Query lcl|Aclame:protein:vir:78942|NCBI_annot:putative head-tail connector protein|genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Match_columns 510 No_of_seqs 115 out of 173 Neff 7.8 Searched_HMMs 1612 Date Tue Dec 3 04:49:58 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_21 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_21_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78942 Length: 510 100.0 8E-183 5E-186 1018.8 59.0 510 1-510 1-510 (510) 2 protein:vir:6322 Length: 510 # 100.0 6E-182 4E-185 1014.0 58.8 510 1-510 1-510 (510) 3 protein:vir:80211 Length: 514 100.0 5E-170 3E-173 948.8 57.4 505 1-510 1-511 (514) 4 protein:vir:78696 Length: 542 100.0 7E-165 5E-168 920.4 55.5 502 1-510 1-519 (542) 5 protein:vir:99672 Length: 532 100.0 2E-162 1E-165 907.1 55.7 506 1-510 10-529 (532) 6 protein:vir:2198 Length: 536 # 100.0 8E-162 5E-165 903.9 57.1 505 1-510 9-532 (536) 7 protein:vir:10447 Length: 536 100.0 1E-161 6E-165 903.3 57.0 505 1-510 9-532 (536) 8 protein:vir:103330 Length: 517 100.0 2E-161 1E-164 901.4 57.1 500 1-510 8-516 (517) 9 protein:vir:94572 Length: 535 100.0 4E-161 2E-164 900.0 56.7 504 1-510 11-532 (535) 10 protein:vir:1538 Length: 535 # 100.0 1E-160 8E-164 897.2 58.1 506 1-510 10-532 (535) 11 protein:vir:96988 Length: 516 100.0 3E-161 2E-164 900.3 53.3 494 1-510 12-511 (516) 12 protein:vir:8883 Length: 543 # 100.0 2E-160 1E-163 896.7 54.1 505 1-510 10-541 (543) 13 protein:vir:7017 Length: 515 # 100.0 2E-160 1E-163 896.3 54.1 494 1-510 11-510 (515) 14 protein:vir:100039 Length: 522 100.0 3E-160 2E-163 895.1 54.5 497 1-510 1-505 (522) 15 protein:vir:3361 Length: 535 # 100.0 8E-160 5E-163 892.7 56.4 506 1-510 10-532 (535) 16 protein:vir:105641 Length: 516 100.0 6E-160 4E-163 893.5 54.2 491 1-510 12-508 (516) 17 protein:vir:1785 Length: 555 # 100.0 3E-159 2E-162 889.4 54.6 503 1-510 1-541 (555) 18 protein:vir:94709 Length: 522 100.0 1E-157 8E-161 880.8 56.5 501 1-510 8-517 (522) 19 protein:vir:103765 Length: 549 100.0 6E-157 4E-160 876.9 54.0 500 1-510 8-544 (549) 20 protein:vir:98506 Length: 555 100.0 1E-156 7E-160 875.4 55.1 498 1-510 6-536 (555) 21 protein:vir:107404 Length: 555 100.0 1E-156 7E-160 875.4 55.1 498 1-510 6-536 (555) 22 protein:vir:107822 Length: 555 100.0 1E-156 7E-160 875.4 55.1 498 1-510 6-536 (555) 23 protein:vir:7321 Length: 556 # 100.0 6E-153 4E-156 855.0 54.4 495 1-510 5-536 (556) 24 protein:vir:102668 Length: 547 100.0 1E-152 7E-156 853.5 54.7 496 1-510 2-543 (547) 25 protein:vir:95315 Length: 559 100.0 3E-152 2E-155 851.0 55.3 497 1-510 5-536 (559) 26 protein:vir:94599 Length: 641 100.0 1.3E-87 8.3E-91 496.9 44.6 494 1-510 25-619 (641) 27 protein:vir:80165 Length: 651 100.0 5.4E-68 3.4E-71 389.4 46.1 493 1-510 21-631 (651) 28 protein:vir:95449 Length: 584 100.0 2.8E-37 1.7E-40 221.0 36.7 476 1-507 17-584 (584) 29 protein:vir:3139 Length: 599 # 100.0 1.2E-32 7.6E-36 195.5 37.8 481 1-510 21-596 (599) 30 protein:vir:8846 Length: 705 # 100.0 6.7E-30 4.2E-33 180.5 42.7 486 1-510 15-632 (705) 31 protein:vir:95821 Length: 763 99.9 2.7E-23 1.7E-26 144.3 44.3 487 1-510 28-669 (763) 32 protein:vir:93630 Length: 776 99.7 3.7E-16 2.3E-19 105.2 37.6 484 1-510 43-678 (776) 33 protein:vir:108295 Length: 711 99.6 1.4E-13 8.7E-17 91.1 40.3 497 1-510 28-676 (711) 34 protein:vir:105429 Length: 708 99.4 3.3E-11 2E-14 78.1 32.9 486 1-510 1-649 (708) 35 protein:vir:100920 Length: 725 99.3 2.2E-10 1.4E-13 73.5 32.9 487 1-510 4-645 (725) 36 protein:vir:9263 Length: 725 # 99.2 3.3E-10 2.1E-13 72.5 32.6 487 1-510 4-645 (725) 37 protein:vir:77597 Length: 725 99.2 3.8E-10 2.3E-13 72.3 35.6 487 1-510 4-645 (725) 38 protein:vir:3520 Length: 720 # 99.2 2.1E-10 1.3E-13 73.6 27.8 483 1-510 1-639 (720) 39 protein:vir:104437 Length: 714 99.2 8.9E-10 5.5E-13 70.2 38.0 484 1-510 20-672 (714) 40 protein:vir:105520 Length: 706 99.1 1.2E-09 7.5E-13 69.5 33.6 488 1-510 1-645 (706) 41 protein:vir:172 Length: 708 # 99.1 1.3E-09 8.1E-13 69.3 31.3 483 1-510 1-646 (708) 42 protein:vir:3296 Length: 714 # 99.1 1.3E-09 8.3E-13 69.2 40.1 484 1-510 17-666 (714) 43 protein:vir:817 Length: 714 # 99.1 1.3E-09 8.3E-13 69.2 40.1 484 1-510 17-666 (714) 44 protein:vir:9950 Length: 714 # 99.1 1.3E-09 8.3E-13 69.2 40.1 484 1-510 17-666 (714) 45 protein:vir:2764 Length: 714 # 99.1 1.3E-09 8.3E-13 69.2 40.1 484 1-510 17-666 (714) 46 protein:vir:10117 Length: 714 99.1 1.3E-09 8.3E-13 69.2 40.1 484 1-510 17-666 (714) 47 protein:vir:105619 Length: 772 98.9 1.2E-08 7.7E-12 63.9 39.4 476 1-510 20-660 (772) 48 protein:vir:2341 Length: 488 # 98.6 2.5E-07 1.5E-10 56.8 35.9 429 1-510 12-479 (488) 49 protein:vir:1587 Length: 508 # 98.5 3.2E-07 2E-10 56.2 33.9 429 1-510 3-506 (508) 50 protein:vir:80453 Length: 535 98.5 6E-07 3.7E-10 54.7 28.5 441 1-510 32-535 (535) 51 protein:vir:78227 Length: 480 98.4 7.5E-07 4.7E-10 54.2 34.9 421 1-510 4-474 (480) 52 protein:vir:78907 Length: 518 98.4 7.5E-07 4.7E-10 54.2 32.2 450 1-509 7-518 (518) 53 protein:vir:80959 Length: 499 98.4 7.7E-07 4.8E-10 54.1 31.5 436 1-510 5-494 (499) 54 protein:vir:4782 Length: 522 # 98.4 9E-07 5.6E-10 53.7 33.4 449 1-510 7-520 (522) 55 protein:vir:3028 Length: 500 # 98.4 9.3E-07 5.7E-10 53.7 26.9 425 1-510 11-500 (500) 56 protein:vir:9815 Length: 500 # 98.4 9.3E-07 5.7E-10 53.7 26.9 425 1-510 11-500 (500) 57 protein:vir:99916 Length: 504 98.4 1.1E-06 6.5E-10 53.4 30.9 413 1-510 18-479 (504) 58 protein:vir:79703 Length: 505 98.4 1.1E-06 6.8E-10 53.3 37.5 424 1-506 3-505 (505) 59 protein:vir:104082 Length: 485 98.3 1.5E-06 9.6E-10 52.4 37.0 416 1-508 17-485 (485) 60 protein:vir:98883 Length: 517 98.3 2E-06 1.2E-09 51.8 34.1 456 1-510 3-515 (517) 61 protein:vir:78537 Length: 480 98.2 2.3E-06 1.4E-09 51.5 34.4 423 1-510 4-474 (480) 62 protein:vir:80680 Length: 441 98.2 2.4E-06 1.5E-09 51.4 37.8 410 1-507 1-441 (441) 63 protein:vir:38 Length: 496 # N 98.2 3E-06 1.8E-09 50.9 37.7 425 1-510 18-494 (496) 64 protein:vir:98444 Length: 434 98.2 3.4E-06 2.1E-09 50.6 34.6 392 28-510 1-424 (434) 65 protein:vir:101494 Length: 527 98.1 3.9E-06 2.4E-09 50.3 29.5 439 1-510 28-525 (527) 66 protein:vir:5961 Length: 503 # 98.1 3.9E-06 2.4E-09 50.2 35.9 436 1-509 26-503 (503) 67 protein:vir:102239 Length: 527 98.1 4.3E-06 2.7E-09 50.0 29.5 439 1-510 28-525 (527) 68 protein:vir:97447 Length: 474 98.0 7.8E-06 4.9E-09 48.6 33.7 413 1-510 28-468 (474) 69 protein:vir:94498 Length: 474 98.0 7.8E-06 4.9E-09 48.6 33.7 413 1-510 28-468 (474) 70 protein:vir:79043 Length: 479 97.8 1.5E-05 9.3E-09 47.0 37.8 425 1-510 20-475 (479) 71 protein:vir:96240 Length: 511 97.8 1.8E-05 1.1E-08 46.7 37.2 419 1-510 13-494 (511) 72 protein:vir:9306 Length: 511 # 97.8 1.8E-05 1.1E-08 46.6 36.5 424 1-510 13-494 (511) 73 protein:vir:7768 Length: 484 # 97.8 1.9E-05 1.2E-08 46.5 36.0 412 1-509 16-484 (484) 74 protein:vir:4223 Length: 486 # 97.7 2.3E-05 1.4E-08 46.0 37.0 413 1-510 17-479 (486) 75 protein:vir:9922 Length: 489 # 97.7 2.6E-05 1.6E-08 45.7 30.3 418 1-507 13-489 (489) 76 protein:vir:95806 Length: 440 97.7 2.9E-05 1.8E-08 45.4 33.6 402 1-510 5-436 (440) 77 protein:vir:2427 Length: 485 # 97.6 3.3E-05 2E-08 45.2 37.9 429 1-510 13-476 (485) 78 protein:vir:345 Length: 663 # 97.6 3.4E-05 2.1E-08 45.1 31.4 465 1-510 1-645 (663) 79 protein:vir:95113 Length: 474 97.6 3.5E-05 2.2E-08 45.0 34.4 412 1-510 28-468 (474) 80 protein:vir:106571 Length: 499 97.6 3.9E-05 2.4E-08 44.7 36.9 424 1-510 5-475 (499) 81 protein:vir:4898 Length: 502 # 97.6 4.2E-05 2.6E-08 44.6 38.3 430 1-510 34-493 (502) 82 protein:vir:96179 Length: 468 97.6 4.6E-05 2.8E-08 44.4 32.6 407 1-506 27-468 (468) 83 protein:vir:97171 Length: 512 97.5 5.4E-05 3.4E-08 44.0 38.6 417 1-510 38-495 (512) 84 protein:vir:2500 Length: 501 # 97.5 5.5E-05 3.4E-08 44.0 35.8 423 1-510 28-495 (501) 85 protein:vir:99781 Length: 511 97.5 5.6E-05 3.5E-08 43.9 39.1 424 1-510 13-494 (511) 86 protein:vir:99072 Length: 479 97.5 6.1E-05 3.8E-08 43.7 35.3 412 1-510 14-474 (479) 87 protein:vir:7430 Length: 563 # 97.5 6.3E-05 3.9E-08 43.6 28.6 464 1-510 1-535 (563) 88 protein:vir:95899 Length: 474 97.3 9.7E-05 6E-08 42.6 34.8 411 1-510 28-468 (474) 89 protein:vir:96266 Length: 474 97.3 9.7E-05 6E-08 42.6 34.8 411 1-510 28-468 (474) 90 protein:vir:733 Length: 453 # 97.3 0.00011 6.5E-08 42.4 40.1 415 1-507 17-453 (453) 91 protein:vir:94101 Length: 474 97.2 0.00012 7.5E-08 42.1 32.5 425 1-510 16-472 (474) 92 protein:vir:105889 Length: 474 97.2 0.00012 7.5E-08 42.1 32.5 425 1-510 16-472 (474) 93 protein:vir:94805 Length: 492 97.2 0.00013 7.8E-08 42.0 34.5 415 1-510 45-482 (492) 94 protein:vir:107112 Length: 478 97.2 0.00014 8.8E-08 41.7 35.5 412 1-510 27-477 (478) 95 protein:vir:1236 Length: 483 # 97.1 0.00016 9.8E-08 41.4 34.6 408 1-510 36-473 (483) 96 protein:vir:96366 Length: 511 97.1 0.00018 1.1E-07 41.2 39.4 421 1-510 13-494 (511) 97 protein:vir:78805 Length: 511 97.1 0.00018 1.1E-07 41.2 39.4 421 1-510 13-494 (511) 98 protein:vir:99522 Length: 470 97.0 0.0002 1.2E-07 40.9 39.1 420 1-510 25-467 (470) 99 protein:vir:95014 Length: 491 97.0 0.00021 1.3E-07 40.8 30.8 430 1-506 1-491 (491) 100 protein:vir:106639 Length: 481 97.0 0.00023 1.4E-07 40.6 41.2 417 1-510 30-476 (481) 101 protein:vir:95149 Length: 501 97.0 0.00023 1.4E-07 40.5 27.8 431 1-507 1-501 (501) 102 protein:vir:103951 Length: 511 96.8 0.00032 2E-07 39.7 39.0 414 1-510 13-494 (511) 103 protein:vir:97336 Length: 492 96.8 0.00033 2E-07 39.7 35.3 412 1-510 45-481 (492) 104 protein:vir:96494 Length: 501 96.7 0.00037 2.3E-07 39.4 38.8 425 1-510 40-488 (501) 105 protein:vir:105461 Length: 470 96.7 0.00043 2.6E-07 39.1 34.0 420 1-510 2-466 (470) 106 protein:vir:3964 Length: 453 # 96.7 0.00043 2.7E-07 39.0 41.7 413 1-510 18-447 (453) 107 protein:vir:105292 Length: 478 96.5 0.00053 3.3E-07 38.5 38.2 405 1-510 26-467 (478) 108 protein:vir:78083 Length: 537 96.5 0.00058 3.6E-07 38.3 37.8 444 1-510 13-502 (537) 109 protein:vir:9871 Length: 429 # 96.3 0.00072 4.4E-07 37.8 38.6 404 1-510 2-427 (429) 110 protein:vir:97265 Length: 513 96.3 0.00075 4.7E-07 37.7 25.9 432 1-510 6-512 (513) 111 protein:vir:9751 Length: 422 # 96.3 0.00076 4.7E-07 37.7 31.2 386 1-486 1-422 (422) 112 protein:vir:8184 Length: 474 # 96.2 0.00083 5.1E-07 37.5 34.4 410 1-507 12-474 (474) 113 protein:vir:94742 Length: 409 96.1 0.00097 6E-07 37.1 33.1 375 1-467 1-409 (409) 114 protein:vir:3609 Length: 452 # 96.1 0.001 6.2E-07 37.0 40.7 406 1-510 18-442 (452) 115 protein:vir:9568 Length: 410 # 96.0 0.0011 6.8E-07 36.8 35.8 377 12-488 1-410 (410) 116 protein:vir:94546 Length: 506 96.0 0.0011 6.8E-07 36.8 37.3 425 1-510 23-500 (506) 117 protein:vir:2732 Length: 501 # 95.9 0.0013 7.8E-07 36.5 38.6 425 1-510 40-494 (501) 118 protein:vir:93747 Length: 472 95.8 0.0014 8.6E-07 36.2 37.4 410 1-510 25-464 (472) 119 protein:vir:78393 Length: 489 95.8 0.0015 9.4E-07 36.0 31.9 422 1-510 1-484 (489) 120 protein:vir:80040 Length: 461 94.9 0.0032 2E-06 34.3 21.6 417 1-499 1-461 (461) 121 protein:vir:94956 Length: 452 94.8 0.0036 2.2E-06 34.0 23.8 420 1-503 1-452 (452) 122 protein:vir:1634 Length: 409 # 94.4 0.0044 2.7E-06 33.5 32.7 376 1-467 1-409 (409) 123 protein:vir:102950 Length: 471 94.4 0.0045 2.8E-06 33.4 33.5 414 1-507 1-471 (471) 124 protein:vir:96839 Length: 474 93.5 0.0075 4.6E-06 32.2 34.5 412 1-510 27-466 (474) 125 protein:vir:4995 Length: 384 # 91.3 0.017 1E-05 30.4 24.4 353 1-448 1-384 (384) 126 protein:vir:96783 Length: 488 91.1 0.017 1.1E-05 30.2 33.0 405 1-465 7-488 (488) 127 protein:vir:3989 Length: 392 # 90.7 0.019 1.2E-05 30.0 25.9 331 27-446 1-392 (392) 128 protein:vir:1023 Length: 392 # 90.7 0.019 1.2E-05 30.0 25.9 331 27-446 1-392 (392) 129 protein:vir:7987 Length: 456 # 89.3 0.027 1.7E-05 29.2 33.9 423 1-510 1-456 (456) 130 protein:vir:7407 Length: 392 # 88.5 0.032 2E-05 28.8 25.1 311 63-446 1-392 (392) 131 protein:vir:100150 Length: 437 88.0 0.035 2.2E-05 28.5 17.9 393 1-510 1-428 (437) 132 protein:vir:78161 Length: 355 85.3 0.054 3.3E-05 27.6 16.1 296 163-509 1-355 (355) 133 protein:vir:5249 Length: 437 # 84.4 0.06 3.7E-05 27.3 17.5 396 18-510 1-433 (437) 134 protein:vir:101647 Length: 460 83.8 0.065 4.1E-05 27.1 24.1 406 1-486 1-460 (460) 135 protein:vir:4828 Length: 382 # 83.2 0.071 4.4E-05 26.9 24.5 343 6-449 1-382 (382) 136 protein:vir:102602 Length: 456 83.1 0.071 4.4E-05 26.9 34.1 428 1-510 6-455 (456) 137 protein:vir:105819 Length: 456 83.1 0.071 4.4E-05 26.9 34.1 428 1-510 6-455 (456) 138 protein:vir:4337 Length: 434 # 81.1 0.089 5.5E-05 26.4 18.6 396 1-498 1-434 (434) 139 protein:vir:93610 Length: 454 79.9 0.1 6.2E-05 26.1 21.1 390 8-510 1-430 (454) 140 protein:vir:4854 Length: 386 # 79.8 0.1 6.3E-05 26.0 21.1 361 6-506 1-386 (386) 141 protein:vir:102330 Length: 451 79.5 0.1 6.4E-05 26.0 35.3 414 1-505 1-451 (451) 142 protein:vir:6240 Length: 457 # 78.6 0.11 7E-05 25.8 17.5 405 6-510 1-442 (457) 143 protein:vir:79538 Length: 502 75.0 0.15 9.4E-05 25.1 29.4 425 1-510 11-489 (502) 144 protein:vir:81152 Length: 411 74.7 0.15 9.6E-05 25.0 24.4 364 1-465 3-411 (411) 145 protein:vir:95542 Length: 548 71.2 0.2 0.00012 24.4 26.2 443 1-510 12-523 (548) 146 protein:vir:94599 Length: 641 65.1 0.29 0.00018 23.5 22.8 443 1-510 39-623 (641) 147 protein:vir:4698 Length: 251 # 64.5 0.3 0.00019 23.5 12.7 237 6-360 1-251 (251) 148 protein:vir:4952 Length: 386 # 64.2 0.3 0.00019 23.4 25.4 344 1-449 1-386 (386) 149 protein:vir:3153 Length: 467 # 61.3 0.36 0.00022 23.0 21.8 408 44-510 1-467 (467) 150 protein:vir:1326 Length: 457 # 50.2 0.62 0.00038 21.7 21.3 400 6-510 1-442 (457) 151 protein:vir:107742 Length: 537 49.9 0.62 0.00039 21.7 19.7 431 1-510 39-533 (537) 152 protein:vir:100187 Length: 385 47.3 0.71 0.00044 21.4 20.3 348 1-467 1-385 (385) 153 protein:vir:78749 Length: 337 46.8 0.72 0.00045 21.4 24.3 308 1-416 1-337 (337) 154 protein:vir:99563 Length: 862 44.9 0.79 0.00049 21.1 16.1 428 1-510 92-584 (862) 155 protein:vir:3420 Length: 533 # 44.3 0.81 0.0005 21.1 26.4 437 1-509 1-533 (533) 156 protein:vir:95315 Length: 559 40.8 0.96 0.00059 20.7 21.5 453 1-510 1-533 (559) 157 protein:vir:4454 Length: 414 # 40.3 0.98 0.00061 20.6 17.8 382 6-509 1-414 (414) 158 protein:vir:9408 Length: 441 # 39.5 1 0.00063 20.5 19.9 366 1-467 15-441 (441) 159 protein:vir:79984 Length: 441 39.5 1 0.00063 20.5 19.9 366 1-467 15-441 (441) 160 protein:vir:5737 Length: 419 # 37.7 1.1 0.00068 20.3 21.1 377 1-510 1-415 (419) 161 protein:vir:96738 Length: 505 36.8 1.2 0.00071 20.2 27.8 432 1-510 19-494 (505) 162 protein:vir:78641 Length: 278 34.1 1.3 0.00082 19.9 21.6 252 69-416 1-278 (278) 163 protein:vir:10362 Length: 432 34.1 1.3 0.00082 19.9 22.8 352 1-455 2-432 (432) 164 protein:vir:3843 Length: 397 # 33.9 1.3 0.00082 19.9 21.6 359 29-508 1-397 (397) 165 protein:vir:4598 Length: 416 # 31.7 1.5 0.00091 19.7 21.3 361 6-467 1-416 (416) 166 protein:vir:81095 Length: 416 31.7 1.5 0.00091 19.7 21.3 361 6-467 1-416 (416) 167 protein:vir:104338 Length: 422 30.9 1.5 0.00095 19.6 21.5 387 18-496 1-422 (422) 168 protein:vir:80644 Length: 551 29.6 1.6 0.001 19.4 19.8 423 1-510 1-522 (551) 169 protein:vir:6382 Length: 553 # 28.9 1.7 0.0011 19.3 24.6 445 1-510 19-553 (553) 170 protein:vir:81072 Length: 432 28.3 1.8 0.0011 19.2 23.6 359 1-455 1-432 (432) 171 protein:vir:483 Length: 413 # 27.6 1.8 0.0011 19.1 22.3 383 7-508 1-413 (413) 172 protein:vir:99232 Length: 526 23.7 2.3 0.0014 18.6 16.8 395 1-510 17-485 (526) 173 protein:vir:98853 Length: 219 22.7 2.4 0.0015 18.5 14.9 190 204-417 1-219 (219) 174 protein:vir:102118 Length: 409 22.4 2.4 0.0015 18.4 22.2 358 7-464 1-409 (409) 175 protein:vir:98396 Length: 441 21.9 2.5 0.0016 18.4 20.5 366 1-467 15-441 (441) 176 protein:vir:389 Length: 530 # 21.6 2.6 0.0016 18.3 27.2 422 1-510 13-516 (530) 177 protein:vir:8418 Length: 409 # 21.5 2.6 0.0016 18.3 22.6 347 6-452 1-409 (409) 178 protein:vir:10321 Length: 495 21.4 2.6 0.0016 18.3 27.6 422 1-510 16-487 (495) No 1 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=8.2e-183 Score=1018.85 Aligned_cols=510 Identities=100% Similarity=1.391 Sum_probs=500.2 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~ 80 (510) ||+|+++||++|||++|+++|+||++||+|++|+++++.++++..++|||||++|+++||||||++||||++|||||+++ T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~ 80 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccCCC Confidence 99999999999999999999999999999999999998888888899999999999999999999999999999999999 Q ss_pred hhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeCCC Q lcl|Aclame:pro 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDAT 160 (510) Q Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d~~ 160 (510) |..++++++.+.+.+++++||++||+.++.+|++||||.++|++|+||++|||+|+|++++.++|++|||++|||.+|++ T Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d~~ 160 (510) T protein:vir:78 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDAT 160 (510) T ss_pred hHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCCCeEEEEEcceeEEeeCCC Confidence 99999998888999999999999999999999999999999999999999999999999998899999999999999999 Q ss_pred CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceE Q lcl|Aclame:pro 161 GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYI 240 (510) Q Consensus 161 G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~ 240 (510) |+||+|||||++|+++|+++||+++.++..+++|+++|+|||||+|+++++|||||+|+|+||+.++.+|+|++++|||+ T Consensus 161 G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i~~~~~~~~~e~P~~ 240 (510) T protein:vir:78 161 GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYI 240 (510) T ss_pred cCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeeeccccccccccCCee Confidence 99999999999999999999999999888889999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccC Q lcl|Aclame:pro 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) Q Consensus 241 ~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~ 320 (510) ++||++.+||+|||||++++|||+|+||.|+++.+++++++++|+|+|+|+|+++|+++..+++|.++||++++|+++++ T Consensus 241 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g~~~~v~~~~~ 320 (510) T protein:vir:78 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) T ss_pred eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecCCccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) Q Consensus 321 ~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l 400 (510) ++++||+++++.|++++++|+++||+++.+++++|||||||++|++|++.+||||||||++|||.|||+|+|++|+++++ T Consensus 321 ~~~~d~~~~~~~i~~~~~rI~~aF~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl 400 (510) T protein:vir:78 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) T ss_pred CcccchHHHHHHHHHHHHHHHHHHhhccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHH Q lcl|Aclame:pro 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAE 480 (510) Q Consensus 401 ~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~ 480 (510) ||+|++.+++.+|+++++|+|+|+++++.++.|+++++++++|++++||+|++++++++++|||+..|+||+||++++++ T Consensus 401 ~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~ 480 (510) T protein:vir:78 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAE 480 (510) T ss_pred CCCCcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999977789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 481 EQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 481 ~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++++|++++++++++++.+|++++++++|| T Consensus 481 ~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 481 EQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 999999999999999999999999999999 No 2 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=6.3e-182 Score=1013.99 Aligned_cols=510 Identities=98% Similarity=1.372 Sum_probs=500.3 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~ 80 (510) ||+||++||++|||++|+++|+||++||+|++|+++++.++.+..++|||||++|+++||||||++||||++|||||+++ T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~ 80 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccccCCC Confidence 99999999999999999999999999999999999998888888899999999999999999999999999999999999 Q ss_pred hhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeCCC Q lcl|Aclame:pro 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDAT 160 (510) Q Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d~~ 160 (510) |..+++.++.+.+.+++++||++||++++.+|++||||.++|++|+||++|||+|+|++++..+|++|||++|||.+|++ T Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~~~~~~~~pl~~y~v~~d~~ 160 (510) T protein:vir:63 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSDAATVVAWSLRSYAVRRDAT 160 (510) T ss_pred hHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCCCcEEEEEEcceeEEeeCCC Confidence 99999998888999999999999999999999999999999999999999999999999998889999999999999999 Q ss_pred CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceE Q lcl|Aclame:pro 161 GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYI 240 (510) Q Consensus 161 G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~ 240 (510) |+||+||||+++|+++|+++|+.+..++..+++|+++|+|||||+|+++++|||||||+|++|+.++.+++|++++|||+ T Consensus 161 G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e~P~~ 240 (510) T protein:vir:63 161 GRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYI 240 (510) T ss_pred cCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCceeccccccccccCcee Confidence 99999999999999999999999998888889999999999999999999999999999999999999999999999999 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccC Q lcl|Aclame:pro 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) Q Consensus 241 ~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~ 320 (510) ++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|+++..+++|.+++|++++++++++ T Consensus 241 ~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v~~~~~ 320 (510) T protein:vir:63 241 VPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYER 320 (510) T ss_pred eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceeecCCcccceeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCC Q lcl|Aclame:pro 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) Q Consensus 321 ~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l 400 (510) ++++||+++++.|++++++|+++||+++.+++++|||||||++|++|++.+||||||||++|||.|||+|+|++|+++++ T Consensus 321 ~~~~d~~~~~~~i~~~~~rI~~af~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl 400 (510) T protein:vir:63 321 GDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL 400 (510) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHhhcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHH Q lcl|Aclame:pro 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAE 480 (510) Q Consensus 401 ~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~ 480 (510) ||+|++.+++.+|+|+++|+|+++++++.++.|+++++++++|++++||+|++++++|+++|||+..|+||+||++++++ T Consensus 401 ~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~ 480 (510) T protein:vir:63 401 QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAE 480 (510) T ss_pred CCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999987789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 481 EQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 481 ~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +++||+++|+++++.++.+|+++++++||| T Consensus 481 ~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 481 QQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 999999999999999999999999999999 No 3 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=4.8e-170 Score=948.83 Aligned_cols=505 Identities=46% Similarity=0.734 Sum_probs=477.4 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccccc--CCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLM--VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~--~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) ||+++.+.|.+.+|++|+++|+||++||+|+++ +.++++...+..++|||||++|+++||||||++||||++|||||+ T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQIE 80 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 999999999999999999999999999999976 445566677788999999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeC Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d 158 (510) ++|...+.....+.+..++++||++||++++++|++||||.++|++|+||++|||+|+|++++..+|++|||++|||.+| T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d 160 (514) T protein:vir:80 81 LDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTGKMLVWTMQSYTVRRT 160 (514) T ss_pred cCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCCcEEEEEcCeEEEeeC Confidence 99887777777788889999999999999999999999999999999999999999999999888999999999999999 Q ss_pred CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCc Q lcl|Aclame:pro 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCP 238 (510) Q Consensus 159 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P 238 (510) ++|+|++||||+++|+++|+++|+.+..+...+++++++|+|||||+|++++++||+|||+|++|++++++|+|++++|| T Consensus 161 ~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i~~es~y~~~e~P 240 (514) T protein:vir:80 161 SHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRVGPESSYPAHLCP 240 (514) T ss_pred CCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceeecccCccccccCC Confidence 99999999999999999999999998887777888999999999999999999999999999999999999999999999 Q ss_pred eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccc Q lcl|Aclame:pro 239 YIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAY 318 (510) Q Consensus 239 ~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~ 318 (510) |+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+|+|+|+++|+++..+++|.+++|++++|+++ T Consensus 241 ~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g~~~~v~~~ 320 (514) T protein:vir:80 241 YVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQVGSVASY 320 (514) T ss_pred eeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecCCCccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh- Q lcl|Aclame:pro 319 ERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD- 397 (510) Q Consensus 319 ~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~- 397 (510) +.++++||+++++.|++++++|+++||++...+++++||||||++|++|++.+||||||||++|||.|||+|+|.+|++ T Consensus 321 ~~~~~~d~~~~~~~i~~~~~rI~~aFml~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~ 400 (514) T protein:vir:80 321 ERGDYNKIAQASASVESIVMRLNRAFMYTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRG 400 (514) T ss_pred ecCcccchHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 9999999999999999999999999999877789999999999999999999999999999999999999999999976 Q ss_pred --cCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH Q lcl|Aclame:pro 398 --ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) Q Consensus 398 --~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee 474 (510) +.+|++|++.++++|+|+|++|+|+++++++.+|++.++.+++. |+++++||+|++++.+|+++|||++.|++|+|+ T Consensus 401 ~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~ 480 (514) T protein:vir:80 401 NGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDV 480 (514) T ss_pred ccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHH Confidence 46788888899999999999999999999999999999999996 899999999999999999999999889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 475 LQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 475 ~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++++++++++++++|++ ++++..+++.++|| T Consensus 481 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~ 511 (514) T protein:vir:80 481 VAAEAEQEAALAQQQLD-----VASGALAAETSAGV 511 (514) T ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHhhhccc Confidence 99988888776644442 34566677788888 No 4 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=7.3e-165 Score=920.44 Aligned_cols=502 Identities=24% Similarity=0.361 Sum_probs=457.5 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) ||+++++||++|+ |++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||||++|||||. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFFKLQ 80 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 9999999999996 89999999999999999999999988888889999999999999999999999999999999999 Q ss_pred CChhhhhhhcc-CchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEee Q lcl|Aclame:pro 79 LTDAIRREADS-RDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRR 157 (510) Q Consensus 79 ~~d~~~~~~~~-~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~ 157 (510) ++|..+.+..+ .....++++.||++||++++++|++||||.++|++|+||++|||+|+|++++ +|++|||++|||.+ T Consensus 81 ~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~--~~~~~pl~~y~v~~ 158 (542) T protein:vir:78 81 INDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKK--TLKVYPLDRYVIER 158 (542) T ss_pred CCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCC--CceEEecceeEEee Confidence 99999888655 4445688999999999999999999999999999999999999999999886 69999999999999 Q ss_pred CCCCceeEEEEEEEecHHHHhHHhhHHhh----cccccCCCCceEEEEEEEEeecC--------CCeeEEEEEEeeCCee Q lcl|Aclame:pro 158 DATGRWMDIVLKQRYKSKDLDDVYKQDLM----RAGRNLSGSGSVDLYTHVQRRKG--------TAMDYAEMYHEIDGVR 225 (510) Q Consensus 158 d~~G~v~~i~r~~~~t~~~l~~~~~~~~~----~~~~~~~~~~~v~v~~~v~~~~~--------~~~~~~sv~~e~~~~~ 225 (510) |++|+||+|||||+||+++|+++||++.. +....++++.+++|+|+|+|+++ ++++|||||++++|+. T Consensus 159 d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~ 238 (542) T protein:vir:78 159 DGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKE 238 (542) T ss_pred CCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEEecccc Confidence 99999999999999999999999997543 33456788999999999999864 4689999999999998 Q ss_pred e-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCC Q lcl|Aclame:pro 226 V-GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM 304 (510) Q Consensus 226 ~-~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~ 304 (510) + +..+.|++++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|+++|.++..+++ T Consensus 239 v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~~~~ 318 (542) T protein:vir:78 239 IKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLARAGT 318 (542) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCC Confidence 7 445666669999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|Aclame:pro 305 GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQ 384 (510) Q Consensus 305 G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l 384 (510) |.+++|.+++++++++++++||+++++.|++++++|+++||++. .+++++||||||++|++|++.+||||||||++||| T Consensus 319 g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~-~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L 397 (542) T protein:vir:78 319 GAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILN-VRQSERTTATEVREVQMELDRQLSGIYGSLTVELL 397 (542) T ss_pred ceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcccc-cCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999864 58999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCC Q lcl|Aclame:pro 385 SPLAYVCLSEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSV 463 (510) Q Consensus 385 ~Pli~r~~~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gv 463 (510) .|||+|+|++|++++ +|++|++.++++|+|+|++++|+++++++.+|++.++++.++++++++||+|++++++++++|| T Consensus 398 ~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gv 477 (542) T protein:vir:78 398 TPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGI 477 (542) T ss_pred HHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCC Confidence 999999999998765 6777888899999999999999999999999999999987888899999999999999999999 Q ss_pred CHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 464 DTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 464 p~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) |+..|++|+||+++++++++++++++. ++.+|++.+++.+|= T Consensus 478 p~~~i~~s~e~~~~~~~q~q~~~~~~a-----l~~~a~~~a~~~~~~ 519 (542) T protein:vir:78 478 DTLNLVKSPETMANEAQQAQQQQMTAS-----LMGQAGQLAKSPIGE 519 (542) T ss_pred CHhhccCCHHHHHHHHHHHHHHHHHHH-----HHHhhhhcccccccc Confidence 988899999999988777655443322 233344433322222 No 5 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=1.9e-162 Score=907.14 Aligned_cols=506 Identities=31% Similarity=0.455 Sum_probs=461.1 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||++|| |++|+++|+||++||+|++|+++++.+.++..++|||||++|+++|||||||+||||++|||||+ T Consensus 10 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~ 89 (532) T protein:vir:99 10 AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 2678999999996 89999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC------CeEEEEEece Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE------ATVVAWSLRS 152 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~------~~~~~~pl~~ 152 (510) ++|..+++....+.+.++|+.||++||++++++|++||||.++|++|+||++|||+|+|+++++ .+|++|||++ T Consensus 90 ~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~~~pl~~ 169 (532) T protein:vir:99 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) T ss_pred CCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCcccceEEEEcCe Confidence 9999999888888899999999999999999999999999999999999999999999998653 2699999999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe-eeccccc Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV-RVGETGR 231 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~-~~~~~~~ 231 (510) |||.+|++|+|++||||+++++++|+++|+.++.+...+++|+++|+|||||+|+++ +++|+++ ++++|+ .++.+|+ T Consensus 170 y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~-~~~~~~~-~~~~g~~~~~~~~~ 247 (532) T protein:vir:99 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPE-AMVFRSY-QEIDGEIVAGTEGE 247 (532) T ss_pred EEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCC-CCeeEEE-EeecCceecccccc Confidence 999999999999999999999999999999998877778899999999999999876 5777755 566665 4578899 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCC Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG 311 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~ 311 (510) |++++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+|+|+|+++|.++..+++|.+++|. T Consensus 248 ~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~ 327 (532) T protein:vir:99 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR 327 (532) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCCCcceecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCCccchHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) Q Consensus 312 ~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) ++++++++.++++||+++++.|++++++|+++||++.+ +++++|||||||++|++|++.+||||||||++|||.|||+| T Consensus 328 ~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) T protein:vir:99 328 KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) T ss_pred cccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999854 69999999999999999999999999999999999999999 Q ss_pred HHHHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhcc Q lcl|Aclame:pro 391 CLSEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) Q Consensus 391 ~~~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~ 469 (510) +|++|++++ +|++|++.+++.+++++++|+|+|+++++.+|++.++++.| +++++||+|++++.+|+++|||+..|+ T Consensus 408 ~~~il~r~g~lP~~p~~~~~~~iv~~is~Laraq~~~~l~~~~~~laq~~p--~~~d~id~d~~~~~~a~~~GV~~~~i~ 485 (532) T protein:vir:99 408 LLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAG--LQDDDINLLDVKMRLANSLGMDTTGLI 485 (532) T ss_pred HHHHHHhcCCCCCCChhhcccceeecchHHHHHHHHHHHHHHHHHHHhhcc--hhhhhCCHHHHHHHHHHHhCCChhhcc Confidence 999998765 56677777899999999999999999999999999888754 578999999999999999999877899 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHhhcccCCC Q lcl|Aclame:pro 470 KSADELQAEAEEQRRQAAQAQAAQE---TLLEGASDMTNALAGV 510 (510) Q Consensus 470 ~s~ee~~~~~~~~~qqa~~~~~a~~---~~~~~a~~~~~~~ag~ 510 (510) ||+||+++++++++++++++++..+ ...+.+++...+++|- T Consensus 486 r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 529 (532) T protein:vir:99 486 LTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGM 529 (532) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcCC Confidence 9999999988776665544433322 1122234456677777 No 6 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=7.7e-162 Score=903.87 Aligned_cols=505 Identities=29% Similarity=0.419 Sum_probs=457.3 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) -.++|++||++|| |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 9 ~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WFrl~ 87 (536) T protein:vir:21 9 AEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM-QTWMRLT 87 (536) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 5669999999996 899999999999999999999999888888899999999999999999999999976 6999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC----eEEEEEeceEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~----~~~~~pl~~~~ 154 (510) ++|..+++........+++++||+.||++++.+|++||||.++|++|+||++|||+|+|++++.. .|++|||++|| T Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~~~~ 167 (536) T protein:vir:21 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) T ss_pred cChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEE Confidence 99999988887788889999999999999999999999999999999999999999999987754 38899999999 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeec-cccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVG-ETGRWP 233 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~-~~~~y~ 233 (510) |.+|++|+||+|||||+||+++|+++||+++.+...+++|+++|+|||||+|+++ +++ +++|++++|+++. ++|.|+ T Consensus 168 v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~-~~~~~e~~g~~v~~~~g~~~ 245 (536) T protein:vir:21 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDED-SGE-YLRYEEVEGMEVQGSDGTYP 245 (536) T ss_pred EeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecC-CCc-EEEEeccCCeeeccccCccc Confidence 9999999999999999999999999999999888888999999999999999865 455 4789999999884 556667 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCcc Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~ 313 (510) |++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+|+|+|+++|.++..+++|.+++|.++ T Consensus 246 f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~ 325 (536) T protein:vir:21 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) T ss_pred cccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +++++++++++||+++++.|++++++|+++||++. .++++++||||||++|++|++.+|||||+||++|||.|||+|+| T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~ 405 (536) T protein:vir:21 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) T ss_pred cceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999975 46999999999999999999999999999999999999999999 Q ss_pred HHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh-cCCHHHHHHHHHHHcCCCHhhccC Q lcl|Aclame:pro 393 SEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP-RISLPKMMDTIWAAFSVDTSQFYK 470 (510) Q Consensus 393 ~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~-~id~d~~~~~~a~~~Gvp~~~i~~ 470 (510) ++|++++ +|++|++.++++|+|+|++++|+++++++.+|++.+++++| ++++ +||+|++++++|+++||++..++| T Consensus 406 ~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P--e~ld~~id~d~~~~~~a~~~Gv~p~~~ir 483 (536) T protein:vir:21 406 KQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAP--MRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) T ss_pred HHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhch--hhhcccCCHHHHHHHHHHHcCCChhhhcC Confidence 9998755 66677788999999999999999999999999999988653 4454 699999999999999995567999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHhhcccCCC Q lcl|Aclame:pro 471 SADELQAEAEEQRRQAAQAQAAQETLLEG---------ASDMTNALAGV 510 (510) Q Consensus 471 s~ee~~~~~~~~~qqa~~~~~a~~~~~~~---------a~~~~~~~ag~ 510 (510) |+||+++.++++++++++++++.++..+. ++..+..++|+ T Consensus 484 t~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~ 532 (536) T protein:vir:21 484 TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhcccc Confidence 99999998877666665555554322211 12233334444 No 7 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=9.8e-162 Score=903.28 Aligned_cols=505 Identities=29% Similarity=0.425 Sum_probs=456.9 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) -.++|++||++|| |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WFrl~ 87 (536) T protein:vir:10 9 AEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM-QTWMRLT 87 (536) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC-Ccccccc Confidence 5669999999996 899999999999999999999999888888899999999999999999999999976 6999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC----eEEEEEeceEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~----~~~~~pl~~~~ 154 (510) ++|..+++........+++++||+.||++++.+|++||||.++|++|+||++|||+|+|++++.. .|++|||++|| T Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~~~~ 167 (536) T protein:vir:10 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) T ss_pred cChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEE Confidence 99999988887788889999999999999999999999999999999999999999999987754 38899999999 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-ccccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWP 233 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~y~ 233 (510) |.+|++|+||+|||||+||+++|+++||+++.+...+++|+++|+|||||+|+++. ++ +++|++++|+.+ .++|.|+ T Consensus 168 v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~-~~-~~~~~e~~g~~v~~~~g~~~ 245 (536) T protein:vir:10 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEAS-GE-YLRYEEVEGMEVQGSDGTYP 245 (536) T ss_pred EeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCC-Cc-EEEEEeecCccccccccccc Confidence 99999999999999999999999999999998888889999999999999998643 44 478889999988 4566678 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCcc Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~ 313 (510) |++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+|+|+|+++|.++..+++|.+++|.++ T Consensus 246 f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~ 325 (536) T protein:vir:10 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCcceecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +++++++++++||+++++.|++++++|+++||++. .++++++||||||++|++|++.+|||||+||++|||.|||+|+| T Consensus 326 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~ 405 (536) T protein:vir:10 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) T ss_pred cceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999975 46999999999999999999999999999999999999999999 Q ss_pred HHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh-cCCHHHHHHHHHHHcCCCHhhccC Q lcl|Aclame:pro 393 SEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP-RISLPKMMDTIWAAFSVDTSQFYK 470 (510) Q Consensus 393 ~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~-~id~d~~~~~~a~~~Gvp~~~i~~ 470 (510) ++|++++ +|++|++.++++|+|+|++++|+++++++.+|++.+++++| ++++ .||+|++++++|+++||++..++| T Consensus 406 ~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P--~~ld~~id~d~~~~~~a~~~Gv~p~~~ir 483 (536) T protein:vir:10 406 KQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAP--MRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) T ss_pred HHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhch--hhhcccCCHHHHHHHHHHHcCCCchhhcC Confidence 9998755 66677788999999999999999999999999999988653 5555 699999999999999995557999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHhhcccCCC Q lcl|Aclame:pro 471 SADELQAEAEEQRRQAAQAQAAQETLLEG---------ASDMTNALAGV 510 (510) Q Consensus 471 s~ee~~~~~~~~~qqa~~~~~a~~~~~~~---------a~~~~~~~ag~ 510 (510) |+||+++.++++++++++++++.++.... ++..+..++|+ T Consensus 484 t~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~ 532 (536) T protein:vir:10 484 TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhcccc Confidence 99999998877666665555554322211 12233334444 No 8 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=2.1e-161 Score=901.42 Aligned_cols=500 Identities=30% Similarity=0.415 Sum_probs=453.7 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =|+||++||++|| |++|+++|+||++||+|++++++++. .+.+++|||||++|+++||||||++||||++|||||+ T Consensus 8 e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 85 (517) T protein:vir:10 8 NKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD--LSSQNAWQDDGASATNFLSNKLSQVLFPAQRSFFRID 85 (517) T ss_pred cHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC--ccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 2599999999995 99999999999999999999877643 3346899999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeC Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d 158 (510) ++|..+++.+......++++.||++||++++.+|++||||.++|++|+||++|||+|+|+++...+|++|||++|||.+| T Consensus 86 ~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~pl~~y~v~~d 165 (517) T protein:vir:10 86 LTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPDKTSPIQAVPLHHYCVRRD 165 (517) T ss_pred CCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeCCCCcEEEEEcCeEEEeeC Confidence 99999999998999999999999999999999999999999999999999999999999998888899999999999999 Q ss_pred CCCceeEEEEEEEecHHHHhHHhhHHhhc--ccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccccc Q lcl|Aclame:pro 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLMR--AGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHL 236 (510) Q Consensus 159 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+|++||||+++|+++|+++||.+... ....++|+++|+|||||+|+.++ ++++|+++||+.++.+|+|++++ T Consensus 166 ~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~---~~~~~~~~d~~~~~~~s~y~~~e 242 (517) T protein:vir:10 166 NNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDG---KYLIRQSADDVPVGKESTVTEDK 242 (517) T ss_pred CCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCC---ceEEEEEeCceeecccccccccc Confidence 99999999999999999999999987643 23467899999999999998654 46899999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccc Q lcl|Aclame:pro 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|||+|+|+|+++|.++..+++|.+++|+++++. T Consensus 243 ~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~~~g~~~~g~~~~v~ 322 (517) T protein:vir:10 243 SPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEGGSGAVLHGVEGDIH 322 (517) T ss_pred CCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCCCccccccCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+++++.|++++++|+++||+++ .++++++||||||++|++|++.+||||||||++|||.|||+|+|.+| T Consensus 323 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l 402 (517) T protein:vir:10 323 IVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGI 402 (517) T ss_pred eeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999985 56899999999999999999999999999999999999999999998 Q ss_pred hhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH Q lcl|Aclame:pro 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) Q Consensus 396 ~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee 474 (510) .+. +|+..++|+|+|+|++|+|+++++++.+|+++++++++. ++++++||+|++++++|+++|||+ .++||+|| T Consensus 403 ~~~----l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~-~~irs~~e 477 (517) T protein:vir:10 403 SSI----LTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANF-PFFKTQDE 477 (517) T ss_pred hhh----cCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCCh-hhcCCHHH Confidence 743 444568999999999999999999999999999999875 668889999999999999999998 59999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---cCCC Q lcl|Aclame:pro 475 LQAEAEEQRRQAAQAQAAQETLLEGASDMTNA---LAGV 510 (510) Q Consensus 475 ~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~---~ag~ 510 (510) +++++++++++++++++++++..+-+..+... ++|= T Consensus 478 v~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~ 516 (517) T protein:vir:10 478 LNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGG 516 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCC Confidence 99888777766665555432211111112222 2222 No 9 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=3.9e-161 Score=900.00 Aligned_cols=504 Identities=30% Similarity=0.444 Sum_probs=454.2 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =++++++||++|| |++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 11 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 89 (535) T protein:vir:94 11 AENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALFPM-QTWMKLT 89 (535) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhcCC-CCccccc Confidence 4677999999996 889999999999999999999999888888999999999999999999999999976 6999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEeceEEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~~~v 155 (510) ++|..++++...+.+.+++++||++||++++.+|++||||.++|++|+||++|||+|+|++++.+ +|++|||++||| T Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~y~v 169 (535) T protein:vir:94 90 ISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRLSSYVV 169 (535) T ss_pred cChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEEcCeEEE Confidence 99999998888889999999999999999999999999999999999999999999999988754 699999999999 Q ss_pred eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-cccccccc Q lcl|Aclame:pro 156 RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWPI 234 (510) Q Consensus 156 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~y~~ 234 (510) .+|++|+|++|||||++++++|+++|++++.++. +++++++|+|||||+|++ ++|+|.+ |++++|+.+ +.++.|++ T Consensus 170 ~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~-~~~~~~~v~v~~~v~~~~-~~~~~~~-~~e~~g~~~~~~~~~~g~ 246 (535) T protein:vir:94 170 QRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQ-EHKGDEMIDVYTHIYLDE-ESGEYLK-YEEIDGVEVEGTDASYPV 246 (535) T ss_pred eeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhcc-ccCCCceeEEEEEEEeeC-CCCcEEE-EEEecCeeeccccccCcc Confidence 9999999999999999999999999999887654 578999999999999865 4688765 568888776 56787888 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccc Q lcl|Aclame:pro 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA 314 (510) Q Consensus 235 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~ 314 (510) ++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+++|+|+++|.++..+++|.+++|.+++ T Consensus 247 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~ 326 (535) T protein:vir:94 247 DACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQTGDFVSGRPED 326 (535) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCCCceeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 315 VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 315 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) +++++.++++||+.+.+.|++++++|+++||++. .++++++||||||++|++|++++||||||||++|||.|||+|+|+ T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~ 406 (535) T protein:vir:94 327 ISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLK 406 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999975 469999999999999999999999999999999999999999999 Q ss_pred HHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh-cCCHHHHHHHHHHHcCCCHhhccCC Q lcl|Aclame:pro 394 EVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP-RISLPKMMDTIWAAFSVDTSQFYKS 471 (510) Q Consensus 394 il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~-~id~d~~~~~~a~~~Gvp~~~i~~s 471 (510) +|++++ +|++|++.++++|+|+|++++|+++++++.+|++.+++++| ++++ +||+|++++.+++++|||+..|+|| T Consensus 407 il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P--~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs 484 (535) T protein:vir:94 407 QLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLERCIAAWSALAP--MQGDPDINIATIKLRIANAIGIDTSGILKT 484 (535) T ss_pred HHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHHHHHHHHHhhCh--HHhhhcCCHHHHHHHHHHHhCCChhhhcCC Confidence 998755 66778888999999999999999999999999999888653 5555 7999999999999999998789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHhhcccCCC Q lcl|Aclame:pro 472 ADELQAEAEEQRRQAAQAQAAQETLLEG---------ASDMTNALAGV 510 (510) Q Consensus 472 ~ee~~~~~~~~~qqa~~~~~a~~~~~~~---------a~~~~~~~ag~ 510 (510) +||++++++++++|++++.++.++..+. +++....+.|. T Consensus 485 ~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 532 (535) T protein:vir:94 485 PEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAGM 532 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhcc Confidence 9999988866666554443332211100 12223344455 No 10 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=1.2e-160 Score=897.25 Aligned_cols=506 Identities=30% Similarity=0.411 Sum_probs=459.0 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||+.|| |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 10 ~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 88 (535) T protein:vir:15 10 GEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFPM-QSWMKLT 88 (535) T ss_pred chHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 4677889999996 899999999999999999999999888888899999999999999999999999986 7999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEeceEEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~~~v 155 (510) ++|..+++....+...++++.||++||++++.+|++||||.++|++|+||++|||+|+|++++.+ +|++|||++||| T Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v 168 (535) T protein:vir:15 89 ISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVV 168 (535) T ss_pred cChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEE Confidence 99999999888889999999999999999999999999999999999999999999999988754 599999999999 Q ss_pred eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-cccccccc Q lcl|Aclame:pro 156 RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWPI 234 (510) Q Consensus 156 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~y~~ 234 (510) .+|++|+||+|||||+||+++|+++|+.++.+...+++++++|+|||||+++++ +++ +++|++++|..+ +.+++|++ T Consensus 169 ~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~-~~~~~e~~g~~~~~~~~~~~~ 246 (535) T protein:vir:15 169 QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEE-SGD-YLKYEEVEDVEIDGSDATYPT 246 (535) T ss_pred eeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecC-CCc-EEEEEEeeCcccccccccccc Confidence 999999999999999999999999999999888888999999999999999754 344 467788888776 67899999 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccc Q lcl|Aclame:pro 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA 314 (510) Q Consensus 235 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~ 314 (510) ++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++..+++|.+++|.+++ T Consensus 247 ~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~g~~v~g~~~~ 326 (535) T protein:vir:15 247 DAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRED 326 (535) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 315 VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 315 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) +++++.++++||+.+++.|++++++|+++||++. .+++++|||||||++|++|++++|||||+||++|||.|||+|+|+ T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~ 406 (535) T protein:vir:15 327 IDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 406 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999985 469999999999999999999999999999999999999999999 Q ss_pred HHhhc-CCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCH Q lcl|Aclame:pro 394 EVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSA 472 (510) Q Consensus 394 il~~~-~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ 472 (510) +|++. .+|++|+++++++|+|+|++++|+++++++.+|++.++++.| +.++++||+|++++++++++|||++.|+||+ T Consensus 407 il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P-~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~ 485 (535) T protein:vir:15 407 QLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAP-MQGDPDINLAVIKLRIANAIGIDTSGILLTD 485 (535) T ss_pred HHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcCh-hhhhccCCHHHHHHHHHHHcCCChhhhcCCH Confidence 99875 567788889999999999999999999999999999988654 4455579999999999999999998899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHhhcccCCC Q lcl|Aclame:pro 473 DELQAEAEEQRRQAAQAQAAQETLL---------EGASDMTNALAGV 510 (510) Q Consensus 473 ee~~~~~~~~~qqa~~~~~a~~~~~---------~~a~~~~~~~ag~ 510 (510) ||++++++++++++++++++.++.. +.+++..-.++|+ T Consensus 486 eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~ 532 (535) T protein:vir:15 486 EQKQALMMQDAAQTGIENAAATGGAGVGALATSSPEAMQGAAAQAGL 532 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChHHHHHHHhccCC Confidence 9999988777666655544432111 1112222234444 No 11 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=3.4e-161 Score=900.30 Aligned_cols=494 Identities=30% Similarity=0.449 Sum_probs=452.1 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|+++++. .+.+++|||||++|+++|||||||+||||++|||||+ T Consensus 12 ~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~ 89 (516) T protein:vir:96 12 KRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDN--ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 89 (516) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCc--cccCCcccchHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 6789999999996 89999999999999999999877643 3456899999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeC Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d 158 (510) ++|..++++++.+.+..++++||++||++++.+|++||||.++|++|+||++|||+|+|++++ ++|++|||++|||.+| T Consensus 90 ~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~-~~~~~~pl~~y~v~~d 168 (516) T protein:vir:96 90 LTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSK-GAISAIPMHHYVVNRD 168 (516) T ss_pred cChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCC-CCEEEEEcCeEEEeeC Confidence 999999888888889999999999999999999999999999999999999999999999876 4799999999999999 Q ss_pred CCCceeEEEEEEEecHHHHhHHhhHHhh--cccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccccc Q lcl|Aclame:pro 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLM--RAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHL 236 (510) Q Consensus 159 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~--~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+|++||||+++++++|+++|+.... +...+++++.+|+|||||+|++++ |+++|+++|+++++.+|+|++++ T Consensus 169 ~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~---~~~~~~~~d~~~~~~es~~~~~e 245 (516) T protein:vir:96 169 TNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDG---FWELKQSADDIPVGKVSKIKSEK 245 (516) T ss_pred CCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCc---eeEEEEEeCceeecccccccccc Confidence 9999999999999999999999976542 334567899999999999998764 78999999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccc Q lcl|Aclame:pro 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|+|+|+|+|+++|+++..+++|.+++|++++|+ T Consensus 246 ~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i~~g~~~~v~ 325 (516) T protein:vir:96 246 LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIH 325 (516) T ss_pred CCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCccchHHHHHHHHHHHHHHHHHHhhc-ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+.++..|++++++|+++||++ +.++++++||||||++|++|++.+||||||||++|||.|||+|++.++ T Consensus 326 ~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~ 405 (516) T protein:vir:96 326 IVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA 405 (516) T ss_pred eeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999999999997 567899999999999999999999999999999999999999998876 Q ss_pred hhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH Q lcl|Aclame:pro 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) Q Consensus 396 ~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee 474 (510) . |++|+.+++++|+++|++|+|+++++++.++++.++.+++. |+++++||+|++++++++++|||++ ++||+|| T Consensus 406 ~----p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~-~irs~ee 480 (516) T protein:vir:96 406 G----ESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELP-FLKSAEE 480 (516) T ss_pred C----CCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCcc-ccCCHHH Confidence 4 78888899999999999999999999999999999999985 8999999999999999999999985 9999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 475 LQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 475 ~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++++++++++++++++++.+ +++....++|= T Consensus 481 v~~~~~~~~~~q~~~~~a~~-----~~~~~~~~~~~ 511 (516) T protein:vir:96 481 MAQEQEAQMQAQQAQMLEEG-----VAKAVPGVIQQ 511 (516) T ss_pred HHHHHHHHHHHHHHHHHHHH-----hhhhhhHHhhc Confidence 99887766665544443322 11111111111 No 12 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=1.5e-160 Score=896.73 Aligned_cols=505 Identities=27% Similarity=0.423 Sum_probs=456.5 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||+.|+ |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 10 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 88 (543) T protein:vir:88 10 AEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFPL-QSWMKLK 88 (543) T ss_pred hHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 4778999999996 899999999999999999999998888888889999999999999999999999986 7999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCe------EEEEEece Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEAT------VVAWSLRS 152 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~------~~~~pl~~ 152 (510) ++|..+++......+.++++.||++||++++++|++||||.++|++|+||++|||+|+|++++.++ |+.|||++ T Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl~~ 168 (543) T protein:vir:88 89 VSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTLHN 168 (543) T ss_pred cChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEcce Confidence 999999888788888999999999999999999999999999999999999999999999987642 67799999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-ccccc Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGR 231 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~ 231 (510) |+|.+|++|+|++|||||++|+++|+++|++++.+.. +++|+++|+|||+|+|+++++ + +++|++++|+.+ +.+++ T Consensus 169 y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~-~~~p~~~~~v~~~V~pr~~~~-~-~~~~~~~~~~~v~~~~~~ 245 (543) T protein:vir:88 169 HVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQ-EYKPEQELEVYTHIYIDDESG-D-FLSYQEIEGVEVDGSDGQ 245 (543) T ss_pred EEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHh-hcCCccceEEEEEEEeecCCC-c-ccccccccCeeeecCCCc Confidence 9999999999999999999999999999998886544 678999999999999987643 3 457889999888 57788 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCC Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG 311 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~ 311 (510) |++++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++..+++|.+++|. T Consensus 246 ~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~g~~v~g~ 325 (543) T protein:vir:88 246 YPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTGDFVAGR 325 (543) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCceeecCC Confidence 98999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCCccchHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) Q Consensus 312 ~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) ++++.+++.++++||+.+++.|++++++|+++||++.+ ++++++||||||++|++|++++|||||+||++|||.|||+| T Consensus 326 ~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 405 (543) T protein:vir:88 326 KADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRV 405 (543) T ss_pred CCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999854 69999999999999999999999999999999999999999 Q ss_pred HHHHHhhc-CCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhcc Q lcl|Aclame:pro 391 CLSEVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) Q Consensus 391 ~~~il~~~-~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~ 469 (510) +|++|++. .+|++|+++++++|+|+|++|+|+++++++.+++++++.+++ |+++|+||+|++++++++++|||+..|+ T Consensus 406 ~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~-p~vld~id~d~~~~~~a~~~Gv~~~~i~ 484 (543) T protein:vir:88 406 LLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQ-LNGDPDLNVNNIKLRLANAIGIDTAGLL 484 (543) T ss_pred HHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccc-hhhhccCCHHHHHHHHHHHhCCChhhhc Confidence 99999875 567788889999999999999999999999999999999987 7889999999999999999999877899 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHhhc---ccCCC Q lcl|Aclame:pro 470 KSADELQAEAEEQRRQAAQAQAAQE-------------TLLEGASDMTN---ALAGV 510 (510) Q Consensus 470 ~s~ee~~~~~~~~~qqa~~~~~a~~-------------~~~~~a~~~~~---~~ag~ 510 (510) ||+||++++++++++|+++++++.+ +.+++|.+.++ +++|- T Consensus 485 r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 541 (543) T protein:vir:88 485 LTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAGVQPGPIAT 541 (543) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcCCCCCCCCC Confidence 9999999887766555443333332 11222222222 11221 No 13 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=1.9e-160 Score=896.28 Aligned_cols=494 Identities=31% Similarity=0.455 Sum_probs=451.4 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|+++++.. ..+++|||||++|+++||||||++||||++|||||+ T Consensus 11 ~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 88 (515) T protein:vir:70 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) T ss_pred CHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc--cccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 4789999999995 999999999999999999998776543 346899999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeC Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d 158 (510) ++|...++++..+.+..++++||+.||+.++.+|++||||.++|++|+||++|||+|+|++++ ++|++|||++|||.+| T Consensus 89 ~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~-~~~~~~pl~~y~v~~d 167 (515) T protein:vir:70 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK-GAMSAVPMHHYVVNRD 167 (515) T ss_pred cChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCC-CCeEEEEcCeEEEeeC Confidence 999988888888889999999999999999999999999999999999999999999999876 4699999999999999 Q ss_pred CCCceeEEEEEEEecHHHHhHHhhHHhhc--ccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccccc Q lcl|Aclame:pro 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLMR--AGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHL 236 (510) Q Consensus 159 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+||+|||||++|+++|+++||.+... ...+++|+++|+|||||+|+++ +||++|++++|+.++.+|+|++++ T Consensus 168 ~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~---~~~~~~~e~d~~~~~~es~y~~~e 244 (515) T protein:vir:70 168 TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE---GFWKINQSADDIPVGKESRIKSEK 244 (515) T ss_pred CCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCC---CceEEEEecCceeecccccccccc Confidence 99999999999999999999999987643 2345678999999999999863 689999999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccc Q lcl|Aclame:pro 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~ 316 (510) |||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+++|+|+++|.++..+++|.++||.+++++ T Consensus 245 ~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~~g~iv~g~~~~v~ 324 (515) T protein:vir:70 245 LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVAEDIH 324 (515) T ss_pred CCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCccchHHHHHHHHHHHHHHHHHHhhc-ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+.++..|++++++|+++||++ +.++++++||||||++|++|++.+||||||||++|||.|||.|++. T Consensus 325 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~-- 402 (515) T protein:vir:70 325 IVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-- 402 (515) T ss_pred eeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH-- Confidence 9999999999999999999999999999997 5678999999999999999999999999999999999999999754 Q ss_pred hhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH Q lcl|Aclame:pro 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) Q Consensus 396 ~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee 474 (510) +.+|++|++++++++++++++|+|+++++++.+|+|.++.+++. |++.++||+|++++++++.+|+|.. ++||+|| T Consensus 403 --~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~-~~rs~ee 479 (515) T protein:vir:70 403 --EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP-FLKSEEE 479 (515) T ss_pred --hhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCcc-ccCCHHH Confidence 34588888999999999999999999999999999999988876 6799999999999999999999985 9999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 475 LQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 475 ~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +++++++++|++++++. .++++++++..+|= T Consensus 480 v~~~r~q~~~~~~~~~~-----~~~~~~a~~~~~~~ 510 (515) T protein:vir:70 480 MQQEMAQQAQAQQEAML-----NEGVAKAVPGVIQQ 510 (515) T ss_pred HHHHHHHHHHHHHHHHH-----HHhhhhhcccchhh Confidence 99988776665544333 33344433332222 No 14 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=3e-160 Score=895.13 Aligned_cols=497 Identities=27% Similarity=0.354 Sum_probs=452.0 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCC--CCccccccccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~--~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) || +++||+.|+ |++|+++|+||++||+|+++++++ +.++++..++|||||++|+++||||||++||||++|||| T Consensus 1 m~--~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (522) T protein:vir:10 1 MK--ARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFK 78 (522) T ss_pred Cc--hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 87 889999996 899999999999999999988764 455677789999999999999999999999999999999 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEe Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVR 156 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~ 156 (510) |+++|+.+.+. ......+++++||++||++++++|++||||.++|++|+||++|||+|+|++++ +|++|||++|||. T Consensus 79 l~~~d~~l~~~-~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--~~~~~pl~~y~v~ 155 (522) T protein:vir:10 79 LQVRDDKLGEE-LDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKD--GLKTFPLTRYVIN 155 (522) T ss_pred ccCChHHHhhh-cChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCC--CceEEEcceEEEe Confidence 99999887764 34556788999999999999999999999999999999999999999999987 5899999999999 Q ss_pred eCCCCceeEEEEEEEecHHHHhHHhhHHhhccc--ccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-ccccccc Q lcl|Aclame:pro 157 RDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAG--RNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWP 233 (510) Q Consensus 157 ~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~--~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~y~ 233 (510) +|++|+||+|||||+||+++|+++||.+..+.. ..++++++|+|||||+|+++.+ ++++|++++|+.+ +.++.|+ T Consensus 156 ~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~--~~~~~~~~~~~~~~~~~s~~g 233 (522) T protein:vir:10 156 RDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSG--RWVWHQEAFDKIIPDSRSTAP 233 (522) T ss_pred eCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCC--ceEEEEccCCccccccccccc Confidence 999999999999999999999999998764332 3468999999999999986643 4678888888654 6678788 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCcc Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~ 313 (510) +++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|+|+|+|+|++++.++..+++|.+++|.++ T Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v~g~~~ 313 (522) T protein:vir:10 234 KNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIVQGRPE 313 (522) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCcceecCCCc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) ++.+++.++++||+++.+.|++++++|+++||+. .++++++||||||++|++|++.+||||||||++|||.|||+|+|. T Consensus 314 ~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~-~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 392 (522) T protein:vir:10 314 DVAVIQVGKTADFSTAANMATAIEKRLLEAFLVM-NVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLL 392 (522) T ss_pred cceeecccccccchHHHHHHHHHHHHHHHHHhhc-cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999875 589999999999999999999999999999999999999999999 Q ss_pred HHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCH Q lcl|Aclame:pro 394 EVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSA 472 (510) Q Consensus 394 il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ 472 (510) +|++.+ +|++|++.+++.+|+|+++|+|+|+++++.+|++.++++.++++++++||+|++++.+|+++|||+..|+||+ T Consensus 393 il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~ 472 (522) T protein:vir:10 393 VLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTE 472 (522) T ss_pred HHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCH Confidence 998866 5666666779999999999999999999999999999988888899999999999999999999977899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 473 DELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 473 ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ||+++.+|+++++++++ +++++|++++++.++- T Consensus 473 eev~~~~q~~q~~~~~~-----~~~~~a~~~~~~~~~~ 505 (522) T protein:vir:10 473 QQLAEEQQAAQQQAAQQ-----SLVDQAGQMTGSPLMD 505 (522) T ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHHhcccccC Confidence 99998776665544433 3456677777777776 No 15 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=8.3e-160 Score=892.72 Aligned_cols=506 Identities=30% Similarity=0.417 Sum_probs=458.6 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||+.|+ |++|+++|+||++||+|++|+++++.++++..++|||||++|+++|||||||+|||+ +|||||+ T Consensus 10 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~ 88 (535) T protein:vir:33 10 GEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFPM-QSWMKLT 88 (535) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC-Ccccccc Confidence 4677889999996 899999999999999999999999888888899999999999999999999999986 7999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEeceEEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSYAV 155 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~~~v 155 (510) ++|..+++.+..+...++++.||++||++++.+|++||||.++|++|+||++|||+|+|++++.+ +|++|||++||| T Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v 168 (535) T protein:vir:33 89 ISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVV 168 (535) T ss_pred cChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEE Confidence 99999999888889999999999999999999999999999999999999999999999988754 599999999999 Q ss_pred eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-cccccccc Q lcl|Aclame:pro 156 RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWPI 234 (510) Q Consensus 156 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~y~~ 234 (510) .+|++|+||+|||||+||+++|+++||.+..+...++++++++++||||+++ .+|++|. +|++++|..+ +.+++|++ T Consensus 169 ~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~ 246 (535) T protein:vir:33 169 QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLD-EESGDYL-KYEEVEDVEIDGSDATYPT 246 (535) T ss_pred eeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEee-CCCCcEE-EEEEEeCcccccccccccc Confidence 9999999999999999999999999999998888889999999999999885 4456665 4567788776 68899999 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccc Q lcl|Aclame:pro 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEA 314 (510) Q Consensus 235 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~ 314 (510) ++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++..+++|.+++|.+++ T Consensus 247 ~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~g~~v~g~~~~ 326 (535) T protein:vir:33 247 DAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDFVPGRRED 326 (535) T ss_pred ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCceeeecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 315 VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 315 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) +++++.++++||+.+++.|++++++|+++||++. .+++++|||||||++|++|++++|||||+||++|||.|||+|+|+ T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~ 406 (535) T protein:vir:33 327 IDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLK 406 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999985 469999999999999999999999999999999999999999999 Q ss_pred HHhhc-CCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCH Q lcl|Aclame:pro 394 EVDDA-LLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSA 472 (510) Q Consensus 394 il~~~-~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ 472 (510) +|++. .+|++|+++++++|+|+|++++|+++++++.+|++.++++.| +.++++||+|++++++++++|||++.|+||+ T Consensus 407 il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P-~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ 485 (535) T protein:vir:33 407 QLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAP-MQGDPDINLAVIKLRIANAIGIDTSGILLTD 485 (535) T ss_pred HHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhCh-hhhhccCCHHHHHHHHHHHcCCCHhHhcCCH Confidence 99875 567788889999999999999999999999999999988654 4445579999999999999999998899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHhhcccCCC Q lcl|Aclame:pro 473 DELQAEAEEQRRQAAQAQAAQETLL---------EGASDMTNALAGV 510 (510) Q Consensus 473 ee~~~~~~~~~qqa~~~~~a~~~~~---------~~a~~~~~~~ag~ 510 (510) ||+++.++++++++++++++.+.-. +.+++.....+|+ T Consensus 486 ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 532 (535) T protein:vir:33 486 EQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAMQGAAAKAGL 532 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhHHHHHHhccC Confidence 9999988766655544444432111 1122333344555 No 16 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=6e-160 Score=893.49 Aligned_cols=491 Identities=30% Similarity=0.460 Sum_probs=450.5 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =|++|++||++|| |++|+++|+||++||+|++|+++++.. ..+++|||||++|+++||||||++||||++|||||+ T Consensus 12 ~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~ 89 (516) T protein:vir:10 12 KRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNE--TSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 89 (516) T ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcc--cccccccchHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 4578999999995 999999999999999999998876543 345899999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeC Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d 158 (510) ++|..++++++.+.+..++++||++||++++.+|++||||.++|++|+||++|||+|+|+|++. +|++|||++|||.+| T Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~-~~~~~pl~~y~v~~d 168 (516) T protein:vir:10 90 LTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG-AISAIPMHHYVVNRD 168 (516) T ss_pred CChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC-CeEEEEcCeEEEeeC Confidence 9999988888888889999999999999999999999999999999999999999999998764 699999999999999 Q ss_pred CCCceeEEEEEEEecHHHHhHHhhHHhh--cccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccccc Q lcl|Aclame:pro 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLM--RAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHL 236 (510) Q Consensus 159 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~--~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~ 236 (510) ++|+|+++|||+++|+++|+++|++... +...+++|+.+++|||||++++++ ||++|+++|+++++++|+|++++ T Consensus 169 ~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~---~~~~~~~~d~~~~~~~s~~~~~e 245 (516) T protein:vir:10 169 TNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEG---FWELKQSADDIPVGKVSKIKSEK 245 (516) T ss_pred CCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCC---ceEEEEeeCceeecccccccccc Confidence 9999999999999999999999986432 234566899999999999997654 79999999999999999999999 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccc Q lcl|Aclame:pro 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) Q Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~ 316 (510) |||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|+|+|+|+|+++|+++..+++|.++||.+++|+ T Consensus 246 ~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~~~~g~~~~v~ 325 (516) T protein:vir:10 246 LPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVEEDIH 325 (516) T ss_pred CCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhccCCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCccchHHHHHHHHHHHHHHHHHHhhc-ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) Q Consensus 317 ~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) ++++++++||+.++..|++++++|+++||++ +.++++++||||||++|++|++.+||||||||++|||.|||+|++..+ T Consensus 326 ~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~ 405 (516) T protein:vir:10 326 IVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA 405 (516) T ss_pred eeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhh Confidence 9999999999999999999999999999997 567899999999999999999999999999999999999999998655 Q ss_pred hhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH Q lcl|Aclame:pro 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) Q Consensus 396 ~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee 474 (510) +|++|++.+++.+++||++|+|+++++++.+|+|+++.+++. |+++++||+|++++.+++.+|||+ .++||+|| T Consensus 406 ----~p~~P~~lv~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~-~~irs~ee 480 (516) T protein:vir:10 406 ----GDSFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAEL-PFLKSAEE 480 (516) T ss_pred ----CCCCChhhcCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCCh-hccCCHHH Confidence 488999999999999999999999999999999999999975 789999999999999999999998 49999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 475 LQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 475 ~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++++++++++++..+++ |.+.++++.|. T Consensus 481 v~~~r~~~~~~q~~~~~--------~~~~~~~~~~~ 508 (516) T protein:vir:10 481 MEQEQEAQMQAQQAQML--------EEGVAKAVPGV 508 (516) T ss_pred HHHHHHHHHHHHHHHHH--------HHHhhhcccch Confidence 99988777655443332 22222233333 No 17 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=3.4e-159 Score=889.38 Aligned_cols=503 Identities=26% Similarity=0.375 Sum_probs=446.1 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) ||++|++||++|+ |++|+++|+||++||+|++++++++.++.+..++|||||++|+++|||||||+||||++|||||. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQ 80 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 9999999999996 89999999999999999999999988888889999999999999999999999999999999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeC Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d 158 (510) ++|..+++.......+.+++.||++||++++.+|++||||.++|++|+||++|||+|+|++++ ++++|||++|||.+| T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--~~~~~pl~~y~v~~d 158 (555) T protein:vir:17 81 INDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKK--NLKLYPLDRFVVSRD 158 (555) T ss_pred cCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCC--ceeEEEcCeEEEeeC Confidence 999999888877788899999999999999999999999999999999999999999999876 588999999999999 Q ss_pred CCCceeEEEEEEEecHHHHhHHhhHHhh----cccccC-----------------CCCceEEEEEEEEeecCCCeeEEEE Q lcl|Aclame:pro 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLM----RAGRNL-----------------SGSGSVDLYTHVQRRKGTAMDYAEM 217 (510) Q Consensus 159 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~----~~~~~~-----------------~~~~~v~v~~~v~~~~~~~~~~~sv 217 (510) ++|+||+|||||+||+++|+++||++.. +...++ +++.++++|+++.++++ +++| T Consensus 159 ~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~----~~~~ 234 (555) T protein:vir:17 159 GEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDG----QVKW 234 (555) T ss_pred CCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCC----eeEE Confidence 9999999999999999999999997532 111222 34455666666655443 5789 Q ss_pred EEeeCCeee-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch Q lcl|Aclame:pro 218 YHEIDGVRV-GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) Q Consensus 218 ~~e~~~~~~-~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~ 296 (510) |++++|+.+ +.++.+++++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++ T Consensus 235 ~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~ 314 (555) T protein:vir:17 235 HQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKP 314 (555) T ss_pred EEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCc Confidence 999999887 4456666689999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 297 DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 297 ~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~ 376 (510) .++..+++|.+++|.+++|++++.++++||+.+++.|++++++|+++||++ ..+++++||||||++|++|++.+||||| T Consensus 315 ~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~-~~~d~~r~TAtEV~~r~~E~~~~LGpv~ 393 (555) T protein:vir:17 315 QNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML-QVRQSERTTATEVQATVQELNEQIGGIY 393 (555) T ss_pred ceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc-CCCCcccchHHHHHHHHHHHHHHHhHHH Confidence 999999999999999999999999999999999999999999999999985 4689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHH Q lcl|Aclame:pro 377 SLLAENLQSPLAYVCLSEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMD 455 (510) Q Consensus 377 ~rl~~E~l~Pli~r~~~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~ 455 (510) +||++|||.|||+|+|.+|++.+ +|++|++.+++.+++++.+++|+++++++.+|++.++++.++|+++++||+|++++ T Consensus 394 ~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~ 473 (555) T protein:vir:17 394 SNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIK 473 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHH Confidence 99999999999999999998866 66777788999999999999999999999999999999998899999999999999 Q ss_pred HHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH------HH-hhccc-----CCC Q lcl|Aclame:pro 456 TIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQE-TLLEGA------SD-MTNAL-----AGV 510 (510) Q Consensus 456 ~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~-~~~~~a------~~-~~~~~-----ag~ 510 (510) .+++++|||+..|++|+||+++.+|++++++++++.+++ ++++++ ++ ...++ +|. T Consensus 474 ~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 541 (555) T protein:vir:17 474 RLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDAGA 541 (555) T ss_pred HHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHHHH Confidence 999999998788999999999877655544433332222 222111 11 01111 111 No 18 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=1.2e-157 Score=880.83 Aligned_cols=501 Identities=30% Similarity=0.448 Sum_probs=453.7 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) =+++|++||++|| |++|+++|+||++||+|++++++++.++.+..++|||||++|+++|||||||+||| ++|||||. T Consensus 8 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP-~~~WFrl~ 86 (522) T protein:vir:94 8 AAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFP-QSPWMRLT 86 (522) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCC-CCcccccc Confidence 4788999999996 89999999999999999999999988888888999999999999999999999996 67999999 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC----eEEEEEeceEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~----~~~~~pl~~~~ 154 (510) +.|..+++.........++++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++|| T Consensus 87 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~y~ 166 (522) T protein:vir:94 87 VSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVSYV 166 (522) T ss_pred cchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcceEE Confidence 99988888777788888999999999999999999999999999999999999999999977642 48999999999 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-ccccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWP 233 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~~~y~ 233 (510) |.+|++|+||+|||||++++++|+++|++++.. ++++|+++|+|||+|+|++++ +++|++++|+.+ +.+|+|+ T Consensus 167 v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~--~~~~p~~~v~v~~~v~~~~~~----~~~~~~~~g~~~~~~~~~~~ 240 (522) T protein:vir:94 167 VQRDAFGNILQIVTIDKVAFSALPEDVKSQLNA--DDYEPDTELEVYTHIYRQDDE----YLRYEEVEGIEVTGTDGSYP 240 (522) T ss_pred EeeCCCcCeEEEeeeeeccHHhcchHHHHHHhc--ccCCccceEEEEEEEEeeCCc----eeEEeeccCceecccCCCCc Confidence 999999999999999999999999999988754 356889999999999998876 357788888877 6788899 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCcc Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~ 313 (510) +++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|+++|+++..+++|.+++|.++ T Consensus 241 ~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~v~g~~~ 320 (522) T protein:vir:94 241 LTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFVAGRVE 320 (522) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCCceeecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCCccchHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) Q Consensus 314 ~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) +++++++++++||+++++.|++++++|+++||++.+ ++++++||||||++|++|++++|||||+||++|||.|||+|+| T Consensus 321 ~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 400 (522) T protein:vir:94 321 DINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLM 400 (522) T ss_pred cceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999854 6999999999999999999999999999999999999999999 Q ss_pred HHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC Q lcl|Aclame:pro 393 SEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS 471 (510) Q Consensus 393 ~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s 471 (510) ++|++.+ +|++|+++++++|+|+|++++|+++++++.+|++.+++++|. .++++||+|++++.+++++|||+..|+|| T Consensus 401 ~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~-~~~~~id~d~~~~~~a~~~Gv~~~~ivr~ 479 (522) T protein:vir:94 401 NQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPL-SQDPDINLPTLKLRLLNALGIDTAGLLLT 479 (522) T ss_pred HHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccch-hhhhcCCHHHHHHHHHHHcCCChhhccCC Confidence 9998755 677888889999999999999999999999999999987653 34578999999999999999987889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 472 ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 472 ~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +||++++++|+++++++++++.++....++. .++.++= T Consensus 480 ~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~-~~~~~~~ 517 (522) T protein:vir:94 480 QDEKIQRMAEQSSQQAVVQGASAAGANMGAA-VGQGAGE 517 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcccch Confidence 9999998888777666655554433222221 1121111 No 19 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=6.3e-157 Score=876.94 Aligned_cols=500 Identities=14% Similarity=0.099 Sum_probs=434.3 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc------cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL------MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~------~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) +.+++++||+.|+ |++|+++|+||++||+|++ .+.++..+..+..++|||||++|+++||||||++||||++ T Consensus 8 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~~ltpp~~ 87 (549) T protein:vir:10 8 ILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDSMITPATQ 87 (549) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHhhccCCCC Confidence 6788899999996 9999999999999999986 2334556677788999999999999999999999999999 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL--FQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVA 147 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~ 147 (510) |||||.++|+.+.+ ..+++.||++||++++..+ ++||||.++|++|+||++|||+|+|++++.+ +|++ T Consensus 88 ~wF~l~~~~~~~~e-------~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~~ 160 (549) T protein:vir:10 88 LWHRLKTGNDALNE-------IASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYRN 160 (549) T ss_pred ccccccCCccchhh-------hhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEEE Confidence 99999999987654 3578999999999999855 5899999999999999999999999998754 4889 Q ss_pred EEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhh----cccccCCCCceEEEEEEEEeec--------CCCeeEE Q lcl|Aclame:pro 148 WSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLM----RAGRNLSGSGSVDLYTHVQRRK--------GTAMDYA 215 (510) Q Consensus 148 ~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~----~~~~~~~~~~~v~v~~~v~~~~--------~~~~~~~ 215 (510) |||++|||.+|++|+||+|||||+||++||+++||.+.. +...+++|+++|+|||+|+|++ .++|||. T Consensus 161 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~pf~ 240 (549) T protein:vir:10 161 VPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNMQFA 240 (549) T ss_pred EEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccCceE Confidence 999999999999999999999999999999999997532 3344678999999999999874 4679999 Q ss_pred EEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc Q lcl|Aclame:pro 216 EMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV 295 (510) Q Consensus 216 sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~ 295 (510) |||+++++.+++++|+| ++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+++++|+++ T Consensus 241 sv~~e~~~~~il~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~ 318 (549) T protein:vir:10 241 SYWLDEGRDRIVQNSGF--RTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLD 318 (549) T ss_pred EEEEEecCCEeeccCCc--ccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc Confidence 99999999999999999 7999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc--cCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 296 VDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA--NQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 296 ~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~--~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) +.++..+..+.+..|..++....+++++++|+.+++.|++++++|+++||.++ .++++++||||||++|++|++++|| T Consensus 319 ~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LG 398 (549) T protein:vir:10 319 GFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLA 398 (549) T ss_pred cceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhh Confidence 98887666655555544554455566778999999999999999999999985 4589999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCC-CCCCccc------eeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHh Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALL-QGLITKQ------HKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLD 445 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l-~~~p~~~------~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~ 445 (510) |+|+||++|||.|||+|+|++|++++. |++|++. ++++|||+|+.++++++++++.++++.++.++++ |+++ T Consensus 399 pv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~l 478 (549) T protein:vir:10 399 PTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAA 478 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHH Confidence 999999999999999999999998775 5555543 5688888888888888999999999999999886 8899 Q ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH-HhhcccCCC Q lcl|Aclame:pro 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQA-QAAQETLLEGAS-DMTNALAGV 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~-~~a~~~~~~~a~-~~~~~~ag~ 510 (510) ++||+|++++++++++|||++ +++|+||+++.++++++|++++ +.+.+.+.++++ +...++++- T Consensus 479 d~id~d~~~~~~a~~~Gvp~~-~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~ta~ 544 (549) T protein:vir:10 479 KVPNGARIARLLADYGGVPVE-AMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQTAA 544 (549) T ss_pred hcCCHHHHHHHHHHhcCCCcc-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCC Confidence 999999999999999999985 9999999998776554444333 333333444443 334444444 No 20 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=1.2e-156 Score=875.39 Aligned_cols=498 Identities=16% Similarity=0.145 Sum_probs=436.1 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) -+++|++||+.|+ |++|+++|+||++||+|++ +.++++.+.++.+++|||||++|+++||||||++||||++||| T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF 85 (555) T protein:vir:98 6 ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSPARPWF 85 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 7889999999996 9999999999999999994 5677777888889999999999999999999999999999999 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEece Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||++.|+.+++ ..+++.||++||++++++|++||||.++|++|+||++|||+|+|++++.. +|++|||++ T Consensus 86 ~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:98 86 RLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 99999877654 46799999999999999999999999999999999999999999987743 477899999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHh-----hcccccCCCCceEEEEEEEEeec--------CCCeeEEEEEE Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDL-----MRAGRNLSGSGSVDLYTHVQRRK--------GTAMDYAEMYH 219 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~v~~~v~~~~--------~~~~~~~sv~~ 219 (510) |||.+|+.|+||+|||||+||+++|+++||.+. .+...+++++.+|+|||+|+|++ .++|||.|||+ T Consensus 159 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:98 159 YAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred eEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 999999999999999999999999999999653 33444445577899999999874 35799999999 Q ss_pred e--eCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh Q lcl|Aclame:pro 220 E--IDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 220 e--~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~ 297 (510) + .+|++++++|+| ++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+++++|.+++. T Consensus 239 ~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~ 316 (555) T protein:vir:98 239 EPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI 316 (555) T ss_pred EeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc Confidence 6 467889999999 799999999999999999999999999999999999999999999999999999999988887 Q ss_pred hhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc----ccCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 298 DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) ++..+..+.+.+|..+++...+.++..||+.+.+.|++++++|+++||.| +.++++++||||||++|++|++.+|| T Consensus 317 ~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG 396 (555) T protein:vir:98 317 STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLG 396 (555) T ss_pred eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhh Confidence 77666666677887777655567777899999999999999999999987 55689999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc-----ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhc Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITK-----QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPR 447 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~-----~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~ 447 (510) |||+||++|||.|||+|+|++|++++++|++|+ .++++|+|+|+.++|+.++.++.++++.++.++|+ |+++++ T Consensus 397 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~ 476 (555) T protein:vir:98 397 PVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDK 476 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhc Confidence 999999999999999999999998776554443 36788899999999999999999999999999996 899999 Q ss_pred CCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ||+|++++++++++|||+ .++||+||+++.++++++++++|++++ ++.+|.+.+.+.++. T Consensus 477 id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~--~~~q~~~~~~~~~~~ 536 (555) T protein:vir:98 477 FDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAA--LLNQGADTAAKLGSV 536 (555) T ss_pred CCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhccc Confidence 999999999999999998 599999999988777665554444332 333333333333333 No 21 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=1.2e-156 Score=875.39 Aligned_cols=498 Identities=16% Similarity=0.145 Sum_probs=436.1 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) -+++|++||+.|+ |++|+++|+||++||+|++ +.++++.+.++.+++|||||++|+++||||||++||||++||| T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF 85 (555) T protein:vir:10 6 ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSPARPWF 85 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 7889999999996 9999999999999999994 5677777888889999999999999999999999999999999 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEece Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||++.|+.+++ ..+++.||++||++++++|++||||.++|++|+||++|||+|+|++++.. +|++|||++ T Consensus 86 ~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:10 86 RLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 99999877654 46799999999999999999999999999999999999999999987743 477899999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHh-----hcccccCCCCceEEEEEEEEeec--------CCCeeEEEEEE Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDL-----MRAGRNLSGSGSVDLYTHVQRRK--------GTAMDYAEMYH 219 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~v~~~v~~~~--------~~~~~~~sv~~ 219 (510) |||.+|+.|+||+|||||+||+++|+++||.+. .+...+++++.+|+|||+|+|++ .++|||.|||+ T Consensus 159 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:10 159 YAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred eEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 999999999999999999999999999999653 33444445577899999999874 35799999999 Q ss_pred e--eCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh Q lcl|Aclame:pro 220 E--IDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 220 e--~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~ 297 (510) + .+|++++++|+| ++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+++++|.+++. T Consensus 239 ~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~ 316 (555) T protein:vir:10 239 EPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI 316 (555) T ss_pred EeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc Confidence 6 467889999999 799999999999999999999999999999999999999999999999999999999988887 Q ss_pred hhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc----ccCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 298 DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) ++..+..+.+.+|..+++...+.++..||+.+.+.|++++++|+++||.| +.++++++||||||++|++|++.+|| T Consensus 317 ~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG 396 (555) T protein:vir:10 317 STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLG 396 (555) T ss_pred eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhh Confidence 77666666677887777655567777899999999999999999999987 55689999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc-----ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhc Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITK-----QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPR 447 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~-----~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~ 447 (510) |||+||++|||.|||+|+|++|++++++|++|+ .++++|+|+|+.++|+.++.++.++++.++.++|+ |+++++ T Consensus 397 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~ 476 (555) T protein:vir:10 397 PVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDK 476 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhc Confidence 999999999999999999999998776554443 36788899999999999999999999999999996 899999 Q ss_pred CCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ||+|++++++++++|||+ .++||+||+++.++++++++++|++++ ++.+|.+.+.+.++. T Consensus 477 id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~--~~~q~~~~~~~~~~~ 536 (555) T protein:vir:10 477 FDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAA--LLNQGADTAAKLGSV 536 (555) T ss_pred CCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhccc Confidence 999999999999999998 599999999988777665554444332 333333333333333 No 22 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=1.2e-156 Score=875.39 Aligned_cols=498 Identities=16% Similarity=0.145 Sum_probs=436.1 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc---cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) -+++|++||+.|+ |++|+++|+||++||+|++ +.++++.+.++.+++|||||++|+++||||||++||||++||| T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF 85 (555) T protein:vir:10 6 ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSPARPWF 85 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 7889999999996 9999999999999999994 5677777888889999999999999999999999999999999 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEece Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||++.|+.+++ ..+++.||++||++++++|++||||.++|++|+||++|||+|+|++++.. +|++|||++ T Consensus 86 ~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:10 86 RLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 99999877654 46799999999999999999999999999999999999999999987743 477899999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHh-----hcccccCCCCceEEEEEEEEeec--------CCCeeEEEEEE Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDL-----MRAGRNLSGSGSVDLYTHVQRRK--------GTAMDYAEMYH 219 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~v~~~v~~~~--------~~~~~~~sv~~ 219 (510) |||.+|+.|+||+|||||+||+++|+++||.+. .+...+++++.+|+|||+|+|++ .++|||.|||+ T Consensus 159 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:10 159 YAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred eEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 999999999999999999999999999999653 33444445577899999999874 35799999999 Q ss_pred e--eCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh Q lcl|Aclame:pro 220 E--IDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 220 e--~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~ 297 (510) + .+|++++++|+| ++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+++++|.+++. T Consensus 239 ~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~~ 316 (555) T protein:vir:10 239 EPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKNQDI 316 (555) T ss_pred EeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccccc Confidence 6 467889999999 799999999999999999999999999999999999999999999999999999999988887 Q ss_pred hhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc----ccCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 298 DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) ++..+..+.+.+|..+++...+.++..||+.+.+.|++++++|+++||.| +.++++++||||||++|++|++.+|| T Consensus 317 ~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG 396 (555) T protein:vir:10 317 STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEKLLMLG 396 (555) T ss_pred eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhh Confidence 77666666677887777655567777899999999999999999999987 55689999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc-----ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHhhc Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITK-----QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPR 447 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~-----~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~~~ 447 (510) |||+||++|||.|||+|+|++|++++++|++|+ .++++|+|+|+.++|+.++.++.++++.++.++|+ |+++++ T Consensus 397 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~ 476 (555) T protein:vir:10 397 PVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDK 476 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhc Confidence 999999999999999999999998776554443 36788899999999999999999999999999996 899999 Q ss_pred CCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ||+|++++++++++|||+ .++||+||+++.++++++++++|++++ ++.+|.+.+.+.++. T Consensus 477 id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~--~~~q~~~~~~~~~~~ 536 (555) T protein:vir:10 477 FDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAA--LLNQGADTAAKLGSV 536 (555) T ss_pred CCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhccc Confidence 999999999999999998 599999999988777665554444332 333333333333333 No 23 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=6.3e-153 Score=854.99 Aligned_cols=495 Identities=14% Similarity=0.134 Sum_probs=425.8 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccC---CCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMV---DPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~---~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) -+++|++||+.|+ |++|+++|+||++||+|++++ ++.+.++.+..++|||||++|+++||||||++||||++||| T Consensus 5 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~~~WF 84 (556) T protein:vir:73 5 EKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSPARPWF 84 (556) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 6777899999996 999999999999999999854 33445566778999999999999999999999999999999 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEece Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) +|+++|+.+.+ ..++++||++||++++++|++||||.++|++|+||++|||+++|++++.. +|++|||++ T Consensus 85 ~l~~~d~~~~~-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~ 157 (556) T protein:vir:73 85 KLATPDPDMMD-------YGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGS 157 (556) T ss_pred ccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecce Confidence 99999876544 45799999999999999999999999999999999999999999988743 477899999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHH-----hhcccccCCCCceEEEEEEEEeec--------CCCeeEEEEEE Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQD-----LMRAGRNLSGSGSVDLYTHVQRRK--------GTAMDYAEMYH 219 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~-----~~~~~~~~~~~~~v~v~~~v~~~~--------~~~~~~~sv~~ 219 (510) |||.+|+.|+||+|||||+||+++|+++||.+ +.+...+++++.+|+|+|+|+|++ .++|||.|+|+ T Consensus 158 ~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 237 (556) T protein:vir:73 158 YYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYF 237 (556) T ss_pred eEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEEEEEE Confidence 99999999999999999999999999999865 334444555577899999999863 35799999999 Q ss_pred ee--CCeeeccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch Q lcl|Aclame:pro 220 EI--DGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) Q Consensus 220 e~--~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrg-p~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~ 296 (510) +. ++++++++|+| ++|||+++||++.+||+|||| |++++|||+|+||.++++++++++++++|||++++++.+.+ T Consensus 238 ~~~~~~~~vl~esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~ 315 (556) T protein:vir:73 238 ESGGDSDKLLRESGF--DEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQR 315 (556) T ss_pred EecCCCceecccCCc--ccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc Confidence 75 56788999998 789999999999999999999 89999999999999999999999999999999999986654 Q ss_pred hhhhcCCCc---ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc----cCCCCCCCCHHHHHHHHHHHH Q lcl|Aclame:pro 297 DDYQDAEMG---DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA----NQRDAERVTAEEVRITAEEAE 369 (510) Q Consensus 297 ~~~~~~~~G---~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~----~~~~~~~vTAtEi~~r~~E~~ 369 (510) .++ .++| ...+|+.+++.|++.++ +|++.+.+.|++++++|+++||.|+ .++++++||||||++|++|++ T Consensus 316 ~~~--~pgg~~~~~~~~~~~~i~p~~~~~-~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~ 392 (556) T protein:vir:73 316 VSL--LPGDVTYLDVISGQDGFKPAYLVN-PNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKL 392 (556) T ss_pred eee--ccCccccccCCCCccceeeecccc-ccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHH Confidence 333 3433 23467778888887665 6799999999999999999999874 568999999999999999999 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc-----ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-Hh Q lcl|Aclame:pro 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK-----QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQ 443 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~-----~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q 443 (510) .+|||+|+||++|||.|||+|+|++|++.+.+|++|+ +++++|+|+|+.++++.+++++.++++.++.++++ |+ T Consensus 393 ~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe 472 (556) T protein:vir:73 393 LMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPE 472 (556) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChh Confidence 9999999999999999999999999998775544443 47889999999999999999999999999999996 89 Q ss_pred HhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++++||+|++++.+++++|||+ .+++|+||+++.+|++++|+++|++++ ++++|++.++.++.+ T Consensus 473 ~~d~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~rq~r~~~qq~~~~~~--~~~~a~~~~~~~~~~ 536 (556) T protein:vir:73 473 ALDKLDVDQAIDAFSEMSGVSP-TVIVPQEQVQGIREERAKQAQAAQAMA--MGQAAAQGAKTLSET 536 (556) T ss_pred hHhcCCHHHHHHHHHHHcCCCh-hhcCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhhhc Confidence 9999999999999999999998 599999999987776555444333222 233333333333333 No 24 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=1.2e-152 Score=853.50 Aligned_cols=496 Identities=16% Similarity=0.129 Sum_probs=421.2 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCC------CCccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPM------SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~------~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) =+++|++||+.|+ |++|+++|+||++||+|++++..+ .....+..++|||||++|+++|||||||+||||++ T Consensus 2 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~~ 81 (547) T protein:vir:10 2 ENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPAT 81 (547) T ss_pred CHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 4677999999996 899999999999999999854222 22235677899999999999999999999999999 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC---C--eEEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE---A--TVVA 147 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~---~--~~~~ 147 (510) |||||+++|..+.+ ..++++||++||+.|+++|++||||.++|++|+||++|||+++|++++. + +|++ T Consensus 82 ~WF~l~~~d~~~~~-------~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~ 154 (547) T protein:vir:10 82 KWFELAFRDKELNS-------DDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQS 154 (547) T ss_pred cccccccCCccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEE Confidence 99999999876643 4579999999999999999999999999999999999999999997653 2 4889 Q ss_pred EEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhh----cccccCCCCc---eEEEEEEEEeecC----------- Q lcl|Aclame:pro 148 WSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLM----RAGRNLSGSG---SVDLYTHVQRRKG----------- 209 (510) Q Consensus 148 ~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~----~~~~~~~~~~---~v~v~~~v~~~~~----------- 209 (510) |||++|||.+|++|+|++|||||+||++||+++||.+.. ++..++++++ ++++||+|+|+.+ T Consensus 155 ~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~ 234 (547) T protein:vir:10 155 SPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVL 234 (547) T ss_pred eecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCcccccee Confidence 999999999999999999999999999999999996532 2222344544 8999999999742 Q ss_pred --CCeeEEEEEEeeCC-eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCce Q lcl|Aclame:pro 210 --TAMDYAEMYHEIDG-VRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) Q Consensus 210 --~~~~~~sv~~e~~~-~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~ 286 (510) ++|||.|+|++++| ++++++|+| ++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++||| T Consensus 235 ~~~~~p~~s~~~e~~~~~~~l~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 312 (547) T protein:vir:10 235 APTERPFGKKWILKEGAVQLGEEGGY--YEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAI 312 (547) T ss_pred eccccceeEEEEEecCceeeeecCCc--ccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 47999999999886 788999998 6899999999999999999999999999999999999999999999999999 Q ss_pred eeCCCCccchhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHH Q lcl|Aclame:pro 287 LVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITA 365 (510) Q Consensus 287 lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~ 365 (510) +|+|+|++++.+ ..++|.++.|+.++++|++.+ +||+.+++.|++++++|+++||.++ .++++++||||||++|+ T Consensus 313 ~v~~~g~~~~~~--~~pgg~~~~~~~~~v~pl~~~--~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r~ 388 (547) T protein:vir:10 313 MVTERGLISDID--LGASGLTVVRDMESMKPFESR--ARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVRY 388 (547) T ss_pred ecccccccccce--ecCCeeeecCCcccceeeecc--cchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHHH Confidence 999999999754 557788888999999987654 7999999999999999999999986 57899999999999999 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCC-CCCCccc-------eeeEEeecHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL-QGLITKQ-------HKPAIETGLPALSRSAAVQSMLNASQVIAG 437 (510) Q Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l-~~~p~~~-------~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~ 437 (510) +|++++|||+|+||++|||.|||+|+|++|++.++ |++|++. ++++++++|+.++++.+++++.+++++++. T Consensus 389 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~ 468 (547) T protein:vir:10 389 ELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQ 468 (547) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999988765 4455443 345666666666666777888888888898 Q ss_pred hcCh-HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH-HHhhcccCCC Q lcl|Aclame:pro 438 LAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ-AAQETLLEGA-SDMTNALAGV 510 (510) Q Consensus 438 ~~~~-~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~-~a~~~~~~~a-~~~~~~~ag~ 510 (510) ++++ |+++++||+|++++.+++++|||++ +++|+||++++++++++++++++ ++.+++..++ .+++...|.. T Consensus 469 laq~~P~vld~id~d~~~~~~a~~~Gvp~~-~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~ 543 (547) T protein:vir:10 469 LAEINPEVLDIPDWDEMVRMLGSLLGAPQT-LMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAAL 543 (547) T ss_pred hhccChhhhhcCCHHHHHHHHHHHhCCChh-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccch Confidence 8886 8899999999999999999999985 99999999987765544333333 2222222222 2333333333 No 25 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=3.4e-152 Score=850.96 Aligned_cols=497 Identities=13% Similarity=0.128 Sum_probs=423.4 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccC---CCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMV---DPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~---~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) .+++|++||+.|+ |++|+++|+||++||+|++++ ++.+.++.+..++|||||++|+++||||||++||||++||| T Consensus 5 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~~~WF 84 (559) T protein:vir:95 5 TKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPARPWF 84 (559) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 7888999999996 999999999999999999865 33355666778999999999999999999999999999999 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEece Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~ 152 (510) ||+++|+.+.+ ..++++||++||+.++++|++||||.++|++|+||++|||+|+|++++.. +|++|||++ T Consensus 85 ~l~~~d~~~~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l~~ 157 (559) T protein:vir:95 85 RLATPDPEMMD-------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPIGS 157 (559) T ss_pred ccccCCccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeecCe Confidence 99999876543 45799999999999999999999999999999999999999999987643 478899999 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHh-----hcccccCCCCceEEEEEEEEeec--------CCCeeEEEEEE Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDL-----MRAGRNLSGSGSVDLYTHVQRRK--------GTAMDYAEMYH 219 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~v~v~~~v~~~~--------~~~~~~~sv~~ 219 (510) |||.+|++|+||+|||||+||+++|+++||.+. .+...++.++++|+|||+|+|+. .++|||.|+|+ T Consensus 158 ~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~ 237 (559) T protein:vir:95 158 YYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYY 237 (559) T ss_pred EEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceEEEEEE Confidence 999999999999999999999999999999653 34444445566899999999874 35799999999 Q ss_pred eeC--CeeeccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch Q lcl|Aclame:pro 220 EID--GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRG-HVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV 296 (510) Q Consensus 220 e~~--~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrg-p~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~ 296 (510) +.+ +++++++|+| ++|||+++||++.+||+|||| |++++|||+|+||.|+++.+++++++++|||++++++.+++ T Consensus 238 e~~~~~~~~l~esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~~ 315 (559) T protein:vir:95 238 EVGGDNDKLLRESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQR 315 (559) T ss_pred EecCCCceeeecCCc--ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc Confidence 974 4678999998 789999999999999999999 89999999999999999999999999999999999998877 Q ss_pred hhhhcCCCcceecCC-ccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc----ccCCCCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 297 DDYQDAEMGDYVPGG-AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 297 ~~~~~~~~G~~~~g~-~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~vTAtEi~~r~~E~~~~ 371 (510) .++..+..+.+..+. .+.+.|....+ .+++.+.++|++++++|+++||.| +.++++++||||||++|++|++.+ T Consensus 316 ~~l~pgg~~~~~~~~~~~~i~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~ 394 (559) T protein:vir:95 316 ASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (559) T ss_pred eeeeccceeeeCCCCCcccceeecccc-cchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHH Confidence 665432222222222 24466655443 578888999999999999999987 467899999999999999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcCCCC-CCcc----ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh-HhHh Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQG-LITK----QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLD 445 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~-~p~~----~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~-~q~~ 445 (510) |||+|+||++|||.|||+|+|++|++++++| +|++ +++++|+|+|+.++|+.+++++.++++.++.++++ |+++ T Consensus 395 LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevl 474 (559) T protein:vir:95 395 LGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEAL 474 (559) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhh Confidence 9999999999999999999999999877554 4443 37889999999999999999999999999999996 8999 Q ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++||+|++++.+++++|||++ ++||+||+++.++++++++++||+++ ++.+|++.++..+.. T Consensus 475 d~id~d~~~~~~a~~~Gvp~~-~irs~~ev~~~rqqr~~~qq~~q~~~--~~~~aa~~~~~~~~~ 536 (559) T protein:vir:95 475 DKLNVDQAIDAFADMSGVSPT-VIVPQEQVEQARQQRAQQQQQQQMMA--MGMAAAQGVKTLSEA 536 (559) T ss_pred hcCCHHHHHHHHHHHhCCchh-hcCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhhccccc Confidence 999999999999999999984 99999999987766655544444332 333344433332222 No 26 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=1.3e-87 Score=496.87 Aligned_cols=494 Identities=12% Similarity=0.079 Sum_probs=375.2 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcc----------cccCCCCCCccccccccccchHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLP----------YLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P----------~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~lt 68 (510) +=+.+.+||+.+| |++|+.+|+||++|..+ ..+...++.....+.+++|++..+++++|+++||+++| T Consensus 25 ~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~ 104 (641) T protein:vir:94 25 IGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYFKGATF 104 (641) T ss_pred HHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHHhhhhc Confidence 6677999999996 99999999999977655 33333333344445689999999999999999999999 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC-------- Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-------- 140 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~-------- 140 (510) | +++||++.+.+....+ +.++ ++..+...+.+++|+...++.+.+.+.+||+++-++- T Consensus 105 p-~~~wf~~~p~~~ed~~----------~A~~---~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~ 170 (641) T protein:vir:94 105 P-SDDWFDLKGMVPELAD----------AARV---VKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQF 170 (641) T ss_pred C-CCceEEEecCCCChHH----------HHHH---HHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhh Confidence 7 8999999887654322 1112 2345556788999999999999999999998764430 Q ss_pred ---------------------CCCeEEEEEeceEEEeeCCCCcee----EEEEEEEecHHHHhHH--hhHHhhc-----c Q lcl|Aclame:pro 141 ---------------------DEATVVAWSLRSYAVRRDATGRWM----DIVLKQRYKSKDLDDV--YKQDLMR-----A 188 (510) Q Consensus 141 ---------------------~~~~~~~~pl~~~~v~~d~~G~v~----~i~r~~~~t~~~l~~~--~~~~~~~-----~ 188 (510) ....+++.||..+-|..|+.++++ ++||++++|+.+|..+ |+.+..+ . T Consensus 171 ~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~ 250 (641) T protein:vir:94 171 KRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVD 250 (641) T ss_pred hhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhccc Confidence 011256677766666666666665 4677888888888766 5443211 1 Q ss_pred cccCCCCce----------EEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccc-cccCceEEEeeeecCCCccccchH Q lcl|Aclame:pro 189 GRNLSGSGS----------VDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWP-IHLCPYIVPTWNLAPGEHYGRGHV 257 (510) Q Consensus 189 ~~~~~~~~~----------v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~-~~~~P~~~~Rw~~~~ge~YGrgp~ 257 (510) +..+.++.. .++|++....++++++|+|+|++++|+++++++++. ++++||+++||.+.++++||+||+ T Consensus 251 ~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~ 330 (641) T protein:vir:94 251 YKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVL 330 (641) T ss_pred ccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCcccCCChH Confidence 111122221 234443334566789999999999999998887775 478999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHH Q lcl|Aclame:pro 258 EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVV 337 (510) Q Consensus 258 ~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 337 (510) +++|||+|+||.+++..++++.++++|+|+++++|+++|.++...++|.+..+..+++.|++.+. .+++..++.++.++ T Consensus 331 ~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~v~pl~~~~-~~~~~~~~~~~~~~ 409 (641) T protein:vir:94 331 HPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGSLQPIDMGR-QDFVVTYQEAQVQE 409 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCcceeecCCc-cccchhHHHHHHHH Confidence 99999999999999999999999999999999999999999998888888888888999886654 58999999999999 Q ss_pred HHHHHHHhhcc----cC-CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh--------------- Q lcl|Aclame:pro 338 VRLNQAFMYGA----NQ-RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD--------------- 397 (510) Q Consensus 338 ~~I~~af~~~~----~~-~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~--------------- 397 (510) .+|+++|+.+. .+ +++++||||||+++.+|+...||+++++|+.||+.||+.++++++.+ T Consensus 410 ~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~ 489 (641) T protein:vir:94 410 SSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEE 489 (641) T ss_pred HHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhh Confidence 99999997552 23 67788999999999999999999999999999999999999998866 Q ss_pred --cCCCCCCccceeeEE-eecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcC--CCHhhccCCH Q lcl|Aclame:pro 398 --ALLQGLITKQHKPAI-ETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS--VDTSQFYKSA 472 (510) Q Consensus 398 --~~l~~~p~~~~~~~~-vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~G--vp~~~i~~s~ 472 (510) ++++|+|+++++.++ +++++..+++.+++++.++++.++.+++.|++++++|+|.+++.+++.+| +|.. ++|++ T Consensus 490 ~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~-~ir~~ 568 (641) T protein:vir:94 490 QMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPMR-YIKKA 568 (641) T ss_pred hcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchh-hccCc Confidence 367788888887654 36788888888888888888888888778999999999999999998755 6764 77777 Q ss_pred HHHHHHH---HHHHHHHHHHHHHH----------HHHHHHHHHhhcccCCC Q lcl|Aclame:pro 473 DELQAEA---EEQRRQAAQAQAAQ----------ETLLEGASDMTNALAGV 510 (510) Q Consensus 473 ee~~~~~---~~~~qqa~~~~~a~----------~~~~~~a~~~~~~~ag~ 510 (510) |..++.+ ++++|+++.++++. +.+++..++.--.++|+ T Consensus 569 ~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 619 (641) T protein:vir:94 569 EAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGI 619 (641) T ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcC Confidence 6442222 11122221111111 01111222233355555 No 27 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=5.4e-68 Score=389.37 Aligned_cols=493 Identities=13% Similarity=0.119 Sum_probs=352.3 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc----------cCCCCCCccccccccccchHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL----------MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~----------~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~lt 68 (510) +=..+.++|++.+ |+.|+.+|++++++..+.. +.+..........+++.++-..+++++.+.|+..+| T Consensus 21 ~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~ 100 (651) T protein:vir:80 21 VSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTGKAFEAIETIHAYLMSATF 100 (651) T ss_pred HHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccChhHHHHHHHHHHHHHHhhc Confidence 4445778999884 8999999999998877741 111112222234568999999999999999999999 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCC----- Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSD----- 141 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~----- 141 (510) | +.+||++.+.++.. ..+++-+-++..+...++.++|+...+.++++.+++||+++-+ +.. T Consensus 101 ~-~~~~~~~~p~~~~d-----------~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~ 168 (651) T protein:vir:80 101 P-NKNWFDVVPAKPGQ-----------DNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVK 168 (651) T ss_pred C-CCceeEeccCCchh-----------HHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeee Confidence 7 58899998864332 1233445567777778899999999999999999999998632 210 Q ss_pred ---------------------------CCeEEEEEeceEEEeeCCCCceeEE-EEEEEecHHHHhHHhh----HH----- Q lcl|Aclame:pro 142 ---------------------------EATVVAWSLRSYAVRRDATGRWMDI-VLKQRYKSKDLDDVYK----QD----- 184 (510) Q Consensus 142 ---------------------------~~~~~~~pl~~~~v~~d~~G~v~~i-~r~~~~t~~~l~~~~~----~~----- 184 (510) ..+++.+|+.+|++..++.+.-|+- +++..+|..++.+... .+ T Consensus 169 ~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~ 248 (651) T protein:vir:80 169 KKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLD 248 (651) T ss_pred hheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHH Confidence 0146778999999999987755542 2344567666543221 00 Q ss_pred -hhcc--------------c-----ccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecc--ccccccccCceEEE Q lcl|Aclame:pro 185 -LMRA--------------G-----RNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGE--TGRWPIHLCPYIVP 242 (510) Q Consensus 185 -~~~~--------------~-----~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~--~~~y~~~~~P~~~~ 242 (510) +.+. . ...++..+|+||+|..+.+..++.++++|+..+|+.+.. +..|+ ++|||+++ T Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~-~~~Pf~~~ 327 (651) T protein:vir:80 249 VVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQNPYW-CGRPFVIG 327 (651) T ss_pred HHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccccCCC-CCCCeeee Confidence 0000 0 012456789999998787888889999999999988864 34443 68999999 Q ss_pred eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccCCC Q lcl|Aclame:pro 243 TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGD 322 (510) Q Consensus 243 Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~ 322 (510) ||.+.+|+.||+||++.++|+++.||.+++++++++.++++|+|+|++||+++|+++...++|.++.|.++++.+++.+. T Consensus 328 ~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~vi~~~~~~~~~~l~~~~ 407 (651) T protein:vir:80 328 TYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKVFLVSDHGDLQPLANQS 407 (651) T ss_pred cceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCceEEecCCCCceeeccCc Confidence 99999999999999999999999999999999999999999999999999999999988888888899999999887664 Q ss_pred ccchHHHHHHHHHHHHHHHHHHhhcc-c----CCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 323 YNKMAAIQQSLQAVVVRLNQAFMYGA-N----QRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD 397 (510) Q Consensus 323 ~~~~~~~~~~i~~~~~~I~~af~~~~-~----~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~ 397 (510) .+++.+++.|+.++++|++.|+.+. . .+..+++|||||+.+++|+...||++|++++.||+.||+.|++.++.+ T Consensus 408 -~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~ 486 (651) T protein:vir:80 408 -SNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQ 486 (651) T ss_pred -ccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5899999999999999999997642 2 245578999999999999999999999999999999999999999987 Q ss_pred cCCCC-----------------CCccceeeEE-eecHHH---HHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHH Q lcl|Aclame:pro 398 ALLQG-----------------LITKQHKPAI-ETGLPA---LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDT 456 (510) Q Consensus 398 ~~l~~-----------------~p~~~~~~~~-vs~l~~---l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~ 456 (510) .+-.| +.+++++..+ +.++++ +.|.+.++++..++| .+++.|++.+.+|...++.. T Consensus 487 ~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q---~~~~~p~~~~~~~~~~~~~~ 563 (651) T protein:vir:80 487 FTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQ---AVAQVPEMGQLVDYKRILVD 563 (651) T ss_pred hcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHH---hhccCCccchhhhHHHHHHH Confidence 54211 1123444332 223344 445555555555555 44445667777899999999 Q ss_pred HHHHcCCCH-hhccCCHHHHHH-HHHH-----HHHHHHHHHHHHHHHH-------HHHHHhhcccCCC Q lcl|Aclame:pro 457 IWAAFSVDT-SQFYKSADELQA-EAEE-----QRRQAAQAQAAQETLL-------EGASDMTNALAGV 510 (510) Q Consensus 457 ~a~~~Gvp~-~~i~~s~ee~~~-~~~~-----~~qqa~~~~~a~~~~~-------~~a~~~~~~~ag~ 510 (510) +++.+|++. ..++..+++.+. .+++ .++...+++.+.++.. ..+.++.+.++.. T Consensus 564 l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 631 (651) T protein:vir:80 564 LLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGTPNAD 631 (651) T ss_pred HHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999942 335555443321 1111 1110000000000000 0011222233322 No 28 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=2.8e-37 Score=220.98 Aligned_cols=476 Identities=11% Similarity=0.082 Sum_probs=317.8 Q ss_pred ChhHHHHHHHHH--hccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~l--kr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) +..-.+++|+.. +|++++.+|.|+.+|..-+.-...++...+...++|-+.-...+.++.+.||+.+|| ++.||++. T Consensus 17 ~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~l~~~~Fp-~~~w~~~v 95 (584) T protein:vir:95 17 SAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSNYFSSLFP-NDDWLRWV 95 (584) T ss_pred hHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHHHHHhhcC-ccceeeee Confidence 667788999988 499999999999999888765555555555666888888899999999999999999 68999998 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CC--------------C Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SD--------------E 142 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~--------------~ 142 (510) ...+.... .++ =+.+++.+...|+.+||+.++...+++++++|+|.+=++ .. + T Consensus 96 ~~~~~~~~---------~~~--~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~ 164 (584) T protein:vir:95 96 GYGKGDST---------KTK--AKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDYIG 164 (584) T ss_pred cCCCchhh---------HHH--HHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecccccccccc Confidence 87654321 111 123667777888999999999999999999999875332 21 1 Q ss_pred CeEEEEEeceEEEeeCCCCceeEEE--EEEEecHHHHhHHhhH--------Hhh-----c-----ccccC--------C- Q lcl|Aclame:pro 143 ATVVAWSLRSYAVRRDATGRWMDIV--LKQRYKSKDLDDVYKQ--------DLM-----R-----AGRNL--------S- 193 (510) Q Consensus 143 ~~~~~~pl~~~~v~~d~~G~v~~i~--r~~~~t~~~l~~~~~~--------~~~-----~-----~~~~~--------~- 193 (510) .++.-++..++++..++ +.+++.. +|..+|..+|.+...+ +.. + .+... + T Consensus 165 prieriSP~d~~~Dpsa-~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (584) T protein:vir:95 165 PRLVRISPLDIVFNPLA-TSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFDV 243 (584) T ss_pred ceEEeeChhheeecCCC-CCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccccccccccccc Confidence 23444556788888888 6676532 3556899998666421 110 0 00000 0 Q ss_pred ----------CCceEEEEEE---EEeec-CCCeeEEEEEEeeCCeeecc--ccccccccCceEEEeeeecCCCccccchH Q lcl|Aclame:pro 194 ----------GSGSVDLYTH---VQRRK-GTAMDYAEMYHEIDGVRVGE--TGRWPIHLCPYIVPTWNLAPGEHYGRGHV 257 (510) Q Consensus 194 ----------~~~~v~v~~~---v~~~~-~~~~~~~sv~~e~~~~~~~~--~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~ 257 (510) ....|++++- ++-+. +.+..+.-+.. .+|..+.+ ..-|+++.+||++..|.+...+.||.|+. T Consensus 244 d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v-~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~ 322 (584) T protein:vir:95 244 DGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITV-VDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPL 322 (584) T ss_pred ccccccccccCCceeEEEeecccccccccCCCcccceEEE-EeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCch Confidence 0112444431 22211 22222222222 24444443 66788899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHH Q lcl|Aclame:pro 258 EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVV 337 (510) Q Consensus 258 ~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~ 337 (510) +.++|-.+.||.+.+.++++...+++|++. .++++.++...+++.+..|.+.++.+++.. ..++..+...|+-+. T Consensus 323 ~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k----~~~~~~~~~~~pg~~~~~~~~~~~q~~~p~-a~~~~s~~~~lq~~e 397 (584) T protein:vir:95 323 DNLVGMQYRIDHLENAKADAVDLIIQPPLK----IIGEVEEFVWGPGAEIHLDQGGDVQEIAKN-VNYIINADNQIQMLE 397 (584) T ss_pred hhhhhHHHHHhHHHHHHHHHHHHhcCccee----eccccchhcccCCceeecCCCCCcceecCc-hhhhhHHHHHHHHHH Confidence 999999999999999999999999998533 456677778878778888888888777542 245555666666666 Q ss_pred HHHHHHHhhcccC--CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcC---------------- Q lcl|Aclame:pro 338 VRLNQAFMYGANQ--RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL---------------- 399 (510) Q Consensus 338 ~~I~~af~~~~~~--~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~---------------- 399 (510) +...+. +++. ..|.+.++++.+--.+.+.+..+.++.+...+|-.|++++++..|..-+ T Consensus 398 ~~me~~---sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e 474 (584) T protein:vir:95 398 DRMELY---AGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTD 474 (584) T ss_pred HHHHhh---hCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccc Confidence 655542 2221 2333344444444446677788899999999999999999998875321 Q ss_pred -----CCCCCccceeeE--Eee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccC Q lcl|Aclame:pro 400 -----LQGLITKQHKPA--IET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYK 470 (510) Q Consensus 400 -----l~~~p~~~~~~~--~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~ 470 (510) +..+.+++++.. ++. -.+-+.|++..+++.+|+|. .++ +++++.++--.+.+.+++..+.|...|.+ T Consensus 475 ~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~--~~~--~~i~p~~~~~~l~~~ladl~~~p~~~~~~ 550 (584) T protein:vir:95 475 LGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNS--QIG--QMILPHTSGKALATFVDDVTGLQGYEIFR 550 (584) T ss_pred cccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHh--hhh--hhccccchHHHHHHHHHHHhCCCcccccC Confidence 223445565533 222 23336788889999988875 222 25777888888999999999999877776 Q ss_pred CHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH-HHhhccc Q lcl|Aclame:pro 471 SADELQAEAEEQRRQA-AQAQAAQETLLEGA-SDMTNAL 507 (510) Q Consensus 471 s~ee~~~~~~~~~qqa-~~~~~a~~~~~~~a-~~~~~~~ 507 (510) .+ .++.+|.+.||. .++| +. .+.+| +...++. T Consensus 551 ~~--~~~~~Q~~~q~~~~~~q--~~-~~~~~~~~~~~~~ 584 (584) T protein:vir:95 551 PN--VAVAEQAETQSLVAQAQ--ED-LQLQAQMPAEGAI 584 (584) T ss_pred CC--cccchhHHHHhhhHHHH--HH-HHHHHhhhhccCC Confidence 43 333233222222 1122 11 12222 2233333 No 29 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=1.2e-32 Score=195.53 Aligned_cols=481 Identities=11% Similarity=0.047 Sum_probs=298.6 Q ss_pred ChhHHHHHHHHH--hccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~l--kr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) .-.-++.+|... +|+.-+..|+|+.+|+.-..-+..++..-+...+++-+.....+.+|.+.+++++|| +..||++. T Consensus 21 ~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~l~a~~~~~~fp-~~~w~d~~ 99 (599) T protein:vir:31 21 FIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLMITTSYMEHLLP-NRNWVDFV 99 (599) T ss_pred HHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHHHHHHHHhhhcC-CccceEee Confidence 222356788877 588899999999999876655555666666666788888999999999999999999 89999998 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--------CCCC------- Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--------SDEA------- 143 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--------~~~~------- 143 (510) .-++... . +..=+.+++.+...|+.|+|+.+....+.|++.+|||..=++ +|.. T Consensus 100 ~~~~~~~--------~---~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~~~ 168 (599) T protein:vir:31 100 GFDNDSV--------N---AEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSG 168 (599) T ss_pred ecCCchh--------H---HHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeeccccccccccc Confidence 7765432 1 112234677888899999999999999999999999875443 1211 Q ss_pred -eEEEEEeceEEEeeCCCCceeEEE--EEEEecHHHHhHHhhHH--------hh-------cc---ccc----------- Q lcl|Aclame:pro 144 -TVVAWSLRSYAVRRDATGRWMDIV--LKQRYKSKDLDDVYKQD--------LM-------RA---GRN----------- 191 (510) Q Consensus 144 -~~~~~pl~~~~v~~d~~G~v~~i~--r~~~~t~~~l~~~~~~~--------~~-------~~---~~~----------- 191 (510) +++-++..++++..++ +.+++.+ +|-.+|..+|.....+. +. +. ++. T Consensus 169 P~~ervsP~Di~~Dp~A-~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~ 247 (599) T protein:vir:31 169 TVTERLSPSDVFWDVTA-DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDS 247 (599) T ss_pred ceEEeecccceeeCCCC-CCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhccc Confidence 2444556788888887 6677644 46667888887655321 10 00 000 Q ss_pred --CCCCce---------EEEEEE---EEeecCCCeeEE---EEEEeeCCeeecc--ccccccccCceEEEeeeecCCCcc Q lcl|Aclame:pro 192 --LSGSGS---------VDLYTH---VQRRKGTAMDYA---EMYHEIDGVRVGE--TGRWPIHLCPYIVPTWNLAPGEHY 252 (510) Q Consensus 192 --~~~~~~---------v~v~~~---v~~~~~~~~~~~---sv~~e~~~~~~~~--~~~y~~~~~P~~~~Rw~~~~ge~Y 252 (510) +++... |++++- ++...++. .+ .++ -++++.+.+ ...|+.++.||++..|....++.| T Consensus 248 ~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~--~~~~~ViT-i~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~y 324 (599) T protein:vir:31 248 LHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDE--LWNNYEIT-VIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLC 324 (599) T ss_pred cccccccchhhhcccchhhhhhhhhhhhcccCCc--cccceEEE-EecCcEEeecccCCCCCCCCCeEEEEeeeeccccC Confidence 011111 222210 11111111 11 222 234444433 344787889999999999999999 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccCCCccchHHHHHH Q lcl|Aclame:pro 253 GRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQS 332 (510) Q Consensus 253 Grgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~ 332 (510) |.||...++|.+..||.+.+.++.+...+++| +++-.|.+.|.++...|+..+..+...++.++.-. .+...+... T Consensus 325 G~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p--~l~~~~dl~~eD~~~~P~~v~~~~d~~~vq~~~p~--s~~~~a~~~ 400 (599) T protein:vir:31 325 PIGPLHRLTGMQYKLDKRENFREDLHDRFLHP--SLKKVGDVREKGMRGGPNHVFEVEETGDVQYMTPP--AEVLQPDNQ 400 (599) T ss_pred CCCCchhcchHHHHHHHHHHHhhhhhhhhhcc--cccccccccccCccCCCCcceeecCCCccccccCc--hhhhhHHHH Confidence 99999999999999999999999999999988 34445778888888876656666666666554322 233334445 Q ss_pred HHHHHHHHHHHH----hhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcC--------- Q lcl|Aclame:pro 333 LQAVVVRLNQAF----MYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL--------- 399 (510) Q Consensus 333 i~~~~~~I~~af----~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~--------- 399 (510) |+..+.+..+.- +..+.+..++. ||+||+...++........+..+..+++.||+++++...++.. T Consensus 401 is~~e~~mee~sGvp~~~~G~~~ag~~-TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~ 479 (599) T protein:vir:31 401 LSITLQLMEDLSGAPKESIGQRTAGEK-TKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTF 479 (599) T ss_pred HHHHHHHHHHhhccchhhcCCcccchh-hHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeee Confidence 555555444321 11122223443 9999999999999999999999999999999999988765421 Q ss_pred --------CCCCCcccee--eEEeecH--HHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhh Q lcl|Aclame:pro 400 --------LQGLITKQHK--PAIETGL--PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQ 467 (510) Q Consensus 400 --------l~~~p~~~~~--~~~vs~l--~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~ 467 (510) +..+-.++++ ..++.-- .-+.|.+..+++.+|++ ++++++ ++|.+.-.++...++.....-.-. T Consensus 480 ~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~--~~~~q~--~~P~~~~k~l~~~l~~~~~l~~~~ 555 (599) T protein:vir:31 480 NSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILG--GPLGAA--LAPHMSRTKLFNAVEYLGDLDAYG 555 (599) T ss_pred cccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhc--ccCCCc--cchhhHHHHHHHHHHHHHhccccc Confidence 2223333433 2233322 23778888999998887 333331 233333334444444433333223 Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHH-HHHhhcccCCC Q lcl|Aclame:pro 468 FYKSADELQAEAEEQRRQAAQAQAAQE-TLLEG-ASDMTNALAGV 510 (510) Q Consensus 468 i~~s~ee~~~~~~~~~qqa~~~~~a~~-~~~~~-a~~~~~~~ag~ 510 (510) |.+..--+++ ++++..|+|.... ...-+ .++.-|++..= T Consensus 556 ~~~~~va~~e----qq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~ 596 (599) T protein:vir:31 556 IFTFGIGVQE----DQQLARMAQKSTQQTEETALTQEEVGGPTTD 596 (599) T ss_pred cCCCchhHHH----HHHHHHHHHHHHHHhHhhhhhhhhcCCCCcc Confidence 4443211111 1111222221111 11111 12222222222 No 30 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=6.7e-30 Score=180.52 Aligned_cols=486 Identities=12% Similarity=0.064 Sum_probs=280.6 Q ss_pred ChhHHHHHHHHHhc--c-CchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRD--G-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lkr--~-~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) +.+.+..+++..+. + .+...+.+-.+|.+-....... .+ ..+++.+.-...++.+.+.|+..+|+ +.+||++ T Consensus 15 ~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~-~~---~s~~~~~~v~~~v~~~~~~l~~~~~~-~~~~~~~ 89 (705) T protein:vir:88 15 VLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNER-PG---KSGIVSRDVQETVDWIMPSLMKVFTS-GGQVVKY 89 (705) T ss_pred HHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCccc-CC---CCccccHHHHHHHHHHHHHHHHhhcC-CCceEEE Confidence 22223333332221 1 1222233333443322111111 11 23456666777889999999999886 8899999 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHH-HHhcCCHHHHHHHHHHHHhhCceEE--EEeC-------------- Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQR-LFQNASLAVLTQVIKLLIVTGNALL--YRNS-------------- 140 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~~l~~~G~~~l--~~~~-------------- 140 (510) .+-.+...+ +. +.++..+.-. ...++.+..++.++++.+.+|++++ |.+. T Consensus 90 ~p~~~~D~~----------~a---~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~ 156 (705) T protein:vir:88 90 EPDTAEDVE----------QA---EQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSE 156 (705) T ss_pred eeCChhHHH----------HH---HHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCCh Confidence 986544322 11 1122233222 4556678889999999999999875 2211 Q ss_pred ----------C-------------------------CCeEEEEEeceEEEeeCCCCceeE--EEEEEEecHHHHhHH--- Q lcl|Aclame:pro 141 ----------D-------------------------EATVVAWSLRSYAVRRDATGRWMD--IVLKQRYKSKDLDDV--- 180 (510) Q Consensus 141 ----------~-------------------------~~~~~~~pl~~~~v~~d~~G~v~~--i~r~~~~t~~~l~~~--- 180 (510) + ..++..+|..+|++..++.+--|. +++++.+|..+|... T Consensus 157 ~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~ 236 (705) T protein:vir:88 157 DMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVP 236 (705) T ss_pred hhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCC Confidence 0 113456788899999998774443 667888999888432 Q ss_pred ------hhHHh-----------hccc------------ccCCCCceEEEEEEEEeecCCCeeEEEEEE-eeCCeeecccc Q lcl|Aclame:pro 181 ------YKQDL-----------MRAG------------RNLSGSGSVDLYTHVQRRKGTAMDYAEMYH-EIDGVRVGETG 230 (510) Q Consensus 181 ------~~~~~-----------~~~~------------~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~-e~~~~~~~~~~ 230 (510) +..+- .+.. ..++....|.+|+|..+-+..+-.+..+|. -..|.++.... T Consensus 237 ~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~ 316 (705) T protein:vir:88 237 EDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNE 316 (705) T ss_pred hhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccccc Confidence 11000 0000 001122357888887654322212222222 12344555544 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceec- Q lcl|Aclame:pro 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVP- 309 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~- 309 (510) .+ +.+||++.++.+.++..||+|++....+-.+.+|.+.+.+++++..+++|.++++. |.+++.++....+|.++. T Consensus 317 ~~--~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~-g~v~~~d~~~~~pg~vv~~ 393 (705) T protein:vir:88 317 PW--DCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLD-GQVNLEDLLTNEAAGIVRV 393 (705) T ss_pred cC--CCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccc-cccCcccccccCCCeeEEe Confidence 44 57999999999999999999999999999999999999999999999999999965 666777776666666553 Q ss_pred CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc-ccC---CC--CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|Aclame:pro 310 GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQ---RD--AERVTAEEVRITAEEAENTLGGTYSLLAENL 383 (510) Q Consensus 310 g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~~~---~~--~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~ 383 (510) .+.+.+.+++.. .-.+.+...++.+.+.|.+..-.+ ..+ .+ ..+.||+.|....+.....+..+...+...+ T Consensus 394 ~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~ 471 (705) T protein:vir:88 394 KSMNSITPLETP--QLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETG 471 (705) T ss_pred cCCCccccccCC--cCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 233445555433 233455677888888888766332 221 11 2357999999999999999999888888889 Q ss_pred HHHHHHHHHHHHhhcC-----------CCCCCcc----ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcC Q lcl|Aclame:pro 384 QSPLAYVCLSEVDDAL-----------LQGLITK----QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRI 448 (510) Q Consensus 384 l~Pli~r~~~il~~~~-----------l~~~p~~----~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~i 448 (510) +.+++.+++.++.... ..++.+. ...+.+.+++....+.+....+..+++..+.+.+.++..+.+ T Consensus 472 ~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~ 551 (705) T protein:vir:88 472 VKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLV 551 (705) T ss_pred HHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhc Confidence 9999999999875432 1122222 234555566777777777777777777776666655555555 Q ss_pred CH---HHHHHHHHHHcCCCH-hhccCCHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHhhccc--C Q lcl|Aclame:pro 449 SL---PKMMDTIWAAFSVDT-SQFYKSADELQAEAEEQRR--------------QAAQAQAAQETLLEGASDMTNAL--A 508 (510) Q Consensus 449 d~---d~~~~~~a~~~Gvp~-~~i~~s~ee~~~~~~~~~q--------------qa~~~~~a~~~~~~~a~~~~~~~--a 508 (510) +. .+++..++..+|+-. ..+...+...++.+.++++ |+.+++ ++..+....++....+ + T Consensus 552 ~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k-~q~e~~~~q~e~q~~q~E~ 630 (705) T protein:vir:88 552 SEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQR-AQSDALAKQAEAQMKQVEA 630 (705) T ss_pred ChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 43 345555666655421 1122221111111100000 000000 0000000000000000 0 Q ss_pred CC Q lcl|Aclame:pro 509 GV 510 (510) Q Consensus 509 g~ 510 (510) -. T Consensus 631 q~ 632 (705) T protein:vir:88 631 QI 632 (705) T ss_pred HH Confidence 00 No 31 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.93 E-value=2.7e-23 Score=144.30 Aligned_cols=487 Identities=9% Similarity=-0.001 Sum_probs=250.2 Q ss_pred ChhHHHHHHHHHhcc--Cch---HHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG--SVE---QRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lkr~--~~~---~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) .-.+|++-++..++. ... ..|-+++-|.. ...++. ..+ +.++....-.+.++.+-+.|+-.+++ +..|| T Consensus 28 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~g---rs~vv~~~v~~~ve~~~~~l~~~f~~-~~~~~ 101 (763) T protein:vir:95 28 SLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEG-KAKPPK-VKG---RSQVQPKLVRRQAEWRYSALTEPFLG-SNKLF 101 (763) T ss_pred HHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccc-cCcccc-cCC---CccccCHHHHHHHHHHHHHHHHhhcC-CCcEE Confidence 222333333333211 111 24555543331 111111 111 23457777888899998999998888 66899 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHH-HHHhcCCHHHHHHHHHHHHhhCceEE--EEeC------------ Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQ-RLFQNASLAVLTQVIKLLIVTGNALL--YRNS------------ 140 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~------------ 140 (510) ++.+-.+...+. ++ +.+..+.- ....++-+..++..+++.+..|++++ |.+. T Consensus 102 ~~~P~~~~D~~~---------A~----q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~ 168 (763) T protein:vir:95 102 KVTPVTWEDVQG---------AR----QNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVF 168 (763) T ss_pred EEecCCcchHHH---------HH----HHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhh Confidence 999887654321 11 12222222 24566677888999999999999863 3320 Q ss_pred --------------------------------C-----------------------------------CCeEEEEEeceE Q lcl|Aclame:pro 141 --------------------------------D-----------------------------------EATVVAWSLRSY 153 (510) Q Consensus 141 --------------------------------~-----------------------------------~~~~~~~pl~~~ 153 (510) + ..+++.+|..+| T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~ 248 (763) T protein:vir:95 169 SLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENI 248 (763) T ss_pred hhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHh Confidence 0 002345778899 Q ss_pred EEeeCCCCcee---EEEEEEEecHHHHhHH-hhHH--------hhc-----cc-----c--c--CCCCceEEEEEEEEee Q lcl|Aclame:pro 154 AVRRDATGRWM---DIVLKQRYKSKDLDDV-YKQD--------LMR-----AG-----R--N--LSGSGSVDLYTHVQRR 207 (510) Q Consensus 154 ~v~~d~~G~v~---~i~r~~~~t~~~l~~~-~~~~--------~~~-----~~-----~--~--~~~~~~v~v~~~v~~~ 207 (510) +|..++.+.++ -+++++.+|..+|.+. ++.+ ... .. . . .....+|.||.|..+- T Consensus 249 ~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~ 328 (763) T protein:vir:95 249 IIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFW 328 (763) T ss_pred eecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeee Confidence 99998877544 3578899999999653 1111 000 00 0 0 0113578888876553 Q ss_pred --cCCCeeEEEEEEeeCCeee-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCC Q lcl|Aclame:pro 208 --KGTAMDYAEMYHEIDGVRV-GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV 284 (510) Q Consensus 208 --~~~~~~~~sv~~e~~~~~~-~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~ 284 (510) ++++...+.+...+++..+ ..+..|+.+.+||++..+.+.++..||+|.++.+.+..+.+|++.+..+..+..+++| T Consensus 329 d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~ 408 (763) T protein:vir:95 329 DIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANG 408 (763) T ss_pred ccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCC Confidence 3333322211123344333 2446677788999999999999999999999999999999999999999999999999 Q ss_pred ceeeCCCCccchhhhhcCCCccee---cCCccc--cccccCCC-ccchHHHHHHHHHHHHHHHHHH-hhcccCCCCCCCC Q lcl|Aclame:pro 285 LNLVDEAKGAVVDDYQDAEMGDYV---PGGAEA--VRAYERGD-YNKMAAIQQSLQAVVVRLNQAF-MYGANQRDAERVT 357 (510) Q Consensus 285 ~~lv~~~g~~~~~~~~~~~~G~~~---~g~~~~--v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~vT 357 (510) .|+++.+. +++.+.....+|.++ +|.... +.+.+... ...+..+.+..+...+.+.-.- +..+...++...| T Consensus 409 ~~~v~~ga-v~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~t 487 (763) T protein:vir:95 409 QRGMPKGM-LDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDV 487 (763) T ss_pred cEEeeccc-ccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccch Confidence 99997655 454444444455443 333221 22221111 1233333333333322222111 1112232333469 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcC------------CCCCCccce----eeEEeecHHHHHH Q lcl|Aclame:pro 358 AEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDAL------------LQGLITKQH----KPAIETGLPALSR 421 (510) Q Consensus 358 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~------------l~~~p~~~~----~~~~vs~l~~l~r 421 (510) |++|..+.+.....+..++.++.. .+.+++++++.++.... ..++.+++. .+.+..+.+ -.+ T Consensus 488 at~v~~l~qa~~~~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~~a-s~~ 565 (763) T protein:vir:95 488 AAGIRGVLDAASKREMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDISTA-EVD 565 (763) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecccc-hHH Confidence 999999999988888888877765 78999999999876421 223333332 222222222 223 Q ss_pred HHHHHHHHHHHHHHHhhcCh-------HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 422 SAAVQSMLNASQVIAGLAPI-------AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQE 494 (510) Q Consensus 422 ~~~~~~~~~~~q~~~~~~~~-------~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~ 494 (510) .+..+.+..+++.++...++ .++++..+...+++.+.....-|.. +-.-..+.++.+.+...+.++++++.+ T Consensus 566 ~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~-~~q~qaqle~~~~q~e~~~~~akaq~~ 644 (763) T protein:vir:95 566 NQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDP-VQEQLKQLAVEKAQLENEELRSKIRLN 644 (763) T ss_pred HHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccc-hhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 33444444444443322221 1122333444444443333222110 100000111100000000000000000 Q ss_pred -----HHHHHH----HHhhcccCCC Q lcl|Aclame:pro 495 -----TLLEGA----SDMTNALAGV 510 (510) Q Consensus 495 -----~~~~~a----~~~~~~~ag~ 510 (510) ..++++ .+..-...+. T Consensus 645 qaqa~~~~aq~e~~~~d~~~~e~~~ 669 (763) T protein:vir:95 645 DAQAQKAMAERDNKNLDYLEQESGT 669 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 0000001111 No 32 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.74 E-value=3.7e-16 Score=105.17 Aligned_cols=484 Identities=11% Similarity=-0.039 Sum_probs=236.9 Q ss_pred ChhHHHHHHHHHh-----ccCchHHHHHHHHhhcccccCCCCCCcc---ccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-----DGSVEQRAIEFAKTTLPYLMVDPMSGSR---GVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lk-----r~~~~~~w~e~~~~~~P~~~~~~~~~~~---~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) +.+.+.+..+.++ ...|...+.+-.+|..-. ..++..... ...+.+.-..-...++.+.+..-. ++ T Consensus 43 ~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~~-----nr 116 (776) T protein:vir:93 43 AVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNI-QWSQDEIDELKERGQAPTVYNVISQSVNWIIGSEKR-----GR 116 (776) T ss_pred HHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC-CCCHHHHHHHHhcCCceEEecchHHHHHHHHHHHHh-----CC Confidence 4444444433332 112333343444443211 111111000 001112222223333333332222 55 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCCC--e--EE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA--T--VV 146 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~~--~--~~ 146 (510) +=+++.+.+... .++.+.| +..+......+++..+...++.+.++.|.+++ +.+.+.. . .+ T Consensus 117 ~~~~~~p~~~~d----------~~~Ae~l---~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~ 183 (776) T protein:vir:93 117 SDFKVLPRRKDG----------GKAAERK---TALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAG 183 (776) T ss_pred cceEEecCChhH----------HHHHHHH---HHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEee Confidence 556666654321 1233333 33344445778999999999999999998874 4444322 2 34 Q ss_pred EEEeceEEEeeCCCC-c---eeEEEEEEEecHHHHhHHhhHHhh---cccc----------------------------- Q lcl|Aclame:pro 147 AWSLRSYAVRRDATG-R---WMDIVLKQRYKSKDLDDVYKQDLM---RAGR----------------------------- 190 (510) Q Consensus 147 ~~pl~~~~v~~d~~G-~---v~~i~r~~~~t~~~l~~~~~~~~~---~~~~----------------------------- 190 (510) +++..++++..++.- . ..-+|++.++|.+++...|++... .... T Consensus 184 ~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 263 (776) T protein:vir:93 184 AESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSV 263 (776) T ss_pred ccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccccccccccccccccc Confidence 556678888766542 2 234778888999999887764211 0000 Q ss_pred ----cCCCCceEEEEEEEEeecC------------C-----------------C------eeEEE--EEEeeCCeee-cc Q lcl|Aclame:pro 191 ----NLSGSGSVDLYTHVQRRKG------------T-----------------A------MDYAE--MYHEIDGVRV-GE 228 (510) Q Consensus 191 ----~~~~~~~v~v~~~v~~~~~------------~-----------------~------~~~~s--v~~e~~~~~~-~~ 228 (510) .....+.|.|+.+.+++.. + + ..... +++..++..+ .. T Consensus 264 ~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~ 343 (776) T protein:vir:93 264 TAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAG 343 (776) T ss_pred cccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhcc Confidence 0011246777777654311 0 0 00111 2223333322 23 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc--CCCcc Q lcl|Aclame:pro 229 TGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--AEMGD 306 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~--~~~G~ 306 (510) .+.|+.+.|||++......+.+.||.|.+....+-.+.+|+....++... ....+++..+.+.+.+.+.. +.+|. T Consensus 344 ~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~~~~~~~gav~~~d~~~~~~~rp~~ 420 (776) T protein:vir:93 344 PSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STNKVLMEEGAVDDIDEFRREAARPDA 420 (776) T ss_pred CCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCCceeeccccccchHHHHHhcccCCc Confidence 46677788999999999999999999999999999999999988877653 34567887777767776664 33444 Q ss_pred eecCCccccccccCCCc-cchHHHHHHHHHHHHHHHHHHh-hcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|Aclame:pro 307 YVPGGAEAVRAYERGDY-NKMAAIQQSLQAVVVRLNQAFM-YGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENL 383 (510) Q Consensus 307 ~~~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~~I~~af~-~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~ 383 (510) ++....+....+++... .-.+...+.++...+.|...-- .+. ....+...+..-|..|.+.....+..++.++.. . T Consensus 421 vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~-~ 499 (776) T protein:vir:93 421 VMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRL-A 499 (776) T ss_pred eeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 44322222222221111 1234455666666666666531 121 122334467778999999999999999998866 4 Q ss_pred HHHHHHHHHHHHhhc----CCC---C---------C----Cccc-----eeeEEeecHH-HHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 384 QSPLAYVCLSEVDDA----LLQ---G---------L----ITKQ-----HKPAIETGLP-ALSRSAAVQSMLNASQVIAG 437 (510) Q Consensus 384 l~Pli~r~~~il~~~----~l~---~---------~----p~~~-----~~~~~vs~l~-~l~r~~~~~~~~~~~q~~~~ 437 (510) +.=+.+.++.++... ... . + +..+ +.+.+..+.+ +..|.+..+.+...++ . T Consensus 500 ~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~---~ 576 (776) T protein:vir:93 500 FQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIG---K 576 (776) T ss_pred HHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHh---h Confidence 444555555554331 111 1 1 1111 1232323222 3446665555554443 2 Q ss_pred hcCh---------HhHhhcCCHHHHHHHHHHHcCCCH-hhccCCHHHHHHHHHHHHHHHHHHHHHHHHH---H------- Q lcl|Aclame:pro 438 LAPI---------AQLDPRISLPKMMDTIWAAFSVDT-SQFYKSADELQAEAEEQRRQAAQAQAAQETL---L------- 497 (510) Q Consensus 438 ~~~~---------~q~~~~id~d~~~~~~a~~~Gvp~-~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~---~------- 497 (510) +.+. .+.++--+.+++...+-...+-+. ..-...+++.++.+.++++++++++.+.+.. . T Consensus 577 ~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~ 656 (776) T protein:vir:93 577 MPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAA 656 (776) T ss_pred cChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHH Confidence 2211 112222356677777666655331 1122222222211111111111111110000 0 Q ss_pred -------HHHHHh--hcccCCC Q lcl|Aclame:pro 498 -------EGASDM--TNALAGV 510 (510) Q Consensus 498 -------~~a~~~--~~~~ag~ 510 (510) ++|.+. .....++ T Consensus 657 aea~~~~aqa~~~~~~a~~~~~ 678 (776) T protein:vir:93 657 AEAQVAEAKAKHISRMAIREGV 678 (776) T ss_pred HHHHHHhhhhhhhhhcchhhhh Confidence 000000 0000111 No 33 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.64 E-value=1.4e-13 Score=91.05 Aligned_cols=497 Identities=10% Similarity=0.016 Sum_probs=234.3 Q ss_pred ChhHHHHHHHHHh-ccCchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) .++-+....+.++ ...+...|+ +=.+|..-.-+ ++..... ...+.+ |+=++. .++...+..-. + T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw-~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~~~~-----n 100 (711) T protein:vir:10 28 DRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW-PSQVRTERELEQRPCLVNNVLPT-FVDQVLGDQRQ-----N 100 (711) T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC-CHHHHHHHHhcCCCcEEEcchHH-HHHHHhhhHhh-----C Confidence 2222222222222 233444443 22233221101 1100000 001111 332222 22222222221 2 Q ss_pred CcccccCCCh------------hhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--E Q lcl|Aclame:pro 72 IPFFRSELTD------------AIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--Y 137 (510) Q Consensus 72 ~~WF~l~~~d------------~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~ 137 (510) ++=+++.+.+ .........+....++.+.|.. .+.-....++...+...++.+.+..|.|++ + T Consensus 101 r~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~---~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~ 177 (711) T protein:vir:10 101 RPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTG---LIKNIEYNCDAETEYDIAFQGAVESGMGYLRVR 177 (711) T ss_pred CcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHH---HHHHHHHhcChhHHHHHHHHHhhhcCcceEEEE Confidence 3333333321 0001111111111223333333 333345677888899999999999998864 2 Q ss_pred Ee---CC--CC--eEEEEE-eceEEEeeCC---CCc-eeEEEEEEEecHHHHhHHhhHHhh----cccccC-C---CCce Q lcl|Aclame:pro 138 RN---SD--EA--TVVAWS-LRSYAVRRDA---TGR-WMDIVLKQRYKSKDLDDVYKQDLM----RAGRNL-S---GSGS 197 (510) Q Consensus 138 ~~---~~--~~--~~~~~p-l~~~~v~~d~---~G~-v~~i~r~~~~t~~~l~~~~~~~~~----~~~~~~-~---~~~~ 197 (510) .+ ++ .+ .+..++ ..++++..++ ++. ..-+|++.+++.+++...|+.... ...... + ..+. T Consensus 178 ~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~ 257 (711) T protein:vir:10 178 SDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKS 257 (711) T ss_pred ecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcccCcce Confidence 22 22 12 244442 4567775543 222 334888899999999999985431 111111 0 0133 Q ss_pred EEEEEEEEeec----------CCC-------------------------eeEEEEEE--eeCCeeeccccccccccCceE Q lcl|Aclame:pro 198 VDLYTHVQRRK----------GTA-------------------------MDYAEMYH--EIDGVRVGETGRWPIHLCPYI 240 (510) Q Consensus 198 v~v~~~v~~~~----------~~~-------------------------~~~~sv~~--e~~~~~~~~~~~y~~~~~P~~ 240 (510) |.|..+.++++ +.. .....+|+ -.+...+...+.|+.+.|||+ T Consensus 258 vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~v 337 (711) T protein:vir:10 258 VRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVI 337 (711) T ss_pred eeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcccEE Confidence 44443333221 000 01112222 233333334456777789999 Q ss_pred EE--eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc---CCCccee---cCCc Q lcl|Aclame:pro 241 VP--TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD---AEMGDYV---PGGA 312 (510) Q Consensus 241 ~~--Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~---~~~G~~~---~g~~ 312 (510) +. .+...++..++.|.+....+-.+.+|++...++.......++++++.++.+-+.+.... ..+|.++ ||.- T Consensus 338 p~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~ 417 (711) T protein:vir:10 338 PVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQ 417 (711) T ss_pred EEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCeeEeccccc Confidence 65 35667888888899999999999999999999999999999999998877776665432 3455554 3332 Q ss_pred cccccccCCCccchHHHHHHHHHHHHHHHHHHh-hcccC-CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 313 EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGANQ-RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) Q Consensus 313 ~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~-~~~~~-~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) +.-.+-......-.+...+.++...+.|.+.-- .+..+ ..+...|+.-|..|.+.-...|..++.++.. ...-+... T Consensus 418 ~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~-~~~~~g~~ 496 (711) T protein:vir:10 418 GDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTK-SIRRVGKI 496 (711) T ss_pred CcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 221222222233445566777777777766542 22222 2334468889999999999999998888775 33333344 Q ss_pred HHHHHhhcC----CCCCCc-----c--------------------ce-----eeEE-eecHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 391 CLSEVDDAL----LQGLIT-----K--------------------QH-----KPAI-ETGLPALSRSAAVQSMLNASQVI 435 (510) Q Consensus 391 ~~~il~~~~----l~~~p~-----~--------------------~~-----~~~~-vs~l~~l~r~~~~~~~~~~~q~~ 435 (510) ++.++.... ..-+.. + ++ .+.+ +++-.+-.|.+.+..+.++++.+ T Consensus 497 ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~ 576 (711) T protein:vir:10 497 LVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAV 576 (711) T ss_pred HHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhc Confidence 444432211 110100 0 11 1222 22334555566566666555543 Q ss_pred HhhcCh--Hh---HhhcCCHHHHHHHHHHHcCCCHhhccCCHHHH-HHHHHHHHHHHHHHHHHHHH------------HH Q lcl|Aclame:pro 436 AGLAPI--AQ---LDPRISLPKMMDTIWAAFSVDTSQFYKSADEL-QAEAEEQRRQAAQAQAAQET------------LL 497 (510) Q Consensus 436 ~~~~~~--~q---~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~-~~~~~~~~qqa~~~~~a~~~------------~~ 497 (510) -.+.+. +. .+|--+.++++..+....+-+. ......+. ++..+++++++.+++.+.+. .. T Consensus 577 p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~--~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~ 654 (711) T protein:vir:10 577 PSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNV--LSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) T ss_pred chhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCccc--CcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222221 22 2334477888888887766442 22222111 11111111111111111000 00 Q ss_pred HHHH--Hh------hccc-CCC Q lcl|Aclame:pro 498 EGAS--DM------TNAL-AGV 510 (510) Q Consensus 498 ~~a~--~~------~~~~-ag~ 510 (510) ++|. +. .+++ +++ T Consensus 655 Aqae~~qa~~e~~~~q~q~~~~ 676 (711) T protein:vir:10 655 AQADMLKAQLETEEAQKQLAMI 676 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHH Confidence 0110 00 0000 000 No 34 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.39 E-value=3.3e-11 Score=78.05 Aligned_cols=486 Identities=13% Similarity=0.028 Sum_probs=225.2 Q ss_pred ChhHHHHHHHHHhc-----cCchHHHHHHH----Hhh-cccccCCCCCCcc--cc-----cccc-ccchHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLRD-----GSVEQRAIEFA----KTT-LPYLMVDPMSGSR--GV-----VEHD-FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~k~~~~~r~~~lkr-----~~~~~~w~e~~----~~~-~P~~~~~~~~~~~--~~-----~~~~-~dstg~~a~~~Laa~ 62 (510) |-+++.++..+++. ..|...|++-+ +|. .+....++.+... .+ .+.+ |+=++. .++...+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~-~v~~v~g~ 79 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVAT-ELNRIIAE 79 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHH-HHHHHHHH Confidence 77777666666531 12334443222 221 1221111111000 00 0111 333332 23322222 Q ss_pred HHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--- Q lcl|Aclame:pro 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--- 139 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--- 139 (510) -.. +++=+++.+.+... + .++.+.|+. .+......++...+...+|.+.+..|-|.+.+- T Consensus 80 ~~~-----nr~d~~v~P~~~~~--------d-~~~Ae~l~~---~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~ 142 (708) T protein:vir:10 80 YRN-----NRITVKFRPGDREA--------S-EELANKLNG---LFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) T ss_pred HHh-----CCcceEEEcCCCCc--------h-HHHHHHHHH---HHHHHHHhcCchHHHHHHHHhhhhcccceeeeeecc Confidence 211 45555555553221 0 123333433 333345578899999999999999998875331 Q ss_pred --C-----C--CCeEEEE--EeceEEEeeCCC---Cc-eeEEEEEEEecHHHHhHHhhHHhhcccc----c------CCC Q lcl|Aclame:pro 140 --S-----D--EATVVAW--SLRSYAVRRDAT---GR-WMDIVLKQRYKSKDLDDVYKQDLMRAGR----N------LSG 194 (510) Q Consensus 140 --~-----~--~~~~~~~--pl~~~~v~~d~~---G~-v~~i~r~~~~t~~~l~~~~~~~~~~~~~----~------~~~ 194 (510) + + +..++++ |..++++..++. +. -.-+||..+++.+++...||+....... . ..+ T Consensus 143 ~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~ 222 (708) T protein:vir:10 143 VNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGA 222 (708) T ss_pred ccccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCC Confidence 1 1 1123322 445566554432 21 1236777889999999999854221100 0 001 Q ss_pred CceEEEEEE-----------EEeecCC-----------------------------CeeEEEEEE-eeCCeeec-ccccc Q lcl|Aclame:pro 195 SGSVDLYTH-----------VQRRKGT-----------------------------AMDYAEMYH-EIDGVRVG-ETGRW 232 (510) Q Consensus 195 ~~~v~v~~~-----------v~~~~~~-----------------------------~~~~~sv~~-e~~~~~~~-~~~~y 232 (510) +.+-|..+ +.+.+.. ....+.||+ -+.|..+. ..+.| T Consensus 223 -d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~ 301 (708) T protein:vir:10 223 -DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRI 301 (708) T ss_pred -CceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCC Confidence 11111111 1111100 001111222 23444444 33557 Q ss_pred ccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc--------- Q lcl|Aclame:pro 233 PIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--------- 301 (510) Q Consensus 233 ~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~--------- 301 (510) ++..|||++.-+.. ..|..++.|.+....+-.+.+|+..-..+..+..+.+.+++++++.+.....-.. T Consensus 302 p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~ 381 (708) T protein:vir:10 302 PGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAF 381 (708) T ss_pred CCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhh Confidence 77789998764433 3677878899999999999999999999999988888888887765433221111 Q ss_pred -------CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 302 -------AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 302 -------~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) .+.|.++++... +.......-.+...+.++...+.|.++.-. +.....+.+.+..-|..|.+.-...++ T Consensus 382 ~~~~~~~~~~G~~~~~~~~---~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~ 458 (708) T protein:vir:10 382 LPLREVRDKSGNIIAGATP---AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASF 458 (708) T ss_pred hccccccccccccccccCC---ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHH Confidence 122222222111 101111112233455666666666666422 222211234678889999999999999 Q ss_pred hhHHHHHH------HHHHHHHHHHH------HHHhhcCC-----------CCCCc-----cce---eeEEe---ecHHHH Q lcl|Aclame:pro 374 GTYSLLAE------NLQSPLAYVCL------SEVDDALL-----------QGLIT-----KQH---KPAIE---TGLPAL 419 (510) Q Consensus 374 pv~~rl~~------E~l~Pli~r~~------~il~~~~l-----------~~~p~-----~~~---~~~~v---s~l~~l 419 (510) ..+.+|.. +++.-||...+ .|+...+- .+... .++ +..++ .+-.+- T Consensus 459 ~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s 538 (708) T protein:vir:10 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchh Confidence 99887763 34444444433 12211110 01111 011 11222 234456 Q ss_pred HHHHHHHHHHHHHHHHHhhcCh-H-------hHhhcCCHHHHHHHHHHHcCCCHhhccCCH-HHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 420 SRSAAVQSMLNASQVIAGLAPI-A-------QLDPRISLPKMMDTIWAAFSVDTSQFYKSA-DELQAEAEEQRRQAA--- 487 (510) Q Consensus 420 ~r~~~~~~~~~~~q~~~~~~~~-~-------q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~-ee~~~~~~~~~qqa~--- 487 (510) .|.+.++.+..+++.+....+. + +.+|--+.++++..+-..++.+. ..... ++.++..+++++.+. T Consensus 539 ~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~--~~~~~~~ee~q~~~~~q~~~q~q~ 616 (708) T protein:vir:10 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISG--IAKPRNEKEQQIVQQAQMAAQSQP 616 (708) T ss_pred HHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccc--cccccchhhHHHHHHHHHHHHHHH Confidence 6777677666666654332111 1 12223366778888877775542 22221 121111111111110 Q ss_pred --HHHHHHHH---HHHHHHHhhc--ccC---CC Q lcl|Aclame:pro 488 --QAQAAQET---LLEGASDMTN--ALA---GV 510 (510) Q Consensus 488 --~~~~a~~~---~~~~a~~~~~--~~a---g~ 510 (510) ++..+++. .++.+.++.. .+. ++ T Consensus 617 ~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~ 649 (708) T protein:vir:10 617 NPEMVLAQAQMVAAQAEAQKATNETAQTQIKAF 649 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01011110 0011111100 000 00 No 35 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.27 E-value=2.2e-10 Score=73.48 Aligned_cols=487 Identities=10% Similarity=-0.003 Sum_probs=214.6 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHH----HHHhhcccccCCCCCCccccc-cc-cccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIE----FAKTTLPYLMVDPMSGSRGVV-EH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e----~~~~~~P~~~~~~~~~~~~~~-~~-~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) -++.+....+.++ .-.+...|++ =.+|..- ...++......+. .+ .|+-++ ..++. +.+.-- .+++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G-~QW~~~~~~~l~~q~rp~~N~i~-~~v~~----v~g~e~-~nr~ 76 (725) T protein:vir:10 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRV-SQWDDWLSQYTTLQYRGQFDVVR-PVVRK----LVSEMR-QNPI 76 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC-CCCCHHHHHHHHhcCCCcccchH-HHHHH----HHhhHH-hCCc Confidence 2322332222222 1123344432 2233221 1111111000000 11 132222 22222 222111 1444 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE-----EeCCCC----e Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY-----RNSDEA----T 144 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~-----~~~~~~----~ 144 (510) =+++.+.++.. .++.+.|+.+ +......++..-+-..++.+.+.+|.+++- .++|.. . T Consensus 77 d~~v~p~~~~d----------~~~Ae~l~~~---~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:10 77 DVLYRPKDGAS----------PDAADVLMGM---YRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred ceEEecCCcch----------HHHHHHHHHH---HHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCcee Confidence 44555543221 2234444443 333345788888999999999999988742 233321 2 Q ss_pred EEEE----EeceEEEeeCCC-Cce---eEEEEEEEecHH---HHhHHhhHHhhcc-----ccc----CCCCceEEEEEEE Q lcl|Aclame:pro 145 VVAW----SLRSYAVRRDAT-GRW---MDIVLKQRYKSK---DLDDVYKQDLMRA-----GRN----LSGSGSVDLYTHV 204 (510) Q Consensus 145 ~~~~----pl~~~~v~~d~~-G~v---~~i~r~~~~t~~---~l~~~~~~~~~~~-----~~~----~~~~~~v~v~~~v 204 (510) ++.+ |..++++..++. ... .-+||..+|+.+ ++.+.++.+...- ... -.....+.|+.+. T Consensus 144 i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~ 223 (725) T protein:vir:10 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEE Confidence 3444 344566665542 222 235677778854 3444555332110 000 0012334444443 Q ss_pred EeecC-----------CC-----------------------------eeEEEEEEe-eCCeeeccc-cccccccCceEEE Q lcl|Aclame:pro 205 QRRKG-----------TA-----------------------------MDYAEMYHE-IDGVRVGET-GRWPIHLCPYIVP 242 (510) Q Consensus 205 ~~~~~-----------~~-----------------------------~~~~sv~~e-~~~~~~~~~-~~y~~~~~P~~~~ 242 (510) ++++. .+ .....||+. +.|..++.. +.|+.+.|||++. T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~ 303 (725) T protein:vir:10 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEE Confidence 32210 00 011233333 345555433 3455556899965 Q ss_pred eee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcc-ee--------cCC Q lcl|Aclame:pro 243 TWN--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGD-YV--------PGG 311 (510) Q Consensus 243 Rw~--~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~-~~--------~g~ 311 (510) -.. ...|..|+.|.+....+-.+.+|+.....+.....+.+.++++..+.+-.-......+++. ++ .|. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:10 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred EeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcc Confidence 323 3589999999999999999999999999999998888888888765443222222222222 11 111 Q ss_pred ccccccccC-CCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH------ Q lcl|Aclame:pro 312 AEAVRAYER-GDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAEN------ 382 (510) Q Consensus 312 ~~~v~~~~~-~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E------ 382 (510) .. ..+++. ....-.+...+.++..++.|.+.--.+ ..-+.+..++.--|..|.+.....|..++.+|..- T Consensus 384 ~~-~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:10 384 MP-TQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred cc-cccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 111111 122233456777888888887775332 22222333556678888888888888888776643 Q ss_pred HHHHHHHHHHH------HHhhcCCCC-------CC-c--------cc----eeeEEe-ecHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 383 LQSPLAYVCLS------EVDDALLQG-------LI-T--------KQ----HKPAIE-TGLPALSRSAAVQSMLNASQVI 435 (510) Q Consensus 383 ~l~Pli~r~~~------il~~~~l~~-------~p-~--------~~----~~~~~v-s~l~~l~r~~~~~~~~~~~q~~ 435 (510) .+.-||...|+ |+...+-.. .+ + .+ ..+.+. .+-.+-.|.+.+..+.++++.+ T Consensus 463 ~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~ 542 (725) T protein:vir:10 463 IYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKT 542 (725) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhc Confidence 33334433331 111111000 00 0 01 123232 2344555666666666666555 Q ss_pred HhhcChH-h-H---hhcC---CHHHHHHHHHHHcCCCHhhcc--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H Q lcl|Aclame:pro 436 AGLAPIA-Q-L---DPRI---SLPKMMDTIWAAFSVDTSQFY--KSADELQAEAEEQRRQAAQAQAAQETLLEGA----S 501 (510) Q Consensus 436 ~~~~~~~-q-~---~~~i---d~d~~~~~~a~~~Gvp~~~i~--~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a----~ 501 (510) ..+.+.. . + .+.. +.+++++.+....+.. ... .++++.++..++++++++++......++..+ . T Consensus 543 ~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~--~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qa 620 (725) T protein:vir:10 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM--GVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) T ss_pred cccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhh--ccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 4333321 1 1 1112 3345555555443321 111 1233322211111111111111000000000 0 Q ss_pred HhhcccC----------------CC Q lcl|Aclame:pro 502 DMTNALA----------------GV 510 (510) Q Consensus 502 ~~~~~~a----------------g~ 510 (510) +...+.+ -+ T Consensus 621 e~~ka~aE~~k~~~~a~~~~~~a~~ 645 (725) T protein:vir:10 621 ELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 00 No 36 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.25 E-value=3.3e-10 Score=72.54 Aligned_cols=487 Identities=10% Similarity=0.018 Sum_probs=215.6 Q ss_pred ChhHHHHHHHHHh-ccCchHHHH----HHHHhhcccccCCCCCCccccc-cc-cccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAI----EFAKTTLPYLMVDPMSGSRGVV-EH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~----e~~~~~~P~~~~~~~~~~~~~~-~~-~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) -++.+......++ .-.+...|+ +=.+|..-. ..++......+. .+ .|+-++. .++.+.+.-- .+++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~-Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v~g~e~-----~nr~ 76 (725) T protein:vir:92 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRIS-QWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMR-----QNPI 76 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcccchHH-HHHHHHhhHH-----hCCc Confidence 2233333333332 112333443 222332211 111110000000 01 1333332 2222221111 1444 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE-----EeCCCC----e Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY-----RNSDEA----T 144 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~-----~~~~~~----~ 144 (510) =+++.+.++.. .++.+.|+.+-+ .....|+..-+...++.+.+..|.|.+- .+++.. . T Consensus 77 d~~v~P~~~~d----------~~~Ae~l~~~~~---~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:92 77 DVLYRPKDGAS----------PDAADVLMGMYR---TDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred ceEEecCCccH----------HHHHHHHHHHHH---HHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCcee Confidence 45555544321 233444444333 3345789999999999999999988742 222221 2 Q ss_pred EEEEE----eceEEEeeCCCC-ce-e--EEEEEEEecHH---HHhHHhhHHhhcc-----cccC----CCCceEEEEEEE Q lcl|Aclame:pro 145 VVAWS----LRSYAVRRDATG-RW-M--DIVLKQRYKSK---DLDDVYKQDLMRA-----GRNL----SGSGSVDLYTHV 204 (510) Q Consensus 145 ~~~~p----l~~~~v~~d~~G-~v-~--~i~r~~~~t~~---~l~~~~~~~~~~~-----~~~~----~~~~~v~v~~~v 204 (510) ++..| +.++++..++.- .. | -+||..+++.+ ++.+++|.+...- .... ...+.|.|+.+. T Consensus 144 i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~ 223 (725) T protein:vir:92 144 IRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEE Confidence 34444 445666655431 11 1 25566677865 4555666422110 0000 012345555444 Q ss_pred EeecC-----------CC-----------------------------eeEEEEEEe-eCCeeeccc-cccccccCceEEE Q lcl|Aclame:pro 205 QRRKG-----------TA-----------------------------MDYAEMYHE-IDGVRVGET-GRWPIHLCPYIVP 242 (510) Q Consensus 205 ~~~~~-----------~~-----------------------------~~~~sv~~e-~~~~~~~~~-~~y~~~~~P~~~~ 242 (510) ++++. .+ .....||+. +.|..++.. +.|+.+.|||++. T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:92 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEE Confidence 33210 00 011233333 345555433 3455566899965 Q ss_pred e--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcc-ee--------cCC Q lcl|Aclame:pro 243 T--WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGD-YV--------PGG 311 (510) Q Consensus 243 R--w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~-~~--------~g~ 311 (510) - .....|..|+.|.+....+-.+.+|+.....+.....+.+.++++..+.+-.-......+++. ++ .|. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:92 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred EeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeecccccccccc Confidence 3 234689999999999999999999999999999988888888888765442222222222221 11 111 Q ss_pred ccccccccC-CCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH------H Q lcl|Aclame:pro 312 AEAVRAYER-GDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAE------N 382 (510) Q Consensus 312 ~~~v~~~~~-~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~------E 382 (510) . ...++.. ....-.+...+.++..++.|.+.--.+ ..-+.+..++.--|..|.+.....|...+.+|.. + T Consensus 384 ~-~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:92 384 M-PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred c-cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1112211 122234456777788888887775332 2222233356667889999989899988876654 3 Q ss_pred HHHHHHHHHHH------HHhhcCCCC-------CCc---------cc----eeeEEe-ecHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 383 LQSPLAYVCLS------EVDDALLQG-------LIT---------KQ----HKPAIE-TGLPALSRSAAVQSMLNASQVI 435 (510) Q Consensus 383 ~l~Pli~r~~~------il~~~~l~~-------~p~---------~~----~~~~~v-s~l~~l~r~~~~~~~~~~~q~~ 435 (510) .+.-||...++ |+...+-+. .+. .+ ..+.+. .+-.+-.|.+.+..+..+++.+ T Consensus 463 ~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~ 542 (725) T protein:vir:92 463 IYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhc Confidence 44444444331 121111100 000 01 222222 3344555666666666655544 Q ss_pred HhhcChH-----hHhhcCC---HHHHHHHHHHHcCCCHhhcc--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H Q lcl|Aclame:pro 436 AGLAPIA-----QLDPRIS---LPKMMDTIWAAFSVDTSQFY--KSADELQAEAEEQRRQAAQAQAAQETLLEGA----S 501 (510) Q Consensus 436 ~~~~~~~-----q~~~~id---~d~~~~~~a~~~Gvp~~~i~--~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a----~ 501 (510) ..+.+.. +..+..| .+++.+.+....+.. ... .++++.++..+++++++++++......+..+ . T Consensus 543 ~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~--~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qa 620 (725) T protein:vir:92 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQM--GVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) T ss_pred ccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchh--ccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 4332211 1122223 345555554433221 111 1233322222221111111111000000000 0 Q ss_pred Hhhc----------------ccCCC Q lcl|Aclame:pro 502 DMTN----------------ALAGV 510 (510) Q Consensus 502 ~~~~----------------~~ag~ 510 (510) ++.. +.+-+ T Consensus 621 e~~kaqaE~~k~q~~a~~~~~~a~~ 645 (725) T protein:vir:92 621 ELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 00110 No 37 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.24 E-value=3.8e-10 Score=72.26 Aligned_cols=487 Identities=10% Similarity=0.001 Sum_probs=215.4 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHH----HHHhhcccccCCCCCCcccc-ccc-cccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIE----FAKTTLPYLMVDPMSGSRGV-VEH-DFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e----~~~~~~P~~~~~~~~~~~~~-~~~-~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) -++.+....+.++ .-.+...|+. =.+|..-. ..++......+ ..+ .|+=++. .++.+.+.--. +++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~-Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v~g~~~~-----nr~ 76 (725) T protein:vir:77 4 NENRLESILSRFDADWTASDEARREAKNDLFFSRVS-QWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMRQ-----NPI 76 (725) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCC-CCCHHHHHHHHhcCCCccccHHH-HHHHHHhhHHh-----CCc Confidence 3444443333333 1123344432 22232211 11111000000 011 1322222 22222222111 445 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE-----EeCCCC----e Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY-----RNSDEA----T 144 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~-----~~~~~~----~ 144 (510) =+++.+.++.. .++.+.|+.+ +......++..-+...++.+.+..|.|.+- .+++.. . T Consensus 77 d~~v~P~~~~d----------~~~Ae~l~~~---~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~ 143 (725) T protein:vir:77 77 DVLYRPKDGAR----------PDAADVLMGM---YRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQV 143 (725) T ss_pred ceEEecCCccH----------HHHHHHHHHH---HHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCcee Confidence 55555544321 1234444443 333345789999999999999999988742 223321 2 Q ss_pred EEEEE----eceEEEeeCCCC-ce-e--EEEEEEEecHH---HHhHHhhHHhhcccc-----cC----CCCceEEEEEEE Q lcl|Aclame:pro 145 VVAWS----LRSYAVRRDATG-RW-M--DIVLKQRYKSK---DLDDVYKQDLMRAGR-----NL----SGSGSVDLYTHV 204 (510) Q Consensus 145 ~~~~p----l~~~~v~~d~~G-~v-~--~i~r~~~~t~~---~l~~~~~~~~~~~~~-----~~----~~~~~v~v~~~v 204 (510) ++.+| ..++++..++.- .. | -+||..+++.+ ++.++++.+...... .. ...+.|.|+.+. T Consensus 144 i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~ 223 (725) T protein:vir:77 144 IRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEE Confidence 34444 445666655431 11 1 25677778876 345555543221100 00 012344444444 Q ss_pred EeecC-----------C---------------------C--------eeEEEEEEe-eCCeeeccc-cccccccCceEEE Q lcl|Aclame:pro 205 QRRKG-----------T---------------------A--------MDYAEMYHE-IDGVRVGET-GRWPIHLCPYIVP 242 (510) Q Consensus 205 ~~~~~-----------~---------------------~--------~~~~sv~~e-~~~~~~~~~-~~y~~~~~P~~~~ 242 (510) +++.. . + .....+|+. +.|..++.. +.|+.+.|||++. T Consensus 224 ~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:77 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEE Confidence 33210 0 0 011234433 355555433 4566667999964 Q ss_pred e--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcc-e-----e---cCC Q lcl|Aclame:pro 243 T--WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGD-Y-----V---PGG 311 (510) Q Consensus 243 R--w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~-~-----~---~g~ 311 (510) - .....|..|+.|.+....+-.+.+|+.....+.....+.+.++++..+-+-........+++. + + .|. T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:77 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGD 383 (725) T ss_pred eeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCc Confidence 3 335789999999999999999999999999999988888888887765432222222222221 0 1 121 Q ss_pred ccccccccCCCccch-HHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH------H Q lcl|Aclame:pro 312 AEAVRAYERGDYNKM-AAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAE------N 382 (510) Q Consensus 312 ~~~v~~~~~~~~~~~-~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~------E 382 (510) . ..+++.......+ +...+.++...+.|.++--. +. .-..+..++.--|..|.+.....+...+.+|.. + T Consensus 384 ~-~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:77 384 L-PTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred c-cccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1111111111222 34566777777777766522 21 222222345667888888888888877776543 3 Q ss_pred HHHHHHHHHH------HHHhhcCCC-----CCC----c-------cc----eeeEEe-ecHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 383 LQSPLAYVCL------SEVDDALLQ-----GLI----T-------KQ----HKPAIE-TGLPALSRSAAVQSMLNASQVI 435 (510) Q Consensus 383 ~l~Pli~r~~------~il~~~~l~-----~~p----~-------~~----~~~~~v-s~l~~l~r~~~~~~~~~~~q~~ 435 (510) .+.-||...| .|+...+-+ ..+ . .+ ..+.+. .+-.+-.|.+.+..+..+++.+ T Consensus 463 ~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~ 542 (725) T protein:vir:77 463 IYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhc Confidence 4444444433 122111110 000 0 01 223332 2344555666666666666555 Q ss_pred HhhcCh-H----hHhhcCCH---HHHHHHHHHHcCCCHhhcc--CCHHHHHHHHHHHHHHHHHHHHHHHHHH-------H Q lcl|Aclame:pro 436 AGLAPI-A----QLDPRISL---PKMMDTIWAAFSVDTSQFY--KSADELQAEAEEQRRQAAQAQAAQETLL-------E 498 (510) Q Consensus 436 ~~~~~~-~----q~~~~id~---d~~~~~~a~~~Gvp~~~i~--~s~ee~~~~~~~~~qqa~~~~~a~~~~~-------~ 498 (510) ..+.+. . +..+..|. +++.+.+...... .... .++++-++..+++++++.+++.....++ + T Consensus 543 ~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~--~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa 620 (725) T protein:vir:77 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ--MGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQA 620 (725) T ss_pred cccchhHHHHHHHhhccccchHHHHHHHHHHhhhhh--hhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 433322 1 11222333 4455544443322 1122 2232222111111111111110000000 0 Q ss_pred HHHHhh-------------cccCCC Q lcl|Aclame:pro 499 GASDMT-------------NALAGV 510 (510) Q Consensus 499 ~a~~~~-------------~~~ag~ 510 (510) .+.++. .++|.+ T Consensus 621 ~~~kaq~e~~k~q~~a~~~~~~a~~ 645 (725) T protein:vir:77 621 ELAKAQNQTLSLQIDAAKVEAQNQL 645 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 011111 No 38 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.21 E-value=2.1e-10 Score=73.59 Aligned_cols=483 Identities=12% Similarity=0.036 Sum_probs=209.9 Q ss_pred ChhHHHHHHHHHh-----ccCchHHHHHHH----Hhh-cccccCCCCCCccc-----ccccc---ccchHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-----DGSVEQRAIEFA----KTT-LPYLMVDPMSGSRG-----VVEHD---FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~k~~~~~r~~~lk-----r~~~~~~w~e~~----~~~-~P~~~~~~~~~~~~-----~~~~~---~dstg~~a~~~Laa~ 62 (510) |-+++.+++.++. -..|.+.|+.-+ +|. .+....+......+ +..++ |+-++.. + ++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~-v----~~ 75 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTE-L----NR 75 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHH-H----HH Confidence 8888777776663 223555665332 111 22211111110000 00111 3333322 2 22 Q ss_pred HHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE---- Q lcl|Aclame:pro 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR---- 138 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~---- 138 (510) +.+.--- +++=+++.+.+..- ..++.+.|+.+ +......++...+...++.+.+.+|.|+.-+ T Consensus 76 v~g~~~~-nr~d~~v~P~~~~~---------d~~~Ae~l~~~---~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~ 142 (720) T protein:vir:35 76 IISEYRH-NRITVKFRPGDKTA---------SEALANKLNGL---FRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNL 142 (720) T ss_pred HHhHHHh-CCCceEEEcCCCcc---------hHHHHHHHHHH---HHHHHHhcCchHHHhHHHHHhhhccceeEEeeecc Confidence 3332222 44555555543210 01233344333 3334557888889999999999999887633 Q ss_pred -eCCC-C----e--EEE--EEeceEEEeeCCCC-ce---eEEEEEEEecHHHHhHHhhHHhhccc--------ccCCCCc Q lcl|Aclame:pro 139 -NSDE-A----T--VVA--WSLRSYAVRRDATG-RW---MDIVLKQRYKSKDLDDVYKQDLMRAG--------RNLSGSG 196 (510) Q Consensus 139 -~~~~-~----~--~~~--~pl~~~~v~~d~~G-~v---~~i~r~~~~t~~~l~~~~~~~~~~~~--------~~~~~~~ 196 (510) ++++ . . ++. .|..++++..++.- .. .-+++..+|+.+++...||++..... .+..... T Consensus 143 ~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~ 222 (720) T protein:vir:35 143 VNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVD 222 (720) T ss_pred cccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCC Confidence 1111 1 1 222 24456777655431 11 12567777899999999986532100 0001122 Q ss_pred eEEEEEEEEee-----------cC---------C-------------------C-eeEEEEEE-eeCCeeecc-cccccc Q lcl|Aclame:pro 197 SVDLYTHVQRR-----------KG---------T-------------------A-MDYAEMYH-EIDGVRVGE-TGRWPI 234 (510) Q Consensus 197 ~v~v~~~v~~~-----------~~---------~-------------------~-~~~~sv~~-e~~~~~~~~-~~~y~~ 234 (510) .|.|.++.+++ +. . . ...+-||+ -+.|..+.. .+.++. T Consensus 223 ~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~ 302 (720) T protein:vir:35 223 VVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPG 302 (720) T ss_pred ceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCC Confidence 33343332211 00 0 0 01122333 235544432 244555 Q ss_pred ccCceEEEeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcC---------- Q lcl|Aclame:pro 235 HLCPYIVPTWN--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA---------- 302 (510) Q Consensus 235 ~~~P~~~~Rw~--~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~---------- 302 (510) +.|||++.-.. ..+|..+..|.+....+-.+.+|+....++..+...-..+....++++-....-... T Consensus 303 ~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~ 382 (720) T protein:vir:35 303 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLP 382 (720) T ss_pred CccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccccccccccc Confidence 66888865322 336788889999999999999999888888887655544444433332111111111 Q ss_pred ------CCcceec--CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHh Q lcl|Aclame:pro 303 ------EMGDYVP--GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 303 ------~~G~~~~--g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~L 372 (510) .+|.++. +......+.++ .+.....++.-...|.++--.+ ++-+.+ +.+.--|..|.+.-...+ T Consensus 383 ~~~~~~~~G~~~~~~~~~~~~~~~~~-----~~~~~~llq~~~~~i~~vsGi~~~~lG~~s-n~SG~Ai~~rq~qg~~~~ 456 (720) T protein:vir:35 383 LNEIVDKQGNIIAPPTPVGYTQPQPL-----NQAMAALLQQTGADIQEVTGSSQAMQPMPS-NIAKETVNHLMHRSDMSS 456 (720) T ss_pred cccccccCcccccCCCcccccCCCCC-----chHHHHHHHHHHHHHHHHhCCChHHcCccc-chHHHHHHHHHHHHHHHH Confidence 1233221 11111122222 2223444445555555553221 111222 246667888888888888 Q ss_pred hhhHHHHH------HHHHHHHHHHHHH------HHhhcCCC-----------CCCcc-----ce---e--eEEe-ecHHH Q lcl|Aclame:pro 373 GGTYSLLA------ENLQSPLAYVCLS------EVDDALLQ-----------GLITK-----QH---K--PAIE-TGLPA 418 (510) Q Consensus 373 Gpv~~rl~------~E~l~Pli~r~~~------il~~~~l~-----------~~p~~-----~~---~--~~~v-s~l~~ 418 (510) ...+..+. -+.+.-||...+. |+...+-. +.++. ++ + +.+. .+-.+ T Consensus 457 ~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~ 536 (720) T protein:vir:35 457 FIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYT 536 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcc Confidence 88887755 3445555555442 22211111 11111 11 1 2222 23334 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCh-----HhHhhcCCH---HHHHHHHHHHcCCCHhhccC--CHHHHHHHH---HHHHHH Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAPI-----AQLDPRISL---PKMMDTIWAAFSVDTSQFYK--SADELQAEA---EEQRRQ 485 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~~-----~q~~~~id~---d~~~~~~a~~~Gvp~~~i~~--s~ee~~~~~---~~~~qq 485 (510) -.|.+....+.++++.+..-.+. +.+....|+ ++++..+-..+. +..... .+++.++.. ++.+++ T Consensus 537 s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~--~~~~~~~~~~e~qq~~a~~qq~~qq~ 614 (720) T protein:vir:35 537 ARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLL--TQGVVKPRNTEEEQMVAQMIQQAQQP 614 (720) T ss_pred cHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcc--hhcccCccChhHHHHHHHHHHHHHhH Confidence 45666666555554432211011 223333443 455555544432 111221 122222111 111111 Q ss_pred HHHHHHHHHHH-HHHHHHhhcccCCC Q lcl|Aclame:pro 486 AAQAQAAQETL-LEGASDMTNALAGV 510 (510) Q Consensus 486 a~~~~~a~~~~-~~~a~~~~~~~ag~ 510 (510) +.+++.+++.+ +.+| +...+++.. T Consensus 615 ~~e~~~aqa~l~qaqa-e~~kaqa~~ 639 (720) T protein:vir:35 615 NAELVAAQGVLMQGQA-EVQKAKNEE 639 (720) T ss_pred hHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 11111111111 0000 000000000 No 39 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.17 E-value=8.9e-10 Score=70.21 Aligned_cols=484 Identities=12% Similarity=0.027 Sum_probs=218.5 Q ss_pred ChhHHHHHHHHHhccCchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) +-..+..+|..- ..+.+.|+ +-.+|..-. ..++..... ...+.+ |+=++. .++...+. -- .++ T Consensus 20 ~~~~~l~~~~~~--~~~~~~~r~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~----~~-~nr 90 (714) T protein:vir:10 20 FSQRQLLSLCSD--IDSQPLWRDAANKACAYYDGD-QLAPEVIQVLKDRGQPMTIHNLIAP-TVDGVLGM----EA-KTR 90 (714) T ss_pred hhHHHHHHHHHH--HhhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEeccHHH-HHHHHHHH----HH-hCC Confidence 333333444322 12334554 333333211 111100000 000111 222222 22222222 11 134 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC--C--eEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE--A--TVV 146 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~--~--~~~ 146 (510) +=+++.+.+...+ ..++-+ .++..+......++...+...++.+.+..|-+.+ +++.+. + +++ T Consensus 91 ~~~~v~pr~~~~~--------~~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i~i~ 159 (714) T protein:vir:10 91 TDLIVMSDDPNDE--------TEKLAE---AINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEFKVS 159 (714) T ss_pred cceEEecCCCChh--------hHHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCeEEE Confidence 4445555332110 011222 3344455556788899999999999998887764 555442 2 356 Q ss_pred EEEeceEEEeeCCC-Ccee---EEEEEEEecHHHHhHHhhHHh--hc-cc-----------------------------c Q lcl|Aclame:pro 147 AWSLRSYAVRRDAT-GRWM---DIVLKQRYKSKDLDDVYKQDL--MR-AG-----------------------------R 190 (510) Q Consensus 147 ~~pl~~~~v~~d~~-G~v~---~i~r~~~~t~~~l~~~~~~~~--~~-~~-----------------------------~ 190 (510) .+|..++++..++. .... -++++.+++.+++...||... .. .. . T Consensus 160 ~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (714) T protein:vir:10 160 TVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWD 239 (714) T ss_pred ecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhcccc Confidence 67778888887653 2222 367888999999988887521 10 00 0 Q ss_pred ------cCCCCceEEEEEEEEeec----------CCCeeE-----------------------EEEEE-eeCCeeecc-- Q lcl|Aclame:pro 191 ------NLSGSGSVDLYTHVQRRK----------GTAMDY-----------------------AEMYH-EIDGVRVGE-- 228 (510) Q Consensus 191 ------~~~~~~~v~v~~~v~~~~----------~~~~~~-----------------------~sv~~-e~~~~~~~~-- 228 (510) ......+|.|+.|.++.. +...-| ..||+ -+.|.+++. T Consensus 240 ~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~ 319 (714) T protein:vir:10 240 RQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDR 319 (714) T ss_pred cccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcC Confidence 001124577777654421 100000 01111 123334433 Q ss_pred ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch-hhhh---cC Q lcl|Aclame:pro 229 TGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV-DDYQ---DA 302 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~-~~~~---~~ 302 (510) .+.|+...|||++.-... ..|..| |.+....+-.+.+|+.....+.+ +..+-. ++.++++..- +.+. .. T Consensus 320 ~~p~p~~~fp~vP~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~-~~~~gav~~~d~~~~e~~~r 394 (714) T protein:vir:10 320 PCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRV-IMDEDATQLSDNDLMEQLER 394 (714) T ss_pred CCCCCCCceeeEEecceeeeccCccc--eehhhhhhHHHHHHHHHHHHHHH--HhCCce-eeccccccccHHHHHHhccC Confidence 346777789998654333 445555 68888999999999866665543 345544 4445554332 2232 12 Q ss_pred CCccee--cC---CccccccccCCCcc-chHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 303 EMGDYV--PG---GAEAVRAYERGDYN-KMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGG 374 (510) Q Consensus 303 ~~G~~~--~g---~~~~v~~~~~~~~~-~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGp 374 (510) ++|.+. |+ +.+....++..... -.+.....++...+.|.+.--. +. .-..+...+..-|..|.+.....|+. T Consensus 395 p~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~ 474 (714) T protein:vir:10 395 PDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAE 474 (714) T ss_pred CCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHH Confidence 233332 22 11212222222222 2345566667777777766421 11 11233345666799999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhh----cCCCCCC-----------------------ccc-----eeeEEe-ecHHHHHH Q lcl|Aclame:pro 375 TYSLLAENLQSPLAYVCLSEVDD----ALLQGLI-----------------------TKQ-----HKPAIE-TGLPALSR 421 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~----~~l~~~p-----------------------~~~-----~~~~~v-s~l~~l~r 421 (510) ++.++..- ..=+.+.++.++.. ..+.-+. -.+ +.+.+. .+-.+-.| T Consensus 475 ~~dnl~~~-~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r 553 (714) T protein:vir:10 475 INDNYQFA-CQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFK 553 (714) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHH Confidence 88887763 22233333333321 1111000 001 122222 23345567 Q ss_pred HHHHHHHHHHHHHHHhhcC---h---HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHH--HHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 422 SAAVQSMLNASQVIAGLAP---I---AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADEL--QAEAEEQRRQAAQAQA-- 491 (510) Q Consensus 422 ~~~~~~~~~~~q~~~~~~~---~---~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~--~~~~~~~~qqa~~~~~-- 491 (510) .+.++.+.++++.+.-..+ + .+.++--+.+++++.+-+.+|.+...=-.++++- ++.+++.++++++.++ T Consensus 554 ~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~~~~~~~~q~~l~~~e 633 (714) T protein:vir:10 554 AQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred HHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHHHHHHHHHHHHHHHHH Confidence 7777777766654321111 1 1222333678899999999988531000122221 1111111110000000 Q ss_pred HHH---HHHHHHHHh------hcccCC-----------C Q lcl|Aclame:pro 492 AQE---TLLEGASDM------TNALAG-----------V 510 (510) Q Consensus 492 a~~---~~~~~a~~~------~~~~ag-----------~ 510 (510) .++ ...+.+.+. ..+.|. + T Consensus 634 ~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~ 672 (714) T protein:vir:10 634 MAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYV 672 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000000 000000 0 No 40 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.15 E-value=1.2e-09 Score=69.48 Aligned_cols=488 Identities=12% Similarity=0.002 Sum_probs=211.7 Q ss_pred ChhHHHHHHHHHh-----ccCchHHHHHHH----Hhhc-ccccCCCCCCccc-------cccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-----DGSVEQRAIEFA----KTTL-PYLMVDPMSGSRG-------VVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~k~~~~~r~~~lk-----r~~~~~~w~e~~----~~~~-P~~~~~~~~~~~~-------~~~~~~dstg~~a~~~Laa~l 63 (510) |-++....+.+++ -..|...|+.-+ +|.. +....++.....- ..+...-..-.-.++...+.. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 8777766666653 123434443222 3321 2211111111000 011122222222233322222 Q ss_pred HHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE----- Q lcl|Aclame:pro 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR----- 138 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~----- 138 (510) -. +++=+++.+.++.. ..++.+.|+ ..+......++...+...++.+.+.+|.+.+-+ T Consensus 81 ~~-----nr~~~~v~P~~~~~---------d~~~Ae~l~---~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~ 143 (706) T protein:vir:10 81 RN-----NRISVKFRPGDNAA---------SEELANKLN---GLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFV 143 (706) T ss_pred Hh-----CCCceEEecCCCCc---------hHHHHHHHH---HHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccc Confidence 11 33434544432111 012233333 334444567899999999999999999887422 Q ss_pred eCC---CC--eE--EE--EEeceEEEeeCC---CCc-eeEEEEEEEecHHHHhHHhhHHh---hcccc--------cC-- Q lcl|Aclame:pro 139 NSD---EA--TV--VA--WSLRSYAVRRDA---TGR-WMDIVLKQRYKSKDLDDVYKQDL---MRAGR--------NL-- 192 (510) Q Consensus 139 ~~~---~~--~~--~~--~pl~~~~v~~d~---~G~-v~~i~r~~~~t~~~l~~~~~~~~---~~~~~--------~~-- 192 (510) +++ .. .+ .. .|+.++++..++ ++. -.-++|...|+.+++...|++.. .+... .. T Consensus 144 ~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~ 223 (706) T protein:vir:10 144 NEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDV 223 (706) T ss_pred cccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCc Confidence 211 11 12 21 366777777653 332 12477888999999999988532 11000 00 Q ss_pred ----CCCc--eEEEEEEEEeecC-----------------------------CCeeEEEEEE-eeCCeeec-cccccccc Q lcl|Aclame:pro 193 ----SGSG--SVDLYTHVQRRKG-----------------------------TAMDYAEMYH-EIDGVRVG-ETGRWPIH 235 (510) Q Consensus 193 ----~~~~--~v~v~~~v~~~~~-----------------------------~~~~~~sv~~-e~~~~~~~-~~~~y~~~ 235 (510) ..++ ...+..+.+++.. +.++.+.+|+ .+.|..+. ..+.|+.+ T Consensus 224 ~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~ 303 (706) T protein:vir:10 224 VYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGE 303 (706) T ss_pred ceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCC Confidence 0011 1111111222210 1112223333 34454444 33667778 Q ss_pred cCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCcc-------chhh-----hh- Q lcl|Aclame:pro 236 LCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGA-------VVDD-----YQ- 300 (510) Q Consensus 236 ~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~-------~~~~-----~~- 300 (510) .|||++.-..+ .++..+..|.+....+-.+.+|+....++.....+.+.++.+.++.+- ++.. +. T Consensus 304 ~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~ 383 (706) T protein:vir:10 304 HIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPL 383 (706) T ss_pred ccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhc Confidence 89999653322 256677788999999999999998888887766555544444322111 0100 00 Q ss_pred ---cCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 301 ---DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 301 ---~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~ 376 (510) ...+|.+++... ....+ ....-.+...+.++...+.|.++--. +.+.....+++.--|..|.+.....+...+ T Consensus 384 ~~~~~~~g~i~~~~~-~~~~~--~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~ 460 (706) T protein:vir:10 384 RTVTDKTGNVVAPAN-VAGYT--QAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIYL 460 (706) T ss_pred ccccCCCCccccccc-ccccC--CCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHHH Confidence 011233222111 00111 11111223455566666666665422 211111223577778899888888888888 Q ss_pred HHHH------HHHHHHHHHHHH------HHHhhcCCCC---C------C--c-----cce-----eeEEe-ecHHHHHHH Q lcl|Aclame:pro 377 SLLA------ENLQSPLAYVCL------SEVDDALLQG---L------I--T-----KQH-----KPAIE-TGLPALSRS 422 (510) Q Consensus 377 ~rl~------~E~l~Pli~r~~------~il~~~~l~~---~------p--~-----~~~-----~~~~v-s~l~~l~r~ 422 (510) .++. -+.+.-||...+ .|+...+-.. + + + .++ .+.+. .+-.+-.|. T Consensus 461 Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~ 540 (706) T protein:vir:10 461 DNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRD 540 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHH Confidence 6544 344444444333 2222111110 0 0 0 011 12222 234455677 Q ss_pred HHHHHHHHHHHHHHhhcCh-Hh----HhhcC---CHHHHHHHHHHHcCCCHhhccCCH-HHHHHHH-HHHHHHHHHHHHH Q lcl|Aclame:pro 423 AAVQSMLNASQVIAGLAPI-AQ----LDPRI---SLPKMMDTIWAAFSVDTSQFYKSA-DELQAEA-EEQRRQAAQAQAA 492 (510) Q Consensus 423 ~~~~~~~~~~q~~~~~~~~-~q----~~~~i---d~d~~~~~~a~~~Gvp~~~i~~s~-ee~~~~~-~~~~qqa~~~~~a 492 (510) +..+.+..+++.+....++ ++ +.+.. +.++++..+-..++.. ...+.. .+.++.. ++++.|+++++.+ T Consensus 541 ~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q--~~~~~~~~~eq~~~~q~qq~q~~q~~~~ 618 (706) T protein:vir:10 541 ATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQ--GIVKPRNQQEQAIVQQAQQAQATQPDPN 618 (706) T ss_pred HHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhccc--CCccccchhHHHHHHHHHHHHHHHHHHH Confidence 7777777666644322111 22 22223 4455666665555432 122211 1112111 1111111111111 Q ss_pred HHHHHHHHHHhh----c-----ccCCC Q lcl|Aclame:pro 493 QETLLEGASDMT----N-----ALAGV 510 (510) Q Consensus 493 ~~~~~~~a~~~~----~-----~~ag~ 510 (510) ...+++++.+.. . .+.++ T Consensus 619 ~~~~~aq~~~~qA~~~k~~a~~~q~~~ 645 (706) T protein:vir:10 619 MLLAQAQMVVAQAEAQKSQNETVQTQI 645 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111100 0 00011 No 41 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.14 E-value=1.3e-09 Score=69.29 Aligned_cols=483 Identities=13% Similarity=0.059 Sum_probs=220.1 Q ss_pred ChhHHHHHHHHHh-c---c-Cch----HHHHHHHHh-hcccccCCCCCCccc----c---cccc-ccchHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-D---G-SVE----QRAIEFAKT-TLPYLMVDPMSGSRG----V---VEHD-FQSAGALLVNNLAAK 62 (510) Q Consensus 1 ~k~~~~~r~~~lk-r---~-~~~----~~w~e~~~~-~~P~~~~~~~~~~~~----~---~~~~-~dstg~~a~~~Laa~ 62 (510) |-+++.+..+++. | . .|. ..|++=.+| ..+....++.....- . ++.. |+=++.. ++.. T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~-i~~v--- 76 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE-LNRI--- 76 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHH-HHHH--- Confidence 6666665555542 1 0 122 223222221 111111111100000 0 0111 3333322 2222 Q ss_pred HHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE-----E Q lcl|Aclame:pro 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL-----Y 137 (510) Q Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l-----~ 137 (510) .+.- =.+++=+++.+.+... + .++.+.|+. .+......++...+...+|.+.+..|.|++ | T Consensus 77 -~g~e-~~nr~d~~v~p~~~~~--------d-~~~Ae~l~~---l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~ 142 (708) T protein:vir:17 77 -IAEY-RNNRITVKFRPGDREA--------S-EELANKLNG---LFRADYEETDGGEACDNAFDDAATGGFGCFRLTSML 142 (708) T ss_pred -HhhH-hhCCcceEEecCCCcc--------h-HHHHHHHHH---HHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecc Confidence 1111 1134444555543211 1 123334433 333445678899999999999999998875 2 Q ss_pred EeCC-------CCeEEE--EEeceEEEeeCCCC-c-ee--EEEEEEEecHHHHhHHhhHHhhc-----cccc-----CCC Q lcl|Aclame:pro 138 RNSD-------EATVVA--WSLRSYAVRRDATG-R-WM--DIVLKQRYKSKDLDDVYKQDLMR-----AGRN-----LSG 194 (510) Q Consensus 138 ~~~~-------~~~~~~--~pl~~~~v~~d~~G-~-v~--~i~r~~~~t~~~l~~~~~~~~~~-----~~~~-----~~~ 194 (510) ..++ +..+++ .|..++++..++.- . -| -+||+.+++.+++...||+.... .... .++ T Consensus 143 ~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~ 222 (708) T protein:vir:17 143 VNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDA 222 (708) T ss_pred cccCCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCC Confidence 2221 112333 25567777766421 1 22 26888899999999999854211 0000 011 Q ss_pred CceEEEEEEEEee-----------c--C--------------------C-----C--eeEEEEEE-eeCCeeecc-cccc Q lcl|Aclame:pro 195 SGSVDLYTHVQRR-----------K--G--------------------T-----A--MDYAEMYH-EIDGVRVGE-TGRW 232 (510) Q Consensus 195 ~~~v~v~~~v~~~-----------~--~--------------------~-----~--~~~~sv~~-e~~~~~~~~-~~~y 232 (510) +.|-|..+.+++ + + + . ...+.||+ -+.|..+.. .+.+ T Consensus 223 -d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~ 301 (708) T protein:vir:17 223 -DVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRI 301 (708) T ss_pred -CeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCC Confidence 222222221111 0 0 0 0 01112332 234554543 3446 Q ss_pred ccccCceEEE---eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh---------- Q lcl|Aclame:pro 233 PIHLCPYIVP---TWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY---------- 299 (510) Q Consensus 233 ~~~~~P~~~~---Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~---------- 299 (510) +.+.|||++. ||. .+|...-.|.+..+.+-.+.+|+.....+..+.++.+-+++++.+.+-....- T Consensus 302 p~~~fP~vP~~g~r~~-~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~ 380 (708) T protein:vir:17 302 PGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) T ss_pred CCCccceEEEeccccc-ccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhh Confidence 6677898865 444 35666567999999999999999999999999888888888876432111100 Q ss_pred ------hcCCCcceecCCc--cccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 300 ------QDAEMGDYVPGGA--EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAEN 370 (510) Q Consensus 300 ------~~~~~G~~~~g~~--~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~ 370 (510) ...+.|.+++|.. .-+++.+ -.+...+.++...+.|.++--. +..+....+++.--|..|.+.... T Consensus 381 ~~~~~~~~~~~g~v~~~a~~~~~~~~~~-----~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~ 455 (708) T protein:vir:17 381 FLPLREVRDKYGNIIAGATPAGYTQPAV-----MNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADM 455 (708) T ss_pred hhhhhccCCcccccccccCCcccCCCcc-----ccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHH Confidence 0012233333221 1122211 1233455556666666555322 212211234566678888888888 Q ss_pred HhhhhHHHHH------HHHHHHHHHHHHH------HHhhcCC-----------CCCCcc-----cee-----eEEe-ecH Q lcl|Aclame:pro 371 TLGGTYSLLA------ENLQSPLAYVCLS------EVDDALL-----------QGLITK-----QHK-----PAIE-TGL 416 (510) Q Consensus 371 ~LGpv~~rl~------~E~l~Pli~r~~~------il~~~~l-----------~~~p~~-----~~~-----~~~v-s~l 416 (510) .++..+.++. -+.+.-||...|. |+...+- .+.++. ++. +.+. .+- T Consensus 456 ~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~ 535 (708) T protein:vir:17 456 ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccC Confidence 8888877765 5666666666552 2221110 111111 221 2222 233 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCh-Hh----H---hhcCCHHHHHHHHHHHcCCCHhhccC--CHHHHHHHHHHHHHHH Q lcl|Aclame:pro 417 PALSRSAAVQSMLNASQVIAGLAPI-AQ----L---DPRISLPKMMDTIWAAFSVDTSQFYK--SADELQAEAEEQRRQA 486 (510) Q Consensus 417 ~~l~r~~~~~~~~~~~q~~~~~~~~-~q----~---~~~id~d~~~~~~a~~~Gvp~~~i~~--s~ee~~~~~~~~~qqa 486 (510) .+-.|.+..+.+..+++.+....+. +. + +|--+.++++..+...++... ... ++++.++..++++.++ T Consensus 536 ~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~--~~~~~~~e~~q~~~q~qq~~q 613 (708) T protein:vir:17 536 YTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISG--IAKPRNEKEQQIVQQAQMAAQ 613 (708) T ss_pred chhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccc--cccCcchhhHHHHHHHHHHHH Confidence 4456666666666665544322111 11 2 222356778888877765532 222 2232222111111111 Q ss_pred ----HHHHHHHHH---HHHHHHHhhc--ccCCC Q lcl|Aclame:pro 487 ----AQAQAAQET---LLEGASDMTN--ALAGV 510 (510) Q Consensus 487 ----~~~~~a~~~---~~~~a~~~~~--~~ag~ 510 (510) +++..+++. .++.+.+... .++.+ T Consensus 614 ~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~ 646 (708) T protein:vir:17 614 SQPNPEMVLAQAQMVAAQAEAQKATNETAQTQI 646 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111110 0111111110 00000 No 42 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=484 Identities=13% Similarity=0.039 Sum_probs=220.1 Q ss_pred ChhHHHHHHHHHhcc-CchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG-SVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~-~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+...+.+..+++. ...+.|+ +-.+|..-. ..++..... ...+.+ |+=++. .++...+..- .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~~~-----~n 89 (714) T protein:vir:32 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGD-QLPPEVLQVLKDRGQPMTIHNLIAP-TVDGVLGMEA-----KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEeccHHH-HHHHHHhHHH-----hC Confidence 222233333433321 2233454 444443311 111110000 001111 333322 2222222221 13 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCCC----eE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.+... ...++.+.| +..+......+++..+...++.+.+..|-+. +|.+.|.. ++ T Consensus 90 r~~~~v~p~~~~~--------~~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:32 90 RTDLVVMSDEPDD--------ETEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCc--------hhHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 4444555432111 011233333 3344445567889999999999999888766 45554322 46 Q ss_pred EEEEeceEEEeeCCCC-cee---EEEEEEEecHHHHhHHhhHHh--hc-c-----------------------c------ Q lcl|Aclame:pro 146 VAWSLRSYAVRRDATG-RWM---DIVLKQRYKSKDLDDVYKQDL--MR-A-----------------------G------ 189 (510) Q Consensus 146 ~~~pl~~~~v~~d~~G-~v~---~i~r~~~~t~~~l~~~~~~~~--~~-~-----------------------~------ 189 (510) +.+|..++++..++.. .+. -++++.++|.+++...||+.. .. . + T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:32 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 7778889998876532 222 478888999999988887521 00 0 0 Q ss_pred cc------CCCCceEEEEEEEEeec----------CC--------------------------CeeEEEEEEeeCCeeec Q lcl|Aclame:pro 190 RN------LSGSGSVDLYTHVQRRK----------GT--------------------------AMDYAEMYHEIDGVRVG 227 (510) Q Consensus 190 ~~------~~~~~~v~v~~~v~~~~----------~~--------------------------~~~~~sv~~e~~~~~~~ 227 (510) .. .....+|.|+.|.++.. +. .+-+...++ .|..++ T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~--~g~~~L 316 (714) T protein:vir:32 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF--VGPHFI 316 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE--ecCccc Confidence 00 01124566666655421 00 011111222 233343 Q ss_pred --cccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh-hhh-- Q lcl|Aclame:pro 228 --ETGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DYQ-- 300 (510) Q Consensus 228 --~~~~y~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~-~~~-- 300 (510) ..+.|+...|||++.-... ..|..| |.+..+.+-.+.+|+.....+.+ ++.+-.+ +.++++...+ .+. T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~ 391 (714) T protein:vir:32 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQ 391 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHh Confidence 3456776779998654333 557777 68888999999999865555443 3566555 4444553322 222 Q ss_pred cCCCccee---cCCccc---cccccCCC-ccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 301 DAEMGDYV---PGGAEA---VRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 301 ~~~~G~~~---~g~~~~---v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~ 371 (510) .+.+|.++ |+..+. ..+++... .+-.+...+.++...+.|.+.--. +. .-+.+...+..-|..|.+..... T Consensus 392 ~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~ 471 (714) T protein:vir:32 392 IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATT 471 (714) T ss_pred ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHH Confidence 12233333 222111 11222222 233445566666666666665411 11 11233335666699999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhh----cC---CC------------CC--Cc------cce---e--eEEe-ecHHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDD----AL---LQ------------GL--IT------KQH---K--PAIE-TGLPA 418 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~----~~---l~------------~~--p~------~~~---~--~~~v-s~l~~ 418 (510) |...+.+|..-+.. +.+.++.++.. .. +. ++ +. .++ + +.+. .+-.+ T Consensus 472 l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:32 472 LAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 99888776654222 12222232211 11 10 00 00 011 2 2222 24455 Q ss_pred HHHHHHHHHHHHHHHHHHhhcC---hH---hHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAP---IA---QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ-- 490 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~---~~---q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~-- 490 (510) -.|.+.++.+.++++.+....+ +. +.+|.=+.+++++.+-+.+|.+...=-.++++-++..++++.++.+++ T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:32 551 AFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 6677777777777765321111 11 222334678999999999987532111222222111111110100000 Q ss_pred --HHHH---HHHHHHHHh------hcccC-----CC Q lcl|Aclame:pro 491 --AAQE---TLLEGASDM------TNALA-----GV 510 (510) Q Consensus 491 --~a~~---~~~~~a~~~------~~~~a-----g~ 510 (510) .+++ ...+.+.+. ....| .- T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:32 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000000 00000 00 No 43 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=484 Identities=13% Similarity=0.039 Sum_probs=220.1 Q ss_pred ChhHHHHHHHHHhcc-CchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG-SVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~-~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+...+.+..+++. ...+.|+ +-.+|..-. ..++..... ...+.+ |+=++. .++...+..- .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~~~-----~n 89 (714) T protein:vir:81 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGD-QLPPEVLQVLKDRGQPMTIHNLIAP-TVDGVLGMEA-----KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEeccHHH-HHHHHHhHHH-----hC Confidence 222233333433321 2233454 444443311 111110000 001111 333322 2222222221 13 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCCC----eE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.+... ...++.+.| +..+......+++..+...++.+.+..|-+. +|.+.|.. ++ T Consensus 90 r~~~~v~p~~~~~--------~~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:81 90 RTDLVVMSDEPDD--------ETEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCc--------hhHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 4444555432111 011233333 3344445567889999999999999888766 45554322 46 Q ss_pred EEEEeceEEEeeCCCC-cee---EEEEEEEecHHHHhHHhhHHh--hc-c-----------------------c------ Q lcl|Aclame:pro 146 VAWSLRSYAVRRDATG-RWM---DIVLKQRYKSKDLDDVYKQDL--MR-A-----------------------G------ 189 (510) Q Consensus 146 ~~~pl~~~~v~~d~~G-~v~---~i~r~~~~t~~~l~~~~~~~~--~~-~-----------------------~------ 189 (510) +.+|..++++..++.. .+. -++++.++|.+++...||+.. .. . + T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:81 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 7778889998876532 222 478888999999988887521 00 0 0 Q ss_pred cc------CCCCceEEEEEEEEeec----------CC--------------------------CeeEEEEEEeeCCeeec Q lcl|Aclame:pro 190 RN------LSGSGSVDLYTHVQRRK----------GT--------------------------AMDYAEMYHEIDGVRVG 227 (510) Q Consensus 190 ~~------~~~~~~v~v~~~v~~~~----------~~--------------------------~~~~~sv~~e~~~~~~~ 227 (510) .. .....+|.|+.|.++.. +. .+-+...++ .|..++ T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~--~g~~~L 316 (714) T protein:vir:81 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF--VGPHFI 316 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE--ecCccc Confidence 00 01124566666655421 00 011111222 233343 Q ss_pred --cccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh-hhh-- Q lcl|Aclame:pro 228 --ETGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DYQ-- 300 (510) Q Consensus 228 --~~~~y~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~-~~~-- 300 (510) ..+.|+...|||++.-... ..|..| |.+..+.+-.+.+|+.....+.+ ++.+-.+ +.++++...+ .+. T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~ 391 (714) T protein:vir:81 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQ 391 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHh Confidence 3456776779998654333 557777 68888999999999865555443 3566555 4444553322 222 Q ss_pred cCCCccee---cCCccc---cccccCCC-ccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 301 DAEMGDYV---PGGAEA---VRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 301 ~~~~G~~~---~g~~~~---v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~ 371 (510) .+.+|.++ |+..+. ..+++... .+-.+...+.++...+.|.+.--. +. .-+.+...+..-|..|.+..... T Consensus 392 ~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~ 471 (714) T protein:vir:81 392 IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATT 471 (714) T ss_pred ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHH Confidence 12233333 222111 11222222 233445566666666666665411 11 11233335666699999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhh----cC---CC------------CC--Cc------cce---e--eEEe-ecHHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDD----AL---LQ------------GL--IT------KQH---K--PAIE-TGLPA 418 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~----~~---l~------------~~--p~------~~~---~--~~~v-s~l~~ 418 (510) |...+.+|..-+.. +.+.++.++.. .. +. ++ +. .++ + +.+. .+-.+ T Consensus 472 l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:81 472 LAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 99888776654222 12222232211 11 10 00 00 011 2 2222 24455 Q ss_pred HHHHHHHHHHHHHHHHHHhhcC---hH---hHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAP---IA---QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ-- 490 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~---~~---q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~-- 490 (510) -.|.+.++.+.++++.+....+ +. +.+|.=+.+++++.+-+.+|.+...=-.++++-++..++++.++.+++ T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:81 551 AFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 6677777777777765321111 11 222334678999999999987532111222222111111110100000 Q ss_pred --HHHH---HHHHHHHHh------hcccC-----CC Q lcl|Aclame:pro 491 --AAQE---TLLEGASDM------TNALA-----GV 510 (510) Q Consensus 491 --~a~~---~~~~~a~~~------~~~~a-----g~ 510 (510) .+++ ...+.+.+. ....| .- T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:81 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000000 00000 00 No 44 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=484 Identities=13% Similarity=0.039 Sum_probs=220.1 Q ss_pred ChhHHHHHHHHHhcc-CchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG-SVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~-~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+...+.+..+++. ...+.|+ +-.+|..-. ..++..... ...+.+ |+=++. .++...+..- .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~~~-----~n 89 (714) T protein:vir:99 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGD-QLPPEVLQVLKDRGQPMTIHNLIAP-TVDGVLGMEA-----KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEeccHHH-HHHHHHhHHH-----hC Confidence 222233333433321 2233454 444443311 111110000 001111 333322 2222222221 13 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCCC----eE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.+... ...++.+.| +..+......+++..+...++.+.+..|-+. +|.+.|.. ++ T Consensus 90 r~~~~v~p~~~~~--------~~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:99 90 RTDLVVMSDEPDD--------ETEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCc--------hhHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 4444555432111 011233333 3344445567889999999999999888766 45554322 46 Q ss_pred EEEEeceEEEeeCCCC-cee---EEEEEEEecHHHHhHHhhHHh--hc-c-----------------------c------ Q lcl|Aclame:pro 146 VAWSLRSYAVRRDATG-RWM---DIVLKQRYKSKDLDDVYKQDL--MR-A-----------------------G------ 189 (510) Q Consensus 146 ~~~pl~~~~v~~d~~G-~v~---~i~r~~~~t~~~l~~~~~~~~--~~-~-----------------------~------ 189 (510) +.+|..++++..++.. .+. -++++.++|.+++...||+.. .. . + T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:99 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 7778889998876532 222 478888999999988887521 00 0 0 Q ss_pred cc------CCCCceEEEEEEEEeec----------CC--------------------------CeeEEEEEEeeCCeeec Q lcl|Aclame:pro 190 RN------LSGSGSVDLYTHVQRRK----------GT--------------------------AMDYAEMYHEIDGVRVG 227 (510) Q Consensus 190 ~~------~~~~~~v~v~~~v~~~~----------~~--------------------------~~~~~sv~~e~~~~~~~ 227 (510) .. .....+|.|+.|.++.. +. .+-+...++ .|..++ T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~--~g~~~L 316 (714) T protein:vir:99 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF--VGPHFI 316 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE--ecCccc Confidence 00 01124566666655421 00 011111222 233343 Q ss_pred --cccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh-hhh-- Q lcl|Aclame:pro 228 --ETGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DYQ-- 300 (510) Q Consensus 228 --~~~~y~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~-~~~-- 300 (510) ..+.|+...|||++.-... ..|..| |.+..+.+-.+.+|+.....+.+ ++.+-.+ +.++++...+ .+. T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~ 391 (714) T protein:vir:99 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQ 391 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHh Confidence 3456776779998654333 557777 68888999999999865555443 3566555 4444553322 222 Q ss_pred cCCCccee---cCCccc---cccccCCC-ccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 301 DAEMGDYV---PGGAEA---VRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 301 ~~~~G~~~---~g~~~~---v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~ 371 (510) .+.+|.++ |+..+. ..+++... .+-.+...+.++...+.|.+.--. +. .-+.+...+..-|..|.+..... T Consensus 392 ~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~ 471 (714) T protein:vir:99 392 IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATT 471 (714) T ss_pred ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHH Confidence 12233333 222111 11222222 233445566666666666665411 11 11233335666699999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhh----cC---CC------------CC--Cc------cce---e--eEEe-ecHHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDD----AL---LQ------------GL--IT------KQH---K--PAIE-TGLPA 418 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~----~~---l~------------~~--p~------~~~---~--~~~v-s~l~~ 418 (510) |...+.+|..-+.. +.+.++.++.. .. +. ++ +. .++ + +.+. .+-.+ T Consensus 472 l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:99 472 LAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 99888776654222 12222232211 11 10 00 00 011 2 2222 24455 Q ss_pred HHHHHHHHHHHHHHHHHHhhcC---hH---hHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAP---IA---QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ-- 490 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~---~~---q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~-- 490 (510) -.|.+.++.+.++++.+....+ +. +.+|.=+.+++++.+-+.+|.+...=-.++++-++..++++.++.+++ T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:99 551 AFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 6677777777777765321111 11 222334678999999999987532111222222111111110100000 Q ss_pred --HHHH---HHHHHHHHh------hcccC-----CC Q lcl|Aclame:pro 491 --AAQE---TLLEGASDM------TNALA-----GV 510 (510) Q Consensus 491 --~a~~---~~~~~a~~~------~~~~a-----g~ 510 (510) .+++ ...+.+.+. ....| .- T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:99 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000000 00000 00 No 45 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=484 Identities=13% Similarity=0.039 Sum_probs=220.1 Q ss_pred ChhHHHHHHHHHhcc-CchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG-SVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~-~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+...+.+..+++. ...+.|+ +-.+|..-. ..++..... ...+.+ |+=++. .++...+..- .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~~~-----~n 89 (714) T protein:vir:27 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGD-QLPPEVLQVLKDRGQPMTIHNLIAP-TVDGVLGMEA-----KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEeccHHH-HHHHHHhHHH-----hC Confidence 222233333433321 2233454 444443311 111110000 001111 333322 2222222221 13 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCCC----eE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.+... ...++.+.| +..+......+++..+...++.+.+..|-+. +|.+.|.. ++ T Consensus 90 r~~~~v~p~~~~~--------~~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:27 90 RTDLVVMSDEPDD--------ETEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCc--------hhHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 4444555432111 011233333 3344445567889999999999999888766 45554322 46 Q ss_pred EEEEeceEEEeeCCCC-cee---EEEEEEEecHHHHhHHhhHHh--hc-c-----------------------c------ Q lcl|Aclame:pro 146 VAWSLRSYAVRRDATG-RWM---DIVLKQRYKSKDLDDVYKQDL--MR-A-----------------------G------ 189 (510) Q Consensus 146 ~~~pl~~~~v~~d~~G-~v~---~i~r~~~~t~~~l~~~~~~~~--~~-~-----------------------~------ 189 (510) +.+|..++++..++.. .+. -++++.++|.+++...||+.. .. . + T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:27 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 7778889998876532 222 478888999999988887521 00 0 0 Q ss_pred cc------CCCCceEEEEEEEEeec----------CC--------------------------CeeEEEEEEeeCCeeec Q lcl|Aclame:pro 190 RN------LSGSGSVDLYTHVQRRK----------GT--------------------------AMDYAEMYHEIDGVRVG 227 (510) Q Consensus 190 ~~------~~~~~~v~v~~~v~~~~----------~~--------------------------~~~~~sv~~e~~~~~~~ 227 (510) .. .....+|.|+.|.++.. +. .+-+...++ .|..++ T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~--~g~~~L 316 (714) T protein:vir:27 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF--VGPHFI 316 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE--ecCccc Confidence 00 01124566666655421 00 011111222 233343 Q ss_pred --cccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh-hhh-- Q lcl|Aclame:pro 228 --ETGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DYQ-- 300 (510) Q Consensus 228 --~~~~y~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~-~~~-- 300 (510) ..+.|+...|||++.-... ..|..| |.+..+.+-.+.+|+.....+.+ ++.+-.+ +.++++...+ .+. T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~ 391 (714) T protein:vir:27 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQ 391 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHh Confidence 3456776779998654333 557777 68888999999999865555443 3566555 4444553322 222 Q ss_pred cCCCccee---cCCccc---cccccCCC-ccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 301 DAEMGDYV---PGGAEA---VRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 301 ~~~~G~~~---~g~~~~---v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~ 371 (510) .+.+|.++ |+..+. ..+++... .+-.+...+.++...+.|.+.--. +. .-+.+...+..-|..|.+..... T Consensus 392 ~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~ 471 (714) T protein:vir:27 392 IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATT 471 (714) T ss_pred ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHH Confidence 12233333 222111 11222222 233445566666666666665411 11 11233335666699999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhh----cC---CC------------CC--Cc------cce---e--eEEe-ecHHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDD----AL---LQ------------GL--IT------KQH---K--PAIE-TGLPA 418 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~----~~---l~------------~~--p~------~~~---~--~~~v-s~l~~ 418 (510) |...+.+|..-+.. +.+.++.++.. .. +. ++ +. .++ + +.+. .+-.+ T Consensus 472 l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:27 472 LAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 99888776654222 12222232211 11 10 00 00 011 2 2222 24455 Q ss_pred HHHHHHHHHHHHHHHHHHhhcC---hH---hHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAP---IA---QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ-- 490 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~---~~---q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~-- 490 (510) -.|.+.++.+.++++.+....+ +. +.+|.=+.+++++.+-+.+|.+...=-.++++-++..++++.++.+++ T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:27 551 AFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 6677777777777765321111 11 222334678999999999987532111222222111111110100000 Q ss_pred --HHHH---HHHHHHHHh------hcccC-----CC Q lcl|Aclame:pro 491 --AAQE---TLLEGASDM------TNALA-----GV 510 (510) Q Consensus 491 --~a~~---~~~~~a~~~------~~~~a-----g~ 510 (510) .+++ ...+.+.+. ....| .- T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:27 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000000 00000 00 No 46 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=484 Identities=13% Similarity=0.039 Sum_probs=220.1 Q ss_pred ChhHHHHHHHHHhcc-CchHHHH----HHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG-SVEQRAI----EFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~-~~~~~w~----e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+...+.+..+++. ...+.|+ +-.+|..-. ..++..... ...+.+ |+=++. .++...+..- .+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~-Qw~~~~~~~l~~~g~p~~~~N~i~~-~v~~v~g~~~-----~n 89 (714) T protein:vir:10 17 TPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGD-QLPPEVLQVLKDRGQPMTIHNLIAP-TVDGVLGMEA-----KT 89 (714) T ss_pred hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEeccHHH-HHHHHHhHHH-----hC Confidence 222233333433321 2233454 444443311 111110000 001111 333322 2222222221 13 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCCC----eE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA----TV 145 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~~----~~ 145 (510) ++=+++.+.+... ...++.+.| +..+......+++..+...++.+.+..|-+. +|.+.|.. ++ T Consensus 90 r~~~~v~p~~~~~--------~~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i 158 (714) T protein:vir:10 90 RTDLVVMSDEPDD--------ETEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKV 158 (714) T ss_pred CcceEEecCCCCc--------hhHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEE Confidence 4444555432111 011233333 3344445567889999999999999888766 45554322 46 Q ss_pred EEEEeceEEEeeCCCC-cee---EEEEEEEecHHHHhHHhhHHh--hc-c-----------------------c------ Q lcl|Aclame:pro 146 VAWSLRSYAVRRDATG-RWM---DIVLKQRYKSKDLDDVYKQDL--MR-A-----------------------G------ 189 (510) Q Consensus 146 ~~~pl~~~~v~~d~~G-~v~---~i~r~~~~t~~~l~~~~~~~~--~~-~-----------------------~------ 189 (510) +.+|..++++..++.. .+. -++++.++|.+++...||+.. .. . + T Consensus 159 ~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 238 (714) T protein:vir:10 159 STVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSW 238 (714) T ss_pred EecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccc Confidence 7778889998876532 222 478888999999988887521 00 0 0 Q ss_pred cc------CCCCceEEEEEEEEeec----------CC--------------------------CeeEEEEEEeeCCeeec Q lcl|Aclame:pro 190 RN------LSGSGSVDLYTHVQRRK----------GT--------------------------AMDYAEMYHEIDGVRVG 227 (510) Q Consensus 190 ~~------~~~~~~v~v~~~v~~~~----------~~--------------------------~~~~~sv~~e~~~~~~~ 227 (510) .. .....+|.|+.|.++.. +. .+-+...++ .|..++ T Consensus 239 ~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~--~g~~~L 316 (714) T protein:vir:10 239 DRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF--VGPHFI 316 (714) T ss_pred cccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE--ecCccc Confidence 00 01124566666655421 00 011111222 233343 Q ss_pred --cccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh-hhh-- Q lcl|Aclame:pro 228 --ETGRWPIHLCPYIVPTWNL--APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD-DYQ-- 300 (510) Q Consensus 228 --~~~~y~~~~~P~~~~Rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~-~~~-- 300 (510) ..+.|+...|||++.-... ..|..| |.+..+.+-.+.+|+.....+.+ ++.+-.+ +.++++...+ .+. T Consensus 317 ~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~~a~~~~d~~~~e~ 391 (714) T protein:vir:10 317 VDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDEDATQLSDNDLMEQ 391 (714) T ss_pred ccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eecCcccccHHHHHHh Confidence 3456776779998654333 557777 68888999999999865555443 3566555 4444553322 222 Q ss_pred cCCCccee---cCCccc---cccccCCC-ccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 301 DAEMGDYV---PGGAEA---VRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENT 371 (510) Q Consensus 301 ~~~~G~~~---~g~~~~---v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~ 371 (510) .+.+|.++ |+..+. ..+++... .+-.+...+.++...+.|.+.--. +. .-+.+...+..-|..|.+..... T Consensus 392 ~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~ 471 (714) T protein:vir:10 392 IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATT 471 (714) T ss_pred ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHH Confidence 12233333 222111 11222222 233445566666666666665411 11 11233335666699999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhh----cC---CC------------CC--Cc------cce---e--eEEe-ecHHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDD----AL---LQ------------GL--IT------KQH---K--PAIE-TGLPA 418 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~----~~---l~------------~~--p~------~~~---~--~~~v-s~l~~ 418 (510) |...+.+|..-+.. +.+.++.++.. .. +. ++ +. .++ + +.+. .+-.+ T Consensus 472 l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~ 550 (714) T protein:vir:10 472 LAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTP 550 (714) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCch Confidence 99888776654222 12222232211 11 10 00 00 011 2 2222 24455 Q ss_pred HHHHHHHHHHHHHHHHHHhhcC---hH---hHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAP---IA---QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ-- 490 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~---~~---q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~-- 490 (510) -.|.+.++.+.++++.+....+ +. +.+|.=+.+++++.+-+.+|.+...=-.++++-++..++++.++.+++ T Consensus 551 t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:10 551 AFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 6677777777777765321111 11 222334678999999999987532111222222111111110100000 Q ss_pred --HHHH---HHHHHHHHh------hcccC-----CC Q lcl|Aclame:pro 491 --AAQE---TLLEGASDM------TNALA-----GV 510 (510) Q Consensus 491 --~a~~---~~~~~a~~~------~~~~a-----g~ 510 (510) .+++ ...+.+.+. ....| .- T Consensus 631 ~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~ 666 (714) T protein:vir:10 631 MREMAGRVAKLEADAARAHAAAQRDNASAQREVALT 666 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 000000000 00000 00 No 47 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=98.93 E-value=1.2e-08 Score=63.92 Aligned_cols=476 Identities=12% Similarity=0.040 Sum_probs=212.6 Q ss_pred Chh--HHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCcc---cccccc-ccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKS--TAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSR---GVVEHD-FQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~--~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~---~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) .+. .+..+|..-. ...|-....+-.+|..-. ..++..... ...+.+ |+=++ ..++...+..- .++ T Consensus 20 ~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~-QW~~~~~~~l~~~g~p~~~~N~i~-~~v~~v~g~~~-----~nr 92 (772) T protein:vir:10 20 TPLTVDEYADINYEIEDQPAWRAVADKEMDYADGN-QLDTELLRRQQALGIPPAVEDLIG-PALLSLQGYEA-----VTR 92 (772) T ss_pred cccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcEEEcchH-HHHHHHHHHHH-----hcC Confidence 221 2223333221 112333333434443311 111110000 000111 23222 22222222221 144 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCCC----eEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA----TVV 146 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~~----~~~ 146 (510) +=+++.+.++.. ..++.+.|+. .+......+++..+...++.+.+..|-+.+ +.++|.. +++ T Consensus 93 ~d~~v~Pr~~~~---------d~~~Ae~l~~---~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i~ 160 (772) T protein:vir:10 93 TDWRVTPNGDVG---------GQEVADALNY---RLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRCR 160 (772) T ss_pred cceEEecCCCch---------HHHHHHHHHH---HHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEEE Confidence 445555532110 1223334433 344445678999999999999998887664 3333321 356 Q ss_pred EEEeceEEEeeCCCCceeE---EEEEEEecHHHHhHHhhHHh--hc-cc---------------c--------------- Q lcl|Aclame:pro 147 AWSLRSYAVRRDATGRWMD---IVLKQRYKSKDLDDVYKQDL--MR-AG---------------R--------------- 190 (510) Q Consensus 147 ~~pl~~~~v~~d~~G~v~~---i~r~~~~t~~~l~~~~~~~~--~~-~~---------------~--------------- 190 (510) .++..++++..++.....+ +||..+|+.+++...|+++. .. .. . T Consensus 161 ~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (772) T protein:vir:10 161 PIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEAR 240 (772) T ss_pred eeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhh Confidence 6777888888877655555 78888999999988887431 10 00 0 Q ss_pred ---------cCCCCceEEEEEEEEeec----------CCCeeE-----------------------EEEEE-eeCCeeec Q lcl|Aclame:pro 191 ---------NLSGSGSVDLYTHVQRRK----------GTAMDY-----------------------AEMYH-EIDGVRVG 227 (510) Q Consensus 191 ---------~~~~~~~v~v~~~v~~~~----------~~~~~~-----------------------~sv~~-e~~~~~~~ 227 (510) .....++|.|+++.+++. ++...| ..||+ -+-|.+++ T Consensus 241 ~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L 320 (772) T protein:vir:10 241 AWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCL 320 (772) T ss_pred ccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecceee Confidence 001136788888765542 110000 01111 12244444 Q ss_pred --cccccccccCceEEEeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh--h-- Q lcl|Aclame:pro 228 --ETGRWPIHLCPYIVPTWN--LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD--Y-- 299 (510) Q Consensus 228 --~~~~y~~~~~P~~~~Rw~--~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~--~-- 299 (510) ..+.|+...|||++.-.. ...|..| |.+....+-.+.+|+.....+..... .. .+. +.|-++..+ + T Consensus 321 ~~~~~p~~~~~fP~vP~~g~r~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l~~--~~-~~~-~~gav~~~d~~~~e 394 (772) T protein:vir:10 321 HDGPTPYTHRHFPYVPFFGFREDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGMSV--AR-VER-TKGAVAMTDAQFRR 394 (772) T ss_pred ccCCCCCCCCccceEEEeeeEeccCCccc--chhhhhhhHHHHHHHHHHHHHHHHhc--cc-ccc-cCCCccchhHHHHH Confidence 457788888999965333 3456666 78999999999999876665554322 22 223 334333221 1 Q ss_pred hcCCCccee---cCCccc-cccccCCCccch-HHHHHHHHHHHHHHHHHH-hhc-ccCCCCCCCCHHHHHHHHHHHHHHh Q lcl|Aclame:pro 300 QDAEMGDYV---PGGAEA-VRAYERGDYNKM-AAIQQSLQAVVVRLNQAF-MYG-ANQRDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 300 ~~~~~G~~~---~g~~~~-v~~~~~~~~~~~-~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~vTAtEi~~r~~E~~~~L 372 (510) ..+.++.++ +|..+. -..++......+ ....+.++...+.|.++- ..+ ..-+.+...+..-|..|.+.....+ T Consensus 395 ~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l 474 (772) T protein:vir:10 395 QIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQSI 474 (772) T ss_pred hccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHH Confidence 112223222 332221 111222221222 344555566666666643 111 1112333457777999999999999 Q ss_pred hhhHHHHHHHH------HHHHHHHHHH------HHhhcCC-C------CCC------c-----cce---eeEEe---ecH Q lcl|Aclame:pro 373 GGTYSLLAENL------QSPLAYVCLS------EVDDALL-Q------GLI------T-----KQH---KPAIE---TGL 416 (510) Q Consensus 373 Gpv~~rl~~E~------l~Pli~r~~~------il~~~~l-~------~~p------~-----~~~---~~~~v---s~l 416 (510) +..+.+|..-. +.-||...+. |....+. + .-+ . .++ +..++ .+. T Consensus 475 ~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~ 554 (772) T protein:vir:10 475 GRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPS 554 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeecccc Confidence 99988766543 3333333321 1111000 0 000 0 011 11121 233 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCh-H--------hHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 417 PALSRSAAVQSMLNASQVIAGLAPI-A--------QLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAA 487 (510) Q Consensus 417 ~~l~r~~~~~~~~~~~q~~~~~~~~-~--------q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~ 487 (510) .+-.|.+.++.+.+++ +.+.+. . +.+|-=+.+++++.+-+..+-+ ++|+.++.++++.|+++ T Consensus 555 ~~t~r~~~~~~m~ql~---~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~------~peq~~~~~~q~~qq~~ 625 (772) T protein:vir:10 555 TNSYRGQQLNAMSEAV---KSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQ------TPEQIQQQIDQAVQDAL 625 (772) T ss_pred chHHHHHHHHHHHHHH---hccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccC------ChHHHHHHHHHHHHHHH Confidence 3445555555555444 333221 1 1111124567777776665432 34444433333333222 Q ss_pred HHHHHHHH---HH-------HHHHH--hhcccCCC Q lcl|Aclame:pro 488 QAQAAQET---LL-------EGASD--MTNALAGV 510 (510) Q Consensus 488 ~~~~a~~~---~~-------~~a~~--~~~~~ag~ 510 (510) +++.+... +. +.+.+ ....+.++ T Consensus 626 ~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~ 660 (772) T protein:vir:10 626 AKAGNDIKLRELEIKERKADSEISGLNAKAVQIGV 660 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22221110 00 00000 00001111 No 48 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.58 E-value=2.5e-07 Score=56.79 Aligned_cols=429 Identities=9% Similarity=-0.028 Sum_probs=168.8 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhc-----ccccCCCCCCccccccccccchHHHHHHHHHHHHH--HhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTL-----PYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLA--RSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~-----P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~--~~ltpp~~~ 73 (510) +.+.|...|..= + .+.+.+.+|.. |+.-.. . ....+..++..+-+...|+++|..|. +..+|.... T Consensus 12 ~i~~L~~~~~~~-~----~r~~~~~~Yy~g~~~i~~~~~~-~-~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~ 84 (488) T protein:vir:23 12 LRDQLLDAFENK-Q----NELKSSKAYYDAERRPDAIGLA-V-PLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANG 84 (488) T ss_pred HHHHHHHHHHHH-H----HHHHHHHHHHhcccchhhcCcc-c-chhhhhhhhhcchHHHHHHHHHHhhhccceeccCCcc Confidence 333333333221 1 23333333321 211100 0 00111123456666777777777653 222221111 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC----------- Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE----------- 142 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~----------- 142 (510) +=--..+ ..++.+. +.+.+..++|.....++.++..++|.+.+++..+. T Consensus 85 ~~~~~~~-------------d~~~~~~-------l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~ 144 (488) T protein:vir:23 85 EEPESGG-------------ENDPASE-------LWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEV 144 (488) T ss_pred ccccccc-------------chhHHHH-------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCc Confidence 1110011 1112222 23447788999999999999999999876654321 Q ss_pred CeEEEEEece-EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEee Q lcl|Aclame:pro 143 ATVVAWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEI 221 (510) Q Consensus 143 ~~~~~~pl~~-~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~ 221 (510) .++++++-.+ |++.-+..+++...++.+. . .+......+++|+ + +.- ..|... T Consensus 145 ~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~-~----------------~~~~~~~~~~~y~---~--~~~----~~~~~~ 198 (488) T protein:vir:23 145 PLIRVEPPTALYAEVDPRTRKVLYAIRAIY-G----------------ADGNEIVSATLYL---P--DTT----MTWLRA 198 (488) T ss_pred ceEEEeccceeEEEEecCCCceEEEEEEEE-e----------------cCCCcEEEEEEEe---c--CcE----EEEEec Confidence 1466776666 4444445677766565542 0 0001111222322 1 110 111222 Q ss_pred CCeee-ccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCCc-c- Q lcl|Aclame:pro 222 DGVRV-GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKG-A- 294 (510) Q Consensus 222 ~~~~~-~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~---~~g~-~- 294 (510) +|... .....+++..+|++.++.+...++.+|+|=..+ .++-+..++...-.+...+...+.|...+- ++.. . T Consensus 199 ~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~ 278 (488) T protein:vir:23 199 EGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGIN 278 (488) T ss_pred CCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccc Confidence 23222 222334557899999999988899999996654 345566777776676777776666654432 1110 0 Q ss_pred --chhhhhcCCCcceecCC-ccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccC-----CCCCC-CCHHHHHHHH Q lcl|Aclame:pro 295 --VVDDYQDAEMGDYVPGG-AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ-----RDAER-VTAEEVRITA 365 (510) Q Consensus 295 --~~~~~~~~~~G~~~~g~-~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~-----~~~~~-vTAtEi~~r~ 365 (510) +...+.....|.+.... ..++...+.+. .++ ...++.++.-|...+...... ..... -++.-++... T Consensus 279 ~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~-~~~---~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~ 354 (488) T protein:vir:23 279 AETGQRMFDAYMARILAFEGGEGAHAEQFSA-AEL---RNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAE 354 (488) T ss_pred ccccchhhhhhhhhhccCCCCCCceeEecCC-CCh---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHH Confidence 01111111122221111 11222223322 233 344555555555443222111 11111 1333332221 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHh--hcCCCCCCcc--ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh Q lcl|Aclame:pro 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVD--DALLQGLITK--QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI 441 (510) Q Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~--~~~l~~~p~~--~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~ 441 (510) .-+... .++.+.. +.+-+.+++.++. .++ ...+.+ .+++.+..+. +-..++.++.+..+.+. ..+. T Consensus 355 ~~l~~k----~~~~~~~-f~~~l~~~~~l~~~~~~~-~~~~~~~~~i~v~f~~~~-~~s~~~~ada~~kl~~~---g~~~ 424 (488) T protein:vir:23 355 SRLVKK----VERKNKI-FGGAWEQAMRLAYKMVKG-GDIPTEYYRMETVWRDPS-TPTYAAKADAAAKLFAN---GAGL 424 (488) T ss_pred HHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhcC-CCcchhhccceEEecCCC-CCCHHHHHHHHHHHHhc---cccc Confidence 111111 1222222 2223344443322 122 122222 3444443222 11223333333322221 1111 Q ss_pred HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 442 AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 442 ~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +..+. +...+|.... ..++++++.++++.++..+..+......+..+.+.+.+|- T Consensus 425 ------~s~et----~~~~l~~~~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (488) T protein:vir:23 425 ------IPRER----GWVDMGYTIV----EREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGE 479 (488) T ss_pred ------CCHHH----HHHhCCCCch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCC Confidence 11111 3333443211 1233333333332222211111100011111222233333 No 49 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.54 E-value=3.2e-07 Score=56.21 Aligned_cols=429 Identities=11% Similarity=0.058 Sum_probs=181.8 Q ss_pred ChhHHHHHHHH----------Hh-------------ccCchHHHHHHHHhhcccccCCCCCCcccccccccc--chHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEK----------LR-------------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALL 55 (510) Q Consensus 1 ~k~~~~~r~~~----------lk-------------r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d--stg~~a 55 (510) |-+.++..+.+ |+ ...-...|+.+++=..|.+. .....+.+..+... ..+... T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~--~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 3 LIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIH--YQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred hHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccc--cccCCCCccccceeecchHHHH Confidence 22222222211 11 11124457777665444321 11111111122223 445555 Q ss_pred HHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE Q lcl|Aclame:pro 56 VNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL 135 (510) Q Consensus 56 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~ 135 (510) ++.+|+-+.+-.. .++++++. ...++|++ .+..++|+..+.+++.+..+.|.++ T Consensus 81 ~~~~A~lv~~e~~-------~i~v~~~~------------~~~e~l~~-------il~~n~f~~~~~~~~e~a~a~G~~~ 134 (508) T protein:vir:15 81 ARRIASVVFNEKA-------EIHVKDNN------------EADKFLND-------VLEDNDFKNKFEEALEKGVALGGFA 134 (508) T ss_pred HHHHHhhhhCCCc-------eEEeCCch------------HHHHHHHH-------HHHhccHHHHHHHHHHHHhhcCceE Confidence 6666655533211 11222211 12334444 4778899999999999999999877 Q ss_pred --EEEeCCCCeEEEEEeceEEE-eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEee---cC Q lcl|Aclame:pro 136 --LYRNSDEATVVAWSLRSYAV-RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRR---KG 209 (510) Q Consensus 136 --l~~~~~~~~~~~~pl~~~~v-~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~---~~ 209 (510) .|++.+..++.+++...++- ..| .|++.++.+..+.... .+.+-.+|+.++.. ++ T Consensus 135 ~k~~~d~~~~~i~~v~ad~~~P~~~d-~~~~~~~af~~~~~~~------------------~~~~~~~yt~lE~h~~~~~ 195 (508) T protein:vir:15 135 MRPYIDGNHIKIAWVRADQFYPLQSN-TNDISEAAIASRTQRT------------------ESNQTKYYTLLEFHQWQDN 195 (508) T ss_pred EEEEEeCCeeEEEEEcCCeeEEEEEc-CCCeEEEEEEEEEEee------------------cCCCceEEEEEEEEEEecC Confidence 47776666678888777764 455 4556554433222110 00111223322221 11 Q ss_pred C-CeeEEEEEEeeC----Ceeec-----------ccccc-ccccCceEEEeee----ecCCCccccchHHHHHHHHHHHH Q lcl|Aclame:pro 210 T-AMDYAEMYHEID----GVRVG-----------ETGRW-PIHLCPYIVPTWN----LAPGEHYGRGHVEDYIGDFAKLS 268 (510) Q Consensus 210 ~-~~~~~sv~~e~~----~~~~~-----------~~~~y-~~~~~P~~~~Rw~----~~~ge~YGrgp~~~~l~d~~~L~ 268 (510) + ..--..+|...+ |..+. .+..+ +....||+.++.. ...+++||+|-...+.+-+..|| T Consensus 196 ~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD 275 (508) T protein:vir:15 196 GSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDIN 275 (508) T ss_pred cceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHH Confidence 1 111112222211 11111 01000 1122344444432 23367899999999999999999 Q ss_pred HHHHHHHHHHHHhhCCceeeCCCCccchhh--hhcCCCc-c-ee--cCCcc---ccccccCCCccchHHHHHHHHHHHHH Q lcl|Aclame:pro 269 LLSEKLGLYELESLEVLNLVDEAKGAVVDD--YQDAEMG-D-YV--PGGAE---AVRAYERGDYNKMAAIQQSLQAVVVR 339 (510) Q Consensus 269 ~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~--~~~~~~G-~-~~--~g~~~---~v~~~~~~~~~~~~~~~~~i~~~~~~ 339 (510) ..--.+.... ...++...|++ .+++++. ......+ . ++ .+..+ .+..++.. -........++.+.+. T Consensus 276 ~~~s~~~~e~-~~~~~~i~v~~-~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~~~~~l~~ 351 (508) T protein:vir:15 276 DTHDQFIWEI-RLGQKHIAVQP-GMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTP--IRTVQYKDAIDHFIKE 351 (508) T ss_pred HHHHHHHHHH-Hhcccceeech-HHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecc--cChHHHHHHHHHHHHH Confidence 8777766655 45566655543 3333211 0000011 0 11 11111 11111111 0112233444455444 Q ss_pred HHHHHhhc--ccCCCC-CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-----cCCC----CCCccc Q lcl|Aclame:pro 340 LNQAFMYG--ANQRDA-ERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-----ALLQ----GLITKQ 407 (510) Q Consensus 340 I~~af~~~--~~~~~~-~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-----~~l~----~~p~~~ 407 (510) |....-+. .+.-++ ..-|||||..+.+.......- ..+.-...|..|++-++.++.- .+.+ ..+..+ T Consensus 352 ~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~-~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~ 430 (508) T protein:vir:15 352 FEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSS-YLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQP 430 (508) T ss_pred HHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCC Confidence 44433221 111122 224999999988888777665 4444445566666666555432 2222 122223 Q ss_pred --eeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 408 --HKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQ 485 (510) Q Consensus 408 --~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qq 485 (510) +.+.+-.++.. .+..+++... +.++ +|. +... ..+....|++ +||.+++.++.+.. T Consensus 431 ~~v~v~f~D~i~~-d~~~~~~~~~---~~v~--aGi------~s~e---~~i~~~~g~~-------deea~~el~ri~~E 488 (508) T protein:vir:15 431 LDIECHFDDGVFV-NKDKQLEEDA---KVLA--IGA------LSKQ---TFLQRNYGMT-------DEQAAEELAKIQSE 488 (508) T ss_pred cceEEEeCCCCCC-CHHHHHHHHH---HHHh--cCC------CCHH---HHHHhcCCCC-------hHHHHHHHHHHHHh Confidence 44443333322 1222222222 2221 121 1111 2234455654 34444433332221 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 486 AAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 486 a~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +. + ..+..+..+..-.|. T Consensus 489 ~~---~----~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 489 AP---T----DTFEGGRSAILNGGD 506 (508) T ss_pred cc---c----cCccccccccCCCCC Confidence 11 0 111111111111112 No 50 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=98.45 E-value=6e-07 Score=54.71 Aligned_cols=441 Identities=10% Similarity=0.010 Sum_probs=191.8 Q ss_pred Ch---------hHHHHHHHHHhcc-CchHHHHHHHHhhcccccCCCCCC-c----cccccc-cccchHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MK---------STAAMLWEKLRDG-SVEQRAIEFAKTTLPYLMVDPMSG-S----RGVVEH-DFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~k---------~~~~~r~~~lkr~-~~~~~w~e~~~~~~P~~~~~~~~~-~----~~~~~~-~~dstg~~a~~~Laa~l~ 64 (510) |- ..+..+|+..++- .=...|++..+-.||..-..+.+. + ..++.+ .|-+.-.+.++. |+ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~----l~ 107 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDG----MM 107 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHH----Hh Confidence 22 2234455544311 012344555444566521111111 1 111211 233444444444 44 Q ss_pred HhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC- Q lcl|Aclame:pro 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA- 143 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~- 143 (510) +.+|- ..|.+ ++++ .++.++++| -+...+.+.-+..++.+...+|-+.+++|-+.. T Consensus 108 G~vfr-k~p~~--~~p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~ 164 (535) T protein:vir:80 108 GQVFS-RDPIR--QLPP--------------ALEAIVEDI------DGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVG 164 (535) T ss_pred chhhc-CCcce--eccH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCC Confidence 44442 12333 2221 234444443 345667888888999999999999888884321 Q ss_pred ---------------eEEEEEeceEE---Eee-CCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 144 ---------------TVVAWSLRSYA---VRR-DATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 144 ---------------~~~~~pl~~~~---v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v 204 (510) .+..|+-.+.. ..+ |+.+++.-+..+++.+.+. ..|+ .+.++.|.++ T Consensus 165 ~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--d~f~------------~~~~~q~RvL 230 (535) T protein:vir:80 165 RPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD--DGFE------------TTYVQQWRVL 230 (535) T ss_pred CcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC--CCcc------------cceeEEEEEE Confidence 14445533322 222 3344565566666554322 3333 3445556666 Q ss_pred EeecCCCeeEEEEEEeeCC-------eeeccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHHHHHH---H Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDG-------VRVGETGRWPIHLCPYIVPTWNLAPGEHY--GRGHVEDYIGDFAKLSLLS---E 272 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~-------~~~~~~~~y~~~~~P~~~~Rw~~~~ge~Y--Grgp~~~~l~d~~~L~~l~---~ 272 (510) .+..++++-++.+..+.++ ..+...++ .+.+++|++.|.-..+..+ |..| |=|+..||.-. . T Consensus 231 ~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g--~~~l~~IPfv~~~~~~~~~~~~~pP----Ll~LA~lni~Hy~~s 304 (535) T protein:vir:80 231 QLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGN--GNPFKEIPFQFIGPLDNNADIDHPP----LLDLCEVNIGHYRNS 304 (535) T ss_pred EecCCceEEEEEEEeecCCccccccceeecccCC--CcccCeeEEEEeecCCCCCCCCccc----hHHHHHHHHHHhhch Confidence 6655444433322222221 12222222 1457788888875555444 4444 33555555332 2 Q ss_pred HHHH-HHHHhhCCceeeC-C-----CCccchhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 273 KLGL-YELESLEVLNLVD-E-----AKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM 345 (510) Q Consensus 273 ~~l~-~~~~a~~~~~lv~-~-----~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~ 345 (510) +-++ .+..+..|...+. . +.......+..+.+..+.-+...+.+.++... ..+. .+.++++++++.+.= T Consensus 305 sd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~-~~~a--~~~l~~~e~qM~~lG- 380 (535) T protein:vir:80 305 ADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQITP-NSVP--FEAMTHKESQMIAMG- 380 (535) T ss_pred hHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeecc-chhH--HHHHHHHHHHHHHHH- Confidence 2232 3444545543332 1 11222222333333222211122233333321 2232 456777777776632 Q ss_pred hcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHH Q lcl|Aclame:pro 346 YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAV 425 (510) Q Consensus 346 ~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~ 425 (510) ..++.......||+|.+.+....-..|..+..++.+-+ .+++.++-+-.=..+.++.+++.+-.- -..+..+. T Consensus 381 a~ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al-----~~aL~~~A~w~G~~~~~~~~~i~~n~d--F~~~~ld~ 453 (535) T protein:vir:80 381 ANLLVKSGGNRTFGEAQQEEASEQSILSACTKNVSMAF-----RKALRWANQFQTGIVNDETVEYNLNTD--FPAARLTP 453 (535) T ss_pred HHhhccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCccCCCceEEEeccc--cccccCCH Confidence 22233344457999999988888888998888877664 334444332110122334444443211 11111122 Q ss_pred HHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|Aclame:pro 426 QSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGA----- 500 (510) Q Consensus 426 ~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a----- 500 (510) +.+.+++..... + .|..+.+...+ ...||... -+ +.+|.+.+.+.+.+....+.........++ T Consensus 454 ~~~~all~~~~~--G------~Is~et~~~~L-~r~gvl~~-~~-~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~ 522 (535) T protein:vir:80 454 NERAELILEWQQ--G------AITFKEMRAGL-RRAGVASE-DD-AKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAK 522 (535) T ss_pred HHHHHHHHHHhc--C------CCCHHHHHHHH-HhCCCCCc-cc-chHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCc Confidence 333333332221 1 24445555555 55566432 12 223333333333222111110000000111 Q ss_pred ---HHhhcccCCC Q lcl|Aclame:pro 501 ---SDMTNALAGV 510 (510) Q Consensus 501 ---~~~~~~~ag~ 510 (510) ++.++++||- T Consensus 523 ~~~~~~~~~~~~~ 535 (535) T protein:vir:80 523 LNNGNGGGNQAGN 535 (535) T ss_pred ccCCccccccCCC Confidence 2334445555 No 51 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.42 E-value=7.5e-07 Score=54.16 Aligned_cols=421 Identities=12% Similarity=0.004 Sum_probs=166.5 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc-cCCCCCC--ccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPMSG--SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~-~~~~~~~--~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) =.+.+...++++.+ ...+.+.+.+|..-.. .+..+.. ...+..++..+-+...|+.+++.| ++.+ |.. T Consensus 4 ~~~~i~~L~~~~~~--~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~~ 74 (480) T protein:vir:78 4 YHEHVERLQGLLAR--DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEG---FRI 74 (480) T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh----ccCc---eec Confidence 23334444444421 1123333333322211 0000100 001111234455666666666655 3322 222 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC------C-CC--eEEEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS------D-EA--TVVAW 148 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~------~-~~--~~~~~ 148 (510) +++. +..+ .+.+.+..++|.....+++++..++|.+.+++.. + .+ +++++ T Consensus 75 --~~d~------------~~~~-------~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~ 133 (480) T protein:vir:78 75 --SEDS------------EGLE-------ELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVE 133 (480) T ss_pred --CCCc------------hhHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEE Confidence 2110 0111 2233466789999999999999999998766542 1 22 46677 Q ss_pred EeceEEEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe-- Q lcl|Aclame:pro 149 SLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV-- 224 (510) Q Consensus 149 pl~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~-- 224 (510) +..+.++..|+ .+++...++.+.-. .+......+++|+. +.- .+|...++. T Consensus 134 ~p~~~~~~~D~~~~~~~~~~i~~~~~~----------------~~~~~~~~~~~y~~-----~~~----~~~~~~~~~~~ 188 (480) T protein:vir:78 134 SPLYMYAELDPRNTRRVTRAVRLYTTR----------------DDVAVPDRATLYLP-----DET----VPLRRNGGLND 188 (480) T ss_pred cccceEEEEcCCCccceEEEEEEEEee----------------cCCCceEEEEEEeC-----CeE----EEEEecCCCcc Confidence 77776666674 57787666555310 01111223334321 100 001111110 Q ss_pred ---eeccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh- Q lcl|Aclame:pro 225 ---RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY- 299 (510) Q Consensus 225 ---~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~- 299 (510) ........++..+|++.++.+...+..||+|=..+ ..+-+-.++...-.........+.|...+. |.. ++.. T Consensus 189 ~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~-~~~~~ 265 (480) T protein:vir:78 189 QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVT-TDELT 265 (480) T ss_pred ccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--cCC-ccccc Confidence 01111122346799999999888899999997765 467777888777777777776666654442 221 1110 Q ss_pred -------hcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHHHHHHH Q lcl|Aclame:pro 300 -------QDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVRITAE 366 (510) Q Consensus 300 -------~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi~~r~~ 366 (510) .....|.+..-...+++..+++. ++++.. ++.++.-|.+.+.... +...+.. -++.-++.+.. T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~ 341 (480) T protein:vir:78 266 NDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNF---AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDS 341 (480) T ss_pred cccccchhhhhhhhhccCCCCCceEEecCc-cCHHHH---HHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHH Confidence 01111222111112333334332 344433 4444444444332211 1111112 13332322211 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCcc--ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHh Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITK--QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQ 443 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~--~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q 443 (510) .+... .++.... +.+-+.+++.++.. .+. ..+.+ .+++.+..+. .-..++.+..+..+. ++..++ T Consensus 342 ~l~~k----a~~~~~~-f~~~l~~~~~l~~~~~g~-~~~~~~~~i~v~f~~~~-~~s~~~~ad~~~kl~---~~g~~~-- 409 (480) T protein:vir:78 342 RIVKM----AERKGRI-FGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPS-TPTVAAKADAVSKLY---ANGQGP-- 409 (480) T ss_pred HHHHH----HHHHHHH-HHHHHHHHHHHHHHHcCC-CccccceeeeEEecCCC-CCCHHHHHHHHHHHH---Hhcccc-- Confidence 11111 1222222 22223333333221 111 11222 2334332221 112223333333222 222111 Q ss_pred HhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H------------HHhhcccCCC Q lcl|Aclame:pro 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEG-A------------SDMTNALAGV 510 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~-a------------~~~~~~~ag~ 510 (510) +..+. +...+|..+ ++++.+.+.+++++..+.........+ + .++.+++.|- T Consensus 410 ----~s~et----~~~~lg~~~-------d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (480) T protein:vir:78 410 ----IPKEQ----ARIDLGYTA-------TQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGF 474 (480) T ss_pred ----CCHHH----HHhcCCCCH-------hHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCccccccCCC Confidence 11111 233355543 333322221111111111111100001 0 1111222222 No 52 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.42 E-value=7.5e-07 Score=54.16 Aligned_cols=450 Identities=9% Similarity=0.003 Sum_probs=179.4 Q ss_pred ChhHHHHHHHHHhc----cCchHHHHHHHHhhcc--------cccCCCCCCccccccccccchHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRD----GSVEQRAIEFAKTTLP--------YLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~k~~~~~r~~~lkr----~~~~~~w~e~~~~~~P--------~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~lt 68 (510) ||+-++.-|. -+- -.....+..++.=..+ .+|............++--+.+...++.+|+-|.+-.. T Consensus 7 ~~~~i~~w~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll~~e~~ 85 (518) T protein:vir:78 7 MTRFIKGWLN-GKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYISGKPL 85 (518) T ss_pred HHHHHHHhhc-CCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhhcCCCc Confidence 6664443332 110 0111122111110000 11211111111111233223466667777766544321 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCCCeEE Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEATVV 146 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~~~~~ 146 (510) =+.++..+.. +.+.+.+.|+ +.+..++|+..+.+.+.+..+.|++++ |++....++. T Consensus 86 -----~i~v~~~~~~---------d~e~~~~~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~ 144 (518) T protein:vir:78 86 -----SIDVTGVNGS---------KDENLTKQLK-------EALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSIS 144 (518) T ss_pred -----eEEecCcccc---------CcHHHHHHHH-------HHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEE Confidence 1333222111 1122344444 448889999999999999999998874 6666555677 Q ss_pred EEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhh-----cccccCCCCceEEEEEEEEeecCC-Ce-----eEE Q lcl|Aclame:pro 147 AWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLM-----RAGRNLSGSGSVDLYTHVQRRKGT-AM-----DYA 215 (510) Q Consensus 147 ~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~-----~~~~~~~~~~~v~v~~~v~~~~~~-~~-----~~~ 215 (510) +++-..|+... .+|++..+.+-.+....+-..-+- .+. ........+....|-..+++.+.. .. |.. T Consensus 145 ~v~ad~~~P~~-~~g~~~~~~f~~~~~~~~k~~~y~-~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~ 222 (518) T protein:vir:78 145 VHSSSQFWIDF-KNNEPFRFNFFEEIPTSNKADIYY-LVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLP 222 (518) T ss_pred EEcCCeeEEEe-ecCcEEEEEEEEEeecCCcceeEE-EEEeeccccccceeecccceeEEEEEeeecCcccccccccccc Confidence 77777766654 358877766544332210000000 000 000000001111111122222111 00 000 Q ss_pred EE---EEeeCCeeec-cccccccccCceEEEeeeec-----CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCce Q lcl|Aclame:pro 216 EM---YHEIDGVRVG-ETGRWPIHLCPYIVPTWNLA-----PGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) Q Consensus 216 sv---~~e~~~~~~~-~~~~y~~~~~P~~~~Rw~~~-----~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~ 286 (510) .. ..+.++.... ...+. ...|+++...+.. .+++||+|-...+.+.++.||..--++..-... .+... T Consensus 223 ~~l~~~~~~~~~~e~~~~~tg--~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i 299 (518) T protein:vir:78 223 EQITSYLHTNDIQLNHSVSIG--LKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKI 299 (518) T ss_pred cccccccccccCccceeeccC--CccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCcee Confidence 00 0011111100 00011 2357777765543 467889999999999999999988877777654 66665 Q ss_pred eeCCCCccchhhhhcCCCc----------ce--ecCCc----c---ccccccCCCccchHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 287 LVDEAKGAVVDDYQDAEMG----------DY--VPGGA----E---AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG 347 (510) Q Consensus 287 lv~~~g~~~~~~~~~~~~G----------~~--~~g~~----~---~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~ 347 (510) .|++ .+++.+.- ....+ .+ +.|.. + .+..++..- ........++.+-+.|....=++ T Consensus 300 ~v~~-~~l~~~~~-~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~I--r~e~~~~~~~~~l~~~~~~~G~s 375 (518) T protein:vir:78 300 AASE-RMFRKKVN-KSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDF--RDGSYRETMEYFAQKAVSKSGYN 375 (518) T ss_pred eech-hHhccCCC-CCCCccccccCCCCceEEEecCcCCCCCccccceeeeeccc--ChHHHHHHHHHHHHHHHHhhCCC Confidence 6643 33321110 00000 01 11111 1 111111110 11122233333333332222111 Q ss_pred --ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhc-C----CCCCCccceeeEEeecH--HH Q lcl|Aclame:pro 348 --ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA-L----LQGLITKQHKPAIETGL--PA 418 (510) Q Consensus 348 --~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~-~----l~~~p~~~~~~~~vs~l--~~ 418 (510) .+--++...|||||..+.+...+.+.-.-..+. ..|.-|+...+.++.-. + ..+.++..+.+.+-.++ +. T Consensus 376 ~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e-~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~ 454 (518) T protein:vir:78 376 PATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQ-NVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNL 454 (518) T ss_pred hhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCH Confidence 122234457999999988887666533222222 22223333333333211 1 11222223444443322 22 Q ss_pred HHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLE 498 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~ 498 (510) ...++...+ .++ +|. +..+.+++.+ ..|+ |+||.+++-++-+.++++...+.+..+. T Consensus 455 ~~~~~~~~~------~v~--aGi------mS~e~~i~~~--~~~~-------~deea~~e~~ri~~E~~~~~~~~p~~~~ 511 (518) T protein:vir:78 455 NELSSTLNN------MNS--ALA------MSVEEKVKLI--HPKW-------EDEEIQAEVKRIYLENAIGEVPDPEAIG 511 (518) T ss_pred HHHHHHHHH------HHh--cCC------CCHHHHHHHh--CCCC-------CHHHHHHHHHHHHHHhcccCCCCCcccc Confidence 222211111 111 121 2223334422 1122 4555555443333322222222222222 Q ss_pred HHHHhhcccCC Q lcl|Aclame:pro 499 GASDMTNALAG 509 (510) Q Consensus 499 ~a~~~~~~~ag 509 (510) +. +...| T Consensus 512 g~----~~~~g 518 (518) T protein:vir:78 512 GM----ETKGG 518 (518) T ss_pred CC----CCCCC Confidence 22 12222 No 53 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.41 E-value=7.7e-07 Score=54.10 Aligned_cols=436 Identities=10% Similarity=0.035 Sum_probs=177.7 Q ss_pred ChhHHHHHHHHH------h---cc----------CchHHHHHHHHhhcccccCCCCC-Cc-cccccccccchHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKL------R---DG----------SVEQRAIEFAKTTLPYLMVDPMS-GS-RGVVEHDFQSAGALLVNNL 59 (510) Q Consensus 1 ~k~~~~~r~~~l------k---r~----------~~~~~w~e~~~~~~P~~~~~~~~-~~-~~~~~~~~dstg~~a~~~L 59 (510) ++..++.-+.++ + +. .....|+++++=--|.++..... .+ .....++--..+...++.+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~iv~~~ 84 (499) T protein:vir:80 5 IIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYM 84 (499) T ss_pred HHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHHHHHH Confidence 333333333321 1 00 23446776664211211111110 11 1111223335566666777 Q ss_pred HHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--E Q lcl|Aclame:pro 60 AAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--Y 137 (510) Q Consensus 60 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~ 137 (510) |+-|.+- |+ .++++|. +..++|.+ .+...+|...+.++..+....|.+.+ | T Consensus 85 a~~l~~e--p~-----~i~~~d~-------------~~~e~l~~-------~~~~n~f~~~~~~~~~~a~~~G~~~~~~~ 137 (499) T protein:vir:80 85 SKLLFNE--KV-----KINIDDE-------------TAEEFVLN-------VLKTNGFTKNMERYIEYGEAMGGFVIKVY 137 (499) T ss_pred HHhhhCC--cc-----eEeeCCH-------------HHHHHHHH-------HHhhccHHHHHHHHHHHHhhcCcEEEEEE Confidence 6654432 22 2333432 23334433 46677899999999999999998875 4 Q ss_pred EeCCC-CeEEEEEeceEEE-eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecC-----C Q lcl|Aclame:pro 138 RNSDE-ATVVAWSLRSYAV-RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKG-----T 210 (510) Q Consensus 138 ~~~~~-~~~~~~pl~~~~v-~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-----~ 210 (510) .+.+. .++..++-.+++- ..| .|++..+.+...++.+. ..-..+..-......+....|-+.++..++ . T Consensus 138 ~D~~~~~~i~~v~a~~~~Pi~~d-~~~~~~~~f~~~~~~~~---~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~ 213 (499) T protein:vir:80 138 HDGNKNVKVSFATADCMYPLSND-SENVDECLIANSFHKNN---KYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGG 213 (499) T ss_pred ECCCCcEEEEEEcCCceEEEEec-CCCeEEEEEEEEEeecC---eEEEEEEEEEecccceeeEEEEEEEEeccCccccCc Confidence 55442 2467778777664 455 58887777655554211 000000000000000111112222222111 1 Q ss_pred CeeEEEEEEeeCCeeeccccccccccCceEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCce Q lcl|Aclame:pro 211 AMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPT----WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) Q Consensus 211 ~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R----w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~ 286 (510) ..|...+|-.. .......++ ...||+.++ .++..++++|+|-...+.+-+..|+..--......+. .+..+ T Consensus 214 ~v~l~~~~~~~--~~~~~~~~~--~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i 288 (499) T protein:vir:80 214 KVSLKLLFNDI--EPVVPLPSL--TRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKV 288 (499) T ss_pred ccchhhhccCc--CCceeecCC--CccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh-cccce Confidence 12222221110 000011122 334555544 3445688999999999999999999887777666554 34444 Q ss_pred eeCCCCccchhhhhc--------CCCcc--eecCCcc----ccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC Q lcl|Aclame:pro 287 LVDEAKGAVVDDYQD--------AEMGD--YVPGGAE----AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ 350 (510) Q Consensus 287 lv~~~g~~~~~~~~~--------~~~G~--~~~g~~~----~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~ 350 (510) .|++ .++.+..-.. ..... .+.+... .+..++..- .-......++.+.+.|....=++ .+. T Consensus 289 ~v~~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i--r~e~~~~~l~~~l~~i~~~~g~s~~~fg 365 (499) T protein:vir:80 289 LVPS-SFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISVEI--RSTEFIESINAMLRIYAMQVGLSAGTFT 365 (499) T ss_pred ecch-hhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEecCcC--ChHHHHHHHHHHHHHHHHhcCCChhhcC Confidence 4532 3332210000 00000 1111111 122111110 11112233333333332222111 111 Q ss_pred -CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCC---CCCCccceeeEEeecHHHHHHHHHHH Q lcl|Aclame:pro 351 -RDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL---QGLITKQHKPAIETGLPALSRSAAVQ 426 (510) Q Consensus 351 -~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l---~~~p~~~~~~~~vs~l~~l~r~~~~~ 426 (510) ......|||||..+.+.......-.-..+ ..-|..|++-++.+..-.+. ...++..+.+.+-.++.. ....+++ T Consensus 366 ~~~~g~~TAtei~s~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~-d~~~~~~ 443 (499) T protein:vir:80 366 FDENGLKTATEVVSEKSETYQTKNSHSQLI-EQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQ-DEDTTIN 443 (499) T ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCC-CHHHHHH Confidence 12234599999988887777755422222 33344444444433221111 112234455555333221 1122222 Q ss_pred HHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 427 SMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 427 ~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~ 506 (510) ... +.++ +|. +... ..++...|++ ++|.+++.++.+ +.++...+ .+. T Consensus 444 ~~~---~~~~--~Gi------~S~e---t~l~~~~~~~-------d~ea~~el~~i~--~E~~~~~~----------~~d 490 (499) T protein:vir:80 444 RYT---TAKN--QGM------IPLK---IALQRAWNIT-------EAEADEWAEMLA--KEKQAEIP----------NND 490 (499) T ss_pred HHH---HHHH--cCC------CCHH---HHHhhcCCCC-------hHHHHHHHHHHH--HHhhcCCC----------CCC Confidence 222 1111 121 1111 2245556664 333332222211 11111000 112 Q ss_pred cCCC Q lcl|Aclame:pro 507 LAGV 510 (510) Q Consensus 507 ~ag~ 510 (510) ..|. T Consensus 491 ~~g~ 494 (499) T protein:vir:80 491 MTGI 494 (499) T ss_pred cccc Confidence 2333 No 54 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.39 E-value=9e-07 Score=53.73 Aligned_cols=449 Identities=10% Similarity=0.022 Sum_probs=174.9 Q ss_pred ChhHHHHHHHHH-----h---cc----------CchHHHHHHHHhhcccccCCCCCCcccccccccc--chHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKL-----R---DG----------SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALLVNNLA 60 (510) Q Consensus 1 ~k~~~~~r~~~l-----k---r~----------~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d--stg~~a~~~La 60 (510) ||.-+.+...++ . .. ....+|+.+++=--+.+ . ..+..+....+-.. ..+...++.+| T Consensus 7 ~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~-~-~~~~~~~~~~~~~~slnl~~~i~~~~A 84 (522) T protein:vir:47 7 VKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDV-Q-YKNTDGDIKSRPMNHLPIARTASKKIA 84 (522) T ss_pred HHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccc-c-ccccCcchhcccceecchHHHHHHHHh Confidence 222222222111 1 00 11124444433100100 0 00111111112233 34445555555 Q ss_pred HHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEE Q lcl|Aclame:pro 61 AKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYR 138 (510) Q Consensus 61 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~ 138 (510) +-+.+-.. .++++|+ .+.++|+ +.+..++|+..+.+++....+.|+++ .|+ T Consensus 85 ~lv~~e~~-------~i~v~d~-------------~~~~~l~-------~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~ 137 (522) T protein:vir:47 85 SLVYNEQA-------TITTKNE-------------ILQKFLD-------DMLTNDRFNKNFERYLESCLALGGLAMRPYI 137 (522) T ss_pred hhhcCCcc-------eeecCCh-------------HHHHHHH-------HHHhhcchHHHHHHHHHHhhccCCEEEEEEE Confidence 44433211 2223332 2334443 44778999999999999999988766 577 Q ss_pred eCCCCeEEEEEeceEE-EeeCCCCceeEEEEEEEe-cHHHHhH--------HhhHHhhcccccCCCCceEEEEEEEEeec Q lcl|Aclame:pro 139 NSDEATVVAWSLRSYA-VRRDATGRWMDIVLKQRY-KSKDLDD--------VYKQDLMRAGRNLSGSGSVDLYTHVQRRK 208 (510) Q Consensus 139 ~~~~~~~~~~pl~~~~-v~~d~~G~v~~i~r~~~~-t~~~l~~--------~~~~~~~~~~~~~~~~~~v~v~~~v~~~~ 208 (510) +.+..++.+++-..++ +..|..|.+..++..... +-+.-.. +|...-.........+....|-+..+.-+ T Consensus 138 d~~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~ 217 (522) T protein:vir:47 138 DGDKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSD 217 (522) T ss_pred cCCceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecC Confidence 7665567778877766 467777765544332221 1111000 00000000000001111222222222211 Q ss_pred -----CCCeeEEEEEEeeCCeeeccccccccccCceE-E---Eeeee-cCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 209 -----GTAMDYAEMYHEIDGVRVGETGRWPIHLCPYI-V---PTWNL-APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYE 278 (510) Q Consensus 209 -----~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~-~---~Rw~~-~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~ 278 (510) +...|...+....+-.......+. .-|.. . +.++. ..+++||+|-...+.+.++.||..--++..-. T Consensus 218 ~~~~lG~~v~l~~~~e~~~l~~~~~~~~~---~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~ 294 (522) T protein:vir:47 218 VNDVLGQRVNLSELDKYKNLEPVTVFENL---SRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEV 294 (522) T ss_pred CCcccCccccccccccccCCCCceEeCCC---CcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHH Confidence 111222222111110000011111 12322 2 22333 34789999999999999999998766666544 Q ss_pred HHhhCCceeeCCCCccchhhhh-----------cCCCcceecC-----CccccccccCCCccchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 LESLEVLNLVDEAKGAVVDDYQ-----------DAEMGDYVPG-----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQ 342 (510) Q Consensus 279 ~~a~~~~~lv~~~g~~~~~~~~-----------~~~~G~~~~g-----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 342 (510) ...-. ...|++ .+++...-. ....-.+++. ..+.+..++.. -........++.+-+.|.. T Consensus 295 ~~g~~-~i~v~~-~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~--ir~e~~~~~~~~~l~~i~~ 370 (522) T protein:vir:47 295 RMGQR-RVIVPE-HLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSP--IRANDYILAISEGLKLFEM 370 (522) T ss_pred Hhccc-eeecch-HHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccc--cChHHHHHHHHHHHHHHHH Confidence 43222 223321 222211000 0000112211 11122222111 1122233344444444433 Q ss_pred HHhhc--ccCCC-CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCC-CC--CccceeeEEeecH Q lcl|Aclame:pro 343 AFMYG--ANQRD-AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQ-GL--ITKQHKPAIETGL 416 (510) Q Consensus 343 af~~~--~~~~~-~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~-~~--p~~~~~~~~vs~l 416 (510) ..-+. .+.-+ +...|||||..+.+...+...-.-..+ ..-|..|+.-++.++.-.++. .. ....+.+.+-.++ T Consensus 371 ~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~-~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i 449 (522) T protein:vir:47 371 QIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALV-EQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGV 449 (522) T ss_pred HhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCC Confidence 32111 12212 233499999999999988877644333 344556666666555432221 12 2223444443333 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 417 PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETL 496 (510) Q Consensus 417 ~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~ 496 (510) .. .+..+++...+. ++ +|. +.... .+....|++ +||.+++.++.+..+. .+.+... T Consensus 450 ~~-D~~~~~~~~~~~---v~--aG~------~s~e~---~i~~~~g~~-------eeea~~el~ri~~E~~--~~~~~~~ 505 (522) T protein:vir:47 450 FT-DRHAELDYWAKM---VA--AGF------STKKR---AIGKTLNIS-------GVEAEKELNAINSELL--PMNDAEL 505 (522) T ss_pred CC-CHHHHHHHHHHH---Hh--cCC------CCHHH---HHHhcCCCC-------hHHHHHHHHHHHHhhc--cCCCCCC Confidence 22 112222222211 11 121 11122 234455654 3444433332222111 1111100 Q ss_pred HHHHH-HhhcccCCC Q lcl|Aclame:pro 497 LEGAS-DMTNALAGV 510 (510) Q Consensus 497 ~~~a~-~~~~~~ag~ 510 (510) --..+ ......++= T Consensus 506 ~~~~~~~~~~~~~d~ 520 (522) T protein:vir:47 506 AIYGMHDQNEEKADD 520 (522) T ss_pred CCCCCCCcccccCCC Confidence 00000 000011111 No 55 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.38 E-value=9.3e-07 Score=53.66 Aligned_cols=425 Identities=10% Similarity=0.047 Sum_probs=171.8 Q ss_pred ChhHHHHHHH-HHh-------------ccCchHHHHHHHHhhcccc-cC-CCCCCccccccccccchHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWE-KLR-------------DGSVEQRAIEFAKTTLPYL-MV-DPMSGSRGVVEHDFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~k~~~~~r~~-~lk-------------r~~~~~~w~e~~~~~~P~~-~~-~~~~~~~~~~~~~~dstg~~a~~~Laa~l~ 64 (510) +|+...+.+. .|+ +......|..+++=--|.+ +. .++....++ +.-=..+...++.+|+-+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~--~~slnl~~~i~~~~A~lv~ 88 (500) T protein:vir:30 11 VTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRD--LNHLPIARTAAKKIASLVF 88 (500) T ss_pred HHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCc--eeecchHHHHHHHHhhhhc Confidence 2222222111 111 1113345666665322222 11 111111111 1111344555555555333 Q ss_pred HhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCC Q lcl|Aclame:pro 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDE 142 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~ 142 (510) +-. | .++++|+ ...++|++ .+..++|+..+.+++.+..+.|.++ .|.+.+. T Consensus 89 ~e~--~-----~i~~~d~-------------~~~~~l~~-------il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~ 141 (500) T protein:vir:30 89 NEQ--A-----EIKVDDD-------------AANEFISE-------TLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK 141 (500) T ss_pred CCc--c-----eEecCCh-------------HHHHHHHH-------HHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Confidence 211 1 1233332 23334443 4778899999999999999999876 4666655 Q ss_pred CeEEEEEeceEEE-eeCCCCceeEEEEEE-EecHHHHhHHhhHHhhcccccCCCCce-----------EEEEEEEEeec- Q lcl|Aclame:pro 143 ATVVAWSLRSYAV-RRDATGRWMDIVLKQ-RYKSKDLDDVYKQDLMRAGRNLSGSGS-----------VDLYTHVQRRK- 208 (510) Q Consensus 143 ~~~~~~pl~~~~v-~~d~~G~v~~i~r~~-~~t~~~l~~~~~~~~~~~~~~~~~~~~-----------v~v~~~v~~~~- 208 (510) ..+.+++...++- .-|..|.+..+|... ..+.. .+...+.. ..|-+.++..+ T Consensus 142 ~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~--------------~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~ 207 (500) T protein:vir:30 142 VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTIN--------------GKEVYYTLIEFHEWQSSDDYVISNELYRSDD 207 (500) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeec--------------CCceEEEEEEEEEEeCCceeEEEEEEEeccc Confidence 5677888777664 455556544433222 11110 00011111 11111222211 Q ss_pred ----CCCeeEEEEEEeeCCeeeccccccccccCceEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 209 ----GTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPT----WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) Q Consensus 209 ----~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R----w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~ 280 (510) +...|...+|-...... ...++ ..-||..++ =+...++++|.|-...+.+.+..|+..--++....+. T Consensus 208 ~~~lG~~v~l~~~~~~l~~~~--~~~~~--~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~ 283 (500) T protein:vir:30 208 KAKVGSRVPLSEVYKDLKDEA--KVTDV--TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM 283 (500) T ss_pred ccccCcccccccccCCcCcce--EeccC--CCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh Confidence 11223333321111100 11122 112232222 2334578899999999999999999988777766654 Q ss_pred hhCCceeeCCCCccchhhhhcCCCcceec---------------CCcc---ccccccCCCccchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 SLEVLNLVDEAKGAVVDDYQDAEMGDYVP---------------GGAE---AVRAYERGDYNKMAAIQQSLQAVVVRLNQ 342 (510) Q Consensus 281 a~~~~~lv~~~g~~~~~~~~~~~~G~~~~---------------g~~~---~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 342 (510) .+....|++ .++.+.. ...+|...+ +..+ .+..++.. -........++.+-+.|.. T Consensus 284 -g~~~i~v~~-~~l~~~~--~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~l~~~l~~i~~ 357 (500) T protein:vir:30 284 -GQRRVAVPE-SLTALTV--RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTP--IRADDYIKAINEGLSLFEM 357 (500) T ss_pred -Ccceeeech-HHhcccC--CCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccc--cChHHHHHHHHHHHHHHHH Confidence 444445543 3433211 001111111 1111 11111100 0111122333333333332 Q ss_pred HHhhc--ccCCC-CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh----cCCCCCCccceeeEEeec Q lcl|Aclame:pro 343 AFMYG--ANQRD-AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD----ALLQGLITKQHKPAIETG 415 (510) Q Consensus 343 af~~~--~~~~~-~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~----~~l~~~p~~~~~~~~vs~ 415 (510) ..-+. .+.-+ ....|||||..+.+.......-.-.. -..-+.-|+.-++.+..- ++.++ +...+.+.+--+ T Consensus 358 ~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~-~~~al~~lv~~il~~~~~~~~~~~~~~-~~~~v~v~f~d~ 435 (500) T protein:vir:30 358 QIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVAL-VEQSLKELVISIFEIAKAYDLYQSEVP-SMDNISISLDDG 435 (500) T ss_pred HhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCCCC-CCcceEEEeCCC Confidence 22111 11111 23359999998888888876653333 334444455555544321 12222 222355544333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 416 LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQET 495 (510) Q Consensus 416 l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~ 495 (510) +.. .+..+++... +.+++ |. +....+ +.+..|++ +||.+++.++-+ +.++ + T Consensus 436 i~~-d~~~~~~~~~---~~v~a--Gi------~s~~~~---i~~~~g~~-------eeea~~~l~~i~--~E~~---~-- 486 (500) T protein:vir:30 436 VFT-DRDAELDYWI---KVVNA--GF------GTREMA---IQKVLNVT-------EEKAQEIAAEIN--TGIV---D-- 486 (500) T ss_pred CCC-CHHHHHHHHH---HHHHc--CC------CCHHHH---HHhcCCCC-------HHHHHHHHHHHH--Hhcc---c-- Confidence 221 1122222222 22221 21 111122 34555654 444433322221 1100 0 Q ss_pred HHHHHHHhhcccCCC Q lcl|Aclame:pro 496 LLEGASDMTNALAGV 510 (510) Q Consensus 496 ~~~~a~~~~~~~ag~ 510 (510) .-+..+....++|= T Consensus 487 -~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 487 -EINQQRTDTHLYGE 500 (500) T ss_pred -cCCCCCccccccCC Confidence 00000111111111 No 56 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.38 E-value=9.3e-07 Score=53.66 Aligned_cols=425 Identities=10% Similarity=0.047 Sum_probs=171.8 Q ss_pred ChhHHHHHHH-HHh-------------ccCchHHHHHHHHhhcccc-cC-CCCCCccccccccccchHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWE-KLR-------------DGSVEQRAIEFAKTTLPYL-MV-DPMSGSRGVVEHDFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~k~~~~~r~~-~lk-------------r~~~~~~w~e~~~~~~P~~-~~-~~~~~~~~~~~~~~dstg~~a~~~Laa~l~ 64 (510) +|+...+.+. .|+ +......|..+++=--|.+ +. .++....++ +.-=..+...++.+|+-+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~--~~slnl~~~i~~~~A~lv~ 88 (500) T protein:vir:98 11 VTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRD--LNHLPIARTAAKKIASLVF 88 (500) T ss_pred HHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCc--eeecchHHHHHHHHhhhhc Confidence 2222222111 111 1113345666665322222 11 111111111 1111344555555555333 Q ss_pred HhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCC Q lcl|Aclame:pro 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDE 142 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~ 142 (510) +-. | .++++|+ ...++|++ .+..++|+..+.+++.+..+.|.++ .|.+.+. T Consensus 89 ~e~--~-----~i~~~d~-------------~~~~~l~~-------il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~ 141 (500) T protein:vir:98 89 NEQ--A-----EIKVDDD-------------AANEFISE-------TLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK 141 (500) T ss_pred CCc--c-----eEecCCh-------------HHHHHHHH-------HHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Confidence 211 1 1233332 23334443 4778899999999999999999876 4666655 Q ss_pred CeEEEEEeceEEE-eeCCCCceeEEEEEE-EecHHHHhHHhhHHhhcccccCCCCce-----------EEEEEEEEeec- Q lcl|Aclame:pro 143 ATVVAWSLRSYAV-RRDATGRWMDIVLKQ-RYKSKDLDDVYKQDLMRAGRNLSGSGS-----------VDLYTHVQRRK- 208 (510) Q Consensus 143 ~~~~~~pl~~~~v-~~d~~G~v~~i~r~~-~~t~~~l~~~~~~~~~~~~~~~~~~~~-----------v~v~~~v~~~~- 208 (510) ..+.+++...++- .-|..|.+..+|... ..+.. .+...+.. ..|-+.++..+ T Consensus 142 ~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~--------------~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~ 207 (500) T protein:vir:98 142 VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTIN--------------GKEVYYTLIEFHEWQSSDDYVISNELYRSDD 207 (500) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeec--------------CCceEEEEEEEEEEeCCceeEEEEEEEeccc Confidence 5677888777664 455556544433222 11110 00011111 11111222211 Q ss_pred ----CCCeeEEEEEEeeCCeeeccccccccccCceEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 209 ----GTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPT----WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) Q Consensus 209 ----~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R----w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~ 280 (510) +...|...+|-...... ...++ ..-||..++ =+...++++|.|-...+.+.+..|+..--++....+. T Consensus 208 ~~~lG~~v~l~~~~~~l~~~~--~~~~~--~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~ 283 (500) T protein:vir:98 208 KAKVGSRVPLSEVYKDLKDEA--KVTDV--TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM 283 (500) T ss_pred ccccCcccccccccCCcCcce--EeccC--CCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh Confidence 11223333321111100 11122 112232222 2334578899999999999999999988777766654 Q ss_pred hhCCceeeCCCCccchhhhhcCCCcceec---------------CCcc---ccccccCCCccchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 281 SLEVLNLVDEAKGAVVDDYQDAEMGDYVP---------------GGAE---AVRAYERGDYNKMAAIQQSLQAVVVRLNQ 342 (510) Q Consensus 281 a~~~~~lv~~~g~~~~~~~~~~~~G~~~~---------------g~~~---~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 342 (510) .+....|++ .++.+.. ...+|...+ +..+ .+..++.. -........++.+-+.|.. T Consensus 284 -g~~~i~v~~-~~l~~~~--~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~l~~~l~~i~~ 357 (500) T protein:vir:98 284 -GQRRVAVPE-SLTALTV--RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTP--IRADDYIKAINEGLSLFEM 357 (500) T ss_pred -Ccceeeech-HHhcccC--CCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccc--cChHHHHHHHHHHHHHHHH Confidence 444445543 3433211 001111111 1111 11111100 0111122333333333332 Q ss_pred HHhhc--ccCCC-CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh----cCCCCCCccceeeEEeec Q lcl|Aclame:pro 343 AFMYG--ANQRD-AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD----ALLQGLITKQHKPAIETG 415 (510) Q Consensus 343 af~~~--~~~~~-~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~----~~l~~~p~~~~~~~~vs~ 415 (510) ..-+. .+.-+ ....|||||..+.+.......-.-.. -..-+.-|+.-++.+..- ++.++ +...+.+.+--+ T Consensus 358 ~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~-~~~al~~lv~~il~~~~~~~~~~~~~~-~~~~v~v~f~d~ 435 (500) T protein:vir:98 358 QIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVAL-VEQSLKELVISIFEIAKAYDLYQSEVP-SMDNISISLDDG 435 (500) T ss_pred HhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCCCC-CCcceEEEeCCC Confidence 22111 11111 23359999998888888876653333 334444455555544321 12222 222355544333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 416 LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQET 495 (510) Q Consensus 416 l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~ 495 (510) +.. .+..+++... +.+++ |. +....+ +.+..|++ +||.+++.++-+ +.++ + T Consensus 436 i~~-d~~~~~~~~~---~~v~a--Gi------~s~~~~---i~~~~g~~-------eeea~~~l~~i~--~E~~---~-- 486 (500) T protein:vir:98 436 VFT-DRDAELDYWI---KVVNA--GF------GTREMA---IQKVLNVT-------EEKAQEIAAEIN--TGIV---D-- 486 (500) T ss_pred CCC-CHHHHHHHHH---HHHHc--CC------CCHHHH---HHhcCCCC-------HHHHHHHHHHHH--Hhcc---c-- Confidence 221 1122222222 22221 21 111122 34555654 444433322221 1100 0 Q ss_pred HHHHHHHhhcccCCC Q lcl|Aclame:pro 496 LLEGASDMTNALAGV 510 (510) Q Consensus 496 ~~~~a~~~~~~~ag~ 510 (510) .-+..+....++|= T Consensus 487 -~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 487 -EINQQRTDTHLYGE 500 (500) T ss_pred -cCCCCCccccccCC Confidence 00000111111111 No 57 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.36 E-value=1.1e-06 Score=53.36 Aligned_cols=413 Identities=11% Similarity=0.004 Sum_probs=162.9 Q ss_pred ChhHH----HHHHHHHhccCchHHHHHHHHhhcccccC-CCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTA----AMLWEKLRDGSVEQRAIEFAKTTLPYLMV-DPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~----~~r~~~lkr~~~~~~w~e~~~~~~P~~~~-~~~~~~~~~~~--~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) |-... ...+.++.+ ...+.+.+.+|..-..-. .-+..-...+. +..-+-+..+|++||..|. +-+ T Consensus 18 l~~~e~~~i~~L~~~~~~--~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~----~~G-- 89 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVD--RTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCN----LES-- 89 (504) T ss_pred CCHHHHHHHHHHHHHHHH--HhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhc----cce-- Confidence 33333 333333321 123444455554322110 00111111111 1233455666666666542 212 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CCC---eEEEE Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DEA---TVVAW 148 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~--~~~---~~~~~ 148 (510) |+ .++... ... .+.+....++|.....++.++..++|.+.+++.. +.. .++++ T Consensus 90 -f~--~~d~~~------------~~~-------~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~ 147 (504) T protein:vir:99 90 -FV--WPDGDY------------GSI-------GGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVK 147 (504) T ss_pred -ee--CCCCCh------------hhH-------HHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEe Confidence 22 222110 011 1233466789999999999999999998776643 322 36667 Q ss_pred EeceEEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeC--Cee Q lcl|Aclame:pro 149 SLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEID--GVR 225 (510) Q Consensus 149 pl~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~--~~~ 225 (510) |-.+.++..| ..+++...++... + ........+++|. + + ..+++..+ +.. T Consensus 148 sP~~~~~iyD~~~~~~~~a~~~~~---------------~--d~~g~~~~~~~y~---~--~-----~~~~~~~~~~~~~ 200 (504) T protein:vir:99 148 SAMQATGEWNSRRNAMDSLLSITS---------------R--DAEGHPTGIALYE---D--G-----VTVTADMDDDGDW 200 (504) T ss_pred ccceeEEEEeCCCCceeEEEEEEE---------------e--cCCCeEEEEEEEc---C--C-----cEEEEEEcCCcee Confidence 6555444444 4455544443221 0 0001112233332 1 1 11122111 111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc--------- Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVE-DYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV--------- 295 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~-~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~--------- 295 (510) ......++. .+|++.+..+...++.||+|-.. ..++-+..+|...-..+..++..+.|...+- |+.. T Consensus 201 ~~~~~~~~~-gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~--G~~~~~~~~~d~~ 277 (504) T protein:vir:99 201 HADVRTHKL-GVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL--GADAKNFRNKDGS 277 (504) T ss_pred eeccccCCC-CcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--cCCcccccccccc Confidence 221222232 37999988888889999999654 5678888888888888888887776644431 1110 Q ss_pred hhhhhcCCCcce--ecCCccc-------cccccCCCccchHHHHHHHHHHHHHHHHHHhhccc-------CCCCCCCCHH Q lcl|Aclame:pro 296 VDDYQDAEMGDY--VPGGAEA-------VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-------QRDAERVTAE 359 (510) Q Consensus 296 ~~~~~~~~~G~~--~~g~~~~-------v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-------~~~~~~vTAt 359 (510) +........+.+ ++.+.+. +..-++. .++++... +.++.-|....+.... ..+..+-+|. T Consensus 278 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~-~~~l~~~~---~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~ 353 (504) T protein:vir:99 278 MKPAWQIALARVFALPDDEDEPDAARARADVKQFP-ASSPQPHI---EMLEQIAMMFSGETSIPVESLGFSNRANPTSAD 353 (504) T ss_pred ccchhhhhhhhhhcCCCccccccccCccceeeecC-CCChHHHH---HHHHHHHHHHHhhhCCCHHHhcccccccccHHH Confidence 111111111111 1222111 1111221 23454333 3333333333222211 1111222443 Q ss_pred HH-------HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEe-ecHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 360 EV-------RITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE-TGLPALSRSAAVQSMLNA 431 (510) Q Consensus 360 Ei-------~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~v-s~l~~l~r~~~~~~~~~~ 431 (510) -| ..+++++...+|..+.++ +...+.++ ++....+.+..+++++ .....-..++.+..+..+ T Consensus 354 Ai~~~~~~L~~ka~~k~~~f~~~l~~~--------~rla~~~~--~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl 423 (504) T protein:vir:99 354 AYIASREDLIAEAEGATDDWSPAFRRS--------MIRALAIK--NGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKM 423 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHh--cCCCccccccccceeEecCCCccCHHHHHHHHHHH Confidence 33 334445555555554431 12222332 2333344444333322 111222223333333322 Q ss_pred HHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 432 SQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 432 ~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .+....+.... +.+...+|+++ +|++...+++++++ .+.+...+.. . +.....+|. T Consensus 424 ~~ag~~l~~~~------------~~l~~~lg~~~-------~ei~r~~~e~~~~~--~~~~~~~l~~-~-~~~~~~~~~ 479 (504) T protein:vir:99 424 LGAGPEWLKET------------EVGLELLGLTP-------QQAKRALAERRRAS--SVSIIEALNR-R-QQEAATAGE 479 (504) T ss_pred Hhhccccccch------------HHHHhhcCCCH-------HHHHHHHHHHHHHh--hHHHHHHHhc-c-cCCCCCCCC Confidence 22111111111 12344557754 34332221111111 1111111110 0 000001111 No 58 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.36 E-value=1.1e-06 Score=53.27 Aligned_cols=424 Identities=12% Similarity=0.083 Sum_probs=181.8 Q ss_pred ChhHHHHHHHHH----------h------ccC-------chHHHHHHHHhhcccccCCCCCCcccccccccc--chHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKL----------R------DGS-------VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQ--SAGALL 55 (510) Q Consensus 1 ~k~~~~~r~~~l----------k------r~~-------~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d--stg~~a 55 (510) |.+.++..+.++ + |-. ....|+.+++=--|.+.... ..+....+... ..+... T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~--~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 3 FWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKN--SYGDTQKHELQSVNVTKLA 80 (505) T ss_pred hHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccc--cCCCccccceeecchHHHH Confidence 444444333331 1 111 12346666542222111111 11111112233 344555 Q ss_pred HHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE Q lcl|Aclame:pro 56 VNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL 135 (510) Q Consensus 56 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~ 135 (510) ++.+|+-|.+- || +++++|. ...++|++ .+..++|+..+.+++.+..+.|.++ T Consensus 81 ~~~~A~ll~~e--~~-----~i~~~d~-------------~~~e~l~~-------i~~~n~f~~~~~~~~e~a~a~G~~~ 133 (505) T protein:vir:79 81 SAKLASLIFNE--QC-----QVTVSDE-------------TANDFLDD-------VFQQNDFYTTFEEKLEEWIALGSGC 133 (505) T ss_pred HHHHHhhhcCC--Cc-----eeecCCh-------------HHHHHHHH-------HHHhccHHHHHHHHHHHHhhcCCeE Confidence 55555544332 11 2333332 23344444 4778899999999999999999876 Q ss_pred E--EEeCCCCeEEEEEeceEE-EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEe---ecC Q lcl|Aclame:pro 136 L--YRNSDEATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQR---RKG 209 (510) Q Consensus 136 l--~~~~~~~~~~~~pl~~~~-v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~---~~~ 209 (510) + |+|.+..++.+++-..++ +..|. |++.++.+...++. ..++.. .+|+.++. .++ T Consensus 134 ~k~~~D~~~~~i~~v~ad~~~P~~~d~-~~~~~~a~~~~~~~---------------~~~~~~---~~yt~lE~h~~~~~ 194 (505) T protein:vir:79 134 VRPYVDSGKIKLAWATADQVYPLQADT-NQVNELAIASRTTE---------------VENHRT---IYYTLLEFHQWDHG 194 (505) T ss_pred EEEEEeCCceEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEE---------------ecCCcc---eEEEEEEEEEecCc Confidence 4 666655567788877766 45565 44544433322210 011111 12333322 222 Q ss_pred CCeeEEEEEEeeC----Ceee-----------cccccc-ccccCceEEEe---e-eecCCCccccchHHHHHHHHHHHHH Q lcl|Aclame:pro 210 TAMDYAEMYHEID----GVRV-----------GETGRW-PIHLCPYIVPT---W-NLAPGEHYGRGHVEDYIGDFAKLSL 269 (510) Q Consensus 210 ~~~~~~sv~~e~~----~~~~-----------~~~~~y-~~~~~P~~~~R---w-~~~~ge~YGrgp~~~~l~d~~~L~~ 269 (510) ....-...|...+ |..+ ..+..+ +....+|..++ + +...++++|+|-...+.+.+..||. T Consensus 195 ~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~ 274 (505) T protein:vir:79 195 DYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINR 274 (505) T ss_pred eEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHH Confidence 1111112222111 1111 011111 11122333322 2 2344678999999999999999998 Q ss_pred HHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC------------Ccceec--C--CccccccccCCCccchHHHHHHH Q lcl|Aclame:pro 270 LSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE------------MGDYVP--G--GAEAVRAYERGDYNKMAAIQQSL 333 (510) Q Consensus 270 l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~------------~G~~~~--g--~~~~v~~~~~~~~~~~~~~~~~i 333 (510) .--++....+. .+....|++ .++++.....+. .-.+.. + +...+..++.. -........+ T Consensus 275 ~~s~~~~e~~~-g~~~i~v~~-~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~--ir~e~~~~~l 350 (505) T protein:vir:79 275 THDQFVDEVKK-GQRRLIVPA-EWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSP--IRVADYQATM 350 (505) T ss_pred HHHHHHHHHHh-cccceeech-HHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEeccc--CCHHHHHHHH Confidence 77666665543 333333432 332221110000 000111 1 11112222111 0112223344 Q ss_pred HHHHHHHHHHHhhc--ccCC-CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCC---------C Q lcl|Aclame:pro 334 QAVVVRLNQAFMYG--ANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALL---------Q 401 (510) Q Consensus 334 ~~~~~~I~~af~~~--~~~~-~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l---------~ 401 (510) +.+-++|....-+. .+.- .....|||||..+.+.......-.-..+ ...|..|++.++.+..-.++ . T Consensus 351 ~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~-~~al~~li~~i~~~~~~~~~~~~g~~~~~~ 429 (505) T protein:vir:79 351 DFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQV-EKTIKALTYAILELASVPSFYADGQARWTG 429 (505) T ss_pred HHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcccccccccccC Confidence 44444443322111 1111 2233599999999998888877644443 55667777777765432221 2 Q ss_pred CCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHH Q lcl|Aclame:pro 402 GLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEE 481 (510) Q Consensus 402 ~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~ 481 (510) ++++..+.+.+-.++.. .+..+++...+..+ + |. +... ..+....|++ +||++++.++ T Consensus 430 ~~~~~~i~v~f~d~i~~-d~~~~~~~~~~~v~---~--Gi------~s~e---~~l~~~~~~~-------eeea~~el~r 487 (505) T protein:vir:79 430 DVDSLDITINFNDGVFV-DQESKRAADLQAVQ---A--QV------MPKK---QFLMRNYGLD-------EEEADEWLAQ 487 (505) T ss_pred CCCceeEEEEeCCCCCC-CHHHHHHHHHHHHH---c--CC------CCHH---HHHHhcCCCC-------hHHHHHHHHH Confidence 23333444444433321 22222222222221 1 21 1111 2234555654 3444433332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 482 QRRQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 482 ~~qqa~~~~~a~~~~~~~a~~~~~~ 506 (510) -+..+. ..++..++.+|. T Consensus 488 i~~E~~-------~~~p~~~~~gg~ 505 (505) T protein:vir:79 488 IDAENS-------TAEPEFNQFGGD 505 (505) T ss_pred HHHhcc-------ccCCCchhccCC Confidence 221111 112334455555 No 59 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.30 E-value=1.5e-06 Score=52.45 Aligned_cols=416 Identities=11% Similarity=0.013 Sum_probs=163.7 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc-cC--CCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MV--DPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~-~~--~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) +-+.|.+.|+. +. .+.+++.+|..-.. .+ +.......+..+..-+-+..+|++++..| +|.+ |+. T Consensus 17 ~~~~l~~~~~~--~~---~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~~ 84 (485) T protein:vir:10 17 ARDEMVSAFED--ST---QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ----AVEG---FRF 84 (485) T ss_pred HHHHHHHHHHH--HH---HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh----cccc---eec Confidence 22223233221 11 22333333322210 00 00000011111233456667777766655 3322 222 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-----------CeEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-----------ATVV 146 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-----------~~~~ 146 (510) ++.. +..+ .+.+.+..++|.....++.++..++|.+.+++..+. .+++ T Consensus 85 --~~~~------------~~~~-------~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~ 143 (485) T protein:vir:10 85 --GDAD------------EADE-------ELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIR 143 (485) T ss_pred --CCCc------------hhHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEE Confidence 1110 0111 122335678999999999999999999876655432 1367 Q ss_pred EEEeceEEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 147 AWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 147 ~~pl~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~ 225 (510) +++..+.++..| ..+++...++.+. . ...+.-..+++|+ ++. .+.++ ..++.. T Consensus 144 ~~~p~~~~~~~D~~~~~~~~~~~~~~-~----------------~~~~~~~~~~~y~-----~~~---~~~~~-~~~~~~ 197 (485) T protein:vir:10 144 VEPPTRMYAEIDPRIGRVSKAIRVAY-D----------------AEGNEIQAATLYT-----PND---IFGWY-RVENEW 197 (485) T ss_pred EEccceeEEEEcCCCCceeEEEEEEE-e----------------eCCCeEEEEEEEe-----CCe---EEEEE-EcCCce Confidence 777666555555 4566665555432 0 0011112233332 110 01111 111111 Q ss_pred -eccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCCc-cc---h Q lcl|Aclame:pro 226 -VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKG-AV---V 296 (510) Q Consensus 226 -~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~---~~g~-~~---~ 296 (510) .......++..+|++.+..+...+..||+|=... .++-+..++...-.+...++..+.|...+. ++.. .. . T Consensus 198 ~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~ 277 (485) T protein:vir:10 198 QEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETG 277 (485) T ss_pred EEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCccccccccccc Confidence 1112223446899999999999999999996654 446667777777777777777776654432 1100 00 0 Q ss_pred hhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHHH-------H Q lcl|Aclame:pro 297 DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVR-------I 363 (510) Q Consensus 297 ~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi~-------~ 363 (510) ..+.....|.+......+++..+... ++++ ..++.++.-|++...... +...... .++.-++ . T Consensus 278 ~~~~~~~~~~i~~~~~~d~k~~q~~~-~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ 353 (485) T protein:vir:10 278 QTLFDAYLARILAFEDAEGKIQQFSA-AELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIK 353 (485) T ss_pred chhhhhcccceeccCCCCceEEeecc-cchH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHH Confidence 11111122332221112333333322 2333 344445555544432211 1111111 2333332 2 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc--eeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh Q lcl|Aclame:pro 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ--HKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI 441 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~--~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~ 441 (510) +++++...+++.+.++ +..++.++. . ...+.+. +++.+..++ +-..++.++.+..+.+ ...+. T Consensus 354 k~~~k~~~f~~~l~~~--------~~l~~~~~~-~--~~~~~~~~~i~v~w~~~~-~~~~~~~ada~~kl~~---ag~~~ 418 (485) T protein:vir:10 354 KVERKNSIFGGAWEEA--------MRLAYRMMK-G--GDVPPDMLRMETVWRDPS-TPTYAAKADAASKLYN---GGTGV 418 (485) T ss_pred HHHHHHHHHHHHHHHH--------HHHHHHHhC-C--CCCcccceeeeEEecCCC-CCCHHHHHHHHHHHHh---ccccC Confidence 3334444444433322 222222221 1 2222233 344443322 2122222222222222 10011 Q ss_pred HhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HH-----------Hhhccc Q lcl|Aclame:pro 442 AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEG---AS-----------DMTNAL 507 (510) Q Consensus 442 ~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~---a~-----------~~~~~~ 507 (510) +..+.+ ...+|+.+.. .++++...++++.+...+..+.....++ +. +.+|+. T Consensus 419 ------~s~et~----~~~lg~~~~~----~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (485) T protein:vir:10 419 ------IPRERA----RKDMGYSIAE----REEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDA 484 (485) T ss_pred ------CCHHHH----HHhCCCCHhH----HHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCC Confidence 111222 2345665321 1333333332222222111111100000 00 001111 Q ss_pred C Q lcl|Aclame:pro 508 A 508 (510) Q Consensus 508 a 508 (510) | T Consensus 485 ~ 485 (485) T protein:vir:10 485 A 485 (485) T ss_pred C Confidence 1 No 60 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.25 E-value=2e-06 Score=51.83 Aligned_cols=456 Identities=13% Similarity=0.084 Sum_probs=181.7 Q ss_pred ChhHHHHHHHHH---------h---c----------cCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKL---------R---D----------GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNN 58 (510) Q Consensus 1 ~k~~~~~r~~~l---------k---r----------~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~ 58 (510) |-++++.-|.+. + + .....+|+.+++=-.|.+....++....+..+.-=..+...+.. T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~ 82 (517) T protein:vir:98 3 VIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADV 82 (517) T ss_pred hHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHH Confidence 444444444321 1 1 01334577776533343221111111111111111233444444 Q ss_pred HHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--E Q lcl|Aclame:pro 59 LAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--L 136 (510) Q Consensus 59 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l 136 (510) +|+ .+|.-- + .+.+++....+.. ........++|+++ +..++|+..+.+++.+..+.|+++ . T Consensus 83 ~A~----Ll~~e~-~--~i~v~d~~~~~~~--~~~~~~~~e~l~~i-------~~~n~f~~~~~~~~e~a~a~G~~a~k~ 146 (517) T protein:vir:98 83 LSG----LVFNEQ-C--EVYVSDAKDEEKK--DNSFKTAHEFIQHV-------FQHNKFIKNLSDYLEPTFALGGLTVRP 146 (517) T ss_pred hhh----hhcCCc-c--eEEeccccccccc--ccchhHHHHHHHHH-------HHhccHHHHHHHHHHHHhhhCCEEEEE Confidence 444 444311 1 1223332211100 01122345555554 888899999999999999999876 4 Q ss_pred EEeCCCCeEEEEEeceEEE-eeCCCCceeEEE-EEEEecHHHHhHHhhHHhhccc--ccCCCCceEEEEEEEEeec---- Q lcl|Aclame:pro 137 YRNSDEATVVAWSLRSYAV-RRDATGRWMDIV-LKQRYKSKDLDDVYKQDLMRAG--RNLSGSGSVDLYTHVQRRK---- 208 (510) Q Consensus 137 ~~~~~~~~~~~~pl~~~~v-~~d~~G~v~~i~-r~~~~t~~~l~~~~~~~~~~~~--~~~~~~~~v~v~~~v~~~~---- 208 (510) |++.+..++.+++-..++- .-|..|.+..+| +++..+.+.=..-+- -+..-. .....+....|-+.++... T Consensus 147 ~~d~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt-~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~ 225 (517) T protein:vir:98 147 YVDNGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYT-LLEFHEWEKTEEGESLYVITNELYKSDNEGE 225 (517) T ss_pred EEeCCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEE-EEEEEecCceeccCCcEEEEEEEEecCCCcc Confidence 7777666677777766654 556666444332 333322111000000 000000 0000111233333333221 Q ss_pred -CCCeeEEEEEEeeCCeeeccccccccccCceEE----Eeee-ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 209 -GTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIV----PTWN-LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESL 282 (510) Q Consensus 209 -~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~----~Rw~-~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~ 282 (510) +...|...+|-+.... ....+. ..|..+ +-.+ ...+++||+|-...+++.++.||..--++..-... . T Consensus 226 lG~~v~L~~~~e~l~~~--~~~~g~---~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~-g 299 (517) T protein:vir:98 226 IGKRIPLEELYEGMQEK--TYIQGL---SRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM-G 299 (517) T ss_pred ccccccccccccCCCcc--eeECCC---CcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh-C Confidence 1112222332111000 011111 224221 1223 33378999999999999999999877777665444 4 Q ss_pred CCceeeCCCCccchhhhhc-CCCc--------cee--cCCccccccccCCCccch--HHHHHHHHHHHHHHHHHHhhc-- Q lcl|Aclame:pro 283 EVLNLVDEAKGAVVDDYQD-AEMG--------DYV--PGGAEAVRAYERGDYNKM--AAIQQSLQAVVVRLNQAFMYG-- 347 (510) Q Consensus 283 ~~~~lv~~~g~~~~~~~~~-~~~G--------~~~--~g~~~~v~~~~~~~~~~~--~~~~~~i~~~~~~I~~af~~~-- 347 (510) +....|++ .+++++.=.. ...| .+. .+..+.. .++.- ..++ ....+.++.+-+.|....-+. T Consensus 300 ~~~i~vp~-~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~-~i~~~-~~~iR~e~~~~~~~~~L~~i~~~~Gls~~ 376 (517) T protein:vir:98 300 QRTVFVSD-VMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEE-FVKDV-THDIRTEQYKEAINQALRTLEMELKLSVG 376 (517) T ss_pred CcceecCh-hhhccccCCCCcccCCCCCcccceeeeccCCCCCC-ceeee-ccccchHHHHHHHHHHHHHHHHHhCCCcc Confidence 44545543 3432211000 0001 001 1111100 00000 0111 123334444444443322111 Q ss_pred ccCCCC-CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCC-C--CCccceeeEEeecHHHHHHHH Q lcl|Aclame:pro 348 ANQRDA-ERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQ-G--LITKQHKPAIETGLPALSRSA 423 (510) Q Consensus 348 ~~~~~~-~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~-~--~p~~~~~~~~vs~l~~l~r~~ 423 (510) .+.-++ .--|||||..+.+...+...- +.+.-...|.-|++-++.+..-..+. . ++...+.+.+-.++.. .+.. T Consensus 377 t~~~~~~~~kTATEi~s~~~~~~~t~~~-~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~-D~~~ 454 (517) T protein:vir:98 377 TFSFDGRSMKTATEIVSENDLTYRTRND-HVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQ-DRSA 454 (517) T ss_pred cccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCC-CHHH Confidence 122222 224999999999988877665 33333334444444444332211111 1 1222345554333322 2222 Q ss_pred HHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 424 AVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDM 503 (510) Q Consensus 424 ~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~ 503 (510) +++...+. +++ |. +... ..+...+|++ +||.+++.++-+ +.... +.+ .....++ T Consensus 455 ~~~~~~~~---v~a--G~------ms~~---~~i~~~~g~~-------eeeA~~e~~~i~--~E~~~-~~~--~~~~~~~ 508 (517) T protein:vir:98 455 LLRFYGQA---KTF--GF------IPTV---EAIQRIFKVP-------KKTAEQWLEEIR--KDQIE-LDP--VTISQRA 508 (517) T ss_pred HHHHHHHH---Hhc--CC------CCHH---HHHHHhCCCC-------hHHHHHHHHHHH--Hhccc-cCC--CCccccc Confidence 22222221 111 21 1112 2234555654 344433322221 11111 111 1112223 Q ss_pred hcccCCC Q lcl|Aclame:pro 504 TNALAGV 510 (510) Q Consensus 504 ~~~~ag~ 510 (510) .+..+|= T Consensus 509 ~~~~~gd 515 (517) T protein:vir:98 509 QKRMFGD 515 (517) T ss_pred cCCCCCC Confidence 3333333 No 61 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.23 E-value=2.3e-06 Score=51.48 Aligned_cols=423 Identities=12% Similarity=0.002 Sum_probs=166.6 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc-cCCCCCC--ccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPMSG--SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~-~~~~~~~--~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) =.+.+...+.++.+ ...+.+.+.+|..-.. .+..+.. ...+..++..+-+..+|+.+++.| ++.+ |.. T Consensus 4 ~~d~i~~L~~~~~~--~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g---~~~ 74 (480) T protein:vir:78 4 YHEHVERLQGLLAR--DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEG---FRI 74 (480) T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh----ccCc---eec Confidence 23334444444421 1233344444433211 0000000 011111234556666677776655 3322 222 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC-------CCC--eEEEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-------DEA--TVVAW 148 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~-------~~~--~~~~~ 148 (510) . .|.. ..+ .+.+.+..++|.....++.++..++|.+.+++.. +.+ +++++ T Consensus 75 ~-~d~~-------------~~~-------~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~ 133 (480) T protein:vir:78 75 S-EDSE-------------GLE-------ELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) T ss_pred C-CCch-------------hHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEE Confidence 1 1111 111 2233466789999999999999999998765542 122 36677 Q ss_pred EeceEEEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE---EE-eecCCCeeEEEEEEeeC Q lcl|Aclame:pro 149 SLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH---VQ-RRKGTAMDYAEMYHEID 222 (510) Q Consensus 149 pl~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~---v~-~~~~~~~~~~sv~~e~~ 222 (510) +..+.++..|+ .+++...+|.+.-. .+......+++|+. ++ +..+. +.. .+ ..+ T Consensus 134 ~p~~~~~i~D~~~~~~~~~~i~~~~~~----------------d~~~~~~~~~~y~~~~~~~~~~~~~-~~~-~~--~~~ 193 (480) T protein:vir:78 134 SPLYMYAELDPRNTRRVTRAVRLYTTR----------------DDVAVPDRATLYLPDETVPLRRNGG-LND-QW--VVD 193 (480) T ss_pred cccceEEEEcCCCccceEEEEEEEEee----------------cCCcceEEEEEEeCCeEEEEEecCC-Ccc-cc--ccc Confidence 76776666665 46676555554211 11111223344331 10 11111 000 00 001 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh--- Q lcl|Aclame:pro 223 GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD--- 298 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~--- 298 (510) ......++..+|++.+..+...+..||+|=..+ ..+-+..++...-.+...+...+.|...+. |...... T Consensus 194 ----~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~ 267 (480) T protein:vir:78 194 ----GDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTND 267 (480) T ss_pred ----ccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--CCCccccccc Confidence 111123346799999999888899999997765 457777888777777777777676654432 2211100 Q ss_pred ----hhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCCC-CHHHHHHHHHHH Q lcl|Aclame:pro 299 ----YQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERV-TAEEVRITAEEA 368 (510) Q Consensus 299 ----~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~v-TAtEi~~r~~E~ 368 (510) ......|.+..-...++...+++. ++++. .++.++.-|...+.... +...+.+. ++.-+..+-.-+ T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l 343 (480) T protein:vir:78 268 GENTTLDIYYGRILTLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRI 343 (480) T ss_pred cccchhhhhhhhhccCCCCCceEEecCc-cCHHH---HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHH Confidence 000111211111112233333322 24443 33444444444332211 11112222 333222221111 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCcc--ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHh Q lcl|Aclame:pro 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITK--QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~--~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~ 445 (510) .. -.++.+..| .+-+.+++.++.. .+ -..+.+ .+++.+..+. +-..++.+..+..+ +++..++ T Consensus 344 ~~----k~~~~~~~f-~~~l~~~~rl~~~~~~-~~~~~~~~~i~v~w~~~~-~~s~~~~ad~~~kl---~~~g~~~---- 409 (480) T protein:vir:78 344 VK----MAERKGRIF-GGAWERAMRIAMQIMG-REVTEEYTRLETVWRDPS-TPTVAAKADAVSKL---YANGQGP---- 409 (480) T ss_pred HH----HHHHHHHHH-HHHHHHHHHHHHHHcC-CCccccceeeeEEecCCC-CCCHHHHHHHHHHH---HHhcccC---- Confidence 11 123333333 2223333333221 11 111222 2344432221 11222222222222 2222111 Q ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhcccCC------------C Q lcl|Aclame:pro 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQET-LLEGASDMTNALAG------------V 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~-~~~~a~~~~~~~ag------------~ 510 (510) +..+ .+...+|+. +++++.+.+.+++++.....+... ..+++..+....+| - T Consensus 410 --~s~e----t~~~~lg~~-------~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (480) T protein:vir:78 410 --IPKE----QARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGF 474 (480) T ss_pred --CCHH----HHHhcCCCC-------HhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCCcccC Confidence 1111 123345554 334433322222222111111000 00111111111112 1 No 62 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.22 E-value=2.4e-06 Score=51.38 Aligned_cols=410 Identities=10% Similarity=-0.060 Sum_probs=164.7 Q ss_pred ChhHHHHHHHHH----h-ccCchHHHHHHHHh--hcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKL----R-DGSVEQRAIEFAKT--TLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~l----k-r~~~~~~w~e~~~~--~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) |...-....+.| . +.+.....+++++- .+|.... .....-+..++..+-+...|+.++..| +|.+ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~--~~~~~~~~~k~~~n~~~~ivd~~~~~l----~~~g-- 72 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGV--AIPPELQRVQTVVSWPGIAVDALEERL----DWLG-- 72 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCc--ccchhhhhhhhhcchHHHHHHHHHhhh----cccc-- Confidence 444433223322 1 11222233344422 1222110 000111122345555666666665554 3332 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CCC-eEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DEA-TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~--~~~-~~~~~pl 150 (510) | ..++. ++++ +....++|.....++.++..++|.+.+++-. +.. ++++++. T Consensus 73 -~--~~~d~------------~~l~-----------~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 126 (441) T protein:vir:80 73 -W--TNGDG------------YGLD-----------GVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSP 126 (441) T ss_pred -c--cCCCh------------HHHH-----------HHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEcc Confidence 2 12221 1122 2345679999999999999999988654443 322 4677776 Q ss_pred ceEEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC-ee-ec Q lcl|Aclame:pro 151 RSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG-VR-VG 227 (510) Q Consensus 151 ~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~-~~-~~ 227 (510) .+.++..| ..+++...++++... .+....+++|. + +.-. .|.+.++ .. .. T Consensus 127 ~~~~~i~d~~~~~~~~~~~~~~~~------------------~~~~~~~~vy~---~--~~~~----~~~~~~~~~~~~~ 179 (441) T protein:vir:80 127 KNCTGKFSADGSRLDAGLVVQQTC------------------DPEVVEAELLL---P--DVIV----QVERRGSREWVEV 179 (441) T ss_pred ceEEEEEeCCCCceeEEEEEEEEe------------------cCceEEEEEEe---c--CeEE----EEEEcCCcceeec Confidence 66555445 456777666655421 01112233332 1 1100 0111111 11 11 Q ss_pred cccccccccCceEEEeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc---hhhhhcCC Q lcl|Aclame:pro 228 ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVE-DYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV---VDDYQDAE 303 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~-~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~---~~~~~~~~ 303 (510) ......++.+|++.+.-+...++.||+|-.. +.++-+..++...-......+..+.|...+. |... ........ T Consensus 180 ~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--G~~~~~~~~~~~~~~ 257 (441) T protein:vir:80 180 DRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--GVSADEFSQPGWVLS 257 (441) T ss_pred cccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee--cCCccccccchhhhc Confidence 1222344679999888888888999999654 4667777888877777778887777765542 2110 11111112 Q ss_pred Cccee--cCCcc--ccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 304 MGDYV--PGGAE--AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVRITAEEAENTLG 373 (510) Q Consensus 304 ~G~~~--~g~~~--~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi~~r~~E~~~~LG 373 (510) .|.+. +++.+ .+...+.. .++++... +.++.-|...+.... +...+.. -++.-++.+-..+. T Consensus 258 ~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~---~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~---- 329 (441) T protein:vir:80 258 MASVWAVDKDDDGDTPNVGSFP-VNSPTPYS---DQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLV---- 329 (441) T ss_pred ccccccCCCCCCCCcceeEecC-ccchHHHH---HHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHH---- Confidence 23332 22211 12222222 13444333 334443443332211 1111111 13333332222211 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhh--cCCCCCCc--cceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCC Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDD--ALLQGLIT--KQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~--~~l~~~p~--~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id 449 (510) =..++.+..|- +-+.+.+.++.. +.....+. ..+++.+..++ +-..++.++.+....+ + +... +. T Consensus 330 ~k~~~~~~~f~-~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~-~~~~~e~ad~~~kl~~---~--g~~~----~s 398 (441) T protein:vir:80 330 KRAERRQTSFG-QGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDAS-TPTRAATADAVTKLVG---A--GILP----AD 398 (441) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCC-CcCHHHHHHHHHHHHh---c--Cccc----cc Confidence 11223222222 223333333321 11111221 23444443333 2222222222222211 1 1110 11 Q ss_pred HHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 450 LPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNAL 507 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ 507 (510) . +.+...+|.+ ++|++...+ +++++..+..+ ....+.....+. T Consensus 399 ~----~~~~~~l~~~-------~~e~~~~~~-e~~e~~~~~~~---~~~~~~~~~~~~ 441 (441) T protein:vir:80 399 S----RTVLEMLGLD-------DVQVEAVMR-HRAESSDPLAV---LAGAISRQTNEV 441 (441) T ss_pred H----HHHHHhCCCC-------HHHHHHHHH-HHHHHHHHHHH---HhhhhhcccccC Confidence 1 1123344443 344443222 22212111111 111122222222 No 63 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.18 E-value=3e-06 Score=50.88 Aligned_cols=425 Identities=10% Similarity=0.017 Sum_probs=176.1 Q ss_pred ChhHHHHHHHHHh------ccCchHHHHHHHHhhcccccCCCC-CCccc-cccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLR------DGSVEQRAIEFAKTTLPYLMVDPM-SGSRG-VVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lk------r~~~~~~w~e~~~~~~P~~~~~~~-~~~~~-~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) |+++++..-...+ +-.....|+++++=--|-...... ..+.. ...++--..+...++.+|+-|.+- || T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~--p~-- 93 (496) T protein:vir:38 18 LLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNE--KV-- 93 (496) T ss_pred cchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCC--cc-- Confidence 2222222111110 001234566655421121111111 11111 111222345555666665544321 11 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWS 149 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~p 149 (510) .++.++. +..++|.+ .+..++|...+.++..+...+|.+.+++ +.+. .++.++| T Consensus 94 ---~i~~~d~-------------~~~e~l~~-------~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~ 150 (496) T protein:vir:38 94 ---KINIDDK-------------AAEEFVLN-------VLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFAT 150 (496) T ss_pred ---eEeeCCh-------------HHHHHHHH-------HHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEc Confidence 1233332 22333333 4667889999999999999999987654 4432 2577788 Q ss_pred eceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEE----------EEEEEEeecCC-----CeeE Q lcl|Aclame:pro 150 LRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVD----------LYTHVQRRKGT-----AMDY 214 (510) Q Consensus 150 l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~----------v~~~v~~~~~~-----~~~~ 214 (510) -.+++-..+..|++..+.+...++.+ .+.+..++ |-+.++..++. ..|. T Consensus 151 ~~~~~P~~~~~~~~~~~~f~~~~~~~----------------~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~ 214 (496) T protein:vir:38 151 ADCMYPLSNDSENVDECVIANSFHKN----------------NKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSL 214 (496) T ss_pred ccceEEEEecCCcEEEEEEEEEEEeC----------------CeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccc Confidence 88877545556888766654444221 11111111 11222221111 1122 Q ss_pred EEEEEeeCCeeecccccc-ccccCceEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC Q lcl|Aclame:pro 215 AEMYHEIDGVRVGETGRW-PIHLCPYIVPT----WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD 289 (510) Q Consensus 215 ~sv~~e~~~~~~~~~~~y-~~~~~P~~~~R----w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~ 289 (510) ..+|-. +.....+ +....||+..+ .+...+++||+|-..++++-+..|+..--......+. .+..+.++ T Consensus 215 ~~~~~~-----~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~ 288 (496) T protein:vir:38 215 TLLFDD-----IEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVP 288 (496) T ss_pred cccccc-----cccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecc Confidence 222111 1111111 11234444332 3446678999999999999999999877766665544 45555553 Q ss_pred CCCccchhhh--------hcCCCcceec--CC----ccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-CC Q lcl|Aclame:pro 290 EAKGAVVDDY--------QDAEMGDYVP--GG----AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RD 352 (510) Q Consensus 290 ~~g~~~~~~~--------~~~~~G~~~~--g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~ 352 (510) + .++....- .......+.. +. ...+..++.. -........++.+.+.|....=++ .+. .. T Consensus 289 ~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~ 365 (496) T protein:vir:38 289 S-SFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVE--IRSTEFIESINAMLRIYAMQVGLSAGTFTFDE 365 (496) T ss_pred h-HHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccc--cCHHHHHHHHHHHHHHHHHhhCCChhhcCCCc Confidence 2 33221110 0000011111 11 1111111110 011222333444444443222111 111 12 Q ss_pred CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh----cCCCCCCccceeeEEeecHHHHHHHHHHHHH Q lcl|Aclame:pro 353 AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD----ALLQGLITKQHKPAIETGLPALSRSAAVQSM 428 (510) Q Consensus 353 ~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~----~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~ 428 (510) +...|||||..+.+.......- ..+.-...+..++.-.+.+... .+.. .++..+.+.+--++.. .....++.+ T Consensus 366 ~g~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~g~~-~~~~~i~v~f~d~i~~-d~~~~~~~~ 442 (496) T protein:vir:38 366 NGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKFIEAYSGEV-VELDTITVDFDDSIAQ-DEDTTINRY 442 (496) T ss_pred cccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCC-CCccceEEEeCCCCCC-CHHHHHHHH Confidence 3345999999887777766543 5555566666666666554321 2222 2233455554332221 111222222 Q ss_pred HHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|Aclame:pro 429 LNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALA 508 (510) Q Consensus 429 ~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~a 508 (510) .... + +|. +....+ +....|++ +++.+++.++.+... ++..+ ....+.+. T Consensus 443 ~~~~---~--~Gi------iS~et~---l~~~~~~~-------d~ea~~el~ri~~E~--~~~~~-------~~d~~~~~ 492 (496) T protein:vir:38 443 TNAK---N--QGM------IPLKIA---LQRAWNIT-------EAEADEWAEMLAKEK--QAEMP-------NNDMNGIF 492 (496) T ss_pred HHHH---h--cCC------CCHHHH---HHhcCCCC-------hHHHHHHHHHHHHhh--hccCc-------cccccCCC Confidence 2211 1 121 111111 33445554 334333222221111 11110 11111112 Q ss_pred CC Q lcl|Aclame:pro 509 GV 510 (510) Q Consensus 509 g~ 510 (510) |= T Consensus 493 ~~ 494 (496) T protein:vir:38 493 GE 494 (496) T ss_pred CC Confidence 21 No 64 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.16 E-value=3.4e-06 Score=50.55 Aligned_cols=392 Identities=11% Similarity=-0.033 Sum_probs=162.0 Q ss_pred hcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 28 TLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRK 107 (510) Q Consensus 28 ~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~ 107 (510) .+|.-.+++.. ....+..-+-+..++++++..|. +.+ |+ .+|... .+. T Consensus 1 ~l~~~~~~~~~---~~~~~~v~n~~~~ivd~~~~~l~----~~g---f~--~~d~~~---------~~~----------- 48 (434) T protein:vir:98 1 MLPKNAEQAFL---DFQRKARTNFCGLIANASVHRLL----ALG---VT--GPDGEP---------DTR----------- 48 (434) T ss_pred CCCCCccHHHH---HhhhhhhccchHHHHHHHHhhhc----cCc---ee--cCCCch---------HHH----------- Confidence 34432222111 11112234566777777777553 323 33 222211 111 Q ss_pred HHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC----------eEEEEEeceEEEeeC-CCCceeEEEEEEEecHHH Q lcl|Aclame:pro 108 ATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----------TVVAWSLRSYAVRRD-ATGRWMDIVLKQRYKSKD 176 (510) Q Consensus 108 ~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~----------~~~~~pl~~~~v~~d-~~G~v~~i~r~~~~t~~~ 176 (510) +.+.+.+++|.....+++++..++|.+.+++..+.. .+++++-.+..+..| ..+++...++.+.... T Consensus 49 ~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~-- 126 (434) T protein:vir:98 49 ASRWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDI-- 126 (434) T ss_pred HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEecc-- Confidence 123356789999999999999999998766543211 266776666545555 4467665555543211 Q ss_pred HhHHhhHHhhcccccCCCCceEEEEEEEEe---ecCCC--eeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCc Q lcl|Aclame:pro 177 LDDVYKQDLMRAGRNLSGSGSVDLYTHVQR---RKGTA--MDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEH 251 (510) Q Consensus 177 l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~---~~~~~--~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~ 251 (510) +......+.+++.++. +...+ ..+.+.-+.... ........+++.+|++.++-+...++ T Consensus 127 --------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~h~~g~vPvv~f~N~~~~~~- 190 (434) T protein:vir:98 127 --------------DGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTG-TADSGDVHDLGGMQLVEFARMPDLGE- 190 (434) T ss_pred --------------CCceEEEEEEeCcEEEEEEeeccccccccccccceecc-cccccccCCCCccceEEeccCCCcCc- Confidence 1111222222222111 11110 000000011111 11111112446799998876666555 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CC-------CccchhhhhcCCCcceecCCccccccccCC Q lcl|Aclame:pro 252 YGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD---EA-------KGAVVDDYQDAEMGDYVPGGAEAVRAYERG 321 (510) Q Consensus 252 YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~---~~-------g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~ 321 (510) +|+|=.+..++.+..++...-..+..+...+.|...+. ++ +.+....+.....|.+..-...+++..+++ T Consensus 191 ~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~ 270 (434) T protein:vir:98 191 DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLD 270 (434) T ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCCCceEEEec Confidence 79998899999999999998888888888877754442 00 000001111111111110001122222332 Q ss_pred CccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 322 DYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVD 396 (510) Q Consensus 322 ~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~ 396 (510) .++++.....+ +.-|........ +..+..+.++.-++.....+... .++.+.- +..-+.+.+.++. T Consensus 271 -~~~~~~~~~~l---~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k----~~~k~~~-f~~~l~~~~rl~~ 341 (434) T protein:vir:98 271 -ATDLSGFLKEH---ASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAK----VREHIAS-FSEGLESVLALAA 341 (434) T ss_pred -CcchHHHHHHH---HHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHH Confidence 13444333333 333333222211 11122344666554332222222 1111111 1122223333321 Q ss_pred -hcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHH Q lcl|Aclame:pro 397 -DALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADEL 475 (510) Q Consensus 397 -~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~ 475 (510) -.+..+ ....+++.+..+. +-..++.++.+..+.+ . +.+ . +.+...+|.+ ++|+ T Consensus 342 ~~~g~~~-~~~~~~v~w~~~~-~~s~~~~ada~~kl~~---~--g~~-------~----e~~~~~lg~~-------~~e~ 396 (434) T protein:vir:98 342 AQAGVPE-DYTEAEVRWANPA-HVTMAVKADAATKLKS---I--GYP-------L----DVIAEELDES-------PARV 396 (434) T ss_pred HhcCCCh-hheeeeEEecCCC-CCCHHHHHHHHHHHHh---c--CCc-------H----HHHHHhCCCC-------HHHH Confidence 223321 1123444443322 1122222222222111 1 111 1 1233456654 3455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 476 QAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 476 ~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +...+++.+|+..+... +.+.+...+|. T Consensus 397 ~r~~~e~~~~~~~~~~~-------~~~~~~~~~g~ 424 (434) T protein:vir:98 397 RRIVAGAASQALLAASL-------LPAPGAPSAGN 424 (434) T ss_pred HHHHHHHHHHHHHHHhh-------hccCCCCCCCC Confidence 44333333333222111 11222233333 No 65 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.13 E-value=3.9e-06 Score=50.27 Aligned_cols=439 Identities=10% Similarity=0.040 Sum_probs=207.1 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc---c--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---L--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~---~--~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) +-+.=..+|.-| .+|..=. + ....++.+ . ...++++.|...+++ .+-.+.| +..|+ T Consensus 28 ~d~~Rl~aY~l~------------~~~y~n~~~~~~~~lrg~~~~-~-~r~~~~ps~~~~~~~----~~~~~~~-g~~~~ 88 (527) T protein:vir:10 28 FDKARLASYRLY------------EDMYLTNTSDYQVILRGGDEG-D-QRPIYVPNGEKLIEA----KMRFLGQ-GLKWE 88 (527) T ss_pred HHHHHHHHHHHH------------HHHhcCchhheeeecCCcccc-c-cceeeehhhHHhhCC----cceeecc-Ccccc Confidence 333333333333 2322211 0 00011111 1 125678877444433 3333333 33342 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--C-C--CCe--EEEE Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--S-D--EAT--VVAW 148 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~-~--~~~--~~~~ 148 (510) - +... . +|+..+...+++.|++....++-.+..+.|-+++.+- + + .++ ++.+ T Consensus 89 ~----~~~~----------e-------~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~ 147 (527) T protein:vir:10 89 F----SKKD----------A-------KVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEV 147 (527) T ss_pred c----cchh----------H-------HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeec Confidence 1 1111 1 2344445567889999999999999999998886553 2 1 124 4445 Q ss_pred EeceEEEeeCCCC--ceeEEEEEEEecHHHHhHHhhH-----HhhcccccCCCCce--------EEEEEEE--EeecCCC Q lcl|Aclame:pro 149 SLRSYAVRRDATG--RWMDIVLKQRYKSKDLDDVYKQ-----DLMRAGRNLSGSGS--------VDLYTHV--QRRKGTA 211 (510) Q Consensus 149 pl~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~-----~~~~~~~~~~~~~~--------v~v~~~v--~~~~~~~ 211 (510) -.+.|+..+|++| .|..+|.. ....+++.-++ .+.+-..+.++... ++..||- .+.+... T Consensus 148 DP~~~f~~ed~d~~~~v~~v~~~---~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e 224 (527) T protein:vir:10 148 DPSTYFPYEDPRYPGQVLGVYLV---DEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPE 224 (527) T ss_pred CcceeeeeecCCCCCceeeEEEe---eeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccc Confidence 4588888899875 34444433 12223322221 11111111122211 1111111 0111111 Q ss_pred ee--EEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC Q lcl|Aclame:pro 212 MD--YAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD 289 (510) Q Consensus 212 ~~--~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~ 289 (510) -| -.++-...++..+... .-+..-.|++..+=...++++||+|=..+.+.-+..||.........+...-.|+...+ T Consensus 225 ~p~~~~~~~~~~~~~~l~~l-p~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~t 303 (527) T protein:vir:10 225 SPLEPDDIKKLSTLTEEEPL-PEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATD 303 (527) T ss_pred cccchhhhhhhcCceeeecc-cCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeec Confidence 11 1222223444443322 11224478887777788999999999999999999999888887778777777766653 Q ss_pred CCCccchhhhhcCC-Cc-ceecCCccc----cccccCCCccchHHHHHHHHHHHHHHHHHHhhcc---cCCCCCCCCHHH Q lcl|Aclame:pro 290 EAKGAVVDDYQDAE-MG-DYVPGGAEA----VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA---NQRDAERVTAEE 360 (510) Q Consensus 290 ~~g~~~~~~~~~~~-~G-~~~~g~~~~----v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~---~~~~~~~vTAtE 360 (510) |+...+ .. +. +. .+-||..-. -....++...++...+.-+..+..+|...--.-. ..-|..+ --+. T Consensus 304 --g~~~vd-~~-G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG 378 (527) T protein:vir:10 304 --SAPPRD-SR-GNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAV-AESG 378 (527) T ss_pred --cccccc-cc-CCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHH Confidence 443222 11 11 00 112222211 1122233334566666667777766655432111 1112222 1122 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHH-HHHHHHHH---------HHHhhcCCCCCCc-cceeeEEeecHHHHHHHHHHHHHH Q lcl|Aclame:pro 361 VRITAEEAENTLGGTYSLLAENLQ-SPLAYVCL---------SEVDDALLQGLIT-KQHKPAIETGLPALSRSAAVQSML 429 (510) Q Consensus 361 i~~r~~E~~~~LGpv~~rl~~E~l-~Pli~r~~---------~il~~~~l~~~p~-~~~~~~~vs~l~~l~r~~~~~~~~ 429 (510) + -+...|+|++.|.+..-| .-.+.|.| ...+.-++-+... -.+++.+ .+.-|..+++-++++. T Consensus 379 ~-----ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf-~p~lP~D~~avie~v~ 452 (527) T protein:vir:10 379 I-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITF-RDPKPVNSEKRFNQLL 452 (527) T ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEe-cccCCCCHHHHHHHHH Confidence 2 123445566665555422 22223222 1111111111111 1233332 3334556666666665 Q ss_pred HHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-- Q lcl|Aclame:pro 430 NASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNAL-- 507 (510) Q Consensus 430 ~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~-- 507 (510) ...+ ++ -+....+++.+.+..|+. ..+.|+++..+++.+|+.+...|.....++|+.-+|.. T Consensus 453 tL~~-----aG------i~S~~tAv~~L~~~~g~e-----D~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~ 516 (527) T protein:vir:10 453 QLWE-----AG------LIPAKKLTEELSKIMGFE-----LTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDE 516 (527) T ss_pred HHHH-----cC------chhHHHHHHHHHhccCCC-----ChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCC Confidence 4443 12 134456788888877643 23556666666666666666666556666665544422 Q ss_pred ------CCC Q lcl|Aclame:pro 508 ------AGV 510 (510) Q Consensus 508 ------ag~ 510 (510) .|+ T Consensus 517 ~~d~~~~~~ 525 (527) T protein:vir:10 517 EDDQALNGQ 525 (527) T ss_pred CcccccCCC Confidence 222 No 66 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.13 E-value=3.9e-06 Score=50.22 Aligned_cols=436 Identities=11% Similarity=0.006 Sum_probs=185.4 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc-----cc---CCCCCC----ccccccccccchHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY-----LM---VDPMSG----SRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~-----~~---~~~~~~----~~~~~~~~~dstg~~a~~~Laa~l~~~lt 68 (510) +.+.......++-+..-..+++.+.+|..-. +. .+.... ......++-.+-+...++..++.|.+ T Consensus 26 ~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g--- 102 (503) T protein:vir:59 26 IAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVG--- 102 (503) T ss_pred ccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhc--- Confidence 2222222233331111224555555554421 11 111000 00111233445556666666665532 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eE Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TV 145 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~ 145 (510) .| ++++.+|+. +.+.+ +.+..++|-....++.++..++|.+.+++..+. + ++ T Consensus 103 ---~~-~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i 157 (503) T protein:vir:59 103 ---EP-VTFTSDNKT-------------LLEYV--------NELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDY 157 (503) T ss_pred ---CC-eeeccCcHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEE Confidence 11 223333322 22222 123346899999999999999999876554332 2 46 Q ss_pred EEEEeceEEEeeC-C-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--EE-eecCCCeeEE-EEEE Q lcl|Aclame:pro 146 VAWSLRSYAVRRD-A-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--VQ-RRKGTAMDYA-EMYH 219 (510) Q Consensus 146 ~~~pl~~~~v~~d-~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v~-~~~~~~~~~~-sv~~ 219 (510) ++++..+++...| . .+++..++|.++.. ..+.+....+++|+. |+ ....++.... ..+. T Consensus 158 ~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~---------------~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~ 222 (503) T protein:vir:59 158 VIFPAEEMIVVYKDNTRRDILFALRYYSYK---------------GIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYG 222 (503) T ss_pred EEEccceeEEEEeCCCCCceEEEEEEEEEe---------------cCCCceEEEEEEEeCCcEEEEEEcCCccccccccc Confidence 6676656444444 3 37777666666421 011111223334331 10 0000000000 0000 Q ss_pred eeC--CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-CCccch Q lcl|Aclame:pro 220 EID--GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVV 296 (510) Q Consensus 220 e~~--~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~-~g~~~~ 296 (510) +.. .........+++..+|++.++- +.+|.|=...+.+-+..+|.+.-...........|.+.+.- ++-... T Consensus 223 ~~~~~~~~~~~~~~~~~~~vPiv~~~n-----n~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~ 297 (503) T protein:vir:59 223 ENNPRPHMTKGGQAIGWGRVPIIPFKN-----NEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK 297 (503) T ss_pred ccccccceeecceeccCCccceEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc Confidence 000 0000111122345678877653 45799989999999999999888888888888888877642 111111 Q ss_pred hhhh-cCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc-c-cCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 297 DDYQ-DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A-NQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 297 ~~~~-~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~-~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) +... ....+.+..+..+++..+... .+.+.....++.++..|...-... . ....+...|+..+..+-.-.... . T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k-~ 374 (503) T protein:vir:59 298 EFTANLRYHSVIKVSGDGGVDTLRAE--IPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLK-A 374 (503) T ss_pred hhhhhhhcccceeccCCCcceeEecc--CCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHH-H Confidence 1111 111223333333445554432 355667778888887776644321 1 12224456887765543322222 2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCC-CCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHH Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQG-LITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~-~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~ 452 (510) --..+.-.+.|.-++..++.++...+... .....+.+.+..++. -..+..++.+....+ . +-++ ... T Consensus 375 ~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p-~d~~~~~~~~~kl~~---~-GiiS-------~et 442 (503) T protein:vir:59 375 NMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRI-QNDSEIVQSLVQGVT---G-GIMS-------KET 442 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCC-CCHHHHHHHHHHHHh---C-CCCc-------hHH Confidence 33444444444445555555554322222 222346666544332 122222222222111 1 1111 112 Q ss_pred HHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHhhc---ccCC Q lcl|Aclame:pro 453 MMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEG----------ASDMTN---ALAG 509 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~----------a~~~~~---~~ag 509 (510) ++. .++ ++.+ ++|++...+++ .++..+.........+ ..+..+ .+|. T Consensus 443 ~l~----~l~-----~v~d~~~E~~ri~~E~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 443 AVA----RNP-----FVQDPEEELARIEEEM-NQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHH----hCC-----CCCCHHHHHHHHHHHH-HHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 222 121 1222 34444332222 2111111100000000 000000 1111 No 67 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.11 E-value=4.3e-06 Score=49.99 Aligned_cols=439 Identities=10% Similarity=0.040 Sum_probs=206.9 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc---c--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---L--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~---~--~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) +-+.=..+|.-| .+|..=. + ....++.+ . ...++++.|...+++ .+-.+.| +..|+ T Consensus 28 ~d~~Rl~aY~l~------------~~~y~n~~~~~~~~lrg~~~~-~-~r~~~~ps~~~~~~~----~~~~~~~-g~~~~ 88 (527) T protein:vir:10 28 FDKARLASYRLY------------EDMYLTNTSDYQVILRGGDEG-D-QRPIYVPNGEKLIEA----KMRFLGQ-GLKWE 88 (527) T ss_pred HHHHHHHHHHHH------------HHHhcCchhheeeecCCcccc-c-cceeeehhhHHhhCC----cceeecc-Ccccc Confidence 333333333333 2322211 0 00011111 1 125678877444432 3333333 33342 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--C-C--CCe--EEEE Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--S-D--EAT--VVAW 148 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~-~--~~~--~~~~ 148 (510) - +... . +|+..+...+++.|++....++-.+..+.|-+++.+- + + .++ ++.+ T Consensus 89 ~----~~~~----------e-------~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~ 147 (527) T protein:vir:10 89 F----SKKD----------A-------KVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEV 147 (527) T ss_pred c----cchh----------H-------HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeec Confidence 1 1111 1 2344445568889999999999999999998886553 2 1 124 4445 Q ss_pred EeceEEEeeCCCC--ceeEEEEEEEecHHHHhHHhhH-----HhhcccccCCCCce--------EEEEEEE--EeecCCC Q lcl|Aclame:pro 149 SLRSYAVRRDATG--RWMDIVLKQRYKSKDLDDVYKQ-----DLMRAGRNLSGSGS--------VDLYTHV--QRRKGTA 211 (510) Q Consensus 149 pl~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~-----~~~~~~~~~~~~~~--------v~v~~~v--~~~~~~~ 211 (510) -.+.|+..+|++| .|..+|.. ....+++.-++ .+.+-..+.++... ++..||- .+.+... T Consensus 148 DP~~~f~~ed~d~~~~v~~v~~~---~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e 224 (527) T protein:vir:10 148 DPSTYFPYEDPRYPGQVLGVYLV---DEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPE 224 (527) T ss_pred CcceeeeeecCCCCCceeeEEEe---eeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccc Confidence 4588888899875 34444433 12223322221 11111111122211 1111111 0111111 Q ss_pred ee--EEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC Q lcl|Aclame:pro 212 MD--YAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD 289 (510) Q Consensus 212 ~~--~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~ 289 (510) -| -.++-...++..+... .-+..-.|++..+=...++++||+|=..+.+.-+..||+........+...-.|+...+ T Consensus 225 ~p~~~~~~~~~~~~~~l~~l-p~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~t 303 (527) T protein:vir:10 225 SPLEPDDIKKLSTLTEEEPL-PEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATD 303 (527) T ss_pred cccchhhhhhhcCceeeecc-cCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeec Confidence 11 1222223444443322 11224478887777788999999999999999999999888887778777777766653 Q ss_pred CCCccchhhhhcCC-Cc-ceecCCccc----cccccCCCccchHHHHHHHHHHHHHHHHHHhhcc---cCCCCCCCCHHH Q lcl|Aclame:pro 290 EAKGAVVDDYQDAE-MG-DYVPGGAEA----VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA---NQRDAERVTAEE 360 (510) Q Consensus 290 ~~g~~~~~~~~~~~-~G-~~~~g~~~~----v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~---~~~~~~~vTAtE 360 (510) |+...+ .. +. +. .+-||..-. -....++...++...+.-+..+..+|...--.-. ..-|..+ --+. T Consensus 304 --g~~~vd-~~-G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG 378 (527) T protein:vir:10 304 --SAPPRD-SR-GNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAV-AESG 378 (527) T ss_pred --cccccc-cc-CCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHH Confidence 443222 11 11 00 112222211 1122233334566666666777666655432111 1112222 1122 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHH-HHHHHHHH---------HHHhhcCCCCCCc-cceeeEEeecHHHHHHHHHHHHHH Q lcl|Aclame:pro 361 VRITAEEAENTLGGTYSLLAENLQ-SPLAYVCL---------SEVDDALLQGLIT-KQHKPAIETGLPALSRSAAVQSML 429 (510) Q Consensus 361 i~~r~~E~~~~LGpv~~rl~~E~l-~Pli~r~~---------~il~~~~l~~~p~-~~~~~~~vs~l~~l~r~~~~~~~~ 429 (510) + -+...|+|++.|.+..-| .-.+.|.| ...+.-++-+... -.+++.+ .+.-|..+++-++++. T Consensus 379 ~-----ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf-~p~lP~D~~avie~v~ 452 (527) T protein:vir:10 379 I-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITF-RDPKPVNNEKRFAQLL 452 (527) T ss_pred H-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEe-cccCCCCHHHHHHHHH Confidence 2 123445566666555422 22233222 1111111111111 1233332 3334556666666665 Q ss_pred HHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-- Q lcl|Aclame:pro 430 NASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNAL-- 507 (510) Q Consensus 430 ~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~-- 507 (510) ...+ ++ -+....+++.+.+..|+. ..+.|+++..+++.+|+.+...|.....++|+.-+|.. T Consensus 453 tL~~-----aG------iiS~etAv~~L~~~~g~e-----D~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~ 516 (527) T protein:vir:10 453 ELWE-----AG------LIPAKKLTEELSKIMGFE-----LTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDE 516 (527) T ss_pred HHHH-----cC------chhHHHHHHHHHhccCCC-----chHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCC Confidence 4444 12 134456788888877643 23456666666666666666666555666665544322 Q ss_pred ------CCC Q lcl|Aclame:pro 508 ------AGV 510 (510) Q Consensus 508 ------ag~ 510 (510) .|+ T Consensus 517 ~~d~~~~~~ 525 (527) T protein:vir:10 517 EDDQALNGQ 525 (527) T ss_pred CcccccCCC Confidence 222 No 68 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.99 E-value=7.8e-06 Score=48.58 Aligned_cols=413 Identities=10% Similarity=0.030 Sum_probs=176.9 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhh--cccc---cCCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL---MVDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~--~P~~---~~~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) ..+.+.+.+...+ |-....+++++++-- +..+ ....+.. ......++..+-+...++..++.|.+ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g--~p---- 101 (474) T protein:vir:97 28 QEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--KP---- 101 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhc--CC---- Confidence 2333344433332 333444555555421 1111 1011111 11122345667777777777766654 12 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC-CeEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~~-~~~~~~pl 150 (510) +.++.+|+. +.+.|. .+..+||...+.++.++...+|.+.+++. ++. .++.+++. T Consensus 102 -~~~~~~d~~-------------~~~~l~--------~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:97 102 -VTYSCEDEN-------------VLKVIH--------DVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred -ceeccCcHH-------------HHHHHH--------HHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcc Confidence 223333322 122221 12347899999999999999998765543 332 24666766 Q ss_pred ceEEEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 151 RSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 151 ~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~~~~sv~~e~~~~ 224 (510) .+.+...|. .+++.-++|.++.. ....+++|+- + +..++++ .........++. T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y~~~~~~-~~~~~~~~~~~~ 218 (474) T protein:vir:97 160 EQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGG-LIPDYYYGANHV 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEEEEcCCc-cccccccCcCcc Confidence 665544443 57887777766421 1123344431 1 1111111 111111111111 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh-hhcCC Q lcl|Aclame:pro 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-YQDAE 303 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~-~~~~~ 303 (510) ... ....++..+|++..+. +.+|.|=.....+-+..+|.+.-......+....|.+++.-...-.... ..... T Consensus 219 ~~~-~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~ 292 (474) T protein:vir:97 219 QSH-FSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLK 292 (474) T ss_pred ccc-ccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhh Confidence 111 1112335688887653 4689999999999999999988888888888888877764211111111 11111 Q ss_pred C-cceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHH-------HHHHHHHHHhh Q lcl|Aclame:pro 304 M-GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVR-------ITAEEAENTLG 373 (510) Q Consensus 304 ~-G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~-------~r~~E~~~~LG 373 (510) . +.+.....+++..+.. ..+.......++.++..|...-.. +. ....+...|+.-+. .++.++...++ T Consensus 293 ~~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~ 370 (474) T protein:vir:97 293 YYKAINVDGDGGVETIQV--EVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKAT 370 (474) T ss_pred ccceeeccCCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 2222222234444332 246677777888888777654322 11 11122344655433 23344444444 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHH Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM 453 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~ 453 (510) ..+.+ ++..++.++. . ......+.+.+.-.+ +..-++.++ .+...+. +....+ T Consensus 371 ~~l~~--------~~~li~~~~~---~-~~d~~~i~v~f~~~~-p~~~~e~a~-------~~~~~g~-------iS~et~ 423 (474) T protein:vir:97 371 VAIQE--------LISFIIDFNN---L-KTDVKDIEISFNFNR-MMNDAEQSQ-------IIAQSQY-------LSRETL 423 (474) T ss_pred HHHHH--------HHHHHHHHhC---C-CcccceeeEEeccCc-ccCHHHHHH-------HHHHcCC-------CCHHHH Confidence 44333 2222222221 1 111223444442111 211111111 1111111 222233 Q ss_pred HHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 454 MDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 454 ~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +.. ++ ++.+ ++|+++..+++ + + .++..+. ........+....+- T Consensus 424 l~~----l~-----~v~D~~~E~eri~~E~-~-~-~~~~~~~-~~~~~~~~~~~~~~~ 468 (474) T protein:vir:97 424 VKS----SP-----LVDDYKAELERIEQEQ-M-E-YNKQLPN-LDDGGADGAQQQEGS 468 (474) T ss_pred HHh----CC-----CCCCHHHHHHHHHHHH-H-H-HHhhccc-cCCCCCCCcccCCCC Confidence 332 21 1222 23333222211 1 1 1111100 000000001111111 No 69 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.99 E-value=7.8e-06 Score=48.58 Aligned_cols=413 Identities=10% Similarity=0.030 Sum_probs=176.9 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhh--cccc---cCCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL---MVDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~--~P~~---~~~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) ..+.+.+.+...+ |-....+++++++-- +..+ ....+.. ......++..+-+...++..++.|.+ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g--~p---- 101 (474) T protein:vir:94 28 QEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--KP---- 101 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhc--CC---- Confidence 2333344433332 333444555555421 1111 1011111 11122345667777777777766654 12 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC-CeEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~~-~~~~~~pl 150 (510) +.++.+|+. +.+.|. .+..+||...+.++.++...+|.+.+++. ++. .++.+++. T Consensus 102 -~~~~~~d~~-------------~~~~l~--------~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:94 102 -VTYSCEDEN-------------VLKVIH--------DVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred -ceeccCcHH-------------HHHHHH--------HHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcc Confidence 223333322 122221 12347899999999999999998765543 332 24666766 Q ss_pred ceEEEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 151 RSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 151 ~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~~~~sv~~e~~~~ 224 (510) .+.+...|. .+++.-++|.++.. ....+++|+- + +..++++ .........++. T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y~~~~~~-~~~~~~~~~~~~ 218 (474) T protein:vir:94 160 EQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGG-LIPDYYYGANHV 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEEEEcCCc-cccccccCcCcc Confidence 665544443 57887777766421 1123344431 1 1111111 111111111111 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh-hhcCC Q lcl|Aclame:pro 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-YQDAE 303 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~-~~~~~ 303 (510) ... ....++..+|++..+. +.+|.|=.....+-+..+|.+.-......+....|.+++.-...-.... ..... T Consensus 219 ~~~-~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~ 292 (474) T protein:vir:94 219 QSH-FSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLK 292 (474) T ss_pred ccc-ccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhh Confidence 111 1112335688887653 4689999999999999999988888888888888877764211111111 11111 Q ss_pred C-cceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHH-------HHHHHHHHHhh Q lcl|Aclame:pro 304 M-GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVR-------ITAEEAENTLG 373 (510) Q Consensus 304 ~-G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~-------~r~~E~~~~LG 373 (510) . +.+.....+++..+.. ..+.......++.++..|...-.. +. ....+...|+.-+. .++.++...++ T Consensus 293 ~~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~ 370 (474) T protein:vir:94 293 YYKAINVDGDGGVETIQV--EVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKAT 370 (474) T ss_pred ccceeeccCCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 2222222234444332 246677777888888777654322 11 11122344655433 23344444444 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHH Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM 453 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~ 453 (510) ..+.+ ++..++.++. . ......+.+.+.-.+ +..-++.++ .+...+. +....+ T Consensus 371 ~~l~~--------~~~li~~~~~---~-~~d~~~i~v~f~~~~-p~~~~e~a~-------~~~~~g~-------iS~et~ 423 (474) T protein:vir:94 371 VAIQE--------LISFIIDFNN---L-KTDVKDIEISFNFNR-MMNDAEQSQ-------IIAQSQY-------LSRETL 423 (474) T ss_pred HHHHH--------HHHHHHHHhC---C-CcccceeeEEeccCc-ccCHHHHHH-------HHHHcCC-------CCHHHH Confidence 44333 2222222221 1 111223444442111 211111111 1111111 222233 Q ss_pred HHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 454 MDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 454 ~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +.. ++ ++.+ ++|+++..+++ + + .++..+. ........+....+- T Consensus 424 l~~----l~-----~v~D~~~E~eri~~E~-~-~-~~~~~~~-~~~~~~~~~~~~~~~ 468 (474) T protein:vir:94 424 VKS----SP-----LVDDYKAELERIEQEQ-M-E-YNKQLPN-LDDGGADGAQQQEGS 468 (474) T ss_pred HHh----CC-----CCCCHHHHHHHHHHHH-H-H-HHhhccc-cCCCCCCCcccCCCC Confidence 332 21 1222 23333222211 1 1 1111100 000000001111111 No 70 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.85 E-value=1.5e-05 Score=47.04 Aligned_cols=425 Identities=11% Similarity=0.000 Sum_probs=183.3 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhh-----ccccc-CCCCCC-----ccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTT-----LPYLM-VDPMSG-----SRGVVEHDFQSAGALLVNNLAAKLARSLFP 69 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~-----~P~~~-~~~~~~-----~~~~~~~~~dstg~~a~~~Laa~l~~~ltp 69 (510) +=.-+.+..+.+....-.++++.+.+|. ++++- ...++. ......++-.+.+...++..++.|++- | T Consensus 20 ~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~--p 97 (479) T protein:vir:79 20 STINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGN--P 97 (479) T ss_pred ChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcC--C Confidence 1122233333332111122333334433 22210 000100 011122455566666677766666542 2 Q ss_pred ccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC-CeEE Q lcl|Aclame:pro 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE-ATVV 146 (510) Q Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~~-~~~~ 146 (510) + +++.+++. +.+.+ ..+..++|.....++.++..++|.+.+++. ++. .+++ T Consensus 98 ~-----~~~~~~~~-------------~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~ 151 (479) T protein:vir:79 98 I-----VFNADDDN-------------LTKLL--------NDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYV 151 (479) T ss_pred c-----eeccCCHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEE Confidence 2 22333321 22222 234457899999999999999998765544 332 2456 Q ss_pred EEEeceEEEeeC--CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE---EE-EeecCCCeeEEE---- Q lcl|Aclame:pro 147 AWSLRSYAVRRD--ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT---HV-QRRKGTAMDYAE---- 216 (510) Q Consensus 147 ~~pl~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~---~v-~~~~~~~~~~~s---- 216 (510) +++..+++...| ..+++...+|.++..- .+.+.-..+++|+ .+ +...+....... T Consensus 152 ~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~---------------~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~ 216 (479) T protein:vir:79 152 IIPAEEAIPIWDSKRQRELVAFIRFYYIED---------------IDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDE 216 (479) T ss_pred EEccceeEEEEeCCCCCceEEEEEEEEEee---------------cCCceEEEEEEEeCCcEEEEEecCCcccccccccc Confidence 666555444444 3466766666654321 0111122333332 11 111111110000 Q ss_pred ---EEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCC-C Q lcl|Aclame:pro 217 ---MYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEA-K 292 (510) Q Consensus 217 ---v~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~-g 292 (510) .+...++.........++..+|++..+- +.+|+|-.+...+-+..++.+.-..........+|.+++.-. + T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~ 291 (479) T protein:vir:79 217 YGKMTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPG 291 (479) T ss_pred cccccccccccccccccccCCCcccEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCc Confidence 0000011111122223446788887654 467999999999999999988888888888888887776421 1 Q ss_pred ccchhhhhcCCC-cceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 293 GAVVDDYQDAEM-GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAEN 370 (510) Q Consensus 293 ~~~~~~~~~~~~-G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~ 370 (510) ....+....... +.+.-....+++.+... .+.......++.++..|...-.. +...-.....|++.+..+-.-+ . T Consensus 292 ~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l-~ 368 (479) T protein:vir:79 292 TSLQEFIDNIRYYKSIKVDGGGGVDKLEIN--IPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLL-D 368 (479) T ss_pred cccccchhhhhhccceecCCCCcceEEecc--CCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHH-H Confidence 111111111111 22221222334444322 46777788888888887665432 2211112345666554421111 1 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCH Q lcl|Aclame:pro 371 TLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISL 450 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~ 450 (510) ....-..+.-.+.+.-+++.+..++...+........+++.+...+.. ..+..++.+ ..+++. +.. T Consensus 369 ~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~-~~~~~a~~~-------~kl~g~------iS~ 434 (479) T protein:vir:79 369 LKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMII-NEAEKIDMA-------AKSTGI------VSD 434 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCc-CHHHHHHHH-------HHHhcc------CcH Confidence 122223333333444444444444433332333344556655433321 122222221 112221 222 Q ss_pred HHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 451 PKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 451 d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..++. .++ ++.+ ++|++...+++..+. +......+..-|| T Consensus 435 et~l~----~l~-----~v~d~~~E~~ri~~E~~~~~-----------~~~~~~~~~~~~~ 475 (479) T protein:vir:79 435 ETIVS----NHP-----WVEDVNDELERLKKQEDTQK-----------EYDDLIPNNQDGV 475 (479) T ss_pred HHHHH----hCC-----CCCCHHHHHHHHHHHHHHHH-----------HHHhccCcccCCC Confidence 33332 222 1222 334433222221111 1111222333333 No 71 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.81 E-value=1.8e-05 Score=46.65 Aligned_cols=419 Identities=10% Similarity=0.010 Sum_probs=176.5 Q ss_pred ChhHHHHHHHHHhc---------------------------cCchHHHHHHHHhhccc---ccCCCCCC-cccccccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRD---------------------------GSVEQRAIEFAKTTLPY---LMVDPMSG-SRGVVEHDFQ 49 (510) Q Consensus 1 ~k~~~~~r~~~lkr---------------------------~~~~~~w~e~~~~~~P~---~~~~~~~~-~~~~~~~~~d 49 (510) ++.++..+|+.-.+ .....+++++.+|..-. ........ ......++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeec Confidence 33333333332211 11122344444443321 00000001 1111224455 Q ss_pred chHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~ 129 (510) +.+...++..++.|.+ -|+ +++.+++. +. ..+...+..++|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p~-----~~~~~~~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:96 93 DYASYISDFINGYFLG--NPI-----QYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHHhhhcc--CCc-----eeecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHH Confidence 6666666666654442 111 12333321 11 234455777899999999999999 Q ss_pred hhCceEEEE--eCCC-CeEEEEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v 204 (510) ++|.+.+++ +++. .++.+++..+ |++..|. .+++...+|.++....+ ....+.-..+++|+ T Consensus 146 i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d------------~~~~~~~~~~~iyt-- 211 (511) T protein:vir:96 146 IYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-- 211 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEe-- Confidence 999876554 4332 2455665555 4444443 36666566555332110 00000111222222 Q ss_pred EeecCCCeeEEEEEEeeCCe------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDGV------RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYE 278 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~~------~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~ 278 (510) ++.-+. |...++. ........++..+|++..+- +.+|+|-.+..++-+..++.+.-...... T Consensus 212 ---~~~i~~----~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~ 279 (511) T protein:vir:96 212 ---SHGVYR----YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYM 279 (511) T ss_pred ---CCcEEE----EEecCCCcccccccccccccccCCceeeEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 111010 1111111 11112233445788877653 45799999999999999999888888888 Q ss_pred HHhhCCceeeCCCCccchhhhhcCCCccee--------------cCCccccccccCCCccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 LESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------------PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 (510) Q Consensus 279 ~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~--------------~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 344 (510) ....+|.+++.-........+.....+... .+...+++.+. ...+.+.....++.+.+.|...- T Consensus 280 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:96 280 SDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHh Confidence 888888766543222333332221111111 11111222222 22355666777777777775533 Q ss_pred hh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCc--cceeeEEee--cHHH Q lcl|Aclame:pro 345 MY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIET--GLPA 418 (510) Q Consensus 345 ~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~--~~~~~~~vs--~l~~ 418 (510) +. +. ...-+...|+..+...-. .+........+.-.+.+.-++..++.++...+-..... ..+++.+.- +.+. T Consensus 358 ~~p~~~~~~~~~n~Sg~Al~~~~~-~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~ 436 (511) T protein:vir:96 358 NTPNMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSL 436 (511) T ss_pred CCcccccccccccchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCH Confidence 22 11 111123457666544322 12222233333333333333333344443222222222 245555543 2222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 419 LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLL 497 (510) Q Consensus 419 l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~ 497 (510) +..+ +.+ ..+++. +....+++ .++ ++.+ ++|++...+++.. +..+++. T Consensus 437 ~e~~---~~~-------~kl~G~------iS~et~l~----~l~-----~v~D~~~E~~ri~~E~~~-~~~~~~~----- 485 (511) T protein:vir:96 437 IEEL---KAY-------IDSGGK------ISQTTLMS----LFS-----FFQDPELEVKKIEEDEKE-SIKKAQK----- 485 (511) T ss_pred HHHH---HHH-------HHHhcc------CChHHHHH----hCC-----CCCCHHHHHHHHHHHHHH-HHHHHhh----- Confidence 2222 211 111221 21122222 222 2222 3444433332211 1111100 Q ss_pred HHHHHhhcccCCC Q lcl|Aclame:pro 498 EGASDMTNALAGV 510 (510) Q Consensus 498 ~~a~~~~~~~ag~ 510 (510) ...+...+. T Consensus 486 ----~~~~~~~~~ 494 (511) T protein:vir:96 486 ----GIYKDPRDI 494 (511) T ss_pred ----ccccCCCCC Confidence 111112222 No 72 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.81 E-value=1.8e-05 Score=46.64 Aligned_cols=424 Identities=10% Similarity=0.009 Sum_probs=175.5 Q ss_pred ChhHHHHHHHHHhcc---------------------------CchHHHHHHHHhhccc---ccCCCCCC-cccccccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG---------------------------SVEQRAIEFAKTTLPY---LMVDPMSG-SRGVVEHDFQ 49 (510) Q Consensus 1 ~k~~~~~r~~~lkr~---------------------------~~~~~w~e~~~~~~P~---~~~~~~~~-~~~~~~~~~d 49 (510) ++.++..+|+.-.+. .-..+++.+.+|..-. +....... ......++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:93 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeec Confidence 344444444332211 0111233333333221 00000000 1111234555 Q ss_pred chHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~ 129 (510) +.+...++..++.|++ -| ++++.+++. +.+ .+...+..++|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p-----~~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:93 93 DYASYISDFINGYFLG--NP-----IQYQDDDKD-------------VLE-------VIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHhhhhcc--cC-----eeeccCChH-------------HHH-------HHHHHHhhcCHhHHHHHHHHHHH Confidence 6666666666655543 12 122333321 222 33344667789999999999999 Q ss_pred hhCceEEEE--eCCC-CeEEEEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE- Q lcl|Aclame:pro 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH- 203 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~- 203 (510) ++|.+.+++ +++. .++.+++..+ |++.-|. .+++...+|.+.....+ ...++.-..+++|+. T Consensus 146 ~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~iyt~~ 213 (511) T protein:vir:93 146 IYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTSH 213 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEeCC Confidence 999876554 3332 2356666555 4444433 36776555555431100 000111122333321 Q ss_pred -E-EeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 204 -V-QRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 204 -v-~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a 281 (510) | .....++.+.. .... .......+++.+|++.++- +..|+|-.+..++-+..++.+.-......... T Consensus 214 ~i~~~~~~~~~~~~-----~~~~-~~~~~~~~~g~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (511) T protein:vir:93 214 GVYRYLTSRTNGLK-----LTPR-ENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred cEEEEEecCCCccc-----cccc-cccccccCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHh Confidence 1 00000000000 0001 1111222345788877653 45789999999999999998888888878878 Q ss_pred hCCceeeCCCCccchhhhhcCCCccee--------------cCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh- Q lcl|Aclame:pro 282 LEVLNLVDEAKGAVVDDYQDAEMGDYV--------------PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY- 346 (510) Q Consensus 282 ~~~~~lv~~~g~~~~~~~~~~~~G~~~--------------~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~- 346 (510) .+|.+++.-........+.....+.+. .+...+++.+. ...+.+.....++.++..|...-.. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~L~~~I~~~s~~P 360 (511) T protein:vir:93 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTP 360 (511) T ss_pred hCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 888766542122222222221111111 11112222222 2235666777788888777554322 Q ss_pred cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--ceeeEEeecHHHHHHHH Q lcl|Aclame:pro 347 GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAIETGLPALSRSA 423 (510) Q Consensus 347 ~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~--~~~~~~vs~l~~l~r~~ 423 (510) +. ...-+...|+.-+...-. .+........+.-.+.+.-+++.++.++...+-...+.+ .+++.+.-.+ +-..++ T Consensus 361 ~~~~~~~~~n~Sg~Al~~~~~-~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~-p~n~~e 438 (511) T protein:vir:93 361 NMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNL-PKSLIE 438 (511) T ss_pred ccccccccccchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCC-CCCHHH Confidence 11 111223456665544322 222222333333333343344444444433222222222 3555553222 212222 Q ss_pred HHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 424 AVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASD 502 (510) Q Consensus 424 ~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~ 502 (510) .++.+ ..+++. +....++.. ++ ++.+ ++|++...+++. .+..++.. . T Consensus 439 ~~~~~-------~kl~g~------iS~et~~~~----l~-----~v~d~~~E~~ri~~E~~-~~~~~~~~---------~ 486 (511) T protein:vir:93 439 ELKAY-------IDSGGK------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEK-ESIKKAQK---------G 486 (511) T ss_pred HHHHH-------HHHhcc------CchHHHHHh----CC-----CCCCHHHHHHHHHHHHH-HHHHHHhh---------h Confidence 22222 112221 212222322 21 2222 344443332221 11111100 1 Q ss_pred hhcccCCC Q lcl|Aclame:pro 503 MTNALAGV 510 (510) Q Consensus 503 ~~~~~ag~ 510 (510) ..+...+. T Consensus 487 ~~~~~~~~ 494 (511) T protein:vir:93 487 IYKDPRDI 494 (511) T ss_pred cccCCCCC Confidence 11111121 No 73 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=97.79 E-value=1.9e-05 Score=46.51 Aligned_cols=412 Identities=12% Similarity=0.047 Sum_probs=160.0 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHh--hcccccCCCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKT--TLPYLMVDPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~--~~P~~~~~~~~~~~~~~~--~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) +++.+.+++.... +-..+.+++|+= .+|++ +..-..++. +..-+-+..+|+.++..|. +-+ |+ T Consensus 16 ~~~~l~~~~~~~~--~rl~~l~~Yy~G~~~i~~~----~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~----~~g---~~ 82 (484) T protein:vir:77 16 AREEMLNLFTERT--QDLGDNTAYYESERRPDAV----GVTVPQQMQKLLAHVGYPRLYIDAIAARQE----LEG---FR 82 (484) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHhccccchhc----ccccchhHHhhhhhcCcHHHHHHHHHhhhc----cCc---ee Confidence 4555555554321 111233334321 11111 000011111 1223444555555555442 222 22 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC-----------eE Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-----------TV 145 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~-----------~~ 145 (510) .++.. +..+ .+.+....++|.....++.++..++|.+.+++..+.. ++ T Consensus 83 --~~~~~------------~~~~-------~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i 141 (484) T protein:vir:77 83 --LGGAD------------KADE-------QLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPII 141 (484) T ss_pred --cCCcc------------hhHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceE Confidence 22211 0111 2233466789999999999999999998765543321 36 Q ss_pred EEEEeceEEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 146 VAWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 146 ~~~pl~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~ 224 (510) ++++-.+.++..| ..+++...++.+.-. ....-..+++|+ + +..+ .|...+|. T Consensus 142 ~~~~p~~~~~~~D~~~~~~~~a~~~~~~~-----------------~~~~~~~~~~y~---~--~~~~----~~~~~~~~ 195 (484) T protein:vir:77 142 RVEPPTNLYAQIDPRTRQVMRAIRAIEDE-----------------EGNEVIGATLYL---P--NNTV----IWNREDGQ 195 (484) T ss_pred EEeccceeEEEecCCCCceEEEEEEEEee-----------------cCCcEEEEEEEe---c--CeEE----EEEecCCc Confidence 6676555444445 447766555544321 001112223332 1 1100 01111121 Q ss_pred ee-ccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCCc-cch-- Q lcl|Aclame:pro 225 RV-GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKG-AVV-- 296 (510) Q Consensus 225 ~~-~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~---~~g~-~~~-- 296 (510) .. ......++..+|++.++.+...++.+|+|-... ..+-+..++...-.+...+...+.|...+. ++.. ... T Consensus 196 ~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~ 275 (484) T protein:vir:77 196 WVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPET 275 (484) T ss_pred eEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccc Confidence 11 111222346799999998888899999997654 446567777766666667666665544431 1100 000 Q ss_pred -hhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHHH------- Q lcl|Aclame:pro 297 -DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVR------- 362 (510) Q Consensus 297 -~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi~------- 362 (510) ..+.....|.+......+++..++.. ++++ .-++.++.-|+....... +-....+ -++.-++ T Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~q~~~-~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~ 351 (484) T protein:vir:77 276 GQTLFDAYLARILAFEDHESKAQQFSA-AELR---NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLV 351 (484) T ss_pred cchhhhhhhhhhcccCCCCceeEeecC-CChH---HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHH Confidence 00111112222221112333333321 2333 344555555544332111 1111111 2333332 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc--eeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 363 ITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ--HKPAIETGLPALSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 363 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~--~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~ 440 (510) .+++++...+|..+.++ +..++.+. ++ ...+.+. +++.+.-+. +-..++.++.+..+.+ ...+ T Consensus 352 ~ka~~k~~~f~~~l~~~--------~~l~~~~~--~~-~~~~~~~~~i~v~w~~~~-~~s~~~~ad~~~kl~~---~g~g 416 (484) T protein:vir:77 352 KTVERKNKIFGGAWEQA--------MRVAYKVM--NG-GDIPPEYYRMESIWRDPS-TPTYAAKADAATKLYN---NGQG 416 (484) T ss_pred HHHHHHHHHHHHHHHHH--------HHHHHHHh--CC-CCcccccccceEEecCCC-CCCHHHHHHHHHHHHh---ccCC Confidence 23445555555444332 12222222 11 2222232 334432222 2122222322222212 1111 Q ss_pred hHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------------- Q lcl|Aclame:pro 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTN--------------- 505 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~--------------- 505 (510) + +.- +.+...+|+... ..+|++++++++..++ ++... .+.++....+ T Consensus 417 i------~s~----et~~~~l~~~~~----~~~e~~~~~~ee~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (484) T protein:vir:77 417 V------IPK----ERARIDMGYSIT----EREEMRKWDEEEQAQG--LGLMG--TMFGTDPSGGGNPDNPETPEPQPNP 478 (484) T ss_pred C------CCH----HHHHhcCCCChh----HHHHHHHHHHHHHHHH--HHHHh--hhccccccCCCCCCCCCcccccCCC Confidence 1 111 123333454321 1233333222222111 11110 1111111111 Q ss_pred --ccCC Q lcl|Aclame:pro 506 --ALAG 509 (510) Q Consensus 506 --~~ag 509 (510) .++| T Consensus 479 ~~~~~~ 484 (484) T protein:vir:77 479 AEEAAA 484 (484) T ss_pred ccccCC Confidence 1111 No 74 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=97.74 E-value=2.3e-05 Score=46.02 Aligned_cols=413 Identities=11% Similarity=-0.003 Sum_probs=160.3 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc-cCCCCCCcccc--ccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL-MVDPMSGSRGV--VEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~--~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) +-+.|.+.|+. +. .+.+.+.+|..-.. ...-+..-..+ ..+...+-+..+|++++..| +|.+ |++ T Consensus 17 ~~~~l~~~~~~--~~---~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l----~~~g---~~~ 84 (486) T protein:vir:42 17 VREEMISAFED--AS---KDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQ----AVEG---FRL 84 (486) T ss_pred HHHHHHHHHHH--HH---HHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhh----cccc---eec Confidence 22233333332 11 22333333322110 00000000011 11223445566666655544 3433 222 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-----------CeEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-----------ATVV 146 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-----------~~~~ 146 (510) ++... ..+ .+.+.+..++|.....++.++..++|.+.+++..+. .+++ T Consensus 85 --~~~~~------------~~~-------~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~ 143 (486) T protein:vir:42 85 --GDADE------------ADE-------ELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIR 143 (486) T ss_pred --CCCch------------hHH-------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEE Confidence 22110 011 122335668899999999999999999876664321 1456 Q ss_pred EEEeceEEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 147 AWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 147 ~~pl~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~ 225 (510) +++-.+.++..| ..+++...+|.+.-. +.+.-..+++|+ ++.-. +|...+|.. T Consensus 144 ~~~p~~~~~i~d~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y~-----~~~~~----~~~~~~~~~ 197 (486) T protein:vir:42 144 VEPPTRMHAEIDPRINRVSKAIRVAYDK-----------------EGNEIQAATLYT-----PMETI----GWFRADGEW 197 (486) T ss_pred EecccceEEEEeCCCCCeEEEEEEEEec-----------------CCCeEEEEEEEc-----CCcEE----EEEecCCcE Confidence 666555444444 567777666655310 001111222332 11100 111112222 Q ss_pred ec-cccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCCcc----ch Q lcl|Aclame:pro 226 VG-ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKGA----VV 296 (510) Q Consensus 226 ~~-~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~---~~g~~----~~ 296 (510) .. ......++.+|++.++.+...+..+|+|=... ..+-+..++...-.+...++..+.|...+. ++... +. T Consensus 198 ~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~ 277 (486) T protein:vir:42 198 AEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETG 277 (486) T ss_pred EeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccc Confidence 11 12223446899999999888899999997665 346667777766666666666666654442 11000 00 Q ss_pred hhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHHHH------- Q lcl|Aclame:pro 297 DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVRI------- 363 (510) Q Consensus 297 ~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi~~------- 363 (510) ..+.....|.+......+++..+.. .+++ ...++.++.-|++...... +...... .++.-++. T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ 353 (486) T protein:vir:42 278 QTLFDAYLARILAFEDAEGKIQQFS-AAEL---ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIK 353 (486) T ss_pred cchhhhhhchhcccCCCCceEEeec-ccCH---HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHH Confidence 0111111222221111223333332 2233 3445555555555432221 1111111 23333322 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCcc--ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITK--QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~--~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~ 440 (510) +++++...+++.+ .+++.++.. .+....+.+ .+++.+..+. +-..++.++.+..+.+ ...+ T Consensus 354 ka~~~~~~f~~~l------------~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~-~~s~~~~ad~~~kl~~---~~~g 417 (486) T protein:vir:42 354 KVERKNLMFGGAW------------EEAMRIAYRIMKGGDVPPDMLRMETVWRDPS-TPTYAAKADAATKLYG---NGQG 417 (486) T ss_pred HHHHHHHHHHHHH------------HHHHHHHHHHhcCCCccccceeeeEEecCCC-CCCHHHHHHHHHHHHh---cccC Confidence 2233344444433 333333211 111222333 2344443221 2222233333332222 2112 Q ss_pred hHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------hcccC---CC Q lcl|Aclame:pro 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDM-------TNALA---GV 510 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~-------~~~~a---g~ 510 (510) . +.- +. +...+|+... ..+|++...+++..+..+... .+..+... ..+.+ ++ T Consensus 418 ~------~s~-et---~~~~lg~~~d----~~~e~~~~~~e~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (486) T protein:vir:42 418 V------IPR-ER---ARIDMGYSVK----EREEMRRWDEEEAAMGLGLLG----TMVDADPTVPGSPSPTAPPKPQPAI 479 (486) T ss_pred C------CCH-HH---HHhcCCCChh----HHHHHHHHHHHHHHHHHHHHH----HhhcCCCCCCCCCCCCCCCCCCccc Confidence 1 111 11 1233554322 113333322222222211110 11111000 00000 00 No 75 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.71 E-value=2.6e-05 Score=45.69 Aligned_cols=418 Identities=12% Similarity=0.070 Sum_probs=167.5 Q ss_pred Ch---hHHHHHHHHHhccCchHHHHHHHHhhccc---ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcc Q lcl|Aclame:pro 1 MK---STAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~k---~~~~~r~~~lkr~~~~~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~W 74 (510) ++ +.+.+..++.+. .-..+++.+.+|..-. ..............++..+.+...++..++.|.+ -|+ T Consensus 13 ~~~~~~~~~~~i~~~~~-~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g--~~~---- 85 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKA-EQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLG--VPV---- 85 (489) T ss_pred CCCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhcc--CCc---- Confidence 22 223333333321 1122444445443211 0010000001112245666777777777666543 122 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeC---CCC--eEEE Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNS---DEA--TVVA 147 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~---~~~--~~~~ 147 (510) +++.+|+ .+.++|.. .+...+|.....++.++..++|.+.+ |+.+ ..+ ++.+ T Consensus 86 -~~~~~d~-------------~~~~~l~~-------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~ 144 (489) T protein:vir:99 86 -EYKNENK-------------DLQAAIDL-------MSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQ 144 (489) T ss_pred -eeecCCh-------------hHHHHHHH-------HHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEE Confidence 2233332 12333333 35567888899999999999998764 4422 222 3566 Q ss_pred EEeceEEEeeCCC--CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 148 WSLRSYAVRRDAT--GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 148 ~pl~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~ 225 (510) ++..+++...|.. +++...+|.++..- .+......+++|+ ++.-+.|.+...+.++.. T Consensus 145 ~~p~~~~~v~dd~~~~~~~~~i~~~~~~~---------------~~~~~~~~~~~y~-----~~~i~~~~~~~~~~~~~~ 204 (489) T protein:vir:99 145 LPAEQTFVIYDDTYQRNSLMAVHFYDIDY---------------GSGKRKQIIKAYT-----SDTIYTYEDYNLETKGMR 204 (489) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEec---------------CCCceEEEEEEEe-----CCcEEEEEecCCCcccce Confidence 7666655555533 45555555443210 0011112233332 111111111111222322 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch----hh--- Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV----DD--- 298 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~----~~--- 298 (510) +......++..+|++..+. ...|+|-.....+-+..++.+.-.+.........|.+++. |...+ .. T Consensus 205 ~~~~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~--g~~~~~~~~~~~~~ 277 (489) T protein:vir:99 205 LKDYEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIA--GNAYTGADENDYLD 277 (489) T ss_pred ecccccccCCceeEEEeec-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhc--cCCcccccchhhhh Confidence 3323333446789887764 3578998889999999999988888888777777765552 11110 00 Q ss_pred -hhcCCCc-----------ceec--------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-ccc-CCCCCCC Q lcl|Aclame:pro 299 -YQDAEMG-----------DYVP--------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERV 356 (510) Q Consensus 299 -~~~~~~G-----------~~~~--------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~v 356 (510) ....++| .+.. |...+++.+ ....+.......++.+...|...-.. +.. ...+... T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~ 355 (489) T protein:vir:99 278 DGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFL--KKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQ 355 (489) T ss_pred hcccccccccccccccccceeeeeccccCccccccceeee--eecCChHHHHHHHHHHHHHHHHHhCCcccccccccccc Confidence 0000011 0000 000011111 11224455566666666666432211 111 1112344 Q ss_pred CHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCC---CCccceeeEEeecHHHHHHHHHHH Q lcl|Aclame:pro 357 TAEEVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQG---LITKQHKPAIETGLPALSRSAAVQ 426 (510) Q Consensus 357 TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~---~p~~~~~~~~vs~l~~l~r~~~~~ 426 (510) |+..+..+ ++++...++..+. -+++.++.++...+... .-..++.+.+.-.+. -..++.++ T Consensus 356 Sg~Al~~~~~~l~~k~~~k~~~~~~~l~--------~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p-~d~~~~~~ 426 (489) T protein:vir:99 356 SGESMKYKLMASDNYREKQERLFKKGLM--------RRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLP-QNDNEIVT 426 (489) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhhcCCccccccccccceEEeCCCCC-cCHHHHHH Confidence 66554332 4444444444433 33333333333222111 111234554432221 11222222 Q ss_pred HHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH--- Q lcl|Aclame:pro 427 SMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQA-QAAQETLLEGASD--- 502 (510) Q Consensus 427 ~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~-~~a~~~~~~~a~~--- 502 (510) .+.. +++. +....++..+ -+++ +++++++.++.++++... ...+......... T Consensus 427 ~~~k-------l~gi------is~et~~~~l---~~v~-------~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 483 (489) T protein:vir:99 427 AAQN-------LYGI------VSDQTIFEIL---NTVT-------GVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEE 483 (489) T ss_pred HHHH-------Hhcc------CCHHHHHHhc---CCCC-------chhHHHHHHHHHHHHHHHhccccccccCCCCCCcC Confidence 2221 1111 2222333321 1222 112222222221111111 1111101100100 Q ss_pred -hhccc Q lcl|Aclame:pro 503 -MTNAL 507 (510) Q Consensus 503 -~~~~~ 507 (510) ....+ T Consensus 484 ~~~~~p 489 (489) T protein:vir:99 484 PTAEKP 489 (489) T ss_pred CCCCCC Confidence 11111 No 76 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.68 E-value=2.9e-05 Score=45.42 Aligned_cols=402 Identities=11% Similarity=0.026 Sum_probs=171.0 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccccCCC-CCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSEL 79 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~-~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~ 79 (510) .++.-..||++|+ ++++=.-+...... .........++..+.+...+++.++.|++- |+. | .. T Consensus 5 ~~~~~~~r~~~l~---------~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~~--~---~~ 68 (440) T protein:vir:95 5 FLGSQKQRLAILA---------SYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN--PVS--I---GV 68 (440) T ss_pred HHHHHHHHHHHHH---------HHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheecc--Cce--E---ee Confidence 3333333444432 22221111111111 011111223455566666666655554321 211 2 22 Q ss_pred ChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEEeceEEEe Q lcl|Aclame:pro 80 TDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSLRSYAVR 156 (510) Q Consensus 80 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~pl~~~~v~ 156 (510) .+... ++..+ .+.+.+..++|.....++.++..++|.+.+++ +++. .++++++..+.++. T Consensus 69 ~~~~~----------~~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~ 131 (440) T protein:vir:95 69 MEGGS----------ADQLS-------TIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVI 131 (440) T ss_pred CCCcc----------HHHHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEE Confidence 22111 11111 23445778899999999999999999987655 4432 24667776676666 Q ss_pred eCCC--CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccc Q lcl|Aclame:pro 157 RDAT--GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPI 234 (510) Q Consensus 157 ~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~ 234 (510) .|+. +++.-.+|.++.. ....++||+.. .-..|.......++-........++ T Consensus 132 ~d~~~~~~~~~~i~~~~~~--------------------~~~~~~vyt~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (440) T protein:vir:95 132 RDLTVEQNIIAAVHLPIYA--------------------DKVNMTVYTKD-----KVITYKPYSNNSVRLVVDDVKKHSY 186 (440) T ss_pred EcCCCCCceEEEEEEEEec--------------------CceEEEEEeCC-----eEEEEEEecCCccceeecceeeccC Confidence 6654 4565555544210 01123344210 0000000000000111111112234 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC--C-CccchhhhhcCC-Cccee-c Q lcl|Aclame:pro 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE--A-KGAVVDDYQDAE-MGDYV-P 309 (510) Q Consensus 235 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~--~-g~~~~~~~~~~~-~G~~~-~ 309 (510) ..+|++.++. +.+|.|=.+...+-+..+|.+.-...........|.+++.- . ....++...... .+.+. + T Consensus 187 g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~ 261 (440) T protein:vir:95 187 NDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLK 261 (440) T ss_pred ceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecc Confidence 5789887653 45799999999999999999999999988888888766521 0 111122211111 11111 1 Q ss_pred --------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHH-------HHHHHHHHh Q lcl|Aclame:pro 310 --------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI-------TAEEAENTL 372 (510) Q Consensus 310 --------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~-------r~~E~~~~L 372 (510) +...+++.+.. ..+.+.....++.++..|...-.. +. ...-+...|+.-+.. +++++...+ T Consensus 262 ~~~~~~~~~~~~~~~~lt~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~ 339 (440) T protein:vir:95 262 TGISTTGQQTTADASYIYK--QYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYF 339 (440) T ss_pred cccccccCCCCcceeEEee--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 11122333322 235666677788887777553321 11 111123457665533 344445554 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecH--HHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCH Q lcl|Aclame:pro 373 GGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGL--PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISL 450 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l--~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~ 450 (510) +..+.+ +++..+.++....-.......+++.+.-++ +.++.++-+.++ +++ +.. T Consensus 340 ~~~l~~--------~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl----------~g~------iS~ 395 (440) T protein:vir:95 340 TKALRR--------RYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA----------GGE------ISQ 395 (440) T ss_pred HHHHHH--------HHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH----------hcc------CcH Confidence 444332 122222222211111222234555553322 222222222221 121 222 Q ss_pred HHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 451 PKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 451 d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..+++.+ -+++ .++|+++..+++...+... ..+.+..-.|- T Consensus 396 et~~~~l---~~~d------~~~E~~ri~~E~~~~~~~~----------~~~~~~~~~~~ 436 (440) T protein:vir:95 396 ETLMENA---SFTD------YKTEHSRILKQGGSSDLEI----------GQIVGDADVGQ 436 (440) T ss_pred HHHHHhC---CCCC------cHHHHHHHHHHHHHhhhhH----------HhhccCCCCCC Confidence 2333322 1222 2344443332222111110 11111111111 No 77 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.65 E-value=3.3e-05 Score=45.15 Aligned_cols=429 Identities=10% Similarity=-0.025 Sum_probs=159.8 Q ss_pred ChhHHHH-HHHHHh--ccCchHHHHHHHH--hhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAM-LWEKLR--DGSVEQRAIEFAK--TTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~-r~~~lk--r~~~~~~w~e~~~--~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) |..+... ..+++. +.++. ..+++|+ +.++++-. .....-+..++.-+-+...|++++..| ++. .|+ T Consensus 13 ~~~~~~~~L~~~~~~~~~r~~-~~~~YY~G~~~i~~~~~--~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~--g~~ 83 (485) T protein:vir:24 13 DPAIARDEMVSAFEDQNQNLR-SNTSYYEAERRPEAIGV--TVPVQMQSLLAHVGYPRLYVDSIAERQ----AVE--GFR 83 (485) T ss_pred chHHHHHHHHHHHHHHHHHHH-HHHHHHhccCchhhcCc--ccchhhhhhhhccchHHHHHHHHhhhh----ccC--cee Confidence 3333221 222221 11111 1222322 11222100 000011111233455666666666554 332 222 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC-----------e Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-----------T 144 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~-----------~ 144 (510) .++..- ..+. +.+.+..++|.....+..++..++|.+.+++..+.. + T Consensus 84 ---~~~~~~------------~~~~-------l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~ 141 (485) T protein:vir:24 84 ---LGDADE------------ADEE-------LWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPL 141 (485) T ss_pred ---cCCCch------------hHHH-------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcce Confidence 222110 1111 223355678999999999999999998876654321 4 Q ss_pred EEEEEeceEEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC Q lcl|Aclame:pro 145 VVAWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG 223 (510) Q Consensus 145 ~~~~pl~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~ 223 (510) +++++-.+.++..| ..+++...++.+.-. .......+++|+ + +.- -.|...+| T Consensus 142 i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y~---~--~~~----~~~~~~~~ 195 (485) T protein:vir:24 142 IRVEPPTRMYAEIDPRIGRPAKAIRVAYDA-----------------EGNEIQAATLYT---P--NET----FGWFRAEG 195 (485) T ss_pred EEEeccceeEEEeeCCcCceeEEEEEEEee-----------------cCCeEEEEEEEc---C--CcE----EEEEecCC Confidence 66676666555555 447766655554210 011112223332 1 110 01112233 Q ss_pred eeec-cccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC---CCCcc-c-- Q lcl|Aclame:pro 224 VRVG-ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAKGA-V-- 295 (510) Q Consensus 224 ~~~~-~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~---~~g~~-~-- 295 (510) .... .....+++.+|++.++.+...+..||+|-..+ ..+-+..++...-.+...+...+.|...+. ++... . T Consensus 196 ~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~ 275 (485) T protein:vir:24 196 EWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE 275 (485) T ss_pred ceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc Confidence 2221 11123346799999998888888999997765 345567777766666667776666654432 11100 0 Q ss_pred -hhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHHHHHHHHH Q lcl|Aclame:pro 296 -VDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEVRITAEEA 368 (510) Q Consensus 296 -~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi~~r~~E~ 368 (510) ...+....+|.+......+++..+.. .++++ ..++.++.-|.+...... +...... .++.-++.. ... T Consensus 276 ~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~-~~~ 350 (485) T protein:vir:24 276 TGQTLFDAYLARILAFEDAEGKIQQFS-AAELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAA-ESR 350 (485) T ss_pred cccchhhhcccceeccCCCCceEEeec-ccchH---HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHH-HHH Confidence 00111112232211111222322332 12333 344555555544332211 1111111 233333221 111 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcC Q lcl|Aclame:pro 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRI 448 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~i 448 (510) +........+.-.+-+.-++..++.++...+.+ .....+++.+..+. +-..++.++.+..+.+ ...+. + T Consensus 351 l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~-~d~~~i~v~f~~~~-~~s~~~~ad~~~kl~~---~g~~~------~ 419 (485) T protein:vir:24 351 LIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVP-PDMLRMETVWRDPS-TPTYAAKADAATKLYG---NGQGV------I 419 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc-cccceeeEEecCCC-CCCHHHHHHHHHHHHh---ccccc------C Confidence 112222222222222222222223322222221 11123444443222 1122222222222222 11111 1 Q ss_pred CHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhhcccCCC Q lcl|Aclame:pro 449 SLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLE---GASDMTNALAGV 510 (510) Q Consensus 449 d~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~---~a~~~~~~~ag~ 510 (510) ..+. +...+|+.... .+++++..+++..+..+...+...... ++.+.+.++.+. T Consensus 420 s~et----~~~~l~~~~d~----~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 476 (485) T protein:vir:24 420 PRER----ARKDMGYSIAE----REEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQ 476 (485) T ss_pred CHHH----HHhhCCCCHhH----HHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCc Confidence 1112 23445554321 122332222221111111111000000 000001111111 No 78 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=97.64 E-value=3.4e-05 Score=45.07 Aligned_cols=465 Identities=13% Similarity=0.094 Sum_probs=181.2 Q ss_pred Ch-----------hHHHHHHHHH---hccCchH---HHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 1 MK-----------STAAMLWEKL---RDGSVEQ---RAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~k-----------~~~~~r~~~l---kr~~~~~---~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l 63 (510) |- +|+.++|.+. .|..|++ +.+.+-+... ......+... +.| |-|.+.+ T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~--~~~~~~~~~~----~r~--------nl~~sni 66 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYR--DERDSAHDAE----TRW--------NLFSTNI 66 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhh--ccccCCCccc----ccc--------chhhhhH Confidence 33 4788899865 2544443 2223322221 0111111111 112 3444444 Q ss_pred HHhhcCc-----cCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHhhCceEE Q lcl|Aclame:pro 64 ARSLFPT-----GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL--FQNASLAVLTQVIKLLIVTGNALL 136 (510) Q Consensus 64 ~~~ltpp-----~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~~l~~~G~~~l 136 (510) +..+ |. -.|=++=...|.. ..-.+..-+.+||.+...+ +..+|+..+..+..+.+..|-+++ T Consensus 67 ~~i~-P~iYar~P~p~V~~rf~d~d----------~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~ 135 (663) T protein:vir:34 67 QTQM-ASLYGQTPKVSVSRRFADAD----------DDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLC 135 (663) T ss_pred HHHh-hhhhcCCCcceeeecccCcc----------cchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceE Confidence 3322 10 0111111111110 0113444455677665566 447799999999999877665554 Q ss_pred ---EEe-------------CCC----------------Ce--EEEEEeceEEEeeC-CCCceeEEEEEEEecHHHHhHHh Q lcl|Aclame:pro 137 ---YRN-------------SDE----------------AT--VVAWSLRSYAVRRD-ATGRWMDIVLKQRYKSKDLDDVY 181 (510) Q Consensus 137 ---~~~-------------~~~----------------~~--~~~~pl~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~ 181 (510) |.. ++. +. +..+.-.+|.+..- .--.|+=|.++-.||-+++.+.| T Consensus 136 ~v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf 215 (663) T protein:vir:34 136 RIRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARF 215 (663) T ss_pred EEEeecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhh Confidence 422 000 01 11122122322211 01367788899999999999999 Q ss_pred hHHhhccc----------ccC------CCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee--------ccccccccccC Q lcl|Aclame:pro 182 KQDLMRAG----------RNL------SGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV--------GETGRWPIHLC 237 (510) Q Consensus 182 ~~~~~~~~----------~~~------~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~--------~~~~~y~~~~~ 237 (510) +.++.+.. .++ +...++.|+....++..+ |||-++|... +.+.+| .-| T Consensus 216 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~------V~w~~eg~~~~L~~~~p~lgl~~f--fPc 287 (663) T protein:vir:34 216 DADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRK------VDWYVEGYSAVLDTQPDPLGLESF--FPC 287 (663) T ss_pred cCChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcE------EEEEEcCcceecccCCCCCCCCCC--CCC Confidence 76542110 011 112355555433333332 4444444321 123333 357 Q ss_pred ceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch-hhhhcCCCcceecC------ Q lcl|Aclame:pro 238 PYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV-DDYQDAEMGDYVPG------ 310 (510) Q Consensus 238 P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~-~~~~~~~~G~~~~g------ 310 (510) |+...=....++ .-.....--+=.-++.+|.+++.+-. ...+++|.++++-+...+. +.+..+..+.++|= T Consensus 288 Prpl~~~~~~ds-~ipvpd~~~y~~~~~E~n~~t~Rin~-l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~ 365 (663) T protein:vir:34 288 PKPLLANWTTDK-VVPRPDFVLAQDLYKEIDLVSTRITL-LERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTF 365 (663) T ss_pred cccccceecCCC-eecCCcHHHHHHHHHHHHHHHHHHHH-HHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhh Confidence 887766666655 44333333777778889988877654 4567889999874332222 33444444444441 Q ss_pred -----CccccccccCCCccchHHHHHHHHHHHHHHHHHHh-hccc---CCC----CCCCCHHHHHHHHHHHHHHhhhhHH Q lcl|Aclame:pro 311 -----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGAN---QRD----AERVTAEEVRITAEEAENTLGGTYS 377 (510) Q Consensus 311 -----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~-~~~~---~~~----~~~vTAtEi~~r~~E~~~~LGpv~~ 377 (510) ....|.-+++. ....+...+-+.+..|+...+ .+++ .|+ .+..||..|.. +.++.-+. T Consensus 366 ~~~gg~~k~I~~~pi~---~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKs------q~gS~RIq 436 (663) T protein:vir:34 366 ADKGGLRGVVDWFPLE---PVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKA------KFGSIRLQ 436 (663) T ss_pred hhhcCccchhhcccch---hHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHH------HHHhHHHH Confidence 11222222222 111122222333444444442 2322 232 23334444422 33333344 Q ss_pred HHHHHHH---HHHHHHHHHHH------------hhcCCCC---CCc----------cceeeEEeec----HHHHHHHHH- Q lcl|Aclame:pro 378 LLAENLQ---SPLAYVCLSEV------------DDALLQG---LIT----------KQHKPAIETG----LPALSRSAA- 424 (510) Q Consensus 378 rl~~E~l---~Pli~r~~~il------------~~~~l~~---~p~----------~~~~~~~vs~----l~~l~r~~~- 424 (510) ..++|+. .-++...-.|| ....+|. +.+ ..+++.+-+. -..++..+. T Consensus 437 e~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~ 516 (663) T protein:vir:34 437 RLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEK 516 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHH Confidence 4443321 11222222222 1122221 001 1233444221 112222222 Q ss_pred ---HHHHHHHHHHHHhhcCh-HhHhh---------------cCCHHHHHHHHHHHcC------CCHhhccCCHHHHHHHH Q lcl|Aclame:pro 425 ---VQSMLNASQVIAGLAPI-AQLDP---------------RISLPKMMDTIWAAFS------VDTSQFYKSADELQAEA 479 (510) Q Consensus 425 ---~~~~~~~~q~~~~~~~~-~q~~~---------------~id~d~~~~~~a~~~G------vp~~~i~~s~ee~~~~~ 479 (510) +..+..++|.++.+.+. |+..+ ..+.+.+++.+.+++- .++. .-.-..+.++.. T Consensus 517 ~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~-pa~~~~~~k~~~ 595 (663) T protein:vir:34 517 MEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQS-PAPQQPDPKVVA 595 (663) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCC-cccchhhHHHHH Confidence 22333344443322221 22111 1233333444333221 1110 000001111111 Q ss_pred HHHHHHHHHHHHHHHHHH----------HH-HHHhh--------cccCCC Q lcl|Aclame:pro 480 EEQRRQAAQAQAAQETLL----------EG-ASDMT--------NALAGV 510 (510) Q Consensus 480 ~~~~qqa~~~~~a~~~~~----------~~-a~~~~--------~~~ag~ 510 (510) ++.+.|...|.+...... +. ..+.. ..+.++ T Consensus 596 ~q~k~q~~~aeAq~e~q~~~~~~ql~~~~~~~k~~~~a~~~~~~a~q~~~ 645 (663) T protein:vir:34 596 QAMKGQQEMAKVQAEVQGDLLRIQAETQANETKERQQAEWNVREAAQKNL 645 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH Confidence 111111111111000000 00 00011 111111 No 79 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.63 E-value=3.5e-05 Score=45.03 Aligned_cols=412 Identities=11% Similarity=0.040 Sum_probs=173.2 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHh--hccccc---CCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKT--TLPYLM---VDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~--~~P~~~---~~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) -++.+.+..++.+ |-.....+++++.- -++.+- ...... ......++..+-+...++..++.|.+ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g--~p---- 101 (474) T protein:vir:95 28 QEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVAS--KP---- 101 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhcc--CC---- Confidence 2222222222221 22233344444431 111111 111111 11112345566777777777766543 12 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~~pl 150 (510) +.+..+|+. +.+.| ...+ .+||...+.++.++...+|.+.+++..+. + ++.+++. T Consensus 102 -~~~~~~d~~-------------~~~~l-------~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 159 (474) T protein:vir:95 102 -VTYSCEDES-------------VLKII-------HDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred -ceeccCchH-------------HHHHH-------HHHH-hccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcc Confidence 123343322 11111 1222 36899999999999999998876554332 3 3555554 Q ss_pred ce-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E-EeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 151 RS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V-QRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 151 ~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v-~~~~~~~~~~~sv~~e~~~~~ 225 (510) .+ |.+..|. .|++.-++|.++.. ....+++|+. + +.+..++...........+.. T Consensus 160 ~~~~~v~d~~~~~~~~~~i~~~~~~--------------------~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:95 160 EQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQ 219 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEEc--------------------CeeEEEEEeCCeEEEEEEcCCccccccccCccccc Confidence 44 5555443 57777666665421 1123444431 1 111111110000000111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh-hcCC- Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY-QDAE- 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~-~~~~- 303 (510) .......+..+|++.++. +.+|.|=.+...+-+..+|.+.-......+....|.+++.-...-..... .... T Consensus 220 -~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:95 220 -SHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKY 293 (474) T ss_pred -ccccccCCCccceEeecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhc Confidence 111222345788887654 46799999999999999999888888888888888776642111111111 1111 Q ss_pred CcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHH-------HHHHHHHHhhh Q lcl|Aclame:pro 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI-------TAEEAENTLGG 374 (510) Q Consensus 304 ~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~-------r~~E~~~~LGp 374 (510) .+.+.....++++.+... .+.......++.+...|...-.. +. ....+...|+..+.. +++++...++. T Consensus 294 ~~~i~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~ 371 (474) T protein:vir:95 294 YKAINVDGDGGVETIQVE--VPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATV 371 (474) T ss_pred cceeeccCCCceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122222233344444322 46677778888888877654322 11 111223456655432 33444444443 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEe--ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHH Q lcl|Aclame:pro 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~v--s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~ 452 (510) .+.+ ++..+..++. . ......+.+.+. .+.+.+.. ++ .+...+. +.... T Consensus 372 ~l~~--------~~~li~~~~g---~-~~d~~~i~v~f~~~~p~d~~e~---a~-------~~~~~g~-------iS~et 422 (474) T protein:vir:95 372 AIQE--------LIGFIIDFNN---L-KMDVKDIEISFNFNRMMNDAEQ---SQ-------IIAQSQY-------LSRET 422 (474) T ss_pred HHHH--------HHHHHHHHhC---C-CcccceeeEEeccCCCcCHHHH---HH-------HHHhcCC-------CchHH Confidence 3332 2222222221 1 122233444442 22222211 11 1111111 22222 Q ss_pred HHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 453 MMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++. .++ ++.+ ++|++...+++ +.+. ++ ++. ......+......-+ T Consensus 423 ~i~----~l~-----~v~d~~~E~~ri~~E~-~~~~-~~-~~~-~~~~~~d~~~~~~~~ 468 (474) T protein:vir:95 423 LVK----SSP-----LVDDYKAELERIEQEQ-MEYN-KQ-LPN-LDDGGADGAQQQERS 468 (474) T ss_pred HHH----hCC-----CCCCHHHHHHHHHHHH-HHHH-hc-ccc-cccccCCCCcCCCCC Confidence 232 222 1222 33443322222 1111 11 000 000001111111111 No 80 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.60 E-value=3.9e-05 Score=44.73 Aligned_cols=424 Identities=11% Similarity=-0.004 Sum_probs=172.7 Q ss_pred ChhH------------HHHHHHHHhccCchHHHHHHHHhhccc---ccCCCCCCccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKST------------AAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~k~~------------~~~r~~~lkr~~~~~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~ 65 (510) |.+. +.+..++.+. -..+++.+.+|.... +..... .......++-.+.+...++..++.|.+ T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~~~~~l~~Yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~Iv~~~~~~l~g 81 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQN--RKKRLDKLSDYYNGKQEIEKHEFD-NATVEAANVMVNHAKYITDMNVGFMTG 81 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHH--HHHHHHHHHHHhccccchhcCCcC-cCCCCcceeecchHHHHHHHHhhhhcc Confidence 2222 2222233321 123445555554332 111111 111122344455666666666655443 Q ss_pred hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE--EeCCC- Q lcl|Aclame:pro 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RNSDE- 142 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~--~~~~~- 142 (510) - |+ ++..++.. ..+ .+...+..++|.....++.++..++|.+.++ .+++. T Consensus 82 ~--p~-----~~~~~~~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~ 134 (499) T protein:vir:10 82 N--PV-----KYVAEKGK-------------NID-------DILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDP 134 (499) T ss_pred c--Cc-----eeecCChh-------------HHH-------HHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccc Confidence 1 22 22233221 111 2334466678999999999999999987654 44432 Q ss_pred -----------------CeEEEEE-eceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE- Q lcl|Aclame:pro 143 -----------------ATVVAWS-LRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH- 203 (510) Q Consensus 143 -----------------~~~~~~p-l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~- 203 (510) .++.+++ ..-|.+.-|..++....+.++..+. ..........++||+. T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~-------------~~~~~~~~~~~~iyt~~ 201 (499) T protein:vir:10 135 ISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKK-------------DLEGNTNGYSITVYMPQ 201 (499) T ss_pred ccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEe-------------ecCCCceEEEEEEEeCC Confidence 1234443 3445665555554433333322110 0001111223333331 Q ss_pred -EE-eecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 204 -VQ-RRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 204 -v~-~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a 281 (510) |+ .....+ ....++........-++..+|++..+- +.+|.|=.....+-+..+|.+.-.+....... T Consensus 202 ~i~~~~~~~~------~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~ 270 (499) T protein:vir:10 202 RIVEYRTKTT------MEVSANDPIVYDGENLFGAVPIIEFRN-----NEERQGDFEQLISLIDAYNLLQTDRISDKEAF 270 (499) T ss_pred eEEEEEecCC------ccccCcceecccccCCCCccceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHh Confidence 00 000000 011111111112222346789887653 46799999999999999999888888888888 Q ss_pred hCCceeeCCCC-ccchhhhhcCCCcce-ec-C-CccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCC Q lcl|Aclame:pro 282 LEVLNLVDEAK-GAVVDDYQDAEMGDY-VP-G-GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAER 355 (510) Q Consensus 282 ~~~~~lv~~~g-~~~~~~~~~~~~G~~-~~-g-~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~ 355 (510) ..|.+++.-.. ....+.......|.+ .. + ...+++.+. ...+.......++.+...|.+.-.. +. ...-+.. T Consensus 271 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn 348 (499) T protein:vir:10 271 VDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLT--KSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGN 348 (499) T ss_pred cCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEe--ccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhccc Confidence 88887764211 111111111122222 21 1 122233332 2346677788888888888663321 11 1112334 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecH--HHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 356 VTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGL--PALSRSAAVQSMLNASQ 433 (510) Q Consensus 356 vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l--~~l~r~~~~~~~~~~~q 433 (510) .|+..+..+-.-+... ..-..+.-.+.+.-++..++.++...+. ......+++.+.-.+ +.+..+.-+.+ T Consensus 349 ~Sg~Al~~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~-~~d~~~i~i~f~~~~p~n~~e~~~~~~k------ 420 (499) T protein:vir:10 349 VSGEAMKFKLFGLENL-LSIKQRYFFDGLRRRLKLIQTIVNIKGA-NDDASGCKISLVANIPSNLSDVVNNVKN------ 420 (499) T ss_pred chHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCC-ccccccceEEeCCCCCCCHHHHHHHHHH------ Confidence 5666654432221111 1112222222222233333343332221 112224455443322 22222222221 Q ss_pred HHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 434 VIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 434 ~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +++ .+....+++.+ -+++ ..++|++...+++.. ...+... ...+.....+...+. T Consensus 421 ----l~g------~iS~et~~~~l---~~v~-----d~~~E~~ri~~E~~~-~~~~~~~---~~~~~~~~~~~~~~~ 475 (499) T protein:vir:10 421 ----ADG------IIPRKYTYSWL---PDVD-----NPQDVIDEMNQQDAE-TIKKNQE---ALRGQDPDRLELEDK 475 (499) T ss_pred ----Hhc------cCChHHHHHhC---CCCC-----CHHHHHHHHHHHHHH-HHHHHHh---hhccCCCCCCCCCCC Confidence 112 12222233221 1221 123444433222211 1111111 011111111122222 No 81 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=97.58 E-value=4.2e-05 Score=44.60 Aligned_cols=430 Identities=9% Similarity=-0.044 Sum_probs=175.7 Q ss_pred Ch-------hHHHHHHHHHhccCchHHHHHHHHhhcccc---cCCCC-CCccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 1 MK-------STAAMLWEKLRDGSVEQRAIEFAKTTLPYL---MVDPM-SGSRGVVEHDFQSAGALLVNNLAAKLARSLFP 69 (510) Q Consensus 1 ~k-------~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~---~~~~~-~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltp 69 (510) .. +-+.+..+.- +..-.++.+++.+|..... ..... ........++..+.+...++..++.|.+- T Consensus 34 ~~~~~~~~~~~i~~~i~~h-~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~--- 109 (502) T protein:vir:48 34 LEELMVNNWELLKNFINHH-KLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGN--- 109 (502) T ss_pred hhhhccccHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhccc--- Confidence 00 0011111110 1111234445555544321 11110 11111122445555556666555544321 Q ss_pred ccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCC-eEE Q lcl|Aclame:pro 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVV 146 (510) Q Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~~-~~~ 146 (510) | ++++..+... ...+.+ .+.+.+..++|....+++.+++.++|.+.+++ +++.. ++. T Consensus 110 ---p-~~~~~~d~~~---------~~~~~~-------~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~ 169 (502) T protein:vir:48 110 ---P-IRVEYDDNED---------NSQNDD-------AIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIK 169 (502) T ss_pred ---C-eeEecCCccc---------hhHHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEE Confidence 1 1233333211 112222 33445777899999999999999999887555 43322 466 Q ss_pred EEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 147 AWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 147 ~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~ 224 (510) +++..+ |.+..|. .+++...+|.+.... ..+....+++|+ ++. .+++..++. T Consensus 170 ~~~p~~~~~vydd~~~~~~~~~ir~~~~~~----------------~~~~~~~~~iyt-----~~~-----i~~~~~~~~ 223 (502) T protein:vir:48 170 RLSPLETFVIYDNSLEDNSIAAVRYYNRGT----------------LQNAKDVVEIYT-----NQH-----IYTLDASDS 223 (502) T ss_pred EEcccceEEEEcCCCCCceEEEEEEEEEee----------------cCCcEEEEEEEe-----CCe-----EEEEEeCCc Confidence 665544 5555443 466766665553211 111122333432 111 122222221 Q ss_pred e-eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc-hhhhhc- Q lcl|Aclame:pro 225 R-VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV-VDDYQD- 301 (510) Q Consensus 225 ~-~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~-~~~~~~- 301 (510) . ........+..+|++..+ ++..|.|-.+.+++-+..++.+.-.+.........|.+.+.-..... ...... T Consensus 224 ~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~ 298 (502) T protein:vir:48 224 FNEISVTPHAFGTVPITEFL-----NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDM 298 (502) T ss_pred eeeccceecCCCccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhh Confidence 1 111222234578887654 34679999999999999999988888888888888876654211111 111110 Q ss_pred CCCcceec-------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHh Q lcl|Aclame:pro 302 AEMGDYVP-------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTL 372 (510) Q Consensus 302 ~~~G~~~~-------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~L 372 (510) ...+.+.. |....+.+-.+....+.+.....++.+.+.|...-.. +. ...-+...|+..+...-. .+... T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~-~l~~k 377 (502) T protein:vir:48 299 KRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLF-GLDQD 377 (502) T ss_pred hhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHH-HHHHH Confidence 01111111 1111111111222235566677778888777553221 11 111124457766654322 11122 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHH Q lcl|Aclame:pro 373 GGTYSLLAENLQSPLAYVCLSEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLP 451 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d 451 (510) .....++-.+.+.-++..++.++...+ ........+++.+.-.+ +-..+..++.+ ..+++. |.-+ T Consensus 378 ~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~-p~d~~e~a~~~-------~kl~g~------iS~e 443 (502) T protein:vir:48 378 RVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNL-PKSLYEQVSIL-------NDLGGQ------VSQE 443 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCC-CcCHHHHHHHH-------HHHhcc------CcHH Confidence 222333333333333333444443222 12222234555553322 21222222221 122221 1112 Q ss_pred HHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 452 KMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 452 ~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .+++ .+| ++.+ ++|++...+++ ++..................-++-.++ T Consensus 444 t~l~----~l~-----~v~D~~~E~~ri~~E~-~~~~~~~~~~~~~~~~~~~~d~~~e~~ 493 (502) T protein:vir:48 444 TALS----LSG-----LVENPTEELDKINEES-SKIDFKGYPSYFYDNVGKYTDEVKETH 493 (502) T ss_pred HHHH----hCC-----CCCCHHHHHHHHHHHH-HhhhhhcccccccccccccCCCccCCC Confidence 2232 232 2222 23343332221 111101000000000001111112222 No 82 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.55 E-value=4.6e-05 Score=44.37 Aligned_cols=407 Identities=10% Similarity=0.027 Sum_probs=170.9 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhhcc--cccCC---CCCC-ccccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTTLP--YLMVD---PMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~~P--~~~~~---~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) -++.+.+..+..+ |......++++++=.-+ .+-.. .... ......++..+.+...++..++.|.+ -|+. T Consensus 27 ~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g--~p~~-- 102 (468) T protein:vir:96 27 QEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVA--NPVT-- 102 (468) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhcc--CCce-- Confidence 2233333333332 33344556666543211 11000 0000 01112244555666666666555543 2222 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC-CeEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~-~~~~~~pl 150 (510) ++.+++. +.+.|. ..+ ..||...+.++.++..++|.+.+ |.+++. .++.+++. T Consensus 103 ---~~~~d~~-------------~~~~l~-------~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 158 (468) T protein:vir:96 103 ---YGTEDEK-------------SLKTIQ-------EVL-NHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPA 158 (468) T ss_pred ---eccCChH-------------HHHHHH-------HHH-hcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcc Confidence 2333321 222222 223 35888889999999999998874 444432 23555554 Q ss_pred ce-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE----EEeecCCCeeEEEEEEeeC-- Q lcl|Aclame:pro 151 RS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH----VQRRKGTAMDYAEMYHEID-- 222 (510) Q Consensus 151 ~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~~~~~~~~sv~~e~~-- 222 (510) .+ |.+..|. .|++.-.+|.++..- ...+++|+. .+...+..+- ..+..... T Consensus 159 ~~~~~v~~~~~~~~~~~~ir~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 217 (468) T protein:vir:96 159 EQAIPIWTNKERDELKAFIRLYELDG--------------------GERVEYWTANDVTFYELKDGQLI-PDYYQGEEHV 217 (468) T ss_pred cceEEEEcCCCCCceEEEEEEEEecC--------------------ceEEEEEeCCeEEEEEEcCCcee-eccccccccc Confidence 44 5454443 577766666654321 112222221 0111111110 00000101 Q ss_pred --CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh- Q lcl|Aclame:pro 223 --GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY- 299 (510) Q Consensus 223 --~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~- 299 (510) +..+ ....+.+..+|++.++ ++.+|.|=.+...+-+..++.+.-......+....|.+++.-...-+.... T Consensus 218 ~~~~~~-~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~ 291 (468) T protein:vir:96 218 QAHYYV-GNKSMSWNRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFM 291 (468) T ss_pred ccceee-ccccccCCcccEEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhh Confidence 1111 1122334567877653 356799999999999999999888888888888888776642111111111 Q ss_pred hcC-CCcce-ecCCc-cccccccCCCccchHHHHHHHHHHHHHHHHHHhh-ccc-CCCCCCCCHHHHHHH-------HHH Q lcl|Aclame:pro 300 QDA-EMGDY-VPGGA-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRIT-------AEE 367 (510) Q Consensus 300 ~~~-~~G~~-~~g~~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~-~~~~~~vTAtEi~~r-------~~E 367 (510) ... ..+.+ +++.. .+++.+... .+.+.....++.++..|...-.. +.. ...+...|+..+..+ +.+ T Consensus 292 ~~~~~~~~i~~~~d~~~~~~~l~~~--~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~ 369 (468) T protein:vir:96 292 YNLKYYKAINVDGDGSGGVDTIQID--VPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANK 369 (468) T ss_pred hhhhcCceEEecCCCCCcceEEeec--CChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHH Confidence 111 12222 22222 234433322 35566677788887777654321 111 122344566655432 333 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh Q lcl|Aclame:pro 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~ 446 (510) +...++..+ .+++.++.+ .+. ......+.+.+.-.+. -.-+..++ .+... + T Consensus 370 k~~~~~~~l------------~~~~~li~~~~g~-~~d~~~i~i~f~~~~p-~d~~e~a~-------~~~~~-g------ 421 (468) T protein:vir:96 370 LKNKTLTAL------------QELLQYIIDFYKL-SIKVQDVEITFNFNVM-VNELEQSQ-------IGVNS-Q------ 421 (468) T ss_pred HHHHHHHHH------------HHHHHHHHHHhCC-CcccceeeEEecCCCC-cCHHHHHH-------HHHhc-C------ Confidence 333333333 333333221 111 1222334444432221 11111111 11111 1 Q ss_pred cCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 447 RISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~ 506 (510) .+.-..+++.+ -++ .+ ++|++...+++ ++...+ +...-+...-.++ T Consensus 422 ~iS~et~i~~l---~~v------~D~~~E~~ri~~E~-~~~~~~----~~~~~~~~~~~~~ 468 (468) T protein:vir:96 422 YLSKETVVTNH---PWV------DDPVAEMERIDQEE-LALPSI----EEGLNGKENNEPT 468 (468) T ss_pred CCchHHHHHhC---CCC------CCHHHHHHHHHHHH-HHHHHH----hhccCCCCCCCCC Confidence 12222233221 122 22 23443322221 111110 0112222222333 No 83 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.50 E-value=5.4e-05 Score=43.97 Aligned_cols=417 Identities=10% Similarity=-0.002 Sum_probs=174.4 Q ss_pred Ch---hHHHHHHHHHhccCchHHHHHHHHhhccc-----ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MK---STAAMLWEKLRDGSVEQRAIEFAKTTLPY-----LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k---~~~~~r~~~lkr~~~~~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) .. +.+.+.-++- .....++++++.+|..-. +.... ........++..+.+...++.+++.|.+ -| T Consensus 38 ~~~~~~~i~~~i~~~-~~~~~~r~~~l~~YY~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Ivd~~~~yl~g--~p--- 110 (512) T protein:vir:97 38 LLQNINEVSKYIEHH-MDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYFLG--NP--- 110 (512) T ss_pred hhhhHHHHHHHHHHH-HHhhHHHHHHHHHHhcccCccccccCcc-cccccCcceeecchHHHHHHHHhhhhcc--cC--- Confidence 11 1111111111 111223455555554321 11111 1111122345566677777777765543 11 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWS 149 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~p 149 (510) ++++.+++. +. ..+...+..++|.....++.++..++|.+.+++ +++. .++.+++ T Consensus 111 --~~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~ 168 (512) T protein:vir:97 111 --IQCQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) T ss_pred --ceeccCChH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEc Confidence 122333321 11 233444667789999999999999999876544 4332 2455666 Q ss_pred ece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe--- Q lcl|Aclame:pro 150 LRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV--- 224 (510) Q Consensus 150 l~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~--- 224 (510) ..+ |++.-|. .+++...+|.++..... ....+.-..+++|+ ++.-+. |...++. T Consensus 169 p~~~~~iyd~~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~vyt-----~~~i~~----~~~~~~~~~~ 227 (512) T protein:vir:97 169 AMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-----SHGVYR----YLTSRTNGLK 227 (512) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEe-----CCcEEE----EEecCCCccc Confidence 555 4444333 36776666665431100 00011112223332 111000 1111110 Q ss_pred ---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc Q lcl|Aclame:pro 225 ---RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD 301 (510) Q Consensus 225 ---~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~ 301 (510) ........+++.+|++.++ ++..|+|-.+..++-+..++.+.-...........|.+++.-....++..+.. T Consensus 228 ~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~ 302 (512) T protein:vir:97 228 LTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK 302 (512) T ss_pred ccccccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhh Confidence 1111222344677887654 34679999999999999999888888888888888877653222223333322 Q ss_pred CCCcceec---------------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHH Q lcl|Aclame:pro 302 AEMGDYVP---------------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRIT 364 (510) Q Consensus 302 ~~~G~~~~---------------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r 364 (510) ...+..+. +...++..+ ....+.......++.++..|...-+. +. ...-+...|+.-+... T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l--~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~ 380 (512) T protein:vir:97 303 QKEANVLFLEPTVYENRDTGIETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYK 380 (512) T ss_pred hhhcccccccccchhhcccccCCCCCcceEEE--eecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHH Confidence 22211110 111112222 22235566677777777777543321 11 1112234566665532 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCc--cceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 365 AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 365 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~--~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~ 440 (510) -. .+........+.-.+.+.-++..++.++...+-...+. ..+++.+.- +.+.++.+..+.+ +++ T Consensus 381 ~~-~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~k----------l~g 449 (512) T protein:vir:97 381 LF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYID----------SGG 449 (512) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHH----------Hhc Confidence 21 11122222333333333333444444443322222222 245555543 2222222222221 112 Q ss_pred hHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) . +....+++. ++ ++.+ ++|++...+++.. ...+... ...+...+. T Consensus 450 i------iS~et~~~~----l~-----~v~d~~~E~eri~~E~~~-~~~~~~~---------~~~~~~~~~ 495 (512) T protein:vir:97 450 K------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEKE-SIKKAQK---------GIYKDPRDI 495 (512) T ss_pred c------CchHHHHHh----CC-----CCCCHHHHHHHHHHHHHH-HHHHHhh---------cccCCCCCC Confidence 1 111222222 22 1222 3444432222211 1111100 011111111 No 84 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=97.50 E-value=5.5e-05 Score=43.96 Aligned_cols=423 Identities=12% Similarity=0.019 Sum_probs=164.0 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhhccc-----ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTTLPY-----LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~W 74 (510) ....+.+.|..+. +. .+.+.+.+|..-. +.............+...+-+..+++.++..| +|.+ T Consensus 28 ~~~l~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l----~~~g--- 97 (501) T protein:vir:25 28 LGALVADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNL----SVVG--- 97 (501) T ss_pred HHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhh----cccc--- Confidence 3333444454443 22 2344444443321 10000000000011122345555555555543 3433 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCCeEEEEEece Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEATVVAWSLRS 152 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~~~~~~~pl~~ 152 (510) |++ +|... .+. +.+....++|....+++.++..++|.+.+++ +++...+++++-.+ T Consensus 98 f~~--~d~~~---------~~~-----------l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~sp~~ 155 (501) T protein:vir:25 98 YRN--ALAKE---------NDP-----------AWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGPVFRTRSPRQ 155 (501) T ss_pred eec--CCccc---------hHH-----------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCCeEEEecccc Confidence 332 22111 111 1233567889999999999999999987554 44444677776544 Q ss_pred -EEEeeCCC--CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE--EEEeecCCCe-------eEEEEEEe Q lcl|Aclame:pro 153 -YAVRRDAT--GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT--HVQRRKGTAM-------DYAEMYHE 220 (510) Q Consensus 153 -~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~--~v~~~~~~~~-------~~~sv~~e 220 (510) +++..|+. .++.-.++.+...- ..+....+++|. +++.-...+. ..++.. . T Consensus 156 ~~~iy~D~~~~~~~~~ai~~~~~~~----------------~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 218 (501) T protein:vir:25 156 ILAVYADPSVDAWPQYALETWVAQK----------------DAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQ-P 218 (501) T ss_pred EEEEEecCCCCcceeEEEEEEeecc----------------ccCcceeEEEecCeeEEEEecCceeeeeccccccccc-c Confidence 55666644 23544444332111 011112222221 2221111000 000000 0 Q ss_pred eCCeee-----ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCc-c Q lcl|Aclame:pro 221 IDGVRV-----GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKG-A 294 (510) Q Consensus 221 ~~~~~~-----~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~-~ 294 (510) ..+..+ ...+..++..||++.+.=+.. .+.+|+|=.+..++-+..+|...-.++..+...+.|...+. |+ . T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~-~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~--G~~~ 295 (501) T protein:vir:25 219 VNVREVTDVIEHGATFEGKPVCPVVRFVNGRD-ADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS--GWTG 295 (501) T ss_pred ccccccccccccccccCCccceeeEeccCccc-cCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh--CCCC Confidence 011111 111222345788887554443 35689998888888888999888888888877777644432 22 1 Q ss_pred chhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCHHHHH-------HHH Q lcl|Aclame:pro 295 VVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVR-------ITA 365 (510) Q Consensus 295 ~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~--~~~~~~~~~vTAtEi~-------~r~ 365 (510) ...+......|.+..-...+++..++. .++++.....++.+-..|...=-. ..+.....+.++.-+. .++ T Consensus 296 ~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka 374 (501) T protein:vir:25 296 SKAEVLKASALRVWTFEDPEVKAQAFP-PASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKL 374 (501) T ss_pred CccchhhhcccceeccCCCCceEEEec-ccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHH Confidence 122222223333322111122222332 235554444444444433221100 0111112233555443 334 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHh Q lcl|Aclame:pro 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) Q Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~ 445 (510) +.+...+|..+.+ ++..++.+.. ..-+.....+++.+..+. +-..++.++.+..+.+ . +.+. T Consensus 375 ~~k~~~f~~~l~~--------~~rl~~~~~~--~~~~~~~~~i~v~w~~~~-~~s~~~~ada~~kl~~---~--gis~-- 436 (501) T protein:vir:25 375 AAKRESFGESWEQ--------LLRLAAEMDD--DPDTAADSGAEVLWRDTE-ARSFGAVVDGITKLAS---A--GIPI-- 436 (501) T ss_pred HHHHHHHHHHHHH--------HHHHHHHHhC--CCccccceeeeEEecCCC-CCCHHHHHHHHHHHHh---c--CCCH-- Confidence 4444555554443 1222222222 111111122344332222 2122222222222211 1 1111 Q ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcccCCC Q lcl|Aclame:pro 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASD----------MTNALAGV 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~----------~~~~~ag~ 510 (510) .. .+....|+++ ++++...++++++.+.....+ ....+..+ ...+-+|+ T Consensus 437 -----et---~~~~~~g~~~-------~~ie~~~~~~~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (501) T protein:vir:25 437 -----EH---LLSMVPGMTQ-------QTIQAIKDSLRGGEVKSLVDK-LLSNEPAPVPPPPPQAAAQALNEGGV 495 (501) T ss_pred -----HH---HHHHcCCCCH-------HHHHHHHHHHHHHhHHHHHHH-hhccCcCCCCCCCCCCCccccccccC Confidence 11 1334456653 333322222222111111000 00000000 01111222 No 85 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.50 E-value=5.6e-05 Score=43.90 Aligned_cols=424 Identities=11% Similarity=-0.000 Sum_probs=178.3 Q ss_pred ChhHHHHHHHHHh---------------------------ccCchHHHHHHHHhhccc---ccCCCC-CCcccccccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLPY---LMVDPM-SGSRGVVEHDFQ 49 (510) Q Consensus 1 ~k~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~d 49 (510) +..++..+|+.-. ......+++++.+|..-. ...... ........++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:99 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeec Confidence 2222222222211 111122444455553321 000011 111112234566 Q ss_pred chHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~ 129 (510) +.+...+++.++.|.+ -|+ +++.+++. +. ..+...+..++|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p~-----~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:99 93 DYASYISDFINGYFLG--NPI-----QYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHHhhhcc--cCc-----eeecCchH-------------HH-------HHHHHHHhhcCHhHHHHHHHHHHH Confidence 6777777776665543 122 12333321 11 233445667789999999999999 Q ss_pred hhCceEEEE--eCCC-CeEEEEEeceEEEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE- Q lcl|Aclame:pro 130 VTGNALLYR--NSDE-ATVVAWSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH- 203 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~- 203 (510) ++|.+.+++ +++. .++.+++..+.++..|. .+++...+|.+.....+ ....+.-..+++|+. T Consensus 146 i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~vyt~~ 213 (511) T protein:vir:99 146 IYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTSH 213 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------cCccceEEEEEEEeCC Confidence 999876554 4332 24666666555444443 36776666665432100 000111112333321 Q ss_pred -EE-eecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 204 -VQ-RRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 204 -v~-~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a 281 (510) ++ .+..++.++.. .... ......++..+|++..+- +..|+|-.+..++-+..++.+.-......... T Consensus 214 ~i~~~~~~~~~~~~~-----~~~~-~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~ 282 (511) T protein:vir:99 214 GVYRYLTSRTNGLKL-----TPRE-NGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (511) T ss_pred cEEEEEecCCccccc-----cccc-cccccCCCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 10 00001000000 0011 111222345788877654 35799999999999999999888888887777 Q ss_pred hCCceeeCCCCccchhhhhcCCCccee--------------cCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh- Q lcl|Aclame:pro 282 LEVLNLVDEAKGAVVDDYQDAEMGDYV--------------PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY- 346 (510) Q Consensus 282 ~~~~~lv~~~g~~~~~~~~~~~~G~~~--------------~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~- 346 (510) ..|.+++.-.+......+.....+... .+...+++.+. ...+.+.....++.+++.|...-+. T Consensus 283 ~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~--~~~~~~~~e~~~~~L~~~I~~~s~~P 360 (511) T protein:vir:99 283 NDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTP 360 (511) T ss_pred hchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 777666532222222222211111111 11112233222 2235566677777777777543322 Q ss_pred cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--ceeeEEeecHHHHHHHH Q lcl|Aclame:pro 347 GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAIETGLPALSRSA 423 (510) Q Consensus 347 ~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~--~~~~~~vs~l~~l~r~~ 423 (510) +. ...-+...|+..+..+-. .+........+.-.+.+.-+++.++.++...+-...+.+ .+++.+.-.+ +-..+. T Consensus 361 ~~~~~~~~gn~Sg~Alk~~~~-~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~-p~n~~e 438 (511) T protein:vir:99 361 NMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNL-PKSLIE 438 (511) T ss_pred ccccccccccchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCC-CcCHHH Confidence 11 111223456665544422 222223333343344444344444454543332222222 3455553322 111122 Q ss_pred HHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 424 AVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASD 502 (510) Q Consensus 424 ~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~ 502 (510) .++.+ ..+++. +....+++. + | ++.+ ++|++...+++ +.+..++. .. T Consensus 439 ~~~~~-------~kl~Gi------iS~et~l~~----l--~---~v~D~~~E~~ri~~E~-~~~~~~~~---------~~ 486 (511) T protein:vir:99 439 ELKAY-------IDSGGK------ISQTTLMSL----F--S---FFQDPELEVKKIEEDE-KESIKKAQ---------KN 486 (511) T ss_pred HHHHH-------HHHhcc------CCHHHHHHh----C--C---CCCCHHHHHHHHHHHH-HHHHHHHh---------hc Confidence 22211 111121 222233332 2 1 2222 33443332222 11111110 11 Q ss_pred hhcccCCC Q lcl|Aclame:pro 503 MTNALAGV 510 (510) Q Consensus 503 ~~~~~ag~ 510 (510) ..+...++ T Consensus 487 ~~~~~~~~ 494 (511) T protein:vir:99 487 MYQDPRNI 494 (511) T ss_pred ccccCCCC Confidence 11222222 No 86 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.47 E-value=6.1e-05 Score=43.68 Aligned_cols=412 Identities=12% Similarity=0.044 Sum_probs=156.7 Q ss_pred ChhHHH-HHHHHHh-ccCchHHHHHHHHhhccc-----ccCCCCCCccccccc-cccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAA-MLWEKLR-DGSVEQRAIEFAKTTLPY-----LMVDPMSGSRGVVEH-DFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~-~r~~~lk-r~~~~~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~-~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) +++.+. ..+.++. +.+ +++.+.+|..-. +.........+++.+ +.-+-+...|+.+++.| +|.+ T Consensus 14 ~~~~~~~~l~~~~~~~~~---r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l----~~~g- 85 (479) T protein:vir:99 14 LAKYLETKVFPKMNTECE---RLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQL----IVDG- 85 (479) T ss_pred HHHHHHHHHHHHHHHHhH---HHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhc----cccc- Confidence 444343 2334442 222 333344443221 111000111111111 12344555555555433 4444 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC-----C-CC--e Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS-----D-EA--T 144 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~-----~-~~--~ 144 (510) |+ .+|... .+.+ .+.+..++|....++++++..++|.+.+++.. + .+ + T Consensus 86 --f~--~~d~~~---------~~~~-----------~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~ 141 (479) T protein:vir:99 86 --YR--KTGTNE---------NAKG-----------WDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVAR 141 (479) T ss_pred --cc--CCCchh---------hHHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceE Confidence 33 222211 1112 23345678999999999999999998776642 1 12 3 Q ss_pred EEEEEece-EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC Q lcl|Aclame:pro 145 VVAWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG 223 (510) Q Consensus 145 ~~~~pl~~-~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~ 223 (510) +++++..+ +++..|+......+|. ++. +.+..+.+|+- +. +.+|....| T Consensus 142 i~~~~p~~~~~iydd~~~~~~~~~~---~~~------------------~~~~~~~~~~~----~~-----~~~~~~~~~ 191 (479) T protein:vir:99 142 IKCIDPRDAFAIWEDPYWDEWPKYL---LER------------------QPNGQYWWWTE----ED-----YSIFEFKQG 191 (479) T ss_pred EEEechhheEEEecCCcccceeeEE---Eee------------------cCceeEEEEec----ce-----EEEEEecCC Confidence 56665544 4455454332222221 111 11222222210 00 001111111 Q ss_pred ee-eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh----- Q lcl|Aclame:pro 224 VR-VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD----- 297 (510) Q Consensus 224 ~~-~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~----- 297 (510) .. ........+..+|++.++-+...+ .+|+|=.+..++-+..++...-.+...++..+.|.+.+. |...++ T Consensus 192 ~~~~~~~~~h~~g~vPvv~f~n~~~~~-~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~ 268 (479) T protein:vir:99 192 KFIYRETVSHDYGHIPFVRYVNVMDLR-GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT--GLMLPEGANAD 268 (479) T ss_pred ceeeccccccCCCCcceEEeecCCCcC-cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc--CCCcccccccc Confidence 11 111111223579999988777664 589998889999999999888888888887777764442 221111 Q ss_pred --hhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccCC----CCCCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 298 --DYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQR----DAERVTAEEVRITAEEAENT 371 (510) Q Consensus 298 --~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~----~~~~vTAtEi~~r~~E~~~~ 371 (510) ......++.+... ..++...++. .++++ ..++.++.-|...+....... ...+.|+.-++....-+... T Consensus 269 ~~~~~~~~~~i~~~~-~~~~~~~q~~-~~~~~---~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~k 343 (479) T protein:vir:99 269 QEKMRFAQESMLISQ-NEKASFGAIP-AAPLD---GLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQK 343 (479) T ss_pred hhccccccccceeec-CCCceEEEec-ccchH---HHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHH Confidence 1111112222211 1223333333 23343 333444444443332221111 12234555444332222211 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEe-ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCC Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE-TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~v-s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id 449 (510) .++.+. .+.+-+.+++.++.. .+... +.+...++++ .....-..++.++.+....+ + +.. - T Consensus 344 ----a~~~~~-~f~~al~~~~~l~~~~~~~~~-~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~---a-g~i-------s 406 (479) T protein:vir:99 344 ----LFEKQA-TWKASHNQTMRLVNKIEGRTE-EATDLDFTITWQDVTIQSLAQFADAWAKMVE---S-LKI-------P 406 (479) T ss_pred ----HHHHHH-HHHHHHHHHHHHHHHHcCCCc-cccceeeeEEecCCCCCCHHHHHHHHHHHHh---c-CCC-------C Confidence 112222 222233333433221 12221 2222333322 11111111222222222111 1 111 1 Q ss_pred HHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhh----------------cccCCC Q lcl|Aclame:pro 450 LPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGA--SDMT----------------NALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a--~~~~----------------~~~ag~ 510 (510) .+.+ +....|++. ++++..++.++++..+.+.+. .+..+. .++. +.+|+| T Consensus 407 ~et~---l~~l~gv~~-------~~~e~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (479) T protein:vir:99 407 AEGV---WDMIPNLDQ-------STVNGWKEIYDREGDFGKYMR-KLQNGPDPAEQRGGPNGATNMQQANNKTGEPASL 474 (479) T ss_pred HHHH---HHhcCCCCH-------HHHHHHHHHHHHHHHHHHHHH-HHhcccCcccccCCCCCCCCCCCCCCCCcchhcc Confidence 1222 222236653 233222111111111111111 111110 1111 122222 No 87 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=97.46 E-value=6.3e-05 Score=43.60 Aligned_cols=464 Identities=13% Similarity=0.056 Sum_probs=198.2 Q ss_pred ChhHHHHHHHHHh------ccCchH-----H---HHHHHHhhcccccCCC-CCCccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR------DGSVEQ-----R---AIEFAKTTLPYLMVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~k~~~~~r~~~lk------r~~~~~-----~---w~e~~~~~~P~~~~~~-~~~~~~~~~~~~dstg~~a~~~Laa~l~~ 65 (510) |-. =.++|.-=+ ...|.+ | .+.+.+|..=.-..-. -..+.+ ...+++..|..-|++++.-| T Consensus 1 m~~-~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~d-r~~~~~ps~r~~V~~~~~~L-- 76 (563) T protein:vir:74 1 MPY-NHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDD-SVPILMPSGRKIVEAVHRFL-- 76 (563) T ss_pred CCc-cccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCc-eeeeccchHHHHHHHHHHhc-- Confidence 110 011111000 011111 1 2222232221100000 011222 23568888888888855443 Q ss_pred hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CC-- Q lcl|Aclame:pro 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SD-- 141 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~-- 141 (510) +....|+- +.....+ ..... +++.+....++-|+.....++-.+..+.|-+++++- ++ T Consensus 77 ---g~~~~~~V---e~~~~de-----~~~~a-------vq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 77 ---GVGFDYLV---EPDMGDE-----GIRQS-------LNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred ---CCCcEEec---CccccCc-----chHHH-------HHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccc Confidence 43444542 2221111 11112 456666678889999999999999999998886653 21 Q ss_pred -CCeEEEEE--eceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcc---cccCCCCceEEEEEEE----E-ee--- Q lcl|Aclame:pro 142 -EATVVAWS--LRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRA---GRNLSGSGSVDLYTHV----Q-RR--- 207 (510) Q Consensus 142 -~~~~~~~p--l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~---~~~~~~~~~v~v~~~v----~-~~--- 207 (510) .++.++.+ .+.|+-..|++. |-.+|-..-.+...++++..+.+-+- ....+++. ...+|| + +. T Consensus 139 ~g~R~rv~~vDP~~~fp~~dpd~-v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg--~~~~~~~~dae~w~lg~ 215 (563) T protein:vir:74 139 AGERISVDEVDPRQIFLIEDGST-VVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEG--MFTGRISSELTHWTLGN 215 (563) T ss_pred cCCCceEeecCCceeeeccCCCC-cccceeeecccCCCCCcchhccceeeeeeeeeeCCCC--Cccceeeeccchhcccc Confidence 23555544 467777777744 55555322222222333333222111 11112221 111122 1 11 Q ss_pred -cCCCeeEEEEEEeeCCeee----cccc--ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 208 -KGTAMDYAEMYHEIDGVRV----GETG--RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) Q Consensus 208 -~~~~~~~~sv~~e~~~~~~----~~~~--~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~ 280 (510) +++.+.-.++--+.++... +++. --+..-.||+.++=...++++||+|-..+.+.-++.||.-....-..+.. T Consensus 216 wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~ 295 (563) T protein:vir:74 216 WDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVF 295 (563) T ss_pred ccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHh Confidence 2222221122123333211 1110 00112368888777778899999999999999999999765555555555 Q ss_pred hhCCceeeCCCCccchhhhh------cCCCccee--cCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-c-- Q lcl|Aclame:pro 281 SLEVLNLVDEAKGAVVDDYQ------DAEMGDYV--PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-N-- 349 (510) Q Consensus 281 a~~~~~lv~~~g~~~~~~~~------~~~~G~~~--~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~-- 349 (510) .=.|+...+ +....+... +.++|.+. ++..+.=....++...+++.++..++++..| +..-.. . T Consensus 296 tG~pi~vl~--~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~er---al~~~s~tPa 370 (563) T protein:vir:74 296 QGLGMYVTN--ASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEK---GIAEGSGTPE 370 (563) T ss_pred cCCCeEEec--cccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHH---HHHhhccCcc Confidence 545655443 222222111 11233332 1111111223444455777777777776642 221110 1 Q ss_pred ----CCCCCCC---CHHH-----HHHHHHHHHHHhhhhHHHHHHHHH---HHHHHHHHHHHhh---cCCCCCCccceeeE Q lcl|Aclame:pro 350 ----QRDAERV---TAEE-----VRITAEEAENTLGGTYSLLAENLQ---SPLAYVCLSEVDD---ALLQGLITKQHKPA 411 (510) Q Consensus 350 ----~~~~~~v---TAtE-----i~~r~~E~~~~LGpv~~rl~~E~l---~Pli~r~~~il~~---~~l~~~p~~~~~~~ 411 (510) .-|..++ .|=| +-.+.+||+..|=.++-+.-.++. .|..+|.+-.-.- .|.-++|. ...+. T Consensus 371 vA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~-~~~v~ 449 (563) T protein:vir:74 371 VAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLN-ECSVV 449 (563) T ss_pred eeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCC-ceEEE Confidence 1122221 2222 234455555544444444322222 2222222211110 12222222 22333 Q ss_pred Ee-ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 412 IE-TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQ 490 (510) Q Consensus 412 ~v-s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~ 490 (510) ++ .+.-|.-+++-++.+....+ ++ -+-...+++.+.+. |.|- =-.++|+++..-+.-+++..|+ T Consensus 450 ivf~p~~P~d~~~vv~~~~tl~~-----aG------iiSretAv~~L~~~-g~~~---pdae~e~~~ie~~~i~~~~~a~ 514 (563) T protein:vir:74 450 CIFADPMPVNKTQVTQDTLLLQQ-----AH------LILRKMAVAKLRSI-GWEY---PEVDDQGNALTDDDIADMLLAE 514 (563) T ss_pred EEeCCCCCccHHHHHHHHHHHHH-----cC------chhHHHHHHHHHhC-CCCC---CcHHHHHhhcCHHHHHHHHHHH Confidence 33 44556677776666655444 12 24455667777776 6552 1124555444333333322222 Q ss_pred HHH-HHHHHHHHHhhcccCCC Q lcl|Aclame:pro 491 AAQ-ETLLEGASDMTNALAGV 510 (510) Q Consensus 491 ~a~-~~~~~~a~~~~~~~ag~ 510 (510) +.. +.-..+|+.-+|..-+- T Consensus 515 a~ad~~~~~~a~~~~g~~~~~ 535 (563) T protein:vir:74 515 AEADASLGLSAMDNGGAGEQQ 535 (563) T ss_pred hhccCcccceecccCCCCccc Confidence 111 11111122111111111 No 88 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.31 E-value=9.7e-05 Score=42.58 Aligned_cols=411 Identities=12% Similarity=0.035 Sum_probs=169.5 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhh--cccc-cCCCCCC--c-cccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL-MVDPMSG--S-RGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~--~P~~-~~~~~~~--~-~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) .++.+.+..+..+ |-+...+..+++.-. ++.+ ...+... . .....++..+-+..-++..++.|.+ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g--~p---- 101 (474) T protein:vir:95 28 QEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAG--KP---- 101 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcc--cC---- Confidence 2222222222221 212222333333321 1111 0001100 0 1112244555666666666655543 12 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~pl 150 (510) .+++.+++. +.+.| ...+ .++|.....++.++...+|.+.+++ +++. .++.+++. T Consensus 102 -~~~~~~~~~-------------~~~~l-------~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:95 102 -VTYAHDDDK-------------VLDVI-------HQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPA 159 (474) T ss_pred -ceeccCChH-------------HHHHH-------HHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Confidence 223333321 11111 1122 3689999999999999999987554 4332 23556665 Q ss_pred ceEEEeeC-C-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E-EeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 151 RSYAVRRD-A-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V-QRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 151 ~~~~v~~d-~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v-~~~~~~~~~~~sv~~e~~~~~ 225 (510) .+.++..| . .+++.-.+|.++.. ....+++|+. | +.....+. +.......+... T Consensus 160 ~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~vy~~~~i~~~~~~~~~-~~~~~~~~~~~~ 218 (474) T protein:vir:95 160 EQAIPIWTDKEREQLNAFIRIFTFN--------------------GETKVEYWTAETVTYYVYENGG-LIPDFYYGDEHI 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeec--------------------CeeEEEEEeCCeEEEEEEcCCc-eeeccccccccc Confidence 55444443 3 47777666665421 1123455541 1 11111111 111111111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc--CC Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--AE 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~--~~ 303 (510) .......++..+|++..+. +.+|.|=.+..++-+..++.+.-......+....|.+++.-....+...... .. T Consensus 219 ~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:95 219 QTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKY 293 (474) T ss_pred cCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhc Confidence 1122223446788886653 4679999999999999999888888888888888876653211111111111 11 Q ss_pred CcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHH-------HHHHHHHHHhhh Q lcl|Aclame:pro 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVR-------ITAEEAENTLGG 374 (510) Q Consensus 304 ~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~-------~r~~E~~~~LGp 374 (510) .+.+..+...++..+.. ..+.......++.++..|...-.. +. ....+...|+.-+. .++.++...++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~ 371 (474) T protein:vir:95 294 YKAINVSSDGGVETIQV--EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANV 371 (474) T ss_pred cceeeccCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333344544432 245667778888888777654322 11 11122334554443 222333333333 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHH Q lcl|Aclame:pro 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~ 452 (510) .+.+ ++..++.++ +. ......+.+.+.. +.+.+..++ .+...+. +.-.. T Consensus 372 ~l~~--------~~~~i~~~~---g~-~~d~~~i~i~f~~~~p~~~~e~a~----------~~~~~gi-------iS~et 422 (474) T protein:vir:95 372 ALQE--------LMQFILDFN---KI-KLDAKEIEITFNFNVMVNDLEQSQ----------IGAQSQY-------LSKET 422 (474) T ss_pred HHHH--------HHHHHHHHh---CC-CcccceeeEEecCCCccCHHHHHH----------HHHHcCC-------CChHH Confidence 3332 222222222 11 1223345555432 222222221 1111111 22222 Q ss_pred HHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhcccCCC Q lcl|Aclame:pro 453 MMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGA-SDMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a-~~~~~~~ag~ 510 (510) ++. .++ ++.+ ++|++...+++ .++. .+ ++. ..+. ..-.....+- T Consensus 423 ~~~----~lp-----~v~D~~~E~eri~~E~-~~~~-~~-~~~--~~~~~~~~~~~~~~~ 468 (474) T protein:vir:95 423 LVR----HHP-----WVDDPKAELERLDEEQ-LELN-KQ-LPN--LDDGGADGAQQQQQS 468 (474) T ss_pred HHH----hCC-----CCCCHHHHHHHHHHHH-HHHH-hh-ccc--cccccCCCCCCcCCC Confidence 222 222 2222 33443322222 1111 11 000 1000 0000011111 No 89 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.31 E-value=9.7e-05 Score=42.58 Aligned_cols=411 Identities=12% Similarity=0.035 Sum_probs=169.5 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhh--cccc-cCCCCCC--c-cccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYL-MVDPMSG--S-RGVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~--~P~~-~~~~~~~--~-~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) .++.+.+..+..+ |-+...+..+++.-. ++.+ ...+... . .....++..+-+..-++..++.|.+ -| T Consensus 28 ~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g--~p---- 101 (474) T protein:vir:96 28 QEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAG--KP---- 101 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcc--cC---- Confidence 2222222222221 212222333333321 1111 0001100 0 1112244555666666666655543 12 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~pl 150 (510) .+++.+++. +.+.| ...+ .++|.....++.++...+|.+.+++ +++. .++.+++. T Consensus 102 -~~~~~~~~~-------------~~~~l-------~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:96 102 -VTYAHDDDK-------------VLDVI-------HQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPA 159 (474) T ss_pred -ceeccCChH-------------HHHHH-------HHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Confidence 223333321 11111 1122 3689999999999999999987554 4332 23556665 Q ss_pred ceEEEeeC-C-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E-EeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 151 RSYAVRRD-A-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V-QRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 151 ~~~~v~~d-~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v-~~~~~~~~~~~sv~~e~~~~~ 225 (510) .+.++..| . .+++.-.+|.++.. ....+++|+. | +.....+. +.......+... T Consensus 160 ~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~vy~~~~i~~~~~~~~~-~~~~~~~~~~~~ 218 (474) T protein:vir:96 160 EQAIPIWTDKEREQLNAFIRIFTFN--------------------GETKVEYWTAETVTYYVYENGG-LIPDFYYGDEHI 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeec--------------------CeeEEEEEeCCeEEEEEEcCCc-eeeccccccccc Confidence 55444443 3 47777666665421 1123455541 1 11111111 111111111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc--CC Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD--AE 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~--~~ 303 (510) .......++..+|++..+. +.+|.|=.+..++-+..++.+.-......+....|.+++.-....+...... .. T Consensus 219 ~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 293 (474) T protein:vir:96 219 QTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKY 293 (474) T ss_pred cCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhc Confidence 1122223446788886653 4679999999999999999888888888888888876653211111111111 11 Q ss_pred CcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHH-------HHHHHHHHHhhh Q lcl|Aclame:pro 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVR-------ITAEEAENTLGG 374 (510) Q Consensus 304 ~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~-------~r~~E~~~~LGp 374 (510) .+.+..+...++..+.. ..+.......++.++..|...-.. +. ....+...|+.-+. .++.++...++. T Consensus 294 ~~~i~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~ 371 (474) T protein:vir:96 294 YKAINVSSDGGVETIQV--EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANV 371 (474) T ss_pred cceeeccCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333344544432 245667778888888777654322 11 11122334554443 222333333333 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHH Q lcl|Aclame:pro 375 TYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~ 452 (510) .+.+ ++..++.++ +. ......+.+.+.. +.+.+..++ .+...+. +.-.. T Consensus 372 ~l~~--------~~~~i~~~~---g~-~~d~~~i~i~f~~~~p~~~~e~a~----------~~~~~gi-------iS~et 422 (474) T protein:vir:96 372 ALQE--------LMQFILDFN---KI-KLDAKEIEITFNFNVMVNDLEQSQ----------IGAQSQY-------LSKET 422 (474) T ss_pred HHHH--------HHHHHHHHh---CC-CcccceeeEEecCCCccCHHHHHH----------HHHHcCC-------CChHH Confidence 3332 222222222 11 1223345555432 222222221 1111111 22222 Q ss_pred HHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhcccCCC Q lcl|Aclame:pro 453 MMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGA-SDMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a-~~~~~~~ag~ 510 (510) ++. .++ ++.+ ++|++...+++ .++. .+ ++. ..+. ..-.....+- T Consensus 423 ~~~----~lp-----~v~D~~~E~eri~~E~-~~~~-~~-~~~--~~~~~~~~~~~~~~~ 468 (474) T protein:vir:96 423 LVR----HHP-----WVDDPKAELERLDEEQ-LELN-KQ-LPN--LDDGGADGAQQQQQS 468 (474) T ss_pred HHH----hCC-----CCCCHHHHHHHHHHHH-HHHH-hh-ccc--cccccCCCCCCcCCC Confidence 222 222 2222 33443322222 1111 11 000 1000 0000011111 No 90 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=97.29 E-value=0.00011 Score=42.38 Aligned_cols=415 Identities=11% Similarity=0.005 Sum_probs=178.6 Q ss_pred Ch-hHHHHHHHHHh-ccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MK-STAAMLWEKLR-DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k-~~~~~r~~~lk-r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) |. +.+.+..++.. +.+...+++++++-.-+-+.. ..........++-.+.+...++..++.|.+- | +++. T Consensus 17 ~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~-~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~-----~~~~ 88 (453) T protein:vir:73 17 ITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQ-KAKDSWKPDNRLTNNFAKYIVDTFVGYFNGI--P-----IKKT 88 (453) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcC-CCCCccCccceeecchHHHHHHHhhhhhccc--C-----ceee Confidence 32 22333333332 323344455555533221111 1111122233555677777788777666431 2 2223 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEE-eceEE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWS-LRSYA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~p-l~~~~ 154 (510) .+++. +.+ .+...+..++|.....++.++..++|.+.+++ +++. .++.+++ ..-|+ T Consensus 89 ~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~ 148 (453) T protein:vir:73 89 HDDKS-------------VLE-------AMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFM 148 (453) T ss_pred cCChH-------------HHH-------HHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 33211 122 22333666789999999999999999987554 3332 2355554 45567 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC-e-eecccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG-V-RVGETGRW 232 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~-~-~~~~~~~y 232 (510) +..|..++....+.++... .+....++||+. +. .++++.++ . .+...... T Consensus 149 v~dd~~~~~~~~~i~~~~~------------------~~~~~~~~vyt~-----~~-----i~~~~~~~~~~~~~~~~~~ 200 (453) T protein:vir:73 149 VYDDSIKQKPLFAVYYGFD------------------EEGNLSGTVYTL-----LE-----TISITGKAGEVKFGESTYN 200 (453) T ss_pred EEeCCCCceeEEEEEEEEe------------------cCceEEEEEEeC-----Ce-----EEEEEecCCceEEccceec Confidence 7777667665455444321 111233444441 10 11111111 1 11112222 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcce----- Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDY----- 307 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~----- 307 (510) .++.+|++..+ ++.+|+|-.+...+-+-.++.+.-......+....|.+++.- .....+.......+.. T Consensus 201 ~~g~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g-~~~~~~~~~~~~~~~~~~~~~ 274 (453) T protein:vir:73 201 VYSDLPIVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLG-AEVDEEDAKNIKDNRLINFFD 274 (453) T ss_pred cCCceeEEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeec-CCCCchhhhcccccccccccc Confidence 34578887654 346899988999999999999888888888888888766631 1111122111111110 Q ss_pred -ecC------CccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 308 -VPG------GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 308 -~~g------~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl 379 (510) .++ ...+++.+. ...+.......++.++..|...-.. +.........|+.-+..+-.-+.. ..--..+. T Consensus 275 ~~~~~~~~~~~~~d~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~-ka~~~~~~ 351 (453) T protein:vir:73 275 KNSNGQGTNAAKVDVKFLD--KPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSN-LALSFQRK 351 (453) T ss_pred cccccccccccCceeEEee--ecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHH-HHHHHHHH Confidence 011 111232222 2234555677778888777553321 111111133466554332111111 11112222 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA 459 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~ 459 (510) -.+.+.-++..+..++...+. +.....+++.+.-.+. -..++.++. ++.+.++ +..+.++ . T Consensus 352 ~~~~l~~~~~li~~~~~~~~~-~~~~~~i~v~f~~~~p-~~~~~~a~~-------~~k~~gi------is~et~~----~ 412 (453) T protein:vir:73 352 FQSALNRRYSLWSSLSTNASN-KDAWKDIEYTFTRNEP-KDIKEQAET-------ANILKGI------TSEETAL----S 412 (453) T ss_pred HHHHHHHHHHHHHHHHhccCC-ccccccceEEeCCCCC-CCHHHHHHH-------HHHHhcc------CcHHHHH----H Confidence 222222233333444433322 1222345555533321 111122221 1111121 1112222 2 Q ss_pred HcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 460 AFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNAL 507 (510) Q Consensus 460 ~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ 507 (510) .++. +.+ ++|+++.++++..+..+++.+ .+.+.-+.-+.+ T Consensus 413 ~~~~-----~~d~~~E~~ri~~E~~~~~~~~~~~---~~~~~~~~~~~~ 453 (453) T protein:vir:73 413 VISV-----IPDVQAEMEKIKKKKLLQLSLTRTS---NLVRMKQMRGNL 453 (453) T ss_pred hCCC-----CCCHHHHHHHHHHHHHHHHHHHHhc---cCCcchhhhcCC Confidence 2221 112 345544333332222222211 111112223333 No 91 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.24 E-value=0.00012 Score=42.06 Aligned_cols=425 Identities=8% Similarity=-0.041 Sum_probs=173.7 Q ss_pred ChhHHHHHHHHH--hccCchHHHHHHHHhhc-------c---cccCC-C-C---CCccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTL-------P---YLMVD-P-M---SGSRGVVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~k~~~~~r~~~l--kr~~~~~~w~e~~~~~~-------P---~~~~~-~-~---~~~~~~~~~~~dstg~~a~~~Laa~l 63 (510) -.+.+.+..+.- ++.++...++.+-.+.. | ....- . . ....+...++..+-+...++..++.| T Consensus 16 ~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl 95 (474) T protein:vir:94 16 LPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYL 95 (474) T ss_pred CHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhhe Confidence 111112222211 12222222222211111 0 00000 0 0 00011112444555555555555544 Q ss_pred HHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CC Q lcl|Aclame:pro 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SD 141 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~ 141 (510) .+- |+. +...+. .....++.++|.+ .+..++|.....++.++..++|.+.+++. ++ T Consensus 96 ~g~--pv~-----~~~~~~--------~~~~e~~~~~l~~-------~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~ 153 (474) T protein:vir:94 96 HGV--PVT-----YDLDEN--------AEKNEKLKKFITN-------FAIRNSVDDEDSEIGKMAAICGYGARLAYIDTN 153 (474) T ss_pred ecc--cee-----EeeCCC--------CcchHHHHHHHHH-------HHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC Confidence 331 321 222221 1112334444433 46678899999999999999999876554 33 Q ss_pred C-CeEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEe Q lcl|Aclame:pro 142 E-ATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHE 220 (510) Q Consensus 142 ~-~~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e 220 (510) . .++.+++..+.++..|..+...-.+|.+... .......+++.-...+.. .+++. T Consensus 154 ~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~-------------------~~~~~~~~~~~~~y~~~~-----~~~~~ 209 (474) T protein:vir:94 154 GDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEK-------------------DDDNGTDYVYAEFYDNAY-----YYVFR 209 (474) T ss_pred CeeEEEEEcccceEEEEcCCCceEEEEEEEEEe-------------------eCCCceEEEEEEEEcCce-----EEEEe Confidence 2 2455666555444456677765555444221 001111111111111111 11222 Q ss_pred eCCe---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh Q lcl|Aclame:pro 221 IDGV---RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 221 ~~~~---~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~ 297 (510) .++. ........++..+|++..+ ++.+|.|=.+...+-+..++.+.-...........|.+.+.-.+ +..+ T Consensus 210 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-~~~~ 283 (474) T protein:vir:94 210 GEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-MSEE 283 (474) T ss_pred ecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-CCch Confidence 2211 1111222234567877553 46789999999999999999988888888888888877664211 1122 Q ss_pred hhh-cCCCccee-cCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 298 DYQ-DAEMGDYV-PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~-~~~~G~~~-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) ... ....|.+. .+...++..+.. ..+.......++.+++.|...-.. +. ...-+...|+..+..+-.-+ .... T Consensus 284 ~~~~~~~~~~i~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l-~~k~ 360 (474) T protein:vir:94 284 MIQETQKSGAFELFDKDMDVKYLTK--DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMAL-ENKC 360 (474) T ss_pred hhhhhhhcceeEecCCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHH-HHHH Confidence 222 12234433 232334444332 235566777888888877553321 11 11123445666665432211 1112 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcC--CCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCC Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDAL--LQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~--l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id 449 (510) ....+.-.+.+.-+++.++.++...+ ..+.....+++.+.- +.+.+..++-+.++ ++. +. T Consensus 361 ~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl----------~g~------iS 424 (474) T protein:vir:94 361 MTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL----------KGQ------VS 424 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH----------hcc------Cc Confidence 22233333333333344444443322 222222345555543 33333333322221 121 11 Q ss_pred HHHHHHHHHHHcCCCHhhccCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 450 LPKMMDTIWAAFSVDTSQFYKSA-DELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~~i~~s~-ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ...+++ .++ ++.+. +|++... +++.+...+. +. ..+..........= T Consensus 425 ~et~~~----~l~-----~v~d~~~E~eri~-~E~~e~~~~~--~~--~~~~~~~~~~~~~~ 472 (474) T protein:vir:94 425 ERTRLG----QSQ-----LVDDVDYELDEME-KESLEFNDKL--PD--IDEGDANDKSQNNQ 472 (474) T ss_pred hHHHHH----hCC-----CCCCHHHHHHHHH-HHHHHHHhhc--cc--ccCCCcCCCCcccc Confidence 122222 221 12222 3333222 2211111110 00 00000000000000 No 92 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.24 E-value=0.00012 Score=42.06 Aligned_cols=425 Identities=8% Similarity=-0.041 Sum_probs=173.7 Q ss_pred ChhHHHHHHHHH--hccCchHHHHHHHHhhc-------c---cccCC-C-C---CCccccccccccchHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTL-------P---YLMVD-P-M---SGSRGVVEHDFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~k~~~~~r~~~l--kr~~~~~~w~e~~~~~~-------P---~~~~~-~-~---~~~~~~~~~~~dstg~~a~~~Laa~l 63 (510) -.+.+.+..+.- ++.++...++.+-.+.. | ....- . . ....+...++..+-+...++..++.| T Consensus 16 ~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl 95 (474) T protein:vir:10 16 LPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYL 95 (474) T ss_pred CHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhhe Confidence 111112222211 12222222222211111 0 00000 0 0 00011112444555555555555544 Q ss_pred HHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CC Q lcl|Aclame:pro 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SD 141 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~ 141 (510) .+- |+. +...+. .....++.++|.+ .+..++|.....++.++..++|.+.+++. ++ T Consensus 96 ~g~--pv~-----~~~~~~--------~~~~e~~~~~l~~-------~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~ 153 (474) T protein:vir:10 96 HGV--PVT-----YDLDEN--------AEKNEKLKKFITN-------FAIRNSVDDEDSEIGKMAAICGYGARLAYIDTN 153 (474) T ss_pred ecc--cee-----EeeCCC--------CcchHHHHHHHHH-------HHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC Confidence 331 321 222221 1112334444433 46678899999999999999999876554 33 Q ss_pred C-CeEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEe Q lcl|Aclame:pro 142 E-ATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHE 220 (510) Q Consensus 142 ~-~~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e 220 (510) . .++.+++..+.++..|..+...-.+|.+... .......+++.-...+.. .+++. T Consensus 154 ~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~-------------------~~~~~~~~~~~~~y~~~~-----~~~~~ 209 (474) T protein:vir:10 154 GDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEK-------------------DDDNGTDYVYAEFYDNAY-----YYVFR 209 (474) T ss_pred CeeEEEEEcccceEEEEcCCCceEEEEEEEEEe-------------------eCCCceEEEEEEEEcCce-----EEEEe Confidence 2 2455666555444456677765555444221 001111111111111111 11222 Q ss_pred eCCe---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchh Q lcl|Aclame:pro 221 IDGV---RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVD 297 (510) Q Consensus 221 ~~~~---~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~ 297 (510) .++. ........++..+|++..+ ++.+|.|=.+...+-+..++.+.-...........|.+.+.-.+ +..+ T Consensus 210 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-~~~~ 283 (474) T protein:vir:10 210 GEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-MSEE 283 (474) T ss_pred ecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-CCch Confidence 2211 1111222234567877553 46789999999999999999988888888888888877664211 1122 Q ss_pred hhh-cCCCccee-cCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 298 DYQ-DAEMGDYV-PGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 298 ~~~-~~~~G~~~-~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) ... ....|.+. .+...++..+.. ..+.......++.+++.|...-.. +. ...-+...|+..+..+-.-+ .... T Consensus 284 ~~~~~~~~~~i~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l-~~k~ 360 (474) T protein:vir:10 284 MIQETQKSGAFELFDKDMDVKYLTK--DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMAL-ENKC 360 (474) T ss_pred hhhhhhhcceeEecCCCCceeEEec--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHH-HHHH Confidence 222 12234433 232334444332 235566777888888877553321 11 11123445666665432211 1112 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcC--CCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCC Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDAL--LQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRIS 449 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~--l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id 449 (510) ....+.-.+.+.-+++.++.++...+ ..+.....+++.+.- +.+.+..++-+.++ ++. +. T Consensus 361 ~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl----------~g~------iS 424 (474) T protein:vir:10 361 MTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL----------KGQ------VS 424 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH----------hcc------Cc Confidence 22233333333333344444443322 222222345555543 33333333322221 121 11 Q ss_pred HHHHHHHHHHHcCCCHhhccCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 450 LPKMMDTIWAAFSVDTSQFYKSA-DELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 450 ~d~~~~~~a~~~Gvp~~~i~~s~-ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ...+++ .++ ++.+. +|++... +++.+...+. +. ..+..........= T Consensus 425 ~et~~~----~l~-----~v~d~~~E~eri~-~E~~e~~~~~--~~--~~~~~~~~~~~~~~ 472 (474) T protein:vir:10 425 ERTRLG----QSQ-----LVDDVDYELDEME-KESLEFNDKL--PD--IDEGDANDKSQNNQ 472 (474) T ss_pred hHHHHH----hCC-----CCCCHHHHHHHHH-HHHHHHHhhc--cc--ccCCCcCCCCcccc Confidence 122222 221 12222 3333222 2211111110 00 00000000000000 No 93 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.22 E-value=0.00013 Score=41.97 Aligned_cols=415 Identities=10% Similarity=0.008 Sum_probs=168.4 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc-----c---cCCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY-----L---MVDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~-----~---~~~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+.+.+..++.+ . -..+++.+.+|..-. + ....... ......++..+-+...+++.++.|.+ -| T Consensus 45 ~~~~i~~~i~~~~-~-~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G--~p-- 118 (492) T protein:vir:94 45 LEEMIVRYIKQHL-E-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 118 (492) T ss_pred HHHHHHHHHHHHH-H-HHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcc--cC-- Confidence 2333333333322 1 123445555554321 0 0000000 11112345667777778877776543 11 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~ 148 (510) +.++.+|+. +.+.|. ..+ ..+|.....++.++..++|.+.+++ +++. .+++++ T Consensus 119 ---~~~~~~d~~-------------~~~~l~-------~~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~ 174 (492) T protein:vir:94 119 ---IAFKHTDDE-------------VVKRID-------EVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 174 (492) T ss_pred ---ceeccCchH-------------HHHHHH-------HHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEE Confidence 122333321 111121 112 3578888899999999999987555 4332 245666 Q ss_pred Eece-EEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCeeEEEEEEeeC Q lcl|Aclame:pro 149 SLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAMDYAEMYHEID 222 (510) Q Consensus 149 pl~~-~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~~~~sv~~e~~ 222 (510) +..+ |++..| ..+++.-.+|.+... ....+++|+- | +..++...- ..+-.+.+ T Consensus 175 ~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~~~-~~~~~~~~ 233 (492) T protein:vir:94 175 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGSLI-PDYSNNLE 233 (492) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEEecCeee-eccccccc Confidence 5544 555433 457776666655421 1122344331 0 111111100 00000111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh-hc Q lcl|Aclame:pro 223 GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY-QD 301 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~-~~ 301 (510) +..+. ....++..+|++..+- +.+|.|=.+..++-+..+|.+.-.+....+....|.+++.-......... .. T Consensus 234 ~~~~~-~~~~~~g~vPvv~~~n-----n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~ 307 (492) T protein:vir:94 234 NSKTH-FSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRL 307 (492) T ss_pred ccccc-ccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHH Confidence 11111 1112335688876643 45799999999999999999888888888888888766531111111111 11 Q ss_pred -CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 302 -AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSL 378 (510) Q Consensus 302 -~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~r 378 (510) ...+.+.-+..++++.+.. ..+.......++.++..|...-.. +. ...-+...|+.-+...-.- +....-...+ T Consensus 308 ~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~-l~~k~~~k~~ 384 (492) T protein:vir:94 308 LRYYGAIKVSDNGGVDTIQV--EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTN-LNLKADKLAR 384 (492) T ss_pred HhhccceecCCCCcceeEec--cCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHH-HHHHHHHHHH Confidence 1112222233334444332 235566677778888777654322 11 1122233455433322111 1111122222 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHH Q lcl|Aclame:pro 379 LAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) Q Consensus 379 l~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a 458 (510) .-.+.+.-++..++.++. .. .....+.+.+.-.+. -..+..+ +.+..+.+. +....++ T Consensus 385 ~f~~~l~~~~~li~~~~~---~~-~~~~~i~v~f~~~~p-~~~~e~~-------~~~~kl~gi------iS~et~~---- 442 (492) T protein:vir:94 385 KAKVAIQELLWFVFEHFD---IK-GEHKDVDISFNYNKV-ANTELQV-------QTAQQSMGI------VSHETVL---- 442 (492) T ss_pred HHHHHHHHHHHHHHHHhc---CC-cccceeeEEecCCCC-CCHHHHH-------HHHHHHhcc------CchHHHH---- Confidence 222222222222233222 11 122334444432221 1111111 222222221 1112222 Q ss_pred HHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 459 AAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 459 ~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..+| ++.+ ++|++...+ +++++ +++.. ...+.....+.. T Consensus 443 ~~l~-----~v~d~~~E~eri~~-E~~~~-~~~~~------~~~~~~~~~~~~ 482 (492) T protein:vir:94 443 ENHP-----FVEDLQAELERIEQ-EQMEY-NKQLP------NLDDGGADSAQQ 482 (492) T ss_pred HhCC-----CCCCHHHHHHHHHH-HHHHH-Hhhcc------ccccccCCCCcc Confidence 2222 1222 234433222 22111 11110 011111111111 No 94 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.18 E-value=0.00014 Score=41.69 Aligned_cols=412 Identities=10% Similarity=0.026 Sum_probs=169.3 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcc-----cc-cC--CCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLP-----YL-MV--DPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P-----~~-~~--~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) -++.+.+..++.+ .-..+++.+.+|..- .+ .. ..... ......++-.+.+...++..++.|++ -| T Consensus 27 ~~~~i~~~i~~~~--~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g--~p-- 100 (478) T protein:vir:10 27 QEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVA--NP-- 100 (478) T ss_pred hHHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcc--cC-- Confidence 2222223323222 112234444444321 10 00 00000 01111234455666666666666654 12 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~ 148 (510) +.+..+++. +.+ .+...+ .++|.....++.++..++|.+.+++ +++. .++.++ T Consensus 101 ---~~~~~~~~~-------------~~~-------~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~ 156 (478) T protein:vir:10 101 ---VTFGVDNDK-------------ALK-------QIQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRV 156 (478) T ss_pred ---ceeecCChH-------------HHH-------HHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEE Confidence 223333321 111 122223 3688999999999999999887655 4332 235555 Q ss_pred Eece-EEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE--EE-EeecCCCeeEEEEEEeeCC Q lcl|Aclame:pro 149 SLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT--HV-QRRKGTAMDYAEMYHEIDG 223 (510) Q Consensus 149 pl~~-~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~--~v-~~~~~~~~~~~sv~~e~~~ 223 (510) +..+ |.+..| ..|++.-.+|.+... ....+++|+ .| +.+..++..++.......+ T Consensus 157 ~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~ 216 (478) T protein:vir:10 157 PAEQAVPIWTNKERDELQAFIRVYELD--------------------GAERVEYWTKDDVTFYELKEGQLIPDFYRSEDH 216 (478) T ss_pred cccceEEEEcCCCCCceEEEEEEEeee--------------------CceEEEEEeCCcEEEEEecCCeeeccccccccc Confidence 5555 445444 358887666665431 112233332 11 1111122222211111111 Q ss_pred e---eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh-h Q lcl|Aclame:pro 224 V---RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-Y 299 (510) Q Consensus 224 ~---~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~-~ 299 (510) . .......+++..+|++..+. +.+|.|-.+...+-+..++.+.-......+....|.+++.-...-.... . T Consensus 217 ~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 291 (478) T protein:vir:10 217 IQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFM 291 (478) T ss_pred cccceecccccccCCcceEEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchh Confidence 1 11112233456788887765 4579999999999999999988888888888888876653111111111 1 Q ss_pred hc-CCCcce-ecCC-ccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHH-------HHHH Q lcl|Aclame:pro 300 QD-AEMGDY-VPGG-AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI-------TAEE 367 (510) Q Consensus 300 ~~-~~~G~~-~~g~-~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~-------r~~E 367 (510) .. ...+.+ +++. ..+++.+... .+.......++.+++.|...-.. +. ....+...|+.-+.. ++.+ T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~ 369 (478) T protein:vir:10 292 HNLKYYKAISVAGESGSGVDTIKVE--VPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANK 369 (478) T ss_pred hhhhhCceeEecCCCCCcceEEeec--CCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHH Confidence 11 112222 3332 2334444322 36677778888888777654321 11 111223446554432 3333 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecH--HHHHHHHHHHHHHHHHHHHHhhcChHhHh Q lcl|Aclame:pro 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGL--PALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l--~~l~r~~~~~~~~~~~q~~~~~~~~~q~~ 445 (510) +...++..+.+ ++..++.++. .......+.+.+.-.+ +.+.. ++.++.+++. T Consensus 370 ~~~~~~~~l~~--------~~~li~~~~~----~~~d~~~i~i~f~~~~p~~~~e~----------~~~~~~~~g~---- 423 (478) T protein:vir:10 370 LKNKTLTALQE--------LLQYIIDFYR----LDVRVQDIEITFNFNVMVNELEN----------SQIAMNSTGL---- 423 (478) T ss_pred HHHHHHHHHHH--------HHHHHHHHhC----CCcccccceEEeCCCCCCCHHHH----------HHHHHHHhCC---- Confidence 34444433333 2222222221 1122224455443222 22221 1112222221 Q ss_pred hcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhcccCCC Q lcl|Aclame:pro 446 PRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASD---MTNALAGV 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~---~~~~~ag~ 510 (510) +....++. .++ ++.+ ++|++...+++ .+.. ++ . .....+..+ ..+.-.+. T Consensus 424 --iS~et~i~----~~~-----~v~d~~~E~~ri~~E~-~~~~-~~-~-~~~~~~~~d~~~~~~~d~~~ 477 (478) T protein:vir:10 424 --LSKETILG----NHS-----WVQDPVAEMERIEQEN-IELN-QQ-L-PDIEEGLNDEQQRQSEDNQS 477 (478) T ss_pred --CChHHHHH----hCC-----CCCCHHHHHHHHHHHH-HHHH-Hh-c-cccCCCCcccccccCcCCCC Confidence 22222222 221 1222 23333222221 1111 10 0 000000000 00000000 No 95 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.13 E-value=0.00016 Score=41.42 Aligned_cols=408 Identities=9% Similarity=0.016 Sum_probs=171.7 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc--ccC------CCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMV------DPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~--~~~------~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) ..+.+.+..++.+ .-..+++.+.+|..-. .+. ..... ......++..+-+...++..++.|.+ .| T Consensus 36 ~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G--~p-- 109 (483) T protein:vir:12 36 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 109 (483) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcc--cC-- Confidence 3333333333332 1123455555554332 000 00000 11122345667777777777766643 12 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC-CeEEEE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE-ATVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~-~~~~~~ 148 (510) +++...|+. ..+. +.. +...+|.....++.++..++|.+.+ |.+++. .+++++ T Consensus 110 ---~~~~~~d~~-------------~~~~-------l~~-~~~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~ 165 (483) T protein:vir:12 110 ---IAFKHTDDE-------------VVKR-------IDE-VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 165 (483) T ss_pred ---ceeccCChH-------------HHHH-------HHH-HHhccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEE Confidence 223333321 1111 112 2235788889999999999998764 444442 246667 Q ss_pred EeceEEEeeC--CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCeeEEEEEEeeC Q lcl|Aclame:pro 149 SLRSYAVRRD--ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAMDYAEMYHEID 222 (510) Q Consensus 149 pl~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~~~~sv~~e~~ 222 (510) +..+.++.-| ..+++...+|.++.. ....+++|+- | +..++... ...+..+.+ T Consensus 166 ~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~~-~~~~~~~~~ 224 (483) T protein:vir:12 166 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGSL-IPDYSNNLE 224 (483) T ss_pred cccceEEEEcCCCCCceEEEEEEEEee--------------------cceEEEEEecCeEEEEEEeCCee-eeccccccc Confidence 6655444433 457777666665421 1112344431 1 11111110 001111111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhh-- Q lcl|Aclame:pro 223 GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ-- 300 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~-- 300 (510) ...+. ....++..+|++..+- +.+|+|=.+...+-+..+|.+.-...........|.+++.-.+........ T Consensus 225 ~~~~~-~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~ 298 (483) T protein:vir:12 225 NSKTH-FSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL 298 (483) T ss_pred ccccc-cccCCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHh Confidence 11121 1222345688776653 457999999999999999988888888888888887766421111111111 Q ss_pred cCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHH-------HHHHHHHH Q lcl|Aclame:pro 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRI-------TAEEAENT 371 (510) Q Consensus 301 ~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~-------r~~E~~~~ 371 (510) ....+.+......+++.+.. ..+.......++.+++.|...-.. +. ...-+...|+.-+.. +++++... T Consensus 299 ~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~ 376 (483) T protein:vir:12 299 LRYYGAIKVSDNGGVDTIQV--EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARK 376 (483) T ss_pred hhhccccccCCCCcceEEee--cCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHH Confidence 11112332233334444432 235566677778777777554322 11 112223445554332 22333333 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLP 451 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d 451 (510) ++..+.+ ++..++.++. .. .....+++.+.-.+ +-..+..++ .+..+++. +... T Consensus 377 f~~~l~~--------~~~li~~~~~---~~-~~~~~i~v~f~~~~-p~~~~~~a~-------~~~kl~Gi------iS~e 430 (483) T protein:vir:12 377 AKVAIQE--------LLWFVFEHFD---IK-GEHKDVDISFNYNK-VANTELQVQ-------TAQQSMGI------VSHE 430 (483) T ss_pred HHHHHHH--------HHHHHHHHhc---CC-CccceeeEEeCCCC-CCCHHHHHH-------HHHHHhcc------CchH Confidence 3333332 2222233322 11 12223444443222 111111121 22222221 2222 Q ss_pred HHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 452 KMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 452 ~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .++. .++ ++.+ ++|++...+ ++.++. ++ .+. ........++. T Consensus 431 t~~~----~~~-----~v~d~~~E~~ri~~-E~~~~~-~~-~~~-----~~~~~~d~~~~ 473 (483) T protein:vir:12 431 TVLE----NHP-----FVEDLQAELERIEQ-EQMEYN-KQ-LPN-----LDDGGADGAQQ 473 (483) T ss_pred HHHH----hCC-----CCCCHHHHHHHHHH-HHHHHH-hh-ccc-----ccccccCCccc Confidence 2222 222 1222 334433222 221111 11 000 00000001111 No 96 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.09 E-value=0.00018 Score=41.17 Aligned_cols=421 Identities=10% Similarity=0.010 Sum_probs=177.1 Q ss_pred ChhHHHHHHHHHh---------------------------ccCchHHHHHHHHhhccc---ccCCCC-CCcccccccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLPY---LMVDPM-SGSRGVVEHDFQ 49 (510) Q Consensus 1 ~k~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~d 49 (510) +++++..+|+.-. .....++++++.+|..-. +..... ........++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeec Confidence 3333333333221 111112334444443221 101111 011111234556 Q ss_pred chHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~ 129 (510) +.+...++..++.|.+ -|+. ++.+++. . ...+...+...+|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:96 93 DYASYISDFINGYFLG--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHhhhhcc--cCce-----eecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHH Confidence 6777777777765543 1211 2233221 1 1234445667789999999999999 Q ss_pred hhCceEEEE--eCCC-CeEEEEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v 204 (510) ++|.+.+++ +++. .++.+++..+ |++.-|. .+++...+|.+..... . ...++.-..+++|+ T Consensus 146 ~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~-----------~-~~~~~~~~~~~vyt-- 211 (511) T protein:vir:96 146 IYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI-----------D-KTDEDEVFTVDLFT-- 211 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec-----------c-ccccceEEEEEEEe-- Confidence 999886544 4432 2455665555 4444333 3566555555432110 0 00001111223332 Q ss_pred EeecCCCeeEEEEEEeeCCe------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDGV------RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYE 278 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~~------~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~ 278 (510) ++.-.. |...++. ........++..+|++..+- ..+|+|-.+..++-+..++.+.-...... T Consensus 212 ---~~~i~~----~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~ 279 (511) T protein:vir:96 212 ---SHGVYR----YLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYM 279 (511) T ss_pred ---CCcEEE----EEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 111000 1111111 01112233445678776543 45799999999999999998877777777 Q ss_pred HHhhCCceeeCCCCccchhhhhcCCCccee--------c------CCccccccccCCCccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 LESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------P------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 (510) Q Consensus 279 ~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~--------~------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 344 (510) +....|.+++.-......+.+.....+..+ . +...+++.+. ...+.......++.+++.|...- T Consensus 280 ~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:96 280 SDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHh Confidence 777788766543223333333222211111 1 1111222221 22355666777777777765433 Q ss_pred hh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCc--cceeeEEeecHHHHH Q lcl|Aclame:pro 345 MY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIETGLPALS 420 (510) Q Consensus 345 ~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~--~~~~~~~vs~l~~l~ 420 (510) +. +. ...-+...|+..+...-. .+........+.-.+.+.-++..++.++...+-...+. ..+++.+.-++. -. T Consensus 358 ~~P~~~~~~~~~n~Sg~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p-~n 435 (511) T protein:vir:96 358 NTPNMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLP-KS 435 (511) T ss_pred CCccccccccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCC-cC Confidence 21 11 111123456665544322 22222333444444444445555555554332222222 245555543221 11 Q ss_pred HHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 421 RSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEG 499 (510) Q Consensus 421 r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~ 499 (510) .+..++.+. .+++. +..+.++.. ++ ++.+ ++|++...+++ +.+..++ T Consensus 436 ~~e~~d~~~-------kl~G~------iS~et~l~~----l~-----~v~d~~~El~ri~~E~-~~~~~~~--------- 483 (511) T protein:vir:96 436 LIEELKAYI-------DSGGK------ISQTTLMSL----FS-----FFQDPELEVKKIEEDE-KESIKKA--------- 483 (511) T ss_pred HHHHHHHHH-------HHhcc------CChHHHHHh----CC-----CCCCHHHHHHHHHHHH-HHHHHHH--------- Confidence 222222221 11121 111222222 21 2222 33443322221 1111111 Q ss_pred HHHhhcccCCC Q lcl|Aclame:pro 500 ASDMTNALAGV 510 (510) Q Consensus 500 a~~~~~~~ag~ 510 (510) .....+...+. T Consensus 484 ~~~~~~~~~~~ 494 (511) T protein:vir:96 484 QKGIYKDPRDI 494 (511) T ss_pred hhccccCCCCC Confidence 00111122222 No 97 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.09 E-value=0.00018 Score=41.17 Aligned_cols=421 Identities=10% Similarity=0.010 Sum_probs=177.1 Q ss_pred ChhHHHHHHHHHh---------------------------ccCchHHHHHHHHhhccc---ccCCCC-CCcccccccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR---------------------------DGSVEQRAIEFAKTTLPY---LMVDPM-SGSRGVVEHDFQ 49 (510) Q Consensus 1 ~k~~~~~r~~~lk---------------------------r~~~~~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~d 49 (510) +++++..+|+.-. .....++++++.+|..-. +..... ........++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:78 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeec Confidence 3333333333221 111112334444443221 101111 011111234556 Q ss_pred chHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~ 129 (510) +.+...++..++.|.+ -|+. ++.+++. . ...+...+...+|.....++.++.. T Consensus 93 n~~k~Iv~~~~~yl~g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:78 93 DYASYISDFINGYFLG--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHhhhhcc--cCce-----eecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHH Confidence 6777777777765543 1211 2233221 1 1234445667789999999999999 Q ss_pred hhCceEEEE--eCCC-CeEEEEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v 204 (510) ++|.+.+++ +++. .++.+++..+ |++.-|. .+++...+|.+..... . ...++.-..+++|+ T Consensus 146 ~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~-----------~-~~~~~~~~~~~vyt-- 211 (511) T protein:vir:78 146 IYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI-----------D-KTDEDEVFTVDLFT-- 211 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec-----------c-ccccceEEEEEEEe-- Confidence 999886544 4432 2455665555 4444333 3566555555432110 0 00001111223332 Q ss_pred EeecCCCeeEEEEEEeeCCe------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDGV------RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYE 278 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~~------~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~ 278 (510) ++.-.. |...++. ........++..+|++..+- ..+|+|-.+..++-+..++.+.-...... T Consensus 212 ---~~~i~~----~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~ 279 (511) T protein:vir:78 212 ---SHGVYR----YLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYM 279 (511) T ss_pred ---CCcEEE----EEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 111000 1111111 01112233445678776543 45799999999999999998877777777 Q ss_pred HHhhCCceeeCCCCccchhhhhcCCCccee--------c------CCccccccccCCCccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 LESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------P------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 (510) Q Consensus 279 ~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~--------~------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 344 (510) +....|.+++.-......+.+.....+..+ . +...+++.+. ...+.......++.+++.|...- T Consensus 280 ~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:78 280 SDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHh Confidence 777788766543223333333222211111 1 1111222221 22355666777777777765433 Q ss_pred hh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCc--cceeeEEeecHHHHH Q lcl|Aclame:pro 345 MY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT--KQHKPAIETGLPALS 420 (510) Q Consensus 345 ~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~--~~~~~~~vs~l~~l~ 420 (510) +. +. ...-+...|+..+...-. .+........+.-.+.+.-++..++.++...+-...+. ..+++.+.-++. -. T Consensus 358 ~~P~~~~~~~~~n~Sg~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p-~n 435 (511) T protein:vir:78 358 NTPNMKDDNFSGTQSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLP-KS 435 (511) T ss_pred CCccccccccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCC-cC Confidence 21 11 111123456665544322 22222333444444444445555555554332222222 245555543221 11 Q ss_pred HHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 421 RSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEG 499 (510) Q Consensus 421 r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~ 499 (510) .+..++.+. .+++. +..+.++.. ++ ++.+ ++|++...+++ +.+..++ T Consensus 436 ~~e~~d~~~-------kl~G~------iS~et~l~~----l~-----~v~d~~~El~ri~~E~-~~~~~~~--------- 483 (511) T protein:vir:78 436 LIEELKAYI-------DSGGK------ISQTTLMSL----FS-----FFQDPELEVKKIEEDE-KESIKKA--------- 483 (511) T ss_pred HHHHHHHHH-------HHhcc------CChHHHHHh----CC-----CCCCHHHHHHHHHHHH-HHHHHHH--------- Confidence 222222221 11121 111222222 21 2222 33443322221 1111111 Q ss_pred HHHhhcccCCC Q lcl|Aclame:pro 500 ASDMTNALAGV 510 (510) Q Consensus 500 a~~~~~~~ag~ 510 (510) .....+...+. T Consensus 484 ~~~~~~~~~~~ 494 (511) T protein:vir:78 484 QKGIYKDPRDI 494 (511) T ss_pred hhccccCCCCC Confidence 00111122222 No 98 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.04 E-value=0.0002 Score=40.90 Aligned_cols=420 Identities=11% Similarity=0.012 Sum_probs=171.4 Q ss_pred ChhH-HHHHHHHHhccCchHHHHHHHHhhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKST-AAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~-~~~r~~~lkr~~~~~~w~e~~~~~~P~--~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) |... +.+..++- .....++++.+.+|..-. ...... .......++..+-+...++..++.|.+- | +++ T Consensus 25 ~~~~~i~~~i~~~-~~~~~~~~~~l~~Yy~g~~~i~~~~~-~~~~~~~ki~~n~~~~Ivd~~~~~l~g~--p-----~~~ 95 (470) T protein:vir:99 25 LTSNELLGFIAYN-ETVLKPRYRENMKLYLGKHKILTAPE-KETGADNRIVVNSAKYVVDVYNGYFCGI--E-----PKL 95 (470) T ss_pred cCHHHHHHHHHHH-HHhhHHHHHHHHHHhccccccccCcc-cccCCcceeecchHHHHHHHHhhhhccC--C-----eeE Confidence 2222 12211111 122223444445544321 010011 1111223445556666666666655322 2 112 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEEeceEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSLRSYA 154 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~pl~~~~ 154 (510) +..++. +..+ .+.+.+..++|.....++.++..++|.+.+++ +++. .++.+++..+.+ T Consensus 96 ~~~~d~------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~ 156 (470) T protein:vir:99 96 ALLNDS------------SKID-------EIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAF 156 (470) T ss_pred eeCCch------------hHHH-------HHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeE Confidence 222211 0111 23344667899999999999999999876554 4432 235666666665 Q ss_pred EeeCCCCc--eeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 155 VRRDATGR--WMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 155 v~~d~~G~--v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) +..|..+. +...+|.++.. . +.....|-.++-. +..+.|...-...+ ......... T Consensus 157 ~i~d~~~~~~~~~~vr~~~~~-------------------~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~ 214 (470) T protein:vir:99 157 IIYDDTVQRQPLAFVHYQIDN-------------------S-NNWTDAYGVIQYA-DKFYKFKGYDIEED-TNAAGYAIN 214 (470) T ss_pred EEEcCCCCcceEEEEEEEEEe-------------------c-CCeeEEEEEEEec-CeEEEEEecccccc-ccccccccc Confidence 55555432 33334333311 0 1111111122221 11111111000000 111112223 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh-----hhcC-CCcc Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-----YQDA-EMGD 306 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~-----~~~~-~~G~ 306 (510) ++..+|++..+ +..+|+|=.+..++-+..++.+.-.+.........|.+.+. |+..+.+ +... ..+. T Consensus 215 ~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~g~~~~~~~~~~~ 287 (470) T protein:vir:99 215 PYGLVPAVEFF-----ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMI--GFKLPEDDEGNPKFDFKNNRV 287 (470) T ss_pred CCCccceEeec-----CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcccccccchhhhhhhcce Confidence 34567877654 35689999999999999999988888888888888877764 2221111 1111 1111 Q ss_pred e-ecC----CccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 307 Y-VPG----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 307 ~-~~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl 379 (510) + +++ ...+++.+. ...+.......++.+.+.|...-.. +. ....+...|+..+..+-.-+... .--..+. T Consensus 288 ~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k-~~~~~~~ 364 (470) T protein:vir:99 288 LYVSQLDPDTNPQIGFIA--KPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNK-ADSKERK 364 (470) T ss_pred eeecCCCCCCCCcceEEe--ecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHH-HHHHHHH Confidence 1 121 122233332 2234555566677777766443221 11 11222445776665432222211 1112222 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTI 457 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~ 457 (510) -.+.+.-+++.++.++...+-.......+++.+.- +.+.++.++-+.++ ++. +....++..+ T Consensus 365 ~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl----------~gi------is~et~l~~l 428 (470) T protein:vir:99 365 FDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNA----------EGI------VSKKTQLGMI 428 (470) T ss_pred HHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHH----------hcc------CCHHHHHHhC Confidence 22222223333344443333333333345555532 22333333332222 111 1112223221 Q ss_pred HHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 458 WAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 458 a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) -++++ ++|++...+++ ..+.+++.. .....+..++-.+- T Consensus 429 ---~~vd~------~~E~eri~~E~-~~~~~~~~~----~~~~~d~~~~d~~~ 467 (470) T protein:vir:99 429 ---PDIEP------DAEMKQIAKEK-ADAIKQTQQ----LSMPIDILKRDNNA 467 (470) T ss_pred ---CCCCH------HHHHHHHHHHH-HHHHHHHHh----hcCCCCcCCCCCCc Confidence 23332 23444322221 111111100 00001111111111 No 99 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=97.02 E-value=0.00021 Score=40.79 Aligned_cols=430 Identities=10% Similarity=0.022 Sum_probs=175.9 Q ss_pred ChhHHHHHHHHHh-cc----CchHHHHHHHHhhcccc----------cCC--CCC-Cccccccc-cccchHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DG----SVEQRAIEFAKTTLPYL----------MVD--PMS-GSRGVVEH-DFQSAGALLVNNLAA 61 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~----~~~~~w~e~~~~~~P~~----------~~~--~~~-~~~~~~~~-~~dstg~~a~~~Laa 61 (510) |-..--++ +... +. .+.++|+-+.+.+--.+ .++ .+. .-..++.+ .|=+.- ....+ T Consensus 1 ~~~~~~~~-~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~----~~tl~ 75 (491) T protein:vir:95 1 MLTANGQG-SGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFT----RRTLS 75 (491) T ss_pred CcccCCcc-CCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChH----HHHHH Confidence 11100000 0000 00 02335555544432110 000 000 00111111 122222 33333 Q ss_pred HHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC Q lcl|Aclame:pro 62 KLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSD 141 (510) Q Consensus 62 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~ 141 (510) .|++.+|- ..|.+ ++++ .++.++++| -....+.+.-+...+.+...+|-+.+++|.+ T Consensus 76 ~l~G~vfr-k~p~~--~~p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P 132 (491) T protein:vir:95 76 GMVGSVMR-KEPEI--NIPK--------------ELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGGLLVDAP 132 (491) T ss_pred HHhchhhc-CCcee--eccH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHHcCeEEEEEecC Confidence 34444433 12333 2221 234444444 3456778888899999999999999999865 Q ss_pred CC---------------eEEEEEeceEE---E-eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE Q lcl|Aclame:pro 142 EA---------------TVVAWSLRSYA---V-RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT 202 (510) Q Consensus 142 ~~---------------~~~~~pl~~~~---v-~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~ 202 (510) .. .+..|+-.+.. . ..|+.+++.-+..+++...++=...|+ .+.++.|. T Consensus 133 ~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~------------~~~~~qyR 200 (491) T protein:vir:95 133 ETAAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFE------------TKYGEQYR 200 (491) T ss_pred CCcccCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcc------------cceEEEEE Confidence 32 14556544432 2 235566777777777655544444554 34445555 Q ss_pred EEEeecCCCeeEEEEEEeeCCe-------eeccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 203 HVQRRKGTAMDYAEMYHEIDGV-------RVGETGRWPIHLCPYIVPTWNLAPGEHY--GRGHVEDYIGDFAKLSLL--- 270 (510) Q Consensus 203 ~v~~~~~~~~~~~sv~~e~~~~-------~~~~~~~y~~~~~P~~~~Rw~~~~ge~Y--Grgp~~~~l~d~~~L~~l--- 270 (510) ++.+...+++.+..+....+|. .+..+++ ..+++|++.|--..+..+ |..|.. |+..||.- T Consensus 201 vL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~lni~Hy~ 273 (491) T protein:vir:95 201 VLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGE---SLRGVIPFTFIGATNNDATIDDAPLL----PLAELNIGHYR 273 (491) T ss_pred EEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCC---cccCeeEEEEEecCCCCCCCCcCchH----HHHHHHHHHhh Confidence 5555433333222222222221 1222333 356777777765555544 445533 55555532 Q ss_pred HHHHHH-HHHHhhCCceeeCC-CC-------ccchhhhhcCCC-cceecCCccccccccCCCccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 271 SEKLGL-YELESLEVLNLVDE-AK-------GAVVDDYQDAEM-GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRL 340 (510) Q Consensus 271 ~~~~l~-~~~~a~~~~~lv~~-~g-------~~~~~~~~~~~~-G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I 340 (510) +.+-++ ....+.-|...+.. +. ..++..+.-+.+ +...| ...+.+.++.. +.+ .+...+.+++.++ T Consensus 274 ~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~lP-~~~~~~~ie~~-~~~--~~~~~l~~~e~qm 349 (491) T protein:vir:95 274 NSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRCGHNLG-YGGSAQLIQAG-ENN--LARQNMLDKEQQA 349 (491) T ss_pred hhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcCCcCCC-CCCccceeecC-cch--HHHHHHHHHHHHH Confidence 222233 23344445433321 11 111222221111 11111 11222333332 112 2467777777776 Q ss_pred HHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecHHHH Q lcl|Aclame:pro 341 NQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPAL 419 (510) Q Consensus 341 ~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l~~l 419 (510) ..+=. .++... .+.||++.+.+...--..|+.+...+++-+ .+++.++-+ -|.. .++.+++.+-. +.. T Consensus 350 ~~~Ga-~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al-----~~~l~~~a~w~G~~--~~~~v~i~~n~--dF~ 418 (491) T protein:vir:95 350 IQIGA-QLITPS-QQITAESARIQRGADTSVMATIARNVSQAY-----TDALRWVAMMLGKP--EDSEVEFQLNM--DFF 418 (491) T ss_pred HHHHH-HhccCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCC--CCCceEEEeec--ccc Confidence 65421 223333 357999999999999999999888877664 344444322 1222 12233322211 111 Q ss_pred HHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 420 SRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEG 499 (510) Q Consensus 420 ~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~ 499 (510) .+..+.+.+.+++..... + .|....+.. .....||+. ++.|++..+.+.+.- ...+-...+-.+++ T Consensus 419 ~~~~~~~~~~all~~~~~--G------~is~~t~~~-~L~~~~vl~----~~~e~~~~~ie~~~~-~~~~~~~~~~~~~~ 484 (491) T protein:vir:95 419 LQPMTAQDRAAWMADINA--G------LLPATAYYA-ALRKAGVTD----WTDEDILNAIEDAPL-PSGAVTQVAGEIPQ 484 (491) T ss_pred cccCCHHHHHHHHHHHhc--C------CCCHHHHHH-HHHhCCCCC----ccHHHHHHHHHhcCC-CCCccccccccchh Confidence 111112222222222221 1 122222233 223445542 233333322222110 00000000111122 Q ss_pred HHHhhcc Q lcl|Aclame:pro 500 ASDMTNA 506 (510) Q Consensus 500 a~~~~~~ 506 (510) +.++... T Consensus 485 ~~~~~~~ 491 (491) T protein:vir:95 485 AAQQQQE 491 (491) T ss_pred hhhhccC Confidence 2222111 No 100 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=96.98 E-value=0.00023 Score=40.58 Aligned_cols=417 Identities=12% Similarity=0.058 Sum_probs=172.7 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc---ccCCCC---CCccccccccccchHHHHHHHHHHHHHHhhcCccCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPM---SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~---~~~~~~---~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~W 74 (510) |.......+=.-.+.....+|+.+.+|.... .+.... ........++..+.+...++..++.|.+ .|. T Consensus 30 ~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g------~~~ 103 (481) T protein:vir:10 30 LKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTG------NPI 103 (481) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhcc------CCc Confidence 2222211111111223344566666665432 111110 0011112244555666666666654432 222 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEEec Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSLR 151 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~pl~ 151 (510) .+..++.. .. ..+.+.+..++|.....++.++..++|.+.+++ +++. .++++++.. T Consensus 104 -~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~ 162 (481) T protein:vir:10 104 -TITHQDNQ-------------TN-------DKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPK 162 (481) T ss_pred -eEecCChh-------------HH-------HHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEccc Confidence 22233221 11 123344677789999999999999999876544 4432 235667766 Q ss_pred eEEEeeCCC--CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee--ec Q lcl|Aclame:pro 152 SYAVRRDAT--GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR--VG 227 (510) Q Consensus 152 ~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~--~~ 227 (510) +.+...|.. +++...+|.++..- .++.....+++|+ ++. .++++.++.. .. T Consensus 163 ~~~~v~d~~~~~~~~~~i~~~~~~~---------------~~~~~~~~~~~y~-----~~~-----i~~~~~~~~~~~~~ 217 (481) T protein:vir:10 163 STFVVYDQTLDKKVVAGVRYFEKQD---------------KDKVPVQHVEVYT-----TDK-----IYYIEIKGGTYHRV 217 (481) T ss_pred ceEEEEcCCCCCceEEEEEEEEEee---------------CCCceEEEEEEEe-----cCe-----EEEEEecCCceeec Confidence 655555543 56665555543210 0111112223332 111 1222222211 11 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCC-cc Q lcl|Aclame:pro 228 ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM-GD 306 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~-G~ 306 (510) ......+..+|++..+ ++.+|+|-.....+-+..++.+.-.+....+....|.+.+.-......+....... +. T Consensus 218 ~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 292 (481) T protein:vir:10 218 EEVEHYYNDVPIIEYL-----NDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANM 292 (481) T ss_pred ccccccCCceeEEEee-----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccc Confidence 1222233568877544 24679998888999999999888777777787888877664211112222111111 11 Q ss_pred ee-c--------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 307 YV-P--------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGT 375 (510) Q Consensus 307 ~~-~--------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv 375 (510) +. + +...+++.+... .+.+.....++.++..|...-.. +. ....+...|+..+..+-.-+... T Consensus 293 ~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k---- 366 (481) T protein:vir:10 293 IHLEPGTNANGSEGKAEVKYVYKQ--YDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQV---- 366 (481) T ss_pred eeccccccccCCCCCcceeEEeec--CCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHH---- Confidence 11 1 111223222211 23455566666666666443211 11 11222334655443322211111 Q ss_pred HHHHHHHHHHHHHHHHHH----HHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHH Q lcl|Aclame:pro 376 YSLLAENLQSPLAYVCLS----EVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLP 451 (510) Q Consensus 376 ~~rl~~E~l~Pli~r~~~----il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d 451 (510) .++.+ ..+...+.+++. ++...+........+++.+.-++ +-..+..++.+. .+++. +... T Consensus 367 ~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~-~~~~~~~a~~~~-------kl~g~------is~e 431 (481) T protein:vir:10 367 RAIKE-RLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNL-PKSMMESINAFN-------ALSGG------VSES 431 (481) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCC-CcCHHHHHHHHH-------HHhcc------CChH Confidence 22221 222233333333 33323322222334555553322 112222222221 11121 2112 Q ss_pred HHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 452 KMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 452 ~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .+++ .++ ++.+ ++|++..++++..+ +. .....+.++.......| T Consensus 432 t~~~----~l~-----~i~d~~~E~~ri~~E~~~~---~~---~~~~~~~~~~~~~~~~~ 476 (481) T protein:vir:10 432 TRLS----LLD-----FIDNPKEELEKMQEEEAQR---EK---QADKRGYGEAFENHLNV 476 (481) T ss_pred HHHH----hCC-----CCCCHHHHHHHHHHHHHHH---Hh---hhhhccCCccCCCCCCC Confidence 2232 222 1222 34444322222111 11 11122223333333333 No 101 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=96.97 E-value=0.00023 Score=40.53 Aligned_cols=431 Identities=13% Similarity=0.051 Sum_probs=180.4 Q ss_pred Ch---------hHHHHHHHHHhcc-CchHHHHHHHHhhcccccCCCCCC-----ccccccc-cccchHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MK---------STAAMLWEKLRDG-SVEQRAIEFAKTTLPYLMVDPMSG-----SRGVVEH-DFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~k---------~~~~~r~~~lkr~-~~~~~w~e~~~~~~P~~~~~~~~~-----~~~~~~~-~~dstg~~a~~~Laa~l~ 64 (510) |- ..+..+|+..++- .=...|++..+-.||..-..+... -..++.+ .|-+.-.+.+ +.|+ T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~----~~l~ 76 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTL----FGLV 76 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHH----HHHh Confidence 32 2234455544321 011234555555566521111111 1111212 2333333444 4444 Q ss_pred HhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC- Q lcl|Aclame:pro 65 RSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA- 143 (510) Q Consensus 65 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~- 143 (510) +.+|- .+-.++++ +.++.++++| -+...+.+.-+..++.+...+|-+.+++|.+.. T Consensus 77 G~vf~---k~p~~~~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~ 133 (501) T protein:vir:95 77 GQVFM---RDPVVKVP--------------ALLNPLVANA------TGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTE 133 (501) T ss_pred hhhhc---CCcceeCc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCC Confidence 44442 22222222 2244444443 345667888888999999999999889885421 Q ss_pred -----------------eEEEEEeceEE-Ee---eCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE Q lcl|Aclame:pro 144 -----------------TVVAWSLRSYA-VR---RDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT 202 (510) Q Consensus 144 -----------------~~~~~pl~~~~-v~---~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~ 202 (510) .+..|+-.+.. +. .|...++.-+..++..+.+. .+|+ .+.++.|. T Consensus 134 ~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~------------~~~~~q~R 199 (501) T protein:vir:95 134 AEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD--DGFE------------MKTSGQFR 199 (501) T ss_pred CcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC--CCcc------------cceeEEEE Confidence 14445433321 12 22333555555555554322 3444 23445555 Q ss_pred EEEeecCCCeeEEEEEEeeCC-----------------eeeccccccccccCceEEEeeeecCCCccc--cchHHHHHHH Q lcl|Aclame:pro 203 HVQRRKGTAMDYAEMYHEIDG-----------------VRVGETGRWPIHLCPYIVPTWNLAPGEHYG--RGHVEDYIGD 263 (510) Q Consensus 203 ~v~~~~~~~~~~~sv~~e~~~-----------------~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YG--rgp~~~~l~d 263 (510) ++.+..++ ...+.+|.+-+. ......++ .+.+++|++.|.-..+...+ ..|.. | T Consensus 200 vL~~~~~g-~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~l~~IPfv~~~~~~~~~~~~~pPLl----~ 272 (501) T protein:vir:95 200 VLRLDEEG-YYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQ--GKRLTEIPFMFIGSENNDSNPDNPNFY----D 272 (501) T ss_pred EEeeCCCc-eEEEEEEEecCCcccCcceecCCcccccceeeeeccC--CCcCCeeeEEEEecCCCCCCCCccchH----H Confidence 55553332 223333332111 11111111 24678888888755555443 33433 4 Q ss_pred HHHHHHH---HHH-HHHHHHHhhCCceeeCCCCccc-------hhhhhcCCCcc-eecCCccccccccCCCccchHHHHH Q lcl|Aclame:pro 264 FAKLSLL---SEK-LGLYELESLEVLNLVDEAKGAV-------VDDYQDAEMGD-YVPGGAEAVRAYERGDYNKMAAIQQ 331 (510) Q Consensus 264 ~~~L~~l---~~~-~l~~~~~a~~~~~lv~~~g~~~-------~~~~~~~~~G~-~~~g~~~~v~~~~~~~~~~~~~~~~ 331 (510) +..||.- +.+ .-..+..+..|...+. |... ...+..+.+.. ..| ...+...++.. +..+ ... T Consensus 273 lA~lni~hy~~ssd~~~~l~~~~~P~l~i~--G~~~~~~~~~~~~~i~~G~~~~~~lP-~~~~~~~ie~~-~~~i--~~~ 346 (501) T protein:vir:95 273 LASLNMAHYRNSADYEESCYIVGQPTPVLI--GLTEEWVTNVLKGSVNFGSRGGIPLP-VGADAKLLQAS-ENTM--LKE 346 (501) T ss_pred HHHHHHHHHhhhhHHHHHHHHcccceeeee--CCcccccccCCCCceeecccccccCC-CCCceeEEecC-hhhH--HHH Confidence 4455433 222 2223344555543332 2111 11122222111 112 11122333321 2233 356 Q ss_pred HHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeE Q lcl|Aclame:pro 332 SLQAVVVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPA 411 (510) Q Consensus 332 ~i~~~~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~ 411 (510) .+++++++++++= ..+++......||++.+.++..-...|+.+..++..-+ .+++.++-+- ....++.+++. T Consensus 347 ~l~~l~~~m~~~G-a~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al-----~~~l~~~a~w--~g~~~~~~~v~ 418 (501) T protein:vir:95 347 AMDTKERQMVALG-AKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAF-----EWALKWAARW--VGQADSGVKFE 418 (501) T ss_pred HHHHHHHHHHHHH-HhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHH-----HHHHHHHHHH--cCCCCCceEEE Confidence 6777877776643 23444444557999999999999999999888887663 3344433221 11223334444 Q ss_pred EeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHH-HHHH Q lcl|Aclame:pro 412 IETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQA-AQAQ 490 (510) Q Consensus 412 ~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa-~~~~ 490 (510) +-.-... +..+.+.+.+++.... ...|..+.+...+ ...||+... . +++.++.+.....+- ..+- T Consensus 419 i~~df~~--~~~~~~~~~al~~~~~--------~G~is~~t~~~~L-~~~~v~~~~--~-~~e~e~i~~~~~~~~~~~~~ 484 (501) T protein:vir:95 419 LNTDFDI--ARMTPDERRSLVEEWQ--------KGAITFEEMRTGL-RKAGVATED--D-SKAKEKIAKDTAEAMALATP 484 (501) T ss_pred Eeccccc--ccCCHHHHHHHHHHHh--------CCCCcHHHHHHHH-HhCCCCChh--H-HHHHHHHHhhhcCccccccc Confidence 3221111 1111222222222211 1124444555544 445777421 1 222222111111110 0000 Q ss_pred HHHHHHHHHHHHhhccc Q lcl|Aclame:pro 491 AAQETLLEGASDMTNAL 507 (510) Q Consensus 491 ~a~~~~~~~a~~~~~~~ 507 (510) +.......|..+.++.- T Consensus 485 ~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 485 ANVPGDGSGGDNVGNSE 501 (501) T ss_pred CCCCCCCcccccccCCC Confidence 00000001111111111 No 102 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=96.82 E-value=0.00032 Score=39.75 Aligned_cols=414 Identities=11% Similarity=0.041 Sum_probs=171.3 Q ss_pred ChhHHHHHHHHHhc---------------------------cCchHHHHHHHHhhccc---ccCCCCCC-cccccccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRD---------------------------GSVEQRAIEFAKTTLPY---LMVDPMSG-SRGVVEHDFQ 49 (510) Q Consensus 1 ~k~~~~~r~~~lkr---------------------------~~~~~~w~e~~~~~~P~---~~~~~~~~-~~~~~~~~~d 49 (510) ++.++..+|+.-+. ..-..+++++.+|..-. ........ ......++.. T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:10 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeec Confidence 33333333332210 01112333344443321 00001111 1111234455 Q ss_pred chHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 50 SAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 50 stg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~ 129 (510) +.+...++..++.|++ -| .+++.+++. +. ..+...+..++|.....++.+++. T Consensus 93 n~~k~Iv~~~~~yl~g--~p-----~~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~ 145 (511) T protein:vir:10 93 DYASYISDFINGYFLG--NP-----IQYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLS 145 (511) T ss_pred chHHHHHHHHhhhhcc--cC-----ceeecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHH Confidence 6666666666554432 11 122333221 11 233445677789999999999999 Q ss_pred hhCceEEEE--eCCC-CeEEEEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 130 VTGNALLYR--NSDE-ATVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~-~~~~~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v 204 (510) ++|.+.+++ +++. .++.+++..+ |.+.-|. .+++...+|.++....+ ....+.-..+++|+ T Consensus 146 i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d------------~~~~~~~~~~~iyt-- 211 (511) T protein:vir:10 146 IYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-- 211 (511) T ss_pred hcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------cCccceEEEEEEEe-- Confidence 999876554 4432 2455565555 4444443 35666555555331100 00001111223332 Q ss_pred EeecCCCeeEEEEEEeeCCe------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDGV------RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYE 278 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~~------~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~ 278 (510) ++.-+. |...++. ........++..+|++.++- +.+|.|-.+..++-+..++.+.-...... T Consensus 212 ---~~~i~~----~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~ 279 (511) T protein:vir:10 212 ---SHGVYR----YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYM 279 (511) T ss_pred ---CCcEEE----EEecCCCcccccccccccccccCcceeEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 111000 1111110 11112223345688877653 45799999999999999998877777777 Q ss_pred HHhhCCceeeCCCCccchhhhhcCCCccee--------c------CCccccccccCCCccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 LESLEVLNLVDEAKGAVVDDYQDAEMGDYV--------P------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 (510) Q Consensus 279 ~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~--------~------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af 344 (510) ....+|.+++.-........+.....+.+. . +...+++.+. ...+.+.....+..++..|...- T Consensus 280 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~--~~~~~~~~e~~~~~L~~~I~~~s 357 (511) T protein:vir:10 280 SDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFT 357 (511) T ss_pred HHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHh Confidence 777788666532122222222221111111 1 1111222221 22355666777777777775433 Q ss_pred hh-cc-cCCCCCCCCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc--ceeeEEe Q lcl|Aclame:pro 345 MY-GA-NQRDAERVTAEEVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK--QHKPAIE 413 (510) Q Consensus 345 ~~-~~-~~~~~~~vTAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~--~~~~~~v 413 (510) .. +. ...-+...|+..+..+ +.++...++..+. -++..++.++...+-...+.+ .+++.+. T Consensus 358 ~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~--------~~~~li~~~~~~~~~~~~~~d~~~i~i~f~ 429 (511) T protein:vir:10 358 NTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR--------RRAKLLETILKNTRSIDANKDFNTVRYVYN 429 (511) T ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhhCCcccccccceeeEEeC Confidence 21 11 1111234577666544 3344444443333 233333444433222222222 4555554 Q ss_pred ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 414 TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAA 492 (510) Q Consensus 414 s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a 492 (510) -++. -..+..++.+.. +.+. +....++.. ++ ++.+ ++|++...+++. .+..++.. T Consensus 430 ~~~p-~d~~~~~~~~~k-------l~G~------iS~et~~~~----l~-----~v~d~~~E~~ri~~E~~-~~~~~~~~ 485 (511) T protein:vir:10 430 RNLP-KSLIEELKAYID-------SGGK------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEK-ESIKKAQK 485 (511) T ss_pred CCCC-cCHHHHHHHHHH-------Hhcc------CcHHHHHHh----CC-----CCCCHHHHHHHHHHHHH-HHHHHHhh Confidence 3221 122222222221 1121 111222222 21 2222 344443332221 11111100 Q ss_pred HHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 493 QETLLEGASDMTNALAGV 510 (510) Q Consensus 493 ~~~~~~~a~~~~~~~ag~ 510 (510) ...+...+. T Consensus 486 ---------~~~~~~~~~ 494 (511) T protein:vir:10 486 ---------GIYKDPRDI 494 (511) T ss_pred ---------hcccCCCCC Confidence 011111111 No 103 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=96.81 E-value=0.00033 Score=39.70 Aligned_cols=412 Identities=10% Similarity=0.021 Sum_probs=168.5 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc--ccC------CCCC-CccccccccccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMV------DPMS-GSRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~--~~~------~~~~-~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) .++.+.+...+.+ .-..+++.+.+|..-. ... .... ...+...++..+-+...++..++.|.+ .| T Consensus 45 ~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g--~p-- 118 (492) T protein:vir:97 45 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 118 (492) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcc--cC-- Confidence 2222233323322 1123445555553321 000 0000 011122245667777777877776543 12 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~ 148 (510) +++..+|+. +.+.| ...+ ..+|.....++.+++.++|.+.+++ +++. .+++++ T Consensus 119 ---~~~~~~d~~-------------~~~~l-------~~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~ 174 (492) T protein:vir:97 119 ---IAFKHTDDE-------------VVKRI-------DEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 174 (492) T ss_pred ---ceeccCchH-------------HHHHH-------HHHH-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEE Confidence 123333321 11111 1222 3678889999999999999886544 4332 246666 Q ss_pred Eece-EEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCeeEEEEEEeeC Q lcl|Aclame:pro 149 SLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAMDYAEMYHEID 222 (510) Q Consensus 149 pl~~-~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~~~~sv~~e~~ 222 (510) +..+ |++..| ..+++.-.+|.++.. ....+++|+- | +..+++. .......+.+ T Consensus 175 ~p~~~~~i~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~-~~~~~~~~~~ 233 (492) T protein:vir:97 175 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGS-LIPDYSNNLE 233 (492) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEEecCe-eeeccccccc Confidence 6555 444443 457787666665421 1112333331 0 0111110 0000001111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh-hc Q lcl|Aclame:pro 223 GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY-QD 301 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~-~~ 301 (510) ...+ .....++..+|++..+. +..|+|=.+..++-+..++.+.-.+.........|.+++.-......... .. T Consensus 234 ~~~~-~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 307 (492) T protein:vir:97 234 NSKT-HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRL 307 (492) T ss_pred cccc-ccccCCCCCcceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHH Confidence 1111 11222345688876654 45799999999999999998888888888888888766531111111111 11 Q ss_pred -CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 302 -AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSL 378 (510) Q Consensus 302 -~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~r 378 (510) ...+.+.-+...++..+... .+.......++.+++.|...-.. +. ...-+...|+.-+...-.- +........+ T Consensus 308 ~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~-l~~ka~~~~~ 384 (492) T protein:vir:97 308 LRYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTN-LNLKADKLAR 384 (492) T ss_pred HhhccceecCCCCcceeEecc--CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHH-HHHHHHHHHH Confidence 11122222223344443322 35566677778887777654322 11 1122233455433222111 1111122222 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEe--ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHH Q lcl|Aclame:pro 379 LAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDT 456 (510) Q Consensus 379 l~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~v--s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~ 456 (510) .-.+.+..++..++.++. +. .....+++.+. .+.+.+.. ++ .+..+++. +....++ T Consensus 385 ~f~~~l~~~~~li~~~~~---~~-~~~~~i~v~f~~~~p~~~~e~---a~-------~~~kl~G~------iS~et~l-- 442 (492) T protein:vir:97 385 KAKVAIQELLWFVFEHFD---IK-GEHKDVDISFNYNKVANTELQ---VQ-------TAQQSMGI------VSHETVL-- 442 (492) T ss_pred HHHHHHHHHHHHHHHHhc---CC-cccceeeEEecCCCCCCHHHH---HH-------HHHHHhcc------CchHHHH-- Confidence 222222222222233221 11 12223444442 22222222 11 12222221 2112222 Q ss_pred HHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 457 IWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 457 ~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..++ ++.+ ++|+++..+++.. +. ++ .+ .. .-.++-.|- T Consensus 443 --~~l~-----~v~d~~~Eleri~~E~~~-~~-~~-~~--~~----~~~~~~~~~ 481 (492) T protein:vir:97 443 --ENHP-----FVEDLQAELERIEQEQTE-YN-KQ-LP--NL----DDGGADSAQ 481 (492) T ss_pred --HhCC-----CCCCHHHHHHHHHHHHHH-HH-Hh-hh--cc----ccCCCCCCc Confidence 2222 1222 3444433322211 11 11 00 00 001111111 No 104 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=96.74 E-value=0.00037 Score=39.40 Aligned_cols=425 Identities=10% Similarity=-0.038 Sum_probs=173.9 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc---ccCCCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~---~~~~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) -.+.+.+..+.- +.....+++++.+|.... ........ ......++..+.+...++..++.|.+- | ++ T Consensus 40 ~~~~i~~~i~~~-~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~--p-----~~ 111 (501) T protein:vir:96 40 NWELLKNFINHH-KLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGN--P-----IR 111 (501) T ss_pred hHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhccc--C-----ee Confidence 000111111111 111123455555554432 11111111 112223556777777777777655531 1 12 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC-CeEEEEEeceE Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDE-ATVVAWSLRSY 153 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~-~~~~~~pl~~~ 153 (510) +...+.. +.+.+.. .+...+..++|.....++.++..++|.+.+++ +++. .++.+++..+. T Consensus 112 ~~~~~~~---------~~~~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~ 175 (501) T protein:vir:96 112 VEYDDND---------DNSQNDD-------AIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLET 175 (501) T ss_pred EeeCCcc---------chhHHHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccee Confidence 2332211 1122333 34445778899999999999999999987655 4332 24666665554 Q ss_pred EEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC-eeecccc Q lcl|Aclame:pro 154 AVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG-VRVGETG 230 (510) Q Consensus 154 ~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~-~~~~~~~ 230 (510) ++..|. .|++.-.+|.+..... ......++||+ + +. .+++..++ ....... T Consensus 176 ~~v~d~~~~~~~~~~v~~~~~~~~----------------~~~~~~~~vyt---~--~~-----i~~~~~~~~~~~~~~~ 229 (501) T protein:vir:96 176 FVIYDNSLEDNSIAAVRYYNRGTL----------------QSAKDVVEIYT---D--EH-----IYTLDASDDFNEISVT 229 (501) T ss_pred EEEEcCCCCCceEEEEEEEEeecC----------------CCcEEEEEEEc---C--Cc-----EEEEeeCCCceecccc Confidence 444443 3667655555432111 01111223332 1 11 11222222 1111122 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc-hhhh-hcCCCccee Q lcl|Aclame:pro 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV-VDDY-QDAEMGDYV 308 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~-~~~~-~~~~~G~~~ 308 (510) ...+..+|++..+ ++..|+|-....++-+..++.+.-...........|.+.+.-..... .... .....+.+. T Consensus 230 ~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~ 304 (501) T protein:vir:96 230 THAFGTVPITEYL-----NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQ 304 (501) T ss_pred ccCCCccceEEec-----CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeee Confidence 2234578877543 45689999999999999999988888888888888877653111100 0000 001112211 Q ss_pred c-------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 309 P-------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 309 ~-------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl 379 (510) . |....+.+-.+....+.......++.+++.|...-.. +. ....+...|+..+..+-.-+. ...-...+. T Consensus 305 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~-~ka~~~~~~ 383 (501) T protein:vir:96 305 LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLD-QDRVDTQSQ 383 (501) T ss_pred ecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHH-HHHHHHHHH Confidence 1 1111111111112234455566666666666443221 11 111234456666543322111 111222222 Q ss_pred HHHHHHHHHHHHHHHHhhcCC-CCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDDALL-QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~~~l-~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a 458 (510) -.+-+.-+++.++.++...+- .......+++.+...+ +-..+..++.+ ..+++. |....++.. T Consensus 384 ~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~-p~n~~e~ad~~-------~kl~g~------iS~et~~~~-- 447 (501) T protein:vir:96 384 FTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNL-PKSLNEQVSIL-------TGLGGQ------VSQETALSL-- 447 (501) T ss_pred HHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCC-CcCHHHHHHHH-------HHHhcc------CchHHHHHh-- Confidence 222233333334444433221 1222234555554322 21222222221 111221 222222222 Q ss_pred HHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccCCC Q lcl|Aclame:pro 459 AAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMT-NALAGV 510 (510) Q Consensus 459 ~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~-~~~ag~ 510 (510) ++ ++.+ ++|++...+ +++... ... ......+.. ...-.+ T Consensus 448 --l~-----~v~D~~~E~~ri~~-E~~~~~-~~~----~~~~~~~~~~~~~~~~ 488 (501) T protein:vir:96 448 --SG-----LVESPNEELDKINK-EMSEID-FKG----YSNDFNEHVGKYTDEV 488 (501) T ss_pred --CC-----CCCCHHHHHHHHHH-HHHHhh-ccc----cccchhhcccccCCcC Confidence 21 2222 233333222 211110 000 011111111 111111 No 105 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.66 E-value=0.00043 Score=39.06 Aligned_cols=420 Identities=10% Similarity=-0.009 Sum_probs=176.9 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc---cC--------CCC--CCccccccccccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---MV--------DPM--SGSRGVVEHDFQSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~---~~--------~~~--~~~~~~~~~~~dstg~~a~~~Laa~l~~ 65 (510) -.+++.+.-+... +.....+.+.+.+|..-.- .. ... ...+....++-.+.+..-++..++.|.+ T Consensus 2 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~G 81 (470) T protein:vir:10 2 ELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVAS 81 (470) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhheec Confidence 3333444434432 2223345555666544310 00 000 0011112244455555555555544433 Q ss_pred hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC- Q lcl|Aclame:pro 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE- 142 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~- 142 (510) -| ..++.+++.. .+.+.++ +. .+|...+.++.++...+|.+.+ |.+++. T Consensus 82 --~p-----~~~~~~d~~~---------~~~l~~~-----------~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~ 133 (470) T protein:vir:10 82 --VF-----PDIDVGKDAD---------NKKIIDV-----------LG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDGN 133 (470) T ss_pred --cc-----eeeecCchHH---------HHHHHHH-----------Hh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCCc Confidence 12 1233333211 1122222 32 3677788888899999998764 454442 Q ss_pred CeEEEEEeceEEEeeC-C-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE--EE-Eee-cCCC-e--- Q lcl|Aclame:pro 143 ATVVAWSLRSYAVRRD-A-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT--HV-QRR-KGTA-M--- 212 (510) Q Consensus 143 ~~~~~~pl~~~~v~~d-~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~--~v-~~~-~~~~-~--- 212 (510) .++.+++..+.++..| . .|++..++|.+...-. ........+++|+ .+ +.+ .+.+ . T Consensus 134 ~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~--------------~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~ 199 (470) T protein:vir:10 134 FRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDP--------------DSGKYFTVHEYWTDKEAQFFRTNATDSTVIE 199 (470) T ss_pred eEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec--------------CCceEEEEEEEEcCCcEEEEEeecCcceecc Confidence 2455566555444444 3 4777666666543110 0011112233333 11 111 1100 0 Q ss_pred ---eEEEEEE--eeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCcee Q lcl|Aclame:pro 213 ---DYAEMYH--EIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNL 287 (510) Q Consensus 213 ---~~~sv~~--e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~l 287 (510) .+.+... ..++..+. ....++..+|++..+= +.+|.|=.+...+-+..++.+.-..........+|.++ T Consensus 200 ~~~~~~~~~~~~~~~~~~~~-~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 273 (470) T protein:vir:10 200 PYNIITSYDLSAGYETGQSN-TLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (470) T ss_pred cccccccccccccccccccc-ccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCccee Confidence 0000000 00111111 1112234577765552 46899999999999999999999999999999999888 Q ss_pred eCCCCccc-hhhhhcCC-Ccce-ecC----CccccccccCCCccchHHHHHHHHHHHHHHHHHHh-hcccCCCCCCCCHH Q lcl|Aclame:pro 288 VDEAKGAV-VDDYQDAE-MGDY-VPG----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM-YGANQRDAERVTAE 359 (510) Q Consensus 288 v~~~g~~~-~~~~~~~~-~G~~-~~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~-~~~~~~~~~~vTAt 359 (510) +.-.+..+ .+...... .|.+ ++. ...++..+. ...+.......++.+++.|.+.-. .+...-.....|+. T Consensus 274 l~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt--~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~ 351 (470) T protein:vir:10 274 LTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQ--IDIPVEARDDALKITRKNIFLFGQGIDPANFESSNASGV 351 (470) T ss_pred eecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEe--ecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccchHH Confidence 75322222 12222111 1222 221 122333333 234667778888888888865432 22222122345666 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeec--HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 360 EVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETG--LPALSRSAAVQSMLNASQVIA 436 (510) Q Consensus 360 Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~--l~~l~r~~~~~~~~~~~q~~~ 436 (510) .+..+-.-+... ..+.... +.+.+.+++.++.. -++-......+.+.+... .+.+..+ +.+. T Consensus 352 Alk~~~~~l~~k----~~~~~~~-~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~----------~~~~ 416 (470) T protein:vir:10 352 AIKMLYSHLELK----AAKTQTY-FEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKA----------QIVS 416 (470) T ss_pred HHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHH----------HHHH Confidence 554332111111 1122221 22223333332211 111122233455555433 2222222 2222 Q ss_pred hhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 437 GLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 437 ~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .+++ .+.-..++. .++ ++.+ ++|+++..++++. + +.. ...+. .....|| T Consensus 417 ~~~g------~iS~et~l~----~~p-----~v~D~~~E~eri~~E~~e-~-~~~------~~~~~--~~~~~~~ 466 (470) T protein:vir:10 417 TVAN------YSSKEAVAK----ANP-----IVDDWQQELKDLAKDKEE-N-DPY------SNQAD--ELNGKGV 466 (470) T ss_pred HHhc------cCcHHHHHH----hCC-----CCCCHHHHHHHHHHHHHH-H-HHh------hcccc--ccCCCCC Confidence 2222 122223332 222 2223 3344332222111 1 110 11111 1233566 No 106 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=96.66 E-value=0.00043 Score=39.05 Aligned_cols=413 Identities=10% Similarity=-0.006 Sum_probs=177.0 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~--~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) -++.+.+..++... -..+++++.+|..-. ..............++..+.+...++.+++.|++ -| ++++ T Consensus 18 ~~~~l~~~i~~~~~--~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~-----~~~~ 88 (453) T protein:vir:39 18 TNEVVTKFMEKHRL--EVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYFNG--IP-----VKKS 88 (453) T ss_pred CHHHHHHHHHHHHH--HHHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhhhhcc--cC-----ceec Confidence 23333333333321 123444444443321 0000111111222345566777777777776643 11 2223 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEEEEece-EE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRS-YA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~~pl~~-~~ 154 (510) .+++. .. ..+.+.+..++|.....++.++..++|.+.+++..+. + ++++++..+ ++ T Consensus 89 ~~d~~-------------~~-------~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (453) T protein:vir:39 89 HSDKE-------------TL-------SKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFM 148 (453) T ss_pred cCChH-------------HH-------HHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEE Confidence 33221 11 2344557778999999999999999999876654432 2 356666544 45 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC--eeecccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG--VRVGETGRW 232 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~--~~~~~~~~y 232 (510) +.-|..++....+.++... .+....+++|+ ++. .++++.++ -.+...... T Consensus 149 v~d~~~~~~~~~~ir~~~~------------------~~~~~~~~~yt-----~~~-----i~~~~~~~~~~~~~~~~~~ 200 (453) T protein:vir:39 149 VYDDTIKQEPLFAVRYGYD------------------DDYKLYGEVYT-----KET-----TYALNGTMGFYNMTEQAPN 200 (453) T ss_pred EecCCCCCeEEEEEEEEEe------------------CCeEEEEEEEe-----CCe-----EEEEEecCCceeeeccccc Confidence 5545555544444443211 01112233332 111 01112121 112222222 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC-Ccce-ecC Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-MGDY-VPG 310 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~-~G~~-~~g 310 (510) ++..+|++..+. +.+|+|=.+...+-+..++.+.-......+....|.+++.- ..+..+.+.... ++.+ +++ T Consensus 201 ~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~~~~~~~~~ 274 (453) T protein:vir:39 201 PFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNIRSNRVINYYG 274 (453) T ss_pred CCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec-CCCCchhhhhhhhcceeeecC Confidence 345788877653 45799999999999999999998998888888888766631 112222222211 1222 121 Q ss_pred -----CccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|Aclame:pro 311 -----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQ 384 (510) Q Consensus 311 -----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l 384 (510) ...+++.+.. ..+.+.....++.++..|...-.. +.....-.+.|+.-+..+-.-+... .--..+.-.+.+ T Consensus 275 ~~~~~~~~~~~~lt~--~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k-a~~~~~~~~~~l 351 (453) T protein:vir:39 275 ESSEAKNVDVKFLEK--PDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNL-ALSFQRKFQSSL 351 (453) T ss_pred CCCCCCCCceeEEee--cCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 1223333332 245677777888888777553321 1111111234555443322211111 112222222233 Q ss_pred HHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCC Q lcl|Aclame:pro 385 SPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVD 464 (510) Q Consensus 385 ~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp 464 (510) ..++.-+..++...+. ......+.+.+.-.+ +-..++.++. +..++++ +....++ ..+| T Consensus 352 ~~~~~li~~~~~~~~~-~~~~~~i~v~f~~~~-p~~~~~~a~~-------~~kl~g~------is~et~l----~~l~-- 410 (453) T protein:vir:39 352 NSRYKLYCELSTNVSN-KEAWKDIEYTFTRNE-PKDIKEQAET-------ANILMGI------TSQETAL----SVIS-- 410 (453) T ss_pred HHHHHHHHHHHhccCC-ccccccceEEeCCCC-CcCHHHHHHH-------HHHHhcc------CChHHHH----HhCC-- Confidence 3333333344333222 112224455543222 1112222222 1122221 2222223 2232 Q ss_pred HhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 465 TSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 465 ~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++.+ ++|++... +++.... +. . ........+..... T Consensus 411 ---~v~D~~~E~~ri~-~E~~~~~-~~-~----~~~~~~~~~~~~~~ 447 (453) T protein:vir:39 411 ---VIPDVQAEMEKIK-KEEASTA-IF-D----KDKQPSEKGTDTVV 447 (453) T ss_pred ---CCCCHHHHHHHHH-HHHHHHH-HH-H----HhccCCCCCCCCCC Confidence 1222 34443322 2211111 10 0 00111111222222 No 107 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=96.54 E-value=0.00053 Score=38.55 Aligned_cols=405 Identities=10% Similarity=0.038 Sum_probs=170.1 Q ss_pred ChhHHH-HHHHHHhccCchHHHHHHHHhhccc-----ccC-CCCC---CccccccccccchHHHHHHHHHHHHHHhhcCc Q lcl|Aclame:pro 1 MKSTAA-MLWEKLRDGSVEQRAIEFAKTTLPY-----LMV-DPMS---GSRGVVEHDFQSAGALLVNNLAAKLARSLFPT 70 (510) Q Consensus 1 ~k~~~~-~r~~~lkr~~~~~~w~e~~~~~~P~-----~~~-~~~~---~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp 70 (510) ..+.+. +.....+ .-..+++.+.+|.... +.. .... .......++..+.+...+++.++.|.+ -|+ T Consensus 26 ~~~~~i~~~i~~~~--~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~~ 101 (478) T protein:vir:10 26 TQEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVA--NPV 101 (478) T ss_pred CcHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhcc--CCe Confidence 111111 1112221 1123455555554432 100 0000 011111234556667777777766543 111 Q ss_pred cCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEE Q lcl|Aclame:pro 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVA 147 (510) Q Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~ 147 (510) +++.+++. ..+. +...+ ..+|.....++.++..++|.+.+++..+. + ++.+ T Consensus 102 -----~~~~~~d~-------------~~~~-------l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~ 155 (478) T protein:vir:10 102 -----TFGVDNDK-------------ALKQ-------IQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR 155 (478) T ss_pred -----eeecCChH-------------HHHH-------HHHHH-hcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEE Confidence 12333321 1111 11223 35889999999999999999875543332 3 3455 Q ss_pred EEece-EEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E-EeecCCCeeEEEEEEeeC Q lcl|Aclame:pro 148 WSLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V-QRRKGTAMDYAEMYHEID 222 (510) Q Consensus 148 ~pl~~-~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v-~~~~~~~~~~~sv~~e~~ 222 (510) ++-.+ |.+..| ..|++.-.+|.++.. ..+.+++|+. | +.+..++........... T Consensus 156 ~~p~~~~~i~d~~~~~~~~~~v~~~~~~--------------------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~ 215 (478) T protein:vir:10 156 VPAEQAVPIWTNKERDELQAFIRVYELD--------------------GAERVEYWTKDDVTYYELKEGQLIPDFYRSDD 215 (478) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEec--------------------CceEEEEEeCCeEEEEEEcCCeeecccccccc Confidence 55444 444443 357787777666421 1122344331 1 111111222211111111 Q ss_pred Cee---eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh- Q lcl|Aclame:pro 223 GVR---VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD- 298 (510) Q Consensus 223 ~~~---~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~- 298 (510) +.. ......+++..+|++.++. ..+|+|=.....+-+..++.+.-.+....+....|.+++.-.+...... T Consensus 216 ~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~ 290 (478) T protein:vir:10 216 HIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDF 290 (478) T ss_pred ccccceecccccccCCccceEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchh Confidence 111 1111223445788877654 5689999999999999999888888888888888876653111111111 Q ss_pred hhcC-CCcce-ecCC-ccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHH-------HH Q lcl|Aclame:pro 299 YQDA-EMGDY-VPGG-AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRIT-------AE 366 (510) Q Consensus 299 ~~~~-~~G~~-~~g~-~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r-------~~ 366 (510) .... ..+.+ ++|. ..+++.+.. ..+.......++.++..|...-.. +. ....+...|+..+..+ +. T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 368 (478) T protein:vir:10 291 MHNLKYYKAISVAGESGSGVDTIKV--EVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKAN 368 (478) T ss_pred hhhhhhcceEEecCCCCCcceEEee--cCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHH Confidence 1111 11222 2222 123433322 235666677777777777554321 11 1122344566655433 23 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhH Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~ 444 (510) ++...++..+.+ ++..++.++. .......+.+.+.- +.+.+..++ .+..+++. T Consensus 369 ~~~~~~~~~l~~--------~~~li~~~~g----~~~~~~~i~i~f~~~~p~d~~e~a~----------~~~kl~g~--- 423 (478) T protein:vir:10 369 KLKNKTLTALQE--------LLQYIIDFYR----LDVKVQDIEITFNFNVMVNELENSQ----------IAMNSTGL--- 423 (478) T ss_pred HHHHHHHHHHHH--------HHHHHHHHhC----CCcccccceEEecCCCCCCHHHHHH----------HHHHHhCC--- Confidence 333333333222 2222223221 11222334554432 222222222 12222221 Q ss_pred hhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 445 DPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +....+++ .+| ++.+ ++|++...+++ +..++ .+.+......+- T Consensus 424 ---iS~et~~~----~l~-----~v~D~~~E~~ri~~E~---~~~~~--------~~~~~~~~~~~~ 467 (478) T protein:vir:10 424 ---LSKETILS----NHA-----WVEDPVAEMERIEQEN---IELNQ--------QLPDIEEGLNGE 467 (478) T ss_pred ---CChHHHHH----hCC-----CCCCHHHHHHHHHHHH---HHHHh--------hccccccccCCC Confidence 22233333 232 2222 23333222111 11111 111211111111 No 108 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=96.49 E-value=0.00058 Score=38.35 Aligned_cols=444 Identities=11% Similarity=-0.029 Sum_probs=184.4 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc--------ccCC-CCCCc---cccccccccchHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--------LMVD-PMSGS---RGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~--------~~~~-~~~~~---~~~~~~~~dstg~~a~~~Laa~l~~~lt 68 (510) +++-+.+.+.....+.-..+-+.+.+|..-. +... .+... .....++..+.+...++..++.|++. T Consensus 13 ~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl~G~-- 90 (537) T protein:vir:78 13 LGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYLLSN-- 90 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhhccc-- Confidence 2222222222221111123334444443321 0000 00000 01123456667777777777776543 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EEEeCCCC-eE Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA-TV 145 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~--l~~~~~~~-~~ 145 (510) |+. ++..+.. .+++.. .+...+ ..+|.....++.+++..+|.+. +|.+++.. ++ T Consensus 91 Pv~-----~~~~d~~----------~~e~~~-------~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~ 147 (537) T protein:vir:78 91 GVE-----VKVKDED----------NTQLDE-------ILQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKF 147 (537) T ss_pred Cce-----eecCcch----------hHHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEE Confidence 322 2222211 112222 222223 4678888899999999999875 45555532 45 Q ss_pred EEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCe--------- Q lcl|Aclame:pro 146 VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAM--------- 212 (510) Q Consensus 146 ~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~--------- 212 (510) ..++-.+.+..-|..|...-++|.+.....+-... ..+.-..+++|+. | +...+.+. T Consensus 148 ~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~----------~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~ 217 (537) T protein:vir:78 148 QTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQ----------STETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEA 217 (537) T ss_pred EEEccceeEEEEcCCCCceeEEEEEeeeecccccc----------CcceEEEEEEEcCCcEEEEEecCCccccccccccc Confidence 55665665555677888887777775542221100 1111223344431 1 11111100 Q ss_pred ----eEEEEEEeeCCe-------eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 213 ----DYAEMYHEIDGV-------RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 213 ----~~~sv~~e~~~~-------~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a 281 (510) |.-.++...++. ........++..+|++.++= +.+|.|=.++..+-+-.++.+.-......+.. T Consensus 218 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~~~sd~e~v~~LiDayd~~~S~~an~~~~~ 292 (537) T protein:vir:78 218 YNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDF 292 (537) T ss_pred ccccccceeeeccccccccccccccccccccCCcceeEEEecc-----CccCCCchhhhHHHHHHHHHHHHhhhhHHHHh Confidence 111111100000 01111222345678776654 45799999999999999999988888888888 Q ss_pred hCCceeeCCCCccchhh-hhcC-CCcce-ecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCC Q lcl|Aclame:pro 282 LEVLNLVDEAKGAVVDD-YQDA-EMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVT 357 (510) Q Consensus 282 ~~~~~lv~~~g~~~~~~-~~~~-~~G~~-~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vT 357 (510) .+|.+.+.-.+...... .... ..|.+ +.|...++..+.. ..+.......++.+++.|.+.-+. +......+..| T Consensus 293 ~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~l~~--~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~S 370 (537) T protein:vir:78 293 SEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEIQTV--SIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVT 370 (537) T ss_pred cCceeeeecCCCccchhHHHHHhhcCceeecCCCCceeEEEe--cCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCc Confidence 89877764222222111 1111 22333 3444455655433 346777788888888888654321 22222334445 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHH--HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 358 AEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLP--ALSRSAAVQSMLNASQVI 435 (510) Q Consensus 358 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~--~l~r~~~~~~~~~~~q~~ 435 (510) ..-+..+-.-+ .+-....++.-.+.+.-++..++.++...+........+++.+.-.+. ....++- +..+ T Consensus 371 GvAlk~~~~~l-~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~-------~~~l 442 (537) T protein:vir:78 371 NVVIKSRYTLL-AMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATT-------RKTE 442 (537) T ss_pred HHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHH-------HHHH Confidence 54332221111 111122233323333333333444443333333334456666554332 1111111 1111 Q ss_pred HhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcccCCC Q lcl|Aclame:pro 436 AGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDM-TNALAGV 510 (510) Q Consensus 436 ~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~-~~~~ag~ 510 (510) .+.+.+. -..++ ..++ ++.++++...+.++..++......+-....++..+. .....++ T Consensus 443 ~~~giiS-------~eT~l----~~~p-----~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (537) T protein:vir:78 443 AETEALK-------IGNIM----TVAP-----RIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAML 502 (537) T ss_pred HhcCcch-------HHHHH----HhCC-----CCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhc Confidence 1111111 11111 1111 222222222111111111100000000000000000 0011111 No 109 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.35 E-value=0.00072 Score=37.83 Aligned_cols=404 Identities=10% Similarity=0.017 Sum_probs=171.0 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc--ccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~--~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) -++.+.+..+++++. ..+.+.+.+|..-. ..............++-.+.+...++..++.|++ -| +.++ T Consensus 2 ~~~~l~~~i~~~~~~--~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~-----~~~~ 72 (429) T protein:vir:98 2 TKDLLSELIQKHRSF--NLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIG--VP-----VQTS 72 (429) T ss_pred CHHHHHHHHHHHHHH--HHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcc--cC-----ceee Confidence 233333333444311 13333333332211 0000001111122355566777777777766643 11 2233 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEEEEece-EE Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRS-YA 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~~pl~~-~~ 154 (510) .+++ ++.+ .+...+..++|.....++.++..++|.+.+++..+. + ++++++..+ |. T Consensus 73 ~~~~-------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~ 132 (429) T protein:vir:98 73 HENK-------------QVSN-------YLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFI 132 (429) T ss_pred cCCh-------------HHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEE Confidence 3321 1222 233346667899999999999999999876554332 3 355665444 44 Q ss_pred EeeCCC-CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeC-Ceeecccccc Q lcl|Aclame:pro 155 VRRDAT-GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEID-GVRVGETGRW 232 (510) Q Consensus 155 v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~-~~~~~~~~~y 232 (510) +.-|.. +++...+|.+. . ++. +++.++..++.-. +|...+ +..+...... T Consensus 133 v~dd~~~~~~~~~i~~~~-~------------------~~~-----~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 184 (429) T protein:vir:98 133 VYDDSIRQKPLFAVRYFY-N------------------KGG-----VLEGSYSDASNIT----YFKDGEKGIEIGESEPH 184 (429) T ss_pred EEeCCCCCceEEEEEEEE-e------------------cCc-----eEEEEEEeCceEE----EEEecCCceEecccccc Confidence 444433 44444444331 0 000 1122222222111 111111 1222222233 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCC-ccee-cC Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM-GDYV-PG 310 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~-G~~~-~g 310 (510) ++..+|++..+ ++.+|+|=.+..++-+..++.+.-......+....|.+.+.- .....+....... +.+. ++ T Consensus 185 ~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g-~~~~~~~~~~~~~~~~~~~~~ 258 (429) T protein:vir:98 185 PFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILG-AELDDETLKSLRDTRIINLKD 258 (429) T ss_pred cCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-CCCCcchhhhHhhCceeeccC Confidence 44678887643 456899999999999999999998888888888888766642 1112222222211 2222 21 Q ss_pred C---ccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHH Q lcl|Aclame:pro 311 G---AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRI-------TAEEAENTLGGTYSLL 379 (510) Q Consensus 311 ~---~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~-------r~~E~~~~LGpv~~rl 379 (510) . ..+++.+. ...+.+.....++.+.+.|...-.. +.........|+.-+.. +++++...++..+. T Consensus 259 ~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-- 334 (429) T protein:vir:98 259 TDAQQLTVEFLQ--KPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMN-- 334 (429) T ss_pred CCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 1 11233332 2245666777778888777554322 11111112346555433 33333333333322 Q ss_pred HHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA 459 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~ 459 (510) -++..+..++...+. +.....+.+.+.-.+ +-.-++.++.+. .+++. +..+.++ . T Consensus 335 ------~~~~li~~~~~~~~~-~~d~~~i~v~f~~~~-p~~~~~~a~~~~-------kl~g~------is~et~~----~ 389 (429) T protein:vir:98 335 ------RRYKLIASYPTSKIG-PKDWIGIKYKFTRNL-PANLLEESQIAG-------NLAGI------VSEETQV----G 389 (429) T ss_pred ------HHHHHHHHHhccCCC-ccccccceEEeCCCC-CcCHHHHHHHHH-------HHhcc------CchHHHH----H Confidence 223333333332221 122223444443222 111122222111 11221 2112222 3 Q ss_pred HcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 460 AFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 460 ~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .+|. +.+ ++|++..++++ ..+. .+ +.++.. ....-.| T Consensus 390 ~l~~-----v~d~~~E~~ri~~E~-~~~~-~~------~~~~~~-~~~~~~~ 427 (429) T protein:vir:98 390 VLSI-----VENPQKEIERKNSDK-STLI-SR------QAGGLN-GQNTTTI 427 (429) T ss_pred hCCC-----CCCHHHHHHHHHHHH-HHHH-HH------HHhhhc-CCCCCCC Confidence 3321 222 23333322222 1111 11 001100 0011111 No 110 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=96.32 E-value=0.00075 Score=37.71 Aligned_cols=432 Identities=12% Similarity=0.037 Sum_probs=177.2 Q ss_pred Ch---------hHHHHHHHHHhccC-chHHHHHHHHhhcccccCCCC-CCccccccc-cccchHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MK---------STAAMLWEKLRDGS-VEQRAIEFAKTTLPYLMVDPM-SGSRGVVEH-DFQSAGALLVNNLAAKLARSLF 68 (510) Q Consensus 1 ~k---------~~~~~r~~~lkr~~-~~~~w~e~~~~~~P~~~~~~~-~~~~~~~~~-~~dstg~~a~~~Laa~l~~~lt 68 (510) .+ ..+..+|+.++..- =....++...-.||.. +... ..-..++.+ .|-+.-.+.++.++..+..- T Consensus 6 ~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~-~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf~k-- 82 (513) T protein:vir:97 6 PKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRH-QEETDKGYQERLASAVLLNMVEQTLDTLSGKPFSE-- 82 (513) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCC-CCCCHHHHHHHHhcccCCChHHHHHHHHhhhhhhc-- Confidence 11 11234444332110 0112233333334542 1111 111222222 45556666666666444331 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHH-HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---- Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTA-ALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---- 143 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~-~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---- 143 (510) ||. -|.. . .+.+.+ ++++| -+...+.+.-+...+.+...+|-+.+++|.+.. T Consensus 83 ~p~-~~~~--~--------------p~~~~~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~ 139 (513) T protein:vir:97 83 PIK-LNED--V--------------PKAIEETILPDV------DLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPRE 139 (513) T ss_pred Ccc-cCcC--c--------------hHHHHHHHhhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCcc Confidence 221 0111 1 112232 23333 245667888888999999999999889985421 Q ss_pred -----------------eEEEEEeceEE---Eee-CCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE Q lcl|Aclame:pro 144 -----------------TVVAWSLRSYA---VRR-DATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT 202 (510) Q Consensus 144 -----------------~~~~~pl~~~~---v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~ 202 (510) .+..|+-.+.. ..+ |..+.+.-+..+++...+ +.|+... ++.|. T Consensus 140 ~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~---Dgf~~~~------------~~q~r 204 (513) T protein:vir:97 140 DGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ---DGFAEVC------------KRRIR 204 (513) T ss_pred chhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec---CCCcceE------------EEEEE Confidence 14555544332 222 444455555556655422 2243221 11121 Q ss_pred EEEeecCCCeeEEEEEEeeC-------CeeeccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHHHHHH-- Q lcl|Aclame:pro 203 HVQRRKGTAMDYAEMYHEID-------GVRVGETGRWPIHLCPYIVPTWNLAPGEHY--GRGHVEDYIGDFAKLSLLS-- 271 (510) Q Consensus 203 ~v~~~~~~~~~~~sv~~e~~-------~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~Y--Grgp~~~~l~d~~~L~~l~-- 271 (510) +.. ++ + |.+|-..+ +-.+..+++ +.+++|++.|-...+..+ |..|.. |+..||.-. T Consensus 205 vL~--~g-~---~~v~r~~~~~~~~~~e~~~~~~g~---~~l~~IP~v~~~~~~~~~~~~~pPLl----~LA~ln~~hy~ 271 (513) T protein:vir:97 205 VLE--PG-L---VQLWEPVKKSNAQKEEWALADEWA---TGLNYVPLVTFYADRQGFMMGKPPLL----DLAHLNVAHWQ 271 (513) T ss_pred EEe--Cc-e---EEEEEeecCCCccccceEEecCCC---CcCCceeEEEEecCCCCCCCCccchH----HHHHHHHHHHh Confidence 111 11 1 12222111 112222333 246677777765554433 444533 666666432 Q ss_pred -HHHHHH-HHHhhCCceeeCCCCccc--hhhhhcCCCcce-ecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 272 -EKLGLY-ELESLEVLNLVDEAKGAV--VDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY 346 (510) Q Consensus 272 -~~~l~~-~~~a~~~~~lv~~~g~~~--~~~~~~~~~G~~-~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~ 346 (510) .+-++. +..+.-|...+. |+.. .+.+..+.+..+ .|+......-++. .+..+......+.+++++|+++= . T Consensus 272 ~~Sd~~~il~~~~~P~l~~~--G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~qm~~~G-a 347 (513) T protein:vir:97 272 SASDQRHILTVSRFPILACS--GASGEDSDPVVVGPNKVLYNPDPAGRFYYVEH-TGQAIAAGRTDLKDLEEQMAGYG-A 347 (513) T ss_pred hhhhHHHHHHhcccceeeee--cCCcCCCCceEeeccccccCCCCCCcceeecc-CchhHHHHHHHHHHHHHHHHHHH-H Confidence 222222 333444443332 3211 123444443322 3432223333333 24577888899999999997654 2 Q ss_pred cccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHH Q lcl|Aclame:pro 347 GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQ 426 (510) Q Consensus 347 ~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~ 426 (510) .+++..+...||++.+.+....-..|+.+..++++-+ ++++.++.+- ....++.+++.+-.-.. .+..+.+ T Consensus 348 ~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al-----~~~l~~~a~w--lg~~~~~~~v~in~dF~--~~~~~~~ 418 (513) T protein:vir:97 348 EFLKRKTGGQTATARALDSAEATSDLSAMTGLFEDAL-----AQALDITADW--LRLGPNGGTVELVKDYD--LEEMDAP 418 (513) T ss_pred HhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHH--hCCCCCccEEEeccccC--cccCCHH Confidence 3334334457999999999999999999887766543 4444444321 11122334444422111 1111222 Q ss_pred HHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH-HHHHHHHHHHHHHH--HHHHHHHHHHH-H-- Q lcl|Aclame:pro 427 SMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE-LQAEAEEQRRQAAQ--AQAAQETLLEG-A-- 500 (510) Q Consensus 427 ~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee-~~~~~~~~~qqa~~--~~~a~~~~~~~-a-- 500 (510) .+.++++.... + .|....+.+.+-+ .||=...+ ++|+ .+.++.+-..+... .....+...++ + T Consensus 419 ~~~al~~a~~~--G------~is~~t~~~~L~r-~gvl~~d~--d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 487 (513) T protein:vir:97 419 GLQALQVAREK--R------DISRKTYLNGLRL-RGVLPEDF--DEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGE 487 (513) T ss_pred HHHHHHHHHhC--C------CCCHHHHHHHHHh-ccCCCccC--CHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCC Confidence 22222222211 1 1222333333332 33311111 2121 12211111000000 00000000000 0 Q ss_pred ---------------HHhhcccCCC Q lcl|Aclame:pro 501 ---------------SDMTNALAGV 510 (510) Q Consensus 501 ---------------~~~~~~~ag~ 510 (510) +..++.+.|- T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (513) T protein:vir:97 488 GEGEGEGEGGEGGEGGEGGGNPGGE 512 (513) T ss_pred CCCCCCCCCCCCCCccccCCCCCCC Confidence 0111111111 No 111 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=96.31 E-value=0.00076 Score=37.69 Aligned_cols=386 Identities=11% Similarity=-0.014 Sum_probs=172.8 Q ss_pred ChhHHHHHH-HHHh-ccCchHHHHHHHHhhcccc-cCCCCCCcccccc---ccccchHHHHHHHHHHHHHHhhcCccCcc Q lcl|Aclame:pro 1 MKSTAAMLW-EKLR-DGSVEQRAIEFAKTTLPYL-MVDPMSGSRGVVE---HDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~k~~~~~r~-~~lk-r~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~~~---~~~dstg~~a~~~Laa~l~~~ltpp~~~W 74 (510) |-+++..+. .++. +.+ +.+.+.+|..-.. .+.-+..-...+. +..-+-+..+|+.||..|. .-+ T Consensus 1 m~~~~i~~L~~~~~~~~~---r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~----~~G--- 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKT---GVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRII----FRE--- 70 (422) T ss_pred CChHHHHHHHHHHHHHHH---HHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccc----cce--- Confidence 766554433 3332 222 3333444433221 0001111111111 1222344555555554321 111 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC--CC--eEEEEEe Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSD--EA--TVVAWSL 150 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~--~~--~~~~~pl 150 (510) | ..+|. + +++....+++.....++.++..++|.+.+++..+ .+ .++++|- T Consensus 71 f--~~~d~-------------~-----------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp 124 (422) T protein:vir:97 71 F--TNDDF-------------N-----------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEA 124 (422) T ss_pred e--eCCch-------------h-----------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEech Confidence 1 11221 1 1234667899999999999999999998877532 23 3666666 Q ss_pred ceEEEeeCCC-CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccc Q lcl|Aclame:pro 151 RSYAVRRDAT-GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGET 229 (510) Q Consensus 151 ~~~~v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~ 229 (510) .+.++..|+. +++...++++.. ..+.. ... .++++.. ..+++..++...... T Consensus 125 ~~~~~i~D~~~~~~~~a~~~~~~------------------~~~~~--~~~-~~~~~~~------~~~~~~~~~~~~~~~ 177 (422) T protein:vir:97 125 SKATGILDPTTFLLTEGYAILES------------------DSNGN--PTL-EAYFTDK------DIWYYPKKGKPYNIK 177 (422) T ss_pred hhEEEEEeCCCCcceeeEEEEEe------------------cCCCc--EEE-EEEEcCc------eEEEEcCCCcccccc Confidence 5554444643 444433333211 01111 111 1111211 112222233222223 Q ss_pred cccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCCccchhhhhcCCCc Q lcl|Aclame:pro 230 GRWPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMG 305 (510) Q Consensus 230 ~~y~~~~~P~~~~Rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv---~~~g~~~~~~~~~~~~G 305 (510) -++ +.+|++++..+...++.||+|-. +..++-+..+|...-..+..++..+.|...+ .++|.. ........| T Consensus 178 ~~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~--~~~~~~~~~ 253 (422) T protein:vir:97 178 NPT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKP--MEKWRATVS 253 (422) T ss_pred CCC--CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCccccc--Cchhhhhhh Confidence 334 46899999999999999999976 5688999999999888888888888775444 233321 111111122 Q ss_pred ce--ecCCcc--ccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-----cCCCCCC-CCHHHH-------HHHHHHH Q lcl|Aclame:pro 306 DY--VPGGAE--AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-----NQRDAER-VTAEEV-------RITAEEA 368 (510) Q Consensus 306 ~~--~~g~~~--~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-----~~~~~~~-vTAtEi-------~~r~~E~ 368 (510) .+ +|...+ .++.-++. .++++.. ++.++.-|........ +.....+ .+|.-| ..+++++ T Consensus 254 ~i~~~~~de~~~~~~v~q~~-~~~l~~~---~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k 329 (422) T protein:vir:97 254 TLLEISKDEDGDKPTVGQFT-TASMAPF---MEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKA 329 (422) T ss_pred hhhccCCCCCCCcceeeecC-CCChhHH---HHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHH Confidence 22 222222 12222222 2455543 3444444433322221 1111111 233332 3445666 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc--eeeEEe--ecHHHHHHHHHHHHHHHHHHHHHhhcChHhH Q lcl|Aclame:pro 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ--HKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQL 444 (510) Q Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~--~~~~~v--s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~ 444 (510) ...+|..+.++ +..++.+. ++....+.+. +.+... .+.+....++.+..+..+.+.+. ++ T Consensus 330 ~~~fg~~l~~~--------~rla~~~~--~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~---~~--- 393 (422) T protein:vir:97 330 QRSFSSGFLNV--------AYIAVCLR--DEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIP---GF--- 393 (422) T ss_pred HHHHHHHHHHH--------HHHHHHHh--cCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhcc---cc--- Confidence 66677666552 22223322 3333333332 333332 23455455555555444444211 11 Q ss_pred hhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 445 DPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQA 486 (510) Q Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa 486 (510) .+.+ .+.+.+|+... +++.. +.++..++. T Consensus 394 ---~~~~----~~~~~lg~~~~-----~~~~~-~~~~~~~d~ 422 (422) T protein:vir:97 394 ---MDAD----VIRDLTGVKGA-----DKPIP-AITEVTTDG 422 (422) T ss_pred ---ccHH----HHHHHcCCCch-----hHHHH-HHHhhhccC Confidence 1222 23344566431 22222 122222222 No 112 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=96.25 E-value=0.00083 Score=37.49 Aligned_cols=410 Identities=12% Similarity=0.034 Sum_probs=169.6 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccc-cCCCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL-MVDPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~~~--~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) |-++-....+.|. -.....+.+.+.+|..-.. .+.-+..-...+. +..-+-+..+|+.||..|.-- + | T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~----G---f 84 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLE----G---F 84 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhccc----c---e Confidence 4444333333331 1111112333334322211 0000000011111 223455566666666644311 1 2 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CCC---eEEEEEe Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DEA---TVVAWSL 150 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~--~~~---~~~~~pl 150 (510) ++ ++.... .. .+++...++++.....+++++..++|.+.+++.. +.. .++++|- T Consensus 85 ~~--~d~~~~--------~~-----------~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp 143 (474) T protein:vir:81 85 VW--PDGDLD--------SL-----------GGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDA 143 (474) T ss_pred EC--CCCCcc--------ch-----------HHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEecc Confidence 22 221110 01 1234467889999999999999999999877753 221 3677776 Q ss_pred ceEEEeeCCC-CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCC-ceEEEEE--EE--EeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 151 RSYAVRRDAT-GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGS-GSVDLYT--HV--QRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 151 ~~~~v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~-~~v~v~~--~v--~~~~~~~~~~~sv~~e~~~~ 224 (510) .+.++..|+. +++...++.... ..+.+ ...++|. .+ +.+++.+.. +..+. T Consensus 144 ~~~~~~~D~~~~~~~~al~~~~~------------------~~~g~~~~~~ly~~~~~~~~~~~~~~~~-----w~~~~- 199 (474) T protein:vir:81 144 SEATGEWNRRRRGLNNLLSIIDK------------------DKEGKVLSLALYLDNETVTAQRDKATLK-----WQVDR- 199 (474) T ss_pred ceEEEEEeCCCCcceeeeEEEEE------------------cCCCcEEEEEEEeCCcEEEEEEcCccce-----eeecc- Confidence 5554444543 444333322210 11111 1222221 11 111221111 11111 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc-------- Q lcl|Aclame:pro 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV-------- 295 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~-------- 295 (510) ..-.+ .+|.+.+..+...++.+|+|-. +..++-+..+|+..-.++..++..+.|...+- |+.. T Consensus 200 ---~~~~~---gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~--G~~~~~~~d~d~ 271 (474) T protein:vir:81 200 ---DEHVY---GVPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL--GADESALKNADG 271 (474) T ss_pred ---CCCCC---CcceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee--cCChhhcccccc Confidence 11222 2799999999888999999965 57788889999988888888888887755441 2211 Q ss_pred -hhhhhcCCCcce--ecCCcccc-------ccccCCCccchHHHHHHHHHHHHHHHHHHhhcccC-------CCCCCCCH Q lcl|Aclame:pro 296 -VDDYQDAEMGDY--VPGGAEAV-------RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ-------RDAERVTA 358 (510) Q Consensus 296 -~~~~~~~~~G~~--~~g~~~~v-------~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~-------~~~~~vTA 358 (510) +........|.+ ++++.+.. +.-|+. .++++.. ++.++.-|......+... ....+-+| T Consensus 272 ~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~-~a~l~~~---~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~Sa 347 (474) T protein:vir:81 272 TIKSVWEARLGRIKGLPDDADADIPQLARADVKQFP-AASPDAH---WSDINGLAKLFAREASLPDTAVAISGLSNPTSA 347 (474) T ss_pred cccchhhhhHHHHhcCCCcccccccccccccccccC-CCChhHH---HHHHHHHHHHHHhhhCCCHHHhcccccccccHH Confidence 111111111222 22222211 111221 2344433 333444443332222211 11111234 Q ss_pred HHH-------HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEe----ecHHHHHHHHHHHH Q lcl|Aclame:pro 359 EEV-------RITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIE----TGLPALSRSAAVQS 427 (510) Q Consensus 359 tEi-------~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~v----s~l~~l~r~~~~~~ 427 (510) .-| ..+++++...+|.-+.+ ++..++.+......-..+.+..+++++ ..-+.++++..+.+ T Consensus 348 eAi~a~~~~l~~kae~k~~~fg~~l~~--------~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~K 419 (474) T protein:vir:81 348 ESYDASQYELIAEAEGAVDDFTPALRK--------AFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMK 419 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHH Confidence 333 34566667777765554 223333333332233344444443332 22222333222222 Q ss_pred HHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 428 MLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNAL 507 (510) Q Consensus 428 ~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ 507 (510) +. + +..+++. ..+ ....+|+. ++|++.....+++++.+.. ..++.......+.+| T Consensus 420 l~---~---a~~~~~~-------~~~---~~~~lg~t-------~~~i~~~~~~~~~~~~~~~--~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 420 QL---A---AVPWLAE-------TEV---GLELIGLT-------PQQARRAMADKRRVQGRGT--LQALIDRSNNGATAQ 474 (474) T ss_pred HH---h---cccCCCc-------HHH---HHhhcCCC-------HHHHHHHHHHHHHHhHHHH--HHHHHhcCCCCCCCC Confidence 22 2 2112211 011 22334654 4555443322222222111 111222112222233 No 113 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=96.13 E-value=0.00097 Score=37.11 Aligned_cols=375 Identities=10% Similarity=0.010 Sum_probs=171.9 Q ss_pred ChhHHHHHHHH-Hh-ccCchHHHHHHHHhh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEK-LR-DGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~-lk-r~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) |-..+..+..+ +. +.+...+..++|+-. +|++-+ .-...-...-+..-+-+..+|++||..|. ..+ | T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~-~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G---f- 71 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGI-TIPQALSQQYRSILGWCAKGVDSLADRLV----FRE---F- 71 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcCh-hhhHHHHHHHhhhcchhHHHHHHhHhhcc----cCc---c- Confidence 66555544432 32 222222223343321 111100 00000001112333455556666555432 112 1 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC---eEEEEEeceE Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA---TVVAWSLRSY 153 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~---~~~~~pl~~~ 153 (510) ..+| .+ +.+....++|.....++.++..++|.+.+++..++. .++++|..+. T Consensus 72 -~~~d-------------~~-----------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~ 126 (409) T protein:vir:94 72 -ENDD-------------FT-----------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNA 126 (409) T ss_pred -cCCc-------------hH-----------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceE Confidence 1111 11 223466788999999999999999998877665422 4777777666 Q ss_pred EEeeCCC-CceeEEEEEEEecHHHHhHHhhHHhhcccccCCCC-ceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccc Q lcl|Aclame:pro 154 AVRRDAT-GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGS-GSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGR 231 (510) Q Consensus 154 ~v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~-~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~ 231 (510) ++..|+. +++...++...- ..+.. ....+|. ++....| ...++.......+ T Consensus 127 ~~i~D~~~~~~~~a~~~~~~------------------d~~~~~~~~~~~~-----~~~~~~~----~~~~~~~~~~~n~ 179 (409) T protein:vir:94 127 TGIIDPITGLLTEGYAVLER------------------DENNNVVLEAHFL-----PDRTDYY----YRDSRNNISIANP 179 (409) T ss_pred EEEEecCCCceeeeEEEEEe------------------cCCCceEEEEEEe-----cCcEEEE----EecCceeEeeeCC Confidence 6666654 555544443210 00111 1111111 1111100 1111211122233 Q ss_pred cccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCCccchhhhhcCCCcce Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGDY 307 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv---~~~g~~~~~~~~~~~~G~~ 307 (510) + +.+|.+.+..+...++.||+|-. +..++-+..+|+..-..+..++..+.|...+ .+++. ..+.++. ..+.+ T Consensus 180 ~--g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~-~~~~~~~-~~~~i 255 (409) T protein:vir:94 180 T--GHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAE-PMETWKA-TVSSM 255 (409) T ss_pred C--CCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCc-ccchhhh-hHHHh Confidence 4 47999999999889999999966 5688888999999888888888888885444 34332 1122222 12222 Q ss_pred e--cCCccc--cccccCCCccchHHHHHHHHHHHHHHHHHHhhcccC-----CCCCC-CCHHHH-------HHHHHHHHH Q lcl|Aclame:pro 308 V--PGGAEA--VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ-----RDAER-VTAEEV-------RITAEEAEN 370 (510) Q Consensus 308 ~--~g~~~~--v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~-----~~~~~-vTAtEi-------~~r~~E~~~ 370 (510) . |...++ ++.-++. .++++.. ++.++.-|+...+..... ....+ -+|.-| ..+++++.. T Consensus 256 ~~~~~d~dg~~~~v~q~~-~~~l~~~---~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~ 331 (409) T protein:vir:94 256 LQFTKDEDGDKPTLGQFT-QPSMSPF---TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQR 331 (409) T ss_pred hcCCCCCCCCCceEEecC-CCChhHH---HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHH Confidence 2 222221 2222332 2455543 444444444433332211 11122 233322 335556666 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc--eeeEEe--ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh Q lcl|Aclame:pro 371 TLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ--HKPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~--~~~~~v--s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~ 446 (510) .+|..+.+ ++..++.+. ++....+.+. +++..- .+-+..+.++.+..+..+.+.. .+++ T Consensus 332 ~fg~~~~~--------~~rla~~i~--~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag---~~~~---- 394 (409) T protein:vir:94 332 SLGAGLLN--------VAYLAACLR--DDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAI---PEFI---- 394 (409) T ss_pred HHHHHHHH--------HHHHHHHHh--CCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhc---cccc---- Confidence 66655544 223333333 3333344433 333332 2334444455554444444321 1111 Q ss_pred cCCHHHHHHHHHHHcCCCHhh Q lcl|Aclame:pro 447 RISLPKMMDTIWAAFSVDTSQ 467 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~ 467 (510) +. +.+.+.+|.+... T Consensus 395 --~~----~~~~~~lG~~~~d 409 (409) T protein:vir:94 395 --NK----DTIRDLTGIEGGE 409 (409) T ss_pred --ch----hHHHHHcCCCCCC Confidence 11 2345556665432 No 114 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=96.10 E-value=0.001 Score=37.03 Aligned_cols=406 Identities=12% Similarity=0.005 Sum_probs=176.7 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~--~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 78 (510) -.+.+.+..++.+. -..+.+.+.+|..-.- .............++..+.+...++..++.|.+ .| +++. T Consensus 18 ~~~~i~~~i~~~~~--~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g------~~-~~~~ 88 (452) T protein:vir:36 18 TVEVVTKFMEKHKL--EVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNG------IP-VKKS 88 (452) T ss_pred CHHHHHHHHHHHHH--HHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhcc------cC-ceee Confidence 22344444443321 1234455555544321 111111111222345566777777777766643 11 2233 Q ss_pred CChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEEEEeceE-E Q lcl|Aclame:pro 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRSY-A 154 (510) Q Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~~pl~~~-~ 154 (510) .+++. . ...+.+.+..++|....+++.++...+|.+.+++..+. + ++.+++..+. . T Consensus 89 ~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (452) T protein:vir:36 89 HSDKE-------------I-------LTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFM 148 (452) T ss_pred cCChh-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEE Confidence 33321 1 12334456678999999999999999998876553322 3 3556665554 4 Q ss_pred EeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeC--Ceeeccccc Q lcl|Aclame:pro 155 VRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEID--GVRVGETGR 231 (510) Q Consensus 155 v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~--~~~~~~~~~ 231 (510) +..|. .+.+.-.+|.+.- .+....++||+. +. .++++.+ +..+..... T Consensus 149 v~d~~~~~~~~~~i~~~~~-------------------~~~~~~~~vyt~-----~~-----i~~~~~~~~~~~~~~~~~ 199 (452) T protein:vir:36 149 VYDDTVKQEPLFAVRYGVD-------------------EDKKLQGEVYTL-----LE-----TIKISGENDEISFGEGTY 199 (452) T ss_pred EEcCCCCCceEEEEEEEEe-------------------cCceEEEEEEec-----Ce-----EEEEEEcCCceEEeccee Confidence 44333 2444444443321 011223444431 11 1111111 111221222 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCC-cce-ec Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEM-GDY-VP 309 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~-G~~-~~ 309 (510) .++..+|++..+. +..|+|-.+...+-+..++.+.-...........|.+++.- .....+....... +.+ ++ T Consensus 200 ~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~~~~~~~~ 273 (452) T protein:vir:36 200 NPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNIRSNRVINYY 273 (452) T ss_pred ccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec-CCcCchhhhhhhhcceEEec Confidence 2335688776644 34689988999999999999988888888888898777642 2222333322222 211 12 Q ss_pred C-C---ccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|Aclame:pro 310 G-G---AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQ 384 (510) Q Consensus 310 g-~---~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l 384 (510) . + ..+++.+. ...+.......++.+++.|...-.. +.........|+.-+..+-.-+... .--..+.-...+ T Consensus 274 ~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k-~~~~~~~~~~~l 350 (452) T protein:vir:36 274 ADGEGKNVDVKFLE--KPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNL-ALSFQRKFQSSL 350 (452) T ss_pred CCCCccCCcceeEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 1 1 11233322 2235666677777777777443321 2111112345666554332221111 111222223333 Q ss_pred HHHHHHHHHHHhhcCCCCCCccceeeEEeecH--HHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcC Q lcl|Aclame:pro 385 SPLAYVCLSEVDDALLQGLITKQHKPAIETGL--PALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFS 462 (510) Q Consensus 385 ~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l--~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~G 462 (510) ..++.-++.++...+.. .....+++.+.-.+ +.++.++ . ++.+++. +....++ ..+| T Consensus 351 ~~~~~li~~~~~~~~~~-~~~~~i~i~f~~~~p~d~~~~a~---~-------~~k~~g~------iS~et~~----~~~~ 409 (452) T protein:vir:36 351 NSRYKLFCELSTNVSNK-DSWKDIEYTFTRNEPKDIKEQAE---T-------ANILMGI------TSQETAL----SVIS 409 (452) T ss_pred HHHHHHHHHHHhccCCc-cccccceEEeCCCCCcCHHHHHH---H-------HHHHhcc------CChHHHH----HhCC Confidence 33444444444432221 12234555543322 2222222 1 1111221 2222222 2332 Q ss_pred CCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 463 VDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 463 vp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) . +.+ ++|++...+ ++.++ ++.. . +..+.-.|. T Consensus 410 ~-----~~d~~~E~~ri~~-E~~~~--~~~~------~--~~~~~~~~~ 442 (452) T protein:vir:36 410 V-----IPDVQAEMEKIKK-EEAST--AIFD------K--DKQPSEKGT 442 (452) T ss_pred C-----CCCHHHHHHHHHH-HHHHH--HHHH------h--hccCCCCcc Confidence 1 222 334433222 21111 1100 0 011111111 No 115 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=96.03 E-value=0.0011 Score=36.81 Aligned_cols=377 Identities=11% Similarity=0.028 Sum_probs=164.9 Q ss_pred HhccCchHHHHHHHHhhcccc-cCCCCCCcccc---ccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhh Q lcl|Aclame:pro 12 LRDGSVEQRAIEFAKTTLPYL-MVDPMSGSRGV---VEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREA 87 (510) Q Consensus 12 lkr~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~---~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~ 87 (510) |. -+.++-+.+.+|..-.. .+.-+..-... ..+..-+-+..+|++||..|. ..+ | ..+|. T Consensus 1 l~--~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~----~~G---f--~~~d~----- 64 (410) T protein:vir:95 1 MN--LYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLI----FRA---F--ANDDF----- 64 (410) T ss_pred CC--cchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhc----ccc---c--cCCCc----- Confidence 21 11222222333322211 00001000001 112334555666666655443 111 1 11111 Q ss_pred ccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC--C-CeEEEEEeceEEEeeCC-CCce Q lcl|Aclame:pro 88 DSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSD--E-ATVVAWSLRSYAVRRDA-TGRW 163 (510) Q Consensus 88 ~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~--~-~~~~~~pl~~~~v~~d~-~G~v 163 (510) + +.+....++|.....++.++..++|.+.+++..+ . .++++++..+.++..|+ .+++ T Consensus 65 --------~-----------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~ 125 (410) T protein:vir:95 65 --------N-----------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLL 125 (410) T ss_pred --------h-----------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCce Confidence 1 2233567899999999999999999988777543 2 25777766554444454 3555 Q ss_pred eEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEe Q lcl|Aclame:pro 164 MDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPT 243 (510) Q Consensus 164 ~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R 243 (510) ..-++... . .+......+.+|+ + +. .+++.-++..-...-++ ..||++.+. T Consensus 126 ~~al~~~~----------------~-~~~~~~~~~~~~~---~--~~-----~~~~~~~~~~~~~~~~~--g~vPvV~f~ 176 (410) T protein:vir:95 126 VEGYAVLA----------------R-DDYNRPTLEAYFE---P--NA-----THFIPKDGEPYSVTNET--GIPLLVPVI 176 (410) T ss_pred EEEEEEEE----------------e-cCCCeEEEEEEEe---C--Cc-----EEEEeeCCccccccCCC--CCcceEEec Confidence 54443211 0 0111111222332 1 11 12222222111112333 469999999 Q ss_pred eeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCCccchhhhhcCCCccee--cCCccc--c Q lcl|Aclame:pro 244 WNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGDYV--PGGAEA--V 315 (510) Q Consensus 244 w~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv---~~~g~~~~~~~~~~~~G~~~--~g~~~~--v 315 (510) .+...++.||+|=. +..++-+..+|...-.++..++..+.|...+ .++|... +.+. ...|.+. +...++ + T Consensus 177 n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~-~~~~-~~~~~i~~~~~~~~~~~~ 254 (410) T protein:vir:95 177 HRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPM-EKWK-ATVSSLLTISSSDKGVKP 254 (410) T ss_pred ccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcC-chhh-hhhhhheeccCCCCCCcc Confidence 99998999999944 5688888899998888888888888775443 2333211 1111 1122222 222111 2 Q ss_pred ccccCCCccchHHHHHHHHHHHHHHHHHHhhcccC-----CCCCC-CCHHHH-------HHHHHHHHHHhhhhHHHHHHH Q lcl|Aclame:pro 316 RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ-----RDAER-VTAEEV-------RITAEEAENTLGGTYSLLAEN 382 (510) Q Consensus 316 ~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~-----~~~~~-vTAtEi-------~~r~~E~~~~LGpv~~rl~~E 382 (510) +.-++ ..++++.. ++.++.-|.......... ....+ -+|.-| ..+++++...+|.-+.+ T Consensus 255 ~v~q~-~~~~l~~~---~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~---- 326 (410) T protein:vir:95 255 SVGQF-TTASMSPF---TEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLN---- 326 (410) T ss_pred eEEec-CCCChHHH---HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 22222 23456543 344444444333322211 11122 233322 33556666666665554 Q ss_pred HHHHHHHHHHHHHhhcCCCCCCccceeeEEe-e---cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHH Q lcl|Aclame:pro 383 LQSPLAYVCLSEVDDALLQGLITKQHKPAIE-T---GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) Q Consensus 383 ~l~Pli~r~~~il~~~~l~~~p~~~~~~~~v-s---~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a 458 (510) ++..++.+. ++....+.+..+.+++ . ..+.-+.++.+..+..+.+. ..+++ +. +.+. T Consensus 327 ----~~rla~~i~--~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a---~~g~~------~~----~~~~ 387 (410) T protein:vir:95 327 ----VAYVAACLR--DEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQA---LPGYI------NA----ETIR 387 (410) T ss_pred ----HHHHHHHHh--cCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHh---ccCCc------cH----HHHH Confidence 223333333 3334444444443332 1 22222333333333332232 11211 11 2244 Q ss_pred HHcCCCHhhccCCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 459 AAFSVDTSQFYKSADELQAEAEEQRRQAAQ 488 (510) Q Consensus 459 ~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~ 488 (510) +.+|... +++..+..+++++..+ T Consensus 388 ~~lg~~~-------~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 388 DLTGIAG-------DMSAKPVVSEGGSNGE 410 (410) T ss_pred HhcCCCh-------HHHHHHHHHHHHhCCC Confidence 5566653 2222111111111111 No 116 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=96.03 E-value=0.0011 Score=36.80 Aligned_cols=425 Identities=10% Similarity=0.006 Sum_probs=171.4 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc---cCCCC--CCccccccccccchHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL---MVDPM--SGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~---~~~~~--~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) ..+.+.+..++- ......+++++.+|....- +.... ........++-.+.+...++..++.|.+- | + T Consensus 23 ~~~~i~~li~~~-~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~--p-----~ 94 (506) T protein:vir:94 23 TPNKIMKFITHH-FNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGN--P-----I 94 (506) T ss_pred CHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhccc--C-----c Confidence 122222222221 1122345666666654431 11100 00111223455566666777766665541 2 1 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEEEEece Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAWSLRS 152 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~~pl~~ 152 (510) .+..+++. .. ..+.+.+..++|.....++.++..++|.+.+++..+. + ++.+++..+ T Consensus 95 ~~~~~d~~-------------~~-------~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~ 154 (506) T protein:vir:94 95 NVKLPDDG-------------SN-------SGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLD 154 (506) T ss_pred eeecCcch-------------HH-------HHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccc Confidence 22333221 11 1233446678999999999999999999875544332 2 355565555 Q ss_pred -EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEE-eeCCeeeccc Q lcl|Aclame:pro 153 -YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYH-EIDGVRVGET 229 (510) Q Consensus 153 -~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~-e~~~~~~~~~ 229 (510) |++.-|. .+++.-.+|.+...-. ..+.......++.++.. .. ..+|- ...+...... T Consensus 155 ~~~v~dd~~~~~~~~~v~~~~~~~~---------------~~~~~~~~~~~~~~yt~-~~----~~~~~~~~~~~~~~~~ 214 (506) T protein:vir:94 155 TFVIYSTDVDPKPIMAVRYHQIELV---------------DDNQVSTINYVPETWTA-DT----YTLYNPTPIMGKMQVD 214 (506) T ss_pred eEEEecCCCCCceEEEEEEEeeeec---------------cCCceeEEEEEEEEEeC-ce----EEEeccccCccceecc Confidence 4444443 3666555555533210 11111122222222221 11 01111 0011111111 Q ss_pred cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccch------------- Q lcl|Aclame:pro 230 GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVV------------- 296 (510) Q Consensus 230 ~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~------------- 296 (510) ...++..+|++..+= ...|.|-.+...+-+-.++.+.-..+...+...+|.+++.-...... T Consensus 215 ~~~~~g~vPvv~~~n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 289 (506) T protein:vir:94 215 TTKPITTFPVVEFKN-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPND 289 (506) T ss_pred ccccCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccc Confidence 122345688876532 34688888888898888888877777777766666555421100000 Q ss_pred ------------hhhhc--------CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCC Q lcl|Aclame:pro 297 ------------DDYQD--------AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAE 354 (510) Q Consensus 297 ------------~~~~~--------~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~ 354 (510) ..+.. ...+....|......+-.+....+.+.....++.+...|...-.. +. ....+. T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 369 (506) T protein:vir:94 290 EDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFAS 369 (506) T ss_pred cccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc Confidence 00000 000011111111111112222345677777777777777543321 11 111224 Q ss_pred CCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecH--HHHHHHHHHHHHHHH Q lcl|Aclame:pro 355 RVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGL--PALSRSAAVQSMLNA 431 (510) Q Consensus 355 ~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l--~~l~r~~~~~~~~~~ 431 (510) ..|+..+..+-.-+... .-...+.-.+.+..+++.++.++.. .+...+....+++.+.-++ +.++.++-+.+ T Consensus 370 n~Sg~Aik~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~k---- 444 (506) T protein:vir:94 370 NSSGVAMQYKVLGTVEL-ASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQ---- 444 (506) T ss_pred cchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHH---- Confidence 45666554432211111 1222233333344444444454432 2222222334555553332 22222222111 Q ss_pred HHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH--hhccc Q lcl|Aclame:pro 432 SQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGA-SD--MTNAL 507 (510) Q Consensus 432 ~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a-~~--~~~~~ 507 (510) +++. +....++..+ -++ .+ ++|++...+++ +++.... ...+. ++ ..... T Consensus 445 ------l~g~------iS~et~~~~l---p~v------~d~~~E~~ri~~E~-~~~~~~~-----~~~~~~~~~~~~~~~ 497 (506) T protein:vir:94 445 ------AGAT------LPQKYLYQQL---PGV------TNPQDIVDMMKEQS-ANGDYSF-----DQNGVISNDGQTNTT 497 (506) T ss_pred ------Hhcc------CChHHHHHhC---CCC------CCHHHHHHHHHHHH-HHHhhcc-----hhhcCCCcccCcccc Confidence 1221 2222233221 122 22 23333322221 1111110 01110 11 11111 Q ss_pred CCC Q lcl|Aclame:pro 508 AGV 510 (510) Q Consensus 508 ag~ 510 (510) +.. T Consensus 498 ~~~ 500 (506) T protein:vir:94 498 ATQ 500 (506) T ss_pred ccc Confidence 111 No 117 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=95.92 E-value=0.0013 Score=36.49 Aligned_cols=425 Identities=9% Similarity=-0.053 Sum_probs=175.7 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccc---cCCC-CCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYL---MVDP-MSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~---~~~~-~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) -.+.+.+..+.- +..-.++++++.+|..... .... .........++..+.+...++..++.|++- | ++ T Consensus 40 ~~~~l~~~i~~~-~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~------p-~~ 111 (501) T protein:vir:27 40 NWELLKNFINHH-KLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGN------P-IR 111 (501) T ss_pred cHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhccc------C-ee Confidence 001111111110 1112234555555554321 1111 111111223455666777777766666431 1 22 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CC-CeEEEEEece- Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DE-ATVVAWSLRS- 152 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~--~~-~~~~~~pl~~- 152 (510) ++..+... .+.+. ..+.+....++|.....++.++..++|.+.+++.. +. .++.+++..+ T Consensus 112 ~~~~d~~~---------~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~ 175 (501) T protein:vir:27 112 VEYDDNDN---------NSQND-------DTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLET 175 (501) T ss_pred EecCCccc---------hHHHH-------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEcccee Confidence 23332211 11222 23444577789999999999999999998765543 32 2466665544 Q ss_pred EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC-eeecccc Q lcl|Aclame:pro 153 YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG-VRVGETG 230 (510) Q Consensus 153 ~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~-~~~~~~~ 230 (510) |.+.-|. .+++...+|.+..... .+....++||+ ++. .+++..++ ....... T Consensus 176 ~~v~d~~~~~~~~~~ir~~~~~~~----------------~~~~~~~~vyt-----~~~-----v~~~~~~~~~~~~~~~ 229 (501) T protein:vir:27 176 FVIYDNSLEDNSIAAVRYYNRGTL----------------QNAKDVVEIYT-----NEH-----IYTLDASDDFNEISVT 229 (501) T ss_pred EEEecCCCCCceEEEEEEEEeeec----------------CCcEEEEEEEe-----CCe-----EEEEEeCCceeecccc Confidence 4554444 3566555555432110 11112233332 111 12222222 1111122 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCcc-chhhhh-cCCCccee Q lcl|Aclame:pro 231 RWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGA-VVDDYQ-DAEMGDYV 308 (510) Q Consensus 231 ~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~-~~~~~~-~~~~G~~~ 308 (510) ..++..+|++..+ ++..|+|-....++-+..++.+.-.+.........|.+.+.-.... ...... ....+.+. T Consensus 230 ~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~ 304 (501) T protein:vir:27 230 THAFGTVPITEFL-----NNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQ 304 (501) T ss_pred ccCCCcccEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCcee Confidence 2234578887764 3467999999999999999999888888888888887665311111 111000 00112221 Q ss_pred c-------CCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 309 P-------GGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 309 ~-------g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl 379 (510) . |....+.+-.+....+.+.....++.+++.|...-+. +. ....+...|+..+...-.- +....-...+. T Consensus 305 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~-l~~ka~~~~~~ 383 (501) T protein:vir:27 305 LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFG-LDQDRVDTQSQ 383 (501) T ss_pred ecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHH-HHHHHHHHHHH Confidence 1 1111111111112234455666677777766553322 11 1112234566555433211 12222333333 Q ss_pred HHHHHHHHHHHHHHHHhhcC-CCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~~~-l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a 458 (510) -.+.+.-++..++.++...+ ........+++.+.-.+ +-..+..++.+. .+++. +....++. T Consensus 384 ~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~-p~n~~e~ad~~~-------kl~g~------iS~et~l~--- 446 (501) T protein:vir:27 384 FTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNL-PKSLNEQVSILT-------GLGGQ------VSQETALS--- 446 (501) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCC-CcCHHHHHHHHH-------HHhcc------CcHHHHHH--- Confidence 33444444444455543222 12222234555553322 212222222211 11221 11122222 Q ss_pred HHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------ccCCC Q lcl|Aclame:pro 459 AAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTN-------ALAGV 510 (510) Q Consensus 459 ~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~-------~~ag~ 510 (510) .++ ++.+ ++|++..++++ .+...+.. .....+..+ .-.|= T Consensus 447 -~l~-----~v~D~~~E~eri~~E~-~e~~~~~~-----~~~~~~~~~~~~d~~~~~~~d 494 (501) T protein:vir:27 447 -LSG-----LVESPNEELDKINKEV-SEIDFKGY-----SNDFNEHVGKYTDEVKETHTD 494 (501) T ss_pred -hCC-----CCCCHHHHHHHHHHHH-HhhhHhhh-----cCccccccccccCCCCCCccc Confidence 221 2222 34444332222 11111110 000111000 00000 No 118 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=95.83 E-value=0.0014 Score=36.24 Aligned_cols=410 Identities=10% Similarity=0.022 Sum_probs=172.0 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc-----cc-C--CCCCC-ccccccccccchHHHHHHHHHHHHHHhhcCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY-----LM-V--DPMSG-SRGVVEHDFQSAGALLVNNLAAKLARSLFPTG 71 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~-----~~-~--~~~~~-~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~ 71 (510) .++.+.+..++.+ .-..+++.+.+|..-. +- . ..... ......++..+-+...++.+++.|.+ .| T Consensus 25 ~~~~i~~~i~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g--~~-- 98 (472) T protein:vir:93 25 LEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG--KP-- 98 (472) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcc--cC-- Confidence 2332333333322 1123555555554332 10 0 00000 11122345667788888888876643 11 Q ss_pred CcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-C--eEEEE Q lcl|Aclame:pro 72 IPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A--TVVAW 148 (510) Q Consensus 72 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~-~--~~~~~ 148 (510) +.+...|+. +.+.| ...+ .++|-..+.++.++..++|.+.+++..+. + ++.++ T Consensus 99 ---~~~~~~d~~-------------~~~~l-------~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~ 154 (472) T protein:vir:93 99 ---IAFKHTDDE-------------VVKRI-------DEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 154 (472) T ss_pred ---eeeccCChH-------------HHHHH-------HHHH-hccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEE Confidence 222333321 11112 1122 46889999999999999998866554332 2 45666 Q ss_pred Eece-EEEeeC-CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE--E--EeecCCCeeEEEEEEeeC Q lcl|Aclame:pro 149 SLRS-YAVRRD-ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--V--QRRKGTAMDYAEMYHEID 222 (510) Q Consensus 149 pl~~-~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--v--~~~~~~~~~~~sv~~e~~ 222 (510) +..+ |++.-| ..+++.-.+|.++.. ....+++|+- + +..++... ...+..+.+ T Consensus 155 ~p~~~~~i~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 213 (472) T protein:vir:93 155 PAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGSL-IPDYSNNLE 213 (472) T ss_pred cccceEEEEcCCCCCceEEEEEEEEee--------------------cceeEEEEecCeEEEEEEecCee-eeccccccc Confidence 6555 444333 357776666665421 0112333321 0 11111111 000111111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhh-c Q lcl|Aclame:pro 223 GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ-D 301 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~-~ 301 (510) ...+. ....++..+|++.++. +.+|+|=.+...+-+..++.+.-.+....+....|.+++.-.......... . T Consensus 214 ~~~~~-~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 287 (472) T protein:vir:93 214 NSKTH-FSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL 287 (472) T ss_pred ccccc-cccCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHH Confidence 11111 2223345788887764 458999999999999999988888888888888887666311111111111 0 Q ss_pred -CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc-c-cCCCCCCCCHHHHH-------HHHHHHHHH Q lcl|Aclame:pro 302 -AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A-NQRDAERVTAEEVR-------ITAEEAENT 371 (510) Q Consensus 302 -~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~-~-~~~~~~~vTAtEi~-------~r~~E~~~~ 371 (510) ...+.+......+++.+... .+.......++.++..|...-..- . ...-+...|+.-+. .+++++... T Consensus 288 ~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~ 365 (472) T protein:vir:93 288 LRYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARK 365 (472) T ss_pred HhhccccccCCCCcceeEeec--CCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 11123322333344444322 345666777777777775543221 1 11222334554432 233444444 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLP 451 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d 451 (510) ++..+.+ ++..++.++. .. .....+.+.+. +..+-..+..++. +..+++. +... T Consensus 366 ~~~~l~~--------~~~li~~~~~---~~-~~~~~i~v~f~-~~~p~~~~~~~~~-------~~k~~gi------is~e 419 (472) T protein:vir:93 366 AKVAIQE--------LLWFVFEHFD---IK-GEHKDVDISFN-YNKVANTELQVQT-------AQQSMGI------VSHE 419 (472) T ss_pred HHHHHHH--------HHHHHHHHhC---CC-cccceeeEEeC-CCCCCCHHHHHHH-------HHHHhcc------CchH Confidence 4443333 2222222221 11 11123444332 1112112222222 1112221 1112 Q ss_pred HHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 452 KMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 452 ~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) .++ ..++ ++.+ ++|++...+ ++.++..++ .. ..+ +.+.+...+- T Consensus 420 t~l----~~l~-----~~~d~~~E~~ri~~-E~~~~~~~~-~~---~~~-~~~d~~~~~~ 464 (472) T protein:vir:93 420 TVL----ENHP-----FVEDLQAELERIEQ-EQMEYNKQL-PN---LDD-GGADGAQQQE 464 (472) T ss_pred HHH----HhCC-----CCCCHHHHHHHHHH-HHHHHHHhc-cC---cCc-ccCCCCCCCC Confidence 222 2222 1222 344433222 222111111 00 000 0000011111 No 119 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=422 Identities=9% Similarity=0.040 Sum_probs=172.1 Q ss_pred ChhHHHHHHHHHh-cc----CchHHHHHHHHhhcccc---cC--------CCC--CCccccccc-cccchHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DG----SVEQRAIEFAKTTLPYL---MV--------DPM--SGSRGVVEH-DFQSAGALLVNNLAA 61 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~----~~~~~w~e~~~~~~P~~---~~--------~~~--~~~~~~~~~-~~dstg~~a~~~Laa 61 (510) |-..--++ +... +. .+..+|+-|.+.+--.. .+ ... +....++.+ .|=+. +....+ T Consensus 1 ~~~~~~~~-~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~----~~~tl~ 75 (489) T protein:vir:78 1 MLTENGQG-SGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNF----TRRTLS 75 (489) T ss_pred CccCCCcc-CCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCCh----HHHHHH Confidence 11000000 0000 00 02334554444322210 00 000 000111111 12222 233334 Q ss_pred HHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC Q lcl|Aclame:pro 62 KLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSD 141 (510) Q Consensus 62 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~ 141 (510) .|++.+|- ..|++. +++ .++.++++| -+...+.+.-+...+.+...+|-+.+++|.+ T Consensus 76 ~l~G~vfr-k~p~~~--~p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P 132 (489) T protein:vir:78 76 GMVGSVMR-KEPEIN--IPK--------------ELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGGLLVDAP 132 (489) T ss_pred HHhchhhc-CCccee--ccH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeeC Confidence 44444443 234442 222 234454444 3556778888899999999999999999865 Q ss_pred CC---------------eEEEEEeceE---EEee-CCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE Q lcl|Aclame:pro 142 EA---------------TVVAWSLRSY---AVRR-DATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT 202 (510) Q Consensus 142 ~~---------------~~~~~pl~~~---~v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~ 202 (510) .. .+..|+-.+. -..+ |+.+++.-+..+++...++=...|+ .+.++.|. T Consensus 133 ~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~------------~~~~~q~R 200 (489) T protein:vir:78 133 ETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFE------------TKYGEQYR 200 (489) T ss_pred CCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCcc------------ceeEEEEE Confidence 32 1555654443 2222 4444666666677555443333443 34555555 Q ss_pred EEEeecCCCeeEEEEEEeeCCeee-------ccccccccccCceEEEeeeecCCCccc--cchHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 203 HVQRRKGTAMDYAEMYHEIDGVRV-------GETGRWPIHLCPYIVPTWNLAPGEHYG--RGHVEDYIGDFAKLSLL--- 270 (510) Q Consensus 203 ~v~~~~~~~~~~~sv~~e~~~~~~-------~~~~~y~~~~~P~~~~Rw~~~~ge~YG--rgp~~~~l~d~~~L~~l--- 270 (510) ++.+...+.+.+..+....+|... ..+++ +.+++|++.|--..+..+. ..|.. |+..||.- T Consensus 201 vL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~lni~Hy~ 273 (489) T protein:vir:78 201 VLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGE---SLRGVIPFTFIGATNNDATIDDAPLL----PLAELNIGHYR 273 (489) T ss_pred EEecCCCcceEEEEEEeecCCcccceeeEEeccCCC---CccCeeeEEEEecCCCCCCCCcCchH----HHHHHHHHHhh Confidence 555543333333333323233211 12222 3578888888766665554 44533 55555532 Q ss_pred HHHHHH-HHHHhhCCceeeCC-CCccchhhhhcCCCcceecCCcc--------ccccccCCCccchHHHHHHHHHHHHHH Q lcl|Aclame:pro 271 SEKLGL-YELESLEVLNLVDE-AKGAVVDDYQDAEMGDYVPGGAE--------AVRAYERGDYNKMAAIQQSLQAVVVRL 340 (510) Q Consensus 271 ~~~~l~-~~~~a~~~~~lv~~-~g~~~~~~~~~~~~G~~~~g~~~--------~v~~~~~~~~~~~~~~~~~i~~~~~~I 340 (510) +.+-++ ....+.-|.+.+.. +.. ....+..+....++-|... +.+.++.. ......+.+.++++++ T Consensus 274 ~ssd~~~~l~~~~~P~l~i~G~d~~-~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~---~~~~~r~~l~~le~qm 349 (489) T protein:vir:78 274 NSADNEESSFVVGQPTLFIYPGENL-TPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAG---ENNLARQNMLDKEQQA 349 (489) T ss_pred hhhHHHHHHHHcccceeeeecCccC-CcccccccCccceeeCCcccccCCCCCCcceeccC---cchHHHHHHHHHHHHH Confidence 222233 33444445443321 111 0111111111112222211 12222322 1233467777777776 Q ss_pred HHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecHHHH Q lcl|Aclame:pro 341 NQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPAL 419 (510) Q Consensus 341 ~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l~~l 419 (510) .++=. .++... .+.||++.+.+...--..|+.+...+++-+ .+++.++-+ -|.. .+..+.+.+-. +.. T Consensus 350 ~~lGa-~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al-----~~~l~~~a~w~G~~--~~~~~~i~~n~--dF~ 418 (489) T protein:vir:78 350 IQIGA-QLITPT-QQITAQSARIQRGADTSVMATIARNVSQAY-----TDALRWVAVMLGKP--EDTEVEFRLNM--DFF 418 (489) T ss_pred HHHhh-hhccCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCC--CCCceEEEeec--ccC Confidence 65321 122332 358999999999999999999888876653 444444432 1221 12223222211 000 Q ss_pred HHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 420 SRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEG 499 (510) Q Consensus 420 ~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~ 499 (510) .+..+.+.+.+++..... + .|..+.+.+.+ ...||.. .+.++++.+.+.+- .+. T Consensus 419 ~~~~d~~~~~al~~~~~~--G------~is~~t~~~~L-~~~gv~d----~~~e~~~~ei~~~~-------------~~~ 472 (489) T protein:vir:78 419 LEPMTAQDRAAWMADINA--G------LLPATAYYAAL-RKAGVTD----WTDADIKDAVADQP-------------LPV 472 (489) T ss_pred cccCCHHHHHHHHHHHhc--C------CCCHHHHHHHH-HhCCCCC----ccHHHHHHHHhhcC-------------CCc Confidence 111122222222222211 1 12222233322 2234431 12222222211100 000 Q ss_pred HHHh-hcccCCC Q lcl|Aclame:pro 500 ASDM-TNALAGV 510 (510) Q Consensus 500 a~~~-~~~~ag~ 510 (510) +.+. +..+++- T Consensus 473 ~~~~~g~~~~~~ 484 (489) T protein:vir:78 473 ATEVQGEIPQSA 484 (489) T ss_pred ccCCcccCCCCc Confidence 0000 0000000 No 120 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=94.92 E-value=0.0032 Score=34.27 Aligned_cols=417 Identities=12% Similarity=0.023 Sum_probs=168.2 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccccCC-CCC------------Cccccccccc--cchHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVD-PMS------------GSRGVVEHDF--QSAGALLVNNLAAKLAR 65 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~-~~~------------~~~~~~~~~~--dstg~~a~~~Laa~l~~ 65 (510) |-+-=.++-.+.. ........|.. ..... ..+ -+...+...| +..+-.+|++.|..+ T Consensus 1 ~~~~~~a~~~~~~-----~~a~~~~~~~~-~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~-- 72 (461) T protein:vir:80 1 MYSIDKAKQAKID-----SKIVNRNDFMV-GHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDM-- 72 (461) T ss_pred Cccchhhhhhhhh-----hhhhhhhHHHh-hcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHh-- Confidence 3221111111111 11111122221 00000 000 0011111112 223333444444333 Q ss_pred hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCC-- Q lcl|Aclame:pro 66 SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA-- 143 (510) Q Consensus 66 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~-- 143 (510) | +.|+.+.-.++.. .+.++.++ .+-+....+.++++.--.+|.+.+++.-.++ T Consensus 73 --~---r~g~~i~~~~~~~---------~~~~~~~~-----------~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~ 127 (461) T protein:vir:80 73 --V---RAGWSLKTDNKEM---------KKNIESKW-----------RKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR 127 (461) T ss_pred --h---cCCeeeecCCHHH---------HHHHHHHH-----------HHhhHHHHHHHHHHhhcccccEEEEEEeecCCc Confidence 3 4688876544321 11233333 2336788999999999999988777643211 Q ss_pred --eEEEEEeceEEEeeCCCCce--eEEEEEEEecHHHHhHHhhHHhhcccccCCC-CceEEEEEEEEeecCCCeeEEEEE Q lcl|Aclame:pro 144 --TVVAWSLRSYAVRRDATGRW--MDIVLKQRYKSKDLDDVYKQDLMRAGRNLSG-SGSVDLYTHVQRRKGTAMDYAEMY 218 (510) Q Consensus 144 --~~~~~pl~~~~v~~d~~G~v--~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~-~~~v~v~~~v~~~~~~~~~~~sv~ 218 (510) ....-|| ....-+.+ ..+|.+..++...+.. +.. +| +-+-+.|+........ .+ T Consensus 128 ~~~~~~~pl-----~~~~~~~~~~l~~~~~~~i~~~~~~~----dp~------sp~fg~P~~y~i~~~~~~~------~~ 186 (461) T protein:vir:80 128 EQADLSTAI-----DPKTIKSIPYINTFNTQKVTQLYLNQ----DMF------SEHFGEVEFFEVNRVSQLG------EE 186 (461) T ss_pred cccCccCCc-----ccccccceeEEEeccccccchhhhcc----cCc------CcccccceEEEEecccccc------cc Confidence 1111122 11111111 1233333333222211 111 11 1111222221111000 00 Q ss_pred EeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC------CC Q lcl|Aclame:pro 219 HEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE------AK 292 (510) Q Consensus 219 ~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~------~g 292 (510) .. .+ .........|..+++...-...++..||+|..+..++.++..........+-...+.-..+-.+. +. T Consensus 187 ~~-~~--~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~ 263 (461) T protein:vir:80 187 IL-SG--TTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDD 263 (461) T ss_pred cc-cc--ccCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchH Confidence 00 00 00011111233445555555566778899999999999999988887777655444433333321 00 Q ss_pred ---ccchhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHh--hc--ccCCCCCCCCHHHHHHHH Q lcl|Aclame:pro 293 ---GAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG--ANQRDAERVTAEEVRITA 365 (510) Q Consensus 293 ---~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~--~~--~~~~~~~~vTAtEi~~r~ 365 (510) ....-.......|..+-+..+++..+.. ++.-+...+....+.|.-+-= .. ..+..+..-|..+ T Consensus 264 ~~~~~~~~~~~~~~~g~~~~d~~e~~e~~~~----~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~----- 334 (461) T protein:vir:80 264 KANLTAMLDFMFRTEALAIIKGDEQLTKEST----NVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQY----- 334 (461) T ss_pred HHHHHHHHHHhcCCceEEEEcCCcceEEEec----CcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchH----- Confidence 0100111111234444444444443322 333445566666777766541 11 1233344434332 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhc--C-CCCCCcc--ceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 366 EEAENTLGGTYSLLAENLQSPLAYVCLSEVDDA--L-LQGLITK--QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAP 440 (510) Q Consensus 366 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~--~-l~~~p~~--~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~ 440 (510) =....---+.+++...+.|.+++++.+|.+. + -+++.+. ++.+++- +|.++.-..+++......+..+.+.+ T Consensus 335 --D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~-~L~~~s~kekAe~~~~~a~a~~~~~~ 411 (461) T protein:vir:80 335 --DVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFN-PLWNLDSKTDAEVRKLTAEADQIYIV 411 (461) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 1222333456667778899999999987542 2 2333333 4444442 33332333333322222222222221 Q ss_pred hHhHhhcCCHHHHHHHHHHHcCCCHhhccC----CHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 441 IAQLDPRISLPKMMDTIWAAFSVDTSQFYK----SADELQAEAEEQRRQAAQAQAAQETLLEG 499 (510) Q Consensus 441 ~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~----s~ee~~~~~~~~~qqa~~~~~a~~~~~~~ 499 (510) ...|+.+++.+.+....|+++..... ..|+++....+ ..+.+.. .| T Consensus 412 ----~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~e~~-----~g 461 (461) T protein:vir:80 412 ----NGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYD----AYAKKNA-----DG 461 (461) T ss_pred ----cCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccc----cccccCC-----CC Confidence 12488888888887777765432222 11221111110 0000000 00 No 121 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=94.76 E-value=0.0036 Score=34.00 Aligned_cols=420 Identities=13% Similarity=0.093 Sum_probs=175.5 Q ss_pred Ch--------hHHHHHHHHHhccC-chHHHHHHHHhhcccccCCCCC-Cccccccc-cccchHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 1 MK--------STAAMLWEKLRDGS-VEQRAIEFAKTTLPYLMVDPMS-GSRGVVEH-DFQSAGALLVNNLAAKLARSLFP 69 (510) Q Consensus 1 ~k--------~~~~~r~~~lkr~~-~~~~w~e~~~~~~P~~~~~~~~-~~~~~~~~-~~dstg~~a~~~Laa~l~~~ltp 69 (510) |= ..+..+|+.++..- =...+++..+-.||.. +.+.+ .-..++.+ .|-+.-.+.++. |++.+|. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~-~~E~~~~Y~~rl~rA~~~n~~~~t~~~----~~G~vf~ 75 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKL-SGQTDDMYNAYKQRALFYSITSKTLSA----LSGMVLD 75 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCC-CCCCHHHHHHHHhhccCCchHHHHHHH----Hhchhhc Confidence 11 11233444332110 0122333333334432 11111 11222222 234444444444 4444443 Q ss_pred ccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCe----E Q lcl|Aclame:pro 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEAT----V 145 (510) Q Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~----~ 145 (510) .++ .++.++. ++.+. .-....+.+.-+...+.+...+|-+.+++|-+... + T Consensus 76 --k~p-~~~~p~~--------------l~~~~--------~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~ 130 (452) T protein:vir:94 76 --QPP-VITHPDA--------------MSKYF--------EDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYI 130 (452) T ss_pred --CCc-eecccHH--------------HHHHH--------hcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEE Confidence 111 2233321 12211 12557788888899999999999998999876432 4 Q ss_pred EEEEece-EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 146 VAWSLRS-YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 146 ~~~pl~~-~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~ 224 (510) ..|+-.+ .=+..|..|+..-+..+++...++-.++|+.+.. +.|.+....++ ++- ..+|-..++. T Consensus 131 ~~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~------------~~yRvL~l~~g-~~~-v~~~~~~~~~ 196 (452) T protein:vir:94 131 SVYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNIR------------VRYRCLELVDG-LLQ-ITVHETQDGK 196 (452) T ss_pred EEechhhhcCccccccCCeeEEEEEEEEEEecCCCcccceeE------------EEEEEEEEeCC-eEE-EEEEEccCCc Confidence 4555433 2244566677766777776666665566664322 23332222221 110 0011111111 Q ss_pred -------eeccccccccccCceEEEeeeecCCCc--cccchHHHHHHHHHHHHHH----HHHHHHHHHHhhCCceeeCCC Q lcl|Aclame:pro 225 -------RVGETGRWPIHLCPYIVPTWNLAPGEH--YGRGHVEDYIGDFAKLSLL----SEKLGLYELESLEVLNLVDEA 291 (510) Q Consensus 225 -------~~~~~~~y~~~~~P~~~~Rw~~~~ge~--YGrgp~~~~l~d~~~L~~l----~~~~l~~~~~a~~~~~lv~~~ 291 (510) .....++ +.+++|++.|-...+.. -|..|.. |+..||.- +-..-+.+..+..|...+. T Consensus 197 ~~~~~~~~~~~~~~---~~l~~IP~v~~~~~~~~~~~~~pPLl----~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~-- 267 (452) T protein:vir:94 197 VWELAKTSTIQNVG---VTMDYIPFFCITPSGLSMTPAKPPMI----DIVDINYSHYRTSADLEHGRHFTGLPTPWIT-- 267 (452) T ss_pred eeeeccceeecCCC---cccceeEEEEEcCCCCCCCCCccchH----HHHHHHHHHhcchhHHHHHHHHcccceeEee-- Confidence 1112222 24667777776555543 3445533 55555432 2223334455555644442 Q ss_pred CccchhhhhcCCCcce-ecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHHH-HHHHHHHH Q lcl|Aclame:pro 292 KGAVVDDYQDAEMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEEV-RITAEEAE 369 (510) Q Consensus 292 g~~~~~~~~~~~~G~~-~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtEi-~~r~~E~~ 369 (510) |....+.+..+++..+ .|.......-++. .+..+......|+++++++.++=- .++.......|++|- ..+..... T Consensus 268 g~~~~~~i~iG~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~~m~~~Ga-~ll~~~~~~~~s~ea~~~~~~~~~ 345 (452) T protein:vir:94 268 GAESQSTMHIGSTKAWVIPEVAAKVGFLEF-TGQGLQSLEKALSEKQAQLASLSA-RLIDNSTRGSEATETVKLRYMSET 345 (452) T ss_pred cCcCCCceEecccccccCCCCCCcceEEcc-CchhHHHHHHHHHHHHHHHHHHHH-HhhccCCCcchHHHHHHHHHHHhh Confidence 3333334444332222 2321122333332 245678888889999988866332 233333323445544 44555456 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcC Q lcl|Aclame:pro 370 NTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRI 448 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~i 448 (510) ..|..+..+++.-+ ++++.++.+ -|. ...+++.+-... ..+..+.+.+.++++... ++ .| T Consensus 346 s~L~~~a~~~e~al-----~~~l~~~a~w~g~----~~~~~v~~n~dF--~~~~~~~~~~~al~~~~~--~G------~i 406 (452) T protein:vir:94 346 ASLKSVTRAVEALL-----NKAYSCIMDMESM----GGTLNIKLNSAF--LDSKLTAAELKAWVEAYL--SG------GI 406 (452) T ss_pred HHHHHHHHHHHHHH-----HHHHHHHHHHcCC----CCceEEEecccc--ccccCCHHHHHHHHHHHh--cC------CC Confidence 88888888876654 455555433 122 123344332211 011111222222332211 11 23 Q ss_pred CHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 449 SLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDM 503 (510) Q Consensus 449 d~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~ 503 (510) ..+.+...+- ..||+. . +.|+.....+...+ + .........+.++. T Consensus 407 s~~t~~~~L~-~~gvl~--~--~~e~~~i~~E~~~~-~---~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 407 SKEIYIHALK-VGKVLP--P--PGESMGVIPDPPAP-E---PSPSNTPPNPSSKA 452 (452) T ss_pred cHHHHHHHHH-hCCCCC--C--ccCHHHHHHHhhcc-C---cccCCCCCCCccCC Confidence 3334444433 356652 1 11111111111100 0 00000001111111 No 122 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=94.44 E-value=0.0044 Score=33.50 Aligned_cols=376 Identities=11% Similarity=0.018 Sum_probs=169.8 Q ss_pred ChhHHHHHHH-HHh-ccCchHHHHHHHHhh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWE-KLR-DGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~-~lk-r~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) |-..+..+.. ++. +.+...+..++|+-. +|++-+ .-...-...-+..-+-+..+|++||..|. ..+ |+ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~-~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G---f~ 72 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGI-TIPQALSQQYRSILGWCAKGVDSLADRLV----FRE---FE 72 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcch-hhhHHHHHHHhhhcChhHHHHHHhHhhcc----ccc---cc Confidence 6555544433 332 222222333344321 111100 00000000112233455666666655442 111 11 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC---CeEEEEEeceE Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE---ATVVAWSLRSY 153 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~---~~~~~~pl~~~ 153 (510) .+| .+ +.+....++|.....++.++..++|.+.+++..++ .++++++..+. T Consensus 73 --~~d-------------~~-----------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~ 126 (409) T protein:vir:16 73 --NDD-------------FT-----------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNA 126 (409) T ss_pred --Ccc-------------hH-----------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccce Confidence 111 11 22345678999999999999999999887766532 24777766554 Q ss_pred EEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 154 AVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 154 ~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) ++..|+ .+++...++...- ..........+| .+. ... ++...++......-++ T Consensus 127 ~~i~D~~~~~~~~a~~~~~~-----------------d~~~~~~~~~~~---~~~--~~~----~~~~~~~~~~~~~~~~ 180 (409) T protein:vir:16 127 TGIIDPITGLLTEGYAVLER-----------------DENNNVVLEAHF---LPD--RTD----YYYRDSRNNISIANPT 180 (409) T ss_pred EEEeecccccceeeeEEEEe-----------------cCCCceEEEEEE---ecC--cEE----EEEecCccccceecCC Confidence 444454 3555544432210 000111111222 111 100 1111222222222334 Q ss_pred ccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhhCCceee---CCCCccchhhhhcCCCcce- Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHV-EDYIGDFAKLSLLSEKLGLYELESLEVLNLV---DEAKGAVVDDYQDAEMGDY- 307 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv---~~~g~~~~~~~~~~~~G~~- 307 (510) ..||++.+..+...++.||+|=. +..++-+..+|...-..+..++..+.|...+ .++|. ..+.+.. ..|.+ T Consensus 181 --g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~-~~~~~~~-~~~~i~ 256 (409) T protein:vir:16 181 --GNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAE-PMETWKA-TVSSML 256 (409) T ss_pred --CCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCC-ccchhhh-hhhHhh Confidence 57999999999999999999954 5688888999998888888888888775544 23331 1111211 12222 Q ss_pred -ecCCccc--cccccCCCccchHHHHHHHHHHHHHHHHHHhhcccC-----CCCCC-CCHHHH-------HHHHHHHHHH Q lcl|Aclame:pro 308 -VPGGAEA--VRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ-----RDAER-VTAEEV-------RITAEEAENT 371 (510) Q Consensus 308 -~~g~~~~--v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~-----~~~~~-vTAtEi-------~~r~~E~~~~ 371 (510) +|...++ ++.-++ ..++++.. ++.++.-|+...+..... ....+ -+|.-| ..+++++... T Consensus 257 ~~~~d~~g~~~~v~q~-~~~~l~~~---~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~ 332 (409) T protein:vir:16 257 QFTKDEDGDKPTLGQF-TQPSMSPF---TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRS 332 (409) T ss_pred ccCCCCCCCCceEEec-CCCChhHH---HHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 2322221 222223 23455543 444444444433322211 11122 233322 3356667777 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccce--eeEEe--ecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhc Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQH--KPAIE--TGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~--~~~~v--s~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~ 447 (510) +|..+.++ +..++.+. ++....+.+.. ++..- .+-+..+.++.+..+.-+.+.. .++++ T Consensus 333 fg~~l~~~--------~rla~~~~--~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~---~~~~~---- 395 (409) T protein:vir:16 333 LGAGLLNV--------AYLAACLR--DDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAI---PEFIN---- 395 (409) T ss_pred HHHHHHHH--------HHHHHHHh--cCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhc---ccccc---- Confidence 77666542 22223332 33344444433 33332 1222323344444444333321 11111 Q ss_pred CCHHHHHHHHHHHcCCCHhh Q lcl|Aclame:pro 448 ISLPKMMDTIWAAFSVDTSQ 467 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~ 467 (510) -+.+.+.+|+.... T Consensus 396 ------~~v~~~~~g~~~~d 409 (409) T protein:vir:16 396 ------KDTIRDLTGIKGAE 409 (409) T ss_pred ------hhHHHHhccCCCCC Confidence 12234445555432 No 123 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=94.40 E-value=0.0045 Score=33.44 Aligned_cols=414 Identities=11% Similarity=0.062 Sum_probs=167.2 Q ss_pred Chh-HHHHHHHHH--hccCchHHHHHHHHhhccc-----c-cCCCC-----------CCccccccccccchHHHHHHHHH Q lcl|Aclame:pro 1 MKS-TAAMLWEKL--RDGSVEQRAIEFAKTTLPY-----L-MVDPM-----------SGSRGVVEHDFQSAGALLVNNLA 60 (510) Q Consensus 1 ~k~-~~~~r~~~l--kr~~~~~~w~e~~~~~~P~-----~-~~~~~-----------~~~~~~~~~~~dstg~~a~~~La 60 (510) |.- ++.+..+.+ +.+.-..+++++.+|..-. + ...+. +.......++..+.+...++..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 221 112222222 1111223444444443211 0 00000 00011112344555555555555 Q ss_pred HHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EE Q lcl|Aclame:pro 61 AKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YR 138 (510) Q Consensus 61 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~ 138 (510) +.|.+ -|+. +..+++. +.+.|.. +...+|.....++.++...+|.+.+ |. T Consensus 81 ~yl~G--~p~~-----~~~~~~~-------------~~~~l~~--------~~~n~~~~~~~~~~~~~~~~G~~~~~v~~ 132 (471) T protein:vir:10 81 AYALT--YPPT-----FDVDDKK-------------VNDMIVD--------VLGDDYERISKQLCVNAGNAGIAWLHVWK 132 (471) T ss_pred hhhcc--cCce-----eccCChH-------------HHHHHHH--------HHhcCHHHHHHHHHHHHhhCCeEEEEEEe Confidence 54443 2222 2333321 2222211 2246889999999999999998764 45 Q ss_pred eCCCCe--EEEEEeceEEEeeCC--CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEE----EEeecCC Q lcl|Aclame:pro 139 NSDEAT--VVAWSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH----VQRRKGT 210 (510) Q Consensus 139 ~~~~~~--~~~~pl~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~~~ 210 (510) +...+. +.+++..+.++.-|. .+++...+|.|..... ...+....+++|+- .+...+. T Consensus 133 d~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~--------------~~~~~~~~~~vy~~~~~~~y~~~~~ 198 (471) T protein:vir:10 133 DASDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDE--------------TDGKNYTVYEYWNDKECSFYRHEKE 198 (471) T ss_pred eCCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeecc--------------CCCceeEEEEEEeCCcEEEEEecCC Confidence 543344 555655554443333 4567666666643211 11122223333321 0111111 Q ss_pred CeeEEEEE--------EeeCCee-eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 211 AMDYAEMY--------HEIDGVR-VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 211 ~~~~~sv~--------~e~~~~~-~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a 281 (510) .....+. .-.+|.. .......++..+|++..+. +.+|.|=.+...+-+-.++.+.-......+.. T Consensus 199 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~ 272 (471) T protein:vir:10 199 -KPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDV 272 (471) T ss_pred -cccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1111110 0011111 1111122345688876654 45789989999999999998888888888888 Q ss_pred hCCceeeCC-CCccchhhhhcCC-Cccee-cCC----ccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCC Q lcl|Aclame:pro 282 LEVLNLVDE-AKGAVVDDYQDAE-MGDYV-PGG----AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDA 353 (510) Q Consensus 282 ~~~~~lv~~-~g~~~~~~~~~~~-~G~~~-~g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~ 353 (510) .+|.+++.- ++....+...... .+.+. ++. ..++..+. ...+.+.....++.+++.|...-.. +...-.. T Consensus 273 ~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~ 350 (471) T protein:vir:10 273 QEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIA--IDIPTEARNLILERTKKQIFISGQGVNPETDKL 350 (471) T ss_pred hCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEe--ecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc Confidence 888766632 1222222222211 12222 111 11333332 2246677788888888887654322 1111111 Q ss_pred CCCCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecH--HHHHHHHH Q lcl|Aclame:pro 354 ERVTAEEVRIT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGL--PALSRSAA 424 (510) Q Consensus 354 ~~vTAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l--~~l~r~~~ 424 (510) ...|+.-+..+ +.++...++..+.+ ++..+..++. .. ...++.+.+.-.+ +....+ T Consensus 351 gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~--------~~~li~~~~~---~~--d~~~i~i~f~~~~p~n~~e~~-- 415 (471) T protein:vir:10 351 GNSSGVALKFLYSLLELKAGNMETQFRSGYAT--------LVKMILKHLG---LS--DKLKIKQTWTRNSINNDTEMA-- 415 (471) T ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhc---cC--CCceeEEEeCCCCCCCHHHHH-- Confidence 23455444332 33333333333322 2222222221 11 1234555554322 222222 Q ss_pred HHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 425 VQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGASDM 503 (510) Q Consensus 425 ~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s-~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~ 503 (510) +.+..+++ .+.-..++. .++ ++.+ ++|++...+++..+ ++ +.....++... T Consensus 416 --------~~~~kl~g------~iS~et~~~----~~p-----~v~D~~~E~eri~~E~~~~---~~--~~~~~~~~~~~ 467 (471) T protein:vir:10 416 --------QVVSTLAT------ITSRENVAK----SNP-----IVEDWQDELRLQKAEQEGR---SE--KLYDMEEVEHE 467 (471) T ss_pred --------HHHHHHhc------cCchHHHHH----hCC-----CCCCHHHHHHHHHHHHHHH---Hh--cccccCCCCCc Confidence 12222222 122222222 221 2222 33443322222111 11 00011111111 Q ss_pred hccc Q lcl|Aclame:pro 504 TNAL 507 (510) Q Consensus 504 ~~~~ 507 (510) .-.. T Consensus 468 ~e~~ 471 (471) T protein:vir:10 468 SEVE 471 (471) T ss_pred cccC Confidence 0011 No 124 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=93.47 E-value=0.0075 Score=32.25 Aligned_cols=412 Identities=11% Similarity=0.021 Sum_probs=169.4 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHh--hccccc---CCCCCCcc-ccccccccchHHHHHHHHHHHHHHhhcCccCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKT--TLPYLM---VDPMSGSR-GVVEHDFQSAGALLVNNLAAKLARSLFPTGIP 73 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~--~~P~~~---~~~~~~~~-~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 73 (510) ..+.+.+..++.+ |-....+.+++++- -++.+- ........ ....++..+-+...++..++.|++ -| T Consensus 27 ~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g--~p---- 100 (474) T protein:vir:96 27 QEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVA--NP---- 100 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcc--cC---- Confidence 2222333333332 22222233333321 111111 11111111 111234555666666666655543 12 Q ss_pred ccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC-CeEEEEEe Q lcl|Aclame:pro 74 FFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE-ATVVAWSL 150 (510) Q Consensus 74 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~-~~~~~~pl 150 (510) .+++.+++.. ...++.| + ..||.....++.++..++|.+.+ |.+++. .++.+++. T Consensus 101 -~~~~~~d~~~---------~~~l~~~-----------~-~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p 158 (474) T protein:vir:96 101 -VTFSSDDDKS---------LKTIQEV-----------L-NHKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPA 158 (474) T ss_pred -ceeecCchHH---------HHHHHHH-----------H-hcCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcc Confidence 1223333221 1112222 2 35788888999999999998764 454442 23556665 Q ss_pred ceEEEeeC--CCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEE--EE--EeecCCCeeEEEEEEe---e Q lcl|Aclame:pro 151 RSYAVRRD--ATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT--HV--QRRKGTAMDYAEMYHE---I 221 (510) Q Consensus 151 ~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~--~v--~~~~~~~~~~~sv~~e---~ 221 (510) .+.++..| ..+++...+|.++.. ....+++|+ .| +...++.......+.. . T Consensus 159 ~~~~~v~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:96 159 EQAIPIWTNKERDTLKAFIRYYRLD--------------------GAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQ 218 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeec--------------------CceEEEEEeCCeEEEEEecCCceeecccccccccc Confidence 55554444 357776666665321 112233332 11 1111111111111111 0 Q ss_pred CCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhh-hh Q lcl|Aclame:pro 222 DGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-YQ 300 (510) Q Consensus 222 ~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~-~~ 300 (510) .+..+. ....++..+|++.++. +.+|+|=.+...+-+..+|.+.-......+....|.+++.-...-...+ .. T Consensus 219 ~~~~~~-~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~ 292 (474) T protein:vir:96 219 SHYYVG-NKRVSWGRVPFIPFKN-----NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMR 292 (474) T ss_pred cccccc-ccccCCCceeEEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhh Confidence 111111 1223446789887765 4679999999999999999988888888888888876653211111111 11 Q ss_pred cC-CCcce-ecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cc-cCCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 301 DA-EMGDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GA-NQRDAERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 301 ~~-~~G~~-~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~ 376 (510) .. ..+.+ ++|...++..+.. ..+.+.....++.+++.|...-.. +. ....+...|+.-+..+-.- ..+-.... T Consensus 293 ~~~~~~~i~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~-l~~k~~~k 369 (474) T protein:vir:96 293 NLKYYKAINVDGDGSGVDTIQI--EVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSN-LDLKANKL 369 (474) T ss_pred hhhcCceEEecCCCCceeEEee--cCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHH-HHHHHHHH Confidence 11 11222 2444445554433 246677777788877777553321 11 1111233455544322111 11111222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHH Q lcl|Aclame:pro 377 SLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMM 454 (510) Q Consensus 377 ~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~ 454 (510) .+.-.+.+.-++..++.++. . ......+.+.+.- |.+.+..+ +.+.. ++ .+.-..++ T Consensus 370 ~~~~~~~l~~~~~~i~~~~~---~-~~~~~~i~i~f~~~~p~~~~e~~----------~~~~~-ag------~iS~et~~ 428 (474) T protein:vir:96 370 KNKTLTALQELLQYIIDFYK---L-NIKVQDVEITFNFNVMVNELEQS----------QIGVQ-SQ------YLSKETVV 428 (474) T ss_pred HHHHHHHHHHHHHHHHHHhC---C-CcccceeeEEeccCCCcCHHHHH----------HHHHh-cC------CCchHHHH Confidence 22222223223333333321 1 1222334444432 22222111 11111 11 12233333 Q ss_pred HHHHHHcCCCHhhccCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccCCC Q lcl|Aclame:pro 455 DTIWAAFSVDTSQFYKSA-DELQAEAEEQRRQAAQAQAAQETLLEGASDMTN-ALAGV 510 (510) Q Consensus 455 ~~~a~~~Gvp~~~i~~s~-ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~-~~ag~ 510 (510) .. ++ ++.++ +|++. .++++.... +.. ....+ ...+. T Consensus 429 ~~----~~-----~v~d~~~E~~r-i~~E~~e~~--~~~--------~~~~~~~~~~~ 466 (474) T protein:vir:96 429 TN----HP-----WVDDPVAELER-IEQDNIDFN--KQL--------PPLEGDANGRA 466 (474) T ss_pred Hh----CC-----CCCCHHHHHHH-HHHHHHHHH--hcc--------ccccccccccc Confidence 32 21 22222 23332 222221111 111 11111 11112 No 125 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=91.30 E-value=0.017 Score=30.35 Aligned_cols=353 Identities=11% Similarity=0.016 Sum_probs=137.7 Q ss_pred ChhHHHHHHHHHhccCchH-HH-HHHHHhhcccccCCCCCCccccccccc-cchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQ-RA-IEFAKTTLPYLMVDPMSGSRGVVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~-~w-~e~~~~~~P~~~~~~~~~~~~~~~~~~-dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) |+ .|+.+...+... .+ ..+..+..|..+........-...+.. .++--.|++.+|+.+.+. ||- + T Consensus 1 Mg-----lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l------~~~-~ 68 (384) T protein:vir:49 1 MP-----IFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATA------KIT-T 68 (384) T ss_pred Cc-----cccccccCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhC------cee-e Confidence 32 234332111111 01 112233333332211110000001112 233334445554444432 221 1 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe- Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL- 150 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl- 150 (510) .+... . ..+.+-| .+.=....+.++...||+.+++..+. + ....+|| T Consensus 69 --~~~~~-------------~-----------~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~ 122 (384) T protein:vir:49 69 --SRKQL-------------Q-----------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLR 122 (384) T ss_pred --ecchh-------------h-----------hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEc Confidence 11100 0 0122222 34444566677888999988876543 2 2344554 Q ss_pred -ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccc Q lcl|Aclame:pro 151 -RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGET 229 (510) Q Consensus 151 -~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~ 229 (510) ..+-+..+.++. ..++.++. ++...... T Consensus 123 ~~~v~v~~~~~~~-------------------------------------------------~~~y~~~~--~~~~~~~~ 151 (384) T protein:vir:49 123 PSQVSFNRLDNQN-------------------------------------------------GLYYNITF--DDPRIPPK 151 (384) T ss_pred CceeEEEEcCCCc-------------------------------------------------eEEEEEEe--cCccccce Confidence 333333332221 11111111 11000111 Q ss_pred cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc-------- Q lcl|Aclame:pro 230 GRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD-------- 301 (510) Q Consensus 230 ~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~-------- 301 (510) ..++.++ .+..|+....+..||.||...+...+.......+.......-...|..++.-.+....+.... T Consensus 152 ~~~~~~e--Vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~ 229 (384) T protein:vir:49 152 QHVPQGD--ILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAM 229 (384) T ss_pred eEecCcc--EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhc Confidence 1111111 455565556677999999999999999999888888888777778877765434433322110 Q ss_pred CC-Ccc--eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-CCCCCCCHHHHHHHHHH-HHHHhhh Q lcl|Aclame:pro 302 AE-MGD--YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RDAERVTAEEVRITAEE-AENTLGG 374 (510) Q Consensus 302 ~~-~G~--~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~~~~vTAtEi~~r~~E-~~~~LGp 374 (510) .. .|. +++++. ++.++.. +..+.+. .+..+..++.|.++|-.. .+. .....-|++.+.+...+ ....|-| T Consensus 230 ~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~p 306 (384) T protein:vir:49 230 KQMQGGPLVLDDLE-DFTPLEI-KSNVAQL-LSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRP 306 (384) T ss_pred ccCCccceecCCCc-eEEEccC-ChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHH Confidence 01 111 222222 2333332 2234443 456677888999998332 111 12233455554433222 2334667 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhcC-CCC-CCccceeeEEeecHHHHHHHHHHHHH-HHHHHHHHhhcChH--hHhhcC Q lcl|Aclame:pro 375 TYSLLAENLQSPLAYVCLSEVDDAL-LQG-LITKQHKPAIETGLPALSRSAAVQSM-LNASQVIAGLAPIA--QLDPRI 448 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~~~il~~~~-l~~-~p~~~~~~~~vs~l~~l~r~~~~~~~-~~~~q~~~~~~~~~--q~~~~i 448 (510) +.++++.+|..-+..-........+ ... -..+.++..+.+.-.......+..-+ .. +.....+.+.+ ...++- T Consensus 307 i~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne-~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 307 FVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKD-LPEGETDSTLKGGETNEQY 384 (384) T ss_pred HHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChh-HHHHcCCCCCCCCCCCCCC Confidence 7777766654322100000000000 000 00001111111111111110000000 00 11122232321 122222 No 126 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=91.10 E-value=0.017 Score=30.22 Aligned_cols=405 Identities=12% Similarity=0.038 Sum_probs=167.4 Q ss_pred Ch---------------hHHHHHHHHHhccCchHHHHHHHHhhcccccCCCC----CCcc----ccccccccc------h Q lcl|Aclame:pro 1 MK---------------STAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPM----SGSR----GVVEHDFQS------A 51 (510) Q Consensus 1 ~k---------------~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~~----~~~~----~~~~~~~ds------t 51 (510) +| ..+..+|+..+. .+...=+...+-.||.....+. +... .+..+.|+. + T Consensus 7 ~~~~~~~m~V~~~hp~y~a~~~~W~~~~d-~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~ 85 (488) T protein:vir:96 7 IKHRGFFMLTPIYHPDYLVNAPQWLRNLD-CVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLAN 85 (488) T ss_pred EeecceeecccccCHHHHHHhhhhhHhhh-hhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccc Confidence 11 123344443322 2333334444555675321111 1000 001111111 1 Q ss_pred HHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|Aclame:pro 52 GALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVT 131 (510) Q Consensus 52 g~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~ 131 (510) =.-+.+...+.|++.+|- .--.++.++ ..+++.++++| -....+.+.-+...+.+...+ T Consensus 86 ~~n~~~~tl~~l~G~vfr---k~p~~~~~~------------~~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~ 144 (488) T protein:vir:96 86 YVNIVNPTMNAITGAVMR---REPEFDTMD------------NPVLIGLRDNI------DGKGNGIDQECKQALNALQWG 144 (488) T ss_pred cCchhHHHHHHhcchhhc---cCceeccCC------------cHHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhc Confidence 111223333334444432 111111111 12345555554 355678888889999999999 Q ss_pred CceEEEEeCCCC--------------eEEEEEece---EEEee-CCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCC Q lcl|Aclame:pro 132 GNALLYRNSDEA--------------TVVAWSLRS---YAVRR-DATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLS 193 (510) Q Consensus 132 G~~~l~~~~~~~--------------~~~~~pl~~---~~v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~ 193 (510) |-+.+++|.+.. .+..|+-.+ +-..+ |+...+.-+..+++....+ ... . T Consensus 145 G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D------------~~~-~ 211 (488) T protein:vir:96 145 SRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERD------------GGT-Y 211 (488) T ss_pred CeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEecc------------CCC-c Confidence 999999987632 144555433 22222 2222344454555333111 000 0 Q ss_pred CCceEEEEEEEEeecCCCeeEEEEEEeeCCee----eccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHH Q lcl|Aclame:pro 194 GSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR----VGETGRWPIHLCPYIVPTWNLAPGEHY--GRGHVEDYIGDFAKL 267 (510) Q Consensus 194 ~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~----~~~~~~y~~~~~P~~~~Rw~~~~ge~Y--Grgp~~~~l~d~~~L 267 (510) ....+..+..+ .++ . |.++...++.. +...++ .+.+++|++.|....+..+ |..|.. |+..| T Consensus 212 ~~~~~~~~~~l--~~g-~---~~v~~~~~~~~~~e~~~~~~g--~~~l~~IP~v~~~~~~~~~~~~~pPLl----dLA~l 279 (488) T protein:vir:96 212 VSKQRLINHRL--VDG-L---CEFQEVTDDEYSDEWTPVLIN--SKQSDTIPFFLASSQSNEWCIDSTPLT----SLAEI 279 (488) T ss_pred ccceEEEEEEE--ECc-E---EEEEEEecCCcccceEeecCC--CcccCeeEEEEEecCCCCCCCCCCchH----HHHHH Confidence 11111111111 122 1 23332222211 111121 1356777777776665554 444533 55555 Q ss_pred HHH---HHHHHHHH-HHhhCCceeeCCCCccchhhhhcCCCcceecCC-------ccccccccCCCccchHHHHHHHHHH Q lcl|Aclame:pro 268 SLL---SEKLGLYE-LESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG-------AEAVRAYERGDYNKMAAIQQSLQAV 336 (510) Q Consensus 268 ~~l---~~~~l~~~-~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~-------~~~v~~~~~~~~~~~~~~~~~i~~~ 336 (510) |.- +.+-++.+ ..+.-|+|+...++. .+........+.+..|. .++...++.+ +.. .+.+.++++ T Consensus 280 nl~Hy~~ssd~~~il~~~~~p~lv~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~-~~~--l~~~~l~~l 355 (488) T protein:vir:96 280 SLSIYVMNAYSNKAMILANEAKWMVDMGDM-NKTMASEMNPLGFTLAGRMPYYVKNGDVKVIQAQ-FSP--ETENKVEKL 355 (488) T ss_pred HHHHHhhhhHHHHHHHhcCCceeeeccCCC-CcccccccccceeeecccccccccCCceeecCCc-hhH--HHHHHHHHH Confidence 532 22222332 344455555433332 22211111111111211 1122222221 111 246677777 Q ss_pred HHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh-cCC--CCCCccceeeEEe Q lcl|Aclame:pro 337 VVRLNQAFMYGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALL--QGLITKQHKPAIE 413 (510) Q Consensus 337 ~~~I~~af~~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~-~~l--~~~p~~~~~~~~v 413 (510) ++++.++=.. +++.. .+.||++.+.+...--..|+.+...+++-+ ++++.++.+ -|. ....++.+++.+- T Consensus 356 ~~qm~~~Ga~-l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~al-----~~~l~~~A~w~g~~~~~~~~~~~~~~in 428 (488) T protein:vir:96 356 FEQAVKVGAS-LFTQQ-SNETATGAAIRSGSSTASMATLGNNVEDTV-----RNMLRFIMRYFEGTNLYVNPDELVFKLN 428 (488) T ss_pred HHHHHHHhHh-hccCC-CcchHHHHHHHHHHhhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCCCCCcCccceEEEec Confidence 7777553321 22332 346999999999999999999888876653 444444432 111 1122233333332 Q ss_pred ec--HHHHHHHHHHHHHHHHHH--------HHHhhcChHhHhhcCCHHHHHHHHHHHcCCCH Q lcl|Aclame:pro 414 TG--LPALSRSAAVQSMLNASQ--------VIAGLAPIAQLDPRISLPKMMDTIWAAFSVDT 465 (510) Q Consensus 414 s~--l~~l~r~~~~~~~~~~~q--------~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~ 465 (510) .- ...+ -++++..+....+ ....+-.-.-+.+.+++++..+++.+ -|+.- T Consensus 429 ~dF~~~~l-d~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~-~g~~~ 488 (488) T protein:vir:96 429 RDYFDVEV-NPQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAE-LGFGM 488 (488) T ss_pred cCCCCccC-CHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhh-cCCCC Confidence 11 0011 0112222222111 11111111112345677777777764 33331 No 127 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=90.72 E-value=0.019 Score=29.97 Aligned_cols=331 Identities=10% Similarity=0.022 Sum_probs=122.4 Q ss_pred hhcccc--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHH Q lcl|Aclame:pro 27 TTLPYL--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARV 104 (510) Q Consensus 27 ~~~P~~--~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~v 104 (510) -+++-+ +....+.........+-+.+. ...+.+.++..+ ...++..... ....|....+.+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~al-------~~~~v~~~i~~i 63 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAAL-------RNSDLFSIILQL 63 (392) T ss_pred CcchhhhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHhh-------ccHHHHHHHHHH Confidence 111110 000000000000000100000 000011111000 0011111000 001111111111 Q ss_pred HHHH------------HHHHHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe--ceEEEeeCCCCcee Q lcl|Aclame:pro 105 DRKA------------TQRLFQNAS----LAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RSYAVRRDATGRWM 164 (510) Q Consensus 105 e~~~------------~~~l~~snf----~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl--~~~~v~~d~~G~v~ 164 (510) ...+ ...+.+-|- +.-+...+.++..+||+.+++..+. + ....+|| ..+-+..|.+|. T Consensus 64 a~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~-- 141 (392) T protein:vir:39 64 SSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYEN-- 141 (392) T ss_pred HHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCc-- Confidence 1111 011222332 3444556668888999887765432 2 2344444 333333433321 Q ss_pred EEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEee Q lcl|Aclame:pro 165 DIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTW 244 (510) Q Consensus 165 ~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw 244 (510) .+++.+++. +........|+.++ .+..|+ T Consensus 142 -----------------------------------------------~~~y~~~~~--~~~~~~~~~~~~~e--iih~~~ 170 (392) T protein:vir:39 142 -----------------------------------------------GMYYNITFD--DPKIEPILQAPQSD--LIHMKL 170 (392) T ss_pred -----------------------------------------------eEEEEEEec--CcccceeEEEcccc--EEEecC Confidence 111111111 11111111122222 555667 Q ss_pred eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-CCccchhh--------hhcCCC-c--ceecCCc Q lcl|Aclame:pro 245 NLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVDD--------YQDAEM-G--DYVPGGA 312 (510) Q Consensus 245 ~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~-~g~~~~~~--------~~~~~~-G--~~~~g~~ 312 (510) ...+|..||.||...+...+.....+.+.......-...|..++.- ++....+. +....+ | .+++++. T Consensus 171 ~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~ 250 (392) T protein:vir:39 171 LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLE 250 (392) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCc Confidence 6677889999999999999999999988888888877888766542 22222111 111111 1 1222222 Q ss_pred cccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 313 EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVC 391 (510) Q Consensus 313 ~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~ 391 (510) .+.++... ..+.+. .+..+..+..|.++|-... .-.+...-|..+ .+...=....|-|.+.++.+++-.-|+..+ T Consensus 251 -~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~ 326 (392) T protein:vir:39 251 -EFTALEIK-SNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKLSDHI 326 (392) T ss_pred -eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 23333322 234443 3556777788888883321 111222222211 112223455677777777777644332210 Q ss_pred HHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHH-----------HHH----------HHHHhhcChH-----hHh Q lcl|Aclame:pro 392 LSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSML-----------NAS----------QVIAGLAPIA-----QLD 445 (510) Q Consensus 392 ~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~-----------~~~----------q~~~~~~~~~-----q~~ 445 (510) .-+++..+- .+...++..+..+. +++ .....+.+.+ +-. T Consensus 327 -------------~~d~~~~~~--~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~~~p~ 391 (392) T protein:vir:39 327 -------------SVNMRPAID--PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEPV 391 (392) T ss_pred -------------cccchhhhc--cCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCCCCCC Confidence 000000000 00011111111110 000 0001122111 111 Q ss_pred h Q lcl|Aclame:pro 446 P 446 (510) Q Consensus 446 ~ 446 (510) | T Consensus 392 p 392 (392) T protein:vir:39 392 P 392 (392) T ss_pred C Confidence 2 No 128 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=90.72 E-value=0.019 Score=29.97 Aligned_cols=331 Identities=10% Similarity=0.022 Sum_probs=122.4 Q ss_pred hhcccc--cCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHH Q lcl|Aclame:pro 27 TTLPYL--MVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARV 104 (510) Q Consensus 27 ~~~P~~--~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~v 104 (510) -+++-+ +....+.........+-+.+. ...+.+.++..+ ...++..... ....|....+.+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~al-------~~~~v~~~i~~i 63 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAAL-------RNSDLFSIILQL 63 (392) T ss_pred CcchhhhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHhh-------ccHHHHHHHHHH Confidence 111110 000000000000000100000 000011111000 0011111000 001111111111 Q ss_pred HHHH------------HHHHHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe--ceEEEeeCCCCcee Q lcl|Aclame:pro 105 DRKA------------TQRLFQNAS----LAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RSYAVRRDATGRWM 164 (510) Q Consensus 105 e~~~------------~~~l~~snf----~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl--~~~~v~~d~~G~v~ 164 (510) ...+ ...+.+-|- +.-+...+.++..+||+.+++..+. + ....+|| ..+-+..|.+|. T Consensus 64 a~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~-- 141 (392) T protein:vir:10 64 SSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYEN-- 141 (392) T ss_pred HHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCc-- Confidence 1111 011222332 3444556668888999887765432 2 2344444 333333433321 Q ss_pred EEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEee Q lcl|Aclame:pro 165 DIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTW 244 (510) Q Consensus 165 ~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw 244 (510) .+++.+++. +........|+.++ .+..|+ T Consensus 142 -----------------------------------------------~~~y~~~~~--~~~~~~~~~~~~~e--iih~~~ 170 (392) T protein:vir:10 142 -----------------------------------------------GMYYNITFD--DPKIEPILQAPQSD--LIHMKL 170 (392) T ss_pred -----------------------------------------------eEEEEEEec--CcccceeEEEcccc--EEEecC Confidence 111111111 11111111122222 555667 Q ss_pred eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-CCccchhh--------hhcCCC-c--ceecCCc Q lcl|Aclame:pro 245 NLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVDD--------YQDAEM-G--DYVPGGA 312 (510) Q Consensus 245 ~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~-~g~~~~~~--------~~~~~~-G--~~~~g~~ 312 (510) ...+|..||.||...+...+.....+.+.......-...|..++.- ++....+. +....+ | .+++++. T Consensus 171 ~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~ 250 (392) T protein:vir:10 171 LSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLE 250 (392) T ss_pred CCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCc Confidence 6677889999999999999999999988888888877888766542 22222111 111111 1 1222222 Q ss_pred cccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 313 EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVC 391 (510) Q Consensus 313 ~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~ 391 (510) .+.++... ..+.+. .+..+..+..|.++|-... .-.+...-|..+ .+...=....|-|.+.++.+++-.-|+..+ T Consensus 251 -~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~ 326 (392) T protein:vir:10 251 -EFTALEIK-SNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKLSDHI 326 (392) T ss_pred -eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 23333322 234443 3556777788888883321 111222222211 112223455677777777777644332210 Q ss_pred HHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHH-----------HHH----------HHHHhhcChH-----hHh Q lcl|Aclame:pro 392 LSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSML-----------NAS----------QVIAGLAPIA-----QLD 445 (510) Q Consensus 392 ~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~-----------~~~----------q~~~~~~~~~-----q~~ 445 (510) .-+++..+- .+...++..+..+. +++ .....+.+.+ +-. T Consensus 327 -------------~~d~~~~~~--~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~~~p~ 391 (392) T protein:vir:10 327 -------------SVNMRPAID--PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEPV 391 (392) T ss_pred -------------cccchhhhc--cCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCCCCCC Confidence 000000000 00011111111110 000 0001122111 111 Q ss_pred h Q lcl|Aclame:pro 446 P 446 (510) Q Consensus 446 ~ 446 (510) | T Consensus 392 p 392 (392) T protein:vir:10 392 P 392 (392) T ss_pred C Confidence 2 No 129 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=89.31 E-value=0.027 Score=29.17 Aligned_cols=423 Identities=13% Similarity=0.009 Sum_probs=148.2 Q ss_pred Ch--------hHHHHHHHHHhccCchHHHHHHHHhh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCc Q lcl|Aclame:pro 1 MK--------STAAMLWEKLRDGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPT 70 (510) Q Consensus 1 ~k--------~~~~~r~~~lkr~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp 70 (510) |. +.|.+.|+. +.+...+.+++++-- ++++-............+...+-+..+|+.+++.|++- T Consensus 1 ~~~~t~~~~~~~l~~~~~~--~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~---- 74 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDD--GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN---- 74 (456) T ss_pred CCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccC---- Confidence 11 112222221 111112223333321 11110000000111111233456666677666665432 Q ss_pred cCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC-CeEEE Q lcl|Aclame:pro 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE-ATVVA 147 (510) Q Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~~-~~~~~ 147 (510) + |+....++. +..+ .+.+.+.+++|.....++.++..++|.+.+++. ++. .++++ T Consensus 75 --g-~~~~~~~d~------------~~~~-------~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~ 132 (456) T protein:vir:79 75 --G-ITVGGSADS------------DLAL-------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITA 132 (456) T ss_pred --C-eecCCCCCc------------cHHH-------HHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEE Confidence 2 222222111 0111 122345667899999999999999998865543 332 24666 Q ss_pred EEeceEEEeeC-CCC-ceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 148 WSLRSYAVRRD-ATG-RWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 148 ~pl~~~~v~~d-~~G-~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~ 225 (510) ++..+.++..| ..+ ++...+|.++ ..+ +.. ....-..++..+..+...+...+. ..+. .....++-. T Consensus 133 ~~p~~~~~i~d~~~~~~~~~~~~~~~-~~d----~~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~ 201 (456) T protein:vir:79 133 DSPETMVVSVDPLQPWRIRSAMRWWR-DLD----AES----DFAIVWSGDGWQKFARPCFVQSSS-RRRL-VTRISDSWV 201 (456) T ss_pred eccceeEEEEcCCCCCceEEEEEEEE-ecC----Cce----eEEEEEcCCceEEEEEEEEeeccc-ccee-eeccCCcee Confidence 66555544445 344 3444444442 110 000 000011122222222222111110 0000 000111111 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC----------CCC-cc Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD----------EAK-GA 294 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~----------~~g-~~ 294 (510) ........+..+|++.++ +..|.|=.+..++-+-.++...-..+..++..+.|...+. ..| .+ T Consensus 202 ~~~~~~~~~~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i 275 (456) T protein:vir:79 202 PVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI 275 (456) T ss_pred ecccccCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc Confidence 111222223456665542 4678888888887777777665555555555544433321 111 01 Q ss_pred chhhhhcCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhccc-----CCCCCCCCHHHHHHHHHHHH Q lcl|Aclame:pro 295 VVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGAN-----QRDAERVTAEEVRITAEEAE 369 (510) Q Consensus 295 ~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~-----~~~~~~vTAtEi~~r~~E~~ 369 (510) ++........|.+..+. .+....++. ..+++.....++.+...| +..... ..+..+.++.-+......+. T Consensus 276 ~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~i---~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) T protein:vir:79 276 DYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQL---SSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) T ss_pred chhhhhhhhccccccCC-CCcceeeec-ccChHHHHHHHHHHHHHH---HhhcCCChhHhcccccCcHHHHHHHHHHHHH Confidence 11111112223222221 122222222 234444344344333333 322211 12223446654433322222 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHh-hcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcC Q lcl|Aclame:pro 370 NTLGGTYSLLAENLQSPLAYVCLSEVD-DALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRI 448 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~-~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~i 448 (510) .. .++.+ ..+.+-+.+.+.++. -.+.++ ...+++.+..+. +-..++.++.+....+ +|.+. T Consensus 351 ~k----~~~~~-~~f~~~l~~~~~l~~~~~g~~~--~~~i~v~w~~~~-~~s~~~~ada~~kl~~-----~G~~~----- 412 (456) T protein:vir:79 351 FK----CEDRL-SIAKIGLEAILVKALQIEGESV--EDTVDVSFESPD-RVTLGEKYSAASLAKA-----AGESW----- 412 (456) T ss_pred HH----HHHHH-HHHHHHHHHHHHHHHHhcCCCc--cccceEEeCCCC-CcCHHHHHHHHHHHHh-----cCCCh----- Confidence 21 12222 223333444444432 123221 223455443321 1122222222222111 12211 Q ss_pred CHHHHHHHHHHHcCCCHhhccCCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 449 SLPKMMDTIWAAFSVDTSQFYKSADELQA-EAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 449 d~d~~~~~~a~~~Gvp~~~i~~s~ee~~~-~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ... ....+|+.+. ++++ +.++...+.. +++ .++.+.+...|.- T Consensus 413 --~~~---~~~~lg~~~~-------~i~~~e~~r~~~e~~-~~~------~~~~~~~~~~~~~ 456 (456) T protein:vir:79 413 --ASI---RRNILNYNAD-------QIKQDDLDRAREQIT-LFA------GNPVQRPQEDGSR 456 (456) T ss_pred --HHH---HHhcCCCCHH-------HHHHHHHHHHHHHHH-HHh------hhHhhcCCCCCCC Confidence 011 2234566543 2221 1111111111 111 1111111111111 No 130 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=88.52 E-value=0.032 Score=28.79 Aligned_cols=311 Identities=10% Similarity=0.021 Sum_probs=119.9 Q ss_pred HHHhhcC------------ccCcccccCCChhhh-hh--------hcc-CchHHHHHHHHHHHHHHHH------------ Q lcl|Aclame:pro 63 LARSLFP------------TGIPFFRSELTDAIR-RE--------ADS-RDTDITEVTAALARVDRKA------------ 108 (510) Q Consensus 63 l~~~ltp------------p~~~WF~l~~~d~~~-~~--------~~~-~~~~~~~v~~~L~~ve~~~------------ 108 (510) |+.++|. .-..||.-. .+... .- +.. .-.....|..-.+.+...+ T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDG-NDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred CcchhhhhhhcccCcccccccccccccC-chhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh Confidence 2222221 000111000 00000 00 000 0000111221111111111 Q ss_pred HHHHHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC-Ce-EEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHH Q lcl|Aclame:pro 109 TQRLFQNAS----LAVLTQVIKLLIVTGNALLYRNSDE-AT-VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDV 180 (510) Q Consensus 109 ~~~l~~snf----~~~~~~~~~~l~~~G~~~l~~~~~~-~~-~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~ 180 (510) ...+.+-|- +.-+...+.++..+||+.+++..+. ++ ...+|| ..+-+..+.+|. T Consensus 80 ~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~------------------ 141 (392) T protein:vir:74 80 QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYEN------------------ 141 (392) T ss_pred hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCc------------------ Confidence 111222232 3444556667788888887765432 22 333444 333333333332 Q ss_pred hhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHH Q lcl|Aclame:pro 181 YKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDY 260 (510) Q Consensus 181 ~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~ 260 (510) .+++.++.. +........+..++ .+..|+...+|..||.||...+ T Consensus 142 -------------------------------~~~y~~~~~--~~~~~~~~~~~~~e--vih~~~~~~~~~~~G~s~i~~~ 186 (392) T protein:vir:74 142 -------------------------------GMYYNITFD--DPKIEPILQAPQSD--LIHMKLLSIDGGKTGISPLYSL 186 (392) T ss_pred -------------------------------eEEEEEEec--CCccceeEEEcCcc--EEEecCCCCCCccccccHHHHH Confidence 111211111 11111111121122 4445555667788999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCCccchhh--------hhcCCC-c--ceecCCccccccccCCCccchHH Q lcl|Aclame:pro 261 IGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVVDD--------YQDAEM-G--DYVPGGAEAVRAYERGDYNKMAA 328 (510) Q Consensus 261 l~d~~~L~~l~~~~l~~~~~a~~~~~lv~-~~g~~~~~~--------~~~~~~-G--~~~~g~~~~v~~~~~~~~~~~~~ 328 (510) ...+.......+.......-...|..++. +++....+. +....+ | .+++++. .+.++.+. ..+.+. T Consensus 187 ~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~ 264 (392) T protein:vir:74 187 RRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLE-EFTALEIK-SNVAQL 264 (392) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCc-eEEEccCC-hhHHHH Confidence 99999999998888888888888876654 323222221 111111 1 1222222 23333332 234444 Q ss_pred HHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc Q lcl|Aclame:pro 329 IQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ 407 (510) Q Consensus 329 ~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~ 407 (510) .+..+..+..|.++|-... .-.+...-|..+ .+..+-....|.|.+.++.+++-.-|+..+ .-+ T Consensus 265 -~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-e~~~~~~~~~l~p~~~~ie~~l~~~l~~~~-------------~~~ 329 (392) T protein:vir:74 265 -LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKLSDHI-------------SVN 329 (392) T ss_pred -HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhccchh-------------ccc Confidence 4556777788989883321 001122222211 122223455677777777776543332110 000 Q ss_pred eeeEEeecHHHHHHHHHHHHHHH-----------HH----------HHHHhhcChH-----hHhh Q lcl|Aclame:pro 408 HKPAIETGLPALSRSAAVQSMLN-----------AS----------QVIAGLAPIA-----QLDP 446 (510) Q Consensus 408 ~~~~~vs~l~~l~r~~~~~~~~~-----------~~----------q~~~~~~~~~-----q~~~ 446 (510) ++..+ -.+...++..+..+.+ ++ .....+.+.+ +=.| T Consensus 330 ~~~~~--~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 330 MRPAI--DPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEPVP 392 (392) T ss_pred chhhh--cCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCCCCCCC Confidence 00000 0011111111111110 00 0001222111 1112 No 131 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=87.97 E-value=0.035 Score=28.55 Aligned_cols=393 Identities=9% Similarity=-0.004 Sum_probs=152.3 Q ss_pred ChhHHHHHHHHHhcc--CchHHHHHHHHhhcccc---cCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDG--SVEQRAIEFAKTTLPYL---MVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lkr~--~~~~~w~e~~~~~~P~~---~~~~~~~~~~~~--~~~~-dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) ||+...+++.+++.+ .|... .++ ...|+. +....+...... .... .++--.|++.+|+.+.+ - T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~--~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~------l 71 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGV--PIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIAT------L 71 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCC--ccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhh------C Confidence 999999988887532 23211 000 000111 110001111100 1112 23334456666665543 2 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCCCeE-E Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDEATV-V 146 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~~~~-~ 146 (510) ||.-....+..-.+. + .+..+...|. +-| .+.-....+.++...||+.+++..+.+.. . T Consensus 72 p~~~~~~~~~g~~~~---------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~~~~ 136 (437) T protein:vir:10 72 PLNLYQTKPDGTRVL---------A------KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGVLIG 136 (437) T ss_pred ceeEEEEcCCCceee---------c------cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEE Confidence 554322221110000 0 0111222233 333 44445666778889999998877665543 3 Q ss_pred EEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCe Q lcl|Aclame:pro 147 AWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV 224 (510) Q Consensus 147 ~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~ 224 (510) .||| ..+.+.++.+|.+. +. |...+|. T Consensus 137 L~~l~p~~v~i~~~~~g~~~--------------------------------------------------y~-~~~~~g~ 165 (437) T protein:vir:10 137 LELMLPQRTTVKRLTSGALQ--------------------------------------------------YT-YRNVDGT 165 (437) T ss_pred EEEEcCcceEEEECCCCeEE--------------------------------------------------EE-EEecCce Confidence 4555 44444444444221 00 1111111 Q ss_pred eeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC- Q lcl|Aclame:pro 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE- 303 (510) Q Consensus 225 ~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~- 303 (510) .. .+..++ .+..|....+ ..||.||..-+...+.....+.+.......-...|-.++.-++.+.++...... T Consensus 166 ~~----~~~~~d--Iih~r~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 238 (437) T protein:vir:10 166 VS----TLAEDD--VFHVRGFSLD-GLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRT 238 (437) T ss_pred EE----EEcccc--EEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHH Confidence 10 001011 2333433223 379999999999998888888888777777777787777655666655433221 Q ss_pred ------Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc-CCCCCCCCHHHHHHHHHH Q lcl|Aclame:pro 304 ------MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-QRDAERVTAEEVRITAEE 367 (510) Q Consensus 304 ------~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~~~~~~vTAtEi~~r~~E 367 (510) .| .+++++. ...++.. +..+.+. .+..+..+..|..+|-.. .+ ..+....+..-+.+.... T Consensus 239 ~~~~~~~g~~nag~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~ 315 (437) T protein:vir:10 239 DLAEQFGGAMQAGKTMVLEAGM-KYQAITM-NPGDVQL-LETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLG 315 (437) T ss_pred HHHHHhcCccccCcceeccCCc-eEEeccC-ChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH Confidence 11 1122222 2233322 1234443 344455677888888432 11 112222223333333222 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhc Q lcl|Aclame:pro 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~ 447 (510) . ...-|.|++.+.-..|.+..+++-......+++- ++.+-|+--.+. ..+.+.+-..+- T Consensus 316 f-----------~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd--~~~ll~~d~~~r-~~~~~~~~~~G~------- 374 (437) T protein:vir:10 316 F-----------LTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFS--VEGLLRADSAGR-AAFYSTMTQNGL------- 374 (437) T ss_pred H-----------HHHHHHHHHHHHHHHHHhhccCccccCceEEEEe--chhhhccCHHHH-HHHHHHHHhCCC------- Confidence 2 2333445544444444444444322222223321 233333211111 112221111111 Q ss_pred CCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcccCCC Q lcl|Aclame:pro 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASD--MTNALAGV 510 (510) Q Consensus 448 id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~--~~~~~ag~ 510 (510) +..++ +-+.+|.|+- ...+++-. .+..-.-......+ ..+.+.+ ..+.-.+. T Consensus 375 ~T~NE----~R~~~gl~pi---~gg~~~~~--~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 428 (437) T protein:vir:10 375 MTRDE----CRAKENLPPM---GGNAAVLT--VQSALLPIDKLGEH--TTATAAQDALKAWLYQE 428 (437) T ss_pred cCHHH----HHHHhCCCCC---CCCcceEe--ecCcccchhhccCc--CCCcchhccccccCCCC Confidence 11111 1122344331 11111000 00000000000000 0000000 00111111 No 132 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=85.33 E-value=0.054 Score=27.55 Aligned_cols=296 Identities=12% Similarity=0.053 Sum_probs=116.7 Q ss_pred eeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeee-ccc---ccc---ccc Q lcl|Aclame:pro 163 WMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GET---GRW---PIH 235 (510) Q Consensus 163 v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~-~~~---~~y---~~~ 235 (510) |-+++++++ +..+.+ ..+.+++-+...| +.+..++..+ ... .+. ... T Consensus 1 v~Eivw~~~-----------------------~g~~~~-~~l~~r~~~~~~~--f~~~~~~~l~~~~~~~~~g~~~~~lp 54 (355) T protein:vir:78 1 MFEQVYRIE-----------------------NGRARL-GKLAWRPPRTISR--FDVAPDGGLVAIEQWGVFGKATVRIP 54 (355) T ss_pred CeEEEEEee-----------------------CCeEEE-eeeeecCccceee--eeeccCCceeEEEecCCCCCCcceec Confidence 222222210 000000 0111121111111 1122222221 111 111 001 Q ss_pred cCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCC-ceeeCCCCccc--hh--------------- Q lcl|Aclame:pro 236 LCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEV-LNLVDEAKGAV--VD--------------- 297 (510) Q Consensus 236 ~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~-~~lv~~~g~~~--~~--------------- 297 (510) .+=|++.|....+|+.||.|+...+..-..--+...+..+..+++-.-| |+..-|.|... .+ T Consensus 55 ~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l 134 (355) T protein:vir:78 55 VDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEG 134 (355) T ss_pred cCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHH Confidence 1238999999999999999999999999888888899999998875433 33333322111 10 Q ss_pred -----hhhcC-CCcceecCCccccccccC-CCccchHHHHHHHHHHHHHHHHHHhhcccCC----CCCCCCHHHHHHHHH Q lcl|Aclame:pro 298 -----DYQDA-EMGDYVPGGAEAVRAYER-GDYNKMAAIQQSLQAVVVRLNQAFMYGANQR----DAERVTAEEVRITAE 366 (510) Q Consensus 298 -----~~~~~-~~G~~~~g~~~~v~~~~~-~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~----~~~~vTAtEi~~r~~ 366 (510) .+..+ ..|.++|-+. .+..++. +..+++ ...|+.+.+.|+++++-..+.. ++..-...|+.... T Consensus 135 ~~~~~~i~~g~~a~~iip~g~-~ie~~ea~g~~~~~---~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v- 209 (355) T protein:vir:78 135 LQLAKEFRAGEAAGGYIPHGA-NFTLTGVQGKLPEM---DGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASF- 209 (355) T ss_pred HHHHHHhhCCcceeEeecCCc-eEEEeecCCCcccH---HHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHH- Confidence 01111 1244555432 3444432 222343 4688999999999997653322 12223445654321 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~ 446 (510) ....+-.-...|...|..-||..++.+-. +...+.| ++++-. +.. +...+...++.+..++- T Consensus 210 -~~~~~~aD~~~i~~~ln~~li~~l~~lN~-~~~~~~P----~~~~~~-~~~-----~~~~~a~~~~~l~~~G~------ 271 (355) T protein:vir:78 210 -FTGSLNAVMKHIADVTQQHVVEDLVDQNW-GPEEPAP----RLVPAQ-LGK-----EQPVTAEAIRALVECGA------ 271 (355) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCCCCC----EEEecC-cCh-----hHHHHHHHHHHHHhCCC------ Confidence 11122222233333333344444433211 0111111 122211 111 11112233333333322 Q ss_pred cCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHH--HHHHH----H--HHH---------HHHHH-HHHHh----- Q lcl|Aclame:pro 447 RISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRR--QAAQA----Q--AAQ---------ETLLE-GASDM----- 503 (510) Q Consensus 447 ~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~q--qa~~~----~--~a~---------~~~~~-~a~~~----- 503 (510) .+..+....++.+.+|+|.. -..++++..-.+.... ++... . .++ .+..+ -+... T Consensus 272 ~~~~~~~~~~~~e~~gip~p--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~~~~~ 349 (355) T protein:vir:78 272 FTADPELEKDLRARYGLPAP--AERDDGADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRGPLRRRPRHPAHR 349 (355) T ss_pred ccccHHHHHHHHHHhCCCCC--CCCCcccCCccccccccccccccCCccccccccccCCCCCChhhhHHHHHHhhccccC Confidence 13334556778888998743 1112222110000000 00000 0 000 00000 01111 Q ss_pred hcccCC Q lcl|Aclame:pro 504 TNALAG 509 (510) Q Consensus 504 ~~~~ag 509 (510) -...+| T Consensus 350 ~~~~~~ 355 (355) T protein:vir:78 350 RCAPDG 355 (355) T ss_pred CCCCCC Confidence 012222 No 133 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=84.45 E-value=0.06 Score=27.27 Aligned_cols=396 Identities=14% Similarity=0.077 Sum_probs=161.9 Q ss_pred hHHHHHHHHhhcccccCCCCCCccccccccc------------cchHHHHHHHHHHHHHHhhcCcc---CcccccCCChh Q lcl|Aclame:pro 18 EQRAIEFAKTTLPYLMVDPMSGSRGVVEHDF------------QSAGALLVNNLAAKLARSLFPTG---IPFFRSELTDA 82 (510) Q Consensus 18 ~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~------------dstg~~a~~~Laa~l~~~ltpp~---~~WF~l~~~d~ 82 (510) ....+-+..+.. +.+++..+.+ +=...-.-+-|+.+++.. |+. +.|+.+.-.+. T Consensus 1 ~~~~D~~~~~~~---------~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~--~a~d~~r~~~~i~~~d~ 69 (437) T protein:vir:52 1 MKFFDGIKSLAL---------KLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIK--RPEDMVRNWREIYSNDL 69 (437) T ss_pred CchhhhhHhHHh---------cCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhc--chHHhhcCCceEecCCC Confidence 222222222211 0011111111 111222234444555544 332 67888865332 Q ss_pred hhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeCCCCc Q lcl|Aclame:pro 83 IRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDATGR 162 (510) Q Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d~~G~ 162 (510) .. +.+ +.+.+.+.+-++...+.++++.--.+|.+++++..+... -.-|+. ..|. T Consensus 70 ~~----------~~~--------~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~-~~~pl~-------~~~~ 123 (437) T protein:vir:52 70 NS----------KQL--------DLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQN-TSAPLK-------PTER 123 (437) T ss_pred CH----------HHH--------HHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCC-cccccc-------cCCc Confidence 11 111 122333445578999999999888899998887665432 122331 1233 Q ss_pred eeE--EEEEEEecHHHHhHHhhHHhhcccccCCC-CceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCce Q lcl|Aclame:pro 163 WMD--IVLKQRYKSKDLDDVYKQDLMRAGRNLSG-SGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPY 239 (510) Q Consensus 163 v~~--i~r~~~~t~~~l~~~~~~~~~~~~~~~~~-~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~ 239 (510) +.. ++-++.+++. .-.+. +-.+| +-+.+.|+.. ++... .-|| -.++..-.+. ..| T Consensus 124 ~~~~~v~~~~~v~~~-----~~~~~----dp~s~~fg~p~~y~v~----~~~~~-~~iH----~SRii~~~~~---~~~- 181 (437) T protein:vir:52 124 LKRLIILPKWKISPT-----GTKDD----DVLSPNFGRYSEYSIL----GGSQS-ITVH----HSRLIILNAN---DAP- 181 (437) T ss_pred eeEEEEechhhcccc-----ccccc----cccccccCcceEEEEe----cCCcc-eeEc----cceeEEecCc---cCC- Confidence 321 1111111110 00000 00011 1122223221 11100 0111 1112111111 122 Q ss_pred EEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-C-----CCc----cch-hhhhc--CCCcc Q lcl|Aclame:pro 240 IVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-E-----AKG----AVV-DDYQD--AEMGD 306 (510) Q Consensus 240 ~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~-~-----~g~----~~~-~~~~~--~~~G~ 306 (510) .....-||+|+.+-.+..++..+.......+.+..+....+-++ . +++ ... +.+.. ...|. T Consensus 182 ------~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 255 (437) T protein:vir:52 182 ------LSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNS 255 (437) T ss_pred ------CccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCce Confidence 12356789999999999999999888887776655544433332 0 010 000 11101 11233 Q ss_pred eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHh--hc-c-cCCCCCCCCHH-HHHHHHHHHHHHhhhhHHHHHH Q lcl|Aclame:pro 307 YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG-A-NQRDAERVTAE-EVRITAEEAENTLGGTYSLLAE 381 (510) Q Consensus 307 ~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~--~~-~-~~~~~~~vTAt-Ei~~r~~E~~~~LGpv~~rl~~ 381 (510) ++-+..+++..+.. ++.-+...+....+.|..++= .. + .+..++--|.. +++.=. --+..++. T Consensus 256 ~~~d~~~~~e~~~~----~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yy--------d~i~~~Qe 323 (437) T protein:vir:52 256 LLLDAENEYDRKEL----TFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYH--------EAIRRLQE 323 (437) T ss_pred EEEcCCcceEEEec----CcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHH--------HHHHHHHH Confidence 33333334444332 333345666777788888771 11 2 22233321322 222211 22455666 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHc Q lcl|Aclame:pro 382 NLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAF 461 (510) Q Consensus 382 E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~ 461 (510) ..+.|++++++.++.+..+.++++ ++.+++- +|..+.-..+++......+....+.+. ..++++++.+.+.+. T Consensus 324 ~~l~p~le~l~~~i~~~~~g~~~~-~~~~~f~-pL~~~s~kekae~~~~~a~a~~~~~~~----g~i~~~e~r~~L~~~- 396 (437) T protein:vir:52 324 TRLRPIFEIIDPLICNELFGGLPA-DWWFEFV-PLTTVKQEQQINMLNTFATAANTLIQN----GVLNEYQIANELRES- 396 (437) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCC-cceEEeC-CcCCcCHHHHHHHHHHHHHHHHHHHhc----CCCCHHHHHHHHHhc- Confidence 789999999999887655544443 4665543 343333333333322222333222221 236777777766553 Q ss_pred CCCHhhccCCHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 462 SVDTSQFYKSADELQAEAE-EQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 462 Gvp~~~i~~s~ee~~~~~~-~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) |+=. .| +++++....- .+.... .+++-+..+++.+= T Consensus 397 g~~~-~i--~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~ 433 (437) T protein:vir:52 397 GLFA-NI--SAEHIEELKNADEFAGN----------FEEPEKMEGAQVQN 433 (437) T ss_pred CCCC-CC--CccccccccCCCCCCCc----------cCCCCCCCCCCCCC Confidence 3211 11 1111111000 000000 00000111111111 No 134 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=83.83 E-value=0.065 Score=27.08 Aligned_cols=406 Identities=7% Similarity=0.031 Sum_probs=160.3 Q ss_pred ChhHHHHHHHHHh--ccCchHHHHHHHHhhcccccCCCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lk--r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) |=.-+.+...+++ ..++.+.| .++.-|...+.. ..+.+-.. -...++--.|++.+|+.+.+ -||.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~~~~~-~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~------lp~~v 70 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAF---IKYIGQTFTKYD-NNGKTYLEQGYNINPDVYSCISQMAAKTVA------VPYTI 70 (460) T ss_pred CchhHHHHHhhhhccCCCchHHH---HHhhccccCCCc-cchhhhhHHHHhcchHHHHHHHHHHHhhhh------CceEE Confidence 4333333333222 22344455 455555432211 11111111 12345556777777777643 35543 Q ss_pred cCCChhh-hhhhccCchHHHH-----------HHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCceEEEEeC Q lcl|Aclame:pro 77 SELTDAI-RREADSRDTDITE-----------VTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNS 140 (510) Q Consensus 77 l~~~d~~-~~~~~~~~~~~~~-----------v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~l~~~G~~~l~~~~ 140 (510) ....... ..+.......... ....+...+......+.+-| .+.-...++.++..+||+.+|+.. T Consensus 71 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 150 (460) T protein:vir:10 71 KVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMS 150 (460) T ss_pred EeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 2222110 0000000000000 00111222223333344444 344456667788899999988764 Q ss_pred CC-----Ce-EEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCe Q lcl|Aclame:pro 141 DE-----AT-VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAM 212 (510) Q Consensus 141 ~~-----~~-~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~ 212 (510) +. +. ...||| ..+-+..+.+|.+.. +++ .++ T Consensus 151 ~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~--~~~----------------------------~~~----------- 189 (460) T protein:vir:10 151 PDDGINAGVPSQMYVLPAHLIKIVLKDDINLLS--TDS----------------------------PIK----------- 189 (460) T ss_pred cCCCccCceeEEEEEEcCceEEEEEcCCCceee--eee----------------------------eee----------- Confidence 32 22 234444 556666665553321 100 000 Q ss_pred eEEEEEEeeCCeeeccccccccccCceEEEeeee-----cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCcee Q lcl|Aclame:pro 213 DYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNL-----APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNL 287 (510) Q Consensus 213 ~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~-----~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~l 287 (510) .+.+..+|.... |..++ .+..|+.. ..+..||.||...+...+.......+...........|-++ T Consensus 190 ---~~~~~~~g~~~~----~~~~e--vih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i 260 (460) T protein:vir:10 190 ---SYMLIQGDQFIE----FNEDE--VIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFI 260 (460) T ss_pred ---EEEEecCceeEE----ecccc--eEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccee Confidence 011111111110 11111 23334322 22457999999999999999888888777777766777777 Q ss_pred eCCCCccchhhhhcCC-------Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc-C Q lcl|Aclame:pro 288 VDEAKGAVVDDYQDAE-------MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-Q 350 (510) Q Consensus 288 v~~~g~~~~~~~~~~~-------~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~ 350 (510) +..++.+.++...... .| .+++++. ++.++... ..+.+. .+..+..+..|.++|=.. .+ . T Consensus 261 ~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~ 337 (460) T protein:vir:10 261 HGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEI-AFTKISLN-TDELKP-FDYLKYDQKAICNALGWSDKLLNN 337 (460) T ss_pred eecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCC Confidence 7766666655433221 11 1122221 22333221 234443 455577778888888322 11 1 Q ss_pred CCCCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-ccceeeEE-eecHHHHHHHHHHHH Q lcl|Aclame:pro 351 RDAERVTAEEVRITAEE-AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLI-TKQHKPAI-ETGLPALSRSAAVQS 427 (510) Q Consensus 351 ~~~~~vTAtEi~~r~~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p-~~~~~~~~-vs~l~~l~r~~~~~~ 427 (510) .++...|-.-+.+.... ....|.|...++..+|-.-| +++.. .....+++ .+.+..+. .+... T Consensus 338 ~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl------------~~~~~~~~~~~i~~d~~~l~~l~--~d~~~ 403 (460) T protein:vir:10 338 NEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKF------------IKRFKGYENAVIEWDISELPEMQ--TDMVA 403 (460) T ss_pred CCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhh------------cCcccccCCceEEeecchhhhHH--HHHHH Confidence 22222222222222222 23356666666666543322 22211 11223333 22332222 12222 Q ss_pred HHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHH-HHHHHHHH Q lcl|Aclame:pro 428 MLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAE-AEEQRRQA 486 (510) Q Consensus 428 ~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~-~~~~~qqa 486 (510) ...+++ ..+..+-++...++.+-+-+.-+|.+=+|. .+++.++.-+.. -..+.|++ T Consensus 404 ~~~~~~--~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~-n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 404 MASWLN--TIPVTPNEIRIAMKYETLNQDGMDIVFMPS-NKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred HHHHHh--CCCCCHHHHHHHhCCCCCCCCCCCeeeecc-cccchhhcccccCCCcccCCC Confidence 222221 011111222222222222111122222221 122211100000 00000000 No 135 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=83.15 E-value=0.071 Score=26.89 Aligned_cols=343 Identities=10% Similarity=0.040 Sum_probs=139.6 Q ss_pred HHHHHHHhccCchHHHHHHHHhhcccccCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCccCcccccCCChh Q lcl|Aclame:pro 6 AMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDA 82 (510) Q Consensus 6 ~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~--~~~~-dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 82 (510) =+.|.++...+-.+. .....++.+...... .++... .... .++--.|++.+|+.+.+. ||--...... T Consensus 1 Mg~f~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~------~~~~~~~~~~ 71 (382) T protein:vir:48 1 MPIFNLATESPPDNQ-GGFFDVVDSDFLASL--KGNEWVSAETALRNSDLFSIINQLSNDLATV------KLITSRKKLQ 71 (382) T ss_pred CccccccccCCcccc-cccccchhhhccccc--cCCcccchHhhhccHHHHHHHHHHHHhhccC------ceeeecchhh Confidence 122233311100000 011111111111000 000000 0111 233334555555555332 3321111100 Q ss_pred hhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe--ceEE Q lcl|Aclame:pro 83 IRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RSYA 154 (510) Q Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl--~~~~ 154 (510) . .+.+-| .+.=+...+.+|...||+.+++..+. + ....+|| ..+- T Consensus 72 ~---------------------------L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~ 124 (382) T protein:vir:48 72 G---------------------------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVS 124 (382) T ss_pred h---------------------------hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeE Confidence 0 122233 34445666778889999998876543 2 2344554 4444 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPI 234 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~ 234 (510) +..+.+|... ++-+ ..++........++. T Consensus 125 v~~~~~~~~~-------------------------------------------------~y~~--~~~~~~~~~~~~~~~ 153 (382) T protein:vir:48 125 FNRLDNKDGI-------------------------------------------------YYNI--TFDDPRIPPKQHVPQ 153 (382) T ss_pred EEEcCCCCeE-------------------------------------------------EEEE--EecCccccceeEEcC Confidence 4454443211 1111 111111111112221 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcC----------CC Q lcl|Aclame:pro 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDA----------EM 304 (510) Q Consensus 235 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~----------~~ 304 (510) ++ .+..|+....+..||.||...+...+...+...+.......-...|.+++.-++.+.++..... .. T Consensus 154 ~e--vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g 231 (382) T protein:vir:48 154 ND--VLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQG 231 (382) T ss_pred cc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCC Confidence 22 5566666677889999999999999999999999888888888888887765555555433211 11 Q ss_pred cce-ecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|Aclame:pro 305 GDY-VPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAE 381 (510) Q Consensus 305 G~~-~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~ 381 (510) |.+ ++++. .+.++... ..+.+. .+..+..+..|.++|-.. .+...+. -|..| .....-....|-|.+.++.. T Consensus 232 ~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~afgVp~~~lg~~~~-~~~~~-~~~~~~~~~~l~p~~~~i~~ 306 (382) T protein:vir:48 232 GPLVLDDLE-DFTPLEIK-SNVSQL-LKQADWTTGQFAKVYGIPDNVVGGQGD-QQSSL-EMSSDLYSKAVSRYLRPFLS 306 (382) T ss_pred CeeEcCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCC-cccHH-HHHHHHHHHHHHHHHHHHHH Confidence 212 22222 23333322 234443 355677788899998433 1111111 12222 11233455667777777777 Q ss_pred HHHHHHHHHHHHHHhhcCCCCC--Cc--------cceeeEEeecHHHHHHHHH----HHHHHHHHHHHHhh-cChHhHhh Q lcl|Aclame:pro 382 NLQSPLAYVCLSEVDDALLQGL--IT--------KQHKPAIETGLPALSRSAA----VQSMLNASQVIAGL-APIAQLDP 446 (510) Q Consensus 382 E~l~Pli~r~~~il~~~~l~~~--p~--------~~~~~~~vs~l~~l~r~~~----~~~~~~~~q~~~~~-~~~~q~~~ 446 (510) |+-.-|..+. .....+.+ .. +.++.-+.+.-.......+ ...+.........+ +|...- T Consensus 307 ~l~~~l~~~~----~~~~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~~~~~GGd~~~-- 380 (382) T protein:vir:48 307 ELSQKLSCDV----DADIFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPNSTLKGGEEDG-- 380 (382) T ss_pred HHHHHhcChh----hhhhhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCCCCCCCCCCCC-- Confidence 7544332111 00000000 00 0111111111111110000 00011111110111 111111 Q ss_pred cCC Q lcl|Aclame:pro 447 RIS 449 (510) Q Consensus 447 ~id 449 (510) =| T Consensus 381 -~~ 382 (382) T protein:vir:48 381 -QD 382 (382) T ss_pred -CC Confidence 11 No 136 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=83.11 E-value=0.071 Score=26.88 Aligned_cols=428 Identities=12% Similarity=-0.000 Sum_probs=147.7 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) =-+.+.....++. +.+-....+++|+-- +|++-+..........+++..+-+...++.+++.|++- ++. + T Consensus 6 ~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~------~~~-~ 78 (456) T protein:vir:10 6 PAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN------GIT-V 78 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccC------Cee-c Confidence 0111111111221 111111223333321 11110000011111122344555666666666654322 222 2 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CC-CeEEEEEeceEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DE-ATVVAWSLRSYA 154 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~--~~-~~~~~~pl~~~~ 154 (510) ...++. +.. ..+.+.+.++++.....++.++..++|.+.+++.. +. .++++++..+.+ T Consensus 79 ~~~~d~------------~~~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~ 139 (456) T protein:vir:10 79 GGSADS------------DLA-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred CCCCCc------------chH-------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeE Confidence 211110 001 11223356678889999999999999998655543 32 246777666655 Q ss_pred EeeCCC-Cc-eeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 155 VRRDAT-GR-WMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 155 v~~d~~-G~-v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) +..|+. ++ +...+|.++ ..+ ..+ .......++.....|..++....... .......+.......... T Consensus 140 ~i~d~~~~~~~~~~i~~~~-~~d----~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 208 (456) T protein:vir:10 140 VSVDPLQPWRIRAAMRWWR-DLD----AES----DFAIVWSGDGWQKFARPCFVQSSSRR--RLVTRISDSWVPVGDAVV 208 (456) T ss_pred EEEcCCCCcceEEEEEEEE-ecC----Cce----eEEEEEeccceeEEEEEEEEeecccc--eeeeecCCceeeccccCC Confidence 555543 33 333444432 110 000 00000111222222221111111100 111111111111111222 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceee----------CCCCc-cchhhhhc Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLV----------DEAKG-AVVDDYQD 301 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv----------~~~g~-~~~~~~~~ 301 (510) .+..+|++.. .+..|.|-.+..++-+..++...-..+..+...+.|...+ +.+|- +++..... T Consensus 209 ~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~ 282 (456) T protein:vir:10 209 TGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFE 282 (456) T ss_pred CCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhh Confidence 2234555432 2357899999999988888877666555555544433222 11111 11111222 Q ss_pred CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 302 AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 302 ~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~--~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl 379 (510) ...|.+.... .+.+..++. .++++.....++.+...|...=-. ..+..+..+.|+.-|.....-+.. -.++. T Consensus 283 ~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~----k~~~~ 356 (456) T protein:vir:10 283 AAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLF----KCEDR 356 (456) T ss_pred hhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHH----HHHHH Confidence 2223322211 112222222 234554444444444433221000 011112234455533322111111 11111 Q ss_pred HHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a 458 (510) + ..+.+-+.+.+.++.. .+.+ ....+++...-+. +-..++.++.+..+.+ ++.+. .. ... T Consensus 357 ~-~~f~~~l~~~~rl~~~~~g~~--~~~~~~v~w~~~~-~~~~~~~ada~~kl~~-----~gi~~-------~~---~~~ 417 (456) T protein:vir:10 357 L-SIAKIGLEAILVKALQIEGES--VEDTVDVSFESPD-RVTLGEKYSAASLAKA-----AGESW-------AS---IRR 417 (456) T ss_pred H-HHHHHHHHHHHHHHHHhcCCC--cccceeEEecCCC-CcCHHHHHHHHHHHHH-----cCCCh-------HH---HHH Confidence 1 2223333444444321 2222 1223454443221 1111222222221111 12111 01 112 Q ss_pred HHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 459 AAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 459 ~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..+|+.+..+ ++.+..+.++++.+ + .+++.+. +..-|- T Consensus 418 ~~lg~~~~~i----~~~e~er~~~e~~~---~------~~~~~~~-~~~~~~ 455 (456) T protein:vir:10 418 NILNYNADQI----KQDDLDRAREQITL---F------AGNPVQR-PQEDGS 455 (456) T ss_pred hhCCCCHHHH----HHHHHHHHHHHHHH---H------hhhhhhc-CCCCCC Confidence 3456654311 11111111111111 1 1111111 112222 No 137 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=83.11 E-value=0.071 Score=26.88 Aligned_cols=428 Identities=12% Similarity=-0.000 Sum_probs=147.7 Q ss_pred ChhHHHHHHHHHh-ccCchHHHHHHHHhh--cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCccccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-DGSVEQRAIEFAKTT--LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) Q Consensus 1 ~k~~~~~r~~~lk-r~~~~~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l 77 (510) =-+.+.....++. +.+-....+++|+-- +|++-+..........+++..+-+...++.+++.|++- ++. + T Consensus 6 ~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~------~~~-~ 78 (456) T protein:vir:10 6 PAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN------GIT-V 78 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccC------Cee-c Confidence 0111111111221 111111223333321 11110000011111122344555666666666654322 222 2 Q ss_pred CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CC-CeEEEEEeceEE Q lcl|Aclame:pro 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNS--DE-ATVVAWSLRSYA 154 (510) Q Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~--~~-~~~~~~pl~~~~ 154 (510) ...++. +.. ..+.+.+.++++.....++.++..++|.+.+++.. +. .++++++..+.+ T Consensus 79 ~~~~d~------------~~~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~ 139 (456) T protein:vir:10 79 GGSADS------------DLA-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred CCCCCc------------chH-------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeE Confidence 211110 001 11223356678889999999999999998655543 32 246777666655 Q ss_pred EeeCCC-Cc-eeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 155 VRRDAT-GR-WMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 155 v~~d~~-G~-v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) +..|+. ++ +...+|.++ ..+ ..+ .......++.....|..++....... .......+.......... T Consensus 140 ~i~d~~~~~~~~~~i~~~~-~~d----~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 208 (456) T protein:vir:10 140 VSVDPLQPWRIRAAMRWWR-DLD----AES----DFAIVWSGDGWQKFARPCFVQSSSRR--RLVTRISDSWVPVGDAVV 208 (456) T ss_pred EEEcCCCCcceEEEEEEEE-ecC----Cce----eEEEEEeccceeEEEEEEEEeecccc--eeeeecCCceeeccccCC Confidence 555543 33 333444432 110 000 00000111222222221111111100 111111111111111222 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceee----------CCCCc-cchhhhhc Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLV----------DEAKG-AVVDDYQD 301 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv----------~~~g~-~~~~~~~~ 301 (510) .+..+|++.. .+..|.|-.+..++-+..++...-..+..+...+.|...+ +.+|- +++..... T Consensus 209 ~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~ 282 (456) T protein:vir:10 209 TGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFE 282 (456) T ss_pred CCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhh Confidence 2234555432 2357899999999988888877666555555544433222 11111 11111222 Q ss_pred CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 302 AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAERVTAEEVRITAEEAENTLGGTYSLL 379 (510) Q Consensus 302 ~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~--~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl 379 (510) ...|.+.... .+.+..++. .++++.....++.+...|...=-. ..+..+..+.|+.-|.....-+.. -.++. T Consensus 283 ~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~----k~~~~ 356 (456) T protein:vir:10 283 AAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLF----KCEDR 356 (456) T ss_pred hhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHH----HHHHH Confidence 2223322211 112222222 234554444444444433221000 011112234455533322111111 11111 Q ss_pred HHHHHHHHHHHHHHHHhh-cCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHH Q lcl|Aclame:pro 380 AENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIW 458 (510) Q Consensus 380 ~~E~l~Pli~r~~~il~~-~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a 458 (510) + ..+.+-+.+.+.++.. .+.+ ....+++...-+. +-..++.++.+..+.+ ++.+. .. ... T Consensus 357 ~-~~f~~~l~~~~rl~~~~~g~~--~~~~~~v~w~~~~-~~~~~~~ada~~kl~~-----~gi~~-------~~---~~~ 417 (456) T protein:vir:10 357 L-SIAKIGLEAILVKALQIEGES--VEDTVDVSFESPD-RVTLGEKYSAASLAKA-----AGESW-------AS---IRR 417 (456) T ss_pred H-HHHHHHHHHHHHHHHHhcCCC--cccceeEEecCCC-CcCHHHHHHHHHHHHH-----cCCCh-------HH---HHH Confidence 1 2223333444444321 2222 1223454443221 1111222222221111 12111 01 112 Q ss_pred HHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 459 AAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 459 ~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..+|+.+..+ ++.+..+.++++.+ + .+++.+. +..-|- T Consensus 418 ~~lg~~~~~i----~~~e~er~~~e~~~---~------~~~~~~~-~~~~~~ 455 (456) T protein:vir:10 418 NILNYNADQI----KQDDLDRAREQITL---F------AGNPVQR-PQEDGS 455 (456) T ss_pred hhCCCCHHHH----HHHHHHHHHHHHHH---H------hhhhhhc-CCCCCC Confidence 3456654311 11111111111111 1 1111111 112222 No 138 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=81.11 E-value=0.089 Score=26.35 Aligned_cols=396 Identities=12% Similarity=0.029 Sum_probs=143.6 Q ss_pred ChhHHHHHHHHHh---ccCchHHH-HHHHHhhcccc---cCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR---DGSVEQRA-IEFAKTTLPYL---MVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPT 70 (510) Q Consensus 1 ~k~~~~~r~~~lk---r~~~~~~w-~e~~~~~~P~~---~~~~~~~~~~~~--~~~~-dstg~~a~~~Laa~l~~~ltpp 70 (510) |-+.+.+.....+ ++... .| ......+.|.. |.......+... .+.. .++--.|++.+|+.+.+ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~----- 74 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLF-GWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAG----- 74 (434) T ss_pred Cccchhhhhhhcccccchhhh-cccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhh----- Confidence 5555544444332 22211 11 11111111211 111111111110 0111 23334555666655554 Q ss_pred cCcccccCCC-hhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcCCH----HHHHHHHHHHHhhCceEEEEeCCCCe Q lcl|Aclame:pro 71 GIPFFRSELT-DAIRREADSRDTDITEVTAALARVDRKATQRLF-QNASL----AVLTQVIKLLIVTGNALLYRNSDEAT 144 (510) Q Consensus 71 ~~~WF~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~----~~~~~~~~~l~~~G~~~l~~~~~~~~ 144 (510) -||.-..-. +....+ + .+..+...|. +-|-+ .-....+.++...||+.+|+..+.++ T Consensus 75 -lp~~~~~~~~~g~~~~----------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~G~ 137 (434) T protein:vir:43 75 -LPLGVYERKADGSRVD----------A------RSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAAGR 137 (434) T ss_pred -CceEEEEEcCCCcccc----------c------cccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCc Confidence 255322211 110000 0 0112223343 34443 33455577888999999888766554 Q ss_pred -EEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEee Q lcl|Aclame:pro 145 -VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEI 221 (510) Q Consensus 145 -~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~ 221 (510) ...+|| ..+-+..|.+|++- |+. + .. T Consensus 138 ~~~L~~l~p~~v~~~~~~~g~~~--y~~------------------------------------------------~-~~ 166 (434) T protein:vir:43 138 PAALDFLLPSRVDLECDENGRLK--YFY------------------------------------------------T-TK 166 (434) T ss_pred EEEEEEEcCcceEEEEcCCCeEE--EEE------------------------------------------------E-ec Confidence 344555 45555565555321 110 0 00 Q ss_pred CCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc Q lcl|Aclame:pro 222 DGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD 301 (510) Q Consensus 222 ~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~ 301 (510) +|... .+..++ .+..|....+| .||.||...+...+.......+.......-...|..++.-++.+.++.... T Consensus 167 ~g~~~----~~~~~e--Vih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~ 239 (434) T protein:vir:43 167 KGARR----EIERTN--MLHIPAFTLDG-RIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREE 239 (434) T ss_pred CceEE----EEcccc--EEEecCcCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHH Confidence 11000 000011 22233332334 799999999988888888777777766666666766665455555543221 Q ss_pred CC----------C-c--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc-CCCCCCCCHHHHHHHH Q lcl|Aclame:pro 302 AE----------M-G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-QRDAERVTAEEVRITA 365 (510) Q Consensus 302 ~~----------~-G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~~~~~~vTAtEi~~r~ 365 (510) .. + | .+++++. ...++.. +..+.+. .+..+.....|.++|=.. .+ ..+....+.+-+.+.. T Consensus 240 ~r~~~~~~~g~~nag~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~ 316 (434) T protein:vir:43 240 FREYVKSVSGAMNSGRSPVLEQGI-TPETIGI-NPVDAQL-LETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQM 316 (434) T ss_pred HHHHHHHhcCccccCCccccCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHH Confidence 11 1 1 1223222 2233322 2235554 344566678888888332 11 1122222222222221 Q ss_pred HH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHh-hcChHh Q lcl|Aclame:pro 366 EE-AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAG-LAPIAQ 443 (510) Q Consensus 366 ~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~-~~~~~q 443 (510) .. ....|.|.+.+++.+ |.+..+++.......+++- ++.+-|+--......+...+.. +.-+-+ T Consensus 317 ~~f~~~~L~P~~~~ie~~------------ln~kL~~~~~~~~~~~~fd--~~~llr~d~~~r~~~~~~~~~~G~~T~NE 382 (434) T protein:vir:43 317 LAFLTFSISSITNQIQQC------------VNKRLLTAPERIRYYAEFS--LEGFLKADSAGRAAWYSTMAQNGFMTRNE 382 (434) T ss_pred HHHHHHHHHHHHHHHHHH------------HHhhcCChhhhcCceEEEe--chhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 12 222355555555544 3333333322122333332 2223332111111111111111 111122 Q ss_pred HhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQ-RRQAAQAQAAQETLLE 498 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~-~qqa~~~~~a~~~~~~ 498 (510) +...++.+.+ .. +|.+=+| ..+++- |++......+ .+.+...+..+++..+ T Consensus 383 ~R~~~gl~p~-~g-gD~~~~~-~n~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 383 GRRKENLPEL-PG-GDILTVQ-SNLVPI-DQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred HHHHhCCCCC-CC-CCeEeec-cCccch-hhhhccCCCcchhhhhhccCCCCCCCC Confidence 2222211111 00 1111111 011111 1111000000 0000000000011111 No 139 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=79.86 E-value=0.1 Score=26.06 Aligned_cols=390 Identities=11% Similarity=0.044 Sum_probs=145.5 Q ss_pred HHHHHhccC---------chHHHHHHHHhhcccccCCCCCCccccc--ccccc-chHHHHHHHHHHHHHHhhcCccCccc Q lcl|Aclame:pro 8 LWEKLRDGS---------VEQRAIEFAKTTLPYLMVDPMSGSRGVV--EHDFQ-SAGALLVNNLAAKLARSLFPTGIPFF 75 (510) Q Consensus 8 r~~~lkr~~---------~~~~w~e~~~~~~P~~~~~~~~~~~~~~--~~~~d-stg~~a~~~Laa~l~~~ltpp~~~WF 75 (510) .|+-++|+. -...|-.+..+.- ..|.+.. .++... .+... ++--.|++.+|..+. . -||. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~-~~g~~v~~~~al~~~~V~~~v~~Ia~~iA-~-----lp~~ 72 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVA-EPFAGAW-QQGVKADPEAVLSFHAVFACISLISQDIA-K-----MRLR 72 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhh-hhhcchh-hcCcccChHHhhccHHHHHHHHHHHHhhc-c-----CceE Confidence 666554322 1223544433211 1111110 111110 01122 222234444444333 2 2564 Q ss_pred ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCceEEEEeCCC-Ce-EEEEE Q lcl|Aclame:pro 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-AT-VVAWS 149 (510) Q Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~~-~~~~p 149 (510) -..-..... ..++... .++..+.+-| .+.=...++.++...||+.+++..+. ++ ...|| T Consensus 73 ~~~~~~~g~---------~~~~~~~------~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~ 137 (454) T protein:vir:93 73 LMQTDAQGI---------RRETRRG------DIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRI 137 (454) T ss_pred EEEeccCCc---------cchhhhH------HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEE Confidence 322221111 0111111 1222344434 34455666778889999998875542 22 23444 Q ss_pred e--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeec Q lcl|Aclame:pro 150 L--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVG 227 (510) Q Consensus 150 l--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~ 227 (510) + ..+-+..+.+|.+- | ++ +.... .... T Consensus 138 i~~~~v~v~~~~~g~~~--y-~~-----------------------------------------------~~~~~-~~~~ 166 (454) T protein:vir:93 138 LDWNRVEPLVADDGEVF--Y-RI-----------------------------------------------TPDRN-CGIT 166 (454) T ss_pred EcCcceEEEEcCCCcEE--E-EE-----------------------------------------------Eeccc-cccc Confidence 4 44444455554321 1 11 00000 0000 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC---- Q lcl|Aclame:pro 228 ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE---- 303 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~---- 303 (510) ....+..++ .+..|+....+..||.||...+...+.....+.+.......-...|..++.-++.+.++...... T Consensus 167 ~~~~~~~~e--ViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~ 244 (454) T protein:vir:93 167 EAVTVPARE--VIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWD 244 (454) T ss_pred eeEEecCcc--eEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHH Confidence 000111111 33444444456789999999999999888888887777766667777776645555554433221 Q ss_pred ---C----c--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCHHHHHHHHH-HHHHHh Q lcl|Aclame:pro 304 ---M----G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA-NQRDAERVTAEEVRITAE-EAENTL 372 (510) Q Consensus 304 ---~----G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~vTAtEi~~r~~-E~~~~L 372 (510) . | .+++++. ++.++.. +..+.+.+ +..+..+..|.++|-... .-.+.+.-|-.-+.+... =....| T Consensus 245 ~~~~g~n~g~~~vl~~g~-~~~~l~~-~~~d~q~l-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l 321 (454) T protein:vir:93 245 SGYTGENAGKTAILSNGA-KYNPTTF-SPVDSQTV-EQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCL 321 (454) T ss_pred HHhcccccCCceeccCCc-eEEEccc-ChhHHHHH-HHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHH Confidence 1 1 1122222 2233332 22345443 445667788888883221 001111112211111111 234456 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHH Q lcl|Aclame:pro 373 GGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 373 Gpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~ 452 (510) .|.+.++..++-. ..+++ ....+++- +..|-|+--..+.. +...+-..+- +.. T Consensus 322 ~P~~~~ie~~ln~------------~L~~~---~~~~~~f~--~~~ll~~D~~~r~~-~~~~~~~~G~-------~T~-- 374 (454) T protein:vir:93 322 QTLIESIELLLDE------------ALETG---ENESTEFD--VTTLLRMDSERRMK-TLGDAVKNTL-------LTP-- 374 (454) T ss_pred HHHHHHHHHHHHH------------hhcCC---CCcEEEee--chhhhccCHHHHHH-HHHHHHhCCC-------cCH-- Confidence 6777776666432 11111 12233331 12333321111111 1111111111 111 Q ss_pred HHHHHHHHcCCCHhhccCCHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 453 MMDTIWAAFSVDTSQFYKSADELQA-----EAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 453 ~~~~~a~~~Gvp~~~i~~s~ee~~~-----~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +++-..+|.|+- ...|++-. -.....++....+ ........++ ......++ T Consensus 375 --NE~R~~~gl~pi---~ggD~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~ 430 (454) T protein:vir:93 375 --NEARKRENLPPL---AGGDALYLQQQNYSLEALSRRDARED-PFASSGKTAS-VPQAVAAS 430 (454) T ss_pred --HHHHHHhCCCCC---CCCCeeeeccCccchHhhhccCcccC-CCCCCccCCC-CCCCCCCC Confidence 112223344431 11111100 0000000000000 0000000000 01122222 No 140 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=79.78 E-value=0.1 Score=26.04 Aligned_cols=361 Identities=11% Similarity=0.045 Sum_probs=139.2 Q ss_pred HHHHHHHhccC--chHHHHHHHHhhcccccCCCCCCcccccc--ccccchHHHHHHHHHHHHHHhhcCccCcccccCCCh Q lcl|Aclame:pro 6 AMLWEKLRDGS--VEQRAIEFAKTTLPYLMVDPMSGSRGVVE--HDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTD 81 (510) Q Consensus 6 ~~r~~~lkr~~--~~~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d 81 (510) =..|+.+++.+ ....-.+...++.|..+.... .+..-.. -.-.++--.|++.+|+.+.+. |+. + -+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~------p~~-~--~~ 70 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLN-GSEWVSAESALRNSDLFSIINQLSNDLATV------KLT-A--SR 70 (386) T ss_pred Ccccccccccccccccccccccccccchhccccc-CCceechhhhhcchHHHHHHHHHHHhhccC------cee-e--cc Confidence 22344443211 111111112222222111110 0000000 112344446667676666553 221 1 11 Q ss_pred hhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCH----HHHHHHHHHHHhhCceEEEEeCCCC--eEEEEEe--ceE Q lcl|Aclame:pro 82 AIRREADSRDTDITEVTAALARVDRKATQRLFQNASL----AVLTQVIKLLIVTGNALLYRNSDEA--TVVAWSL--RSY 153 (510) Q Consensus 82 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~----~~~~~~~~~l~~~G~~~l~~~~~~~--~~~~~pl--~~~ 153 (510) .. . ...+.+.|.+ .-+...+.++...||+.+++..+.. ....+|+ ..+ T Consensus 71 ~~-------------~-----------~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v 126 (386) T protein:vir:48 71 KQ-------------L-----------QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQV 126 (386) T ss_pred ch-------------h-----------HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCcee Confidence 00 0 1123344443 3334556678889999888765432 2334444 555 Q ss_pred EEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccc Q lcl|Aclame:pro 154 AVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWP 233 (510) Q Consensus 154 ~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~ 233 (510) .+.++.+|.. ++.+ + ..++........|. T Consensus 127 ~v~~~~~~~~--~~y~-----------------------------------------------~--~~~~~~~~~~~~~~ 155 (386) T protein:vir:48 127 SFNRLDNKDG--IYYN-----------------------------------------------I--TFDDPRIPPKQHVP 155 (386) T ss_pred EEEEcCCCce--EEEE-----------------------------------------------E--EecCccccceeEec Confidence 5555544321 1111 1 11111111111111 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC---------C Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE---------M 304 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~---------~ 304 (510) .++ .+..|.....+..||.||..-+...+.....+.+.......-...|..++..++.+.++...... . T Consensus 156 ~~e--vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~ 233 (386) T protein:vir:48 156 QGD--VLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQ 233 (386) T ss_pred Ccc--EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCC Confidence 122 34445555667789999999999999999999998888888878887777655555554332211 1 Q ss_pred c--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|Aclame:pro 305 G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTYSLLA 380 (510) Q Consensus 305 G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~ 380 (510) | .+++++. ++.++... ..+.+. .+..+..++.|-.+|-.. ++...+..-+++|-. + .=....|.|++..+. T Consensus 234 g~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~-~-~~~~~~l~P~~~~ie 308 (386) T protein:vir:48 234 GGPLVLDDLE-EFTPLEIK-SNVSQL-LKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMS-L-DLYNKAVSRYLRPFL 308 (386) T ss_pred CCceecCCCc-eEEEcCCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH-H-HHHHHHHHHHHHHHH Confidence 1 1112221 23333221 234443 455677778888888332 111111111222221 1 123344556666555 Q ss_pred HHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHH Q lcl|Aclame:pro 381 ENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAA 460 (510) Q Consensus 381 ~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~ 460 (510) .++-.-|+ +....+ +...+- .+...++..+..+.+ ++ -+..+++ +..... T Consensus 309 ~~l~~~l~------------~~~~~~-~~~~~~--~d~~~~~~~~~~l~~--------~g------~~t~nE~-r~~lg~ 358 (386) T protein:vir:48 309 SELSQKLS------------CDVDAD-ILPAVD--PTGSNSVSRINSMVK--------SG------TLAQNQG-LYILQQ 358 (386) T ss_pred HHHHHhhc------------chhhcc-hhhhhc--cChHHHHHHHHHHHh--------CC------CcCHHHH-HHHhhc Confidence 55433222 111000 000000 011112222211111 00 1111221 111112 Q ss_pred cCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 461 FSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 461 ~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~ 506 (510) .|+++..+. ... . ......++.-.-+.. T Consensus 359 ~~~~~~~~~-------~~~------~-----~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 359 AEILPKELP-------EGE------N-----PNKTTLKGGEINGED 386 (386) T ss_pred CCCCCccch-------hhc------C-----CCCCccCCCCCCCCC Confidence 222211000 000 0 000000000000000 No 141 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=79.49 E-value=0.1 Score=25.97 Aligned_cols=414 Identities=8% Similarity=0.047 Sum_probs=164.6 Q ss_pred ChhHH-HHHHHHHh-ccCchHHHHHHHHh--hcccccCC--CCCC--ccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTA-AMLWEKLR-DGSVEQRAIEFAKT--TLPYLMVD--PMSG--SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~-~~r~~~lk-r~~~~~~w~e~~~~--~~P~~~~~--~~~~--~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) |-... .+.+++.+ |.......+++++= -++.+-.. .... ......++..+.....++..++.|.+ . T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G------~ 74 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFT------Y 74 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheec------c Confidence 32222 22222221 21222222323221 11111000 0000 01111234445555555555543322 1 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC-------C Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDE-------A 143 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l--~~~~~~-------~ 143 (510) | ..++.++.. +..+.+ ..+..++|.....++.++...+|.+.+ |.+++. + T Consensus 75 p-~~~~~~~~~------------~~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~ 133 (451) T protein:vir:10 75 P-VLFDIDNNK------------ELNEKV--------TDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQ 133 (451) T ss_pred c-ceeecCCcH------------HHHHHH--------HHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCccccccccc Confidence 1 112222211 111111 123347899999999999999998764 555431 2 Q ss_pred --eEEEEEece-EEEeeCC-CCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEE Q lcl|Aclame:pro 144 --TVVAWSLRS-YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYH 219 (510) Q Consensus 144 --~~~~~pl~~-~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~ 219 (510) ++.+++..+ |++..|. .+++.-.+|.+......-... .++-...+++|+ ++.-..|..... T Consensus 134 ~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~----------~~~~~~~~e~yt-----~~~~~~~~~~~~ 198 (451) T protein:vir:10 134 TFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVKGQI----------QKQAYTYVEFWT-----DKILDKYKFFGV 198 (451) T ss_pred ceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeecccccc----------cceEEEEEEEEe-----CCeEEEEEeccc Confidence 355554444 5555453 477776666663322110000 000011122222 211111111111 Q ss_pred eeCCeee-ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCC-Cccchh Q lcl|Aclame:pro 220 EIDGVRV-GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEA-KGAVVD 297 (510) Q Consensus 220 e~~~~~~-~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~-g~~~~~ 297 (510) ...+... ......++..+|++..+. +.+|.|=.+...+-+..+|.+.-......+...+|.+.+.-- +-...+ T Consensus 199 ~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~ 273 (451) T protein:vir:10 199 SCCGSQIEHITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSE 273 (451) T ss_pred CccccccccccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchh Confidence 1111111 111122346788776543 457889899999999999988888888888888887666311 111112 Q ss_pred hhhcCC-Cccee-cC----CccccccccCCCccchHHHHHHHHHHHHHHHHHHhh-cccCCCCCCCCHHHHHH------- Q lcl|Aclame:pro 298 DYQDAE-MGDYV-PG----GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRI------- 363 (510) Q Consensus 298 ~~~~~~-~G~~~-~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~-~~~~~~~~~vTAtEi~~------- 363 (510) ...... .+.+. ++ ...+++.+. ...+.+.....++.++..|...-.. +.........|+.-+.. T Consensus 274 ~~~~~~~~~~i~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~ 351 (451) T protein:vir:10 274 FLKELKRYKTIKTETDSEGDSGGLKTMQ--IEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLEL 351 (451) T ss_pred hHHHHhhCCeEEecCcCCccCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHH Confidence 112111 12221 11 122343332 2346777788888888877664322 11111112345543332 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHh Q lcl|Aclame:pro 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQ 443 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q 443 (510) ++.+++..++..+. -+++.++.++. .. ....+++.+.-.+.. +....++.+..+++. T Consensus 352 k~~~k~~~f~~~l~--------~~~~li~~~~~---~~--d~~~i~i~f~~~~p~--------n~~e~~~~~~kl~g~-- 408 (451) T protein:vir:10 352 KSGLLETEFRTSFD--------KLIKAILYFLG---VT--DYKKIQQTYTRNMMS--------NDLEDADIATKSVGI-- 408 (451) T ss_pred HHHHHHHHHHHHHH--------HHHHHHHHHhC---CC--CccceeEEecCCCCC--------CHHHHHHHHHHHhcc-- Confidence 23333333333222 22222222221 11 223455555433221 111111222222221 Q ss_pred HhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTN 505 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~ 505 (510) +.-..++.. + | ++.++++..++..++++.+. ++ .+...+.... T Consensus 409 ----iS~et~~~~----~--p---~v~d~~~e~~~~~ee~~~~~-~~-----~~~~~~~~~~ 451 (451) T protein:vir:10 409 ----IPTKIILRH----H--P---WVDDVEEAEKLYLEEKKIQA-SK-----VSDDYNNFTE 451 (451) T ss_pred ----CchHHHHHh----C--C---CCCCHHHHHHHHHHHHHHHH-HH-----HHhhcCCCCC Confidence 222222222 2 1 33344433333322222111 11 1111122222 No 142 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=78.64 E-value=0.11 Score=25.79 Aligned_cols=405 Identities=13% Similarity=0.055 Sum_probs=148.1 Q ss_pred HHHHHHHh-c--cC-----chHHHHHHHHhhcccccCCCCCCcccccc--ccc-cchHHHHHHHHHHHHHHhhcCccCcc Q lcl|Aclame:pro 6 AMLWEKLR-D--GS-----VEQRAIEFAKTTLPYLMVDPMSGSRGVVE--HDF-QSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 6 ~~r~~~lk-r--~~-----~~~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~-dstg~~a~~~Laa~l~~~ltpp~~~W 74 (510) =..|..|. | .+ -...|..+.-... ..+. .. .++.... ... .++--.|++.+|..+.+. || T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~-~~g~~v~~~~al~~~~v~~~i~~ia~~iA~l------p~ 71 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIY-NLGA-TA-SSGERVTPHDALQVSAVFASVRLLSETIATL------PL 71 (457) T ss_pred Cchhhhhhccccccccccccccccccchhhhh-hccc-cc-cCCceechHHhhccHHHHHHHHHHHHhHhhC------ce Confidence 23333331 1 01 0111111110000 0010 00 1111100 011 233344555555555433 33 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHh----cCCHHHHHHHHHHHHhhCceEEEEeCCCCe-EEEEE Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQ----NASLAVLTQVIKLLIVTGNALLYRNSDEAT-VVAWS 149 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~~l~~~G~~~l~~~~~~~~-~~~~p 149 (510) .=..-.+.... +++. ..+...+.+ -+.+.-+..+..++...||+.+++..+.++ ...+| T Consensus 72 ~~~~~~~~~~~----------~~~~------~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~ 135 (457) T protein:vir:62 72 STYSKRGGTRK----------EIDT------PEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDV 135 (457) T ss_pred EEEEecCCccc----------cccc------hHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEE Confidence 21111111110 0110 011112222 235666777888889999999888665543 34455 Q ss_pred e--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeec Q lcl|Aclame:pro 150 L--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVG 227 (510) Q Consensus 150 l--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~ 227 (510) | ..+.+.++..+... ...|..+.+..+|.... T Consensus 136 l~p~~v~v~~~~~~~~~----------------------------------------------~~~~~~y~~~~~g~~~~ 169 (457) T protein:vir:62 136 LDPTKIHVHMVMVDGLR----------------------------------------------RKVFEAYDIDADGNEVL 169 (457) T ss_pred EcCcceEEEEeccCCcc----------------------------------------------ceeEEEEEEccCCceeE Confidence 4 23333333222110 00111111222221110 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC---- Q lcl|Aclame:pro 228 ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE---- 303 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~---- 303 (510) ...|..++ .|..|.....+..||.||...+...+.....+.+.......-...|..++.-++.+.++...... T Consensus 170 -~~~~~~~e--iih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~ 246 (457) T protein:vir:62 170 -LGWFTPRD--VLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWR 246 (457) T ss_pred -EEeeCccc--eEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHH Confidence 11111122 45555555567789999999999888888888877777766667777766655666665443221 Q ss_pred ---Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc-CCCCCCCCHHHHHHHHHH-HH Q lcl|Aclame:pro 304 ---MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-QRDAERVTAEEVRITAEE-AE 369 (510) Q Consensus 304 ---~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~~~~~~vTAtEi~~r~~E-~~ 369 (510) .| .+++++. +..++.+. ..+.+. .+..+..+..|.++|-.. .+ ..+....+..-+.+.... .. T Consensus 247 ~~~~G~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~ 323 (457) T protein:vir:62 247 AANSGVDNAHRVALLTEGA-KFSKVAMS-PDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTM 323 (457) T ss_pred HHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHH Confidence 11 1122221 22233221 234444 344456777888888332 11 111111222223222222 22 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHh-hcChHhHhhcC Q lcl|Aclame:pro 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAG-LAPIAQLDPRI 448 (510) Q Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~-~~~~~q~~~~i 448 (510) ..|.|.+.++..+ +.+..+++.......+++- ++.+-|+--......+...++. +.-+-++...+ T Consensus 324 ~~l~P~~~~ie~~------------ln~~L~~~~~~~~~~i~fd--~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~ 389 (457) T protein:vir:62 324 FSLRPWLERIEAG------------FNRLLFAETADRFRFVKFN--LDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAE 389 (457) T ss_pred HHHHHHHHHHHHH------------HHhhhcCccccCceEEEee--chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 2445555554444 3333333322222233331 2223332111111111111111 11112222222 Q ss_pred CHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 449 SLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 449 d~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +.+.+=.-.++.+=+|. .+....++.+ .+.+.......+...+.+ ..+.+.|. T Consensus 390 gl~pi~~g~~D~~~~~~-n~~~~~~~~~------~~~~~~~~~~~~~~~~~~--~~~~~~~~ 442 (457) T protein:vir:62 390 DMTPLPDGLGEKYRVPL-NLGEIGEEPE------PEPAPAPPAIDPPAEEPA--DDEEPDNA 442 (457) T ss_pred CCCCCCCCCcceeeecc-cccccccccc------ccccCCCccCCCCccCCC--CCCCCCCC Confidence 22211111112222221 1221111110 000000000000000111 11122222 No 143 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=75.01 E-value=0.15 Score=25.08 Aligned_cols=425 Identities=8% Similarity=-0.009 Sum_probs=160.4 Q ss_pred Chh-------HHH---HHHHHHhccCchHHHHHHHHhhcccccCCC-CCCc----cccccc--cccchHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKS-------TAA---MLWEKLRDGSVEQRAIEFAKTTLPYLMVDP-MSGS----RGVVEH--DFQSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~k~-------~~~---~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~-~~~~----~~~~~~--~~dstg~~a~~~Laa~l 63 (510) +.- .+. ..|+.-..++ -..|. -|..-.+. .... ..+... .-++.+..+++.+++.+ T Consensus 11 ~sP~~~~~R~~ar~~~~~y~aa~~~r-~~~~~------~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nv 83 (502) T protein:vir:79 11 FSPGWKAARLRSRAVIQAYEAVKTTR-THKAR------RENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERV 83 (502) T ss_pred cChHHHHHHHhhHHHHhhccccCccc-ccCCC------CCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhh Confidence 211 111 1122111111 00010 00000000 0000 011111 24788999999999999 Q ss_pred HH--hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe-- Q lcl|Aclame:pro 64 AR--SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN-- 139 (510) Q Consensus 64 ~~--~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~-- 139 (510) ++ +++|..++=. .+...++ .....++.....-.+. ...=.+.+||.....++...+.-|-+++... T Consensus 84 VG~ggi~~~~~~~~----~~~~~~~-----~~~~~ie~~w~~Wa~~-~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~ 153 (502) T protein:vir:79 84 VGKNGIIVEPHPVL----RNGAIAR-----DLAAEIRTRWSEWSVS-PEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSG 153 (502) T ss_pred ccCCceeeeeccCC----CChhHHH-----HHHHHHHHHHHHhhcC-cCccccCCHHHHHHHHHHHHHhCCceEEEEeec Confidence 96 5666554411 1111110 1112222211111111 1222356899999999999999998775432 Q ss_pred CCC-----C----eEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCC Q lcl|Aclame:pro 140 SDE-----A----TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGT 210 (510) Q Consensus 140 ~~~-----~----~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~ 210 (510) ++. . .++.++...+-.-.+ +|+ ..+.+.+.+.+.+..=||.....++. T Consensus 154 ~~~~~~~g~~~~l~lq~iepd~l~~~~~-~~~----------------------~i~~GVe~d~~Gr~~aY~i~~~hPgd 210 (502) T protein:vir:79 154 RINSLTPSAGVHFWLEALEPDFIPMTSD-ESN----------------------RLNQGVFVDDWGRPEKYLVYKSRPVS 210 (502) T ss_pred ccCccCCCcccceEEEEecchhcCCCCC-CCC----------------------eeEeeeEECCCCceEEEEEeecCCCC Confidence 211 0 122222211110000 000 00111112222222223322222111 Q ss_pred CeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC- Q lcl|Aclame:pro 211 AMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD- 289 (510) Q Consensus 211 ~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~- 289 (510) .....+ ..+.. ++ ++..-....+|..-|.+...-+|..++.|..+..+.+.++..++.....+. T Consensus 211 ~~~~~~-------~rvpA------~~--vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~ 275 (502) T protein:vir:79 211 GRQMET-------KEVDA------ER--MLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRK 275 (502) T ss_pred Ccccce-------eEech------hh--eEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeec Confidence 100000 01110 00 222223456889999999999999999999999999999988887766654 Q ss_pred CCCcc-chh--------hhhcCCCcceecC-C-ccccccccCC-CccchHHHHHHHHHHHHHHHHHHhh--cccCCCCCC Q lcl|Aclame:pro 290 EAKGA-VVD--------DYQDAEMGDYVPG-G-AEAVRAYERG-DYNKMAAIQQSLQAVVVRLNQAFMY--GANQRDAER 355 (510) Q Consensus 290 ~~g~~-~~~--------~~~~~~~G~~~~g-~-~~~v~~~~~~-~~~~~~~~~~~i~~~~~~I~~af~~--~~~~~~~~~ 355 (510) +++-. .+. ......+|.+++. . ...++..... ...++. .-...+...|..++=+ ..+..|-.. T Consensus 276 ~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~ye~lt~D~s~ 352 (502) T protein:vir:79 276 GDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLE---TFRNGQLRAVAAGSRLSFSSTARNYNG 352 (502) T ss_pred CCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhccccc Confidence 22110 000 0111234544332 1 2234444332 223443 2223333334444311 123333221 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc-----eeeEEeec----HHHHHHHHHHH Q lcl|Aclame:pro 356 VTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ-----HKPAIETG----LPALSRSAAVQ 426 (510) Q Consensus 356 vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~-----~~~~~vs~----l~~l~r~~~~~ 426 (510) +=.-++.-..|....+--.=..+...|+.|+..+.+..+.-.|..++|... .+.+.+.+ ++++--++ T Consensus 353 -nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~--- 428 (502) T protein:vir:79 353 -TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAE--- 428 (502) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHH--- Confidence 323333333333333333233344567778877777766545555555321 22222221 23322111 Q ss_pred HHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 427 SMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA 506 (510) Q Consensus 427 ~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~ 506 (510) ... ..+. ++. .. ...++...|.++. ++-.++.++.+..... .......+......++ T Consensus 429 a~~---~~i~--~Gl-------~t---~~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~-Gl~~~~~~~~~~~~~~ 485 (502) T protein:vir:79 429 AWK---IQIR--GGA-------AT---ESDWVRAGGRNPD-------DVKRRRKAEIDENRKL-DLVFDTDPASDKGGSS 485 (502) T ss_pred HHH---HHHH--cCC-------CC---HHHHHHHcCCCHH-------HHHHHHHHHHHHHHHc-CCCCCCCCCCCCCCCC Confidence 100 0000 010 00 1122223344442 2211111111111000 0000000000000001 Q ss_pred cCCC Q lcl|Aclame:pro 507 LAGV 510 (510) Q Consensus 507 ~ag~ 510 (510) ..+- T Consensus 486 ~~~~ 489 (502) T protein:vir:79 486 AATK 489 (502) T ss_pred CCCC Confidence 0111 No 144 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=74.73 E-value=0.15 Score=25.03 Aligned_cols=364 Identities=10% Similarity=-0.017 Sum_probs=143.5 Q ss_pred ChhHHHHHHHHHhccCchHHHH-HHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAI-EFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSEL 79 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~-e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~ 79 (510) +.+.+...+.. ++.-...+. .+..+ +.....+.... .-.++--.|++.+|+.+.+. ||.-..- T Consensus 3 ~~~~~~~~~~~--~~~~~~~~~~~~~~~-----~g~~~~~~~~a---l~~~~V~~~v~~Ia~~iA~l------p~~~~~~ 66 (411) T protein:vir:81 3 WWSRLTRFFRP--RNETVDMTNPLLLQW-----LGVDPDTPRNQ---LSEATYFACLKILSESLGKL------PLKMYQK 66 (411) T ss_pred hHHHHHhhccC--cccccccchHHHHHH-----hcCcccChhhh---hccHHHHHHHHHHHHhHhhC------ceeEEEe Confidence 22222222211 111111010 01111 11111111111 11233344555555544422 4432222 Q ss_pred ChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCCCe-EEEEE--ec Q lcl|Aclame:pro 80 TDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDEAT-VVAWS--LR 151 (510) Q Consensus 80 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~~~-~~~~p--l~ 151 (510) .+....+. . +..+...|. +-| .+.-+...+.++...||+.+++..+.++ ...|| .. T Consensus 67 ~~~~~~~~---------~-------~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~l~~l~~~ 130 (411) T protein:vir:81 67 TERGIVKS---------D-------REELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQLQALWILPSQ 130 (411) T ss_pred cCCceeee---------c-------ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCceEEEEEECCc Confidence 22111110 0 011122232 333 3444566677888999999887766554 22344 35 Q ss_pred eEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccc Q lcl|Aclame:pro 152 SYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGR 231 (510) Q Consensus 152 ~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~ 231 (510) .+.+..|..|.+.. . +..++.+....+|... . T Consensus 131 ~v~~~~~~~~~~~~------------------------------~--------------~~~~~~~~~~~~g~~~----~ 162 (411) T protein:vir:81 131 YVTIVVDDRGLLGE------------------------------K--------------NAIWYRYNDPYDGKMY----V 162 (411) T ss_pred eEEEEEcCcccccc------------------------------c--------------ceEEEEEEecCCceEE----E Confidence 55566665553110 0 0001111111122211 1 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC-------- Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-------- 303 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~-------- 303 (510) ++.++ .+..|+....+..||.||..-+...+.......+.......-...|..++.-++.+.++...... T Consensus 163 ~~~~e--iih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~ 240 (411) T protein:vir:81 163 FRNDE--ILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFAN 240 (411) T ss_pred Ecccc--EEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 11122 55566655556789999999999999999988888888877777787776555555555322111 Q ss_pred ---C-c--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc--CCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 304 ---M-G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN--QRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 304 ---~-G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~--~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) + | .+++++. ++.++... ..+.+.+ +..+..+..|..+|-.. .+ ..++..-++++.. ..=....|. T Consensus 241 g~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~--~~f~~~~l~ 315 (411) T protein:vir:81 241 GSKNAGKIIPVPLGM-KLVPLDIK-LTDSQFF-ELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQN--LAFYVDTLL 315 (411) T ss_pred CccccCCceecCCCc-eEEEccCC-HHHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHH--HHHHHHHHH Confidence 1 1 1122222 23333322 2345443 44567788898888332 11 1122222333321 112333455 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-ccceeeEE-eecH---HHHHHHHHHHHHH-----HHHH--HHHhhcCh Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLI-TKQHKPAI-ETGL---PALSRSAAVQSML-----NASQ--VIAGLAPI 441 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p-~~~~~~~~-vs~l---~~l~r~~~~~~~~-----~~~q--~~~~~~~~ 441 (510) |.+.++..++-.-| +++.. +....+++ ++.+ +...|+.-...+. +.-. ..-.+.+. T Consensus 316 P~~~~ie~~l~~~l------------l~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~ 383 (411) T protein:vir:81 316 YVLKQYEEEITYKI------------LSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPAD 383 (411) T ss_pred HHHHHHHHHHHhhc------------CChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 66655555543322 22110 01111111 0000 1111221111111 0000 11111122 Q ss_pred Hh---Hh---hcCCHHHHHHHHHHHcCCCH Q lcl|Aclame:pro 442 AQ---LD---PRISLPKMMDTIWAAFSVDT 465 (510) Q Consensus 442 ~q---~~---~~id~d~~~~~~a~~~Gvp~ 465 (510) +. .. ..+-.+.+.+...+ |=+. T Consensus 384 ~ggD~~~~~~n~~pl~~~~~~~~k--gGd~ 411 (411) T protein:vir:81 384 DYGNNLMANGNYIPLSMLGANYGK--GGDS 411 (411) T ss_pred CCCCeeeeccCccchhhhhhhhcc--CCCC Confidence 11 00 01222232232221 1111 No 145 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=71.22 E-value=0.2 Score=24.43 Aligned_cols=443 Identities=11% Similarity=0.001 Sum_probs=177.2 Q ss_pred ChhHHHHH---------HHHHhccCchHHHHHHHHhhcccccCC-----CCCCccccccc--cccchHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAML---------WEKLRDGSVEQRAIEFAKTTLPYLMVD-----PMSGSRGVVEH--DFQSAGALLVNNLAAKLA 64 (510) Q Consensus 1 ~k~~~~~r---------~~~lkr~~~~~~w~e~~~~~~P~~~~~-----~~~~~~~~~~~--~~dstg~~a~~~Laa~l~ 64 (510) --....+| |+.-.+++-...|. |.+-.+ +.+.-..+... --+..+..+++.+++.++ T Consensus 12 sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~-------~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvV 84 (548) T protein:vir:95 12 APELVARRLAAREAIQAYEAARPGRTHKAKR-------QPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVV 84 (548) T ss_pred chHHHHHHHHhHHHhccccccCccccccccC-------CCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhcc Confidence 11111111 22222111111111 100000 00000111111 157889999999999999 Q ss_pred H--hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--C Q lcl|Aclame:pro 65 R--SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--S 140 (510) Q Consensus 65 ~--~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~ 140 (510) + ++.+..++ +..++...+++.. .-...-+.|.+.| +.-.+.+||.....++...+.-|-+++-.. . T Consensus 85 G~~G~~i~p~~---l~~d~~~a~~l~~--~ie~~w~~Wa~~~-----D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~ 154 (548) T protein:vir:95 85 GGSGIGVEPLP---LRLDGSVHAELAM--EIRSAWAEWSLSP-----ETSGELTRPQVERLMCRTWLRDGEGLAQKLMGR 154 (548) T ss_pred Cccccceeeee---cCCCHHHHHHHHH--HHHHHHHHhhcCc-----cccccCCHHHHHHHHHHHHHhCCceEEEeeecc Confidence 7 35443343 3333222111100 0011122232211 222356899999999999999998775432 2 Q ss_pred CCC---------eEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCC Q lcl|Aclame:pro 141 DEA---------TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTA 211 (510) Q Consensus 141 ~~~---------~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~ 211 (510) ... .++.+....+....+..|+ . .....+.+.+....-||.....++.. T Consensus 155 ~~~~~~g~~~~~~lqliepd~l~~~~~~~~~---------------------~-i~~GIE~D~~Grp~aY~i~~~hPgd~ 212 (548) T protein:vir:95 155 VPNYTFATSVPFALELLEPDYLPFSYNNLSK---------------------G-IVQGIERDTWRRKRAYHLLKDHPGNL 212 (548) T ss_pred cccccCCcccceEEEEechhhcCCCCCCCCC---------------------c-eeeeeEECCCCceEEEEEeecCCCcc Confidence 111 1222221111110110000 0 01122233334444455443333321 Q ss_pred eeEEEEEEeeCCeeeccccccccccCceEEEe-eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC- Q lcl|Aclame:pro 212 MDYAEMYHEIDGVRVGETGRWPIHLCPYIVPT-WNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD- 289 (510) Q Consensus 212 ~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R-w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~- 289 (510) ...... ..-+.++.+ -|+.- ....+|..-|.+...-+|..++.|.....+.+.++..++.....+. T Consensus 213 ~~~~~~---~~~~rvpA~---------~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~ 280 (548) T protein:vir:95 213 QTLGGS---LAVKRVEAE---------RIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKK 280 (548) T ss_pred cccccc---cceeeechh---------HheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeec Confidence 100000 000111111 12222 3455889999999999999999999999999999988888766664 Q ss_pred CCCccc---h-----hhhhcCCCcceecC--CccccccccCCC-ccchHHHHHHHHHHHHHHHHHHh--hcccCCCCCCC Q lcl|Aclame:pro 290 EAKGAV---V-----DDYQDAEMGDYVPG--GAEAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERV 356 (510) Q Consensus 290 ~~g~~~---~-----~~~~~~~~G~~~~g--~~~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~--~~~~~~~~~~v 356 (510) +++... . .......+|.+++. ...+++...... ..++. .-...+...|..++= +..+..|-. . T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~IAaglGipYe~ltgD~s-~ 356 (548) T protein:vir:95 281 GNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLE---GFRNGQLRMIGAGTRSTYSSVSRAYD-G 356 (548) T ss_pred CCCccccCCCCcccccccccccCCccccccCCCceeeecCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhcccc-h Confidence 211110 0 11111224544332 123455544332 23443 222333344444441 123344432 2 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc-----ceeeEEeec----HHHHHHHHHH-H Q lcl|Aclame:pro 357 TAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK-----QHKPAIETG----LPALSRSAAV-Q 426 (510) Q Consensus 357 TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~-----~~~~~~vs~----l~~l~r~~~~-~ 426 (510) |=.=+++-..|....+...=..+...|+.|+..+.+..+.-.|..++|.. .++...+.+ ++++--++.. . T Consensus 357 nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~ 436 (548) T protein:vir:95 357 TYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWEL 436 (548) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHH Confidence 43344444444444444433445566777887777776655555555432 234444322 3443332211 1 Q ss_pred HH----HHHHHHHHhhcChHhHhhcCCHHHHHHH------HHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 427 SM----LNASQVIAGLAPIAQLDPRISLPKMMDT------IWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQ--- 493 (510) Q Consensus 427 ~~----~~~~q~~~~~~~~~q~~~~id~d~~~~~------~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~--- 493 (510) .+ .+..+.++..+ .|+++.++. .++.+|++...--+..-. ....+..+..+++.. T Consensus 437 ~i~~Gl~T~~~~~a~~G--------~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 504 (548) T protein:vir:95 437 LVKAGFADEAEVARARG--------RDPRELKKSRETEIKANRAAGLVFSSDAYHQLV----KSGMDPVEAVQKVYLGVG 504 (548) T ss_pred HHHcCCCCHHHHHHHhC--------CCHHHHHHHHHHHHHHHHHcCCCCCCccccccc----ccccCCCCchhhhccccc Confidence 11 11112222222 233333332 334556553211111000 000000000000000 Q ss_pred -HHHHHHHHH-hhcccCCC Q lcl|Aclame:pro 494 -ETLLEGASD-MTNALAGV 510 (510) Q Consensus 494 -~~~~~~a~~-~~~~~ag~ 510 (510) ++..-.+-+ ...-.||. T Consensus 505 ~~~~~~~~~~~~~~~~~~~ 523 (548) T protein:vir:95 505 KMLTADEARELVNRYGAGL 523 (548) T ss_pred cccccchhHHhhccCCCCC Confidence 000000111 11233443 No 146 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=65.08 E-value=0.29 Score=23.54 Aligned_cols=443 Identities=11% Similarity=0.003 Sum_probs=147.8 Q ss_pred ChhHHHHHHHHHh----------------------------ccC-chHHHHHHHHhhcccccCCCCCCccccc---cccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR----------------------------DGS-VEQRAIEFAKTTLPYLMVDPMSGSRGVV---EHDF 48 (510) Q Consensus 1 ~k~~~~~r~~~lk----------------------------r~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~---~~~~ 48 (510) =|+.+.++|..+- |+. +.+...+..+...|+++.... .+.... +... T Consensus 39 ~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~-p~~~wf~~~p~~~ 117 (641) T protein:vir:94 39 KRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYFKGATF-PSDDWFDLKGMVP 117 (641) T ss_pred hhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHHhhhhc-CCCceEEEecCCC Confidence 3334444454331 111 333445666666666533211 111111 0111 Q ss_pred -cchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCC------HHHH Q lcl|Aclame:pro 49 -QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNAS------LAVL 121 (510) Q Consensus 49 -dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf------~~~~ 121 (510) |..+++.++. .+...+. -. .|++.++..+...+...|. .... T Consensus 118 ed~~~A~~~~~----~~~~~l~-~~--------------------------~~~~~~~~~~~d~~~~g~~iv~~~w~~~~ 166 (641) T protein:vir:94 118 ELADAARVVKQ----LTKTKLE-AA--------------------------SIRDIFETYVRNLVLYGVSTYRLGWDTSM 166 (641) T ss_pred ChHHHHHHHHH----HHHHHHh-hc--------------------------chHHHHHHHHHHHhhcCceEEEeehhhHH Confidence 1122221111 1111111 01 1233333344444444432 2222 Q ss_pred HHHHHHHHh-hCce------------------------EEEEeCCCC----eEEEEE-----eceEEEeeCCCC--ceeE Q lcl|Aclame:pro 122 TQVIKLLIV-TGNA------------------------LLYRNSDEA----TVVAWS-----LRSYAVRRDATG--RWMD 165 (510) Q Consensus 122 ~~~~~~l~~-~G~~------------------------~l~~~~~~~----~~~~~p-----l~~~~v~~d~~G--~v~~ 165 (510) .+...+.-+ +|.. -+|.|+... .|..+. +.+. +.+...| .++. T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l-~~eg~~~~d~v~~ 245 (641) T protein:vir:94 167 ERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHEL-VTSGYYDLDLTQV 245 (641) T ss_pred HHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHH-HhcCCCChhhcch Confidence 222222111 1110 124443221 122211 1111 0110011 1110 Q ss_pred E-EEEEEecHHHHhHHh---hH---Hhhccccc-CCCCceEEEEEEE------EeecC----CCeeEEEEEEeeCCeeec Q lcl|Aclame:pro 166 I-VLKQRYKSKDLDDVY---KQ---DLMRAGRN-LSGSGSVDLYTHV------QRRKG----TAMDYAEMYHEIDGVRVG 227 (510) Q Consensus 166 i-~r~~~~t~~~l~~~~---~~---~~~~~~~~-~~~~~~v~v~~~v------~~~~~----~~~~~~sv~~e~~~~~~~ 227 (510) . ...++.+-.+-..++ .. .+...+.. ...+..+.-||.. .+..+ ..+||....|.... T Consensus 246 ~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~---- 321 (641) T protein:vir:94 246 EQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDR---- 321 (641) T ss_pred hhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecC---- Confidence 0 001111100000000 00 01000000 0111122222221 11111 23577665554222 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHH-HHHHHH-----HHhhCCc-eeeCCCCccchhhhh Q lcl|Aclame:pro 228 ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSE-KLGLYE-----LESLEVL-NLVDEAKGAVVDDYQ 300 (510) Q Consensus 228 ~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~-~~l~~~-----~~a~~~~-~lv~~~g~~~~~~~~ 300 (510) .+.|+. -|. +. .-|....|-.+.....-.- ..+..- ....+|. +-..|+|++...... T Consensus 322 -~~~YG~--gp~----------~~--~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~ 386 (641) T protein:vir:94 322 -DSVYGM--SVL----------HP--NLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHG 386 (641) T ss_pred -CcccCC--ChH----------HH--HHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCC Confidence 122321 111 00 1133334433332221111 111111 1112333 223455554322111 Q ss_pred cCCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHh------hcccCCCCCCC--CHHHHH--------HH Q lcl|Aclame:pro 301 DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM------YGANQRDAERV--TAEEVR--------IT 364 (510) Q Consensus 301 ~~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~------~~~~~~~~~~v--TAtEi~--------~r 364 (510) ..+-+.+|..+. ...++.+...-..++.+..-.++ -++....+.-| -..|.. .- T Consensus 387 --~v~pl~~~~~~~--------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l 456 (641) T protein:vir:94 387 --SLQPIDMGRQDF--------VVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHI 456 (641) T ss_pred --cceeecCCcccc--------chhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHH Confidence 011122333221 11223333333344444443332 22221121111 122221 12 Q ss_pred HHHHHH-HhhhhHHHHHHHHHHHHHHHHHHH----HhhcCC-CCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 365 AEEAEN-TLGGTYSLLAENLQSPLAYVCLSE----VDDALL-QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGL 438 (510) Q Consensus 365 ~~E~~~-~LGpv~~rl~~E~l~Pli~r~~~i----l~~~~l-~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~ 438 (510) .+|... +|+++|++++.+++.|++.|+|++ -.--.+ |.--...+++..+.....+.+++.++++.++++.+++. T Consensus 457 ~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~ 536 (641) T protein:vir:94 457 EDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRV 536 (641) T ss_pred HHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcC Confidence 345554 799999999999999999999985 211111 11123466767788889999999999999999998866 Q ss_pred cChHhH----------hhc--C-CHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHH-------HHHHHHHHHHHHH Q lcl|Aclame:pro 439 APIAQL----------DPR--I-SLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQ-------AAQAQAAQETLLE 498 (510) Q Consensus 439 ~~~~q~----------~~~--i-d~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qq-------a~~~~~a~~~~~~ 498 (510) .++.+. +.. + ++...++. .+ -++....--+.+.++...+++|. ++++.+.+. .++ T Consensus 537 P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~-~~---~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~-~~~ 611 (641) T protein:vir:94 537 PQIGQSLDYALILEDLLRQMRFTDPMRYIKK-AE---APPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPE-DVS 611 (641) T ss_pred hhhhhcCCHHHHHHHHHHHhCCCCchhhccC-cc---CchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHH-HHH Confidence 654221 111 1 22222221 11 11111110111111111111110 111222222 222 Q ss_pred HHHHhhcccCCC Q lcl|Aclame:pro 499 GASDMTNALAGV 510 (510) Q Consensus 499 ~a~~~~~~~ag~ 510 (510) ++..+.+.+.|= T Consensus 612 ~~~~~~~~~~~~ 623 (641) T protein:vir:94 612 DLASRIGIDTSD 623 (641) T ss_pred HHHHhhcCCchh Confidence 222221111111 No 147 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=64.51 E-value=0.3 Score=23.46 Aligned_cols=237 Identities=10% Similarity=-0.015 Sum_probs=92.1 Q ss_pred HHHHHHH-hccCchHHHHHHHHhhcccccCCCCCCccccc--cc-cccchHHHHHHHHHHHHHHhhcCccCcccccCCCh Q lcl|Aclame:pro 6 AMLWEKL-RDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVV--EH-DFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTD 81 (510) Q Consensus 6 ~~r~~~l-kr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~--~~-~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d 81 (510) =..|.+. +|+. ...+.....+.- .++.-........ .. .-.++--.|++.+|+.+.+. ||.-.. .. T Consensus 1 MglF~~~~~r~~-~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~l------p~~~~~-~~ 70 (251) T protein:vir:46 1 MGIFYKNEKRDL-QYNEDDLQMMVQ--TLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARM------PIRVTV-NG 70 (251) T ss_pred CCcccccccccc-CCCccchhhhhh--hhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhC------ceEEee-Cc Confidence 2222221 1211 111111111100 0000000000000 01 12233334555555555443 443221 11 Q ss_pred hhhhhhccCchHHHHHHHHHHHHHHHHHHHH-HhcCCHH----HHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe--ce Q lcl|Aclame:pro 82 AIRREADSRDTDITEVTAALARVDRKATQRL-FQNASLA----VLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RS 152 (510) Q Consensus 82 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~----~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl--~~ 152 (510) ... . +.-+...| .+-|-+. -+.....++..+||+.+|+..+. + ....+|| .. T Consensus 71 ~~~------------~-------~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~ 131 (251) T protein:vir:46 71 QIN------------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSE 131 (251) T ss_pred ccc------------c-------cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCce Confidence 100 0 11112223 3444443 34455667788999998876543 2 2444555 55 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) +-+..|.+|++- | .+ ...++........| T Consensus 132 v~v~~~~~g~~~--~------------------------------------------------~~-~~~~~~~~g~~~~~ 160 (251) T protein:vir:46 132 IELKSDARGRLY--Y------------------------------------------------FH-QRIDSNGNNIERNV 160 (251) T ss_pred EEEEECCCCcEE--E------------------------------------------------EE-EEeccCCcceeEEE Confidence 656666666321 0 00 00000000000111 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCc-cchhhhhcCCCcceecCC Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKG-AVVDDYQDAEMGDYVPGG 311 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~-~~~~~~~~~~~G~~~~g~ 311 (510) ..++ .+..|+...+| .||.||...+...+...+...+.......-...|..++.-++. .+++. T Consensus 161 ~~~d--iiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~------------- 224 (251) T protein:vir:46 161 KFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKA------------- 224 (251) T ss_pred CCcc--EEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHH------------- Confidence 1111 34445443334 7999999999999999888888777776665555544432222 22221 Q ss_pred ccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCHHH Q lcl|Aclame:pro 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAERVTAEE 360 (510) Q Consensus 312 ~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~vTAtE 360 (510) .+.+++++.+.+--. .....-.+--.| T Consensus 225 ---------------------~~~~~~~~~~~~~g~-~n~g~~~~gm~~ 251 (251) T protein:vir:46 225 ---------------------RDRAREEFPKVLVEL-NKLGKLSYSMNQ 251 (251) T ss_pred ---------------------HHHHHHHHHHHhcCc-ccccccccccCC Confidence 122222222222100 000000000000 No 148 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=64.15 E-value=0.3 Score=23.41 Aligned_cols=344 Identities=9% Similarity=0.012 Sum_probs=130.5 Q ss_pred ChhHHHHHHHHHhc--cCchHHHHHHHHhhcccccCCCCCCccc-cccccc-cchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLRD--GSVEQRAIEFAKTTLPYLMVDPMSGSRG-VVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lkr--~~~~~~w~e~~~~~~P~~~~~~~~~~~~-~~~~~~-dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) |+ .|+++++ ......-.....+..+..+.... .+.. ...+.. .++--.|++.+|+.+.+ + |+. T Consensus 1 M~-----~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~--~----p~~- 67 (386) T protein:vir:49 1 MP-----IFNITNLATESPPINQESFFDIADSDFLASLN-SSEWVSAENALKNSDLFSIISQLSNDLAT--A----KIT- 67 (386) T ss_pred Cc-----hhhhhccCCCCcccchhhhhhhhhcccccccc-CCceechhhhhccHHHHHHHHHHHHHhhh--C----cee- Confidence 43 3555532 21111111112222222111111 1100 000111 23333455555554433 2 221 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL 150 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl 150 (510) +-+.. ... .+.+-| .+.-....+.++...||+.+++..+. + ....+|+ T Consensus 68 --~~~~~-------------~~~-----------l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i 121 (386) T protein:vir:49 68 --TSRKQ-------------LQG-----------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYL 121 (386) T ss_pred --eccch-------------hhh-----------hhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEe Confidence 11111 000 122223 34445666778889999998875432 2 2333444 Q ss_pred --ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecc Q lcl|Aclame:pro 151 --RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGE 228 (510) Q Consensus 151 --~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~ 228 (510) ..+-+..+.+|.. .++.++. .+..... T Consensus 122 ~~~~v~v~~~~~~~~-------------------------------------------------~~y~~~~--~~~~~~~ 150 (386) T protein:vir:49 122 RPSQVSFNRLDNQNG-------------------------------------------------LYYNITF--DDPHIAP 150 (386) T ss_pred cCceeEEEEcCCCce-------------------------------------------------EEEEEEE--cCccccc Confidence 3444444443321 1111111 0101111 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh--------- Q lcl|Aclame:pro 229 TGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY--------- 299 (510) Q Consensus 229 ~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~--------- 299 (510) ...++.++ .+..|+....+..||.||..-+...+.......+.......-...|..++.-++.+.++.. T Consensus 151 ~~~~~~~e--vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~ 228 (386) T protein:vir:49 151 KQHVPQND--ILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQA 228 (386) T ss_pred eeEEcccc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHH Confidence 11111122 4555666666889999999999999999998888888887777778776643344443221 Q ss_pred -hcCCCcc-eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCC-CCCCCHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 300 -QDAEMGD-YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRD-AERVTAEEVRITAEEAENTLGG 374 (510) Q Consensus 300 -~~~~~G~-~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~-~~~vTAtEi~~r~~E~~~~LGp 374 (510) .....+. +++++. .+.++... ..+.+. .+..+..+..|..+|-.. .+..+ ...-+++.+. +-....+-| T Consensus 229 ~~~n~g~~~vl~~g~-~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~---~~~~~~i~~ 302 (386) T protein:vir:49 229 MKQMQGGPLVLDDLE-DFTPLEIK-SNVAQL-LSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIY---NIYFKSVSR 302 (386) T ss_pred hccCCCCceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHH---HHHHHHHHH Confidence 1111111 222222 33343322 234443 445677888899998432 22222 2222333222 122334445 Q ss_pred hHHHHHHHHHHHHHHHH-HHHHhhcCCCCCCcc--------ceeeEEeecHHHHHHHHH----HHHHHHHHHHHHhhcCh Q lcl|Aclame:pro 375 TYSLLAENLQSPLAYVC-LSEVDDALLQGLITK--------QHKPAIETGLPALSRSAA----VQSMLNASQVIAGLAPI 441 (510) Q Consensus 375 v~~rl~~E~l~Pli~r~-~~il~~~~l~~~p~~--------~~~~~~vs~l~~l~r~~~----~~~~~~~~q~~~~~~~~ 441 (510) .+..+..++-.-|...+ |++ ..+...... .++.-+.|+-.......+ ...+.... ....+. T Consensus 303 ~l~~i~~~~~~~l~~~~~~~~---~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~---~~~~~~ 376 (386) T protein:vir:49 303 YLRPFVSEMSKKLSCEVDVDI---SPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGK---NPNRTS 376 (386) T ss_pred HHHHHHHHHHHHhcchhcccc---hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchh---ccCCCC Confidence 55544444322221110 000 000000000 000001111110000000 00000000 000111 Q ss_pred ---HhHhhcCC Q lcl|Aclame:pro 442 ---AQLDPRIS 449 (510) Q Consensus 442 ---~q~~~~id 449 (510) .+. +.=| T Consensus 377 ~~gGd~-~~~~ 386 (386) T protein:vir:49 377 LKGGEI-NEQD 386 (386) T ss_pred CCCCCC-CCCC Confidence 011 1111 No 149 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=61.26 E-value=0.36 Score=23.04 Aligned_cols=408 Identities=12% Similarity=-0.039 Sum_probs=138.6 Q ss_pred cccc--ccchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHH--HHHH-HHHhcCCH Q lcl|Aclame:pro 44 VEHD--FQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDR--KATQ-RLFQNASL 118 (510) Q Consensus 44 ~~~~--~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~--~~~~-~l~~snf~ 118 (510) +..+ -+++.-.|++.+|..+.+ .||. +......... .........+..+|...+- .+.. .+....+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~------~p~~-i~~~~~~~~~-~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~ 72 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAG------FGIN-IIPHPEAEDP-DRDGEQYERVWDFWFGDDSNWQVGPMESERATAT 72 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhc------CCeE-EEEccCcccc-cchhhhhhhHHHHhhccCCCccccchhhHhhHHH Confidence 3333 245666777777777753 2332 2111100000 0000000111111110000 0000 01122345 Q ss_pred HHHHHHHHHHHhhCceEEEEeCCC--CeEEEEEeceEE--EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCC Q lcl|Aclame:pro 119 AVLTQVIKLLIVTGNALLYRNSDE--ATVVAWSLRSYA--VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSG 194 (510) Q Consensus 119 ~~~~~~~~~l~~~G~~~l~~~~~~--~~~~~~pl~~~~--v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~ 194 (510) .-+...+.|+..+||+.+++..+. ..+..+||..-+ +..|..+.+. .+ ... T Consensus 73 ~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~-~~------------------------~~~ 127 (467) T protein:vir:31 73 NVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQ-LL------------------------EEK 127 (467) T ss_pred HHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEe-ec------------------------CCc Confidence 566778888999999998876543 335666664333 3333222111 00 000 Q ss_pred CceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 195 SGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKL 274 (510) Q Consensus 195 ~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~ 274 (510) ...+.++...+..+..+ ...-.+.+...........++.++ .+..|.....+..||.+|..-++..+.......+.. T Consensus 128 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~d--iih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 204 (467) T protein:vir:31 128 EKYFGVAGDRYQTNGNG-DLDPVFVDADDGSTGTSVSNPANE--LIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYN 204 (467) T ss_pred eeeEEeccccceeeccc-ceeeeeeeeccccccceeEecccc--EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 11111111111111111 111122221111111111222122 455566666678999999999998887777766666 Q ss_pred HHHHHHhhCCceeeC-CCCccchhhhhcCCC-------c------------------ceecCCcc--c--cccccCC--C Q lcl|Aclame:pro 275 GLYELESLEVLNLVD-EAKGAVVDDYQDAEM-------G------------------DYVPGGAE--A--VRAYERG--D 322 (510) Q Consensus 275 l~~~~~a~~~~~lv~-~~g~~~~~~~~~~~~-------G------------------~~~~g~~~--~--v~~~~~~--~ 322 (510) .....-...|..++. +++.++++....... | .+++++.. . +...++. . T Consensus 205 ~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~ 284 (467) T protein:vir:31 205 IDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGI 284 (467) T ss_pred HHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccC Confidence 655555556655543 455665554322110 0 01111111 0 0000110 1 Q ss_pred ccchHHHHHHHHHHHHHHHHHHhhc--c--cCCCCCCCC-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 323 YNKMAAIQQSLQAVVVRLNQAFMYG--A--NQRDAERVT-AEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD 397 (510) Q Consensus 323 ~~~~~~~~~~i~~~~~~I~~af~~~--~--~~~~~~~vT-AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~ 397 (510) ..+.+ ..+..+.....|..+|-.. . ...++..-| +++... .=....|.|.+.++..++-.-|+.+... T Consensus 285 ~~d~q-f~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~--~f~~~~l~P~~~~ie~~ln~~l~~~~~~---- 357 (467) T protein:vir:31 285 DEEAS-FLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRK--EFAEETIQPKQHDFGELLYELVHKQGLD---- 357 (467) T ss_pred hhhHH-HHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcchhhc---- Confidence 11222 2334455667788888332 1 111221112 222222 2234446666666655544333221111 Q ss_pred cCCCCCCccceeeEEeec--HHHHHHHHHHHHHHH--H--HHHHHhhcChHhHhhcC-CHHHHHHHHHHHcCCCHhhccC Q lcl|Aclame:pro 398 ALLQGLITKQHKPAIETG--LPALSRSAAVQSMLN--A--SQVIAGLAPIAQLDPRI-SLPKMMDTIWAAFSVDTSQFYK 470 (510) Q Consensus 398 ~~l~~~p~~~~~~~~vs~--l~~l~r~~~~~~~~~--~--~q~~~~~~~~~q~~~~i-d~d~~~~~~a~~~Gvp~~~i~~ 470 (510) ....-++...... .+...|+.-...+.. + ...+-..-+.+.+.+.. ...........+--.|.. . T Consensus 358 -----~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~---~ 429 (467) T protein:vir:31 358 -----APDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPGG---G 429 (467) T ss_pred -----cCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccCCcccccccccccCCCC---c Confidence 0111122222221 223333332222211 0 01111111111111100 000000000000000000 0 Q ss_pred CHHHHHHH----HHH--HHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 471 SADELQAE----AEE--QRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 471 s~ee~~~~----~~~--~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++++..+. .++ ...++.... +.....|+ +|-- T Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~------~~~~ 467 (467) T protein:vir:31 430 IGDQIEQLVEDRADEIIDSYQADLET--EQLIEIGA------NADS 467 (467) T ss_pred ccCcCCCCCCCcccchHhhhhhcccc--chhhhhcc------ccCC Confidence 00000000 000 000000011 01111111 1111 No 150 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=50.17 E-value=0.62 Score=21.73 Aligned_cols=400 Identities=14% Similarity=0.069 Sum_probs=147.0 Q ss_pred HHHHHHHh-cc------CchH-HHH--HHHHhhcccccCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 6 AMLWEKLR-DG------SVEQ-RAI--EFAKTTLPYLMVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 6 ~~r~~~lk-r~------~~~~-~w~--e~~~~~~P~~~~~~~~~~~~~~--~~~~-dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) =..|..|. |. ..+. .|. +-+-+.+ +.. . .++... .... .++--.|++.+|+.+.+. T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~-~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~l------ 69 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNL---GAV-A-ASGETVTPHDALQVSAVFASVRLLSETIATL------ 69 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhh---ccc-c-cCCceechHHhhccHHHHHHHHHHHHhhccC------ Confidence 12233331 11 1111 110 0011110 000 0 000100 0111 233345566666665542 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhc----CCHHHHHHHHHHHHhhCceEEEEeCCCCe-EEE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQN----ASLAVLTQVIKLLIVTGNALLYRNSDEAT-VVA 147 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~~l~~~G~~~l~~~~~~~~-~~~ 147 (510) ||.-..-.+... .++. ...+...+... +.+.-+...+.++..+||+.+++..+.++ ... T Consensus 70 p~~~~~~~~~~~----------~~~~------~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l 133 (457) T protein:vir:13 70 PLSTYSKRGGSR----------KEIV------TPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGL 133 (457) T ss_pred ceEEEEecCCcc----------cccc------cchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEE Confidence 332111111110 0111 11223334432 23445667777888999999887655443 334 Q ss_pred EEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 148 WSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 148 ~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~ 225 (510) +|| ..+.+..+..+... ...|..+.+..+|.. T Consensus 134 ~~l~p~~v~v~~~~~~~~~----------------------------------------------~~~~~~y~~~~~~~~ 167 (457) T protein:vir:13 134 DVLDPTKIHVHMVMVDGLR----------------------------------------------RKVFEAYDIDADGNE 167 (457) T ss_pred EEEccCceEEEEecCCCcc----------------------------------------------ceeEEEEEEecCCce Confidence 454 23333332222110 011111112222211 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC-- Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-- 303 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~-- 303 (510) .. ...|..+ -.+..|+....+..||.||...+...+.....+.+.......-...|..++.-++.+.++...... T Consensus 168 ~~-~~~~~~~--diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~ 244 (457) T protein:vir:13 168 VL-LGWFTPR--DVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREA 244 (457) T ss_pred ee-EEeeCcc--ceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHH Confidence 11 1111111 245556666667789999999999999998888888887777777787777666666665433221 Q ss_pred -----Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc-CCCCCCCCHHHHHHHHHH- Q lcl|Aclame:pro 304 -----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN-QRDAERVTAEEVRITAEE- 367 (510) Q Consensus 304 -----~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~-~~~~~~vTAtEi~~r~~E- 367 (510) .| .+++++. +..++.+. ..+.+. .+..+..+..|.++|-.. ++ ..+....+..-+.+.... T Consensus 245 ~~~~~~g~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f 321 (457) T protein:vir:13 245 WRAANSGVDNAHRVALLTEGA-KFSKVAMS-PDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAF 321 (457) T ss_pred HHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHH Confidence 11 1222221 22333321 234444 344456777888888332 11 111111222323333222 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcC--hHhHh Q lcl|Aclame:pro 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAP--IAQLD 445 (510) Q Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~--~~q~~ 445 (510) ....|.|.+.++..+| .+..+++.......+++. +..|-|. +......+...+-..+- +-++. T Consensus 322 ~~~tl~P~~~~ie~~l------------n~~L~~~~~~~~~~i~fd--~~~l~~~-D~~~r~~~~~~~~~~G~~T~NE~R 386 (457) T protein:vir:13 322 TMFSLRPWLERIEAGF------------NRLLFAETADRFRFVKFN--LDEIKRG-APKERMELWSLGLQNGIYSIDEVR 386 (457) T ss_pred HHHHHHHHHHHHHHHH------------HHhhcCccccCceeEEee--chhhhcc-CHHHHHHHHHHHHhCCCcCHHHHH Confidence 2334556555555553 333333222222223321 2223332 11111111111111111 11222 Q ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hcccCCC Q lcl|Aclame:pro 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDM--TNALAGV 510 (510) Q Consensus 446 ~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~--~~~~ag~ 510 (510) ..++.+-+=.-.++.+=+|. .+..-.+... .+.+. ++.+..+.+.+. ..+..|= T Consensus 387 ~~~gl~Pi~~g~~d~~~~~~-n~~~~~~~~~------~~~~~----~~~~~~~~~~~~~~~~~~~g~ 442 (457) T protein:vir:13 387 AAEDMTPLPDGLGEKYRVPL-NLGEVGEEPE------PEPAP----APPAIEPPAEEPDEEPEPEGK 442 (457) T ss_pred HHhCCCCCCCCcccceeecc-cccccccccc------ccccC----CCCCCCCCccccCCCCCCCCC Confidence 22222111111111111221 1111111000 00000 000000000000 0011111 No 151 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=49.93 E-value=0.62 Score=21.71 Aligned_cols=431 Identities=8% Similarity=-0.043 Sum_probs=159.9 Q ss_pred ChhHHHHHHHHH------------------hccCchHHHHH------------HHHhhcccccCCCCCCcccccccc--c Q lcl|Aclame:pro 1 MKSTAAMLWEKL------------------RDGSVEQRAIE------------FAKTTLPYLMVDPMSGSRGVVEHD--F 48 (510) Q Consensus 1 ~k~~~~~r~~~l------------------kr~~~~~~w~e------------~~~~~~P~~~~~~~~~~~~~~~~~--~ 48 (510) ...+-....-.+ .+++.-..... +..|..+..|+ + ..+... . T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~-~~l~a~Y~~ 112 (537) T protein:vir:10 39 RAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFI-----G-HQMCALIAT 112 (537) T ss_pred HHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCc-----c-HHHHHHHHh Confidence 000000001111 11110000000 00011111010 0 001111 1 Q ss_pred cchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|Aclame:pro 49 QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLL 128 (510) Q Consensus 49 dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l 128 (510) ...+-.+|++.|..+ -+.|+.+...+....+ .+. .+.+.+.+.+-+++..+.++++.. T Consensus 113 ~~l~r~iVd~~A~d~-------~r~~~~i~~~~~~~~~--------~~~-------~~~l~~~~~~l~~~~~l~~a~~~~ 170 (537) T protein:vir:10 113 HWLVNKACSQMPRDA-------MRKGYKIISDDGNELD--------PKD-------AKFIDRYDRAFNIKKHAIQFVRKG 170 (537) T ss_pred CchhhhhhhhhhHHh-------hcCCceeecCCccccc--------HHH-------HHHHHHHHHHhhHHHHHHHHHHhc Confidence 233344444444433 3688888765432111 111 123334455558899999999998 Q ss_pred HhhCceEEEEeC--CCCeEEEEEeceEEEeeCCCCceeEE--EEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 129 IVTGNALLYRNS--DEATVVAWSLRSYAVRRDATGRWMDI--VLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 129 ~~~G~~~l~~~~--~~~~~~~~pl~~~~v~~d~~G~v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v 204 (510) -.+|.+.+++.- +++..-.=||.--.| ..|.+..+ +-++..++ .++..+-.+..+.. -.+| +.|. T Consensus 171 rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i---~kg~~k~l~vidp~~~~~-~~~~~~~~dp~sp~-fg~P----~~y~-- 239 (537) T protein:vir:10 171 RIFGIRIALFKVDSPDPYYYEKPFNIDGV---MPGAYKGIVQIDPYWCAP-LLDAQASSNPVSMH-FYEP----TYWL-- 239 (537) T ss_pred ccccceEEEEeecCcCCcccccccccccc---cccceeEEEEechhhccc-ccchhhhccCCccc-cCCc----eeee-- Confidence 888887666542 222222223311111 12222211 11111111 00111111111000 0011 1121 Q ss_pred EeecCCCeeEEEEEEeeCCeeecccccccc--ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDGVRVGETGRWPI--HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESL 282 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~--~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~ 282 (510) +.|+.+....-..+ ...|+ +.+....-||++..+.++..++..........+...++. T Consensus 240 ----------------v~g~~iH~SRli~f~g~~~p~----~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~ 299 (537) T protein:vir:10 240 ----------------INGKKYHRSHLAIYINDEVVD----FLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKR 299 (537) T ss_pred ----------------ecCeEecceeEEEecCCCCch----hhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 12222211111111 12233 233334456999999999999999988888777776666 Q ss_pred CCceeeCCCC-ccchhhhhc---------CCCcceecCCc-cccccccCCCccchHHHHHHHHHHHHHHHHHHh--hc-c Q lcl|Aclame:pro 283 EVLNLVDEAK-GAVVDDYQD---------AEMGDYVPGGA-EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG-A 348 (510) Q Consensus 283 ~~~~lv~~~g-~~~~~~~~~---------~~~G~~~~g~~-~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~--~~-~ 348 (510) -..+-++-.. +.+.+.+.. ...|.++-+.. ..+..+. .++..+...+....+.|.-++= +. + T Consensus 300 ~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~----~~lsgl~~~l~~~~~~iAa~~~IP~t~L 375 (537) T protein:vir:10 300 QTVLKVDAAQVLANKQQFDETMSWWTATRDNYQVRVVDKDNEDVVQID----TTLNDLDKVIMNQYQLVCAIARTPAPKM 375 (537) T ss_pred CceeeechHHhhcCHHHHHHHHHHHHhhcCCcceeEecCCCceeEEEe----ccCCCHHHHHHHHHHHHHhhhCCCceee Confidence 5555443111 111222211 11233443332 3333322 2333345666777777777651 11 2 Q ss_pred -cCC-CCCCCCHH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHH Q lcl|Aclame:pro 349 -NQR-DAERVTAE-EVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAV 425 (510) Q Consensus 349 -~~~-~~~~vTAt-Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~ 425 (510) .+. .|-.-|.+ ++..= ---+..++.+ +.|++++++.++.+..+-+.+ ++.+++- +|..+.-..++ T Consensus 376 ~G~sp~GlnatGe~D~~~y--------yd~I~~~Qe~-l~p~l~~l~~ll~~~~~~~~~--~~~i~f~-pL~~~s~kEkA 443 (537) T protein:vir:10 376 LGTVPTGFNSTGDYEEASY--------HEECESTQDD-MRPLIDRHHQLVCRSHLRKRI--RVKVEFP-PMDAPKESERA 443 (537) T ss_pred ccCCccccccchhHHHHHH--------HHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCc--ceEEEeC-CCCCCCHHHHH Confidence 222 22222333 22211 1123444544 688888888887765443322 3444432 33333333333 Q ss_pred HHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccC--CHHHHHHHH-HHHHHHHHHHHHHH-HHHHHHH- Q lcl|Aclame:pro 426 QSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYK--SADELQAEA-EEQRRQAAQAQAAQ-ETLLEGA- 500 (510) Q Consensus 426 ~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~--s~ee~~~~~-~~~~qqa~~~~~a~-~~~~~~a- 500 (510) +......++...+.+ ...|+.+++-+.+...-..+...|.. +++..+... +.+.+.......+. +...+++ T Consensus 444 ei~~~~a~a~~~~~~----~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (537) T protein:vir:10 444 DTFLKKMQAAKLAFE----MGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGAT 519 (537) T ss_pred HHHHHHHHHHHHHHH----cCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCC Confidence 322222222222211 12477888888877642221122322 111111100 00000000000000 0000000 Q ss_pred --HHh--hcccCCC Q lcl|Aclame:pro 501 --SDM--TNALAGV 510 (510) Q Consensus 501 --~~~--~~~~ag~ 510 (510) ++. ...-+|- T Consensus 520 ~~~~~~~~~~~~~a 533 (537) T protein:vir:10 520 SSGESANDPRDSGA 533 (537) T ss_pred ccccccCCCccCcc Confidence 000 0011111 No 152 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=47.32 E-value=0.71 Score=21.42 Aligned_cols=348 Identities=11% Similarity=0.012 Sum_probs=134.2 Q ss_pred ChhHHHHHHHHHh---ccCchHHHHHHHHhhcccccCCCCCCccccccccc-cchHHHHHHHHHHHHHHhhcCccCcccc Q lcl|Aclame:pro 1 MKSTAAMLWEKLR---DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) Q Consensus 1 ~k~~~~~r~~~lk---r~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dstg~~a~~~Laa~l~~~ltpp~~~WF~ 76 (510) |- +-.++...+ +............+..... . ....+.. ... .++--.|++.+|+.+.+. || + T Consensus 1 Mg--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~v~~~---~al~~~~v~~~i~~ia~~ia~~------p~-~ 66 (385) T protein:vir:10 1 MG--LLTPRNFNKRKAKNMVYPSNPAFFTTTVGGM-Q-LSYVSAL---SALQNTNVYSVINRIASDVASA------HF-K 66 (385) T ss_pred Cc--cccchhcccccccccccccchhhhhhhcccc-C-ccccCHH---HhhccHHHHHHHHHHHHHHhhC------ce-e Confidence 32 111111111 1111111111112111110 0 0001111 112 233334455555554432 32 2 Q ss_pred cCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCC----HHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEece Q lcl|Aclame:pro 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNAS----LAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRS 152 (510) Q Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf----~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~ 152 (510) + .+.. .. ..+.+-|- +.=...+..++..+||+.+++..+ .+..+|+.. T Consensus 67 v--~~~~-------------~~-----------~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~--~~~~~p~~~ 118 (385) T protein:vir:10 67 T--ENTA-------------TL-----------NRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ--NLEHIPNSD 118 (385) T ss_pred e--eccc-------------hh-----------hhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC--ceeEeecCC Confidence 2 1100 11 11333333 333445566788899999888654 344556533 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) ..|....++ +..++.++...++... .| T Consensus 119 ~~v~~~~~~-------------------------------------------------~~~~~~~~~~~~~~~~----~~ 145 (385) T protein:vir:10 119 VQINYLPGN-------------------------------------------------MGIVYTVLESNDRPQM----VL 145 (385) T ss_pred ceEEEEEcC-------------------------------------------------CceEEEEEEcCCceEE----EE Confidence 322211111 0011111111111110 01 Q ss_pred ccccCceEEEeeeecC--CCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCC-CCccchhhhhcC------- Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAP--GEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVDDYQDA------- 302 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~--ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~-~g~~~~~~~~~~------- 302 (510) ..++ .+..|....+ +..||.||...+...+.......+.......-...|.+++.- ++..+++..... T Consensus 146 ~~~e--iihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~ 223 (385) T protein:vir:10 146 RQDQ--MLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKA 223 (385) T ss_pred cccc--EEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 1111 3333432222 346899999999999999999988888888877888777753 344444332211 Q ss_pred ---C-Cc--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-CCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 303 ---E-MG--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 303 ---~-~G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~~~~vTAtEi~~r~~E~~~~LG 373 (510) . .| .+++++ -++.++.. +..+.+.+.+..+.....|.++|-.. .+. .+.+.-|.+.+-+........|. T Consensus 224 ~~~~n~~~~~vl~~g-~~~~~l~~-~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~ 301 (385) T protein:vir:10 224 NTGDNSGRLMVLPDG-FDYTQLEM-KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLN 301 (385) T ss_pred hCccccCCccccCCC-ceEEecCC-ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHH Confidence 1 11 122222 23334433 22455655566677788899998432 111 23333333333222333344566 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeec--HHHHHHHHHHHHHHH--HH--H---HHHhhcChH-h Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG--LPALSRSAAVQSMLN--AS--Q---VIAGLAPIA-Q 443 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~--l~~l~r~~~~~~~~~--~~--q---~~~~~~~~~-q 443 (510) |.+.++.+++-. ..+.+ .++...... .+.-.|+.-.+.+.. ++ . .+-.+.+.| . T Consensus 302 P~~~~ie~~l~~------------~l~~~----~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~ 365 (385) T protein:vir:10 302 SYVNPIVDELRL------------KMNAP----DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPD 365 (385) T ss_pred HHHHHHHHHHHH------------hhCCc----eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCC Confidence 666666666532 21111 233222111 122333333332222 11 1 111112321 1 Q ss_pred HhhcCCHHHHHHHHHHHcCCCHhh Q lcl|Aclame:pro 444 LDPRISLPKMMDTIWAAFSVDTSQ 467 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~ 467 (510) =.+.+......-. -|=.-+. T Consensus 366 ~~~~~~~~~~~~~----~g~~~dn 385 (385) T protein:vir:10 366 NLPEFKPLTTQVK----GGDEGDN 385 (385) T ss_pred CCccccCcccccC----CCCCCCC Confidence 1111000000000 0000000 No 153 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=46.80 E-value=0.72 Score=21.36 Aligned_cols=308 Identities=11% Similarity=0.036 Sum_probs=120.0 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~ 80 (510) |+++-.++=..-..++.+ .| -| -|+.+...-..+-. .+....-.+-.|++.=++ T Consensus 1 m~~~~~~~~~~~~~~~~~-------~~----~~--------------~~p~~~~~~~~~~~-~~~~~~~~~~~~~~pP~~ 54 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRP-------SV----VF--------------SMPEAIDPTAWMTD-YTGVFYNPYGEYYQPPID 54 (337) T ss_pred CCCcccCcccccccCcee-------EE----Ee--------------cCcccccCcchhHh-hhhhhhccCcceecCCCC Confidence 443222110000000000 00 00 01111100000000 111122334566654443 Q ss_pred hhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCC---HHHHHHHHHHHHhhCceEEEEeCCC--CeEEEEEeceEEE Q lcl|Aclame:pro 81 DAIRREADSRDTDITEVTAALARVDRKATQRLFQNAS---LAVLTQVIKLLIVTGNALLYRNSDE--ATVVAWSLRSYAV 155 (510) Q Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf---~~~~~~~~~~l~~~G~~~l~~~~~~--~~~~~~pl~~~~v 155 (510) -..++++-........ .|. .......+.| +..+..+..|+.++||+.+++..+. .....+||..-++ T Consensus 55 ~~~La~l~~~~~~h~~---~L~-----~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v 126 (337) T protein:vir:78 55 RKGLAKVARANAHHGA---ILM-----ARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYL 126 (337) T ss_pred HHHHHHHhhcchhhhh---HHH-----hhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCcee Confidence 3333332211111100 000 0011112223 3567788889999999988865542 2456667654445 Q ss_pred eeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccc Q lcl|Aclame:pro 156 RRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIH 235 (510) Q Consensus 156 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~ 235 (510) .+..+|+. +| ...++... .|..+ T Consensus 127 ~~~~d~~~--~~---------------------------------------------------~~~~~~~~----~~~~~ 149 (337) T protein:vir:78 127 RRREDGCF--VY---------------------------------------------------LQQGKPNL----IYRPD 149 (337) T ss_pred EeeeCCeE--EE---------------------------------------------------EEcCCceE----EECCc Confidence 54443321 00 00000000 01111 Q ss_pred cCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceee-CCCCccchhhhhcC----------CC Q lcl|Aclame:pro 236 LCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLV-DEAKGAVVDDYQDA----------EM 304 (510) Q Consensus 236 ~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv-~~~g~~~~~~~~~~----------~~ 304 (510) + .+..|.....+.+||.+|..-++..+..-+...+-..+.-.-.+.|-.++ -+++.++++..... .| T Consensus 150 e--IiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n 227 (337) T protein:vir:78 150 D--VIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGN 227 (337) T ss_pred c--EEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCccc Confidence 2 23344333446799999999888877766655555455444445565554 35555555432221 11 Q ss_pred c----ceecCC-ccccccccCCCc-cchHHHHHHHHHHHHHHHHHHhhc----ccCCCCCCCC---HHHHHHHHHHHHHH Q lcl|Aclame:pro 305 G----DYVPGG-AEAVRAYERGDY-NKMAAIQQSLQAVVVRLNQAFMYG----ANQRDAERVT---AEEVRITAEEAENT 371 (510) Q Consensus 305 G----~~~~g~-~~~v~~~~~~~~-~~~~~~~~~i~~~~~~I~~af~~~----~~~~~~~~vT---AtEi~~r~~E~~~~ 371 (510) + ...||+ .+.++..+++.. .+.+ ..+.-+-.++.|-.+|-.. +...++..-| +++.... =.... T Consensus 228 ~~~~~v~~~~g~~~Gi~~~pis~~~~d~q-fle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~--f~~~~ 304 (337) T protein:vir:78 228 FRSMFVNIPDGKPDGIKLIPVGDIATKDE-FAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDAT--YARNE 304 (337) T ss_pred ccceEEEcCCCCccceeEEEcCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHH--HHHHH Confidence 1 112333 233444444432 2444 2344456667788888321 1112222222 3333221 22334 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecH Q lcl|Aclame:pro 372 LGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGL 416 (510) Q Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l 416 (510) |.|...++.+++- +.+++..--..++..+-..+ T Consensus 305 L~P~~~~ie~~~n------------~~ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 305 VLPLCELVQDAIN------------SAGLPRALWVTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHh------------hhcCChhhceeccccccccC Confidence 4555555555443 22222111011111111111 No 154 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=44.91 E-value=0.79 Score=21.15 Aligned_cols=428 Identities=9% Similarity=-0.050 Sum_probs=157.6 Q ss_pred Ch-------hHHHHHHHHHh--c--cC-chHHHHHHHHhhcccccCCCCCCcccccccc--ccchHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MK-------STAAMLWEKLR--D--GS-VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHD--FQSAGALLVNNLAAKLARS 66 (510) Q Consensus 1 ~k-------~~~~~r~~~lk--r--~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dstg~~a~~~Laa~l~~~ 66 (510) |+ ..+...-..+- + ++ ..+++ .+.+..+..|++ - .+-.. ....+-.+|++.|-.++ T Consensus 92 ~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~--~~~~~~~~~f~g-----y-ql~alY~~~~larkiVd~pAeDat-- 161 (862) T protein:vir:99 92 IKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEA--LQDWYLSQGFIG-----H-QACALIAQHWLVDKACSLAGEDAI-- 161 (862) T ss_pred hhhhhhhhhhcchhhhhhccccccccccccchh--ccccccccCccc-----H-HHHHHHHhCchhhhhhhhhhHHHh-- Confidence 22 11222222221 1 11 11111 112222211111 0 11122 24445555555555553 Q ss_pred hcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCCe Q lcl|Aclame:pro 67 LFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEAT 144 (510) Q Consensus 67 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~--~~~~~~ 144 (510) +.|+.+...++..+ .+.+ + .+.+.+.+.+-+....+.++++.--.||.+.+++ +.++.. T Consensus 162 -----R~g~~I~~~~d~~e------~~~e----~----~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~ 222 (862) T protein:vir:99 162 -----RNGWHLKSLGEGEE------IDEE----S----LEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPD 222 (862) T ss_pred -----hCCceEeecCcccc------cCHH----H----HHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCch Confidence 57999876432211 0011 1 1223344555578888999999777777665443 222222 Q ss_pred EEEEEeceEEEeeCCCCceeE--EEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeC Q lcl|Aclame:pro 145 VVAWSLRSYAVRRDATGRWMD--IVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEID 222 (510) Q Consensus 145 ~~~~pl~~~~v~~d~~G~v~~--i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~ 222 (510) .-.=||.- ..=..|.+-. ++-.+..+. ..+.++-.+.... + +-+.+.|. | .+. -|| T Consensus 223 ~LsqPLn~---e~I~kG~lkgl~vlDp~w~~p-~~v~~~~~Dp~sp----~-yGkP~~y~-I---~g~-----~IH---- 280 (862) T protein:vir:99 223 YYEKPFNP---DGITPGSYRGISQIDPYWMMP-MLTAESTADPSSQ----F-FYEPEFWI-I---SGQ-----KYH---- 280 (862) T ss_pred hhhcCcCc---ccccccceeEEEEechhhhcc-ccccccccccccc----c-cCCceeee-e---cCe-----eec---- Confidence 11123311 1111222221 111111111 0011111111100 0 01112221 1 000 111 Q ss_pred CeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-------CCCccc Q lcl|Aclame:pro 223 GVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-------EAKGAV 295 (510) Q Consensus 223 ~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~-------~~g~~~ 295 (510) -.++....+ +..|+ +.+....-||+|..+-++..++..........+.+..+.-..+-++ ++++.. T Consensus 281 ~SRliif~g---~~vpd----~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ed~l~~ 353 (862) T protein:vir:99 281 RSHLIIARG---PQPAD----ILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIANEDKFIQ 353 (862) T ss_pred cceeEEecC---CCchh----hhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhccHHHHHH Confidence 111211111 12344 2233344579999998998888888777766665554443333222 122211 Q ss_pred hhh-hhcC--CCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHh--hc-cc-C-CCCCCCCHHH-HHHHHH Q lcl|Aclame:pro 296 VDD-YQDA--EMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG-AN-Q-RDAERVTAEE-VRITAE 366 (510) Q Consensus 296 ~~~-~~~~--~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~--~~-~~-~-~~~~~vTAtE-i~~r~~ 366 (510) ... +... ..|.++-+..+.+..+. .+|.-+...+....+.|.-++= +. ++ + ..|-+-|..+ ++. T Consensus 354 r~~~~~~~rdN~Gi~liD~eEe~e~ls----~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~n--- 426 (862) T protein:vir:99 354 RLMFWVRYRDNHAVKVLGTDETMEQFD----TSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETIS--- 426 (862) T ss_pred HHHHHHhccCcceeEEecCCCceeEEe----cccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHH--- Confidence 111 1111 11333333333443332 2344445666777777877761 11 22 2 2333335443 221 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhh Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDP 446 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~ 446 (510) .---+..++...+.|+++|++.++.... ..+ .++.+++ .+|..+.-..+++......++.+.+.+ .. T Consensus 427 -----YyD~I~s~QE~~L~P~LerL~~li~~~l--g~~-~d~~ieF-npL~~~sekEkAEi~kk~Aea~~~lv~----sG 493 (862) T protein:vir:99 427 -----YHEELESIQEHVYMPFLQRHYLISRLSL--GIQ-HEIDVVM-EPVASMTAQQQADLNKTKAEGGKVLID----GG 493 (862) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCC-CcceEEe-CCCCCCCHHHHHHHHHHHHHHHHHHHh----cC Confidence 1112334455678899999988775432 222 3466554 234333333333332222222222211 12 Q ss_pred cCCHHHHHHHHHHH--cCCCHhhccCCHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------- Q lcl|Aclame:pro 447 RISLPKMMDTIWAA--FSVDTSQFYKSADELQA--EAEEQRRQAAQAQAAQETLLEGASDMTNAL--------------- 507 (510) Q Consensus 447 ~id~d~~~~~~a~~--~Gvp~~~i~~s~ee~~~--~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~--------------- 507 (510) .|+.+++.+.++.. .|.+ .| +++++.. -.......+.+...+.....+.-...++++ T Consensus 494 vispdEvR~~L~~~~~~g~~--~l--~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~ 569 (862) T protein:vir:99 494 VISPDEERNRIRDDKRSGYN--RL--TKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVP 569 (862) T ss_pred CCCHHHHHHHHHhcCCcCCC--CC--CcccccccCCCCcccccccccCCcccccccccccccccCCccccCCcccccccC Confidence 46777777766531 1211 11 1111110 000000000000000000000000000111 Q ss_pred ------------CCC Q lcl|Aclame:pro 508 ------------AGV 510 (510) Q Consensus 508 ------------ag~ 510 (510) +++ T Consensus 570 ~~~~g~~~~~t~~~~ 584 (862) T protein:vir:99 570 SMKPGQMVGPEVGIT 584 (862) T ss_pred CCCCCCccccccccc Confidence 111 No 155 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=44.28 E-value=0.81 Score=21.08 Aligned_cols=437 Identities=10% Similarity=0.006 Sum_probs=174.3 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhccc---c----cCCCCCC------ccc----ccccc--ccchHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---L----MVDPMSG------SRG----VVEHD--FQSAGALLVNNLAA 61 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~---~----~~~~~~~------~~~----~~~~~--~dstg~~a~~~Laa 61 (510) ||.-...+--.+.+..-...+.-+..- -++ + .+...+. ..+ +...+ -++.+..+++.+++ T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~-a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 79 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGG-GSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQD 79 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhc-cCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 666655555544332222233322221 111 0 0111110 011 11111 47788999999988 Q ss_pred HHHHh-hcCccCccc-ccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHH----------HHHhcCCHHHHHHHHHHHH Q lcl|Aclame:pro 62 KLARS-LFPTGIPFF-RSELTDAIRREADSRDTDITEVTAALARVDRKATQ----------RLFQNASLAVLTQVIKLLI 129 (510) Q Consensus 62 ~l~~~-ltpp~~~WF-~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~----------~l~~snf~~~~~~~~~~l~ 129 (510) .+++. ++|..+|=. .|..++... ++|-..||+.-.. .-.+.+||.....++..++ T Consensus 80 nvVG~Gi~~~~~p~~~~lg~~~~~~-------------~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~ 146 (533) T protein:vir:34 80 HIVGSFFRLSHRPSWRYLGIGEEEA-------------RAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHA 146 (533) T ss_pred HhhCCCceeeeccchhhcCCChhHH-------------HHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHH Confidence 88765 888776633 344443322 2233333333322 2235589999999999999 Q ss_pred hhCceEEEE--eCCCCeEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhH---HhhcccccCCCCceEEEEEEE Q lcl|Aclame:pro 130 VTGNALLYR--NSDEATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQ---DLMRAGRNLSGSGSVDLYTHV 204 (510) Q Consensus 130 ~~G~~~l~~--~~~~~~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~---~~~~~~~~~~~~~~v~v~~~v 204 (510) +-|-+++-. .+..+. .||+. -.-+-++.|...... .-.+.+.+.+.+.+..=||.. T Consensus 147 ~dGE~f~~~~~~~~~g~--~~~~~-----------------lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~ 207 (533) T protein:vir:34 147 FNGELFVQATWDTSSSR--LFRTQ-----------------FRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVS 207 (533) T ss_pred hCCceEEEeeeccCCCC--ccceE-----------------EEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEe Confidence 999876432 222110 11111 000111111110000 000011112222222333322 Q ss_pred EeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEee-eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhC Q lcl|Aclame:pro 205 QRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTW-NLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLE 283 (510) Q Consensus 205 ~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw-~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~ 283 (510) ...+.....+.. ..+......+ -+-|+.-+ ...+|..-|.+...-+|..++.|+....+.+.++..++. T Consensus 208 ~~~~~~~~~~~~-------~~~~~~~~v~---a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~ 277 (533) T protein:vir:34 208 EDGYPGWMPQKW-------TWIPRELPGG---RASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAM 277 (533) T ss_pred ecCCCCcccccc-------ceeeeeeccC---hhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 221111110000 0011111111 01233333 345899999999999999999999999999999988888 Q ss_pred CceeeC-CCCccch-------------hhhh---------------cCCCcceecCCc-cccccccCC-CccchHHHHHH Q lcl|Aclame:pro 284 VLNLVD-EAKGAVV-------------DDYQ---------------DAEMGDYVPGGA-EAVRAYERG-DYNKMAAIQQS 332 (510) Q Consensus 284 ~~~lv~-~~g~~~~-------------~~~~---------------~~~~G~~~~g~~-~~v~~~~~~-~~~~~~~~~~~ 332 (510) ....+. +.+.-.+ ..+. ..++|.+..-.+ .+++....+ ...++. .- T Consensus 278 ~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~---~f 354 (533) T protein:vir:34 278 YAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS---VF 354 (533) T ss_pred heeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHH---HH Confidence 776654 2111000 0000 012232221111 223333322 223443 22 Q ss_pred HHHHHHHHHHHHh--hcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc---- Q lcl|Aclame:pro 333 LQAVVVRLNQAFM--YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK---- 406 (510) Q Consensus 333 i~~~~~~I~~af~--~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~---- 406 (510) ...+...|..++= +..+..|-..++=.-++.-..|.....--.=..+...|+.|+..+.+..+--.|..++|.. T Consensus 355 ~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~ 434 (533) T protein:vir:34 355 EQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFS 434 (533) T ss_pred HHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCC Confidence 2333444444442 2234455555554444444444444433333444555667777777765544554444432 Q ss_pred -------ceeeEEee----cHHHHHHHHHH-HHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHH Q lcl|Aclame:pro 407 -------QHKPAIET----GLPALSRSAAV-QSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) Q Consensus 407 -------~~~~~~vs----~l~~l~r~~~~-~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee 474 (510) ..+...+. .++++--++.. ..+..-+. -.. .++...|.++. | T Consensus 435 ~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~------s~~-------------~~~a~~G~D~~-------e 488 (533) T protein:vir:34 435 FQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLS------TYE-------------KECAKRGDDYQ-------E 488 (533) T ss_pred chhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCC------CHH-------------HHHHHcCCCHH-------H Confidence 12344432 23333222111 11110000 011 11222343332 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-----HHHHHHH-----hhcccCC Q lcl|Aclame:pro 475 LQAEAEEQRRQAAQAQAAQET-----LLEGASD-----MTNALAG 509 (510) Q Consensus 475 ~~~~~~~~~qqa~~~~~a~~~-----~~~~a~~-----~~~~~ag 509 (510) +-.++.++.+.....-..... ...+..+ ..+.-|+ T Consensus 489 v~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 489 IFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred HHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCCCC Confidence 111111111111000000000 0000000 0000000 No 156 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=40.76 E-value=0.96 Score=20.69 Aligned_cols=453 Identities=13% Similarity=0.101 Sum_probs=175.7 Q ss_pred ChhHHHHHHHHHhccCch--HHHHHHHHhhcccccCCCCCCcc-cc-ccccccchHHHHH-HHHHHHHHHhhcCccCc-c Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVE--QRAIEFAKTTLPYLMVDPMSGSR-GV-VEHDFQSAGALLV-NNLAAKLARSLFPTGIP-F 74 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~--~~w~e~~~~~~P~~~~~~~~~~~-~~-~~~~~dstg~~a~-~~Laa~l~~~ltpp~~~-W 74 (510) |-++.++...+.-..--. ..|+...+=+.=+..|.-+.-.. .. ......++..++. -..+..|.++|+..=.| + T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPA 80 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 888777654332111111 13555555444344443332111 11 1112233333433 56677888888874333 2 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEE-EEece- Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVA-WSLRS- 152 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~-~pl~~- 152 (510) -.+ .. +...+.+..+. ..+.+.|+ .+...+.+.+..+| .|..-- .-+.. +-.|+ T Consensus 81 ~~W------F~-l~~~d~~~~e~--------~~v~~~L~------~ve~~~~~~l~~sn--f~~~~~-~~~~~L~~~Gta 136 (559) T protein:vir:95 81 RPW------FR-LATPDPEMMDY--------GPVKLWLE------AVQNRMNDMFNKSN--LYQSLP-QLYGSLGTYSTG 136 (559) T ss_pred Ccc------cc-cccCCccccch--------HHHHHHHH------HHHHHHHHHHHhcC--cHHHHH-HHHHHHHhhCce Confidence 221 11 11112111111 12233332 22222223233233 111100 00111 11244 Q ss_pred -EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceE-EEEEEEEee----------------------c Q lcl|Aclame:pro 153 -YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSV-DLYTHVQRR----------------------K 208 (510) Q Consensus 153 -~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v-~v~~~v~~~----------------------~ 208 (510) .++..|+.+ .+|-..++..+. + ...+++..+ +||+..+.. . T Consensus 137 ~l~~~~d~~~----~~r~~~~~l~~~-------~----v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~ 201 (559) T protein:vir:95 137 AMAVLDDDED----IIRTMPFPIGSY-------Y----LANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWES 201 (559) T ss_pred eeEeecCCCc----eeEEEEeecCeE-------E----EeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhc Confidence 445555443 222222332221 1 111222222 223222110 0 Q ss_pred CCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecCCC-ccc----------------cchHHHH---HHHHHHH- Q lcl|Aclame:pro 209 GTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGE-HYG----------------RGHVEDY---IGDFAKL- 267 (510) Q Consensus 209 ~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge-~YG----------------rgp~~~~---l~d~~~L- 267 (510) +....+..+++.+.-......+.+....+||--.-|....++ +-| +.+++.| .|-...| T Consensus 202 ~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~ 281 (559) T protein:vir:95 202 GTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALG 281 (559) T ss_pred CCCCCeEEEEEEEeccccccccccccccceEEEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhH Confidence 111122334433332222222233334688888888753321 111 1222222 1122222 Q ss_pred -HHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCCCcceecCCccccccccCCCccchHHH---HHHHHHHHHHHHHH Q lcl|Aclame:pro 268 -SLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAI---QQSLQAVVVRLNQA 343 (510) Q Consensus 268 -~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~~G~~~~g~~~~v~~~~~~~~~~~~~~---~~~i~~~~~~I~~a 343 (510) -+.-+.+.+....++. ..++| .++.|++... .+-+..||+.+.+....... .++.. ...++.++..|.+. T Consensus 282 d~k~L~~l~~~~l~~~~--~~~~p-p~~v~~~~~~-~~~~l~pgg~~~~~~~~~~~--~i~p~~~~~~~~~~~~~~i~~~ 355 (559) T protein:vir:95 282 PVKALQLLQKRKSQLID--KATNP-PMVAPTSLKN-QRASLLPGDITYIDQITGQD--GFRPAYLVNPSTADLVADIQDT 355 (559) T ss_pred HHHHHHHHHHHHHHHHH--HHhcC-ceeccccccc-cceeeeccceeeeCCCCCcc--cceeecccccchHHHHHHHHHH Confidence 2222333333333333 24555 5555665543 33567888888765443222 23322 12233333333221 Q ss_pred --HhhcccCCCC----CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHh-----hc-CC---CCCCccce Q lcl|Aclame:pro 344 --FMYGANQRDA----ERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVD-----DA-LL---QGLITKQH 408 (510) Q Consensus 344 --f~~~~~~~~~----~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~-----~~-~l---~~~p~~~~ 408 (510) =+-..+.-|- ...++. +.-+.|.... ..--...|+|++.|+-..+. |. .+ -..-|..- T Consensus 356 ~~rI~~af~~d~~~~l~~r~~~--rvTAtEV~~r-----~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p 428 (559) T protein:vir:95 356 RQIINSAYFVDLFMMLQNINTR--SMPVEAVIEM-----KEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPP 428 (559) T ss_pred HHHHHHHhhhhhHHHhhcCCCC--CCCHHHHHHH-----HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc Confidence 0122333331 122333 1122232222 11112347888877644322 21 11 12333333 Q ss_pred ee-----EEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHH-HHHHHHHcCCCHhhccCCHHHH--HHHHH Q lcl|Aclame:pro 409 KP-----AIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM-MDTIWAAFSVDTSQFYKSADEL--QAEAE 480 (510) Q Consensus 409 ~~-----~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~-~~~~a~~~Gvp~~~i~~s~ee~--~~~~~ 480 (510) +. ..++++++|+|+++..++..+.++++.+++++|+.|.+ .|.+ .+.+.+.+.--- + + +.+-+ +++.+ T Consensus 429 ~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pev-ld~id~d~~~~~~a~~~-G-v-p~~~irs~~ev~ 504 (559) T protein:vir:95 429 DVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEA-LDKLNVDQAIDAFADMS-G-V-SPTVIVPQEQVE 504 (559) T ss_pred ccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhh-hhcCCHHHHHHHHHHHh-C-C-chhhcCCHHHHH Confidence 32 12789999999999999999999999999989876653 1111 122222221110 0 1 12222 12222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 481 EQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 481 ~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +.+||.++|| +++.+++.+.+.++..... T Consensus 505 ~~rqqr~~~q-q~~q~~~~~~~aa~~~~~~ 533 (559) T protein:vir:95 505 QARQQRAQQQ-QQQQMMAMGMAAAQGVKTL 533 (559) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHhhhcc Confidence 2222222222 2222222223332222222 No 157 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=40.33 E-value=0.98 Score=20.64 Aligned_cols=382 Identities=9% Similarity=-0.014 Sum_probs=146.8 Q ss_pred HHHHHHH-hc--cCchHHHHHHHHhhcccccCCCCCCcccccc--c-cccchHHHHHHHHHHHHHHhhcCccCcccccCC Q lcl|Aclame:pro 6 AMLWEKL-RD--GSVEQRAIEFAKTTLPYLMVDPMSGSRGVVE--H-DFQSAGALLVNNLAAKLARSLFPTGIPFFRSEL 79 (510) Q Consensus 6 ~~r~~~l-kr--~~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~--~-~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~ 79 (510) =..|+.| +| ..-...+.+..++.-... .+.+..... . .-.++--.|++.+|+.+. +-||--... T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~----~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia------~~p~~~~~~ 70 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSY----DTYTGKQISSQRAMRLTAVFSCVRVLAESVG------MLPCNLYHL 70 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCc----cccCCceechhhhhccHHHHHHHHHHHHHhc------cCceEEEEe Confidence 5555555 21 222233333333321111 011111110 1 112333344555544443 224322222 Q ss_pred ChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCCCe-EEEEEe--c Q lcl|Aclame:pro 80 TDAIRREADSRDTDITEVTAALARVDRKATQRLF-QN----ASLAVLTQVIKLLIVTGNALLYRNSDEAT-VVAWSL--R 151 (510) Q Consensus 80 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~l~~~G~~~l~~~~~~~~-~~~~pl--~ 151 (510) .+..... ..... +...|. +- +.+.-+..+..++...||+.+|+..+.++ ...+|| + T Consensus 71 ~~~~~~~---------~~~~~-------~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~L~~l~~~ 134 (414) T protein:vir:44 71 NGSLKQR---------ATGER-------LHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPG 134 (414) T ss_pred cCCceee---------cccch-------HHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcCc Confidence 2111000 00111 112232 22 34444566677788899998887655444 345666 3 Q ss_pred eEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccc Q lcl|Aclame:pro 152 SYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGR 231 (510) Q Consensus 152 ~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~ 231 (510) .+.+..+..|++ + |..... +|... . T Consensus 135 ~v~~~~~~~~~~--~----------------------------------y~~~~~---------------~g~~~----~ 159 (414) T protein:vir:44 135 CVVPKLNSSWEP--V----------------------------------YQVTFP---------------DGSTD----V 159 (414) T ss_pred eEEEEECCCCcE--E----------------------------------EEEEec---------------CceEE----E Confidence 444444444432 1 111100 01000 0 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC-------- Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-------- 303 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~-------- 303 (510) |..++ .+..|.... +..||.||..-+...+.....+.+.......-...|..++.-++.+.++...... T Consensus 160 ~~~~e--vih~~~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~ 236 (414) T protein:vir:44 160 LSQED--IWHVRTLTL-DGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHT 236 (414) T ss_pred Ecccc--EEEecCCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 10011 233332222 3379999999999888888888888877777777787777655655555332211 Q ss_pred ---C-cc--eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC--CCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 304 ---M-GD--YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ--RDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 304 ---~-G~--~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~--~~~~~vTAtEi~~r~~E~~~~LG 373 (510) + |. +++++. +..++... ..+.+. .+..+..+..|.++|-.. .+. ..+..-+++|.. T Consensus 237 g~~n~~~~~vl~~g~-~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~----------- 302 (414) T protein:vir:44 237 GLGNAHRPMILEMGL-DWKSMALN-AEDSQF-LETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG----------- 302 (414) T ss_pred CccccCcceecCCCc-eEEEccCC-hHHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------- Confidence 0 11 122222 23333322 234554 344556667888888332 111 112222333322 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHH Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM 453 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~ 453 (510) ..+...-+.|++.+.-..|.+..+++.......+++- +..+.|+-.... ..+++.+-.. + -+..++ T Consensus 303 ---~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd--~~~ll~~d~~~~-~~~~~~~~~~-G------~~t~NE- 368 (414) T protein:vir:44 303 ---LGFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAKFN--AGALLRGDMKSR-FEAYATGINW-G------IYSPND- 368 (414) T ss_pred ---HHHHHHHHHHHHHHHHHHHHhhcCCccccCceEEEEe--chhhhccCHHHH-HHHHHHHHhC-C------CcCHHH- Confidence 1233445666666655555555554443333334432 223333211111 1111111111 1 112222 Q ss_pred HHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCC Q lcl|Aclame:pro 454 MDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAG 509 (510) Q Consensus 454 ~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag 509 (510) +-+.+|.|+- ...|+.-.-..-........+ ...++....+..... T Consensus 369 ---~R~~~gl~p~---~ggD~~~~~~n~~~~~~~~~~----~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 369 ---CRDLEDMNPR---PGGDVYLTPMNMTTKPSDGSK----AGKQKDNANADETTS 414 (414) T ss_pred ---HHHHhCCCCC---CCcceecccccccccCCcccc----CCCCCCCCCCCCCCC Confidence 2233455541 112211100000000000000 000000000000001 No 158 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=39.45 E-value=1 Score=20.54 Aligned_cols=366 Identities=11% Similarity=0.079 Sum_probs=137.5 Q ss_pred ChhHHHH------HHHHH-hccC-chHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAM------LWEKL-RDGS-VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~------r~~~l-kr~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) =|++-.+ .|.+- +|+. .-..|-...--++|......+..-+. ..-+=.++--.|++.+|+.+.+. T Consensus 15 ~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al~~~~V~~cv~~Ia~~iA~l------ 87 (441) T protein:vir:94 15 SRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAIRHSDIFTAVMMIASDLARM------ 87 (441) T ss_pred ccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccch-hhhhccHHHHHHHHHHHHhhccC------ Confidence 1222211 12221 1221 11111111111122211111100000 00012334445667766666552 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TV 145 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~ 145 (510) || ++.-.... .. +..++..|. +-| .+.-....+.++..+||+.+++..+. + .. T Consensus 88 p~-~~~~~~~~------------~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:94 88 PI-RVTVNGQI------------NY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred ce-eeecCccc------------cc-------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 33 33211100 00 111222232 333 33445667777889999988876543 2 23 Q ss_pred EEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC Q lcl|Aclame:pro 146 VAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG 223 (510) Q Consensus 146 ~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~ 223 (510) ..+|+ ..+.+..|.+|++--. ++..+. ++ T Consensus 148 ~L~~i~~~~v~v~~d~~g~~~~~-------------------------------------~~~~~~------------~~ 178 (441) T protein:vir:94 148 NLTFRKTSEIELKSDARGRLYYF-------------------------------------HQRIDS------------NG 178 (441) T ss_pred EEEEEcCceeEEEECCCccEEEE-------------------------------------EEEecc------------CC Confidence 44554 6677777777643110 000000 00 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCc-cchhhh--- Q lcl|Aclame:pro 224 VRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKG-AVVDDY--- 299 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~-~~~~~~--- 299 (510) ... ...|..++ .+..|+...+| .||.||...+...+.......+.......-...|..++.-++. .+++.. T Consensus 179 ~~~--~~~~~~~d--vih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~ 253 (441) T protein:vir:94 179 NNI--ERNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRA 253 (441) T ss_pred cee--EEEEcccc--EEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHH Confidence 000 01111111 24445444444 7999999998888887777777777776667777776643343 333322 Q ss_pred hcCC----Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHH Q lcl|Aclame:pro 300 QDAE----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAE 366 (510) Q Consensus 300 ~~~~----~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~ 366 (510) +..- .| .+++++. ...++.+. ..+.+. .+.....+..|.++|-.. ++..+...-+.+|. .. T Consensus 254 r~~~~~~~~G~~nag~~~vl~~G~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~---~~ 327 (441) T protein:vir:94 254 REEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NL 327 (441) T ss_pred HHHHHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH---HH Confidence 2111 11 1222222 22333322 234443 344466677788888432 12122222232332 22 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHH-----HH--HHHHh Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLN-----AS--QVIAG 437 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~-----~~--q~~~~ 437 (510) .....|-|.+.++..|+-.-|..+. ..-.++..... -.+...|+.-...+.. .- ...-. T Consensus 328 ~~~~tl~P~~~~ie~eln~kl~~~~------------~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~g 395 (441) T protein:vir:94 328 DYLSTLKPYITCVCAELNFKFNDEY------------VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred HHHHHHHHHHHHHHHHHhhhccccc------------cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 3444677777777777655443210 00011111000 0011222221111111 00 00111 Q ss_pred hcChHhHh---h-----cCCHHHH----------HHHHHHHcCCCHhh Q lcl|Aclame:pro 438 LAPIAQLD---P-----RISLPKM----------MDTIWAAFSVDTSQ 467 (510) Q Consensus 438 ~~~~~q~~---~-----~id~d~~----------~~~~a~~~Gvp~~~ 467 (510) +.+.+.-+ - .+..+.+ .+.-. .-|=. .. T Consensus 396 l~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~-kgGe~-~e 441 (441) T protein:vir:94 396 LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKL-KGGEE-NE 441 (441) T ss_pred CCCCCCCCcceEeeccccccccccccccccccccccccc-CCCCC-CC Confidence 11111000 0 0000000 00000 00000 00 No 159 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=39.45 E-value=1 Score=20.54 Aligned_cols=366 Identities=11% Similarity=0.079 Sum_probs=137.5 Q ss_pred ChhHHHH------HHHHH-hccC-chHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAAM------LWEKL-RDGS-VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~~------r~~~l-kr~~-~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) =|++-.+ .|.+- +|+. .-..|-...--++|......+..-+. ..-+=.++--.|++.+|+.+.+. T Consensus 15 ~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al~~~~V~~cv~~Ia~~iA~l------ 87 (441) T protein:vir:79 15 SRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAIRHSDIFTAVMMIASDLARM------ 87 (441) T ss_pred ccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccch-hhhhccHHHHHHHHHHHHhhccC------ Confidence 1222211 12221 1221 11111111111122211111100000 00012334445667766666552 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TV 145 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~ 145 (510) || ++.-.... .. +..++..|. +-| .+.-....+.++..+||+.+++..+. + .. T Consensus 88 p~-~~~~~~~~------------~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:79 88 PI-RVTVNGQI------------NY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred ce-eeecCccc------------cc-------cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 33 33211100 00 111222232 333 33445667777889999988876543 2 23 Q ss_pred EEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC Q lcl|Aclame:pro 146 VAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG 223 (510) Q Consensus 146 ~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~ 223 (510) ..+|+ ..+.+..|.+|++--. ++..+. ++ T Consensus 148 ~L~~i~~~~v~v~~d~~g~~~~~-------------------------------------~~~~~~------------~~ 178 (441) T protein:vir:79 148 NLTFRKTSEIELKSDARGRLYYF-------------------------------------HQRIDS------------NG 178 (441) T ss_pred EEEEEcCceeEEEECCCccEEEE-------------------------------------EEEecc------------CC Confidence 44554 6677777777643110 000000 00 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCc-cchhhh--- Q lcl|Aclame:pro 224 VRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKG-AVVDDY--- 299 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~-~~~~~~--- 299 (510) ... ...|..++ .+..|+...+| .||.||...+...+.......+.......-...|..++.-++. .+++.. T Consensus 179 ~~~--~~~~~~~d--vih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~ 253 (441) T protein:vir:79 179 NNI--ERNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRA 253 (441) T ss_pred cee--EEEEcccc--EEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHH Confidence 000 01111111 24445444444 7999999998888887777777777776667777776643343 333322 Q ss_pred hcCC----Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHH Q lcl|Aclame:pro 300 QDAE----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAE 366 (510) Q Consensus 300 ~~~~----~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~ 366 (510) +..- .| .+++++. ...++.+. ..+.+. .+.....+..|.++|-.. ++..+...-+.+|. .. T Consensus 254 r~~~~~~~~G~~nag~~~vl~~G~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~---~~ 327 (441) T protein:vir:79 254 REEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NL 327 (441) T ss_pred HHHHHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH---HH Confidence 2111 11 1222222 22333322 234443 344466677788888432 12122222232332 22 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHH-----HH--HHHHh Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLN-----AS--QVIAG 437 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~-----~~--q~~~~ 437 (510) .....|-|.+.++..|+-.-|..+. ..-.++..... -.+...|+.-...+.. .- ...-. T Consensus 328 ~~~~tl~P~~~~ie~eln~kl~~~~------------~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~g 395 (441) T protein:vir:79 328 DYLSTLKPYITCVCAELNFKFNDEY------------VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred HHHHHHHHHHHHHHHHHhhhccccc------------cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 3444677777777777655443210 00011111000 0011222221111111 00 00111 Q ss_pred hcChHhHh---h-----cCCHHHH----------HHHHHHHcCCCHhh Q lcl|Aclame:pro 438 LAPIAQLD---P-----RISLPKM----------MDTIWAAFSVDTSQ 467 (510) Q Consensus 438 ~~~~~q~~---~-----~id~d~~----------~~~~a~~~Gvp~~~ 467 (510) +.+.+.-+ - .+..+.+ .+.-. .-|=. .. T Consensus 396 l~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~-kgGe~-~e 441 (441) T protein:vir:79 396 LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKL-KGGEE-NE 441 (441) T ss_pred CCCCCCCCcceEeeccccccccccccccccccccccccc-CCCCC-CC Confidence 11111000 0 0000000 00000 00000 00 No 160 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=37.72 E-value=1.1 Score=20.35 Aligned_cols=377 Identities=12% Similarity=0.031 Sum_probs=144.1 Q ss_pred ChhHHHHHHHHH-hccCchHH--HHHHHHhhcccccCCCCCCccccc--cccc-cchHHHHHHHHHHHHHHhhcCccCcc Q lcl|Aclame:pro 1 MKSTAAMLWEKL-RDGSVEQR--AIEFAKTTLPYLMVDPMSGSRGVV--EHDF-QSAGALLVNNLAAKLARSLFPTGIPF 74 (510) Q Consensus 1 ~k~~~~~r~~~l-kr~~~~~~--w~e~~~~~~P~~~~~~~~~~~~~~--~~~~-dstg~~a~~~Laa~l~~~ltpp~~~W 74 (510) |. +.++ ++.+++.+ |..+ +.-+.......+... .+.. .++--.|++.+|+.+.+. || T Consensus 1 m~------~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~l------p~ 63 (419) T protein:vir:57 1 MF------IPQFWKGRPSENRVNWQVV-----PGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQL------PC 63 (419) T ss_pred Cc------chhhhccCCcccccccccc-----ccccccccccCCceechHHhhccHHHHHHHHHHHHhhccC------ce Confidence 22 2222 33344332 2211 111111111111100 0112 233344555555555432 44 Q ss_pred cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eEEE Q lcl|Aclame:pro 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVA 147 (510) Q Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~ 147 (510) .-..-.+..-.+. + .+..+...|. +-| .+.-....+.++..+|++.+++..+. + .... T Consensus 64 ~~~~~~~~g~~~~---------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L 128 (419) T protein:vir:57 64 VLYRRTENGGREI---------A------FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITEL 128 (419) T ss_pred EEEEEcCCCceec---------c------ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 3222111110000 0 0111222342 233 44445666778889999988876443 2 2455 Q ss_pred EEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCee Q lcl|Aclame:pro 148 WSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVR 225 (510) Q Consensus 148 ~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~ 225 (510) ||| ..+.+..+.+|.+ |.++ ...|.. T Consensus 129 ~pl~~~~v~v~~~~~g~~---~y~~-------------------------------------------------~~~~~~ 156 (419) T protein:vir:57 129 IPINPHKVIVLKGPDGMP---YYDI-------------------------------------------------PSIGEI 156 (419) T ss_pred EEEcCcceEEEECCCceE---EEEE-------------------------------------------------cCCceE Confidence 665 4455555554432 1000 000000 Q ss_pred eccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCC----Cccchhh--- Q lcl|Aclame:pro 226 VGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEA----KGAVVDD--- 298 (510) Q Consensus 226 ~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~----g~~~~~~--- 298 (510) +. .++ .+..|....+ ..||.||...+...+.....+.+.......-...|..++.-. ..+.++. T Consensus 157 ~~------~~~--vih~r~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~ 227 (419) T protein:vir:57 157 LP------MRM--VHHIKSFSLD-GYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDA 227 (419) T ss_pred Ec------hhh--EEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHH Confidence 00 011 1222322223 489999999999999988888888888777777776665321 1222222 Q ss_pred hhc--------CCC-c--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC--CCCCCCCHHHHHH Q lcl|Aclame:pro 299 YQD--------AEM-G--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ--RDAERVTAEEVRI 363 (510) Q Consensus 299 ~~~--------~~~-G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~--~~~~~vTAtEi~~ 363 (510) ++. ..+ | .+++++. .+.++.. +..+.+.+ +..+..+..|..+|-.. .+. ..+..-+++|... T Consensus 228 ~~~~~~~~~~g~~nag~~~vl~~g~-~~~~l~~-~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~ 304 (419) T protein:vir:57 228 ILAKWTERYGGVRNAFSVGMLQEGM-TYKQLSQ-DNEKAQLL-QSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGL 304 (419) T ss_pred HHHHHHHHhccccccccceecCCCc-eEEEcCC-ChhhHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHH Confidence 111 001 1 1223322 2233332 22355433 34456667888888322 111 1122223333221 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHh Q lcl|Aclame:pro 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQ 443 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q 443 (510) .+...-|.|++.+.-..+.+..+.+.......+++- ++.+.|+ +......+.+.+-..+- T Consensus 305 --------------~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd--~~~ll~~-d~~~~~~~~~~~~~~G~--- 364 (419) T protein:vir:57 305 --------------QYVIYTMLAILKRHESAMMRDLLLPSERRDFYIEFN--VSSLLRG-DQKSRYESYALGRQWGW--- 364 (419) T ss_pred --------------HHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEe--chhhhcc-CHHHHHHHHHHHHhCCC--- Confidence 133444666666665555555554433333444442 1222332 11111222222111111 Q ss_pred HhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEE-QRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~-~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +..+ ++-+.+|.|+ +..-|++-.-... ..++..... .+...++...++.+ T Consensus 365 ----~T~N----E~R~~~gl~p---~~ggD~~~~~~n~~~~~~~~~~~------~~~~~~~~~~~~~~ 415 (419) T protein:vir:57 365 ----LSVN----DIRRMENLTP---IPGGDKYLTPLNMVDSKALTGIG------KATPQQLKDIEAIL 415 (419) T ss_pred ----cCHH----HHHHHhCCCC---CCCcCeeeecccccccccccccc------CCCcccCcchhhhh Confidence 1111 1222344443 1222222100000 000000000 00011222223333 No 161 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=36.84 E-value=1.2 Score=20.25 Aligned_cols=432 Identities=10% Similarity=0.005 Sum_probs=174.3 Q ss_pred Chh------HHHHHHHHHhccCchHHHHHHHHhhcccccCCCCC------Ccccccccc--ccchHHHHHHHHHHHHHH- Q lcl|Aclame:pro 1 MKS------TAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMS------GSRGVVEHD--FQSAGALLVNNLAAKLAR- 65 (510) Q Consensus 1 ~k~------~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~~~------~~~~~~~~~--~dstg~~a~~~Laa~l~~- 65 (510) .+. +....|+.-..++--..|. ..|.....+.. .-..+...+ -++.+..+++.+++.+++ T Consensus 19 ~~~~~~~~~~~~~~y~aa~~~r~~~~w~-----~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~ 93 (505) T protein:vir:96 19 WYRYVEPQKNAARAFEAARRDRLGKAWL-----RRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNNVIGP 93 (505) T ss_pred hhhhHHHHHHhhhhcccccCCCcccccc-----CCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcCC Confidence 111 1112233222111111121 01211100000 001111112 477899999999999996 Q ss_pred -hhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE-eCCCC Q lcl|Aclame:pro 66 -SLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR-NSDEA 143 (510) Q Consensus 66 -~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~-~~~~~ 143 (510) +++|..++.......++.+.+ .-...-+.|.+.- -+..=.+.+||.....++...+.-|-+++-. ..+.. T Consensus 94 ~Gi~~~~~~~~~~~~~~~~~~~-----~ie~~w~~Wa~~~---~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~ 165 (505) T protein:vir:96 94 KGMTFQSRVKRRNGKPDDRANT-----LIEGNWQQWIKKG---NCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPN 165 (505) T ss_pred CcceeeecCCcccccccHHHHH-----HHHHHHHHhcCCc---CcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCC Confidence 899988886654433433221 0011123332210 1122334679999999999999999875422 11111 Q ss_pred eEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeC- Q lcl|Aclame:pro 144 TVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEID- 222 (510) Q Consensus 144 ~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~- 222 (510) . +|+ +-.-+-++.|..... ...++. -.|+.-|+.+. .+.|. .||+... T Consensus 166 ~---~~~-----------------~lqliepd~l~~~~n--------~~~~~~-~~i~~GIe~d~-~Gr~~-aY~i~~~h 214 (505) T protein:vir:96 166 K---WGY-----------------ALQILECDRLDLNYN--------ADLQNG-NRIRMSIELDA-WERPV-AYHLLVNH 214 (505) T ss_pred C---cce-----------------EEEEechhhcCCCCC--------cccCCc-CeEEeceEECC-CCceE-EEEEeecC Confidence 0 000 001111222211110 000000 12334444322 22221 1221100 Q ss_pred -Ce-eecccc-ccccccCc--eEEEeee-ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCC-C-cc Q lcl|Aclame:pro 223 -GV-RVGETG-RWPIHLCP--YIVPTWN-LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEA-K-GA 294 (510) Q Consensus 223 -~~-~~~~~~-~y~~~~~P--~~~~Rw~-~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~-g-~~ 294 (510) |. ...... .......| -|..-|. ..+|..-|.+...-+|..++.|.....+.+.++..++.....+..+ + +. T Consensus 215 Pgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~ 294 (505) T protein:vir:96 215 PGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYD 294 (505) T ss_pred CCccccccccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCC Confidence 00 000000 00001122 2333333 4578899999999999999999999999999999988877666422 1 11 Q ss_pred chhh------hhcCCCcceecCCc-cccccccCCC-ccchHHHHHHHHHHHHHHHHHHh--hcccCCCCCCCCHHHHHHH Q lcl|Aclame:pro 295 VVDD------YQDAEMGDYVPGGA-EAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVRIT 364 (510) Q Consensus 295 ~~~~------~~~~~~G~~~~g~~-~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af~--~~~~~~~~~~vTAtEi~~r 364 (510) .+.. ....++|.+..-.+ .+++.+..+. ..++. .-...+...|..++= +..+..|-..++=.-+++- T Consensus 295 ~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~---~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~ 371 (505) T protein:vir:96 295 QPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFG---AFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSG 371 (505) T ss_pred CccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHH Confidence 1110 11122343322111 2344443332 23442 112222233333331 2234445455554445544 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc----eeeEEeec----HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 365 AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ----HKPAIETG----LPALSRSAAVQSMLNASQVIA 436 (510) Q Consensus 365 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~----~~~~~vs~----l~~l~r~~~~~~~~~~~q~~~ 436 (510) ..|.....--.=..+..-|+.|+..+.+..+.-.|..++|... ++...+.+ ++++--++ .... .+. T Consensus 372 ~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~---a~~~---~i~ 445 (505) T protein:vir:96 372 ELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSK---AHSE---SIK 445 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHH---HHHH---HHH Confidence 4455544444444555667788888877766555555555432 22333221 22221111 0000 000 Q ss_pred hhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 437 GLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 437 ~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ++. .. ...++...|.++. ++-.++.++.+.....- +.. ......+..+- T Consensus 446 --~G~-------~t---~~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~G-----l~~-~~~~~~~~~~~ 494 (505) T protein:vir:96 446 --NRT-------RS---RSSIIRAAGDDPE-------DVFDEIAWEEQLMRDKG-----VNP-TPPEQESKDAT 494 (505) T ss_pred --cCC-------CC---HHHHHHHcCCCHH-------HHHHHHHHHHHHHHHcC-----CCC-CCCCCCCCCCC Confidence 010 00 1112222454442 22222222211111000 000 00000000000 No 162 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=34.08 E-value=1.3 Score=19.93 Aligned_cols=252 Identities=12% Similarity=0.035 Sum_probs=110.1 Q ss_pred CccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCC- Q lcl|Aclame:pro 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QN----ASLAVLTQVIKLLIVTGNALLYRNSDE- 142 (510) Q Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~l~~~G~~~l~~~~~~- 142 (510) =++-||--.. .+ + +.+..| ...|. +- +.+.=+...+.++..+||+.+++..+. T Consensus 1 ia~l~~~~~~-~~---------~----~~~~~l-------~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~ 59 (278) T protein:vir:78 1 MASLPLKMYE-DY---------K----VVNTEV-------SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY 59 (278) T ss_pred CccceeEEEe-cC---------c----ccccHH-------HHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCC Confidence 0011221110 00 0 011111 12232 22 344556777788899999988765432 Q ss_pred C-eEEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEE Q lcl|Aclame:pro 143 A-TVVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYH 219 (510) Q Consensus 143 ~-~~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~ 219 (510) + ....+|| ..+-+..+.+|.. +++.+. T Consensus 60 G~~~~l~~l~~~~v~v~~~~~~~~--~~y~~~------------------------------------------------ 89 (278) T protein:vir:78 60 HQPSKLFLLNPDVVEMLIENQSRE--LYYSIH------------------------------------------------ 89 (278) T ss_pred CcEEEEEEECCceeEEEEcCCCce--EEEEEE------------------------------------------------ Confidence 2 2445555 3444444444322 111110 Q ss_pred eeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhh Q lcl|Aclame:pro 220 EIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDY 299 (510) Q Consensus 220 e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~ 299 (510) ..+|... .++.++ .+..|.....+..||.||...+...+...+...+..+.... ..|.+++..++.++++.. T Consensus 90 ~~~g~~~----~~~~~e--vih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~i~~~~~~l~~e~~ 161 (278) T protein:vir:78 90 AATGNKL----IVHNMD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQ--KPDSFMLKYGSNVGKEKR 161 (278) T ss_pred cCCceEE----EEcccc--EEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhc--CCCcEEEEeCCCCCHHHH Confidence 0111110 011111 33334443456689999999999988888877766544333 235556555565554433 Q ss_pred hcC---------CCcc--eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC--CCCCCCCHHHHHHH Q lcl|Aclame:pro 300 QDA---------EMGD--YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ--RDAERVTAEEVRIT 364 (510) Q Consensus 300 ~~~---------~~G~--~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~--~~~~~vTAtEi~~r 364 (510) ... .+|. +++++. ++.++... ..+.+. .+..+...+.|-.+|=.. .+. .++..-|++|... T Consensus 162 ~~~~~~~~~~~~~~g~~~vl~~g~-~~~~l~~~-~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~- 237 (278) T protein:vir:78 162 QQVLEDFKQYYEENGGILFQEPGV-EIEPLPKK-YVSEDI-VASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNR- 237 (278) T ss_pred HHHHHHHHHHhccCCCceecCCCc-eEEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH- Confidence 221 1121 222322 23333322 234443 444566778888888332 111 1222234444221 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCc-cceeeEE-eecH Q lcl|Aclame:pro 365 AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLIT-KQHKPAI-ETGL 416 (510) Q Consensus 365 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~-~~~~~~~-vs~l 416 (510) .+...-+.|++.+.-..|.+..+++..- ....+++ ++.| T Consensus 238 -------------~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 238 -------------FYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred -------------HHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 2223335555555544444444333110 1122322 1223 No 163 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=34.08 E-value=1.3 Score=19.93 Aligned_cols=352 Identities=14% Similarity=0.048 Sum_probs=136.5 Q ss_pred ChhHHHHHHHHHh-----ccC-----------chHHHHHHHHhhcccccCCCCCCccccccccc-cchHHHHHHHHHHHH Q lcl|Aclame:pro 1 MKSTAAMLWEKLR-----DGS-----------VEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDF-QSAGALLVNNLAAKL 63 (510) Q Consensus 1 ~k~~~~~r~~~lk-----r~~-----------~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dstg~~a~~~Laa~l 63 (510) .-+.....|+++| +++ .-.-|++. +..+ .+.+..-+. .... .++--.|++.+|+.+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~---s~~g~~v~~--~~al~~~~V~~~i~~Ia~~i 74 (432) T protein:vir:10 2 PDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDL--GIII---SDTGAAVNA--DAIMRLDAVAACVKLVSQAI 74 (432) T ss_pred CCCcccchhhhhHhhcCCccccccccccccccCcchhhhh--cccc---cccCcccch--hhhhcchHHHHHHHHHHHhh Confidence 2222333333322 111 00011110 0000 001100000 0111 233334555555544 Q ss_pred HHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEE Q lcl|Aclame:pro 64 ARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYR 138 (510) Q Consensus 64 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~ 138 (510) .+ + ||.-..-.++...+ .. +.-++..|. +-| .+.-.+..+.++..+||+.+++ T Consensus 75 a~-l-----p~~~y~~~~~g~~~---------~~-------~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~ 132 (432) T protein:vir:10 75 AA-M-----PLTMYMRTPDGRKE---------AV-------NHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRK 132 (432) T ss_pred hh-C-----ceeEEEecCCCccc---------cc-------ccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEE Confidence 43 2 55311111111100 01 112223342 333 3334566677888999998887 Q ss_pred eCCCCe-EEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEE Q lcl|Aclame:pro 139 NSDEAT-VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYA 215 (510) Q Consensus 139 ~~~~~~-~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 215 (510) ..+.++ ...+|| ..+.+..|.+|++ +|+ + T Consensus 133 ~~~~g~~~~L~~l~~~~v~v~~~~~g~~--~y~-----------------------------------~----------- 164 (432) T protein:vir:10 133 VVTDGRIESLQYLANDRLTITTDTKGNT--AYR-----------------------------------Y----------- 164 (432) T ss_pred EecCCcEEEEEEEcCCceEEEEcCCCcE--EEE-----------------------------------E----------- Confidence 665443 334444 5566777766643 111 0 Q ss_pred EEEEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccc Q lcl|Aclame:pro 216 EMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAV 295 (510) Q Consensus 216 sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~ 295 (510) ...+|.... +..++ .+..|+...+| .||.||...+...+.......+.......-...|-.++.-++.+. T Consensus 165 ---~~~~g~~~~----~~~~~--iih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~ 234 (432) T protein:vir:10 165 ---RRTDGQMID----IPKQQ--IWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT 234 (432) T ss_pred ---EecCceEEE----EcCcc--EEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC Confidence 001111100 00111 22233333334 799999999888887777776666665555556666665445555 Q ss_pred hhhhhc---CCCc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-C-CCCCCCHHHH Q lcl|Aclame:pro 296 VDDYQD---AEMG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-R-DAERVTAEEV 361 (510) Q Consensus 296 ~~~~~~---~~~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~-~~~~vTAtEi 361 (510) ++.... .-.| .+++++. +..++.+. ..+.+. .+..+..+..|.++|-.. .+. . .+..-+.+-+ T Consensus 235 ~e~~~~~~~~~~~~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~ 311 (432) T protein:vir:10 235 DDQYDSFAKKVSGSVEAGRAPLLEGGM-DVKSLGLN-PVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGI 311 (432) T ss_pred HHHHHHHHHHHhhhhhCCCceecCCCc-eEEEccCC-hHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchH Confidence 443321 1111 1223322 22333321 234554 344577788899998332 111 1 1111122333 Q ss_pred HHHHHH-HHHHhhhhHHHHHHHHHHHHHHH-------------------------HHHHHhhc------------CCCCC Q lcl|Aclame:pro 362 RITAEE-AENTLGGTYSLLAENLQSPLAYV-------------------------CLSEVDDA------------LLQGL 403 (510) Q Consensus 362 ~~r~~E-~~~~LGpv~~rl~~E~l~Pli~r-------------------------~~~il~~~------------~l~~~ 403 (510) .+.... ....|.|.+.++..++-.-|+.. .+..+-.. ++||+ T Consensus 312 e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi 391 (432) T protein:vir:10 312 ESQQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL 391 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Confidence 322223 23467788777777765433211 00011111 24444 Q ss_pred CccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh--HhHhhcCCHHHHHH Q lcl|Aclame:pro 404 ITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI--AQLDPRISLPKMMD 455 (510) Q Consensus 404 p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~--~q~~~~id~d~~~~ 455 (510) ++++.....-+...++... +.-..+ ..-...-+-++.-+ T Consensus 392 ~g~~~~~~~~~~~~pl~~~-------------~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 392 GGNAAVLTVQSAMVPLDSI-------------GLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCCcceEeecCcccchhhh-------------cccCCCCCCCCCCCcccccccC Confidence 4332111111111111110 000000 00011111111111 No 164 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=33.88 E-value=1.3 Score=19.91 Aligned_cols=359 Identities=9% Similarity=0.034 Sum_probs=127.8 Q ss_pred cccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 29 LPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRK- 107 (510) Q Consensus 29 ~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~- 107 (510) ++- |...... ........+. +... ++. +..=. .++..... ....|..-.+.+... T Consensus 1 M~~-f~~~~~~---~~~~~~~~~~------~~~~----~~~-~~~~~--~v~~~~al-------~~~~V~~~v~~ia~~i 56 (397) T protein:vir:38 1 MPL-LKLNKSH---SQGFSLNDPD------WVNF----LTG-GEAQK--YVSADTAL-------KNSDIFSLIMQLSGDL 56 (397) T ss_pred Ccc-hhhhhcc---cCcccCCchh------hhhh----hcC-CcCCc--eechHHhh-------ccHHHHHHHHHHHHHH Confidence 221 1110000 0001111110 0000 000 00000 01111000 011111111111111 Q ss_pred -----------HHHHHHhc----CCHHHHHHHHHHHHhhCceEEEEeCCC--CeEEEEEe--ceEEEeeCCCCceeEEEE Q lcl|Aclame:pro 108 -----------ATQRLFQN----ASLAVLTQVIKLLIVTGNALLYRNSDE--ATVVAWSL--RSYAVRRDATGRWMDIVL 168 (510) Q Consensus 108 -----------~~~~l~~s----nf~~~~~~~~~~l~~~G~~~l~~~~~~--~~~~~~pl--~~~~v~~d~~G~v~~i~r 168 (510) ....+.+- ..+.-+..+..++..+|+|.+++..+. .....+|+ ..+.+..+.+|.. ++. T Consensus 57 a~~p~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~--~~y 134 (397) T protein:vir:38 57 AMVRYTSESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSG--LIY 134 (397) T ss_pred hhCcccccccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce--EEE Confidence 11112222 234445667778888999988765443 22344554 4455555555422 111 Q ss_pred EEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeeecC Q lcl|Aclame:pro 169 KQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNLAP 248 (510) Q Consensus 169 ~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ 248 (510) ++.. ++........++.++ .+..|..... T Consensus 135 ~~~~-------------------------------------------------~~~~~~~~~~~~~~e--iih~~~~~~~ 163 (397) T protein:vir:38 135 NINF-------------------------------------------------DEPAIGYMENVPAAD--VIHIRLLSKN 163 (397) T ss_pred EEEe-------------------------------------------------ccccccceeEecCcc--EEEecCCCCC Confidence 1110 000000000111112 3444555556 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc----------CC-Cc--ceecCCcccc Q lcl|Aclame:pro 249 GEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD----------AE-MG--DYVPGGAEAV 315 (510) Q Consensus 249 ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~----------~~-~G--~~~~g~~~~v 315 (510) +..||.||...+...+.......+.......-...|..++.-++.+.++.... +. .| .+++++. .+ T Consensus 164 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~-~~ 242 (397) T protein:vir:38 164 GGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDALE-DY 242 (397) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCCCc-eE Confidence 77899999999999999999888888887777777777765444444433211 11 11 1122221 22 Q ss_pred ccccCCCccchHHHHHHHHHHHHHHHHHHhhcc--cCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 316 RAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGA--NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLS 393 (510) Q Consensus 316 ~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~--~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 393 (510) .++... ..+.+ ..+..+..+..|..+|-... +.......+..| +...-....|-|++..+..+|-.-| T Consensus 243 ~~l~~~-~~d~~-~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e--~~~~~~~~~l~P~~~~ie~~ln~~l------ 312 (397) T protein:vir:38 243 KPLEVK-GNIAS-LLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT--QISGQYAKSLNRYVQAIVGELNDKL------ 312 (397) T ss_pred EecCCC-hhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH--HHHHHHHHHHHHHHHHHHHHHHHhc------ Confidence 233221 23444 34556778889999984431 221111112222 1122233445555555555543222 Q ss_pred HHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCH---hhccC Q lcl|Aclame:pro 394 EVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDT---SQFYK 470 (510) Q Consensus 394 il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~---~~i~~ 470 (510) ++. .++.+...---....|+.....+.. . + -+..+++-+ .+|.|+ ..+.. T Consensus 313 ------~~~---~~~~~~~~~~~d~~~~~~~~~~~~~-------~-G------~~t~nE~R~----~lg~~p~~~~d~~~ 365 (397) T protein:vir:38 313 ------HAN---ISANIRFAIDAMGDQYASTISSSVK-------G-G------TIAGNQARF----ILQNSGYLAKDLPD 365 (397) T ss_pred ------cCh---hcccccccccCCHHHHHHHHHHHHh-------C-C------CcCHHHHHH----HhCCCCCCCCcccc Confidence 221 1111111111122233322222111 0 1 122222222 223322 11100 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|Aclame:pro 471 SADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALA 508 (510) Q Consensus 471 s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~a 508 (510) .+........... .... ........+..+.+- T Consensus 366 -~~~~~~~~~~~~~----~~~g-~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 366 -PEKEPQQAIQLIQ----QEGG-ENDGNNSDERGSDPE 397 (397) T ss_pred -ccccccccccccc----cccC-CCCCCCCCCCCCCCC Confidence 0000000000000 0000 000000111111111 No 165 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=31.74 E-value=1.5 Score=19.66 Aligned_cols=361 Identities=11% Similarity=0.049 Sum_probs=135.6 Q ss_pred HHHHHHH-hccC-chHH-HHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChh Q lcl|Aclame:pro 6 AMLWEKL-RDGS-VEQR-AIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDA 82 (510) Q Consensus 6 ~~r~~~l-kr~~-~~~~-w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 82 (510) -..|.+- ||+. .-.. +..... ++|......+..-+. ..-+-.++--.|++.+|+.+.+ -|| ++.-... T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~al~~~~v~~cv~~Ia~~iA~------~p~-~~~~~~~ 71 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQ-TLPGFQGTKLRQYKD-IEAIRHSDIFTAVMMIASDLAR------MPI-RVTVNGQ 71 (416) T ss_pred CCcccccccccccCCCcchhHHHH-HhccccccCccccch-hhhhcchHHHHHHHHHHHhhcc------Cce-EEecCcc Confidence 1122222 2221 1111 111111 233211111100000 0001123334466666666654 243 3321111 Q ss_pred hhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe--ceE Q lcl|Aclame:pro 83 IRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RSY 153 (510) Q Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl--~~~ 153 (510) . ..+ ..++..|. +-| .+.-....+.++..+||+.+++..+. + ....||+ ..+ T Consensus 72 ~------------~~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v 132 (416) T protein:vir:45 72 I------------NYS-------DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 132 (416) T ss_pred c------------ccc-------chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 0 011 11222232 333 33445677778889999988876543 2 2344554 666 Q ss_pred EEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccc Q lcl|Aclame:pro 154 AVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWP 233 (510) Q Consensus 154 ~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~ 233 (510) .+..|.+|++--.+. .+++........|+ T Consensus 133 ~v~~~~~g~~~~~~~---------------------------------------------------~~~~~~~~~~~~~~ 161 (416) T protein:vir:45 133 ELKSDARGRLYYFHQ---------------------------------------------------RIDSNGNNIERNVK 161 (416) T ss_pred EEEECCCccEEEEEE---------------------------------------------------EecCCCceeEEEEc Confidence 677777765321110 00100000001111 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCC-ccchhh---hhc----CCCc Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDD---YQD----AEMG 305 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g-~~~~~~---~~~----~~~G 305 (510) .++ .+..|+...+ ..||.||...+...+...+...+.......-...|..++.-++ ..+++. ++. .-.| T Consensus 162 ~~e--vihir~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g 238 (416) T protein:vir:45 162 FED--MLDIKFYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSG 238 (416) T ss_pred ccc--EEEeccCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcC Confidence 112 2334544433 4799999999998888888887777777777777777664333 333332 111 1011 Q ss_pred -------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 306 -------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 306 -------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~LGpv~ 376 (510) .+++++. +..++... ..+.+. .+.....+..|..+|-.. ++..+...-+.+|. .......|-|.+ T Consensus 239 ~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~---~~~~~~~l~P~~ 312 (416) T protein:vir:45 239 TKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NLDYLSTLKPYI 312 (416) T ss_pred ccccCceeecCCCc-eeEeccCC-HHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHH---HHHHHHHHHHHH Confidence 1122222 22233221 123333 344456677888888432 12112222122222 222344666777 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeec--HHHHHHHHHHHHHHH-----HHH--HHHhhcChHhHh-- Q lcl|Aclame:pro 377 SLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG--LPALSRSAAVQSMLN-----ASQ--VIAGLAPIAQLD-- 445 (510) Q Consensus 377 ~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~--l~~l~r~~~~~~~~~-----~~q--~~~~~~~~~q~~-- 445 (510) ..+..|+-.-|..+ -..-.++...... .+...|+.-.+.+.. .-. ..-.+.+.+.-+ T Consensus 313 ~~ie~~ln~~l~~~------------~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~ 380 (416) T protein:vir:45 313 TCVCAELNFKFNDE------------YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS 380 (416) T ss_pred HHHHHHHhhhcccc------------ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Confidence 77766665443221 1011111111000 011122211111111 000 011111111000 Q ss_pred ------hcCCHHHHHHH---------HHHHcCCCHhh Q lcl|Aclame:pro 446 ------PRISLPKMMDT---------IWAAFSVDTSQ 467 (510) Q Consensus 446 ------~~id~d~~~~~---------~a~~~Gvp~~~ 467 (510) ..+..|. ++. -...=|=.... T Consensus 381 ~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 381 IHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred eEeeccccccccc-ccccCcccccccccccCCCCCCC Confidence 0000000 000 00000000001 No 166 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=31.74 E-value=1.5 Score=19.66 Aligned_cols=361 Identities=11% Similarity=0.049 Sum_probs=135.6 Q ss_pred HHHHHHH-hccC-chHH-HHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccCcccccCCChh Q lcl|Aclame:pro 6 AMLWEKL-RDGS-VEQR-AIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDA 82 (510) Q Consensus 6 ~~r~~~l-kr~~-~~~~-w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 82 (510) -..|.+- ||+. .-.. +..... ++|......+..-+. ..-+-.++--.|++.+|+.+.+ -|| ++.-... T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~al~~~~v~~cv~~Ia~~iA~------~p~-~~~~~~~ 71 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQ-TLPGFQGTKLRQYKD-IEAIRHSDIFTAVMMIASDLAR------MPI-RVTVNGQ 71 (416) T ss_pred CCcccccccccccCCCcchhHHHH-HhccccccCccccch-hhhhcchHHHHHHHHHHHhhcc------Cce-EEecCcc Confidence 1122222 2221 1111 111111 233211111100000 0001123334466666666654 243 3321111 Q ss_pred hhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eEEEEEe--ceE Q lcl|Aclame:pro 83 IRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TVVAWSL--RSY 153 (510) Q Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~~~~pl--~~~ 153 (510) . ..+ ..++..|. +-| .+.-....+.++..+||+.+++..+. + ....||+ ..+ T Consensus 72 ~------------~~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v 132 (416) T protein:vir:81 72 I------------NYS-------DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 132 (416) T ss_pred c------------ccc-------chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 0 011 11222232 333 33445677778889999988876543 2 2344554 666 Q ss_pred EEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccc Q lcl|Aclame:pro 154 AVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWP 233 (510) Q Consensus 154 ~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~ 233 (510) .+..|.+|++--.+. .+++........|+ T Consensus 133 ~v~~~~~g~~~~~~~---------------------------------------------------~~~~~~~~~~~~~~ 161 (416) T protein:vir:81 133 ELKSDARGRLYYFHQ---------------------------------------------------RIDSNGNNIERNVK 161 (416) T ss_pred EEEECCCccEEEEEE---------------------------------------------------EecCCCceeEEEEc Confidence 677777765321110 00100000001111 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCC-ccchhh---hhc----CCCc Q lcl|Aclame:pro 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDD---YQD----AEMG 305 (510) Q Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g-~~~~~~---~~~----~~~G 305 (510) .++ .+..|+...+ ..||.||...+...+...+...+.......-...|..++.-++ ..+++. ++. .-.| T Consensus 162 ~~e--vihir~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g 238 (416) T protein:vir:81 162 FED--MLDIKFYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSG 238 (416) T ss_pred ccc--EEEeccCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcC Confidence 112 2334544433 4799999999998888888887777777777777777664333 333332 111 1011 Q ss_pred -------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 306 -------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 306 -------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~E~~~~LGpv~ 376 (510) .+++++. +..++... ..+.+. .+.....+..|..+|-.. ++..+...-+.+|. .......|-|.+ T Consensus 239 ~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~---~~~~~~~l~P~~ 312 (416) T protein:vir:81 239 TKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NLDYLSTLKPYI 312 (416) T ss_pred ccccCceeecCCCc-eeEeccCC-HHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHH---HHHHHHHHHHHH Confidence 1122222 22233221 123333 344456677888888432 12112222122222 222344666777 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeec--HHHHHHHHHHHHHHH-----HHH--HHHhhcChHhHh-- Q lcl|Aclame:pro 377 SLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG--LPALSRSAAVQSMLN-----ASQ--VIAGLAPIAQLD-- 445 (510) Q Consensus 377 ~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~--l~~l~r~~~~~~~~~-----~~q--~~~~~~~~~q~~-- 445 (510) ..+..|+-.-|..+ -..-.++...... .+...|+.-.+.+.. .-. ..-.+.+.+.-+ T Consensus 313 ~~ie~~ln~~l~~~------------~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~ 380 (416) T protein:vir:81 313 TCVCAELNFKFNDE------------YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGS 380 (416) T ss_pred HHHHHHHhhhcccc------------ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Confidence 77766665443221 1011111111000 011122211111111 000 011111111000 Q ss_pred ------hcCCHHHHHHH---------HHHHcCCCHhh Q lcl|Aclame:pro 446 ------PRISLPKMMDT---------IWAAFSVDTSQ 467 (510) Q Consensus 446 ------~~id~d~~~~~---------~a~~~Gvp~~~ 467 (510) ..+..|. ++. -...=|=.... T Consensus 381 ~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 381 IHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred eEeeccccccccc-ccccCcccccccccccCCCCCCC Confidence 0000000 000 00000000001 No 167 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=30.94 E-value=1.5 Score=19.56 Aligned_cols=387 Identities=9% Similarity=0.028 Sum_probs=159.6 Q ss_pred hHHHHHHHHhhcccccCCCCCCcccccccc-----ccchHHHHHHHHHHHHHHh----hcCccCcccccCCChhhhhhhc Q lcl|Aclame:pro 18 EQRAIEFAKTTLPYLMVDPMSGSRGVVEHD-----FQSAGALLVNNLAAKLARS----LFPTGIPFFRSELTDAIRREAD 88 (510) Q Consensus 18 ~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~-----~dstg~~a~~~Laa~l~~~----ltpp~~~WF~l~~~d~~~~~~~ 88 (510) ..+..-+....+ + ++..++..... ++-...-.-+-|+.+++.. +| +.|+.++-.+.. T Consensus 1 ~~~~D~~~n~~~-----g-g~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~---r~g~~i~~~~~~----- 66 (422) T protein:vir:10 1 MVKTDSYANIFL-----G-GSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETAL---AAGFHIDGIDDE----- 66 (422) T ss_pred CccchhhHHHHc-----C-CCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHh---cCCccccCCCHH----- Confidence 222222222211 1 11111111111 1111222333444444443 55 578887533211 Q ss_pred cCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCCeEEEEEeceEEEeeCCCCcee--EE Q lcl|Aclame:pro 89 SRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRDATGRWM--DI 166 (510) Q Consensus 89 ~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~~~~~~~~~~pl~~~~v~~d~~G~v~--~i 166 (510) ..++ ..+++-++...+.++++.--.+|.+.+++.-+++.--.=||. ..|.+- .+ T Consensus 67 ------~~~~-----------~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~-------~~g~~~~l~v 122 (422) T protein:vir:10 67 ------PAFW-----------SRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR-------EGAELETVRV 122 (422) T ss_pred ------HHHH-----------HHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCcccccc-------ccCceeeEEe Confidence 1111 123334788999999999999999887776533332223442 124332 23 Q ss_pred EEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEeeee Q lcl|Aclame:pro 167 VLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWNL 246 (510) Q Consensus 167 ~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~ 246 (510) +-+..+++.... .+... ..+-+-+.|+.. ... .+.+ ..|| -.++....+. ..|+ +.+ T Consensus 123 ~d~~~i~~~~~~----~dp~s-----~~fg~P~~y~v~-~~~-~~~~-~~iH----~SRli~~~g~---~~p~----~~~ 179 (422) T protein:vir:10 123 YDRTQVKVQTRE----ENPRN-----ARFGEPLTYRIT-TNE-SDMF-YDVH----YSRIHIIDGE---RIPN----VMR 179 (422) T ss_pred eccccccchhcc----cCccc-----cccCcceEEEEe-cCC-CCcc-eeec----cceeEEeCCC---Cchh----hhc Confidence 334444432211 11110 111222333322 211 1111 1222 1122222222 2343 455 Q ss_pred cCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhhCCceeeC------CCCccchh-----hhh---cCCCcc-eecC Q lcl|Aclame:pro 247 APGEHYGRGHVED-YIGDFAKLSLLSEKLGLYELESLEVLNLVD------EAKGAVVD-----DYQ---DAEMGD-YVPG 310 (510) Q Consensus 247 ~~ge~YGrgp~~~-~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~------~~g~~~~~-----~~~---~~~~G~-~~~g 310 (510) ....-||+||... +++.++.....+....+...++.-..+-++ .++..... ... ...++. .+-+ T Consensus 180 ~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~ 259 (422) T protein:vir:10 180 RQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA 259 (422) T ss_pred ccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEec Confidence 6677789999987 578888888887777766554443332222 11211100 000 011222 2223 Q ss_pred CccccccccCCCccchHHHHHHHHHHHHHHHHHHh--hc-ccCCCCCCCCHH--HHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|Aclame:pro 311 GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFM--YG-ANQRDAERVTAE--EVRITAEEAENTLGGTYSLLAENLQS 385 (510) Q Consensus 311 ~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~--~~-~~~~~~~~vTAt--Ei~~r~~E~~~~LGpv~~rl~~E~l~ 385 (510) ..+.+..+.. ++.-+...+....+.|.-+.= +. ++-.....+.|| +-. ...---+..++...+. T Consensus 260 ~~e~~e~~~~----~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~-------~~yyd~i~~~Qe~~l~ 328 (422) T protein:vir:10 260 ESEEYSVLNS----DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTAL-------ETFHKLVDRKRNAELL 328 (422) T ss_pred CCcceEEEec----ccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHH-------HHHHHHHHHHHHHHHH Confidence 3334443322 333445667777777776661 11 222223335443 221 1111223445667889 Q ss_pred HHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHH---HcC Q lcl|Aclame:pro 386 PLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA---AFS 462 (510) Q Consensus 386 Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~---~~G 462 (510) |++++++.++.+. +++.+++- +|..+.-..+++......+..+.+.+ ...++.+++-+.+.. ..| T Consensus 329 p~l~~l~~~i~~s-------~~~~~~f~-pL~~~sekekaei~~~~a~a~~~~~~----~g~i~~~e~r~~L~~~~~~~~ 396 (422) T protein:vir:10 329 PILEFLIPFIVNA-------EEWSVEFN-PLAQESSKDKAEILEKNVNSIAALIA----AGAMDIDEARDTLRTIAPEVK 396 (422) T ss_pred HHHHHHHHHhccc-------CCcEEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHh----cCCCCHHHHHHHhhhhccccc Confidence 9999998887642 34554432 23332222222222222222222211 113566666655543 445 Q ss_pred CCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 463 VDTSQFYKSADELQAEAEEQRRQAAQAQAAQETL 496 (510) Q Consensus 463 vp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~ 496 (510) +... +.. ++......+. .. ..-++.- T Consensus 397 ~~~~-~~~--~~~~~~~~~~---~~--~~~~~~d 422 (422) T protein:vir:10 397 INDG-SVE--TEVTISETSN---DP--LEVPTDD 422 (422) T ss_pred CCCC-CCc--cccchhhcCC---CC--CCCCCCC Confidence 5432 322 2221111000 00 0000000 No 168 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=29.59 E-value=1.6 Score=19.40 Aligned_cols=423 Identities=10% Similarity=0.070 Sum_probs=143.3 Q ss_pred ChhHH--HHHHHHHhcc--CchHHH---------------HHHHH-----------hhcccccCCCCCCccc-------- Q lcl|Aclame:pro 1 MKSTA--AMLWEKLRDG--SVEQRA---------------IEFAK-----------TTLPYLMVDPMSGSRG-------- 42 (510) Q Consensus 1 ~k~~~--~~r~~~lkr~--~~~~~w---------------~e~~~-----------~~~P~~~~~~~~~~~~-------- 42 (510) |++.+ .+|+...++. .+..|- ..+.+ -.+|..--..+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l 80 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDL 80 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHH Confidence 55432 2333321110 011100 01111 1111110011101000 Q ss_pred -cccccc--cchHHHHHHHHHHHHHHhhcCccC----cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 43 -VVEHDF--QSAGALLVNNLAAKLARSLFPTGI----PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQN 115 (510) Q Consensus 43 -~~~~~~--dstg~~a~~~Laa~l~~~ltpp~~----~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s 115 (510) .+-+.| ..+--.|++..|.-+.+.-.+... .=|.+.+.+...+... .-......++ ..|.+- T Consensus 81 ~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~-------~~~~~~~~i~----~~l~~p 149 (551) T protein:vir:80 81 HGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTS-------HDEATIKRIE----SFIEKT 149 (551) T ss_pred HHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccCh-------hHHHHHHHHH----HHHHhc Confidence 011122 233346667766655533222110 1122333222111100 0111111122 223333 Q ss_pred C---------CHHHHHHHHHHHHhhCceEEEEeCC-CC-eEEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhh Q lcl|Aclame:pro 116 A---------SLAVLTQVIKLLIVTGNALLYRNSD-EA-TVVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYK 182 (510) Q Consensus 116 n---------f~~~~~~~~~~l~~~G~~~l~~~~~-~~-~~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~ 182 (510) | |..-+...+.|+..+||+.+++..+ .+ ....+|| ..+.+..+.+|.+.+-.++| T Consensus 150 n~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y------------ 217 (551) T protein:vir:80 150 GVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRF------------ 217 (551) T ss_pred CCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEE------------ Confidence 3 3344555677888999998765443 22 3455666 45555666666432100000 Q ss_pred HHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccccccccCceEEEe-eee--cCCCccccchHHH Q lcl|Aclame:pro 183 QDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPT-WNL--APGEHYGRGHVED 259 (510) Q Consensus 183 ~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~~~~P~~~~R-w~~--~~ge~YGrgp~~~ 259 (510) ++..+|.... .|..++ .+..| |.. ..+.+||.||..- T Consensus 218 -----------------------------------~~~~~g~~~~---~~~~~e--iiH~~~n~~~~~~~~~~G~spi~~ 257 (551) T protein:vir:80 218 -----------------------------------VQVIDQKIVA---TFNARE--MAFAVRNPRSDIYATGYGYPELEI 257 (551) T ss_pred -----------------------------------EEEeCCcEEE---EEcccc--eEEecccCCCCcccccccccHHHH Confidence 0111111110 011111 12222 111 1235799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCceee--CCCCccchhhh---hc----CCCc-----c--eecCCccccccccCCCc Q lcl|Aclame:pro 260 YIGDFAKLSLLSEKLGLYELESLEVLNLV--DEAKGAVVDDY---QD----AEMG-----D--YVPGGAEAVRAYERGDY 323 (510) Q Consensus 260 ~l~d~~~L~~l~~~~l~~~~~a~~~~~lv--~~~g~~~~~~~---~~----~~~G-----~--~~~g~~~~v~~~~~~~~ 323 (510) +...+.......+.......-...|..++ +.+..+.++.. +. .-.| . ++.+..-.+.++.+ +. T Consensus 258 a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~-~~ 336 (551) T protein:vir:80 258 ALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTP-SA 336 (551) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccC-Ch Confidence 99999888888887777776667776554 33333343322 11 1011 1 22221112223322 22 Q ss_pred cchHHHHHHHHHHHHHHHHHHhhc----ccCCC-------CCCCCHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 324 NKMAAIQQSLQAVVVRLNQAFMYG----ANQRD-------AERVTAEEVRITA-EEAENTLGGTYSLLAENLQSPLAYVC 391 (510) Q Consensus 324 ~~~~~~~~~i~~~~~~I~~af~~~----~~~~~-------~~~vTAtEi~~r~-~E~~~~LGpv~~rl~~E~l~Pli~r~ 391 (510) .+.+ ..+..+..+..|.++|-.. +...+ .+.+|-.=+.+.. .=....|.|.+.++..+|-.-| T Consensus 337 ~D~q-fle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L---- 411 (551) T protein:vir:80 337 RDME-FEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHI---- 411 (551) T ss_pred hHHH-HHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh---- Confidence 3444 2344566778888998321 11111 1122221122221 2233456666666665543322 Q ss_pred HHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHH---HHHHHHHcCCCHhhc Q lcl|Aclame:pro 392 LSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKM---MDTIWAAFSVDTSQF 468 (510) Q Consensus 392 ~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~---~~~~a~~~Gvp~~~i 468 (510) ++.. ...+...+.. +....+.... .+...+. +.+.-+-++...++.+-. -|.+...+.+++ T Consensus 412 --------~~~~-~~~~~f~f~~-~~~~~~~~~~-~~~~~~~--~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~--- 475 (551) T protein:vir:80 412 --------VAEF-GDKYTFQFVG-GDIKSELESV-KILAEKA--KVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQR--- 475 (551) T ss_pred --------cccc-CCceEEEeec-cChhhHHHHH-HHHHHHh--cCCcCHHHHHHHhCCCCCCCCCceeeccccccc--- Confidence 2211 2234444432 2222222111 1111111 111112222222221110 011111111110 Q ss_pred cCCHHHHHHHH-HHHHHHHHHHHHHHH------HHHHHHHHhhcccCCC Q lcl|Aclame:pro 469 YKSADELQAEA-EEQRRQAAQAQAAQE------TLLEGASDMTNALAGV 510 (510) Q Consensus 469 ~~s~ee~~~~~-~~~~qqa~~~~~a~~------~~~~~a~~~~~~~ag~ 510 (510) ..+..+... +.+.+++...+...+ ...++-.....+..++ T Consensus 476 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~ 522 (551) T protein:vir:80 476 --IGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDI 522 (551) T ss_pred --ccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCc Confidence 000000000 000000000000000 0000000001112222 No 169 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=28.89 E-value=1.7 Score=19.31 Aligned_cols=445 Identities=13% Similarity=0.039 Sum_probs=174.0 Q ss_pred ChhHH-HHHHHHHhc-cCchHHHHHHHHhhcccccCCCC-CCcc----ccccc--cccchHHHHHHHHHHHHHHh-hcCc Q lcl|Aclame:pro 1 MKSTA-AMLWEKLRD-GSVEQRAIEFAKTTLPYLMVDPM-SGSR----GVVEH--DFQSAGALLVNNLAAKLARS-LFPT 70 (510) Q Consensus 1 ~k~~~-~~r~~~lkr-~~~~~~w~e~~~~~~P~~~~~~~-~~~~----~~~~~--~~dstg~~a~~~Laa~l~~~-ltpp 70 (510) .+... ...|+.-.+ .+....|. -+..-.+.. .... .+... .-++.+..+++.+++.+++. ++|. T Consensus 19 ~~~~~~~~~y~gA~~~~r~~~~w~------~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~Gi~~~ 92 (553) T protein:vir:63 19 QSASLGGGGLEGASRLSRETVSWN------PSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQRDSIVGAQYRLN 92 (553) T ss_pred hhhhhhcccccccccCCCcccccc------cCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCceee Confidence 11111 112222111 12222222 111111000 0011 11111 25788899999988888876 7776 Q ss_pred cCcccc-c-CCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHH----------HhcCCHHHHHHHHHHHHhhCceEEEE Q lcl|Aclame:pro 71 GIPFFR-S-ELTDAIRREADSRDTDITEVTAALARVDRKATQRL----------FQNASLAVLTQVIKLLIVTGNALLYR 138 (510) Q Consensus 71 ~~~WF~-l-~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l----------~~snf~~~~~~~~~~l~~~G~~~l~~ 138 (510) .+|=.+ | ..++ .+.+.|-..||+.-...- -..+||.....++...++-|-+++-. T Consensus 93 ~~~~~~~l~g~~~-------------~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 159 (553) T protein:vir:63 93 SMPDINVIPGATE-------------EWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATA 159 (553) T ss_pred eccchhhhcCCCH-------------HHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEe Confidence 654322 2 1111 122334444444433322 34579999999999999999876533 Q ss_pred eCCCCeEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhH---HhhcccccCCCCceEEEEEEEEeecCCCeeEE Q lcl|Aclame:pro 139 NSDEATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQ---DLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYA 215 (510) Q Consensus 139 ~~~~~~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~---~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 215 (510) ......-..||+.==.| .++.|...... ...+.+.+.+.+....-||.....++..+... T Consensus 160 ~~~~~~~~~~~~~lq~i-----------------e~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~ 222 (553) T protein:vir:63 160 EWDRAANRPYATCFQMV-----------------STDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMA 222 (553) T ss_pred eeccCCCCcccceEEEe-----------------chhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCcccccc Confidence 21111000122110001 11111110000 00111222233333334443332222211000 Q ss_pred EEEEeeCCeeeccccccccccCceEEEeee-ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCCc Q lcl|Aclame:pro 216 EMYHEIDGVRVGETGRWPIHLCPYIVPTWN-LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKG 293 (510) Q Consensus 216 sv~~e~~~~~~~~~~~y~~~~~P~~~~Rw~-~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~-~~g~ 293 (510) .-. ..-..+......+ -|-|..-|. ..+|..-|.+...-+|..++.|+....+.+.++..++.....+. +++. T Consensus 223 ~~~--~~~~r~~~~~~v~---a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~ 297 (553) T protein:vir:63 223 PDM--YKWKFVQQSKPWG---RRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPP 297 (553) T ss_pred ccc--cceeeeccccccC---hhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCh Confidence 000 0000111111111 123333333 35888999999999999999999999999999998888776654 2111 Q ss_pred cchh------------------------------hhhcCCCcceecCCc-cccccccCC-CccchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 294 AVVD------------------------------DYQDAEMGDYVPGGA-EAVRAYERG-DYNKMAAIQQSLQAVVVRLN 341 (510) Q Consensus 294 ~~~~------------------------------~~~~~~~G~~~~g~~-~~v~~~~~~-~~~~~~~~~~~i~~~~~~I~ 341 (510) -... ......+|.+..-.+ .+++.+... ...+|. .-...+...|. T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~F~~~~lr~ia 374 (553) T protein:vir:63 298 EFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGS---EFEASLNRHLA 374 (553) T ss_pred hhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHH---HHHHHHHHHHH Confidence 0000 000111232221111 233333332 223443 22233344444 Q ss_pred HHHh--hcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccc------------ Q lcl|Aclame:pro 342 QAFM--YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQ------------ 407 (510) Q Consensus 342 ~af~--~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~------------ 407 (510) .++= +..+..|-..++=.-+++-..|....+--.=..+...|..|+..+.+....-.|..++|... T Consensus 375 aglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~ 454 (553) T protein:vir:63 375 SAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKE 454 (553) T ss_pred hhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhh Confidence 4442 22344565556555555555555555544445566677888888887765544544454421 Q ss_pred --eeeEEeec----HHHHHHHHH-HHHHH----HHHHHHHhhcChHhHhhcCCHHHHHHHH------HHHcCCCHhhccC Q lcl|Aclame:pro 408 --HKPAIETG----LPALSRSAA-VQSML----NASQVIAGLAPIAQLDPRISLPKMMDTI------WAAFSVDTSQFYK 470 (510) Q Consensus 408 --~~~~~vs~----l~~l~r~~~-~~~~~----~~~q~~~~~~~~~q~~~~id~d~~~~~~------a~~~Gvp~~~i~~ 470 (510) ++.+.+.+ ++++--++. ...+. +..+.++..+ .|+++.++.+ ++.+|++...-.+ T Consensus 455 a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G--------~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~ 526 (553) T protein:vir:63 455 ALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLG--------GDFRKSFAQRAREDALLKKYGLTFNLSAK 526 (553) T ss_pred hhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhC--------CCHHHHHHHHHHHHHHHHHcCCCCCCCCc Confidence 12333221 333221111 01111 1111111111 2222222222 2223443211110 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 471 SADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 471 s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) ..- ..-..++...+ .-+.+.+ +.+.|= T Consensus 527 ~~~--~~~~~~~~~~~---------~~~~~~~--~~~~~e 553 (553) T protein:vir:63 527 RSL--GDGRDAATGIA---------EDPAAAQ--TSQQGE 553 (553) T ss_pred ccc--CCCcccCCCCC---------CCCCCCC--cccccC Confidence 000 00000000000 0000000 000000 No 170 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=28.26 E-value=1.8 Score=19.23 Aligned_cols=359 Identities=14% Similarity=0.043 Sum_probs=139.5 Q ss_pred Ch-hHHHHHHHHHh-----ccCchHH-HHHHHHhhcc----cccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 1 MK-STAAMLWEKLR-----DGSVEQR-AIEFAKTTLP----YLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFP 69 (510) Q Consensus 1 ~k-~~~~~r~~~lk-----r~~~~~~-w~e~~~~~~P----~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltp 69 (510) |- ++.-..|.++| ++++... |..+.-..-. ..+++++...=+...-.-.++--.|++.+|+.+.+. T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~l--- 77 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAM--- 77 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhC--- Confidence 32 22334444432 1111100 1100000000 000111000000000012344445666666665543 Q ss_pred ccCcccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCCCe Q lcl|Aclame:pro 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDEAT 144 (510) Q Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~~~ 144 (510) |+.-..-.++...+ .. +.-++..|. +-| .+.-....+.++...||+.+++..+.++ T Consensus 78 ---p~~~y~~~~~g~~~---------~~-------~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~g~ 138 (432) T protein:vir:81 78 ---PLTMYMRTPDGRKE---------AV-------NHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGR 138 (432) T ss_pred ---ceeeEEecCCccee---------cc-------cchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCc Confidence 43211111111110 01 111222232 233 3344566677788899998876655443 Q ss_pred -EEEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEee Q lcl|Aclame:pro 145 -VVAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEI 221 (510) Q Consensus 145 -~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~ 221 (510) ...||| ..+-+..|.+|++ +|+ ++ .. T Consensus 139 ~~~L~~l~~~~v~v~~~~~g~~--~y~------------------------------------------------~~-~~ 167 (432) T protein:vir:81 139 IESLQYLANDRLTITTDPKGNT--AYR------------------------------------------------YR-RT 167 (432) T ss_pred EEEEEEEcCCceEEEECCCCcE--EEE------------------------------------------------EE-ec Confidence 334454 4555666655532 111 00 01 Q ss_pred CCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhc Q lcl|Aclame:pro 222 DGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQD 301 (510) Q Consensus 222 ~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~ 301 (510) +|.... +..++ .+..|+...+| .||.||...+...+.......+.......-...|-.++.-++.+.++.... T Consensus 168 ~g~~~~----~~~~~--iih~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~ 240 (432) T protein:vir:81 168 DGQMID----IPKQQ--IWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDS 240 (432) T ss_pred CceEEE----Ecccc--EEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHH Confidence 111100 00011 22334333445 799999999888888887777776666665666766665555555544331 Q ss_pred CC---Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-CC-CCCCCHHHHHHHHHH Q lcl|Aclame:pro 302 AE---MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RD-AERVTAEEVRITAEE 367 (510) Q Consensus 302 ~~---~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~-~~~vTAtEi~~r~~E 367 (510) .. .| .+++++. +..++.+. ..+.+. .+..+..+..|.++|-.. .+. .+ +..-|.+-+.+.... T Consensus 241 ~~~~~~~~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~ 317 (432) T protein:vir:81 241 FAKKVSGSVEAGRAPLLEGGM-DVKSLGLN-PVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG 317 (432) T ss_pred HHHHHhhhhcCCCceecCCCc-eEEEccCC-HHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHH Confidence 11 11 1222222 22333322 234444 344567778898998321 111 11 111233334333333 Q ss_pred H-HHHhhhhHHHHHHHHHHHHHHH-------------------------HHHHHhhc------------CCCCCCcccee Q lcl|Aclame:pro 368 A-ENTLGGTYSLLAENLQSPLAYV-------------------------CLSEVDDA------------LLQGLITKQHK 409 (510) Q Consensus 368 ~-~~~LGpv~~rl~~E~l~Pli~r-------------------------~~~il~~~------------~l~~~p~~~~~ 409 (510) . ...|.|.+.++..|+-.-|+.. .+..+-+. ++||+++++.. T Consensus 318 f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~ 397 (432) T protein:vir:81 318 FLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAV 397 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcce Confidence 3 3467888888888775544311 01111111 23444433211 Q ss_pred eEEeecHHHHHHHHHHHHHHHHHHHHHhhcCh--HhHhhcCCHHHHHH Q lcl|Aclame:pro 410 PAIETGLPALSRSAAVQSMLNASQVIAGLAPI--AQLDPRISLPKMMD 455 (510) Q Consensus 410 ~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~--~q~~~~id~d~~~~ 455 (510) ...-+...++.... .-..+ ..-...=+-++.-+ T Consensus 398 ~~~~~~~~pl~~~~-------------~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 398 LTVQSAMVPLDSIG-------------LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred EeecCcccchhhhc-------------cCCCCCCCCCCCCcccccccC Confidence 11111111111000 00000 00000000000000 No 171 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=27.55 E-value=1.8 Score=19.14 Aligned_cols=383 Identities=11% Similarity=0.038 Sum_probs=145.0 Q ss_pred HHHHHH-h-cc-CchHHHHHHHHhhcccccCCCCCCccccccccc-cchHHHHHHHHHHHHHHhhcCccCcccccCCChh Q lcl|Aclame:pro 7 MLWEKL-R-DG-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFFRSELTDA 82 (510) Q Consensus 7 ~r~~~l-k-r~-~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 82 (510) --|..| + ++ .-...+-+..+..-.......+..-+. .... .++--.|++.+|+.+.+ -||.-....+. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~--~~~l~~~~v~~~i~~Ia~~iA~------~p~~~~~~~~~ 72 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISS--QRAMRLTAVYSCVRVLAESVGM------LPCSLYKISGT 72 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccccCceech--hhhhccHHHHHHHHHHHHhhhh------CceEEEEecCC Confidence 333444 1 11 111122223332221111111110000 0111 23334455555554442 23322222221 Q ss_pred hhhhhccCchHHHHHHHHHHHHHHHHHHHHH-h----cCCHHHHHHHHHHHHhhCceEEEEeCCCCe-EEEEEe--ceEE Q lcl|Aclame:pro 83 IRREADSRDTDITEVTAALARVDRKATQRLF-Q----NASLAVLTQVIKLLIVTGNALLYRNSDEAT-VVAWSL--RSYA 154 (510) Q Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~~l~~~G~~~l~~~~~~~~-~~~~pl--~~~~ 154 (510) ...+ + .+..+...|. + -+.+.-+...+.++...||+.+|+..+.++ ...||| ..+- T Consensus 73 ~~~~----------~------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~g~~~~L~~l~~~~v~ 136 (413) T protein:vir:48 73 LKTR----------V------VDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKALGEVVELLPIDPGCVE 136 (413) T ss_pred ccee----------e------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCCCcEEEEEEEcCceEE Confidence 1110 0 0111122232 2 244555677778888999999887665443 344554 3344 Q ss_pred EeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccccc Q lcl|Aclame:pro 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPI 234 (510) Q Consensus 155 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y~~ 234 (510) +..|.+|.+ +| .++.. +|... .|.. T Consensus 137 ~~~~~~~~~--~y------------------------------------------------~~~~~-~g~~~----~~~~ 161 (413) T protein:vir:48 137 PKLNSQWQP--VY------------------------------------------------QVTFP-DGSVD----VLTQ 161 (413) T ss_pred EEEcCCceE--EE------------------------------------------------EEEec-CceEE----EEcc Confidence 444444322 11 01100 01000 0000 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC-------Cc-- Q lcl|Aclame:pro 235 HLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-------MG-- 305 (510) Q Consensus 235 ~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~-------~G-- 305 (510) ++ .+..|.... +..||.||...+...+.....+.+.......-...|..++.-++.+.++...... .| T Consensus 162 ~e--vih~~~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~ 238 (413) T protein:vir:48 162 DE--IWHVRTLTL-DGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLG 238 (413) T ss_pred cc--EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCcc Confidence 11 222232222 3379999999999999988888888777777777777777655555554332111 11 Q ss_pred ---c--eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-CC-CCCCCHHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 306 ---D--YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RD-AERVTAEEVRITAEEAENTLGGTY 376 (510) Q Consensus 306 ---~--~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~-~~~vTAtEi~~r~~E~~~~LGpv~ 376 (510) . +++++. ++.++... ..+.+. .+..+..+..|..+|-.. .+. .+ +..-++++.. T Consensus 239 n~g~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~-------------- 301 (413) T protein:vir:48 239 NAHRPMILEMGL-DWKSMALN-AEDSQF-LETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG-------------- 301 (413) T ss_pred ccCcceecCCCc-eEEeccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH-------------- Confidence 1 122222 23333221 234443 355566777888888332 111 11 2222333322 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEeecHHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHH Q lcl|Aclame:pro 377 SLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDT 456 (510) Q Consensus 377 ~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs~l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~ 456 (510) ..+...-+.|++.+.-..|.+..+++.......+++- +..|.|+ +......+.+.+-..+- +..++ T Consensus 302 ~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd--~~~l~~~-d~~~~~~~~~~~~~~g~-------~T~NE---- 367 (413) T protein:vir:48 302 LGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFN--AGALLRG-DMKSRFEAYATGINWGI-------YSPND---- 367 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEe--chhhhcc-CHHHHHHHHHHHHhCCC-------cCHHH---- Confidence 1233445566666655555555444433333334432 2333332 11111222221111111 11111 Q ss_pred HHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|Aclame:pro 457 IWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALA 508 (510) Q Consensus 457 ~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a~~~~~~~a 508 (510) +-+.+|.|+- ..-|+.-.-...........+.... .. .++..-+-+ T Consensus 368 ~R~~~g~~p~---~ggD~~~~~~n~~~~~~~~~~~~~~-~~--~~~~~~~~~ 413 (413) T protein:vir:48 368 CRDLEDMNPR---PGGDVYLTPMNMTTSPSAGDDNGKK-KE--SGDADKTAS 413 (413) T ss_pred HHHHhCCCCC---CCcceeeccccccccccccccCCCC-CC--CCCccccCC Confidence 2233455431 1111111000000000000000000 00 000000000 No 172 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=23.73 E-value=2.3 Score=18.63 Aligned_cols=395 Identities=11% Similarity=0.073 Sum_probs=149.7 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhc----ccc----cCCCCCCccccccccc------cchHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTL----PYL----MVDPMSGSRGVVEHDF------QSAGALLVNNLAAKLARS 66 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~----P~~----~~~~~~~~~~~~~~~~------dstg~~a~~~Laa~l~~~ 66 (510) +++....+-..++ +.+..|.. |.+ +.......-.+...+| |++-.-++++....+.+ T Consensus 17 ~~~~~~~~~~~~~--------~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~- 87 (526) T protein:vir:99 17 LREPQTSRLAGLA--------KEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILG- 87 (526) T ss_pred ccchhhhhhhhhh--------hhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhC- Confidence 2222222222221 11222211 111 1111111111111222 66666666666665553 Q ss_pred hcCccCcccccCCC-hhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHh-cCCHHHHHHHHHHHHhhCceEEEE--eCCC Q lcl|Aclame:pro 67 LFPTGIPFFRSELT-DAIRREADSRDTDITEVTAALARVDRKATQRLFQ-NASLAVLTQVIKLLIVTGNALLYR--NSDE 142 (510) Q Consensus 67 ltpp~~~WF~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~-snf~~~~~~~~~~l~~~G~~~l~~--~~~~ 142 (510) .+|-= .++ ++.... ....+.++++ |.. .+|...+.+++ +.+.+|-++.-+ +.+. T Consensus 88 -----~~w~I-~p~~~~~~~~----~~~a~~v~~~-----------l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~ 145 (526) T protein:vir:99 88 -----LDWAV-EPPRNASAAE----KADADYLHEL-----------LLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQG 145 (526) T ss_pred -----CCceE-ecCCCCCHHH----HHHHHHHHHH-----------HhcccCHHHHHHHHH-HhhhhcceeEEEEEeecC Confidence 45642 332 211100 0111223333 333 36776666665 567778666332 1111 Q ss_pred CeE-----EEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEE Q lcl|Aclame:pro 143 ATV-----VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEM 217 (510) Q Consensus 143 ~~~-----~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv 217 (510) +.+ ...|-..|.+..+..+. +..+++ T Consensus 146 g~~~~~~l~~r~~~~f~~~~~~~~~-----------------------------------------l~~~~~-------- 176 (526) T protein:vir:99 146 REWMPLAFHHRPQSWFQLNPEDQNE-----------------------------------------LRLRDN-------- 176 (526) T ss_pred CceeEEEeeeecccceeeccCCCcE-----------------------------------------EEecCC-------- Confidence 111 11111111111110000 000000 Q ss_pred EEeeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCCccch Q lcl|Aclame:pro 218 YHEIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVV 296 (510) Q Consensus 218 ~~e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~-~~g~~~~ 296 (510) ..+|..+ ..+=|++.|....+|..||.|+...+..-..-=+...+..+..+++---|..+.. |.|.... T Consensus 177 --~~~g~~l--------~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ 246 (526) T protein:vir:99 177 --SPAGEAL--------QPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADE 246 (526) T ss_pred --CCCceee--------cCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHH Confidence 0011111 1234899999999999999999999999888777788888888888666655543 2232222 Q ss_pred hh------hhc--CCCcceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhcccC---CCC--CCCCHHHHHH Q lcl|Aclame:pro 297 DD------YQD--AEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQ---RDA--ERVTAEEVRI 363 (510) Q Consensus 297 ~~------~~~--~~~G~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~---~~~--~~vTAtEi~~ 363 (510) +. +.. ...+.++|.+ ..|..++.+. ++-..-...++.+.+.|+++++-..+. .++ ..-...|+.. T Consensus 247 ek~~L~~av~~i~~d~~~iiP~~-~~ie~~ea~~-~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~ 324 (526) T protein:vir:99 247 EKATLLRAVTGLGHAAAGIIPET-MAIDFQQAAQ-GSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHN 324 (526) T ss_pred HHHHHHHHHHHHhhCcEEEecCC-ceeEEeecCC-CCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHH Confidence 11 111 1224556654 2355555443 344555788999999999999754322 111 2223445543 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEE--eecHHHHHHHHHHHHHHHHHHHHHhhcCh Q lcl|Aclame:pro 364 TAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI--ETGLPALSRSAAVQSMLNASQVIAGLAPI 441 (510) Q Consensus 364 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~--vs~l~~l~r~~~~~~~~~~~q~~~~~~~~ 441 (510) ...+- ..-.-...+...+..-||..++.+=. +-...+....++++ ....+.-.++. .++.+..+ |+ T Consensus 325 ~v~~d--i~~aDa~~i~~tln~~Li~~l~~~N~--~~~~~~~~~p~~~~~~~e~eDl~~~a~-------~~~~L~~~-G~ 392 (526) T protein:vir:99 325 EVRHD--LLASDARQLAATLSRDLLWPLLVLNR--PGSPDVRRAPRLVFDLREQADITSMAQ-------SIPALVNV-GL 392 (526) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHhCC--CCcCCccccceEEeCCCCcccHHHHHH-------HHHHHHhC-CC Confidence 32221 11122222223333334444333211 11111111122222 12222112222 22222222 11 Q ss_pred HhHhhcCCHHHHHHHHHHHcCCCHhh----c----------------------------cCCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 442 AQLDPRISLPKMMDTIWAAFSVDTSQ----F----------------------------YKSADELQAEAEEQRRQAAQA 489 (510) Q Consensus 442 ~q~~~~id~d~~~~~~a~~~Gvp~~~----i----------------------------~~s~ee~~~~~~~~~qqa~~~ 489 (510) +|+. +++.+.+|+|... + ..+.+.+........ +... T Consensus 393 -----~i~~----~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~--~~~~ 461 (526) T protein:vir:99 393 -----EIPS----AWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQALDKALADLP--AKDM 461 (526) T ss_pred -----ccCH----HHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccccCcchhhHHHHHHHHH--HHHH Confidence 1222 2344444554210 0 001111110000000 0000 Q ss_pred HHHHHHHHHH---HHHhhcccCCC Q lcl|Aclame:pro 490 QAAQETLLEG---ASDMTNALAGV 510 (510) Q Consensus 490 ~~a~~~~~~~---a~~~~~~~ag~ 510 (510) +.+..+.+.+ +-+.+....-. T Consensus 462 ~~~~~~~l~~i~~~l~~~~s~ee~ 485 (526) T protein:vir:99 462 QNQANDLLAPLLEAVNRGDSETEL 485 (526) T ss_pred HHHHHHHHHHHHHHHHhcCCHHHH Confidence 0000000111 11111111111 No 173 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=22.71 E-value=2.4 Score=18.49 Aligned_cols=190 Identities=11% Similarity=0.019 Sum_probs=78.1 Q ss_pred EEeecCCCeeEEEEEE--eeCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 204 VQRRKGTAMDYAEMYH--EIDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES 281 (510) Q Consensus 204 v~~~~~~~~~~~sv~~--e~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a 281 (510) +.+..+++..|.-... ..+|+ ...|..++ .+..|.....+.+||.+|..-++..+..-+...+-....-.-. T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g~----~~~~~~~e--ilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng 74 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKSE----IYEYNKND--VIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNG 74 (219) T ss_pred CceeecCeEEEEEecceecCCce----eEEecccc--EEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4444443321111000 11111 12222233 3444543334568999999988887776555544333333334 Q ss_pred hCCceee-CCCCccchhhhhcCC------Ccc------ee--cCCc-cccccccCCC-ccchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 282 LEVLNLV-DEAKGAVVDDYQDAE------MGD------YV--PGGA-EAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAF 344 (510) Q Consensus 282 ~~~~~lv-~~~g~~~~~~~~~~~------~G~------~~--~g~~-~~v~~~~~~~-~~~~~~~~~~i~~~~~~I~~af 344 (510) ..|-.++ .+++.++++...... .|. ++ +|+. +.+...++.. ..+.|. .+.-+..+..|.++| T Consensus 75 ~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qf-le~rk~~~~eIa~~f 153 (219) T protein:vir:98 75 AHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEF-ANIKNISAQDVLTSH 153 (219) T ss_pred CCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHH-HHHHHhhHHHHHHHh Confidence 5565544 355555554322111 010 11 2211 1222222221 224443 334455566788888 Q ss_pred hhc---ccCCCCCC---CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEE----ee Q lcl|Aclame:pro 345 MYG---ANQRDAER---VTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAI----ET 414 (510) Q Consensus 345 ~~~---~~~~~~~~---vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~----vs 414 (510) -.. +...+..+ -++++... .=....|.|.+.++..++-. .-+ +|+ .++..+ .+ T Consensus 154 gVPp~~lG~~~~~~~~~sn~eq~~~--~f~~~tL~P~~~~ie~~ln~------------~~~--~~~-~~~~~F~~~~~~ 216 (219) T protein:vir:98 154 RFPPGLSGIIPVNTAGLGDPLKIRE--AYQADEVLPLQEIIAESINS------------DYE--IKS-ALKVNFKQPEKR 216 (219) T ss_pred CCCHHHcccccCCCCCccCHHHHHH--HHHHHHHHHHHHHHHHHhhh------------hhc--CCC-ccEEeecCcccc Confidence 322 11111112 24443332 34455667777776666531 111 111 112221 11 Q ss_pred cHH Q lcl|Aclame:pro 415 GLP 417 (510) Q Consensus 415 ~l~ 417 (510) -++ T Consensus 217 d~~ 219 (219) T protein:vir:98 217 DKN 219 (219) T ss_pred cCC Confidence 122 No 174 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=22.42 E-value=2.4 Score=18.45 Aligned_cols=358 Identities=13% Similarity=0.031 Sum_probs=134.4 Q ss_pred HHHHHHhccCchH---HHHHHHHhhcccccCCCCCCccccc--cccccchH-HHHHHHHHHHHHHhhcCccCcccccCCC Q lcl|Aclame:pro 7 MLWEKLRDGSVEQ---RAIEFAKTTLPYLMVDPMSGSRGVV--EHDFQSAG-ALLVNNLAAKLARSLFPTGIPFFRSELT 80 (510) Q Consensus 7 ~r~~~lkr~~~~~---~w~e~~~~~~P~~~~~~~~~~~~~~--~~~~dstg-~~a~~~Laa~l~~~ltpp~~~WF~l~~~ 80 (510) ..|.+..+.+.+. .+..+..+. .. ..+.... .+.+...+ -.|++.+|+.+.+ + ||--..-. T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~-----g~--~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~-l-----p~~~~~~~ 67 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWL-----GI--NPSETYVNGKSCLKQATVFGCIRILSDNISK-L-----PIKIYQKK 67 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHh-----cC--CcCcceechhhhhccHHHHHHHHHHHHhhhh-C-----ceEEEEec Confidence 2222221111111 111222221 00 0011110 12233333 3444555444443 2 33211111 Q ss_pred hhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCCC--eEEEEEe--c Q lcl|Aclame:pro 81 DAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDEA--TVVAWSL--R 151 (510) Q Consensus 81 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~~--~~~~~pl--~ 151 (510) +.. .+ ++ +..+...|. +-| .+.-+...+.++..+||+.+++..+.. ....||+ . T Consensus 68 ~~~-~~----------~~------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~ 130 (409) T protein:vir:10 68 DGI-KR----------VP------DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSD 130 (409) T ss_pred CCe-ee----------cc------CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCC Confidence 111 00 00 011122232 333 334456677788899999988754432 2344454 3 Q ss_pred eEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeeccccc Q lcl|Aclame:pro 152 SYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGR 231 (510) Q Consensus 152 ~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~ 231 (510) ..-+..|..|....- ..+ .|. + ....|... . T Consensus 131 ~V~v~~~~~~~~~~~-----------------------------~~~-~y~-~--------------~~~~g~~~----~ 161 (409) T protein:vir:10 131 GMKIFVDDTGLLNSE-----------------------------NNV-WYL-Y--------------TDDLGQRH----K 161 (409) T ss_pred ceEEEEcCCcccccc-----------------------------ceE-EEE-E--------------EeCCceeE----E Confidence 343444444432210 000 010 0 01111100 0 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC-------- Q lcl|Aclame:pro 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE-------- 303 (510) Q Consensus 232 y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~-------- 303 (510) |..++ .+..|.... +..||.||...+...+.....+.+.......-...|..++.-++.++++...... T Consensus 162 ~~~~e--vih~r~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~ 238 (409) T protein:vir:10 162 FMSDE--ILHFKGLTA-DGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSS 238 (409) T ss_pred ecccc--EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhc Confidence 11111 344444433 3489999999988888888888888777777777787776655555554332111 Q ss_pred ---C-c-c-eecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--cc--CCCCCCCCHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 304 ---M-G-D-YVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--AN--QRDAERVTAEEVRITAEEAENTLG 373 (510) Q Consensus 304 ---~-G-~-~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~--~~~~~~vTAtEi~~r~~E~~~~LG 373 (510) + | . +++++. .+.++... ..+.+. .+..+..+..|..+|-.. .+ ..++..-++++.... =....|. T Consensus 239 g~~n~~~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~--f~~~~l~ 313 (409) T protein:vir:10 239 GLKNAHRIAMLPIGY-KFEPISQK-LVDAQF-LENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNRE--FYIDTLQ 313 (409) T ss_pred cccccCCceecCCCc-eEEEccCC-hhhHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHH--HHHHHHH Confidence 1 1 1 122222 33444332 345554 345567778898888332 11 112222244433221 1223344 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-ccceeeEEe-ecH---HHHHHHHHHHHHHH-----HH--HHHHhhcCh Q lcl|Aclame:pro 374 GTYSLLAENLQSPLAYVCLSEVDDALLQGLI-TKQHKPAIE-TGL---PALSRSAAVQSMLN-----AS--QVIAGLAPI 441 (510) Q Consensus 374 pv~~rl~~E~l~Pli~r~~~il~~~~l~~~p-~~~~~~~~v-s~l---~~l~r~~~~~~~~~-----~~--q~~~~~~~~ 441 (510) |.+.++..++ .+..+++.. +....+++. ..+ +...|+.-...+.. .- ...-.+.+. T Consensus 314 P~~~~ie~~l------------n~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~ 381 (409) T protein:vir:10 314 SILNMYELEI------------NYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPL 381 (409) T ss_pred HHHHHHHHHH------------HHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 5554444443 322222110 111222221 111 12222222222111 00 011111122 Q ss_pred HhHhh---cCC---HHHHHHHHHHHcCCC Q lcl|Aclame:pro 442 AQLDP---RIS---LPKMMDTIWAAFSVD 464 (510) Q Consensus 442 ~q~~~---~id---~d~~~~~~a~~~Gvp 464 (510) +.-+. ..| .+.+-+...++ |=- T Consensus 382 ~ggD~~~~~~n~~~~~~~~~~~~kg-Ge~ 409 (409) T protein:vir:10 382 EGGDVLLINGNMIPVKMAGEQYSKG-GEK 409 (409) T ss_pred CCcCeeeeccCccchhhcccccccc-CCC Confidence 21100 011 11111111111 111 No 175 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=21.90 E-value=2.5 Score=18.37 Aligned_cols=366 Identities=11% Similarity=0.070 Sum_probs=134.3 Q ss_pred ChhHHH------HHHHHH-hcc-CchHHHHHHHHhhcccccCCCCCCccccccccccchHHHHHHHHHHHHHHhhcCccC Q lcl|Aclame:pro 1 MKSTAA------MLWEKL-RDG-SVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGI 72 (510) Q Consensus 1 ~k~~~~------~r~~~l-kr~-~~~~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 72 (510) =|++.. ..|.+- +|+ .+-..|-...--.+|......+..-.. ..-.-.++--.|++.+|+.+.+. T Consensus 15 ~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al~~~~V~acv~~Ia~~iA~l------ 87 (441) T protein:vir:98 15 SRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAIRHSDIFTAVMMIASDLARM------ 87 (441) T ss_pred cccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccch-hhhhccHHHHHHHHHHHHhhccC------ Confidence 111111 111111 121 111111101111122211111110000 00011233344666666666542 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC-C-eE Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRNSDE-A-TV 145 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~~~~-~-~~ 145 (510) | +++.-.... .. +..++..|. +-| .+.-....+.++..+||+.+++..+. + .. T Consensus 88 p-l~~~~~~~~------------~~-------~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:98 88 P-IRVTVNGQI------------NY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred c-eEEecCCcc------------cc-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE Confidence 3 233211100 00 111223332 333 33445667778889999988876443 2 34 Q ss_pred EEEEe--ceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCC Q lcl|Aclame:pro 146 VAWSL--RSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG 223 (510) Q Consensus 146 ~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~ 223 (510) ..||| +.+.+..|.+|++--.+ . . +++ T Consensus 148 ~L~~i~~~~v~v~~~~~g~~~~~~--~-----------------------------------~--------------~~~ 176 (441) T protein:vir:98 148 NLTFRKTSEIELKLDARGRLYYFH--Q-----------------------------------R--------------IDS 176 (441) T ss_pred EEEEEcCceeEEEECCCCcEEEEE--E-----------------------------------E--------------ecc Confidence 45555 66677777777541110 0 0 000 Q ss_pred eeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCC-ccchhhh--- Q lcl|Aclame:pro 224 VRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAK-GAVVDDY--- 299 (510) Q Consensus 224 ~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g-~~~~~~~--- 299 (510) ........+..++ .+..|+...+| .||.||...+...+...+.+.+.......-...|..++.-++ +.+++.. T Consensus 177 ~~~~~~~~~~~~d--viHir~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~ 253 (441) T protein:vir:98 177 NGNNIERNVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRA 253 (441) T ss_pred CcceeeEEEcccc--EEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHH Confidence 0000000111111 23344443344 799999999988888888777777776666666766654333 3333322 Q ss_pred hcCC----Cc-------ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccCCCCCCCCHHHHHHHHH Q lcl|Aclame:pro 300 QDAE----MG-------DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAE 366 (510) Q Consensus 300 ~~~~----~G-------~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~vTAtEi~~r~~ 366 (510) +..- .| .+++++. +..++.+. ..+.+. .+..+..+..|.++|-.. .+..+...-+.+|. .. T Consensus 254 ~~~~~~~~~G~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~---~~ 327 (441) T protein:vir:98 254 REEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGIETANMSITDA---NL 327 (441) T ss_pred HHHHHHHhcCccccCcceecCCCc-eEEEccCC-hhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHH---HH Confidence 1110 11 1112221 22333221 223333 344455667788888432 12122222233332 22 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCccceeeEEee--cHHHHHHHHHHHHHHH-----HH--HHHHh Q lcl|Aclame:pro 367 EAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIET--GLPALSRSAAVQSMLN-----AS--QVIAG 437 (510) Q Consensus 367 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~~~~~~~vs--~l~~l~r~~~~~~~~~-----~~--q~~~~ 437 (510) .....|-|.+.++..|+-.-|..+ ...-.++..... -.+...|+.-...+.. .- ..+-. T Consensus 328 ~y~~tl~P~~~~ie~~ln~~L~~~------------~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 395 (441) T protein:vir:98 328 DYLSTLKPYITCVCAELNFKFNDE------------YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDG 395 (441) T ss_pred HHHHHHHHHHHHHHHHHHhhcccc------------ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 334566777777777654433211 101011111000 0111222221111111 00 01111 Q ss_pred hcChHhHh-----h---cCCHHHH----------HHHHHHHcCCCHhh Q lcl|Aclame:pro 438 LAPIAQLD-----P---RISLPKM----------MDTIWAAFSVDTSQ 467 (510) Q Consensus 438 ~~~~~q~~-----~---~id~d~~----------~~~~a~~~Gvp~~~ 467 (510) +.+.+.-+ . .++.|.+ .+.-. .-|=. .. T Consensus 396 l~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~-kgGe~-ne 441 (441) T protein:vir:98 396 LAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKL-KGGEE-NE 441 (441) T ss_pred CCCCCCCCcceEeeccccccccccccccccccccccccc-CCCCC-CC Confidence 11111000 0 0000000 00000 00000 01 No 176 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=21.61 E-value=2.6 Score=18.33 Aligned_cols=422 Identities=12% Similarity=0.026 Sum_probs=171.3 Q ss_pred ChhHHHHHHHHH--hccCchHHHHHHHHhhcccccCCC------CCCcccccccc--ccchHHHHHHHHHHHHHHh-hcC Q lcl|Aclame:pro 1 MKSTAAMLWEKL--RDGSVEQRAIEFAKTTLPYLMVDP------MSGSRGVVEHD--FQSAGALLVNNLAAKLARS-LFP 69 (510) Q Consensus 1 ~k~~~~~r~~~l--kr~~~~~~w~e~~~~~~P~~~~~~------~~~~~~~~~~~--~dstg~~a~~~Laa~l~~~-ltp 69 (510) +-......|..- .+.+-...|. |.....+ ...-..+...+ -++.+..+++.+++.+++. ++| T Consensus 13 ~~~~~~~~~~~~a~~~~~~~~~w~-------~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 85 (530) T protein:vir:38 13 TSLREYAGYHGGGGGFGGQLRGWN-------PPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRL 85 (530) T ss_pred cchHHHhhhhcccCCCCCcccccc-------cCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 111122222211 1222222222 2111000 00111111112 4778999999999888875 777 Q ss_pred ccCcc-cccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHH----------HhcCCHHHHHHHHHHHHhhCceEEEE Q lcl|Aclame:pro 70 TGIPF-FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRL----------FQNASLAVLTQVIKLLIVTGNALLYR 138 (510) Q Consensus 70 p~~~W-F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l----------~~snf~~~~~~~~~~l~~~G~~~l~~ 138 (510) ..+|= =.|..++.. .++|-..||+.-.... ...+||.....++...++-|-+++-. T Consensus 86 ~~~p~~~~l~~~~~~-------------~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 152 (530) T protein:vir:38 86 SYRPSWRYLGINEED-------------SRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQA 152 (530) T ss_pred eeccchhhcCCCHhH-------------HHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEe Confidence 66553 334333322 2233333443333222 23579999999999999999877543 Q ss_pred e--CCCCeEEEEEeceEEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEE Q lcl|Aclame:pro 139 N--SDEATVVAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAE 216 (510) Q Consensus 139 ~--~~~~~~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~s 216 (510) . ++.+ ..||+.==.| -++.|..... .++.. .|+.-|+-+.. +.|.. T Consensus 153 ~~~~~~g--~~~~~~lq~i-----------------e~d~l~~~~~----------~~~~~-~i~~GIe~d~~-Gr~~a- 200 (530) T protein:vir:38 153 TWDSDST--RLFRTQFKMV-----------------SPKRVSNPNN----------IGDTR-NCRAGVKINDS-GAALG- 200 (530) T ss_pred eeccCCC--CccceEEEEe-----------------chhhcCCCCC----------CCCCC-eeEeeeEECCC-CceEE- Confidence 2 2211 1122110001 1111110000 00100 23334433221 22211 Q ss_pred EEEee---CC------eeeccccccccccCceEEEeeee-cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCce Q lcl|Aclame:pro 217 MYHEI---DG------VRVGETGRWPIHLCPYIVPTWNL-APGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLN 286 (510) Q Consensus 217 v~~e~---~~------~~~~~~~~y~~~~~P~~~~Rw~~-~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~ 286 (510) ||+.- .| ..+......+ -|-|+.-++. .+|..-|.+...-+|..++.|+....+.+.++..++.... T Consensus 201 Y~i~~~~~~~~~~~~~~~~~~~~~v~---a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~ 277 (530) T protein:vir:38 201 YYVSDDGYPGWMAQNWTYIPRELPGG---RPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAA 277 (530) T ss_pred EEEeeccCCCccccccceeeeeeccC---hhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhhee Confidence 11110 00 0111111111 1234444444 4799999999999999999999999999999988887766 Q ss_pred eeC-CCCccchhh-------------h---------------hcCCCcceecCCc-cccccccCCC-ccchHHHHHHHHH Q lcl|Aclame:pro 287 LVD-EAKGAVVDD-------------Y---------------QDAEMGDYVPGGA-EAVRAYERGD-YNKMAAIQQSLQA 335 (510) Q Consensus 287 lv~-~~g~~~~~~-------------~---------------~~~~~G~~~~g~~-~~v~~~~~~~-~~~~~~~~~~i~~ 335 (510) .+. +.+.-.... + ....+|.+..-.+ .+++....+. ..++. .-... T Consensus 278 fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~f~~~ 354 (530) T protein:vir:38 278 TIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYS---TFEQS 354 (530) T ss_pred eeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHH---HHHHH Confidence 653 211100000 0 0012233222111 2244333322 23443 22233 Q ss_pred HHHHHHHHHh--hcccCCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCcc------- Q lcl|Aclame:pro 336 VVVRLNQAFM--YGANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITK------- 406 (510) Q Consensus 336 ~~~~I~~af~--~~~~~~~~~~vTAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~~~~l~~~p~~------- 406 (510) +...|..++= +..+..|-..++=.-+++-..|.....--.=..+...|+.|+..+.+..+.-.|..++|.. T Consensus 355 ~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~ 434 (530) T protein:vir:38 355 LLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQE 434 (530) T ss_pred HHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchh Confidence 4444444441 2234445445554444444445554444434445556677777777776555555555431 Q ss_pred ----ceeeEEeec----HHHHHHHHHHHHHHHHHHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHH Q lcl|Aclame:pro 407 ----QHKPAIETG----LPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAE 478 (510) Q Consensus 407 ----~~~~~~vs~----l~~l~r~~~~~~~~~~~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~ 478 (510) .++.+.+.+ ++++--++.. ..-+. ++... ...++...|.+++ ++-.+ T Consensus 435 ~~~a~~~~~w~~p~~~~iDP~Ke~~a~---~~~i~-----~G~~s----------~~~~~a~~G~D~~-------~v~~q 489 (530) T protein:vir:38 435 ARTAWGNANWIGSGRMAIDGLKEVQEA---VMLIE-----AGLST----------YEKECAKRGDDYQ-------EIFAQ 489 (530) T ss_pred hHHhhhceeeecCCccccChHHHHHHH---HHHHH-----cCCCC----------HHHHHHHcCCCHH-------HHHHH Confidence 123444332 3443222111 11000 01100 0112223444432 22221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|Aclame:pro 479 AEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) Q Consensus 479 ~~~~~qqa~~~~~a~~~~~~~a~~~~~~~ag~ 510 (510) +.++.+.....- ... . ..-...+.+|+ T Consensus 490 ~a~e~~~~~~~G-l~~---~-~~~~~~~~~~~ 516 (530) T protein:vir:38 490 QVRESMERRAAG-LNP---P-AWAAAAFEAGV 516 (530) T ss_pred HHHHHHHHHHcC-CCC---C-CCcccccCCCC Confidence 211211111000 000 0 00001111122 No 177 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=21.50 E-value=2.6 Score=18.32 Aligned_cols=347 Identities=13% Similarity=-0.007 Sum_probs=137.0 Q ss_pred HHHHHHH-hccCchHHHHHHHHhhcccc-cCCCCCCcccccccc-ccchHHHHHHHHHHHHHHhhcCccCcccccCCChh Q lcl|Aclame:pro 6 AMLWEKL-RDGSVEQRAIEFAKTTLPYL-MVDPMSGSRGVVEHD-FQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDA 82 (510) Q Consensus 6 ~~r~~~l-kr~~~~~~w~e~~~~~~P~~-~~~~~~~~~~~~~~~-~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 82 (510) =..|+++ ++..-.........+..|.. +.-.+..... ... -.++--.|++.+|+.+.+. ||.-....+. T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~--~~al~~~~v~~~v~~ia~~iA~l------p~~~~~~~~~ 72 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGA--NSAMTLGAFYACVTLLADTVASL------SIDAYRKKDN 72 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccch--hhhhccHHHHHHHHHHHHhhhhC------ceEEEEecCC Confidence 2334433 21111111111111111110 0000000000 011 1234445566666666542 4443332222 Q ss_pred hhhhhccCchHHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEe--CCCC-eEEEEEe--ce Q lcl|Aclame:pro 83 IRREADSRDTDITEVTAALARVDRKATQRLF-QNA----SLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVAWSL--RS 152 (510) Q Consensus 83 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~l~~~G~~~l~~~--~~~~-~~~~~pl--~~ 152 (510) ...+ +..+...|. +-| .+.-+...+.++..+||+.+|+. +..+ ....||| .. T Consensus 73 ~~~~------------------~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~ 134 (409) T protein:vir:84 73 VRIP------------------VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDC 134 (409) T ss_pred cccc------------------cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCce Confidence 1100 011122232 333 33444556667889999987764 2222 2344554 22 Q ss_pred EEEeeCCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEeeCCeeecccccc Q lcl|Aclame:pro 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRW 232 (510) Q Consensus 153 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~~~~~~~~~~y 232 (510) +.|....++. +..+. ..+..+|+.+. T Consensus 135 v~v~~~~~~~------------------------------------------------~~~~~-~~~~~~g~~~~----- 160 (409) T protein:vir:84 135 IHVTDAKDED------------------------------------------------GDWIE-PVYRIDGKVVP----- 160 (409) T ss_pred eEEEEcCCCc------------------------------------------------ceEEE-EEecCCceEEc----- Confidence 2222221111 11011 11122232211 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeCCCCccchhhhhcCC--------- Q lcl|Aclame:pro 233 PIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAE--------- 303 (510) Q Consensus 233 ~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~~~g~~~~~~~~~~~--------- 303 (510) .++ .+..|+....|..||.||...+...+.......+.......-...|..++.-++.+.++...... T Consensus 161 -~~d--vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n 237 (409) T protein:vir:84 161 -NHR--IMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHN 237 (409) T ss_pred -hhh--EEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcc Confidence 111 45556666667789999999999888888888888887777777777776655666665433211 Q ss_pred Cc--ceecCCccccccccCCCccchHHHHHHHHHHHHHHHHHHhhc--ccC-CCCCCCCHHHHHHHHHH-HHHHhhhhHH Q lcl|Aclame:pro 304 MG--DYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQ-RDAERVTAEEVRITAEE-AENTLGGTYS 377 (510) Q Consensus 304 ~G--~~~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~-~~~~~vTAtEi~~r~~E-~~~~LGpv~~ 377 (510) .| .+++++.. +.++... ..+.+. .+..+..+..|.++|=.. .+. .++..-++.=+.+.... ....|.|.+. T Consensus 238 ~g~~~vl~~g~~-~~~~~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~ 314 (409) T protein:vir:84 238 RRLPAVMSAGIK-WQSVSIT-PNESQF-LETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLR 314 (409) T ss_pred CCCeeecCCCce-EEEccCC-hhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHH Confidence 11 12233222 2333221 234443 344456677888888321 111 11121222223222222 3455788888 Q ss_pred HHHHHHHHHHHHH---H-----------------HHHHhh------------cCCCCCCccce--eeEEeecHHHHHHHH Q lcl|Aclame:pro 378 LLAENLQSPLAYV---C-----------------LSEVDD------------ALLQGLITKQH--KPAIETGLPALSRSA 423 (510) Q Consensus 378 rl~~E~l~Pli~r---~-----------------~~il~~------------~~l~~~p~~~~--~~~~vs~l~~l~r~~ 423 (510) ++..+|-.-|... . +..+-+ -++||+|..+. .+...+++..+...+ T Consensus 315 ~ie~~l~~~L~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~ 394 (409) T protein:vir:84 315 CIEQALDTFLPRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPPEE 394 (409) T ss_pred HHHHHHHHhccCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccCCccc Confidence 8888764322000 0 000000 12333333221 011111111000000 Q ss_pred HHHHHHHHHHHHHhhcChHhHhhcCCHHH Q lcl|Aclame:pro 424 AVQSMLNASQVIAGLAPIAQLDPRISLPK 452 (510) Q Consensus 424 ~~~~~~~~~q~~~~~~~~~q~~~~id~d~ 452 (510) . +.-++-...-|..+ T Consensus 395 ~--------------~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 395 P--------------AQEPQPNSATEGNK 409 (409) T ss_pred c--------------CcCCCCCCccCCCC Confidence 0 00000011112222 No 178 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=21.45 E-value=2.6 Score=18.31 Aligned_cols=422 Identities=11% Similarity=0.045 Sum_probs=167.9 Q ss_pred ChhHHHHHHHHHhccCchHHHHHHHHhhcccccCCC-----CCCccccccc--cccchHHHHHHHHHHHHHH-hhcCccC Q lcl|Aclame:pro 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDP-----MSGSRGVVEH--DFQSAGALLVNNLAAKLAR-SLFPTGI 72 (510) Q Consensus 1 ~k~~~~~r~~~lkr~~~~~~w~e~~~~~~P~~~~~~-----~~~~~~~~~~--~~dstg~~a~~~Laa~l~~-~ltpp~~ 72 (510) .+.+....|+.-.+.. +|+. .|...++. .+.-..+... .-++.+..+++.+.+.+++ +++|..+ T Consensus 16 ~~~~~~~~y~aa~~~~---~~~~-----~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~ 87 (495) T protein:vir:10 16 LVPVGASAYEGASGGH---RWQD-----IGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPRWR 87 (495) T ss_pred hhHHHhhhhhccccCc---ccCC-----CCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccccC Confidence 4444444565543222 1210 01111110 0000111111 2477889999988887764 5666544 Q ss_pred cccccCCChhhhhhhccCchHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC--C----e Q lcl|Aclame:pro 73 PFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDE--A----T 144 (510) Q Consensus 73 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~l~~~G~~~l~~~--~~~--~----~ 144 (510) + .++.+.+ .-...-+.|.+.| ..-.+.+||.....++..++..|-+++-.. +.. . . T Consensus 88 ~------~~~~~~~-----~ie~~w~~wa~~~-----D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~ 151 (495) T protein:vir:10 88 M------KEQELRQ-----ELQELWGDWVNEA-----DFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQ 151 (495) T ss_pred C------chHHHHH-----HHHHHHHHhhcCc-----ccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceE Confidence 3 2332221 1112233343221 233467899999999999999998764322 111 1 1 Q ss_pred EEEEEeceEEEee----CCCCceeEEEEEEEecHHHHhHHhhHHhhcccccCCCCceEEEEEEEEeecCCCeeEEEEEEe Q lcl|Aclame:pro 145 VVAWSLRSYAVRR----DATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHE 220 (510) Q Consensus 145 ~~~~pl~~~~v~~----d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e 220 (510) ++.+....+.... +++|+. .+.+.+.+.+.+..-||.....++....... . T Consensus 152 lqliepd~l~~~~~~~~~~~g~~----------------------i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~---~ 206 (495) T protein:vir:10 152 LQIIEPDMLASDIPDETLPSGGY----------------------VKGGIRFSNGGKRKAYCFYRNHPAESSLIGD---P 206 (495) T ss_pred EEEechhhcCCCCCCCCCCCCCE----------------------EEeceEECCCCceEEEEEeecCCCccccccc---c Confidence 2333211111111 111111 0112222333334445433333322110000 0 Q ss_pred eCCeeeccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceeeC-CCCccchh-- Q lcl|Aclame:pro 221 IDGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAKGAVVD-- 297 (510) Q Consensus 221 ~~~~~~~~~~~y~~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~l~~~~~a~~~~~lv~-~~g~~~~~-- 297 (510) ..-..+.. +-|..-|.+.+|..-|.+... .+-.++.|+....+.+.++..++.....+. +++--... T Consensus 207 ~~~~rvpA---------~~vlH~f~~r~gQ~RGis~la-~i~~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 276 (495) T protein:vir:10 207 VDTVWIKA---------EHVLHVTVLTVRSDAGAPWFQ-LLLRLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPT 276 (495) T ss_pred cceeeech---------hheEeccccCCCcccCcchhH-HHHHHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccc Confidence 00011111 123344566789999998665 455799999999999999988887765553 22111000 Q ss_pred -----------hhhcCCCcceecCCc-cccccccCC-CccchHHHHHHHHHHHHHHHHHHh--hcccCCCCCCCCHHHHH Q lcl|Aclame:pro 298 -----------DYQDAEMGDYVPGGA-EAVRAYERG-DYNKMAAIQQSLQAVVVRLNQAFM--YGANQRDAERVTAEEVR 362 (510) Q Consensus 298 -----------~~~~~~~G~~~~g~~-~~v~~~~~~-~~~~~~~~~~~i~~~~~~I~~af~--~~~~~~~~~~vTAtEi~ 362 (510) .....++|.+..-.+ .++..+... ..+++. .-...+...|..++= +..+..|-..++=.=++ T Consensus 277 ~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R 353 (495) T protein:vir:10 277 IGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYE---PWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIR 353 (495) T ss_pred cCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHhcccccccHHHHH Confidence 011122333322111 224433322 223443 222333334444442 22344565555544444 Q ss_pred HHHHHHHHHhhhhHH-HHHHHHHHHHHHHHHHHHhhcCCCCCCcc------ceeeEEeec----HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 363 ITAEEAENTLGGTYS-LLAENLQSPLAYVCLSEVDDALLQGLITK------QHKPAIETG----LPALSRSAAVQSMLNA 431 (510) Q Consensus 363 ~r~~E~~~~LGpv~~-rl~~E~l~Pli~r~~~il~~~~l~~~p~~------~~~~~~vs~----l~~l~r~~~~~~~~~~ 431 (510) +-..|.....-..=. .+...|..|+..+.+..+--.|..++|+. ..+++.+.+ ++++--++. .. T Consensus 354 ~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A---~~-- 428 (495) T protein:vir:10 354 AGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLA---DL-- 428 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHH---HH-- Confidence 444444444332222 23445777887777776654554444431 122232221 222211111 00 Q ss_pred HHHHHhhcChHhHhhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhcccCCC Q lcl|Aclame:pro 432 SQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGA-SDMTNALAGV 510 (510) Q Consensus 432 ~q~~~~~~~~~q~~~~id~d~~~~~~a~~~Gvp~~~i~~s~ee~~~~~~~~~qqa~~~~~a~~~~~~~a-~~~~~~~ag~ 510 (510) ..+. ++.. ....++...|.+++ |+-.++.++.+.....-. .-...+.+ ...+..+..+ T Consensus 429 -~~i~--~G~~----------s~~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~Gl-~~~~~p~~~~~~~~~~~~~ 487 (495) T protein:vir:10 429 -GDVR--AGFA----------PISDKQAERGYDME-------ELFDMISDANQLIDEYDL-RLDSDPRYVNGSGAEQKSV 487 (495) T ss_pred -HHHH--cCCC----------CHHHHHHHcCCCHH-------HHHHHHHHHHHHHHHcCC-CCCCCCCcCCCccCCCCCC Confidence 0000 0000 01112222344432 222222222211111100 00000000 0011111122 Done!