Query lcl|NC_015159.1_cdsid_YP_004251280.1 [gene=8] [protein=head-to-tail joining protein] [protein_id=YP_004251280.1] [location=19132..20730] Match_columns 532 No_of_seqs 119 out of 165 Neff 7.8 Searched_HMMs 1612 Date Thu Nov 7 13:02:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99672 Length: 532 100.0 1E-186 8E-190 1039.7 59.0 532 1-532 1-532 (532) 2 protein:vir:94572 Length: 535 100.0 3E-176 2E-179 983.3 56.0 527 1-532 1-535 (535) 3 protein:vir:10447 Length: 536 100.0 3E-175 2E-178 977.7 57.5 528 1-532 1-535 (536) 4 protein:vir:2198 Length: 536 # 100.0 3E-175 2E-178 977.5 57.2 528 1-532 1-535 (536) 5 protein:vir:1538 Length: 535 # 100.0 2E-174 1E-177 973.5 56.5 528 1-532 1-535 (535) 6 protein:vir:3361 Length: 535 # 100.0 1E-173 6E-177 969.1 55.7 528 1-532 1-535 (535) 7 protein:vir:8883 Length: 543 # 100.0 6E-172 4E-175 959.2 57.7 530 1-532 1-537 (543) 8 protein:vir:94709 Length: 522 100.0 9E-170 6E-173 947.3 58.1 521 1-530 1-522 (522) 9 protein:vir:6322 Length: 510 # 100.0 3E-166 2E-169 928.2 58.0 506 10-529 1-510 (510) 10 protein:vir:78942 Length: 510 100.0 9E-166 6E-169 925.4 57.7 506 10-529 1-510 (510) 11 protein:vir:100039 Length: 522 100.0 6E-166 3E-169 926.6 54.2 510 12-532 1-522 (522) 12 protein:vir:78696 Length: 542 100.0 9E-164 6E-167 914.5 55.3 513 10-532 1-535 (542) 13 protein:vir:80211 Length: 514 100.0 5E-162 3E-165 905.1 55.9 503 14-524 1-514 (514) 14 protein:vir:98506 Length: 555 100.0 2E-161 1E-164 901.9 56.3 516 1-532 1-543 (555) 15 protein:vir:107404 Length: 555 100.0 2E-161 1E-164 901.9 56.3 516 1-532 1-543 (555) 16 protein:vir:107822 Length: 555 100.0 2E-161 1E-164 901.9 56.3 516 1-532 1-543 (555) 17 protein:vir:1785 Length: 555 # 100.0 2E-161 1E-164 901.8 56.0 510 10-532 1-533 (555) 18 protein:vir:7017 Length: 515 # 100.0 2E-161 1E-164 901.9 54.7 509 1-526 1-515 (515) 19 protein:vir:103765 Length: 549 100.0 1E-161 9E-165 902.4 53.7 516 1-532 1-547 (549) 20 protein:vir:96988 Length: 516 100.0 1E-161 8E-165 902.6 52.9 509 1-526 1-516 (516) 21 protein:vir:105641 Length: 516 100.0 4E-161 3E-164 899.7 54.4 509 1-526 1-516 (516) 22 protein:vir:103330 Length: 517 100.0 1E-159 9E-163 891.4 57.2 511 1-529 1-517 (517) 23 protein:vir:7321 Length: 556 # 100.0 4E-158 3E-161 883.2 54.6 515 1-532 1-556 (556) 24 protein:vir:102668 Length: 547 100.0 3E-157 2E-160 878.7 53.9 507 9-532 1-546 (547) 25 protein:vir:95315 Length: 559 100.0 5E-157 3E-160 877.4 54.7 514 1-532 1-557 (559) 26 protein:vir:94599 Length: 641 100.0 9.1E-88 5.6E-91 497.8 41.6 511 1-532 20-622 (641) 27 protein:vir:80165 Length: 651 100.0 5.5E-71 3.4E-74 405.8 46.3 515 1-532 3-641 (651) 28 protein:vir:95449 Length: 584 100.0 6.1E-40 3.8E-43 235.6 34.8 496 1-518 1-584 (584) 29 protein:vir:3139 Length: 599 # 100.0 6.1E-38 3.8E-41 224.6 35.9 503 1-532 1-595 (599) 30 protein:vir:8846 Length: 705 # 100.0 1.1E-31 6.8E-35 190.3 44.4 506 1-532 1-637 (705) 31 protein:vir:95821 Length: 763 99.9 3.1E-25 1.9E-28 155.0 42.0 510 1-532 15-706 (763) 32 protein:vir:93630 Length: 776 99.8 1.9E-18 1.2E-21 117.8 38.0 503 1-532 23-681 (776) 33 protein:vir:108295 Length: 711 99.7 1.1E-14 7E-18 97.0 41.8 516 1-532 1-679 (711) 34 protein:vir:10117 Length: 714 99.5 1.7E-12 1E-15 85.1 42.6 501 1-532 8-693 (714) 35 protein:vir:817 Length: 714 # 99.5 1.7E-12 1E-15 85.1 42.6 501 1-532 8-693 (714) 36 protein:vir:9950 Length: 714 # 99.5 1.7E-12 1E-15 85.1 42.6 501 1-532 8-693 (714) 37 protein:vir:3296 Length: 714 # 99.5 1.7E-12 1E-15 85.1 42.6 501 1-532 8-693 (714) 38 protein:vir:2764 Length: 714 # 99.5 1.7E-12 1E-15 85.1 42.6 501 1-532 8-693 (714) 39 protein:vir:105429 Length: 708 99.5 9.7E-14 6E-17 91.9 29.6 505 1-532 1-668 (708) 40 protein:vir:9263 Length: 725 # 99.5 3.2E-13 2E-16 89.1 29.1 503 1-532 1-662 (725) 41 protein:vir:77597 Length: 725 99.5 4.2E-12 2.6E-15 83.0 33.9 501 1-532 1-656 (725) 42 protein:vir:104437 Length: 714 99.5 1.1E-11 6.9E-15 80.6 39.9 501 1-532 1-693 (714) 43 protein:vir:100920 Length: 725 99.4 4.9E-12 3E-15 82.6 31.7 500 1-532 1-661 (725) 44 protein:vir:172 Length: 708 # 99.4 1E-11 6.5E-15 80.8 30.0 500 1-532 1-668 (708) 45 protein:vir:105520 Length: 706 99.3 4.6E-11 2.8E-14 77.3 29.6 503 1-532 1-664 (706) 46 protein:vir:105619 Length: 772 99.2 3.3E-10 2.1E-13 72.5 38.7 491 1-532 11-675 (772) 47 protein:vir:4223 Length: 486 # 99.2 4E-10 2.5E-13 72.1 38.1 448 1-532 8-479 (486) 48 protein:vir:3520 Length: 720 # 99.1 3.6E-10 2.3E-13 72.3 25.8 501 1-532 1-677 (720) 49 protein:vir:7768 Length: 484 # 99.1 2.2E-09 1.4E-12 68.1 38.3 453 1-532 1-478 (484) 50 protein:vir:104082 Length: 485 99.0 4.2E-09 2.6E-12 66.5 39.2 451 1-532 1-478 (485) 51 protein:vir:38 Length: 496 # N 99.0 5.1E-09 3.2E-12 66.1 29.0 437 1-531 18-496 (496) 52 protein:vir:2341 Length: 488 # 99.0 5.9E-09 3.7E-12 65.7 40.2 456 1-532 1-483 (488) 53 protein:vir:2427 Length: 485 # 99.0 9.8E-09 6.1E-12 64.5 38.2 447 1-532 5-479 (485) 54 protein:vir:4898 Length: 502 # 98.9 1.3E-08 8.3E-12 63.8 38.3 452 1-532 31-502 (502) 55 protein:vir:78227 Length: 480 98.9 2.1E-08 1.3E-11 62.7 37.1 437 1-532 1-466 (480) 56 protein:vir:3964 Length: 453 # 98.8 3.8E-08 2.4E-11 61.2 39.4 431 1-532 9-452 (453) 57 protein:vir:96494 Length: 501 98.8 6.4E-08 4E-11 60.0 39.8 445 1-532 30-495 (501) 58 protein:vir:99781 Length: 511 98.7 7.1E-08 4.4E-11 59.8 40.2 445 1-532 31-504 (511) 59 protein:vir:93747 Length: 472 98.7 7.5E-08 4.6E-11 59.7 36.7 439 1-532 11-471 (472) 60 protein:vir:2732 Length: 501 # 98.7 8E-08 4.9E-11 59.5 39.7 448 1-532 30-495 (501) 61 protein:vir:80680 Length: 441 98.7 9.2E-08 5.7E-11 59.2 39.3 421 1-532 1-440 (441) 62 protein:vir:9306 Length: 511 # 98.7 9.2E-08 5.7E-11 59.2 41.0 449 1-532 31-509 (511) 63 protein:vir:80959 Length: 499 98.7 9.2E-08 5.7E-11 59.1 30.8 440 1-531 18-499 (499) 64 protein:vir:96366 Length: 511 98.7 9.8E-08 6.1E-11 59.0 40.3 448 1-532 31-509 (511) 65 protein:vir:78805 Length: 511 98.7 9.8E-08 6.1E-11 59.0 40.3 448 1-532 31-509 (511) 66 protein:vir:7430 Length: 563 # 98.7 1E-07 6.3E-11 58.9 28.5 481 1-532 1-534 (563) 67 protein:vir:1587 Length: 508 # 98.7 1.1E-07 6.9E-11 58.7 33.9 442 1-531 20-508 (508) 68 protein:vir:106639 Length: 481 98.7 1.2E-07 7.7E-11 58.5 39.5 436 1-530 22-481 (481) 69 protein:vir:96240 Length: 511 98.7 1.3E-07 8.2E-11 58.3 41.5 448 1-532 31-509 (511) 70 protein:vir:97171 Length: 512 98.7 1.4E-07 8.8E-11 58.1 40.4 445 1-532 31-510 (512) 71 protein:vir:99072 Length: 479 98.6 1.8E-07 1.1E-10 57.6 39.1 444 1-532 1-473 (479) 72 protein:vir:9922 Length: 489 # 98.6 2.1E-07 1.3E-10 57.2 38.0 449 1-530 1-489 (489) 73 protein:vir:9871 Length: 429 # 98.6 2.1E-07 1.3E-10 57.2 38.3 422 9-531 1-429 (429) 74 protein:vir:99916 Length: 504 98.6 2.1E-07 1.3E-10 57.2 34.7 450 1-532 1-502 (504) 75 protein:vir:102950 Length: 471 98.6 2.2E-07 1.4E-10 57.0 31.7 426 9-532 1-467 (471) 76 protein:vir:78537 Length: 480 98.6 2.3E-07 1.4E-10 57.0 36.9 437 1-532 1-465 (480) 77 protein:vir:733 Length: 453 # 98.6 2.4E-07 1.5E-10 56.9 39.1 430 1-522 11-453 (453) 78 protein:vir:79043 Length: 479 98.6 2.5E-07 1.5E-10 56.8 36.5 442 1-532 1-478 (479) 79 protein:vir:3609 Length: 452 # 98.6 2.7E-07 1.7E-10 56.6 39.3 431 1-532 9-451 (452) 80 protein:vir:105889 Length: 474 98.5 3.2E-07 2E-10 56.2 37.8 440 1-532 7-470 (474) 81 protein:vir:94101 Length: 474 98.5 3.2E-07 2E-10 56.2 37.8 440 1-532 7-470 (474) 82 protein:vir:94805 Length: 492 98.5 3.2E-07 2E-10 56.2 37.1 436 1-532 35-491 (492) 83 protein:vir:98883 Length: 517 98.5 3.4E-07 2.1E-10 56.0 36.4 444 1-532 20-515 (517) 84 protein:vir:96266 Length: 474 98.5 3.6E-07 2.3E-10 55.9 37.2 433 1-532 18-472 (474) 85 protein:vir:95899 Length: 474 98.5 3.6E-07 2.3E-10 55.9 37.2 433 1-532 18-472 (474) 86 protein:vir:95113 Length: 474 98.5 3.9E-07 2.4E-10 55.7 34.3 437 1-532 1-471 (474) 87 protein:vir:5961 Length: 503 # 98.5 5.9E-07 3.7E-10 54.7 37.5 448 1-532 1-493 (503) 88 protein:vir:103951 Length: 511 98.4 6.3E-07 3.9E-10 54.6 41.7 448 1-532 31-509 (511) 89 protein:vir:80453 Length: 535 98.4 7.2E-07 4.4E-10 54.3 29.7 459 1-529 32-535 (535) 90 protein:vir:94546 Length: 506 98.4 7.6E-07 4.7E-10 54.1 39.7 434 1-532 16-496 (506) 91 protein:vir:97336 Length: 492 98.4 8.7E-07 5.4E-10 53.8 37.2 436 1-532 35-491 (492) 92 protein:vir:101494 Length: 527 98.4 9.2E-07 5.7E-10 53.7 27.3 466 1-532 1-516 (527) 93 protein:vir:1236 Length: 483 # 98.4 9.2E-07 5.7E-10 53.7 37.9 436 1-532 1-482 (483) 94 protein:vir:102239 Length: 527 98.4 9.4E-07 5.8E-10 53.6 27.2 466 1-532 1-516 (527) 95 protein:vir:9815 Length: 500 # 98.4 1.1E-06 6.7E-10 53.3 34.5 429 1-532 19-500 (500) 96 protein:vir:3028 Length: 500 # 98.4 1.1E-06 6.7E-10 53.3 34.5 429 1-532 19-500 (500) 97 protein:vir:4782 Length: 522 # 98.4 1.1E-06 6.8E-10 53.3 34.5 442 1-532 14-520 (522) 98 protein:vir:105461 Length: 470 98.3 1.3E-06 7.8E-10 52.9 37.5 430 9-532 1-469 (470) 99 protein:vir:345 Length: 663 # 98.2 2.2E-06 1.4E-09 51.6 31.5 488 1-532 1-635 (663) 100 protein:vir:2500 Length: 501 # 98.2 2.5E-06 1.5E-09 51.3 36.3 452 1-532 16-498 (501) 101 protein:vir:8184 Length: 474 # 98.2 2.7E-06 1.7E-09 51.1 34.1 431 1-526 1-474 (474) 102 protein:vir:94498 Length: 474 98.2 2.8E-06 1.7E-09 51.1 34.9 436 1-532 13-471 (474) 103 protein:vir:97447 Length: 474 98.2 2.8E-06 1.7E-09 51.1 34.9 436 1-532 13-471 (474) 104 protein:vir:98444 Length: 434 98.2 2.8E-06 1.7E-09 51.0 30.6 405 39-532 1-433 (434) 105 protein:vir:95806 Length: 440 98.2 3.3E-06 2E-09 50.7 35.4 406 17-531 1-440 (440) 106 protein:vir:79703 Length: 505 98.2 3.3E-06 2E-09 50.7 35.7 429 1-520 20-505 (505) 107 protein:vir:99522 Length: 470 98.1 4.7E-06 2.9E-09 49.8 40.7 435 1-532 19-469 (470) 108 protein:vir:94742 Length: 409 98.1 5.7E-06 3.5E-09 49.3 34.5 379 9-483 1-409 (409) 109 protein:vir:9751 Length: 422 # 98.0 8E-06 5E-09 48.5 35.0 392 9-503 1-422 (422) 110 protein:vir:102330 Length: 451 97.9 1.4E-05 8.9E-09 47.1 38.5 427 9-516 1-451 (451) 111 protein:vir:105292 Length: 478 97.8 1.6E-05 9.7E-09 46.9 37.7 434 1-532 1-478 (478) 112 protein:vir:9568 Length: 410 # 97.8 1.8E-05 1.1E-08 46.6 35.3 381 25-504 1-410 (410) 113 protein:vir:95149 Length: 501 97.8 2.1E-05 1.3E-08 46.3 30.4 449 1-531 1-501 (501) 114 protein:vir:1634 Length: 409 # 97.7 2.4E-05 1.5E-08 45.9 33.9 380 9-483 1-409 (409) 115 protein:vir:7987 Length: 456 # 97.7 3.1E-05 1.9E-08 45.3 36.8 431 1-521 1-456 (456) 116 protein:vir:107112 Length: 478 97.6 3.4E-05 2.1E-08 45.1 39.4 435 1-531 1-478 (478) 117 protein:vir:79538 Length: 502 97.5 4.7E-05 2.9E-08 44.3 27.9 446 1-532 1-502 (502) 118 protein:vir:106571 Length: 499 97.2 0.00015 9.2E-08 41.6 38.9 448 1-532 1-483 (499) 119 protein:vir:105819 Length: 456 97.1 0.00018 1.1E-07 41.1 36.1 433 1-521 1-456 (456) 120 protein:vir:102602 Length: 456 97.1 0.00018 1.1E-07 41.1 36.1 433 1-521 1-456 (456) 121 protein:vir:78907 Length: 518 97.0 0.00022 1.4E-07 40.6 36.8 443 1-528 1-518 (518) 122 protein:vir:96738 Length: 505 96.8 0.00034 2.1E-07 39.6 24.5 455 1-532 1-504 (505) 123 protein:vir:78393 Length: 489 96.3 0.00077 4.8E-07 37.7 28.7 429 1-532 3-482 (489) 124 protein:vir:96179 Length: 468 96.0 0.0012 7.3E-07 36.7 40.1 428 1-531 1-468 (468) 125 protein:vir:96839 Length: 474 96.0 0.0012 7.4E-07 36.6 36.0 426 1-532 1-468 (474) 126 protein:vir:94956 Length: 452 95.8 0.0015 9.4E-07 36.0 27.2 433 1-521 1-452 (452) 127 protein:vir:78083 Length: 537 95.5 0.0019 1.2E-06 35.5 39.2 460 1-532 1-511 (537) 128 protein:vir:80040 Length: 461 95.4 0.0021 1.3E-06 35.3 22.0 430 8-512 1-461 (461) 129 protein:vir:1266 Length: 416 # 95.2 0.0025 1.6E-06 34.8 20.2 365 16-483 1-416 (416) 130 protein:vir:95014 Length: 491 95.1 0.0028 1.7E-06 34.6 34.1 438 1-527 3-491 (491) 131 protein:vir:389 Length: 530 # 94.9 0.0032 2E-06 34.3 28.3 452 1-532 1-525 (530) 132 protein:vir:101647 Length: 460 94.5 0.0042 2.6E-06 33.6 20.2 404 1-526 1-460 (460) 133 protein:vir:97265 Length: 513 94.0 0.0056 3.5E-06 32.9 34.2 451 1-532 1-490 (513) 134 protein:vir:1023 Length: 392 # 93.8 0.0062 3.8E-06 32.7 26.9 329 38-461 1-392 (392) 135 protein:vir:3989 Length: 392 # 93.8 0.0062 3.8E-06 32.7 26.9 329 38-461 1-392 (392) 136 protein:vir:3843 Length: 397 # 93.7 0.0067 4.2E-06 32.5 22.5 368 44-531 1-397 (397) 137 protein:vir:7407 Length: 392 # 93.6 0.0069 4.3E-06 32.4 25.4 320 74-461 1-392 (392) 138 protein:vir:4854 Length: 386 # 93.6 0.0071 4.4E-06 32.4 19.5 369 1-518 1-386 (386) 139 protein:vir:4995 Length: 384 # 93.4 0.0075 4.7E-06 32.2 24.0 349 1-464 1-384 (384) 140 protein:vir:10321 Length: 495 93.3 0.0081 5E-06 32.0 27.7 447 1-532 1-495 (495) 141 protein:vir:95542 Length: 548 92.8 0.01 6.2E-06 31.5 27.2 477 1-532 1-526 (548) 142 protein:vir:78161 Length: 355 92.3 0.012 7.5E-06 31.1 14.6 303 180-532 1-339 (355) 143 protein:vir:3420 Length: 533 # 91.9 0.014 8.6E-06 30.8 30.3 448 1-532 1-529 (533) 144 protein:vir:81152 Length: 411 91.7 0.014 9E-06 30.7 22.7 370 1-481 1-411 (411) 145 protein:vir:107742 Length: 537 91.1 0.018 1.1E-05 30.2 17.1 443 1-532 47-536 (537) 146 protein:vir:4828 Length: 382 # 90.7 0.019 1.2E-05 30.0 23.2 351 1-468 1-382 (382) 147 protein:vir:100150 Length: 437 90.6 0.02 1.2E-05 29.9 20.8 383 1-491 1-437 (437) 148 protein:vir:80644 Length: 551 88.6 0.031 1.9E-05 28.8 23.3 433 1-532 30-524 (551) 149 protein:vir:5249 Length: 437 # 88.1 0.035 2.2E-05 28.6 16.1 400 29-532 1-430 (437) 150 protein:vir:96783 Length: 488 88.0 0.035 2.2E-05 28.6 35.0 415 1-481 14-488 (488) 151 protein:vir:93610 Length: 454 87.8 0.036 2.3E-05 28.5 21.0 398 1-500 1-454 (454) 152 protein:vir:81072 Length: 432 87.3 0.04 2.5E-05 28.3 18.9 381 1-486 1-432 (432) 153 protein:vir:78641 Length: 278 86.6 0.044 2.7E-05 28.0 15.8 259 80-437 1-278 (278) 154 protein:vir:4952 Length: 386 # 85.8 0.05 3.1E-05 27.7 24.9 343 1-465 1-386 (386) 155 protein:vir:1326 Length: 457 # 84.9 0.057 3.5E-05 27.4 17.4 414 1-532 1-445 (457) 156 protein:vir:189 Length: 424 # 83.3 0.07 4.3E-05 26.9 21.3 379 1-479 1-424 (424) 157 protein:vir:3153 Length: 467 # 83.0 0.072 4.5E-05 26.8 22.9 384 55-493 1-467 (467) 158 protein:vir:6240 Length: 457 # 82.7 0.074 4.6E-05 26.8 18.4 405 1-532 1-448 (457) 159 protein:vir:4698 Length: 251 # 82.5 0.076 4.7E-05 26.7 13.4 240 1-348 1-251 (251) 160 protein:vir:102080 Length: 429 79.9 0.1 6.2E-05 26.1 22.6 371 1-480 1-429 (429) 161 protein:vir:6382 Length: 553 # 78.9 0.11 6.8E-05 25.8 27.3 457 1-532 1-550 (553) 162 protein:vir:96068 Length: 765 76.8 0.13 8.2E-05 25.4 18.7 448 1-532 37-563 (765) 163 protein:vir:97060 Length: 432 76.4 0.14 8.4E-05 25.3 21.2 382 9-486 1-432 (432) 164 protein:vir:1884 Length: 424 # 75.5 0.15 9E-05 25.2 19.3 373 1-479 1-424 (424) 165 protein:vir:1785 Length: 555 # 75.4 0.15 9.1E-05 25.1 28.1 460 14-532 1-530 (555) 166 protein:vir:8418 Length: 409 # 75.2 0.15 9.3E-05 25.1 18.4 367 1-483 1-409 (409) 167 protein:vir:98853 Length: 219 75.0 0.15 9.4E-05 25.1 15.2 197 221-435 1-219 (219) 168 protein:vir:10362 Length: 432 73.4 0.17 0.00011 24.8 22.3 381 9-486 1-432 (432) 169 protein:vir:108215 Length: 469 72.7 0.18 0.00011 24.7 20.7 448 1-532 1-468 (469) 170 protein:vir:81095 Length: 416 71.8 0.19 0.00012 24.5 21.0 364 1-483 1-416 (416) 171 protein:vir:4598 Length: 416 # 71.8 0.19 0.00012 24.5 21.0 364 1-483 1-416 (416) 172 protein:vir:4194 Length: 540 # 67.8 0.25 0.00015 23.9 21.7 397 43-532 1-467 (540) 173 protein:vir:101648 Length: 518 67.0 0.26 0.00016 23.8 17.2 378 38-532 1-441 (518) 174 protein:vir:63755 Length: 547 60.4 0.37 0.00023 22.9 24.9 432 1-532 31-520 (547) 175 protein:vir:99312 Length: 563 59.7 0.39 0.00024 22.8 27.8 415 1-532 42-516 (563) 176 protein:vir:95599 Length: 563 59.7 0.39 0.00024 22.8 27.8 415 1-532 42-516 (563) 177 protein:vir:7853 Length: 518 # 55.9 0.47 0.00029 22.4 17.0 388 38-532 1-441 (518) 178 protein:vir:5737 Length: 419 # 55.6 0.48 0.0003 22.3 18.5 364 1-487 1-419 (419) 179 protein:vir:4337 Length: 434 # 53.7 0.52 0.00032 22.1 22.1 385 1-490 1-434 (434) 180 protein:vir:102727 Length: 945 51.8 0.57 0.00035 21.9 17.6 429 1-532 34-571 (945) 181 protein:vir:483 Length: 413 # 51.2 0.59 0.00036 21.9 21.6 380 16-532 1-404 (413) 182 protein:vir:100187 Length: 385 50.7 0.6 0.00037 21.8 20.1 336 44-483 1-385 (385) 183 protein:vir:94426 Length: 409 50.1 0.62 0.00038 21.7 19.6 368 1-484 1-409 (409) 184 protein:vir:9359 Length: 348 # 48.5 0.67 0.00041 21.5 17.6 307 80-484 1-348 (348) 185 protein:vir:9408 Length: 441 # 47.9 0.68 0.00042 21.5 21.1 378 1-483 1-441 (441) 186 protein:vir:79984 Length: 441 47.9 0.68 0.00042 21.5 21.1 378 1-483 1-441 (441) 187 protein:vir:1431 Length: 419 # 44.8 0.79 0.00049 21.1 22.9 359 16-478 1-419 (419) 188 protein:vir:102118 Length: 409 43.6 0.84 0.00052 21.0 24.1 347 42-481 1-409 (409) 189 protein:vir:98396 Length: 441 42.9 0.87 0.00054 20.9 22.9 374 1-483 1-441 (441) 190 protein:vir:99853 Length: 488 42.8 0.87 0.00054 20.9 23.7 415 4-532 1-457 (488) 191 protein:vir:96579 Length: 576 42.5 0.88 0.00055 20.9 23.4 433 1-532 18-520 (576) 192 protein:vir:3743 Length: 345 # 41.9 0.91 0.00056 20.8 21.1 315 35-439 1-345 (345) 193 protein:vir:2683 Length: 412 # 35.4 1.2 0.00076 20.1 22.6 368 1-484 1-412 (412) 194 protein:vir:78749 Length: 337 34.5 1.3 0.0008 20.0 22.0 310 1-434 1-337 (337) 195 protein:vir:3868 Length: 417 # 33.7 1.3 0.00083 19.9 17.7 390 15-532 1-416 (417) 196 protein:vir:96980 Length: 409 31.0 1.5 0.00095 19.6 22.6 358 21-483 1-409 (409) 197 protein:vir:99452 Length: 651 26.3 2 0.0012 19.0 18.9 480 1-532 1-573 (651) 198 protein:vir:1380 Length: 422 # 25.8 2 0.0013 18.9 26.8 351 40-482 1-422 (422) 199 protein:vir:80333 Length: 419 22.7 2.4 0.0015 18.5 24.9 359 9-478 1-419 (419) 200 protein:vir:80796 Length: 574 21.6 2.6 0.0016 18.3 23.8 427 1-532 1-547 (574) 201 protein:vir:100328 Length: 346 21.5 2.6 0.0016 18.3 24.2 295 1-442 26-346 (346) 202 protein:vir:107851 Length: 175 20.2 2.8 0.0017 18.1 7.8 116 1-126 1-175 (175) 203 protein:vir:99563 Length: 862 20.1 2.8 0.0018 18.1 16.8 453 1-532 66-590 (862) 204 protein:vir:9702 Length: 406 # 20.1 2.8 0.0018 18.1 21.4 358 1-482 1-406 (406) No 1 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=1.3e-186 Score=1039.68 Aligned_cols=532 Identities=100% Similarity=1.384 Sum_probs=523.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) ||+++++++++++|++||+.||++|++|+++|+||++||+|+++++++++++++..++|||||++|+++|||||||+||| T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 80 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) |++|||||.++|+++++....+.+.++|++||++||++|+++|++||||.++|++|+||++|||||||+++++.+.++++ T Consensus 81 p~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~ 160 (532) T protein:vir:99 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) T ss_pred CCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999888888999 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) +|++|||++|||++|++|+||+||||++|++++|++++++.+.++.++++|+++|+|||+|+|++++++|++||+++|+. T Consensus 161 ~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~~~g~~ 240 (532) T protein:vir:99 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) T ss_pred ceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCCeeEEEEeecCce Confidence 99999999999999999999999999999999999999999998888899999999999999999999999999999998 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +.+.+++|+|++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++ T Consensus 241 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~ 320 (532) T protein:vir:99 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) T ss_pred ecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCCC Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |++++|.++++++++++++++|+++++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||++|| T Consensus 321 g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~ 400 (532) T protein:vir:99 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) T ss_pred cceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMD 480 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~ 480 (532) |.|||+|+|++|+|+|+||++|++++++.+++||++|+|+|+++++++|++.|+|+.|+++|+||+|+++++||+++||| T Consensus 401 l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~ 480 (532) T protein:vir:99 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMD 480 (532) T ss_pred HHHHHHHHHHHHHhcCCCCCCChhhcccceeecchHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 481 TTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 481 p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) |+.|+||+||++++++|+|++++++++++++++++++++++++++|.|++|| T Consensus 481 ~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 481 TTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred hhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcCCCCC Confidence 9999999999999999999999999999999999999999999999999999 No 2 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=2.5e-176 Score=983.27 Aligned_cols=527 Identities=60% Similarity=0.960 Sum_probs=494.9 Q ss_pred CCCC-CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 1 MAEV-EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 1 m~~~-~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 79 (532) ||-. ++++++++++++||++||++|++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 80 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALF 80 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhc Confidence 8766 899999999999999999999999999999999999999999999888889999999999999999999999999 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~ 159 (532) |+ +|||||+++|..+++....+.+.+++++||++||++|+.+|++||||.++|++|+||++|||||+|++++ .+++ T Consensus 81 P~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~---~~~~ 156 (535) T protein:vir:94 81 PM-QTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEP---EGTY 156 (535) T ss_pred CC-CCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccC---cCcc Confidence 76 7999999999999998888899999999999999999999999999999999999999999999999875 3567 Q ss_pred ceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCc Q lcl|NC_015159. 160 NAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGE 239 (532) Q Consensus 160 ~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~ 239 (532) .+|++|||++|||++|++|+||+||||++|++++|++++++.+.++. +++++++|+|||+|+|++++|+|.+||+++|. T Consensus 157 ~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~-~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~ 235 (535) T protein:vir:94 157 NPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQ-EHKGDEMIDVYTHIYLDEESGEYLKYEEIDGV 235 (535) T ss_pred cceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhcc-ccCCCceeEEEEEEEeeCCCCcEEEEEEecCe Confidence 79999999999999999999999999999999999999999886544 56889999999999999999999999999999 Q ss_pred ccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCC Q lcl|NC_015159. 240 IVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKAN 319 (532) Q Consensus 240 ~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~ 319 (532) .+++.+++|||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++++++.+++ T Consensus 236 ~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~ 315 (535) T protein:vir:94 236 EVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQ 315 (535) T ss_pred eeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCC Confidence 98888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_015159. 320 TGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQE 399 (532) Q Consensus 320 ~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E 399 (532) +|++++|.++++++++++++++|+.+.+.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.| T Consensus 316 ~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E 395 (535) T protein:vir:94 316 TGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (535) T ss_pred CceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh-hcCHHHHHHHHHHhcC Q lcl|NC_015159. 400 LQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLG 478 (532) Q Consensus 400 ~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~G 478 (532) ||.|||+|+|++|+|+|+||++|+++++++|+++|++|+|+++++++++|++.++++.|+++| +||+|+++++|++++| T Consensus 396 lL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~G 475 (535) T protein:vir:94 396 LQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIG 475 (535) T ss_pred HHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhC Confidence 999999999999999999999999999999999999999999999999999999999999999 5999999999999999 Q ss_pred CCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH------HhhcccccCCCCC Q lcl|NC_015159. 479 MDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAA------AAMMQQQAGLPTQ 532 (532) Q Consensus 479 v~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~g~~~~ 532 (532) ||++.|+||+||++++++|+++++++++++++++++.+..+ .....++.||.+- T Consensus 476 vp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 476 IDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhccCCC Confidence 98889999999999999888887776666555555443211 1123356666666 No 3 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=2.6e-175 Score=977.68 Aligned_cols=528 Identities=59% Similarity=0.954 Sum_probs=491.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) ||+ +|++.++++|++||++||++|++|+++|+||++||+|+++++++++++++..++|||||++|+++|||||||+||| T Consensus 1 m~~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) T protein:vir:10 1 MAE-KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFP 79 (536) T ss_pred Ccc-hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcC Confidence 999 8889999999999999999999999999999999999999999999888899999999999999999999999997 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) + +|||||.+.|+++++....+...+++++||+.||++++.+|++||||.++|++|+||++|||||+|++++. .++.. T Consensus 80 ~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~--~~~~~ 156 (536) T protein:vir:10 80 M-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE--GSNYN 156 (536) T ss_pred C-CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCC--CCcee Confidence 5 79999999999999988888889999999999999999999999999999999999999999999998764 34566 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) +|++|||++|||++|++|+||+||||+++|+++|+++|+........+++|+++|+|||+|+|++++++|.+|++++|+. T Consensus 157 ~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~ 236 (536) T protein:vir:10 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGME 236 (536) T ss_pred eEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCcc Confidence 79999999999999999999999999999999999999999887778889999999999999999999999999999999 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +.+.+++|||++|||+++||++.+|++|||||++++|||+|+||.|+++++++++++++|||+|+|+|+++++++.++++ T Consensus 237 v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~ 316 (536) T protein:vir:10 237 VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT 316 (536) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCC Confidence 98899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |++++|.++++++++++++++|+++++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (536) T protein:vir:10 317 GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (536) T ss_pred cceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh-hcCHHHHHHHHHHhcCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLGM 479 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~Gv 479 (532) |.|||+|+|++|+++|+||++|+++++++|+++|++|+|+++++++++|++.++++.|+++| .||+|+++++||+++|| T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv 476 (536) T protein:vir:10 397 QLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGI 476 (536) T ss_pred HHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999998 49999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHH-HHHH-----hhcccccCCCCC Q lcl|NC_015159. 480 DTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGG-QAAA-----AMMQQQAGLPTQ 532 (532) Q Consensus 480 ~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~-~~~~-----~~~~~~~g~~~~ 532 (532) +|..++||+||++++|+|+++++++++++.+++...+ ++.. ..+.+++|+-+- T Consensus 477 ~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (536) T protein:vir:10 477 DTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) T ss_pred CchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhccccCCC Confidence 9999999999999999888776665554444333221 1111 111122232222 No 4 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=2.8e-175 Score=977.51 Aligned_cols=528 Identities=59% Similarity=0.955 Sum_probs=490.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) ||+ +|++.++++|++||++||++|++|+++|+||++||+|+++++++++++++..++|||||++|+++|||||||+||| T Consensus 1 m~~-~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) T protein:vir:21 1 MAE-KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFP 79 (536) T ss_pred Ccc-hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 999 8889999999999999999999999999999999999999999999988899999999999999999999999997 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) + +|||||.+.|+++++....+...+++++||+.||++++.+|++||||.++|++|+||++|||||+|++++. .++.. T Consensus 80 ~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~--~~~~~ 156 (536) T protein:vir:21 80 M-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE--GSNYN 156 (536) T ss_pred C-CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCC--CCcee Confidence 6 79999999999999988888889999999999999999999999999999999999999999999998764 34566 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) +|++|||++|||++|++|+||+||||+++|+++|+++|+..+.....+++|+++|+|||+|+|++++++|.+|++++|.. T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~ 236 (536) T protein:vir:21 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGME 236 (536) T ss_pred eEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCee Confidence 79999999999999999999999999999999999999999887777889999999999999999999999999999999 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +.+.+++|+|++|||+++||++.+|++|||||++++|||+|+||.|+++++++++++++|||+|+|+|+++++++.++++ T Consensus 237 v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~ 316 (536) T protein:vir:21 237 VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT 316 (536) T ss_pred eccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCC Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |++++|.++++++++++++++|+++++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (536) T protein:vir:21 317 GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (536) T ss_pred cceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh-hcCHHHHHHHHHHhcCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLGM 479 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~Gv 479 (532) |.|||+|+|++|+++|+||++|+++++++|+++|++|+|+++++++++|++.++++.|+++| .||+|++++++|+++|| T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv 476 (536) T protein:vir:21 397 QLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGI 476 (536) T ss_pred HHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999998 59999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHH-HHHH-----hhcccccCCCCC Q lcl|NC_015159. 480 DTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGG-QAAA-----AMMQQQAGLPTQ 532 (532) Q Consensus 480 ~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~-~~~~-----~~~~~~~g~~~~ 532 (532) +|..++||+||++++|+|+++++++++++.+++...+ ++.. ..+.+++|+-+- T Consensus 477 ~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (536) T protein:vir:21 477 DTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) T ss_pred ChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhccccCCC Confidence 9999999999999999888776665554444333221 1111 111222232222 No 5 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=1.5e-174 Score=973.47 Aligned_cols=528 Identities=60% Similarity=0.945 Sum_probs=498.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) ||+++++++++++|++||+.|+++|++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||| T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999999999999999999999999888888889999999999999999999999997 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) + +|||||+++|..+++...++.+.++++.||++||++|+.+|++||||.++|++|+||++|||||+|++++ .++++ T Consensus 81 ~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~---~~~~~ 156 (535) T protein:vir:15 81 M-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEP---EGSYN 156 (535) T ss_pred C-CcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecC---CCCce Confidence 6 7999999999999999888889999999999999999999999999999999999999999999999864 46788 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) +|++|||++|||++|++|+||+||||+++|+++|+++|++.+.+...+++++++|+|||+|++++++++|.+|++++|.. T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~ 236 (535) T protein:vir:15 157 PMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVE 236 (535) T ss_pred eeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCcc Confidence 99999999999999999999999999999999999999999988888899999999999999999999999999999999 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +++.+++|||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++ T Consensus 237 ~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~ 316 (535) T protein:vir:15 237 IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQT 316 (535) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCc Confidence 98889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |.+++|.++++++++++++++|+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||++|| T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (535) T protein:vir:15 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh-hcCHHHHHHHHHHhcCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLGM 479 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~Gv 479 (532) |.|||+|+|++|+++|+||++|+++++++|+|+|++++|++++++++.|++.++++.|+++| +||+|+++++|++++|| T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv 476 (535) T protein:vir:15 397 QLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGI 476 (535) T ss_pred HHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999998 59999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHH------HHHhhcccccCCCCC Q lcl|NC_015159. 480 DTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQ------AAAAMMQQQAGLPTQ 532 (532) Q Consensus 480 ~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~g~~~~ 532 (532) |++.|+||+||++++++|+++++++++++.++++..+. .....+.++.|+++- T Consensus 477 p~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 477 DTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred ChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChHHHHHHHhccCCCCC Confidence 88889999999999998887776666555554443221 222344556666666 No 6 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=9.6e-174 Score=969.13 Aligned_cols=528 Identities=60% Similarity=0.943 Sum_probs=496.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) ||+++++++++++|++||+.|+++|++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||| T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999999999999999999999999988888889999999999999999999999998 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) + +|||||+++|.++++.+..+...++++.||++||++|+.+|++||||.++|++|+||++|||||+|++++ .++++ T Consensus 81 ~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~---~~~~~ 156 (535) T protein:vir:33 81 M-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEP---EGSYN 156 (535) T ss_pred C-CcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecC---CCCce Confidence 6 7999999999999999988899999999999999999999999999999999999999999999999875 45788 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) +|++|||++|||++|++|+||+||||+++|+++|+++|+....+...++++++++++||||++++++++|.++++++|.. T Consensus 157 ~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~ 236 (535) T protein:vir:33 157 PMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVE 236 (535) T ss_pred eeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCcc Confidence 99999999999999999999999999999999999999999888777889999999999999999999999999999999 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +++.++.|||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++ T Consensus 237 ~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~ 316 (535) T protein:vir:33 237 IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQT 316 (535) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCc Confidence 98999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |.+++|.++++++++++++++|+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||++|| T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (535) T protein:vir:33 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh-hcCHHHHHHHHHHhcCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLGM 479 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~Gv 479 (532) |.|||+|+|++|+++|+||++|+++++++|+++|++++|++++++++.|++.++++.|+++| +||+|++++++++++|| T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gv 476 (535) T protein:vir:33 397 QLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGI 476 (535) T ss_pred HHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999999 59999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHH------HHHHhhcccccCCCCC Q lcl|NC_015159. 480 DTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGG------QAAAAMMQQQAGLPTQ 532 (532) Q Consensus 480 ~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~g~~~~ 532 (532) |++.|+||+||+++.++|++++++++++++++++..+ ..+...+.+..|+-+- T Consensus 477 p~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 477 DTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred CHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhHHHHHHhccCCCC Confidence 8888999999999999888777666665555444322 2222233334444444 No 7 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=6.3e-172 Score=959.16 Aligned_cols=530 Identities=57% Similarity=0.903 Sum_probs=489.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) ||++++++.++++|++||++|+++|++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||| T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFP 80 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999999999999999999999999888888888999999999999999999999998 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) + +|||||+++|..+.+...++.+.++|+.||++||++|+.+|++||||.++|++|+||++|||||+|++++.....+.. T Consensus 81 ~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~ 159 (543) T protein:vir:88 81 L-QSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYN 159 (543) T ss_pred C-CcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceec Confidence 6 799999999999998888888999999999999999999999999999999999999999999999998765545556 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) .|+.|||++|+|++|++|+||+||||+++++++|+++|++.+++ ..+++|+++|+|||+|+|++++++|+++++++|.. T Consensus 160 ~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~-~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~~~~ 238 (543) T protein:vir:88 160 PMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSG-GQEYKPEQELEVYTHIYIDDESGDFLSYQEIEGVE 238 (543) T ss_pred ceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHH-HhhcCCccceEEEEEEEeecCCCcccccccccCee Confidence 68889999999999999999999999999999999999988764 44678999999999999999999999999999998 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +.+.++.|+|++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++ T Consensus 239 v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~ 318 (543) T protein:vir:88 239 VDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQT 318 (543) T ss_pred eecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCC Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |.+++|.++++.++++++++||+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 319 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~ 398 (543) T protein:vir:88 319 GDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQEL 398 (543) T ss_pred ceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcc-hhhhhcCHHHHHHHHHHhcCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAG-LQDDDINLLDVKMRLANSLGM 479 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p-~~~d~id~d~~~~~~a~~~Gv 479 (532) |.|||+|+|++|+++|+||++|+++++++|+++|++|+|+++++++.+|++.++++.| +++|+||+|+++++|++++|| T Consensus 399 l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv 478 (543) T protein:vir:88 399 QLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGI 478 (543) T ss_pred HHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCC Confidence 9999999999999999999999999999999999999999999999999999988865 689999999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHH------HHHHHHhhcccccCCCCC Q lcl|NC_015159. 480 DTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAA------GGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 480 ~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~~~~ 532 (532) ||+.|+||++|++++++|+++++++++++++.++. .+..+...|.+++|+.|. T Consensus 479 ~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (543) T protein:vir:88 479 DTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAGVQPG 537 (543) T ss_pred ChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcCCCCC Confidence 99999999999999988776555444333322221 112222333344555444 No 8 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=9.4e-170 Score=947.25 Aligned_cols=521 Identities=55% Similarity=0.905 Sum_probs=482.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) |++ +++.++++|++||+.||++|++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||| T Consensus 1 ~~~--~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP 78 (522) T protein:vir:94 1 MAE--REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFP 78 (522) T ss_pred Ccc--cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCC Confidence 998 889999999999999999999999999999999999999999988888888999999999999999999999996 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) ++|||||.+.|..+.+...+.....++++||++||++|+++|++||||.++|++|+||++|||||+|++++ .+++++ T Consensus 79 -~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~--~~~~~~ 155 (522) T protein:vir:94 79 -QSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEP--EQGTYS 155 (522) T ss_pred -CCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeecc--CCCcee Confidence 67999999999888877777778889999999999999999999999999999999999999999999865 346777 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) +|++|||++|||++|++|+||+||||++|++++|++++++.+.+ .+++|+++|+|||+|+|++++ |.+|++++|.. T Consensus 156 ~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~--~~~~p~~~v~v~~~v~~~~~~--~~~~~~~~g~~ 231 (522) T protein:vir:94 156 PMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNA--DDYEPDTELEVYTHIYRQDDE--YLRYEEVEGIE 231 (522) T ss_pred eEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhc--ccCCccceEEEEEEEEeeCCc--eeEEeeccCce Confidence 89999999999999999999999999999999999999998754 345789999999999998764 88899999999 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 320 (532) +.+.+++|||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+|+++++++.++++ T Consensus 232 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~ 311 (522) T protein:vir:94 232 VTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAAT 311 (522) T ss_pred ecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCC Confidence 98999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_015159. 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) Q Consensus 321 G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 400 (532) |.+++|+++++++++++++++|+++++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 312 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~ 391 (522) T protein:vir:94 312 GEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQEL 391 (522) T ss_pred ceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh-hcCHHHHHHHHHHhcCC Q lcl|NC_015159. 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLGM 479 (532) Q Consensus 401 l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~Gv 479 (532) |.|||+|+|++|+++|+||++|+++++++|+|+|++++|+++++++.+|++.++++.|++++ +||+|+++++|++++|| T Consensus 392 l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv 471 (522) T protein:vir:94 392 QLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGI 471 (522) T ss_pred HHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999875 89999999999999999 Q ss_pred CHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCC Q lcl|NC_015159. 480 DTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLP 530 (532) Q Consensus 480 ~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 530 (532) ||+.|+||++|++++++|+++++++++++.++++..++..+.-..+..+.+ T Consensus 472 ~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 472 DTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred ChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhhcC Confidence 999999999999999998877766666555554444332222222222222 No 9 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=2.8e-166 Score=928.24 Aligned_cols=506 Identities=31% Similarity=0.457 Sum_probs=465.6 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC Q lcl|NC_015159. 10 AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) Q Consensus 10 ~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 89 (532) =++++++||++|| |++||++|+||++||+|+++++++++++++..++|||||++|+++||||||++||||++|||||+ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccccC Confidence 4668999999996 99999999999999999999999888888889999999999999999999999999999999999 Q ss_pred CChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecce Q lcl|NC_015159. 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) Q Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~ 169 (532) ++|..+.+......+.++|++||++||+.++.+|++||||.++|++|+||++|||+|+|++++ +.+|++|||++ T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~------~~~~~~~pl~~ 152 (510) T protein:vir:63 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSD------AATVVAWSLRS 152 (510) T ss_pred CChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCC------CcEEEEEEcce Confidence 999999998888888999999999999999999999999999999999999999999998764 44799999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCC-CCeEEEEEE-EcCccccccccc Q lcl|NC_015159. 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPE-AMVFRSYQE-IDGEIVAGTEGE 247 (532) Q Consensus 170 ~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~-~~~~~s~~~-~~~~~~~~~~~~ 247 (532) |||++|++|+||+||||+++++++|++++++...++..+++|+++|+|||+|+|+++ +|||.|+|+ ++|+.+ +.+++ T Consensus 153 y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~-~~~~~ 231 (510) T protein:vir:63 153 YAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GKEGR 231 (510) T ss_pred eEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCcee-ccccc Confidence 999999999999999999999999999999999888888999999999999999755 799999876 577654 78899 Q ss_pred CccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCc Q lcl|NC_015159. 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR 327 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~ 327 (532) |||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++++++.++++|++++|+ T Consensus 232 ~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~ 311 (510) T protein:vir:63 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG 311 (510) T ss_pred cccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceeecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_015159. 328 KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) ++++.+++++++++|+.+++.|++++++|+++||++ +.++++++||||||++|++|++++|||||+||++|||.|||+| T Consensus 312 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) T protein:vir:63 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) T ss_pred cccceeeecCcccchHHHHHHHHHHHHHHHHHHHhh-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999987 6789999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcch--hhhhcCHHHHHHHHHHhcCCCHhHcc Q lcl|NC_015159. 408 LLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGL--QDDDINLLDVKMRLANSLGMDTTGLI 485 (532) Q Consensus 408 ~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~--~~d~id~d~~~~~~a~~~Gv~p~~i~ 485 (532) +|++|++.|+ ||+|++.+++.+|+|+++|+|+|+++++.++.+.++.+.+. +.++||+|+++++||+++||||+.|+ T Consensus 391 ~~~il~r~gl-~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~iv 469 (510) T protein:vir:63 391 CLSEVDDALL-QGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) T ss_pred HHHHHHhccC-CCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhc Confidence 9999999985 55566678899999999999999999999998888877654 35789999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 486 LTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 486 ~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) ||++|++++++|++++++++++++ +...++++++..+.+|+ T Consensus 470 rs~eev~a~~~~~~qq~~~~~~~~---~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 470 KSADELQAEAEQQRQQAAQAQAAQ---ETLLEGASDMTNALAGV 510 (510) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHhhcccccCC Confidence 999999998876554444333222 22345556777788898 No 10 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=9.1e-166 Score=925.40 Aligned_cols=506 Identities=31% Similarity=0.442 Sum_probs=466.3 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC Q lcl|NC_015159. 10 AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) Q Consensus 10 ~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 89 (532) =++++++||++|| |++||++|+||++||+|+++++++++++++..++|||||++|+++||||||++||||++|||||+ T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccC Confidence 4668999999996 99999999999999999999999888888888999999999999999999999999999999999 Q ss_pred CChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecce Q lcl|NC_015159. 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) Q Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~ 169 (532) ++|..+.+......+.++|++||++||+.++.+|++||||.++|++|+||++|||+|+|++++. .+|++|||++ T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~------~~~~~~pl~~ 152 (510) T protein:vir:78 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE------ATVVAWSLRS 152 (510) T ss_pred CChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCC------CeEEEEEcce Confidence 9999999888888889999999999999999999999999999999999999999999997642 2699999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCC-CCeEEEEEE-EcCccccccccc Q lcl|NC_015159. 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPE-AMVFRSYQE-IDGEIVAGTEGE 247 (532) Q Consensus 170 ~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~-~~~~~s~~~-~~~~~~~~~~~~ 247 (532) |||++|++|+||+||||+++|+++|+++|++...+...+++|+++|+|||+|+|+++ +|||.|+|+ ++|..+ +.+++ T Consensus 153 y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i-~~~~~ 231 (510) T protein:vir:78 153 YAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGR 231 (510) T ss_pred eEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeee-ccccc Confidence 999999999999999999999999999999999888888899999999999999765 789999776 577665 78899 Q ss_pred CccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCc Q lcl|NC_015159. 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR 327 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~ 327 (532) |||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|+++.++++|++++|+ T Consensus 232 ~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g~ 311 (510) T protein:vir:78 232 WPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGG 311 (510) T ss_pred cccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_015159. 328 KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) ++++.+++++++++|+++++.|++++++|+++||++ +.++++++||||||++|++|++++|||||+||++|||.|||+| T Consensus 312 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 390 (510) T protein:vir:78 312 AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYV 390 (510) T ss_pred cccccccccCcccchHHHHHHHHHHHHHHHHHHhhc-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999997 6789999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcc--hhhhhcCHHHHHHHHHHhcCCCHhHcc Q lcl|NC_015159. 408 LLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAG--LQDDDINLLDVKMRLANSLGMDTTGLI 485 (532) Q Consensus 408 ~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p--~~~d~id~d~~~~~~a~~~Gv~p~~i~ 485 (532) +|++|++.|++|+ |++.+++.+|+|+++|+|+|+++++.+|.+.++.+.| .+.+.||+|+++++|++++||||+.|+ T Consensus 391 ~~~il~r~gl~p~-p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~iv 469 (510) T protein:vir:78 391 CLSEVDDALLQGL-ITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) T ss_pred HHHHHHhccCCCC-CcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhc Confidence 9999999986555 5567889999999999999999999999999988876 345789999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 486 LTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 486 ~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) ||+||++++++|+++|++++++++++ ....++++...+.|+ T Consensus 470 rs~eev~a~~~~~~~q~~~~~~~~~a---~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 470 KSADELQAEAEEQRRQAAQAQAAQET---LLEGASDMTNALAGV 510 (510) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH---HHHhhhhhcccCCCC Confidence 99999999998776655544443322 233445667778888 No 11 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=5.6e-166 Score=926.57 Aligned_cols=510 Identities=35% Similarity=0.553 Sum_probs=465.9 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCC--CcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC Q lcl|NC_015159. 12 DGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSAT--ADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) Q Consensus 12 ~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 89 (532) =++++||+.|+++|++|+++|+||++||+|+++++++ ..+.++..++|||||++|+++||||||++||||++|||||. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFKLQ 80 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 2488999999999999999999999999999998764 34566788999999999999999999999999999999999 Q ss_pred CChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecce Q lcl|NC_015159. 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) Q Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~ 169 (532) ++|.++.+. ..++.+++|++||+.||++++.+|++||||.++|++|+||++|||||+|++++ +|++|||++ T Consensus 81 ~~d~~l~~~-~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--------~~~~~pl~~ 151 (522) T protein:vir:10 81 VRDDKLGEE-LDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKD--------GLKTFPLTR 151 (522) T ss_pred CChHHHhhh-cChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCC--------CceEEEcce Confidence 999998875 35667789999999999999999999999999999999999999999999764 378999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHh--hcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccccccc Q lcl|NC_015159. 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEE--AQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGE 247 (532) Q Consensus 170 ~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~ 247 (532) |||++|++|+||+||||+++|+++|+++|+..... .+..++++++|+|||+|+|+++++.|.++++++|+.+++.+++ T Consensus 152 y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s~ 231 (522) T protein:vir:10 152 YVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRST 231 (522) T ss_pred EEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCccccccccc Confidence 99999999999999999999999999999876432 2345688999999999999999888999999999988888999 Q ss_pred CccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCc Q lcl|NC_015159. 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR 327 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~ 327 (532) |||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|.+++|. T Consensus 232 ~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v~g~ 311 (522) T protein:vir:10 232 APKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIVQGR 311 (522) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCcceecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_015159. 328 KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) ++++.+++++++++|+.+.+.|++++++|+++||+. .++++++||||||++|++|++++|||||+||+.|||.|||+| T Consensus 312 ~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~--~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 389 (522) T protein:vir:10 312 PEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVM--NVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNR 389 (522) T ss_pred CccceeecccccccchHHHHHHHHHHHHHHHHHhhc--cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999865 478999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhc-ch-hhhhcCHHHHHHHHHHhcCCCHhHcc Q lcl|NC_015159. 408 LLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLA-GL-QDDDINLLDVKMRLANSLGMDTTGLI 485 (532) Q Consensus 408 ~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~-p~-~~d~id~d~~~~~~a~~~Gv~p~~i~ 485 (532) +|++|+|+|+||++|++++++.+|+|+++|+|+|+++++++|++.++++. |+ ++|+||+|++++++|+++|||++.|+ T Consensus 390 ~~~il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~iv 469 (522) T protein:vir:10 390 TLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLV 469 (522) T ss_pred HHHHHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhc Confidence 99999999999999999999999999999999999999999999999874 44 46899999999999999999888999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH------hhcccccCCCCC Q lcl|NC_015159. 486 LTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAA------AMMQQQAGLPTQ 532 (532) Q Consensus 486 ~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~g~~~~ 532 (532) ||+||+++++|++|++++++++++++++.++..++ ..|++++....+ T Consensus 470 rt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 470 KTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHHHHhCCCCCC Confidence 99999999998887777766666555544433332 344444444444 No 12 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=8.9e-164 Score=914.49 Aligned_cols=513 Identities=32% Similarity=0.518 Sum_probs=460.0 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC Q lcl|NC_015159. 10 AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) Q Consensus 10 ~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 89 (532) =++++++||++|+++|++|+++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||||++|||||. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFFKLQ 80 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 35568899999999999999999999999999999999988888889999999999999999999999999999999999 Q ss_pred CChHHHhhhcc-ChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecc Q lcl|NC_015159. 90 VSELEVKQSIT-SPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH 168 (532) Q Consensus 90 ~~d~~~~~~~~-~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~ 168 (532) ++|.++.+... +++.+++++.||++||++|+++|++||||.++|++|+||++|||||+|++++ +|++|||+ T Consensus 81 ~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~--------~~~~~pl~ 152 (542) T protein:vir:78 81 INDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKK--------TLKVYPLD 152 (542) T ss_pred CCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCC--------CceEEecc Confidence 99999988644 5556788999999999999999999999999999999999999999999764 48999999 Q ss_pred eEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHH----HHhhcccCCCcceEEEEEEEEeeCCC---------CeEEEE-E Q lcl|NC_015159. 169 NFVVERDAYDNVLQIVTEDKIARAALPEDVRKS----LEEAQGDQNPSEEVTIYTHVYRDPEA---------MVFRSY-Q 234 (532) Q Consensus 169 ~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~----~~~~~~~~~~~~~v~i~~~v~~~~~~---------~~~~s~-~ 234 (532) +|||++|++|+||+||||+++|+++|+++|+.. ..+....++++.+++++|+|+|+.+. ++|.|+ + T Consensus 153 ~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~ 232 (542) T protein:vir:78 153 RYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQ 232 (542) T ss_pred eeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEE Confidence 999999999999999999999999999998853 23445667889999999999997653 455554 5 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) +++|+.+.+.+++|||++|||+++||++.+|++|||||++++|||+|+||.|+++.+++++++++|||+|+|||++++.+ T Consensus 233 e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~ 312 (542) T protein:vir:78 233 ECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQS 312 (542) T ss_pred EeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh Confidence 67888887888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHH Q lcl|NC_015159. 315 VAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYS 394 (532) Q Consensus 315 ~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~ 394 (532) +.++++|+|++|.++++++++++++++|+++++.|++++++|+++||+++ .+|+++||||||++|++|++++|||||+ T Consensus 313 ~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~--~~d~~rvTAtEV~~r~~E~~~~LG~v~~ 390 (542) T protein:vir:78 313 LARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILN--VRQSERTTATEVREVQMELDRQLSGIYG 390 (542) T ss_pred cccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcccc--cCCcccccHHHHHHHHHHHHHHhhHHHH Confidence 99999999999999999999999999999999999999999999999875 5899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhh-cch-hhhhcCHHHHHHH Q lcl|NC_015159. 395 LLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKL-AGL-QDDDINLLDVKMR 472 (532) Q Consensus 395 rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~-~p~-~~d~id~d~~~~~ 472 (532) ||++|||.|+|+|+|++|+++|+||++|+++++++|+++|++++|+++++++..|++.++++ .|+ ++++||+|+++++ T Consensus 391 rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~ 470 (542) T protein:vir:78 391 SLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKR 470 (542) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999886 344 4689999999999 Q ss_pred HHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccccc-----CCCCC Q lcl|NC_015159. 473 LANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQA-----GLPTQ 532 (532) Q Consensus 473 ~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----g~~~~ 532 (532) +++++|||++.|++|+||++++++|+|++++++.+.++++.+++..++....++. ..|.+ T Consensus 471 ~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~~~~~~~a~~~~~~~~ 535 (542) T protein:vir:78 471 LAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKMMQQINAPGQEAPAG 535 (542) T ss_pred HHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhhcCCCCcCCCCC Confidence 9999999989999999999999888776666555544444433322222211111 11211 No 13 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=4.6e-162 Score=905.08 Aligned_cols=503 Identities=26% Similarity=0.357 Sum_probs=457.4 Q ss_pred HHHHHHHH--HHHhhhHHHHHHHHHHhhcccccCC--CCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC Q lcl|NC_015159. 14 AAAAYNRL--KNDRGAYETRAEDCATYTIPSVFPS--ATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) Q Consensus 14 ~~~r~~~l--k~~R~~~e~~w~e~~~~~~P~~~~~--~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 89 (532) .++|+..| |.+|++|+++|+||++||+|+++++ +++++..+..++|||||++|+++|||||||+||||++|||||+ T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQIE 80 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccccc Confidence 55556666 6679999999999999999998754 4555566778999999999999999999999999999999999 Q ss_pred CChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecce Q lcl|NC_015159. 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) Q Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~ 169 (532) ++|...+.....+.+..+|++||++||++|+.+|++||||.++|++|+||++|||||+|++++ +.+|++|||++ T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~------~~~~~~~pl~~ 154 (514) T protein:vir:80 81 LDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPG------TGKMLVWTMQS 154 (514) T ss_pred cCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecC------CCcEEEEEcCe Confidence 998777766667778899999999999999999999999999999999999999999999753 33689999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCC-CCeEEEEE-EEcCccccccccc Q lcl|NC_015159. 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPE-AMVFRSYQ-EIDGEIVAGTEGE 247 (532) Q Consensus 170 ~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~-~~~~~s~~-~~~~~~~~~~~~~ 247 (532) |||++|++|+||+||||++|++++|+++++....+...+++++++|+|||||+|+++ +++|.|+| +++|..+ +.+++ T Consensus 155 y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i-~~es~ 233 (514) T protein:vir:80 155 YTVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRV-GPESS 233 (514) T ss_pred EEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceee-cccCc Confidence 999999999999999999999999999999999888888899999999999998754 67777765 6677765 78899 Q ss_pred CccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCc Q lcl|NC_015159. 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGR 327 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~ 327 (532) |+|++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++|++++|. T Consensus 234 y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g~ 313 (514) T protein:vir:80 234 YPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQ 313 (514) T ss_pred cccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_015159. 328 KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) Q Consensus 328 ~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) ++++.+++++++++|+.+++.|++++++|+++||++... +++++||||||++|++|++++|||||+||++|||.|||+| T Consensus 314 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~~~-rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r 392 (514) T protein:vir:80 314 VGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTGQV-RDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYL 392 (514) T ss_pred CccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhccC-CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999988654 8999999999999999999999999999999999999999 Q ss_pred HHHHHHh--cCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHH---hhcchhhhhcCHHHHHHHHHHhcCCCHh Q lcl|NC_015159. 408 LLKELQA--TSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMI---KLAGLQDDDINLLDVKMRLANSLGMDTT 482 (532) Q Consensus 408 ~~~il~r--~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~la---q~~p~~~d~id~d~~~~~~a~~~Gv~p~ 482 (532) +|++|++ .|.||++|+++++++|+++|++|+|+++++++..|++.++ ++.|+++|+||+|+++++||+++|||++ T Consensus 393 ~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~ 472 (514) T protein:vir:80 393 TMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLS 472 (514) T ss_pred HHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHh Confidence 9999987 5999999999999999999999999999888887766654 5567789999999999999999999888 Q ss_pred HccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcc Q lcl|NC_015159. 483 GLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQ 524 (532) Q Consensus 483 ~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (532) .|++++|++++++++++++++++++.++..+.++++++-+.+ T Consensus 473 ~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 473 TLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 999999999999888888777777766666666665555554 No 14 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=1.8e-161 Score=901.86 Aligned_cols=516 Identities=16% Similarity=0.147 Sum_probs=448.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc---CCCCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---PSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |++.+ ++++|++||+.|+++|++||++|+||++||+|++. ..+++.++++.+++|||||++|+++|||||||+ T Consensus 1 M~~~~----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 76 (555) T protein:vir:98 1 MAEQT----ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAG 76 (555) T ss_pred CCCcc----cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHh Confidence 99988 68999999999999999999999999999999964 445667778899999999999999999999999 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) ||||++|||||++.|+++.+. .++++||++||++|+++|++||||.++|++|+||++|||||+|++++. . T Consensus 77 ltpp~~~WF~l~~~d~~l~e~-------~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~---~ 146 (555) T protein:vir:98 77 MTSPARPWFRLTTSIPELDES-------AAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDF---D 146 (555) T ss_pred hcCCCCcccccccCcccccch-------HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCC---C Confidence 999999999999998877653 579999999999999999999999999999999999999999998753 4 Q ss_pred CcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHH-----HHHhhcccCCCcceEEEEEEEEeeCC------ Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRK-----SLEEAQGDQNPSEEVTIYTHVYRDPE------ 226 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~v~i~~~v~~~~~------ 226 (532) ++++|++|||++|||++|+.|+||+||||++||++++.++|+. .+++...++.++++|+|||+|+|+.+ T Consensus 147 ~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 226 (555) T protein:vir:98 147 AVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKR 226 (555) T ss_pred ceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCC Confidence 7889999999999999999999999999999999998776653 23333334445678999999998643 Q ss_pred ---CCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 227 ---AMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 227 ---~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) +|||+|||++.+......++++||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+ T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:98 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 58999999987654445577889999999999999999999999999999999999999999999999999999999 Q ss_pred ecCccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN---SAVQRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~~~TAtEi~~ 380 (532) ++++|.+++.++.+++.+.+.+|..++....++++.+||+.+.+.|++++++|+++||.+ ++.++++++||||||++ T Consensus 307 v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~ 386 (555) T protein:vir:98 307 LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAE 386 (555) T ss_pred eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHH Confidence 999999888888888888888888877666677888999999999999999999999977 67779999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecchHHHHHHHHHH------HHHHHHHHH Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHDLN------KLNVFIDYM 453 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l~~l~raq~~~------~l~~~~~~l 453 (532) |++|++++|||||+||+.|||.|||+|+|++|+++|+||++|+++.+..+ |+|+++|+|+|+.+ +++++++.+ T Consensus 387 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~l 466 (555) T protein:vir:98 387 RHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAV 466 (555) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999988777 78888888887654 455566777 Q ss_pred HhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 454 IKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 454 aq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +|+.|+++|+||+|++++++++++|||+ .++||++|+++.|+|++++++++++++++.++.+. ++.+......-+-. T Consensus 467 aq~~P~vld~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~-~~~~~~~~~~~~~~ 543 (555) T protein:vir:98 467 AGIKPEVLDKFDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADT-AAKLGSVDTSKQNA 543 (555) T ss_pred hcCChhhhhcCCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcccccCcchh Confidence 8999999999999999999999999975 79999999999999887766655544333333322 22222211111111 No 15 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=1.8e-161 Score=901.86 Aligned_cols=516 Identities=16% Similarity=0.147 Sum_probs=448.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc---CCCCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---PSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |++.+ ++++|++||+.|+++|++||++|+||++||+|++. ..+++.++++.+++|||||++|+++|||||||+ T Consensus 1 M~~~~----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 76 (555) T protein:vir:10 1 MAEQT----ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAG 76 (555) T ss_pred CCCcc----cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHh Confidence 99988 68999999999999999999999999999999964 445667778899999999999999999999999 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) ||||++|||||++.|+++.+. .++++||++||++|+++|++||||.++|++|+||++|||||+|++++. . T Consensus 77 ltpp~~~WF~l~~~d~~l~e~-------~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~---~ 146 (555) T protein:vir:10 77 MTSPARPWFRLTTSIPELDES-------AAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDF---D 146 (555) T ss_pred hcCCCCcccccccCcccccch-------HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCC---C Confidence 999999999999998877653 579999999999999999999999999999999999999999998753 4 Q ss_pred CcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHH-----HHHhhcccCCCcceEEEEEEEEeeCC------ Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRK-----SLEEAQGDQNPSEEVTIYTHVYRDPE------ 226 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~v~i~~~v~~~~~------ 226 (532) ++++|++|||++|||++|+.|+||+||||++||++++.++|+. .+++...++.++++|+|||+|+|+.+ T Consensus 147 ~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 226 (555) T protein:vir:10 147 AVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKR 226 (555) T ss_pred ceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCC Confidence 7889999999999999999999999999999999998776653 23333334445678999999998643 Q ss_pred ---CCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 227 ---AMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 227 ---~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) +|||+|||++.+......++++||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+ T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 58999999987654445577889999999999999999999999999999999999999999999999999999999 Q ss_pred ecCccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN---SAVQRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~~~TAtEi~~ 380 (532) ++++|.+++.++.+++.+.+.+|..++....++++.+||+.+.+.|++++++|+++||.+ ++.++++++||||||++ T Consensus 307 v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~ 386 (555) T protein:vir:10 307 LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAE 386 (555) T ss_pred eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHH Confidence 999999888888888888888888877666677888999999999999999999999977 67779999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecchHHHHHHHHHH------HHHHHHHHH Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHDLN------KLNVFIDYM 453 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l~~l~raq~~~------~l~~~~~~l 453 (532) |++|++++|||||+||+.|||.|||+|+|++|+++|+||++|+++.+..+ |+|+++|+|+|+.+ +++++++.+ T Consensus 387 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~l 466 (555) T protein:vir:10 387 RHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAV 466 (555) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999988777 78888888887654 455566777 Q ss_pred HhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 454 IKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 454 aq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +|+.|+++|+||+|++++++++++|||+ .++||++|+++.|+|++++++++++++++.++.+. ++.+......-+-. T Consensus 467 aq~~P~vld~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~-~~~~~~~~~~~~~~ 543 (555) T protein:vir:10 467 AGIKPEVLDKFDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADT-AAKLGSVDTSKQNA 543 (555) T ss_pred hcCChhhhhcCCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcccccCcchh Confidence 8999999999999999999999999975 79999999999999887766655544333333322 22222211111111 No 16 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=1.8e-161 Score=901.86 Aligned_cols=516 Identities=16% Similarity=0.147 Sum_probs=448.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc---CCCCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---PSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |++.+ ++++|++||+.|+++|++||++|+||++||+|++. ..+++.++++.+++|||||++|+++|||||||+ T Consensus 1 M~~~~----~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 76 (555) T protein:vir:10 1 MAEQT----ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAG 76 (555) T ss_pred CCCcc----cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHh Confidence 99988 68999999999999999999999999999999964 445667778899999999999999999999999 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) ||||++|||||++.|+++.+. .++++||++||++|+++|++||||.++|++|+||++|||||+|++++. . T Consensus 77 ltpp~~~WF~l~~~d~~l~e~-------~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~---~ 146 (555) T protein:vir:10 77 MTSPARPWFRLTTSIPELDES-------AAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDF---D 146 (555) T ss_pred hcCCCCcccccccCcccccch-------HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCC---C Confidence 999999999999998877653 579999999999999999999999999999999999999999998753 4 Q ss_pred CcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHH-----HHHhhcccCCCcceEEEEEEEEeeCC------ Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRK-----SLEEAQGDQNPSEEVTIYTHVYRDPE------ 226 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~v~i~~~v~~~~~------ 226 (532) ++++|++|||++|||++|+.|+||+||||++||++++.++|+. .+++...++.++++|+|||+|+|+.+ T Consensus 147 ~~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 226 (555) T protein:vir:10 147 AVVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKR 226 (555) T ss_pred ceEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCC Confidence 7889999999999999999999999999999999998776653 23333334445678999999998643 Q ss_pred ---CCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 227 ---AMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 227 ---~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) +|||+|||++.+......++++||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+ T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 58999999987654445577889999999999999999999999999999999999999999999999999999999 Q ss_pred ecCccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN---SAVQRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~~~TAtEi~~ 380 (532) ++++|.+++.++.+++.+.+.+|..++....++++.+||+.+.+.|++++++|+++||.+ ++.++++++||||||++ T Consensus 307 v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~ 386 (555) T protein:vir:10 307 LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAE 386 (555) T ss_pred eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHH Confidence 999999888888888888888888877666677888999999999999999999999977 67779999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecchHHHHHHHHHH------HHHHHHHHH Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHDLN------KLNVFIDYM 453 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l~~l~raq~~~------~l~~~~~~l 453 (532) |++|++++|||||+||+.|||.|||+|+|++|+++|+||++|+++.+..+ |+|+++|+|+|+.+ +++++++.+ T Consensus 387 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~l 466 (555) T protein:vir:10 387 RHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAV 466 (555) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999988777 78888888887654 455566777 Q ss_pred HhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 454 IKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 454 aq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +|+.|+++|+||+|++++++++++|||+ .++||++|+++.|+|++++++++++++++.++.+. ++.+......-+-. T Consensus 467 aq~~P~vld~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~-~~~~~~~~~~~~~~ 543 (555) T protein:vir:10 467 AGIKPEVLDKFDADRWADTYADMLGIDP-ELIVPGNQVALIRKQRADQQQAAQQAALLNQGADT-AAKLGSVDTSKQNA 543 (555) T ss_pred hcCChhhhhcCCHHHHHHHHHHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcccccCcchh Confidence 8999999999999999999999999975 79999999999999887766655544333333322 22222211111111 No 17 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=1.9e-161 Score=901.76 Aligned_cols=510 Identities=36% Similarity=0.539 Sum_probs=458.2 Q ss_pred CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC Q lcl|NC_015159. 10 AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN 89 (532) Q Consensus 10 ~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 89 (532) =+++|++||+.|+++|++|+++|+||++||+|+++++++++++++..++|||||++|+++|||||||+||||++|||||. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQ 80 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 35679999999999999999999999999999999999998888899999999999999999999999999999999999 Q ss_pred CChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecce Q lcl|NC_015159. 90 VSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN 169 (532) Q Consensus 90 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~ 169 (532) ++|+++++....++.+.+++.||++||++++.+|++||||.++|++|+||++|||||+|++++ ++++|||++ T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~--------~~~~~pl~~ 152 (555) T protein:vir:17 81 INDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKK--------NLKLYPLDR 152 (555) T ss_pred cCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCC--------ceeEEEcCe Confidence 999999998888889999999999999999999999999999999999999999999998754 478999999 Q ss_pred EEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHH----hhcc-----------------cCCCcceEEEEEEEEeeCCCC Q lcl|NC_015159. 170 FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLE----EAQG-----------------DQNPSEEVTIYTHVYRDPEAM 228 (532) Q Consensus 170 ~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~----~~~~-----------------~~~~~~~v~i~~~v~~~~~~~ 228 (532) |||++|++|+||+||||+++|+++|+++|++... +... +++++.++++|+++.++.+ T Consensus 153 y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~~-- 230 (555) T protein:vir:17 153 FVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKDG-- 230 (555) T ss_pred EEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccCC-- Confidence 9999999999999999999999999999986421 1111 2234455667766655333 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG 308 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g 308 (532) .|.++++++|..+.+.+++|||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+| T Consensus 231 ~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g 310 (555) T protein:vir:17 231 QVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSA 310 (555) T ss_pred eeEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Confidence 36667788888887888999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_015159. 309 VTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDT 388 (532) Q Consensus 309 ~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~ 388 (532) ++++.++.++++|.+++|.++++.+++.+++++|+.+++.|++++++|+++||+++ .+++++||||||++|++|++++ T Consensus 311 ~~~~~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~~--~~d~~r~TAtEV~~r~~E~~~~ 388 (555) T protein:vir:17 311 TTKPQNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLMLQ--VRQSERTTATEVQATVQELNEQ 388 (555) T ss_pred ccCcceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhcC--CCCcccchHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999864 5899999999999999999999 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhc--chhhhhcCH Q lcl|NC_015159. 389 LGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLA--GLQDDDINL 466 (532) Q Consensus 389 LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~--p~~~d~id~ 466 (532) |||||+||++|||.|||+|+|++|+|+|+||++|++++++++++++.+|+|+++++++++|++.++|+. |.++|+||+ T Consensus 389 LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~ 468 (555) T protein:vir:17 389 IGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINP 468 (555) T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCH Confidence 999999999999999999999999999999999999999999999999999999999999999999995 779999999 Q ss_pred HHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 467 LDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 467 d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) |++++.|++++||||+.++||+||+++++|+++++++++++.+++++.++.+++..+. +.|++++ T Consensus 469 d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~-~~~~~~~ 533 (555) T protein:vir:17 469 TEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAM-QLIQQQQ 533 (555) T ss_pred HHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHH-hccccch Confidence 9999999999999999999999999999888877666665555555544332211100 1122222 No 18 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=1.8e-161 Score=901.90 Aligned_cols=509 Identities=28% Similarity=0.443 Sum_probs=463.0 Q ss_pred CCCC-CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 1 MAEV-EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 1 m~~~-~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 79 (532) |-.+ -+.|.++++|++||++||++|++|+++|+||++||+|++++++++ .+..+++|||||++|+++|||||||+|| T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~--~~~~~~~~dstg~~a~~~LAa~l~~~lt 78 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD--NETSQNGWQGVGAQATNHLANKLAQVLF 78 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCC--cccccccccchHHHHHHHHHHHHHHhhc Confidence 7666 788999999999999999999999999999999999999877654 3345689999999999999999999999 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~ 159 (532) ||++|||||+++|..+++....+.+..++++||+.||+.++.+|++||||.++|++|+||++|||||+|++++. T Consensus 79 pp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~------ 152 (515) T protein:vir:70 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG------ 152 (515) T ss_pred CCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCC------ Confidence 99999999999999988888888888999999999999999999999999999999999999999999997542 Q ss_pred ceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHh--hcccCCCcceEEEEEEEEeeCCCCeEEEEEEEc Q lcl|NC_015159. 160 NAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEE--AQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEID 237 (532) Q Consensus 160 ~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~ 237 (532) +|++|||++|||++|++|+||+||||+++++++|+++|++.... ...+.+++++|+|||+|+++++++ |..+++++ T Consensus 153 -~~~~~pl~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~-~~~~~e~d 230 (515) T protein:vir:70 153 -AMSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGF-WKINQSAD 230 (515) T ss_pred -CeEEEEcCeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCc-eEEEEecC Confidence 38999999999999999999999999999999999999976532 233567899999999999987653 66677778 Q ss_pred CcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhcc Q lcl|NC_015159. 238 GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK 317 (532) Q Consensus 238 ~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~ 317 (532) |+.+ +.++.|+|++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|||++++.++.+ T Consensus 231 ~~~~-~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~ 309 (515) T protein:vir:70 231 DIPV-GKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVN 309 (515) T ss_pred ceee-ccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccc Confidence 7754 678899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_015159. 318 ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLS 397 (532) Q Consensus 318 ~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 397 (532) +++|.+++|.++++.++++++++||+.+++.|++++++|+++||++.+.++++++||||||++|++|++++||||||||+ T Consensus 310 ~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~ 389 (515) T protein:vir:70 310 SGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) T ss_pred cCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHH---HhhcchhhhhcCHHHHHHHHH Q lcl|NC_015159. 398 QELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYM---IKLAGLQDDDINLLDVKMRLA 474 (532) Q Consensus 398 ~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~l---aq~~p~~~d~id~d~~~~~~a 474 (532) .|||.||+.|++ .|.+|++|.+++++++|+||++|+|+|+++++..|++.+ ++++|+++|++|+|+++++++ T Consensus 390 ~Ell~Pli~r~~-----~~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a 464 (515) T protein:vir:70 390 MTMQTPIAMWGL-----QEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVR 464 (515) T ss_pred HHHHHHHHHHHH-----HhhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHH Confidence 999999999864 577899999999999999999999999999888777665 588889999999999999999 Q ss_pred HhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccc Q lcl|NC_015159. 475 NSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQ 526 (532) Q Consensus 475 ~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (532) +.+|+ |..++||+||++++++|+++++++++++++++.+++..+...|.+- T Consensus 465 ~~~g~-p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 465 GQISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHhCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 99998 5569999999999999888877777666666555544444444333 No 19 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=1.4e-161 Score=902.42 Aligned_cols=516 Identities=15% Similarity=0.142 Sum_probs=452.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCC------CCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPS------ATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~------~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |++.+ ..-+++|++||++|+++|++||++|+||++||+|++..+ ++..+.++++++|||||++|+++||||| T Consensus 1 m~~d~--~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l 78 (549) T protein:vir:10 1 MTNDD--AKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAM 78 (549) T ss_pred CCcch--HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHH Confidence 99864 224688999999999999999999999999999998543 3456677889999999999999999999 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHH--HhcCChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM--ESNSFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) |++||||++|||||.++|+++.+. .+++.||++||+.++..+ ++||||.++|++|+||++|||||+|++++ T Consensus 79 ~~~ltpp~~~wF~l~~~~~~~~e~-------~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~ 151 (549) T protein:vir:10 79 DSMITPATQLWHRLKTGNDALNEI-------ASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHD 151 (549) T ss_pred HhhccCCCCccccccCCccchhhh-------hHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeec Confidence 999999999999999999877654 479999999999999865 58999999999999999999999999864 Q ss_pred ccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHH----HHhhcccCCCcceEEEEEEEEeeC--- Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKS----LEEAQGDQNPSEEVTIYTHVYRDP--- 225 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~----~~~~~~~~~~~~~v~i~~~v~~~~--- 225 (532) .+++++|++|||++|||++|++|+||+||||++||+++|.++|+.. ..+...+++|+++|+|||+|+|+. T Consensus 152 ---~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~ 228 (549) T protein:vir:10 152 ---VGKGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRD 228 (549) T ss_pred ---CCCeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCC Confidence 4578899999999999999999999999999999999988766642 223444678899999999999864 Q ss_pred ------CCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 226 ------EAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSK 299 (532) Q Consensus 226 ------~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~ 299 (532) .+|||.|||+++|.. ..++++||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++ T Consensus 229 ~~~~~~~~~pf~sv~~e~~~~--~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~ 306 (549) T protein:vir:10 229 PRKLDGRNMQFASYWLDEGRD--RIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVD 306 (549) T ss_pred ccccccccCceEEEEEEecCC--EeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 368999999988774 345677889999999999999999999999999999999999999999999999999 Q ss_pred CceeecCccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhc-ccCCCCCCCHHHH Q lcl|NC_015159. 300 VLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSA-VQRGGDRVTAEEI 378 (532) Q Consensus 300 p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~-~~~~~~~~TAtEi 378 (532) |||+++++|++++.++.+++.+.+..|..++..+.|++++++|+.+++.|++++++|+++||.+.+ .++++++|||||| T Consensus 307 p~~~v~~~g~~~~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV 386 (549) T protein:vir:10 307 PPLLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEV 386 (549) T ss_pred CceeeccccccccceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHH Confidence 999999999999999988877777767666677788888899999999999999999999999974 4579999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccce--e-ecchHHHHHHHHHH------HHHHH Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPA--I-ATGLEALGRGHDLN------KLNVF 449 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~--~-v~~l~~l~raq~~~------~l~~~ 449 (532) ++|++|++++|||||+||++|||.|||+|+|++|+++|+||++|++++... + ++|+++|+|+|+.+ +++++ T Consensus 387 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~ 466 (549) T protein:vir:10 387 LQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQ 466 (549) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999986432 2 77888888877654 33455 Q ss_pred HHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 450 IDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 450 ~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) ++.++|+.|+++|+||+|++++++++++|||+ .++||++|++++++++++|++++++.+++. .++.+++.+.+++++- T Consensus 467 ~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~~~~~qqq~~~~~~~a~-~a~~~a~~~~~~~ta~ 544 (549) T protein:vir:10 467 LGIVSQFDPAAAKVPNGARIARLLADYGGVPV-EAMSTDEELQAQQAAEAQAAQMQQMLAAAP-VAAGAIKDLSDAQTAA 544 (549) T ss_pred HHHHhccChhHHhcCCHHHHHHHHHHhcCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhhhhcCCC Confidence 66678889999999999999999999999976 699999999999998877777666655544 4444667788788876 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) +|- T Consensus 545 ~~~ 547 (549) T protein:vir:10 545 QTA 547 (549) T ss_pred ccc Confidence 666 No 20 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=1.3e-161 Score=902.60 Aligned_cols=509 Identities=26% Similarity=0.420 Sum_probs=460.4 Q ss_pred CCCC--CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEV--EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~--~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (532) |++- .+++..+++|++||++|+++|++||++|+||++||+|++++++++ +++.+++|||||++|+++|||||||+| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~--~~~~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGD--NETSQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCC--ccccCCcccchHHHHHHHHHHHHHhhh Confidence 8877 677889999999999999999999999999999999999877654 345678999999999999999999999 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCC Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQ 158 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~ 158 (532) |||++|||||+++|..++..+..+.+..++++||++||++|+.+|++||||.++|++|+||++|||||||+++++ T Consensus 79 tpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~----- 153 (516) T protein:vir:96 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG----- 153 (516) T ss_pred cCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC----- Confidence 999999999999999888888778888999999999999999999999999999999999999999999997643 Q ss_pred cceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHH--hhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 159 SNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLE--EAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 159 ~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~--~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) +|++|||++|||++|++|+|++||||+++++++|++++.+... +...+++++++|+|||+|+|++++. |..++.+ T Consensus 154 --~~~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~-~~~~~~~ 230 (516) T protein:vir:96 154 --AISAIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGF-WELKQSA 230 (516) T ss_pred --CEEEEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCce-eEEEEEe Confidence 4899999999999999999999999999999999999977543 2344568899999999999988763 5555566 Q ss_pred cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhc Q lcl|NC_015159. 237 DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA 316 (532) Q Consensus 237 ~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~ 316 (532) ++.. .+.+++|||++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+|+|+|+++++++. T Consensus 231 d~~~-~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~ 309 (516) T protein:vir:96 231 DDIP-VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFV 309 (516) T ss_pred Ccee-eccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhc Confidence 6665 478899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_015159. 317 KANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLL 396 (532) Q Consensus 317 ~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl 396 (532) ++++|.+++|.++++.++++++++||+.+++.|++++++|+++||++.+.++++++||||||++|++|++++|||||+|| T Consensus 310 ~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl 389 (516) T protein:vir:96 310 NSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLF 389 (516) T ss_pred cCCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHH---HhhcchhhhhcCHHHHHHHH Q lcl|NC_015159. 397 SQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYM---IKLAGLQDDDINLLDVKMRL 473 (532) Q Consensus 397 ~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~l---aq~~p~~~d~id~d~~~~~~ 473 (532) +.|||.|||+|++.++ .|++|+.++++++|+||++|+|+|+++++..|++.+ ++++|+++|+||+|+++++| T Consensus 390 ~~Ell~Pli~r~l~~~-----~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~ 464 (516) T protein:vir:96 390 ATTMQSPVAMWGLLEA-----GESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWV 464 (516) T ss_pred HHHHHHHHHHHHHHhc-----CCCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHH Confidence 9999999999998765 388999999999999999999999998888877766 56679999999999999999 Q ss_pred HHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccc Q lcl|NC_015159. 474 ANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQ 526 (532) Q Consensus 474 a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (532) ++++|||+ .++||+||++++++|+++++++++++++++.+....+++..++- T Consensus 465 a~~~Gvp~-~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 465 RGQISAEL-PFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHhCCCc-cccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 99999976 59999999999999888877766666555554433333333322 No 21 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=4.4e-161 Score=899.72 Aligned_cols=509 Identities=26% Similarity=0.419 Sum_probs=458.7 Q ss_pred CCCC--CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEV--EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~--~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (532) |.+. ++++..+++|++||++||++|++||++|+||++||+|++++++++. ++.+++|||||++|+++|||||||+| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDN--ETSQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCc--ccccccccchHHHHHHHHHHHHHhhh Confidence 8877 7888899999999999999999999999999999999998876653 34568999999999999999999999 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCC Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQ 158 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~ 158 (532) |||++|||||+++|..+++.+..+.+..++++||++||++++.+|++||||.++|++|+||++|||||+|+++++ T Consensus 79 tpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~----- 153 (516) T protein:vir:10 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG----- 153 (516) T ss_pred cCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC----- Confidence 999999999999999888888777888899999999999999999999999999999999999999999997643 Q ss_pred cceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHh--hcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 159 SNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEE--AQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 159 ~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) +|++|||++|||++|++|+||+||||+++++++|++++.+.... ...+++++.+|+|||+|++++++. |..++.+ T Consensus 154 --~~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~-~~~~~~~ 230 (516) T protein:vir:10 154 --AISAIPMHHYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGF-WELKQSA 230 (516) T ss_pred --CeEEEEcCeEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCc-eEEEEee Confidence 38999999999999999999999999999999999999875433 334567899999999999987653 6666667 Q ss_pred cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhc Q lcl|NC_015159. 237 DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA 316 (532) Q Consensus 237 ~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~ 316 (532) ++.. .+.+++|||++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++. T Consensus 231 d~~~-~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~ 309 (516) T protein:vir:10 231 DDIP-VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFV 309 (516) T ss_pred Ccee-eccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhc Confidence 7765 478899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_015159. 317 KANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLL 396 (532) Q Consensus 317 ~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl 396 (532) ++++|.+++|.++++.++++++++||+.+++.|++++++|+++||++.+.++++++||||||++|++|++++|||||+|| T Consensus 310 ~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl 389 (516) T protein:vir:10 310 NSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLF 389 (516) T ss_pred cCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHH---HhhcchhhhhcCHHHHHHHH Q lcl|NC_015159. 397 SQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYM---IKLAGLQDDDINLLDVKMRL 473 (532) Q Consensus 397 ~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~l---aq~~p~~~d~id~d~~~~~~ 473 (532) ++|||.|||+|++. +++|++|++++++++|+||++|+|+|+++++..|++.+ +|++|+++|+||+|++++++ T Consensus 390 ~~Ell~Pli~r~~~-----~~~p~~P~~lv~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~ 464 (516) T protein:vir:10 390 ATTMQSPVAMWGLL-----EAGDSFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWV 464 (516) T ss_pred HHHHHHHHHHHHHH-----hhCCCCChhhcCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHH Confidence 99999999999975 55799999999999999999999999999887776665 57788999999999999999 Q ss_pred HHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccc Q lcl|NC_015159. 474 ANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQ 526 (532) Q Consensus 474 a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (532) ++.+|||+ .++||++||+++++|++++++.+.++++++.+.+..++.-+.+- T Consensus 465 a~~~gvp~-~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 465 RGQISAEL-PFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHhCCCh-hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccchhhhhhhcC Confidence 99999965 69999999999999887766655554444443333222222222 No 22 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=1.4e-159 Score=891.40 Aligned_cols=511 Identities=28% Similarity=0.444 Sum_probs=458.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) |. -+-+.++++|++||++||++|++|+++|+||++||+|+++++++++ ++.+++|||||++|+++|||||||+||| T Consensus 1 ~~--~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~ltp 76 (517) T protein:vir:10 1 MD--MRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD--LSSQNAWQDDGASATNFLSNKLSQVLFP 76 (517) T ss_pred Cc--ccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC--ccccccccchHHHHHHHHHHHHHHhhcC Confidence 32 2345578899999999999999999999999999999998877643 3457899999999999999999999999 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) |++|||||+++|.++.+.+.+....++|+.||+.||++++.+|++||||.++|++|+||++|||||+|+++ ++. T Consensus 77 p~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~------~~~ 150 (517) T protein:vir:10 77 AQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPD------KTS 150 (517) T ss_pred CCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeC------CCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999853 345 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHh--hcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcC Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEE--AQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDG 238 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~--~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~ 238 (532) +|++|||++|||++|++|+||+||||+++++++|+++|++.... ....++|+++|+|||+|+|+++++ |.++++++| T Consensus 151 ~~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~-~~~~~~~d~ 229 (517) T protein:vir:10 151 PIQAVPLHHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGK-YLIRQSADD 229 (517) T ss_pred cEEEEEcCeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCc-eEEEEEeCc Confidence 79999999999999999999999999999999999999987543 334678999999999999988874 777888888 Q ss_pred cccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccC Q lcl|NC_015159. 239 EIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKA 318 (532) Q Consensus 239 ~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~ 318 (532) +.+ +.++.|+|++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++ T Consensus 230 ~~~-~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~ 308 (517) T protein:vir:10 230 VPV-GKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEG 308 (517) T ss_pred eee-ccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCC Confidence 765 7788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_015159. 319 NTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQ 398 (532) Q Consensus 319 ~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~ 398 (532) ++|++++|.++++.++++++++||+.+++.|++++++|+++||++.+.++++++||||||++|++|++++|||||+||++ T Consensus 309 ~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 388 (517) T protein:vir:10 309 GSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFAT 388 (517) T ss_pred CccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999998899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHH---hhcchhhhhcCHHHHHHHHHH Q lcl|NC_015159. 399 ELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMI---KLAGLQDDDINLLDVKMRLAN 475 (532) Q Consensus 399 E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~la---q~~p~~~d~id~d~~~~~~a~ 475 (532) |||.|||+|+|++|.+. +|...++++++++|++|+|+++++++..|++.++ +++|.+.++||+|+++++||+ T Consensus 389 Ell~Pli~r~~~~l~~~-----l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~ 463 (517) T protein:vir:10 389 TFQGPLARWFMNGISSI-----LTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQG 463 (517) T ss_pred HHHHHHHHHHHHHhhhh-----cCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHH Confidence 99999999999998653 3445678999999999999999999998877765 454566778999999999999 Q ss_pred hcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHH-HHHHHhhcccccCC Q lcl|NC_015159. 476 SLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAG-GQAAAAMMQQQAGL 529 (532) Q Consensus 476 ~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~ 529 (532) ++|||+ .++||++|++++++++++++++++++++++.+. +...++...++.|+ T Consensus 464 ~~Gvp~-~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 464 QISANF-PFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred HhCCCh-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 999965 799999999999988887776666554444433 33333444455555 No 23 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=4.5e-158 Score=883.24 Aligned_cols=515 Identities=15% Similarity=0.129 Sum_probs=435.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC---CCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---TADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |++++ +++|++||++|+++|++||++|+||++||+|+++++. .++++++..++|||||++|+++|||||||+ T Consensus 1 m~~~~-----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ 75 (556) T protein:vir:73 1 MAETE-----KERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSG 75 (556) T ss_pred CChhh-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHh Confidence 99977 6789999999999999999999999999999987654 345566788999999999999999999999 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) ||||++|||||+++|+++.+. .+|++||++||++|+++|++||||.++|++|+||++||||++|++++ .. T Consensus 76 ltpp~~~WF~l~~~d~~~~~~-------~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~---~~ 145 (556) T protein:vir:73 76 ITSPARPWFKLATPDPDMMDY-------GPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMED---DQ 145 (556) T ss_pred hcCCCCcccccccCcccccch-------HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeec---CC Confidence 999999999999998876654 57999999999999999999999999999999999999999999865 35 Q ss_pred CcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHH-----HHHhhcccCCCcceEEEEEEEEeeC------- Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRK-----SLEEAQGDQNPSEEVTIYTHVYRDP------- 225 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~v~i~~~v~~~~------- 225 (532) ++++|++|||++|||++|+.|+||+||||+++|++++.++|+. .+++...++.++++|+|+|+|+|+. T Consensus 146 ~~~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~ 225 (556) T protein:vir:73 146 DVIRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKM 225 (556) T ss_pred ceEEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEecccccccccc Confidence 7789999999999999999999999999999999998776653 3344444445567899999999954 Q ss_pred --CCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_015159. 226 --EAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRS-FVEEYLGDLKSLENLYEAIVKMSMISSKVLF 302 (532) Q Consensus 226 --~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~G-p~~~al~d~~~L~~l~~~~l~~~~~a~~p~~ 302 (532) .+|||.|+||+.+......++++||++|||+++||++.+|++|||| |++++|||+|+||.|+++++++++++++||| T Consensus 226 ~~~~~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (556) T protein:vir:73 226 DSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPM 305 (556) T ss_pred CcccceEEEEEEEecCCCceecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 3689999999865544445678899999999999999999999999 8999999999999999999999999999999 Q ss_pred eecCccccChhhhccCC-CceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHH Q lcl|NC_015159. 303 FVNPNGVTQIRRVAKAN-TGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN---SAVQRGGDRVTAEEI 378 (532) Q Consensus 303 lv~~~g~~~~~~~~~~~-~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~~~TAtEi 378 (532) ++++++...+.++.+++ ++...++..+++.++..++ ++++.+.+.|++++++|+++||.+ ++.++++++|||||| T Consensus 306 ~v~~~~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~-~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv 384 (556) T protein:vir:73 306 VAPTSLKNQRVSLLPGDVTYLDVISGQDGFKPAYLVN-PNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAV 384 (556) T ss_pred eccccccccceeeccCccccccCCCCccceeeecccc-ccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHH Confidence 99999877655554433 2223456666777765454 679999999999999999999977 456699999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecchHHHHHHHHH------HHHHHHHH Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHDL------NKLNVFID 451 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l~~l~raq~~------~~l~~~~~ 451 (532) ++|++|++++|||||+||++|||.|||+|+|++|+|+|+||++|+++.+..+ |+|+++|+|+|+. ++++++++ T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~ 464 (556) T protein:vir:73 385 IEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIG 464 (556) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999877666 6778888877764 46667788 Q ss_pred HHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHH------------HHH Q lcl|NC_015159. 452 YMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGG------------QAA 519 (532) Q Consensus 452 ~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~------------~~~ 519 (532) .++|+.|+++|+||+|++++++++++|||+ .++||++||++.|+|++++++++++.++..++.+ ..+ T Consensus 465 ~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 543 (556) T protein:vir:73 465 QLAQFKPEALDKLDVDQAIDAFSEMSGVSP-TVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSA 543 (556) T ss_pred HHhccChhhHhcCCHHHHHHHHHHHcCCCh-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHH Confidence 888999999999999999999999999965 7999999999998887666555444333332211 111 Q ss_pred HhhcccccCCCCC Q lcl|NC_015159. 520 AAMMQQQAGLPTQ 532 (532) Q Consensus 520 ~~~~~~~~g~~~~ 532 (532) ...+.+..|-|.| T Consensus 544 l~~~~~~~g~~~~ 556 (556) T protein:vir:73 544 LTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHhhcCCCC Confidence 1112223344555 No 24 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=3e-157 Score=878.71 Aligned_cols=507 Identities=15% Similarity=0.150 Sum_probs=429.5 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCc------ccccccccccchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATAD------GSTSYTTPWQSIGARGLNNLASKLMLALFPVG 82 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~------~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~ 82 (532) +++++|++||+.|+++|++||++|+||++||+|++++++++. +.++.+++|||||++|+++|||||||+||||+ T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 899999999999999999999999999999999997654432 23568899999999999999999999999999 Q ss_pred CCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceE Q lcl|NC_015159. 83 SSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAP 162 (532) Q Consensus 83 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~ 162 (532) +|||||++.|.++.+. .++++||++||+.|+++|++||||.++|++|+||++||||++|++++. +..++++| T Consensus 81 ~~WF~l~~~d~~~~~~-------~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~-~~~~~~r~ 152 (547) T protein:vir:10 81 TKWFELAFRDKELNSD-------DECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDE-DEEGSVVF 152 (547) T ss_pred CcccccccCCccccch-------HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCC-CCCCceeE Confidence 9999999998876653 579999999999999999999999999999999999999999998653 34578899 Q ss_pred EEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHH-----HHHhhcccCCC---cceEEEEEEEEeeC--------- Q lcl|NC_015159. 163 KLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRK-----SLEEAQGDQNP---SEEVTIYTHVYRDP--------- 225 (532) Q Consensus 163 ~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~-----~~~~~~~~~~~---~~~v~i~~~v~~~~--------- 225 (532) ++|||++|||++|++|+||+||||++||++++.++|+. .+++. .++++ +.++++||+|+|+. T Consensus 153 ~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~-~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~ 231 (547) T protein:vir:10 153 QSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKK-AKEASNQAALKQEVVMCVFTRYDKKQNRNAG 231 (547) T ss_pred EEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHH-HhcCCCcccceEEEEEEEeeccCCCCCcccc Confidence 99999999999999999999999999999998766553 22222 23333 44899999999864 Q ss_pred -----CCCeEEEEEEEc-CcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 226 -----EAMVFRSYQEID-GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSK 299 (532) Q Consensus 226 -----~~~~~~s~~~~~-~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~ 299 (532) ++|||+|+|++. |... .++++||++|||+++||++.+|++|||||++++|||+|+||.|+++++++++++++ T Consensus 232 ~~~~~~~~p~~s~~~e~~~~~~--~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~ 309 (547) T protein:vir:10 232 TVLAPTERPFGKKWILKEGAVQ--LGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVID 309 (547) T ss_pred ceeeccccceeEEEEEecCcee--eeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 368999987654 4333 35577889999999999999999999999999999999999999999999999999 Q ss_pred CceeecCccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHH Q lcl|NC_015159. 300 VLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIR 379 (532) Q Consensus 300 p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~ 379 (532) |||+|+++|++++.++ .++|.++.|..+++.++ +++++|+.+++.|++++++|+++||.+.+.++++++||||||+ T Consensus 310 pp~~v~~~g~~~~~~~--~pgg~~~~~~~~~v~pl--~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~ 385 (547) T protein:vir:10 310 PAIMVTERGLISDIDL--GASGLTVVRDMESMKPF--ESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQ 385 (547) T ss_pred Cceeccccccccccee--cCCeeeecCCcccceee--ecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHH Confidence 9999999999998664 34566677888888765 5668999999999999999999999999889999999999999 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccce----eecchHHHHHHHHHH------HHHHH Q lcl|NC_015159. 380 YVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPA----IATGLEALGRGHDLN------KLNVF 449 (532) Q Consensus 380 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~----~v~~l~~l~raq~~~------~l~~~ 449 (532) +|++|++++|||||+||++|||.|||+|+|++|+++|+||++|++++++. .|+|+++|+|+|+.+ +++++ T Consensus 386 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~ 465 (547) T protein:vir:10 386 VRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGS 465 (547) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999987543 378999999998654 33445 Q ss_pred HHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 450 IDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 450 ~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) ++.++|+.|+++|+||+|++++++++++|||+ .++||++|++++++|++++++++++++++ ++++.+++++....+.| T Consensus 466 v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~-~~irs~eev~~~r~qr~~~~q~~~qaa~~-~~~g~~m~~~~~~~a~~ 543 (547) T protein:vir:10 466 TAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQ-TLMRPKAKVTSIRKNRSQTQQKAEQAAIA-EAEGNAMEAQGKGQAAL 543 (547) T ss_pred HHHhhccChhhhhcCCHHHHHHHHHHHhCCCh-hccCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhcCcccch Confidence 55567889999999999999999999999965 69999999999988876665544332222 33333333333222221 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) -.- T Consensus 544 ~~~ 546 (547) T protein:vir:10 544 KEN 546 (547) T ss_pred hcc Confidence 111 No 25 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=5.1e-157 Score=877.44 Aligned_cols=514 Identities=14% Similarity=0.113 Sum_probs=432.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCC---CcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSAT---ADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~---~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |++++ +++|++||+.|+++|++||++|+||++||+|+++++.+ +.++++..++|||||++|+++|||||||+ T Consensus 1 m~~~~-----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ 75 (559) T protein:vir:95 1 MAETT-----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) T ss_pred CChhh-----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHh Confidence 99987 67899999999999999999999999999999987543 45667788999999999999999999999 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) ||||++|||||+++|+++.+. .++++||++||+.|+++|++||||.++|++|+||++|||||+|++++ .. T Consensus 76 ltpp~~~WF~l~~~d~~~~e~-------~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d---~~ 145 (559) T protein:vir:95 76 ITSPARPWFRLATPDPEMMDY-------GPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDD---DE 145 (559) T ss_pred hcCCCCcccccccCCccccch-------HHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecC---CC Confidence 999999999999998776653 57999999999999999999999999999999999999999999865 35 Q ss_pred CcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHH-----HHHhhcccCCCcceEEEEEEEEeeCC------ Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRK-----SLEEAQGDQNPSEEVTIYTHVYRDPE------ 226 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~-----~~~~~~~~~~~~~~v~i~~~v~~~~~------ 226 (532) ++++|++|||++|||++|++|+||+||||+++|++++.++|+. .+++...++.++++|+|||+|+|+.+ T Consensus 146 ~~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 225 (559) T protein:vir:95 146 DIIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 (559) T ss_pred ceeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEecccccccccc Confidence 6789999999999999999999999999999999998776653 23333334445678999999998543 Q ss_pred ---CCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_015159. 227 ---AMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRS-FVEEYLGDLKSLENLYEAIVKMSMISSKVLF 302 (532) Q Consensus 227 ---~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~G-p~~~al~d~~~L~~l~~~~l~~~~~a~~p~~ 302 (532) +|||.|+||+.+......++++||++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++||| T Consensus 226 ~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~ 305 (559) T protein:vir:95 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) T ss_pred ccccceEEEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 689999999875544445677889999999999999999999999 8999999999999999999999999999999 Q ss_pred eecCccccChhhhccCCCceeecCc-cccccccccCCccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHH Q lcl|NC_015159. 303 FVNPNGVTQIRRVAKANTGDFVAGR-KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN---SAVQRGGDRVTAEEI 378 (532) Q Consensus 303 lv~~~g~~~~~~~~~~~~G~~v~g~-~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~~~TAtEi 378 (532) ++++++.+++.++.+++.+.+..+. .+.+.+.... ..+++.+...|++++++|+++||.+ ++.++++++|||||| T Consensus 306 ~v~~~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~-~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV 384 (559) T protein:vir:95 306 VAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLV-NPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) T ss_pred eccccccccceeeeccceeeeCCCCCcccceeeccc-ccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHH Confidence 9999999888777655443332222 2334444433 3578888999999999999999876 466799999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecchHHHHHHHHH------HHHHHHHH Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHDL------NKLNVFID 451 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l~~l~raq~~------~~l~~~~~ 451 (532) ++|++|++++|||||+||++|||.|||+|+|++|+|+|+||++|+++....+ |+|+++|+|+|+. ++++++++ T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~ 464 (559) T protein:vir:95 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999998866555 6777777777754 56677888 Q ss_pred HHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccccc---- Q lcl|NC_015159. 452 YMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQA---- 527 (532) Q Consensus 452 ~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 527 (532) .|+|+.|+++|+||+|++++++++++|||+ .++||++|+++.|+|+++|+++++++++..++++.+ +.+...+. T Consensus 465 ~laq~~Pevld~id~d~~~~~~a~~~Gvp~-~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~-~~~~~~~~~~~~ 542 (559) T protein:vir:95 465 QLAQVKPEALDKLNVDQAIDAFADMSGVSP-TVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGV-KTLSEAKTSDPS 542 (559) T ss_pred HHhccChhhhhcCCHHHHHHHHHHHhCCch-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hccccccCCChh Confidence 889999999999999999999999999965 699999999999988877766655544444432221 11111111 Q ss_pred ----------CCCCC Q lcl|NC_015159. 528 ----------GLPTQ 532 (532) Q Consensus 528 ----------g~~~~ 532 (532) |..-| T Consensus 543 ~l~~~~~~~~~~~~~ 557 (559) T protein:vir:95 543 VLSAMANAVSGQGGQ 557 (559) T ss_pred HHHHHHHhhcCcccc Confidence 11111 No 26 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=9.1e-88 Score=497.77 Aligned_cols=511 Identities=16% Similarity=0.154 Sum_probs=361.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcc----------cccCCCCCcccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIP----------SVFPSATADGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P----------~~~~~~~~~~~~~~~~~~dst~~~a~~~L 70 (532) |+..+ -+..|++||+.+|++|+.||.+|+||++|..+ .++...++.......++++++..+++++| T Consensus 20 ~~~~~----~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l 95 (641) T protein:vir:94 20 LSTDR----IGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETL 95 (641) T ss_pred CCchh----HHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHH Confidence 44333 25569999999999999999999999976544 44444444343445589999999999999 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) +++||+++|| +++||++.+.+.+..+ ..++ ++..+...+++++|+..+++.+.+++.+|||++-++ T Consensus 96 ~s~Lm~~~~p-~~~wf~~~p~~~ed~~----------~A~~---~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~ 161 (641) T protein:vir:94 96 VAYFKGATFP-SDDWFDLKGMVPELAD----------AARV---VKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLG 161 (641) T ss_pred hhHHhhhhcC-CCceEEEecCCCChHH----------HHHH---HHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEee Confidence 9999999997 9999999887655432 1222 334566788999999999999999999999988665 Q ss_pred cccccc--------CCc---------------ceEEEEecceEEEeeCCCCCeE----EEEEEEeecHHHhhHH------ Q lcl|NC_015159. 151 STEQVE--------GQS---------------NAPKLYKLHNFVVERDAYDNVL----QIVTEDKIARAALPED------ 197 (532) Q Consensus 151 ~~~~~~--------~~~---------------~~~~~~pl~~~~v~~d~~G~vd----~i~rk~~~~~~~l~~~------ 197 (532) .+.... +++ -.++++|+..+-|-.|+.++++ ++||++++++.+|..+ T Consensus 162 w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d 241 (641) T protein:vir:94 162 WDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLD 241 (641) T ss_pred hhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCChh Confidence 331100 011 1234556655444455666654 4677778888777432 Q ss_pred -HHHHHHhhcccCCCcce----------EEEEEE-EEeeCCCCeEEEEE-EEcCcccccccccCccccCceEEEEeeecC Q lcl|NC_015159. 198 -VRKSLEEAQGDQNPSEE----------VTIYTH-VYRDPEAMVFRSYQ-EIDGEIVAGTEGEYPLDSCPWIPVRLIKMP 264 (532) Q Consensus 198 -~~~~~~~~~~~~~~~~~----------v~i~~~-v~~~~~~~~~~s~~-~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~ 264 (532) +.......+...+++.. .++|++ .-.+.++++|.+|| .+.|+.+....+...|+++||+++||.+.+ T Consensus 242 ~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~ 321 (641) T protein:vir:94 242 LTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDR 321 (641) T ss_pred hcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecC Confidence 11111111111112211 123221 12345677887765 446665544433333678999999999999 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCccccccccccCCccchhH Q lcl|NC_015159. 265 NEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQV 344 (532) Q Consensus 265 g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~ 344 (532) +++||+||++++|||+|+||.+++..++++.++++|+|+++++|+++++++...++|.+..+..+++.++..+ ..+|+. T Consensus 322 ~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~v~pl~~~-~~~~~~ 400 (641) T protein:vir:94 322 DSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGSLQPIDMG-RQDFVV 400 (641) T ss_pred CcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCcceeecCC-ccccch Confidence 9999999999999999999999999999999999999999999999999987666566666777888776544 368999 Q ss_pred HHHHHHHHHHHHHHHHhhhhc----ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc----- Q lcl|NC_015159. 345 AKATADDIEKRLSYAFMLNSA----VQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT----- 415 (532) Q Consensus 345 ~~~~i~~~~~rI~~af~~~~~----~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~----- 415 (532) .+..++.++.+|+++|+.+.+ ..+++++||||||+++.+|+..+||+++++|+.||+.||+.|+++++.+. T Consensus 401 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~ 480 (641) T protein:vir:94 401 TYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPE 480 (641) T ss_pred hHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchh Confidence 999999999999999986644 23677889999999999999999999999999999999999999998774 Q ss_pred ------------CCCCCCcccccccee-ecchH---HHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCC Q lcl|NC_015159. 416 ------------SKIPNLPKEAVEPAI-ATGLE---ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGM 479 (532) Q Consensus 416 ------------g~lp~~p~~~~~~~~-v~~l~---~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv 479 (532) |.+|++|+++ +.++ +.+++ .+.+++++++++++++.+++ .|.++|++|+|.+++.+++..|+ T Consensus 481 i~R~~~~~~~~~~~~~~~p~~L-~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~-~P~v~d~~d~~~~~~~~~~~~g~ 558 (641) T protein:vir:94 481 TIRMYVPEEQMDGFFEVSPEYL-HYPYKFLALGANYVVERERMVTDLLQLLDISGR-VPQIGQSLDYALILEDLLRQMRF 558 (641) T ss_pred hhhhhchhhhcccCCCCCccce-eeeeeEeecchhHHHHHHHHHHHHHHHHHHhhc-ChhhhhcCCHHHHHHHHHHHhCC Confidence 4555555544 3332 23454 45577888999999998888 59999999999999999988664 Q ss_pred C-HhHccCCHHHHHHHHHHHHHHHHHH---HHHHhhhHHHHHHHHh-------hcccccCCCCC Q lcl|NC_015159. 480 D-TTGLILTQQDKQAKMAEASTAAGMV---TAGQQMGAAGGQAAAA-------MMQQQAGLPTQ 532 (532) Q Consensus 480 ~-p~~i~~s~ee~~~~~~q~~~~~~~~---~~~~~~~~~~~~~~~~-------~~~~~~g~~~~ 532 (532) + |..++|++|..++..++++++++++ ++++..+.+..++.++ .|.++.|++|. T Consensus 559 ~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 622 (641) T protein:vir:94 559 TDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTS 622 (641) T ss_pred CCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCch Confidence 2 7778888764433332222222211 1211112222222222 24455566655 No 27 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=5.5e-71 Score=405.79 Aligned_cols=515 Identities=14% Similarity=0.174 Sum_probs=365.8 Q ss_pred CCCCCCCc-----cCHHH----HHHHHHHHHHHhhhHHHHHHHHHHhhcccc-------cCCC---CCcccccccccccc Q lcl|NC_015159. 1 MAEVEKTG-----FAADG----AAAAYNRLKNDRGAYETRAEDCATYTIPSV-------FPSA---TADGSTSYTTPWQS 61 (532) Q Consensus 1 m~~~~~~~-----~~~~~----~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~-------~~~~---~~~~~~~~~~~~ds 61 (532) ||.+.... .+.+. +.++|+.+++.|+.|+++|++++++..+.. .... ..+......+++.+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~ 82 (651) T protein:vir:80 3 LATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTG 82 (651) T ss_pred ccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccCh Confidence 66552222 23344 889999999999999999999998877741 1111 11222234578999 Q ss_pred hHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHh Q lcl|NC_015159. 62 IGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLV 141 (532) Q Consensus 62 t~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 141 (532) +-..+++++.+.|+..+|| +.+||++.+.+... ..+.+-.-|+..+...+++++|+..++.+++|+++ T Consensus 83 ~v~~~ve~~~~~l~~~~~~-~~~~~~~~p~~~~d-----------~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~ 150 (651) T protein:vir:80 83 KAFEAIETIHAYLMSATFP-NKNWFDVVPAKPGQ-----------DNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLI 150 (651) T ss_pred hHHHHHHHHHHHHHHhhcC-CCceeEeccCCchh-----------HHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcc Confidence 9999999999999999997 69999998853221 23444455677777889999999999999999999 Q ss_pred hCceeeeecccccc----------------------------cCCcceEEEEecceEEEeeCCCCCeEEEE-EEEeecHH Q lcl|NC_015159. 142 AGNVLLYIPSTEQV----------------------------EGQSNAPKLYKLHNFVVERDAYDNVLQIV-TEDKIARA 192 (532) Q Consensus 142 ~G~~~~~v~~~~~~----------------------------~~~~~~~~~~pl~~~~v~~d~~G~vd~i~-rk~~~~~~ 192 (532) +|||++-+.++... ..+..+++.+|+.+|++..++.+.-|+-| .+..++.. T Consensus 151 ~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~ 230 (651) T protein:vir:80 151 TGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKA 230 (651) T ss_pred cCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHH Confidence 99999866543110 01235789999999999998877655533 23345555 Q ss_pred HhhHHH----------HHHHHhhc-------------------ccCCCcceEEEEEE-EEeeCCCCeEEEEEEEcC-ccc Q lcl|NC_015159. 193 ALPEDV----------RKSLEEAQ-------------------GDQNPSEEVTIYTH-VYRDPEAMVFRSYQEIDG-EIV 241 (532) Q Consensus 193 ~l~~~~----------~~~~~~~~-------------------~~~~~~~~v~i~~~-v~~~~~~~~~~s~~~~~~-~~~ 241 (532) ++-+.+ ...+.+.. ...++..+|+||+| ++.+.+++.+++++.+.+ +.+ T Consensus 231 ~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~i 310 (651) T protein:vir:80 231 DILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEV 310 (651) T ss_pred HHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEE Confidence 432111 01111100 01144578899987 556778888988887654 434 Q ss_pred ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCc Q lcl|NC_015159. 242 AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTG 321 (532) Q Consensus 242 ~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G 321 (532) ...+.....++|||+++||.+.+|+.||+||++.++|+++.||.+++..++++.++++|+|+|++||+++++++...++| T Consensus 311 l~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~ 390 (651) T protein:vir:80 311 LRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGK 390 (651) T ss_pred ecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCc Confidence 32222222368999999999999999999999999999999999999999999999999999999999999999876667 Q ss_pred eeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhccc----CCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_015159. 322 DFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQ----RGGDRVTAEEIRYVAGELEDTLGGVYSLLS 397 (532) Q Consensus 322 ~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~----~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 397 (532) .++.+.++++.+++.+ ..+++.++..++.++++|++.|+...+.+ ++.+++|||||+.+++++..+||++|++|+ T Consensus 391 vi~~~~~~~~~~l~~~-~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~ 469 (651) T protein:vir:80 391 VFLVSDHGDLQPLANQ-SSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIE 469 (651) T ss_pred eEEecCCCCceeeccC-cccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHH Confidence 7778999988877644 35799999999999999999997755433 556779999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCCCcc----------------cccccee-ecchHH---HHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 398 QELQLPLVKILLKELQATSKIPNLPK----------------EAVEPAI-ATGLEA---LGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 398 ~E~l~Pli~r~~~il~r~g~lp~~p~----------------~~~~~~~-v~~l~~---l~raq~~~~l~~~~~~laq~~ 457 (532) .||+.||+.|++.++++.+..|+++. ++++.++ +.++++ +.|.+.++++..+++.+++ . T Consensus 470 ~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~-~ 548 (651) T protein:vir:80 470 ETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQ-V 548 (651) T ss_pred HHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhcc-C Confidence 99999999999999999987775443 1222222 223443 4577888999999998877 5 Q ss_pred chhhhhcCHHHHHHHHHHhcCC-CHhHccCCHHHHHHHHHHH------HH---HHHHHHHHHhhhHHHHHHHHhhccccc Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGM-DTTGLILTQQDKQAKMAEA------ST---AAGMVTAGQQMGAAGGQAAAAMMQQQA 527 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv-~p~~i~~s~ee~~~~~~q~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (532) |...+.+|+..+++.+++.+|+ .|..++..+++.+....+. +. +++.+++..++ .+.+..+++.++. T Consensus 549 p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~---~~~~~~~~~~~~~ 625 (651) T protein:vir:80 549 PEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQL---QADGGTQMMSEMY 625 (651) T ss_pred CccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH Confidence 7777888999999999999998 4666776654432211111 00 01111111111 1111112222222 Q ss_pred CCCC-----------C Q lcl|NC_015159. 528 GLPT-----------Q 532 (532) Q Consensus 528 g~~~-----------~ 532 (532) ++.. + T Consensus 626 ~~~~~~~~~~~~~~~~ 641 (651) T protein:vir:80 626 GTPNADQMQQELMATT 641 (651) T ss_pred HHHHHHHHHHHHHHHH Confidence 2211 1 No 28 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=6.1e-40 Score=235.57 Aligned_cols=496 Identities=13% Similarity=0.086 Sum_probs=325.0 Q ss_pred CCCCCC--Ccc-----CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK--TGF-----AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASK 73 (532) Q Consensus 1 m~~~~~--~~~-----~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 73 (532) |+-+.+ ..+ .+..++++|+...+.|+.|+..|.|+++|..-+.....+........++|-+.....+.++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~ 80 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSN 80 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHH Confidence 665522 111 2455679999999999999999999999999987766665555555688989999999999999 Q ss_pred HHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_015159. 74 LMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) Q Consensus 74 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~ 153 (532) ||+.+|| ++.||++....+..... ++ =..+++.+...|+.+||+.++..++++++++|+|.+=+.... T Consensus 81 l~~~~Fp-~~~w~~~v~~~~~~~~~---------~~--~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~ 148 (584) T protein:vir:95 81 YFSSLFP-NDDWLRWVGYGKGDSTK---------TK--AKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEA 148 (584) T ss_pred HHHhhcC-ccceeeeecCCCchhhH---------HH--HHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEee Confidence 9999999 79999999886543321 11 224667778889999999999999999999999988665433 Q ss_pred ccc----------CCcceEEEEecceEEEeeCCCCCeEEEE--EEEeecHHHhhHHH-------------HHHHHhh--- Q lcl|NC_015159. 154 QVE----------GQSNAPKLYKLHNFVVERDAYDNVLQIV--TEDKIARAALPEDV-------------RKSLEEA--- 205 (532) Q Consensus 154 ~~~----------~~~~~~~~~pl~~~~v~~d~~G~vd~i~--rk~~~~~~~l~~~~-------------~~~~~~~--- 205 (532) ... ....+++-++.-++++..++ +.++... +|..+|+.+|-... +....+. T Consensus 149 ~~~e~~e~~~v~~~~~prieriSP~d~~~Dpsa-~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~ 227 (584) T protein:vir:95 149 KYKEMTDGTLVPDYIGPRLVRISPLDIVFNPLA-TSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHL 227 (584) T ss_pred cceeeeccccccccccceEEeeChhheeecCCC-CCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCC Confidence 210 11245666666788888888 5555532 46678888874433 2211110 Q ss_pred -cccCC----C----------------cceEEEEEE--EEee---CCCCeEEEEE-EEcCcccccccccCccccCceEEE Q lcl|NC_015159. 206 -QGDQN----P----------------SEEVTIYTH--VYRD---PEAMVFRSYQ-EIDGEIVAGTEGEYPLDSCPWIPV 258 (532) Q Consensus 206 -~~~~~----~----------------~~~v~i~~~--v~~~---~~~~~~~s~~-~~~~~~~~~~~~~~g~~~~P~~~~ 258 (532) ....+ + ...|++++. ..++ .++..|..+. ++.+..+-.....++++++||++. T Consensus 228 ~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~ 307 (584) T protein:vir:95 228 GGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHV 307 (584) T ss_pred CCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEE Confidence 00111 0 112443332 1222 2223333333 344444444566788899999999 Q ss_pred EeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCccccccccccCC Q lcl|NC_015159. 259 RLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEK 338 (532) Q Consensus 259 Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~ 338 (532) .|.+...+.||.|+.+.++|-.+.||.+.+.++++..++++|++. .+++++++...+++.+..+..+++.+++. . T Consensus 308 ~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k----~~~~~~~~~~~pg~~~~~~~~~~~q~~~p-~ 382 (584) T protein:vir:95 308 GWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLK----IIGEVEEFVWGPGAEIHLDQGGDVQEIAK-N 382 (584) T ss_pred cceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCccee----eccccchhcccCCceeecCCCCCcceecC-c Confidence 999999999999999999999999999999999999999999533 34556666655555567788888776653 3 Q ss_pred ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcC-- Q lcl|NC_015159. 339 YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATS-- 416 (532) Q Consensus 339 ~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g-- 416 (532) ..++-.+...|+-+.....+ +..-....+|.+.++++.+--.+.+.+..+.++.+...+|-.|+++|++.+|++.| T Consensus 383 a~~~~s~~~~lq~~e~~me~--~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~ 460 (584) T protein:vir:95 383 VNYIINADNQIQMLEDRMEL--YAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATR 460 (584) T ss_pred hhhhhHHHHHHHHHHHHHHh--hhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 34555555666666655544 11111223444454555555556677788899999999999999999999998764 Q ss_pred --CCC----------------CCcccccccee--ecchH--HHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHH Q lcl|NC_015159. 417 --KIP----------------NLPKEAVEPAI--ATGLE--ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLA 474 (532) Q Consensus 417 --~lp----------------~~p~~~~~~~~--v~~l~--~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a 474 (532) ..+ ++..++++.++ +.-.+ -+.|+|..+++.+|++. ++.+....+++-.++.+.++ T Consensus 461 nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~--~~~~~i~p~~~~~~l~~~la 538 (584) T protein:vir:95 461 NMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNS--QIGQMILPHTSGKALATFVD 538 (584) T ss_pred hccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHh--hhhhhccccchHHHHHHHHH Confidence 112 23344554442 22222 25688999999999885 55555555667777777788 Q ss_pred HhcCCCHhHccCCHHHHHHH--HHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_015159. 475 NSLGMDTTGLILTQQDKQAK--MAEASTAAGMVTAGQQMGAAGGQA 518 (532) Q Consensus 475 ~~~Gv~p~~i~~s~ee~~~~--~~q~~~~~~~~~~~~~~~~~~~~~ 518 (532) +-.+.|.-.|.+.+-.++.. .|+.+.++++...+++...+.++- T Consensus 539 dl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 539 DVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred HHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 88888665666654333322 222222222111222222222111 No 29 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=6.1e-38 Score=224.60 Aligned_cols=503 Identities=10% Similarity=0.057 Sum_probs=318.3 Q ss_pred CCCC-CCCc-----c--C---HHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHH Q lcl|NC_015159. 1 MAEV-EKTG-----F--A---ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNN 69 (532) Q Consensus 1 m~~~-~~~~-----~--~---~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~ 69 (532) |+-- +.+. . + ...++.+|.++.+.|+..+..|.|+++|+.-...+.+++.+.....+++-+...+.+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~ 80 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLM 80 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHH Confidence 4422 1000 0 1 33478999999999999999999999999988777777777777778999999999999 Q ss_pred HHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeee Q lcl|NC_015159. 70 LASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYI 149 (532) Q Consensus 70 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v 149 (532) |.+.+++++|| ++.||++..-+++.. .+.-=..+++.|...|+.|+|+.+....+.|++.+||++-=+ T Consensus 81 l~a~~~~~~fp-~~~w~d~~~~~~~~~-----------~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~ 148 (599) T protein:vir:31 81 ITTSYMEHLLP-NRNWVDFVGFDNDSV-----------NAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHT 148 (599) T ss_pred HHHHHHhhhcC-CccceEeeecCCchh-----------HHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEee Confidence 99999999999 999999988765532 122233567778899999999999999999999999996654 Q ss_pred cc--------cccc--cCCcceEEEEecceEEEeeCCCCCeEEEE--EEEeecHHHhhHHHHH---------HHHh---h Q lcl|NC_015159. 150 PS--------TEQV--EGQSNAPKLYKLHNFVVERDAYDNVLQIV--TEDKIARAALPEDVRK---------SLEE---A 205 (532) Q Consensus 150 ~~--------~~~~--~~~~~~~~~~pl~~~~v~~d~~G~vd~i~--rk~~~~~~~l~~~~~~---------~~~~---~ 205 (532) +- ++.. ...+.+++-+.+..+++..++ +.++.++ +|..+|..+|-..+.+ .... . T Consensus 149 ~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A-~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~ 227 (599) T protein:vir:31 149 RHVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTA-DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREE 227 (599) T ss_pred eEEEcceeecccccccccccceEEeecccceeeCCCC-CCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhh Confidence 41 1100 011235666667788888887 5555543 5777787776432221 1100 0 Q ss_pred c-----cc--------------CCCcce---------EEEEEE--EEeeCCCCeEE---EEEEEcCccc-ccccccCccc Q lcl|NC_015159. 206 Q-----GD--------------QNPSEE---------VTIYTH--VYRDPEAMVFR---SYQEIDGEIV-AGTEGEYPLD 251 (532) Q Consensus 206 ~-----~~--------------~~~~~~---------v~i~~~--v~~~~~~~~~~---s~~~~~~~~~-~~~~~~~g~~ 251 (532) . .. ++...+ |++++. .+.+.++.++. ...+++++.+ ......++.+ T Consensus 228 ~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g 307 (599) T protein:vir:31 228 RRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDG 307 (599) T ss_pred ccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCC Confidence 0 00 011112 222221 23344443333 3345555433 4555678888 Q ss_pred cCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceeecCccccc Q lcl|NC_015159. 252 SCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDV 331 (532) Q Consensus 252 ~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~ 331 (532) +.||++..|.+..++.||.||...++|.+..||.+.+..+.+...+++| +++-.|.+.+.++.+.++..+..++.+++ T Consensus 308 ~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p--~l~~~~dl~~eD~~~~P~~v~~~~d~~~v 385 (599) T protein:vir:31 308 SQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHP--SLKKVGDVREKGMRGGPNHVFEVEETGDV 385 (599) T ss_pred CCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcc--cccccccccccCccCCCCcceeecCCCcc Confidence 9999999999999999999999999999999999999999999999999 44556668888887764444455777777 Q ss_pred cccccCCccchhHHHHHHHHHHHHHHHHH---hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|NC_015159. 332 EVFQLEKYNDFQVAKATADDIEKRLSYAF---MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKIL 408 (532) Q Consensus 332 ~~~~~~~~~~~~~~~~~i~~~~~rI~~af---~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~ 408 (532) .+.. +..+...+...|+..+.+..+.= ...+..+..++ -||+||....++...........+..+|+.||++++ T Consensus 386 q~~~--p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~-~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l 462 (599) T protein:vir:31 386 QYMT--PPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGE-KTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDY 462 (599) T ss_pred cccc--CchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccch-hhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 6543 22234344445555555443310 11111112223 499999999999999999999999999999999999 Q ss_pred HHHHHhc----CCCC------------CCcccccccee-ecchH---HHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHH Q lcl|NC_015159. 409 LKELQAT----SKIP------------NLPKEAVEPAI-ATGLE---ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLD 468 (532) Q Consensus 409 ~~il~r~----g~lp------------~~p~~~~~~~~-v~~l~---~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~ 468 (532) +....+. |.+- .+..++++-++ +..++ -+.|++..+++.+|++ +++.+.....+.-.+ T Consensus 463 ~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~--~~~~q~~~P~~~~k~ 540 (599) T protein:vir:31 463 LEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILG--GPLGAALAPHMSRTK 540 (599) T ss_pred HHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhc--ccCCCccchhhHHHH Confidence 9887653 1111 12223333222 22222 2668889999999997 555444433333334 Q ss_pred HHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 469 VKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 469 ~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +...++....+.--.|.+.. +..+.||.+.. +++....+.+...++++.-|-|+- T Consensus 541 l~~~l~~~~~l~~~~~~~~~--va~~eqq~~~~-------m~Q~~lq~~~~~~~~~~~~~~~~~ 595 (599) T protein:vir:31 541 LFNAVEYLGDLDAYGIFTFG--IGVQEDQQLAR-------MAQKSTQQTEETALTQEEVGGPTT 595 (599) T ss_pred HHHHHHHHHhccccccCCCc--hhHHHHHHHHH-------HHHHHHHHhHhhhhhhhhcCCCCc Confidence 44444442222222233322 21111111111 111111222334555555565555 No 30 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=1.1e-31 Score=190.30 Aligned_cols=506 Identities=13% Similarity=0.109 Sum_probs=285.9 Q ss_pred CCCCC-CCccCHHHHHHHHHHH----HHHhhh-HHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVE-KTGFAADGAAAAYNRL----KNDRGA-YETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~-~~~~~~~~~~~r~~~l----k~~R~~-~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |||.+ .-.++-+.+.+..+++ ++-+.. +...+.+-.+|.+-.-. ....+| ..+++.+.-...++.+.+.| T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~-~~~~~~---~s~~~~~~v~~~v~~~~~~l 76 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPF-GNERPG---KSGIVSRDVQETVDWIMPSL 76 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCC-CcccCC---CCccccHHHHHHHHHHHHHH Confidence 99994 4445555555444443 332221 11233333444332211 111222 34566777777899999999 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHH-HHHhcCChHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMN-YMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~ 153 (532) +..+|+ +.+||++.|-.+...+. . +.++..+.- ....++.+..++.++++.+..|+|++=+..+. T Consensus 77 ~~~~~~-~~~~~~~~p~~~~D~~~----------a---~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~ 142 (705) T protein:vir:88 77 MKVFTS-GGQVVKYEPDTAEDVEQ----------A---EQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEE 142 (705) T ss_pred HHhhcC-CCceEEEeeCChhHHHH----------H---HHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEecccc Confidence 999887 99999999864432211 1 122223322 34566677889999999999999977332211 Q ss_pred c---------------------------------------------ccCCcceEEEEecceEEEeeCCCCCeEE--EEEE Q lcl|NC_015159. 154 Q---------------------------------------------VEGQSNAPKLYKLHNFVVERDAYDNVLQ--IVTE 186 (532) Q Consensus 154 ~---------------------------------------------~~~~~~~~~~~pl~~~~v~~d~~G~vd~--i~rk 186 (532) . ...+.++++.||..+|++..++.+--|. ++++ T Consensus 143 ~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~ 222 (705) T protein:vir:88 143 VLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHR 222 (705) T ss_pred ccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEE Confidence 0 0013467888999999999988765453 5678 Q ss_pred EeecHHHhhH-----HHHHHHH---------------hhc------------ccCCCcceEEEEEEEEe-eCCCCe---E Q lcl|NC_015159. 187 DKIARAALPE-----DVRKSLE---------------EAQ------------GDQNPSEEVTIYTHVYR-DPEAMV---F 230 (532) Q Consensus 187 ~~~~~~~l~~-----~~~~~~~---------------~~~------------~~~~~~~~v~i~~~v~~-~~~~~~---~ 230 (532) ..+|.++|-. +..+.+. +.. ..+.....|++|+|..+ +.++.. | T Consensus 223 ~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~ 302 (705) T protein:vir:88 223 EKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISEL 302 (705) T ss_pred EeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceee Confidence 8899888721 1111110 000 00112235777777544 323322 1 Q ss_pred EEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc Q lcl|NC_015159. 231 RSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT 310 (532) Q Consensus 231 ~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~ 310 (532) ..+.+. |..+.. .-+.+.+||++..+.+.++..||.|++....+-.+.+|.+.+..++++..+++|.++++ +|.+ T Consensus 303 ~~~~~~-g~~il~---~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~-~g~v 377 (705) T protein:vir:88 303 RRILYV-GDYIIS---NEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVL-DGQV 377 (705) T ss_pred EEEEEe-Cccccc---cccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecc-cccc Confidence 122233 332222 22446799999999999999999999999999999999999999999999999999995 5667 Q ss_pred ChhhhccCCCceeecCc-cccccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhcccCC----CCCCCHHHHHHHHHH Q lcl|NC_015159. 311 QIRRVAKANTGDFVAGR-KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQRG----GDRVTAEEIRYVAGE 384 (532) Q Consensus 311 ~~~~~~~~~~G~~v~g~-~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~----~~~~TAtEi~~r~~E 384 (532) ++.++.+..||.++.-. .+.+.+++.+ .--+.....++.+.+.|++..=. +.....+ ..+.||+.|....+. T Consensus 378 ~~~d~~~~~pg~vv~~~~~~~i~~~~~~--~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~ 455 (705) T protein:vir:88 378 NLEDLLTNEAAGIVRVKSMNSITPLETP--QLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTA 455 (705) T ss_pred CcccccccCCCeeEEecCCCccccccCC--cCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHH Confidence 88888888888887633 3445555433 23344566677777777765411 1111111 235799999999999 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC----------C-cccc---ccceeecchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 385 LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPN----------L-PKEA---VEPAIATGLEALGRGHDLNKLNVFI 450 (532) Q Consensus 385 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~----------~-p~~~---~~~~~v~~l~~l~raq~~~~l~~~~ 450 (532) ....+.-....+...++.+++.+++.++....--|. + |.+. ..+.+.++++...+.+..+++...+ T Consensus 456 ~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll 535 (705) T protein:vir:88 456 AEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIW 535 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHH Confidence 999999989988888999999999999887542221 1 2221 2334445566666555554444444 Q ss_pred H---HHHhhcchhhhhc---CHHHHHHHHHHhcCC-CHhHccCCHHHHHHH---HHHHHHHHH------HHHH--HHhhh Q lcl|NC_015159. 451 D---YMIKLAGLQDDDI---NLLDVKMRLANSLGM-DTTGLILTQQDKQAK---MAEASTAAG------MVTA--GQQMG 512 (532) Q Consensus 451 ~---~laq~~p~~~d~i---d~d~~~~~~a~~~Gv-~p~~i~~s~ee~~~~---~~q~~~~~~------~~~~--~~~~~ 512 (532) + .+.+. |...+.+ +..++...+++..|+ .+..+...+...+++ .++.+...+ .+++ +.+.. T Consensus 536 ~~~q~l~~~-~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~ 614 (705) T protein:vir:88 536 EMAQAVVGG-GGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQS 614 (705) T ss_pred HHHHHhhcc-cchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHH Confidence 4 44332 3333333 344566666666665 233344332211111 110000000 0000 00000 Q ss_pred HHHHHHHHh-hccc--ccCCCCC Q lcl|NC_015159. 513 AAGGQAAAA-MMQQ--QAGLPTQ 532 (532) Q Consensus 513 ~~~~~~~~~-~~~~--~~g~~~~ 532 (532) ....+.+.. ..+. +.-+.++ T Consensus 615 e~~~~q~e~q~~q~E~q~~q~e~ 637 (705) T protein:vir:88 615 DALAKQAEAQMKQVEAQIRLAEI 637 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 0000 0000000 No 31 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.94 E-value=3.1e-25 Score=154.95 Aligned_cols=510 Identities=10% Similarity=0.038 Sum_probs=259.7 Q ss_pred CCCCC-CCccCHHH---HHHHHHHHHHHhhhHH---HHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVE-KTGFAADG---AAAAYNRLKNDRGAYE---TRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASK 73 (532) Q Consensus 1 m~~~~-~~~~~~~~---~~~r~~~lk~~R~~~e---~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 73 (532) =+..+ +.=..... |++-++..++...... ..|-+++-|.. . .......| ..++....-.+.++.+-+. T Consensus 15 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~g---rs~vv~~~v~~~ve~~~~~ 89 (763) T protein:vir:95 15 SQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEG-K-AKPPKVKG---RSQVQPKLVRRQAEWRYSA 89 (763) T ss_pred cchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccc-c-CcccccCC---CccccCHHHHHHHHHHHHH Confidence 01111 11112222 2222222222222211 23444433331 1 11111222 3356777788899999999 Q ss_pred HHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHH-HHHHhcCChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 74 LMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICM-NYMESNSFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 74 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~-~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) |+-.+++ +..||++.+-.+...+.. + +.+..+. -....++-+..++.++++++..|||++=+-++ T Consensus 90 l~~~f~~-~~~~~~~~P~~~~D~~~A-------~------q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~ 155 (763) T protein:vir:95 90 LTEPFLG-SNKLFKVTPVTWEDVQGA-------R------QNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWN 155 (763) T ss_pred HHHhhcC-CCcEEEEecCCcchHHHH-------H------HHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeee Confidence 9998888 888999998865543221 1 1222222 24456667778999999999999997633221 Q ss_pred cc-------------------------------------------------------c--------------------cC Q lcl|NC_015159. 153 EQ-------------------------------------------------------V--------------------EG 157 (532) Q Consensus 153 ~~-------------------------------------------------------~--------------------~~ 157 (532) .. . .. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 235 (763) T protein:vir:95 156 REIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLA 235 (763) T ss_pred eeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEec Confidence 00 0 00 Q ss_pred CcceEEEEecceEEEeeCCCCCeE---EEEEEEeecHHHhhH---------HHHHHHH--------------hhcccCCC Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVL---QIVTEDKIARAALPE---------DVRKSLE--------------EAQGDQNP 211 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd---~i~rk~~~~~~~l~~---------~~~~~~~--------------~~~~~~~~ 211 (532) +..+++.+|+.+|+|..++.+.++ -++++..+|..+|-. ++..... ........ T Consensus 236 ~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 315 (763) T protein:vir:95 236 NHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPM 315 (763) T ss_pred CceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcc Confidence 123567789999999988766443 356788899888722 1111000 00111112 Q ss_pred cceEEEEEEEEe-eCCCC---eEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 212 SEEVTIYTHVYR-DPEAM---VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 212 ~~~v~i~~~v~~-~~~~~---~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) ..+|.||.+..+ +.++. .|..+.+..+..+......|+++++||++..+.+.++..||.|.++.+.+..+.+|.+. T Consensus 316 ~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~ 395 (763) T protein:vir:95 316 RKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVM 395 (763) T ss_pred cceEEEEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHH Confidence 367888776543 33332 23333455655555555667778999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCceeecCccccChhhhccCCCceeec---Ccccc--ccccccCC-ccchhHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVA---GRKQD--VEVFQLEK-YNDFQVAKATADDIEKRLSYAFM 361 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~---g~~~~--~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~ 361 (532) +..+..+..+++|.|+++.+. ++..+....++|.++. |.... +.+.+.+. ...+....+.++...+.+.-.-- T Consensus 396 ~~~~d~l~~~~~~~~~v~~ga-v~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~ 474 (763) T protein:vir:95 396 RGMIDLLGRSANGQRGMPKGM-LDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKA 474 (763) T ss_pred HHHHHHHHhhcCCcEEeeccc-ccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcch Confidence 999999999999999996554 5656655667777653 32222 12222221 12222233333322222211111 Q ss_pred hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc----------C-CCCCCcccccc--c Q lcl|NC_015159. 362 LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT----------S-KIPNLPKEAVE--P 428 (532) Q Consensus 362 ~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~----------g-~lp~~p~~~~~--~ 428 (532) .......++...||++|..+.+....++..++.++.. .+.+++++++.++.+. | ..-++..+... . T Consensus 475 ~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~ 553 (763) T protein:vir:95 475 FAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNF 553 (763) T ss_pred hhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCc Confidence 1111122334569999999999999999888877765 7899999999998874 1 11122222211 1 Q ss_pred eeecchHHH-HHHHHHHHHHHHHHHHHhhcchh---------hhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHH Q lcl|NC_015159. 429 AIATGLEAL-GRGHDLNKLNVFIDYMIKLAGLQ---------DDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEA 498 (532) Q Consensus 429 ~~v~~l~~l-~raq~~~~l~~~~~~laq~~p~~---------~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~ 498 (532) .++..+++. .+.+.++.+..+++.++...+.. ++..+..++++.+.....- |..+-.-..+.++++++. T Consensus 554 DV~V~~~~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~-~d~~~q~qaqle~~~~q~ 632 (763) T protein:vir:95 554 DLEVDISTAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQ-PDPVQEQLKQLAVEKAQL 632 (763) T ss_pred ceEEecccchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCC-ccchhhhHHHHHHHHHHH Confidence 222223332 23445555655555554322221 1233333333333322221 111110000111110000 Q ss_pred HH---HHHHHHHHHhhhH--HHH-HHHHhhcccccCCC----------------------------------CC Q lcl|NC_015159. 499 ST---AAGMVTAGQQMGA--AGG-QAAAAMMQQQAGLP----------------------------------TQ 532 (532) Q Consensus 499 ~~---~~~~~~~~~~~~~--~~~-~~~~~~~~~~~g~~----------------------------------~~ 532 (532) ++ +++++.+..++.. ... .....+...+.++- ++ T Consensus 633 e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~~ea~~~~~ 706 (763) T protein:vir:95 633 ENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPRKEGELPPN 706 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccChh Confidence 00 0000000000000 000 00000000000000 00 No 32 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.82 E-value=1.9e-18 Score=117.78 Aligned_cols=503 Identities=11% Similarity=0.041 Sum_probs=248.5 Q ss_pred CCCCCCCccC-----------HH---HHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCC-c-c---cccccccccc Q lcl|NC_015159. 1 MAEVEKTGFA-----------AD---GAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATA-D-G---STSYTTPWQS 61 (532) Q Consensus 1 m~~~~~~~~~-----------~~---~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~-~-~---~~~~~~~~ds 61 (532) |.+.+....+ ++ +|.++|..-.+.-..|...+.+-.+|.. +.-.+. . . .+..+.+.-. T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~---G~Qw~~~~~~~l~~~g~p~~~~N 99 (776) T protein:vir:93 23 SPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYD---NIQWSQDEIDELKERGQAPTVYN 99 (776) T ss_pred CCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhC---CCCCCHHHHHHHHhcCCceEEec Confidence 3444332221 11 2333444443333455555555556642 110110 0 0 0111122222 Q ss_pred hHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHh Q lcl|NC_015159. 62 IGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLV 141 (532) Q Consensus 62 t~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 141 (532) .-...++.+.+..-. +++=+++.+.+.... ++.+.| +..+......+++..+...++.+.++ T Consensus 100 ~i~~~i~~v~g~~~~-----nr~~~~~~p~~~~d~----------~~Ae~l---~~~~~~~~~~~~~~~~~~~af~d~~~ 161 (776) T protein:vir:93 100 VISQSVNWIIGSEKR-----GRSDFKVLPRRKDGG----------KAAERK---TALLKYLSDVNHTPFERSMAFEETTK 161 (776) T ss_pred chHHHHHHHHHHHHh-----CCcceEEecCChhHH----------HHHHHH---HHHHHHHHHhhcHHHHHHHHHHHhhh Confidence 223334433332222 566677766543211 222233 33444556788999999999999999 Q ss_pred hCceeeeecccccccCCcceEEEEecceEEEeeCCCC----CeEEEEEEEeecHHHhhHHHHH---HHHh----h----- Q lcl|NC_015159. 142 AGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYD----NVLQIVTEDKIARAALPEDVRK---SLEE----A----- 205 (532) Q Consensus 142 ~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G----~vd~i~rk~~~~~~~l~~~~~~---~~~~----~----- 205 (532) .|.|++=+--+....+..+..++++..++++..++.- ...-+|++.+++.+++-..+.. .+.+ . T Consensus 162 ~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 241 (776) T protein:vir:93 162 AGIGWLESQVQDENDGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWG 241 (776) T ss_pred cCcceEEEEeeccCCCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccc Confidence 9999864433333345566677788888888765532 1334677888998876322211 1100 0 Q ss_pred ------------------------cccCCCcceEEEEEEEEeeCC------------------------------C---- Q lcl|NC_015159. 206 ------------------------QGDQNPSEEVTIYTHVYRDPE------------------------------A---- 227 (532) Q Consensus 206 ------------------------~~~~~~~~~v~i~~~v~~~~~------------------------------~---- 227 (532) .......++|.|+.+.+++.. + T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~ 321 (776) T protein:vir:93 242 TDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVL 321 (776) T ss_pred hhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceee Confidence 000112246777776544210 0 Q ss_pred -----CeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_015159. 228 -----MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLF 302 (532) Q Consensus 228 -----~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~ 302 (532) +....+++..+..+....+.|.++.|||++......+.+.||.|.+....+-.+.+|......+..+ ...++ T Consensus 322 ~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~~~ 398 (776) T protein:vir:93 322 AVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STNKV 398 (776) T ss_pred hheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCCce Confidence 1112233444444444445677789999999999999999999999999999999999888876643 45678 Q ss_pred eecCccccChhhhcc--CCCceeecCccccccccccCCccc-hhHHHHHHHHHHHHHHHH--HhhhhcccCCCCCCCHHH Q lcl|NC_015159. 303 FVNPNGVTQIRRVAK--ANTGDFVAGRKQDVEVFQLEKYND-FQVAKATADDIEKRLSYA--FMLNSAVQRGGDRVTAEE 377 (532) Q Consensus 303 lv~~~g~~~~~~~~~--~~~G~~v~g~~~~~~~~~~~~~~~-~~~~~~~i~~~~~rI~~a--f~~~~~~~~~~~~~TAtE 377 (532) ++..+.+-+.+.+.. +++|.++...++......+....+ .+.....++...+.|+.. ..-..+. ..+...+..- T Consensus 399 ~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G-~~~n~~Sg~a 477 (776) T protein:vir:93 399 LMEEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLG-RTTNAVSGVA 477 (776) T ss_pred eeccccccchHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhC-CCcchhhHHH Confidence 888777777776654 567777665544433332222222 233444455555555443 1211222 2334467888 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcC---CC----C--------CC----ccccc-----cceeecc Q lcl|NC_015159. 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATS---KI----P--------NL----PKEAV-----EPAIATG 433 (532) Q Consensus 378 i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g---~l----p--------~~----p~~~~-----~~~~v~~ 433 (532) |..|.+.....|..++.++.. .+.=+.+.++.++.+.- .+ - .+ +..++ .+.+..+ T Consensus 478 i~~~~~~~~~~~~~~~dn~~~-~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~ 556 (776) T protein:vir:93 478 IQARQEQGSVATNKLFDNLRL-AFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEA 556 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeec Confidence 999999999999999999877 44446666666665521 00 0 01 11111 1222222 Q ss_pred h-HHHHHHHHHHHHHHHHHHHHhhcchh-----------hhhcCHHHHHHHHHHhcCC-CHhHccCCHHHHHHHHHHHHH Q lcl|NC_015159. 434 L-EALGRGHDLNKLNVFIDYMIKLAGLQ-----------DDDINLLDVKMRLANSLGM-DTTGLILTQQDKQAKMAEAST 500 (532) Q Consensus 434 l-~~l~raq~~~~l~~~~~~laq~~p~~-----------~d~id~d~~~~~~a~~~Gv-~p~~i~~s~ee~~~~~~q~~~ 500 (532) . ++..|.+..+.++..++ ++.|.. ++.-+.++++..+-...+- +|..-...+++.++...+++. T Consensus 557 ~~~~s~r~~~~~~l~ql~~---~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~ 633 (776) T protein:vir:93 557 EWRATMRQAAVAELMEVIG---KMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQ 633 (776) T ss_pred ccchhHHHHHHHHHHHHHh---hcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHH Confidence 2 23345555555554443 333322 2233566677666665552 122222222222211111110 Q ss_pred HHHH----------HHHHHhh--hHHHHHHHHhhc----ccccCCCCC Q lcl|NC_015159. 501 AAGM----------VTAGQQM--GAAGGQAAAAMM----QQQAGLPTQ 532 (532) Q Consensus 501 ~~~~----------~~~~~~~--~~~~~~~~~~~~----~~~~g~~~~ 532 (532) ++.+ +++..+. ..+....+.+.+ ..+++.-++ T Consensus 634 ~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~ 681 (776) T protein:vir:93 634 QQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAV 681 (776) T ss_pred HHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhh Confidence 0000 0000000 000000000000 000000000 No 33 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.72 E-value=1.1e-14 Score=97.04 Aligned_cols=516 Identities=13% Similarity=0.079 Sum_probs=237.0 Q ss_pred CCCC-CCC-------------ccCHHHHHHHHHHHHHH---hhhHHHHHHH----HHHhhcccccCCCCCcc------cc Q lcl|NC_015159. 1 MAEV-EKT-------------GFAADGAAAAYNRLKND---RGAYETRAED----CATYTIPSVFPSATADG------ST 53 (532) Q Consensus 1 m~~~-~~~-------------~~~~~~~~~r~~~lk~~---R~~~e~~w~e----~~~~~~P~~~~~~~~~~------~~ 53 (532) |++- ++. +.+.++..+.+.++++. -..+...|++ -.+|.. + ..-.+. .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~---G-~Qw~~~~~~~l~~~ 76 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLG---G-EQWPSQVRTERELE 76 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhC---C-CCCCHHHHHHHHhc Confidence 6654 111 11222222233333322 2334444442 233331 1 111110 01 Q ss_pred ccccc-ccchHHHHHHHHHHHHHHhhcCCCCCccccCCCh------------HHHhhhccChhHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 54 SYTTP-WQSIGARGLNNLASKLMLALFPVGSSFFKLNVSE------------LEVKQSITSPEELTEIATGLAMVERICM 120 (532) Q Consensus 54 ~~~~~-~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d------------~~~~~~~~~~~~~~~v~~~L~~ve~~~~ 120 (532) ..+.+ |+=++ -.++...+..- .+++=+++.+.+ ............-.++.+.| +..+. T Consensus 77 g~p~~~~N~i~-~~v~~v~g~~~-----~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l---~~~~~ 147 (711) T protein:vir:10 77 QRPCLVNNVLP-TFVDQVLGDQR-----QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVF---TGLIK 147 (711) T ss_pred CCCcEEEcchH-HHHHHHhhhHh-----hCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHH---HHHHH Confidence 11112 22222 23333322222 133333333321 11111111111111233333 33344 Q ss_pred HHHHhcCChHHHHHHHHHHHhhCceeeee--cc-cccccCCcceEEEEe-cceEEEeeCC---CCC-eEEEEEEEeecHH Q lcl|NC_015159. 121 NYMESNSFRPTLHAAIKQLLVAGNVLLYI--PS-TEQVEGQSNAPKLYK-LHNFVVERDA---YDN-VLQIVTEDKIARA 192 (532) Q Consensus 121 ~~l~~snf~~~~~~~~~dl~~~G~~~~~v--~~-~~~~~~~~~~~~~~p-l~~~~v~~d~---~G~-vd~i~rk~~~~~~ 192 (532) .....++...+...++.+.+..|.|++=| +. ++....+.+.++.++ ..++++..++ ++. ..-+|++.+++.+ T Consensus 148 ~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~ 227 (711) T protein:vir:10 148 NIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKE 227 (711) T ss_pred HHHHhcChhHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHH Confidence 45567888889999999999999997522 21 111223455666664 5667775433 332 3447889999999 Q ss_pred HhhHHHHHHH----Hhhccc-CC---CcceEEEEEEEEeeCCC------------------------------------- Q lcl|NC_015159. 193 ALPEDVRKSL----EEAQGD-QN---PSEEVTIYTHVYRDPEA------------------------------------- 227 (532) Q Consensus 193 ~l~~~~~~~~----~~~~~~-~~---~~~~v~i~~~v~~~~~~------------------------------------- 227 (532) ++-..+.... ...... .+ ..+.|.+..+.++++.. T Consensus 228 ~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 307 (711) T protein:vir:10 228 KFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKV 307 (711) T ss_pred HHHHhCCchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhh Confidence 8755443322 111100 00 12344444333221100 Q ss_pred CeEEEE-EEEcCcccccccccCccccCceEEE--EeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_015159. 228 MVFRSY-QEIDGEIVAGTEGEYPLDSCPWIPV--RLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV 304 (532) Q Consensus 228 ~~~~s~-~~~~~~~~~~~~~~~g~~~~P~~~~--Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv 304 (532) +.++.+ +...|..+......|++..|||++. .+...++..++.|.+....+-.+.+|.+....+..+....+++|++ T Consensus 308 ~~~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~ 387 (711) T protein:vir:10 308 KTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIG 387 (711) T ss_pred ceeeEEEEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceee Confidence 011111 1123333322334567788999865 3556788888888999999999999999999999999999999999 Q ss_pred cCccccChhhhc---cCCCceeecCcccc---ccccccCCccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHH Q lcl|NC_015159. 305 NPNGVTQIRRVA---KANTGDFVAGRKQD---VEVFQLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEE 377 (532) Q Consensus 305 ~~~g~~~~~~~~---~~~~G~~v~g~~~~---~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~~~TAtE 377 (532) .++.+-+.+... .+.||.++...++. ..+.+.+...--+.....++...+.|.+.- ..+.+....+..+|..- T Consensus 388 ~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~a 467 (711) T protein:vir:10 388 SEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRA 467 (711) T ss_pred cCcccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHH Confidence 888887766543 35677776543332 222222322333445555565666555532 11212223334578999 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcC---CC--------C----CCccc-------------cc--- Q lcl|NC_015159. 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATS---KI--------P----NLPKE-------------AV--- 426 (532) Q Consensus 378 i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g---~l--------p----~~p~~-------------~~--- 426 (532) |..+.+.....|...+.++.. ...=+.+.++.++.+.- .+ + .+-.. ++ T Consensus 468 i~~~q~qg~~~l~~~~dn~~~-~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g 546 (711) T protein:vir:10 468 IIARQRQGDRGSFAFIDNLTK-SIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQ 546 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeecccee Confidence 999999999999998888876 44444455555443311 00 0 00000 00 Q ss_pred cceee---cchHHHHHHHHHHHHHHHHHHHHhhcch----h---hhhcCHHHHHHHHHHhcCCCHhHccCCHHH----HH Q lcl|NC_015159. 427 EPAIA---TGLEALGRGHDLNKLNVFIDYMIKLAGL----Q---DDDINLLDVKMRLANSLGMDTTGLILTQQD----KQ 492 (532) Q Consensus 427 ~~~~v---~~l~~l~raq~~~~l~~~~~~laq~~p~----~---~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee----~~ 492 (532) +..++ .+-.+-.|.+.++.++.+++.+-++.+. + +|.-+.++++..+....+- ........+ .+ T Consensus 547 ~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~--~~~~~~~~~~~qq~~ 624 (711) T protein:vir:10 547 KYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDM 624 (711) T ss_pred eeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCc--ccCcchhhhHHHHHH Confidence 11222 2223444555555555544433221111 1 2344677777777766553 222222111 11 Q ss_pred HHHHHHHHHHHH----HHH-HHhhhHHHHHHHHhhcc---------cc-cCCCCC Q lcl|NC_015159. 493 AKMAEASTAAGM----VTA-GQQMGAAGGQAAAAMMQ---------QQ-AGLPTQ 532 (532) Q Consensus 493 ~~~~q~~~~~~~----~~~-~~~~~~~~~~~~~~~~~---------~~-~g~~~~ 532 (532) ++++++.++.+. .++ ..++.+....+-+..++ .+ .++..+ T Consensus 625 ~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~ 679 (711) T protein:vir:10 625 PEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDM 679 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111000000 000 00000000000000000 00 000000 No 34 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.55 E-value=1.7e-12 Score=85.13 Aligned_cols=501 Identities=10% Similarity=0.056 Sum_probs=229.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH----HHHHhhcccccCCCCCcc------ccccccc-ccchHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE----DCATYTIPSVFPSATADG------STSYTTP-WQSIGARGLNN 69 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~------~~~~~~~-~dst~~~a~~~ 69 (532) |+......+.++...+.+...+.+... .+.|+ +-.+|.. + ..-... .+..+.+ |+=++ -.++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~---G-~Qw~~~~~~~l~~~g~p~~~~N~i~-~~v~~ 81 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYD---G-DQLPPEVLQVLKDRGQPMTIHNLIA-PTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEeccHH-HHHHH Confidence 444432333444445555555555432 33455 4445442 1 111110 0111122 33332 23333 Q ss_pred HHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee-- Q lcl|NC_015159. 70 LASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL-- 147 (532) Q Consensus 70 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~-- 147 (532) ..+..- .+++=+++.+.+.+.. -.++.+.| +..+......+++..+...++.+.+..|.|++ T Consensus 82 v~g~~~-----~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~ 145 (714) T protein:vir:10 82 VLGMEA-----KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEV 145 (714) T ss_pred HHhHHH-----hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEe Confidence 222222 2555566665432111 01223333 33445556688888899999999988888873 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCC----CeEEEEEEEeecHHHhhHHHHH---HHHhhc-------------- Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYD----NVLQIVTEDKIARAALPEDVRK---SLEEAQ-------------- 206 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G----~vd~i~rk~~~~~~~l~~~~~~---~~~~~~-------------- 206 (532) |++++ ..+..++++.+|..++++..++.- .-.-++++.+++.+++-..+.. .+.... T Consensus 146 ~~~~d--~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:10 146 RRNSD--PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred ccccC--CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 44433 446678999999999999765432 1224778899998887332221 010000 Q ss_pred ---------------c------cCCCcceEEEEEEEEeeC------------------CC-------------------C Q lcl|NC_015159. 207 ---------------G------DQNPSEEVTIYTHVYRDP------------------EA-------------------M 228 (532) Q Consensus 207 ---------------~------~~~~~~~v~i~~~v~~~~------------------~~-------------------~ 228 (532) . ......+|.|+.+.++.. ++ + T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:10 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0 000124566666644311 00 1 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) .+..++++....+......|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+.+ +..+-++ +.+ T Consensus 304 rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~ 378 (714) T protein:vir:10 304 RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDE 378 (714) T ss_pred eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eec Confidence 1222233333333223345666789998654333 567777 58888999999999754444432 3555555 445 Q ss_pred ccccChh-hhc--cCCCceeecCccc---cc---ccccc-CCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCC Q lcl|NC_015159. 307 NGVTQIR-RVA--KANTGDFVAGRKQ---DV---EVFQL-EKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVT 374 (532) Q Consensus 307 ~g~~~~~-~~~--~~~~G~~v~g~~~---~~---~~~~~-~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~T 374 (532) +++...+ .+. .+.+|+++.-+++ .. .++.. +...-.+.....++...+.|++.- +-..+. ..+...+ T Consensus 379 ~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG-~~~na~S 457 (714) T protein:vir:10 379 DATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG-QDSGATS 457 (714) T ss_pred CcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC-CCccchh Confidence 5554432 221 2556666543322 11 11222 222223444455555555554431 111222 2333456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-------------cCCCC--------CCcc-----cc--- Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-------------TSKIP--------NLPK-----EA--- 425 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-------------~g~lp--------~~p~-----~~--- 425 (532) ..-|..|++...+.|+..+.+|..-+. =+.+.+++++.+ .+... +... .+ T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~-~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~ 536 (714) T protein:vir:10 458 GVAISNLVEQGATTLAEINDNYQFACQ-QVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISR 536 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceeccccee Confidence 667999999999999998877765322 223333333322 11000 0000 00 Q ss_pred ccceee---cchHHHHHHHHHHHHHHHHHHHH----hhcch-hh---hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHH Q lcl|NC_015159. 426 VEPAIA---TGLEALGRGHDLNKLNVFIDYMI----KLAGL-QD---DDINLLDVKMRLANSLGMDTTGLILTQQDKQAK 494 (532) Q Consensus 426 ~~~~~v---~~l~~l~raq~~~~l~~~~~~la----q~~p~-~~---d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~ 494 (532) .+..++ .+-++-.|.+.++.++.+++.+. .+.+. .+ |.-+.+++++.+-+.+|.....=-.++++.++. T Consensus 537 ~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~ 616 (714) T protein:vir:10 537 LNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred eeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHH Confidence 111222 23345556666666666665431 11111 12 334678899999888886321111222221111 Q ss_pred HHHHHHHHHH-------------------HHHHH--------------------hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 495 MAEASTAAGM-------------------VTAGQ--------------------QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 495 ~~q~~~~~~~-------------------~~~~~--------------------~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .++++.++++ +++.+ ...+...+.++.+++..+++.-+ T Consensus 617 ~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~ 693 (714) T protein:vir:10 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQE 693 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhh Confidence 1111000000 00000 00000111111122222222222 No 35 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.55 E-value=1.7e-12 Score=85.13 Aligned_cols=501 Identities=10% Similarity=0.056 Sum_probs=229.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH----HHHHhhcccccCCCCCcc------ccccccc-ccchHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE----DCATYTIPSVFPSATADG------STSYTTP-WQSIGARGLNN 69 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~------~~~~~~~-~dst~~~a~~~ 69 (532) |+......+.++...+.+...+.+... .+.|+ +-.+|.. + ..-... .+..+.+ |+=++ -.++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~---G-~Qw~~~~~~~l~~~g~p~~~~N~i~-~~v~~ 81 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYD---G-DQLPPEVLQVLKDRGQPMTIHNLIA-PTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEeccHH-HHHHH Confidence 444432333444445555555555432 33455 4445442 1 111110 0111122 33332 23333 Q ss_pred HHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee-- Q lcl|NC_015159. 70 LASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL-- 147 (532) Q Consensus 70 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~-- 147 (532) ..+..- .+++=+++.+.+.+.. -.++.+.| +..+......+++..+...++.+.+..|.|++ T Consensus 82 v~g~~~-----~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~ 145 (714) T protein:vir:81 82 VLGMEA-----KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEV 145 (714) T ss_pred HHhHHH-----hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEe Confidence 222222 2555566665432111 01223333 33445556688888899999999988888873 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCC----CeEEEEEEEeecHHHhhHHHHH---HHHhhc-------------- Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYD----NVLQIVTEDKIARAALPEDVRK---SLEEAQ-------------- 206 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G----~vd~i~rk~~~~~~~l~~~~~~---~~~~~~-------------- 206 (532) |++++ ..+..++++.+|..++++..++.- .-.-++++.+++.+++-..+.. .+.... T Consensus 146 ~~~~d--~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:81 146 RRNSD--PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred ccccC--CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 44433 446678999999999999765432 1224778899998887332221 010000 Q ss_pred ---------------c------cCCCcceEEEEEEEEeeC------------------CC-------------------C Q lcl|NC_015159. 207 ---------------G------DQNPSEEVTIYTHVYRDP------------------EA-------------------M 228 (532) Q Consensus 207 ---------------~------~~~~~~~v~i~~~v~~~~------------------~~-------------------~ 228 (532) . ......+|.|+.+.++.. ++ + T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:81 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0 000124566666644311 00 1 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) .+..++++....+......|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+.+ +..+-++ +.+ T Consensus 304 rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~ 378 (714) T protein:vir:81 304 RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDE 378 (714) T ss_pred eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eec Confidence 1222233333333223345666789998654333 567777 58888999999999754444432 3555555 445 Q ss_pred ccccChh-hhc--cCCCceeecCccc---cc---ccccc-CCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCC Q lcl|NC_015159. 307 NGVTQIR-RVA--KANTGDFVAGRKQ---DV---EVFQL-EKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVT 374 (532) Q Consensus 307 ~g~~~~~-~~~--~~~~G~~v~g~~~---~~---~~~~~-~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~T 374 (532) +++...+ .+. .+.+|+++.-+++ .. .++.. +...-.+.....++...+.|++.- +-..+. ..+...+ T Consensus 379 ~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG-~~~na~S 457 (714) T protein:vir:81 379 DATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG-QDSGATS 457 (714) T ss_pred CcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC-CCccchh Confidence 5554432 221 2556666543322 11 11222 222223444455555555554431 111222 2333456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-------------cCCCC--------CCcc-----cc--- Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-------------TSKIP--------NLPK-----EA--- 425 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-------------~g~lp--------~~p~-----~~--- 425 (532) ..-|..|++...+.|+..+.+|..-+. =+.+.+++++.+ .+... +... .+ T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~-~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~ 536 (714) T protein:vir:81 458 GVAISNLVEQGATTLAEINDNYQFACQ-QVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISR 536 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceeccccee Confidence 667999999999999998877765322 223333333322 11000 0000 00 Q ss_pred ccceee---cchHHHHHHHHHHHHHHHHHHHH----hhcch-hh---hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHH Q lcl|NC_015159. 426 VEPAIA---TGLEALGRGHDLNKLNVFIDYMI----KLAGL-QD---DDINLLDVKMRLANSLGMDTTGLILTQQDKQAK 494 (532) Q Consensus 426 ~~~~~v---~~l~~l~raq~~~~l~~~~~~la----q~~p~-~~---d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~ 494 (532) .+..++ .+-++-.|.+.++.++.+++.+. .+.+. .+ |.-+.+++++.+-+.+|.....=-.++++.++. T Consensus 537 ~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~ 616 (714) T protein:vir:81 537 LNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred eeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHH Confidence 111222 23345556666666666665431 11111 12 334678899999888886321111222221111 Q ss_pred HHHHHHHHHH-------------------HHHHH--------------------hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 495 MAEASTAAGM-------------------VTAGQ--------------------QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 495 ~~q~~~~~~~-------------------~~~~~--------------------~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .++++.++++ +++.+ ...+...+.++.+++..+++.-+ T Consensus 617 ~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~ 693 (714) T protein:vir:81 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQE 693 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhh Confidence 1111000000 00000 00000111111122222222222 No 36 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.55 E-value=1.7e-12 Score=85.13 Aligned_cols=501 Identities=10% Similarity=0.056 Sum_probs=229.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH----HHHHhhcccccCCCCCcc------ccccccc-ccchHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE----DCATYTIPSVFPSATADG------STSYTTP-WQSIGARGLNN 69 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~------~~~~~~~-~dst~~~a~~~ 69 (532) |+......+.++...+.+...+.+... .+.|+ +-.+|.. + ..-... .+..+.+ |+=++ -.++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~---G-~Qw~~~~~~~l~~~g~p~~~~N~i~-~~v~~ 81 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYD---G-DQLPPEVLQVLKDRGQPMTIHNLIA-PTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEeccHH-HHHHH Confidence 444432333444445555555555432 33455 4445442 1 111110 0111122 33332 23333 Q ss_pred HHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee-- Q lcl|NC_015159. 70 LASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL-- 147 (532) Q Consensus 70 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~-- 147 (532) ..+..- .+++=+++.+.+.+.. -.++.+.| +..+......+++..+...++.+.+..|.|++ T Consensus 82 v~g~~~-----~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~ 145 (714) T protein:vir:99 82 VLGMEA-----KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEV 145 (714) T ss_pred HHhHHH-----hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEe Confidence 222222 2555566665432111 01223333 33445556688888899999999988888873 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCC----CeEEEEEEEeecHHHhhHHHHH---HHHhhc-------------- Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYD----NVLQIVTEDKIARAALPEDVRK---SLEEAQ-------------- 206 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G----~vd~i~rk~~~~~~~l~~~~~~---~~~~~~-------------- 206 (532) |++++ ..+..++++.+|..++++..++.- .-.-++++.+++.+++-..+.. .+.... T Consensus 146 ~~~~d--~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:99 146 RRNSD--PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred ccccC--CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 44433 446678999999999999765432 1224778899998887332221 010000 Q ss_pred ---------------c------cCCCcceEEEEEEEEeeC------------------CC-------------------C Q lcl|NC_015159. 207 ---------------G------DQNPSEEVTIYTHVYRDP------------------EA-------------------M 228 (532) Q Consensus 207 ---------------~------~~~~~~~v~i~~~v~~~~------------------~~-------------------~ 228 (532) . ......+|.|+.+.++.. ++ + T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:99 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0 000124566666644311 00 1 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) .+..++++....+......|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+.+ +..+-++ +.+ T Consensus 304 rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~ 378 (714) T protein:vir:99 304 RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDE 378 (714) T ss_pred eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eec Confidence 1222233333333223345666789998654333 567777 58888999999999754444432 3555555 445 Q ss_pred ccccChh-hhc--cCCCceeecCccc---cc---ccccc-CCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCC Q lcl|NC_015159. 307 NGVTQIR-RVA--KANTGDFVAGRKQ---DV---EVFQL-EKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVT 374 (532) Q Consensus 307 ~g~~~~~-~~~--~~~~G~~v~g~~~---~~---~~~~~-~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~T 374 (532) +++...+ .+. .+.+|+++.-+++ .. .++.. +...-.+.....++...+.|++.- +-..+. ..+...+ T Consensus 379 ~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG-~~~na~S 457 (714) T protein:vir:99 379 DATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG-QDSGATS 457 (714) T ss_pred CcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC-CCccchh Confidence 5554432 221 2556666543322 11 11222 222223444455555555554431 111222 2333456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-------------cCCCC--------CCcc-----cc--- Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-------------TSKIP--------NLPK-----EA--- 425 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-------------~g~lp--------~~p~-----~~--- 425 (532) ..-|..|++...+.|+..+.+|..-+. =+.+.+++++.+ .+... +... .+ T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~-~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~ 536 (714) T protein:vir:99 458 GVAISNLVEQGATTLAEINDNYQFACQ-QVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISR 536 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceeccccee Confidence 667999999999999998877765322 223333333322 11000 0000 00 Q ss_pred ccceee---cchHHHHHHHHHHHHHHHHHHHH----hhcch-hh---hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHH Q lcl|NC_015159. 426 VEPAIA---TGLEALGRGHDLNKLNVFIDYMI----KLAGL-QD---DDINLLDVKMRLANSLGMDTTGLILTQQDKQAK 494 (532) Q Consensus 426 ~~~~~v---~~l~~l~raq~~~~l~~~~~~la----q~~p~-~~---d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~ 494 (532) .+..++ .+-++-.|.+.++.++.+++.+. .+.+. .+ |.-+.+++++.+-+.+|.....=-.++++.++. T Consensus 537 ~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~ 616 (714) T protein:vir:99 537 LNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred eeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHH Confidence 111222 23345556666666666665431 11111 12 334678899999888886321111222221111 Q ss_pred HHHHHHHHHH-------------------HHHHH--------------------hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 495 MAEASTAAGM-------------------VTAGQ--------------------QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 495 ~~q~~~~~~~-------------------~~~~~--------------------~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .++++.++++ +++.+ ...+...+.++.+++..+++.-+ T Consensus 617 ~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~ 693 (714) T protein:vir:99 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQE 693 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhh Confidence 1111000000 00000 00000111111122222222222 No 37 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.55 E-value=1.7e-12 Score=85.13 Aligned_cols=501 Identities=10% Similarity=0.056 Sum_probs=229.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH----HHHHhhcccccCCCCCcc------ccccccc-ccchHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE----DCATYTIPSVFPSATADG------STSYTTP-WQSIGARGLNN 69 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~------~~~~~~~-~dst~~~a~~~ 69 (532) |+......+.++...+.+...+.+... .+.|+ +-.+|.. + ..-... .+..+.+ |+=++ -.++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~---G-~Qw~~~~~~~l~~~g~p~~~~N~i~-~~v~~ 81 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYD---G-DQLPPEVLQVLKDRGQPMTIHNLIA-PTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEeccHH-HHHHH Confidence 444432333444445555555555432 33455 4445442 1 111110 0111122 33332 23333 Q ss_pred HHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee-- Q lcl|NC_015159. 70 LASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL-- 147 (532) Q Consensus 70 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~-- 147 (532) ..+..- .+++=+++.+.+.+.. -.++.+.| +..+......+++..+...++.+.+..|.|++ T Consensus 82 v~g~~~-----~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~ 145 (714) T protein:vir:32 82 VLGMEA-----KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEV 145 (714) T ss_pred HHhHHH-----hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEe Confidence 222222 2555566665432111 01223333 33445556688888899999999988888873 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCC----CeEEEEEEEeecHHHhhHHHHH---HHHhhc-------------- Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYD----NVLQIVTEDKIARAALPEDVRK---SLEEAQ-------------- 206 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G----~vd~i~rk~~~~~~~l~~~~~~---~~~~~~-------------- 206 (532) |++++ ..+..++++.+|..++++..++.- .-.-++++.+++.+++-..+.. .+.... T Consensus 146 ~~~~d--~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:32 146 RRNSD--PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred ccccC--CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 44433 446678999999999999765432 1224778899998887332221 010000 Q ss_pred ---------------c------cCCCcceEEEEEEEEeeC------------------CC-------------------C Q lcl|NC_015159. 207 ---------------G------DQNPSEEVTIYTHVYRDP------------------EA-------------------M 228 (532) Q Consensus 207 ---------------~------~~~~~~~v~i~~~v~~~~------------------~~-------------------~ 228 (532) . ......+|.|+.+.++.. ++ + T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:32 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0 000124566666644311 00 1 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) .+..++++....+......|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+.+ +..+-++ +.+ T Consensus 304 rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~ 378 (714) T protein:vir:32 304 RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDE 378 (714) T ss_pred eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eec Confidence 1222233333333223345666789998654333 567777 58888999999999754444432 3555555 445 Q ss_pred ccccChh-hhc--cCCCceeecCccc---cc---ccccc-CCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCC Q lcl|NC_015159. 307 NGVTQIR-RVA--KANTGDFVAGRKQ---DV---EVFQL-EKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVT 374 (532) Q Consensus 307 ~g~~~~~-~~~--~~~~G~~v~g~~~---~~---~~~~~-~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~T 374 (532) +++...+ .+. .+.+|+++.-+++ .. .++.. +...-.+.....++...+.|++.- +-..+. ..+...+ T Consensus 379 ~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG-~~~na~S 457 (714) T protein:vir:32 379 DATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG-QDSGATS 457 (714) T ss_pred CcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC-CCccchh Confidence 5554432 221 2556666543322 11 11222 222223444455555555554431 111222 2333456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-------------cCCCC--------CCcc-----cc--- Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-------------TSKIP--------NLPK-----EA--- 425 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-------------~g~lp--------~~p~-----~~--- 425 (532) ..-|..|++...+.|+..+.+|..-+. =+.+.+++++.+ .+... +... .+ T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~-~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~ 536 (714) T protein:vir:32 458 GVAISNLVEQGATTLAEINDNYQFACQ-QVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISR 536 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceeccccee Confidence 667999999999999998877765322 223333333322 11000 0000 00 Q ss_pred ccceee---cchHHHHHHHHHHHHHHHHHHHH----hhcch-hh---hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHH Q lcl|NC_015159. 426 VEPAIA---TGLEALGRGHDLNKLNVFIDYMI----KLAGL-QD---DDINLLDVKMRLANSLGMDTTGLILTQQDKQAK 494 (532) Q Consensus 426 ~~~~~v---~~l~~l~raq~~~~l~~~~~~la----q~~p~-~~---d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~ 494 (532) .+..++ .+-++-.|.+.++.++.+++.+. .+.+. .+ |.-+.+++++.+-+.+|.....=-.++++.++. T Consensus 537 ~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~ 616 (714) T protein:vir:32 537 LNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred eeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHH Confidence 111222 23345556666666666665431 11111 12 334678899999888886321111222221111 Q ss_pred HHHHHHHHHH-------------------HHHHH--------------------hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 495 MAEASTAAGM-------------------VTAGQ--------------------QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 495 ~~q~~~~~~~-------------------~~~~~--------------------~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .++++.++++ +++.+ ...+...+.++.+++..+++.-+ T Consensus 617 ~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~ 693 (714) T protein:vir:32 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQE 693 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhh Confidence 1111000000 00000 00000111111122222222222 No 38 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.55 E-value=1.7e-12 Score=85.13 Aligned_cols=501 Identities=10% Similarity=0.056 Sum_probs=229.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH----HHHHhhcccccCCCCCcc------ccccccc-ccchHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE----DCATYTIPSVFPSATADG------STSYTTP-WQSIGARGLNN 69 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~------~~~~~~~-~dst~~~a~~~ 69 (532) |+......+.++...+.+...+.+... .+.|+ +-.+|.. + ..-... .+..+.+ |+=++ -.++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~---G-~Qw~~~~~~~l~~~g~p~~~~N~i~-~~v~~ 81 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYD---G-DQLPPEVLQVLKDRGQPMTIHNLIA-PTVDG 81 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEeccHH-HHHHH Confidence 444432333444445555555555432 33455 4445442 1 111110 0111122 33332 23333 Q ss_pred HHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee-- Q lcl|NC_015159. 70 LASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL-- 147 (532) Q Consensus 70 Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~-- 147 (532) ..+..- .+++=+++.+.+.+.. -.++.+.| +..+......+++..+...++.+.+..|.|++ T Consensus 82 v~g~~~-----~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~ 145 (714) T protein:vir:27 82 VLGMEA-----KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEV 145 (714) T ss_pred HHhHHH-----hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEe Confidence 222222 2555566665432111 01223333 33445556688888899999999988888873 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCC----CeEEEEEEEeecHHHhhHHHHH---HHHhhc-------------- Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYD----NVLQIVTEDKIARAALPEDVRK---SLEEAQ-------------- 206 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G----~vd~i~rk~~~~~~~l~~~~~~---~~~~~~-------------- 206 (532) |++++ ..+..++++.+|..++++..++.- .-.-++++.+++.+++-..+.. .+.... T Consensus 146 ~~~~d--~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~ 223 (714) T protein:vir:27 146 RRNSD--PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEG 223 (714) T ss_pred ccccC--CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccc Confidence 44433 446678999999999999765432 1224778899998887332221 010000 Q ss_pred ---------------c------cCCCcceEEEEEEEEeeC------------------CC-------------------C Q lcl|NC_015159. 207 ---------------G------DQNPSEEVTIYTHVYRDP------------------EA-------------------M 228 (532) Q Consensus 207 ---------------~------~~~~~~~v~i~~~v~~~~------------------~~-------------------~ 228 (532) . ......+|.|+.+.++.. ++ + T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~ 303 (714) T protein:vir:27 224 QPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVS 303 (714) T ss_pred cccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccc Confidence 0 000124566666644311 00 1 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) .+..++++....+......|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+.+ +..+-++ +.+ T Consensus 304 rv~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~-~~~ 378 (714) T protein:vir:27 304 RIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVI-MDE 378 (714) T ss_pred eEEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCcee-eec Confidence 1222233333333223345666789998654333 567777 58888999999999754444432 3555555 445 Q ss_pred ccccChh-hhc--cCCCceeecCccc---cc---ccccc-CCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCC Q lcl|NC_015159. 307 NGVTQIR-RVA--KANTGDFVAGRKQ---DV---EVFQL-EKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVT 374 (532) Q Consensus 307 ~g~~~~~-~~~--~~~~G~~v~g~~~---~~---~~~~~-~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~T 374 (532) +++...+ .+. .+.+|+++.-+++ .. .++.. +...-.+.....++...+.|++.- +-..+. ..+...+ T Consensus 379 ~a~~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG-~~~na~S 457 (714) T protein:vir:27 379 DATQLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG-QDSGATS 457 (714) T ss_pred CcccccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcC-CCccchh Confidence 5554432 221 2556666543322 11 11222 222223444455555555554431 111222 2333456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-------------cCCCC--------CCcc-----cc--- Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-------------TSKIP--------NLPK-----EA--- 425 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-------------~g~lp--------~~p~-----~~--- 425 (532) ..-|..|++...+.|+..+.+|..-+. =+.+.+++++.+ .+... +... .+ T Consensus 458 GvAi~~rq~qg~~~l~~~~Dnl~~~~~-~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~ 536 (714) T protein:vir:27 458 GVAISNLVEQGATTLAEINDNYQFACQ-QVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISR 536 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceeccccee Confidence 667999999999999998877765322 223333333322 11000 0000 00 Q ss_pred ccceee---cchHHHHHHHHHHHHHHHHHHHH----hhcch-hh---hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHH Q lcl|NC_015159. 426 VEPAIA---TGLEALGRGHDLNKLNVFIDYMI----KLAGL-QD---DDINLLDVKMRLANSLGMDTTGLILTQQDKQAK 494 (532) Q Consensus 426 ~~~~~v---~~l~~l~raq~~~~l~~~~~~la----q~~p~-~~---d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~ 494 (532) .+..++ .+-++-.|.+.++.++.+++.+. .+.+. .+ |.-+.+++++.+-+.+|.....=-.++++.++. T Consensus 537 ~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~ 616 (714) T protein:vir:27 537 LNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred eeEEEEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHH Confidence 111222 23345556666666666665431 11111 12 334678899999888886321111222221111 Q ss_pred HHHHHHHHHH-------------------HHHHH--------------------hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 495 MAEASTAAGM-------------------VTAGQ--------------------QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 495 ~~q~~~~~~~-------------------~~~~~--------------------~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .++++.++++ +++.+ ...+...+.++.+++..+++.-+ T Consensus 617 ~~~q~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~ 693 (714) T protein:vir:27 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQE 693 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhh Confidence 1111000000 00000 00000111111122222222222 No 39 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.54 E-value=9.7e-14 Score=91.94 Aligned_cols=505 Identities=13% Similarity=0.104 Sum_probs=231.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhc-ccccCCCCCcc----ccc---cccc-ccchHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTI-PSVFPSATADG----STS---YTTP-WQSIGARGLNNLA 71 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~-P~~~~~~~~~~----~~~---~~~~-~dst~~~a~~~La 71 (532) ||+..+.- .+.+..||....+..+.|...+.+=.+|.. +..-=+..... +.. .+.+ |+=++. .++... T Consensus 1 m~~~~~~~--~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~-~v~~v~ 77 (708) T protein:vir:10 1 MAETLEKK--HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVAT-ELNRII 77 (708) T ss_pred CchhHHHH--HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHH-HHHHHH Confidence 99986421 355677777776666666666655544432 21100000000 000 1111 333332 333333 Q ss_pred HHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 72 SKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 72 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) +.-. .+++=+++.+.+..-. .++.+.| +..+......++...+...++.+.+..|-|++-+-. T Consensus 78 g~~~-----~nr~d~~v~P~~~~~d---------~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~ 140 (708) T protein:vir:10 78 AEYR-----NNRITVKFRPGDREAS---------EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTS 140 (708) T ss_pred HHHH-----hCCcceEEEcCCCCch---------HHHHHHH---HHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeee Confidence 2222 2555566666532211 1223333 333444556888888999999999999999764411 Q ss_pred ------cccccCCcceEEEE--ecceEEEeeCC---CCC-eEEEEEEEeecHHHhhHHHHHHHHh----hccc--CC--- Q lcl|NC_015159. 152 ------TEQVEGQSNAPKLY--KLHNFVVERDA---YDN-VLQIVTEDKIARAALPEDVRKSLEE----AQGD--QN--- 210 (532) Q Consensus 152 ------~~~~~~~~~~~~~~--pl~~~~v~~d~---~G~-vd~i~rk~~~~~~~l~~~~~~~~~~----~~~~--~~--- 210 (532) +......++.++++ |..++++.-++ ++. -.-+||..+++.+++-..+...... .... .+ T Consensus 141 d~~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~ 220 (708) T protein:vir:10 141 MLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWF 220 (708) T ss_pred ccccccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCcccccc Confidence 11112234444443 44566655332 221 2236778888988775444322111 0000 00 Q ss_pred CcceEEEEEE-----------EEeeCCC-------------------------------CeEEEE-EEEcCccccccccc Q lcl|NC_015159. 211 PSEEVTIYTH-----------VYRDPEA-------------------------------MVFRSY-QEIDGEIVAGTEGE 247 (532) Q Consensus 211 ~~~~v~i~~~-----------v~~~~~~-------------------------------~~~~s~-~~~~~~~~~~~~~~ 247 (532) ..+.+.|..+ +.+++.+ +.++.+ +...|..+....+. T Consensus 221 ~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~ 300 (708) T protein:vir:10 221 GADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) T ss_pred CCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCC Confidence 0011211111 1122110 112222 22234443333466 Q ss_pred CccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhcc-------- Q lcl|NC_015159. 248 YPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK-------- 317 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~-------- 317 (532) +++..|||++.-+.. .+|..++.|.+..+.+-.+.+|+..-..+.++..+-+.+++++++.+.....-.. T Consensus 301 ~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~ 380 (708) T protein:vir:10 301 IPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPA 380 (708) T ss_pred CCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchh Confidence 888999998774433 4677877899999999999999999999999988888888888776544322211 Q ss_pred --------CCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHH Q lcl|NC_015159. 318 --------ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELED 387 (532) Q Consensus 318 --------~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~ 387 (532) ..+|.++++... +.......--+.....++...+.|.+.. .-.++.+ .+ .++..-|..|.+.... T Consensus 381 ~~~~~~~~~~~G~~~~~~~~---~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~s-n~SG~aI~~rq~qg~~ 455 (708) T protein:vir:10 381 FLPLREVRDKSGNIIAGATP---AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PS-NIAQETVNNLMNRADM 455 (708) T ss_pred hhccccccccccccccccCC---ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccC-cc-chHHHHHHHHHHHHHH Confidence 112222221110 0001111111223344444445554442 2222322 22 3578889999999999 Q ss_pred HhhhhHHHHHH------HHHHHHHHHHH------HHHHhcCC----------CCCCccc-----cc---cceee---cch Q lcl|NC_015159. 388 TLGGVYSLLSQ------ELQLPLVKILL------KELQATSK----------IPNLPKE-----AV---EPAIA---TGL 434 (532) Q Consensus 388 ~LGpv~~rl~~------E~l~Pli~r~~------~il~r~g~----------lp~~p~~-----~~---~~~~v---~~l 434 (532) .|+..+.+|.. +++.-||...+ .|....|. .++-.+. ++ +..++ .+- T Consensus 456 ~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~ 535 (708) T protein:vir:10 456 ASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPS 535 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccC Confidence 99999988764 33333333222 22221221 1111111 11 11232 233 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcc-------hhh---hhcCHHHHHHHHHHhcCCCHhHccCCH-HHHHHHHHHHHHHHH Q lcl|NC_015159. 435 EALGRGHDLNKLNVFIDYMIKLAG-------LQD---DDINLLDVKMRLANSLGMDTTGLILTQ-QDKQAKMAEASTAAG 503 (532) Q Consensus 435 ~~l~raq~~~~l~~~~~~laq~~p-------~~~---d~id~d~~~~~~a~~~Gv~p~~i~~s~-ee~~~~~~q~~~~~~ 503 (532) .+-.|.+.++.++.+++.+....| .++ |--+.++++..+-..++. ....... +|.+++.++++++++ T Consensus 536 ~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~--~~~~~~~~~ee~q~~~~~q~~~q 613 (708) T protein:vir:10 536 YTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQ 613 (708) T ss_pred chhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcc--cccccccchhhHHHHHHHHHHHH Confidence 445677777777777776543222 222 233556777777666553 1122221 121111111111100 Q ss_pred H------HHHHHh-------hhHHHH-------HHHHhhccccc--C----CCCC Q lcl|NC_015159. 504 M------VTAGQQ-------MGAAGG-------QAAAAMMQQQA--G----LPTQ 532 (532) Q Consensus 504 ~------~~~~~~-------~~~~~~-------~~~~~~~~~~~--g----~~~~ 532 (532) + .++.+. +..+.. .++....+... + .+.+ T Consensus 614 ~q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~ 668 (708) T protein:vir:10 614 SQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQ 668 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000000 000000 00000000000 0 0000 No 40 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.50 E-value=3.2e-13 Score=89.09 Aligned_cols=503 Identities=9% Similarity=0.013 Sum_probs=228.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccc-cccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTT-PWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~-~~dst~~~a~~~Laa~l~~~l 78 (532) ||..++ .-+++..||....+....|.....+=.+|..=. -=++......+ ..+ .|+-++. .++.+ .+.- T Consensus 1 m~d~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~-Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v----~g~e 71 (725) T protein:vir:92 1 MADNEN---RLESILSRFDADWTASDEARREAKNDLFFSRIS-QWDDWLSQYTTLQYRGQFDVVRP-VVRKL----VSEM 71 (725) T ss_pred CCchHH---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC-CCCHHHHHHHHhcCCCcccchHH-HHHHH----HhhH Confidence 998743 466777777777776666666666666665411 00000000001 011 2333332 22222 2221 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeee-----cccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYI-----PSTE 153 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v-----~~~~ 153 (532) - .+++=+++.+.++... ++.+.|+. .+......|+..-+...++.+.+..|.|++=| +++. T Consensus 72 ~-~nr~d~~v~P~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~ 137 (725) T protein:vir:92 72 R-QNPIDVLYRPKDGASP----------DAADVLMG---MYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSP 137 (725) T ss_pred H-hCCcceEEecCCccHH----------HHHHHHHH---HHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCC Confidence 1 1555566666543211 23333333 33344558899999999999999999996422 2221 Q ss_pred cccCCcceEEEEe----cceEEEeeCCC---CCeEE--EEEEEeecHHHh---hHHHHHHHHh--------hcccC-CCc Q lcl|NC_015159. 154 QVEGQSNAPKLYK----LHNFVVERDAY---DNVLQ--IVTEDKIARAAL---PEDVRKSLEE--------AQGDQ-NPS 212 (532) Q Consensus 154 ~~~~~~~~~~~~p----l~~~~v~~d~~---G~vd~--i~rk~~~~~~~l---~~~~~~~~~~--------~~~~~-~~~ 212 (532) . +..+..+..| +.++++..++. +. |. +||..+++.+.+ .+.+...... .+... ... T Consensus 138 ~--~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~s-Dar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (725) T protein:vir:92 138 T--SNNQVIRREPIHSACSHVIWDSNSKLMDKS-DSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQ 214 (725) T ss_pred C--CCceeeEEeeccCChhhcccCchhhccChh-hHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCC Confidence 1 2334444444 44555554332 21 22 456677776533 2222211100 00000 012 Q ss_pred ceEEEEEEEEee-----------CCC-------------------------------CeEEEEEE-EcCcccccccccCc Q lcl|NC_015159. 213 EEVTIYTHVYRD-----------PEA-------------------------------MVFRSYQE-IDGEIVAGTEGEYP 249 (532) Q Consensus 213 ~~v~i~~~v~~~-----------~~~-------------------------------~~~~s~~~-~~~~~~~~~~~~~g 249 (532) +.|.|+.+.++. +.+ +.++.+.+ ..|..+......++ T Consensus 215 d~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~ 294 (725) T protein:vir:92 215 DTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIA 294 (725) T ss_pred CeEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCC Confidence 345554443321 111 01122221 23443322233566 Q ss_pred cccCceEEEE--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCce-ee-- Q lcl|NC_015159. 250 LDSCPWIPVR--LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGD-FV-- 324 (532) Q Consensus 250 ~~~~P~~~~R--w~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~-~v-- 324 (532) .+.|||++.- ....+|..|+.|.+....+-.+.+|+..-..+..+..+.+.++++..+.+-..+.....+.+. ++ T Consensus 295 ~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 374 (725) T protein:vir:92 295 GEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLL 374 (725) T ss_pred CCceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeec Confidence 6779998653 223689999999999999999999999999999998888888888776553322222111111 11 Q ss_pred ---cCcccccc--ccc-cCCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_015159. 325 ---AGRKQDVE--VFQ-LEKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLL 396 (532) Q Consensus 325 ---~g~~~~~~--~~~-~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl 396 (532) +...+.+. ++. .....-.+.....++..++.|.+.- .-..+. ..+..++.--|..|++.....|+..+..| T Consensus 375 ~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG-~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl 453 (725) T protein:vir:92 375 NRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQDNL 453 (725) T ss_pred cccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhc-cCchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111 111 1222223445566666666666653 222232 23334666778899999999999888766 Q ss_pred HH------HHHHHHHHHHH------HHHHhcCC-----CCC-Ccc----------cc---cccee-ecchHHHHHHHHHH Q lcl|NC_015159. 397 SQ------ELQLPLVKILL------KELQATSK-----IPN-LPK----------EA---VEPAI-ATGLEALGRGHDLN 444 (532) Q Consensus 397 ~~------E~l~Pli~r~~------~il~r~g~-----lp~-~p~----------~~---~~~~~-v~~l~~l~raq~~~ 444 (532) .. +.+.-||...+ .|+...|. |.+ .++ ++ +.+.+ +.+-.+-.|.+.++ T Consensus 454 ~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~ 533 (725) T protein:vir:92 454 ATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRA 533 (725) T ss_pred HHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHH Confidence 54 33344443332 22222221 101 000 01 11222 22333445666677 Q ss_pred HHHHHHHHHHhhcchh-------hhhcCH---HHHHHHHHHhcCCCHhHcc--CCHHHHHHHHHHHHHHHHHHHH--HHh Q lcl|NC_015159. 445 KLNVFIDYMIKLAGLQ-------DDDINL---LDVKMRLANSLGMDTTGLI--LTQQDKQAKMAEASTAAGMVTA--GQQ 510 (532) Q Consensus 445 ~l~~~~~~laq~~p~~-------~d~id~---d~~~~~~a~~~Gv~p~~i~--~s~ee~~~~~~q~~~~~~~~~~--~~~ 510 (532) .++.+++.+.+..|.. ++..|. +++++.+....+ +.... .++++.++..++++.+++++.+ .++ T Consensus 534 ~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~--~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~ 611 (725) T protein:vir:92 534 EILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQA 611 (725) T ss_pred HHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhc--hhccCCccchhhhHHHHHHHHHHHhhhHHHHHHH Confidence 7776666554433322 122333 445555544333 22111 1222222221111111100000 000 Q ss_pred h------hHHHHHHHHhh-----------------------cccccCCCCC Q lcl|NC_015159. 511 M------GAAGGQAAAAM-----------------------MQQQAGLPTQ 532 (532) Q Consensus 511 ~------~~~~~~~~~~~-----------------------~~~~~g~~~~ 532 (532) . .+...++.+.. +-.+..+..+ T Consensus 612 qa~~~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~ 662 (725) T protein:vir:92 612 QGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQ 662 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH Confidence 0 00000000000 0000000000 No 41 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.48 E-value=4.2e-12 Score=82.97 Aligned_cols=501 Identities=10% Similarity=0.029 Sum_probs=224.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCc---cccc-ccc-cccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATAD---GSTS-YTT-PWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~---~~~~-~~~-~~dst~~~a~~~Laa~l~ 75 (532) ||.-+. .-+++..||....+....|.....+=.+|.. + ..-.. ...+ ..+ .|+=++. .++.+.+.-- T Consensus 1 m~d~~~---~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~---G-~Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v~g~~~ 72 (725) T protein:vir:77 1 MADNEN---RLESILSRFDADWTASDEARREAKNDLFFSR---V-SQWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMR 72 (725) T ss_pred CCchHH---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhC---C-CCCCHHHHHHHHhcCCCccccHHH-HHHHHHhhHH Confidence 887542 3556777777666666666555555555553 1 11111 0001 111 2322222 2333222221 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeee-----c Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYI-----P 150 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v-----~ 150 (532) .+++=+++.+.++... ++.+.|+. .+......|++.-+-..++.+.+..|.|++=| + T Consensus 73 -----~nr~d~~v~P~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~ 134 (725) T protein:vir:77 73 -----QNPIDVLYRPKDGARP----------DAADVLMG---MYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED 134 (725) T ss_pred -----hCCcceEEecCCccHH----------HHHHHHHH---HHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccC Confidence 2556566666543211 23333332 33344558889999999999999999997522 2 Q ss_pred ccccccCCcceEEEEe----cceEEEeeCCCCC--eEE--EEEEEeecHHHh---hHHHHHHHHhhc-----cc----CC Q lcl|NC_015159. 151 STEQVEGQSNAPKLYK----LHNFVVERDAYDN--VLQ--IVTEDKIARAAL---PEDVRKSLEEAQ-----GD----QN 210 (532) Q Consensus 151 ~~~~~~~~~~~~~~~p----l~~~~v~~d~~G~--vd~--i~rk~~~~~~~l---~~~~~~~~~~~~-----~~----~~ 210 (532) ++.. +..+..+.+| ..++++..++.-. -|. +||..+++.+.+ .+.......... .. -. T Consensus 135 ~d~~--~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (725) T protein:vir:77 135 QSPT--SNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWL 212 (725) T ss_pred CCCC--CCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhccccccccccccccc Confidence 2211 2333444443 4455665433210 122 567777887643 222221111000 00 00 Q ss_pred CcceEEEEEEEEeeCC-----------C-------------------------------CeEEEEEE-EcCccccccccc Q lcl|NC_015159. 211 PSEEVTIYTHVYRDPE-----------A-------------------------------MVFRSYQE-IDGEIVAGTEGE 247 (532) Q Consensus 211 ~~~~v~i~~~v~~~~~-----------~-------------------------------~~~~s~~~-~~~~~~~~~~~~ 247 (532) ..+.|.|+.+.++.+. + +.++.+.. ..|..+...... T Consensus 213 ~~d~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~ 292 (725) T protein:vir:77 213 TQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQL 292 (725) T ss_pred CCCeeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCc Confidence 1234444444332110 0 01122211 244433222345 Q ss_pred CccccCceEEEE--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCce--e Q lcl|NC_015159. 248 YPLDSCPWIPVR--LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGD--F 323 (532) Q Consensus 248 ~g~~~~P~~~~R--w~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~--~ 323 (532) |+.+.|||++.- ....+|..|+.|.+....+-.+.+|+.....+.....+.+.++++..+.+-..+......++. + T Consensus 293 ~~~~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 372 (725) T protein:vir:77 293 IAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYY 372 (725) T ss_pred CCCCccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCcee Confidence 667789998543 335789999999999999999999999999999888888888888766443222221111111 1 Q ss_pred ----ecCccccc--cccccCCccch-hHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHH Q lcl|NC_015159. 324 ----VAGRKQDV--EVFQLEKYNDF-QVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYS 394 (532) Q Consensus 324 ----v~g~~~~~--~~~~~~~~~~~-~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~ 394 (532) +....+.+ +++......++ +.....++...+.|.+.- .-.++.. .+..++.--|..|++.....|...+. T Consensus 373 ~~~~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~-~~n~~SG~ai~~rq~qg~~~~~~~~D 451 (725) T protein:vir:77 373 LLNRTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAV-NGGQVAFDTVNQLNMRADLETYVFQD 451 (725) T ss_pred cccccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCC-CchhhHHHHHHHHHHHHHHHHHHHHH Confidence 11112211 11111112233 344455556666665542 2222322 23235667788888888888888777 Q ss_pred HHHH------HHHHHHHHHHH------HHHHhcCCC-----CCCc-----c------cc---cccee-ecchHHHHHHHH Q lcl|NC_015159. 395 LLSQ------ELQLPLVKILL------KELQATSKI-----PNLP-----K------EA---VEPAI-ATGLEALGRGHD 442 (532) Q Consensus 395 rl~~------E~l~Pli~r~~------~il~r~g~l-----p~~p-----~------~~---~~~~~-v~~l~~l~raq~ 442 (532) +|.. +.+.-||...+ .|+...|.. .... + ++ ..+.+ +.+-.+-.|.+. T Consensus 452 nl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~ 531 (725) T protein:vir:77 452 NLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQN 531 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHH Confidence 7544 33334443322 222222211 0000 0 00 11222 222334556777 Q ss_pred HHHHHHHHHHHHhhcchhh-------hhcCH---HHHHHHHHHhcCCCHhHccC--CHHHHHHHHHHHHHHHH--HHHHH Q lcl|NC_015159. 443 LNKLNVFIDYMIKLAGLQD-------DDINL---LDVKMRLANSLGMDTTGLIL--TQQDKQAKMAEASTAAG--MVTAG 508 (532) Q Consensus 443 ~~~l~~~~~~laq~~p~~~-------d~id~---d~~~~~~a~~~Gv~p~~i~~--s~ee~~~~~~q~~~~~~--~~~~~ 508 (532) ++.++.+++.+....|... +..|. +++++.+..... +..... ++++.++..++++.+++ ..... T Consensus 532 ~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~--~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~ 609 (725) T protein:vir:77 532 RAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWLVEAQQAKQGQQDPAMV 609 (725) T ss_pred HHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhh--hhhccCCCChhhHHHHHHHHHHHHHhHHHHHH Confidence 7777777766654433321 22343 444544444322 322222 22222111111111000 00000 Q ss_pred Hhhh-HHHH-----HHHHhh--------c---------ccccCCCCC Q lcl|NC_015159. 509 QQMG-AAGG-----QAAAAM--------M---------QQQAGLPTQ 532 (532) Q Consensus 509 ~~~~-~~~~-----~~~~~~--------~---------~~~~g~~~~ 532 (532) +++. ...+ .+.+.. . .+.+.+.+| T Consensus 610 q~q~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q 656 (725) T protein:vir:77 610 QAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNN 656 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000 000000 0 000000011 No 42 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.46 E-value=1.1e-11 Score=80.63 Aligned_cols=501 Identities=10% Similarity=0.056 Sum_probs=228.9 Q ss_pred CCCCCCC-cc------CHHHHHHHHHHHHHHhhhHHHHHH----HHHHhhcccccCCCCCcc------ccccccc-ccch Q lcl|NC_015159. 1 MAEVEKT-GF------AADGAAAAYNRLKNDRGAYETRAE----DCATYTIPSVFPSATADG------STSYTTP-WQSI 62 (532) Q Consensus 1 m~~~~~~-~~------~~~~~~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~------~~~~~~~-~dst 62 (532) |++...+ +. .++.....|..+..++. +.+.|+ +-.+|.. + ..-... .+..+.+ |+=+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~r~~a~~d~~fy~---G-~Qw~~~~~~~l~~~g~p~~~~N~i 75 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDID-SQPLWRDAANKACAYYD---G-DQLAPEVIQVLKDRGQPMTIHNLI 75 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHh-hhHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEeccH Confidence 7775322 22 12223344555555543 345565 4444442 1 111110 0111112 2322 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhh Q lcl|NC_015159. 63 GARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVA 142 (532) Q Consensus 63 ~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 142 (532) + -.++...+ .-- .+++=+++.+.+..... .++.+ .++..+......++...+...++.+.+.. T Consensus 76 ~-~~v~~v~g----~~~-~nr~~~~v~pr~~~~~~--------~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~ 138 (714) T protein:vir:10 76 A-PTVDGVLG----MEA-KTRTDLIVMSDDPNDET--------EKLAE---AINAEFADACRLGNMNKARSDAYAEQIKA 138 (714) T ss_pred H-HHHHHHHH----HHH-hCCcceEEecCCCChhh--------HHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhc Confidence 2 22332222 222 24555566664322110 11222 33445556667888888999999999988 Q ss_pred Cceee--eecccccccCCcceEEEEecceEEEeeCCCC---C-eEEEEEEEeecHHHhhHHHHH---HHHhhc------- Q lcl|NC_015159. 143 GNVLL--YIPSTEQVEGQSNAPKLYKLHNFVVERDAYD---N-VLQIVTEDKIARAALPEDVRK---SLEEAQ------- 206 (532) Q Consensus 143 G~~~~--~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G---~-vd~i~rk~~~~~~~l~~~~~~---~~~~~~------- 206 (532) |-|++ +++.+ ..++.++++.+|..++++..++.- . -.-++++.+++.+++...+.. .+.... T Consensus 139 G~G~~~~~~d~d--~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~ 216 (714) T protein:vir:10 139 GLSWVEVRRNSE--PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFV 216 (714) T ss_pred ccceEEeeeccC--CCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcc Confidence 98877 55543 446778999999999999765432 1 223678888998876433321 111000 Q ss_pred ---------------------cc-------CCCcceEEEEEEEEeeCC---------C---------------------- Q lcl|NC_015159. 207 ---------------------GD-------QNPSEEVTIYTHVYRDPE---------A---------------------- 227 (532) Q Consensus 207 ---------------------~~-------~~~~~~v~i~~~v~~~~~---------~---------------------- 227 (532) .. .....+|.|+.+.++... | T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~ 296 (714) T protein:vir:10 217 DTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQ 296 (714) T ss_pred cchhhhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccce Confidence 00 011245777766433110 0 Q ss_pred ------CeEEEEEEEcCcccccccccCccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 228 ------MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSK 299 (532) Q Consensus 228 ------~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~ 299 (532) +....+++.....+....+.|++..|||++.-... ..|..| |.+..+.+-.+.+|+..-..+.+ +..+ T Consensus 297 ~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~ 372 (714) T protein:vir:10 297 VKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAK 372 (714) T ss_pred ecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccc--eehhhhhhHHHHHHHHHHHHHHH--HhCC Confidence 11112222222222233345677789997653333 455555 67888999999999765554442 3444 Q ss_pred CceeecCccccCh-hhhc--cCCCceeecCcc---cc---ccccccCCccc-hhHHHHHHHHHHHHHHHHH--hhhhccc Q lcl|NC_015159. 300 VLFFVNPNGVTQI-RRVA--KANTGDFVAGRK---QD---VEVFQLEKYND-FQVAKATADDIEKRLSYAF--MLNSAVQ 367 (532) Q Consensus 300 p~~lv~~~g~~~~-~~~~--~~~~G~~v~g~~---~~---~~~~~~~~~~~-~~~~~~~i~~~~~rI~~af--~~~~~~~ 367 (532) -+ ++.++++... +.+. .+.||.++.-++ ++ ...+....... .+.....++...+.|++.- +-.++. T Consensus 373 ~~-~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG- 450 (714) T protein:vir:10 373 RV-IMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG- 450 (714) T ss_pred ce-eeccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcC- Confidence 44 4445555442 2232 235555553221 11 11122222222 2344455555555555542 111222 Q ss_pred CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCC--C-Cc--------- Q lcl|NC_015159. 368 RGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-------------SKIP--N-LP--------- 422 (532) Q Consensus 368 ~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-------------g~lp--~-~p--------- 422 (532) ..+...+..-|..|++.....|+..+.+|.. ...=+.+.+++++.+. +... . ++ T Consensus 451 ~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~-~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~ 529 (714) T protein:vir:10 451 QDSGATSGVAISNLVEQGATTLAEINDNYQF-ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGE 529 (714) T ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCcc Confidence 2333456677999999999999999988776 3333344444444331 1000 0 00 Q ss_pred -cccc---cceee---cchHHHHHHHHHHHHHHHHHHHH----hhc-chhhhh---cCHHHHHHHHHHhcCCCHh-HccC Q lcl|NC_015159. 423 -KEAV---EPAIA---TGLEALGRGHDLNKLNVFIDYMI----KLA-GLQDDD---INLLDVKMRLANSLGMDTT-GLIL 486 (532) Q Consensus 423 -~~~~---~~~~v---~~l~~l~raq~~~~l~~~~~~la----q~~-p~~~d~---id~d~~~~~~a~~~Gv~p~-~i~~ 486 (532) ..++ +..++ .+-.+-.|.+.++.++++++.+. ++. +..++. -+.+++++.+.+.+|.+.. .-.. T Consensus 530 ~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~ 609 (714) T protein:vir:10 530 LTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMT 609 (714) T ss_pred ccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccC Confidence 0001 11221 22334556666667666665431 111 112233 3567899999999886321 1221 Q ss_pred CHH-HHHHHHHHHHHHHH-----------------HHHHHHhh--------------------hHHHHHHHHhhcccccC Q lcl|NC_015159. 487 TQQ-DKQAKMAEASTAAG-----------------MVTAGQQM--------------------GAAGGQAAAAMMQQQAG 528 (532) Q Consensus 487 s~e-e~~~~~~q~~~~~~-----------------~~~~~~~~--------------------~~~~~~~~~~~~~~~~g 528 (532) .++ +.++.+++.+++++ ++++.+++ .+..++.++.+++..++ T Consensus 610 ~e~q~~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~ 689 (714) T protein:vir:10 610 PEEQEVAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQN 689 (714) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111 11111111000000 00000000 00001111112222222 Q ss_pred CCCC Q lcl|NC_015159. 529 LPTQ 532 (532) Q Consensus 529 ~~~~ 532 (532) +.-+ T Consensus 690 ~~q~ 693 (714) T protein:vir:10 690 MEQE 693 (714) T ss_pred hhhh Confidence 2222 No 43 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.44 E-value=4.9e-12 Score=82.60 Aligned_cols=500 Identities=9% Similarity=0.029 Sum_probs=223.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCc---cccc-ccc-cccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATAD---GSTS-YTT-PWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~---~~~~-~~~-~~dst~~~a~~~Laa~l~ 75 (532) ||..+. .-+++..||..-.+.-..|.....+=.+|.. + ..-.. ...+ ..+ .|+-++ -.++.+ . T Consensus 1 m~d~~~---~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~---G-~QW~~~~~~~l~~q~rp~~N~i~-~~v~~v----~ 68 (725) T protein:vir:10 1 MADNEN---RLESILSRFDADWTASDEARREAKNDLFFSR---V-SQWDDWLSQYTTLQYRGQFDVVR-PVVRKL----V 68 (725) T ss_pred CCchHH---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcccchH-HHHHHH----H Confidence 998743 3556777776666655555555555555553 1 11110 0000 111 233333 222222 2 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeee-----c Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYI-----P 150 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v-----~ 150 (532) +.-- .+++=+++.+.++... ++.+.|+. .+......+++.-+-..++.+.+..|.|++=| + T Consensus 69 g~e~-~nr~d~~v~p~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~ 134 (725) T protein:vir:10 69 SEMR-QNPIDVLYRPKDGASP----------DAADVLMG---MYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYED 134 (725) T ss_pred hhHH-hCCcceEEecCCcchH----------HHHHHHHH---HHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccC Confidence 2211 2555556665542211 23333332 33344557888888999999999999997532 2 Q ss_pred ccccccCCcceEEEE----ecceEEEeeCC---CCC-eEEEEEEEeecHHHhhHHHHHHHHhh----c-----cc---C- Q lcl|NC_015159. 151 STEQVEGQSNAPKLY----KLHNFVVERDA---YDN-VLQIVTEDKIARAALPEDVRKSLEEA----Q-----GD---Q- 209 (532) Q Consensus 151 ~~~~~~~~~~~~~~~----pl~~~~v~~d~---~G~-vd~i~rk~~~~~~~l~~~~~~~~~~~----~-----~~---~- 209 (532) ++.. +..+..+.+ |..++++..++ ++. -.-+||..+|+.+.+. ++...+... . .. . T Consensus 135 ~d~~--~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~~~~~ 211 (725) T protein:vir:10 135 QSPT--SNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWD-DFAEKYDLDADNIPSFQNPNDWVFPW 211 (725) T ss_pred CCCC--CCceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHH-HHHHhCCCcccccccccccccccccc Confidence 2211 223333444 34556665433 221 2235677788865442 232222110 0 00 0 Q ss_pred CCcceEEEEEEEEee-----------CCC-------------------------------CeEEEEEE-EcCcccccccc Q lcl|NC_015159. 210 NPSEEVTIYTHVYRD-----------PEA-------------------------------MVFRSYQE-IDGEIVAGTEG 246 (532) Q Consensus 210 ~~~~~v~i~~~v~~~-----------~~~-------------------------------~~~~s~~~-~~~~~~~~~~~ 246 (532) ...+.|.|+.+.++. +.+ |.++.+.+ ..|..+..... T Consensus 212 ~~~~~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~ 291 (725) T protein:vir:10 212 LTQDTIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQ 291 (725) T ss_pred cCCCeEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCC Confidence 012334444333221 100 11122222 23443322223 Q ss_pred cCccccCceEEEEee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCce-e Q lcl|NC_015159. 247 EYPLDSCPWIPVRLI--KMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGD-F 323 (532) Q Consensus 247 ~~g~~~~P~~~~Rw~--~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~-~ 323 (532) .++.+.|||++.-.. ..+|..|+.|.+....+-.+.+|+.....+..+..+.+.++++..+.+-..+.....+.+. + T Consensus 292 ~~~~~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~ 371 (725) T protein:vir:10 292 LIAGEHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPY 371 (725) T ss_pred CCCCCceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCcee Confidence 556677999865322 3689999999999999999999999999999998999999898766553333322222222 1 Q ss_pred ec-----Ccccccc--cc-ccCCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|NC_015159. 324 VA-----GRKQDVE--VF-QLEKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVY 393 (532) Q Consensus 324 v~-----g~~~~~~--~~-~~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~ 393 (532) +. ...+.+. ++ ......--+.....++...+.|.+.- ....+. ..+..++.--|..|++.....|...+ T Consensus 372 ~~~~~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG-~~~n~~SG~ai~~rq~qg~~~l~~~~ 450 (725) T protein:vir:10 372 YLLNRTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEA-VNGGQVAYDTVNQLNMRADLETYVFQ 450 (725) T ss_pred eecccccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhC-cCchhhHHHHHHHHHHHHHHHHHHHH Confidence 11 1111111 11 11221222355566666677776653 222232 23334566778899999998998888 Q ss_pred HHHHHH------HHHHHHHHHH------HHHHhcCC-----CC-CCcc----------cc---cccee-ecchHHHHHHH Q lcl|NC_015159. 394 SLLSQE------LQLPLVKILL------KELQATSK-----IP-NLPK----------EA---VEPAI-ATGLEALGRGH 441 (532) Q Consensus 394 ~rl~~E------~l~Pli~r~~------~il~r~g~-----lp-~~p~----------~~---~~~~~-v~~l~~l~raq 441 (532) .+|..- .+.-||...+ .|+...|. |. +.++ ++ +.+.+ +.+-.+-.|.+ T Consensus 451 Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~ 530 (725) T protein:vir:10 451 DNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQ 530 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHH Confidence 777652 3333333322 12222221 11 1110 01 11222 12223444666 Q ss_pred HHHHHHHHHHHHHhhcchhh-------hhc---CHHHHHHHHHHhcCCCHhHcc--CCHHHHHHHHHHHHHHHHH--HHH Q lcl|NC_015159. 442 DLNKLNVFIDYMIKLAGLQD-------DDI---NLLDVKMRLANSLGMDTTGLI--LTQQDKQAKMAEASTAAGM--VTA 507 (532) Q Consensus 442 ~~~~l~~~~~~laq~~p~~~-------d~i---d~d~~~~~~a~~~Gv~p~~i~--~s~ee~~~~~~q~~~~~~~--~~~ 507 (532) .++.++.+++.+....|... +.. ..+++++.+....+ +.... .++++.++..+++++++++ ++. T Consensus 531 ~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~--~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~ 608 (725) T protein:vir:10 531 NRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWLVEAQQAKQGQQDPAM 608 (725) T ss_pred HHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhh--hhccCCccccchhHHHHHHHHHHHhhhHHHH Confidence 66666666665544333221 122 33455555544433 21111 1222222111111111000 000 Q ss_pred HHhh------hHHHHHHHHhhcc---cc-------------------cCCCCC Q lcl|NC_015159. 508 GQQM------GAAGGQAAAAMMQ---QQ-------------------AGLPTQ 532 (532) Q Consensus 508 ~~~~------~~~~~~~~~~~~~---~~-------------------~g~~~~ 532 (532) .++. .+...++-+.... .. +...++ T Consensus 609 ~q~~~~~~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q 661 (725) T protein:vir:10 609 VQAQGVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSK 661 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH Confidence 0000 0000000000000 00 000000 No 44 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.39 E-value=1e-11 Score=80.80 Aligned_cols=500 Identities=13% Similarity=0.109 Sum_probs=225.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh-cccccCCCCCcc-------ccc---ccc-cccchHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYT-IPSVFPSATADG-------STS---YTT-PWQSIGARGLN 68 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~-~P~~~~~~~~~~-------~~~---~~~-~~dst~~~a~~ 68 (532) ||+.... -.+.+..||....+.-+.|...|++=.+|. .+. ..-... ... .+. .|+-++.. ++ T Consensus 1 ma~~~~~--~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G---~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~-i~ 74 (708) T protein:vir:17 1 MAETLEK--KHERIMLRFDRAYSPQQEVREKCIEATRFARVPG---GQWEGATAAGTKLDEQFEKYPKFEINKVATE-LN 74 (708) T ss_pred CchhHHH--HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCC---CCCCHHHHHHHHhhhhhcCCCceEEcchHHH-HH Confidence 9988532 144556666665555555666665554421 111 111100 000 111 13333322 22 Q ss_pred HHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeee Q lcl|NC_015159. 69 NLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 69 ~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~ 148 (532) ... +.-- .+++=+++.+.+..-. .++.+.| +..+......++...+...++.+.++.|.|++= T Consensus 75 ~v~----g~e~-~nr~d~~v~p~~~~~d---------~~~Ae~l---~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~ 137 (708) T protein:vir:17 75 RII----AEYR-NNRITVKFRPGDREAS---------EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFR 137 (708) T ss_pred HHH----hhHh-hCCcceEEecCCCcch---------HHHHHHH---HHHHHHHHHhcCchhHHhHHHHHhhhcccceee Confidence 222 2211 2555556665532210 1223333 333445556888999999999999999999762 Q ss_pred e-----cccc-cccCCcceEEEE--ecceEEEeeCC---CCCeEE--EEEEEeecHHHhhHHHHHHHH-----hhcccCC Q lcl|NC_015159. 149 I-----PSTE-QVEGQSNAPKLY--KLHNFVVERDA---YDNVLQ--IVTEDKIARAALPEDVRKSLE-----EAQGDQN 210 (532) Q Consensus 149 v-----~~~~-~~~~~~~~~~~~--pl~~~~v~~d~---~G~vd~--i~rk~~~~~~~l~~~~~~~~~-----~~~~~~~ 210 (532) + .+++ .....++.++++ |..++++..++ ++ -|. +||..+++.+++-..+..... ....... T Consensus 138 ~~~d~~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~-sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~ 216 (708) T protein:vir:17 138 LTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK-SDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWE 216 (708) T ss_pred eeecccccCCCCCCccccceEeeccchhheecCccccccCh-hhhhhhhhhccCCHHHHHHhCccccchhhhhhhhcccc Confidence 2 2211 122334444443 56677776544 32 233 678889998876444432110 0000000 Q ss_pred ----CcceEEEEEEEE-----------eeCC----------------------C---------CeEEEE-EEEcCccccc Q lcl|NC_015159. 211 ----PSEEVTIYTHVY-----------RDPE----------------------A---------MVFRSY-QEIDGEIVAG 243 (532) Q Consensus 211 ----~~~~v~i~~~v~-----------~~~~----------------------~---------~~~~s~-~~~~~~~~~~ 243 (532) ..+.|-|+.+.+ +++. + +.++++ +...|..+.. T Consensus 217 ~~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~ 296 (708) T protein:vir:17 217 YDWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLE 296 (708) T ss_pred ccccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeeccccccc Confidence 012332222211 1110 0 112222 2234554433 Q ss_pred ccccCccccCceEEE---EeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhh----- Q lcl|NC_015159. 244 TEGEYPLDSCPWIPV---RLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV----- 315 (532) Q Consensus 244 ~~~~~g~~~~P~~~~---Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~----- 315 (532) ..+.++++.|||++. ||. .+|..+-.|.+..+.+-.+.+|+.....+..+.++.+-+++++.+.+.....- T Consensus 297 ~~~~~p~~~fP~vP~~g~r~~-~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~ 375 (708) T protein:vir:17 297 KPRRIPGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARN 375 (708) T ss_pred CCCCCCCCccceEEEeccccc-ccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcc Confidence 445678888999865 343 36666556899999999999999999999999999888888877543211110 Q ss_pred -----------ccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHH Q lcl|NC_015159. 316 -----------AKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 316 -----------~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~ 382 (532) .....|.+++|..-. ...+ ...--+.....++...+.|.+.- .-.++. +.+ .++.--|..|. T Consensus 376 ~~~~~~~~~~~~~~~~g~v~~~a~~~-~~~~--~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G-~~s-n~SG~Ai~~rq 450 (708) T protein:vir:17 376 KKRPAFLPLREVRDKYGNIIAGATPA-GYTQ--PAVMNQALAALLQQTSADIQEVTGGSQAMQQ-MPS-NIAQETVNNLM 450 (708) T ss_pred cchhhhhhhhccCCcccccccccCCc-ccCC--CccccHHHHHHHHHHHHHHHHhcCCChHHcc-Ccc-chHHHHHHHHH Confidence 011223233222111 1111 11111223333444444444431 111222 222 35666788888 Q ss_pred HHHHHHhhhhHHHHH------HHHHHHHHHHHHH------HHHhcCC----------CCCCccc-----ccc---ceee- Q lcl|NC_015159. 383 GELEDTLGGVYSLLS------QELQLPLVKILLK------ELQATSK----------IPNLPKE-----AVE---PAIA- 431 (532) Q Consensus 383 ~E~~~~LGpv~~rl~------~E~l~Pli~r~~~------il~r~g~----------lp~~p~~-----~~~---~~~v- 431 (532) +.....++..+.++. -+.+.-||...+. |+...|. +++.++. ++. ..++ T Consensus 451 ~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v 530 (708) T protein:vir:17 451 NRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTV 530 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEE Confidence 888888888887766 5566666655442 2222221 1122221 111 1221 Q ss_pred -c-chHHHHHHHHHHHHHHHHHHHHhhcch-------hhh---hcCHHHHHHHHHHhcCCCHhHccC--CHHHHHHHHHH Q lcl|NC_015159. 432 -T-GLEALGRGHDLNKLNVFIDYMIKLAGL-------QDD---DINLLDVKMRLANSLGMDTTGLIL--TQQDKQAKMAE 497 (532) Q Consensus 432 -~-~l~~l~raq~~~~l~~~~~~laq~~p~-------~~d---~id~d~~~~~~a~~~Gv~p~~i~~--s~ee~~~~~~q 497 (532) + +-.+-.|.+..+.++.+++.+....|. +++ .-+.++++..+...++. ..... .+++.++..++ T Consensus 531 ~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~--~~~~~~~~~e~~q~~~q~ 608 (708) T protein:vir:17 531 DVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQA 608 (708) T ss_pred ecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhc--cccccCcchhhHHHHHHH Confidence 1 223455666677777776665432221 222 33457777777766553 11222 22222211111 Q ss_pred HHHHHH-----HHHHHHh-------hhHHHHHHHHh---hc----ccc--cCC----CCC Q lcl|NC_015159. 498 ASTAAG-----MVTAGQQ-------MGAAGGQAAAA---MM----QQQ--AGL----PTQ 532 (532) Q Consensus 498 ~~~~~~-----~~~~~~~-------~~~~~~~~~~~---~~----~~~--~g~----~~~ 532 (532) ++.+++ +.++..+ +..+..++... .+ ... ++. ++| T Consensus 609 qq~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q 668 (708) T protein:vir:17 609 QMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQ 668 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111000 0000000 00000000000 00 000 000 000 No 45 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.32 E-value=4.6e-11 Score=77.28 Aligned_cols=503 Identities=14% Similarity=0.106 Sum_probs=212.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhc-ccccCCCCCccc---c----cccccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTI-PSVFPSATADGS---T----SYTTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~-P~~~~~~~~~~~---~----~~~~~~dst~~~a~~~Laa 72 (532) ||+..+.- .+.+..||..-.+..+.|...+++=.+|.. +..-=++..... + ..+.+.-..-.-.++...+ T Consensus 1 m~e~~~~~--~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g 78 (706) T protein:vir:10 1 MAESRQKQ--HERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIIS 78 (706) T ss_pred CCcchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhh Confidence 99944322 334555665555554555555555445542 211000000000 0 0112222222333333222 Q ss_pred HHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeee--- Q lcl|NC_015159. 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYI--- 149 (532) Q Consensus 73 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v--- 149 (532) ..-- +++=+++.+.+..- .. ++.+.| +..+......++...+...++.+.+..|.|++=+ T Consensus 79 ----~~~~-nr~~~~v~P~~~~~------d~---~~Ae~l---~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d 141 (706) T protein:vir:10 79 ----EYRN-NRISVKFRPGDNAA------SE---ELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTS 141 (706) T ss_pred ----HHHh-CCCceEEecCCCCc------hH---HHHHHH---HHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeec Confidence 2222 44445555532110 11 122222 3344445568899999999999999999997533 Q ss_pred --ccccc-ccCCcceEEE--EecceEEEeeC---CCCC-eEEEEEEEeecHHHhhHHHHHH---HHhhcc--------cC Q lcl|NC_015159. 150 --PSTEQ-VEGQSNAPKL--YKLHNFVVERD---AYDN-VLQIVTEDKIARAALPEDVRKS---LEEAQG--------DQ 209 (532) Q Consensus 150 --~~~~~-~~~~~~~~~~--~pl~~~~v~~d---~~G~-vd~i~rk~~~~~~~l~~~~~~~---~~~~~~--------~~ 209 (532) .+.+. .....+.++. .|+.++++..+ .++. ..-+||..+|+.+++-..+... +..... .. T Consensus 142 ~~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~ 221 (706) T protein:vir:10 142 FVNEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTP 221 (706) T ss_pred cccccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCC Confidence 11110 1112333332 35677777643 3333 2247788899988864433321 111000 00 Q ss_pred ------CCc----ceEEEEEEEEeeCC-------------------------------CCeEEEEE-EEcCccccccccc Q lcl|NC_015159. 210 ------NPS----EEVTIYTHVYRDPE-------------------------------AMVFRSYQ-EIDGEIVAGTEGE 247 (532) Q Consensus 210 ------~~~----~~v~i~~~v~~~~~-------------------------------~~~~~s~~-~~~~~~~~~~~~~ 247 (532) .++ ..++++ .+.+.- .+.++.+. ...|..+....+. T Consensus 222 d~~~~~eyy~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p 299 (706) T protein:vir:10 222 DVVYIAKYYEVRKESVDVI--SYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRR 299 (706) T ss_pred CcceecccccccceeEEEE--EeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCC Confidence 000 111111 111110 01122221 2234433333356 Q ss_pred CccccCceEEEEeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhc--------- Q lcl|NC_015159. 248 YPLDSCPWIPVRLIK--MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA--------- 316 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~--~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~--------- 316 (532) |+.+.|||++.-... .++..+..|.+....+-.+.+|+.....+......-+-+..+..+.+-....-. T Consensus 300 ~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 379 (706) T protein:vir:10 300 IPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPA 379 (706) T ss_pred CCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhccccccc Confidence 777889998653222 356666778888999999999988777777665555544444433221111000 Q ss_pred -------cCCCceeecCccccccccccCCccch-hHHHHHHHHHHHHHHHH--HhhhhcccCCCCCCCHHHHHHHHHHHH Q lcl|NC_015159. 317 -------KANTGDFVAGRKQDVEVFQLEKYNDF-QVAKATADDIEKRLSYA--FMLNSAVQRGGDRVTAEEIRYVAGELE 386 (532) Q Consensus 317 -------~~~~G~~v~g~~~~~~~~~~~~~~~~-~~~~~~i~~~~~rI~~a--f~~~~~~~~~~~~~TAtEi~~r~~E~~ 386 (532) ...+|.+++... ..... . ...+ +.....++.-.+.|.+. .+-.++.+.+ .++.--|..|.+... T Consensus 380 ~l~~~~~~~~~g~i~~~~~-~~~~~--~-~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~s--n~SG~Ai~~rq~qg~ 453 (706) T protein:vir:10 380 FLPLRTVTDKTGNVVAPAN-VAGYT--Q-APVLNQALAALLQQTSADIQEVTGSSQAMQQMPS--NVARETVNSLLNRSD 453 (706) T ss_pred chhcccccCCCCccccccc-ccccC--C-CcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCcc--chHHHHHHHHHHHHH Confidence 011233322111 00111 1 1112 22334444555555544 2222233222 257778999999999 Q ss_pred HHhhhhHHHHHH------HHHHHHHHH------HHHHHHhcCCC--CCC------ccc-------cc---cceee---cc Q lcl|NC_015159. 387 DTLGGVYSLLSQ------ELQLPLVKI------LLKELQATSKI--PNL------PKE-------AV---EPAIA---TG 433 (532) Q Consensus 387 ~~LGpv~~rl~~------E~l~Pli~r------~~~il~r~g~l--p~~------p~~-------~~---~~~~v---~~ 433 (532) ..+...+..|.. +.+.-||.. +|.|....|.. +.+ |.. ++ +..++ .+ T Consensus 454 ~~~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p 533 (706) T protein:vir:10 454 MASFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGP 533 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEeccc Confidence 999888865543 344444432 22233222211 000 000 11 11221 22 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhc-------chhhhhcC---HHHHHHHHHHhcCCCHhHccCCHHHH-HHHHH---HHH Q lcl|NC_015159. 434 LEALGRGHDLNKLNVFIDYMIKLA-------GLQDDDIN---LLDVKMRLANSLGMDTTGLILTQQDK-QAKMA---EAS 499 (532) Q Consensus 434 l~~l~raq~~~~l~~~~~~laq~~-------p~~~d~id---~d~~~~~~a~~~Gv~p~~i~~s~ee~-~~~~~---q~~ 499 (532) -.+-.|.+.++.++.+++.+.... +.+++..| .++++..+-..++ +....+..++. +++.+ |.+ T Consensus 534 ~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~--~q~~~~~~~~~eq~~~~q~qq~q 611 (706) T protein:vir:10 534 SYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLL--TQGIVKPRNQQEQAIVQQAQQAQ 611 (706) T ss_pred CcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhc--ccCCccccchhHHHHHHHHHHHH Confidence 344557777777777777543222 22333443 3455666555444 22223222111 11111 111 Q ss_pred HHHHHH-----HHHHhhh------------HHHHHHHHhh---cccccCCCCC Q lcl|NC_015159. 500 TAAGMV-----TAGQQMG------------AAGGQAAAAM---MQQQAGLPTQ 532 (532) Q Consensus 500 ~~~~~~-----~~~~~~~------------~~~~~~~~~~---~~~~~g~~~~ 532 (532) ++++.. ++..... .....+..+. +.++.-.+.+ T Consensus 612 ~~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~~~~~~~ 664 (706) T protein:vir:10 612 ATQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQANTVYK 664 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 100000 0000000 0000000000 0000000011 No 46 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.25 E-value=3.3e-10 Score=72.53 Aligned_cols=491 Identities=11% Similarity=0.034 Sum_probs=218.7 Q ss_pred CCCCCC---CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcc------cccccccccchHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK---TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADG------STSYTTPWQSIGARGLNNLA 71 (532) Q Consensus 1 m~~~~~---~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~------~~~~~~~~dst~~~a~~~La 71 (532) |..... ..+..+ +..+|..-......|.....+-.+|.. + ..-... .+..+.+.-..-.-.++... T Consensus 11 ~~~~~~~~~~~~~~~-~~~~~~~~~~~q~~~r~~a~~d~~fy~---G-~QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 85 (772) T protein:vir:10 11 LNGLPPAGDTPLTVD-EYADINYEIEDQPAWRAVADKEMDYAD---G-NQLDTELLRRQQALGIPPAVEDLIGPALLSLQ 85 (772) T ss_pred hccCCcccccccCHH-HHHHHHHHHhccHHHHHHHHHHHHhhc---C-CCCCHHHHHHHHhcCCCcEEEcchHHHHHHHH Confidence 332211 111222 233444433333345544445555553 1 111111 01111222222222333332 Q ss_pred HHHHHhhcCCCCCccccCCChH-HHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 72 SKLMLALFPVGSSFFKLNVSEL-EVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 72 a~l~~~ltpp~~~WF~l~~~d~-~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) +. -- .+++=+++.+.+. .. .++.+.| +..+......+++..+...++.+.+..|.|++-+. T Consensus 86 g~----~~-~nr~d~~v~Pr~~~~d----------~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~ 147 (772) T protein:vir:10 86 GY----EA-VTRTDWRVTPNGDVGG----------QEVADAL---NYRLNTAERQSGADRACSEAFRPQIACGIGWVEVS 147 (772) T ss_pred HH----HH-hcCcceEEecCCCchH----------HHHHHHH---HHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEec Confidence 22 22 2555566665421 11 1233333 33444555688899999999999998898866444 Q ss_pred ccccccCCcceEEEEecceEEEeeCCCCCeEE---EEEEEeecHHHhhHHHHH---HHHhh------------------- Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERDAYDNVLQ---IVTEDKIARAALPEDVRK---SLEEA------------------- 205 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~---i~rk~~~~~~~l~~~~~~---~~~~~------------------- 205 (532) -+....+..++++.++..++++..+......+ +||..+|+.+++-..+.. .+... T Consensus 148 ~~~d~~~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (772) T protein:vir:10 148 RESDPFKFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGG 227 (772) T ss_pred cccCCCCCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccc Confidence 33334455688999999999998766544334 678888998875322221 11100 Q ss_pred --------------------cccCCCcceEEEEEEEEeeCC---------C----------------------------C Q lcl|NC_015159. 206 --------------------QGDQNPSEEVTIYTHVYRDPE---------A----------------------------M 228 (532) Q Consensus 206 --------------------~~~~~~~~~v~i~~~v~~~~~---------~----------------------------~ 228 (532) .......++|.|+++.++... + + T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~ 307 (772) T protein:vir:10 228 TSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVS 307 (772) T ss_pred cccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeee Confidence 000111367888876544321 1 1 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLI--KMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~--~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) ....+++.....+....+.|++..|||++.-.. ...|..| |.+....+-.+.+|+..-..+.... ... .+. + T Consensus 308 rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l~--~~~-~~~-~ 381 (772) T protein:vir:10 308 RVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGMS--VAR-VER-T 381 (772) T ss_pred EEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCccc--chhhhhhhHHHHHHHHHHHHHHHHh--ccc-ccc-c Confidence 112233443333333345677788999865333 3566666 6888899999999986555544322 222 233 4 Q ss_pred ccccChhh--h--ccCCCceeecCccc---ccc-ccccCCccch-hHHHHHHHHHHHHHHHH--HhhhhcccCCCCCCCH Q lcl|NC_015159. 307 NGVTQIRR--V--AKANTGDFVAGRKQ---DVE-VFQLEKYNDF-QVAKATADDIEKRLSYA--FMLNSAVQRGGDRVTA 375 (532) Q Consensus 307 ~g~~~~~~--~--~~~~~G~~v~g~~~---~~~-~~~~~~~~~~-~~~~~~i~~~~~rI~~a--f~~~~~~~~~~~~~TA 375 (532) .|.++..+ + ..+.++.++.-+++ +.. .+.......+ ......++...+.|.+. .+-.++. ..+...+. T Consensus 382 ~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG-~~~na~SG 460 (772) T protein:vir:10 382 KGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQG-RKGTATSG 460 (772) T ss_pred CCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcC-CCcchhhH Confidence 44444321 1 23455555442222 111 1112222222 23344444554555553 1222222 23334677 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-------------cCC-------CCCCc-----cc-----c Q lcl|NC_015159. 376 EEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-------------TSK-------IPNLP-----KE-----A 425 (532) Q Consensus 376 tEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-------------~g~-------lp~~p-----~~-----~ 425 (532) .-|..|.+...+.|+..+.+|..-... +.+.+++++.+ .+. |.... ++ + T Consensus 461 vAi~~rq~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~ND 539 (772) T protein:vir:10 461 IQEQQQIEQSNQSIGRIMDNFRAGRTL-VGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSND 539 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceecc Confidence 789999999999999999877653322 22333333322 111 00000 00 1 Q ss_pred c---cceeec---chHHHHHHHHHHHHHHHHHHHHhhcchh--------hhh---cCHHHHHHHHHHhcCCCHhHccCCH Q lcl|NC_015159. 426 V---EPAIAT---GLEALGRGHDLNKLNVFIDYMIKLAGLQ--------DDD---INLLDVKMRLANSLGMDTTGLILTQ 488 (532) Q Consensus 426 ~---~~~~v~---~l~~l~raq~~~~l~~~~~~laq~~p~~--------~d~---id~d~~~~~~a~~~Gv~p~~i~~s~ 488 (532) + +..++. +-.+-.|.+.++.+++++ +.+.|.. ++. -+.+++++.+-+..+- .++ T Consensus 540 i~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~---~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~------~~p 610 (772) T protein:vir:10 540 LLRTRIKVALEDVPSTNSYRGQQLNAMSEAV---KSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQ------QTP 610 (772) T ss_pred ceeeeEEEEeeccccchHHHHHHHHHHHHHH---hccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhcc------CCh Confidence 1 112221 222334455555555544 3444432 222 2445777777666553 122 Q ss_pred HHHHHHHHH-HHHHHHHHH------------------HHHhhhHHHHH---HHHhhcccccCCCCC Q lcl|NC_015159. 489 QDKQAKMAE-ASTAAGMVT------------------AGQQMGAAGGQ---AAAAMMQQQAGLPTQ 532 (532) Q Consensus 489 ee~~~~~~q-~~~~~~~~~------------------~~~~~~~~~~~---~~~~~~~~~~g~~~~ 532 (532) ++.++..++ .+++++.++ +...+..+.+. +....+ +-+.+++| T Consensus 611 eq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~-~aa~~~~q 675 (772) T protein:vir:10 611 EQIQQQIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAM-QAGAQIAQ 675 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhhhHHh Confidence 222222111 111000000 00000000000 000000 00111111 No 47 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.23 E-value=4e-10 Score=72.12 Aligned_cols=448 Identities=10% Similarity=0.078 Sum_probs=180.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCC---CCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPS---ATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~---~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |.+++-+..-.+.|.+.| .. ..++.+.+.+|..=..-.. ...+...+..+...+-+..++++++..| T Consensus 8 ~~e~~~~~~~~~~l~~~~---~~----~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l--- 77 (486) T protein:vir:42 8 MEEIEDPAVVREEMISAF---ED----ASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQ--- 77 (486) T ss_pred CCCcccHHHHHHHHHHHH---HH----HHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhh--- Confidence 666653222222333333 22 2344445555543221100 0111111122334455666666666654 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc--- Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ--- 154 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~--- 154 (532) +|.+ |++ ++.+. .. ..+.+.+..++|.....++.++..+||.+.++|..++. T Consensus 78 -~~~g---~~~--~~~~~------------~~-------~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~ 132 (486) T protein:vir:42 78 -AVEG---FRL--GDADE------------AD-------EELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLD 132 (486) T ss_pred -cccc---eec--CCCch------------hH-------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc Confidence 3333 222 21110 00 11234556788999999999999999999888764431 Q ss_pred --ccCCcceEEEEecceEEEeeC-CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 155 --VEGQSNAPKLYKLHNFVVERD-AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 155 --~~~~~~~~~~~pl~~~~v~~d-~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) ...+..++++++..+.++..| ..+++...+|.+.-. +.+....+++| . ++. -|. T Consensus 133 ~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y----~-~~~-~~~ 189 (486) T protein:vir:42 133 LGWDQNVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYDK-----------------EGNEIQAATLY----T-PME-TIG 189 (486) T ss_pred cccCCCeeEEEEecccceEEEEeCCCCCeEEEEEEEEec-----------------CCCeEEEEEEE----c-CCc-EEE Confidence 123334677777766554444 567777666554300 01111122222 1 111 011 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeec---Cc Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVN---PN 307 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~-al~d~~~L~~l~~~~l~~~~~a~~p~~lv~---~~ 307 (532) +...+|.........++|..+|++.++.+...+..+|+|=.+. ..+-+..++...-.....++..+.|...+. ++ T Consensus 190 -~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~ 268 (486) T protein:vir:42 190 -WFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPE 268 (486) T ss_pred -EEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCcc Confidence 1111222222222346778999999999888999999997664 346567777776677777777777764442 11 Q ss_pred ccc----ChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCHHHH Q lcl|NC_015159. 308 GVT----QIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS-----AVQRGGDRVTAEEI 378 (532) Q Consensus 308 g~~----~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~~~TAtEi 378 (532) .+. +...+....+|.+.....+++...++. .+++ ...++.++.-|.+...... +........++.-+ T Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al 344 (486) T protein:vir:42 269 EIGVDSETGQTLFDAYLARILAFEDAEGKIQQFS-AAEL---ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAI 344 (486) T ss_pred ccccccccccchhhhhhchhcccCCCCceEEeec-ccCH---HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHH Confidence 000 000111112333222112233333332 2222 3444555554544322111 11011111233333 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~laq~~ 457 (532) .....- +... .++.+. .+.+-+.+++.++.+..-....+.+..++.++ ....+-..++.++.+..+++....+ T Consensus 345 ~~~~~~-l~~k---a~~~~~-~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~- 418 (486) T protein:vir:42 345 RAAESR-LIKK---VERKNL-MFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGV- 418 (486) T ss_pred HHHHHH-HHHH---HHHHHH-HHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCC- Confidence 332221 1111 122222 23333444444443321112223343333332 1111222222223332222211111 Q ss_pred chhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhH-HHHHHHHhhcccccCCCCC Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGA-AGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~ 532 (532) +.- +.+ ...+|+.+.. .+|++..++++...... ...+..++ ...++..+.-.+..+.|.. T Consensus 419 ------~s~-et~---~~~lg~~~d~----~~e~~~~~~e~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (486) T protein:vir:42 419 ------IPR-ERA---RIDMGYSVKE----REEMRRWDEEEAAMGLG-LLGTMVDADPTVPGSPSPTAPPKPQPAI 479 (486) T ss_pred ------CCH-HHH---HhcCCCChhH----HHHHHHHHHHHHHHHHH-HHHHhhcCCCCCCCCCCCCCCCCCCccc Confidence 111 222 2235653321 13333332322222111 11111111 0111111111122222221 No 48 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.13 E-value=3.6e-10 Score=72.33 Aligned_cols=501 Identities=13% Similarity=0.072 Sum_probs=204.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHH----Hhhc-ccccCCCCCccc-----cccccc---ccchHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCA----TYTI-PSVFPSATADGS-----TSYTTP---WQSIGARGL 67 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~----~~~~-P~~~~~~~~~~~-----~~~~~~---~dst~~~a~ 67 (532) ||+..+ +.+.+.++.++... .|.+.|+.-+ +|.. +..-=+...... ++..++ |+-++.. + T Consensus 1 ma~~~~-----~~l~~~~~~~~~~~-~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~-v 73 (720) T protein:vir:35 1 MAETLQ-----KRHEQIMRKFDRAH-SPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTE-L 73 (720) T ss_pred CchHHH-----HHHHHHHHHHHHHH-hhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHH-H Confidence 888752 23333333333332 2334444322 3321 211000000000 011111 3333332 2 Q ss_pred HHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee Q lcl|NC_015159. 68 NNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL 147 (532) Q Consensus 68 ~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~ 147 (532) +++.+.--- +++=+++.+.+..- . .++.+.|+ ..+......++...+...++.+.+..|.|+. T Consensus 74 ----~~v~g~~~~-nr~d~~v~P~~~~~------d---~~~Ae~l~---~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ 136 (720) T protein:vir:35 74 ----NRIISEYRH-NRITVKFRPGDKTA------S---EALANKLN---GLFRADYEETDGGEACDNAFDDGSTGGFGCF 136 (720) T ss_pred ----HHHHhHHHh-CCCceEEEcCCCcc------h---HHHHHHHH---HHHHHHHHhcCchHHHhHHHHHhhhccceeE Confidence 233333322 55556666653220 0 12233333 3344455678888889999999999999987 Q ss_pred eecc-----ccc-ccCCcceEEEE--ecceEEEeeCC---CCC-eEEEEEEEeecHHHhhHHHHHHHHh-------hccc Q lcl|NC_015159. 148 YIPS-----TEQ-VEGQSNAPKLY--KLHNFVVERDA---YDN-VLQIVTEDKIARAALPEDVRKSLEE-------AQGD 208 (532) Q Consensus 148 ~v~~-----~~~-~~~~~~~~~~~--pl~~~~v~~d~---~G~-vd~i~rk~~~~~~~l~~~~~~~~~~-------~~~~ 208 (532) -+-- .+. .....+.++.+ |..++++..++ ++. -.-+||..+++.+++-..+...... .+.. T Consensus 137 ~v~~d~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~ 216 (720) T protein:vir:35 137 RLTTNLVNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDY 216 (720) T ss_pred EeeecccccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccc Confidence 4411 111 11223334433 44566665443 221 2225677778888765444432211 1000 Q ss_pred -CCCcceEEEEEEEEe-----------eCC----------C---------------------CeEEEEE-EEcCcccccc Q lcl|NC_015159. 209 -QNPSEEVTIYTHVYR-----------DPE----------A---------------------MVFRSYQ-EIDGEIVAGT 244 (532) Q Consensus 209 -~~~~~~v~i~~~v~~-----------~~~----------~---------------------~~~~s~~-~~~~~~~~~~ 244 (532) .-....|.|+++.++ ++. . +.+++++ .+.|..+... T Consensus 217 d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~ 296 (720) T protein:vir:35 217 DWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEK 296 (720) T ss_pred cccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhccc Confidence 001123334333211 110 0 1233332 2344443333 Q ss_pred cccCccccCceEEEEee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhcc----- Q lcl|NC_015159. 245 EGEYPLDSCPWIPVRLI--KMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK----- 317 (532) Q Consensus 245 ~~~~g~~~~P~~~~Rw~--~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~----- 317 (532) ...++++.|||+++-.. ..+|..+..|.+..+.+-.+.+|+..-..+..+...-.-+.....+++-..+.-.. T Consensus 297 ~~~~p~~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~ 376 (720) T protein:vir:35 297 AQRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKN 376 (720) T ss_pred CCCCCCCccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccccc Confidence 35577888999865332 34777878899999999999999987777777655444344443333222121111 Q ss_pred -----------CCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHH Q lcl|NC_015159. 318 -----------ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGE 384 (532) Q Consensus 318 -----------~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E 384 (532) ..+|.++.. ++.+...+ ...-.+.....++.-...|.++- +-.++.+. +. .+.--|..|.+. T Consensus 377 ~~~~l~~~~~~~~~G~~~~~-~~~~~~~~--~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~-sn-~SG~Ai~~rq~q 451 (720) T protein:vir:35 377 RPAFLPLNEIVDKQGNIIAP-PTPVGYTQ--PQPLNQAMAALLQQTGADIQEVTGSSQAMQPMP-SN-IAKETVNHLMHR 451 (720) T ss_pred cccccccccccccCcccccC-CCcccccC--CCCCchHHHHHHHHHHHHHHHHhCCChHHcCcc-cc-hHHHHHHHHHHH Confidence 112222211 11111111 11111122233333333343331 11122222 22 466778889999 Q ss_pred HHHHhhhhHHHHHH------HHHHHHHHHHHH------HHHhcCC-----C-----CCCccc-----cc---cceeec-- Q lcl|NC_015159. 385 LEDTLGGVYSLLSQ------ELQLPLVKILLK------ELQATSK-----I-----PNLPKE-----AV---EPAIAT-- 432 (532) Q Consensus 385 ~~~~LGpv~~rl~~------E~l~Pli~r~~~------il~r~g~-----l-----p~~p~~-----~~---~~~~v~-- 432 (532) ....+...+..+.. +.+.-||...+. |....|. + .+.++. ++ +..++. T Consensus 452 g~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~ 531 (720) T protein:vir:35 452 SDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDV 531 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEec Confidence 99999888877654 444444444332 2222221 1 111121 11 112222 Q ss_pred -chHHHHHHHHHHHHHHHHHHHHhh-------cchhhhhcCH---HHHHHHHHHhcCCCHhHccCC--HHHHHHHHHHHH Q lcl|NC_015159. 433 -GLEALGRGHDLNKLNVFIDYMIKL-------AGLQDDDINL---LDVKMRLANSLGMDTTGLILT--QQDKQAKMAEAS 499 (532) Q Consensus 433 -~l~~l~raq~~~~l~~~~~~laq~-------~p~~~d~id~---d~~~~~~a~~~Gv~p~~i~~s--~ee~~~~~~q~~ 499 (532) +-.+-.|.+.++.++++++.+..- .+.++...|+ ++++..+-..+. +...+.. .++.++..++++ T Consensus 532 ~p~~~s~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~--~~~~~~~~~~e~qq~~a~~qq 609 (720) T protein:vir:35 532 GPSYTARRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLL--TQGVVKPRNTEEEQMVAQMIQ 609 (720) T ss_pred ccCcccHHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcc--hhcccCccChhHHHHHHHHHH Confidence 222344556666666555543221 1122344444 455555544422 2222222 122111111100 Q ss_pred HHH----HHHHHHH---hhhHHH---------HHH------------HHhhcccccCCC-------CC Q lcl|NC_015159. 500 TAA----GMVTAGQ---QMGAAG---------GQA------------AAAMMQQQAGLP-------TQ 532 (532) Q Consensus 500 ~~~----~~~~~~~---~~~~~~---------~~~------------~~~~~~~~~g~~-------~~ 532 (532) .++ ++++++. ++.+.. .++ .+.+.+..+.++ .| T Consensus 610 ~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~~~~~q~~i~q 677 (720) T protein:vir:35 610 QAQQPNAELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASADSAKRAEIRE 677 (720) T ss_pred HHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 0000000 000000 000 000000000000 00 No 49 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.10 E-value=2.2e-09 Score=68.07 Aligned_cols=453 Identities=11% Similarity=0.064 Sum_probs=182.0 Q ss_pred CCCC--CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh--cccccCCCCCcccccccccccchHHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEV--EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYT--IPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLML 76 (532) Q Consensus 1 m~~~--~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~ 76 (532) |+-+ .-++++.+.+.++...+...+.....+++++|+=- ++.... ..+...+..++..+-+..++++++..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~--~~~~~~~~~~~~~n~~~~ivd~~~~~l~- 77 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGV--TVPQQMQKLLAHVGYPRLYIDAIAARQE- 77 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccc--ccchhHHhhhhhcCcHHHHHHHHHhhhc- Confidence 7766 33566777777766666655554444444444211 111110 0011111112233455556666665442 Q ss_pred hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_015159. 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVE 156 (532) Q Consensus 77 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~ 156 (532) +-+ |+. ++.+ +. .+.+.+....++|.....++.++..+||.|.++|-.++... T Consensus 78 ---~~g---~~~--~~~~------------~~-------~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~ 130 (484) T protein:vir:77 78 ---LEG---FRL--GGAD------------KA-------DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNI 130 (484) T ss_pred ---cCc---eec--CCcc------------hh-------HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCc Confidence 222 222 2111 01 11233556778999999999999999999988776553211 Q ss_pred -----CCcceEEEEecceEEEeeCC-CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeE Q lcl|NC_015159. 157 -----GQSNAPKLYKLHNFVVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVF 230 (532) Q Consensus 157 -----~~~~~~~~~pl~~~~v~~d~-~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~ 230 (532) ....++++++..+.++..|+ .+++...++.+.-. ..+.-..+++|+ ++. .+ T Consensus 131 ~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~-----------------~~~~~~~~~~y~-----~~~-~~ 187 (484) T protein:vir:77 131 DPGVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDE-----------------EGNEVIGATLYL-----PNN-TV 187 (484) T ss_pred ccccccccceEEEeccceeEEEecCCCCceEEEEEEEEee-----------------cCCcEEEEEEEe-----cCe-EE Confidence 11235677776665544454 46666555443310 001112222221 110 01 Q ss_pred EEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeec---C Q lcl|NC_015159. 231 RSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVN---P 306 (532) Q Consensus 231 ~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~-al~d~~~L~~l~~~~l~~~~~a~~p~~lv~---~ 306 (532) .. +-.+|.-........++..+|++.++.+...++.+|+|-..+ ..+-+..++...-.....++..+.|...+. + T Consensus 188 ~~-~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~ 266 (484) T protein:vir:77 188 IW-NREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKG 266 (484) T ss_pred EE-EecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCc Confidence 10 111111111112245678899999998888899999997664 445567777777777777777777664442 1 Q ss_pred cccc----ChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCHHH Q lcl|NC_015159. 307 NGVT----QIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDRVTAEE 377 (532) Q Consensus 307 ~g~~----~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~~TAtE 377 (532) +-+. +...+....+|.+.....+++...++. .+.++ ..++.++.-|....... .+.......-++.- T Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~A 342 (484) T protein:vir:77 267 EELGVDPETGQTLFDAYLARILAFEDHESKAQQFS-AAELR---NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEA 342 (484) T ss_pred chhcccccccchhhhhhhhhhcccCCCCceeEeec-CCChH---HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHH Confidence 1000 000111112233222222233333322 12222 34444444444322111 11101111123333 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYMIKL 456 (532) Q Consensus 378 i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~laq~ 456 (532) +.....-+... ..+.+..|- +-+.+++.++....-....+.+..++.++ ....+-..++.++.+.. +++. T Consensus 343 l~~~~~~l~~k----a~~k~~~f~-~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~k----l~~~ 413 (484) T protein:vir:77 343 IRSSESRLVKT----VERKNKIFG-GAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATK----LYNN 413 (484) T ss_pred HHHHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHH----HHhc Confidence 33222111111 122222222 22233333333221112233344444332 11112122222222222 2222 Q ss_pred cchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhh-cccccCCCCC Q lcl|NC_015159. 457 AGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAM-MQQQAGLPTQ 532 (532) Q Consensus 457 ~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~~~ 532 (532) ...+ +.- +.+...+|+.+.. .+|++.+++++..++ ++.+.+..+.....+..+. -.+.+|.|.- T Consensus 414 g~gi---~s~----et~~~~l~~~~~~----~~e~~~~~~ee~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (484) T protein:vir:77 414 GQGV---IPK----ERARIDMGYSITE----REEMRKWDEEEQAQG-LGLMGTMFGTDPSGGGNPDNPETPEPQPNP 478 (484) T ss_pred cCCC---CCH----HHHHhcCCCChhH----HHHHHHHHHHHHHHH-HHHHhhhccccccCCCCCCCCCcccccCCC Confidence 1111 111 2223335653321 133333333322211 1111111111111111111 1111111111 No 50 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.04 E-value=4.2e-09 Score=66.50 Aligned_cols=451 Identities=10% Similarity=0.068 Sum_probs=184.1 Q ss_pred CCC----CCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC---CCcccccccccccchHHHHHHHHHHH Q lcl|NC_015159. 1 MAE----VEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---TADGSTSYTTPWQSIGARGLNNLASK 73 (532) Q Consensus 1 m~~----~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~ 73 (532) |.- ++....+...+...+..+..++ ++.+++.+|..=..-... ..+...+..+...+-+..++++++.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~----~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~ 76 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDST----QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHH----HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhh Confidence 322 2212222333333444444333 445555555443211111 11111112234456677777777776 Q ss_pred HHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_015159. 74 LMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) Q Consensus 74 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~ 153 (532) | ++.+ |+.. .+.+ .. ..+.+.+.+++|.....++.++..+||.|.+++..++ T Consensus 77 l----~~~g---~~~~-~~~~-------------~~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e 128 (485) T protein:vir:10 77 Q----AVEG---FRFG-DADE-------------AD-------EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPD 128 (485) T ss_pred h----cccc---eecC-CCch-------------hH-------HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCC Confidence 5 3322 2221 1111 11 1223455678999999999999999999988876553 Q ss_pred cc-----cCCcceEEEEecceEEEeeC-CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCC Q lcl|NC_015159. 154 QV-----EGQSNAPKLYKLHNFVVERD-AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEA 227 (532) Q Consensus 154 ~~-----~~~~~~~~~~pl~~~~v~~d-~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~ 227 (532) .. ..+..++++++..+.++..| ..+++...++...- ...+....+++|+ ++ T Consensus 129 ~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~y~-----~~- 185 (485) T protein:vir:10 129 PQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSKAIRVAYD-----------------AEGNEIQAATLYT-----PN- 185 (485) T ss_pred cccccccCCCeeEEEEEccceeEEEEcCCCCceeEEEEEEEe-----------------eCCCeEEEEEEEe-----CC- Confidence 21 12345678888777655555 45566655544320 0011112233332 11 Q ss_pred CeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeec- Q lcl|NC_015159. 228 MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVN- 305 (532) Q Consensus 228 ~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~-al~d~~~L~~l~~~~l~~~~~a~~p~~lv~- 305 (532) .-|.. ....+.-.......+++..+|++.+..+...+..||+|=.+. .++-+..++...-.+...++..+.|...+. T Consensus 186 ~~~~~-~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G 264 (485) T protein:vir:10 186 DIFGW-YRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG 264 (485) T ss_pred eEEEE-EEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhc Confidence 11111 111222111122346778899999999999999999996654 345567778777777778888887764442 Q ss_pred --Cccc----cChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCC Q lcl|NC_015159. 306 --PNGV----TQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDRVT 374 (532) Q Consensus 306 --~~g~----~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~~T 374 (532) ++.+ .+...+....+|.+......++...++. .++++ ..++.++.-|.+..... .+........+ T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~-~~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~S 340 (485) T protein:vir:10 265 IKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFS-AAELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPAS 340 (485) T ss_pred CCcccccccccccchhhhhcccceeccCCCCceEEeec-ccchH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhH Confidence 1100 0001112223344332222233333332 22333 34444444444332211 11111111123 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYM 453 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~l 453 (532) +.-+......+... .++.+. .+.+-+.+++.++...-.....+.+..++.++ ....+-..++.++.+.. + T Consensus 341 g~Al~~~~~~l~~k----~~~k~~-~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~k----l 411 (485) T protein:vir:10 341 AEAIRAAESRLIKK----VERKNS-IFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASK----L 411 (485) T ss_pred HHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHH----H Confidence 33333322222211 223222 22333344444433211112223333333332 11111111222222211 1 Q ss_pred HhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 454 IKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 454 aq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+... -.+.-..+. +.+|+.+.. .++++.+++++..+.. .++..+.....+...+.-..+....|+. T Consensus 412 ~~ag~---~~~s~et~~----~~lg~~~~~----~~~~~~~~ee~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (485) T protein:vir:10 412 YNGGT---GVIPRERAR----KDMGYSIAE----REEMRRWDEEEAAMGL-GLIGTMVDPNPTVPGSPSPAPAPKPAAL 478 (485) T ss_pred Hhccc---cCCCHHHHH----HhCCCCHhH----HHHHHHHHHHHHHHHH-HHHHHhhccCCCCCCCCCccccccCcCC Confidence 11110 012222222 336764431 1333333222222211 1111111111111111122222222222 No 51 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.02 E-value=5.1e-09 Score=66.05 Aligned_cols=437 Identities=11% Similarity=0.064 Sum_probs=188.2 Q ss_pred CCCCCCCccCHHHHHH-HHHHHHHHhhhHHHHHHHHHHhhcccccC-CCCCccccc-ccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAA-AYNRLKNDRGAYETRAEDCATYTIPSVFP-SATADGSTS-YTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~-r~~~lk~~R~~~e~~w~e~~~~~~P~~~~-~~~~~~~~~-~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |.+.-+ .+.. ..-.+-..+-.....|+++|+=--+-... .....+... ..++--+.+-..++.+|+-|.+- T Consensus 18 ~~~~~~------~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~ 91 (496) T protein:vir:38 18 LLKALK------DVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNE 91 (496) T ss_pred cchhhH------HHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCC Confidence 222211 1110 00001111223344555554321111111 111111111 12233355666667666554332 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) . | .++.+|. +..++| .+.+..++|...+.++..+...+|.+.+++-.++ . T Consensus 92 p--~-----~i~~~d~-------------~~~e~l-------~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~---~ 141 (496) T protein:vir:38 92 K--V-----KINIDDK-------------AAEEFV-------LNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDG---N 141 (496) T ss_pred c--c-----eEeeCCh-------------HHHHHH-------HHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcC---C Confidence 1 1 1233331 233333 3466678899999999999999999998876543 4 Q ss_pred CcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE- Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI- 236 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~- 236 (532) +.+.+..++..+++--.+..|++..+.+....+. .++.+..++.+. .. +.+|...+.+ T Consensus 142 ~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~----------------~~~~y~~le~h~---~~--~~~~~I~~~~y 200 (496) T protein:vir:38 142 KNVKVSFATADCMYPLSNDSENVDECVIANSFHK----------------NNKYYTLLEWNE---WQ--GDVYTVTTELY 200 (496) T ss_pred CcEEEEEEcccceEEEEecCCcEEEEEEEEEEEe----------------CCeEEEEEEEEE---Ee--CceEEEEEEEE Confidence 6678999999988754555677766554433321 111222222211 11 1112221110 Q ss_pred ---c----Ccc---------cccccccCccccCceEEEE----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 237 ---D----GEI---------VAGTEGEYPLDSCPWIPVR----LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMI 296 (532) Q Consensus 237 ---~----~~~---------~~~~~~~~g~~~~P~~~~R----w~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~ 296 (532) + |.. +.......++...||+..+ .+...++.||+|-..++++-+..|+..--......+. T Consensus 201 ~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~ 280 (496) T protein:vir:38 201 QSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL 280 (496) T ss_pred ecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh Confidence 0 100 0011111233444554433 3446678999999999999999999887776665554 Q ss_pred HhcCceeecCccccChhhhccCCC--------cee--ecCcccc-ccccc-cCCccchhHHHHHHHHHHHHHHHHH--hh Q lcl|NC_015159. 297 SSKVLFFVNPNGVTQIRRVAKANT--------GDF--VAGRKQD-VEVFQ-LEKYNDFQVAKATADDIEKRLSYAF--ML 362 (532) Q Consensus 297 a~~p~~lv~~~g~~~~~~~~~~~~--------G~~--v~g~~~~-~~~~~-~~~~~~~~~~~~~i~~~~~rI~~af--~~ 362 (532) .++.+.++++ ++....-..+.+ ..+ +.+...+ ...+. +...-+...-...++.+.+.|...- -. T Consensus 281 -~~~~i~v~~~-~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~ 358 (496) T protein:vir:38 281 -GKKKVLVPSS-FVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSA 358 (496) T ss_pred -cccceecchH-HhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCCh Confidence 5777777543 221111001110 011 1111111 11111 1111111222333444444443221 11 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--Cccccccceeec--chHHHH Q lcl|NC_015159. 363 NSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPN--LPKEAVEPAIAT--GLEALG 438 (532) Q Consensus 363 ~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~--~p~~~~~~~~v~--~l~~l~ 438 (532) ..+....+...||+||..+.+...+...- ..+.....|..++.-++.+....+.+.. .+...+.+.+-- +.+... T Consensus 359 ~~f~~~~~g~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~ 437 (496) T protein:vir:38 359 GTFTFDENGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDT 437 (496) T ss_pred hhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHH Confidence 12222233446999999888777776544 5566666777776666655432222211 122223333311 222222 Q ss_pred HHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_015159. 439 RGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQA 518 (532) Q Consensus 439 raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~ 518 (532) . ++.+...++ +.+ +....++ ....|+ |++|++++.++.+..++++ .+ . T Consensus 438 ~---~~~~~~~~~--~Gi-------iS~et~l---~~~~~~-------~d~ea~~el~ri~~E~~~~-----~~--~--- 485 (496) T protein:vir:38 438 T---INRYTNAKN--QGM-------IPLKIAL---QRAWNI-------TEAEADEWAEMLAKEKQAE-----MP--N--- 485 (496) T ss_pred H---HHHHHHHHh--cCC-------CCHHHHH---HhcCCC-------ChHHHHHHHHHHHHhhhcc-----Cc--c--- Confidence 2 222222221 111 2222222 233454 3445444433332222111 00 0 Q ss_pred HHhhcccccCCCC Q lcl|NC_015159. 519 AAAMMQQQAGLPT 531 (532) Q Consensus 519 ~~~~~~~~~g~~~ 531 (532) ..+....|-.+ T Consensus 486 --~d~~~~~~~~e 496 (496) T protein:vir:38 486 --NDMNGIFGEEE 496 (496) T ss_pred --ccccCCCCCCC Confidence 01111111111 No 52 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.01 E-value=5.9e-09 Score=65.70 Aligned_cols=456 Identities=12% Similarity=0.063 Sum_probs=191.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC---CCcccccccccccchHHHHHHHHHHHHH-- Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---TADGSTSYTTPWQSIGARGLNNLASKLM-- 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~l~-- 75 (532) |++.+... +.+-+...+.++..++ ++++.+.+|..=...... ..+...+..++..+-+..++++++..|. T Consensus 1 ~~~~~~~d-~~~~i~~L~~~~~~~~----~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~ 75 (488) T protein:vir:23 1 MAETESID-PEKLRDQLLDAFENKQ----NELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELE 75 (488) T ss_pred CCcccCCC-HHHHHHHHHHHHHHHH----HHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhcc Confidence 98887544 3443444444444443 455555555422210000 1111112334566777778888887664 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc- Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ- 154 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~- 154 (532) +..+|....+=-....|. ++.. .+.+.+..++|.....++.++..++|.|.++|..++. T Consensus 76 Gf~~~~~~~~~~~~~~d~-------------~~~~-------~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~ 135 (488) T protein:vir:23 76 GFRIPSANGEEPESGGEN-------------DPAS-------ELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPE 135 (488) T ss_pred ceeccCCcccccccccch-------------hHHH-------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcc Confidence 222221111111111111 1222 2345677889999999999999999999888765431 Q ss_pred ----ccCCcceEEEEecceEEEee-CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCe Q lcl|NC_015159. 155 ----VEGQSNAPKLYKLHNFVVER-DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMV 229 (532) Q Consensus 155 ----~~~~~~~~~~~pl~~~~v~~-d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~ 229 (532) .+.+..++++++..+.++-. +..+++...++.+.- ........+++|+ + +. - T Consensus 136 ~~~~~~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~y~---~--~~-~ 192 (488) T protein:vir:23 136 VDFDVDPEVPLIRVEPPTALYAEVDPRTRKVLYAIRAIYG-----------------ADGNEIVSATLYL---P--DT-T 192 (488) T ss_pred cccCCCCCcceEEEeccceeEEEEecCCCceEEEEEEEEe-----------------cCCCcEEEEEEEe---c--Cc-E Confidence 22334467778777754444 456777765554430 0001111222221 1 11 0 Q ss_pred EEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecC-- Q lcl|NC_015159. 230 FRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVNP-- 306 (532) Q Consensus 230 ~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~-al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~-- 306 (532) +. +....|.........++|..+|++.++.+...+..+|+|=..+ .++-+..++...-.....++....|...+.. T Consensus 193 ~~-~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~ 271 (488) T protein:vir:23 193 MT-WLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAK 271 (488) T ss_pred EE-EEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCC Confidence 10 1111222111122346788999999999988999999996654 3455677777777777777777777654421 Q ss_pred -ccc----cChhhhccCCCceeecCcc-ccccccccCCccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCH Q lcl|NC_015159. 307 -NGV----TQIRRVAKANTGDFVAGRK-QDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS-----AVQRGGDRVTA 375 (532) Q Consensus 307 -~g~----~~~~~~~~~~~G~~v~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~~~TA 375 (532) +.. .+...+.....|.+..... .++...+++. .++ ...++.++.-|.+.+.... +.......-++ T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~-~~~---~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg 347 (488) T protein:vir:23 272 PEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFSA-AEL---RNFVDALDALDRKAASYSGLPPQYLSSSSDNPASA 347 (488) T ss_pred cccccccccccchhhhhhhhhhccCCCCCCceeEecCC-CCh---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHH Confidence 100 0111122222333322111 1223333332 233 3444445544444322111 11111111133 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 376 EEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYM 453 (532) Q Consensus 376 tEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~l 453 (532) .-+.....-+... .++.+.. +.+-+.+++.++... |. ...+.+..++.++ ....+-.-++.++.+..+++.. T Consensus 348 ~Al~~~~~~l~~k----~~~~~~~-f~~~l~~~~~l~~~~~~~-~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g 421 (488) T protein:vir:23 348 EAIKAAESRLVKK----VERKNKI-FGGAWEQAMRLAYKMVKG-GDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANG 421 (488) T ss_pred HHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhcC-CCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcc Confidence 3333222222222 2333333 333345555554432 11 1233344343332 1111222223333332222211 Q ss_pred HhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 454 IKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 454 aq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ..+ +.... +.+.+|..+. ..++++.+++++..+.. .+..+..+...+. ...--.+-++.+.+ T Consensus 422 ~~~-------~s~et----~~~~l~~~~d----~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 483 (488) T protein:vir:23 422 AGL-------IPRER----GWVDMGYTIV----EREQMRQWLEQDQKQGL-GLIGSLYGASTPE-GKPGEAPVGEPPAP 483 (488) T ss_pred ccc-------CCHHH----HHHhCCCCch----HHHHHHHHHHHHHHHHH-HHHHHHhccCCCc-ccCCCCCCCCCCCC Confidence 111 12222 2333454221 11333333222222111 1111111111111 11111122222222 No 53 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.96 E-value=9.8e-09 Score=64.48 Aligned_cols=447 Identities=11% Similarity=0.061 Sum_probs=182.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCC---CCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPS---ATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~---~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) -...+..-.+...+....+.+..++ ++.+.+.+|..-..-.. ...+...+..++..+-+...++++++.| T Consensus 5 i~~~~~~~~~~~~~~~L~~~~~~~~----~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l--- 77 (485) T protein:vir:24 5 LPGQEEIADPAIARDEMVSAFEDQN----QNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQ--- 77 (485) T ss_pred CCCCCcccchHHHHHHHHHHHHHHH----HHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhh--- Confidence 2222223334443333444444433 33344444433221100 0111111222344556667777766655 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc-- Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV-- 155 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~-- 155 (532) ++. .|+ + ++.+ +.. ..+.+.+..++|.....++.++..+||.|.++|..++.. T Consensus 78 -~~~--g~~-~--~~~~------------~~~-------~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~ 132 (485) T protein:vir:24 78 -AVE--GFR-L--GDAD------------EAD-------EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQID 132 (485) T ss_pred -ccC--cee-c--CCCc------------hhH-------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc Confidence 332 222 2 2111 011 112344567789999999999999999999888655321 Q ss_pred ---cCCcceEEEEecceEEEeeC-CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 156 ---EGQSNAPKLYKLHNFVVERD-AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 156 ---~~~~~~~~~~pl~~~~v~~d-~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) ..+..+++.++..+.++..| ..+++...++.+.-. .......+++|+ ++ .-|. T Consensus 133 ~~~~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y~-----~~-~~~~ 189 (485) T protein:vir:24 133 LGWDPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDA-----------------EGNEIQAATLYT-----PN-ETFG 189 (485) T ss_pred cccCCCcceEEEeccceeEEEeeCCcCceeEEEEEEEee-----------------cCCeEEEEEEEc-----CC-cEEE Confidence 12334677777777666665 456666555544310 011112233321 11 1111 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHHhcCceeec---Cc Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEY-LGDLKSLENLYEAIVKMSMISSKVLFFVN---PN 307 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~a-l~d~~~L~~l~~~~l~~~~~a~~p~~lv~---~~ 307 (532) +...+|..........+|..+|++.++.+...+..||+|-..+. .+-+..++...-.+...++....|...+. ++ T Consensus 190 -~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~ 268 (485) T protein:vir:24 190 -WFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPE 268 (485) T ss_pred -EEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCcc Confidence 11112222222223457788999999988888999999977653 45566777776677777777777765442 11 Q ss_pred ccc----ChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCHHHH Q lcl|NC_015159. 308 GVT----QIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDRVTAEEI 378 (532) Q Consensus 308 g~~----~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~~TAtEi 378 (532) .+. +...+....+|.+.....+++...++. .+.++ ..++.++.-|.+..... .+........++.-+ T Consensus 269 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al 344 (485) T protein:vir:24 269 EIGVDPETGQTLFDAYLARILAFEDAEGKIQQFS-AAELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAI 344 (485) T ss_pred ccccccccccchhhhcccceeccCCCCceEEeec-ccchH---HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHH Confidence 100 001111223343322112233333322 12222 33444444443322111 111111111233333 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--c--chHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--T--GLEALGRGHDLNKLNVFIDYMI 454 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~--~l~~l~raq~~~~l~~~~~~la 454 (532) .. ....+.... ++.+. .+.+-+.+++.++.........+.+..++.++ . +.+-+..+..+.++ ++... T Consensus 345 ~~-~~~~l~~ka---~~~~~-~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl---~~~g~ 416 (485) T protein:vir:24 345 RA-AESRLIKKV---ERKNA-IFGGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKL---YGNGQ 416 (485) T ss_pred HH-HHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHH---Hhccc Confidence 22 222222222 23332 23333444444443321111223333333332 1 22223333322222 22111 Q ss_pred hhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHH--HHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 455 KLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMV--TAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 455 q~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+ +.... +.+.+|..+.. .+|++++++++..+.... .+..+..+. ......-.++.+.++. T Consensus 417 ~~-------~s~et----~~~~l~~~~d~----~~e~~~~~ee~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~ 479 (485) T protein:vir:24 417 GV-------IPRER----ARKDMGYSIAE----REEMRRWDEEEAAMGLGLLGTMVDADPTV--PGSPNPTPAPKPQPAI 479 (485) T ss_pred cc-------CCHHH----HHhhCCCCHhH----HHHHHHHHHHHhhhhhhHHHhhcccCCCC--CCCCCCCCCCCCccCC Confidence 11 11222 22345663321 123333222222111111 111111111 1111222233333333 No 54 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.93 E-value=1.3e-08 Score=63.75 Aligned_cols=452 Identities=11% Similarity=-0.004 Sum_probs=199.3 Q ss_pred CCCCCCCc-cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cC-CCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTG-FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FP-SATADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~-~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~-~~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...+... .+.+.+.+..+.-+..+ .++++++.+|..... .. .......+...++..+.+...++..++.|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~---~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 107 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLA 107 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhc Confidence 43333222 22333444434333333 345566666654421 10 011111222345666677777777766554 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) +- | ++++..+.... ..+ ...+...+..++|....+++.+++.+||.|.+++..++ T Consensus 108 g~--p-----~~~~~~d~~~~---------~~~-------~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~de-- 162 (502) T protein:vir:48 108 GN--P-----IRVEYDDNEDN---------SQN-------DDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSE-- 162 (502) T ss_pred cc--C-----eeEecCCccch---------hHH-------HHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC-- Confidence 32 1 12232221111 112 23344567788999999999999999999988776543 Q ss_pred cCCcceEEEEecceEEEeeC-C-CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERD-A-YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d-~-~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) .+.+++..++..+.++..| . .+++...+|.+..... .+....+++|+ ++ .-| + T Consensus 163 -dg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~----------------~~~~~~~~iyt-----~~-~i~--~ 217 (502) T protein:vir:48 163 -YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QNAKDVVEIYT-----NQ-HIY--T 217 (502) T ss_pred -CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeec----------------CCcEEEEEEEe-----CC-eEE--E Confidence 3556788887766544333 3 5677766665442111 11122333432 11 111 1 Q ss_pred EEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChh Q lcl|NC_015159. 234 QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIR 313 (532) Q Consensus 234 ~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~ 313 (532) +...+..........+|..+|++..+ ++..|.|-.+.+++-+..++.+.-......+....|.+.+.-.+..... T Consensus 218 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~ 292 (502) T protein:vir:48 218 LDASDSFNEISVTPHAFGTVPITEFL-----NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQG 292 (502) T ss_pred EEeCCceeeccceecCCCccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc Confidence 11222222222334567789987664 3457999999999999999999888888888888888766432221111 Q ss_pred -hhccC-CCceeec-------CccccccccccCCccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHH Q lcl|NC_015159. 314 -RVAKA-NTGDFVA-------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEEIRYVAG 383 (532) Q Consensus 314 -~~~~~-~~G~~v~-------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~~~TAtEi~~r~~ 383 (532) ..... ..+.+.. |......+..+....+.+.....++.+.+.|...= ..+......+...|+..+..... T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~ 372 (502) T protein:vir:48 293 MQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLF 372 (502) T ss_pred cchhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHH Confidence 11000 1122211 11222223233333455666666777777665421 11111111123356666554432 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDD 463 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~ 463 (532) .+........++-.+.+.-++.-++.++...+.........+++.+. +.-|-..+..++.+ +.++.+ T Consensus 373 -~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~-~~~p~d~~e~a~~~----~kl~g~------- 439 (502) T protein:vir:48 373 -GLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFT-PNLPKSLYEQVSIL----NDLGGQ------- 439 (502) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeC-CCCCcCHHHHHHHH----HHHhcc------- Confidence 22222333334444444444444444444443322222223444442 21121122222222 122222 Q ss_pred cCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHH--HHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 464 INLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVT--AGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 464 id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +.-..++ +.+| ++.+ ++|++...+++++...... ........++.........+....++ T Consensus 440 iS~et~l----~~l~-----~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 440 VSQETAL----SLSG-----LVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred CcHHHHH----HhCC-----CCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcCCCCC Confidence 1122222 2233 2323 2344333322221111000 00001111111111112222222222 No 55 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.88 E-value=2.1e-08 Score=62.71 Aligned_cols=437 Identities=13% Similarity=0.074 Sum_probs=184.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC---CCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---TADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |+=+ .+.+....+.+..++ ++...+.+|..-..-... ..+...+..++..+-+..+++.+++.| T Consensus 1 ~~t~------~~~i~~L~~~~~~~~----~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l--- 67 (480) T protein:vir:78 1 MTTY------HEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL--- 67 (480) T ss_pred CCCH------HHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh--- Confidence 5432 345555566554443 444455555433211111 111111222445566667777777665 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccc---c Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE---Q 154 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~---~ 154 (532) ++.+ |... .|.+ .. ..+.+.+..++|.....++.++..+||.|.++|...+ . T Consensus 68 -~~~g---~~~~-~d~~-------------~~-------~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~ 122 (480) T protein:vir:78 68 -DIEG---FRIS-EDSE-------------GL-------EELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESG 122 (480) T ss_pred -ccCc---eecC-CCch-------------hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccC Confidence 2322 2221 1111 11 1223556778999999999999999999988776432 1 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE---E-EeeCCCC Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH---V-YRDPEAM 228 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v-~~~~~~~ 228 (532) ++.+..+++.++..+.++..|+ .+++...+|.+.-. + .......+++|+. + +....++ T Consensus 123 d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~-~---------------~~~~~~~~~~y~~~~~~~~~~~~~~ 186 (480) T protein:vir:78 123 DPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D---------------DVAVPDRATLYLPDETVPLRRNGGL 186 (480) T ss_pred CCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee-c---------------CCCceEEEEEEeCCeEEEEEecCCC Confidence 2345567888888887777775 57777666554310 0 0111123333321 0 0111111 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~-al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) ..... .......+++..+|++.++.+...+..||+|=..+ ..+-+-.++...-.....++....|...+. T Consensus 187 ~~~~~-------~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-- 257 (480) T protein:vir:78 187 NDQWV-------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-- 257 (480) T ss_pred ccccc-------cccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-- Confidence 11111 11112245778899999999888899999997765 457777888877777778887777765442 Q ss_pred cccChhhh--------ccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCC-C Q lcl|NC_015159. 308 GVTQIRRV--------AKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDR-V 373 (532) Q Consensus 308 g~~~~~~~--------~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~-~ 373 (532) |.. +..+ .....|.+..-..+++...++.. ++++.. ++.++.-|.+.+... .+. ..... - T Consensus 258 G~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~l~~~i~~~~~~~~~p~~~~g-~~~~n~~ 331 (480) T protein:vir:78 258 GVT-TDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNF---AEEMEVFRKEAASITGLPPQYLS-SSSENPA 331 (480) T ss_pred cCC-ccccccccccchhhhhhhhhccCCCCCceEEecCc-cCHHHH---HHHHHHHHHHHhcccCCChHHhc-cccCcch Confidence 111 1111 11112222221222333334332 234433 333444333322111 111 11111 1 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceeec-chHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 374 TAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIAT-GLEALGRGHDLNKLNVFID 451 (532) Q Consensus 374 TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v~-~l~~l~raq~~~~l~~~~~ 451 (532) ++.-+..+...+.. ...+.+. .+.+-+.+++.++... |. ..+.+..++.++- ...+-.-++.++.+...++ T Consensus 332 Sg~Alk~~~~~l~~----ka~~~~~-~f~~~l~~~~~l~~~~~g~--~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~ 404 (480) T protein:vir:78 332 SAEAIIATDSRIVK----MAERKGR-IFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYA 404 (480) T ss_pred HHHHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHHcCC--CccccceeeeEEecCCCCCCHHHHHHHHHHHHH Confidence 33323222211111 1233333 2233344444444331 21 1223333333221 1111122223333322222 Q ss_pred HHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 452 YMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 452 ~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) ... + .+.... +...+|..+ ++++.+.+.+ +++......++.....+.+.+....+.+..++ T Consensus 405 ~g~---~----~~s~et----~~~~lg~~~-------d~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (480) T protein:vir:78 405 NGQ---G----PIPKEQ----ARIDLGYTA-------TQREQMRDWD-KQETEDMIDTLYSTTKAQADATPKPTVTETKT 465 (480) T ss_pred hcc---c----cCCHHH----HHhcCCCCH-------hHHHHHHHHH-HHHHHHHHHHhhccccccCCCCCCCCCCCCCC Confidence 111 1 122222 223356533 2232222111 11111111111111011111111111122222 Q ss_pred C Q lcl|NC_015159. 532 Q 532 (532) Q Consensus 532 ~ 532 (532) + T Consensus 466 ~ 466 (480) T protein:vir:78 466 E 466 (480) T ss_pred c Confidence 2 No 56 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.81 E-value=3.8e-08 Score=61.25 Aligned_cols=431 Identities=10% Similarity=0.026 Sum_probs=195.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc---ccCCCCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS---VFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |-=++-..+..+.|.+..+.....+ .+++++.+|..-. ..... ....+...++..+.+...++.+++.|++ T Consensus 9 ~~~p~d~~~~~~~l~~~i~~~~~~~----~r~~~~~~yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~l~g- 82 (453) T protein:vir:39 9 MTFPKDEPITNEVVTKFMEKHRLEV----ARYEYLKNMYRGIMAIDAEPT-KDLWKPDNRLTVNFTKYIVDTFTGYFNG- 82 (453) T ss_pred eEcCCCCCCCHHHHHHHHHHHHHHH----HHHHHHHHHhhccCchhcCCC-ccccCccceeecchHHHHHHHHhhhhcc- Confidence 2222223345666666655554433 4555555554321 11111 1111233466667788888888877643 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccC Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~ 157 (532) -| ++++..+.. + ...+.+.+..++|.....++.++..++|.|.+++..++ . T Consensus 83 -~~-----~~~~~~d~~-------------~-------~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~---~ 133 (453) T protein:vir:39 83 -IP-----VKKSHSDKE-------------T-------LSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNE---E 133 (453) T ss_pred -cC-----ceeccCChH-------------H-------HHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecC---C Confidence 11 222323211 1 22345667788999999999999999999999887653 3 Q ss_pred CcceEEEEecceEEEee-CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 158 QSNAPKLYKLHNFVVER-DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 158 ~~~~~~~~pl~~~~v~~-d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) +.+++++++..+.++.. |..++....+.++... .+....+++|+ ++. ..++.. T Consensus 134 g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~------------------~~~~~~~~~yt-----~~~---i~~~~~ 187 (453) T protein:vir:39 134 TQTNVIYNTPENMFMVYDDTIKQEPLFAVRYGYD------------------DDYKLYGEVYT-----KET---TYALNG 187 (453) T ss_pred CceEEEEEcccceEEEecCCCCCeEEEEEEEEEe------------------CCeEEEEEEEe-----CCe---EEEEEe Confidence 45677888876654443 4444433333333211 11112233332 111 011111 Q ss_pred cCccc-ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhh Q lcl|NC_015159. 237 DGEIV-AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV 315 (532) Q Consensus 237 ~~~~~-~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~ 315 (532) ++... ......+++..+|++..+. +.+|+|=.+...+-+..++.+.-......+....|.+.+.. .....+.+ T Consensus 188 ~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~ 261 (453) T protein:vir:39 188 TMGFYNMTEQAPNPFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDL 261 (453) T ss_pred cCCceeeecccccCCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec-CCCCchhh Confidence 11111 1122345677899987653 45799988999999999999999999999999988776642 12222222 Q ss_pred ccCCC-cee-ecCc---cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_015159. 316 AKANT-GDF-VAGR---KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLG 390 (532) Q Consensus 316 ~~~~~-G~~-v~g~---~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LG 390 (532) ..... +.+ +++. ..+..+..+....+.+.....++.++..|...-..-.+....-...|+.-+..+..-+... . T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k-a 340 (453) T protein:vir:39 262 KNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNL-A 340 (453) T ss_pred hhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHH-H Confidence 22222 222 2211 1112222333334566677777777776644321111111111234555544333222222 2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHH Q lcl|NC_015159. 391 GVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVK 470 (532) Q Consensus 391 pv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~ 470 (532) --..+.-.+.+..++.-+..++...|.- .....+++.+.-. .+-..++.++.+.. ++++ +....++ T Consensus 341 ~~~~~~~~~~l~~~~~li~~~~~~~~~~--~~~~~i~v~f~~~-~p~~~~~~a~~~~k----l~g~-------is~et~l 406 (453) T protein:vir:39 341 LSFQRKFQSSLNSRYKLYCELSTNVSNK--EAWKDIEYTFTRN-EPKDIKEQAETANI----LMGI-------TSQETAL 406 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCc--cccccceEEeCCC-CCcCHHHHHHHHHH----Hhcc-------CChHHHH Confidence 2233333334444444444444444421 1112233433211 11112222222222 2222 2223333 Q ss_pred HHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC--C Q lcl|NC_015159. 471 MRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT--Q 532 (532) Q Consensus 471 ~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~--~ 532 (532) ..+|. +.+ ++|++...++...... ....... .....++..++ | T Consensus 407 ----~~l~~-----v~D~~~E~~ri~~E~~~~~~--~~~~~~~--------~~~~~~~~~~~~~~ 452 (453) T protein:vir:39 407 ----SVISV-----IPDVQAEMEKIKKEEASTAI--FDKDKQP--------SEKGTDTVVPETNE 452 (453) T ss_pred ----HhCCC-----CCCHHHHHHHHHHHHHHHHH--HHHhccC--------CCCCCCCCCCCcCC Confidence 23332 222 3444333222221111 1110000 01111111111 1 No 57 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.75 E-value=6.4e-08 Score=60.01 Aligned_cols=445 Identities=11% Similarity=0.015 Sum_probs=198.0 Q ss_pred CCCCCCCccC-HHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cC-CCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFA-ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FP-SATADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~-~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~-~~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |.+.+....+ .+.+.+..+..+..+ .++++++.+|..... .. .......+...++..+.+...++..++.|+ T Consensus 30 ~~~~~~~~~~~~~~i~~~i~~~~~~~---~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~ 106 (501) T protein:vir:96 30 ADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLA 106 (501) T ss_pred ccccccccCChHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhc Confidence 5555443333 333444444444433 245666666654421 11 111111223446777888888888877665 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) +- | +++...+... ..++. ..+...+..++|.....++.++..+||.|.+++..++ T Consensus 107 g~--p-----~~~~~~~~~~---------~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~de-- 161 (501) T protein:vir:96 107 GN--P-----IRVEYDDNDD---------NSQND-------DAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSE-- 161 (501) T ss_pred cc--C-----eeEeeCCccc---------hhHHH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcC-- Confidence 32 1 1223222111 11233 3344567788999999999999999999988876543 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) ++.+++..++..+.++..|. .+++...+|.+..... .+....+++|+ ++. ++ T Consensus 162 -dg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~----------------~~~~~~~~vyt-----~~~----i~ 215 (501) T protein:vir:96 162 -YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QSAKDVVEIYT-----DEH----IY 215 (501) T ss_pred -CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecC----------------CCcEEEEEEEc-----CCc----EE Confidence 35677888888776555554 4777766655432111 01112233321 121 12 Q ss_pred EEE-cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccCh Q lcl|NC_015159. 234 QEI-DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQI 312 (532) Q Consensus 234 ~~~-~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~ 312 (532) ++. .+..........++..+|++..+ ++..|+|-.+...+-+..++.+.-...........|.+.+.-...... T Consensus 216 ~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~ 290 (501) T protein:vir:96 216 TLDASDDFNEISVTTHAFGTVPITEYL-----NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK 290 (501) T ss_pred EEeeCCCceeccccccCCCccceEEec-----CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCc Confidence 222 22211222234567788987653 456799999999999999999988888888888888876643211111 Q ss_pred h-hhcc-CCCceeec-------CccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHHHH Q lcl|NC_015159. 313 R-RVAK-ANTGDFVA-------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 313 ~-~~~~-~~~G~~v~-------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~TAtEi~~r~ 382 (532) . .... ...+.+.. +...++.+..+....+.......++.+++.|...=. .+......+...|+..+..+. T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~ 370 (501) T protein:vir:96 291 GMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKL 370 (501) T ss_pred ccchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHH Confidence 0 0000 01122211 111112222222223344455556666665543211 111111112335666554433 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_015159. 383 GELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD 462 (532) Q Consensus 383 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d 462 (532) .-.. ...-...+.-.+.+.-+++.++.++...+.........+++.+. +.-|-..++.++.+... +.+ T Consensus 371 ~~l~-~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~-~~~p~n~~e~ad~~~kl----~g~------ 438 (501) T protein:vir:96 371 FGLD-QDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFT-PNLPKSLNEQVSILTGL----GGQ------ 438 (501) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeC-CCCCcCHHHHHHHHHHH----hcc------ Confidence 2222 22222333333334444444444444443322222233444442 21221222222222222 111 Q ss_pred hcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccccc--CCCCC Q lcl|NC_015159. 463 DINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQA--GLPTQ 532 (532) Q Consensus 463 ~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~~~~ 532 (532) +....++.. ++ ++.+ ++|++...++++.+. ......+.... .+....+. +-+.+ T Consensus 439 -iS~et~~~~----l~-----~v~D~~~E~~ri~~E~~~~~-~~~~~~~~~~~-----~~~~~~~~~e~~~d~ 495 (501) T protein:vir:96 439 -VSQETALSL----SG-----LVESPNEELDKINKEMSEID-FKGYSNDFNEH-----VGKYTDEVKETHTDD 495 (501) T ss_pred -CchHHHHHh----CC-----CCCCHHHHHHHHHHHHHHhh-ccccccchhhc-----ccccCCcCCCCCCCc Confidence 222333322 22 2323 234433322221111 00000000000 00011111 11111 No 58 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.74 E-value=7.1e-08 Score=59.76 Aligned_cols=445 Identities=10% Similarity=-0.000 Sum_probs=200.5 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK-TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~-~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |...+. ...+.+.+.+.-+.....|. ++++++.+|..-. +.... ....+...++..+.+...+++.++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcc-cccccCcceeecchHHHHHHHHHhhh Confidence 544433 22244555555544444443 4555555555322 11111 11112234677777888888877666 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) ++ -|+. ++.+|.. + ...+.+.+..++|.....++.++..+||.+.+++..++ T Consensus 107 ~g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de- 158 (511) T protein:vir:99 107 LG--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ- 158 (511) T ss_pred cc--cCce-----eecCchH-------------H-------HHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC- Confidence 53 2222 2333221 1 22344566778899999999999999999988776542 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) ++.+++.+++..+.++..|. .+++...+|.+......- ...+.-..+++|+ ++. -|.. T Consensus 159 --d~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~------------~~~~~~~~~~vyt-----~~~-i~~~ 218 (511) T protein:vir:99 159 --DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK------------TDEDEVFTVDLFT-----SHG-VYRY 218 (511) T ss_pred --CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc------------CccceEEEEEEEe-----CCc-EEEE Confidence 35577888888776555544 367766666554311000 0001111222332 221 1111 Q ss_pred EEEEcCcc-----cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 233 YQEIDGEI-----VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 233 ~~~~~~~~-----~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) .. ..+.. ........++..+|++.++- +..|.|-.+..++-+..++.+.-......+....|.+.+... T Consensus 219 ~~-~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~ 292 (511) T protein:vir:99 219 LT-SRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (511) T ss_pred Ee-cCCccccccccccccccCCCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccC Confidence 11 00000 01112345677899888764 357999999999999999998888888888888887666433 Q ss_pred cccChhhhccCCCceeec------------CccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCC Q lcl|NC_015159. 308 GVTQIRRVAKANTGDFVA------------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVT 374 (532) Q Consensus 308 g~~~~~~~~~~~~G~~v~------------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~T 374 (532) +......+.....+..+. +..++..+..+....+.......++.+.+.|...=+ .+.....-+...| T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~S 372 (511) T protein:vir:99 293 LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred cccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 333333222111111111 011112222233333455566667777766643211 1111111123346 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceeecchHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIATGLEALGRGHDLNKLNVFIDY 452 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v~~l~~l~raq~~~~l~~~~~~ 452 (532) +..+..+.. .+........+.-.+.+.-+++-++.++...+... .+.+ .+++.+. +-.|-..++.++.+.. T Consensus 373 g~Alk~~~~-~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~-~~~~~~~i~i~f~-~~~p~n~~e~~~~~~k---- 445 (511) T protein:vir:99 373 GEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-VSKDFNTVRYVYN-RNLPKSLIEELKAYID---- 445 (511) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-cccccccceEEeC-CCCCcCHHHHHHHHHH---- Confidence 655554433 22333333444444444444444444444433222 1222 2333332 1112112222222221 Q ss_pred HHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 453 MIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 453 laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) ++. .+....+++. ++ ++.+ ++|++.+.++++.+.. +... ....... ..-..+..-.+ T Consensus 446 l~G-------iiS~et~l~~----l~-----~v~D~~~E~~ri~~E~~~~~~--~~~~---~~~~~~~-~~~~~~~~~~~ 503 (511) T protein:vir:99 446 SGG-------KISQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIK--KAQK---NMYQDPR-NINDDEQDDST 503 (511) T ss_pred Hhc-------cCCHHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHH--HHhh---cccccCC-CCCCCCCCCCC Confidence 111 1223333332 22 2323 3444443333322211 1111 1111111 11111111111 Q ss_pred C Q lcl|NC_015159. 532 Q 532 (532) Q Consensus 532 ~ 532 (532) + T Consensus 504 ~ 504 (511) T protein:vir:99 504 K 504 (511) T ss_pred c Confidence 1 No 59 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.73 E-value=7.5e-08 Score=59.66 Aligned_cols=439 Identities=10% Similarity=0.042 Sum_probs=190.9 Q ss_pred CCCC-CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCC--CC--cccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEV-EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSA--TA--DGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~-~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~--~~--~~~~~~~~~~dst~~~a~~~L 70 (532) |-+. -+...+.+.+......|.++-+.-..+++.+.+|..-. +.... .. ...+...++..+-+...++.+ T Consensus 11 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 90 (472) T protein:vir:93 11 IFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 90 (472) T ss_pred hhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHH Confidence 2222 11111222222333333222222335666666665442 11110 00 111223467778888889988 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) ++.|++ .| +++...|.. +.+.|. .+..++|-..++++.++..++|.|.+++. T Consensus 91 ~~~l~g--~~-----~~~~~~d~~-------------~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 142 (472) T protein:vir:93 91 VSYIVG--KP-----IAFKHTDDE-------------VVKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPY 142 (472) T ss_pred hhhhcc--cC-----eeeccCChH-------------HHHHHH--------HHHhccHHHHHHHHHHHHhhcCeEEEEEE Confidence 887753 12 223333321 222221 22246888999999999999999988876 Q ss_pred ccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE-----EEe Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH-----VYR 223 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~-----v~~ 223 (532) .++ ++.+++.+++..+.++..| ..+++...+|.+...- ...+++|+. ... T Consensus 143 ~d~---d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~--------------------~~~~~~~~~~~~~~~~~ 199 (472) T protein:vir:93 143 LDE---EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--------------------ETKVEYWDKVTVNYYVY 199 (472) T ss_pred ECC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeec--------------------ceeEEEEecCeEEEEEE Confidence 543 3456788888877555443 4677776666554210 112333321 111 Q ss_pred eCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 224 DPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 224 ~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) +.+...+......++.. ......++..+|++.++. +.+|+|=.+...+-+..++.+.-......+....|.++ T Consensus 200 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~ 272 (472) T protein:vir:93 200 ENGSLIPDYSNNLENSK--THFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV 272 (472) T ss_pred ecCeeeecccccccccc--cccccCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeE Confidence 11111111111111111 122345678899987764 45899999999999999998888888888888888876 Q ss_pred ecCccccChhhhc-cCC-CceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVA-KAN-TGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV-QRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~-~~~-~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~~~TAtEi~~ 380 (532) +.--......... ... .+.+.....+++..+. ...+.......++.++..|.+.-..-.+. ...+...|+.-+.. T Consensus 273 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~ 350 (472) T protein:vir:93 273 LTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEF 350 (472) T ss_pred eecCCcccchhhHHHHhhccccccCCCCcceeEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHH Confidence 6421111111111 011 1233223334444332 22345556666777766665432211111 11122345544332 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYMIKLAGL 459 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~laq~~p~ 459 (532) ...-.. ... .+.+. .+...+.+++.++.+.- ..+.+...+.++ ++..|-..++.++.+.. ++.+ T Consensus 351 ~~~~l~-~ka---~~~~~-~~~~~l~~~~~li~~~~---~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k----~~gi--- 415 (472) T protein:vir:93 351 LYTNLN-LKA---DKLAR-KAKVAIQELLWFVFEHF---DIKGEHKDVDISFNYNKVANTELQVQTAQQ----SMGI--- 415 (472) T ss_pred HHHHHH-HHH---HHHHH-HHHHHHHHHHHHHHHHh---CCCcccceeeEEeCCCCCCCHHHHHHHHHH----Hhcc--- Confidence 211111 111 22222 12222333333333211 112233333321 22222112222222222 1222 Q ss_pred hhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 460 QDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 460 ~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +....++ ..++. +.+ ++|++...++++.+++..+ ....+....+...++.+..++ T Consensus 416 ----is~et~l----~~l~~-----~~d~~~E~~ri~~E~~~~~~~~~-----~~~~~~~d~~~~~~~~~~~~~ 471 (472) T protein:vir:93 416 ----VSHETVL----ENHPF-----VEDLQAELERIEQEQMEYNKQLP-----NLDDGGADGAQQQERSNNKES 471 (472) T ss_pred ----CchHHHH----HhCCC-----CCCHHHHHHHHHHHHHHHHHhcc-----CcCcccCCCCCCCCCCCcccC Confidence 1222222 22332 222 3444443333322222111 111111111222233333333 No 60 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.73 E-value=8e-08 Score=59.50 Aligned_cols=448 Identities=11% Similarity=0.002 Sum_probs=199.6 Q ss_pred CCCCCCC-ccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cCCC-CCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKT-GFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSA-TADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~-~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) +-..+.. ..+.+.+.+..+.-+..+ .++|+++.+|..... .... .....+...++..+.+...++..++.|+ T Consensus 30 ~~~~~~~~~~~~~~l~~~i~~~~~~~---~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 106 (501) T protein:vir:27 30 ADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLA 106 (501) T ss_pred cccccccccccHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhc Confidence 4443332 223444555444444333 345666666655421 1111 1111223446777777888888777765 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) +- | +++...|.... ..+ ...+.+.+..++|.....++.+++.+||.+.+++..++ T Consensus 107 g~--p-----~~~~~~d~~~~---------~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~de-- 161 (501) T protein:vir:27 107 GN--P-----IRVEYDDNDNN---------SQN-------DDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNE-- 161 (501) T ss_pred cc--C-----eeEecCCccch---------HHH-------HHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCC-- Confidence 42 1 12333322111 112 23344567788999999999999999999988886643 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) ++.+++..++..+.++..|. .+++...+|.+..... .+....++||+ ++. -| + T Consensus 162 -d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~----------------~~~~~~~~vyt-----~~~-v~--~ 216 (501) T protein:vir:27 162 -YDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QNAKDVVEIYT-----NEH-IY--T 216 (501) T ss_pred -CCceEEEEEccceeEEEecCCCCCceEEEEEEEEeeec----------------CCcEEEEEEEe-----CCe-EE--E Confidence 34567788877665444433 4666666555542111 11112233331 121 11 1 Q ss_pred EEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccC-h Q lcl|NC_015159. 234 QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ-I 312 (532) Q Consensus 234 ~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~-~ 312 (532) +...+..........++..+|++..+- +..|+|-.+..++-+..++.+.-...........|.+.+.-..... . T Consensus 217 ~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~ 291 (501) T protein:vir:27 217 LDASDDFNEISVTTHAFGTVPITEFLN-----NVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKG 291 (501) T ss_pred EEeCCceeeccccccCCCcccEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcc Confidence 222222222222345677899887643 4679999999999999999999888888888888887654321111 1 Q ss_pred hhhcc-CCCceeec-------CccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHHHHH Q lcl|NC_015159. 313 RRVAK-ANTGDFVA-------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVTAEEIRYVAG 383 (532) Q Consensus 313 ~~~~~-~~~G~~v~-------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~TAtEi~~r~~ 383 (532) +.... ...+.+.. |..+++.+..+....+.+.....++.+++.|...-. .+......+...|+..+..... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~ 371 (501) T protein:vir:27 292 MQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLF 371 (501) T ss_pred cchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHH Confidence 11000 01122211 111222222222223444555666666666544211 1111111123345555443322 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDD 463 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~ 463 (532) - +....-...+.-.+.+.-++..++.++...+....+....+++.+. +.-|-..++.++.+.. ++.+ T Consensus 372 ~-l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~-~~~p~n~~e~ad~~~k----l~g~------- 438 (501) T protein:vir:27 372 G-LDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFT-PNLPKSLNEQVSILTG----LGGQ------- 438 (501) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeC-CCCCcCHHHHHHHHHH----Hhcc------- Confidence 2 2223333444444455555555555554444322232233444442 2112112222222221 2221 Q ss_pred cCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 464 INLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 464 id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +....++ ..++ ++.+ ++|++..+++++...... +++......+...-.....-+++ T Consensus 439 iS~et~l----~~l~-----~v~D~~~E~eri~~E~~e~~~~~----~~~~~~~~~~~~~d~~~~~~~d~ 495 (501) T protein:vir:27 439 VSQETAL----SLSG-----LVESPNEELDKINKEVSEIDFKG----YSNDFNEHVGKYTDEVKETHTDD 495 (501) T ss_pred CcHHHHH----HhCC-----CCCCHHHHHHHHHHHHHhhhHhh----hcCccccccccccCCCCCCcccc Confidence 2122222 2222 2222 344443333322111111 11110000000000000011111 No 61 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.71 E-value=9.2e-08 Score=59.17 Aligned_cols=421 Identities=9% Similarity=-0.025 Sum_probs=178.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh--cccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYT--IPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (532) |..++ .+.+....+.... +.++....+.+|+-- ++... ...+...+..++..+-+...++.++..| T Consensus 1 ~~~~~-----~~~i~~l~~~~~~-~~~r~~~l~~Yy~G~~~i~~~~--~~~~~~~~~~k~~~n~~~~ivd~~~~~l---- 68 (441) T protein:vir:80 1 MNSDE-----LALIEGMYDRIQR-LSSWHCCIEGYYEGSNRVRDLG--VAIPPELQRVQTVVSWPGIAVDALEERL---- 68 (441) T ss_pred CCccH-----HHHHHHHHHHHHH-HHHHHHHHHHHHhcCCcchhcC--cccchhhhhhhhhcchHHHHHHHHHhhh---- Confidence 44433 2333333333332 223333334444221 22211 1111111233566666677777776655 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCC Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQ 158 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~ 158 (532) +|.+ | ..++. +.+ .+....++|....+++.++..+||.|.+++-.++ .+ T Consensus 69 ~~~g---~--~~~d~------------~~l-----------~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~---~g 117 (441) T protein:vir:80 69 DWLG---W--TNGDG------------YGL-----------DGVYAANRLATASCDVHLDALIFGLSFVAIIPHG---DG 117 (441) T ss_pred cccc---c--cCCCh------------HHH-----------HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCC---CC Confidence 3332 2 12211 112 2344568999999999999999999988776543 34 Q ss_pred cceEEEEecceEEEeeC-CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEc Q lcl|NC_015159. 159 SNAPKLYKLHNFVVERD-AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEID 237 (532) Q Consensus 159 ~~~~~~~pl~~~~v~~d-~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~ 237 (532) ..+++.++..+.++..| ..+++...++++.-. .+....+++|. ++ ..+ +++.. T Consensus 118 ~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~------------------~~~~~~~~vy~-----~~-~~~--~~~~~ 171 (441) T protein:vir:80 118 TVSVRPQSPKNCTGKFSADGSRLDAGLVVQQTC------------------DPEVVEAELLL-----PD-VIV--QVERR 171 (441) T ss_pred ceEEEEEccceEEEEEeCCCCceeEEEEEEEEe------------------cCceEEEEEEe-----cC-eEE--EEEEc Confidence 55788888877665555 456666655554311 01112233331 11 101 11111 Q ss_pred C--cccccccccCccccCceEEEEeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccc-cChh Q lcl|NC_015159. 238 G--EIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVE-EYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGV-TQIR 313 (532) Q Consensus 238 ~--~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~-~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~-~~~~ 313 (532) + ..........+|..+|++.+.-+...++.||+|-.. +..+-+..++...-......+....|.+.+.--.. -... T Consensus 172 ~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~ 251 (441) T protein:vir:80 172 GSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQ 251 (441) T ss_pred CCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCcccccc Confidence 1 111112234567889999988888889999998554 45676778888877888888888888655521000 0011 Q ss_pred hhccCCCceeec--Ccccc--ccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCC-CCHHHHHHHHH Q lcl|NC_015159. 314 RVAKANTGDFVA--GRKQD--VEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDR-VTAEEIRYVAG 383 (532) Q Consensus 314 ~~~~~~~G~~v~--g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~-~TAtEi~~r~~ 383 (532) .......|.+.. ++.+. +...++. .++++.-.. .++.-|...+... .+. ..+.. -++.-+..... T Consensus 252 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~l~~~i~~~~~~~~~p~~~~g-~~~~~~~Sg~Al~~~~~ 326 (441) T protein:vir:80 252 PGWVLSMASVWAVDKDDDGDTPNVGSFP-VNSPTPYSD---QMRLLAQLTAGEAAVPERYFG-FITSNPPSGEALAAEES 326 (441) T ss_pred chhhhcccccccCCCCCCCCcceeEecC-ccchHHHHH---HHHHHHHHHhcccCCCHHHhc-cCCCcchHHHHHHHHHH Confidence 111223344432 22111 2222222 233443333 3333333322111 111 11111 13333333222 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYMIKLAGLQD 461 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~laq~~p~~~ 461 (532) .+. -...+.+..|- +-+.+.+.++.+. |.....+.+..++.++ ++..+-..++.++.+.. +.+...... T Consensus 327 ~l~----~k~~~~~~~f~-~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~k----l~~~g~~~~ 397 (441) T protein:vir:80 327 RLV----KRAERRQTSFG-QGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTK----LVGAGILPA 397 (441) T ss_pred HHH----HHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHH----HHhcCcccc Confidence 222 22334333333 3344544444432 3333333333333321 11122122222222221 222111111 Q ss_pred hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 462 DDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 462 d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .. ..+...+|.+ ++|++.+.+++++++ .+..+. .....+.+.| T Consensus 398 ---s~----~~~~~~l~~~-------~~e~~~~~~e~~e~~--~~~~~~------------~~~~~~~~~~ 440 (441) T protein:vir:80 398 ---DS----RTVLEMLGLD-------DVQVEAVMRHRAESS--DPLAVL------------AGAISRQTNE 440 (441) T ss_pred ---cH----HHHHHhCCCC-------HHHHHHHHHHHHHHH--HHHHHH------------hhhhhccccc Confidence 11 1122344542 344433222111111 111111 1112233333 No 62 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.71 E-value=9.2e-08 Score=59.17 Aligned_cols=449 Identities=9% Similarity=-0.007 Sum_probs=196.6 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK-TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~-~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |...+. ...+.+.+.+.-+.....|. ++++++.+|..-. +..... ...+...++..+.+...++..++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~-~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRK-EEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcCc-ccccCcceeecchHHHHHHHHhhhh Confidence 544322 22244555555555445444 3445555554332 111111 1112234676777777788777666 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) ++ -|+ +++.++.. +. ..+.+.+..++|.....++.+++.+||.|.+++..++ T Consensus 107 ~g--~p~-----~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de- 158 (511) T protein:vir:93 107 LG--NPI-----QYQDDDKD-------------VL-------EVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ- 158 (511) T ss_pred cc--cCe-----eeccCChH-------------HH-------HHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC- Confidence 43 221 12333221 12 2334556678899999999999999999988876543 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) ++.+++.+++..+.++..|. .+++...+|.+...... ....+.-..+++|+ ++. -|. T Consensus 159 --~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~iyt-----~~~-i~~- 217 (511) T protein:vir:93 159 --DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-----SHG-VYR- 217 (511) T ss_pred --CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEe-----CCc-EEE- Confidence 35567788887665544443 46776655554421110 00001112233332 111 111 Q ss_pred EEEEcCcc------cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 233 YQEIDGEI------VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 233 ~~~~~~~~------~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) +..++.. ........++..+|++.++- +..|.|=.+..++-+..++.+.-...........|.+.+.- T Consensus 218 -~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G 291 (511) T protein:vir:93 218 -YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (511) T ss_pred -EEecCCCccccccccccccccCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeec Confidence 1111100 01112345677899887653 45789989999999999998888888888888888776643 Q ss_pred ccccChhhhccCCCceee--------cC----ccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCC Q lcl|NC_015159. 307 NGVTQIRRVAKANTGDFV--------AG----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRV 373 (532) Q Consensus 307 ~g~~~~~~~~~~~~G~~v--------~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~ 373 (532) ........+...+.+.+. .+ ..++..+..+....+.......++.+.+.|...-. .+.....-+... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~ 371 (511) T protein:vir:93 292 NLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ 371 (511) T ss_pred CcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc Confidence 222333332222211111 10 11112222333334566666777777776654321 111111112334 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceeecchHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 374 TAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIATGLEALGRGHDLNKLNVFID 451 (532) Q Consensus 374 TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v~~l~~l~raq~~~~l~~~~~ 451 (532) |+..+..... .+........+.-.+.+.-+++-++.++...+... .+.+ .+++.+ ++-.|-..++.++.+.. T Consensus 372 Sg~Al~~~~~-~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~-~~~d~~~i~~~f-~~~~p~n~~e~~~~~~k--- 445 (511) T protein:vir:93 372 SGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSID-ANKDFNTVRYVY-NRNLPKSLIEELKAYID--- 445 (511) T ss_pred hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccccccceEEe-CCCCCCCHHHHHHHHHH--- Confidence 6555544332 22222333333334444444433344433333221 1222 234444 22122222222222221 Q ss_pred HHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCC Q lcl|NC_015159. 452 YMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLP 530 (532) Q Consensus 452 ~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 530 (532) ++.+ +....++.. ++ ++.+ ++|++...+++..+...++ ......+.+........+-.... T Consensus 446 -l~g~-------iS~et~~~~----l~-----~v~d~~~E~~ri~~E~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:93 446 -SGGK-------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIKKAQ-KGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred -Hhcc-------CchHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHHh-hhcccCCCCCCCCCCCCcccccc Confidence 1222 222333322 22 2222 3444443333322211111 10000111000000000001111 Q ss_pred CC Q lcl|NC_015159. 531 TQ 532 (532) Q Consensus 531 ~~ 532 (532) +| T Consensus 508 ~~ 509 (511) T protein:vir:93 508 DK 509 (511) T ss_pred cc Confidence 11 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.71 E-value=9.2e-08 Score=59.15 Aligned_cols=440 Identities=10% Similarity=0.058 Sum_probs=187.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc-CCCCCccc-ccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF-PSATADGS-TSYTTPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~-~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~l 78 (532) |.++-+...+...+ .+-.+.......|+.+|+=--|-.. +.....+. ....++--..+...++.+|+-|++- T Consensus 18 ~~~~~~~~~~~~~i-----~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~iv~~~a~~l~~e- 91 (499) T protein:vir:80 18 LLKSLKDVTDHKKV-----NANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNE- 91 (499) T ss_pred cccchhhhhcCCCC-----cCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHHHHHHHHhhhCC- Confidence 22221111110000 0111112334556666541111111 11111111 1122333456677777777655543 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCC Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQ 158 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~ 158 (532) |+ .++++|. +..++|. +.+..++|...+.++..+...+|.+++.+-.++ .+ T Consensus 92 -p~-----~i~~~d~-------------~~~e~l~-------~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~---~~ 142 (499) T protein:vir:80 92 -KV-----KINIDDE-------------TAEEFVL-------NVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDG---NK 142 (499) T ss_pred -cc-----eEeeCCH-------------HHHHHHH-------HHHhhccHHHHHHHHHHHHhhcCcEEEEEEECC---CC Confidence 22 2333332 2333443 466678899999999999999999998776543 35 Q ss_pred cceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC-CCCeEEEEEEE- Q lcl|NC_015159. 159 SNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP-EAMVFRSYQEI- 236 (532) Q Consensus 159 ~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~-~~~~~~s~~~~- 236 (532) .+.+..++..+++--....|++..+.+....+.+ .+.+..++. | ++.. +...|...+.. T Consensus 143 ~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~----------------~~~y~~lE~-h--~~~~~~~~~y~I~n~~~ 203 (499) T protein:vir:80 143 NVKVSFATADCMYPLSNDSENVDECLIANSFHKN----------------NKYYKLLEW-N--EWKGEKEEVYTVTTELY 203 (499) T ss_pred cEEEEEEcCCceEEEEecCCCeEEEEEEEEEeec----------------CeEEEEEEE-E--EecccceeeEEEEEEEE Confidence 6788999998877433345778776654443321 111222221 1 1111 11112211100 Q ss_pred --c-----Ccccc---------cccccCccccCceEEEE----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 237 --D-----GEIVA---------GTEGEYPLDSCPWIPVR----LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMI 296 (532) Q Consensus 237 --~-----~~~~~---------~~~~~~g~~~~P~~~~R----w~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~ 296 (532) + |..+. ......+....||+.++ .++..++++|+|-...+.+-+..|+..--......+. T Consensus 204 ~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~ 283 (499) T protein:vir:80 204 QSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL 283 (499) T ss_pred eccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh Confidence 0 11110 00011133445555544 3446688999999999999999999888777766544 Q ss_pred HhcCceeecCccccChh-hhccC-------CCce--eecCcccccc-ccc-cCCccchhHHHHHHHHHHHHHHHHH-h-h Q lcl|NC_015159. 297 SSKVLFFVNPNGVTQIR-RVAKA-------NTGD--FVAGRKQDVE-VFQ-LEKYNDFQVAKATADDIEKRLSYAF-M-L 362 (532) Q Consensus 297 a~~p~~lv~~~g~~~~~-~~~~~-------~~G~--~v~g~~~~~~-~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~-~ 362 (532) .+..+.|+++.+ .+. +.... .... .+.+..++.. .+. +...-+...-...++.+.+.|.... + . T Consensus 284 -~~~~i~v~~~~l-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~ 361 (499) T protein:vir:80 284 -GKKKVLVPSSFV-KTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSA 361 (499) T ss_pred -cccceecchhhh-hccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCCh Confidence 566666654322 211 11000 0000 1112111111 011 1100011112233333333333221 1 1 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc--cccccceeec--chHHHH Q lcl|NC_015159. 363 NSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLP--KEAVEPAIAT--GLEALG 438 (532) Q Consensus 363 ~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p--~~~~~~~~v~--~l~~l~ 438 (532) ..+........|||||..+.+...+...-.-.. ...-|..|++-++.+..-.+.+...+ ...+.+.+-. ..+... T Consensus 362 ~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~-~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~ 440 (499) T protein:vir:80 362 GTFTFDENGLKTATEVVSEKSETYQTKNSHSQL-IEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDT 440 (499) T ss_pred hhcCCCcccchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHH Confidence 112222334469999998888877776553333 33345555555555443333332212 2233333311 122222 Q ss_pred HHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_015159. 439 RGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQA 518 (532) Q Consensus 439 raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~ 518 (532) .++.....++ +.+. ....+ ++...|+ |++|.+++.++.+..+.+ ..+ . T Consensus 441 ---~~~~~~~~~~--~Gi~-------S~et~---l~~~~~~-------~d~ea~~el~~i~~E~~~-----~~~--~--- 488 (499) T protein:vir:80 441 ---TINRYTTAKN--QGMI-------PLKIA---LQRAWNI-------TEAEADEWAEMLAKEKQA-----EIP--N--- 488 (499) T ss_pred ---HHHHHHHHHH--cCCC-------CHHHH---HhhcCCC-------ChHHHHHHHHHHHHHhhc-----CCC--C--- Confidence 2222222111 1111 12222 3344565 344444333332221110 000 0 Q ss_pred HHhhcccccCCCC Q lcl|NC_015159. 519 AAAMMQQQAGLPT 531 (532) Q Consensus 519 ~~~~~~~~~g~~~ 531 (532) .......|..+ T Consensus 489 --~d~~g~~ge~e 499 (499) T protein:vir:80 489 --NDMTGIFGEEE 499 (499) T ss_pred --CCccccCCCCC Confidence 00111222222 No 64 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.70 E-value=9.8e-08 Score=59.00 Aligned_cols=448 Identities=9% Similarity=-0.000 Sum_probs=199.0 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc---ccCCCC-CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK-TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS---VFPSAT-ADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~-~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...+. ...+.+.+.+..+.....+. ++++++.+|..-. ...... ....+...++..+.+...++..++.|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhh---HHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 443322 22345556565555555544 3445555554322 111111 111122346777778888888877665 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) + -|+. ++.++.. ....+...+..++|.....++.++..+||.+.+++..++ T Consensus 108 g--~p~~-----~~~~d~~--------------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~-- 158 (511) T protein:vir:96 108 G--NPIQ-----YQDDDKD--------------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ-- 158 (511) T ss_pred c--cCce-----eecCchH--------------------HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCC-- Confidence 3 1221 2233221 112344666778899999999999999999988776542 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) .+.+++.+++..+.++..|. .+++...+|.+..... . ....+....+++|+ ++. -|. T Consensus 159 -dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~----------~--~~~~~~~~~~~vyt-----~~~-i~~-- 217 (511) T protein:vir:96 159 -DDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI----------D--KTDEDEVFTVDLFT-----SHG-VYR-- 217 (511) T ss_pred -CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec----------c--ccccceEEEEEEEe-----CCc-EEE-- Confidence 34567888877665554443 4566655554432110 0 00001111222222 221 111 Q ss_pred EEEcCcc------cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 234 QEIDGEI------VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 234 ~~~~~~~------~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) +..++.. ........++..+|++.++- +.+|+|=.+..++-+..++.+.-......+....|.+.+... T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~ 292 (511) T protein:vir:96 218 YLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (511) T ss_pred EEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecC Confidence 1111110 01112345677888877653 457999899999999999988888888888888887766543 Q ss_pred cccChhhhccCCCceee--------cC----ccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCC Q lcl|NC_015159. 308 GVTQIRRVAKANTGDFV--------AG----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVT 374 (532) Q Consensus 308 g~~~~~~~~~~~~G~~v--------~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~T 374 (532) ...+...+.....+.++ .+ ..++..+..+....+.......++.+++.|...-. .+.....-+...| T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:96 293 LNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred ccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccH Confidence 33443333222211111 11 11112222233333455566667777666644211 1111111123346 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceeec--chHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIAT--GLEALGRGHDLNKLNVFI 450 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v~--~l~~l~raq~~~~l~~~~ 450 (532) +..+..... .+........+.-.+.+.-++..++.++...+... .+.+ .+++.+.- +.+.+.. ++.+... T Consensus 373 g~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~-~~~~~~~i~~~f~~~~p~n~~e~---~d~~~kl- 446 (511) T protein:vir:96 373 GEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEE---LKAYIDS- 446 (511) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-cccccccceEEeCCCCCcCHHHH---HHHHHHH- Confidence 665544432 23333344445555555555555555554333221 1222 23444422 2232322 2222221 Q ss_pred HHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 451 DYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 451 ~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) +.+ +....++.. ++ ++.+ ++|++...++++.+...+ +........+........+-..- T Consensus 447 ---~G~-------iS~et~l~~----l~-----~v~d~~~El~ri~~E~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:96 447 ---GGK-------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIKKA-QKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred ---hcc-------CChHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHH-hhccccCCCCCCCCCCCCCccCc Confidence 111 222333322 22 2222 344443333322221111 11111111111000101111111 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) ++| T Consensus 507 ~~e 509 (511) T protein:vir:96 507 VDK 509 (511) T ss_pred ccc Confidence 111 No 65 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.70 E-value=9.8e-08 Score=59.00 Aligned_cols=448 Identities=9% Similarity=-0.000 Sum_probs=199.0 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc---ccCCCC-CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK-TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS---VFPSAT-ADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~-~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...+. ...+.+.+.+..+.....+. ++++++.+|..-. ...... ....+...++..+.+...++..++.|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhh---HHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 443322 22345556565555555544 3445555554322 111111 111122346777778888888877665 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) + -|+. ++.++.. ....+...+..++|.....++.++..+||.+.+++..++ T Consensus 108 g--~p~~-----~~~~d~~--------------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~-- 158 (511) T protein:vir:78 108 G--NPIQ-----YQDDDKD--------------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ-- 158 (511) T ss_pred c--cCce-----eecCchH--------------------HHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCC-- Confidence 3 1221 2233221 112344666778899999999999999999988776542 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) .+.+++.+++..+.++..|. .+++...+|.+..... . ....+....+++|+ ++. -|. T Consensus 159 -dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~----------~--~~~~~~~~~~~vyt-----~~~-i~~-- 217 (511) T protein:vir:78 159 -DDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI----------D--KTDEDEVFTVDLFT-----SHG-VYR-- 217 (511) T ss_pred -CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec----------c--ccccceEEEEEEEe-----CCc-EEE-- Confidence 34567888877665554443 4566655554432110 0 00001111222222 221 111 Q ss_pred EEEcCcc------cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 234 QEIDGEI------VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 234 ~~~~~~~------~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) +..++.. ........++..+|++.++- +.+|+|=.+..++-+..++.+.-......+....|.+.+... T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~ 292 (511) T protein:vir:78 218 YLTNRTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (511) T ss_pred EEecCCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecC Confidence 1111110 01112345677888877653 457999899999999999988888888888888887766543 Q ss_pred cccChhhhccCCCceee--------cC----ccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCC Q lcl|NC_015159. 308 GVTQIRRVAKANTGDFV--------AG----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVT 374 (532) Q Consensus 308 g~~~~~~~~~~~~G~~v--------~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~T 374 (532) ...+...+.....+.++ .+ ..++..+..+....+.......++.+++.|...-. .+.....-+...| T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:78 293 LNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred ccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccH Confidence 33443333222211111 11 11112222233333455566667777666644211 1111111123346 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceeec--chHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIAT--GLEALGRGHDLNKLNVFI 450 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v~--~l~~l~raq~~~~l~~~~ 450 (532) +..+..... .+........+.-.+.+.-++..++.++...+... .+.+ .+++.+.- +.+.+.. ++.+... T Consensus 373 g~Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~-~~~~~~~i~~~f~~~~p~n~~e~---~d~~~kl- 446 (511) T protein:vir:78 373 GEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEE---LKAYIDS- 446 (511) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-cccccccceEEeCCCCCcCHHHH---HHHHHHH- Confidence 665544432 23333344445555555555555555554333221 1222 23444422 2232322 2222221 Q ss_pred HHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 451 DYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 451 ~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) +.+ +....++.. ++ ++.+ ++|++...++++.+...+ +........+........+-..- T Consensus 447 ---~G~-------iS~et~l~~----l~-----~v~d~~~El~ri~~E~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:78 447 ---GGK-------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIKKA-QKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred ---hcc-------CChHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHH-hhccccCCCCCCCCCCCCCccCc Confidence 111 222333322 22 2222 344443333322221111 11111111111000101111111 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) ++| T Consensus 507 ~~e 509 (511) T protein:vir:78 507 VDK 509 (511) T ss_pred ccc Confidence 111 No 66 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=98.70 E-value=1e-07 Score=58.91 Aligned_cols=481 Identities=11% Similarity=0.033 Sum_probs=220.5 Q ss_pred CCCCCCCccCHHH----HHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC-CCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADG----AAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA-TADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~----~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |+...+...+... ..+-|=...++| -..+++.+.+|..-.-..-. -.+|.. ...+++..|..-++++++-| T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~--RlaaY~ly~d~y~n~~~el~~il~G~d-r~~~~~ps~r~~V~~~~~~L- 76 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKN--RVRAYDLYENIYLNSAETLKLVLRGDD-SVPILMPSGRKIVEAVHRFL- 76 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHH--HHHHHHHHHHhhcCchhhhhhhcCCCc-eeeeccchHHHHHHHHHHhc- Confidence 8887766654332 112222222221 13344444444433211100 012221 23577888888888855444 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) +....|+ ++..+-.+ ..... +++.+...+++-|+.....++-.+.++.|-|++++-.|..+ T Consensus 77 ----g~~~~~~---Ve~~~~de-----~~~~a-------vq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K 137 (563) T protein:vir:74 77 ----GVGFDYL---VEPDMGDE-----GIRQS-------LNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNK 137 (563) T ss_pred ----CCCcEEe---cCccccCc-----chHHH-------HHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecccc Confidence 3344444 22222111 11122 45566678889999999999999999999999999876433 Q ss_pred -cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHH-Hhhc--ccCCCcce--EEEEEEEE-eeC--- Q lcl|NC_015159. 156 -EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSL-EEAQ--GDQNPSEE--VTIYTHVY-RDP--- 225 (532) Q Consensus 156 -~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~-~~~~--~~~~~~~~--v~i~~~v~-~~~--- 225 (532) .+..++.+.|-.+.|+-..|++. |-.+|-..-...=.+|++..+.+ ...+ +..+++.. .++.|..+ +.- T Consensus 138 ~~g~R~rv~~vDP~~~fp~~dpd~-v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~w 216 (563) T protein:vir:74 138 KAGERISVDEVDPRQIFLIEDGST-VVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNW 216 (563) T ss_pred ccCCCceEeecCCceeeeccCCCC-cccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccc Confidence 34566777777788888777744 54454222111112344443322 1111 11122221 01111111 100 Q ss_pred CCCeEEEEE---EEcCcccc---cccc--cCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 226 EAMVFRSYQ---EIDGEIVA---GTEG--EYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMIS 297 (532) Q Consensus 226 ~~~~~~s~~---~~~~~~~~---~~~~--~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a 297 (532) +..+....- +.++.... .++. --++.-.||++++=...++++||+|-..+.+.-++.||.--...-..+... T Consensus 217 d~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~t 296 (563) T protein:vir:74 217 DDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQ 296 (563) T ss_pred cccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhc Confidence 111111111 11111110 0000 012335788887777789999999999999999999996555554444444 Q ss_pred hcCceeecCccccChhhh------ccCCCceeec-Ccccccc-ccccCCccchhHHHHHHHHHHHHHHHHHhhhhccc-- Q lcl|NC_015159. 298 SKVLFFVNPNGVTQIRRV------AKANTGDFVA-GRKQDVE-VFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQ-- 367 (532) Q Consensus 298 ~~p~~lv~~~g~~~~~~~------~~~~~G~~v~-g~~~~~~-~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~-- 367 (532) =.|+.... +....+.. .+-++|.++. +..+... ...+....+++.++..+.++..| +.+-.+-.+ T Consensus 297 G~pi~vl~--~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~er---al~~~s~tPav 371 (563) T protein:vir:74 297 GLGMYVTN--ASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEK---GIAEGSGTPEV 371 (563) T ss_pred CCCeEEec--cccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHH---HHHhhccCcce Confidence 44554433 22211111 1124566543 2221212 23455566788888888877753 222111111 Q ss_pred ----CCCCCC---CHHH-----HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC------Cccccccce Q lcl|NC_015159. 368 ----RGGDRV---TAEE-----IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPN------LPKEAVEPA 429 (532) Q Consensus 368 ----~~~~~~---TAtE-----i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~------~p~~~~~~~ 429 (532) .|..+. .|=| +-.+.+||+..|=.++-++..++..=++. .+.-++..|..|. +|... .+. T Consensus 372 A~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~-~~erl~~~g~~~~~~g~~~~~~~~-~v~ 449 (563) T protein:vir:74 372 AIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLP-AYESDFQEQDGSRPFASADLLNEC-SVV 449 (563) T ss_pred eecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH-HHHhHhhhhcccccccccccCCce-EEE Confidence 122221 2222 23344444443333344432222211111 1112233454443 22222 122 Q ss_pred ee-cchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 430 IA-TGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTA 507 (532) Q Consensus 430 ~v-~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~ 507 (532) ++ .+.-|.-+++-++++....+. . .+-...+++.+.++ |.+ ..+ ++|+++...++=..+..++| T Consensus 450 ivf~p~~P~d~~~vv~~~~tl~~a--G-------iiSretAv~~L~~~-g~~----~pdae~e~~~ie~~~i~~~~~a~a 515 (563) T protein:vir:74 450 CIFADPMPVNKTQVTQDTLLLQQA--H-------LILRKMAVAKLRSI-GWE----YPEVDDQGNALTDDDIADMLLAEA 515 (563) T ss_pred EEeCCCCCccHHHHHHHHHHHHHc--C-------chhHHHHHHHHHhC-CCC----CCcHHHHHhhcCHHHHHHHHHHHh Confidence 22 345667777777776655442 1 23466677777766 651 212 34444333222222222222 Q ss_pred HHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 508 GQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+. ++ ....+...+|.+.| T Consensus 516 ~ad--~~----~~~~a~~~~g~~~~ 534 (563) T protein:vir:74 516 EAD--AS----LGLSAMDNGGAGEQ 534 (563) T ss_pred hcc--Cc----ccceecccCCCCcc Confidence 111 11 11222233444444 No 67 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.68 E-value=1.1e-07 Score=58.71 Aligned_cols=442 Identities=12% Similarity=0.090 Sum_probs=191.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccc-cccccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSY-TTPWQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~dst~~~a~~~Laa~l~~~lt 79 (532) |.+.-+...+.+.+. +-.++..-...|+.+|+=-.|.+-.. ..++.+.. .++--..+...++.+|+-+.+-.. T Consensus 20 ~~~~~~~~~~~~~i~-----~~~~~~~ri~~~~~~y~g~~~~~~~~-~~~~~~~~~~~~sln~~~~i~~~~A~lv~~e~~ 93 (508) T protein:vir:15 20 VTGSLSKITDDPRIS-----IDPDEYVRIQTDLDYYSDKLQYIHYQ-ASDGIKKKRLKNTINMAKTAARRIASVVFNEKA 93 (508) T ss_pred cccchHHhhcccccc-----cCHHHHHHHHHHHHHhcCCCcccccc-cCCCCccccceeecchHHHHHHHHHhhhhCCCc Confidence 222211111111110 11111222344555554333322111 11121111 112224556666666666644321 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~ 159 (532) . | ++++.+ ...+||+ +.+..++|+..+.+++.+..++|.+++-+-.+ ... T Consensus 94 ~-----i--~v~~~~------------~~~e~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d----~~~ 143 (508) T protein:vir:15 94 E-----I--HVKDNN------------EADKFLN-------DVLEDNDFKNKFEEALEKGVALGGFAMRPYID----GNH 143 (508) T ss_pred e-----E--EeCCch------------HHHHHHH-------HHHHhccHHHHHHHHHHHHhhcCceEEEEEEe----CCe Confidence 1 1 121111 1233443 57778999999999999999999998754433 235 Q ss_pred ceEEEEecceEEE-eeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC--CCCeEEEEEEE Q lcl|NC_015159. 160 NAPKLYKLHNFVV-ERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP--EAMVFRSYQEI 236 (532) Q Consensus 160 ~~~~~~pl~~~~v-~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~--~~~~~~s~~~~ 236 (532) +.+..++...|+- ..|. |++.++.......... +.+-.+|+.++... ++.+|.+.+.+ T Consensus 144 ~~i~~v~ad~~~P~~~d~-~~~~~~af~~~~~~~~------------------~~~~~~yt~lE~h~~~~~~~~~I~n~l 204 (508) T protein:vir:15 144 IKIAWVRADQFYPLQSNT-NDISEAAIASRTQRTE------------------SNQTKYYTLLEFHQWQDNGSYQITNEL 204 (508) T ss_pred eEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEeec------------------CCCceEEEEEEEEEEecCcceEEEEEE Confidence 6788889888764 5565 4454443222211100 01111233322211 22233332221 Q ss_pred -c-------Ccccccc-----------cccCccccCceEEEEee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 237 -D-------GEIVAGT-----------EGEYPLDSCPWIPVRLI----KMPNEDYGRSFVEEYLGDLKSLENLYEAIVKM 293 (532) Q Consensus 237 -~-------~~~~~~~-----------~~~~g~~~~P~~~~Rw~----~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~ 293 (532) . |..++.. ....|...-||..++.. ...++.||+|-...+.+.+..||..--....- T Consensus 205 y~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e 284 (508) T protein:vir:15 205 YKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWE 284 (508) T ss_pred EecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHH Confidence 1 2222110 00123333444444432 23368899999999999999999877777765 Q ss_pred HHHHhcCceeecCccccChhh--hccCCCc-e-ee--cCcccc-ccccccCCccchhHHHHHHHHHHHHHHHHH-h-hhh Q lcl|NC_015159. 294 SMISSKVLFFVNPNGVTQIRR--VAKANTG-D-FV--AGRKQD-VEVFQLEKYNDFQVAKATADDIEKRLSYAF-M-LNS 364 (532) Q Consensus 294 ~~~a~~p~~lv~~~g~~~~~~--~~~~~~G-~-~v--~g~~~~-~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~-~~~ 364 (532) . ...++.+.|+++- ++.+. ...-.++ . +. .+..+. ..+..+...-....-.+.++.+.+.|.... + ... T Consensus 285 ~-~~~~~~i~v~~~~-l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~ 362 (508) T protein:vir:15 285 I-RLGQKHIAVQPGM-LRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGT 362 (508) T ss_pred H-HhcccceeechHH-hcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchh Confidence 5 6778887776542 22211 0000011 1 11 111111 011111111112223344444444444332 1 111 Q ss_pred cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--------Cccccccceee--cch Q lcl|NC_015159. 365 AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPN--------LPKEAVEPAIA--TGL 434 (532) Q Consensus 365 ~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~--------~p~~~~~~~~v--~~l 434 (532) +.......-|||||....+...+...- ..+.....|..|++-++.++.-.++... .+....++++. -++ T Consensus 363 f~~~~~~~~TAtei~s~~~~~~~t~~~-~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i 441 (508) T protein:vir:15 363 FSYSNDGVKTATEVVSNNSMTYQTRSS-YLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGV 441 (508) T ss_pred cccccCccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCC Confidence 111222335999999999888888776 4444555777777777776554433321 22233333332 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHH Q lcl|NC_015159. 435 EALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAA 514 (532) Q Consensus 435 ~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~ 514 (532) .+ -+.++++..+..+. +.+ +....++ .+..|+ |++|++++.++.+..+... .... T Consensus 442 ~~-d~~~~~~~~~~~v~--aGi-------~s~e~~i---~~~~g~-------~deea~~el~ri~~E~~~~-----~~~~ 496 (508) T protein:vir:15 442 FV-NKDKQLEEDAKVLA--IGA-------LSKQTFL---QRNYGM-------TDEQAAEELAKIQSEAPTD-----TFEG 496 (508) T ss_pred CC-CHHHHHHHHHHHHh--cCC-------CCHHHHH---HhcCCC-------ChHHHHHHHHHHHHhcccc-----Cccc Confidence 11 11222222222221 122 1223333 233566 3566655555443332110 0000 Q ss_pred HHHHHHhhcccccC-CCC Q lcl|NC_015159. 515 GGQAAAAMMQQQAG-LPT 531 (532) Q Consensus 515 ~~~~~~~~~~~~~g-~~~ 531 (532) .......| ..+ T Consensus 497 ------~~~~~~~g~~ge 508 (508) T protein:vir:15 497 ------GRSAILNGGDGE 508 (508) T ss_pred ------cccccCCCCCCC Confidence 11111111 222 No 68 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.67 E-value=1.2e-07 Score=58.46 Aligned_cols=436 Identities=11% Similarity=0.035 Sum_probs=192.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cCCC---CCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSA---TADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |-+....-++.+.+.+..+..+.+| .++|+.+.+|....- .... .....+...++..+.+...++..++.| T Consensus 22 ~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l 98 (481) T protein:vir:10 22 VVSDLAELLKEENLRNFISRHQTEQ---VPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYL 98 (481) T ss_pred eeecchhhcCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhh Confidence 3334345556666666565554443 456667777654421 1010 011112233566666677777777555 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) .+ .|. .+...|... . ..+.+.+..++|.....++.++..++|.+.+++..++ T Consensus 99 ~g------~~~-~~~~~d~~~-------------~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~- 150 (481) T protein:vir:10 99 TG------NPI-TITHQDNQT-------------N-------DKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDF- 150 (481) T ss_pred cc------CCc-eEecCChhH-------------H-------HHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCC- Confidence 42 222 222222211 1 1334566778899999999999999999988775542 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) ++.+++++++..+.++..|. .+++...+|.++..-. ++.....+++| .++. .. T Consensus 151 --dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~---------------~~~~~~~~~~y-----~~~~---i~ 205 (481) T protein:vir:10 151 --EDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDK---------------DKVPVQHVEVY-----TTDK---IY 205 (481) T ss_pred --CCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeC---------------CCceEEEEEEE-----ecCe---EE Confidence 35567888888776655554 3566665554442100 01111222232 2221 11 Q ss_pred EEEEcCccc-ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccC Q lcl|NC_015159. 233 YQEIDGEIV-AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ 311 (532) Q Consensus 233 ~~~~~~~~~-~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~ 311 (532) ++...+... .......++..+|++..+- +.+|+|=.+...+-+..++.+.-......+....|.+.+....... T Consensus 206 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~ 280 (481) T protein:vir:10 206 YIEIKGGTYHRVEEVEHYYNDVPIIEYLN-----DQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLD 280 (481) T ss_pred EEEecCCceeecccccccCCceeEEEeec-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCC Confidence 122222211 1122344667889876543 4679998888999999999888888888888888887765322222 Q ss_pred hhhhccCCCce-ee-c--------CccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHH Q lcl|NC_015159. 312 IRRVAKANTGD-FV-A--------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVTAEEIRY 380 (532) Q Consensus 312 ~~~~~~~~~G~-~v-~--------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~TAtEi~~ 380 (532) .++......+. +. + +...++..+ ....+.+.....++.++..|...-. .+......+...|+..+.. T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~ 358 (481) T protein:vir:10 281 SEDAKAFRDANMIHLEPGTNANGSEGKAEVKYV--YKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKY 358 (481) T ss_pred ccchhhhhhccceeccccccccCCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHH Confidence 22222211111 11 1 111222221 2222344455556666655533211 1111111122345544433 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccc--cccceeecchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKE--AVEPAIATGLEALGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~--~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~ 457 (532) +..-... ..++.+ ..+...+.+.+.++.+. +.....+.+ .+++.+. +..+-..++.++.+... +.+ T Consensus 359 ~~~~l~~----k~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~-~~~~~~~~~~a~~~~kl----~g~- 427 (481) T protein:vir:10 359 KLFGLEQ----VRAIKE-RLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFT-PNLPKSMMESINAFNAL----SGG- 427 (481) T ss_pred HHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeC-CCCCcCHHHHHHHHHHH----hcc- Confidence 3222222 123322 23333345544544332 111111122 2333332 11122222222222221 111 Q ss_pred chhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCC Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLP 530 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 530 (532) +....+++ .++. +.+ ++|++..+++...++.. ......+.+.........|.+ T Consensus 428 ------is~et~~~----~l~~-----i~d~~~E~~ri~~E~~~~~~~-----~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 428 ------VSESTRLS----LLDF-----IDNPKEELEKMQEEEAQREKQ-----ADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred ------CChHHHHH----hCCC-----CCCHHHHHHHHHHHHHHHHhh-----hhhccCCccCCCCCCCCCCCC Confidence 22233332 2332 222 34444333332221111 111111111111111222222 No 69 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.66 E-value=1.3e-07 Score=58.31 Aligned_cols=448 Identities=9% Similarity=-0.021 Sum_probs=200.2 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK-TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~-~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |...+. ...+.+.+.+.-+.....|. ++++++.+|..-. +.... ....+...++..+.+...++..++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcC-cccccCcceeecchHHHHHHHHHhhh Confidence 544322 22244555555544444443 4555566555432 11111 11112234566677777777777655 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) ++ -|+. ++..+.. + ...+.+.+..++|.....++.+++.+||.+.+++..++ T Consensus 107 ~g--~p~~-----~~~~~~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de- 158 (511) T protein:vir:96 107 LG--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ- 158 (511) T ss_pred cc--CCce-----eecCchH-------------H-------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC- Confidence 43 1211 2333221 1 12345667788999999999999999999988776542 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) ++.+++..++..+.++..|. .+++...+|.+...... ....-+++|+-...++. -|.. T Consensus 159 --d~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d-----------------~~~~~~~~~~~iyt~~~-i~~~ 218 (511) T protein:vir:96 159 --DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-----------------KTDEDEVFTVDLFTSHG-VYRY 218 (511) T ss_pred --CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc-----------------ccccceEEEEEEEeCCc-EEEE Confidence 35567777777665544443 46666666555432110 01111222221122221 1111 Q ss_pred EEEEcCc-----ccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 233 YQEIDGE-----IVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 233 ~~~~~~~-----~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) ....+. .........++..+|++.++- +.+|+|-.+..++-+..++.+.-..........+|.+.+... T Consensus 219 -~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~ 292 (511) T protein:vir:96 219 -LTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (511) T ss_pred -EecCCCcccccccccccccccCCceeeEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC Confidence 111111 011122345678899888664 457999999999999999998888888888888887766543 Q ss_pred cccChhhhccCCCceeec--------C----ccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCC Q lcl|NC_015159. 308 GVTQIRRVAKANTGDFVA--------G----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVT 374 (532) Q Consensus 308 g~~~~~~~~~~~~G~~v~--------g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~T 374 (532) +......+.....+..+. + ..++..+..+....+.+.....++.+.+.|...-. .+.....-+...| T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~S 372 (511) T protein:vir:96 293 LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred ccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 333333332222221111 0 11112222233334556666777777776644321 1111111123456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceee--cchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIA--TGLEALGRGHDLNKLNVFI 450 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v--~~l~~l~raq~~~~l~~~~ 450 (532) +..+.....- +........+.-.+.+.-+++-++.++...+... .+.+ .+++.+. .+.+.+..++ .+. T Consensus 373 g~Al~~~~~~-l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~-~~~d~~~i~~~f~~~~p~n~~e~~~---~~~--- 444 (511) T protein:vir:96 373 GEAMKYKLFG-LEQRTKTKEGLFTKGLRRRAKLLETILKNTWSID-ANKDFNTVRYVYNRNLPKSLIEELK---AYI--- 444 (511) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc-cccccccceEEeCCCCCCCHHHHHH---HHH--- Confidence 6665544332 2222333344344444444333344433322211 1222 2344432 2223233322 221 Q ss_pred HHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 451 DYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 451 ~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) .++.+ +....+++ .++ ++.+ ++|++...+++..+...+ +......+.+..-.....+.... T Consensus 445 -kl~G~-------iS~et~l~----~l~-----~v~D~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:96 445 -DSGGK-------ISQTTLMS----LFS-----FFQDPELEVKKIEEDEKESIKKA-QKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred -HHhcc-------CChHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHHHH-hhccccCCCCCCCCCCCCccccc Confidence 11221 22233332 222 2222 344444433332221111 11111111111001111111111 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) ++| T Consensus 507 ~~~ 509 (511) T protein:vir:96 507 VDK 509 (511) T ss_pred ccc Confidence 111 No 70 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.65 E-value=1.4e-07 Score=58.13 Aligned_cols=445 Identities=9% Similarity=-0.002 Sum_probs=197.7 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGF-AADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~-~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |..++..-. +.+.+.+.-+.....|. ++++++.+|..-. +.... ....+...++..+.+...++.+++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~YY~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Ivd~~~~yl 106 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYF 106 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcc-cccccCcceeecchHHHHHHHHhhhh Confidence 555543322 23444444444444443 4555666665432 11111 11112234666777778888877766 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) ++ -|+ +++..|.. +. ..+.+.+..++|.....++.+++.+||.+.+++..++ T Consensus 107 ~g--~p~-----~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de- 158 (512) T protein:vir:97 107 LG--NPI-----QCQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ- 158 (512) T ss_pred cc--cCc-----eeccCChH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCC- Confidence 54 121 22333221 11 2334566778899999999999999999988776543 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) ++.+++..++..+.++.-|. .+++...+|.+......- ...+.-..+++|+ ++. -|. T Consensus 159 --d~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~------------~~~~~~~~~~vyt-----~~~-i~~- 217 (512) T protein:vir:97 159 --DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK------------TDEDEVFTVDLFT-----SHG-VYR- 217 (512) T ss_pred --CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc------------cccceEEEEEEEe-----CCc-EEE- Confidence 35567888887765554443 467776666554321100 0001112223322 221 111 Q ss_pred EEEEcC-cc-----cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 233 YQEIDG-EI-----VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 233 ~~~~~~-~~-----~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) +..++ .. ........++..+|++.++ ++..|.|-.+..++-+..++.+.-......+....|.+.+.. T Consensus 218 -~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G 291 (512) T protein:vir:97 218 -YLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) T ss_pred -EEecCCCcccccccccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 11111 10 0112234577888988754 346789989999999999998888888888888888876643 Q ss_pred ccccChhhhccCCCceeec-------------CccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCC Q lcl|NC_015159. 307 NGVTQIRRVAKANTGDFVA-------------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDR 372 (532) Q Consensus 307 ~g~~~~~~~~~~~~G~~v~-------------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~ 372 (532) ....+...+.....+..+. +..++..+..+....+.......++.++..|-..=+ .+.....-+.. T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn 371 (512) T protein:vir:97 292 NLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT 371 (512) T ss_pred CccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccccc Confidence 3333333332222222211 011122222233334555566667777766643211 11111111233 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceeec--chHHHHHHHHHHHHHH Q lcl|NC_015159. 373 VTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIAT--GLEALGRGHDLNKLNV 448 (532) Q Consensus 373 ~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v~--~l~~l~raq~~~~l~~ 448 (532) .|+..+..... .+........+.-.+.+.-++..++.++...+.... +.+ .+++.+.- +.+.+..++.+.++ T Consensus 372 ~Sg~Al~~~~~-~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~-~~d~~~i~~~f~~~~p~~~~e~~~~~~kl-- 447 (512) T protein:vir:97 372 QSGEAMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVRYVYNRNLPKSLIEELKAYIDS-- 447 (512) T ss_pred chHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc-ccccccceEEeCCCCCcCHHHHHHHHHHH-- Confidence 46655543322 222223333333333443344444444433332221 122 23444422 22323333222222 Q ss_pred HHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccccc Q lcl|NC_015159. 449 FIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQA 527 (532) Q Consensus 449 ~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (532) +.+ +....+++ .++. +.+ ++|++...++++.+.... ....+...+ +...-.++.+ T Consensus 448 -----~gi-------iS~et~~~----~l~~-----v~d~~~E~eri~~E~~~~~~~~--~~~~~~~~~-~~~~~~~~~~ 503 (512) T protein:vir:97 448 -----GGK-------ISQTTLMS----LFSF-----FQDPELEVKKIEEDEKESIKKA--QKGIYKDPR-DINDDEQDDD 503 (512) T ss_pred -----hcc-------CchHHHHH----hCCC-----CCCHHHHHHHHHHHHHHHHHHH--hhcccCCCC-CCCCCCCCCC Confidence 121 11222332 2332 222 344443333322211111 111100000 0000000000 Q ss_pred CCC--CC Q lcl|NC_015159. 528 GLP--TQ 532 (532) Q Consensus 528 g~~--~~ 532 (532) ... +| T Consensus 504 ~~~~~~~ 510 (512) T protein:vir:97 504 TKDTVDK 510 (512) T ss_pred ccccccc Confidence 000 00 No 71 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.62 E-value=1.8e-07 Score=57.61 Aligned_cols=444 Identities=9% Similarity=0.032 Sum_probs=182.9 Q ss_pred CCCCCCCccCHHHHHHHHH-HHHHHhhhHHHHHHHHHHhhcccccCCC---CCcc--ccccc-ccccchHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYN-RLKNDRGAYETRAEDCATYTIPSVFPSA---TADG--STSYT-TPWQSIGARGLNNLASK 73 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~-~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~~~~--~~~~~-~~~dst~~~a~~~Laa~ 73 (532) |=+.-...++.+.+.+... +|-.+...-.++++.+.+|..=...... .... .+++. ++..+-+..+++++++. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 80 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQ 80 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhh Confidence 9888777888777665443 3333333344566666666544321111 1111 11111 12335556666666554 Q ss_pred HHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_015159. 74 LMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) Q Consensus 74 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~ 153 (532) | +|.+ |+. +|.+.. +.+ .+.+..++|....++++++..+||.+.++|.+.. T Consensus 81 l----~~~g---f~~--~d~~~~---------~~~-----------~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~ 131 (479) T protein:vir:99 81 L----IVDG---YRK--TGTNEN---------AKG-----------WDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGI 131 (479) T ss_pred c----cccc---ccC--CCchhh---------HHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCC Confidence 3 4444 332 222211 112 2445667899999999999999999988876421 Q ss_pred --cccCCcceEEEEecceEE-EeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeE Q lcl|NC_015159. 154 --QVEGQSNAPKLYKLHNFV-VERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVF 230 (532) Q Consensus 154 --~~~~~~~~~~~~pl~~~~-v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~ 230 (532) .++.+..++++++..+.+ +-.|+......+|.. .. +....+.+|+. .+ + T Consensus 132 ~~~d~~g~~~i~~~~p~~~~~iydd~~~~~~~~~~~---~~------------------~~~~~~~~~~~----~~---~ 183 (479) T protein:vir:99 132 SPLDGTTVARIKCIDPRDAFAIWEDPYWDEWPKYLL---ER------------------QPNGQYWWWTE----ED---Y 183 (479) T ss_pred CCcCCCCceEEEEechhheEEEecCCcccceeeEEE---ee------------------cCceeEEEEec----ce---E Confidence 122344567777765544 434443322222211 11 11122222210 00 1 Q ss_pred EEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc Q lcl|NC_015159. 231 RSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT 310 (532) Q Consensus 231 ~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~ 310 (532) ..+....+..........++..+|++.++-+...+ .+|+|=.+..++-+..++...-.....++....|.+.+.-.... T Consensus 184 ~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~-~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~ 262 (479) T protein:vir:99 184 SIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLR-GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLP 262 (479) T ss_pred EEEEecCCceeeccccccCCCCcceEEeecCCCcC-cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcc Confidence 11111111111111223456789999988776664 58999888999999999998888888888888887554311100 Q ss_pred ----ChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHHHHHHHH Q lcl|NC_015159. 311 ----QIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQ---RGGDRVTAEEIRYVAG 383 (532) Q Consensus 311 ----~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~~~TAtEi~~r~~ 383 (532) ..........+.++.....++...++. .++++ ..++.++.-|...+....... ......++.-+..... T Consensus 263 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~~---~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~ 338 (479) T protein:vir:99 263 EGANADQEKMRFAQESMLISQNEKASFGAIP-AAPLD---GLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTR 338 (479) T ss_pred cccccchhccccccccceeecCCCceEEEec-ccchH---HHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHH Confidence 000001111122222222333333333 22333 333333333333222111100 0112235544443322 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYMIKLAGLQD 461 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~laq~~p~~~ 461 (532) -+... .++.+. .+.+-+.+++.++.. .|... +.+.+.+.++ ....+-..++.++.+.. +.+. + T Consensus 339 ~l~~k----a~~~~~-~f~~al~~~~~l~~~~~~~~~--~~~~~~i~~~w~~~~~~s~~~~ad~~~k----l~~a-g--- 403 (479) T protein:vir:99 339 QTMQK----LFEKQA-TWKASHNQTMRLVNKIEGRTE--EATDLDFTITWQDVTIQSLAQFADAWAK----MVES-L--- 403 (479) T ss_pred HHHHH----HHHHHH-HHHHHHHHHHHHHHHHcCCCc--cccceeeeEEecCCCCCCHHHHHHHHHH----HHhc-C--- Confidence 22222 222222 333334444444333 22211 1222333332 11111111222222222 1121 1 Q ss_pred hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhh---HHHHH-H--HHhhcccccCC----CC Q lcl|NC_015159. 462 DDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMG---AAGGQ-A--AAAMMQQQAGL----PT 531 (532) Q Consensus 462 d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~---~~~~~-~--~~~~~~~~~g~----~~ 531 (532) .+-...++..+ .||++. +++.+++.++.+.+..+...+.. .++.+ + ..+.-.++++. |- T Consensus 404 -~is~et~l~~l---~gv~~~-------~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (479) T protein:vir:99 404 -KIPAEGVWDMI---PNLDQS-------TVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPA 472 (479) T ss_pred -CCCHHHHHHhc---CCCCHH-------HHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchh Confidence 12223333222 477543 22222222211111111111111 11110 0 00111111111 11 Q ss_pred C Q lcl|NC_015159. 532 Q 532 (532) Q Consensus 532 ~ 532 (532) + T Consensus 473 ~ 473 (479) T protein:vir:99 473 S 473 (479) T ss_pred c Confidence 1 No 72 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.60 E-value=2.1e-07 Score=57.25 Aligned_cols=449 Identities=10% Similarity=0.036 Sum_probs=188.8 Q ss_pred CCCC------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc---ccCCCCCcccccccccccchHHHHHHHHH Q lcl|NC_015159. 1 MAEV------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS---VFPSATADGSTSYTTPWQSIGARGLNNLA 71 (532) Q Consensus 1 m~~~------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~dst~~~a~~~La 71 (532) |++. +...++.+.+....+..+.+|.. +++++.+|..=. ..........+...++..+-+...++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~ 77 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLE---RLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQ 77 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHh Confidence 5554 33344667787777777766644 445555543311 01111111112233576777788888887 Q ss_pred HHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 72 SKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 72 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) +.|.+ -|+. ++..|.. +.++|. ..+...+|.....++.++..++|.+..++.. T Consensus 78 ~~l~g--~~~~-----~~~~d~~-------------~~~~l~-------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~ 130 (489) T protein:vir:99 78 GYMLG--VPVE-----YKNENKD-------------LQAAID-------LMSVRNNEDYHNVKIKTDLSIYGRAYELLTV 130 (489) T ss_pred hhhcc--CCce-----eecCChh-------------HHHHHH-------HHHhhcChhHHHHHHHHHHhhCCeEEEEEee Confidence 66653 2222 2333221 333333 4456678888999999999999999765532 Q ss_pred cc-cccCCcceEEEEecceEEEeeCCC--CCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCC- Q lcl|NC_015159. 152 TE-QVEGQSNAPKLYKLHNFVVERDAY--DNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEA- 227 (532) Q Consensus 152 ~~-~~~~~~~~~~~~pl~~~~v~~d~~--G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~- 227 (532) .. .+..+.+++.+++..+++...|.. +++...+|.+...-. +......+++|+ ++. T Consensus 131 ~~~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~---------------~~~~~~~~~~y~-----~~~i 190 (489) T protein:vir:99 131 EKIDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYG---------------SGKRKQIIKAYT-----SDTI 190 (489) T ss_pred ccCcCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEEEecC---------------CCceEEEEEEEe-----CCcE Confidence 21 123455678888888766655543 445555544431100 001112233332 111 Q ss_pred CeEEEEEE-EcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC Q lcl|NC_015159. 228 MVFRSYQE-IDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP 306 (532) Q Consensus 228 ~~~~s~~~-~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~ 306 (532) ..|..... .++.. .......++..+|++..+. ...|+|-.+...+-+..++.+.-...........|.+.+.- T Consensus 191 ~~~~~~~~~~~~~~-~~~~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g 264 (489) T protein:vir:99 191 YTYEDYNLETKGMR-LKDYEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAG 264 (489) T ss_pred EEEEecCCCcccce-ecccccccCCceeEEEeec-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhcc Confidence 01111100 01111 1122345678899988764 35688888889999999999888888888888877766532 Q ss_pred ccccCh--hhh---ccC-CC-----------ceeecCccc------cccccccCCccchhHHHHHHHHHHHHHHHHHh-h Q lcl|NC_015159. 307 NGVTQI--RRV---AKA-NT-----------GDFVAGRKQ------DVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-L 362 (532) Q Consensus 307 ~g~~~~--~~~---~~~-~~-----------G~~v~g~~~------~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~ 362 (532) ...... ..+ ... .+ +.++....+ +..+..+....+.......++.+.+.|-..-. . T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 344 (489) T protein:vir:99 265 NAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTP 344 (489) T ss_pred CCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCc Confidence 111000 000 000 00 111110000 01111222223344455556666555533211 1 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc--ccccceeecchHHHHHH Q lcl|NC_015159. 363 NSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPK--EAVEPAIATGLEALGRG 440 (532) Q Consensus 363 ~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~--~~~~~~~v~~l~~l~ra 440 (532) +......+...|+..+..+....... .-...+.-.+.+.-+++-++.++...+.--.... ..+++.+.-. .+-..+ T Consensus 345 ~~~~~~~~~n~Sg~Al~~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~-~p~d~~ 422 (489) T protein:vir:99 345 DTQDMKFSGVQSGESMKYKLMASDNY-REKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPN-LPQNDN 422 (489) T ss_pred ccccccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCC-CCcCHH Confidence 10101111234555544332211111 1112222223333333333333322221000111 1233333211 111122 Q ss_pred HHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH Q lcl|NC_015159. 441 HDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAA 520 (532) Q Consensus 441 q~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~ 520 (532) +.++.+.... .+ +....++..+ -+| ++++++++.++.+.++...+...+ .... .... T Consensus 423 ~~~~~~~kl~----gi-------is~et~~~~l---~~v-------~~~d~~~E~~ri~~E~~~~~~~~~-~~~~-~~~~ 479 (489) T protein:vir:99 423 EIVTAAQNLY----GI-------VSDQTIFEIL---NTV-------TGVDAEAELKRLKEEADKKQSLPE-PRLV-GDAS 479 (489) T ss_pred HHHHHHHHHh----cc-------CCHHHHHHhc---CCC-------CchhHHHHHHHHHHHHHHHhcccc-cccc-CCCC Confidence 2223222221 11 2233333322 233 222333333332222111111110 0100 0011 Q ss_pred hhcccccCCC Q lcl|NC_015159. 521 AMMQQQAGLP 530 (532) Q Consensus 521 ~~~~~~~g~~ 530 (532) ..-++..+-| T Consensus 480 ~~~~~~~~~p 489 (489) T protein:vir:99 480 GQEEPTAEKP 489 (489) T ss_pred CCcCCCCCCC Confidence 1112222222 No 73 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.60 E-value=2.1e-07 Score=57.23 Aligned_cols=422 Identities=9% Similarity=0.027 Sum_probs=188.9 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCcccc Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKL 88 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l 88 (532) ++.+.|.+..++++..+ +.....+++|+=--+-...... ...+...++-.+.+...++..++.|++ .| +.+ T Consensus 1 l~~~~l~~~i~~~~~~~-~r~~~l~~yy~g~~~il~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~-----~~~ 71 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFN-LSYSAYKQLYEGDHAILQQKQK-EQYKPDNRLVVNFAKYIVDTFNGYFIG--VP-----VQT 71 (429) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhcccccccccccc-ccCCCcceeecchHHHHHHHHhhhhcc--cC-----cee Confidence 78888888887776543 3333333443321111111111 111223466677777888888877754 12 223 Q ss_pred CCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecc Q lcl|NC_015159. 89 NVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH 168 (532) Q Consensus 89 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~ 168 (532) +.++. ++.. .+...+..++|.....++.++..+||.|.+++..++ .+.+.+++++.. T Consensus 72 ~~~~~-------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---~g~~~~~~~~p~ 128 (429) T protein:vir:98 72 SHENK-------------QVSN-------YLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDE---NAEAGITYLTPL 128 (429) T ss_pred ecCCh-------------HHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecC---CCcEEEEEEccc Confidence 33321 1222 334556677899999999999999999988876543 355678888766 Q ss_pred eEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccc-cccc Q lcl|NC_015159. 169 NFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIV-AGTE 245 (532) Q Consensus 169 ~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~-~~~~ 245 (532) +.++.-|. .+++...+|.+.- ++ .+.+.++.+.+.. .++.+.+... .... T Consensus 129 ~~~~v~dd~~~~~~~~~i~~~~~-------------------~~-----~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 181 (429) T protein:vir:98 129 EAFIVYDDSIRQKPLFAVRYFYN-------------------KG-----GVLEGSYSDASNI---TYFKDGEKGIEIGES 181 (429) T ss_pred ceEEEEeCCCCCceEEEEEEEEe-------------------cC-----ceEEEEEEeCceE---EEEEecCCceEeccc Confidence 65444333 3445544443320 00 1222233333321 1111221111 1122 Q ss_pred ccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCCceee- Q lcl|NC_015159. 246 GEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFV- 324 (532) Q Consensus 246 ~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v- 324 (532) ...++..+|++..+ ++.+|+|=.+...+-+..++.+.-......+....|.+.+... ....+.......+.++ T Consensus 182 ~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~-~~~~~~~~~~~~~~~~~ 255 (429) T protein:vir:98 182 EPHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGA-ELDDETLKSLRDTRIIN 255 (429) T ss_pred ccccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-CCCcchhhhHhhCceee Confidence 24567789987653 4568999999999999999999999999999999888776421 1122222222222222 Q ss_pred -cCcc-ccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_015159. 325 -AGRK-QDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQL 402 (532) Q Consensus 325 -~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~ 402 (532) ++.. ....+..+....+.+.....++.+.+.|...-..-.+...+....|+.-+..+..-.. ...-...+.-.+.+. T Consensus 256 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~-~k~~~~~~~~~~~l~ 334 (429) T protein:vir:98 256 LKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMD-NLAKTKERKFMSGMN 334 (429) T ss_pred ccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 2111 1112222333345566666777777766543221111111212345554443221111 111112222222222 Q ss_pred HHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh Q lcl|NC_015159. 403 PLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT 482 (532) Q Consensus 403 Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~ 482 (532) -++..+..++...+. +.....+++.+ ++..+-.-++.++.+.. ++.+ +....++ +.+|. T Consensus 335 ~~~~li~~~~~~~~~--~~d~~~i~v~f-~~~~p~~~~~~a~~~~k----l~g~-------is~et~~----~~l~~--- 393 (429) T protein:vir:98 335 RRYKLIASYPTSKIG--PKDWIGIKYKF-TRNLPANLLEESQIAGN----LAGI-------VSEETQV----GVLSI--- 393 (429) T ss_pred HHHHHHHHHhccCCC--ccccccceEEe-CCCCCcCHHHHHHHHHH----Hhcc-------CchHHHH----HhCCC--- Confidence 233333333322221 11111233333 11122112222222222 1221 2223232 33342 Q ss_pred HccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 483 GLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 483 ~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) +.+ ++|++..++++..... .+++...++...... + T Consensus 394 --v~d~~~E~~ri~~E~~~~~~-----~~~~~~~~~~~~~~~-------~ 429 (429) T protein:vir:98 394 --VENPQKEIERKNSDKSTLIS-----RQAGGLNGQNTTTIL-------E 429 (429) T ss_pred --CCCHHHHHHHHHHHHHHHHH-----HHHhhhcCCCCCCCC-------C Confidence 222 2344333332221111 111111111111111 1 No 74 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.60 E-value=2.1e-07 Score=57.18 Aligned_cols=450 Identities=11% Similarity=0.015 Sum_probs=187.6 Q ss_pred CCCCCCC---------ccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC-C--CcccccccccccchHHHHHH Q lcl|NC_015159. 1 MAEVEKT---------GFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA-T--ADGSTSYTTPWQSIGARGLN 68 (532) Q Consensus 1 m~~~~~~---------~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~--~~~~~~~~~~~dst~~~a~~ 68 (532) |-.+.+. +++.+. ....+.|..+...+.++.+++.+|..-...... + -+..-+.-+..-+-+..+++ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e-~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd 79 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDV-VDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVD 79 (504) T ss_pred CCccCCcccccccccCCCCHHH-HHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHH Confidence 5544332 223222 222344444444445666677777543321111 1 11111111234455667777 Q ss_pred HHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeee Q lcl|NC_015159. 69 NLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 69 ~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~ 148 (532) +|+..|. .-+ |++ ++... .. ..+++....++|.....++.++..+||.+.++ T Consensus 80 ~~a~rl~----~~G---f~~--~d~~~------------~~-------~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~ 131 (504) T protein:vir:99 80 TLARRCN----LES---FVW--PDGDY------------GS-------IGGPDVWDENFFATKANNAMVSSLIHGPAFLI 131 (504) T ss_pred HHHhhhc----cce---eeC--CCCCh------------hh-------HHHHHHHHhcChhhHHHHHHHHHHhhCceeEE Confidence 7776542 211 222 21110 01 12335567788999999999999999999988 Q ss_pred ecccccccCCcceEEEEecceEEEeeCC-CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCC Q lcl|NC_015159. 149 IPSTEQVEGQSNAPKLYKLHNFVVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEA 227 (532) Q Consensus 149 v~~~~~~~~~~~~~~~~pl~~~~v~~d~-~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~ 227 (532) |-.++. .....++++++..+..+..|+ .+++...++.... ........+++|. ++ T Consensus 132 v~~~~d-~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~-----------------d~~g~~~~~~~y~-----~~- 187 (504) T protein:vir:99 132 NTEGGA-GEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSR-----------------DAEGHPTGIALYE-----DG- 187 (504) T ss_pred EecCCC-CCceeEEEEeccceeEEEEeCCCCceeEEEEEEEe-----------------cCCCeEEEEEEEc-----CC- Confidence 865532 122346777777665444444 4554444332210 0001112233332 11 Q ss_pred CeEEEEEE-EcCc-ccccccccCccccCceEEEEeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_015159. 228 MVFRSYQE-IDGE-IVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFV-EEYLGDLKSLENLYEAIVKMSMISSKVLFFV 304 (532) Q Consensus 228 ~~~~s~~~-~~~~-~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~-~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv 304 (532) .++++ .++. ........+++. +|++.+..+...++.||+|-. +..++-+..+|...-..+..+++.+.|...+ T Consensus 188 ---~~~~~~~~~~~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i 263 (504) T protein:vir:99 188 ---VTVTADMDDDGDWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLIL 263 (504) T ss_pred ---cEEEEEEcCCceeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhh Confidence 01111 1111 111111233443 899999888888999999954 3567888899988888888888888887444 Q ss_pred c---------CccccChhhhccCCCcee--ecCcccc-------ccccccCCccchhHHHHHHHHHHHHHHHHHhhhh-- Q lcl|NC_015159. 305 N---------PNGVTQIRRVAKANTGDF--VAGRKQD-------VEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS-- 364 (532) Q Consensus 305 ~---------~~g~~~~~~~~~~~~G~~--v~g~~~~-------~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-- 364 (532) - .||. +........+.+ ++.+.+. +...++ ..++++. .++.++.-|........ T Consensus 264 ~G~~~~~~~~~d~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~-~~~~l~~---~~~~l~~~i~~~a~~t~~P 337 (504) T protein:vir:99 264 LGADAKNFRNKDGS--MKPAWQIALARVFALPDDEDEPDAARARADVKQF-PASSPQP---HIEMLEQIAMMFSGETSIP 337 (504) T ss_pred ccCCcccccccccc--ccchhhhhhhhhhcCCCccccccccCccceeeec-CCCChHH---HHHHHHHHHHHHHhhhCCC Confidence 1 1111 111111112222 2222111 111111 1223332 23333333333222111 Q ss_pred ---cc-cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee-cchHHHH Q lcl|NC_015159. 365 ---AV-QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA-TGLEALG 438 (532) Q Consensus 365 ---~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v-~~l~~l~ 438 (532) +. ..+...-+|.-|.....-+.+ ...+.+.-|-.. +.+++.++.. .+.+...+.+..+++++ ....+-. T Consensus 338 ~~~lG~~~~~n~sSa~Ai~~~~~~L~~----ka~~k~~~f~~~-l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s 412 (504) T protein:vir:99 338 VESLGFSNRANPTSADAYIASREDLIA----EAEGATDDWSPA-FRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLS 412 (504) T ss_pred HHHhcccccccccHHHHHHHHHHHHHH----HHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccceeEecCCCccC Confidence 10 111112244433322222111 233333323222 3343333322 12344556666555442 1222222 Q ss_pred HHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhh------- Q lcl|NC_015159. 439 RGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQM------- 511 (532) Q Consensus 439 raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~------- 511 (532) .++.++.+..+ .+..+..+ ... +.+.+.+|+++..+-+-. +++++++.+....++..+. T Consensus 413 ~a~~aDa~~Kl----~~ag~~l~--~~~----~~l~~~lg~~~~ei~r~~----~e~~~~~~~~~~~~l~~~~~~~~~~~ 478 (504) T protein:vir:99 413 KAAQADAGAKM----LGAGPEWL--KET----EVGLELLGLTPQQAKRAL----AERRRASSVSIIEALNRRQQEAATAG 478 (504) T ss_pred HHHHHHHHHHH----Hhhccccc--cch----HHHHhhcCCCHHHHHHHH----HHHHHHhhHHHHHHHhcccCCCCCCC Confidence 33333332222 22211110 011 222344577554332111 1111111111111111111 Q ss_pred ---hHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 512 ---GAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 512 ---~~~~~~~~~~~~~~~~g~~~~ 532 (532) ..+.+..++..-...+|.|++ T Consensus 479 ~~~~~~~~e~a~~~~~~~~~~p~~ 502 (504) T protein:vir:99 479 EDQDQGAGEPPANEPPAALGRPTL 502 (504) T ss_pred CCCCcCCCCCCCCCCCccCCCccc Confidence 111222233334455566666 No 75 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=98.59 E-value=2.2e-07 Score=57.04 Aligned_cols=426 Identities=7% Similarity=0.020 Sum_probs=194.2 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----cc-CCC-----------CCcccccccccccchHHHHHHHHH Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VF-PSA-----------TADGSTSYTTPWQSIGARGLNNLA 71 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~-~~~-----------~~~~~~~~~~~~dst~~~a~~~La 71 (532) ++.+.+.+....+..+.+.-.+++.++.+|..=. +- ..+ .........++..+-+...++..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 8888888888888777666666777777776432 00 000 000111233566666666666666 Q ss_pred HHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 72 SKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 72 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) +.|.+ -|+. +...|.. +.+.|+ .+...+|.....++.++...+|.+.+++-. T Consensus 81 ~yl~G--~p~~-----~~~~~~~-------------~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~ 132 (471) T protein:vir:10 81 AYALT--YPPT-----FDVDDKK-------------VNDMIV--------DVLGDDYERISKQLCVNAGNAGIAWLHVWK 132 (471) T ss_pred hhhcc--cCce-----eccCChH-------------HHHHHH--------HHHhcCHHHHHHHHHHHHhhCCeEEEEEEe Confidence 55543 2322 3333221 222221 222468888999999999999999877654 Q ss_pred cccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEE------EEEe Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYT------HVYR 223 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~------~v~~ 223 (532) +. +.+.+++.+++..+.++--|. .+++...+|.+....... .+....+++|+ .+.. T Consensus 133 d~--~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~--------------~~~~~~~~vy~~~~~~~y~~~ 196 (471) T protein:vir:10 133 DA--SDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETD--------------GKNYTVYEYWNDKECSFYRHE 196 (471) T ss_pred eC--CCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccCC--------------CceeEEEEEEeCCcEEEEEec Confidence 42 234567888888775554443 456776666554221110 11122233332 1111 Q ss_pred eCCC-------CeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 224 DPEA-------MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMI 296 (532) Q Consensus 224 ~~~~-------~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~ 296 (532) .... .++.......|.........++|..+|++..+. +.+|.|=.+...+-+-.++.+.-......+. T Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~ 271 (471) T protein:vir:10 197 KEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDVYDKVFSGFVNDTDD 271 (471) T ss_pred CCcccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1100 001111111222222222245777899887755 4578888899999999999888888888899 Q ss_pred HhcCceeecCcc-ccChhhhccCC-Cceeec-C--ccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCC Q lcl|NC_015159. 297 SSKVLFFVNPNG-VTQIRRVAKAN-TGDFVA-G--RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGD 371 (532) Q Consensus 297 a~~p~~lv~~~g-~~~~~~~~~~~-~G~~v~-g--~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~ 371 (532) ..+|.+.+.-.. -...+...... .+.+.. + ...+..+..+....+.+.....++.+++.|-+.-..-........ T Consensus 272 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~g 351 (471) T protein:vir:10 272 VQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLG 351 (471) T ss_pred hhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccccc Confidence 999876664321 11122111111 222322 1 112222333334446677777777777777553221111111112 Q ss_pred CCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceeec--chHHHHHHHHHHHHHH Q lcl|NC_015159. 372 RVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNV 448 (532) Q Consensus 372 ~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~ 448 (532) ..|+.-+..+..-.. --.++....| ...+.+++.++... |.. ....+++.+.. +.+.+..++-+.+ T Consensus 352 n~Sg~Alk~~~~~l~----~k~~~~~~~~-~~~l~~~~~li~~~~~~~---d~~~i~i~f~~~~p~n~~e~~~~~~k--- 420 (471) T protein:vir:10 352 NSSGVALKFLYSLLE----LKAGNMETQF-RSGYATLVKMILKHLGLS---DKLKIKQTWTRNSINNDTEMAQVVST--- 420 (471) T ss_pred CccHHHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHhccC---CCceeEEEeCCCCCCCHHHHHHHHHH--- Confidence 334443332211111 1122222222 22223333333221 211 12234444322 2222222222111 Q ss_pred HHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccccc Q lcl|NC_015159. 449 FIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQA 527 (532) Q Consensus 449 ~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (532) ++. .+....++. .++ ++.+ ++|++...++++.++. .+.... T Consensus 421 ----l~g-------~iS~et~~~----~~p-----~v~D~~~E~eri~~E~~~~~~------------------~~~~~~ 462 (471) T protein:vir:10 421 ----LAT-------ITSRENVAK----SNP-----IVEDWQDELRLQKAEQEGRSE------------------KLYDME 462 (471) T ss_pred ----Hhc-------cCchHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHh------------------cccccC Confidence 111 122333332 222 2223 2344333322221111 011111 Q ss_pred CCCCC Q lcl|NC_015159. 528 GLPTQ 532 (532) Q Consensus 528 g~~~~ 532 (532) |--++ T Consensus 463 ~~~~~ 467 (471) T protein:vir:10 463 EVEHE 467 (471) T ss_pred CCCCc Confidence 11111 No 76 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.59 E-value=2.3e-07 Score=57.01 Aligned_cols=437 Identities=12% Similarity=0.054 Sum_probs=184.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC---CCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---TADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |+- ..+.+...++.+..+ .++...+.+|..-...... ..+...+..++..+-+..+++++++.| T Consensus 1 ~~t------~~d~i~~L~~~~~~~----~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l--- 67 (480) T protein:vir:78 1 MTT------YHEHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL--- 67 (480) T ss_pred CCC------HHHHHHHHHHHHHHH----HHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh--- Confidence 432 244555555555443 3455555555433211111 111111222345566677777777765 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccc---c Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE---Q 154 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~---~ 154 (532) ++.+ |+.. .|.+ .. ..+.+.+..++|.....++.++..+||.|.++|...+ . T Consensus 68 -~~~g---~~~~-~d~~-------------~~-------~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~ 122 (480) T protein:vir:78 68 -DIEG---FRIS-EDSE-------------GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG 122 (480) T ss_pred -ccCc---eecC-CCch-------------hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccC Confidence 2322 2221 1111 11 1233456678999999999999999999988775422 1 Q ss_pred ccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE---EEe-eCCCC Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH---VYR-DPEAM 228 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~-~~~~~ 228 (532) +..+..++++++..+.++..|+ .+++...+|.+.-. + ..+....+++|+. ++. ...++ T Consensus 123 d~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~-d---------------~~~~~~~~~~y~~~~~~~~~~~~~~ 186 (480) T protein:vir:78 123 DPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D---------------DVAVPDRATLYLPDETVPLRRNGGL 186 (480) T ss_pred CCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee-c---------------CCcceEEEEEEeCCeEEEEEecCCC Confidence 2345567888888877776675 45666655544211 0 1111233444321 111 11111 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE-YLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~-al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) ..... .......+++..+|++.+..+...+..||+|=.++ ..+-+..++...-.....++..+.|...+. T Consensus 187 ~~~~~-------~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-- 257 (480) T protein:vir:78 187 NDQWV-------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-- 257 (480) T ss_pred ccccc-------ccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-- Confidence 11100 11112245778899999999988899999997765 457778888877777778887777765442 Q ss_pred cccChhhh--------ccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCC- Q lcl|NC_015159. 308 GVTQIRRV--------AKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDRV- 373 (532) Q Consensus 308 g~~~~~~~--------~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~~- 373 (532) |.. ++.+ ....+|.+..-..+++...++. .++++. .++.++.-|...+... .+. ...... T Consensus 258 G~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~l~~~i~~~~~~~~~p~~~fg-~~~~n~~ 331 (480) T protein:vir:78 258 GVT-TDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLS-SSSENPA 331 (480) T ss_pred CCC-ccccccccccchhhhhhhhhccCCCCCceEEecC-ccCHHH---HHHHHHHHHHHHhcccCCCHHHhc-cccCchh Confidence 111 1111 1111222222122233333332 223333 3333444443332211 111 111111 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 374 TAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDY 452 (532) Q Consensus 374 TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~ 452 (532) ++.-+..+...+.. ..++.+..| .+-+.+++.++....- -..+.+..++.++ ....+-.-++.++.+...++. T Consensus 332 Sg~Al~~~~~~l~~----k~~~~~~~f-~~~l~~~~rl~~~~~~-~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~ 405 (480) T protein:vir:78 332 SAEAIIATDSRIVK----MAERKGRIF-GGAWERAMRIAMQIMG-REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) T ss_pred HHHHHHHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHHHcC-CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHh Confidence 33323222222111 233444433 2333444444433210 0122233333332 111111122223322222221 Q ss_pred HHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 453 MIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 453 laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) . .+ .+.-. . +...+|+.+ ++++.+.+.+ ++... ....+..++....+.+.-++++|-.++ T Consensus 406 g---~~----~~s~e-t---~~~~lg~~~-------d~~~e~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (480) T protein:vir:78 406 G---QG----PIPKE-Q---ARIDLGYTA-------TQREQMRDWD-KQETE-DMIDTLYSTTKAQADATPKPTVTETKT 465 (480) T ss_pred c---cc----CCCHH-H---HHhcCCCCH-------hHHHHHHHHH-HHHHH-HHHHHhhccccCCCccccCCCCCCCCC Confidence 1 11 11221 1 223356532 3332222111 11111 111111111111111111222221111 No 77 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.58 E-value=2.4e-07 Score=56.89 Aligned_cols=430 Identities=9% Similarity=-0.011 Sum_probs=194.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) |.+. ..+..+.+.+..+.+.. |.+....++++|+-.-+-+..... ...+...++-.+.+...++..++.|.+- | T Consensus 11 ~~~~--~~~~~~~i~~~i~~~~~-~~~r~~~~~~yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~ 84 (453) T protein:vir:73 11 YSRD--EEITDKVVNDFMKKHQE-EVERYEYLGNMYKGIMEISSQKAK-DSWKPDNRLTNNFAKYIVDTFVGYFNGI--P 84 (453) T ss_pred cccc--ccCCHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcCCCC-CccCccceeecchHHHHHHHhhhhhccc--C Confidence 4433 34456667666666654 445555566666643222111111 1122344666778888888888776542 2 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) +++...+.. +.+ .+...+..++|.....++.++..+||.|.+++..++ .+.+ T Consensus 85 -----~~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---~~~~ 136 (453) T protein:vir:73 85 -----IKKTHDDKS-------------VLE-------AMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNE---STES 136 (453) T ss_pred -----ceeecCChH-------------HHH-------HHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCC---CCce Confidence 223332211 111 233456678899999999999999999988776543 3455 Q ss_pred eEEEEecce-EEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCc Q lcl|NC_015159. 161 APKLYKLHN-FVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGE 239 (532) Q Consensus 161 ~~~~~pl~~-~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~ 239 (532) ++.+++..+ |++-.|..++....+.++... .+....++||+. + . ..++..++. T Consensus 137 ~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~------------------~~~~~~~~vyt~-----~-~--i~~~~~~~~ 190 (453) T protein:vir:73 137 EVIYCSPLNVFMVYDDSIKQKPLFAVYYGFD------------------EEGNLSGTVYTL-----L-E--TISITGKAG 190 (453) T ss_pred EEEEEcccceEEEEeCCCCceeEEEEEEEEe------------------cCceEEEEEEeC-----C-e--EEEEEecCC Confidence 677776544 566666656655444444321 011233444431 1 1 111111111 Q ss_pred cc-ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccC Q lcl|NC_015159. 240 IV-AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKA 318 (532) Q Consensus 240 ~~-~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~ 318 (532) .. .......++..+|++.++ ++.+|+|-.+...+-+-.++.+.-......+....|.+.+.. .....+..... T Consensus 191 ~~~~~~~~~~~~g~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g-~~~~~~~~~~~ 264 (453) T protein:vir:73 191 EVKFGESTYNVYSDLPIVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLG-AEVDEEDAKNI 264 (453) T ss_pred ceEEccceeccCCceeEEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeec-CCCCchhhhcc Confidence 11 111234567789988654 346789988889999999999888888888888888766631 11111222211 Q ss_pred CCceee------cC----ccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_015159. 319 NTGDFV------AG----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDT 388 (532) Q Consensus 319 ~~G~~v------~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~ 388 (532) ..+..+ ++ ...+..+..+....+.......++.++..|...-..-.+........|+.-+..+..-+. . T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~-~ 343 (453) T protein:vir:73 265 KDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMS-N 343 (453) T ss_pred cccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHH-H Confidence 111110 10 011111112222234455566677777766443211111111212345554433322111 1 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHH Q lcl|NC_015159. 389 LGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLD 468 (532) Q Consensus 389 LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~ 468 (532) ..--..+...+.+.-++..+..++...|. +..-..+++.+...+ +-..++.++.+.... .+ +.... T Consensus 344 ka~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~i~v~f~~~~-p~~~~~~a~~~~k~~----gi-------is~et 409 (453) T protein:vir:73 344 LALSFQRKFQSALNRRYSLWSSLSTNASN--KDAWKDIEYTFTRNE-PKDIKEQAETANILK----GI-------TSEET 409 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeCCCC-CCCHHHHHHHHHHHh----cc-------CcHHH Confidence 22222333333333344444444433332 111123344442222 111222222222211 11 22222 Q ss_pred HHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhh Q lcl|NC_015159. 469 VKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAM 522 (532) Q Consensus 469 ~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (532) ++ +.++. +.+ ++|++..+++++.+..+++- ..+....+.-..+ T Consensus 410 ~~----~~~~~-----~~d~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~~~~~ 453 (453) T protein:vir:73 410 AL----SVISV-----IPDVQAEMEKIKKKKLLQLSLTRT--SNLVRMKQMRGNL 453 (453) T ss_pred HH----HhCCC-----CCCHHHHHHHHHHHHHHHHHHHHh--ccCCcchhhhcCC Confidence 22 23332 222 34444443333322221111 1111111111111 No 78 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.58 E-value=2.5e-07 Score=56.82 Aligned_cols=442 Identities=10% Similarity=0.034 Sum_probs=201.8 Q ss_pred CCCC----------CCCccCHHHHHHHHHHHHHH-hhhHHHHHHHHHHhh--cccccCCC-CC-----cccccccccccc Q lcl|NC_015159. 1 MAEV----------EKTGFAADGAAAAYNRLKND-RGAYETRAEDCATYT--IPSVFPSA-TA-----DGSTSYTTPWQS 61 (532) Q Consensus 1 m~~~----------~~~~~~~~~~~~r~~~lk~~-R~~~e~~w~e~~~~~--~P~~~~~~-~~-----~~~~~~~~~~ds 61 (532) |-+. +-.......+.+..+.+..+ |-+...+++++++-- ++.+-... +. +..+...++-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~ 80 (479) T protein:vir:79 1 MLNIYISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINN 80 (479) T ss_pred CCCceecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecc Confidence 1111 11112344566666666544 434444444444311 22221110 10 111223356677 Q ss_pred hHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHh Q lcl|NC_015159. 62 IGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLV 141 (532) Q Consensus 62 t~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 141 (532) .+...++..++.|++- |+ +++..+.. +.+.+ ..+..++|.....++.++..+ T Consensus 81 ~~~~Ivd~~~~~l~g~--p~-----~~~~~~~~-------------~~~~~--------~~~~~n~~~~~~~~~~~~~~~ 132 (479) T protein:vir:79 81 YHKLLVDQKVGYSVGN--PI-----VFNADDDN-------------LTKLL--------NDLLGEEFDDTITELYLNASN 132 (479) T ss_pred hHHHHHHHHHhhhhcC--Cc-----eeccCCHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHHh Confidence 7777788777776542 22 22333222 22222 344457899999999999999 Q ss_pred hCceeeeecccccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEE Q lcl|NC_015159. 142 AGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYT 219 (532) Q Consensus 142 ~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~ 219 (532) ||.+++++..++ ++.++++.++..+++...|. .+++...+|.+...-. +.+....+++|+ T Consensus 133 ~G~~~~~v~~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~---------------~~~~~~~~e~y~ 194 (479) T protein:vir:79 133 KGVEWLHPYINR---KGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDI---------------DGNKIKRVEYYT 194 (479) T ss_pred cCeEEEEEEeCC---CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeec---------------CCceEEEEEEEe Confidence 999988776543 34567888887775555443 5667766665553210 011112333332 Q ss_pred E---EEeeCCCCeEEEE---------EEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 220 H---VYRDPEAMVFRSY---------QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 220 ~---v~~~~~~~~~~s~---------~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) . ++...++..+... ...+...........+|..+|++..+- +.+|+|-.+...+-+..++.+. T Consensus 195 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd~~~v~~liDa~d~~~ 269 (479) T protein:vir:79 195 ENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVSDLTFYKSLIDIYDNNI 269 (479) T ss_pred CCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecC-----CCCCCcchhhhHHHHHHHHHHH Confidence 1 1111111111110 011111111122345778899987754 4679998899999999999888 Q ss_pred HHHHHHHHHHhcCceeecCccccChhh-hccCCCceeec-CccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVTQIRR-VAKANTGDFVA-GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSA 365 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~~~~~-~~~~~~G~~v~-g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~ 365 (532) -......+...+|.+++.........+ ......+.++. ...+++..+ ....+.......++.+++.|...-..-.+ T Consensus 270 S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 347 (479) T protein:vir:79 270 STLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKL--EINIPVEAKKELLDRLEKNIIIFGQGVNP 347 (479) T ss_pred HHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCcceEE--eccCCHHHHHHHHHHHHHHHHHHhCcccc Confidence 888888898889887764321111111 11112222222 223334433 33346677778888888777554322122 Q ss_pred ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHH Q lcl|NC_015159. 366 VQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNK 445 (532) Q Consensus 366 ~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~ 445 (532) ........|++.+..+..-. .....-..+.-.+.+.-+++.+..++...+.. ......+++.+...+ |-..+..++. T Consensus 348 ~~~~~gn~Sg~Ai~~~~~~l-~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~i~i~f~~~~-p~~~~~~a~~ 424 (479) T protein:vir:79 348 ESQNTGDKSGVALKFLYSLL-DLKCSKTEKKFKKAIRELLWFVCEYLKISGNK-SYDYKTVQITFNHSM-IINEAEKIDM 424 (479) T ss_pred ccccccchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ccccccceEEeCCCC-CcCHHHHHHH Confidence 22222335666554432222 22223333333444444444444444333321 122223334332222 1122222222 Q ss_pred HHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcc Q lcl|NC_015159. 446 LNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQ 524 (532) Q Consensus 446 l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (532) +... +. .+....++. .++. +.+ ++|++..++++..+..... . .-. T Consensus 425 ~~kl----~g-------~iS~et~l~----~l~~-----v~d~~~E~~ri~~E~~~~~~~~~---~-----------~~~ 470 (479) T protein:vir:79 425 AAKS----TG-------IVSDETIVS----NHPW-----VEDVNDELERLKKQEDTQKEYDD---L-----------IPN 470 (479) T ss_pred HHHH----hc-------cCcHHHHHH----hCCC-----CCCHHHHHHHHHHHHHHHHHHHh---c-----------cCc Confidence 2221 11 122333332 2332 222 3444333332222111111 0 112 Q ss_pred cccCCCCC Q lcl|NC_015159. 525 QQAGLPTQ 532 (532) Q Consensus 525 ~~~g~~~~ 532 (532) ...|.+.| T Consensus 471 ~~~~~~~e 478 (479) T protein:vir:79 471 NQDGVIDE 478 (479) T ss_pred ccCCCcCc Confidence 23344444 No 79 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.56 E-value=2.7e-07 Score=56.56 Aligned_cols=431 Identities=11% Similarity=0.039 Sum_probs=198.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccC--CCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFP--SATADGSTSYTTPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~--~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 78 (532) +-=++-..++.+.+.+..+..+.++ ++.+.+.+|..-.--. .......+...++..+.+...++..++.|++- T Consensus 9 ~~~~~~~~~~~~~i~~~i~~~~~~~----~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~- 83 (452) T protein:vir:36 9 MTFSKDEPITVEVVTKFMEKHKLEV----ARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNGI- 83 (452) T ss_pred EEcCCccCCCHHHHHHHHHHHHHHH----HHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhccc- Confidence 1122223345666766666555433 4555666665543111 01111112234666777788888888776541 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCC Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQ 158 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~ 158 (532) | +++...|.. + ...+.+.+..++|....+++.++..++|.+++++-.++ .+ T Consensus 84 -~-----~~~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---~g 134 (452) T protein:vir:36 84 -P-----VKKSHSDKE-------------I-------LTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDE---DT 134 (452) T ss_pred -C-----ceeecCChh-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecC---CC Confidence 1 223333321 1 12344566778999999999999999999988876543 35 Q ss_pred cceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 159 SNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 159 ~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) .+++.+++..+.+...|. .+++...+|.+.- .+....+++|+. +. -| ++.. T Consensus 135 ~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~-------------------~~~~~~~~vyt~-----~~-i~--~~~~ 187 (452) T protein:vir:36 135 QTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVD-------------------EDKKLQGEVYTL-----LE-TI--KISG 187 (452) T ss_pred eeEEEEEcccceEEEEcCCCCCceEEEEEEEEe-------------------cCceEEEEEEec-----Ce-EE--EEEE Confidence 567888887665544443 3445444443320 011123444421 11 01 1111 Q ss_pred cC-cccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhh Q lcl|NC_015159. 237 DG-EIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV 315 (532) Q Consensus 237 ~~-~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~ 315 (532) ++ ..........++..+|++..+. +..|+|=.+...+-+..++.+.-......+....|.+.+.. .....+.. T Consensus 188 ~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~ 261 (452) T protein:vir:36 188 ENDEISFGEGTYNPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDL 261 (452) T ss_pred cCCceEEecceeccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec-CCcCchhh Confidence 11 1111122234677899877654 34688888889999999999999999999899998877642 22233333 Q ss_pred ccCCCcee--ecCc--cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_015159. 316 AKANTGDF--VAGR--KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGG 391 (532) Q Consensus 316 ~~~~~G~~--v~g~--~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGp 391 (532) .....+.. ++.. ..+..+..+....+.......++.+++.|...-..-.+........|+..+..+..-.... .- T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k-~~ 340 (452) T protein:vir:36 262 KNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNL-AL 340 (452) T ss_pred hhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHH-HH Confidence 33222222 2221 1111222233334556666677777776644321111111122345666554433222222 22 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec--chHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHH Q lcl|NC_015159. 392 VYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDV 469 (532) Q Consensus 392 v~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~ 469 (532) -..+.-...+..+++-++.++...|.- .....+++.+.. +.+.+..++ .+.. ++.+ +....+ T Consensus 341 ~~~~~~~~~l~~~~~li~~~~~~~~~~--~~~~~i~i~f~~~~p~d~~~~a~---~~~k----~~g~-------iS~et~ 404 (452) T protein:vir:36 341 SFQRKFQSSLNSRYKLFCELSTNVSNK--DSWKDIEYTFTRNEPKDIKEQAE---TANI----LMGI-------TSQETA 404 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCc--cccccceEEeCCCCCcCHHHHHH---HHHH----Hhcc-------CChHHH Confidence 233333444444444455555444421 111233444322 222222222 2211 1221 222333 Q ss_pred HHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 470 KMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 470 ~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) + +.+|. +.+ ++|++...++++.++. ......+. .....+..+.-.+ T Consensus 405 ~----~~~~~-----~~d~~~E~~ri~~E~~~~~~--~~~~~~~~------~~~~~~~~~~~~~ 451 (452) T protein:vir:36 405 L----SVISV-----IPDVQAEMEKIKKEEASTAI--FDKDKQPS------EKGTDTVVSETNE 451 (452) T ss_pred H----HhCCC-----CCCHHHHHHHHHHHHHHHHH--HHhhccCC------CCcccccCccccC Confidence 3 33332 222 3444333332221111 11100000 0111111111111 No 80 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.54 E-value=3.2e-07 Score=56.22 Aligned_cols=440 Identities=12% Similarity=0.003 Sum_probs=203.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhc-----ccc-----cCCCC-----CcccccccccccchHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTI-----PSV-----FPSAT-----ADGSTSYTTPWQSIGAR 65 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~-----P~~-----~~~~~-----~~~~~~~~~~~dst~~~ 65 (532) +-+.+-..++.+.|.+..+.-+..|..+...++.+-.+.. ... ..... ....+...|+..+-+.. T Consensus 7 ~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 86 (474) T protein:vir:10 7 IDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSE 86 (474) T ss_pred HhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHH Confidence 3444555567777777777766666655555544433221 110 00000 01111233566666666 Q ss_pred HHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCce Q lcl|NC_015159. 66 GLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNV 145 (532) Q Consensus 66 a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 145 (532) .+++.++.|++- |+. +...+.. ....++.++| .+.+..++|.....++.++..+||.| T Consensus 87 ivd~~~~yl~g~--pv~-----~~~~~~~--------~~~e~~~~~l-------~~~~~~n~~~~~~~~~~~~~~~~G~a 144 (474) T protein:vir:10 87 IVDTRVGYLHGV--PVT-----YDLDENA--------EKNEKLKKFI-------TNFAIRNSVDDEDSEIGKMAAICGYG 144 (474) T ss_pred HHHhHhhheecc--cee-----EeeCCCC--------cchHHHHHHH-------HHHHhhcCHhHHHHHHHHHHhhcCeE Confidence 666666555431 332 2222110 0112344444 35666788999999999999999999 Q ss_pred eeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC Q lcl|NC_015159. 146 LLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP 225 (532) Q Consensus 146 ~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~ 225 (532) .+++..++ .+.+++.+++..+.++-.|..+.....+|.+... +......+++.-..++ T Consensus 145 ~~~~~~d~---~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~-------------------~~~~~~~~~~~~~y~~ 202 (474) T protein:vir:10 145 ARLAYIDT---NGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEK-------------------DDDNGTDYVYAEFYDN 202 (474) T ss_pred EEEEEeCC---CCeeEEEEEcccceEEEEcCCCceEEEEEEEEEe-------------------eCCCceEEEEEEEEcC Confidence 88875432 3456788888777555557777766555443311 0011112222222222 Q ss_pred CCCeEEEEEEEcCcc--cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 226 EAMVFRSYQEIDGEI--VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 226 ~~~~~~s~~~~~~~~--~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) + . ..++..++.. ........++..+|++..+ ++.+|.|=.+...+-+..++.+.-......+....|.+. T Consensus 203 ~-~--~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~ 274 (474) T protein:vir:10 203 A-Y--YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLV 274 (474) T ss_pred c-e--EEEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 2 1 1122222211 1112234567778887654 456899999999999999999888888888988998877 Q ss_pred ecCccccChhhhccCC-Cceee-cCccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVAKAN-TGDFV-AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~~~~-~G~~v-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~TAtEi~~ 380 (532) +..-+ +..+...... .|.+. .+..+++.. +....+.......++.+++.|...-. .+.....-+...|+..+.. T Consensus 275 i~g~~-~~~~~~~~~~~~~~i~~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 351 (474) T protein:vir:10 275 LRGMG-MSEEMIQETQKSGAFELFDKDMDVKY--LTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKL 351 (474) T ss_pred hccCC-CCchhhhhhhhcceeEecCCCCceeE--EeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 64321 1222222222 24432 233333433 33334556667777777777755321 1111111223456665554 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-CCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKI-PNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~l-p~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~ 457 (532) +..-.. +......+.-.+.+.-+++-++.++...|.- .+..-..+++.+. .+.+.+..++-+.++. T Consensus 352 ~~~~l~-~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~---------- 420 (474) T protein:vir:10 352 KLMALE-NKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK---------- 420 (474) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh---------- Confidence 322222 2222233333333434444444444333211 1111122333332 2333333333222221 Q ss_pred chhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) + .+....+++ .++. +.+ ++|++...++++..+. .. +... .+...+.+++ T Consensus 421 g----~iS~et~~~----~l~~-----v~d~~~E~eri~~E~~e~~~--~~----~~~~-------~~~~~~~~~~ 470 (474) T protein:vir:10 421 G----QVSERTRLG----QSQL-----VDDVDYELDEMEKESLEFND--KL----PDID-------EGDANDKSQN 470 (474) T ss_pred c----cCchHHHHH----hCCC-----CCCHHHHHHHHHHHHHHHHh--hc----cccc-------CCCcCCCCcc Confidence 1 122222332 2331 222 2344333322221111 10 0000 0011111111 No 81 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.54 E-value=3.2e-07 Score=56.22 Aligned_cols=440 Identities=12% Similarity=0.003 Sum_probs=203.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhc-----ccc-----cCCCC-----CcccccccccccchHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTI-----PSV-----FPSAT-----ADGSTSYTTPWQSIGAR 65 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~-----P~~-----~~~~~-----~~~~~~~~~~~dst~~~ 65 (532) +-+.+-..++.+.|.+..+.-+..|..+...++.+-.+.. ... ..... ....+...|+..+-+.. T Consensus 7 ~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ 86 (474) T protein:vir:94 7 IDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSE 86 (474) T ss_pred HhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHH Confidence 3444555567777777777766666655555544433221 110 00000 01111233566666666 Q ss_pred HHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCce Q lcl|NC_015159. 66 GLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNV 145 (532) Q Consensus 66 a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 145 (532) .+++.++.|++- |+. +...+.. ....++.++| .+.+..++|.....++.++..+||.| T Consensus 87 ivd~~~~yl~g~--pv~-----~~~~~~~--------~~~e~~~~~l-------~~~~~~n~~~~~~~~~~~~~~~~G~a 144 (474) T protein:vir:94 87 IVDTRVGYLHGV--PVT-----YDLDENA--------EKNEKLKKFI-------TNFAIRNSVDDEDSEIGKMAAICGYG 144 (474) T ss_pred HHHhHhhheecc--cee-----EeeCCCC--------cchHHHHHHH-------HHHHhhcCHhHHHHHHHHHHhhcCeE Confidence 666666555431 332 2222110 0112344444 35666788999999999999999999 Q ss_pred eeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC Q lcl|NC_015159. 146 LLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP 225 (532) Q Consensus 146 ~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~ 225 (532) .+++..++ .+.+++.+++..+.++-.|..+.....+|.+... +......+++.-..++ T Consensus 145 ~~~~~~d~---~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~-------------------~~~~~~~~~~~~~y~~ 202 (474) T protein:vir:94 145 ARLAYIDT---NGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEK-------------------DDDNGTDYVYAEFYDN 202 (474) T ss_pred EEEEEeCC---CCeeEEEEEcccceEEEEcCCCceEEEEEEEEEe-------------------eCCCceEEEEEEEEcC Confidence 88875432 3456788888777555557777766555443311 0011112222222222 Q ss_pred CCCeEEEEEEEcCcc--cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 226 EAMVFRSYQEIDGEI--VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 226 ~~~~~~s~~~~~~~~--~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) + . ..++..++.. ........++..+|++..+ ++.+|.|=.+...+-+..++.+.-......+....|.+. T Consensus 203 ~-~--~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~ 274 (474) T protein:vir:94 203 A-Y--YYVFRGEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLV 274 (474) T ss_pred c-e--EEEEeecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 2 1 1122222211 1112234567778887654 456899999999999999999888888888988998877 Q ss_pred ecCccccChhhhccCC-Cceee-cCccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVAKAN-TGDFV-AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~~~~-~G~~v-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~TAtEi~~ 380 (532) +..-+ +..+...... .|.+. .+..+++.. +....+.......++.+++.|...-. .+.....-+...|+..+.. T Consensus 275 i~g~~-~~~~~~~~~~~~~~i~~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 351 (474) T protein:vir:94 275 LRGMG-MSEEMIQETQKSGAFELFDKDMDVKY--LTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKL 351 (474) T ss_pred hccCC-CCchhhhhhhhcceeEecCCCCceeE--EeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 64321 1222222222 24432 233333433 33334556667777777777755321 1111111223456665554 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-CCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKI-PNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~l-p~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~ 457 (532) +..-.. +......+.-.+.+.-+++-++.++...|.- .+..-..+++.+. .+.+.+..++-+.++. T Consensus 352 ~~~~l~-~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~---------- 420 (474) T protein:vir:94 352 KLMALE-NKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK---------- 420 (474) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh---------- Confidence 322222 2222233333333434444444444333211 1111122333332 2333333333222221 Q ss_pred chhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) + .+....+++ .++. +.+ ++|++...++++..+. .. +... .+...+.+++ T Consensus 421 g----~iS~et~~~----~l~~-----v~d~~~E~eri~~E~~e~~~--~~----~~~~-------~~~~~~~~~~ 470 (474) T protein:vir:94 421 G----QVSERTRLG----QSQL-----VDDVDYELDEMEKESLEFND--KL----PDID-------EGDANDKSQN 470 (474) T ss_pred c----cCchHHHHH----hCCC-----CCCHHHHHHHHHHHHHHHHh--hc----cccc-------CCCcCCCCcc Confidence 1 122222332 2331 222 2344333322221111 10 0000 0011111111 No 82 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.54 E-value=3.2e-07 Score=56.21 Aligned_cols=436 Identities=9% Similarity=0.028 Sum_probs=186.3 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCC-C-C--CcccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGF-AADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPS-A-T--ADGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~-~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~-~-~--~~~~~~~~~~~dst~~~a~~~L 70 (532) |........ ..+.+.+..+..+.+ .++++.+.+|..-. +-.. . . ....+...++..+-+...++++ T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~----~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:94 35 IVRTNNKPETLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred ccccCCchhhHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHH Confidence 332222111 223333333444432 34555666654321 1111 0 0 0111223467778888888888 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) ++.|++ -| +.+...|.. +.+.|. .. ..++|-....++.++..++|.+.+++. T Consensus 111 ~~yl~G--~p-----~~~~~~d~~-------------~~~~l~-------~~-~~n~~~~~~~~~~~~a~~~G~a~~~v~ 162 (492) T protein:vir:94 111 VSYIVG--KP-----IAFKHTDDE-------------VVKRID-------EV-LGNRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred Hhhhcc--cC-----ceeccCchH-------------HHHHHH-------HH-HhccHHHHHHHHHHHHhhCCeEEEEEE Confidence 876643 12 122333321 222221 12 235788889999999999999988776 Q ss_pred ccccccCCcceEEEEecceEEEee--CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE---EEeeC Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVER--DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH---VYRDP 225 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~--d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~ 225 (532) .++ .+.+.+++++..+.++.. +..+++...+|.+... ....+++|+. .+... T Consensus 163 ~d~---dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~ 219 (492) T protein:vir:94 163 LDE---EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVY 219 (492) T ss_pred ecC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEE Confidence 543 345677777776644433 3467787666655421 1122344431 11111 Q ss_pred CCCeEEEE--EEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 226 EAMVFRSY--QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 226 ~~~~~~s~--~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) ++..+... ...++.. ......++..+|++..+- +.+|.|=.+..++-+..++.+.-......+....|.++ T Consensus 220 ~~~~~~~~~~~~~~~~~--~~~~~~~~g~vPvv~~~n-----n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv 292 (492) T protein:vir:94 220 ENGSLIPDYSNNLENSK--THFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV 292 (492) T ss_pred ecCeeeecccccccccc--ccccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 11111111 1111111 112335677889886654 45799999999999999999888888888888888866 Q ss_pred ecCccccChhhhc-cC-CCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVA-KA-NTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV-QRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~-~~-~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~~~TAtEi~~ 380 (532) +..-......... .. ..+.+.-+..+++..+ ....+.......++.++..|.+.-..-.+. ..-+...|+.-+.. T Consensus 293 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 370 (492) T protein:vir:94 293 LKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTI--QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEF 370 (492) T ss_pred eecCCcccchhhHHHHhhccceecCCCCcceeE--eccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHH Confidence 5321111111111 11 1122322333444433 223355566677777777665532211111 11122335443332 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchh Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQ 460 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~ 460 (532) ...- +....-...+.-.+ -+.+++.++.+..-++. ....+++.+ ++-.|-..+..++.+.. ++.+ T Consensus 371 ~~~~-l~~k~~~k~~~f~~----~l~~~~~li~~~~~~~~-~~~~i~v~f-~~~~p~~~~e~~~~~~k----l~gi---- 435 (492) T protein:vir:94 371 LYTN-LNLKADKLARKAKV----AIQELLWFVFEHFDIKG-EHKDVDISF-NYNKVANTELQVQTAQQ----SMGI---- 435 (492) T ss_pred HHHH-HHHHHHHHHHHHHH----HHHHHHHHHHHHhcCCc-ccceeeEEe-cCCCCCCHHHHHHHHHH----Hhcc---- Confidence 2211 22222222222222 33444444333211111 112233332 11111111222222111 1111 Q ss_pred hhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 461 DDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 461 ~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +....++ ..+|. +.+ ++|++...++++.+++..+ ....+....+...+..+..++ T Consensus 436 ---iS~et~~----~~l~~-----v~d~~~E~eri~~E~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~e~ 491 (492) T protein:vir:94 436 ---VSHETVL----ENHPF-----VEDLQAELERIEQEQMEYNKQLP-----NLDDGGADSAQQQERSNNKES 491 (492) T ss_pred ---CchHHHH----HhCCC-----CCCHHHHHHHHHHHHHHHHhhcc-----ccccccCCCCccccCCccccC Confidence 2122222 23332 222 3444444333322222111 111111111112222233333 No 83 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.53 E-value=3.4e-07 Score=56.04 Aligned_cols=444 Identities=10% Similarity=0.030 Sum_probs=184.1 Q ss_pred CCCCC------CCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVE------KTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~------~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |...+ +..++. ..+ ....+|+.+|+=-.|.+....+.....+..+.-=..+...++.+|+-| T Consensus 20 ~~~~~~~~~~~~i~~~~----~~~--------~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~~A~Ll 87 (517) T protein:vir:98 20 GQTLKSINDHEKINIDP----NEL--------ARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADVLSGLV 87 (517) T ss_pred ccchhHhhcCCceecCH----HHH--------HHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHHhhhhh Confidence 21111 111111 111 223446666543333322111111111111111134455555555554 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) .+-+.. +.++|....+... .......++|+ +.+..++|+..+.+++.+..+.|.+++=+-.+ T Consensus 88 ~~e~~~-------i~v~d~~~~~~~~--~~~~~~~e~l~-------~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-- 149 (517) T protein:vir:98 88 FNEQCE-------VYVSDAKDEEKKD--NSFKTAHEFIQ-------HVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD-- 149 (517) T ss_pred cCCcce-------EEecccccccccc--cchhHHHHHHH-------HHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe-- Confidence 333322 2222222111110 01122445554 56788999999999999999999998743332 Q ss_pred ccCCcceEEEEecceEEE-eeCCCCCeEEEE-EEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEee---C---C Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVV-ERDAYDNVLQIV-TEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRD---P---E 226 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v-~~d~~G~vd~i~-rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~---~---~ 226 (532) +..+.+..++...|+- +-|..|.+..+| +++..+.+ .+-.+|+.++.. . . T Consensus 150 --~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~--------------------~~~~~Yt~lE~H~~~~~~~~ 207 (517) T protein:vir:98 150 --NGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIG--------------------NKTVYYTLLEFHEWEKTEEG 207 (517) T ss_pred --CCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeec--------------------CCceEEEEEEEEecCceecc Confidence 2456788888887765 566666554433 23322111 000112222111 0 0 Q ss_pred CCeEEEEEEE--------cCccccc---------ccccCccccCceEE----EEee-ecCCCccccchHHHHHHHHHHHH Q lcl|NC_015159. 227 AMVFRSYQEI--------DGEIVAG---------TEGEYPLDSCPWIP----VRLI-KMPNEDYGRSFVEEYLGDLKSLE 284 (532) Q Consensus 227 ~~~~~s~~~~--------~~~~~~~---------~~~~~g~~~~P~~~----~Rw~-~~~g~~YG~Gp~~~al~d~~~L~ 284 (532) ...|.+.+.. -|..++. ....-|. ..|.++ +-.+ ...+++||+|-...+++.++.|| T Consensus 208 ~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~-~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD 286 (517) T protein:vir:98 208 ESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGL-SRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKIN 286 (517) T ss_pred CCcEEEEEEEEecCCCccccccccccccccCCCcceeECCC-CcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHH Confidence 0112221111 0111110 0000111 224221 1222 23378899999999999999999 Q ss_pred HHHHHHHHHHHHHhcCceeecCccccChhhhccCCCce--------e--ecCccccccccccCCccchhHHHHHHHHHHH Q lcl|NC_015159. 285 NLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANTGD--------F--VAGRKQDVEVFQLEKYNDFQVAKATADDIEK 354 (532) Q Consensus 285 ~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~--------~--v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 354 (532) ..--+...-.. ..+.++.||++.+-...+-....++. + +.+..+......+...-........++.+-+ T Consensus 287 ~~~s~~~~e~~-~g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~ 365 (517) T protein:vir:98 287 DTYDQFWWEIK-MGQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALR 365 (517) T ss_pred HHHHHHHHHHH-hCCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHH Confidence 77777666444 46777777655431111110011111 1 1111111111111111111223344444444 Q ss_pred HHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--cccee Q lcl|NC_015159. 355 RLSYAF-M-LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPAI 430 (532) Q Consensus 355 rI~~af-~-~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~~ 430 (532) .|.... + ...+...+...-|||||..+.+...+...- +.+.....|.-|++-++.+..-.+++....... +.+.+ T Consensus 366 ~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~-~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f 444 (517) T protein:vir:98 366 TLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRND-HVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDF 444 (517) T ss_pred HHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEc Confidence 443321 1 111222222335999999999988888775 333344455555555444433333333222222 33332 Q ss_pred ecch--HHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 431 ATGL--EALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAG 508 (532) Q Consensus 431 v~~l--~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~ 508 (532) -.++ +.-+. ++..++.++ +.+ +....++. +.+|+ |++|.+++.++.+.... T Consensus 445 ~D~i~~D~~~~---~~~~~~~v~--aG~-------ms~~~~i~---~~~g~-------~eeeA~~e~~~i~~E~~----- 497 (517) T protein:vir:98 445 DDGVFQDRSAL---LRFYGQAKT--FGF-------IPTVEAIQ---RIFKV-------PKKTAEQWLEEIRKDQI----- 497 (517) T ss_pred CCCCCCCHHHH---HHHHHHHHh--cCC-------CCHHHHHH---HhCCC-------ChHHHHHHHHHHHHhcc----- Confidence 1222 22222 222222111 111 23344433 34675 45565544443322221 Q ss_pred HhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 509 QQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ...+.. ....++++++.+ T Consensus 498 ~~~~~~------~~~~~~~~~~gd 515 (517) T protein:vir:98 498 ELDPVT------ISQRAQKRMFGD 515 (517) T ss_pred ccCCCC------ccccccCCCCCC Confidence 111111 112333344433 No 84 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.52 E-value=3.6e-07 Score=55.89 Aligned_cols=433 Identities=9% Similarity=-0.033 Sum_probs=193.0 Q ss_pred CCCCCCCc-cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcc-----cccCCCCC----cccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTG-FAADGAAAAYNRLKNDRGAYETRAEDCATYTIP-----SVFPSATA----DGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~-~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~~----~~~~~~~~~~dst~~~a~~~L 70 (532) +...+... ...+-+.+..+..+. | .++.+++.+|..= .+-..... ...+...++..+-+...++.. T Consensus 18 ~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~ 93 (474) T protein:vir:96 18 VEQMKPKVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQK 93 (474) T ss_pred hhhccccccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhh Confidence 11111111 122223333333333 2 2344444444332 11111100 111223456667777777777 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) ++.|++ -|+ +++..+.. +.+.| ... ..++|...+.++.+++..+|.|.+++. T Consensus 94 ~~yl~g--~p~-----~~~~~~~~-------------~~~~l-------~~~-~~n~~~~~~~~l~~~~~~~G~~~~~~~ 145 (474) T protein:vir:96 94 VSYVAG--KPV-----TYAHDDDK-------------VLDVI-------HQV-LDTRWDNKLIDILTAASNKGIDWLQVY 145 (474) T ss_pred hhhhcc--cCc-----eeccCChH-------------HHHHH-------HHH-HhccHHHHHHHHHHHHhhCCeEEEEee Confidence 766654 222 23333321 11112 122 236899999999999999999988876 Q ss_pred ccccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE---EEeeC Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH---VYRDP 225 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~ 225 (532) .++ .+.+++.+++..+.++..|. .+++...+|.+... ....+++|+. .+... T Consensus 146 ~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~vy~~~~i~~~~~ 202 (474) T protein:vir:96 146 INE---DGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFN--------------------GETKVEYWTAETVTYYVY 202 (474) T ss_pred eCC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------CeeEEEEEeCCeEEEEEE Confidence 543 34567888888776555443 57777777665421 1123455432 11111 Q ss_pred CCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_015159. 226 EAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN 305 (532) Q Consensus 226 ~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~ 305 (532) ++..+.......+..........++..+|++..+. +.+|.|=.+..++-+-.++.+--......+....|.+.+. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~ 277 (474) T protein:vir:96 203 ENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR 277 (474) T ss_pred cCCceeeccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc Confidence 11112111111111122223345778899987754 4679998899999999999888888888888888876654 Q ss_pred CccccChhhhc-cC-CCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHH Q lcl|NC_015159. 306 PNGVTQIRRVA-KA-NTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 306 ~~g~~~~~~~~-~~-~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~~~TAtEi~~r~ 382 (532) .-+..+..... .. ..+.+..+..+++..+ ....+.......++.++..|...-.. +......+...|+..+..+- T Consensus 278 g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~ 355 (474) T protein:vir:96 278 GYEGEDLSEFMEGLKYYKAINVSSDGGVETI--QVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLY 355 (474) T ss_pred CCCcccccchhhhhhccceeeccCCCceeEE--eccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHH Confidence 32111111111 11 1123323333444433 23345566677777777766553221 11111112233444433221 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_015159. 383 GELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLAGL 459 (532) Q Consensus 383 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~p~ 459 (532) .- +........+ .+...+.+++.++.+. |.-+ ....+++.+. .+.+.+..++- +.+. + T Consensus 356 ~~-l~~k~~~~~~----~~~~~l~~~~~~i~~~~g~~~--d~~~i~i~f~~~~p~~~~e~a~~----------~~~~-g- 416 (474) T protein:vir:96 356 TN-LNLKANKLKN----KANVALQELMQFILDFNKIKL--DAKEIEITFNFNVMVNDLEQSQI----------GAQS-Q- 416 (474) T ss_pred HH-HHHHHHHHHH----HHHHHHHHHHHHHHHHhCCCc--ccceeeEEecCCCccCHHHHHHH----------HHHc-C- Confidence 11 1111111222 2222333334333332 2211 1223334332 22333332221 1111 1 Q ss_pred hhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 460 QDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 460 ~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+.-..++. .++ ++.+ ++|++..++++...+ .+++...+.......++..+-.+| T Consensus 417 ---iiS~et~~~----~lp-----~v~D~~~E~eri~~E~~~~~------~~~~~~~~~~~~~~~~~~~~~~~e 472 (474) T protein:vir:96 417 ---YLSKETLVR----HHP-----WVDDPKAELERLDEEQLELN------KQLPNLDDGGADGAQQQQQSENNQ 472 (474) T ss_pred ---CCChHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHH------hhccccccccCCCCCCcCCCCccc Confidence 122233332 222 2222 234433333322111 112222222233334444555555 No 85 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.52 E-value=3.6e-07 Score=55.89 Aligned_cols=433 Identities=9% Similarity=-0.033 Sum_probs=193.0 Q ss_pred CCCCCCCc-cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcc-----cccCCCCC----cccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTG-FAADGAAAAYNRLKNDRGAYETRAEDCATYTIP-----SVFPSATA----DGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~-~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~~----~~~~~~~~~~dst~~~a~~~L 70 (532) +...+... ...+-+.+..+..+. | .++.+++.+|..= .+-..... ...+...++..+-+...++.. T Consensus 18 ~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~ 93 (474) T protein:vir:95 18 VEQMKPKVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQK 93 (474) T ss_pred hhhccccccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhh Confidence 11111111 122223333333333 2 2344444444332 11111100 111223456667777777777 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) ++.|++ -|+ +++..+.. +.+.| ... ..++|...+.++.+++..+|.|.+++. T Consensus 94 ~~yl~g--~p~-----~~~~~~~~-------------~~~~l-------~~~-~~n~~~~~~~~l~~~~~~~G~~~~~~~ 145 (474) T protein:vir:95 94 VSYVAG--KPV-----TYAHDDDK-------------VLDVI-------HQV-LDTRWDNKLIDILTAASNKGIDWLQVY 145 (474) T ss_pred hhhhcc--cCc-----eeccCChH-------------HHHHH-------HHH-HhccHHHHHHHHHHHHhhCCeEEEEee Confidence 766654 222 23333321 11112 122 236899999999999999999988876 Q ss_pred ccccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE---EEeeC Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH---VYRDP 225 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~---v~~~~ 225 (532) .++ .+.+++.+++..+.++..|. .+++...+|.+... ....+++|+. .+... T Consensus 146 ~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~vy~~~~i~~~~~ 202 (474) T protein:vir:95 146 INE---DGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFN--------------------GETKVEYWTAETVTYYVY 202 (474) T ss_pred eCC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------CeeEEEEEeCCeEEEEEE Confidence 543 34567888888776555443 57777777665421 1123455432 11111 Q ss_pred CCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_015159. 226 EAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN 305 (532) Q Consensus 226 ~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~ 305 (532) ++..+.......+..........++..+|++..+. +.+|.|=.+..++-+-.++.+--......+....|.+.+. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~ 277 (474) T protein:vir:95 203 ENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR 277 (474) T ss_pred cCCceeeccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc Confidence 11112111111111122223345778899987754 4679998899999999999888888888888888876654 Q ss_pred CccccChhhhc-cC-CCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHH Q lcl|NC_015159. 306 PNGVTQIRRVA-KA-NTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 306 ~~g~~~~~~~~-~~-~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~~~TAtEi~~r~ 382 (532) .-+..+..... .. ..+.+..+..+++..+ ....+.......++.++..|...-.. +......+...|+..+..+- T Consensus 278 g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~ 355 (474) T protein:vir:95 278 GYEGEDLSEFMEGLKYYKAINVSSDGGVETI--QVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLY 355 (474) T ss_pred CCCcccccchhhhhhccceeeccCCCceeEE--eccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHH Confidence 32111111111 11 1123323333444433 23345566677777777766553221 11111112233444433221 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_015159. 383 GELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLAGL 459 (532) Q Consensus 383 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~p~ 459 (532) .- +........+ .+...+.+++.++.+. |.-+ ....+++.+. .+.+.+..++- +.+. + T Consensus 356 ~~-l~~k~~~~~~----~~~~~l~~~~~~i~~~~g~~~--d~~~i~i~f~~~~p~~~~e~a~~----------~~~~-g- 416 (474) T protein:vir:95 356 TN-LNLKANKLKN----KANVALQELMQFILDFNKIKL--DAKEIEITFNFNVMVNDLEQSQI----------GAQS-Q- 416 (474) T ss_pred HH-HHHHHHHHHH----HHHHHHHHHHHHHHHHhCCCc--ccceeeEEecCCCccCHHHHHHH----------HHHc-C- Confidence 11 1111111222 2222333334333332 2211 1223334332 22333332221 1111 1 Q ss_pred hhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 460 QDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 460 ~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+.-..++. .++ ++.+ ++|++..++++...+ .+++...+.......++..+-.+| T Consensus 417 ---iiS~et~~~----~lp-----~v~D~~~E~eri~~E~~~~~------~~~~~~~~~~~~~~~~~~~~~~~e 472 (474) T protein:vir:95 417 ---YLSKETLVR----HHP-----WVDDPKAELERLDEEQLELN------KQLPNLDDGGADGAQQQQQSENNQ 472 (474) T ss_pred ---CCChHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHH------hhccccccccCCCCCCcCCCCccc Confidence 122233332 222 2222 234433333322111 112222222233334444555555 No 86 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.51 E-value=3.9e-07 Score=55.72 Aligned_cols=437 Identities=9% Similarity=-0.012 Sum_probs=191.9 Q ss_pred CCC------CCC------------CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh--cccccCCCCC----ccccccc Q lcl|NC_015159. 1 MAE------VEK------------TGFAADGAAAAYNRLKNDRGAYETRAEDCATYT--IPSVFPSATA----DGSTSYT 56 (532) Q Consensus 1 m~~------~~~------------~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~----~~~~~~~ 56 (532) |-+ +++ ...+.+.+.+..+..+.. -.....+++++.-- +..+...... ...+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~-~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQ-LDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDW 79 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHH-HHHHHHHHHHhcccCchhccccccccccccccccccc Confidence 221 111 112345555555555443 33344555555421 2222111111 1112234 Q ss_pred ccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHH Q lcl|NC_015159. 57 TPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAI 136 (532) Q Consensus 57 ~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~ 136 (532) ++..+-+...++..++.|++ -|+ .+...|.. +.+.| ... ..+||...+.++. T Consensus 80 ki~~n~~~~Ivd~~~~~l~g--~p~-----~~~~~d~~-------------~~~~l-------~~~-~~n~~~~~~~e~~ 131 (474) T protein:vir:95 80 RITTNFHQNLVDQKVSYVAS--KPV-----TYSCEDES-------------VLKII-------HDV-LDTRWDNKLIDIL 131 (474) T ss_pred eeccchHHHHHHHHHhhhcc--CCc-----eeccCchH-------------HHHHH-------HHH-HhccHHHHHHHHH Confidence 66777777888888776654 221 23433322 11112 122 2368999999999 Q ss_pred HHHHhhCceeeeecccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcce Q lcl|NC_015159. 137 KQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEE 214 (532) Q Consensus 137 ~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 214 (532) ++...+|.|.+++..++ ++.+++.+++..+.+...| ..|++..++|.+... .... T Consensus 132 ~~~~~~G~~~~~v~~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~--------------------~~~~ 188 (474) T protein:vir:95 132 TATSNKGIDWLQVYINE---NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------------------NEEK 188 (474) T ss_pred HHHhhcCcEEEEEEecC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEc--------------------CeeE Confidence 99999999988876543 3456777777766544433 357777777665421 1123 Q ss_pred EEEEEE---EEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 215 VTIYTH---VYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIV 291 (532) Q Consensus 215 v~i~~~---v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l 291 (532) +++|+. .+...++..|..................++..+|++.++. +.+|.|=.+...+-+..+|.+.-... T Consensus 189 ~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~ 263 (474) T protein:vir:95 189 VEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSLIDAIDKRLSDAQ 263 (474) T ss_pred EEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecC-----CCCCCCcHHHHHHHHHHHHHHHHHHH Confidence 444422 1111111112222111111111222345677899988754 46799989999999999999888888 Q ss_pred HHHHHHhcCceeecCccccChhhh-ccCCCceee-cCccccccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhcccC Q lcl|NC_015159. 292 KMSMISSKVLFFVNPNGVTQIRRV-AKANTGDFV-AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQR 368 (532) Q Consensus 292 ~~~~~a~~p~~lv~~~g~~~~~~~-~~~~~G~~v-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~ 368 (532) ...+....|.+.+..-..-....+ .....+.++ ....+++..+ ....+.......++.+.+.|...-.. +..... T Consensus 264 ~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 341 (474) T protein:vir:95 264 NMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGGVETI--QVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDK 341 (474) T ss_pred HHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCccccccc Confidence 888888998877653222222221 111122222 2333334333 33346666777777777777553221 111111 Q ss_pred CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceeecchHHHHHHHHHHHHH Q lcl|NC_015159. 369 GGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIATGLEALGRGHDLNKLN 447 (532) Q Consensus 369 ~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~ 447 (532) .+...|+..+..+....... .-...+ .+..-+.+++.++.+. |. ......+.+.+ .+-.|-.-++.++.+ T Consensus 342 ~~~n~Sg~Alk~~~~~l~~k-~~~k~~----~~~~~l~~~~~li~~~~g~--~~d~~~i~v~f-~~~~p~d~~e~a~~~- 412 (474) T protein:vir:95 342 FGSAPSGIALKFLYGNLDLK-ANKLKN----KATVAIQELIGFIIDFNNL--KMDVKDIEISF-NFNRMMNDAEQSQII- 412 (474) T ss_pred ccccchHHHHHHHHHHHHHH-HHHHHH----HHHHHHHHHHHHHHHHhCC--CcccceeeEEe-ccCCCcCHHHHHHHH- Confidence 22334665544332222221 111222 2222333334333331 21 11112233333 111121112222211 Q ss_pred HHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccc Q lcl|NC_015159. 448 VFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQ 526 (532) Q Consensus 448 ~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (532) .+. + .+....++. .++ ++.+ ++|++...++++.+..... ..... ......+. T Consensus 413 ------~~~-g----~iS~et~i~----~l~-----~v~d~~~E~~ri~~E~~~~~~~~~------~~~~~-~~d~~~~~ 465 (474) T protein:vir:95 413 ------AQS-Q----YLSRETLVK----SSP-----LVDDYKAELERIEQEQMEYNKQLP------NLDDG-GADGAQQQ 465 (474) T ss_pred ------Hhc-C----CCchHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHhccc------ccccc-cCCCCcCC Confidence 111 1 122233332 222 1222 3444433333322222111 11110 01111111 Q ss_pred cCCCCC Q lcl|NC_015159. 527 AGLPTQ 532 (532) Q Consensus 527 ~g~~~~ 532 (532) ..-.++ T Consensus 466 ~~~~~~ 471 (474) T protein:vir:95 466 ERSNDK 471 (474) T ss_pred CCCccC Confidence 111111 No 87 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.45 E-value=5.9e-07 Score=54.72 Aligned_cols=448 Identities=9% Similarity=0.041 Sum_probs=198.9 Q ss_pred CCCCCCCcc--------------------CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccC---CCCC--- Q lcl|NC_015159. 1 MAEVEKTGF--------------------AADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFP---SATA--- 49 (532) Q Consensus 1 m~~~~~~~~--------------------~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~---~~~~--- 49 (532) ||+.=+-+. +.+.+.+..+ ..| .++++.+.+|..-. +.. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~---~~~---~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~ 74 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLID---EHN---PEPLLKGVRYYMCENDIEKKRRTYYDAAGQQL 74 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHH---hhc---HHHHHHHHHHhccccchhhccchhcccccccc Confidence 554311110 1111111111 112 24455665555432 111 1100 Q ss_pred -cccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_015159. 50 -DGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSF 128 (532) Q Consensus 50 -~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf 128 (532) ...+...++-.+-+...++.+++.|.+- | ++++..|.. +.+.| +.+..++| T Consensus 75 ~~~~~~~~ri~~n~~~~ivd~~~~yl~g~------~-~~~~~~d~~-------------~~~~l--------~~~~~n~~ 126 (503) T protein:vir:59 75 VDDTKTNNRTSHAWHKLFVDQKTQYLVGE------P-VTFTSDNKT-------------LLEYV--------NELADDDF 126 (503) T ss_pred cccccccceeecchHHHHHHHHHhhhhcC------C-eeeccCcHH-------------HHHHH--------HHHHhcCH Confidence 1111233555666677777777665421 1 123333322 22233 23334789 Q ss_pred hHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhc Q lcl|NC_015159. 129 RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQ 206 (532) Q Consensus 129 ~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~ 206 (532) -....++.++..++|.+++++..++ ++.++++.++..+++...|. .+++...+|.++..- T Consensus 127 ~~~~~~~~~~~~~~G~~~~~v~~d~---dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~--------------- 188 (503) T protein:vir:59 127 DDILNETVKNMSNKGIEYWHPFVDE---EGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKG--------------- 188 (503) T ss_pred HHHHHHHHHHHhhCCeEEEEEeecC---CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEec--------------- Confidence 9999999999999999998886543 35567888888776555443 477777666554210 Q ss_pred ccCCCcceEEEEEEE---EeeCCCCeEEEEE-EEcC---cccccccccCccccCceEEEEeeecCCCccccchHHHHHHH Q lcl|NC_015159. 207 GDQNPSEEVTIYTHV---YRDPEAMVFRSYQ-EIDG---EIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGD 279 (532) Q Consensus 207 ~~~~~~~~v~i~~~v---~~~~~~~~~~s~~-~~~~---~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d 279 (532) .+++....+++|+.- +....+.-|..-. +... .........+++..+|++.++- +.+|.|=.+.+.+- T Consensus 189 ~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~n-----n~~~~sd~~~~~~l 263 (503) T protein:vir:59 189 IMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKN-----NEEMVSDLKFYKDL 263 (503) T ss_pred CCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecC-----CCCCCcchhhhHHH Confidence 011112233343321 0110100010000 0000 0000111235677889887754 45799988999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCceeecCccccChhh-hccCCC-ceeecCccccccccccCCccchhHHHHHHHHHHHHHH Q lcl|NC_015159. 280 LKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR-VAKANT-GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLS 357 (532) Q Consensus 280 ~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~-~~~~~~-G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~ 357 (532) +..+|.+.-......+....|.+.+..-...+... ...... +.+..+..+++..+. ...+.......++.++..|. T Consensus 264 iDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~ 341 (503) T protein:vir:59 264 IDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGGVDTLR--AEIPVDSAAKELERIQDELY 341 (503) T ss_pred HHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCcceeEe--ccCCHHHHHHHHHHHHHHHH Confidence 99999988888888888888887764321111111 111121 233333334444433 23455666777777777776 Q ss_pred HHHhhhh-cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHH Q lcl|NC_015159. 358 YAFMLNS-AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEA 436 (532) Q Consensus 358 ~af~~~~-~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~ 436 (532) +.-.... .....+...|+..+..+..-.... .--..+.-.+.|.-+++.++.++...+.....+...+++.+..++ + T Consensus 342 ~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k-~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~-p 419 (503) T protein:vir:59 342 KSAQAVDNSPETIGGGATGPALENLYALLDLK-ANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTR-I 419 (503) T ss_pred HHhcccCCCcccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCC-C Confidence 5432211 111223456777766544333333 233444445555555555555554444322222233444442221 2 Q ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHH Q lcl|NC_015159. 437 LGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAG 515 (532) Q Consensus 437 l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~ 515 (532) -..++.++.+...++. .+ +....++. .++ ++.+ ++|++...++++..+.+ ..... . T Consensus 420 ~d~~~~~~~~~kl~~~--Gi-------iS~et~l~----~l~-----~v~d~~~E~~ri~~E~~~~~~~--~~~~~---~ 476 (503) T protein:vir:59 420 QNDSEIVQSLVQGVTG--GI-------MSKETAVA----RNP-----FVQDPEEELARIEEEMNQYAEM--QGNLL---D 476 (503) T ss_pred CCHHHHHHHHHHHHhC--CC-------CchHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHhh--hcccc---C Confidence 2222233333222211 11 11222222 222 1222 34443333222211111 11111 1 Q ss_pred HHHHHhhcccccCCCCC Q lcl|NC_015159. 516 GQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 516 ~~~~~~~~~~~~g~~~~ 532 (532) ...+..-.++..+.+.+ T Consensus 477 ~~~~~~~~~~~~~~~~~ 493 (503) T protein:vir:59 477 DEGGDDDLEEDDPNAGA 493 (503) T ss_pred ccCCCCCCCcCCCCCCc Confidence 11111111111222211 No 88 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.44 E-value=6.3e-07 Score=54.56 Aligned_cols=448 Identities=9% Similarity=-0.005 Sum_probs=194.1 Q ss_pred CCCCCCCc-cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cCCCCCc-ccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTG-FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSATAD-GSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~-~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...+... .+.+.+.+..+.....+. ++++++.+|..-.- ....... ..+...++..+.+...++..++.|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~---~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhH---HHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 55443322 244556555555444443 45555555544321 1111111 1122345666777777777766554 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) + -|+ +++..+.. +. ..+.+.+..++|.....++.+++.+||.|..++..++ T Consensus 108 g--~p~-----~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de-- 158 (511) T protein:vir:10 108 G--NPI-----QYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQ-- 158 (511) T ss_pred c--cCc-----eeecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC-- Confidence 3 121 12333221 11 2344566778899999999999999999988776542 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) .+.+++..++..+.++..|. .+++...+|.+......- ...+.-..+++|+ ++. -|. T Consensus 159 -dg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~------------~~~~~~~~~~iyt-----~~~-i~~-- 217 (511) T protein:vir:10 159 -DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK------------TDEDEVFTVDLFT-----SHG-VYR-- 217 (511) T ss_pred -CCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc------------CccceEEEEEEEe-----CCc-EEE-- Confidence 34567777777665554443 356665555543211100 0001111222322 221 111 Q ss_pred EEEcCcc------cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 234 QEIDGEI------VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 234 ~~~~~~~------~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) +..++.. ........++..+|++.++- +.+|.|-.+..++-+..++.+.-..........+|.+.+.-. T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~ 292 (511) T protein:vir:10 218 YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (511) T ss_pred EEecCCCcccccccccccccccCcceeEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecc Confidence 1111110 01112345677889887653 357899899999999999988777788788888887665432 Q ss_pred cccChhhhccCCCceeec--------C----ccccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCC Q lcl|NC_015159. 308 GVTQIRRVAKANTGDFVA--------G----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVT 374 (532) Q Consensus 308 g~~~~~~~~~~~~G~~v~--------g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~T 374 (532) ...+...+.....+.++. + ..++..+..+....+.......+..++..|...-. .+.....-+...| T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~S 372 (511) T protein:vir:10 293 LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred ccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 333333332222221111 1 11112222333334556666777777776644211 1111111123456 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc--cccceeec--chHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE--AVEPAIAT--GLEALGRGHDLNKLNVFI 450 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~--~~~~~~v~--~l~~l~raq~~~~l~~~~ 450 (532) +..+..+..-. ........++-.+.+.-++.-++.++...+... .+.+ .+++.+.- +.+.+..++ .+.... T Consensus 373 g~Al~~~~~~l-~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~-~~~d~~~i~i~f~~~~p~d~~~~~~---~~~kl~ 447 (511) T protein:vir:10 373 GEAMKYKLFGL-EQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEELK---AYIDSG 447 (511) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc-cccccceeeEEeCCCCCcCHHHHHH---HHHHHh Confidence 66655442222 222222233333333333333333333333211 1222 23343322 223233332 222221 Q ss_pred HHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 451 DYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 451 ~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) + + +....++.. ++ ++.+ ++|++...++++.+...+ .........+........+..-. T Consensus 448 G----~-------iS~et~~~~----l~-----~v~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:10 448 G----K-------ISQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIKKA-QKGIYKDPRDINDDEQDDDTKDT 506 (511) T ss_pred c----c-------CcHHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHH-hhhcccCCCCCCCCCCCCcccCc Confidence 1 1 222323322 22 2222 344444333332221111 11110011100000000000111 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) ++| T Consensus 507 ~~~ 509 (511) T protein:vir:10 507 VDK 509 (511) T ss_pred ccc Confidence 111 No 89 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=98.42 E-value=7.2e-07 Score=54.27 Aligned_cols=459 Identities=11% Similarity=0.053 Sum_probs=210.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccc-----cccc-ccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGS-----TSYT-TPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-----~~~~-~~~dst~~~a~~~Laa~l 74 (532) |.+.++.--+-..+..+|+..++--.. ...|++...-.||.....+..+.+ .++. -.|-+.-.+.++.++..+ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~v 110 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSG-QEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQV 110 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchh Confidence 988866555666677777766665443 467777777778874333322211 1222 244444455555555444 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) + - ..|.+. ++ ..++.+++.| -+...+++.-+..++.+...+|-+.++||-... T Consensus 111 f----r-k~p~~~--~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~ 163 (535) T protein:vir:80 111 F----S-RDPIRQ--LP--------------PALEAIVEDI------DGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNV 163 (535) T ss_pred h----c-CCccee--cc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCC Confidence 3 2 223442 22 1245455443 345667888888999999999999999973221 Q ss_pred cc----------CCcceEEEEecceE---EEe-eCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE Q lcl|NC_015159. 155 VE----------GQSNAPKLYKLHNF---VVE-RDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH 220 (532) Q Consensus 155 ~~----------~~~~~~~~~pl~~~---~v~-~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~ 220 (532) .. +..-.+..|+..+. -.. .|..+++.-+..++....+. +.| ..+.++.|.. T Consensus 164 ~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--d~f------------~~~~~~q~Rv 229 (535) T protein:vir:80 164 GRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD--DGF------------ETTYVQQWRV 229 (535) T ss_pred CCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC--CCc------------ccceeEEEEE Confidence 10 11134666665443 222 23344555566666543221 111 2345556667 Q ss_pred EEeeCCCCeEEEEEE-EcCcc--cccc----cccCccccCceEEEEeeecCCCcccc--chHHHHHHHHHHHHHH---HH Q lcl|NC_015159. 221 VYRDPEAMVFRSYQE-IDGEI--VAGT----EGEYPLDSCPWIPVRLIKMPNEDYGR--SFVEEYLGDLKSLENL---YE 288 (532) Q Consensus 221 v~~~~~~~~~~s~~~-~~~~~--~~~~----~~~~g~~~~P~~~~Rw~~~~g~~YG~--Gp~~~al~d~~~L~~l---~~ 288 (532) +.++.++. |....| .++.. .... .-..|-+.+++|++.|.-..+..+.. .|. =|+..||.- +. T Consensus 230 L~~~~~G~-y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPL----l~LA~lni~Hy~~s 304 (535) T protein:vir:80 230 LQLNAEGN-YQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPL----LDLCEVNIGHYRNS 304 (535) T ss_pred EEecCCce-EEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccch----HHHHHHHHHHhhch Confidence 77766653 554333 22221 1110 00123356777777777555554443 343 244444432 22 Q ss_pred HHHH-HHHHHhcCceeecC------ccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 289 AIVK-MSMISSKVLFFVNP------NGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM 361 (532) Q Consensus 289 ~~l~-~~~~a~~p~~lv~~------~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~ 361 (532) +-.+ .+..+..|.+.+.. +.......+..+.+..+.-+..++...+++. +..+ ..+.++++++++++.= T Consensus 305 sd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~-~~~~--a~~~l~~~e~qM~~lG- 380 (535) T protein:vir:80 305 ADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQIT-PNSV--PFEAMTHKESQMIAMG- 380 (535) T ss_pred hHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeec-cchh--HHHHHHHHHHHHHHHH- Confidence 2233 34444455443321 1122222233333333322223334444432 2222 2355777777776631 Q ss_pred hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee-cchHHHHHH Q lcl|NC_015159. 362 LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA-TGLEALGRG 440 (532) Q Consensus 362 ~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~~l~~l~ra 440 (532) ...+ .......||+|........-..|+.+...+.+-+ .-++..+...+ |..+ .++.+++.+- .++.+---+ T Consensus 381 a~ll-~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al-~~aL~~~A~w~---G~~~--~~~~~~i~~n~dF~~~~ld~ 453 (535) T protein:vir:80 381 ANLL-VKSGGNRTFGEAQQEEASEQSILSACTKNVSMAF-RKALRWANQFQ---TGIV--NDETVEYNLNTDFPAARLTP 453 (535) T ss_pred HHhh-ccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHH-HHHHHHHHHHc---CCcc--CCCceEEEeccccccccCCH Confidence 1122 2334458999999999988889988888877643 22333333222 2211 2233322221 111111112 Q ss_pred HHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH- Q lcl|NC_015159. 441 HDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAA- 519 (532) Q Consensus 441 q~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~- 519 (532) +.++.++...+ + ..|..+.++.++ ...||... -+..++|. .+.+.+............-.+.+|+.- T Consensus 454 ~~~~all~~~~---~------G~Is~et~~~~L-~r~gvl~~-~~~~eee~-~ri~~E~~~~~~~~g~~~d~~~~g~~~~ 521 (535) T protein:vir:80 454 NERAELILEWQ---Q------GAITFKEMRAGL-RRAGVASE-DDAKAETE-GKATVEFIAKTAAAGKVGDAASGGTNKA 521 (535) T ss_pred HHHHHHHHHHh---c------CCCCHHHHHHHH-HhCCCCCc-ccchHHHH-HHHHhhhhhccccCCCCCCCCCCCCCcC Confidence 33333333222 1 135556666665 44476322 22223322 322222111111100000001111100 Q ss_pred ----HhhcccccCC Q lcl|NC_015159. 520 ----AAMMQQQAGL 529 (532) Q Consensus 520 ----~~~~~~~~g~ 529 (532) +....+++|+ T Consensus 522 ~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 522 KLNNGNGGGNQAGN 535 (535) T ss_pred cccCCccccccCCC Confidence 1223345555 No 90 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.41 E-value=7.6e-07 Score=54.14 Aligned_cols=434 Identities=8% Similarity=-0.000 Sum_probs=198.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc---CCC--CCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---PSA--TADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~~--~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |... ..++.+.+.+..+..+.+| .++|+++.+|....-- ... .....+...++-.+.+...++..++.|+ T Consensus 16 ~~~~--~~l~~~~i~~li~~~~~~~---~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~ 90 (506) T protein:vir:94 16 QESL--ENLTPNKIMKFITHHFNYQ---RPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSV 90 (506) T ss_pred ccch--hcCCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhc Confidence 4332 3455666766665555544 3567777777654321 110 0111122345666777777887777665 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) +- | +++...+.. .. ..+.+.+..++|.....++.++..++|.+.+++..++ T Consensus 91 G~--p-----~~~~~~d~~-------------~~-------~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~de-- 141 (506) T protein:vir:94 91 GN--P-----INVKLPDDG-------------SN-------SGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGE-- 141 (506) T ss_pred cc--C-----ceeecCcch-------------HH-------HHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecC-- Confidence 42 2 123333211 11 2234566778999999999999999999988877643 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) ++.+++.+++..+.++..|. .+++...+|.+..... ..+.......++.++.... .+ T Consensus 142 -d~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~---------------~~~~~~~~~~~~~~yt~~~-----~~ 200 (506) T protein:vir:94 142 -DNEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELV---------------DDNQVSTINYVPETWTADT-----YT 200 (506) T ss_pred -CCeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeec---------------cCCceeEEEEEEEEEeCce-----EE Confidence 35667777787665554443 4566655554443211 0111112222222332111 11 Q ss_pred EEEc---CcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc Q lcl|NC_015159. 234 QEID---GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT 310 (532) Q Consensus 234 ~~~~---~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~ 310 (532) ++.. +... ......++..+|++..+= +..|.|-.+...+-+-.++.+.-..+...+...+|.+++...... T Consensus 201 ~~~~~~~~~~~-~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~ 274 (506) T protein:vir:94 201 LYNPTPIMGKM-QVDTTKPITTFPVVEFKN-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDT 274 (506) T ss_pred EeccccCccce-eccccccCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccc Confidence 1111 1111 112235677889876533 345788888888888888888888888777777776555321110 Q ss_pred Chh-------------------------hh--------ccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHH Q lcl|NC_015159. 311 QIR-------------------------RV--------AKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLS 357 (532) Q Consensus 311 ~~~-------------------------~~--------~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~ 357 (532) ... .+ .....+..+.+...+..+..+....+.+.....++.+.+.|- T Consensus 275 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~ 354 (506) T protein:vir:94 275 LFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIH 354 (506) T ss_pred cccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHH Confidence 000 00 000001111111222223333344456667777777777664 Q ss_pred HHHh-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cch Q lcl|NC_015159. 358 YAFM-LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGL 434 (532) Q Consensus 358 ~af~-~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l 434 (532) ..-. .+......+...|+..+..+..-.... .-...+.-.+.+..+++-++.++...+.-..+....+++.+. .+- T Consensus 355 ~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~ 433 (506) T protein:vir:94 355 KFSHTPDLTDENFASNSSGVAMQYKVLGTVEL-ASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPA 433 (506) T ss_pred HHhCccccccccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCc Confidence 4211 111111112345666655443332222 222334444455555555555543322211122222344432 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhH Q lcl|NC_015159. 435 EALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGA 513 (532) Q Consensus 435 ~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~ 513 (532) +.+..++-+.++ +.+ +....++.. ++ ++.+ ++|++...++++.+... .. T Consensus 434 d~~e~a~~~~kl-------~g~-------iS~et~~~~----lp-----~v~d~~~E~~ri~~E~~~~~~~------~~- 483 (506) T protein:vir:94 434 DNISQIKALVQA-------GAT-------LPQKYLYQQ----LP-----GVTNPQDIVDMMKEQSANGDYS------FD- 483 (506) T ss_pred CHHHHHHHHHHH-------hcc-------CChHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHhhc------ch- Confidence 233333322221 211 223333332 22 2222 23443333332211110 00 Q ss_pred HHHHHHHhhcccccCCCCC Q lcl|NC_015159. 514 AGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 514 ~~~~~~~~~~~~~~g~~~~ 532 (532) ........+.+++ T Consensus 484 ------~~~~~~~~~~~~~ 496 (506) T protein:vir:94 484 ------QNGVISNDGQTNT 496 (506) T ss_pred ------hhcCCCcccCccc Confidence 0112223333333 No 91 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.39 E-value=8.7e-07 Score=53.81 Aligned_cols=436 Identities=10% Similarity=0.023 Sum_probs=188.3 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCC--C--CcccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGF-AADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSA--T--ADGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~-~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~--~--~~~~~~~~~~~dst~~~a~~~L 70 (532) |-......- ..+.+.+..+..+.+ ..+++.+.+|..-. +.... + ....+...++..+-+...++.. T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~----~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:97 35 IVRTNNKPETLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred cccCCCchhhHHHHHHHHHHHHHHH----HHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHH Confidence 443322211 223333333444432 34555666664432 11100 0 0111223467778888888888 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) ++.|++ .| +++...|.. +.+.| ... ..++|-....++.+++.++|.|.+++. T Consensus 111 ~~yl~g--~p-----~~~~~~d~~-------------~~~~l-------~~~-~~n~~~~~~~~~~~~~~~~G~a~~~v~ 162 (492) T protein:vir:97 111 VSYIVG--KP-----IAFKHTDDE-------------VVKRI-------DEV-LGNRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred hhhhcc--cC-----ceeccCchH-------------HHHHH-------HHH-HhccHHHHHHHHHHHHhhcCeEEEEEE Confidence 876643 22 123333322 12222 122 246788899999999999999988776 Q ss_pred ccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEE---EeeC Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHV---YRDP 225 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~~~ 225 (532) .++ .+.+++++++..+.++..| ..+++...+|.+... ....+++|+.. +... T Consensus 163 ~d~---dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~y~~~~v~~~~~ 219 (492) T protein:vir:97 163 LDE---EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVY 219 (492) T ss_pred ecC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEE Confidence 542 3557788888877555443 467887777665421 01223333210 1111 Q ss_pred CCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_015159. 226 EAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN 305 (532) Q Consensus 226 ~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~ 305 (532) ++..+.....-............++..+|++..+. +.+|+|=.+..++-+..++.+.-...........|.+++. T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~ 294 (492) T protein:vir:97 220 ENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK 294 (492) T ss_pred ecCeeeecccccccccccccccCCCCCcceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeee Confidence 11111111000000111112345677889887654 3578998999999999999888888888888888876664 Q ss_pred CccccChhhh-ccCC-CceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHH Q lcl|NC_015159. 306 PNGVTQIRRV-AKAN-TGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 306 ~~g~~~~~~~-~~~~-~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~~~TAtEi~~r~ 382 (532) .......... .... .+.+.-+..+++..+ ....+.......++.+++.|.+.-.. +.....-+...|+.-+.... T Consensus 295 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~ 372 (492) T protein:vir:97 295 NYDDQELPEFKRLLRYYGAIKVSDNGGVDTI--QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLY 372 (492) T ss_pred cCCcccchhHHHHHhhccceecCCCCcceeE--eccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHH Confidence 2111111111 1111 122222333344433 22335566667777777666543221 11111122234444333222 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--ccceeecchHHHHHHHHHHHHHHHHHHHHhhcchh Q lcl|NC_015159. 383 GELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQ 460 (532) Q Consensus 383 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~ 460 (532) .- +....-...+.- ...+.+++.++.+..- .+.+. +++.+ ++..|-..++.++.+.. ++.+ T Consensus 373 ~~-l~~ka~~~~~~f----~~~l~~~~~li~~~~~---~~~~~~~i~v~f-~~~~p~~~~e~a~~~~k----l~G~---- 435 (492) T protein:vir:97 373 TN-LNLKADKLARKA----KVAIQELLWFVFEHFD---IKGEHKDVDISF-NYNKVANTELQVQTAQQ----SMGI---- 435 (492) T ss_pred HH-HHHHHHHHHHHH----HHHHHHHHHHHHHHhc---CCcccceeeEEe-cCCCCCCHHHHHHHHHH----Hhcc---- Confidence 21 222222222222 2333444444333211 12233 33332 22112111222222111 1221 Q ss_pred hhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 461 DDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 461 ~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +....++ +.++. +.+ ++|++...++.+.++...+. ...+.....--.+..+-.++ T Consensus 436 ---iS~et~l----~~l~~-----v~d~~~Eleri~~E~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 491 (492) T protein:vir:97 436 ---VSHETVL----ENHPF-----VEDLQAELERIEQEQTEYNKQLPN-----LDDGGADSAQQQERSNNKES 491 (492) T ss_pred ---CchHHHH----HhCCC-----CCCHHHHHHHHHHHHHHHHHhhhc-----cccCCCCCCccccccccccc Confidence 2222222 22332 222 34554443333322221111 11111111111111122222 No 92 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.38 E-value=9.2e-07 Score=53.68 Aligned_cols=466 Identities=11% Similarity=0.077 Sum_probs=212.3 Q ss_pred CCCCCCCccCHHHHH-------HHHHHHHHHhhhHHHHHHHHHHhhcccccCCCC--Ccc-cccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAA-------AAYNRLKNDRGAYETRAEDCATYTIPSVFPSAT--ADG-STSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~~~~~~~-------~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~-~~~~~~~~dst~~~a~~~L 70 (532) |+..+|+--..+-+. -+-......| ...++.+.+|..-.-..-.+ ..+ .+....++++.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~R---l~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~--- 74 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKAR---LASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLI--- 74 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHH---HHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhh--- Confidence 766554322211110 0000001111 34445555555442110000 001 1122356777774333 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) .+.+-.+.| +..|+- +.. . + +|+..+...+++.|++....++-.+.++.|-|++++- T Consensus 75 -~~~~~~~~~-g~~~~~----~~~-~----------e------~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~ 131 (527) T protein:vir:10 75 -EAKMRFLGQ-GLKWEF----SKK-D----------A------KVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLI 131 (527) T ss_pred -CCcceeecc-Cccccc----cch-h----------H------HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 333333333 444431 100 0 0 2344445677889999999999999999999999998 Q ss_pred ccccc-cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHH-----hhcccCCCcce------E--E Q lcl|NC_015159. 151 STEQV-EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLE-----EAQGDQNPSEE------V--T 216 (532) Q Consensus 151 ~~~~~-~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~-----~~~~~~~~~~~------v--~ 216 (532) .|.+. ++..++.+.+-.+.|+..+|++| ++.+-+.+....=.+|++-.+.++ +-.+..+++.. + + T Consensus 132 wD~~k~~~~R~~v~~~DP~~~f~~ed~d~-~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt 210 (527) T protein:vir:10 132 GDDEKDEGSRLSLHEVDPSTYFPYEDPRY-PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYT 210 (527) T ss_pred eccCCCcCCCceEeecCcceeeeeecCCC-CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeee Confidence 77544 34567888888899988888866 343333222111124443332111 11111122221 1 1 Q ss_pred EEEEE--EeeC-CC-----CeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 217 IYTHV--YRDP-EA-----MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYE 288 (532) Q Consensus 217 i~~~v--~~~~-~~-----~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~ 288 (532) ..|+- .++. +. --|+. .+++...+. .-.++.-.|++.++=...++++||+|=..+.+.-+..||.... T Consensus 211 ~~~w~lg~w~d~~e~p~~~~~~~~--~~~~~~l~~--lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~T 286 (527) T protein:vir:10 211 EELYEPGKWDDRPESPLEPDDIKK--LSTLTEEEP--LPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMT 286 (527) T ss_pred eceeeccccccccccccchhhhhh--hcCceeeec--ccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhh Confidence 11111 0110 00 00111 122332221 2234455888888778899999999999999999999998888 Q ss_pred HHHHHHHHHhcCceeecCccccChhhhcc-----CCCceeec-CccccccccccCCccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 289 AIVKMSMISSKVLFFVNPNGVTQIRRVAK-----ANTGDFVA-GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML 362 (532) Q Consensus 289 ~~l~~~~~a~~p~~lv~~~g~~~~~~~~~-----~~~G~~v~-g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~ 362 (532) .....+...-.|+... +|+...+.... -+||.++. +..+... .+....++......+..+..+|...=-. T Consensus 287 d~s~is~~sG~Pi~~~--tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~--~v~~~~~la~~~~h~~~L~~~l~~vA~~ 362 (527) T protein:vir:10 287 DEDLIMVFGGLGFYAT--DSAPPRDSRGNMVPWTISPLGMVEHGQNNKIY--RVNGVASLEPSQTHMTKAEEAMQQTKGI 362 (527) T ss_pred HHHHHHHHhCCceeee--cccccccccCCcCccccCCceeEecCCCccee--eccchhhhHHHHHHHHHHHHHHHHhhcC Confidence 8888887777776544 34433221100 11344332 2222221 2233335555555555555555432100 Q ss_pred h--hcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH-HHHHHHHHHH---------HHHhcCCCCCCcccccccee Q lcl|NC_015159. 363 N--SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL-QLPLVKILLK---------ELQATSKIPNLPKEAVEPAI 430 (532) Q Consensus 363 ~--~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~-l~Pli~r~~~---------il~r~g~lp~~p~~~~~~~~ 430 (532) - .+...|..+ --+.+ .+...|+|++.|.+..- +.-.+.|-|. ....-+.-+-.+.-.+++. T Consensus 363 PavA~G~vD~s~-~~SG~-----ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~iv- 435 (527) T protein:vir:10 363 PDIAVGVVDAAV-AESGI-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTIT- 435 (527) T ss_pred CeeeeccccCCc-CcHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEE- Confidence 0 011112222 11222 23445666666666542 2223332221 1111111000000012222 Q ss_pred ecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 431 ATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQ 510 (532) Q Consensus 431 v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~ 510 (532) -.+.-|.-+++-++++....+. .+ +-...+++.+.+.-|+. -.++|++...+++.+++.+. +.+ T Consensus 436 f~p~lP~D~~avie~v~tL~~a--Gi-------~S~~tAv~~L~~~~g~e-----D~E~E~~~I~~era~~a~a~--a~A 499 (527) T protein:vir:10 436 FRDPKPVNSEKRFNQLLQLWEA--GL-------IPAKKLTEELSKIMGFE-----LTEEDFKQATEDKKTQGIAQ--AEA 499 (527) T ss_pred ecccCCCCHHHHHHHHHHHHHc--Cc-------hhHHHHHHHHHhccCCC-----ChHHHHHHHHHHHHHHhHHh--hhh Confidence 2344566677777766655442 11 34667788887777742 12345544433332222111 111 Q ss_pred hhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 511 MGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 511 ~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+.-+ +.+...+|.+.+ T Consensus 500 ~~~~~-----a~~~~~~g~~~~ 516 (527) T protein:vir:10 500 ADPFG-----AQMAAEQGIPDE 516 (527) T ss_pred cCchh-----hhhccccCCCCC Confidence 11111 112223444444 No 93 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.38 E-value=9.2e-07 Score=53.67 Aligned_cols=436 Identities=11% Similarity=0.034 Sum_probs=190.8 Q ss_pred CCCC----------------------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCC----C Q lcl|NC_015159. 1 MAEV----------------------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSAT----A 49 (532) Q Consensus 1 m~~~----------------------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~----~ 49 (532) ||+. -+...+.+........|-++-..-..+++.+.+|..-. +..... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~ 80 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV 80 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Confidence 3221 01111233333333333333223345566666665442 111100 0 Q ss_pred cccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCCh Q lcl|NC_015159. 50 DGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFR 129 (532) Q Consensus 50 ~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~ 129 (532) ...+...++..+-+...++..++.|++ .| +++...|.. +.+.| . .+...+|. T Consensus 81 ~~~~~~~ki~~n~~k~Ivd~~~~~l~G--~p-----~~~~~~d~~-------------~~~~l-------~-~~~~n~~~ 132 (483) T protein:vir:12 81 DPLKPDDRMITNFHANLVDQKVSYIVG--KP-----IAFKHTDDE-------------VVKRI-------D-EVLGNRFD 132 (483) T ss_pred cccccccccccchHHHHHHHHhhhhcc--cC-----ceeccCChH-------------HHHHH-------H-HHHhccHH Confidence 111223467778888888888877653 22 223333322 11112 1 22235788 Q ss_pred HHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcc Q lcl|NC_015159. 130 PTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQG 207 (532) Q Consensus 130 ~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~ 207 (532) ....++.++..+||.+.+++..++ ++.+++++++..+.++.-| ..+++...+|.+... T Consensus 133 ~~~~~~~~~~~~~G~~y~~v~~d~---d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~----------------- 192 (483) T protein:vir:12 133 DKLHSVLTGASNKGIEWLHPYLDE---EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE----------------- 192 (483) T ss_pred HHHHHHHHHHhhCCeEEEEEEEcC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEee----------------- Confidence 889999999999999987776542 3556788888877655444 457887766665421 Q ss_pred cCCCcceEEEEEE--E-EeeCCCCeEEE--EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHH Q lcl|NC_015159. 208 DQNPSEEVTIYTH--V-YRDPEAMVFRS--YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKS 282 (532) Q Consensus 208 ~~~~~~~v~i~~~--v-~~~~~~~~~~s--~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~ 282 (532) ....+++|+. | +...++..+.. ....++..+ .....++..+|++.++- +.+|.|=.+...+-+.. T Consensus 193 ---~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa 262 (483) T protein:vir:12 193 ---NETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKT--HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDA 262 (483) T ss_pred ---cceEEEEEecCeEEEEEEeCCeeeeccccccccccc--ccccCCCCccceEEecC-----CCCCCCchhhHHHHHHH Confidence 0112333321 1 11111111111 111111111 12345677889887654 45799989999999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeecCccccChhhhc-cCC-CceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 283 LENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA-KAN-TGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF 360 (532) Q Consensus 283 L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~-~~~-~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af 360 (532) ++.+.-......+....|.+++...+........ ..+ .+.+.....+++..+ ....+.......++.+++.|.+.- T Consensus 263 ~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s 340 (483) T protein:vir:12 263 YNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTI--QVEVPVENSKKYLDELYQKIMLFG 340 (483) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhccccccCCCCcceEE--eecCCHHHHHHHHHHHHHHHHHHh Confidence 9988888888888888998776432222211111 111 122322333444433 223355566667777776664432 Q ss_pred hh-hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--ccceee--cchH Q lcl|NC_015159. 361 ML-NSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPAIA--TGLE 435 (532) Q Consensus 361 ~~-~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~~v--~~l~ 435 (532) .. +.....-+...|+.-+..+..-... .. .+.+. .+...+.+++.++.+.-- .+.+. +++.+. .+.+ T Consensus 341 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~-k~---~~~~~-~f~~~l~~~~~li~~~~~---~~~~~~~i~v~f~~~~p~~ 412 (483) T protein:vir:12 341 QAVDFSSDKFGSAPSGVALEFLYTNLNL-KA---DKLAR-KAKVAIQELLWFVFEHFD---IKGEHKDVDISFNYNKVAN 412 (483) T ss_pred CCCCCCccccccCcHHHHHHHHHHHHHH-HH---HHHHH-HHHHHHHHHHHHHHHHhc---CCCccceeeEEeCCCCCCC Confidence 11 1111111223455443322221111 11 22222 223333444444333211 12232 333331 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHH Q lcl|NC_015159. 436 ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAA 514 (532) Q Consensus 436 ~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~ 514 (532) ....+ +.+.. ++.+ +....++. .++. +.+ ++|++...++++.++.. +.... T Consensus 413 ~~~~a---~~~~k----l~Gi-------iS~et~~~----~~~~-----v~d~~~E~~ri~~E~~~~~~~-----~~~~~ 464 (483) T protein:vir:12 413 TELQV---QTAQQ----SMGI-------VSHETVLE----NHPF-----VEDLQAELERIEQEQMEYNKQ-----LPNLD 464 (483) T ss_pred HHHHH---HHHHH----Hhcc-------CchHHHHH----hCCC-----CCCHHHHHHHHHHHHHHHHhh-----ccccc Confidence 22222 22111 1221 22222222 2232 222 34444333333222111 11111 Q ss_pred HHHHHHhhcccccCCCCC Q lcl|NC_015159. 515 GGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 515 ~~~~~~~~~~~~~g~~~~ 532 (532) .+..-...-.++.+..++ T Consensus 465 ~~~~d~~~~~~~~~~~e~ 482 (483) T protein:vir:12 465 DGGADGAQQQERSNNKES 482 (483) T ss_pred ccccCCcccCCCCCcccC Confidence 111111122233344444 No 94 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.38 E-value=9.4e-07 Score=53.62 Aligned_cols=466 Identities=11% Similarity=0.076 Sum_probs=212.5 Q ss_pred CCCCCCCccCHHHHH-------HHHHHHHHHhhhHHHHHHHHHHhhcccccCCCC--Ccc-cccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAA-------AAYNRLKNDRGAYETRAEDCATYTIPSVFPSAT--ADG-STSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~~~~~~~-------~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~-~~~~~~~~dst~~~a~~~L 70 (532) |+..+|+--..+-+. -+-......| ...++.+.+|..-.-..-.+ ..+ .+....++++.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~R---l~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~--- 74 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKAR---LASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLI--- 74 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHH---HHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhh--- Confidence 766554322211110 0000001111 34445555555442110000 001 1122356777774333 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) .+.+-.+.| +..|+- +.. . + +|+..+...+++.|++....++-.+.++.|-|++++- T Consensus 75 -~~~~~~~~~-g~~~~~----~~~-~----------e------~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~ 131 (527) T protein:vir:10 75 -EAKMRFLGQ-GLKWEF----SKK-D----------A------KVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLI 131 (527) T ss_pred -CCcceeecc-Cccccc----cch-h----------H------HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 333333333 444431 100 0 0 2444455677889999999999999999999999998 Q ss_pred ccccc-cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHH-----hhcccCCCcce------E--E Q lcl|NC_015159. 151 STEQV-EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLE-----EAQGDQNPSEE------V--T 216 (532) Q Consensus 151 ~~~~~-~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~-----~~~~~~~~~~~------v--~ 216 (532) .|.+. ++..++.+.+-.+.|+..+|++| ++.+-+.+....=.+|++-.+.++ +-.+..+++.. + + T Consensus 132 wD~~k~~~~R~~v~~~DP~~~f~~ed~d~-~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt 210 (527) T protein:vir:10 132 GDDEKDEGSRLSLHEVDPSTYFPYEDPRY-PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYT 210 (527) T ss_pred eccCCCcCCCceEeecCcceeeeeecCCC-CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeee Confidence 77544 34567888888899988888866 343333222111124443332111 11111122221 1 1 Q ss_pred EEEEE--EeeC-CC-----CeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 217 IYTHV--YRDP-EA-----MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYE 288 (532) Q Consensus 217 i~~~v--~~~~-~~-----~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~ 288 (532) ..|+- .++. +. --|+. .+++...+. .-.++.-.|++.++=...++++||+|=..+.+.-+..||.... T Consensus 211 ~~~w~lg~w~d~~e~p~~~~~~~~--~~~~~~l~~--lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~T 286 (527) T protein:vir:10 211 EELYEPGKWDDRPESPLEPDDIKK--LSTLTEEEP--LPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMT 286 (527) T ss_pred eceeeccccccccccccchhhhhh--hcCceeeec--ccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhh Confidence 11111 0110 00 00111 122332221 2234455888888778899999999999999999999998888 Q ss_pred HHHHHHHHHhcCceeecCccccChhhhcc-----CCCceeec-CccccccccccCCccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 289 AIVKMSMISSKVLFFVNPNGVTQIRRVAK-----ANTGDFVA-GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML 362 (532) Q Consensus 289 ~~l~~~~~a~~p~~lv~~~g~~~~~~~~~-----~~~G~~v~-g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~ 362 (532) .....+...-.|+... +|+...+.... -+||.++. +..+... .+....+++.....+..+..+|...=-. T Consensus 287 d~s~is~~sG~Pi~~~--tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~--~v~~~~~la~~~~h~~~L~~~l~~vA~~ 362 (527) T protein:vir:10 287 DEDLIMVFGGLGFYAT--DSAPPRDSRGNMVPWTISPLGMVEHGQNNKIY--RVNGVASLEPSQTHMNKAEEAMQQTKGI 362 (527) T ss_pred HHHHHHHHhCCceeee--cccccccccCCcCccccCCceeEecCCCccee--eccchhhhHHHHHHHHHHHHHHHHhhcC Confidence 8888887777776544 34433221100 11344332 2222221 2233335555555555555555432100 Q ss_pred h--hcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH-HHHHHHHHHH---------HHHhcCCCCCCcccccccee Q lcl|NC_015159. 363 N--SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL-QLPLVKILLK---------ELQATSKIPNLPKEAVEPAI 430 (532) Q Consensus 363 ~--~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~-l~Pli~r~~~---------il~r~g~lp~~p~~~~~~~~ 430 (532) - .+...|..+ --+.+ .+...|+|++.|.+..- +.-.+.|-|. ....-+.-+-.+.-.+++. T Consensus 363 PavA~G~vD~s~-~~SG~-----ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~iv- 435 (527) T protein:vir:10 363 PDIAVGVVDAAV-AESGI-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTIT- 435 (527) T ss_pred CeeeeccccCCc-CcHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEE- Confidence 0 011112222 11222 23445666666666552 2223332221 1111111000000012222 Q ss_pred ecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 431 ATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQ 510 (532) Q Consensus 431 v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~ 510 (532) -.+.-|.-+++-++++....+. .+ +-...+++.+.+.-|+. -.++|++...+++.+++.+. +.+ T Consensus 436 f~p~lP~D~~avie~v~tL~~a--Gi-------iS~etAv~~L~~~~g~e-----D~E~E~~~I~~era~~a~a~--a~a 499 (527) T protein:vir:10 436 FRDPKPVNNEKRFAQLLELWEA--GL-------IPAKKLTEELSKIMGFE-----LTEEDFRQATEDKKTQGIAQ--AEA 499 (527) T ss_pred ecccCCCCHHHHHHHHHHHHHc--Cc-------hhHHHHHHHHHhccCCC-----chHHHHHHHHHHHHHHhHHh--hhh Confidence 2344566677777766655442 11 34667888887777742 12345444333332222211 111 Q ss_pred hhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 511 MGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 511 ~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+.-+ +.+...+|.+.+ T Consensus 500 ~~~~~-----a~~~~~~g~~~~ 516 (527) T protein:vir:10 500 ADPFG-----AQMAAEQGIPDE 516 (527) T ss_pred cCchh-----hhhccccCCCCC Confidence 11111 112223444444 No 95 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.36 E-value=1.1e-06 Score=53.28 Aligned_cols=429 Identities=13% Similarity=0.084 Sum_probs=180.0 Q ss_pred CCCC-------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEV-------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~-------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~~a~~~Laa 72 (532) |.+. .+.+++. .++++ ...|+.+|+=-.|.+. ..+..+... ..+.--..+...++.+|+ T Consensus 19 ~~~~~~~~~~~~~i~~~~----~~~~~--------i~~~~~~Y~g~~~~~~-~~~~~~~~~~~~~~slnl~~~i~~~~A~ 85 (500) T protein:vir:98 19 TTQSLTNITDHPKIAISK----LEYDR--------ITTNLKYYKSDWDSVL-YLNTDGETKKRDLNHLPIARTAAKKIAS 85 (500) T ss_pred hcchhhhhhccccccCCH----HHHHH--------HHHHHHHhcCCCCCcc-cccCCCCcccCceeecchHHHHHHHHhh Confidence 2122 1122222 22333 3334444432112111 111111111 111112455556666665 Q ss_pred HHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 73 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) -+.+-. | .+.++|. ...++| .+.+..++|+..+.+++.+..+.|.+++-+..+ T Consensus 86 lv~~e~--~-----~i~~~d~-------------~~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d 138 (500) T protein:vir:98 86 LVFNEQ--A-----EIKVDDD-------------AANEFI-------SETLKNDRFNKNFERYLESCLALGGLAMRPYVD 138 (500) T ss_pred hhcCCc--c-----eEecCCh-------------HHHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe Confidence 443322 1 1233332 233344 356778899999999999999999998754433 Q ss_pred ccccCCcceEEEEecceEEE-eeCCCCCeEEEEEEEe-ecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEee--CCCC Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVV-ERDAYDNVLQIVTEDK-IARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRD--PEAM 228 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v-~~d~~G~vd~i~rk~~-~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~--~~~~ 228 (532) +..+.+..++...++- .-|..|.+...|.... .+. ..+.. +|+.++.. .++. T Consensus 139 ----~~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~--------------~~~~~------~yt~lE~h~~~~~~ 194 (500) T protein:vir:98 139 ----GDKVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTI--------------NGKEV------YYTLIEFHEWQSSD 194 (500) T ss_pred ----CCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeee--------------cCCce------EEEEEEEEEEeCCc Confidence 2346688888888664 5566665554443221 111 00111 23333221 1222 Q ss_pred eEEEEEEE--------cCccccc---------ccccCccccCceEEEE----eeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 229 VFRSYQEI--------DGEIVAG---------TEGEYPLDSCPWIPVR----LIKMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 229 ~~~s~~~~--------~~~~~~~---------~~~~~g~~~~P~~~~R----w~~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) +|.+.+.. .|..+.. .-...|...-||..++ =+...+++||.|-...+.+.+..|+..- T Consensus 195 ~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~ 274 (500) T protein:vir:98 195 DYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTY 274 (500) T ss_pred eeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHH Confidence 33332211 1221110 0011222222222222 2334578899999999999999999988 Q ss_pred HHHHHHHHHHhcCceeecCccccChh-hhccCC---Cc-------eee--cCcccc-ccccccCCccchhHHHHHHHHHH Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVTQIR-RVAKAN---TG-------DFV--AGRKQD-VEVFQLEKYNDFQVAKATADDIE 353 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~~~~-~~~~~~---~G-------~~v--~g~~~~-~~~~~~~~~~~~~~~~~~i~~~~ 353 (532) -+.....+. .+..+.|+++-+ ... +...+. +. .++ .+..++ .....+...-........++.+- T Consensus 275 s~~~~e~~~-g~~~i~v~~~~l-~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 352 (500) T protein:vir:98 275 DEFMWEVKM-GQRRVAVPESLT-ALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGL 352 (500) T ss_pred HHHHHHHHh-CcceeeechHHh-cccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHH Confidence 888775544 666767754322 211 000010 00 011 111111 01111111111122223333344 Q ss_pred HHHHHH--HhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--ccce Q lcl|NC_015159. 354 KRLSYA--FMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPA 429 (532) Q Consensus 354 ~rI~~a--f~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~ 429 (532) +.|... |=...+........|||||....+...+...-.-.. ...-|.-|++-++.+..-.++....+... +.+. T Consensus 353 ~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~-~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~ 431 (500) T protein:vir:98 353 SLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVAL-VEQSLKELVISIFEIAKAYDLYQSEVPSMDNISIS 431 (500) T ss_pred HHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEE Confidence 434322 111112212223369999999988888887764333 34455666666655543222222111122 3333 Q ss_pred eecc--hHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 430 IATG--LEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTA 507 (532) Q Consensus 430 ~v~~--l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~ 507 (532) +--+ .+.-+ .++..+..++. .+ +....++ .+.+|+ |++|++++.++.+..+.. T Consensus 432 f~d~i~~d~~~---~~~~~~~~v~a--Gi-------~s~~~~i---~~~~g~-------~eeea~~~l~~i~~E~~~--- 486 (500) T protein:vir:98 432 LDDGVFTDRDA---ELDYWIKVVNA--GF-------GTREMAI---QKVLNV-------TEEKAQEIAAEINTGIVD--- 486 (500) T ss_pred eCCCCCCCHHH---HHHHHHHHHHc--CC-------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhccc--- Confidence 3111 12222 22222222111 12 2223333 344565 455655554444332110 Q ss_pred HHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 508 GQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) . .....+++++.-| T Consensus 487 -----~------~~~~~~~~~~~g~ 500 (500) T protein:vir:98 487 -----E------INQQRTDTHLYGE 500 (500) T ss_pred -----c------CCCCCccccccCC Confidence 0 0111112222222 No 96 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.36 E-value=1.1e-06 Score=53.28 Aligned_cols=429 Identities=13% Similarity=0.084 Sum_probs=180.0 Q ss_pred CCCC-------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEV-------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~-------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~~a~~~Laa 72 (532) |.+. .+.+++. .++++ ...|+.+|+=-.|.+. ..+..+... ..+.--..+...++.+|+ T Consensus 19 ~~~~~~~~~~~~~i~~~~----~~~~~--------i~~~~~~Y~g~~~~~~-~~~~~~~~~~~~~~slnl~~~i~~~~A~ 85 (500) T protein:vir:30 19 TTQSLTNITDHPKIAISK----LEYDR--------ITTNLKYYKSDWDSVL-YLNTDGETKKRDLNHLPIARTAAKKIAS 85 (500) T ss_pred hcchhhhhhccccccCCH----HHHHH--------HHHHHHHhcCCCCCcc-cccCCCCcccCceeecchHHHHHHHHhh Confidence 2122 1122222 22333 3334444432112111 111111111 111112455556666665 Q ss_pred HHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 73 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) -+.+-. | .+.++|. ...++| .+.+..++|+..+.+++.+..+.|.+++-+..+ T Consensus 86 lv~~e~--~-----~i~~~d~-------------~~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d 138 (500) T protein:vir:30 86 LVFNEQ--A-----EIKVDDD-------------AANEFI-------SETLKNDRFNKNFERYLESCLALGGLAMRPYVD 138 (500) T ss_pred hhcCCc--c-----eEecCCh-------------HHHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe Confidence 443322 1 1233332 233344 356778899999999999999999998754433 Q ss_pred ccccCCcceEEEEecceEEE-eeCCCCCeEEEEEEEe-ecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEee--CCCC Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVV-ERDAYDNVLQIVTEDK-IARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRD--PEAM 228 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v-~~d~~G~vd~i~rk~~-~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~--~~~~ 228 (532) +..+.+..++...++- .-|..|.+...|.... .+. ..+.. +|+.++.. .++. T Consensus 139 ----~~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~--------------~~~~~------~yt~lE~h~~~~~~ 194 (500) T protein:vir:30 139 ----GDKVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTI--------------NGKEV------YYTLIEFHEWQSSD 194 (500) T ss_pred ----CCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeee--------------cCCce------EEEEEEEEEEeCCc Confidence 2346688888888664 5566665554443221 111 00111 23333221 1222 Q ss_pred eEEEEEEE--------cCccccc---------ccccCccccCceEEEE----eeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 229 VFRSYQEI--------DGEIVAG---------TEGEYPLDSCPWIPVR----LIKMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 229 ~~~s~~~~--------~~~~~~~---------~~~~~g~~~~P~~~~R----w~~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) +|.+.+.. .|..+.. .-...|...-||..++ =+...+++||.|-...+.+.+..|+..- T Consensus 195 ~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~ 274 (500) T protein:vir:30 195 DYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTY 274 (500) T ss_pred eeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHH Confidence 33332211 1221110 0011222222222222 2334578899999999999999999988 Q ss_pred HHHHHHHHHHhcCceeecCccccChh-hhccCC---Cc-------eee--cCcccc-ccccccCCccchhHHHHHHHHHH Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVTQIR-RVAKAN---TG-------DFV--AGRKQD-VEVFQLEKYNDFQVAKATADDIE 353 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~~~~-~~~~~~---~G-------~~v--~g~~~~-~~~~~~~~~~~~~~~~~~i~~~~ 353 (532) -+.....+. .+..+.|+++-+ ... +...+. +. .++ .+..++ .....+...-........++.+- T Consensus 275 s~~~~e~~~-g~~~i~v~~~~l-~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 352 (500) T protein:vir:30 275 DEFMWEVKM-GQRRVAVPESLT-ALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGL 352 (500) T ss_pred HHHHHHHHh-CcceeeechHHh-cccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHH Confidence 888775544 666767754322 211 000010 00 011 111111 01111111111122223333344 Q ss_pred HHHHHH--HhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--ccce Q lcl|NC_015159. 354 KRLSYA--FMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPA 429 (532) Q Consensus 354 ~rI~~a--f~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~ 429 (532) +.|... |=...+........|||||....+...+...-.-.. ...-|.-|++-++.+..-.++....+... +.+. T Consensus 353 ~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~-~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~ 431 (500) T protein:vir:30 353 SLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVAL-VEQSLKELVISIFEIAKAYDLYQSEVPSMDNISIS 431 (500) T ss_pred HHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEE Confidence 434322 111112212223369999999988888887764333 34455666666655543222222111122 3333 Q ss_pred eecc--hHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 430 IATG--LEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTA 507 (532) Q Consensus 430 ~v~~--l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~ 507 (532) +--+ .+.-+ .++..+..++. .+ +....++ .+.+|+ |++|++++.++.+..+.. T Consensus 432 f~d~i~~d~~~---~~~~~~~~v~a--Gi-------~s~~~~i---~~~~g~-------~eeea~~~l~~i~~E~~~--- 486 (500) T protein:vir:30 432 LDDGVFTDRDA---ELDYWIKVVNA--GF-------GTREMAI---QKVLNV-------TEEKAQEIAAEINTGIVD--- 486 (500) T ss_pred eCCCCCCCHHH---HHHHHHHHHHc--CC-------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhccc--- Confidence 3111 12222 22222222111 12 2223333 344565 455655554444332110 Q ss_pred HHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 508 GQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) . .....+++++.-| T Consensus 487 -----~------~~~~~~~~~~~g~ 500 (500) T protein:vir:30 487 -----E------INQQRTDTHLYGE 500 (500) T ss_pred -----c------CCCCCccccccCC Confidence 0 0111112222222 No 97 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.36 E-value=1.1e-06 Score=53.27 Aligned_cols=442 Identities=13% Similarity=0.068 Sum_probs=184.8 Q ss_pred CC-----CCCCC---ccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccc-cccccccchHHHHHHHHH Q lcl|NC_015159. 1 MA-----EVEKT---GFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGST-SYTTPWQSIGARGLNNLA 71 (532) Q Consensus 1 m~-----~~~~~---~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~-~~~~~~dst~~~a~~~La 71 (532) |. +.... ......-..+|.+.+.-|.-|+..|.++..+ ...+.. ...++--..+...++.+| T Consensus 14 ~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~---------~~~~~~~~~~~~slnl~~~i~~~~A 84 (522) T protein:vir:47 14 GRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYK---------NTDGDIKSRPMNHLPIARTASKKIA 84 (522) T ss_pred HHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCccccccc---------ccCcchhcccceecchHHHHHHHHh Confidence 11 11100 0001111233333333333333333332211 111111 111222245556666666 Q ss_pred HHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 72 SKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 72 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) +-+.+-.. .++++|. .+.++| .+.+..++|+..+.+++....+.|.+++-+-. T Consensus 85 ~lv~~e~~-------~i~v~d~-------------~~~~~l-------~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~ 137 (522) T protein:vir:47 85 SLVYNEQA-------TITTKNE-------------ILQKFL-------DDMLTNDRFNKNFERYLESCLALGGLAMRPYI 137 (522) T ss_pred hhhcCCcc-------eeecCCh-------------HHHHHH-------HHHHhhcchHHHHHHHHHHhhccCCEEEEEEE Confidence 55544322 1222321 233344 46777899999999999999999998774333 Q ss_pred cccccCCcceEEEEecceEEE-eeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEee------ Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVV-ERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRD------ 224 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v-~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~------ 224 (532) + .+.+++..++...|+- ..|..|.+..++...... ..++...+||.++.. T Consensus 138 d----~~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~-------------------~~~~~~~~yt~lE~he~~~~~ 194 (522) T protein:vir:47 138 D----GDKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIK-------------------SEGRKNVYYTLVEFHEWVTAD 194 (522) T ss_pred c----CCceEEEEEcCCceEEEEEcCCceEEEEEEEEEEe-------------------ecccceeEEEEEEEeeecccc Confidence 2 3567888899988774 678888665544322210 001111122222111 Q ss_pred -------CCCCeEEEEEEE-c-------Cccccc-----------ccccCccccCceE-E---EEeee-cCCCccccchH Q lcl|NC_015159. 225 -------PEAMVFRSYQEI-D-------GEIVAG-----------TEGEYPLDSCPWI-P---VRLIK-MPNEDYGRSFV 273 (532) Q Consensus 225 -------~~~~~~~s~~~~-~-------~~~~~~-----------~~~~~g~~~~P~~-~---~Rw~~-~~g~~YG~Gp~ 273 (532) ..+.+|.+.+.. . |..++. .....+. .-|.+ . +.++. ..++.||+|-. T Consensus 195 ~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~-~~Plf~y~~~~~~N~~~~~splG~S~~ 273 (522) T protein:vir:47 195 GQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENL-SRPLFTYLKTPGMNNKDINSPLGLSIF 273 (522) T ss_pred cccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCC-CcceEEEecCCcccccccCCCcCCchh Confidence 011122221111 0 111100 0001122 12322 2 12333 44789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccC-----------CCceeecCc--cccc-cccccCCc Q lcl|NC_015159. 274 EEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKA-----------NTGDFVAGR--KQDV-EVFQLEKY 339 (532) Q Consensus 274 ~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~-----------~~G~~v~g~--~~~~-~~~~~~~~ 339 (532) ..+.+.++.||..--+...-.. ..+-...|+++ +++...-..+ ....+++.. .++. ....+... T Consensus 274 ~~~~~~id~lD~~~s~~~~e~~-~g~~~i~v~~~-~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ 351 (522) T protein:vir:47 274 DNAKTTIDFINRSYDEFMWEVR-MGQRRVIVPEH-LTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSP 351 (522) T ss_pred hhhHHHHHHHHHHHHHHHHHHH-hccceeecchH-HhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccc Confidence 9999999999976666555433 33444455432 2221110000 001122111 1110 11111111 Q ss_pred cchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_015159. 340 NDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSK 417 (532) Q Consensus 340 ~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~ 417 (532) -+.....+.++.+-+.|.... =...+........|||||..+.+...+...-.-..+ ...|..|+.-++.++.-.++ T Consensus 352 ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~-~~al~~lv~~i~~l~~~~~~ 430 (522) T protein:vir:47 352 IRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALV-EQSIKELCVSMCELGKAVGV 430 (522) T ss_pred cChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhh Confidence 122223334444444443321 111222223334699999999999999887744444 44667777777766644444 Q ss_pred CCCCccccccceeecc--hHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHH Q lcl|NC_015159. 418 IPNLPKEAVEPAIATG--LEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKM 495 (532) Q Consensus 418 lp~~p~~~~~~~~v~~--l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~ 495 (532) +-..+.+..++.+.-+ +.. .+...++..++.++ +.+ +....++. +.+|+ |++|.+++. T Consensus 431 ~~~~~~~~~~i~v~f~D~i~~-D~~~~~~~~~~~v~--aG~-------~s~e~~i~---~~~g~-------~eeea~~el 490 (522) T protein:vir:47 431 YSGEIPELDDISVNLDDGVFT-DRHAELDYWAKMVA--AGF-------STKKRAIG---KTLNI-------SGVEAEKEL 490 (522) T ss_pred ccCCCCCcceeEEEcCCCCCC-CHHHHHHHHHHHHh--cCC-------CCHHHHHH---hcCCC-------ChHHHHHHH Confidence 4333333333433222 111 11122222222111 111 22333333 34565 445555554 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 496 AEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 496 ~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ++.+..+.. +.+... ...++..++..-.-+ T Consensus 491 ~ri~~E~~~-----~~~~~~--~~~~~~~~~~~~~d~ 520 (522) T protein:vir:47 491 NAINSELLP-----MNDAEL--AIYGMHDQNEEKADD 520 (522) T ss_pred HHHHHhhcc-----CCCCCC--CCCCCCCcccccCCC Confidence 443322111 111100 011111111111111 No 98 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=98.33 E-value=1.3e-06 Score=52.93 Aligned_cols=430 Identities=9% Similarity=0.043 Sum_probs=203.4 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc---CCC-----C-----CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---PSA-----T-----ADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~~-----~-----~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) ++.+.+.+.-+.+...++....+++.+.+|..-.-- ... . ..-.+...++-.+-+...++..++.|+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 888888888888888888778888888888665310 000 0 011122345666666666666666554 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) + -|+ .+...+.... ..++. .+. .||...+.++.++...+|.+.+++-.+. T Consensus 81 G--~p~-----~~~~~d~~~~---------~~l~~-----------~~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~-- 130 (470) T protein:vir:10 81 S--VFP-----DIDVGKDADN---------KKIID-----------VLG-DDRALTLNGLLVDSSNAGRAWLHYWIDE-- 130 (470) T ss_pred c--cce-----eeecCchHHH---------HHHHH-----------HHh-hhHHHHHHHHHHHHhhcCeeEEEEEecC-- Confidence 4 222 2333332211 12332 232 4677788888899999999988766543 Q ss_pred cCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE--E-EeeC--CCC Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH--V-YRDP--EAM 228 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~--v-~~~~--~~~ 228 (532) .+.+++..++..+.++..|. .+++..++|.+...-.. .......+++|+. + +... .+. T Consensus 131 -~~~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~--------------~~~~~~~~e~yt~~~~~~~~~~~~~~ 195 (470) T protein:vir:10 131 -DGNFRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPD--------------SGKYFTVHEYWTDKEAQFFRTNATDS 195 (470) T ss_pred -CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecC--------------CceEEEEEEEEcCCcEEEEEeecCcc Confidence 34567888887776555543 47777666555421110 0001122333320 1 1111 100 Q ss_pred -------eEEEEE---EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 229 -------VFRSYQ---EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISS 298 (532) Q Consensus 229 -------~~~s~~---~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~ 298 (532) .+.++. ..++..+ .....+|..+|++.++= +.+|.|=.+...+-+..++.+.-.......... T Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 268 (470) T protein:vir:10 196 TVIEPYNIITSYDLSAGYETGQS--NTLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQ 268 (470) T ss_pred eeccccccccccccccccccccc--cccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhc Confidence 000100 0111111 11234667788876653 468999999999999999999999999999999 Q ss_pred cCceeecCccccChhhh-ccCC-Cceee-cCc--cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCC Q lcl|NC_015159. 299 KVLFFVNPNGVTQIRRV-AKAN-TGDFV-AGR--KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRV 373 (532) Q Consensus 299 ~p~~lv~~~g~~~~~~~-~~~~-~G~~v-~g~--~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~ 373 (532) +|.+++..-+..+.... .... .|.+. +.. .....+..+....+.......++.+++.|-+.-..-.+...+.... T Consensus 269 ~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~ 348 (470) T protein:vir:10 269 TVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESSNA 348 (470) T ss_pred CcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccc Confidence 99988865333222222 1111 12332 221 1122233344445667777888888887765322111111222335 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceee--cchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 374 TAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFI 450 (532) Q Consensus 374 TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~ 450 (532) |+..+..+..-.... ..+... .+.+.+.+++.++.+. |. -..+...+++.+. .+.+.+..++-+.. T Consensus 349 Sg~Alk~~~~~l~~k----~~~~~~-~~~~~l~~~~~~i~~~l~~-~~~d~~~i~i~f~~~~p~d~~e~~~~~~~----- 417 (470) T protein:vir:10 349 SGVAIKMLYSHLELK----AAKTQT-YFEHAINELVRAIMRYLNF-SDADKRHISQHWTRTKVEDSLTKAQIVST----- 417 (470) T ss_pred hHHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHHhcc-cCcccceeeEEeccCCCCCHHHHHHHHHH----- Confidence 555554433222222 222222 2222334444433221 11 1122223444432 23333333222211 Q ss_pred HHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCC Q lcl|NC_015159. 451 DYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGL 529 (532) Q Consensus 451 ~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 529 (532) +++ .+.-..++. .++ ++.+ ++|+++..++++.++. .. +..+.. +..|. T Consensus 418 --~~g-------~iS~et~l~----~~p-----~v~D~~~E~eri~~E~~e~~~--~~-~~~~~~----------~~~~~ 466 (470) T protein:vir:10 418 --VAN-------YSSKEAVAK----ANP-----IVDDWQQELKDLAKDKEENDP--YS-NQADEL----------NGKGV 466 (470) T ss_pred --Hhc-------cCcHHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHH--hh-cccccc----------CCCCC Confidence 111 122333332 223 2333 2344333332222111 11 111111 11122 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) -.+ T Consensus 467 dde 469 (470) T protein:vir:10 467 NDE 469 (470) T ss_pred CCC Confidence 222 No 99 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=98.24 E-value=2.2e-06 Score=51.59 Aligned_cols=488 Identities=12% Similarity=0.093 Sum_probs=202.7 Q ss_pred CCCC--CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC-CCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEV--EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA-TADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~--~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |++- .-....-+++.+||...-+.=..+...|++-.+.+--...... +.++..+. | |.|.|.++.. T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~~r---~--------nl~~sni~~i 69 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAETR---W--------NLFSTNIQTQ 69 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCccccc---c--------chhhhhHHHH Confidence 9882 2223345778889977655433344455544444433222111 11111111 1 4444444332 Q ss_pred h--------cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHH--HhcCChHHHHHHHHHHHhhCceee Q lcl|NC_015159. 78 L--------FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM--ESNSFRPTLHAAIKQLLVAGNVLL 147 (532) Q Consensus 78 l--------tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~dl~~~G~~~~ 147 (532) + -|.-+|=|. |.+. .-.+..-+.+|+.+...| +..+|+..+..+..+.+..|-|++ T Consensus 70 ~P~iYar~P~p~V~~rf~----d~d~----------~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~ 135 (663) T protein:vir:34 70 MASLYGQTPKVSVSRRFA----DADD----------DVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLC 135 (663) T ss_pred hhhhhcCCCcceeeeccc----Cccc----------chhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceE Confidence 2 111222222 1110 013444455666666666 457799999999999777777766 Q ss_pred eecc----------------cccc------------cCCcceEEEEecceEEEee-CCCCCeEEEEEEEeecHHHhhHHH Q lcl|NC_015159. 148 YIPS----------------TEQV------------EGQSNAPKLYKLHNFVVER-DAYDNVLQIVTEDKIARAALPEDV 198 (532) Q Consensus 148 ~v~~----------------~~~~------------~~~~~~~~~~pl~~~~v~~-d~~G~vd~i~rk~~~~~~~l~~~~ 198 (532) ||-= ..+. ...++.+..|.-.+|.+.- -.--.|+=|.++-.|+.+++-+.| T Consensus 136 ~v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf 215 (663) T protein:vir:34 136 RIRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARF 215 (663) T ss_pred EEEeecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhh Confidence 6521 0000 0113344444444453322 112368888999999999998877 Q ss_pred HHHHHhhc--------------cc--CCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccc-----cccccCccccCceEE Q lcl|NC_015159. 199 RKSLEEAQ--------------GD--QNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVA-----GTEGEYPLDSCPWIP 257 (532) Q Consensus 199 ~~~~~~~~--------------~~--~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~-----~~~~~~g~~~~P~~~ 257 (532) +....+.. .. .++..+..|+. ||-+..+ +.|+.++|.... ..++.-||--||+.. T Consensus 216 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwE-IWdK~~~---~V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl 291 (663) T protein:vir:34 216 DADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWE-IWDKGGR---KVDWYVEGYSAVLDTQPDPLGLESFFPCPKPL 291 (663) T ss_pred cCChhhhhhhhccCcCCccccCCCCCcchhcCcceeE-EEecCCc---EEEEEEcCcceecccCCCCCCCCCCCCCcccc Confidence 54332110 00 11223455433 4443343 223333554321 223455676789887 Q ss_pred EEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccCh-hhhccCCCceeecC-------ccc Q lcl|NC_015159. 258 VRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQI-RRVAKANTGDFVAG-------RKQ 329 (532) Q Consensus 258 ~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~-~~~~~~~~G~~v~g-------~~~ 329 (532) .-....++-+=+-..+ -+=.-++.+|.++..+-. ...+++|.++++.+...+. +.+.++..+.++|- ..+ T Consensus 292 ~~~~~~ds~ipvpd~~-~y~~~~~E~n~~t~Rin~-l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~g 369 (663) T protein:vir:34 292 LANWTTDKVVPRPDFV-LAQDLYKEIDLVSTRITL-LERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKG 369 (663) T ss_pred cceecCCCeecCCcHH-HHHHHHHHHHHHHHHHHH-HHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhc Confidence 7777666544444444 677778889988776655 7889999999974433222 33545555565552 122 Q ss_pred c----ccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhc--ccCCC--CCCCHHHHHHHHHHHHHHhhhhHHHHHHH- Q lcl|NC_015159. 330 D----VEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSA--VQRGG--DRVTAEEIRYVAGELEDTLGGVYSLLSQE- 399 (532) Q Consensus 330 ~----~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~--~~~~~--~~~TAtEi~~r~~E~~~~LGpv~~rl~~E- 399 (532) + +.-+|+. ....+...+-+.+..|+...+. +.+ ..|++ .+-||||-.... +.++.-+...++| T Consensus 370 g~~k~I~~~pi~---~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKs----q~gS~RIqe~qdev 442 (663) T protein:vir:34 370 GLRGVVDWFPLE---PVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKA----KFGSIRLQRLQDEV 442 (663) T ss_pred Cccchhhcccch---hHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHH----HHHhHHHHHHHHHH Confidence 2 2222222 1222223333445556555542 222 12332 223555433222 4444444444444 Q ss_pred --HHHHHHHHHHHHHHh-----------cCCCCC---Cc---c-------ccccceeecc--h--HHHHHHHHHHHHHH- Q lcl|NC_015159. 400 --LQLPLVKILLKELQA-----------TSKIPN---LP---K-------EAVEPAIATG--L--EALGRGHDLNKLNV- 448 (532) Q Consensus 400 --~l~Pli~r~~~il~r-----------~g~lp~---~p---~-------~~~~~~~v~~--l--~~l~raq~~~~l~~- 448 (532) |..-++.-.-.+|.. .+.||. +. . ..+.+.+.+. + +.++..+....++. T Consensus 443 qR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~ 522 (663) T protein:vir:34 443 ARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSG 522 (663) T ss_pred HHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHH Confidence 222222222222221 233441 10 0 1233333221 1 23444333332222 Q ss_pred ------HHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC--------HHHH--------------HHHH--HHH Q lcl|NC_015159. 449 ------FIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT--------QQDK--------------QAKM--AEA 498 (532) Q Consensus 449 ------~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s--------~ee~--------------~~~~--~q~ 498 (532) -++.+++..|.... =.-++++.-... ++...-+.+ .+++ ++++ +|+ T Consensus 523 i~~~~qq~~pl~~q~p~~~p--~l~Ellk~~~~~--f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~ 598 (663) T protein:vir:34 523 IASFMQGVAPLAQQVPGSAP--FLLQMLKWSVSG--LRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAM 598 (663) T ss_pred HHHHHHHHHHHHHhhhhhHH--HHHHHHHHHhhc--CChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHH Confidence 22223333332211 011122221111 111111111 1111 1111 111 Q ss_pred HHHHHHHHHHHhhhH---HHHH--HHHhhcccccCCCCC Q lcl|NC_015159. 499 STAAGMVTAGQQMGA---AGGQ--AAAAMMQQQAGLPTQ 532 (532) Q Consensus 499 ~~~~~~~~~~~~~~~---~~~~--~~~~~~~~~~g~~~~ 532 (532) ..+...+.++..+.. ..+. ......+..+ +.+ T Consensus 599 k~q~~~aeAq~e~q~~~~~~ql~~~~~~~k~~~~--a~~ 635 (663) T protein:vir:34 599 KGQQEMAKVQAEVQGDLLRIQAETQANETKERQQ--AEW 635 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHH Confidence 000000000000000 0000 0000000000 111 No 100 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.22 E-value=2.5e-06 Score=51.30 Aligned_cols=452 Identities=11% Similarity=0.043 Sum_probs=176.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC---C-Cccccc-ccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA---T-ADGSTS-YTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~---~-~~~~~~-~~~~~dst~~~a~~~Laa~l~ 75 (532) +.-++- .++.+.+...-..|-.....-.++.+.+.+|..-...... + .+..+. ..+...+-+..+++.++..| T Consensus 16 ~~~p~~-~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l- 93 (501) T protein:vir:25 16 VEFPED-SMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNL- 93 (501) T ss_pred ccCCcc-cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhh- Confidence 222211 1233333333333333333333555666666432211000 0 011111 11233455566666666544 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) ++.+ |++ +|.... + .+++....++|....+++.++..+||.|.++|-.++. T Consensus 94 ---~~~g---f~~--~d~~~~---------~-----------~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~- 144 (501) T protein:vir:25 94 ---SVVG---YRN--ALAKEN---------D-----------PAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDE- 144 (501) T ss_pred ---cccc---eec--CCccch---------H-----------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCC- Confidence 3433 433 221111 1 1234556788999999999999999999888866532 Q ss_pred cCCcceEEEEecce-EEEeeCCC--CCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEE--EEEEee------ Q lcl|NC_015159. 156 EGQSNAPKLYKLHN-FVVERDAY--DNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIY--THVYRD------ 224 (532) Q Consensus 156 ~~~~~~~~~~pl~~-~~v~~d~~--G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~--~~v~~~------ 224 (532) + ..+++++..+ +++-.|+. .++...++......+ .+....+++| ++++.- T Consensus 145 -~--~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~----------------~~~~~~~~~y~~~~~~~~~~~~~~ 205 (501) T protein:vir:25 145 -G--PVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKD----------------AKPHRRGVLYDDTYMYELDLGEVV 205 (501) T ss_pred -C--CeEEEeccccEEEEEecCCCCcceeEEEEEEeeccc----------------cCcceeEEEecCeeEEEEecCcee Confidence 2 2455666554 45555543 234444443321111 0111122222 112111 Q ss_pred ---CCCCeEEEEEEEc---CcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 225 ---PEAMVFRSYQEID---GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISS 298 (532) Q Consensus 225 ---~~~~~~~s~~~~~---~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~ 298 (532) .....|....... +..........+|..||++.+.=+. ..+.+|+|=.+..++-+..+|...-..+..++..+ T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~-~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a 284 (501) T protein:vir:25 206 LGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR-DADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGA 284 (501) T ss_pred eeeccccccccccccccccccccccccccCCccceeeEeccCcc-ccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhc Confidence 1111111111111 1111122234567788998765443 33568999888888888999988888888888888 Q ss_pred cCceeecCccccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHH-HhhhhcccCCCCCCCHHH Q lcl|NC_015159. 299 KVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYA-FMLNSAVQRGGDRVTAEE 377 (532) Q Consensus 299 ~p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~a-f~~~~~~~~~~~~~TAtE 377 (532) .|...+.-- ..+..+......|.+..-..+++...++. .++++.-...++.+...|... ..-+..........++.- T Consensus 285 ~p~~~i~G~-~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~A 362 (501) T protein:vir:25 285 NPQRVISGW-TGSKAEVLKASALRVWTFEDPEVKAQAFP-PASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEA 362 (501) T ss_pred cHHHHHhCC-CCCccchhhhcccceeccCCCCceEEEec-ccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHH Confidence 876443211 11222222334454433222233333433 234544444444444433221 011111111112235554 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee-cchHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA-TGLEALGRGHDLNKLNVFIDYMIK 455 (532) Q Consensus 378 i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v-~~l~~l~raq~~~~l~~~~~~laq 455 (532) +.....-+.+. ..+.+..|-..| .+.+.++.. .|. .. +.+...++++ ....+-..++.++.+ ..+.+ T Consensus 363 l~~~~~~l~~k----a~~k~~~f~~~l-~~~~rl~~~~~~~-~~-~~~~~~i~v~w~~~~~~s~~~~ada~----~kl~~ 431 (501) T protein:vir:25 363 LAAAEANQQRK----LAAKRESFGESW-EQLLRLAAEMDDD-PD-TAADSGAEVLWRDTEARSFGAVVDGI----TKLAS 431 (501) T ss_pred HHHHHHHHHHH----HHHHHHHHHHHH-HHHHHHHHHHhCC-Cc-cccceeeeEEecCCCCCCHHHHHHHH----HHHHh Confidence 43332222221 223233222222 222222211 121 11 1223333332 111121222222222 22222 Q ss_pred hcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHH---HHHHHHhhhHHHHH---HHHhhcccccCC Q lcl|NC_015159. 456 LAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAG---MVTAGQQMGAAGGQ---AAAAMMQQQAGL 529 (532) Q Consensus 456 ~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~---~~~~~~~~~~~~~~---~~~~~~~~~~g~ 529 (532) + + |-...+ +....|+++. +++.++++++.+.. ..++....+.+... .....-.+.+|. T Consensus 432 ~-g-----is~et~---~~~~~g~~~~-------~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (501) T protein:vir:25 432 A-G-----IPIEHL---LSMVPGMTQQ-------TIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGV 495 (501) T ss_pred c-C-----CCHHHH---HHHcCCCCHH-------HHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccC Confidence 2 1 111222 2234687653 33222222222111 11111111111000 000000011111 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) -.. T Consensus 496 ~~~ 498 (501) T protein:vir:25 496 NGN 498 (501) T ss_pred CCC Confidence 111 No 101 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.20 E-value=2.7e-06 Score=51.08 Aligned_cols=431 Identities=13% Similarity=0.078 Sum_probs=177.9 Q ss_pred CCCCCC---CccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCC---CcccccccccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEK---TGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSAT---ADGSTSYTTPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~---~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~---~~~~~~~~~~~dst~~~a~~~Laa~l 74 (532) |-.++. ++++.+. ..-.+.|..+.....++.+.+.+|..-......- -+..-+.-+..-+-+..+++.|+..| T Consensus 1 ~~~~~~~~~~gl~~~~-~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl 79 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDE-NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRC 79 (474) T ss_pred CcCCCcCcCCCCChhH-HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhh Confidence 444422 2333221 1122333333333444555666664333211111 11110111234455666777776654 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) .-- + |+. +|.+.. . ..+++...++++.....+++++..+||.+.++|-.++ T Consensus 80 ~~~----G---f~~--~d~~~~----------~---------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~- 130 (474) T protein:vir:81 80 NLE----G---FVW--PDGDLD----------S---------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGE- 130 (474) T ss_pred ccc----c---eEC--CCCCcc----------c---------hHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCC- Confidence 411 1 222 221110 0 1134566789999999999999999999998886543 Q ss_pred ccCCcceEEEEecceEEEeeCCC-CCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCc-ceEEEEE--EE-EeeCCCCe Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAY-DNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPS-EEVTIYT--HV-YRDPEAMV 229 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~-G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~-~~v~i~~--~v-~~~~~~~~ 229 (532) +.....++++++..+.++..|+. +++...++... ...+.+ ..+.+|. .+ +...++-. T Consensus 131 d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~------------------~~~~g~~~~~~ly~~~~~~~~~~~~~~ 192 (474) T protein:vir:81 131 DDEPEALIHVKDASEATGEWNRRRRGLNNLLSIID------------------KDKEGKVLSLALYLDNETVTAQRDKAT 192 (474) T ss_pred CCCceeEEEEeccceEEEEEeCCCCcceeeeEEEE------------------EcCCCcEEEEEEEeCCcEEEEEEcCcc Confidence 22334567888877665555653 33332222111 001111 1222221 11 11111111 Q ss_pred EEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc Q lcl|NC_015159. 230 FRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFV-EEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG 308 (532) Q Consensus 230 ~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~-~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g 308 (532) +.+.. +. ..+++. +|++++..+..-++.+|+|-. +..++-+..+|...-..+..++..+.|...+- | T Consensus 193 ~~w~~--~~-------~~~~~g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~--G 260 (474) T protein:vir:81 193 LKWQV--DR-------DEHVYG-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL--G 260 (474) T ss_pred ceeee--cc-------CCCCCC-cceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee--c Confidence 11111 11 123333 799999999888999999865 57788899999999999999999999985442 1 Q ss_pred ccC---------hhhhccCCCcee--ecCcccc-cccc---ccC--CccchhHHHHHHHHHHHHHHHHHhhhhcc----- Q lcl|NC_015159. 309 VTQ---------IRRVAKANTGDF--VAGRKQD-VEVF---QLE--KYNDFQVAKATADDIEKRLSYAFMLNSAV----- 366 (532) Q Consensus 309 ~~~---------~~~~~~~~~G~~--v~g~~~~-~~~~---~~~--~~~~~~~~~~~i~~~~~rI~~af~~~~~~----- 366 (532) ... +........|.+ ++.+.+. +... .++ ..++++. .++.++.-|.......... T Consensus 261 ~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~---~~~~l~~~~~~~a~~t~iP~~~lG 337 (474) T protein:vir:81 261 ADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDA---HWSDINGLAKLFAREASLPDTAVA 337 (474) T ss_pred CChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhH---HHHHHHHHHHHHHhhhCCCHHHhc Confidence 111 111111112222 2222221 1100 011 1123332 2333333333322111110 Q ss_pred cCC-CCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec-chHHH Q lcl|NC_015159. 367 QRG-GDRVTAEEIRY-------VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT-GLEAL 437 (532) Q Consensus 367 ~~~-~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~-~l~~l 437 (532) ... ...-+|.-|.. +++++.+.+|.-+.+ ++...+.+.... -...++.+..+++++= ..+.- T Consensus 338 ~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~--------~~rla~~i~~~~-~~~~~~~~~~~~~v~W~d~~~~ 408 (474) T protein:vir:81 338 ISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRK--------AFIRALAMKNKV-AIDEIPDEWKSIDAKWRDPRYL 408 (474) T ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhCCC-CccccchhhccceeEecCCCcc Confidence 001 11123433332 333444444443322 222233322111 1234555655554421 11111 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_015159. 438 GRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQ 517 (532) Q Consensus 438 ~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~ 517 (532) +.++.++.+.-.. +..+..+ ..++. .+.+|++ ++|++..++.+++++......... +.+.. T Consensus 409 s~a~~aDa~~Kl~----~a~~~~~----~~~~~---~~~lg~t-------~~~i~~~~~~~~~~~~~~~~~~l~-~~~~~ 469 (474) T protein:vir:81 409 SKSAQADAGMKQL----AAVPWLA----ETEVG---LELIGLT-------PQQARRAMADKRRVQGRGTLQALI-DRSNN 469 (474) T ss_pred CHHHHHHHHHHHH----hcccCCC----cHHHH---HhhcCCC-------HHHHHHHHHHHHHHhHHHHHHHHH-hcCCC Confidence 2222222222222 2211111 01112 2335764 445544333332222221111111 11100 Q ss_pred HHHhhcccc Q lcl|NC_015159. 518 AAAAMMQQQ 526 (532) Q Consensus 518 ~~~~~~~~~ 526 (532) ++.+ | T Consensus 470 ~~~a----q 474 (474) T protein:vir:81 470 GATA----Q 474 (474) T ss_pred CCCC----C Confidence 0000 1 No 102 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.20 E-value=2.8e-06 Score=51.05 Aligned_cols=436 Identities=9% Similarity=-0.001 Sum_probs=194.8 Q ss_pred CCCC------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh--cccccCCCCC----cccccccccccchHHHHHH Q lcl|NC_015159. 1 MAEV------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYT--IPSVFPSATA----DGSTSYTTPWQSIGARGLN 68 (532) Q Consensus 1 m~~~------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~----~~~~~~~~~~dst~~~a~~ 68 (532) |++. ...-.+.+.+.+.++..+. |-....+++++|+-- +..+-..... .-.+...++..+-+...++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd 91 (474) T protein:vir:94 13 YGEEVVEQLKPQFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVD 91 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHH Confidence 2110 0111234556666655554 445555666665421 1111111111 1112244677777888888 Q ss_pred HHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeee Q lcl|NC_015159. 69 NLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 69 ~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~ 148 (532) ..++.|++ -| +.+...|.. +.+.| ..+..+||...+.++.++...+|.|.++ T Consensus 92 ~~~~~l~g--~p-----~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~e~~~~~~~~G~~~~~ 143 (474) T protein:vir:94 92 QKVSYVAS--KP-----VTYSCEDEN-------------VLKVI--------HDVLDTRWDNKLIDILTATSNKGIDWLQ 143 (474) T ss_pred HHHhhhhc--CC-----ceeccCcHH-------------HHHHH--------HHHHhccHHHHHHHHHHHHhhcCceEEE Confidence 88877754 22 223333322 22222 1223578999999999999999999887 Q ss_pred ecccccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEE---Ee Q lcl|NC_015159. 149 IPSTEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHV---YR 223 (532) Q Consensus 149 v~~~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~ 223 (532) +..++ ++.+++.+++..+.++..|. .+++...+|.+... ....+++|+.- +. T Consensus 144 ~~~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y 200 (474) T protein:vir:94 144 VYINE---NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYY 200 (474) T ss_pred EEecC---CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEE Confidence 76543 34577888888776655554 57888777766421 01233444211 11 Q ss_pred eCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 224 DPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 224 ~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) ..++..+......+...+.......++..+|++.++. +.+|.|=.+...+-+..+|.+.-......+....|.++ T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv 275 (474) T protein:vir:94 201 VLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYI 275 (474) T ss_pred EEcCCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 1111112222211111111112234677889887654 46899999999999999999988889889999998877 Q ss_pred ecCccccChhhhc-cCCCc-eeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVA-KANTG-DFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV-QRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~-~~~~G-~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~~~TAtEi~~ 380 (532) +..-..-....+. +...+ .+.....+++..+ ....+.......++.++..|.+.-..-.+. ...+...|+..+.. T Consensus 276 ~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 353 (474) T protein:vir:94 276 LKGYEGEDLEEFMRGLKYYKAINVDGDGGVETI--QVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKF 353 (474) T ss_pred eecCCcccchhhhhhhhccceeeccCCCceeEE--eecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHH Confidence 6532222212211 11122 2222233334333 333456666777777777665432211111 11223345544332 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--ccceeecchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPAIATGLEALGRGHDLNKLNVFIDYMIKLAG 458 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p 458 (532) +....... ..+ ....+...+.+++.++.+.. ....+. +++.+ ++-.|..-++.++.+ .+. + T Consensus 354 ~~~~l~~k----~~~-k~~~~~~~l~~~~~li~~~~---~~~~d~~~i~v~f-~~~~p~~~~e~a~~~-------~~~-g 416 (474) T protein:vir:94 354 LYGNLDLK----ANK-LKNKATVAIQELISFIIDFN---NLKTDVKDIEISF-NFNRMMNDAEQSQII-------AQS-Q 416 (474) T ss_pred HHHHHHHH----HHH-HHHHHHHHHHHHHHHHHHHh---CCCcccceeeEEe-ccCcccCHHHHHHHH-------HHc-C Confidence 22211111 111 11223333344444433311 112222 33333 222222122222221 111 1 Q ss_pred hhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 459 LQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 459 ~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+....++.. ++ ++.+ ++|++...++++.++. . ..... ........+..+.... T Consensus 417 ----~iS~et~l~~----l~-----~v~D~~~E~eri~~E~~~~~~--~----~~~~~-~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:94 417 ----YLSRETLVKS----SP-----LVDDYKAELERIEQEQMEYNK--Q----LPNLD-DGGADGAQQQEGSNNK 471 (474) T ss_pred ----CCCHHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHh--h----ccccC-CCCCCCcccCCCCccc Confidence 1233333332 22 1222 2344333332221111 1 11111 1001111111111111 No 103 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.20 E-value=2.8e-06 Score=51.05 Aligned_cols=436 Identities=9% Similarity=-0.001 Sum_probs=194.8 Q ss_pred CCCC------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh--cccccCCCCC----cccccccccccchHHHHHH Q lcl|NC_015159. 1 MAEV------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYT--IPSVFPSATA----DGSTSYTTPWQSIGARGLN 68 (532) Q Consensus 1 m~~~------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~----~~~~~~~~~~dst~~~a~~ 68 (532) |++. ...-.+.+.+.+.++..+. |-....+++++|+-- +..+-..... .-.+...++..+-+...++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd 91 (474) T protein:vir:97 13 YGEEVVEQLKPQFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVD 91 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHH Confidence 2110 0111234556666655554 445555666665421 1111111111 1112244677777888888 Q ss_pred HHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeee Q lcl|NC_015159. 69 NLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 69 ~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~ 148 (532) ..++.|++ -| +.+...|.. +.+.| ..+..+||...+.++.++...+|.|.++ T Consensus 92 ~~~~~l~g--~p-----~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~e~~~~~~~~G~~~~~ 143 (474) T protein:vir:97 92 QKVSYVAS--KP-----VTYSCEDEN-------------VLKVI--------HDVLDTRWDNKLIDILTATSNKGIDWLQ 143 (474) T ss_pred HHHhhhhc--CC-----ceeccCcHH-------------HHHHH--------HHHHhccHHHHHHHHHHHHhhcCceEEE Confidence 88877754 22 223333322 22222 1223578999999999999999999887 Q ss_pred ecccccccCCcceEEEEecceEEEeeCC--CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEE---Ee Q lcl|NC_015159. 149 IPSTEQVEGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHV---YR 223 (532) Q Consensus 149 v~~~~~~~~~~~~~~~~pl~~~~v~~d~--~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v---~~ 223 (532) +..++ ++.+++.+++..+.++..|. .+++...+|.+... ....+++|+.- +. T Consensus 144 ~~~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y 200 (474) T protein:vir:97 144 VYINE---NGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYY 200 (474) T ss_pred EEecC---CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEE Confidence 76543 34577888888776655554 57888777766421 01233444211 11 Q ss_pred eCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_015159. 224 DPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFF 303 (532) Q Consensus 224 ~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~l 303 (532) ..++..+......+...+.......++..+|++.++. +.+|.|=.+...+-+..+|.+.-......+....|.++ T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv 275 (474) T protein:vir:97 201 VLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYI 275 (474) T ss_pred EEcCCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 1111112222211111111112234677889887654 46899999999999999999988889889999998877 Q ss_pred ecCccccChhhhc-cCCCc-eeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCCCHHHHHH Q lcl|NC_015159. 304 VNPNGVTQIRRVA-KANTG-DFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV-QRGGDRVTAEEIRY 380 (532) Q Consensus 304 v~~~g~~~~~~~~-~~~~G-~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~~~TAtEi~~ 380 (532) +..-..-....+. +...+ .+.....+++..+ ....+.......++.++..|.+.-..-.+. ...+...|+..+.. T Consensus 276 ~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 353 (474) T protein:vir:97 276 LKGYEGEDLEEFMRGLKYYKAINVDGDGGVETI--QVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKF 353 (474) T ss_pred eecCCcccchhhhhhhhccceeeccCCCceeEE--eecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHH Confidence 6532222212211 11122 2222233334333 333456666777777777665432211111 11223345544332 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc--ccceeecchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA--VEPAIATGLEALGRGHDLNKLNVFIDYMIKLAG 458 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~--~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p 458 (532) +....... ..+ ....+...+.+++.++.+.. ....+. +++.+ ++-.|..-++.++.+ .+. + T Consensus 354 ~~~~l~~k----~~~-k~~~~~~~l~~~~~li~~~~---~~~~d~~~i~v~f-~~~~p~~~~e~a~~~-------~~~-g 416 (474) T protein:vir:97 354 LYGNLDLK----ANK-LKNKATVAIQELISFIIDFN---NLKTDVKDIEISF-NFNRMMNDAEQSQII-------AQS-Q 416 (474) T ss_pred HHHHHHHH----HHH-HHHHHHHHHHHHHHHHHHHh---CCCcccceeeEEe-ccCcccCHHHHHHHH-------HHc-C Confidence 22211111 111 11223333344444433311 112222 33333 222222122222221 111 1 Q ss_pred hhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 459 LQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 459 ~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+....++.. ++ ++.+ ++|++...++++.++. . ..... ........+..+.... T Consensus 417 ----~iS~et~l~~----l~-----~v~D~~~E~eri~~E~~~~~~--~----~~~~~-~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:97 417 ----YLSRETLVKS----SP-----LVDDYKAELERIEQEQMEYNK--Q----LPNLD-DGGADGAQQQEGSNNK 471 (474) T ss_pred ----CCCHHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHh--h----ccccC-CCCCCCcccCCCCccc Confidence 1233333332 22 1222 2344333332221111 1 11111 1001111111111111 No 104 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.19 E-value=2.8e-06 Score=51.03 Aligned_cols=405 Identities=11% Similarity=0.000 Sum_probs=168.5 Q ss_pred hcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHH Q lcl|NC_015159. 39 TIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERI 118 (532) Q Consensus 39 ~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~ 118 (532) .+|.-....-.. ...+...+-+..++++++..|. +.+ |+ .+|.+.. . . T Consensus 1 ~l~~~~~~~~~~---~~~~~v~n~~~~ivd~~~~~l~----~~g---f~--~~d~~~~---------~-----------~ 48 (434) T protein:vir:98 1 MLPKNAEQAFLD---FQRKARTNFCGLIANASVHRLL----ALG---VT--GPDGEPD---------T-----------R 48 (434) T ss_pred CCCCCccHHHHH---hhhhhhccchHHHHHHHHhhhc----cCc---ee--cCCCchH---------H-----------H Confidence 344322111111 1122344566777777777653 323 33 2222111 1 1 Q ss_pred HHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc----ccCCcceEEEEecceEEEeeCC-CCCeEEEEEEEeecHHH Q lcl|NC_015159. 119 CMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ----VEGQSNAPKLYKLHNFVVERDA-YDNVLQIVTEDKIARAA 193 (532) Q Consensus 119 ~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~----~~~~~~~~~~~pl~~~~v~~d~-~G~vd~i~rk~~~~~~~ 193 (532) +.+.+.+++|.....+++++..+||.+.++|..+.. +......+++++..+..+..|. .+++...++.+....+ T Consensus 49 ~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~- 127 (434) T protein:vir:98 49 ASRWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDID- 127 (434) T ss_pred HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccC- Confidence 224556789999999999999999999887754321 1112334777777665555554 4566555544331111 Q ss_pred hhHHHHHHHHhhcccCCCcceEEEEEEEEe----eCCCCe--EEEEEEEcCcccccccccCccccCceEEEEeeecCCCc Q lcl|NC_015159. 194 LPEDVRKSLEEAQGDQNPSEEVTIYTHVYR----DPEAMV--FRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNED 267 (532) Q Consensus 194 l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~----~~~~~~--~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~ 267 (532) +.....+.+++.++. ...+.. +.+.-++-... ......++|..+|++.+.-+...++ T Consensus 128 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~h~~g~vPvv~f~N~~~~~~- 190 (434) T protein:vir:98 128 ---------------GFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGT-ADSGDVHDLGGMQLVEFARMPDLGE- 190 (434) T ss_pred ---------------CceEEEEEEeCcEEEEEEeeccccccccccccceeccc-ccccccCCCCccceEEeccCCCcCc- Confidence 111222322222211 111111 11111111111 1112345788999998876666554 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc----------ccChhhhccCCCceeecCccccccccccC Q lcl|NC_015159. 268 YGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG----------VTQIRRVAKANTGDFVAGRKQDVEVFQLE 337 (532) Q Consensus 268 YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g----------~~~~~~~~~~~~G~~v~g~~~~~~~~~~~ 337 (532) +|+|=.+..++.+..++...-..+..++..+.|...+.-.. ............|.+..-...++...+++ T Consensus 191 ~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~ 270 (434) T protein:vir:98 191 DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLD 270 (434) T ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCCCceEEEec Confidence 79998899999999999999898889998888865542100 00000111111222211111223233322 Q ss_pred CccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc- Q lcl|NC_015159. 338 KYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT- 415 (532) Q Consensus 338 ~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~- 415 (532) .++++.....+..+.+.|...= +-+.....+..+.++.-+......+... ..+.+. .+.+-+.+.+.++.+. T Consensus 271 -~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k----~~~k~~-~f~~~l~~~~rl~~~~~ 344 (434) T protein:vir:98 271 -ATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAK----VREHIA-SFSEGLESVLALAAAQA 344 (434) T ss_pred -CcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHH----HHHHHH-HHHHHHHHHHHHHHHhc Confidence 2234333333333333332110 0000001122234555554433333222 222222 2222233444443332 Q ss_pred CCCCCCcccc--ccceeec--chHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHH Q lcl|NC_015159. 416 SKIPNLPKEA--VEPAIAT--GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDK 491 (532) Q Consensus 416 g~lp~~p~~~--~~~~~v~--~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~ 491 (532) |. +.+. +++.+.. +-+.+..++-+.++. +. .+ | . + .+...+|.++ +|+ T Consensus 345 g~----~~~~~~~~v~w~~~~~~s~~~~ada~~kl~---~~--g~-~-------~-e---~~~~~lg~~~-------~e~ 396 (434) T protein:vir:98 345 GV----PEDYTEAEVRWANPAHVTMAVKADAATKLK---SI--GY-P-------L-D---VIAEELDESP-------ARV 396 (434) T ss_pred CC----ChhheeeeEEecCCCCCCHHHHHHHHHHHH---hc--CC-c-------H-H---HHHHhCCCCH-------HHH Confidence 22 2233 3333321 222233333222221 11 11 1 1 2 2334567643 444 Q ss_pred HHHHHHHHHHHHHHH-HHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 492 QAKMAEASTAAGMVT-AGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 492 ~~~~~q~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +...+++..++...+ ...+.+.+ .++ ..+..|=+|- T Consensus 397 ~r~~~e~~~~~~~~~~~~~~~~~~----~~g-~~~~~~~~~d 433 (434) T protein:vir:98 397 RRIVAGAASQALLAASLLPAPGAP----SAG-NVPDSGGAVD 433 (434) T ss_pred HHHHHHHHHHHHHHHhhhccCCCC----CCC-CCCcccCCCC Confidence 433322222111111 11111111 011 1111222222 No 105 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.17 E-value=3.3e-06 Score=50.67 Aligned_cols=406 Identities=11% Similarity=0.035 Sum_probs=182.9 Q ss_pred HHHHHHHHhhhHHHHHHHHHHhhccc---ccCCC-CCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCCh Q lcl|NC_015159. 17 AYNRLKNDRGAYETRAEDCATYTIPS---VFPSA-TADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSE 92 (532) Q Consensus 17 r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d 92 (532) .....+++| .++++.+.+|..-. ..... .....+...++..+-+...+++.++.|++- |+. | ...+ T Consensus 1 ~~~~~~~~~---~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~~--~---~~~~ 70 (440) T protein:vir:95 1 MLAAFLGSQ---KQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN--PVS--I---GVME 70 (440) T ss_pred ChhhHHHHH---HHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheecc--Cce--E---eeCC Confidence 233333333 44556666664421 11111 111122344666677777777766665432 222 1 2222 Q ss_pred HHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEE Q lcl|NC_015159. 93 LEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVV 172 (532) Q Consensus 93 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v 172 (532) .... +.. ..+.+.+..++|.....++.++..+||.+.+++..++ ++.+.++.++..+.++ T Consensus 71 ~~~~----------~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~---~~~~~i~~~~p~~~~~ 130 (440) T protein:vir:95 71 GGSA----------DQL-------STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK---DKVDRVVLISPLEMFV 130 (440) T ss_pred CccH----------HHH-------HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC---CCceEEEEEcccceEE Confidence 1111 111 1234667788999999999999999999988876542 3456788888887777 Q ss_pred eeCCC--CCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE--cCcccccccccC Q lcl|NC_015159. 173 ERDAY--DNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI--DGEIVAGTEGEY 248 (532) Q Consensus 173 ~~d~~--G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~--~~~~~~~~~~~~ 248 (532) ..|+. +++...+|..... ....++||+. +. -+...... .+.......... T Consensus 131 ~~d~~~~~~~~~~i~~~~~~--------------------~~~~~~vyt~-----~~-~~~~~~~~~~~~~~~~~~~~~~ 184 (440) T protein:vir:95 131 IRDLTVEQNIIAAVHLPIYA--------------------DKVNMTVYTK-----DK-VITYKPYSNNSVRLVVDDVKKH 184 (440) T ss_pred EEcCCCCCceEEEEEEEEec--------------------CceEEEEEeC-----Ce-EEEEEEecCCccceeecceeec Confidence 66664 4566555544311 0122334321 10 01000000 011111112235 Q ss_pred ccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc---cccChhhhccCC-Cceee Q lcl|NC_015159. 249 PLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN---GVTQIRRVAKAN-TGDFV 324 (532) Q Consensus 249 g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~---g~~~~~~~~~~~-~G~~v 324 (532) ++..+|++.++. +.+|.|=.+...+-+..++.+.-......+....|.+++.-. ....++...... .+.+. T Consensus 185 ~~g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~ 259 (440) T protein:vir:95 185 SYNDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLF 259 (440) T ss_pred cCceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhcccee Confidence 677899987664 457999999999999999999999999999988887665311 111222221111 11111 Q ss_pred -c--------CccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCCCHHHHHHH-------HHHHHH Q lcl|NC_015159. 325 -A--------GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV-QRGGDRVTAEEIRYV-------AGELED 387 (532) Q Consensus 325 -~--------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~~~TAtEi~~r-------~~E~~~ 387 (532) + +..+++. .+....+.+.....++.++..|...-..-... ..-+...|+..+..+ ++++.. T Consensus 260 ~~~~~~~~~~~~~~~~~--~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~ 337 (440) T protein:vir:95 260 LKTGISTTGQQTTADAS--YIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKET 337 (440) T ss_pred cccccccccCCCCccee--EEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHH Confidence 1 1112222 22233455666677777777664432110010 011234566655433 333333 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHhc---CCCCCCccccccceeec--chHHHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_015159. 388 TLGGVYSLLSQELQLPLVKILLKELQAT---SKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNVFIDYMIKLAGLQDD 462 (532) Q Consensus 388 ~LGpv~~rl~~E~l~Pli~r~~~il~r~---g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~~~~~laq~~p~~~d 462 (532) .++..+ .+++.++.+. ..-.......+++.+.- +.+.+..++-+.++ +.+ T Consensus 338 ~~~~~l------------~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl-------~g~------ 392 (440) T protein:vir:95 338 YFTKAL------------RRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA-------GGE------ 392 (440) T ss_pred HHHHHH------------HHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH-------hcc------ Confidence 333322 3333332221 00011222233444322 22223323222222 222 Q ss_pred hcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 463 DINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 463 ~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) +....++..+ -+++ .++|++...+++...+. ...+..+ .....++-++ T Consensus 393 -iS~et~~~~l---~~~d------~~~E~~ri~~E~~~~~~--~~~~~~~---------~~~~~~~~~e 440 (440) T protein:vir:95 393 -ISQETLMENA---SFTD------YKTEHSRILKQGGSSDL--EIGQIVG---------DADVGQADTE 440 (440) T ss_pred -CcHHHHHHhC---CCCC------cHHHHHHHHHHHHHhhh--hHHhhcc---------CCCCCCcCCC Confidence 2223333322 1232 23455444333322111 1111111 1111112222 No 106 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.17 E-value=3.3e-06 Score=50.66 Aligned_cols=429 Identities=11% Similarity=0.053 Sum_probs=183.7 Q ss_pred CCCCC-------CCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEVE-------KTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~~-------~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~~a~~~Laa 72 (532) |.+.- +..++. .+|++ ...|+.+|.=--|...... ..+... ..++--..+...++.+|+ T Consensus 20 ~~~~~~~i~d~~~i~~~~----~~~~~--------i~~~~~~Y~g~~~~l~~~~-~~~~~~~~~~~slnl~~~i~~~~A~ 86 (505) T protein:vir:79 20 MTKSLGQIIDDPRINLPA----DEVER--------IARDKRYYMDDFKQVTHKN-SYGDTQKHELQSVNVTKLASAKLAS 86 (505) T ss_pred chhhhhhhhcccCCCCCH----HHHHH--------HHHHHHHhcCCCccccccc-cCCCccccceeecchHHHHHHHHHh Confidence 22221 111111 12222 2345555432122111111 111111 111112455666666666 Q ss_pred HHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 73 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) -|.+-. | +++++|. +..++|+ +.+..++|+..+.+++.+..+.|.+++.+-.+ T Consensus 87 ll~~e~--~-----~i~~~d~-------------~~~e~l~-------~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D 139 (505) T protein:vir:79 87 LIFNEQ--C-----QVTVSDE-------------TANDFLD-------DVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD 139 (505) T ss_pred hhcCCC--c-----eeecCCh-------------HHHHHHH-------HHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe Confidence 544432 1 2333331 2334443 56778899999999999999999998854433 Q ss_pred ccccCCcceEEEEecceEE-EeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC-CCCeE Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFV-VERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP-EAMVF 230 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~-v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~-~~~~~ 230 (532) .+.+.+..++...++ +..|..+....+|..+. +. ..++ +-.+|+.++... ++.+| T Consensus 140 ----~~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~-~~---------------~~~~---~~~~yt~lE~h~~~~~~~ 196 (505) T protein:vir:79 140 ----SGKIKLAWATADQVYPLQADTNQVNELAIASRT-TE---------------VENH---RTIYYTLLEFHQWDHGDY 196 (505) T ss_pred ----CCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEE-EE---------------ecCC---cceEEEEEEEEEecCceE Confidence 245678888988866 45666554443333221 10 0011 111233332211 11223 Q ss_pred EEEEEE-c-------Cccccc--------cc---ccCccccCceEEEE---ee-ecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 231 RSYQEI-D-------GEIVAG--------TE---GEYPLDSCPWIPVR---LI-KMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 231 ~s~~~~-~-------~~~~~~--------~~---~~~g~~~~P~~~~R---w~-~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) .+.+.. . |..++. .+ ...|...-+|..++ ++ ...++.+|+|-...+.+.+..||..- T Consensus 197 ~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~ 276 (505) T protein:vir:79 197 VITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTH 276 (505) T ss_pred EEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHH Confidence 322211 0 111100 00 01122222333322 22 24467899999999999999999877 Q ss_pred HHHHHHHHHHhcCceeecCccccChhhhc------------cCCCceeec--CccccccccccCCccchhHHHHHHHHHH Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVTQIRRVA------------KANTGDFVA--GRKQDVEVFQLEKYNDFQVAKATADDIE 353 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~~~~~~~------------~~~~G~~v~--g~~~~~~~~~~~~~~~~~~~~~~i~~~~ 353 (532) -+.....+ ..+....|+++ ++...... ....-.+.. +..+......+...-+...-...++.+- T Consensus 277 s~~~~e~~-~g~~~i~v~~~-~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 354 (505) T protein:vir:79 277 DQFVDEVK-KGQRRLIVPAE-WLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFL 354 (505) T ss_pred HHHHHHHH-hcccceeechH-HhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHH Confidence 66666544 45555555432 22111000 000000111 1111111111111111122233344444 Q ss_pred HHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC--------CCcc Q lcl|NC_015159. 354 KRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIP--------NLPK 423 (532) Q Consensus 354 ~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp--------~~p~ 423 (532) ++|.... =...+...+....|||||..+.+...+...-.-..+ ...|..|++.++.+..-.+..+ +++. T Consensus 355 ~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~-~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~ 433 (505) T protein:vir:79 355 REFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQV-EKTIKALTYAILELASVPSFYADGQARWTGDVDS 433 (505) T ss_pred HHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcccccccccccCCCCc Confidence 4443321 111122223334699999999999988888754444 5577888888877765544322 2222 Q ss_pred ccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHH Q lcl|NC_015159. 424 EAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAG 503 (532) Q Consensus 424 ~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~ 503 (532) ..+.+.+-.++. .-+...++..++.++. .+ +....++ ....|+ |++|++++.++.+..+. T Consensus 434 ~~i~v~f~d~i~-~d~~~~~~~~~~~v~~--Gi-------~s~e~~l---~~~~~~-------~eeea~~el~ri~~E~~ 493 (505) T protein:vir:79 434 LDITINFNDGVF-VDQESKRAADLQAVQA--QV-------MPKKQFL---MRNYGL-------DEEEADEWLAQIDAENS 493 (505) T ss_pred eeEEEEeCCCCC-CCHHHHHHHHHHHHHc--CC-------CCHHHHH---HhcCCC-------ChHHHHHHHHHHHHhcc Confidence 223333322211 1112222222222211 11 1222222 334565 44555555444433221 Q ss_pred HHHHHHhhhHHHHHHHH Q lcl|NC_015159. 504 MVTAGQQMGAAGGQAAA 520 (532) Q Consensus 504 ~~~~~~~~~~~~~~~~~ 520 (532) .+.+...+.++- T Consensus 494 -----~~~p~~~~~gg~ 505 (505) T protein:vir:79 494 -----TAEPEFNQFGGD 505 (505) T ss_pred -----ccCCCchhccCC Confidence 111222211111 No 107 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.10 E-value=4.7e-06 Score=49.80 Aligned_cols=435 Identities=8% Similarity=-0.027 Sum_probs=194.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccC-CCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFP-SATADGSTSYTTPWQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~-~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 79 (532) |...+ .+..+.+.+..+..+..+. ++++.+.+|..-.--. ....+..+...++..+-+...++..++.|++- T Consensus 19 ~~~~~--~~~~~~i~~~i~~~~~~~~---~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~-- 91 (470) T protein:vir:99 19 FPKGE--KLTSNELLGFIAYNETVLK---PRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKYVVDVYNGYFCGI-- 91 (470) T ss_pred eCCCC--CcCHHHHHHHHHHHHHhhH---HHHHHHHHHhccccccccCcccccCCcceeecchHHHHHHHHhhhhccC-- Confidence 54333 3445566666555554443 4555566655432100 00111122234566667777777777665432 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~ 159 (532) | +++...+.. .. ...+.+.+..++|.....++.++..++|.+.+++..++ .+. T Consensus 92 p-----~~~~~~~d~------------~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~---dg~ 144 (470) T protein:vir:99 92 E-----PKLALLNDS------------SK-------IDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGE---DAR 144 (470) T ss_pred C-----eeEeeCCch------------hH-------HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCC---CCe Confidence 2 112222110 00 11233556778999999999999999999988775543 345 Q ss_pred ceEEEEecceEEEeeCCCCC--eEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEc Q lcl|NC_015159. 160 NAPKLYKLHNFVVERDAYDN--VLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEID 237 (532) Q Consensus 160 ~~~~~~pl~~~~v~~d~~G~--vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~ 237 (532) +++..++..+.++..|..+. +...+|.+... . +.....|-.++.+.. -| ++... T Consensus 145 ~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~-------------------~-~~~~~~~~~~~~~~~--~~--~~~~~ 200 (470) T protein:vir:99 145 PHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDN-------------------S-NNWTDAYGVIQYADK--FY--KFKGY 200 (470) T ss_pred EEEEEEccceeEEEEcCCCCcceEEEEEEEEEe-------------------c-CCeeEEEEEEEecCe--EE--EEEec Confidence 67888888887666666543 44444333311 0 111111112222111 11 11111 Q ss_pred C-c-cc-ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccCh-- Q lcl|NC_015159. 238 G-E-IV-AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQI-- 312 (532) Q Consensus 238 ~-~-~~-~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~-- 312 (532) + . .. .......++..+|++..+- +.+|+|=.+..++-+..++.+.-...........|.+.+.-.+.... T Consensus 201 ~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~ 275 (470) T protein:vir:99 201 DIEEDTNAAGYAINPYGLVPAVEFFE-----NEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDE 275 (470) T ss_pred ccccccccccccccCCCccceEeecC-----CCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccc Confidence 1 1 01 1122345677889876543 46899999999999999999988888889999998877653211110 Q ss_pred -hhhccCC-Ccee-ecCcc--ccccccccCCccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHHHHHHHH Q lcl|NC_015159. 313 -RRVAKAN-TGDF-VAGRK--QDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-LNSAVQRGGDRVTAEEIRYVAGELE 386 (532) Q Consensus 313 -~~~~~~~-~G~~-v~g~~--~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~~~TAtEi~~r~~E~~ 386 (532) +.+.... .+.+ +++.. .+..+..+....+.......++.+.+.|...-. .+......+...|+..+..+..-.. T Consensus 276 g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 355 (470) T protein:vir:99 276 GNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMK 355 (470) T ss_pred cchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHH Confidence 1111111 1222 22111 111122233333455556666666666643211 1111111123356666554433222 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhcchhhhhc Q lcl|NC_015159. 387 DTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDI 464 (532) Q Consensus 387 ~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~i 464 (532) .. .--..+.-.+.+.-+++.++.++...+..+ .....+++.+. .+.+.+..++-+.++ +.+ + T Consensus 356 ~k-~~~~~~~~~~~l~~~~~li~~~~~~~~~~~-~~~~~i~v~f~~~~p~~~~e~a~~~~kl-------~gi-------i 419 (470) T protein:vir:99 356 NK-ADSKERKFDKSLMQLYRIVLATLFNNKQDQ-ELWSELDFKFTRNLPEDMASAIDNAKNA-------EGI-------V 419 (470) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHhccCCcc-cccccceEEeCCCCCcCHHHHHHHHHHH-------hcc-------C Confidence 22 222223333333333333334443333211 11122333332 233444444433332 111 1 Q ss_pred CHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 465 NLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 465 d~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ....++..+ -+|+| ++|++...+++..+.+ .++. +.+ ........+-.++ T Consensus 420 s~et~l~~l---~~vd~------~~E~eri~~E~~~~~~-~~~~-~~~-------~~d~~~~d~~~ee 469 (470) T protein:vir:99 420 SKKTQLGMI---PDIEP------DAEMKQIAKEKADAIK-QTQQ-LSM-------PIDILKRDNNAEE 469 (470) T ss_pred CHHHHHHhC---CCCCH------HHHHHHHHHHHHHHHH-HHHh-hcC-------CCCcCCCCCCccC Confidence 122333222 23332 2444433333222111 1111 111 0111122222222 No 108 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.06 E-value=5.7e-06 Score=49.33 Aligned_cols=379 Identities=12% Similarity=0.021 Sum_probs=182.3 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccC-CCC--Cccc-ccccccccchHHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFP-SAT--ADGS-TSYTTPWQSIGARGLNNLASKLMLALFPVGSS 84 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~-~~~--~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~ 84 (532) ++.+.+....+++..++ ++.+.+.+|..=.... .-+ -+.. +..-+..-+-+..++++||..|. ..+ T Consensus 1 ~~~~~i~~L~~~~~~~~----~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G-- 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHK----RRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE-- 70 (409) T ss_pred CCHHHHHHHHHHHHHHh----HHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc----cCc-- Confidence 77777766666665544 3333444443221110 111 1111 11223444566667777666432 112 Q ss_pred ccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEE Q lcl|NC_015159. 85 FFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKL 164 (532) Q Consensus 85 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~ 164 (532) |+ ..|. + +++...+++|.....++.++..+||.+.++|-+++ .+..++++ T Consensus 71 -f~--~~d~-------------~-----------l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~---dg~~~i~~ 120 (409) T protein:vir:94 71 -FE--NDDF-------------T-----------VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGE---NDAVRLQV 120 (409) T ss_pred -cc--CCch-------------H-----------HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCC---CCceEEEE Confidence 11 1110 1 23456778899999999999999999999887653 23457788 Q ss_pred EecceEEEeeCCC-CCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE-cCcccc Q lcl|NC_015159. 165 YKLHNFVVERDAY-DNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI-DGEIVA 242 (532) Q Consensus 165 ~pl~~~~v~~d~~-G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~-~~~~~~ 242 (532) ++..+.++..|+. +++...++... + .+........+| . ++ .++++. ++.... T Consensus 121 ~sp~~~~~i~D~~~~~~~~a~~~~~---~--------------d~~~~~~~~~~~----~-~~----~~~~~~~~~~~~~ 174 (409) T protein:vir:94 121 IEAVNATGIIDPITGLLTEGYAVLE---R--------------DENNNVVLEAHF----L-PD----RTDYYYRDSRNNI 174 (409) T ss_pred eccceEEEEEecCCCceeeeEEEEE---e--------------cCCCceEEEEEE----e-cC----cEEEEEecCceeE Confidence 8887766666663 45554443321 0 000001111111 1 11 111111 111111 Q ss_pred cccccCccccCceEEEEeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceee---cCccccChhhhccC Q lcl|NC_015159. 243 GTEGEYPLDSCPWIPVRLIKMPNEDYGRSFV-EEYLGDLKSLENLYEAIVKMSMISSKVLFFV---NPNGVTQIRRVAKA 318 (532) Q Consensus 243 ~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~-~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv---~~~g~~~~~~~~~~ 318 (532) . -..++..+|++.+..+...++.||+|-. +..++-+..+|...-..+..++....|...+ .+|+. +.+.... T Consensus 175 ~--~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~--~~~~~~~ 250 (409) T protein:vir:94 175 S--IANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAE--PMETWKA 250 (409) T ss_pred e--eeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCc--ccchhhh Confidence 1 1245678999999999899999999965 5688889999999999999999999997444 33332 1111222 Q ss_pred CCceee--cCcccc--ccccccCCccchhHHHHHHHHHHHHHHHHHhhhhccc----CCCCC-CCHHHHHH-------HH Q lcl|NC_015159. 319 NTGDFV--AGRKQD--VEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQ----RGGDR-VTAEEIRY-------VA 382 (532) Q Consensus 319 ~~G~~v--~g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~----~~~~~-~TAtEi~~-------r~ 382 (532) ..+.+. |.+.++ +..-++ ..++++. .++.++.-|+.......+.. ..... -+|.-|.+ ++ T Consensus 251 ~~~~i~~~~~d~dg~~~~v~q~-~~~~l~~---~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a 326 (409) T protein:vir:94 251 TVSSMLQFTKDEDGDKPTLGQF-TQPSMSP---FTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAG 326 (409) T ss_pred hHHHhhcCCCCCCCCCceEEec-CCCChhH---HHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHH Confidence 234443 222221 222222 2345553 34444444433222111100 01111 23333332 22 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee----cchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_015159. 383 GELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA----TGLEALGRGHDLNKLNVFIDYMIKLAG 458 (532) Q Consensus 383 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v----~~l~~l~raq~~~~l~~~~~~laq~~p 458 (532) +++.+.+|.-+. -++...+.++ +-.+..+.+..+++++ .+-+..+.|+.++.+.-.++ ..| T Consensus 327 ~~k~~~fg~~~~--------~~~rla~~i~---~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~----ag~ 391 (409) T protein:vir:94 327 RKAQRSLGAGLL--------NVAYLAACLR---DDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQ----AIP 391 (409) T ss_pred HHHHHHHHHHHH--------HHHHHHHHHh---CCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHH----hcc Confidence 333333333222 2233333332 3345556665554442 23344444554444433333 333 Q ss_pred hhhhhcCHHHHHHHHHHhcCCCHhH Q lcl|NC_015159. 459 LQDDDINLLDVKMRLANSLGMDTTG 483 (532) Q Consensus 459 ~~~d~id~d~~~~~~a~~~Gv~p~~ 483 (532) ..+ +.+ .+.+.+|.+... T Consensus 392 ~~~---~~~----~~~~~lG~~~~d 409 (409) T protein:vir:94 392 EFI---NKD----TIRDLTGIEGGE 409 (409) T ss_pred ccc---chh----HHHHHcCCCCCC Confidence 222 122 233445665554 No 109 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=97.99 E-value=8e-06 Score=48.53 Aligned_cols=392 Identities=11% Similarity=0.045 Sum_probs=182.7 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC-CC--cc-cccccccccchHHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA-TA--DG-STSYTTPWQSIGARGLNNLASKLMLALFPVGSS 84 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~--~~-~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~ 84 (532) ++...+...+.++...+ ++.+.+.+|..-..-... +. +. -+.+.+..-+-+..++++|+..|.- -+ T Consensus 1 m~~~~i~~L~~~~~~~~----~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~----~G-- 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFK----TGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIF----RE-- 70 (422) T ss_pred CChHHHHHHHHHHHHHH----HHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcccc----ce-- Confidence 66655665555555544 344555555544321111 11 11 1112223334556666666653311 11 Q ss_pred ccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEE Q lcl|NC_015159. 85 FFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKL 164 (532) Q Consensus 85 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~ 164 (532) | ..+|. + +++.+.++++....+++.++..+||.+.++|-.++. .+..++++ T Consensus 71 -f--~~~d~-------------~-----------l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~--~~~p~i~~ 121 (422) T protein:vir:97 71 -F--TNDDF-------------N-----------AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAE--DGLPKMQV 121 (422) T ss_pred -e--eCCch-------------h-----------HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCC--CCeeEEEE Confidence 1 11211 1 234566799999999999999999999999865432 23346777 Q ss_pred EecceEEEeeCC-CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccc Q lcl|NC_015159. 165 YKLHNFVVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAG 243 (532) Q Consensus 165 ~pl~~~~v~~d~-~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~ 243 (532) ++..+..+..|+ .+++...+++... +.+..... ..++++. ..++.-++..... T Consensus 122 ~sp~~~~~i~D~~~~~~~~a~~~~~~--------------------~~~~~~~~-~~~~~~~-----~~~~~~~~~~~~~ 175 (422) T protein:vir:97 122 IEASKATGILDPTTFLLTEGYAILES--------------------DSNGNPTL-EAYFTDK-----DIWYYPKKGKPYN 175 (422) T ss_pred echhhEEEEEeCCCCcceeeEEEEEe--------------------cCCCcEEE-EEEEcCc-----eEEEEcCCCcccc Confidence 777666555565 3333333322211 01111111 1112211 1111112211111 Q ss_pred ccccCccccCceEEEEeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceee---cCccccChhhhccCC Q lcl|NC_015159. 244 TEGEYPLDSCPWIPVRLIKMPNEDYGRSFV-EEYLGDLKSLENLYEAIVKMSMISSKVLFFV---NPNGVTQIRRVAKAN 319 (532) Q Consensus 244 ~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~-~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv---~~~g~~~~~~~~~~~ 319 (532) . ..+++.+|++++..+...++.||+|-. +..++-+..++...-..+..++....|...+ .++|. +.+..... T Consensus 176 ~--~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~--~~~~~~~~ 251 (422) T protein:vir:97 176 I--KNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAK--PMEKWRAT 251 (422) T ss_pred c--cCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccc--cCchhhhh Confidence 1 345567999999999999999999976 5688999999999999999999999988444 22222 11111222 Q ss_pred Ccee--ecCcccc--ccccccCCccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCHHHHHH-------HHH Q lcl|NC_015159. 320 TGDF--VAGRKQD--VEVFQLEKYNDFQVAKATADDIEKRLSYAFMLN-----SAVQRGGDRVTAEEIRY-------VAG 383 (532) Q Consensus 320 ~G~~--v~g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~~~TAtEi~~-------r~~ 383 (532) .|.+ ++.+.++ +..-++ ..++++.- ++.++.-|....... .+......+.+|.-|.+ +.+ T Consensus 252 ~~~i~~~~~de~~~~~~v~q~-~~~~l~~~---~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~ 327 (422) T protein:vir:97 252 VSTLLEISKDEDGDKPTVGQF-TTASMAPF---MEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGR 327 (422) T ss_pred hhhhhccCCCCCCCcceeeec-CCCChhHH---HHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHH Confidence 3343 2322221 222222 23345533 333333333322111 11111111123433332 233 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee----cchHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA----TGLEALGRGHDLNKLNVFIDYMIKLAG 458 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v----~~l~~l~raq~~~~l~~~~~~laq~~p 458 (532) ++.+.+|.-+. +++.++.. .|..+..+.+..++.+. .+.+....+|.++.+.-.++ ..| T Consensus 328 ~k~~~fg~~l~------------~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~----a~~ 391 (422) T protein:vir:97 328 KAQRSFSSGFL------------NVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQ----AIP 391 (422) T ss_pred HHHHHHHHHHH------------HHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHh----hcc Confidence 34444444333 33333222 24455566665554442 24455555555544443333 222 Q ss_pred hhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHH Q lcl|NC_015159. 459 LQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAG 503 (532) Q Consensus 459 ~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~ 503 (532) -. .+.+. +.+.+|++.. ++|. .+.+++.+.. T Consensus 392 ~~---~~~~~----~~~~lg~~~~-----~~~~--~~~~~~~~d~ 422 (422) T protein:vir:97 392 GF---MDADV----IRDLTGVKGA-----DKPI--PAITEVTTDG 422 (422) T ss_pred cc---ccHHH----HHHHcCCCch-----hHHH--HHHHhhhccC Confidence 11 22332 2233566331 2222 1111111111 No 110 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=97.86 E-value=1.4e-05 Score=47.13 Aligned_cols=427 Identities=9% Similarity=0.015 Sum_probs=186.7 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhh--cccccCCC----CCcccccccccccchHHHHHHHHHHHHHHhhcCCC Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYT--IPSVFPSA----TADGSTSYTTPWQSIGARGLNNLASKLMLALFPVG 82 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~----~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~ 82 (532) ++.+.+.+..+..+.++ ......+++|.=- ++.+.... .....+...++..+-+...++..++.|++ T Consensus 1 l~~~~i~~~i~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G------ 73 (451) T protein:vir:10 1 MELEKIRAIISADAARR-QEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFT------ 73 (451) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheec------ Confidence 88999999888887644 3333334433210 11111110 01111122355566666666666654432 Q ss_pred CCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccccc-----C Q lcl|NC_015159. 83 SSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVE-----G 157 (532) Q Consensus 83 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~-----~ 157 (532) .| ..+..++.. +..+.+ ..+..++|.....++.++...+|.|.+++..++... . T Consensus 74 ~p-~~~~~~~~~------------~~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~ 132 (451) T protein:vir:10 74 YP-VLFDIDNNK------------ELNEKV--------TDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTN 132 (451) T ss_pred cc-ceeecCCcH------------HHHHHH--------HHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccc Confidence 11 112222211 111111 223357899999999999999999887665443211 2 Q ss_pred CcceEEEEecceE-EEeeC-CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEE Q lcl|NC_015159. 158 QSNAPKLYKLHNF-VVERD-AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQE 235 (532) Q Consensus 158 ~~~~~~~~pl~~~-~v~~d-~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~ 235 (532) +.+.+.+++..+. ++-.| ..+++...+|.+......- +....+...+..++-+..-..|..... T Consensus 133 ~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~--------------~~~~~~~~~~~e~yt~~~~~~~~~~~~ 198 (451) T protein:vir:10 133 QTFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVK--------------GQIQKQAYTYVEFWTDKILDKYKFFGV 198 (451) T ss_pred cceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc--------------ccccceEEEEEEEEeCCeEEEEEeccc Confidence 4556777766554 44333 3577877776664322211 000111111111222111111111000 Q ss_pred -EcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccC-hh Q lcl|NC_015159. 236 -IDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ-IR 313 (532) Q Consensus 236 -~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~-~~ 313 (532) ..+..........+|..+|++..+. +.+|.|=.+...+-+..+|.+.-......+...+|.+.+.--+... .+ T Consensus 199 ~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~ 273 (451) T protein:vir:10 199 SCCGSQIEHITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSE 273 (451) T ss_pred CccccccccccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchh Confidence 1111122222346788899887654 4568888899999999999988888888888898887664211111 12 Q ss_pred hhccCCC-ceeec-C----ccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHH Q lcl|NC_015159. 314 RVAKANT-GDFVA-G----RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELED 387 (532) Q Consensus 314 ~~~~~~~-G~~v~-g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~ 387 (532) ....... +.++. + ..+++. .+....+.+.....++.++..|...-..-.+........|+.-+..+-.-... T Consensus 274 ~~~~~~~~~~i~~~~~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~ 351 (451) T protein:vir:10 274 FLKELKRYKTIKTETDSEGDSGGLK--TMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLEL 351 (451) T ss_pred hHHHHhhCCeEEecCcCCccCCcce--EEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHH Confidence 2222222 33222 1 112233 33344466777788888877775532211111111123444433332221111 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceeec--chHHHHHHHHHHHHHHHHHHHHhhcchhhhhc Q lcl|NC_015159. 388 TLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDI 464 (532) Q Consensus 388 ~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~~~~~laq~~p~~~d~i 464 (532) . ..+.+.. +.+.+.+.+.++.+. |.. ....+++.+.. +.+.+..++-+.+ ++ + .+ T Consensus 352 k----~~~k~~~-f~~~l~~~~~li~~~~~~~---d~~~i~i~f~~~~p~n~~e~~~~~~k-------l~---g----~i 409 (451) T protein:vir:10 352 K----SGLLETE-FRTSFDKLIKAILYFLGVT---DYKKIQQTYTRNMMSNDLEDADIATK-------SV---G----II 409 (451) T ss_pred H----HHHHHHH-HHHHHHHHHHHHHHHhCCC---CccceeEEecCCCCCCHHHHHHHHHH-------Hh---c----cC Confidence 1 1222222 222334444443331 221 12233443322 2122222211111 11 1 12 Q ss_pred CHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHH Q lcl|NC_015159. 465 NLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGG 516 (532) Q Consensus 465 d~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~ 516 (532) .-..++.. ++ ++.+.++..++..++++ .+.++.....+.... T Consensus 410 S~et~~~~----~p-----~v~d~~~e~~~~~ee~~-~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 410 PTKIILRH----HP-----WVDDVEEAEKLYLEEKK-IQASKVSDDYNNFTE 451 (451) T ss_pred chHHHHHh----CC-----CCCCHHHHHHHHHHHHH-HHHHHHHhhcCCCCC Confidence 22333322 22 23343333222222211 111222222221111 No 111 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=97.84 E-value=1.6e-05 Score=46.94 Aligned_cols=434 Identities=10% Similarity=0.039 Sum_probs=188.5 Q ss_pred CCCCCCCc----------------c-CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCC-C---Cccccc Q lcl|NC_015159. 1 MAEVEKTG----------------F-AADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSA-T---ADGSTS 54 (532) Q Consensus 1 m~~~~~~~----------------~-~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~-~---~~~~~~ 54 (532) |+.+.++- - ..+.+.+.....+. -.++++.+.+|.... +.... . ....+. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~----~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKE----NIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKP 76 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCchhccccccccccccccccc Confidence 77774411 1 12222222233332 234555666665442 11100 0 011112 Q ss_pred ccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHH Q lcl|NC_015159. 55 YTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHA 134 (532) Q Consensus 55 ~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 134 (532) ..++..+-+...+++.++.|++ -|+ ++...+.. ..+.| ...+ ..+|.....+ T Consensus 77 ~~ki~~n~~~~ivd~~~~~l~g--~~~-----~~~~~~d~-------------~~~~l-------~~~~-~n~~~~~~~~ 128 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVA--NPV-----TFGVDNDK-------------ALKQI-------QHTL-NHKWDDKLVD 128 (478) T ss_pred cceeccchHHHHHHHHHhhhcc--CCe-----eeecCChH-------------HHHHH-------HHHH-hcCHHHHHHH Confidence 3356667777788888776654 121 12333221 11111 1223 3588999999 Q ss_pred HHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCc Q lcl|NC_015159. 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPS 212 (532) Q Consensus 135 ~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~ 212 (532) +.++..++|.+.+++..++ .+.+++++++..+.+...| ..+++...+|.+... .. T Consensus 129 ~~~~~~~~G~~~~~~~~d~---~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~--------------------~~ 185 (478) T protein:vir:10 129 ILTAASNKGIEWVQPYVDE---EGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELD--------------------GA 185 (478) T ss_pred HHHHHHhcCeEEEEEEecC---CCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------------------Cc Confidence 9999999999988775543 3456777777766544433 467787777665421 11 Q ss_pred ceEEEEEE--E--EeeCCCCeEEEE-EEEcCcc--cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHH Q lcl|NC_015159. 213 EEVTIYTH--V--YRDPEAMVFRSY-QEIDGEI--VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLEN 285 (532) Q Consensus 213 ~~v~i~~~--v--~~~~~~~~~~s~-~~~~~~~--~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~ 285 (532) ..+++|+. | +...++...... ....+.. .......+++..+|++.++. +.+|+|=.....+-+..++. T Consensus 186 ~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~liDa~~~ 260 (478) T protein:vir:10 186 ERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDK 260 (478) T ss_pred eEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEecc-----CCCCCCcHHHHHHHHHHHHH Confidence 22333321 1 111222111111 0001100 00112245677899887754 46899988899999999998 Q ss_pred HHHHHHHHHHHHhcCceeecCccccChhh-hccCC-Ccee-ecCcc-ccccccccCCccchhHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 286 LYEAIVKMSMISSKVLFFVNPNGVTQIRR-VAKAN-TGDF-VAGRK-QDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM 361 (532) Q Consensus 286 l~~~~l~~~~~a~~p~~lv~~~g~~~~~~-~~~~~-~G~~-v~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~ 361 (532) +.-......+....|.+++..-+.-+... ..... .+.+ +++.. +++. .+....+.......++.+++.|...-. T Consensus 261 ~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~i~~~s~ 338 (478) T protein:vir:10 261 RLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVD--TIKVEVPIDSVKEYTKMLRDYIIEFGQ 338 (478) T ss_pred HHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecCCCCCcce--EEeecCChHHHHHHHHHHHHHHHHHhC Confidence 88888888888888876654221111111 11111 1222 22322 2333 233333556666777777766654321 Q ss_pred -hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceee--cchHHH Q lcl|NC_015159. 362 -LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIA--TGLEAL 437 (532) Q Consensus 362 -~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v--~~l~~l 437 (532) .+......+...|+..+..+..-.. +......+. +...+.+++.++.+. |. ......+++.+. .+.+.+ T Consensus 339 ~p~~~~~~~~~n~Sg~Al~~~~~~l~-~k~~~~~~~----~~~~l~~~~~li~~~~g~--~~~~~~i~i~f~~~~p~d~~ 411 (478) T protein:vir:10 339 GVDFQQDKFGNSPSGIALKFMYSNLD-LKANKLKNK----TLTALQELLQYIIDFYRL--DVKVQDIEITFNFNVMVNEL 411 (478) T ss_pred ccccCccccccccHHHHHHHHHHHHH-HHHHHHHHH----HHHHHHHHHHHHHHHhCC--CcccccceEEecCCCCCCHH Confidence 1111111223456655544322221 112222222 223333333333321 11 111122333331 222222 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHH Q lcl|NC_015159. 438 GRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGG 516 (532) Q Consensus 438 ~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~ 516 (532) ..++-+ +.++.+ +....+++ .+| ++.+ ++|++..+++...++ +.+ ..... T Consensus 412 e~a~~~-------~kl~g~-------iS~et~~~----~l~-----~v~D~~~E~~ri~~E~~~~~--~~~----~~~~~ 462 (478) T protein:vir:10 412 ENSQIA-------MNSTGL-------LSKETILS----NHA-----WVEDPVAEMERIEQENIELN--QQL----PDIEE 462 (478) T ss_pred HHHHHH-------HHHhCC-------CChHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHH--hhc----ccccc Confidence 222211 122222 22333333 333 2223 234433333221111 111 11111 Q ss_pred HHHHhhcccccCCCCC Q lcl|NC_015159. 517 QAAAAMMQQQAGLPTQ 532 (532) Q Consensus 517 ~~~~~~~~~~~g~~~~ 532 (532) ......-.+.....++ T Consensus 463 ~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 463 GLNGEQQRQSENNQPE 478 (478) T ss_pred ccCCCCCCCCCCCCCC Confidence 1111112222222222 No 112 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=97.81 E-value=1.8e-05 Score=46.62 Aligned_cols=381 Identities=12% Similarity=0.043 Sum_probs=173.8 Q ss_pred hhhHHHHHHHHHHhhcccccCCC-C--Cccc-ccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhcc Q lcl|NC_015159. 25 RGAYETRAEDCATYTIPSVFPSA-T--ADGS-TSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSIT 100 (532) Q Consensus 25 R~~~e~~w~e~~~~~~P~~~~~~-~--~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~ 100 (532) =+-+.++-+.+.+|..-..-..+ + -+.. +...+..-+-+..++++||..|.- .+ | ...|. T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~----~G---f--~~~d~------- 64 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIF----RA---F--ANDDF------- 64 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhcc----cc---c--cCCCc------- Confidence 12223333334444332211111 1 1111 111234456667777777665441 11 1 11111 Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCC-CCC Q lcl|NC_015159. 101 SPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDA-YDN 179 (532) Q Consensus 101 ~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~-~G~ 179 (532) + +++...+++|.....++.++..+||.+.++|-+++ .+..++++++..+.++..|+ .++ T Consensus 65 ------~-----------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~---d~~~~i~~~sP~~~~~i~Dp~~~~ 124 (410) T protein:vir:95 65 ------N-----------VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGE---DDEVRLQVIESSNATGVIDPITGL 124 (410) T ss_pred ------h-----------HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCC---CCceEEEEEcccceEEEEeCCCCc Confidence 1 23445679999999999999999999999886643 23457788877766555555 455 Q ss_pred eEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEE Q lcl|NC_015159. 180 VLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVR 259 (532) Q Consensus 180 vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~R 259 (532) +..-++...- .+......+.+|+ ++ .++++..+...... ..++..||++.+. T Consensus 125 ~~~al~~~~~-----------------~~~~~~~~~~~~~-----~~----~~~~~~~~~~~~~~--~~~~g~vPvV~f~ 176 (410) T protein:vir:95 125 LVEGYAVLAR-----------------DDYNRPTLEAYFE-----PN----ATHFIPKDGEPYSV--TNETGIPLLVPVI 176 (410) T ss_pred eEEEEEEEEe-----------------cCCCeEEEEEEEe-----CC----cEEEEeeCCccccc--cCCCCCcceEEec Confidence 5443432110 0001111223321 11 22222222211111 3456779999999 Q ss_pred eeecCCCccccch-HHHHHHHHHHHHHHHHHHHHHHHHHhcCceee---cCccccChhhhccCCCceeec--Ccccc--c Q lcl|NC_015159. 260 LIKMPNEDYGRSF-VEEYLGDLKSLENLYEAIVKMSMISSKVLFFV---NPNGVTQIRRVAKANTGDFVA--GRKQD--V 331 (532) Q Consensus 260 w~~~~g~~YG~Gp-~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv---~~~g~~~~~~~~~~~~G~~v~--g~~~~--~ 331 (532) .+...++.||+|= .+..++-+..++...-..+..++....|...+ .++|.. .+......|.+.. .+.++ + T Consensus 177 n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~--~~~~~~~~~~i~~~~~~~~~~~~ 254 (410) T protein:vir:95 177 HRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEP--MEKWKATVSSLLTISSSDKGVKP 254 (410) T ss_pred ccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCc--CchhhhhhhhheeccCCCCCCcc Confidence 9999899999984 35688888999999999999999999987444 222221 1112222344332 22221 1 Q ss_pred cccccCCccchhHHHHHHHHHHHHHHHHHhhhhccc----CCCCC-CCHHHHHH-------HHHHHHHHhhhhHHHHHHH Q lcl|NC_015159. 332 EVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQ----RGGDR-VTAEEIRY-------VAGELEDTLGGVYSLLSQE 399 (532) Q Consensus 332 ~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~----~~~~~-~TAtEi~~-------r~~E~~~~LGpv~~rl~~E 399 (532) ..-++ ..++++.- ++.++.-|+.......+.. ..... -+|.-|.. +++++.+.+|.-+.+ T Consensus 255 ~v~q~-~~~~l~~~---~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~---- 326 (410) T protein:vir:95 255 SVGQF-TTASMSPF---TEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLN---- 326 (410) T ss_pred eEEec-CCCChHHH---HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 22222 23455543 3334433333222111100 01111 23332222 233334444432222 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCccccccceee-c---chHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHH Q lcl|NC_015159. 400 LQLPLVKILLKELQATSKIPNLPKEAVEPAIA-T---GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLAN 475 (532) Q Consensus 400 ~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v-~---~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~ 475 (532) ++...+.++ +-.+..+.+..+..++ . ..+.-+.++.++.+.-. ++..|-+ .+.+ .+.+ T Consensus 327 ----~~rla~~i~---~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl----~~a~~g~---~~~~----~~~~ 388 (410) T protein:vir:95 327 ----VAYVAACLR---DEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKL----NQALPGY---INAE----TIRD 388 (410) T ss_pred ----HHHHHHHHh---cCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHH----HHhccCC---ccHH----HHHH Confidence 223333333 3345555665555443 2 22222334444433332 2222211 1222 2334 Q ss_pred hcCCCHhHccCCHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 476 SLGMDTTGLILTQQDKQAKMAEASTAAGM 504 (532) Q Consensus 476 ~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~ 504 (532) .+|.++. ++...+.+++.+..+ T Consensus 389 ~lg~~~~-------~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 389 LTGIAGD-------MSAKPVVSEGGSNGE 410 (410) T ss_pred hcCCChH-------HHHHHHHHHHHhCCC Confidence 4676433 332222222222222 No 113 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.77 E-value=2.1e-05 Score=46.26 Aligned_cols=449 Identities=14% Similarity=0.086 Sum_probs=199.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcc-----cccccc-cccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADG-----STSYTT-PWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~-----~~~~~~-~~dst~~~a~~~Laa~l 74 (532) |.+.++.-.+-..+..+|+..++--.. ...|++...-.||.......... ..++.+ .|-+.-.+.++.+++.+ T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G-~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~v 79 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAG-EPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQV 79 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhh Confidence 998876665666666777665543321 35677777777886433221111 112222 34444445555544444 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) +. -|| .++.+ ..++.+++.| -+...+++.-+..+..+...+|-+.++||-... T Consensus 80 f~--k~p-----~~~~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~ 132 (501) T protein:vir:95 80 FM--RDP-----VVKVP--------------ALLNPLVANA------TGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTT 132 (501) T ss_pred hc--CCc-----ceeCc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCC Confidence 32 111 22222 1244455443 345667888888999999999999999984321 Q ss_pred ccCC------------cceEEEEecceEE-Ee---eCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEE Q lcl|NC_015159. 155 VEGQ------------SNAPKLYKLHNFV-VE---RDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIY 218 (532) Q Consensus 155 ~~~~------------~~~~~~~pl~~~~-v~---~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~ 218 (532) ...+ ...+..|+..+.. +. .+...++.-+..++....+. ++|. .+.++.| T Consensus 133 ~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~------------~~~~~q~ 198 (501) T protein:vir:95 133 EAEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD--DGFE------------MKTSGQF 198 (501) T ss_pred CCcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC--CCcc------------cceeEEE Confidence 1000 1236666654431 12 23334455555555543221 1111 2456667 Q ss_pred EEEEeeCCCCeEEEEEEEcCcc-------------c---ccccccCccccCceEEEEeeecCCCcccc--chHHHHHHHH Q lcl|NC_015159. 219 THVYRDPEAMVFRSYQEIDGEI-------------V---AGTEGEYPLDSCPWIPVRLIKMPNEDYGR--SFVEEYLGDL 280 (532) Q Consensus 219 ~~v~~~~~~~~~~s~~~~~~~~-------------~---~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~--Gp~~~al~d~ 280 (532) ..+.++.++. |....|.++.. . ....-..|.+.+++|++.|.-..+...+. .|.. |+ T Consensus 199 RvL~~~~~g~-~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl----~l 273 (501) T protein:vir:95 199 RVLRLDEEGY-YVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFY----DL 273 (501) T ss_pred EEEeeCCCce-EEEEEEEecCCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchH----HH Confidence 7777765543 33222221110 0 00111224467888888887665554443 3333 44 Q ss_pred HHHHHH---HHHHHH-HHHHHhcCceeecCccccC-------hhhhccCCCceeecCccccccccccCCccchhHHHHHH Q lcl|NC_015159. 281 KSLENL---YEAIVK-MSMISSKVLFFVNPNGVTQ-------IRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATA 349 (532) Q Consensus 281 ~~L~~l---~~~~l~-~~~~a~~p~~lv~~~g~~~-------~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i 349 (532) ..||.- ..+-.+ .+..+..|.+.+. |... ...+.-+.+..+.-+..++...++.. +..+ ....+ T Consensus 274 A~lni~hy~~ssd~~~~l~~~~~P~l~i~--G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~-~~~i--~~~~l 348 (501) T protein:vir:95 274 ASLNMAHYRNSADYEESCYIVGQPTPVLI--GLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQAS-ENTM--LKEAM 348 (501) T ss_pred HHHHHHHHhhhhHHHHHHHHcccceeeee--CCcccccccCCCCceeecccccccCCCCCceeEEecC-hhhH--HHHHH Confidence 444432 222233 3444445543332 1111 11111111111111122233333321 2233 35667 Q ss_pred HHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccce Q lcl|NC_015159. 350 DDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPA 429 (532) Q Consensus 350 ~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~ 429 (532) ++++++++++ .-.+........||++...+....-..|+-+...+++- +.-++..+...+ |+- ++.+++. T Consensus 349 ~~l~~~m~~~--Ga~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~a-l~~~l~~~a~w~---g~~----~~~~~v~ 418 (501) T protein:vir:95 349 DTKERQMVAL--GAKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAA-FEWALKWAARWV---GQA----DSGVKFE 418 (501) T ss_pred HHHHHHHHHH--HHhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHc---CCC----CCceEEE Confidence 7777777663 11222344445899999999999999999988888764 333334433332 221 1222222 Q ss_pred eecchH-HHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 430 IATGLE-ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAG 508 (532) Q Consensus 430 ~v~~l~-~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~ 508 (532) +-.-.. .--..+.++.++...+ ...|....+...+.+ .||..... ++|.++...... .+....+ T Consensus 419 i~~df~~~~~~~~~~~al~~~~~---------~G~is~~t~~~~L~~-~~v~~~~~---~~e~e~i~~~~~--~~~~~~~ 483 (501) T protein:vir:95 419 LNTDFDIARMTPDERRSLVEEWQ---------KGAITFEEMRTGLRK-AGVATEDD---SKAKEKIAKDTA--EAMALAT 483 (501) T ss_pred EecccccccCCHHHHHHHHHHHh---------CCCCcHHHHHHHHHh-CCCCChhH---HHHHHHHHhhhc--Ccccccc Confidence 211110 0001222322222211 122555555555544 47743211 122211111111 0000000 Q ss_pred HhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 509 QQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) ...+...+.++.. -|..+ T Consensus 484 ~~~~~~~~~gg~~-----~~~~~ 501 (501) T protein:vir:95 484 PANVPGDGSGGDN-----VGNSE 501 (501) T ss_pred cCCCCCCCccccc-----ccCCC Confidence 0011111111122 23333 No 114 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=97.73 E-value=2.4e-05 Score=45.90 Aligned_cols=380 Identities=11% Similarity=0.005 Sum_probs=179.4 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcc--cccCCCCCccc-ccccccccchHHHHHHHHHHHHHHhhcCCCCCc Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIP--SVFPSATADGS-TSYTTPWQSIGARGLNNLASKLMLALFPVGSSF 85 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P--~~~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~W 85 (532) ++.+.+.....++..++. ...+..++|+---+ .... .-+.. +..-+..-+-+..++++||..|. ..+ T Consensus 1 ~~~~~i~~L~~~~~~~~~-r~~~~~~yY~g~~~~~~~~~--~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G--- 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKR-RAEMRYEQYAMKHVDRFKGI--TIPQALSQQYRSILGWCAKGVDSLADRLV----FRE--- 70 (409) T ss_pred CCHHHHHHHHHHHHHHhH-HHHHHHHHHhccCchhhcch--hhhHHHHHHHhhhcChhHHHHHHhHhhcc----ccc--- Confidence 777777776666665442 22233334432211 1111 11111 11123444566677777766443 112 Q ss_pred cccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEE Q lcl|NC_015159. 86 FKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLY 165 (532) Q Consensus 86 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~ 165 (532) |+ ..|. + +++...+++|.....++.++..+||.+.++|-+++ .+..+++++ T Consensus 71 f~--~~d~-------------~-----------l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~---dg~~~i~~~ 121 (409) T protein:vir:16 71 FE--NDDF-------------T-----------VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGE---NDAVRLQVI 121 (409) T ss_pred cc--Ccch-------------H-----------HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCC---CCceEEEEE Confidence 11 1110 1 23456779999999999999999999999887653 234577788 Q ss_pred ecceEEEeeCC-CCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE-cCccccc Q lcl|NC_015159. 166 KLHNFVVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI-DGEIVAG 243 (532) Q Consensus 166 pl~~~~v~~d~-~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~-~~~~~~~ 243 (532) +..+.++..|+ .+++...++...- ..........+| .+ + ..+++. ++..... T Consensus 122 sP~~~~~i~D~~~~~~~~a~~~~~~-----------------d~~~~~~~~~~~---~~--~----~~~~~~~~~~~~~~ 175 (409) T protein:vir:16 122 EATNATGIIDPITGLLTEGYAVLER-----------------DENNNVVLEAHF---LP--D----RTDYYYRDSRNNIS 175 (409) T ss_pred cccceEEEeecccccceeeeEEEEe-----------------cCCCceEEEEEE---ec--C----cEEEEEecCccccc Confidence 77665555555 4555443332110 000111111121 11 1 111111 1111111 Q ss_pred ccccCccccCceEEEEeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhcCceeec---CccccChhhhccCC Q lcl|NC_015159. 244 TEGEYPLDSCPWIPVRLIKMPNEDYGRSFV-EEYLGDLKSLENLYEAIVKMSMISSKVLFFVN---PNGVTQIRRVAKAN 319 (532) Q Consensus 244 ~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~-~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~---~~g~~~~~~~~~~~ 319 (532) ...++..||++.+..+...++.||+|=. +..++-+..+|...-..+..++....|...+- +||. +.+..... T Consensus 176 --~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~--~~~~~~~~ 251 (409) T protein:vir:16 176 --IANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAE--PMETWKAT 251 (409) T ss_pred --eecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCC--ccchhhhh Confidence 2346678999999999999999999844 56888889999999999999999999885441 2221 11111222 Q ss_pred Cceee--cCcccc--ccccccCCccchhHHHHHHHHHHHHHHHHHhhhhccc----CCCCC-CCHHHHHH-------HHH Q lcl|NC_015159. 320 TGDFV--AGRKQD--VEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQ----RGGDR-VTAEEIRY-------VAG 383 (532) Q Consensus 320 ~G~~v--~g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~----~~~~~-~TAtEi~~-------r~~ 383 (532) .|.+. |...++ +..-++ ..++++.- ++.++.-|+.......+.. ..... -+|.-|.+ +++ T Consensus 252 ~~~i~~~~~d~~g~~~~v~q~-~~~~l~~~---~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~ 327 (409) T protein:vir:16 252 VSSMLQFTKDEDGDKPTLGQF-TQPSMSPF---TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGR 327 (409) T ss_pred hhHhhccCCCCCCCCceEEec-CCCChhHH---HHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHH Confidence 34432 322222 222222 23445533 4444444433222111100 11111 23333322 333 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee----cchHHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA----TGLEALGRGHDLNKLNVFIDYMIKLAGL 459 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v----~~l~~l~raq~~~~l~~~~~~laq~~p~ 459 (532) ++.+.+|..+.+ ++...+.++ |-++..+.+..++.++ .+-+..+.|+.++.+.-.++ ..|. T Consensus 328 ~k~~~fg~~l~~--------~~rla~~~~---~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~----a~~~ 392 (409) T protein:vir:16 328 KAQRSLGAGLLN--------VAYLAACLR---DDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQ----AIPE 392 (409) T ss_pred HHHHHHHHHHHH--------HHHHHHHHh---cCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHh----hccc Confidence 444444443333 222223332 3345556665544432 12233333444443333322 2222 Q ss_pred hhhhcCHHHHHHHHHHhcCCCHhH Q lcl|NC_015159. 460 QDDDINLLDVKMRLANSLGMDTTG 483 (532) Q Consensus 460 ~~d~id~d~~~~~~a~~~Gv~p~~ 483 (532) .+ +. +.+.+.+|.+.+. T Consensus 393 ~~---~~----~v~~~~~g~~~~d 409 (409) T protein:vir:16 393 FI---NK----DTIRDLTGIKGAE 409 (409) T ss_pred cc---ch----hHHHHhccCCCCC Confidence 21 11 1223335654443 No 115 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=97.66 E-value=3.1e-05 Score=45.30 Aligned_cols=431 Identities=10% Similarity=0.016 Sum_probs=166.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc-----cCCCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV-----FPSATADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~-----~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...++ .+.+....+++.. ..++.+.+.+|..=.. .............++..+-+..+++.+++.|+ T Consensus 1 ~~~~t~----~~~~~~l~~~~~~----~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:79 1 MTASTP----AEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCH----HHHHHHHHHHHHH----HHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhc Confidence 554442 2223222333222 2334444444443321 11101111111223445666777777777664 Q ss_pred HhhcCCCCCccccCCC-hHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVS-ELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) +- + |++... |.+ + ...+.+.+.+++|.....++.++..+||.|.+++-.++ T Consensus 73 ~~------g-~~~~~~~d~~-------------~-------~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~e- 124 (456) T protein:vir:79 73 PN------G-ITVGGSADSD-------------L-------ALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD- 124 (456) T ss_pred cC------C-eecCCCCCcc-------------H-------HHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCC- Confidence 33 2 122211 111 1 11223455667899999999999999999988765543 Q ss_pred ccCCcceEEEEecceEEEeeCC-CC-CeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDA-YD-NVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~-~G-~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) .+..++++++..+.++..|+ .+ ++...+|.+. ..+.- ... ...-.++..+..+...+...+...+.. T Consensus 125 --dg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~-~~d~~-------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) T protein:vir:79 125 --DGTATITADSPETMVVSVDPLQPWRIRSAMRWWR-DLDAE-------SDF-AIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) T ss_pred --CCceEEEEeccceeEEEEcCCCCCceEEEEEEEE-ecCCc-------eeE-EEEEcCCceEEEEEEEEeeccccceee Confidence 34457788877766555554 33 3444444432 11100 000 000011122222211111111111111 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc----- Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN----- 307 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~----- 307 (532) +. ..+..........++..+|++.++ +..|.|=.+..++-+-.++...-..+..++..+.|...+... T Consensus 194 ~~-~~~~~~~~~~~~~~~~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~ 266 (456) T protein:vir:79 194 TR-ISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLP 266 (456) T ss_pred ec-cCCceeecccccCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccc Confidence 11 111111112223455667776542 467888788888777777766656566666666665333211 Q ss_pred -----c-ccChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcc----cCCCCCCCHHH Q lcl|NC_015159. 308 -----G-VTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAV----QRGGDRVTAEE 377 (532) Q Consensus 308 -----g-~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~----~~~~~~~TAtE 377 (532) | .++........+|.+... +++....++. ..+++.....++.+...| +...... ..+....++.- T Consensus 267 ~~d~~g~~i~~~~~~~~~~~~~~~~-~~~~~~~q~~-~~~~~~~~~~l~~~i~~i---~~~t~~p~~~~~~~~~N~Sg~A 341 (456) T protein:vir:79 267 KVDENGNAIDYASIFEAAPGALWEL-PPGVDIWESQ-TNDFTPMLSAIKEHIRQL---SSATKTPLPMLMPDSANQSAEG 341 (456) T ss_pred cccccccccchhhhhhhhccccccC-CCCcceeeec-ccChHHHHHHHHHHHHHH---HhhcCCChhHhcccccCcHHHH Confidence 1 011112122233333222 2222222322 233433333333333333 2211110 01222345554 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec--chHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 378 IRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNVFIDYMIK 455 (532) Q Consensus 378 i~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~~~~~laq 455 (532) +......+.+. .++.+ ..+.+-+.+.+.++....-.+ +...+++.+.- +.+.+.+|+-+.++. +. . T Consensus 342 l~~~~~~l~~k----~~~~~-~~f~~~l~~~~~l~~~~~g~~--~~~~i~v~w~~~~~~s~~~~ada~~kl~---~~--G 409 (456) T protein:vir:79 342 AHNIEKGFLFK----CEDRL-SIAKIGLEAILVKALQIEGES--VEDTVDVSFESPDRVTLGEKYSAASLAK---AA--G 409 (456) T ss_pred HHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHhcCCC--ccccceEEeCCCCCcCHHHHHHHHHHHH---hc--C Confidence 44433333222 22222 233444555555554422112 11223333311 222233322222221 10 1 Q ss_pred hcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015159. 456 LAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA 521 (532) Q Consensus 456 ~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 521 (532) + + ..+.. ...+|+++..+- +.+.++..++..+....++ +.+.+.+. . T Consensus 410 ~-~-------~~~~~---~~~lg~~~~~i~----~~e~~r~~~e~~~~~~~~~-~~~~~~~~---~ 456 (456) T protein:vir:79 410 E-S-------WASIR---RNILNYNADQIK----QDDLDRAREQITLFAGNPV-QRPQEDGS---R 456 (456) T ss_pred C-C-------hHHHH---HhcCCCCHHHHH----HHHHHHHHHHHHHHhhhHh-hcCCCCCC---C Confidence 1 1 11111 234677654321 1111121121111111111 11111111 1 No 116 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.64 E-value=3.4e-05 Score=45.06 Aligned_cols=435 Identities=9% Similarity=0.018 Sum_probs=185.9 Q ss_pred CCCCCCCc-----------------cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCC--CCC--ccccc Q lcl|NC_015159. 1 MAEVEKTG-----------------FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPS--ATA--DGSTS 54 (532) Q Consensus 1 m~~~~~~~-----------------~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~--~~~--~~~~~ 54 (532) |+...++- ...+.+.+..+..+.+ ..+++.+.+|..=. +... ... ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKEN----IDNITMGERYYNHHPDILDAPFKRDVNGDYDETKP 76 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHH----HHHHHHHHHHhcccccccccchhhhcccccccccc Confidence 66662211 1122233333333322 34455555553311 1000 000 11112 Q ss_pred ccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHH Q lcl|NC_015159. 55 YTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHA 134 (532) Q Consensus 55 ~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 134 (532) ..++-.+.+...++..++.|++ -| +++...+.. +.+ .+...+ .++|.....+ T Consensus 77 ~~ki~~n~~k~ivd~~~~yl~g--~p-----~~~~~~~~~-------------~~~-------~l~~~~-~n~~~~~~~~ 128 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVA--NP-----VTFGVDNDK-------------ALK-------QIQHTL-NHKWDDKLVD 128 (478) T ss_pred cceeccchHHHHHHHHhhhhcc--cC-----ceeecCChH-------------HHH-------HHHHHH-hccHHHHHHH Confidence 2355566777777777777665 22 223333322 111 112233 3688999999 Q ss_pred HHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCc Q lcl|NC_015159. 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPS 212 (532) Q Consensus 135 ~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~ 212 (532) +.++..++|.+.+++..++ ++.+++.+++..+.+...| ..|++...+|.+...- . T Consensus 129 ~~~~~~~~G~~~~~v~~d~---~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------------------~ 185 (478) T protein:vir:10 129 ILTAASNKGIEWVQPYVDE---EGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG--------------------A 185 (478) T ss_pred HHHHHhhCCeEEEEEEecC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeC--------------------c Confidence 9999999999988776543 3567788887776544333 3688777666554210 1 Q ss_pred ceEEEEEE----EEeeCCCCeEEEEEEE-cCc--ccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHH Q lcl|NC_015159. 213 EEVTIYTH----VYRDPEAMVFRSYQEI-DGE--IVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLEN 285 (532) Q Consensus 213 ~~v~i~~~----v~~~~~~~~~~s~~~~-~~~--~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~ 285 (532) ..+++|+. .+...++......... .+. ........+++..+|++..+. +.+|.|-.+...+-+..++. T Consensus 186 ~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~ 260 (478) T protein:vir:10 186 ERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDK 260 (478) T ss_pred eEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEecc-----CCCCCCcHHHHHHHHHHHHH Confidence 22333321 1111222111111100 000 001112345778899988765 35799999999999999999 Q ss_pred HHHHHHHHHHHHhcCceeecCccccChhhh-cc-CCCcee-ecCc-cccccccccCCccchhHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 286 LYEAIVKMSMISSKVLFFVNPNGVTQIRRV-AK-ANTGDF-VAGR-KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM 361 (532) Q Consensus 286 l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~-~~-~~~G~~-v~g~-~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~ 361 (532) +.-......+....|.+.+..-..-..... .. ...+.+ +.+. .+++..+ ....+.......++.+++.|...-. T Consensus 261 ~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~ 338 (478) T protein:vir:10 261 RLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDTI--KVEVPIDSVKEYTKMLRDYIIEFGQ 338 (478) T ss_pred HHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecCCCCCcceEE--eecCCHHHHHHHHHHHHHHHHHHhC Confidence 888888888888988766542111111111 11 112333 3332 2334333 3334566677777777776655322 Q ss_pred hhhcc-cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceeecchHHHHH Q lcl|NC_015159. 362 LNSAV-QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIATGLEALGR 439 (532) Q Consensus 362 ~~~~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v~~l~~l~r 439 (532) .-.+. ...+...|+.-+..+..-.... ..+. ...+.+.+.+.+.++.+. |. ......+.+.+. +.-|-.. T Consensus 339 ~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k----~~~~-~~~~~~~l~~~~~li~~~~~~--~~d~~~i~i~f~-~~~p~~~ 410 (478) T protein:vir:10 339 GVDFQQDKFGNSPSGIALKFMYSNLDLK----ANKL-KNKTLTALQELLQYIIDFYRL--DVRVQDIEITFN-FNVMVNE 410 (478) T ss_pred CcCcCccccccchHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHhCC--CcccccceEEeC-CCCCCCH Confidence 11111 1112334555443322221111 1111 122233334434333321 11 111122333331 1111111 Q ss_pred HHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHH-HH Q lcl|NC_015159. 440 GHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAG-GQ 517 (532) Q Consensus 440 aq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~-~~ 517 (532) +..++. ++.++.+ +....++. .++ ++.+ ++|++...++....+. + ..... +. T Consensus 411 ~e~~~~----~~~~~g~-------iS~et~i~----~~~-----~v~d~~~E~~ri~~E~~~~~~--~----~~~~~~~~ 464 (478) T protein:vir:10 411 LENSQI----AMNSTGL-------LSKETILG----NHS-----WVQDPVAEMERIEQENIELNQ--Q----LPDIEEGL 464 (478) T ss_pred HHHHHH----HHHHhCC-------CChHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHH--h----ccccCCCC Confidence 111111 1111211 22232332 222 2222 2333333222221111 1 11110 01 Q ss_pred HHHhhcccccCCCC Q lcl|NC_015159. 518 AAAAMMQQQAGLPT 531 (532) Q Consensus 518 ~~~~~~~~~~g~~~ 531 (532) .-...-+...+.++ T Consensus 465 ~d~~~~~~~d~~~e 478 (478) T protein:vir:10 465 NDEQQRQSEDNQSE 478 (478) T ss_pred cccccccCcCCCCC Confidence 11112223333333 No 117 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=97.55 E-value=4.7e-05 Score=44.32 Aligned_cols=446 Identities=11% Similarity=0.072 Sum_probs=193.2 Q ss_pred CCCCCCCcc--CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC----CCc------cc----ccccc--cccch Q lcl|NC_015159. 1 MAEVEKTGF--AADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA----TAD------GS----TSYTT--PWQSI 62 (532) Q Consensus 1 m~~~~~~~~--~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~----~~~------~~----~~~~~--~~dst 62 (532) |.=.+|.-. +.+...+|.. .+..++.|+. + -+.+.... .+. .. .+... .-++. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~-ar~~~~~y~a-----a---~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~ 71 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLR-SRAVIQAYEA-----V---KTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDL 71 (502) T ss_pred CchHhhHHhhcChHHHHHHHh-hHHHHhhccc-----c---CcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChH Confidence 554443211 1222222221 1112222221 1 11111000 000 00 11112 34788 Q ss_pred HHHHHHHHHHHHHH--hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHH Q lcl|NC_015159. 63 GARGLNNLASKLML--ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLL 140 (532) Q Consensus 63 ~~~a~~~Laa~l~~--~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~ 140 (532) +..+++.+++.+++ ++++..++=..-...+.++.+. -...-+.|.+.| +.=.+.+||.....++..++ T Consensus 72 a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~-----ie~~w~~Wa~~~-----D~~g~~~f~~~q~l~~r~~~ 141 (502) T protein:vir:79 72 VIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAE-----IRTRWSEWSVSP-----EVTGQFTRPMLERLMLRTWL 141 (502) T ss_pred HHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHH-----HHHHHHHhhcCc-----CccccCCHHHHHHHHHHHHH Confidence 89999999999996 5666444411111111111110 011223333222 23346789999999999999 Q ss_pred hhCceeeeeccccc---ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEE Q lcl|NC_015159. 141 VAGNVLLYIPSTEQ---VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTI 217 (532) Q Consensus 141 ~~G~~~~~v~~~~~---~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i 217 (532) .-|-+++-+..... ..+.++.++. +-+..+.|+- ... ..-.| T Consensus 142 ~dGE~f~~~~~~~~~~~~~g~~~~l~l----------------------q~iepd~l~~--------~~~-----~~~~i 186 (502) T protein:vir:79 142 RDGEVFAQMVSGRINSLTPSAGVHFWL----------------------EALEPDFIPM--------TSD-----ESNRL 186 (502) T ss_pred hCCceEEEEeecccCccCCCcccceEE----------------------EEecchhcCC--------CCC-----CCCee Confidence 98988764422110 1112221111 1111111210 000 11136 Q ss_pred EEEEEeeCCCCeEEEEEEEcCcccccccccCccccCc---eEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 218 YTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCP---WIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMS 294 (532) Q Consensus 218 ~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P---~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~ 294 (532) ...|+.|..++|...+.. ..........++ ...| +++.-....+|..-|.+...-+|..++.|+.+..+.+.++ T Consensus 187 ~~GVe~d~~Gr~~aY~i~-~~hPgd~~~~~~--~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a 263 (502) T protein:vir:79 187 NQGVFVDDWGRPEKYLVY-KSRPVSGRQMET--KEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAA 263 (502) T ss_pred EeeeEECCCCceEEEEEe-ecCCCCCcccce--eEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHH Confidence 788999888887654433 221111111111 2233 3344444679999999999999999999999999999999 Q ss_pred HHHhcCceeecC-ccc-cC--------hhhhccCCCceeecC-ccc-cccccccC-CccchhHHHHHHHHHHHHHHHHH- Q lcl|NC_015159. 295 MISSKVLFFVNP-NGV-TQ--------IRRVAKANTGDFVAG-RKQ-DVEVFQLE-KYNDFQVAKATADDIEKRLSYAF- 360 (532) Q Consensus 295 ~~a~~p~~lv~~-~g~-~~--------~~~~~~~~~G~~v~g-~~~-~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af- 360 (532) ..++.....+.. ++- .. -.......||.+++. .++ ++...... ..++| ......+...|-.++ T Consensus 264 ~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~---~~f~~~~lr~iaaglG 340 (502) T protein:vir:79 264 RIAAALGMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNL---ETFRNGQLRAVAAGSR 340 (502) T ss_pred HHhhhheeeeecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCH---HHHHHHHHHHHHhhcC Confidence 998888866652 111 00 011122345655442 222 23322211 22233 233444444455553 Q ss_pred -hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc----ccceee---- Q lcl|NC_015159. 361 -MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA----VEPAIA---- 431 (532) Q Consensus 361 -~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~----~~~~~v---- 431 (532) -+..+. .|-.. +=.-++.-..|..+.+--.=..|...|+.|+..+++..+...|.||-+.... .....+ T Consensus 341 i~ye~lt-~D~s~-nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~ 418 (502) T protein:vir:79 341 LSFSSTA-RNYNG-TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVM 418 (502) T ss_pred CCHHHHh-ccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCc Confidence 111221 22222 3334444444444444444445666799999999999999999998443211 122222 Q ss_pred cchHHHHHHHHH-HHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_015159. 432 TGLEALGRGHDL-NKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQ- 509 (532) Q Consensus 432 ~~l~~l~raq~~-~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~- 509 (532) ..|+|+--++.. ..+.+ . +. ....++...|.|+..++ +|.+.+++.. ..-...- T Consensus 419 ~~iDP~Ke~~a~~~~i~~------G--------l~---t~~~~~a~~G~D~~~v~---~q~a~e~~~~----~~~Gl~~~ 474 (502) T protein:vir:79 419 PWIDPVKEAEAWKIQIRG------G--------AA---TESDWVRAGGRNPDDVK---RRRKAEIDEN----RKLDLVFD 474 (502) T ss_pred cccChHHHHHHHHHHHHc------C--------CC---CHHHHHHHcCCCHHHHH---HHHHHHHHHH----HHcCCCCC Confidence 123444322111 00000 0 00 11123334466664333 2222221111 1110000 Q ss_pred ----hhhHHHHHHHHhhcccccC-CCCC Q lcl|NC_015159. 510 ----QMGAAGGQAAAAMMQQQAG-LPTQ 532 (532) Q Consensus 510 ----~~~~~~~~~~~~~~~~~~g-~~~~ 532 (532) .............-.++.+ .++| T Consensus 475 ~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 475 TDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 0000000000011111111 1112 No 118 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.16 E-value=0.00015 Score=41.57 Aligned_cols=448 Identities=11% Similarity=0.024 Sum_probs=191.6 Q ss_pred CCCC-------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cCCCCCcccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEV-------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSATADGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~-------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dst~~~a~~~L 70 (532) ||=. +-..++.+.+.+..+.++.. .++++++.+|....- .+... .......++-.+-+...++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~l~~Yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~Iv~~~ 75 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNR----KKRLDKLSDYYNGKQEIEKHEFD-NATVEAANVMVNHAKYITDMN 75 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHH----HHHHHHHHHHhccccchhcCCcC-cCCCCcceeecchHHHHHHHH Confidence 5432 11233455555555555443 344555555544431 11111 111223456566667777777 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) ++.|.+- |+ ++...+... ...+.+.+..++|.....++.++..+||.+.+++. T Consensus 76 ~~~l~g~--p~-----~~~~~~~~~--------------------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 128 (499) T protein:vir:10 76 VGFMTGN--PV-----KYVAEKGKN--------------------IDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLY 128 (499) T ss_pred hhhhccc--Cc-----eeecCChhH--------------------HHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEE Confidence 7655442 22 222322111 11233556677898999999999999999977765 Q ss_pred ccccccC--------------CcceEEEEecce-EEEeeCCCCCeEE-EEEEEeecHHHhhHHHHHHHHhhcccCCCcce Q lcl|NC_015159. 151 STEQVEG--------------QSNAPKLYKLHN-FVVERDAYDNVLQ-IVTEDKIARAALPEDVRKSLEEAQGDQNPSEE 214 (532) Q Consensus 151 ~~~~~~~--------------~~~~~~~~pl~~-~~v~~d~~G~vd~-i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 214 (532) .++.... ...++.+++..+ |.+.-|..++... .+|.+...-. ........ T Consensus 129 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~--------------~~~~~~~~ 194 (499) T protein:vir:10 129 LKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDL--------------EGNTNGYS 194 (499) T ss_pred ecccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeec--------------CCCceEEE Confidence 4432110 123455555443 5554455444333 3333321100 00111122 Q ss_pred EEEEEEEEeeCCCC-eEEE--EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 215 VTIYTHVYRDPEAM-VFRS--YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIV 291 (532) Q Consensus 215 v~i~~~v~~~~~~~-~~~s--~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l 291 (532) +++|+ ++.. .|.. ..+..+..........++..+|++.++- +.+|.|=.+...+-+..++.+.-... T Consensus 195 ~~iyt-----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~~~~d~e~v~~liD~~~~~~S~~~ 264 (499) T protein:vir:10 195 ITVYM-----PQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRN-----NEERQGDFEQLISLIDAYNLLQTDRI 264 (499) T ss_pred EEEEe-----CCeEEEEEecCCccccCcceecccccCCCCccceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHH Confidence 33332 1110 0000 0001111111122345678899987654 46789988999999999999888888 Q ss_pred HHHHHHhcCceeecCcccc-ChhhhccCCCceee-cCccccccccccCCccchhHHHHHHHHHHHHHHHHHhh-hhcccC Q lcl|NC_015159. 292 KMSMISSKVLFFVNPNGVT-QIRRVAKANTGDFV-AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQR 368 (532) Q Consensus 292 ~~~~~a~~p~~lv~~~g~~-~~~~~~~~~~G~~v-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~ 368 (532) ...+....|.+.+.....- ..........|.+. .+..++..+..+....+.......++.+.+.|.+.-.. +..... T Consensus 265 ~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~ 344 (499) T protein:vir:10 265 SDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEK 344 (499) T ss_pred HHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchh Confidence 8888999998777532221 11111112223332 22222223334444456677778888888877653211 111111 Q ss_pred CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cchHHHHHHHHHHHH Q lcl|NC_015159. 369 GGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKL 446 (532) Q Consensus 369 ~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l 446 (532) .+...|+..+..+..-.... ..-..+...+.+.-++.-++.++...|.- ..-..+++.+. .+.+.+..++-+.++ T Consensus 345 ~~gn~Sg~Al~~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~~--~d~~~i~i~f~~~~p~n~~e~~~~~~kl 421 (499) T protein:vir:10 345 FMGNVSGEAMKFKLFGLENL-LSIKQRYFFDGLRRRLKLIQTIVNIKGAN--DDASGCKISLVANIPSNLSDVVNNVKNA 421 (499) T ss_pred hcccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCc--cccccceEEeCCCCCCCHHHHHHHHHHH Confidence 12334666655433322222 12222322333333333333333223321 11122333332 222222222222222 Q ss_pred HHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccC-CHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccc Q lcl|NC_015159. 447 NVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLIL-TQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQ 525 (532) Q Consensus 447 ~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~-s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (532) +. .+....+++. ++ ++. .++|++...++++......+. ............--.+ T Consensus 422 -------~g-------~iS~et~~~~----l~-----~v~d~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~~~~~~~~ 476 (499) T protein:vir:10 422 -------DG-------IIPRKYTYSW----LP-----DVDNPQDVIDEMNQQDAETIKKNQE--ALRGQDPDRLELEDKQ 476 (499) T ss_pred -------hc-------cCChHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHHHh--hhccCCCCCCCCCCCC Confidence 11 1223333332 22 122 234554443333222211111 1111110000011111 Q ss_pred ccCCCCC Q lcl|NC_015159. 526 QAGLPTQ 532 (532) Q Consensus 526 ~~g~~~~ 532 (532) ....+.. T Consensus 477 ~~~~~~~ 483 (499) T protein:vir:10 477 DDSSEND 483 (499) T ss_pred cccCCCC Confidence 1111111 No 119 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=97.09 E-value=0.00018 Score=41.15 Aligned_cols=433 Identities=11% Similarity=0.046 Sum_probs=168.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSATADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...++ .+.+.+...++..++ ++.+.+.+|..=. ..+...........++..+-+...++.+++.|+ T Consensus 1 ~~~~t~----~~~~~~l~~~~~~~~----~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTP----AEWLPVLTKRIDDGM----SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCH----HHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 777662 344444444444433 3444444444322 111111111111334556666777777776654 Q ss_pred HhhcCCCCCccccCCC-hHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVS-ELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) +- ++. +... |.+. ...+++.+.++++.....++.++..+||.+.+++..++ T Consensus 73 ~~------~~~-~~~~~d~~~--------------------~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~- 124 (456) T protein:vir:10 73 PN------GIT-VGGSADSDL--------------------ALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD- 124 (456) T ss_pred cC------Cee-cCCCCCcch--------------------HHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCC- Confidence 22 222 2111 1110 01123445678888899999999999999988776543 Q ss_pred ccCCcceEEEEecceEEEeeCCC-C-CeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAY-D-NVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~-G-~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) .+...+++++..+.++..|+. + ++...+|.++ ..+..+. . ...-.++..++.|..++...+. .... T Consensus 125 --~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~-~~d~~~~-----~---~~~~~~~~~~~~~~~~~~~~~~-~~~~ 192 (456) T protein:vir:10 125 --DGTATITADSPETMVVSVDPLQPWRIRAAMRWWR-DLDAESD-----F---AIVWSGDGWQKFARPCFVQSSS-RRRL 192 (456) T ss_pred --CCceEEEEEccceeEEEEcCCCCcceEEEEEEEE-ecCCcee-----E---EEEEeccceeEEEEEEEEeecc-ccee Confidence 344567788777765655543 3 3333443332 1110000 0 0000011111111111110010 0111 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec------- Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN------- 305 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~------- 305 (532) ..+..+.........+++..+|++.. .+..|.|-.+..++-+..++...-..+..++....|...+. T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~ 266 (456) T protein:vir:10 193 VTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLP 266 (456) T ss_pred eeecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccc Confidence 11122222111122334455665443 23568898999999888888777666666666665543221 Q ss_pred ---Cccc-cChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHH Q lcl|NC_015159. 306 ---PNGV-TQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEEIRY 380 (532) Q Consensus 306 ---~~g~-~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~~~TAtEi~~ 380 (532) .+|. .++.......+|.+.... .+....++. .++++.....++.+...|...= +-+.....+....|+.-|.. T Consensus 267 ~~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~ 344 (456) T protein:vir:10 267 NVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHN 344 (456) T ss_pred cccccccccchhhhhhhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHH Confidence 1110 111122222333332221 222222322 2344443344444444332110 00000001222335543333 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~ 457 (532) ...-+.+ ..++.+. .+.+-+.+.+.++.+ .|. .....+++.+- .+-+.+..+ +.+....+ +.+. T Consensus 345 ~~~~l~~----k~~~~~~-~f~~~l~~~~rl~~~~~g~---~~~~~~~v~w~~~~~~~~~~~a---da~~kl~~--~gi~ 411 (456) T protein:vir:10 345 IEKGFLF----KCEDRLS-IAKIGLEAILVKALQIEGE---SVEDTVDVSFESPDRVTLGEKY---SAASLAKA--AGES 411 (456) T ss_pred HHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhcCC---CcccceeEEecCCCCcCHHHHH---HHHHHHHH--cCCC Confidence 2222221 1222222 333344555555443 221 12223333331 122222222 22221111 1111 Q ss_pred chhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA 521 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 521 (532) ..+.. ...+|+++..+- +.+.+|..+++.+ +++.++..+. ..+.. T Consensus 412 --------~~~~~---~~~lg~~~~~i~----~~e~er~~~e~~~---~~~~~~~~~~-~~~~~ 456 (456) T protein:vir:10 412 --------WASIR---RNILNYNADQIK----QDDLDRAREQITL---FAGNPVQRPQ-EDGSR 456 (456) T ss_pred --------hHHHH---HhhCCCCHHHHH----HHHHHHHHHHHHH---HhhhhhhcCC-CCCCC Confidence 11111 234677554221 1112222221111 1111111111 11111 No 120 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=97.09 E-value=0.00018 Score=41.15 Aligned_cols=433 Identities=11% Similarity=0.046 Sum_probs=168.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS-----VFPSATADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (532) |...++ .+.+.+...++..++ ++.+.+.+|..=. ..+...........++..+-+...++.+++.|+ T Consensus 1 ~~~~t~----~~~~~~l~~~~~~~~----~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTP----AEWLPVLTKRIDDGM----SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCH----HHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 777662 344444444444433 3444444444322 111111111111334556666777777776654 Q ss_pred HhhcCCCCCccccCCC-hHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVS-ELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) +- ++. +... |.+. ...+++.+.++++.....++.++..+||.+.+++..++ T Consensus 73 ~~------~~~-~~~~~d~~~--------------------~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~- 124 (456) T protein:vir:10 73 PN------GIT-VGGSADSDL--------------------ALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD- 124 (456) T ss_pred cC------Cee-cCCCCCcch--------------------HHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCC- Confidence 22 222 2111 1110 01123445678888899999999999999988776543 Q ss_pred ccCCcceEEEEecceEEEeeCCC-C-CeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAY-D-NVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~-G-~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) .+...+++++..+.++..|+. + ++...+|.++ ..+..+. . ...-.++..++.|..++...+. .... T Consensus 125 --~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~-~~d~~~~-----~---~~~~~~~~~~~~~~~~~~~~~~-~~~~ 192 (456) T protein:vir:10 125 --DGTATITADSPETMVVSVDPLQPWRIRAAMRWWR-DLDAESD-----F---AIVWSGDGWQKFARPCFVQSSS-RRRL 192 (456) T ss_pred --CCceEEEEEccceeEEEEcCCCCcceEEEEEEEE-ecCCcee-----E---EEEEeccceeEEEEEEEEeecc-ccee Confidence 344567788777765655543 3 3333443332 1110000 0 0000011111111111110010 0111 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec------- Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN------- 305 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~------- 305 (532) ..+..+.........+++..+|++.. .+..|.|-.+..++-+..++...-..+..++....|...+. T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~ 266 (456) T protein:vir:10 193 VTRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLP 266 (456) T ss_pred eeecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccc Confidence 11122222111122334455665443 23568898999999888888777666666666665543221 Q ss_pred ---Cccc-cChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHH Q lcl|NC_015159. 306 ---PNGV-TQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF-MLNSAVQRGGDRVTAEEIRY 380 (532) Q Consensus 306 ---~~g~-~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~~~TAtEi~~ 380 (532) .+|. .++.......+|.+.... .+....++. .++++.....++.+...|...= +-+.....+....|+.-|.. T Consensus 267 ~~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~ 344 (456) T protein:vir:10 267 NVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHN 344 (456) T ss_pred cccccccccchhhhhhhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHH Confidence 1110 111122222333332221 222222322 2344443344444444332110 00000001222335543333 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee--cchHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVFIDYMIKLA 457 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~~~~laq~~ 457 (532) ...-+.+ ..++.+. .+.+-+.+.+.++.+ .|. .....+++.+- .+-+.+..+ +.+....+ +.+. T Consensus 345 ~~~~l~~----k~~~~~~-~f~~~l~~~~rl~~~~~g~---~~~~~~~v~w~~~~~~~~~~~a---da~~kl~~--~gi~ 411 (456) T protein:vir:10 345 IEKGFLF----KCEDRLS-IAKIGLEAILVKALQIEGE---SVEDTVDVSFESPDRVTLGEKY---SAASLAKA--AGES 411 (456) T ss_pred HHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhcCC---CcccceeEEecCCCCcCHHHHH---HHHHHHHH--cCCC Confidence 2222221 1222222 333344555555443 221 12223333331 122222222 22221111 1111 Q ss_pred chhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015159. 458 GLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA 521 (532) Q Consensus 458 p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 521 (532) ..+.. ...+|+++..+- +.+.+|..+++.+ +++.++..+. ..+.. T Consensus 412 --------~~~~~---~~~lg~~~~~i~----~~e~er~~~e~~~---~~~~~~~~~~-~~~~~ 456 (456) T protein:vir:10 412 --------WASIR---RNILNYNADQIK----QDDLDRAREQITL---FAGNPVQRPQ-EDGSR 456 (456) T ss_pred --------hHHHH---HhhCCCCHHHHH----HHHHHHHHHHHHH---HhhhhhhcCC-CCCCC Confidence 11111 234677554221 1112222221111 1111111111 11111 No 121 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=96.99 E-value=0.00022 Score=40.61 Aligned_cols=443 Identities=9% Similarity=0.006 Sum_probs=185.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHH--hhhHHHHHHHHHHhhccc---c---cCCCCC--cccccccccccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKND--RGAYETRAEDCATYTIPS---V---FPSATA--DGSTSYTTPWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~--R~~~e~~w~e~~~~~~P~---~---~~~~~~--~~~~~~~~~~dst~~~a~~~L 70 (532) |.=-. ..+...+.|=+-+.. .-.....+..++.=-.+. + ..+.+. ....+..++--+.+...++.+ T Consensus 1 ~~~~~----~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~ 76 (518) T protein:vir:78 1 MGVWS----VMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVA 76 (518) T ss_pred Ccchh----hHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHH Confidence 33222 222333333211110 000111111111100000 0 001111 001112233334567777777 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~ 150 (532) |+-|.+-... +.+...| ..+. ..++++| .+.|..++|+..+.+.+.+..+.|.+++-+. T Consensus 77 A~ll~~e~~~-----i~v~~~~--~~d~-------e~~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~ 135 (518) T protein:vir:78 77 AEYISGKPLS-----IDVTGVN--GSKD-------ENLTKQL-------KEALRIDNFDSKSVKIVELAGGSGVSAVKIN 135 (518) T ss_pred HHhhcCCCce-----EEecCcc--ccCc-------HHHHHHH-------HHHHHhccHHHHHHHHHHHhhccCceEEEEE Confidence 7776554321 2322111 1110 1234444 4678889999999999999999999886322 Q ss_pred ccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC----- Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP----- 225 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~----- 225 (532) .+ ++.+.+..++...|+... .+|++..+.+-...... .+-.+|+.++++. T Consensus 136 ~d----~~~~~i~~v~ad~~~P~~-~~g~~~~~~f~~~~~~~--------------------~k~~~y~~lE~he~~~~~ 190 (518) T protein:vir:78 136 IL----NGRPSISVHSSSQFWIDF-KNNEPFRFNFFEEIPTS--------------------NKADIYYLVESREIKQWD 190 (518) T ss_pred EE----CCeeEEEEEcCCeeEEEe-ecCcEEEEEEEEEeecC--------------------CcceeEEEEEeecccccc Confidence 21 234678888888887754 35777666543332111 0111233332211 Q ss_pred ----CCCeEEEEE--------------------------EEcCcccccccccCccccCceEEEEeeec-----CCCcccc Q lcl|NC_015159. 226 ----EAMVFRSYQ--------------------------EIDGEIVAGTEGEYPLDSCPWIPVRLIKM-----PNEDYGR 270 (532) Q Consensus 226 ----~~~~~~s~~--------------------------~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~-----~g~~YG~ 270 (532) ...++.+.+ ..++...... .......|+++...+.. .+++||+ T Consensus 191 ~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~--~~tg~~~~~~~~~~n~~~N~~~~~splG~ 268 (518) T protein:vir:78 191 KEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHS--VSIGLKSMGAYLINNSPSNTRYPHLNLGE 268 (518) T ss_pred ceeecccceeEEEEEeeecCcccccccccccccccccccccccCcccee--eccCCccceEEeeccccccccccCCCcCc Confidence 001111111 1111100000 01113467777655543 4677899 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCCC----------cee--ecCccc-cccc---c Q lcl|NC_015159. 271 SFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT----------GDF--VAGRKQ-DVEV---F 334 (532) Q Consensus 271 Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~----------G~~--v~g~~~-~~~~---~ 334 (532) |-...+.+.++.||..--+...-.. ..++...|+++-+ .... ..++. -.+ +.+..+ +... + T Consensus 269 S~~~~~~~~id~lD~~~s~~~~e~~-~g~~~i~v~~~~l-~~~~-~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i 345 (518) T protein:vir:78 269 SDLSQCTNYLFAVDYFFTVYMREGE-KTKTKIAASERMF-RKKV-NKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMI 345 (518) T ss_pred chHhhhhHHHHHHHHHHHHHHHHHH-hCCceeeechhHh-ccCC-CCCCCccccccCCCCceEEEecCcCCCCCccccce Confidence 9999999999999998888877665 4888888865432 2111 01111 111 111111 0110 1 Q ss_pred c-cCCccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 335 Q-LEKYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKE 411 (532) Q Consensus 335 ~-~~~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~i 411 (532) . +...-+...-...++.+-+.|.... =...+.. ++...|||||..+.+...+.+--.-..+. ..|.-|+.-++.+ T Consensus 346 ~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~-~~~~~TATei~s~~~~~~~t~~~~~~~~e-~al~~l~~~i~~l 423 (518) T protein:vir:78 346 QFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNL-GNREVKATEIWSLQDATVRKIEKKKRLIQ-NVYEQMLWDFLYL 423 (518) T ss_pred eeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCc-ccccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 1 1111111222233334433333221 1112222 34457999999998887666544332222 2333444444444 Q ss_pred HHhcCC--CCCCccccccce--eecc--hHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHcc Q lcl|NC_015159. 412 LQATSK--IPNLPKEAVEPA--IATG--LEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLI 485 (532) Q Consensus 412 l~r~g~--lp~~p~~~~~~~--~v~~--l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~ 485 (532) +..... ....+.+..++. +--+ .+...+++....+.+ +. .+....+++.+ ..|+ T Consensus 424 ~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~-----aG-------imS~e~~i~~~--~~~~------ 483 (518) T protein:vir:78 424 LTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNS-----AL-------AMSVEEKVKLI--HPKW------ 483 (518) T ss_pred HHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHh-----cC-------CCCHHHHHHHh--CCCC------ Confidence 332211 111222333333 3111 222333222211111 11 12234444432 1122 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccC Q lcl|NC_015159. 486 LTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAG 528 (532) Q Consensus 486 ~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 528 (532) +++|.+++.++.+.++.+. .++.+ ... ..+.+.+| T Consensus 484 -~deea~~e~~ri~~E~~~~----~~~~p--~~~-~g~~~~~g 518 (518) T protein:vir:78 484 -EDEEIQAEVKRIYLENAIG----EVPDP--EAI-GGMETKGG 518 (518) T ss_pred -CHHHHHHHHHHHHHHhccc----CCCCC--ccc-cCCCCCCC Confidence 5566665554443332211 01111 111 22444445 No 122 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=96.78 E-value=0.00034 Score=39.58 Aligned_cols=455 Identities=10% Similarity=0.014 Sum_probs=206.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc--cc-CC-----C-CC-cc---------cccccc--cc Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS--VF-PS-----A-TA-DG---------STSYTT--PW 59 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~--~~-~~-----~-~~-~~---------~~~~~~--~~ 59 (532) |+.-.+..--++.+.+- .....+.... .....|--.. +. .. . .+ +. ..+... .- T Consensus 1 ~~r~~~~~~~~dr~i~~--~~~~~~~~~~---~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rN 75 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNW--AWYRYVEPQK---NAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSIN 75 (505) T ss_pred CCCCccccchhhcccch--hhhhhHHHHH---HhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhc Confidence 87773322222222220 0011111111 1111121111 11 00 0 00 00 011112 24 Q ss_pred cchHHHHHHHHHHHHHH--hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHH Q lcl|NC_015159. 60 QSIGARGLNNLASKLML--ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIK 137 (532) Q Consensus 60 dst~~~a~~~Laa~l~~--~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~ 137 (532) ++.+..+++.+++.+++ +++|..++..+....|+++.+.. ...-+.|.+. .-++.=.+.+||.....++. T Consensus 76 n~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~i-----e~~w~~Wa~~---~~~D~~g~~~f~~lq~l~~r 147 (505) T protein:vir:96 76 NPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLI-----EGNWQQWIKK---GNCDVTGRYHFVTLLHLWME 147 (505) T ss_pred ChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHH-----HHHHHHhcCC---cCcceeccCCHHHHHHHHHH Confidence 77889999999999995 89998888776655555544321 1122333321 11234446679999999999 Q ss_pred HHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEE Q lcl|NC_015159. 138 QLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTI 217 (532) Q Consensus 138 dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i 217 (532) ..++-|=+++-... +..+.+-. +-+-+..+.|+.. .+....+ .-.| T Consensus 148 ~~~~dGE~f~~~~~-----------------------~~~~~~~~--~lqliepd~l~~~--------~n~~~~~-~~~i 193 (505) T protein:vir:96 148 TLARDGEVLVREHR-----------------------GYPNKWGY--ALQILECDRLDLN--------YNADLQN-GNRI 193 (505) T ss_pred HHhhCCceEEEEee-----------------------cCCCCcce--EEEEechhhcCCC--------CCcccCC-cCeE Confidence 98888876432110 11111111 1122223322211 0011111 1237 Q ss_pred EEEEEeeCCCCeEEEEEEEcCcccccc---cc-cCccccCce--EEEEee-ecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 218 YTHVYRDPEAMVFRSYQEIDGEIVAGT---EG-EYPLDSCPW--IPVRLI-KMPNEDYGRSFVEEYLGDLKSLENLYEAI 290 (532) Q Consensus 218 ~~~v~~~~~~~~~~s~~~~~~~~~~~~---~~-~~g~~~~P~--~~~Rw~-~~~g~~YG~Gp~~~al~d~~~L~~l~~~~ 290 (532) +..|+.|..+.|...+.. ........ .. ...+...|. +.+-|. ..+|..-|.+...-+|..++.|.....+. T Consensus 194 ~~GIe~d~~Gr~~aY~i~-~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dae 272 (505) T protein:vir:96 194 RMSIELDAWERPVAYHLL-VNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSE 272 (505) T ss_pred EeceEECCCCceEEEEEe-ecCCCccccccccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHH Confidence 788999998887654433 22211110 00 111233452 333344 45888999999999999999999999999 Q ss_pred HHHHHHHhcCceeecCc-c-ccCh------hhhccCCCceeecCcccc-ccccccC-CccchhHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 291 VKMSMISSKVLFFVNPN-G-VTQI------RRVAKANTGDFVAGRKQD-VEVFQLE-KYNDFQVAKATADDIEKRLSYAF 360 (532) Q Consensus 291 l~~~~~a~~p~~lv~~~-g-~~~~------~~~~~~~~G~~v~g~~~~-~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af 360 (532) +.++..++.....+..+ + ...+ ......++|.|..-.++. +...... ..++|. .....+...|-.++ T Consensus 273 l~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~---~f~~~~lr~iaagl 349 (505) T protein:vir:96 273 MIAAELGAKKVGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFG---AFVKSSLRGVAAGM 349 (505) T ss_pred HHHHHHhhhheeeeecCCccCCCccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhc Confidence 99999999988666532 1 1111 112223455544323322 3322222 122332 23333333344442 Q ss_pred --hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc---cccceeec--- Q lcl|NC_015159. 361 --MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE---AVEPAIAT--- 432 (532) Q Consensus 361 --~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~---~~~~~~v~--- 432 (532) -+..+ ..|-..++=.-+++-..|..+.+--.=..+..-|+.|+..+++..+...|.||-+... ......+. T Consensus 350 gi~ye~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~ 428 (505) T protein:vir:96 350 GPAYNRL-AHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGW 428 (505) T ss_pred CCCHHHH-hcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCc Confidence 11222 2354455555555555666665555555666779999999999999999999754322 12222221 Q ss_pred -chHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 433 -GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQM 511 (532) Q Consensus 433 -~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~ 511 (532) .|+|+-.++..... +. +. +. -...++...|.++..++ .++.++.+....... .. T Consensus 429 ~~iDP~Ke~~a~~~~---i~--~G----------~~-t~~~~~a~~G~D~~~v~-------~q~a~e~~~~~~~Gl--~~ 483 (505) T protein:vir:96 429 DWVDPAKDSKAHSES---IK--NR----------TR-SRSSIIRAAGDDPEDVF-------DEIAWEEQLMRDKGV--NP 483 (505) T ss_pred cccChHHHHHHHHHH---HH--cC----------CC-CHHHHHHHcCCCHHHHH-------HHHHHHHHHHHHcCC--CC Confidence 23343322211000 00 00 00 01122233466664332 222222111111111 00 Q ss_pred hHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 512 GAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 512 ~~~~~~~~~~~~~~~~g~~~~ 532 (532) ..+..........++...+.. T Consensus 484 ~~~~~~~~~~~~~~~~~~~~d 504 (505) T protein:vir:96 484 TPPEQESKDATTDEEDDSASD 504 (505) T ss_pred CCCCCCCCCCCCCCCCCCCCC Confidence 011111111111111111111 No 123 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.30 E-value=0.00077 Score=37.65 Aligned_cols=429 Identities=14% Similarity=0.117 Sum_probs=183.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cCCC------CCcc----cccccc-cccchHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSA------TADG----STSYTT-PWQSIGARG 66 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~------~~~~----~~~~~~-~~dst~~~a 66 (532) -++-++..++.+. ..|. ...++|+-+.+.+-=.+ .+.. ...+ ..++.+ .|-+.-.+. T Consensus 3 ~~~~~~~~V~~~h--p~y~-------a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~t 73 (489) T protein:vir:78 3 TENGQGSGVKTKH--REWL-------HYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRT 73 (489) T ss_pred cCCCccCCCCccC--HHHH-------HHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHH Confidence 2333554444332 2232 23355655555433321 0000 0000 111111 222322333 Q ss_pred HHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCcee Q lcl|NC_015159. 67 LNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVL 146 (532) Q Consensus 67 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 146 (532) + ..|++.+|- ..|++. ++ ..++.+++.| -+...+++.-+..++.+...+|-+. T Consensus 74 l----~~l~G~vfr-k~p~~~--~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ 126 (489) T protein:vir:78 74 L----SGMVGSVMR-KEPEIN--IP--------------KELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGG 126 (489) T ss_pred H----HHHhchhhc-CCccee--cc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEE Confidence 3 444444443 445553 22 1245455544 4567778888889999999999999 Q ss_pred eeeccccccc---------CCcceEEEEecceE---EEe-eCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcc Q lcl|NC_015159. 147 LYIPSTEQVE---------GQSNAPKLYKLHNF---VVE-RDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSE 213 (532) Q Consensus 147 ~~v~~~~~~~---------~~~~~~~~~pl~~~---~v~-~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~ 213 (532) ++||-..... +..-.+..|+..+. -.. .|..+++.-+..+++...++=+..| ..+ T Consensus 127 ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f------------~~~ 194 (489) T protein:vir:78 127 LLVDAPETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEF------------ETK 194 (489) T ss_pred EEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCc------------cce Confidence 9998543210 11234666665554 222 2444456656666654332211111 235 Q ss_pred eEEEEEEEEeeCCCCeEEE--EEEE-cCccccc---ccccCccccCceEEEEeeecCCCcccc--chHHHHHHHHHHHHH Q lcl|NC_015159. 214 EVTIYTHVYRDPEAMVFRS--YQEI-DGEIVAG---TEGEYPLDSCPWIPVRLIKMPNEDYGR--SFVEEYLGDLKSLEN 285 (532) Q Consensus 214 ~v~i~~~v~~~~~~~~~~s--~~~~-~~~~~~~---~~~~~g~~~~P~~~~Rw~~~~g~~YG~--Gp~~~al~d~~~L~~ 285 (532) .++.|....++.++. |+. +... +|..... ..-..|-+.+++|++.|.-..+..+.. .|.. |+..||. T Consensus 195 ~~~q~RvL~~~~~g~-~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~lni 269 (489) T protein:vir:78 195 YGEQYRVLDIDSDGN-YRQRLFRFDAEGGAQEDVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLL----PLAELNI 269 (489) T ss_pred eEEEEEEEecCCCcc-eEEEEEEeecCCcccceeeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchH----HHHHHHH Confidence 566666666665553 332 2222 1211100 000123346778888887666665544 3433 4444443 Q ss_pred ---HHHHHHHHHH-HHhcCceeecC-ccccChhhhccCCCceeecCcc--------ccccccccCCccchhHHHHHHHHH Q lcl|NC_015159. 286 ---LYEAIVKMSM-ISSKVLFFVNP-NGVTQIRRVAKANTGDFVAGRK--------QDVEVFQLEKYNDFQVAKATADDI 352 (532) Q Consensus 286 ---l~~~~l~~~~-~a~~p~~lv~~-~g~~~~~~~~~~~~G~~v~g~~--------~~~~~~~~~~~~~~~~~~~~i~~~ 352 (532) -+.+-.+.+- .+.-|.+.+.. +.. +...+..+.+..++-|.. ++...++.. + ...+.+.+.++ T Consensus 270 ~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~-~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~-~--~~~~r~~l~~l 345 (489) T protein:vir:78 270 GHYRNSADNEESSFVVGQPTLFIYPGENL-TPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAG-E--NNLARQNMLDK 345 (489) T ss_pred HHhhhhhHHHHHHHHcccceeeeecCccC-CcccccccCccceeeCCcccccCCCCCCcceeccC-c--chHHHHHHHHH Confidence 2333344333 44444433321 111 111121122222222221 222222222 1 22346667777 Q ss_pred HHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-c--ccce Q lcl|NC_015159. 353 EKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-A--VEPA 429 (532) Q Consensus 353 ~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~--~~~~ 429 (532) +.+..++ .-.+...+ .+.||++.+.+....-..|+.+...+++- +.-++..+-..+ |. + .+.+ . ++.+ T Consensus 346 e~qm~~l--Ga~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~a-l~~~l~~~a~w~---G~-~-~~~~~~i~~n~d 416 (489) T protein:vir:78 346 EQQAIQI--GAQLITPT-QQITAQSARIQRGADTSVMATIARNVSQA-YTDALRWVAVML---GK-P-EDTEVEFRLNMD 416 (489) T ss_pred HHHHHHH--hhhhccCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHH-HHHHHHHHHHHc---CC-C-CCCceEEEeecc Confidence 7766542 11122223 36899999999999999999988887763 333333333332 32 1 1111 1 2223 Q ss_pred eecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 430 IATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQ 509 (532) Q Consensus 430 ~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~ 509 (532) +. +..+ -++.++.++...+ + ..|....+.+++ ...||.. .+.++++.+.+.+. T Consensus 417 F~--~~~~-d~~~~~al~~~~~--~-------G~is~~t~~~~L-~~~gv~d----~~~e~~~~ei~~~~---------- 469 (489) T protein:vir:78 417 FF--LEPM-TAQDRAAWMADIN--A-------GLLPATAYYAAL-RKAGVTD----WTDADIKDAVADQP---------- 469 (489) T ss_pred cC--cccC-CHHHHHHHHHHHh--c-------CCCCHHHHHHHH-HhCCCCC----ccHHHHHHHHhhcC---------- Confidence 31 1112 1223333332211 1 123344444444 3345521 23333332222110 Q ss_pred hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 510 QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 510 ~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+ ...+-+|..++ T Consensus 470 ---~~-------~~~~~~g~~~~ 482 (489) T protein:vir:78 470 ---LP-------VATEVQGEIPQ 482 (489) T ss_pred ---CC-------cccCCcccCCC Confidence 00 00111111111 No 124 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=428 Identities=9% Similarity=0.026 Sum_probs=186.3 Q ss_pred CCCCCCCc-----------------cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcc--cccCCCCC----cccccccc Q lcl|NC_015159. 1 MAEVEKTG-----------------FAADGAAAAYNRLKNDRGAYETRAEDCATYTIP--SVFPSATA----DGSTSYTT 57 (532) Q Consensus 1 m~~~~~~~-----------------~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P--~~~~~~~~----~~~~~~~~ 57 (532) |+++..+. ...+.+.+..+.... |......|+++|+=.-+ .+-..... ...+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKE-NVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR 79 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccc Confidence 66652211 123444444444443 44455555666543211 11111110 11112335 Q ss_pred cccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHH Q lcl|NC_015159. 58 PWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIK 137 (532) Q Consensus 58 ~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~ 137 (532) +..+-+...++..++.|++ -|+. ++.+|.. +.+.|. ..+ ..||...+.++.+ T Consensus 80 i~~n~~~~Iv~~~~~~l~g--~p~~-----~~~~d~~-------------~~~~l~-------~~~-~n~~~~~~~~~~~ 131 (468) T protein:vir:96 80 MYTNYHQNLVDQKVAYAVA--NPVT-----YGTEDEK-------------SLKTIQ-------EVL-NHKWDDKLVDILT 131 (468) T ss_pred cccchHHHHHHHHHhhhcc--CCce-----eccCChH-------------HHHHHH-------HHH-hcCHHHHHHHHHH Confidence 6666777777777766543 2222 2333322 222222 223 3588888999999 Q ss_pred HHHhhCceeeeecccccccCCcceEEEEecceEEEee--CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceE Q lcl|NC_015159. 138 QLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVER--DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEV 215 (532) Q Consensus 138 dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~--d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v 215 (532) +..++|.+++++..++ ++.+++.+++..+.+... +..|++...+|.+...- ...+ T Consensus 132 ~~~~~G~~~~~v~~d~---~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~--------------------~~~~ 188 (468) T protein:vir:96 132 AASNKGVEWIQPYVDE---QGEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDG--------------------GERV 188 (468) T ss_pred HHhhcCeEEEEEEEcC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--------------------ceEE Confidence 9999999987665442 345677777776644433 33577776666554211 1122 Q ss_pred EEEEE----EEeeCCCCeEEEEE--EEcCc--ccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 216 TIYTH----VYRDPEAMVFRSYQ--EIDGE--IVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 216 ~i~~~----v~~~~~~~~~~s~~--~~~~~--~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) ++|+. .+...++. +.... ...+. ........+++..+|++.++ ++.+|.|=.+...+-+..++.+- T Consensus 189 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~ 262 (468) T protein:vir:96 189 EYWTANDVTFYELKDGQ-LIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRL 262 (468) T ss_pred EEEeCCeEEEEEEcCCc-eeecccccccccccceeeccccccCCcccEEEec-----CCCCCCCchHHHHHHHHHHHHHH Confidence 22221 01111111 11110 00000 00111224567788888664 35679998899999999999988 Q ss_pred HHHHHHHHHHhcCceeecCccccChhhhc-cCC-Ccee-ecCcc-ccccccccCCccchhHHHHHHHHHHHHHHHHHh-h Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVTQIRRVA-KAN-TGDF-VAGRK-QDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-L 362 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~~~~~~~-~~~-~G~~-v~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~ 362 (532) -......+....|.+++..-..-....+. ... .+.+ +++.. +++.. +....+.+.....++.++..|.+.-. . T Consensus 263 S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p 340 (468) T protein:vir:96 263 SDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVDT--IQIDVPVQSAKEYLDMLRDYVIEFGQGV 340 (468) T ss_pred HHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCCCcceE--EeecCChHHHHHHHHHHHHHHHHHhCcc Confidence 88888888889988776532111111111 111 2223 22322 22332 23333556666777777777655422 1 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCccccccceeecchHHHHHHH Q lcl|NC_015159. 363 NSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT-SKIPNLPKEAVEPAIATGLEALGRGH 441 (532) Q Consensus 363 ~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lp~~p~~~~~~~~v~~l~~l~raq 441 (532) +......+...|+..+..+..-..... -...+ .+...+.+++.++.+. |. . .....+.+.+.-. -|-.-+. T Consensus 341 ~~~~~~~~~n~Sg~Alk~~~~~l~~k~-~~k~~----~~~~~l~~~~~li~~~~g~-~-~d~~~i~i~f~~~-~p~d~~e 412 (468) T protein:vir:96 341 DFQQDKFGNSPSGIALKFMYSNLDLKA-NKLKN----KTLTALQELLQYIIDFYKL-S-IKVQDVEITFNFN-VMVNELE 412 (468) T ss_pred cccccccccchHHHHHHHHHHHHHHHH-HHHHH----HHHHHHHHHHHHHHHHhCC-C-cccceeeEEecCC-CCcCHHH Confidence 111112223456665543322222111 11222 2233334444443332 21 1 1112233333111 1111112 Q ss_pred HHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH Q lcl|NC_015159. 442 DLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAA 520 (532) Q Consensus 442 ~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~ 520 (532) .++.+ .+. ..+....+++. ++ ++.+ ++|++...+++++... .. . T Consensus 413 ~a~~~-------~~~-----g~iS~et~i~~----l~-----~v~D~~~E~~ri~~E~~~~~~---~~-----------~ 457 (468) T protein:vir:96 413 QSQIG-------VNS-----QYLSKETVVTN----HP-----WVDDPVAEMERIDQEELALPS---IE-----------E 457 (468) T ss_pred HHHHH-------Hhc-----CCCchHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHH---Hh-----------h Confidence 22211 111 12233333322 22 2222 2444333332211111 00 0 Q ss_pred hhcccccCCCC Q lcl|NC_015159. 521 AMMQQQAGLPT 531 (532) Q Consensus 521 ~~~~~~~g~~~ 531 (532) .+.....--|| T Consensus 458 ~~~~~~~~~~~ 468 (468) T protein:vir:96 458 GLNGKENNEPT 468 (468) T ss_pred ccCCCCCCCCC Confidence 12222333344 No 125 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=95.97 E-value=0.0012 Score=36.62 Aligned_cols=426 Identities=9% Similarity=0.037 Sum_probs=184.6 Q ss_pred CCCCCCCcc-----------------CHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcc-----cccCCCCCc----cccc Q lcl|NC_015159. 1 MAEVEKTGF-----------------AADGAAAAYNRLKNDRGAYETRAEDCATYTIP-----SVFPSATAD----GSTS 54 (532) Q Consensus 1 m~~~~~~~~-----------------~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~~~----~~~~ 54 (532) |+.+..+.- ..+-+.+..+..+. |- .+.+.+.+|..- .+-...... ..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~---~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKP-KI---DDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHH-HH---HHHHHHHHHhccCCcchhccchhccccccccccc Confidence 666643321 12233334444332 22 233344444322 211110000 0112 Q ss_pred ccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHH Q lcl|NC_015159. 55 YTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHA 134 (532) Q Consensus 55 ~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 134 (532) ..++..+-+...++..++.|++ -|+ +++.+|.... ..++.|+ ..||.....+ T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~g--~p~-----~~~~~d~~~~---------~~l~~~~------------~n~~~~~~~~ 128 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAVA--NPV-----TFSSDDDKSL---------KTIQEVL------------NHKWDDKLVD 128 (474) T ss_pred chhcccchHHHHHHhhhhhhcc--cCc-----eeecCchHHH---------HHHHHHH------------hcCHHHHHHH Confidence 3356666677777777766644 221 2233332211 1222222 3578888899 Q ss_pred HHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeC--CCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCc Q lcl|NC_015159. 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERD--AYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPS 212 (532) Q Consensus 135 ~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d--~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~ 212 (532) +.++..++|.+.+++..++ .+.+++.+++..++++..| ..+++...+|.++.. .. T Consensus 129 ~~~~~~~~G~~~~~~y~d~---~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~--------------------~~ 185 (474) T protein:vir:96 129 ILTAASNKGIEWLQPYIDE---NGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLD--------------------GA 185 (474) T ss_pred HHHHHHhcCeeEEEEEecC---CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------Cc Confidence 9999999999887665443 3456788888877665554 367777666665421 11 Q ss_pred ceEEEEEE--E--EeeCCCCeEE-EEEEEcCccc--ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHH Q lcl|NC_015159. 213 EEVTIYTH--V--YRDPEAMVFR-SYQEIDGEIV--AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLEN 285 (532) Q Consensus 213 ~~v~i~~~--v--~~~~~~~~~~-s~~~~~~~~~--~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~ 285 (532) ..+++|+. | +...++.... ..+...+... ......+++..+|++.++. +.+|+|=.+...+-+..++. T Consensus 186 ~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~ 260 (474) T protein:vir:96 186 ERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN-----NPQEMSDLFMYKTIIDAMDK 260 (474) T ss_pred eEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEecc-----CCCCCCcHHHHHHHHHHHHH Confidence 22333321 1 1111111010 0111111000 0112345778899988775 45799999999999999999 Q ss_pred HHHHHHHHHHHHhcCceeecCccccChhh-hccCCC-cee-ecCccccccccccCCccchhHHHHHHHHHHHHHHHHHh- Q lcl|NC_015159. 286 LYEAIVKMSMISSKVLFFVNPNGVTQIRR-VAKANT-GDF-VAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM- 361 (532) Q Consensus 286 l~~~~l~~~~~a~~p~~lv~~~g~~~~~~-~~~~~~-G~~-v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~- 361 (532) +.-......+....|.+.+...+.-.... ...... +.+ +++..+++..+ ....+.+.....++.+++.|-+.-. T Consensus 261 ~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~ 338 (474) T protein:vir:96 261 RLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDGSGVDTI--QIEVPVQSSKEYLDMLRDYVIEFGQG 338 (474) T ss_pred HHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCCCceeEE--eecCChHHHHHHHHHHHHHHHHHhCC Confidence 88888888889898876654321111111 111112 222 23444444433 3334566667777777776654321 Q ss_pred hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cchHHHHH Q lcl|NC_015159. 362 LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGLEALGR 439 (532) Q Consensus 362 ~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l~~l~r 439 (532) .+......+...|+.-+..+..- +.+-.....+...+.+.-++..++.++ |. . .....+.+.+. .+.+.+.. T Consensus 339 p~~~~~~~~~n~Sg~Al~~~~~~-l~~k~~~k~~~~~~~l~~~~~~i~~~~---~~-~-~~~~~i~i~f~~~~p~~~~e~ 412 (474) T protein:vir:96 339 VDFQQDKFGNSPSGIALKFMYSN-LDLKANKLKNKTLTALQELLQYIIDFY---KL-N-IKVQDVEITFNFNVMVNELEQ 412 (474) T ss_pred ccccccccccccHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHh---CC-C-cccceeeEEeccCCCcCHHHH Confidence 11111111223344443332111 111222233333333333333333332 21 1 11122333331 22222211 Q ss_pred HHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC-HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_015159. 440 GHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT-QQDKQAKMAEASTAAGMVTAGQQMGAAGGQA 518 (532) Q Consensus 440 aq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s-~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~ 518 (532) + + .+.+. + .+....++.. ++ ++.+ ++|++...++++... +. .+ T Consensus 413 ~---~-------~~~~a-g----~iS~et~~~~----~~-----~v~d~~~E~~ri~~E~~e~~--~~----~~------ 456 (474) T protein:vir:96 413 S---Q-------IGVQS-Q----YLSKETVVTN----HP-----WVDDPVAELERIEQDNIDFN--KQ----LP------ 456 (474) T ss_pred H---H-------HHHhc-C----CCchHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHH--hc----cc------ Confidence 1 1 11111 1 2333333332 22 2222 233333322221111 11 11 Q ss_pred HHhhcccccCCCCC Q lcl|NC_015159. 519 AAAMMQQQAGLPTQ 532 (532) Q Consensus 519 ~~~~~~~~~g~~~~ 532 (532) .+..+..|..+. T Consensus 457 --~~~~~~~~~~~d 468 (474) T protein:vir:96 457 --PLEGDANGRAQD 468 (474) T ss_pred --ccccccccccCC Confidence 111111222222 No 126 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=433 Identities=10% Similarity=0.049 Sum_probs=187.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccc-ccccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYT-TPWQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l~~~lt 79 (532) |- .++.-.+-.....+|+.+++--.. ...+++...--||.....+...-..++. -.|-+.-.+.++.+++ .+| T Consensus 1 m~-V~~~hp~y~a~~~~W~~~rd~~~G-~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G----~vf 74 (452) T protein:vir:94 1 MP-IETKHPEYLAYENDWIDCRVASLG-QREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSG----MVL 74 (452) T ss_pred CC-CCCcCHHHHHHHHHHHHHHHHhcC-hHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhc----hhh Confidence 76 433333455566666555553322 2455555554566432221111122332 2444555555555544 443 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~ 159 (532) . ..| .++.++ .+..+. .-....+.+.-+...+.+...+|-+.++||-+. .+.. T Consensus 75 ~-k~p--~~~~p~--------------~l~~~~--------~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~--~g~r 127 (452) T protein:vir:94 75 D-QPP--VITHPD--------------AMSKYF--------EDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPL--TGGD 127 (452) T ss_pred c-CCc--eecccH--------------HHHHHH--------hcccCCCHHHHHHHHHHHHHhcCeEEEEEeecc--CCCc Confidence 3 112 122221 122221 125677888888999999999999999998542 2333 Q ss_pred ceEEEEecceE-EEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE-- Q lcl|NC_015159. 160 NAPKLYKLHNF-VVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI-- 236 (532) Q Consensus 160 ~~~~~~pl~~~-~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~-- 236 (532) ..+..|+..+. =++.|..|+..-+..++....++-+++|.. +.++.|......+.+ |....|. T Consensus 128 Py~~~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~------------~~~~~yRvL~l~~g~--~~v~~~~~~ 193 (452) T protein:vir:94 128 PYISVYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQ------------NIRVRYRCLELVDGL--LQITVHETQ 193 (452) T ss_pred eEEEEechhhhcCccccccCCeeEEEEEEEEEEecCCCcccc------------eeEEEEEEEEEeCCe--EEEEEEEcc Confidence 45666665443 234466676655555555444333333332 233333333322221 3332211 Q ss_pred cCccc---ccccccCccccCceEEEEeeecCCCcc--ccchHHHHHHHHHHHHH----HHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 237 DGEIV---AGTEGEYPLDSCPWIPVRLIKMPNEDY--GRSFVEEYLGDLKSLEN----LYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 237 ~~~~~---~~~~~~~g~~~~P~~~~Rw~~~~g~~Y--G~Gp~~~al~d~~~L~~----l~~~~l~~~~~a~~p~~lv~~~ 307 (532) ++... ....-..+-+.+++|++.|-...+... |..|.. |+..||. .+-..-..+..+..|.+.+. T Consensus 194 ~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pPLl----~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~-- 267 (452) T protein:vir:94 194 DGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPPMI----DIVDINYSHYRTSADLEHGRHFTGLPTPWIT-- 267 (452) T ss_pred CCceeeeccceeecCCCcccceeEEEEEcCCCCCCCCCccchH----HHHHHHHHHhcchhHHHHHHHHcccceeEee-- Confidence 11100 000001122345666666665554433 444533 4444442 22223334455555554443 Q ss_pred cccChhhhccCCCceeec-Cc-cccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHH-HHHHHH Q lcl|NC_015159. 308 GVTQIRRVAKANTGDFVA-GR-KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEI-RYVAGE 384 (532) Q Consensus 308 g~~~~~~~~~~~~G~~v~-g~-~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi-~~r~~E 384 (532) |..+-..+.- +++.++. .. .++...++ .+++.+....+.|+++++.++++ ...+......+.|++|- ..+... T Consensus 268 g~~~~~~i~i-G~~~~~~lpe~~~~~~yie-~~g~~i~~~~~~l~~le~~m~~~--Ga~ll~~~~~~~~s~ea~~~~~~~ 343 (452) T protein:vir:94 268 GAESQSTMHI-GSTKAWVIPEVAAKVGFLE-FTGQGLQSLEKALSEKQAQLASL--SARLIDNSTRGSEATETVKLRYMS 343 (452) T ss_pred cCcCCCceEe-cccccccCCCCCCcceEEc-cCchhHHHHHHHHHHHHHHHHHH--HHHhhccCCCcchHHHHHHHHHHH Confidence 2223233332 3333322 22 22344444 34566888888999999887663 11223333333455554 445555 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCccccccceee-cch-HHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_015159. 385 LEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIA-TGL-EALGRGHDLNKLNVFIDYMIKLAGLQD 461 (532) Q Consensus 385 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lp~~p~~~~~~~~v-~~l-~~l~raq~~~~l~~~~~~laq~~p~~~ 461 (532) .-..|..+..++++-+ ++++.++.+ .|. . ..+++.+- .++ ..+ -.+.++.++... + . T Consensus 344 ~~s~L~~~a~~~e~al-----~~~l~~~a~w~g~-~----~~~~v~~n~dF~~~~~-~~~~~~al~~~~----~-----~ 403 (452) T protein:vir:94 344 ETASLKSVTRAVEALL-----NKAYSCIMDMESM-G----GTLNIKLNSAFLDSKL-TAAELKAWVEAY----L-----S 403 (452) T ss_pred hhHHHHHHHHHHHHHH-----HHHHHHHHHHcCC-C----CceEEEeccccccccC-CHHHHHHHHHHH----h-----c Confidence 5688888777776643 455555544 232 1 11222221 111 111 123333333221 1 1 Q ss_pred hhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015159. 462 DDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA 521 (532) Q Consensus 462 d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 521 (532) ..|....+..++-+ .||.. .++|......+. .++ +......+...+-.+ T Consensus 404 G~is~~t~~~~L~~-~gvl~-----~~~e~~~i~~E~--~~~---~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 404 GGISKEIYIHALKV-GKVLP-----PPGESMGVIPDP--PAP---EPSPSNTPPNPSSKA 452 (452) T ss_pred CCCcHHHHHHHHHh-CCCCC-----CccCHHHHHHHh--hcc---CcccCCCCCCCccCC Confidence 12444444544433 47622 222221111111 000 000011111000000 No 127 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=95.52 E-value=0.0019 Score=35.47 Aligned_cols=460 Identities=12% Similarity=0.015 Sum_probs=199.6 Q ss_pred CCCCCCCccCHHHHHHHHHHH-HHHh-hhHHHHHHHHHHhhcccc---cCC---CCCcc------cccccccccchHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRL-KNDR-GAYETRAEDCATYTIPSV---FPS---ATADG------STSYTTPWQSIGARG 66 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~l-k~~R-~~~e~~w~e~~~~~~P~~---~~~---~~~~~------~~~~~~~~dst~~~a 66 (532) |-.+. ..++.+.+...+... .... +...++.+.+.+|..-.- ... .+..+ .+...|+..+-+... T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~I 79 (537) T protein:vir:78 1 MTSPL-LNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTEL 79 (537) T ss_pred CCccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHH Confidence 65552 122444555444322 1111 122345556666644421 000 01111 112346777788888 Q ss_pred HHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCcee Q lcl|NC_015159. 67 LNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVL 146 (532) Q Consensus 67 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 146 (532) ++..++.|++- |+. ++..+... +++.. .+...+ ..+|.....++.+++..+|.+. T Consensus 80 vd~~~~yl~G~--Pv~-----~~~~d~~~----------~e~~~-------~l~~~~-~~~~~~~~~el~~~~s~~G~ay 134 (537) T protein:vir:78 80 VDQLAQYLLSN--GVE-----VKVKDEDN----------TQLDE-------ILQEYF-DEDFQATIDTLVTNASKKGFEG 134 (537) T ss_pred HHHHhhhhccc--Cce-----eecCcchh----------HHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCeeE Confidence 88888877654 332 22322111 11221 222233 4678888899999999999997 Q ss_pred eeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEE--E--E Q lcl|NC_015159. 147 LYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTH--V--Y 222 (532) Q Consensus 147 ~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~--v--~ 222 (532) +|+-.++ .+.+++..++..+.+.--|..|....++|-+......-.. ...+.-..+++|+. | + T Consensus 135 ~~~y~de---~~~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~----------~~~~~~~~~evyt~~~i~~y 201 (537) T protein:vir:78 135 IFARTTS---EGKLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQ----------QSTETIWHADVWNEEAVCYY 201 (537) T ss_pred EEeeecC---CCceEEEEEccceeEEEEcCCCCceeEEEEEeeeeccccc----------cCcceEEEEEEEcCCcEEEE Confidence 6655432 3466788888777766667788888888776644321110 01111123333321 0 1 Q ss_pred eeCCC--------------CeEEEEEEEcCcc-------cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHH Q lcl|NC_015159. 223 RDPEA--------------MVFRSYQEIDGEI-------VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLK 281 (532) Q Consensus 223 ~~~~~--------------~~~~s~~~~~~~~-------~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~ 281 (532) ....+ .|...++..+... .......++|..+|++.++= +.+|.|=.++..+-+- T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~~~sd~e~v~~LiD 276 (537) T protein:vir:78 202 IQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKDGMSDVKRVKSIID 276 (537) T ss_pred EecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEecc-----CccCCCchhhhHHHHH Confidence 10110 0111111111100 01112235678888876654 4578999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCceeecCccccC-hhhhccC-CCcee-ecCccccccccccCCccchhHHHHHHHHHHHHHHH Q lcl|NC_015159. 282 SLENLYEAIVKMSMISSKVLFFVNPNGVTQ-IRRVAKA-NTGDF-VAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSY 358 (532) Q Consensus 282 ~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~-~~~~~~~-~~G~~-v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~ 358 (532) .++.+.-......+...+|.+.+...+... .+..... ..|.+ +.+..+++..+ ....+.......++.+++.|-+ T Consensus 277 ayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~l--~~~~~~~~~e~~ld~L~~~I~~ 354 (537) T protein:vir:78 277 DYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEIQ--TVSIPYEARKAKMDIDVENIYR 354 (537) T ss_pred HHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCCCceeEE--EecCCHHHHHHHHHHHHHHHHH Confidence 999999899999999999887775432222 1111111 12333 34444555443 3334667777778888887754 Q ss_pred HHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec--chHH Q lcl|NC_015159. 359 AFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT--GLEA 436 (532) Q Consensus 359 af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~--~l~~ 436 (532) .-+............|..-+..+-.-+ .+-.....+...+.+.-++.-++.++...|. .......+++.+.- +.+- T Consensus 355 ~s~~~~~~~~~~gn~SGvAlk~~~~~l-~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~-~~~d~~~i~i~f~~~~P~n~ 432 (537) T protein:vir:78 355 SGMGFNSTAVGDGNVTNVVIKSRYTLL-AMKARKMETSLRKVLRWCADMVVSDIALRGL-GEYDSNDICFEIEPHVLANE 432 (537) T ss_pred hcCCCCCccccccCCcHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-cccccceeeEEeccCCCCCH Confidence 322111222233334543333221111 1222333333333333333444444433332 11222234443322 2222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHH-HH------HHHHHH Q lcl|NC_015159. 437 LGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTA-AG------MVTAGQ 509 (532) Q Consensus 437 l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~-~~------~~~~~~ 509 (532) +..++-+.++. +. + .+....++ ..++ ++.+.++.+.+.++..+. .. .++... T Consensus 433 ~e~a~~~~~l~-------~~-g----iiS~eT~l----~~~p-----~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~ 491 (537) T protein:vir:78 433 LDIATTRKTEA-------ET-E----ALKIGNIM----TVAP-----RIGDDETLKLIAEELDLDYNELKDALAEQDAQS 491 (537) T ss_pred HHHHHHHHHHH-------hc-C----cchHHHHH----HhCC-----CCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccc Confidence 22222111111 10 0 01111111 1111 122222221111111110 00 000000 Q ss_pred hhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 510 QMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 510 ~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ....+..+ .+.....+.+.+ T Consensus 492 ~~~~~~~~---~~~~~~~~~~~~ 511 (537) T protein:vir:78 492 LDVSPDVQ---AMLDGLPVNANQ 511 (537) T ss_pred cCcCcchh---hhcCCCCCCCCC Confidence 00001111 111111111111 No 128 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=430 Identities=12% Similarity=0.056 Sum_probs=178.6 Q ss_pred ccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccc----cccccccchHHHH-------HHHHHHHHHH Q lcl|NC_015159. 8 GFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGST----SYTTPWQSIGARG-------LNNLASKLML 76 (532) Q Consensus 8 ~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~----~~~~~~dst~~~a-------~~~Laa~l~~ 76 (532) =..++.+ +..+ ..........|.. ..+.....+... ..++..+-.-..+ +++.+..+.. T Consensus 1 ~~~~~~a-------~~~~--~~~~a~~~~~~~~-~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~ 70 (461) T protein:vir:80 1 MYSIDKA-------KQAK--IDSKIVNRNDFMV-GHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISE 70 (461) T ss_pred Cccchhh-------hhhh--hhhhhhhhhHHHh-hcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchH Confidence 1011111 1111 1222223333332 111111111100 0111111111111 1223333333 Q ss_pred hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_015159. 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVE 156 (532) Q Consensus 77 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~ 156 (532) .+| +.|+.+.-.+++.. ..++.|+ .+-+....+.++++.--.||.|.+++.-.+... T Consensus 71 d~~---r~g~~i~~~~~~~~---------~~~~~~~-----------~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~ 127 (461) T protein:vir:80 71 DMV---RAGWSLKTDNKEMK---------KNIESKW-----------RKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR 127 (461) T ss_pred Hhh---cCCeeeecCCHHHH---------HHHHHHH-----------HHhhHHHHHHHHHHhhcccccEEEEEEeecCCc Confidence 443 45777765433211 1233333 333577889999999999999988885432211 Q ss_pred CCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 157 GQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 157 ~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) ......+.+..+.+ ..-....+|.+..++...+..+. ... .+.+-+.|+.... ...+. +.+ T Consensus 128 ~~~~~~~pl~~~~~-----~~~~~l~~~~~~~i~~~~~~~dp--------~sp-~fg~P~~y~i~~~-~~~~~----~~~ 188 (461) T protein:vir:80 128 EQADLSTAIDPKTI-----KSIPYINTFNTQKVTQLYLNQDM--------FSE-HFGEVEFFEVNRV-SQLGE----EIL 188 (461) T ss_pred cccCccCCcccccc-----cceeEEEeccccccchhhhcccC--------cCc-ccccceEEEEecc-ccccc----ccc Confidence 11111111111111 00111222323222222111100 000 0111122111110 11000 000 Q ss_pred cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC------ccc- Q lcl|NC_015159. 237 DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP------NGV- 309 (532) Q Consensus 237 ~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~------~g~- 309 (532) .+.. + ....-+|..+++...-...++..||+|..+..++.++..........+.+..+.-+.+-.+. +.. T Consensus 189 ~~~~--~-~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~ 265 (461) T protein:vir:80 189 SGTT--A-STSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKA 265 (461) T ss_pred cccc--C-ccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHH Confidence 1100 0 00112355566666666777888999999999999999998888887766666555544431 100 Q ss_pred --cChhhhccCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHH---hhhhcccCCCCCCCHHHHHHHHHH Q lcl|NC_015159. 310 --TQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAF---MLNSAVQRGGDRVTAEEIRYVAGE 384 (532) Q Consensus 310 --~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af---~~~~~~~~~~~~~TAtEi~~r~~E 384 (532) ...-.......|.++-+..++...+. .+|.-+...+..+.+.|.-+- ..-.+.+.-+..=|.+ + T Consensus 266 ~~~~~~~~~~~~~g~~~~d~~e~~e~~~----~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge-------~ 334 (461) T protein:vir:80 266 NLTAMLDFMFRTEALAIIKGDEQLTKES----TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQ-------Y 334 (461) T ss_pred HHHHHHHHhcCCceEEEEcCCcceEEEe----cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccch-------H Confidence 00111112233444444444443332 234445566666777766552 0001111112222322 2 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCccccccceee-cchHHHHHHHHHH---HHHHHHHHHHhhcc Q lcl|NC_015159. 385 LEDTLGGVYSLLSQELQLPLVKILLKELQAT--SKIPNLPKEAVEPAIA-TGLEALGRGHDLN---KLNVFIDYMIKLAG 458 (532) Q Consensus 385 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~--g~lp~~p~~~~~~~~v-~~l~~l~raq~~~---~l~~~~~~laq~~p 458 (532) -...+---+.+++...+.|.+++++.++.+. |.-|.++....+..+. .+|..+....+++ +....++.+.+. T Consensus 335 D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~-- 412 (461) T protein:vir:80 335 DVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVN-- 412 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc-- Confidence 2233444456667778899999999988763 3334444433333332 3344444444433 333344433332 Q ss_pred hhhhhcCHHHHHHHHHHhcCCCHhHccCCHH-HHHH-HHHHHHHHHHHHHHHHhhh Q lcl|NC_015159. 459 LQDDDINLLDVKMRLANSLGMDTTGLILTQQ-DKQA-KMAEASTAAGMVTAGQQMG 512 (532) Q Consensus 459 ~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~e-e~~~-~~~q~~~~~~~~~~~~~~~ 512 (532) ..|+.+++.+.+....|+++.......+ |... ..+..+..+.. ...| T Consensus 413 ---g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e----~~~g 461 (461) T protein:vir:80 413 ---GVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKK----NADG 461 (461) T ss_pred ---CCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcccccccc----CCCC Confidence 2488899888887777776554332221 1111 00000000000 0000 No 129 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=95.21 E-value=0.0025 Score=34.83 Aligned_cols=365 Identities=10% Similarity=0.034 Sum_probs=143.1 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc--ccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChH Q lcl|NC_015159. 16 AAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS--YTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSEL 93 (532) Q Consensus 16 ~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 93 (532) --|+.+-.+|+..-....-.........+...+..+..- ..-+-.++--.|++.+|+.+.+. ||--+...+. T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l------~~~~~~~~~~ 74 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKL------PIHTYKRTDG 74 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhC------ceEEEEecCC Confidence 334443333432211100011111111111111111110 11123444445666666655432 4422222211 Q ss_pred HHhhhccChhHHHHHHHHHHHHHHHHHHHH-HhcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecc Q lcl|NC_015159. 94 EVKQSITSPEELTEIATGLAMVERICMNYM-ESNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH 168 (532) Q Consensus 94 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~ 168 (532) ...+. ....| +..| .+-| .+.=+...+.++..+|||.+|+..+. .+.....+||. T Consensus 75 ~~~~~---------~~~~l-------~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~----~G~~~~L~~l~ 134 (416) T protein:vir:12 75 GIERK---------PEHKS-------AHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGS----HGYPEALFPLR 134 (416) T ss_pred ccccc---------cccHH-------HHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC----CCcEEEEEEEC Confidence 11100 00011 1222 2222 33345566788889999988775321 22233444442 Q ss_pred --eEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccc Q lcl|NC_015159. 169 --NFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEG 246 (532) Q Consensus 169 --~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~ 246 (532) ..-+..+.++. ..|. .+...|... T Consensus 135 ~~~v~v~~~~~~~------------------------------------------------~~~~-~~~~~g~~~----- 160 (416) T protein:vir:12 135 PDYTNAYVHPTTG------------------------------------------------MLWY-QTVLNGKAI----- 160 (416) T ss_pred CcceEEEEeCCCc------------------------------------------------EEEE-EEecCCeEE----- Confidence 22222222211 1111 111122211 Q ss_pred cCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhc---------- Q lcl|NC_015159. 247 EYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA---------- 316 (532) Q Consensus 247 ~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~---------- 316 (532) .+...-+++.|+...+ ..||.||..-+...+.......+.......-...|..++.-++.++++... T Consensus 161 --~~~~~eiih~~~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~ 237 (416) T protein:vir:12 161 --ELYDYEVLHFKGLSTD-GIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVN 237 (416) T ss_pred --EecCccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHh Confidence 1122345666665444 489999999999999998888888888888888888888766666666432 Q ss_pred cCCCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cc-cCCCCCCCHHHHHHHHHHHHHHhhhh Q lcl|NC_015159. 317 KANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AV-QRGGDRVTAEEIRYVAGELEDTLGGV 392 (532) Q Consensus 317 ~~~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~-~~~~~~~TAtEi~~r~~E~~~~LGpv 392 (532) .++.-.++++ +....++.. ..+.+.. +.....+..|-++|-.-. +. ..++..-++++... . T Consensus 238 ~~~~~~vl~~---g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~--~--------- 302 (416) T protein:vir:12 238 KVENIAIIDY---GLEYQSISMPLQEAQFV-ESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSI--E--------- 302 (416) T ss_pred cCCCeeecCC---CceEEEccCChhhHHHH-HHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHH--H--------- Confidence 1211112222 223333332 3345543 445667788888884321 11 11222223333321 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc-cccccee-ec---chHHHHHHHHHHHHHHH----HHH---HHhhcch- Q lcl|NC_015159. 393 YSLLSQELQLPLVKILLKELQATSKIPNLPK-EAVEPAI-AT---GLEALGRGHDLNKLNVF----IDY---MIKLAGL- 459 (532) Q Consensus 393 ~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~-~~~~~~~-v~---~l~~l~raq~~~~l~~~----~~~---laq~~p~- 459 (532) +...-|.|++.+....+.+ .+|++... ....+.+ ++ ..+...|+.-.+.+... ... +-.+.|. T Consensus 303 ---f~~~~l~P~~~~ie~~l~~-~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 378 (416) T protein:vir:12 303 ---YVRNTLQPWIVNFEQELNV-KLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIE 378 (416) T ss_pred ---HHHHHHHHHHHHHHHHHHH-hhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 2233445554444444322 23332221 1111222 11 11233333332222211 000 1111121 Q ss_pred hhhh-------cCHHHHHHHHH-------HhcC-CCHhH Q lcl|NC_015159. 460 QDDD-------INLLDVKMRLA-------NSLG-MDTTG 483 (532) Q Consensus 460 ~~d~-------id~d~~~~~~a-------~~~G-v~p~~ 483 (532) -.|. +-.|.+ ..+- ..-| =.-+. T Consensus 379 ggd~~~~~~n~~~~~~~-~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 379 NGDKYISSLNYVFLDFL-EEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred Ccceeeecccccccccc-chhhccccccccCCCCCcCCC Confidence 0111 111111 1110 0000 00111 No 130 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=95.09 E-value=0.0028 Score=34.60 Aligned_cols=438 Identities=14% Similarity=0.120 Sum_probs=183.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccccc---CC------CCCccc----cccc-ccccchHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF---PS------ATADGS----TSYT-TPWQSIGARG 66 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~------~~~~~~----~~~~-~~~dst~~~a 66 (532) -|+-++..++.+.. .|. ...++|+-+.+.+-=.+. +. ....+. .++. -.|-+.-.+. T Consensus 3 ~~~~~~~~V~~~hp--~y~-------a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~t 73 (491) T protein:vir:95 3 TANGQGSGVKTKHR--EWL-------HYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRT 73 (491) T ss_pred ccCCccCCCCccCH--HHH-------HHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHH Confidence 34445555543322 232 233555555554432110 00 000111 1111 1333333344 Q ss_pred HHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCcee Q lcl|NC_015159. 67 LNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVL 146 (532) Q Consensus 67 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 146 (532) ++. |++.+|- ..|.+. ++ ..++.+++.| -....+++.-+..++.+...+|-+. T Consensus 74 l~~----l~G~vfr-k~p~~~--~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ 126 (491) T protein:vir:95 74 LSG----MVGSVMR-KEPEIN--IP--------------KELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGG 126 (491) T ss_pred HHH----Hhchhhc-CCceee--cc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHHcCeEE Confidence 444 4444433 334442 22 1244455544 4557778888889999999999999 Q ss_pred eeeccccccc---------CCcceEEEEecceE---EE-eeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcc Q lcl|NC_015159. 147 LYIPSTEQVE---------GQSNAPKLYKLHNF---VV-ERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSE 213 (532) Q Consensus 147 ~~v~~~~~~~---------~~~~~~~~~pl~~~---~v-~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~ 213 (532) ++||-..... +..-.+..|+..+. -. ..|..+++.-+..+++..+++=+.+|. .+ T Consensus 127 ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~------------~~ 194 (491) T protein:vir:95 127 LLVDAPETAAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFE------------TK 194 (491) T ss_pred EEEecCCCcccCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcc------------cc Confidence 9998543210 11234666665554 22 235566677676677544433222222 34 Q ss_pred eEEEEEEEEeeCCCC-eEEEEEEE-cCccccccc---ccCccccCceEEEEeeecCCCccc--cchHHHHHHHHHHHHH- Q lcl|NC_015159. 214 EVTIYTHVYRDPEAM-VFRSYQEI-DGEIVAGTE---GEYPLDSCPWIPVRLIKMPNEDYG--RSFVEEYLGDLKSLEN- 285 (532) Q Consensus 214 ~v~i~~~v~~~~~~~-~~~s~~~~-~~~~~~~~~---~~~g~~~~P~~~~Rw~~~~g~~YG--~Gp~~~al~d~~~L~~- 285 (532) .++.|....++.++. .++.+... +|....... -..|-..+++|++.|.-..+..+. ..|.. |+..||. T Consensus 195 ~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~lni~ 270 (491) T protein:vir:95 195 YGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLL----PLAELNIG 270 (491) T ss_pred eEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchH----HHHHHHHH Confidence 556666666655443 12222211 111110000 012334566777776655555444 44533 4444443 Q ss_pred --HHHHHHHHHHH-HhcCceeecC-ccccChhhhccCCCceeecCc--------cccccccccCCccchhHHHHHHHHHH Q lcl|NC_015159. 286 --LYEAIVKMSMI-SSKVLFFVNP-NGVTQIRRVAKANTGDFVAGR--------KQDVEVFQLEKYNDFQVAKATADDIE 353 (532) Q Consensus 286 --l~~~~l~~~~~-a~~p~~lv~~-~g~~~~~~~~~~~~G~~v~g~--------~~~~~~~~~~~~~~~~~~~~~i~~~~ 353 (532) -+.+-.+.+-. +..|.+.+.. |.. ..+.+..+.+..++-|. .++...++.. +.. .+...+.+++ T Consensus 271 Hy~~ssd~~~~l~~~~~P~l~~~G~d~~-~~~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~-~~~--~~~~~l~~~e 346 (491) T protein:vir:95 271 HYRNSADNEESSFVVGQPTLFIYPGDNL-TPQSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAG-ENN--LARQNMLDKE 346 (491) T ss_pred HhhhhhHHHHHHHHcccceeeeecCccc-CcchhhccCcceeEecCcCCcCCCCCCccceeecC-cch--HHHHHHHHHH Confidence 23333443333 4444433321 111 11111111122222222 2222333322 112 2466677777 Q ss_pred HHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-c--cccee Q lcl|NC_015159. 354 KRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-A--VEPAI 430 (532) Q Consensus 354 ~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~--~~~~~ 430 (532) .+..++=. .+...+ .+.||++...+....-..|+.+...+++-+ .-++..+-..+ |. + .+.+ . ++.++ T Consensus 347 ~qm~~~Ga--~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al-~~~l~~~a~w~---G~-~-~~~~v~i~~n~dF 417 (491) T protein:vir:95 347 QQAIQIGA--QLITPS-QQITAESARIQRGADTSVMATIARNVSQAY-TDALRWVAMML---GK-P-EDSEVEFQLNMDF 417 (491) T ss_pred HHHHHHHH--HhccCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHHH-HHHHHHHHHHc---CC-C-CCCceEEEeeccc Confidence 66655311 222233 358999999999999999999888877643 33334433332 32 1 1111 1 22232 Q ss_pred -ecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 431 -ATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQ 509 (532) Q Consensus 431 -v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~ 509 (532) +..+. ++.++.++...+ ...|....+...+ ...||. + .+.|++..+.+.+.-. .. T Consensus 418 ~~~~~~----~~~~~all~~~~---------~G~is~~t~~~~L-~~~~vl-~---~~~e~~~~~ie~~~~~------~~ 473 (491) T protein:vir:95 418 FLQPMT----AQDRAAWMADIN---------AGLLPATAYYAAL-RKAGVT-D---WTDEDILNAIEDAPLP------SG 473 (491) T ss_pred ccccCC----HHHHHHHHHHHh---------cCCCCHHHHHHHH-HhCCCC-C---ccHHHHHHHHHhcCCC------CC Confidence 12222 223333332222 1123333333333 334663 1 2333332222211100 00 Q ss_pred hhhHHHHHHHHhhccccc Q lcl|NC_015159. 510 QMGAAGGQAAAAMMQQQA 527 (532) Q Consensus 510 ~~~~~~~~~~~~~~~~~~ 527 (532) ...+.++..+.+.-+++. T Consensus 474 ~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 474 AVTQVAGEIPQAAQQQQE 491 (491) T ss_pred ccccccccchhhhhhccC Confidence 000000000000000000 No 131 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=94.92 E-value=0.0032 Score=34.27 Aligned_cols=452 Identities=9% Similarity=0.012 Sum_probs=202.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCc-c----cccccc--cccchHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATAD-G----STSYTT--PWQSIGARGLNNLASK 73 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~-~----~~~~~~--~~dst~~~a~~~Laa~ 73 (532) |.-+..-+.+......++...-..-+ ++.+..+.|.-+.+..+.... . ..+... .-++.+..+++.+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~---~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 77 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGG---GFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDH 77 (530) T ss_pred CccceeecCccccchHHHhhhhcccC---CCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 66665555554434333322211000 111222222111111111000 0 111222 3477889999999888 Q ss_pred HHHh-hcCCCCCccc-cCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHH----------HhcCChHHHHHHHHHHHh Q lcl|NC_015159. 74 LMLA-LFPVGSSFFK-LNVSELEVKQSITSPEELTEIATGLAMVERICMNYM----------ESNSFRPTLHAAIKQLLV 141 (532) Q Consensus 74 l~~~-ltpp~~~WF~-l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l----------~~snf~~~~~~~~~dl~~ 141 (532) +++. ++|..+|=++ |..++.. .+.|-..||+.-.... ...+||.....++..+++ T Consensus 78 vVG~Gi~~~~~p~~~~l~~~~~~-------------~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~ 144 (530) T protein:vir:38 78 IVGSFFRLSYRPSWRYLGINEED-------------SRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAF 144 (530) T ss_pred hhCCCceeeeccchhhcCCCHhH-------------HHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhh Confidence 8775 7776554333 3332211 2233333433333222 245799999999999999 Q ss_pred hCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEE Q lcl|NC_015159. 142 AGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHV 221 (532) Q Consensus 142 ~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 221 (532) -|-+++-+.... ..+..+.+ +-+-+-.+.|+-. .. .++.. .|+..| T Consensus 145 dGE~~~~~~~~~-~~g~~~~~----------------------~lq~ie~d~l~~~--------~~--~~~~~-~i~~GI 190 (530) T protein:vir:38 145 NGELCVQATWDS-DSTRLFRT----------------------QFKMVSPKRVSNP--------NN--IGDTR-NCRAGV 190 (530) T ss_pred CCceEEEeeecc-CCCCccce----------------------EEEEechhhcCCC--------CC--CCCCC-eeEeee Confidence 888766432110 11111111 1112222222110 00 11111 367889 Q ss_pred EeeCCCCeEEEEEEEcCcccc---cccccCccccCc--eEEEEeee-cCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 222 YRDPEAMVFRSYQEIDGEIVA---GTEGEYPLDSCP--WIPVRLIK-MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSM 295 (532) Q Consensus 222 ~~~~~~~~~~s~~~~~~~~~~---~~~~~~g~~~~P--~~~~Rw~~-~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~ 295 (532) +.|..++|...+..-...... ...+.-.+...| -+++-++. .+|..-|.+..--+|..++.|+....+.+.++. T Consensus 191 e~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~ 270 (530) T protein:vir:38 191 KINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAI 270 (530) T ss_pred EECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHH Confidence 999888876554432211100 000000011122 24444444 589999999999999999999999999999999 Q ss_pred HHhcCceeecCc-cccChh----------------------------hhccCCCceeecCcccc-ccccccC-CccchhH Q lcl|NC_015159. 296 ISSKVLFFVNPN-GVTQIR----------------------------RVAKANTGDFVAGRKQD-VEVFQLE-KYNDFQV 344 (532) Q Consensus 296 ~a~~p~~lv~~~-g~~~~~----------------------------~~~~~~~G~~v~g~~~~-~~~~~~~-~~~~~~~ 344 (532) .++.....+..+ +.-... .....++|.|..-.++. +...... ...+| T Consensus 271 i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~-- 348 (530) T protein:vir:38 271 VKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGY-- 348 (530) T ss_pred HhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCH-- Confidence 988888665421 100000 00112344443322222 3222211 12233 Q ss_pred HHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc Q lcl|NC_015159. 345 AKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLP 422 (532) Q Consensus 345 ~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p 422 (532) ......+...|-.++ -+..+ ..|-..++=.-+++-..|..+.+--.=..+..-|+.|+..+.+..+...|.||-+. T Consensus 349 -~~f~~~~lr~iaaglGi~ye~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~ 426 (530) T protein:vir:38 349 -STFEQSLLRYIAAGLGVSYEQL-SRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPS 426 (530) T ss_pred -HHHHHHHHHHHHhhcCCCHHHH-hcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCC Confidence 233344455555554 12222 23444555555555556666665555566667788999999999999999998443 Q ss_pred cc----------cccceee----cchHHHHHHHHH-HHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC Q lcl|NC_015159. 423 KE----------AVEPAIA----TGLEALGRGHDL-NKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT 487 (532) Q Consensus 423 ~~----------~~~~~~v----~~l~~l~raq~~-~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s 487 (532) .. ......+ ..|+|+-.++.. ..+.+-+.. ...++...|.|+..++ T Consensus 427 ~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s-----------------~~~~~a~~G~D~~~v~-- 487 (530) T protein:vir:38 427 KARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLST-----------------YEKECAKRGDDYQEIF-- 487 (530) T ss_pred CCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCC-----------------HHHHHHHcCCCHHHHH-- Confidence 21 1223332 224554333221 111100000 1122233466664222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 488 QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 488 ~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .++.++.+......+ ..+.......+.-..+....++. T Consensus 488 -----~q~a~e~~~~~~~Gl--~~~~~~~~~~~~~~~~~~~~~~d 525 (530) T protein:vir:38 488 -----AQQVRESMERRAAGL--NPPAWAAAAFEAGVKKSNEEEQD 525 (530) T ss_pred -----HHHHHHHHHHHHcCC--CCCCCcccccCCCCCCCCCCCCC Confidence 222222111111100 00000000000000111111111 No 132 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=94.51 E-value=0.0042 Score=33.61 Aligned_cols=404 Identities=12% Similarity=0.092 Sum_probs=164.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccc--ccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYT--TPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~dst~~~a~~~Laa~l~~~l 78 (532) ||+.-. .+.+ .++..-.++.+.| .+++-|...+.. ..+.+-.. -.-.++--.|++.+|+.+. T Consensus 1 ~~~~~~------~~~~---~~~~~~~~~~~~~---~~~~g~~~~~~~-~~~~~~~~~~a~~~~~v~~~v~~ia~~iA--- 64 (460) T protein:vir:10 1 MANRII------RALR---ELTGLDNKFNDAF---IKYIGQTFTKYD-NNGKTYLEQGYNINPDVYSCISQMAAKTV--- 64 (460) T ss_pred CchhHH------HHHh---hhhccCCCchHHH---HHhhccccCCCc-cchhhhhHHHHhcchHHHHHHHHHHHhhh--- Confidence 777631 2222 2222222333445 456666433211 22222122 2234455566777776653 Q ss_pred cCCCCCccccCCChHH-HhhhccC------------hhHHHHHHHHHHHHHHHHHHHHHhcC----ChHHHHHHHHHHHh Q lcl|NC_015159. 79 FPVGSSFFKLNVSELE-VKQSITS------------PEELTEIATGLAMVERICMNYMESNS----FRPTLHAAIKQLLV 141 (532) Q Consensus 79 tpp~~~WF~l~~~d~~-~~~~~~~------------~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~ 141 (532) +-||.-....... ..+.... ......+ ..+...+......+.+=| .+.-...++.++.. T Consensus 65 ---~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll 140 (460) T protein:vir:10 65 ---AVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRL-DTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRL 140 (460) T ss_pred ---hCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchh-hhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhh Confidence 3344432211100 0000000 0000000 112222222333333333 34445666778889 Q ss_pred hCceeeeecccccccCCcce--EEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEE Q lcl|NC_015159. 142 AGNVLLYIPSTEQVEGQSNA--PKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYT 219 (532) Q Consensus 142 ~G~~~~~v~~~~~~~~~~~~--~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~ 219 (532) +|||..|+..+......+.. +..+|...+-+..+.+|.+-.. ++ T Consensus 141 ~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~--~~-------------------------------- 186 (460) T protein:vir:10 141 NGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLST--DS-------------------------------- 186 (460) T ss_pred cCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeee--ee-------------------------------- Confidence 99999988654322222333 4444446666666666633210 00 Q ss_pred EEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeec-----CCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 220 HVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKM-----PNEDYGRSFVEEYLGDLKSLENLYEAIVKMS 294 (532) Q Consensus 220 ~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~-----~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~ 294 (532) ....+.+..+.... .+ ...=.++.|+... .+..||.||...+...+.......+...... T Consensus 187 ---------~~~~~~~~~~g~~~----~~--~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f 251 (460) T protein:vir:10 187 ---------PIKSYMLIQGDQFI----EF--NEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTM 251 (460) T ss_pred ---------eeeEEEEecCceeE----Ee--cccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111111111100 11 1112344454332 3557999999999999999888888888877 Q ss_pred HHHhcCceeecCccccChhhhccCC-----------C-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 295 MISSKVLFFVNPNGVTQIRRVAKAN-----------T-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFM 361 (532) Q Consensus 295 ~~a~~p~~lv~~~g~~~~~~~~~~~-----------~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~ 361 (532) .....|-+++..++.++++...... + |.++. -.++....++.. ..+.+ ..+..+..+..|-++|- T Consensus 252 ~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fg 329 (460) T protein:vir:10 252 QNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAG-ASGEIAFTKISLNTDELK-PFDYLKYDQKAICNALG 329 (460) T ss_pred hcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCcee-cCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhC Confidence 7777787888777777766442211 1 11111 122333334432 23333 24555677788888873 Q ss_pred hh--hcccCCCCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-ccccee-ecchHH Q lcl|NC_015159. 362 LN--SAVQRGGDRVTAEEIRYVAGE-LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-AVEPAI-ATGLEA 436 (532) Q Consensus 362 ~~--~~~~~~~~~~TAtEi~~r~~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~~~~~~-v~~l~~ 436 (532) .- .+...++...|-.-+.+.... ....|.|...+++.+|-.- ++|+.... ...+.+ ...+.. T Consensus 330 VPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~k-------------l~~~~~~~~~~~i~~d~~~l~~ 396 (460) T protein:vir:10 330 WSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKK-------------FIKRFKGYENAVIEWDISELPE 396 (460) T ss_pred CCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHh-------------hcCcccccCCceEEeecchhhh Confidence 22 111122222222222222222 2224555555555444332 33332211 122232 122222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh------------HccCCHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 437 LGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT------------GLILTQQDKQAKMAEASTAAGM 504 (532) Q Consensus 437 l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~------------~i~~s~ee~~~~~~q~~~~~~~ 504 (532) +. .+...+..+++. -.+-.++ +-+.+|.||- .++..++.-+..... T Consensus 397 l~--~d~~~~~~~~~~---------g~~T~NE----~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~------- 454 (460) T protein:vir:10 397 MQ--TDMVAMASWLNT---------IPVTPNE----IRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDS------- 454 (460) T ss_pred HH--HHHHHHHHHHhC---------CCCCHHH----HHHHhCCCCCCCCCCCeeeecccccchhhcccccCCC------- Confidence 22 122222222221 1122333 2233455442 111111000000000 Q ss_pred HHHHHhhhHHHHHHHHhhcccc Q lcl|NC_015159. 505 VTAGQQMGAAGGQAAAAMMQQQ 526 (532) Q Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~ 526 (532) ...++| T Consensus 455 ----------------~~nq~~ 460 (460) T protein:vir:10 455 ----------------AFNQNQ 460 (460) T ss_pred ----------------cccCCC Confidence 000000 No 133 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=94.02 E-value=0.0056 Score=32.92 Aligned_cols=451 Identities=12% Similarity=0.077 Sum_probs=192.7 Q ss_pred CCCCCCCccCH-----HHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccc-ccccchHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAA-----DGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYT-TPWQSIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~-----~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l 74 (532) |++-++..++. .....+|+.+++--.. ....++...--||.....+...-..++. -.|-+.-.+.++.++..+ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~v 79 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGG-TEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKP 79 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcC-hHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhh Confidence 98877655542 3334444444443222 3444555555566433222111222332 356666677777777555 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHH-HHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIAT-GLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~-~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~ 153 (532) +.- ||.. |..+ + ..+.. |++.| -+...+++.-+..++.+...+|-+.++||-.. T Consensus 80 f~k--~p~~-~~~~--p--------------~~~~~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~ 134 (513) T protein:vir:97 80 FSE--PIKL-NEDV--P--------------KAIEETILPDV------DLQGNNLDVFARQWFREGMAKALCHVLIDMPR 134 (513) T ss_pred hhc--Cccc-CcCc--h--------------HHHHHHHhhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEecCC Confidence 442 3211 2111 1 12333 33332 34566788888889999999999999997432 Q ss_pred ccc---------------CCcceEEEEecceE---EEe-eCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcce Q lcl|NC_015159. 154 QVE---------------GQSNAPKLYKLHNF---VVE-RDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEE 214 (532) Q Consensus 154 ~~~---------------~~~~~~~~~pl~~~---~v~-~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 214 (532) ... +..-.+..|+..++ -.. .|..+.+.-+..++....+ +.|. ... T Consensus 135 ~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~---Dgf~------------~~~ 199 (513) T protein:vir:97 135 PAPREDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ---DGFA------------EVC 199 (513) T ss_pred CCCccchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec---CCCc------------ceE Confidence 100 00123556665443 222 2444445545555544311 1111 112 Q ss_pred EEEEEEEEeeCCCCeEEEEEEEcCccccc---ccccCccccCceEEEEeeecCCCcc--ccchHHHHHHHHHHHHHH--- Q lcl|NC_015159. 215 VTIYTHVYRDPEAMVFRSYQEIDGEIVAG---TEGEYPLDSCPWIPVRLIKMPNEDY--GRSFVEEYLGDLKSLENL--- 286 (532) Q Consensus 215 v~i~~~v~~~~~~~~~~s~~~~~~~~~~~---~~~~~g~~~~P~~~~Rw~~~~g~~Y--G~Gp~~~al~d~~~L~~l--- 286 (532) ++.|... ++. -|+.+...++..... ..-..|-+.+++|++.|....+..+ |..|.. |+..||.- T Consensus 200 ~~q~rvL--~~g--~~~v~r~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl----~LA~ln~~hy~ 271 (513) T protein:vir:97 200 KRRIRVL--EPG--LVQLWEPVKKSNAQKEEWALADEWATGLNYVPLVTFYADRQGFMMGKPPLL----DLAHLNVAHWQ 271 (513) T ss_pred EEEEEEE--eCc--eEEEEEeecCCCccccceEEecCCCCcCCceeEEEEecCCCCCCCCccchH----HHHHHHHHHHh Confidence 2212111 122 144333222211000 0111122345666666665555444 344533 55555532 Q ss_pred HHHHHHHHHHHhcCceeecCccccC--hhhhccCCCceee--cCccccccccccCCccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 287 YEAIVKMSMISSKVLFFVNPNGVTQ--IRRVAKANTGDFV--AGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML 362 (532) Q Consensus 287 ~~~~l~~~~~a~~p~~lv~~~g~~~--~~~~~~~~~G~~v--~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~ 362 (532) ..+-++.+......|.++-. |... .+.+.-+ ++.++ |...++...++ ..+..+......+.+++++++++= . T Consensus 272 ~~Sd~~~il~~~~~P~l~~~-G~~~~~~~~i~iG-~~~~~~lpe~~~~~~yie-~~g~~i~~~~~~l~~le~qm~~~G-a 347 (513) T protein:vir:97 272 SASDQRHILTVSRFPILACS-GASGEDSDPVVVG-PNKVLYNPDPAGRFYYVE-HTGQAIAAGRTDLKDLEEQMAGYG-A 347 (513) T ss_pred hhhhHHHHHHhcccceeeee-cCCcCCCCceEee-ccccccCCCCCCcceeec-cCchhHHHHHHHHHHHHHHHHHHH-H Confidence 23333333333444434332 2211 1233333 33333 32233344444 235678888889999999887642 1 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec-chHHHHHHH Q lcl|NC_015159. 363 NSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT-GLEALGRGH 441 (532) Q Consensus 363 ~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~-~l~~l~raq 441 (532) . +........||++.+.+....-..|+.+...+++ .+.-++..+...| |. ..+..++.+-. +...---++ T Consensus 348 ~-ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~-al~~~l~~~a~wl---g~----~~~~~~v~in~dF~~~~~~~~ 418 (513) T protein:vir:97 348 E-FLKRKTGGQTATARALDSAEATSDLSAMTGLFED-ALAQALDITADWL---RL----GPNGGTVELVKDYDLEEMDAP 418 (513) T ss_pred H-hhccCCccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHh---CC----CCCccEEEeccccCcccCCHH Confidence 1 2223344589999999999999999998877665 3333333333333 21 11222222211 111111122 Q ss_pred HHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015159. 442 DLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA 521 (532) Q Consensus 442 ~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 521 (532) .++.++...+ + + .|....+.+++-+ .||-+..+- .+++.+.++.+.+.+.... ......+. T Consensus 419 ~~~al~~a~~---~--G----~is~~t~~~~L~r-~gvl~~d~d-~~~~~e~~~~~~~~~~~~~--------~~d~~~~~ 479 (513) T protein:vir:97 419 GLQALQVARE---K--R----DISRKTYLNGLRL-RGVLPEDFD-EDEDWEELMEEISEAMGRA--------GLDLDPAQ 479 (513) T ss_pred HHHHHHHHHh---C--C----CCCHHHHHHHHHh-ccCCCccCC-HHHHHHHHHHhhhhccCCC--------CccccccC Confidence 2322222211 0 1 1333334444432 455222221 1222222222211111000 00000011 Q ss_pred hcccccCCCCC Q lcl|NC_015159. 522 MMQQQAGLPTQ 532 (532) Q Consensus 522 ~~~~~~g~~~~ 532 (532) -+.+++|..|. T Consensus 480 ~~~~~~~~~~~ 490 (513) T protein:vir:97 480 KNPPEGGEGEG 490 (513) T ss_pred CCCCCCCCCCC Confidence 12222222222 No 134 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=93.85 E-value=0.0062 Score=32.69 Aligned_cols=329 Identities=13% Similarity=0.057 Sum_probs=124.3 Q ss_pred hhcccccCCCCCccc---ccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHH Q lcl|NC_015159. 38 YTIPSVFPSATADGS---TSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAM 114 (532) Q Consensus 38 ~~~P~~~~~~~~~~~---~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ 114 (532) -+++- +........ ......+-+.+. ...+.+.++..+ ...++...... ...|.. T Consensus 1 m~m~~-f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~al~-------~~~v~~---- 58 (392) T protein:vir:10 1 MILPI-LNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAALR-------NSDLFS---- 58 (392) T ss_pred Ccchh-hhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHhhc-------cHHHHH---- Confidence 11111 110000000 000000000000 001111111100 00111110000 011111 Q ss_pred HHHHHHHH----------------HHhcCC----hHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEee Q lcl|NC_015159. 115 VERICMNY----------------MESNSF----RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVER 174 (532) Q Consensus 115 ve~~~~~~----------------l~~snf----~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~ 174 (532) |=..+... +.+-|- +.=+..++.++..+|||++++..+. .+..+.+..++...+-+.. T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~--~g~~~~L~~l~~~~v~~~~ 136 (392) T protein:vir:10 59 IILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA--NGADMKWEYLRPSQVNTYY 136 (392) T ss_pred HHHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC--CCcEEEEEEEcCceeEEEE Confidence 11122222 222332 4444556678888999988765331 1223333333334444444 Q ss_pred CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCc Q lcl|NC_015159. 175 DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCP 254 (532) Q Consensus 175 d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P 254 (532) +.+|... |--++ ..+...... ..+ ..-- T Consensus 137 ~~~~~~~------------------------------------------------~y~~~-~~~~~~~~~-~~~--~~~e 164 (392) T protein:vir:10 137 FEYENGM------------------------------------------------YYNIT-FDDPKIEPI-LQA--PQSD 164 (392) T ss_pred cCCCceE------------------------------------------------EEEEE-ecCccccee-EEE--cccc Confidence 4333210 10011 111111000 011 1223 Q ss_pred eEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc-ccChh--------hhccCCC-ceee Q lcl|NC_015159. 255 WIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG-VTQIR--------RVAKANT-GDFV 324 (532) Q Consensus 255 ~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g-~~~~~--------~~~~~~~-G~~v 324 (532) +++.|+...+|..||.||...+...+.....+.+.......-...|..++.-.+ ....+ .+....+ |.+ T Consensus 165 iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~- 243 (392) T protein:vir:10 165 LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGP- 243 (392) T ss_pred EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCe- Confidence 566777777788999999999999999999999999988888888886654322 11111 1111111 111 Q ss_pred cCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_015159. 325 AGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLP 403 (532) Q Consensus 325 ~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~P 403 (532) .--.+++...++.. ..+.+. .+..+..+..|-.+|=.......+...-|..+ .+...-....|.|.+.++++++-.- T Consensus 244 ~vl~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~ 321 (392) T protein:vir:10 244 VVLDDLEEFTALEIKSNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYK 321 (392) T ss_pred eecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11123334444442 234443 35666777888888733211111222222211 1122234456667666666666443 Q ss_pred HHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHH-----------HHH-------------HHHHhhc-- Q lcl|NC_015159. 404 LVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLN-----------VFI-------------DYMIKLA-- 457 (532) Q Consensus 404 li~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~-----------~~~-------------~~laq~~-- 457 (532) |+.. +.-+ .......+...++..+..+. ..+ ..+..+. T Consensus 322 L~~~-------------~~~d---~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~G 385 (392) T protein:vir:10 322 LSDH-------------ISVN---MRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTG 385 (392) T ss_pred cccc-------------cccc---chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCC Confidence 3221 1000 00000011111111111110 000 1121111 Q ss_pred ---chhh Q lcl|NC_015159. 458 ---GLQD 461 (532) Q Consensus 458 ---p~~~ 461 (532) .+++ T Consensus 386 d~~~p~p 392 (392) T protein:vir:10 386 QSNEPVP 392 (392) T ss_pred CCCCCCC Confidence 1112 No 135 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=93.85 E-value=0.0062 Score=32.69 Aligned_cols=329 Identities=13% Similarity=0.057 Sum_probs=124.3 Q ss_pred hhcccccCCCCCccc---ccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHH Q lcl|NC_015159. 38 YTIPSVFPSATADGS---TSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAM 114 (532) Q Consensus 38 ~~~P~~~~~~~~~~~---~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ 114 (532) -+++- +........ ......+-+.+. ...+.+.++..+ ...++...... ...|.. T Consensus 1 m~m~~-f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~al~-------~~~v~~---- 58 (392) T protein:vir:39 1 MILPI-LNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAALR-------NSDLFS---- 58 (392) T ss_pred Ccchh-hhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHhhc-------cHHHHH---- Confidence 11111 110000000 000000000000 001111111100 00111110000 011111 Q ss_pred HHHHHHHH----------------HHhcCC----hHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEee Q lcl|NC_015159. 115 VERICMNY----------------MESNSF----RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVER 174 (532) Q Consensus 115 ve~~~~~~----------------l~~snf----~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~ 174 (532) |=..+... +.+-|- +.=+..++.++..+|||++++..+. .+..+.+..++...+-+.. T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~--~g~~~~L~~l~~~~v~~~~ 136 (392) T protein:vir:39 59 IILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA--NGADMKWEYLRPSQVNTYY 136 (392) T ss_pred HHHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC--CCcEEEEEEEcCceeEEEE Confidence 11122222 222332 4444556678888999988765331 1223333333334444444 Q ss_pred CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCc Q lcl|NC_015159. 175 DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCP 254 (532) Q Consensus 175 d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P 254 (532) +.+|... |--++ ..+...... ..+ ..-- T Consensus 137 ~~~~~~~------------------------------------------------~y~~~-~~~~~~~~~-~~~--~~~e 164 (392) T protein:vir:39 137 FEYENGM------------------------------------------------YYNIT-FDDPKIEPI-LQA--PQSD 164 (392) T ss_pred cCCCceE------------------------------------------------EEEEE-ecCccccee-EEE--cccc Confidence 4333210 10011 111111000 011 1223 Q ss_pred eEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc-ccChh--------hhccCCC-ceee Q lcl|NC_015159. 255 WIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG-VTQIR--------RVAKANT-GDFV 324 (532) Q Consensus 255 ~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g-~~~~~--------~~~~~~~-G~~v 324 (532) +++.|+...+|..||.||...+...+.....+.+.......-...|..++.-.+ ....+ .+....+ |.+ T Consensus 165 iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~- 243 (392) T protein:vir:39 165 LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGP- 243 (392) T ss_pred EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCe- Confidence 566777777788999999999999999999999999988888888886654322 11111 1111111 111 Q ss_pred cCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_015159. 325 AGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLP 403 (532) Q Consensus 325 ~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~P 403 (532) .--.+++...++.. ..+.+. .+..+..+..|-.+|=.......+...-|..+ .+...-....|.|.+.++++++-.- T Consensus 244 ~vl~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~ 321 (392) T protein:vir:39 244 VVLDDLEEFTALEIKSNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYK 321 (392) T ss_pred eecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11123334444442 234443 35666777888888733211111222222211 1122234456667666666666443 Q ss_pred HHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHH-----------HHH-------------HHHHhhc-- Q lcl|NC_015159. 404 LVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLN-----------VFI-------------DYMIKLA-- 457 (532) Q Consensus 404 li~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~-----------~~~-------------~~laq~~-- 457 (532) |+.. +.-+ .......+...++..+..+. ..+ ..+..+. T Consensus 322 L~~~-------------~~~d---~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~G 385 (392) T protein:vir:39 322 LSDH-------------ISVN---MRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTG 385 (392) T ss_pred cccc-------------cccc---chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCC Confidence 3221 1000 00000011111111111110 000 1121111 Q ss_pred ---chhh Q lcl|NC_015159. 458 ---GLQD 461 (532) Q Consensus 458 ---p~~~ 461 (532) .+++ T Consensus 386 d~~~p~p 392 (392) T protein:vir:39 386 QSNEPVP 392 (392) T ss_pred CCCCCCC Confidence 1112 No 136 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=93.68 E-value=0.0067 Score=32.49 Aligned_cols=368 Identities=12% Similarity=0.052 Sum_probs=141.6 Q ss_pred cCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHH----- Q lcl|NC_015159. 44 FPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERI----- 118 (532) Q Consensus 44 ~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~----- 118 (532) +...+....+........+. +...+.++ .+..+ ++...... ...|..-.+.+... T Consensus 1 M~~f~~~~~~~~~~~~~~~~------~~~~~~~~---~~~~~----v~~~~al~-------~~~V~~~v~~ia~~ia~~p 60 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPD------WVNFLTGG---EAQKY----VSADTALK-------NSDIFSLIMQLSGDLAMVR 60 (397) T ss_pred CcchhhhhcccCcccCCchh------hhhhhcCC---cCCce----echHHhhc-------cHHHHHHHHHHHHHHhhCc Confidence 33332211111111111111 11111100 00000 11110000 01111111111111 Q ss_pred -------HHHHHHhc----CChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEE Q lcl|NC_015159. 119 -------CMNYMESN----SFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTED 187 (532) Q Consensus 119 -------~~~~l~~s----nf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~ 187 (532) .+..+.+- ..+.-+..+..+|..+|||.+++..+. .+..+.+..+|...+-+..+.+|.. ++.++ T Consensus 61 ~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~--~g~~~~l~~l~~~~v~i~~~~~~~~--~~y~~ 136 (397) T protein:vir:38 61 YTSESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNT--NGVDLSWEYLRPSQVQPMLLQDGSG--LIYNI 136 (397) T ss_pred ccccccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC--CCcEEEEEEEcCceeEEEEcCCCce--EEEEE Confidence 11112222 234455667778888999988875432 2233445555556666665555432 11111 Q ss_pred eecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCc Q lcl|NC_015159. 188 KIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNED 267 (532) Q Consensus 188 ~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~ 267 (532) .+ .+.. .+....++.++ +++.|.....+.. T Consensus 137 ~~-----------------------------------------------~~~~-~~~~~~~~~~e--iih~~~~~~~~~~ 166 (397) T protein:vir:38 137 NF-----------------------------------------------DEPA-IGYMENVPAAD--VIHIRLLSKNGGK 166 (397) T ss_pred Ee-----------------------------------------------cccc-ccceeEecCcc--EEEecCCCCCCcc Confidence 10 0000 00000111111 4455555566778 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhcc----------CC-CceeecCcccccccccc Q lcl|NC_015159. 268 YGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK----------AN-TGDFVAGRKQDVEVFQL 336 (532) Q Consensus 268 YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~----------~~-~G~~v~g~~~~~~~~~~ 336 (532) ||.||...+...+.......+.......-...|..++.-++.++.+.... +. .|.++. -.+++...++ T Consensus 167 ~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~v-l~~g~~~~~l 245 (397) T protein:vir:38 167 TGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVV-IDALEDYKPL 245 (397) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCcee-cCCCceEEec Confidence 99999999999999999998888888888888887777555554443211 11 121111 1233444444 Q ss_pred CCc-cchhHHHHHHHHHHHHHHHHHhhhhcccCCCCC-CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 337 EKY-NDFQVAKATADDIEKRLSYAFMLNSAVQRGGDR-VTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA 414 (532) Q Consensus 337 ~~~-~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~-~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r 414 (532) ... .+.+ ..+..+..+..|-.+|-.......+... .+..| +...-....|-|.+..+..+|-.- T Consensus 246 ~~~~~d~~-~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e--~~~~~~~~~l~P~~~~ie~~ln~~----------- 311 (397) T protein:vir:38 246 EVKGNIAS-LLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT--QISGQYAKSLNRYVQAIVGELNDK----------- 311 (397) T ss_pred CCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH--HHHHHHHHHHHHHHHHHHHHHHHh----------- Confidence 432 3343 3556677888898888443211111111 12222 111122234444444444443222 Q ss_pred cCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHH Q lcl|NC_015159. 415 TSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAK 494 (532) Q Consensus 415 ~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~ 494 (532) ++++.. +.......-+...|+..++.+. +. -.+..+++-+ .+|.+|- -.. +..... T Consensus 312 --l~~~~~---~~~~~~~~~d~~~~~~~~~~~~-------~~-----G~~t~nE~R~----~lg~~p~--~~~-d~~~~~ 367 (397) T protein:vir:38 312 --LHANIS---ANIRFAIDAMGDQYASTISSSV-------KG-----GTIAGNQARF----ILQNSGY--LAK-DLPDPE 367 (397) T ss_pred --ccChhc---ccccccccCCHHHHHHHHHHHH-------hC-----CCcCHHHHHH----HhCCCCC--CCC-cccccc Confidence 233211 1111111112333333222221 11 1234444332 2344331 111 100000 Q ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 495 MAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 495 ~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) .. ...+.......++........++.+-|+ T Consensus 368 ~~-------~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 368 KE-------PQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred cc-------ccccccccccccCCCCCCCCCCCCCCCC Confidence 00 0000001111112222222333344444 No 137 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=93.64 E-value=0.0069 Score=32.44 Aligned_cols=320 Identities=14% Similarity=0.105 Sum_probs=123.0 Q ss_pred HHHhhcC------------CCCCccccCCChHHHhhhc-------cChh---HHHHHHHHHHHHHHH------------H Q lcl|NC_015159. 74 LMLALFP------------VGSSFFKLNVSELEVKQSI-------TSPE---ELTEIATGLAMVERI------------C 119 (532) Q Consensus 74 l~~~ltp------------p~~~WF~l~~~d~~~~~~~-------~~~~---~~~~v~~~L~~ve~~------------~ 119 (532) |+.++|. .-..||.- ..++.+-... .... ....|..-.+.+... . T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPD-GNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred CcchhhhhhhcccCccccccccccccc-CchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh Confidence 2222221 00011100 0000000000 0000 001122111111111 1 Q ss_pred HHHHHhcCC----hHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhh Q lcl|NC_015159. 120 MNYMESNSF----RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALP 195 (532) Q Consensus 120 ~~~l~~snf----~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~ 195 (532) ...+.+-|- +.=+...+.++..+||++.|+..+. .+....+..++...+-+..+.+|.. T Consensus 80 ~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~--~G~~~~L~~i~~~~v~v~~~~~~~~--------------- 142 (392) T protein:vir:74 80 QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA--NGADMKWEYLRPSQVNTYYFEYENG--------------- 142 (392) T ss_pred hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC--CCcEEEEEEEcCceeEEEEcCCCce--------------- Confidence 111222332 4445556678888899887764331 1222233333334444444433321 Q ss_pred HHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHH Q lcl|NC_015159. 196 EDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEE 275 (532) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~ 275 (532) .|-.++ ..+..... ...+. .--+++.|+...+|..||.||..- T Consensus 143 ---------------------------------~~y~~~-~~~~~~~~-~~~~~--~~evih~~~~~~~~~~~G~s~i~~ 185 (392) T protein:vir:74 143 ---------------------------------MYYNIT-FDDPKIEP-ILQAP--QSDLIHMKLLSIDGGKTGISPLYS 185 (392) T ss_pred ---------------------------------EEEEEE-ecCCccce-eEEEc--CccEEEecCCCCCCccccccHHHH Confidence 110111 11111000 00111 112456666667788899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeecC-ccccChh--------hhccCCC-ceeecCccccccccccCC-ccchhH Q lcl|NC_015159. 276 YLGDLKSLENLYEAIVKMSMISSKVLFFVNP-NGVTQIR--------RVAKANT-GDFVAGRKQDVEVFQLEK-YNDFQV 344 (532) Q Consensus 276 al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~-~g~~~~~--------~~~~~~~-G~~v~g~~~~~~~~~~~~-~~~~~~ 344 (532) +...+.......+.......-...|..++.- ++....+ .+....+ |.+ .--.++....++.. ..+.+. T Consensus 186 ~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~-~vl~~g~~~~~l~~~~~d~q~ 264 (392) T protein:vir:74 186 LRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGP-VVLDDLEEFTALEIKSNVAQL 264 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCe-eecCCCceEEEccCChhHHHH Confidence 9999999999999999988888888866542 2222211 1111111 111 11123333344442 334443 Q ss_pred HHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc Q lcl|NC_015159. 345 AKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE 424 (532) Q Consensus 345 ~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~ 424 (532) .+..+..+..|-++|-.......+...-|.. +.+...-....|.|.+.++.+++-.-|+..+ +... .. T Consensus 265 -~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~~-----~~~~-----~~ 332 (392) T protein:vir:74 265 -LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSS-IQQISGMYASALNRYLRPAISELEYKLSDHI-----SVNM-----RP 332 (392) T ss_pred -HHHHHHHHHHHHHHhCCCHHHhCCCCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhccchh-----cccc-----hh Confidence 4556677788888884322111122222221 1122223455677777777666644332110 0000 00 Q ss_pred ccccee---ecchHHHH------HHHHHHHHH---------HHHHHHHhh-----cchhh Q lcl|NC_015159. 425 AVEPAI---ATGLEALG------RGHDLNKLN---------VFIDYMIKL-----AGLQD 461 (532) Q Consensus 425 ~~~~~~---v~~l~~l~------raq~~~~l~---------~~~~~laq~-----~p~~~ 461 (532) .+.... ...++.|. +++-.+-+. ....++..+ ..+++ T Consensus 333 ~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 333 AIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEPVP 392 (392) T ss_pred hhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCCCCCCC Confidence 000000 00111111 111100000 000112111 11233 No 138 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=93.58 E-value=0.0071 Score=32.37 Aligned_cols=369 Identities=12% Similarity=0.054 Sum_probs=143.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccc--ccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYT--TPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~dst~~~a~~~Laa~l~~~l 78 (532) |.=- +.++..++.-...-.+...++.|...... ..+..-.. -.-.++--.|++.+|+.+.+. T Consensus 1 M~~f--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~- 64 (386) T protein:vir:48 1 MPIF--------------NITNLATESPPISQGGFFDITDPDFLSTL-NGSEWVSAESALRNSDLFSIINQLSNDLATV- 64 (386) T ss_pred Cccc--------------ccccccccccccccccccccccchhcccc-cCCceechhhhhcchHHHHHHHHHHHhhccC- Confidence 4322 22222222111111111122222111110 11111011 112344345666666655442 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCCh----HHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFR----PTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~----~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) |+ ++ -+.. .. ..+.+-|.+ .-+..++.++...||+.+|+..+. T Consensus 65 -----p~-~~--~~~~-------------~~-----------~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~- 111 (386) T protein:vir:48 65 -----KL-TA--SRKQ-------------LQ-----------GIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE- 111 (386) T ss_pred -----ce-ee--ccch-------------hH-----------HHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC- Confidence 22 11 1100 11 123334433 333455678888999988876542 Q ss_pred ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~ 234 (532) .+....+..+|...+-+.++.+|... +.+ + T Consensus 112 -~g~~~~L~~l~~~~v~v~~~~~~~~~--~y~-----------------------------------------------~ 141 (386) T protein:vir:48 112 -NGRDMKWEYLRPSQVSFNRLDNKDGI--YYN-----------------------------------------------I 141 (386) T ss_pred -CCcEEEEEEecCceeEEEEcCCCceE--EEE-----------------------------------------------E Confidence 23344555555566666665544211 111 1 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) ..++..... ...+. .--+++.|....++..||.||..-+...+.....+.+.......-...|..++..++.++.+. T Consensus 142 ~~~~~~~~~-~~~~~--~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~ 218 (386) T protein:vir:48 142 TFDDPRIPP-KQHVP--QGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDF 218 (386) T ss_pred EecCccccc-eeEec--CccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHH Confidence 111111000 00111 112455565556777999999999999999999999999998888888988887766666654 Q ss_pred hccC---------CCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccC-CCCCCCHHHHHHHHH Q lcl|NC_015159. 315 VAKA---------NTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQR-GGDRVTAEEIRYVAG 383 (532) Q Consensus 315 ~~~~---------~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~-~~~~~TAtEi~~r~~ 383 (532) .... ..|.++. -.+++...++.. ..+.+ ..+..+..++.|-.+|-....... .+..-+++|-.. . T Consensus 219 ~~~~~~~~~~~~~n~g~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~--~ 294 (386) T protein:vir:48 219 KTKLSRSRQAMKQMQGGPLV-LDDLEEFTPLEIKSNVSQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSL--D 294 (386) T ss_pred HHHHHHHHHHhhcCCCCcee-cCCCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--H Confidence 3211 0111110 012233334432 22333 345566677788888743211111 111112222211 1 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDD 463 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~ 463 (532) -....|.|++..+.+++-.-| ++.+..+... . ...+...++..++.+.. .+. T Consensus 295 ~~~~~l~P~~~~ie~~l~~~l-------------~~~~~~~~~~-~--~~~d~~~~~~~~~~l~~------------~g~ 346 (386) T protein:vir:48 295 LYNKAVSRYLRPFLSELSQKL-------------SCDVDADILP-A--VDPTGSNSVSRINSMVK------------SGT 346 (386) T ss_pred HHHHHHHHHHHHHHHHHHHhh-------------cchhhcchhh-h--hccChHHHHHHHHHHHh------------CCC Confidence 233344555555554443322 2222111100 0 00111122222222211 011 Q ss_pred cCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_015159. 464 INLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQA 518 (532) Q Consensus 464 id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~ 518 (532) +..+++- .+...-|+++..+. . .+.. ..... .-|...++. T Consensus 347 ~t~nE~r-~~lg~~~~~~~~~~-~---~~~~------~~~~~----~gGd~~~~~ 386 (386) T protein:vir:48 347 LAQNQGL-YILQQAEILPKELP-E---GENP------NKTTL----KGGEINGED 386 (386) T ss_pred cCHHHHH-HHhhcCCCCCccch-h---hcCC------CCCcc----CCCCCCCCC Confidence 1222211 11111112111000 0 0000 00000 000000000 No 139 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=93.45 E-value=0.0075 Score=32.23 Aligned_cols=349 Identities=12% Similarity=0.073 Sum_probs=137.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccccccchHH-HHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTTPWQSIGA-RGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~-~a~~~Laa~l~~~l 78 (532) |.=-++....+... ..+... +..+..|...... ..+..- ..+.....+. .|++.+|+.+ +.+ T Consensus 1 Mglf~~~~~~~~~~-----------~~~~~~---~~~~~~~~~~~~~-~~~~~v~~~~al~~~~V~~~i~~Ia~~i-a~l 64 (384) T protein:vir:49 1 MPIFNITNLATESP-----------PSNQDS---FFDITDPEFLDAL-NGSEWVSAETALKNSDLFSIISQLSNDL-ATA 64 (384) T ss_pred CccccccccCcccc-----------cccchh---hccccchhhcccc-cCCceechhhhhccHHHHHHHHHHHHHH-hhC Confidence 44322211111000 000111 1222233222111 111111 1112222333 3444444433 333 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----CChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESN----SFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) || ++ .+.... ..+.+- +.+.=+...+.++...|||.+++..+. T Consensus 65 -----~~-~~--~~~~~~------------------------~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~- 111 (384) T protein:vir:49 65 -----KI-TT--SRKQLQ------------------------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE- 111 (384) T ss_pred -----ce-ee--ecchhh------------------------hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECC- Confidence 22 11 111100 011122 233444566778888999988876432 Q ss_pred ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~ 234 (532) .+....+..+|...+-+..+.++.. ++.+ + T Consensus 112 -~g~~~~L~~l~~~~v~v~~~~~~~~--~~y~-----------------------------------------------~ 141 (384) T protein:vir:49 112 -NGRDMKWEYLRPSQVSFNRLDNQNG--LYYN-----------------------------------------------I 141 (384) T ss_pred -CCcEEEEEEEcCceeEEEEcCCCce--EEEE-----------------------------------------------E Confidence 2233334444444454443332211 0100 0 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) ...+... +....+ ..-=+++.|+...++..||.||...+...+.......+.......-...|..++.-.+....+. T Consensus 142 ~~~~~~~-~~~~~~--~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~ 218 (384) T protein:vir:49 142 TFDDPRI-PPKQHV--PQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDF 218 (384) T ss_pred EecCccc-cceeEe--cCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHH Confidence 0111000 000001 1112455666666778999999999999999999999988888888888887776555444322 Q ss_pred hc--------cCC-CceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCCHHHHHHHH Q lcl|NC_015159. 315 VA--------KAN-TGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 315 ~~--------~~~-~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~~~TAtEi~~r~ 382 (532) .. ... .|.++. -.+++...++.. ..+.+ ..+..+..++.|-++|-.-. +...+...-|++.+.+.. T Consensus 219 ~~~~~~~~~~~~~n~~~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~ 296 (384) T protein:vir:49 219 KTKQSRSRQAMKQMQGGPLV-LDDLEDFTPLEIKSNVAQ-LLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIY 296 (384) T ss_pred HHHHHHHHHhcccCCcccee-cCCCceEEEccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHH Confidence 10 111 122111 112233334332 33444 34666778888988884321 111222334555444333 Q ss_pred HH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHH------HHHHHHHHHH------- Q lcl|NC_015159. 383 GE-LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALG------RGHDLNKLNV------- 448 (532) Q Consensus 383 ~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~------raq~~~~l~~------- 448 (532) .. ....|-|+.++++.+|..-+.. .+.+....+... +...++.|- |++-.+.+.. T Consensus 297 ~~~i~~~l~pi~~~i~~~l~~~l~~---------~~~~~~~~~~~~--~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne 365 (384) T protein:vir:49 297 FKAVSRFLRPFVSELSKKLSCEVDA---------DILPAVDPTGSN--YIGLINSMVKTGTLAQNQGLYVLQQAEILPKD 365 (384) T ss_pred HHHHHHHHHHHHHHHHHHhchhhhh---------hhhhhhhccchH--HHHHHHHHhhcCcccHHHHHHHHhhCCCCChh Confidence 22 2334566666666665432210 011111000000 011112221 1111111110 Q ss_pred H--HHHHHhhc-chhhhhc Q lcl|NC_015159. 449 F--IDYMIKLA-GLQDDDI 464 (532) Q Consensus 449 ~--~~~laq~~-p~~~d~i 464 (532) . ...+..+. ++.-+.- T Consensus 366 ~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 366 LPEGETDSTLKGGETNEQY 384 (384) T ss_pred HHHHcCCCCCCCCCCCCCC Confidence 0 00111111 1111222 No 140 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=93.28 E-value=0.0081 Score=32.04 Aligned_cols=447 Identities=11% Similarity=0.027 Sum_probs=190.1 Q ss_pred CCCCCCCccC------HHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccc-----ccccc--cccchHHHHH Q lcl|NC_015159. 1 MAEVEKTGFA------ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGS-----TSYTT--PWQSIGARGL 67 (532) Q Consensus 1 m~~~~~~~~~------~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-----~~~~~--~~dst~~~a~ 67 (532) |.=+.+.-.+ +....+.|+....-| +|+- .|...++...... .+... .-++.+..++ T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~-----~~~~-----~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av 70 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGH-----RWQD-----IGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAV 70 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCc-----ccCC-----CCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 6666542111 111111122221111 1110 0111111100000 11112 3477788889 Q ss_pred HHHHHHHHH-hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCcee Q lcl|NC_015159. 68 NNLASKLML-ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVL 146 (532) Q Consensus 68 ~~Laa~l~~-~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 146 (532) +.+++.+++ +++|..++ .++.+.+. -...-+.|.+.| ++-.+.+||.....++..+++-|-++ T Consensus 71 ~~~~~~vVG~Gi~p~~~~------~~~~~~~~-----ie~~w~~wa~~~-----D~~g~~~f~~lq~l~~r~~~~dGE~f 134 (495) T protein:vir:10 71 ATWVAAAVGNGLTPRWRM------KEQELRQE-----LQELWGDWVNEA-----DFDEVQSFYGLQALVVRTVINSGEAF 134 (495) T ss_pred HHHHHhhcCCCcccccCC------chHHHHHH-----HHHHHHHhhcCc-----ccccccCHHHHHHHHHHHHHhCCceE Confidence 988887754 56665443 34333321 112334444332 34457789999999999999988876 Q ss_pred eeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCC Q lcl|NC_015159. 147 LYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPE 226 (532) Q Consensus 147 ~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~ 226 (532) +-+.......+..+.+ +-+-+..+.|+-.... ...++. -.|...|+.|.. T Consensus 135 ~~~~~~~~~~g~~~~~----------------------~lqliepd~l~~~~~~-------~~~~~g-~~i~~GIe~d~~ 184 (495) T protein:vir:10 135 VIKKPRPLSEGLSVPL----------------------QLQIIEPDMLASDIPD-------ETLPSG-GYVKGGIRFSNG 184 (495) T ss_pred EEEeecccCCCCccce----------------------EEEEechhhcCCCCCC-------CCCCCC-CEEEeceEECCC Confidence 5322111001111111 1122222222110000 001112 236778888888 Q ss_pred CCeEEEEEEEcCcccccccccC--ccccCc--eEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_015159. 227 AMVFRSYQEIDGEIVAGTEGEY--PLDSCP--WIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLF 302 (532) Q Consensus 227 ~~~~~s~~~~~~~~~~~~~~~~--g~~~~P--~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~ 302 (532) ++|-..+... ........... .+...| -+++-|.+.+|..-|.+..- .+-.++.|+....+.+.++..++.... T Consensus 185 Gr~vaY~i~~-~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis~la-~i~~l~~l~~y~dael~~a~i~A~~~~ 262 (495) T protein:vir:10 185 GKRKAYCFYR-NHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAPWFQ-LLLRLNELDQYEDAELVRKKTAALFAA 262 (495) T ss_pred CceEEEEEee-cCCCcccccccccceeeechhheEeccccCCCcccCcchhH-HHHHHHHhhHHHHHHHHHHHHhhhhee Confidence 8765544332 11111110000 112233 24444778899999998665 455799999999999999999988886 Q ss_pred eecC-ccccCh-------------hhhccCCCceeecCcccc-ccccccC-CccchhHHHHHHHHHHHHHHHHH-h-hhh Q lcl|NC_015159. 303 FVNP-NGVTQI-------------RRVAKANTGDFVAGRKQD-VEVFQLE-KYNDFQVAKATADDIEKRLSYAF-M-LNS 364 (532) Q Consensus 303 lv~~-~g~~~~-------------~~~~~~~~G~~v~g~~~~-~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af-~-~~~ 364 (532) .+.. ++--.. ......++|.|..-.++. +...... ..++|. .....+...|-.++ + +.. T Consensus 263 fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~Ye~ 339 (495) T protein:vir:10 263 FIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYE---PWLRYQLLSIAKGYGITYEM 339 (495) T ss_pred eeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHH Confidence 6542 111000 011223455554433322 3322211 222332 33334444555554 1 222 Q ss_pred cccCCCCCCCHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHHHHHHHHHHhcCCCCCCccc-----cccceee----cch Q lcl|NC_015159. 365 AVQRGGDRVTAEEIRYVAGELEDTLGGVYSL-LSQELQLPLVKILLKELQATSKIPNLPKE-----AVEPAIA----TGL 434 (532) Q Consensus 365 ~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~r-l~~E~l~Pli~r~~~il~r~g~lp~~p~~-----~~~~~~v----~~l 434 (532) + ..|-..++=.=+++-..|..+.+-..=.+ +...|..|+..+++..+...|.|+.++-- ...+..+ ..+ T Consensus 340 l-tgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~v 418 (495) T protein:vir:10 340 L-TGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEV 418 (495) T ss_pred H-hcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCcccc Confidence 2 24555565555555555555554443222 45568889999999999999998754211 1122222 123 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHH-HhhhH Q lcl|NC_015159. 435 EALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAG-QQMGA 513 (532) Q Consensus 435 ~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~-~~~~~ 513 (532) +|+-.++..... +. +. + . -...++...|.|+. |+..++.++.+....-... ...+. T Consensus 419 DP~Ke~~A~~~~---i~--~G--------~--~-s~~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~Gl~~~~~p~ 475 (495) T protein:vir:10 419 DPLKKHLADLGD---VR--AG--------F--A-PISDKQAERGYDME-------ELFDMISDANQLIDEYDLRLDSDPR 475 (495) T ss_pred ChHHHHHHHHHH---HH--cC--------C--C-CHHHHHHHcCCCHH-------HHHHHHHHHHHHHHHcCCCCCCCCC Confidence 343322211000 00 00 0 0 01122233466553 2322222222211111110 00000 Q ss_pred H-HHHHHHhhcccccCCCCC Q lcl|NC_015159. 514 A-GGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 514 ~-~~~~~~~~~~~~~g~~~~ 532 (532) . .+.++...-.+.+.-..| T Consensus 476 ~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 476 YVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred cCCCccCCCCCCCCCCCCCC Confidence 0 000000011111111111 No 141 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=92.77 E-value=0.01 Score=31.54 Aligned_cols=477 Identities=12% Similarity=-0.011 Sum_probs=201.7 Q ss_pred CCCCCCCcc--CHHHHHHHHHHHHHHhhhHHHHHH-HHHHhhcccccCCCC-Cccc----ccccc--cccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGF--AADGAAAAYNRLKNDRGAYETRAE-DCATYTIPSVFPSAT-ADGS----TSYTT--PWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~--~~~~~~~r~~~lk~~R~~~e~~w~-e~~~~~~P~~~~~~~-~~~~----~~~~~--~~dst~~~a~~~L 70 (532) |.=.+|... +.....+|...- ..++.|+.--. .....--|.+..+.. .... .+... .-++.+..+++.+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar-~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~ 79 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAR-EAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRL 79 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhH-HHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 655544332 222222331111 11122221100 000000000000000 0000 11111 2577889999999 Q ss_pred HHHHHH--hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeee Q lcl|NC_015159. 71 ASKLML--ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 71 aa~l~~--~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~ 148 (532) ++.+++ ++.+..++ +..++..-.+.. ..-...-+.|.+.| +.-.+.+||.....++...++-|-+++. T Consensus 80 ~~nvVG~~G~~i~p~~---l~~d~~~a~~l~--~~ie~~w~~Wa~~~-----D~~g~~~f~~lq~l~~R~~~~dGE~f~~ 149 (548) T protein:vir:95 80 EERVVGGSGIGVEPLP---LRLDGSVHAELA--MEIRSAWAEWSLSP-----ETSGELTRPQVERLMCRTWLRDGEGLAQ 149 (548) T ss_pred HHhccCccccceeeee---cCCCHHHHHHHH--HHHHHHHHHhhcCc-----cccccCCHHHHHHHHHHHHHhCCceEEE Confidence 999997 34443333 222221111110 00011123333222 2334678999999999999998887654 Q ss_pred ecccccc---cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeC Q lcl|NC_015159. 149 IPSTEQV---EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDP 225 (532) Q Consensus 149 v~~~~~~---~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~ 225 (532) +...... .+..+.++. +-+..+.|+-. ... +.. .|+..|+.|. T Consensus 150 ~~~~~~~~~~~g~~~~~~l----------------------qliepd~l~~~--------~~~--~~~--~i~~GIE~D~ 195 (548) T protein:vir:95 150 KLMGRVPNYTFATSVPFAL----------------------ELLEPDYLPFS--------YNN--LSK--GIVQGIERDT 195 (548) T ss_pred eeecccccccCCcccceEE----------------------EEechhhcCCC--------CCC--CCC--ceeeeeEECC Confidence 3221100 011111111 11222222100 000 011 3677888888 Q ss_pred CCCeEEEEEEEcCcccccc-c-ccCccccCce--EEEE-eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015159. 226 EAMVFRSYQEIDGEIVAGT-E-GEYPLDSCPW--IPVR-LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 (532) Q Consensus 226 ~~~~~~s~~~~~~~~~~~~-~-~~~g~~~~P~--~~~R-w~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p 300 (532) .+.|...+.. ........ . +..-+...|- +++- ....+|..-|.+..--+|..++.|.....+.+.++..++.. T Consensus 196 ~Grp~aY~i~-~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~ 274 (548) T protein:vir:95 196 WRRKRAYHLL-KDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAAL 274 (548) T ss_pred CCceEEEEEe-ecCCCcccccccccceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhh Confidence 8887654433 22211100 0 0001112222 2222 34568899999999999999999999999999999999988 Q ss_pred ceeecCc-cc--------cChhhhccCCCceeecC-cc-ccccccccC-CccchhHHHHHHHHHHHHHHHHH--hhhhcc Q lcl|NC_015159. 301 LFFVNPN-GV--------TQIRRVAKANTGDFVAG-RK-QDVEVFQLE-KYNDFQVAKATADDIEKRLSYAF--MLNSAV 366 (532) Q Consensus 301 ~~lv~~~-g~--------~~~~~~~~~~~G~~v~g-~~-~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af--~~~~~~ 366 (532) ...+..+ +- -.........||.+++. .+ .++...... ..++| ......+...|..++ =+..+ T Consensus 275 a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~---~~f~~~~lr~IAaglGipYe~l- 350 (548) T protein:vir:95 275 AMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFL---EGFRNGQLRMIGAGTRSTYSSV- 350 (548) T ss_pred eeeeecCCCccccCCCCcccccccccccCCccccccCCCceeeecCCCCCCCCH---HHHHHHHHHHHHhhcCCCHHHH- Confidence 8766531 10 01111122345665442 22 233333222 12233 233444445555554 11222 Q ss_pred cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc----cccceee----cchHHHH Q lcl|NC_015159. 367 QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE----AVEPAIA----TGLEALG 438 (532) Q Consensus 367 ~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~----~~~~~~v----~~l~~l~ 438 (532) ..|-. .+=.=+++-..|..+.+...=..+...|+.|+..+++..+...|.|+-+... ......+ ..|+|+- T Consensus 351 tgD~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~K 429 (548) T protein:vir:95 351 SRAYD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMH 429 (548) T ss_pred hcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHH Confidence 13332 2445555555555555555555566778999999999999999998743221 2233332 2356654 Q ss_pred HHHHH-HHHHHHHHHHHhhcchhhhhcCHHHHHHHHH------HhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 439 RGHDL-NKLNVFIDYMIKLAGLQDDDINLLDVKMRLA------NSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQM 511 (532) Q Consensus 439 raq~~-~~l~~~~~~laq~~p~~~d~id~d~~~~~~a------~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~ 511 (532) .++.. ..+.+=+.+..++.-+ .=.|++++++.++ +.+|++...--+........ +.. ...+........ T Consensus 430 ea~A~~~~i~~Gl~T~~~~~a~--~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~ 505 (548) T protein:vir:95 430 EANAWELLVKAGFADEAEVARA--RGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGM-DPV-EAVQKVYLGVGK 505 (548) T ss_pred HHHHHHHHHHcCCCCHHHHHHH--hCCCHHHHHHHHHHHHHHHHHcCCCCCCccccccccccc-CCC-Cchhhhcccccc Confidence 43322 1111111111111000 0134444444443 44555321111110000000 000 000001111111 Q ss_pred hHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 512 GAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 512 ~~~~~~~~~~~~~~~~g~~~~ 532 (532) +.++..+-.....--+|+|-- T Consensus 506 ~~~~~~~~~~~~~~~~~~~~~ 526 (548) T protein:vir:95 506 MLTADEARELVNRYGAGLPVP 526 (548) T ss_pred ccccchhHHhhccCCCCCcCC Confidence 112222222222233344322 No 142 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=92.27 E-value=0.012 Score=31.09 Aligned_cols=303 Identities=10% Similarity=0.063 Sum_probs=116.0 Q ss_pred eEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccccc-ccCccc--cC--- Q lcl|NC_015159. 180 VLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTE-GEYPLD--SC--- 253 (532) Q Consensus 180 vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~-~~~g~~--~~--- 253 (532) |-+++++.. +..+.+ ..+.+++-..-..+.+..++....... +..|.+ .. T Consensus 1 v~Eivw~~~-----------------------~g~~~~-~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~ 56 (355) T protein:vir:78 1 MFEQVYRIE-----------------------NGRARL-GKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVD 56 (355) T ss_pred CeEEEEEee-----------------------CCeEEE-eeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccC Confidence 222222211 000111 111111111000111222222211111 111111 12 Q ss_pred ceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc-eeecCccccC--h------------------ Q lcl|NC_015159. 254 PWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVL-FFVNPNGVTQ--I------------------ 312 (532) Q Consensus 254 P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~-~lv~~~g~~~--~------------------ 312 (532) =|+++|....+|+.||.|....+..-..--+...+.-+..+++-.-|. +..-+.|... . T Consensus 57 kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~ 136 (355) T protein:vir:78 57 RLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQ 136 (355) T ss_pred CEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHH Confidence 278999999999999999999999988888888999999999876554 3322322110 0 Q ss_pred --hhhccCC-CceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCC---CCCCCHHHHHHHHHHHH Q lcl|NC_015159. 313 --RRVAKAN-TGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRG---GDRVTAEEIRYVAGELE 386 (532) Q Consensus 313 --~~~~~~~-~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~---~~~~TAtEi~~r~~E~~ 386 (532) ..+..+. .|.++|-+ ..+..+... +.-......|+.+.+.|+++++...+.... +......|++.... . T Consensus 137 ~~~~i~~g~~a~~iip~g-~~ie~~ea~--g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~--~ 211 (355) T protein:vir:78 137 LAKEFRAGEAAGGYIPHG-ANFTLTGVQ--GKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFF--T 211 (355) T ss_pred HHHHhhCCcceeEeecCC-ceEEEeecC--CCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHH--H Confidence 0111111 24455533 345555432 222224468999999999999876554322 22234456543221 1 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCH Q lcl|NC_015159. 387 DTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINL 466 (532) Q Consensus 387 ~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~ 466 (532) .++-.-...+...|..-||..++.+- .|--.+.|. ..+ ..+.. +...+...++.+..+.- .+.. T Consensus 212 ~~~~aD~~~i~~~ln~~li~~l~~lN--~~~~~~~P~----~~~-~~~~~-----~~~~~a~~~~~l~~~G~----~~~~ 275 (355) T protein:vir:78 212 GSLNAVMKHIADVTQQHVVEDLVDQN--WGPEEPAPR----LVP-AQLGK-----EQPVTAEAIRALVECGA----FTAD 275 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCCCE----EEe-cCcCh-----hHHHHHHHHHHHHhCCC----cccc Confidence 11112222222222233333333322 121122221 111 01111 11122333444443321 1223 Q ss_pred HHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHH-HHHHhhcc--cccCCCCC Q lcl|NC_015159. 467 LDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGG-QAAAAMMQ--QQAGLPTQ 532 (532) Q Consensus 467 d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~g~~~~ 532 (532) +....++.+.+|+|.. - ..+++++.-.+ .+...++.....+...+ .+.+.... .+.-+.++ T Consensus 276 ~~~~~~~~e~~gip~p-~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~~ 339 (355) T protein:vir:78 276 PELEKDLRARYGLPAP-A-ERDDGADAAAA---KAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRGPL 339 (355) T ss_pred HHHHHHHHHHhCCCCC-C-CCCcccCCccc---cccccccccccCCccccccccccCCCCCChhhhHHH Confidence 4456677788998532 1 11112111000 00000000000000000 00000000 00000000 No 143 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=91.89 E-value=0.014 Score=30.78 Aligned_cols=448 Identities=9% Similarity=0.032 Sum_probs=195.6 Q ss_pred CCCCCCCcc---CHHHH---HHHHHHHHHHhhhHHHHHHHHHHhhccc-ccCCCCCc-cc----ccccc--cccchHHHH Q lcl|NC_015159. 1 MAEVEKTGF---AADGA---AAAYNRLKNDRGAYETRAEDCATYTIPS-VFPSATAD-GS----TSYTT--PWQSIGARG 66 (532) Q Consensus 1 m~~~~~~~~---~~~~~---~~r~~~lk~~R~~~e~~w~e~~~~~~P~-~~~~~~~~-~~----~~~~~--~~dst~~~a 66 (532) |..+-...+ ++-.. .+.|....+.+.. ..+.|- |. +..+.... .. .+... .-++.+..+ T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~------~~~~w~-p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a 73 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGG------QLRSWN-PPSESVDAALLPNFTRGNARADDLVRNNGYAANA 73 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCC------cccccc-cCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 665532211 11001 0111111111111 111111 11 11110000 00 11112 347788899 Q ss_pred HHHHHHHHHHh-hcCCCCCccc-cCCChHHHhhhccChhHHHHHHHHHHHHHHHHH----------HHHHhcCChHHHHH Q lcl|NC_015159. 67 LNNLASKLMLA-LFPVGSSFFK-LNVSELEVKQSITSPEELTEIATGLAMVERICM----------NYMESNSFRPTLHA 134 (532) Q Consensus 67 ~~~Laa~l~~~-ltpp~~~WF~-l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~----------~~l~~snf~~~~~~ 134 (532) ++.+++.+++. ++|..+|=.+ |...+.. .++|-..||+.-. +.=.+.+||..... T Consensus 74 v~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~-------------~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l 140 (533) T protein:vir:34 74 IQLHQDHIVGSFFRLSHRPSWRYLGIGEEE-------------ARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIRE 140 (533) T ss_pred HHHHHHHhhCCCceeeeccchhhcCCChhH-------------HHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHH Confidence 99998888764 8877666433 3333222 2233333333322 23345689999999 Q ss_pred HHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcce Q lcl|NC_015159. 135 AIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEE 214 (532) Q Consensus 135 ~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 214 (532) ++..+++-|-+++-+... .+..+.+- ++-+-+..+.|+-. .. .++. T Consensus 141 ~~r~~~~dGE~f~~~~~~---------------------~~~g~~~~--~~lq~ie~d~l~~~--------~~--~~~~- 186 (533) T protein:vir:34 141 GVAMHAFNGELFVQATWD---------------------TSSSRLFR--TQFRMVSPKRISNP--------NN--TGDS- 186 (533) T ss_pred HHHHHHhCCceEEEeeec---------------------cCCCCccc--eEEEEechhhcCCC--------CC--CCCC- Confidence 999998888775532111 01111111 11122233333211 00 1111 Q ss_pred EEEEEEEEeeCCCCeEEEEEEEcCccc---ccccccCccccCc---eEEEEeeecCCCccccchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 215 VTIYTHVYRDPEAMVFRSYQEIDGEIV---AGTEGEYPLDSCP---WIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYE 288 (532) Q Consensus 215 v~i~~~v~~~~~~~~~~s~~~~~~~~~---~~~~~~~g~~~~P---~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~ 288 (532) -.|+..|+.|..+.|...+........ ....+..-+...| +++.-....+|..-|.+..--+|..++.|+.... T Consensus 187 ~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~d 266 (533) T protein:vir:34 187 RNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQN 266 (533) T ss_pred CceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHH Confidence 136788999988887765554322110 0000000011122 2333344469999999999999999999999999 Q ss_pred HHHHHHHHHhcCceeecCc-cccCh-------------hhh---------------ccCCCceeecCcccc-ccccccC- Q lcl|NC_015159. 289 AIVKMSMISSKVLFFVNPN-GVTQI-------------RRV---------------AKANTGDFVAGRKQD-VEVFQLE- 337 (532) Q Consensus 289 ~~l~~~~~a~~p~~lv~~~-g~~~~-------------~~~---------------~~~~~G~~v~g~~~~-~~~~~~~- 337 (532) +.+.++..++.....+..+ +--.. ..+ ...++|.|..-.++. +...... T Consensus 267 ael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~ 346 (533) T protein:vir:34 267 TQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQD 346 (533) T ss_pred HHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCC Confidence 9999999999888766522 10000 000 012344443322221 2222211 Q ss_pred CccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 338 KYNDFQVAKATADDIEKRLSYAF--MLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT 415 (532) Q Consensus 338 ~~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~ 415 (532) ..++|. .....+...|-.++ -+.++ ..|-.+++=.-+++-..|..+.+--.=..|..-|+.|+..+++..+... T Consensus 347 p~~~~~---~f~~~~lr~iAaglGi~ye~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~ 422 (533) T protein:vir:34 347 TDNGYS---VFEQSLLRYIAAGLGVSYEQL-SRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVR 422 (533) T ss_pred CCCCHH---HHHHHHHHHHHhhcCCCHHHH-hhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 122332 33344444555554 12222 2354455655555555555555555555566678889999999999999 Q ss_pred CCCCCCcc---c-------cccceee----cchHHHHHHHHHH-HHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCC Q lcl|NC_015159. 416 SKIPNLPK---E-------AVEPAIA----TGLEALGRGHDLN-KLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMD 480 (532) Q Consensus 416 g~lp~~p~---~-------~~~~~~v----~~l~~l~raq~~~-~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~ 480 (532) |.||-+.. + ..+...+ ..|+|+-.++... .+.+-+.+ ...++...|.| T Consensus 423 G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s-----------------~~~~~a~~G~D 485 (533) T protein:vir:34 423 RVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLST-----------------YEKECAKRGDD 485 (533) T ss_pred CcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCC-----------------HHHHHHHcCCC Confidence 99874321 1 1222332 2344543332211 11100000 11222334555 Q ss_pred HhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhH-HHHHHHHhhcccccCCCCC Q lcl|NC_015159. 481 TTGLILTQQDKQAKMAEASTAAGMVTAGQQMGA-AGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 481 p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~~ 532 (532) +...+ .++.++.+........ .+. +.....++..+......+. T Consensus 486 ~~ev~-------~q~a~e~~~~~~~gl~--~~~~~~~~~~s~~~~~~~~~~~~ 529 (533) T protein:vir:34 486 YQEIF-------AQQVRETMERRAAGLK--PPAWAAAAFESGLRQSTEEEKSD 529 (533) T ss_pred HHHHH-------HHHHHHHHHHHhcCCC--CCCCCCcCccCCCCCCCCCCccc Confidence 53222 2222111111110000 000 0000000111111111111 No 144 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=91.74 E-value=0.014 Score=30.67 Aligned_cols=370 Identities=11% Similarity=0.016 Sum_probs=148.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) |.=-++ +.+.+ ..|+.-. ..+.|..+...+.+......-+-.++--.|++.+|+.+. T Consensus 1 MG~~~~-------~~~~~----~~~~~~~-------~~~~~~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA----- 57 (411) T protein:vir:81 1 MGWWSR-------LTRFF----RPRNETV-------DMTNPLLLQWLGVDPDTPRNQLSEATYFACLKILSESLG----- 57 (411) T ss_pred CchHHH-------HHhhc----cCccccc-------ccchHHHHHHhcCcccChhhhhccHHHHHHHHHHHHhHh----- Confidence 331110 11111 1111000 000111110000000000001112222344554444433 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) +-||--..-.+....+. .+..++..|+ +-| .+.=+...+.+|..+|||.+|+..+ T Consensus 58 -~lp~~~~~~~~~~~~~~----------------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--- 117 (411) T protein:vir:81 58 -KLPLKMYQKTERGIVKS----------------DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS--- 117 (411) T ss_pred -hCceeEEEecCCceeee----------------cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--- Confidence 22443222111110000 0111122332 222 3344566677888999998887643 Q ss_pred cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQE 235 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~ 235 (532) .++...+..+|...+.+..|..|.+.. ...-+|+.. +. T Consensus 118 ~g~~~~l~~l~~~~v~~~~~~~~~~~~------------------------------~~~~~~~~~------------~~ 155 (411) T protein:vir:81 118 GPQLQALWILPSQYVTIVVDDRGLLGE------------------------------KNAIWYRYN------------DP 155 (411) T ss_pred CCceEEEEEECCceEEEEEcCcccccc------------------------------cceEEEEEE------------ec Confidence 244555666666777666666663110 000011110 01 Q ss_pred EcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhh Q lcl|NC_015159. 236 IDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV 315 (532) Q Consensus 236 ~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~ 315 (532) .+|... . +..--+++.|+....+..||.||..-+...+.......+.......-...|..++.-++.++++.. T Consensus 156 ~~g~~~-----~--~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~ 228 (411) T protein:vir:81 156 YDGKMY-----V--FRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEAR 228 (411) T ss_pred CCceEE-----E--EccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHH Confidence 111111 1 112225666765556678999999999999999999999888888888888877766666666543 Q ss_pred cc----------C-CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--c-ccCCCCCCCHHHHH Q lcl|NC_015159. 316 AK----------A-NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--A-VQRGGDRVTAEEIR 379 (532) Q Consensus 316 ~~----------~-~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~-~~~~~~~~TAtEi~ 379 (532) .. + .+ |.+. --.+++...++.. ..+.+.. +..+..+..|-.+|-.-. + ...++..-++++.. T Consensus 229 ~~~~~~~~~~~~g~~n~g~~~-vl~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~ 306 (411) T protein:vir:81 229 DRLVKGFEQFANGSKNAGKII-PVPLGMKLVPLDIKLTDSQFF-ELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQN 306 (411) T ss_pred HHHHHHHHHHhcCccccCCce-ecCCCceEEEccCCHHHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHH Confidence 11 1 11 1111 1122233333332 2344433 445667788888884321 1 11122222333221 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc-ccccccee-ecc---hHHHHHHHHHHHHHH--H--H Q lcl|NC_015159. 380 YVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLP-KEAVEPAI-ATG---LEALGRGHDLNKLNV--F--I 450 (532) Q Consensus 380 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p-~~~~~~~~-v~~---l~~l~raq~~~~l~~--~--~ 450 (532) ..+...-|.|++.++...+.+ .+|++-. .....+.+ ++. .+...|+.-++.+.. + . T Consensus 307 --------------~~f~~~~l~P~~~~ie~~l~~-~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~ 371 (411) T protein:vir:81 307 --------------LAFYVDTLLYVLKQYEEEITY-KILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTP 371 (411) T ss_pred --------------HHHHHHHHHHHHHHHHHHHHh-hcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 122333455555555444422 2333211 11111222 111 122333333222221 0 0 Q ss_pred ---HHHHhhcchh-hh-------hcCHHHHHHHHHHhcCCCH Q lcl|NC_015159. 451 ---DYMIKLAGLQ-DD-------DINLLDVKMRLANSLGMDT 481 (532) Q Consensus 451 ---~~laq~~p~~-~d-------~id~d~~~~~~a~~~Gv~p 481 (532) -.+-.+.|.. .| .+-.+.+.+...+ |=+. T Consensus 372 NE~R~~~gl~p~~ggD~~~~~~n~~pl~~~~~~~~k--gGd~ 411 (411) T protein:vir:81 372 NEARDYLDMPADDYGNNLMANGNYIPLSMLGANYGK--GGDS 411 (411) T ss_pred HHHHHHhCCCCCCCCCeeeeccCccchhhhhhhhcc--CCCC Confidence 0111222211 11 2223333222221 2123 No 145 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=91.08 E-value=0.018 Score=30.20 Aligned_cols=443 Identities=10% Similarity=0.028 Sum_probs=165.7 Q ss_pred CCCCCCC-ccCHHHHHHHHHHHHHHh-----hhHHHHH-----HHHHHhhcccccCCCCCcccccccccc--cchHHHHH Q lcl|NC_015159. 1 MAEVEKT-GFAADGAAAAYNRLKNDR-----GAYETRA-----EDCATYTIPSVFPSATADGSTSYTTPW--QSIGARGL 67 (532) Q Consensus 1 m~~~~~~-~~~~~~~~~r~~~lk~~R-----~~~e~~w-----~e~~~~~~P~~~~~~~~~~~~~~~~~~--dst~~~a~ 67 (532) +...+++ +.-...+..--..+..++ ..+.... .-+..|..+..++ +. .+..+| +..+-.+| T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~-~l~a~Y~~~~l~r~iV 120 (537) T protein:vir:10 47 MMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFI-----GH-QMCALIATHWLVNKAC 120 (537) T ss_pred cCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCc-----cH-HHHHHHHhCchhhhhh Confidence 1122111 100000000000000000 0000000 0001111111111 10 011111 23334444 Q ss_pred HHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceee Q lcl|NC_015159. 68 NNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLL 147 (532) Q Consensus 68 ~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~ 147 (532) ++.|..+ -+.|+.+...+..-.+. ..+ +.+...+.+-+++..+.+++..--.||.+++ T Consensus 121 d~~A~d~-------~r~~~~i~~~~~~~~~~-------~~~--------~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i 178 (537) T protein:vir:10 121 SQMPRDA-------MRKGYKIISDDGNELDP-------KDA--------KFIDRYDRAFNIKKHAIQFVRKGRIFGIRIA 178 (537) T ss_pred hhhhHHh-------hcCCceeecCCcccccH-------HHH--------HHHHHHHHHhhHHHHHHHHHHhcccccceEE Confidence 4444433 46888887654321111 112 2233455566788999999999888999888 Q ss_pred eeccccccc---CCcceEEEEecceEEEeeCCCCCeEEE--EEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEE Q lcl|NC_015159. 148 YIPSTEQVE---GQSNAPKLYKLHNFVVERDAYDNVLQI--VTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVY 222 (532) Q Consensus 148 ~v~~~~~~~---~~~~~~~~~pl~~~~v~~d~~G~vd~i--~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~ 222 (532) ++.-++.+. ..++....+. .|.+..+ +-+.+.+.. ............+ +.+-+.| T Consensus 179 ~i~v~~~D~~~~~~Pl~~~~i~----------kg~~k~l~vidp~~~~~~-----~~~~~~~dp~sp~-fg~P~~y---- 238 (537) T protein:vir:10 179 LFKVDSPDPYYYEKPFNIDGVM----------PGAYKGIVQIDPYWCAPL-----LDAQASSNPVSMH-FYEPTYW---- 238 (537) T ss_pred EEeecCcCCccccccccccccc----------ccceeEEEEechhhcccc-----cchhhhccCCccc-cCCceee---- Confidence 875432211 1111111111 1111111 111111110 0000000000111 0111111 Q ss_pred eeCCCCeEEEEEEEcCcccccccccCccc--cCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015159. 223 RDPEAMVFRSYQEIDGEIVAGTEGEYPLD--SCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 (532) Q Consensus 223 ~~~~~~~~~s~~~~~~~~~~~~~~~~g~~--~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p 300 (532) .+.|..++.. +.--|. ..|+ +.+....-||++..+.++..++..........+......-. T Consensus 239 ------------~v~g~~iH~S-Rli~f~g~~~p~----~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~ 301 (537) T protein:vir:10 239 ------------LINGKKYHRS-HLAIYINDEVVD----FLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQT 301 (537) T ss_pred ------------eecCeEecce-eEEEecCCCCch----hhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 1122222110 000111 1233 23334445799999999999999998888888877777766 Q ss_pred ceeecCccc-cChhhhc---------cCCCceeecCcc-ccccccccCCccchhHHHHHHHHHHHHHHHHHh--hhhccc Q lcl|NC_015159. 301 LFFVNPNGV-TQIRRVA---------KANTGDFVAGRK-QDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM--LNSAVQ 367 (532) Q Consensus 301 ~~lv~~~g~-~~~~~~~---------~~~~G~~v~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~--~~~~~~ 367 (532) .+-++.... .+.+.+. ....|.++-+.. ..+..+. .+|..+...+....+.|.-++= ..-+.. T Consensus 302 v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~----~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G 377 (537) T protein:vir:10 302 VLKVDAAQVLANKQQFDETMSWWTATRDNYQVRVVDKDNEDVVQID----TTLNDLDKVIMNQYQLVCAIARTPAPKMLG 377 (537) T ss_pred eeeechHHhhcCHHHHHHHHHHHHhhcCCcceeEecCCCceeEEEe----ccCCCHHHHHHHHHHHHHhhhCCCceeecc Confidence 665532111 1122221 111233333332 3333222 2344455677777777777641 111111 Q ss_pred C--CCCCCCHH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHH Q lcl|NC_015159. 368 R--GGDRVTAE-EIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLN 444 (532) Q Consensus 368 ~--~~~~~TAt-Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~ 444 (532) . .+-.=|.+ ++.. .---+..++.+ +.|++++++.++.+....+++ .+.+++ .+|-.+....+++ T Consensus 378 ~sp~GlnatGe~D~~~--------yyd~I~~~Qe~-l~p~l~~l~~ll~~~~~~~~~---~~~i~f-~pL~~~s~kEkAe 444 (537) T protein:vir:10 378 TVPTGFNSTGDYEEAS--------YHEECESTQDD-MRPLIDRHHQLVCRSHLRKRI---RVKVEF-PPMDAPKESERAD 444 (537) T ss_pred CCccccccchhHHHHH--------HHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCc---ceEEEe-CCCCCCCHHHHHH Confidence 1 12111222 2222 22223444554 688899988888876654422 233332 2343333333333 Q ss_pred ---HHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC--HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_015159. 445 ---KLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT--QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAA 519 (532) Q Consensus 445 ---~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s--~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~ 519 (532) +.....+.+.+. ..|+.+++-+.+...-...-..+... .++.+....+.+ ..........+.+.+.+. T Consensus 445 i~~~~a~a~~~~~~~-----G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~--~~~~~~~~~~~~~~~~~~ 517 (537) T protein:vir:10 445 TFLKKMQAAKLAFEM-----GAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDE--GKPVRIIEDQPAPSEMFG 517 (537) T ss_pred HHHHHHHHHHHHHHc-----CCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCcc--CCcCCCCCCCCCccccCC Confidence 333333333332 25788888888876421111122211 111111000000 000000000111110000 Q ss_pred HhhcccccCCCC------C Q lcl|NC_015159. 520 AAMMQQQAGLPT------Q 532 (532) Q Consensus 520 ~~~~~~~~g~~~------~ 532 (532) +........-+. . T Consensus 518 ~~~~~~~~~~~~~~~a~~~ 536 (537) T protein:vir:10 518 ATSSGESANDPRDSGAAFE 536 (537) T ss_pred CCccccccCCCccCccccC Confidence 111101111110 1 No 146 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=90.73 E-value=0.019 Score=29.98 Aligned_cols=351 Identities=12% Similarity=0.048 Sum_probs=142.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccc-cccc-cchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSY-TTPW-QSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~-dst~~~a~~~Laa~l~~~l 78 (532) |+=-++-. ..|..-.. ....++.+....... .+..-. .... .++--.|++.+|+.+.+ T Consensus 1 Mg~f~~~~--------------~~~~~~~~---~~~~~~~~~~~~~~~-~~~~v~~~~~l~~~~v~~~i~~ia~~ia~-- 60 (382) T protein:vir:48 1 MPIFNLAT--------------ESPPDNQG---GFFDVVDSDFLASLK-GNEWVSAETALRNSDLFSIINQLSNDLAT-- 60 (382) T ss_pred Cccccccc--------------cCCccccc---ccccchhhhcccccc-CCcccchHhhhccHHHHHHHHHHHHhhcc-- Confidence 55443211 11110000 111111111111100 010000 0111 22223445555554422 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcC----ChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) -||--...... ..+.+-| .+.=+..++.+|...|||.+++..+. T Consensus 61 ----~~~~~~~~~~~---------------------------~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~- 108 (382) T protein:vir:48 61 ----VKLITSRKKLQ---------------------------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE- 108 (382) T ss_pred ----Cceeeecchhh---------------------------hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC- Confidence 23321111100 0122222 34445566778889999988875432 Q ss_pred ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~ 234 (532) .+....+..+|...+-+..+.+|... + ..+ T Consensus 109 -~G~~~~l~~i~~~~v~v~~~~~~~~~--~-----------------------------------------------y~~ 138 (382) T protein:vir:48 109 -NGRDMKWEYLRPSQVSFNRLDNKDGI--Y-----------------------------------------------YNI 138 (382) T ss_pred -CCcEEEEEEEcCceeEEEEcCCCCeE--E-----------------------------------------------EEE Confidence 23333444444555555555444211 0 011 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) ..++.... .... |..--+++.|+...++..||.||..-+...+...+...+.......-...|..++.-++.++.+. T Consensus 139 ~~~~~~~~-~~~~--~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~ 215 (382) T protein:vir:48 139 TFDDPRIP-PKQH--VPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDF 215 (382) T ss_pred EecCcccc-ceeE--EcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHH Confidence 11111100 0001 11223566777777788999999999999999999999999998888899988887666666643 Q ss_pred hcc-------C--CCce-eecCccccccccccC-CccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHH Q lcl|NC_015159. 315 VAK-------A--NTGD-FVAGRKQDVEVFQLE-KYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAG 383 (532) Q Consensus 315 ~~~-------~--~~G~-~v~g~~~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~ 383 (532) ... + ..|. ++-. +++...++. +..+.+. .+..+..+..|-.+|-...........-|..| ..... T Consensus 216 ~~~~~~~~~~~~~n~g~~~vl~--~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~-~~~~~ 291 (382) T protein:vir:48 216 KTKLSRSRQAMKQMQGGPLVLD--DLEDFTPLEIKSNVSQL-LKQADWTTGQFAKVYGIPDNVVGGQGDQQSSL-EMSSD 291 (382) T ss_pred HHHHHHHHHhhccCCCCeeEcC--CCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHH Confidence 321 0 1122 2211 222333333 2234443 35567777888888843221111111112221 11233 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHH------HHHHHHHHH--HH-HHH- Q lcl|NC_015159. 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGR------GHDLNKLNV--FI-DYM- 453 (532) Q Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~r------aq~~~~l~~--~~-~~l- 453 (532) -....|-|.+.++..|+-.-|+.+.- ....+.+..+.. .+..-+..|.| ++-.+.+.. +. ..+ T Consensus 292 ~~~~~l~p~~~~i~~~l~~~l~~~~~-----~~~~~~~~~~~~--~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~ 364 (382) T protein:vir:48 292 LYSKAVSRYLRPFLSELSQKLSCDVD-----ADIFPAVDPTGS--NYISRINSLVKTGTLAQNQGLYILQQAEILPKELP 364 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHhcChhh-----hhhhhhhccchh--HHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchh Confidence 44555666666666665443322110 001111100000 00011122222 111111100 00 000 Q ss_pred --Hhhcchh--hhhcCHHH Q lcl|NC_015159. 454 --IKLAGLQ--DDDINLLD 468 (532) Q Consensus 454 --aq~~p~~--~d~id~d~ 468 (532) -...|.. -|. |-.+ T Consensus 365 ~~~~~~~~~~GGd~-~~~~ 382 (382) T protein:vir:48 365 NGENPNSTLKGGEE-DGQD 382 (382) T ss_pred hhhcCCCCCCCCCC-CCCC Confidence 1111211 111 1111 No 147 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=90.60 E-value=0.02 Score=29.90 Aligned_cols=383 Identities=10% Similarity=0.025 Sum_probs=148.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc----cCCCCCccccc-ccccccchH-HHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV----FPSATADGSTS-YTTPWQSIG-ARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~----~~~~~~~~~~~-~~~~~dst~-~~a~~~Laa~l 74 (532) |.+.+ .++++.+++....|... .++ ...|.. ....+..+..- ......... -.|++.+|+.+ T Consensus 1 ~~~~~---------~~~~~~~~~~~~~~~g~--~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~i 68 (437) T protein:vir:10 1 MKQGK---------QRALGRIKSSFLKWLGV--PIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETI 68 (437) T ss_pred CCcch---------hhhhhhhHHhhhhhcCC--ccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHH Confidence 66553 23344444433333211 000 000100 00001111100 011222222 33555555544 Q ss_pred HHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----CChHHHHHHHHHHHhhCceeeee Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SN----SFRPTLHAAIKQLLVAGNVLLYI 149 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~~~v 149 (532) .+ -||.-....+..-.+ .+ .+..+...|. +- +.+.=....+.+|..+||+.+++ T Consensus 69 a~------lp~~~~~~~~~g~~~---------~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 127 (437) T protein:vir:10 69 AT------LPLNLYQTKPDGTRV---------LA------KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARK 127 (437) T ss_pred hh------CceeEEEEcCCCcee---------ec------cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 32 245422221100000 00 0112223333 22 34444566678888999998887 Q ss_pred cccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCe Q lcl|NC_015159. 150 PSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMV 229 (532) Q Consensus 150 ~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~ 229 (532) ..+ .+....+..+|...+-+..+.+|.+- T Consensus 128 ~r~---~g~~~~L~~l~p~~v~i~~~~~g~~~------------------------------------------------ 156 (437) T protein:vir:10 128 LRS---AGVLIGLELMLPQRTTVKRLTSGALQ------------------------------------------------ 156 (437) T ss_pred Eec---CCcEEEEEEEcCcceEEEECCCCeEE------------------------------------------------ Confidence 543 13333344444455545444444221 Q ss_pred EEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccc Q lcl|NC_015159. 230 FRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGV 309 (532) Q Consensus 230 ~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~ 309 (532) -.++...|... .+. ..=+++.|....+ ..||.||..-+...+.....+.+.......-...|-.++.-++. T Consensus 157 -y~~~~~~g~~~-----~~~--~~dIih~r~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 227 (437) T protein:vir:10 157 -YTYRNVDGTVS-----TLA--EDDVFHVRGFSLD-GLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQI 227 (437) T ss_pred -EEEEecCceEE-----EEc--cccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC Confidence 01111111110 000 1113444443333 38999999999999988888888888888888888877776676 Q ss_pred cChhhhccCC-------C-----ce-eecCccccccccccC-CccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCC Q lcl|NC_015159. 310 TQIRRVAKAN-------T-----GD-FVAGRKQDVEVFQLE-KYNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGDRV 373 (532) Q Consensus 310 ~~~~~~~~~~-------~-----G~-~v~g~~~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~~~ 373 (532) ++++...... . |. ++- .+++...++. +..+.+. .+..+..+..|-.+|-... +...+.... T Consensus 228 l~~e~~~~~~~~~~~~~~g~~nag~~~vl--~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 304 (437) T protein:vir:10 228 LQKEKRAEIRTDLAEQFGGAMQAGKTMVL--EAGMKYQAITMNPGDVQL-LETRAFNIEEICRWYRVPPFMVGHSEKSTS 304 (437) T ss_pred CCHHHHHHHHHHHHHHhcCccccCcceec--cCCceEEeccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCccc Confidence 6766542211 0 11 111 1223333333 2234443 3444556677888884322 111122222 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecch---HHHHHHHHHHHHHHH Q lcl|NC_015159. 374 TAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGL---EALGRGHDLNKLNVF 449 (532) Q Consensus 374 TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l---~~l~raq~~~~l~~~ 449 (532) +..-+.+.... +...-|.|++.++...|.+ .+|++-......+.+ ++.+ +.-.|+.-.+.+... T Consensus 305 ~~sn~e~~~~~-----------f~~~tl~P~~~~ie~~l~~-kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~ 372 (437) T protein:vir:10 305 WGTGIEQQTLG-----------FLTFTLRPWLTRIEQAARR-SLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQN 372 (437) T ss_pred ccchHHHHHHH-----------HHHHHHHHHHHHHHHHHHh-hccCccccCceEEEEechhhhccCHHHHHHHHHHHHhC Confidence 22333333222 2233445555554444332 244432222222222 1111 223333333222210 Q ss_pred ----H---HHHHhhcchh-hh--------hcCHHHHHHHHHHh-------cCCCHhHccCCHHHH Q lcl|NC_015159. 450 ----I---DYMIKLAGLQ-DD--------DINLLDVKMRLANS-------LGMDTTGLILTQQDK 491 (532) Q Consensus 450 ----~---~~laq~~p~~-~d--------~id~d~~~~~~a~~-------~Gv~p~~i~~s~ee~ 491 (532) . -.+-.+.|.. -+ .+..+.+-...-.. .|-....=-+.+||. T Consensus 373 G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 373 GLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred CCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCCCCCCCCccccC Confidence 0 0111222211 01 12222211100000 000011112223333 No 148 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=88.64 E-value=0.031 Score=28.85 Aligned_cols=433 Identities=11% Similarity=0.047 Sum_probs=143.2 Q ss_pred CCCCCCCccCHHHHH-HHHHHHHHHhhhHH--HHHHHHHHhhcccccCCCCCcccccccccc--cchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAA-AAYNRLKNDRGAYE--TRAEDCATYTIPSVFPSATADGSTSYTTPW--QSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~-~r~~~lk~~R~~~e--~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~--dst~~~a~~~Laa~l~ 75 (532) =..........+++. .....-+..-+... ..+...+. ..|.-.++.. -+.+-+.| ..+.-.|++..|.-+. T Consensus 30 ~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~-~r~~~~~~~~---l~~~~~~~~~npiv~~~I~~ia~~IA 105 (551) T protein:vir:80 30 NYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFK-TKPSIRNNQD---LHGVLKKFGGNIILNAIINTRSNQVS 105 (551) T ss_pred ceeeecccccHHHHHHhhccCcceeecccccceecCcccc-cCccccChhH---HHHHHHHhhcCHHHHHHHHHHHHHHh Confidence 000000000111110 00000000000000 11111111 1111111100 01111222 2223455666665544 Q ss_pred HhhcCCC----CCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcC---------ChHHHHHHHHHHHhh Q lcl|NC_015159. 76 LALFPVG----SSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNS---------FRPTLHAAIKQLLVA 142 (532) Q Consensus 76 ~~ltpp~----~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn---------f~~~~~~~~~dl~~~ 142 (532) +.-.+.. -.=|.+.+.+.+-............++ ..|.+-| |..-+..++.|+.++ T Consensus 106 ~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~-----------~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~ 174 (551) T protein:vir:80 106 MYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIE-----------SFIEKTGVDNDINRDSFSSFVKKIVRDTYMY 174 (551) T ss_pred hhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHH-----------HHHHhcCCCCCCccchHHHHHHHHHHHHHhc Confidence 3221110 011223322211111000001111122 2233333 334455667888899 Q ss_pred CceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEE Q lcl|NC_015159. 143 GNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVY 222 (532) Q Consensus 143 G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~ 222 (532) |||.+++..+. .+....+..++...+.+..+.+|.+..- T Consensus 175 Gnay~~i~rd~--~G~~~~L~~l~p~~V~v~~~~~g~~~~~--------------------------------------- 213 (551) T protein:vir:80 175 DQVNFEKVFNR--NQSMVRFVAKDPTTIFFATTADGKIPDN--------------------------------------- 213 (551) T ss_pred CCEEEEEEECC--CCcEEEEEEeCCceeEEEECCccccccC--------------------------------------- Confidence 99987765432 2333444444445666666666643210 Q ss_pred eeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeee---cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 223 RDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIK---MPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSK 299 (532) Q Consensus 223 ~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~---~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~ 299 (532) ++..++..+|.... .+..++ +++.|.+. ....+||.||..-+...+.......+.......-... T Consensus 214 ------~~~y~~~~~g~~~~----~~~~~e--iiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~ 281 (551) T protein:vir:80 214 ------GNRFVQVIDQKIVA----TFNARE--MAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGT 281 (551) T ss_pred ------ceEEEEEeCCcEEE----EEcccc--eEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 01111111221110 111111 22222221 2235799999999999999998888888888777788 Q ss_pred Cceee--cCccccChhhhc----------cC-CC-ce--eecCccccccccccC-CccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_015159. 300 VLFFV--NPNGVTQIRRVA----------KA-NT-GD--FVAGRKQDVEVFQLE-KYNDFQVAKATADDIEKRLSYAFML 362 (532) Q Consensus 300 p~~lv--~~~g~~~~~~~~----------~~-~~-G~--~v~g~~~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~ 362 (532) |..++ +.+..+..+.+. .+ .+ |. ++.+ +++...++. +..+.+ ..+..+..+..|-++|-. T Consensus 282 p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~--~g~~~~~l~~~~~D~q-fle~~~~~~~~Ia~aFgV 358 (551) T protein:vir:80 282 TRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSA--EDVKFVNMTPSARDME-FEKWLNYLINVISALYGI 358 (551) T ss_pred cceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccC--CCceEEEccCChhHHH-HHHHHHHHHHHHHHHhcC Confidence 88554 434334443321 11 11 11 2211 334444443 233444 334456677788888843 Q ss_pred hh--ccc-CC-------CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec Q lcl|NC_015159. 363 NS--AVQ-RG-------GDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT 432 (532) Q Consensus 363 ~~--~~~-~~-------~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~ 432 (532) -. +.. .+ ...+|-.=+.+.. ..+...-|.|++.++...|.+ .++|... ..+...+. T Consensus 359 Pp~~lG~~~~~~~~~~~~~s~t~sn~e~~~-----------~~f~~~tL~P~~~~ie~~ln~-~L~~~~~-~~~~f~f~- 424 (551) T protein:vir:80 359 DPAEINIPNNGGATGSKGGSLNEGNSAEKN-----------QASKNKGLQPLLGFIEDFINK-HIVAEFG-DKYTFQFV- 424 (551) T ss_pred CHHHcCcccccccccccccccchhhHHHHH-----------HHHHHHHHHHHHHHHHHHHHh-hhccccC-CceEEEee- Confidence 21 110 11 1112211111111 122333444544444443332 2344322 22333433 Q ss_pred chHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh-----Hcc------CCHHHHHHHHHHHHHH Q lcl|NC_015159. 433 GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT-----GLI------LTQQDKQAKMAEASTA 501 (532) Q Consensus 433 ~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~-----~i~------~s~ee~~~~~~q~~~~ 501 (532) .+....++...+ +...+. +. .+-.++ +-+.+|.+|. .++ ...+..+....+.+.+ T Consensus 425 ~~~~~~~~~~~~-~~~~~~--~g-------~lT~NE----~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (551) T protein:vir:80 425 GGDIKSELESVK-ILAEKA--KV-------AMTVNE----VRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQ 490 (551) T ss_pred ccChhhHHHHHH-HHHHHh--cC-------CcCHHH----HHHHhCCCCCCCCCceeecccccccccccccccCcchhhh Confidence 222222222111 111110 00 012222 1123344331 011 0000000000000000 Q ss_pred H-HHHHHHHhhhHHHHHH--HHhhcccccCCCCC Q lcl|NC_015159. 502 A-GMVTAGQQMGAAGGQA--AAAMMQQQAGLPTQ 532 (532) Q Consensus 502 ~-~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~ 532 (532) + ...+..+..+...+.. .....++..|-.+. T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 524 (551) T protein:vir:80 491 QSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGK 524 (551) T ss_pred hhccccccCcCCCCCCCCCCCCCCccccCCCccc Confidence 0 0000000000000000 00000000010000 No 149 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=88.06 E-value=0.035 Score=28.59 Aligned_cols=400 Identities=13% Similarity=0.102 Sum_probs=160.9 Q ss_pred HHHHHHHHHhhcccccCC-CCC---cccccccccccchHHHHHHHHHHHHHHhhcCCC---CCccccCCChHHHhhhccC Q lcl|NC_015159. 29 ETRAEDCATYTIPSVFPS-ATA---DGSTSYTTPWQSIGARGLNNLASKLMLALFPVG---SSFFKLNVSELEVKQSITS 101 (532) Q Consensus 29 e~~w~e~~~~~~P~~~~~-~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~---~~WF~l~~~d~~~~~~~~~ 101 (532) .....-+..++.- .+.. +.. .+.......++=.+..+.+-|+.++... |+. +.|+.+...|.... T Consensus 1 ~~~~D~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~--~a~d~~r~~~~i~~~d~~~~----- 72 (437) T protein:vir:52 1 MKFFDGIKSLALK-LGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIK--RPEDMVRNWREIYSNDLNSK----- 72 (437) T ss_pred CchhhhhHhHHhc-CCCccccceeecCccccccHHHHHHHHHhCchhhHHhhc--chHHhhcCCceEecCCCCHH----- Confidence 1111111111110 0000 000 0000000001111122233344444433 332 68888865321110 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeE Q lcl|NC_015159. 102 PEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVL 181 (532) Q Consensus 102 ~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd 181 (532) .+ +.+.+.+.+-++...+.++++.--.||.|++++.-+.... .-|+. ..|.+. T Consensus 73 -----~~--------~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~-------~~pl~-------~~~~~~ 125 (437) T protein:vir:52 73 -----QL--------DLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNT-------SAPLK-------PTERLK 125 (437) T ss_pred -----HH--------HHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCc-------ccccc-------cCCcee Confidence 11 1223445555788999999998888999999886543211 12221 123222 Q ss_pred EE--EEEEeecHHHhhHHHHHHHHhhcccCCC-cceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEE Q lcl|NC_015159. 182 QI--VTEDKIARAALPEDVRKSLEEAQGDQNP-SEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPV 258 (532) Q Consensus 182 ~i--~rk~~~~~~~l~~~~~~~~~~~~~~~~~-~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~ 258 (532) .+ +-+.+++ +...+..+-.+| +.+.+.|.. ...+..+.. .-.++..+.+ ...| T Consensus 126 ~~~v~~~~~v~---------~~~~~~~dp~s~~fg~p~~y~v---~~~~~~~~i----H~SRii~~~~----~~~~---- 181 (437) T protein:vir:52 126 RLIILPKWKIS---------PTGTKDDDVLSPNFGRYSEYSI---LGGSQSITV----HHSRLIILNA----NDAP---- 181 (437) T ss_pred EEEEechhhcc---------ccccccccccccccCcceEEEE---ecCCcceeE----ccceeEEecC----ccCC---- Confidence 11 1111111 001000000011 122222221 111111111 1111111111 0112 Q ss_pred EeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecC-ccccC---------hh-hh--ccCCCceeec Q lcl|NC_015159. 259 RLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNP-NGVTQ---------IR-RV--AKANTGDFVA 325 (532) Q Consensus 259 Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~-~g~~~---------~~-~~--~~~~~G~~v~ 325 (532) .....-||+|+.+..+..++..+.........+..+..+.+.++. ...+. .. .+ .....|.++- T Consensus 182 ---~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (437) T protein:vir:52 182 ---LSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLL 258 (437) T ss_pred ---CccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEE Confidence 233667899999999999999999888888777666555554431 00010 00 00 1111233333 Q ss_pred CccccccccccCCccchhHHHHHHHHHHHHHHHHHh--hhhcccCCCCCC-CHH-HHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_015159. 326 GRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM--LNSAVQRGGDRV-TAE-EIRYVAGELEDTLGGVYSLLSQELQ 401 (532) Q Consensus 326 g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~--~~~~~~~~~~~~-TAt-Ei~~r~~E~~~~LGpv~~rl~~E~l 401 (532) +..++...+. .+|.-+...+....+.|..++= ...+.......+ |.+ +++. .---+..++...+ T Consensus 259 d~~~~~e~~~----~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~--------yyd~i~~~Qe~~l 326 (437) T protein:vir:52 259 DAENEYDRKE----LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQN--------YHEAIRRLQETRL 326 (437) T ss_pred cCCcceEEEe----cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHH--------HHHHHHHHHHHHH Confidence 3333443333 2344455677777888887761 111111112223 211 2222 2222455667789 Q ss_pred HHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHH---HHHHHHHHhhcchhhhhcCHHHHHHHHHHhcC Q lcl|NC_015159. 402 LPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKL---NVFIDYMIKLAGLQDDDINLLDVKMRLANSLG 478 (532) Q Consensus 402 ~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l---~~~~~~laq~~p~~~d~id~d~~~~~~a~~~G 478 (532) .|+++|++.++.+....+ +|. .+.+++ .+|..+....+++.. ....+.+.+. ..++.+++.+.+.+. | T Consensus 327 ~p~le~l~~~i~~~~~g~-~~~-~~~~~f-~pL~~~s~kekae~~~~~a~a~~~~~~~-----g~i~~~e~r~~L~~~-g 397 (437) T protein:vir:52 327 RPIFEIIDPLICNELFGG-LPA-DWWFEF-VPLTTVKQEQQINMLNTFATAANTLIQN-----GVLNEYQIANELRES-G 397 (437) T ss_pred HHHHHHHHHHHHHHhcCC-CCC-cceEEe-CCcCCcCHHHHHHHHHHHHHHHHHHHhc-----CCCCHHHHHHHHHhc-C Confidence 999999999888764333 333 233333 244444444444432 2333333322 246778777766543 4 Q ss_pred CCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 479 MDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 479 v~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +=+. + +++++....-..+ .. ....+.+...+++ T Consensus 398 ~~~~--i-~~~~~~~~~~~~~--------------~~----~~~~~~~~~~~~~ 430 (437) T protein:vir:52 398 LFAN--I-SAEHIEELKNADE--------------FA----GNFEEPEKMEGAQ 430 (437) T ss_pred CCCC--C-CccccccccCCCC--------------CC----CccCCCCCCCCCC Confidence 3221 1 1111111000000 00 0000000000111 No 150 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=88.05 E-value=0.035 Score=28.58 Aligned_cols=415 Identities=11% Similarity=0.089 Sum_probs=179.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCC----CCc---cc-cccc----------ccccch Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSA----TAD---GS-TSYT----------TPWQSI 62 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~----~~~---~~-~~~~----------~~~dst 62 (532) |. ++..--+-..+..+|+.. |......-+...+-.||.....+ ++. .. .+.. -.|-+. T Consensus 14 m~-V~~~hp~y~a~~~~W~~~---~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~ 89 (488) T protein:vir:96 14 ML-TPIYHPDYLVNAPQWLRN---LDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNI 89 (488) T ss_pred ec-ccccCHHHHHHhhhhhHh---hhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCch Confidence 77 444444566666777654 33455555566666778643211 110 00 0111 123333 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhh Q lcl|NC_015159. 63 GARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVA 142 (532) Q Consensus 63 ~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 142 (532) -.+.++. |++.+|- ..|=+ +.++ ..+++.+++.| -....+++.-+..+..+...+ T Consensus 90 ~~~tl~~----l~G~vfr-k~p~~--~~~~------------~~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~ 144 (488) T protein:vir:96 90 VNPTMNA----ITGAVMR-REPEF--DTMD------------NPVLIGLRDNI------DGKGNGIDQECKQALNALQWG 144 (488) T ss_pred hHHHHHH----hcchhhc-cCcee--ccCC------------cHHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhc Confidence 3444444 3333332 11111 1111 01245555544 456777888888999999999 Q ss_pred Cceeeeeccccccc--------CCcceEEEEecceE---EEee-CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCC Q lcl|NC_015159. 143 GNVLLYIPSTEQVE--------GQSNAPKLYKLHNF---VVER-DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQN 210 (532) Q Consensus 143 G~~~~~v~~~~~~~--------~~~~~~~~~pl~~~---~v~~-d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~ 210 (532) |-+.++||-..... +..-.+..|+..+. -..+ |....+.-+..++...... .. .. T Consensus 145 G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D-----------~~-~~- 211 (488) T protein:vir:96 145 SRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERD-----------GG-TY- 211 (488) T ss_pred CeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEecc-----------CC-Cc- Confidence 99999998543110 11134666665443 2222 2222344454455433210 00 00 Q ss_pred CcceEEEEEEEEeeCCCCeEEEEEEEcCccccc-ccccCccccCceEEEEeeecCCCcccc--chHHHHHHHHHHHHH-- Q lcl|NC_015159. 211 PSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAG-TEGEYPLDSCPWIPVRLIKMPNEDYGR--SFVEEYLGDLKSLEN-- 285 (532) Q Consensus 211 ~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~-~~~~~g~~~~P~~~~Rw~~~~g~~YG~--Gp~~~al~d~~~L~~-- 285 (532) ....+..++.+. ++ .|..+.+.++..... ..-..|-+.+++|++.|....+..+.. .|.. |+..||. T Consensus 212 ~~~~~~~~~~l~---~g-~~~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl----dLA~lnl~H 283 (488) T protein:vir:96 212 VSKQRLINHRLV---DG-LCEFQEVTDDEYSDEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLT----SLAEISLSI 283 (488) T ss_pred ccceEEEEEEEE---Cc-EEEEEEEecCCcccceEeecCCCcccCeeEEEEEecCCCCCCCCCCchH----HHHHHHHHH Confidence 011121222111 22 355554433332211 111123345666777777666655544 4433 4444442 Q ss_pred -HHHHHHHHHHHHhcCc-eeecCccccChhhhccCCCceeecCc-------cccccccccCCccchhHHHHHHHHHHHHH Q lcl|NC_015159. 286 -LYEAIVKMSMISSKVL-FFVNPNGVTQIRRVAKANTGDFVAGR-------KQDVEVFQLEKYNDFQVAKATADDIEKRL 356 (532) Q Consensus 286 -l~~~~l~~~~~a~~p~-~lv~~~g~~~~~~~~~~~~G~~v~g~-------~~~~~~~~~~~~~~~~~~~~~i~~~~~rI 356 (532) -..+-++.+.....+| |....++.. ........++.+..+. .++.... +..+. ..+.+.+++++.++ T Consensus 284 y~~ssd~~~il~~~~~p~lv~~~~~~~-~~~~~~~~~~g~~~~~~~~~~~~~g~~~~~--e~~~~-~l~~~~l~~l~~qm 359 (488) T protein:vir:96 284 YVMNAYSNKAMILANEAKWMVDMGDMN-KTMASEMNPLGFTLAGRMPYYVKNGDVKVI--QAQFS-PETENKVEKLFEQA 359 (488) T ss_pred HhhhhHHHHHHHhcCCceeeeccCCCC-cccccccccceeeecccccccccCCceeec--CCchh-HHHHHHHHHHHHHH Confidence 2233334333344444 444333322 2211111112221111 1222222 21111 12466677777776 Q ss_pred HHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccc----ccceee- Q lcl|NC_015159. 357 SYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA----VEPAIA- 431 (532) Q Consensus 357 ~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~----~~~~~v- 431 (532) .++= -.+...+ .+.||++...+....-..|+.+...+++- +.-++..+...+--.+ ....++. ++.+++ T Consensus 360 ~~~G--a~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~a-l~~~l~~~A~w~g~~~--~~~~~~~~~~~in~dF~~ 433 (488) T protein:vir:96 360 VKVG--ASLFTQQ-SNETATGAAIRSGSSTASMATLGNNVEDT-VRNMLRFIMRYFEGTN--LYVNPDELVFKLNRDYFD 433 (488) T ss_pred HHHh--HhhccCC-CcchHHHHHHHHHHhhHHHHHHHHHHHHH-HHHHHHHHHHHcCCCC--CCcCccceEEEeccCCCC Confidence 5531 1122233 34799999999999999999988887763 3333343333331111 0011111 222221 Q ss_pred cchHHHHHHHHHHHHHH-----------HHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCH Q lcl|NC_015159. 432 TGLEALGRGHDLNKLNV-----------FIDYMIKLAGLQDDDINLLDVKMRLANSLGMDT 481 (532) Q Consensus 432 ~~l~~l~raq~~~~l~~-----------~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p 481 (532) ..+. ++.++.++. +...|-. .+...+.+++++..+++.+ .|+.. T Consensus 434 ~~ld----~~~~~al~~~~~~G~Is~~t~~~~L~~-~gvl~~d~~~e~~~~~ie~-~g~~~ 488 (488) T protein:vir:96 434 VEVN----PQMLQVAYAAMMEGNLPQVSWFELLKR-ARVVRGDMSKEEFDEHIAE-LGFGM 488 (488) T ss_pred ccCC----HHHHHHHHHHHhcCCCCHHHHHHHHHh-CCcCCccCCHHHHHHHHhh-cCCCC Confidence 1121 122222222 2222222 1111234567777777763 23311 No 151 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=87.80 E-value=0.036 Score=28.47 Aligned_cols=398 Identities=11% Similarity=-0.004 Sum_probs=141.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccccccchHHH-HHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTTPWQSIGAR-GLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~~-a~~~Laa~l~~~l 78 (532) |=..-+-+.. .. +..|..-...|-.+..++-=.... .+..|..- ..+.....+.. |++.+|..+ + T Consensus 1 ~~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~~~~~~~g-~~~~g~~v~~~~al~~~~V~~~v~~Ia~~i-A-- 67 (454) T protein:vir:93 1 MWNLLRRTRK---NQ------KSGRDVREAGWTSLFQAVAEPFAG-AWQQGVKADPEAVLSFHAVFACISLISQDI-A-- 67 (454) T ss_pred CCCccccCcc---cc------cccccccchhhhhhhhhhhhhhcc-hhhcCcccChHHhhccHHHHHHHHHHHHhh-c-- Confidence 3322111000 00 000111123344433222110000 00111110 01122222333 333333333 2 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcC----ChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) +-||.-..-...... .++.. ..++..+.+=| .+.=...++.+|...|||+.|+..+. T Consensus 68 ---~lp~~~~~~~~~g~~---------~~~~~------~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~- 128 (454) T protein:vir:93 68 ---KMRLRLMQTDAQGIR---------RETRR------GDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNA- 128 (454) T ss_pred ---cCceEEEEeccCCcc---------chhhh------HHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECC- Confidence 335643321110000 01111 11122333333 33445566678888999988875432 Q ss_pred ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~ 234 (532) .+....+..++...+-+..+.+|++- | ++. ...+.....++ T Consensus 129 -~G~~~~L~~i~~~~v~v~~~~~g~~~--y-~~~----------------------------------~~~~~~~~~~~- 169 (454) T protein:vir:93 129 -RGQIKELRILDWNRVEPLVADDGEVF--Y-RIT----------------------------------PDRNCGITEAV- 169 (454) T ss_pred -CCcEEEEEEEcCcceEEEEcCCCcEE--E-EEE----------------------------------eccccccceeE- Confidence 12223333344444544445544321 1 100 00000000000 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) .+ ..-=+++.|+....+..||.||...+...+.....+.+.......-...|..++.-++.++++. T Consensus 170 ------------~~--~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~ 235 (454) T protein:vir:93 170 ------------TV--PAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEEN 235 (454) T ss_pred ------------Ee--cCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHH Confidence 01 1112345555555667899999999999998888888888887777788887777666666654 Q ss_pred hccCC----------C-ce-eecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHH Q lcl|NC_015159. 315 VAKAN----------T-GD-FVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYV 381 (532) Q Consensus 315 ~~~~~----------~-G~-~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r 381 (532) ..... + |. .+ -.+++...++.. ..+.+. .+..+..+..|-++|-.-.....+...-|-.-+.+. T Consensus 236 ~~~~~~~~~~~~~g~n~g~~~v--l~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~ 312 (454) T protein:vir:93 236 AKKLKSNWDSGYTGENAGKTAI--LSNGAKYNPTTFSPVDSQT-VEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEAL 312 (454) T ss_pred HHHHHHHHHHHhcccccCCcee--ccCCceEEEcccChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHH Confidence 42211 1 11 11 112223333332 234443 344556677888887432111111112222111111 Q ss_pred H-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec--chHHHHHHHHHHHHHHH----HHH-- Q lcl|NC_015159. 382 A-GELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNVF----IDY-- 452 (532) Q Consensus 382 ~-~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~~----~~~-- 452 (532) . .=....|.|.+.++..++-.-| +++ .+..++.++.. ..+...|+.....+... ... T Consensus 313 ~~~f~~~~l~P~~~~ie~~ln~~L-------------~~~-~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R 378 (454) T protein:vir:93 313 EQQYYSQCLQTLIESIELLLDEAL-------------ETG-ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEAR 378 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhh-------------cCC-CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1 1234456666666666553222 211 11112211110 11222333222222110 000 Q ss_pred -HHhhcch-hhh-------hcCHHHHHHHHHHh-----cCCCHhH------------ccCCH--HHHHHHHHHHHH Q lcl|NC_015159. 453 -MIKLAGL-QDD-------DINLLDVKMRLANS-----LGMDTTG------------LILTQ--QDKQAKMAEAST 500 (532) Q Consensus 453 -laq~~p~-~~d-------~id~d~~~~~~a~~-----~Gv~p~~------------i~~s~--ee~~~~~~q~~~ 500 (532) +-.+.|. -.| .+..+.+-+.-... .|.+.+. .-.++ ..-...+--.+. T Consensus 379 ~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~~~~~~ 454 (454) T protein:vir:93 379 KRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFRGILKK 454 (454) T ss_pred HHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhhhhhcC Confidence 1111121 001 11111111100000 0110000 00000 000000000000 No 152 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=87.26 E-value=0.04 Score=28.25 Aligned_cols=381 Identities=10% Similarity=0.068 Sum_probs=140.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHH---HhhhHHH-HHHHHHHhhccc--ccCCCCCcccccc--cccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKN---DRGAYET-RAEDCATYTIPS--VFPSATADGSTSY--TTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~---~R~~~e~-~w~e~~~~~~P~--~~~~~~~~~~~~~--~~~~dst~~~a~~~Laa 72 (532) |+..+ .-..|+.+++ .++++.. .|..+.-.--.. .+...+..|..-. .-+-.++--.|++.+|+ T Consensus 1 ~~~~~--------~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~ 72 (432) T protein:vir:81 1 MPDEK--------KLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQ 72 (432) T ss_pred CCchh--------hcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHH Confidence 54443 3444544332 1121110 011000000000 0000000111000 01112333445555555 Q ss_pred HHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceee Q lcl|NC_015159. 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLL 147 (532) Q Consensus 73 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~ 147 (532) .+.+. |+.-..-.+..-.+. . +.-++..|+ +-| .+.-...++.++...|||.. T Consensus 73 ~ia~l------p~~~y~~~~~g~~~~---------~-------~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv 130 (432) T protein:vir:81 73 AIAAM------PLTMYMRTPDGRKEA---------V-------NHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYV 130 (432) T ss_pred hhhhC------ceeeEEecCCcceec---------c-------cchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEE Confidence 55433 432111111000000 0 111222332 222 23345566678888999977 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCC Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEA 227 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~ 227 (532) ++..+ .+....+..++...+-+..|.+|++. |+ T Consensus 131 ~i~~~---~g~~~~L~~l~~~~v~v~~~~~g~~~--y~------------------------------------------ 163 (432) T protein:vir:81 131 RKVVT---DGRIESLQYLANDRLTITTDPKGNTA--YR------------------------------------------ 163 (432) T ss_pred EEEec---CCcEEEEEEEcCCceEEEECCCCcEE--EE------------------------------------------ Confidence 76432 23333444444556666666655321 11 Q ss_pred CeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc Q lcl|NC_015159. 228 MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN 307 (532) Q Consensus 228 ~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~ 307 (532) ++..+|... .+..+ =+++.|....+| .||.||...+...+.......+.......-...|-.++.-+ T Consensus 164 -----~~~~~g~~~-----~~~~~--~iih~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~ 230 (432) T protein:vir:81 164 -----YRRTDGQMI-----DIPKQ--QIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID 230 (432) T ss_pred -----EEecCceEE-----EEccc--cEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC Confidence 011111100 00001 123344444455 79999999988888888877777777666677787666666 Q ss_pred cccChhhhccCC-------C-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCC-CCCCCH Q lcl|NC_015159. 308 GVTQIRRVAKAN-------T-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRG-GDRVTA 375 (532) Q Consensus 308 g~~~~~~~~~~~-------~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~-~~~~TA 375 (532) +.++++...... + |.+.. -.++....++.. ..+.+. .+..+..+..|-++|-.-. +...+ +..-|. T Consensus 231 ~~l~~e~~~~~~~~~~~~~nag~~~v-l~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~ 308 (432) T protein:vir:81 231 RFLTDDQYDSFAKKVSGSVEAGRAPL-LEGGMDVKSLGLNPVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWG 308 (432) T ss_pred CCCCHHHHHHHHHHHhhhhcCCCcee-cCCCceEEEccCCHHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCcCCcccccc Confidence 666665432111 1 11110 112222233332 234443 3445677778888884221 11111 111223 Q ss_pred HHHHHHHHHH-HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ec---chHHHHHHHHHHHHHHH- Q lcl|NC_015159. 376 EEIRYVAGEL-EDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-AT---GLEALGRGHDLNKLNVF- 449 (532) Q Consensus 376 tEi~~r~~E~-~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~---~l~~l~raq~~~~l~~~- 449 (532) +-+.+..... ...|.|.+.+++.||-.-| |++-......+++ +. ..+...|+.-.+.+... T Consensus 309 sn~eq~~~~f~~~tl~P~~~~ie~~l~~kL-------------l~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G 375 (432) T protein:vir:81 309 SGIESQQLGFLTMTLSPWLRRIEQSIALNL-------------LSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNG 375 (432) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------------cCccccCceEEEeechhhhccCHHHHHHHHHHHHhCC Confidence 3333333332 2356666666655554433 2221111111121 00 11222233222222110 Q ss_pred ---HH---HHHhhcchh--hhh-------cCHHHHHHHHH----HhcC-CCHhHccC Q lcl|NC_015159. 450 ---ID---YMIKLAGLQ--DDD-------INLLDVKMRLA----NSLG-MDTTGLIL 486 (532) Q Consensus 450 ---~~---~laq~~p~~--~d~-------id~d~~~~~~a----~~~G-v~p~~i~~ 486 (532) .. .+-.+.|.. .+. +..+..-.... ..-+ -+..++-+ T Consensus 376 ~~t~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 376 LMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhccCCCCCCCCCCCCcccccccC Confidence 00 001111110 000 11111100000 0000 01111222 No 153 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=86.62 E-value=0.044 Score=28.01 Aligned_cols=259 Identities=14% Similarity=0.077 Sum_probs=114.3 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-----hcCChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-----SNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) =++-||--.. .+. ..+..| ...|. ..+.+.=+..++.++..+|||++++..+. T Consensus 1 ia~l~~~~~~-~~~-------------~~~~~l-------~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~- 58 (278) T protein:vir:78 1 MASLPLKMYE-DYK-------------VVNTEV-------SDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI- 58 (278) T ss_pred CccceeEEEe-cCc-------------ccccHH-------HHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECC- Confidence 0122332111 000 011111 12222 12244456677788999999988875431 Q ss_pred ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~ 234 (532) .+....+..++...+-+..+.+|... ++.+ . T Consensus 59 -~G~~~~l~~l~~~~v~v~~~~~~~~~--~y~~----------------------------------------------~ 89 (278) T protein:vir:78 59 -YHQPSKLFLLNPDVVEMLIENQSREL--YYSI----------------------------------------------H 89 (278) T ss_pred -CCcEEEEEEECCceeEEEEcCCCceE--EEEE----------------------------------------------E Confidence 12223333333344444444443221 1111 0 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) ...|... .++ ..-++..|.....+..||.||...+...+...+...+..+... ...|..++..++.++.+. T Consensus 90 ~~~g~~~-----~~~--~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~ 160 (278) T protein:vir:78 90 AATGNKL-----IVH--NMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVGKEK 160 (278) T ss_pred cCCceEE-----EEc--cccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeCCCCCHHH Confidence 0011100 111 1123444544455678999999999988888777766654333 234556666666665554 Q ss_pred hcc---------CCCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-cc--cCCCCCCCHHHHHHH Q lcl|NC_015159. 315 VAK---------ANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AV--QRGGDRVTAEEIRYV 381 (532) Q Consensus 315 ~~~---------~~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~--~~~~~~~TAtEi~~r 381 (532) ... ...|.++. -.+++...++.. ..+.+ ..+..+...+.|-.+|=... +. ..++..-|++|.. T Consensus 161 ~~~~~~~~~~~~~~~g~~~v-l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~-- 236 (278) T protein:vir:78 161 RQQVLEDFKQYYEENGGILF-QEPGVEIEPLPKKYVSED-IVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN-- 236 (278) T ss_pred HHHHHHHHHHHhccCCCcee-cCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH-- Confidence 311 11222221 112233333332 23443 34455667778888873321 11 1122233444422 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-cccceeecchHHH Q lcl|NC_015159. 382 AGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-AVEPAIATGLEAL 437 (532) Q Consensus 382 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~~~~~~v~~l~~l 437 (532) ..+...-+.|++.+....+.+. +||+-... ...+.+ -++.| T Consensus 237 ------------~~~~~~~l~P~~~~i~~~ln~~-L~~~~e~~~g~~~~f--~~~~l 278 (278) T protein:vir:78 237 ------------RFYLQHTLLPIVKQYEEEFNRK-LLTKTDREKIGILNL--TLNLI 278 (278) T ss_pred ------------HHHHHHHHHHHHHHHHHHHHhh-cCChhHhcCCceEEE--ecccC Confidence 1333444666666665555433 55542211 122232 12233 No 154 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=85.75 E-value=0.05 Score=27.69 Aligned_cols=343 Identities=12% Similarity=0.054 Sum_probs=134.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhH---HHHHHHHHHhhcccccCCCCCccccc-ccccc-cchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAY---ETRAEDCATYTIPSVFPSATADGSTS-YTTPW-QSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~---e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~-dst~~~a~~~Laa~l~ 75 (532) |+ .|+.++..+.+. ...|-+. ..+...... ..+..- ..... .++--.|++.+|+.+ T Consensus 1 M~--------------~f~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~v~~~~al~~~~v~~~i~~ia~~i- 61 (386) T protein:vir:49 1 MP--------------IFNITNLATESPPINQESFFDI---ADSDFLASL-NSSEWVSAENALKNSDLFSIISQLSNDL- 61 (386) T ss_pred Cc--------------hhhhhccCCCCcccchhhhhhh---hhccccccc-cCCceechhhhhccHHHHHHHHHHHHHh- Confidence 32 233333333221 1222222 222211111 111100 00111 233234444444433 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----CChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESN----SFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) +. +| + ++ .+.. .+. .+.+- +.+.=+..++.+|..+|||+.++.. T Consensus 62 a~-~p----~-~~--~~~~-------------~~~-----------l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r 109 (386) T protein:vir:49 62 AT-AK----I-TT--SRKQ-------------LQG-----------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR 109 (386) T ss_pred hh-Cc----e-ee--ccch-------------hhh-----------hhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEE Confidence 33 22 2 11 1111 010 11122 2344455667788899999888754 Q ss_pred cccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) +. .+....+..+|...+-+..+.+|... + T Consensus 110 ~~--~g~~~~l~~i~~~~v~v~~~~~~~~~--~----------------------------------------------- 138 (386) T protein:vir:49 110 ND--NGRDMKWEYLRPSQVSFNRLDNQNGL--Y----------------------------------------------- 138 (386) T ss_pred CC--CCcEEEEEEecCceeEEEEcCCCceE--E----------------------------------------------- Confidence 32 23334444555555555544433211 0 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccC Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ 311 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~ 311 (532) ..+...+... +....+ ...=+++.|+....+..||.||..-+...+.......+.......-...|..++.-++.++ T Consensus 139 y~~~~~~~~~-~~~~~~--~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~ 215 (386) T protein:vir:49 139 YNITFDDPHI-APKQHV--PQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGL 215 (386) T ss_pred EEEEEcCccc-cceeEE--ccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCC Confidence 0011111000 000111 1122566677667788999999999999999999999988888888888887776555555 Q ss_pred hhhh----------ccCCCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-cccC-CCCCCCHHHH Q lcl|NC_015159. 312 IRRV----------AKANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AVQR-GGDRVTAEEI 378 (532) Q Consensus 312 ~~~~----------~~~~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~-~~~~~TAtEi 378 (532) .+.. .....+.++- .+++...++.. ..+.+ ..+..+..+..|-.+|-.-. +... ....-+++.+ T Consensus 216 ~~~~~~~~~~~~~~~~n~g~~~vl--~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~ 292 (386) T protein:vir:49 216 LDFKTKVSRSRQAMKQMQGGPLVL--DDLEDFTPLEIKSNVAQ-LLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMI 292 (386) T ss_pred hHHHHHHHHHHHHhccCCCCceec--CCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHH Confidence 4321 1111111211 12233334432 22333 34556777888988884321 1111 2222233322 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHH-HHHHHhcCCCCCCccccccce---eecchHHHH------HHHHHHHHHH Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKIL-LKELQATSKIPNLPKEAVEPA---IATGLEALG------RGHDLNKLNV 448 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~-~~il~r~g~lp~~p~~~~~~~---~v~~l~~l~------raq~~~~l~~ 448 (532) . +-....+-|.+..+..++-.-|..++ |++ ..+.... +..-+..|. +++-.+.+.. T Consensus 293 ~---~~~~~~i~~~l~~i~~~~~~~l~~~~~~~~-----------~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~ 358 (386) T protein:vir:49 293 Y---NIYFKSVSRYLRPFVSEMSKKLSCEVDVDI-----------SPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQ 358 (386) T ss_pred H---HHHHHHHHHHHHHHHHHHHHHhcchhcccc-----------hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhh Confidence 2 11223344444444333322221110 000 0000000 001111111 1111111100 Q ss_pred --HH-HHH---Hh-hcchh----hhhcC Q lcl|NC_015159. 449 --FI-DYM---IK-LAGLQ----DDDIN 465 (532) Q Consensus 449 --~~-~~l---aq-~~p~~----~d~id 465 (532) +. ..+ -. ..++. -+.=| T Consensus 359 ~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 359 AEILPKELPDGKNPNRTSLKGGEINEQD 386 (386) T ss_pred CCCCCCcCcchhccCCCCCCCCCCCCCC Confidence 00 000 00 01111 11122 No 155 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=84.93 E-value=0.057 Score=27.42 Aligned_cols=414 Identities=11% Similarity=0.032 Sum_probs=147.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHH-HHH--HHHHhhcccccCCCCCcccccc-cccc-cchHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYET-RAE--DCATYTIPSVFPSATADGSTSY-TTPW-QSIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~-~w~--e~~~~~~P~~~~~~~~~~~~~~-~~~~-dst~~~a~~~Laa~l~ 75 (532) |+=-++ +.+|...-.. ...+. .|. +-+.+.+-. .+..|..-. .... .++--.|++.+|+.+. T Consensus 1 Mg~~~~-------l~~r~~~~~~--~~~~~~~~~~~~~~~~~~~~----~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA 67 (457) T protein:vir:13 1 MGFWSA-------LFGRGHSPAL--DGIEARAWEPYDPSIYNLGA----VAASGETVTPHDALQVSAVFASVRLLSETIA 67 (457) T ss_pred Cchhhh-------hhcccccccc--cccccccccccchHHHhhcc----cccCCceechHHhhccHHHHHHHHHHHHhhc Confidence 543331 1111111000 00000 010 000000000 000111000 1112 2233345555555544 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----CChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESN----SFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) + + ||--..-.+... .++. ...++..++.. +.+.-+..++.++..+||+.+++.. T Consensus 68 ~-l-----p~~~~~~~~~~~----------~~~~------~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~ 125 (457) T protein:vir:13 68 T-L-----PLSTYSKRGGSR----------KEIV------TPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRW 125 (457) T ss_pred c-C-----ceEEEEecCCcc----------cccc------cchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 3 2 332221111000 0111 11222344432 2344566677788889999888743 Q ss_pred cccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) + .+..+.+..++...+-+..+..+... ...|. T Consensus 126 ~---~g~~~~l~~l~p~~v~v~~~~~~~~~---------------------------------------------~~~~~ 157 (457) T protein:vir:13 126 Q---GPNIVGLDVLDPTKIHVHMVMVDGLR---------------------------------------------RKVFE 157 (457) T ss_pred c---CCcEEEEEEEccCceEEEEecCCCcc---------------------------------------------ceeEE Confidence 2 12222222222233333332222100 00112 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccC Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ 311 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~ 311 (532) .|....+.. ......+ ..--+++.|+....+..||.||...+...+.....+.+.......-...|..++.-++.++ T Consensus 158 ~y~~~~~~~-~~~~~~~--~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls 234 (457) T protein:vir:13 158 AYDIDADGN-EVLLGWF--TPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMS 234 (457) T ss_pred EEEEecCCc-eeeEEee--CccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCC Confidence 221111110 0111111 1223566666667778899999999999999999998888888888888988887777777 Q ss_pred hhhhccCC-----------C-ce--eecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCC Q lcl|NC_015159. 312 IRRVAKAN-----------T-GD--FVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGDRVT 374 (532) Q Consensus 312 ~~~~~~~~-----------~-G~--~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~~~T 374 (532) ++...... + |. +++ +++...++.. ..+.+. .+..+..+..|-++|-.-. +...+....+ T Consensus 235 ~e~~~~~~~~~~~~~~g~~nag~~~vl~---~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~ 310 (457) T protein:vir:13 235 EEGLARAREAWRAANSGVDNAHRVALLT---EGAKFSKVAMSPDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSW 310 (457) T ss_pred HHHHHHHHHHHHHHhcCccccCcceecC---CCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccc Confidence 76542211 1 11 122 2222233322 234443 3444566778888884321 1111221122 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMI 454 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~la 454 (532) ..-+.+.... +...-|.|++.++...+. ..++++.......+.+ .++.|-|. +......++..+. T Consensus 311 ~sn~eq~~~~-----------f~~~tl~P~~~~ie~~ln-~~L~~~~~~~~~~i~f--d~~~l~~~-D~~~r~~~~~~~~ 375 (457) T protein:vir:13 311 GSGLAEQNIA-----------FTMFSLRPWLERIEAGFN-RLLFAETADRFRFVKF--NLDEIKRG-APKERMELWSLGL 375 (457) T ss_pred cchHHHHHHH-----------HHHHHHHHHHHHHHHHHH-HhhcCccccCceeEEe--echhhhcc-CHHHHHHHHHHHH Confidence 2223322222 222234444444433332 2244433222222222 12333322 1111222222222 Q ss_pred hhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCH--HHHH-HHHHHHHHHHHHHHHHHhhhH--HHHHHHHhhcccccCC Q lcl|NC_015159. 455 KLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQ--QDKQ-AKMAEASTAAGMVTAGQQMGA--AGGQAAAAMMQQQAGL 529 (532) Q Consensus 455 q~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~--ee~~-~~~~q~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~g~ 529 (532) +. + .+..+++ -+..|.+|- ... ++.- ..-...-.++...+.+....+ +.... .+--.+..|- T Consensus 376 ~~-G----~~T~NE~----R~~~gl~Pi---~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~ 442 (457) T protein:vir:13 376 QN-G----IYSIDEV----RAAEDMTPL---PDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEE-PDEEPEPEGK 442 (457) T ss_pred hC-C----CcCHHHH----HHHhCCCCC---CCCcccceeeccccccccccccccccCCCCCCCCCccc-cCCCCCCCCC Confidence 11 0 1223332 222454441 110 0000 000000000000000000000 00000 0000111111 Q ss_pred CCC Q lcl|NC_015159. 530 PTQ 532 (532) Q Consensus 530 ~~~ 532 (532) +.. T Consensus 443 ~d~ 445 (457) T protein:vir:13 443 PDD 445 (457) T ss_pred Ccc Confidence 111 No 156 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=83.28 E-value=0.07 Score=26.92 Aligned_cols=379 Identities=12% Similarity=0.031 Sum_probs=140.3 Q ss_pred CCCCCCCc--cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-c-ccccc-chHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTG--FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-Y-TTPWQ-SIGARGLNNLASKLM 75 (532) Q Consensus 1 m~~~~~~~--~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~-~~~~d-st~~~a~~~Laa~l~ 75 (532) |-+++-+- ..+..+..|+..+...+..-.+ +....+-|... .+..+... . ..... ++--.|++.+|+.+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~---~~~~~~~~~~~--~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~i- 74 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTP---NQGSQTGPVSA--HGYLGDSSINDERILQISTVWRCVSLISTLT- 74 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccc---cchhhcccccc--ccccccccccHHHhhccHHHHHHHHHHHHhh- Confidence 77775322 2233343443332212111000 01111222110 00001000 0 11122 22223444444444 Q ss_pred HhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeec Q lcl|NC_015159. 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIP 150 (532) Q Consensus 76 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~ 150 (532) + +-||.-.......-.+ ++ ..+.-+...|+ +-| .+.=...++.++..+|||.+++. T Consensus 75 A-----~lp~~vy~~~~~~~~~---------~~-----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 135 (424) T protein:vir:18 75 A-----CLPLDVFETDQNDNRK---------KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) T ss_pred c-----cCceEEEEeccCCcee---------ee-----ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 2 3354322211100000 00 01112233443 223 33345566788899999988875 Q ss_pred ccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeE Q lcl|NC_015159. 151 STEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVF 230 (532) Q Consensus 151 ~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~ 230 (532) .+ ..+.....+|+....|....++. .++.++ T Consensus 136 r~----~~G~~~~L~~l~~~~v~v~~~~~--~~~y~~------------------------------------------- 166 (424) T protein:vir:18 136 RN----SAGDVISLLPLQSANMDVKLVGK--KVVYRY------------------------------------------- 166 (424) T ss_pred EC----CCCcEEEEEEecCcceEEEEcCC--eEEEEE------------------------------------------- Confidence 32 12223444444322221111110 111101 Q ss_pred EEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec-Cccc Q lcl|NC_015159. 231 RSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN-PNGV 309 (532) Q Consensus 231 ~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~-~~g~ 309 (532) ..+|... .+. .--+++.|+...+| .||.||...+...+.....+.+.......-...|..++. +++. T Consensus 167 ----~~~g~~~-----~~~--~~eVihir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~ 234 (424) T protein:vir:18 167 ----QRDSEYA-----DFS--QKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV 234 (424) T ss_pred ----EeCCeEE-----Eec--cccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcC Confidence 1111110 111 11235556544344 899999999998888888888888888888888875554 3444 Q ss_pred cChhhhc----------cCCC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-cc-cCCCCCCCH Q lcl|NC_015159. 310 TQIRRVA----------KANT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AV-QRGGDRVTA 375 (532) Q Consensus 310 ~~~~~~~----------~~~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~-~~~~~~~TA 375 (532) ++.+... .+.+ |.+ .--.+++...++.. ..+.+. .+..+..+..|-++|=... +. ..+...-+. T Consensus 235 l~~e~~~~~~~~~~~~~~~~nag~~-~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~ 312 (424) T protein:vir:18 235 LTEQQRSQVEENFKEIAGGPVKKRL-WILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) T ss_pred CCHHHHHHHHHHHHHHhCCcccCCc-eeccCCceEEecCCChhHHHH-HHHHHHhHHHHHHHhCCCHHHhCCCCCccccc Confidence 5544321 1111 111 11112333334432 234443 3455666778888884221 11 112111111 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecch---HHHHHHHHHHHHHHH-- Q lcl|NC_015159. 376 EEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGL---EALGRGHDLNKLNVF-- 449 (532) Q Consensus 376 tEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l---~~l~raq~~~~l~~~-- 449 (532) .-+.+.... +...-|.|++.++...+. ..+|++-......+.+ ...+ +...|+.-...+... T Consensus 313 sn~eq~~~~-----------f~~~tl~P~~~~ie~~ln-~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~ 380 (424) T protein:vir:18 313 SGIEQQNLG-----------FLQYTLQPYISRWENSIQ-RWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGL 380 (424) T ss_pred ccHHHHHHH-----------HHHHHHHHHHHHHHHHHH-hhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCC Confidence 222222111 223344555555444432 2344432222222222 1111 223333333222210 Q ss_pred --HH---HHHhhcch-hhhh----cC---HHHHHHHH-HHhcCC Q lcl|NC_015159. 450 --ID---YMIKLAGL-QDDD----IN---LLDVKMRL-ANSLGM 479 (532) Q Consensus 450 --~~---~laq~~p~-~~d~----id---~d~~~~~~-a~~~Gv 479 (532) .. .+-.+.|. -.|. .| .+.+-+.. -...|. T Consensus 381 ~T~NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 381 RTINEMRRTDNMPPLPGGDVAMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred cCHHHHHHHhCCCCCCCcCeeeeccCccchhhhhccCCccccCC Confidence 00 01122221 0111 11 11111000 011222 No 157 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=82.96 E-value=0.072 Score=26.83 Aligned_cols=384 Identities=10% Similarity=-0.032 Sum_probs=140.1 Q ss_pred ccccc--cchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHH--HHHH-HHHHhcCCh Q lcl|NC_015159. 55 YTTPW--QSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVE--RICM-NYMESNSFR 129 (532) Q Consensus 55 ~~~~~--dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve--~~~~-~~l~~snf~ 129 (532) +..+. +++.-.|++.+|..+.+ .||- +...+..... .........+..+|...+ ..+. ..+....+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~------~p~~-i~~~~~~~~~-~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~ 72 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAG------FGIN-IIPHPEAEDP-DRDGEQYERVWDFWFGDDSNWQVGPMESERATAT 72 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhc------CCeE-EEEccCcccc-cchhhhhhhHHHHhhccCCCccccchhhHhhHHH Confidence 44433 35555677777776642 2332 2111100000 000011111222221110 0000 011223455 Q ss_pred HHHHHHHHHHHhhCceeeeecccccccCCcceEEEEec--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcc Q lcl|NC_015159. 130 PTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL--HNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQG 207 (532) Q Consensus 130 ~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl--~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~ 207 (532) .-+..++.|+..+|||.+|+..+. .+-....+|| ...-+..|..+.+.. + T Consensus 73 ~~~~~~~~~l~l~Gn~~i~~~r~~----~G~~~~l~~l~~~~v~~~~d~~~~~~~-~----------------------- 124 (467) T protein:vir:31 73 NVLQTAWTDYEAIGWLTIEILTQT----DGTPTGLAYVPGHTIRKRMDERGFVQL-L----------------------- 124 (467) T ss_pred HHHHHHHHHHHhcCCeEEEEEECC----CCcEEEEEEeCCceeEeeeecceeEee-c----------------------- Confidence 566778889999999998876432 2223344444 333334443321110 0 Q ss_pred cCCCcceEEEEEEEEe-eCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHH Q lcl|NC_015159. 208 DQNPSEEVTIYTHVYR-DPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENL 286 (532) Q Consensus 208 ~~~~~~~v~i~~~v~~-~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l 286 (532) ......+.++...+. +..+..+..++....... +. ...+...=.++.|.....+..||.+|..-++..+...... T Consensus 125 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~ 200 (467) T protein:vir:31 125 -EEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGST-GT--SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAA 200 (467) T ss_pred -CCceeeEEeccccceeecccceeeeeeeeccccc-cc--eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHH Confidence 000011111111111 112222222222222111 11 1112223356677766678899999999999888777777 Q ss_pred HHHHHHHHHHHhcCceeec-CccccChhhhccCC----------------------Cc---eeecCccc----ccccccc Q lcl|NC_015159. 287 YEAIVKMSMISSKVLFFVN-PNGVTQIRRVAKAN----------------------TG---DFVAGRKQ----DVEVFQL 336 (532) Q Consensus 287 ~~~~l~~~~~a~~p~~lv~-~~g~~~~~~~~~~~----------------------~G---~~v~g~~~----~~~~~~~ 336 (532) .+.......-...|..++. +++.++++.....+ ++ .++++... ++...++ T Consensus 201 ~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~l 280 (467) T protein:vir:31 201 QDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPL 280 (467) T ss_pred HHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEec Confidence 7666666566667775543 45555554421111 00 01111110 0111111 Q ss_pred CC--ccchhHHHHHHHHHHHHHHHHHhhhh-cc--cCCCCC-CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 337 EK--YNDFQVAKATADDIEKRLSYAFMLNS-AV--QRGGDR-VTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLK 410 (532) Q Consensus 337 ~~--~~~~~~~~~~i~~~~~rI~~af~~~~-~~--~~~~~~-~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 410 (532) .. ..+.+ ..+..+..+..|..+|-... +. ..++.. -.+++... .-....|.|.+.++..+|-.-|+.+... T Consensus 281 s~~~~~d~q-f~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~--~f~~~~l~P~~~~ie~~ln~~l~~~~~~ 357 (467) T protein:vir:31 281 TVGIDEEAS-FLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRK--EFAEETIQPKQHDFGELLYELVHKQGLD 357 (467) T ss_pred cccChhhHH-HHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcchhhc Confidence 11 11211 23444556667888874321 11 111111 11232222 2234556777777777665555432211 Q ss_pred HHHhcCCCCCCccccccceeec--chHHHHHHHHHH-----------HHHHHHHH--------------H----Hhhcch Q lcl|NC_015159. 411 ELQATSKIPNLPKEAVEPAIAT--GLEALGRGHDLN-----------KLNVFIDY--------------M----IKLAGL 459 (532) Q Consensus 411 il~r~g~lp~~p~~~~~~~~v~--~l~~l~raq~~~-----------~l~~~~~~--------------l----aq~~p~ 459 (532) . .+..++..... ..+...|+.-.. .+...++. . ++..|. T Consensus 358 ~----------~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~ 427 (467) T protein:vir:31 358 A----------PDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPG 427 (467) T ss_pred c----------CCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccCCcccccccccccCCC Confidence 0 00001111100 011111211111 11111100 0 000000 Q ss_pred -h-------hhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHH Q lcl|NC_015159. 460 -Q-------DDDINLLDVKMRLANSLGMDTTGLILTQQDKQA 493 (532) Q Consensus 460 -~-------~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~ 493 (532) . ...=..++.++.+... +.....+.--+.... T Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 428 GGIGDQIEQLVEDRADEIIDSYQAD--LETEQLIEIGANADS 467 (467) T ss_pred CcccCcCCCCCCCcccchHhhhhhc--cccchhhhhccccCC Confidence 0 0000122223333211 111111100000000 No 158 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=82.73 E-value=0.074 Score=26.77 Aligned_cols=405 Identities=13% Similarity=0.049 Sum_probs=145.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhh------hHHHHHHHHHHhhcccccCCCCCcccccc-ccccc-chHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRG------AYETRAEDCATYTIPSVFPSATADGSTSY-TTPWQ-SIGARGLNNLAS 72 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~------~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~d-st~~~a~~~Laa 72 (532) |+=-+ .|..... .-...|..+.-.+.. .+. .+..|..-. ..... ++--.|++.+|. T Consensus 1 Mg~~~--------------~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~g~~v~~~~al~~~~v~~~i~~ia~ 64 (457) T protein:vir:62 1 MGFWS--------------ALFGRGHSPALDAAEGRAWEPYDPSIYN-LGA-TASSGERVTPHDALQVSAVFASVRLLSE 64 (457) T ss_pred Cchhh--------------hhhccccccccccccccccccchhhhhh-ccc-cccCCceechHHhhccHHHHHHHHHHHH Confidence 44322 2211100 000111111111000 010 011111100 01122 333345555555 Q ss_pred HHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHh----cCChHHHHHHHHHHHhhCceeee Q lcl|NC_015159. 73 KLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMES----NSFRPTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 73 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~dl~~~G~~~~~ 148 (532) .+.+ + ||.=..-.+..-. +++. ..+...+.+ -+.+.-+..++.++..+|||+++ T Consensus 65 ~iA~-l-----p~~~~~~~~~~~~----------~~~~------~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~ 122 (457) T protein:vir:62 65 TIAT-L-----PLSTYSKRGGTRK----------EIDT------PEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLA 122 (457) T ss_pred hHhh-C-----ceEEEEecCCccc----------cccc------hHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEE Confidence 4432 2 3321111110000 0100 011122222 23555667778888999999888 Q ss_pred ecccccccCCcceEEEEec--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCC Q lcl|NC_015159. 149 IPSTEQVEGQSNAPKLYKL--HNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPE 226 (532) Q Consensus 149 v~~~~~~~~~~~~~~~~pl--~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~ 226 (532) +..+ .++ ....+|| ..+.+.++..+... T Consensus 123 i~~~---~g~--~~~l~~l~p~~v~v~~~~~~~~~--------------------------------------------- 152 (457) T protein:vir:62 123 VRWA---GPN--IAGLDVLDPTKIHVHMVMVDGLR--------------------------------------------- 152 (457) T ss_pred EEeC---CCc--EEEEEEEcCcceEEEEeccCCcc--------------------------------------------- Confidence 7432 122 2333343 33433333222100 Q ss_pred CCeEEEEEEE-cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Q lcl|NC_015159. 227 AMVFRSYQEI-DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN 305 (532) Q Consensus 227 ~~~~~s~~~~-~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~ 305 (532) ...|..|.+. +|.. .....+..++ +++.|.....+..||.||..-+...+.....+.+.......-...|..++. T Consensus 153 ~~~~~~y~~~~~g~~--~~~~~~~~~e--iih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 228 (457) T protein:vir:62 153 RKVFEAYDIDADGNE--VLLGWFTPRD--VLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVE 228 (457) T ss_pred ceeEEEEEEccCCce--eEEEeeCccc--eEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEE Confidence 0011112111 1111 1111111112 456666666677899999999999888888888888887777788887777 Q ss_pred CccccChhhhccCC-----------C-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCC Q lcl|NC_015159. 306 PNGVTQIRRVAKAN-----------T-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGG 370 (532) Q Consensus 306 ~~g~~~~~~~~~~~-----------~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~ 370 (532) -++.++++...... + |.+ .--.+++...++.. ..+.+. .+..+..+..|-++|-.-. +...+. T Consensus 229 ~~~~ls~e~~~~~~~~~~~~~~G~~nag~~-~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~ 306 (457) T protein:vir:62 229 VPGTMSEEGLARAREAWRAANSGVDNAHRV-ALLTEGAKFSKVAMSPDEAQF-LQTRQFQVPEIARIFGVPPHLISDATN 306 (457) T ss_pred cCCCCCHHHHHHHHHHHHHHhcCccccCcc-eecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCC Confidence 66776766542211 0 111 00112223333332 223443 3445567778888884321 111111 Q ss_pred CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 371 DRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFI 450 (532) Q Consensus 371 ~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~ 450 (532) ...+..-+.+.... +...-|.|++.++...+. ..++++.......+.+ .++.|-|. +.+....++ T Consensus 307 ~~~~~sn~eq~~~~-----------f~~~~l~P~~~~ie~~ln-~~L~~~~~~~~~~i~f--d~~~l~~~-d~~~r~~~~ 371 (457) T protein:vir:62 307 STSWGSGLAEQNIA-----------FTMFSLRPWLERIEAGFN-RLLFAETADRFRFVKF--NLDEIKRG-APKERMELW 371 (457) T ss_pred cccccchHHHHHHH-----------HHHHHHHHHHHHHHHHHH-hhhcCccccCceEEEe--echhhhcc-CHHHHHHHH Confidence 11222222222222 222334555555444332 3344443333322333 23333332 111111222 Q ss_pred HHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh------------HccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_015159. 451 DYMIKLAGLQDDDINLLDVKMRLANSLGMDTT------------GLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQA 518 (532) Q Consensus 451 ~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~------------~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~ 518 (532) ..+.+. + .+..++ +-+.+|.+|- .+....+..+.+ .+. +......+.... T Consensus 372 ~~~~~~-G----~~T~NE----~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~-----~~~----~~~~~~~~~~~~ 433 (457) T protein:vir:62 372 SLGLQN-G----IYSIDE----VRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPE-----PAP----APPAIDPPAEEP 433 (457) T ss_pred HHHHhC-C----CcCHHH----HHHHhCCCCCCCCCcceeeecccccccccccccc-----ccC----CCccCCCCccCC Confidence 221111 0 122222 2223444331 111111000000 000 000000000000 Q ss_pred H-HhhcccccCCCCC Q lcl|NC_015159. 519 A-AAMMQQQAGLPTQ 532 (532) Q Consensus 519 ~-~~~~~~~~g~~~~ 532 (532) + ..--.+..|-|.+ T Consensus 434 ~~~~~~~~~~~~~d~ 448 (457) T protein:vir:62 434 ADDEEPDNAEGDPDE 448 (457) T ss_pred CCCCCCCCCCCCCcc Confidence 0 0000111122211 No 159 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=82.54 E-value=0.076 Score=26.72 Aligned_cols=240 Identities=12% Similarity=0.056 Sum_probs=101.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhh-HHHHHHHHHHhhcccccCCCCCccccc-ccc-cccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGA-YETRAEDCATYTIPSVFPSATADGSTS-YTT-PWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~-~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~-~~dst~~~a~~~Laa~l~~~ 77 (532) |.==.+ . .+|+. ....|..-.--+.|... +..+..- ... +-.++--.|++.+|+.+.+. T Consensus 1 MglF~~--------------~-~~r~~~~~~~~~~~~~~~~~~~~---~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~l 62 (251) T protein:vir:46 1 MGIFYK--------------N-EKRDLQYNEDDLQMMVQTLPSFQ---GTKLRQYKDIEAIRHSDIFTAVMMIASDLARM 62 (251) T ss_pred CCcccc--------------c-cccccCCCccchhhhhhhhcccc---CcCcceechhhhhccHHHHHHHHHHHHhHhhC Confidence 542111 0 01110 11111000001122221 1111110 011 12333345566666655443 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHH-HhcCCh----HHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-ESNSFR----PTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~----~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) ||.-.. .... . .+.-+...| .+-|-+ .-+.....++..+|||.+|+..+ T Consensus 63 ------p~~~~~-~~~~-~------------------~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~ 116 (251) T protein:vir:46 63 ------PIRVTV-NGQI-N------------------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD 116 (251) T ss_pred ------ceEEee-Cccc-c------------------ccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC Confidence 343221 1100 0 011112223 233433 33445567888899998887543 Q ss_pred ccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) . .+....+..+|...+-+..|.+|++ +.. T Consensus 117 ~--~G~~~~L~~i~~~~v~v~~~~~g~~-------------------------------------------------~~~ 145 (251) T protein:vir:46 117 K--TGEPMNLTFRKTSEIELKSDARGRL-------------------------------------------------YYF 145 (251) T ss_pred C--CCcEEEEEEECCceEEEEECCCCcE-------------------------------------------------EEE Confidence 2 2333444444445666666665532 111 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc-C Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT-Q 311 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~-~ 311 (532) ++..++.. .+....+..+ =+++.|+...+| .||.||...+...+...+...+.......-...|..++.-++.+ + T Consensus 146 ~~~~~~~~-~g~~~~~~~~--diiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~ 221 (251) T protein:vir:46 146 HQRIDSNG-NNIERNVKFE--DMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDN 221 (251) T ss_pred EEEeccCC-cceeEEECCc--cEEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCC Confidence 11111110 0001111111 135555554444 79999999999999999999999988888888888666544433 3 Q ss_pred hhhhccCCC--ceeecCccccccccccCCccchhHHHHH Q lcl|NC_015159. 312 IRRVAKANT--GDFVAGRKQDVEVFQLEKYNDFQVAKAT 348 (532) Q Consensus 312 ~~~~~~~~~--G~~v~g~~~~~~~~~~~~~~~~~~~~~~ 348 (532) .+....... -..+.|. +..+.+. +++++ T Consensus 222 ~e~~~~~~~~~~~~~~g~-~n~g~~~--------~gm~~ 251 (251) T protein:vir:46 222 KKARDRAREEFPKVLVEL-NKLGKLS--------YSMNQ 251 (251) T ss_pred HHHHHHHHHHHHHHhcCc-ccccccc--------cccCC Confidence 332211110 0011111 1111111 12222 No 160 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=79.86 E-value=0.1 Score=26.06 Aligned_cols=371 Identities=11% Similarity=-0.031 Sum_probs=144.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHH---HHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYET---RAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~---~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (532) |+-.++- |.-.+........ .-..+.++.- ...+.-. -.....+...+..+|-++-|.-++. T Consensus 1 M~~~~~~----------f~~~~r~~~~~~~~~~~~~~~~~~~g----~~~~~~~-v~~~~al~~~~v~~~i~~ia~~ia~ 65 (429) T protein:vir:10 1 MDSVKKF----------FNFEKRQTSQVIELNKDDEKLLEWLG----ISPSTIS-VKGKNALKVATVFACIKILSESVSK 65 (429) T ss_pred Cchhhhh----------hcccccCcccccccCCChHHHHHHhc----CCCCcce-echhhhhccHHHHHHHHHHHHhhcc Confidence 6555431 1100000001000 0011122211 1110000 0011233333333333333333333 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----CChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SN----SFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) -||--..-.+....+. . +..+...|+ +- +.+.=...++.++..+||+.+++..+ T Consensus 66 -----l~~~~~~~~~~~~~~~---------~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~ 124 (429) T protein:vir:10 66 -----LPLKIYQEDEYGIQRG---------T-------KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD 124 (429) T ss_pred -----CceEEEEecCCceeec---------c-------ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC Confidence 2333221111110000 0 111223332 22 23444667778899999999887543 Q ss_pred ccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) . .+....+..+|...+.+..|..|.+..-++ . T Consensus 125 ~--~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~----------------------------------------------~ 156 (429) T protein:vir:10 125 R--KGKVQALWPIDASKVTVYIDDVGLLNSKTK----------------------------------------------M 156 (429) T ss_pred C--CCcEEEEEEEcCceeEEEEcCcccccccce----------------------------------------------E Confidence 2 233334444445566666666554321111 1 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccCh Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQI 312 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~ 312 (532) +|....... ... +..--+++.|.....+..||.||..-+...+.......+.......-...|..++.-++.+++ T Consensus 157 ~~~~~~~g~---~~~--~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~ 231 (429) T protein:vir:10 157 WYVVNTGGQ---QRV--LKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNE 231 (429) T ss_pred EEEEccCCe---EEE--EccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCH Confidence 111111000 001 112235666665566779999999999999999999999999988888889888776666665 Q ss_pred hhhccC-----------CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcc---cCCCCCCCHH Q lcl|NC_015159. 313 RRVAKA-----------NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAV---QRGGDRVTAE 376 (532) Q Consensus 313 ~~~~~~-----------~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~---~~~~~~~TAt 376 (532) +..... .+ |.+. --.+++...++.. ..+.+. .+..+..++.|-.+|-.-... ..++..-+++ T Consensus 232 e~~~~~~~~~~~~~~g~~n~~~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e 309 (429) T protein:vir:10 232 DAKKVFRENFESMSSGLQNSHRIA-LMPVGYQFQPISLNMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIE 309 (429) T ss_pred HHHHHHHHHHHHHhccccccCcee-ecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH Confidence 533111 10 1110 0112233334332 234443 344566678888888432111 1222222333 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc-cccccee-ec---chHHHHHHHHHHHHHHH-- Q lcl|NC_015159. 377 EIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPK-EAVEPAI-AT---GLEALGRGHDLNKLNVF-- 449 (532) Q Consensus 377 Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~-~~~~~~~-v~---~l~~l~raq~~~~l~~~-- 449 (532) |.... =....|-|.+..+++++-.- +|++... ....+.+ ++ ..+.-.|+...+.+... T Consensus 310 ~~~~~--f~~~~l~P~~~~ie~~ln~k-------------l~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~ 374 (429) T protein:vir:10 310 QQQQQ--FYTDTLQATLTMYEQEMTYK-------------LFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGF 374 (429) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHh-------------hcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCC Confidence 32221 12233444444444443322 2222111 1111111 11 11222333322222210 Q ss_pred --H---HHHHhhcch-hhhh----cCH---HHH--------------HHHHHHhcCCC Q lcl|NC_015159. 450 --I---DYMIKLAGL-QDDD----INL---LDV--------------KMRLANSLGMD 480 (532) Q Consensus 450 --~---~~laq~~p~-~~d~----id~---d~~--------------~~~~a~~~Gv~ 480 (532) . -.+-.+.|. -.|. .|. |.+ -+.-.++ + T Consensus 375 ~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e~---~ 429 (429) T protein:vir:10 375 LKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEG---N 429 (429) T ss_pred cCHHHHHHHhCCCCCCCcCeeeecccccchhhccccccCCCCCCCCCCCCCCCC---C Confidence 0 001111111 0110 010 000 0000011 1 No 161 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=78.88 E-value=0.11 Score=25.84 Aligned_cols=457 Identities=11% Similarity=0.040 Sum_probs=196.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHH-H--HHHHHhhcccccCCCCCc-cc----ccccc--cccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETR-A--EDCATYTIPSVFPSATAD-GS----TSYTT--PWQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~-w--~e~~~~~~P~~~~~~~~~-~~----~~~~~--~~dst~~~a~~~L 70 (532) |-+...-.+...+...-.......+..++.. + +..+.|.-+....+.... .. .+... .-++.+..+++.+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 5444211121111110001111111222211 1 122222122222111100 11 11112 3577888999988 Q ss_pred HHHHHHh-hcCCCCCccc-cCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHH----------HHhcCChHHHHHHHHH Q lcl|NC_015159. 71 ASKLMLA-LFPVGSSFFK-LNVSELEVKQSITSPEELTEIATGLAMVERICMNY----------MESNSFRPTLHAAIKQ 138 (532) Q Consensus 71 aa~l~~~-ltpp~~~WF~-l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~----------l~~snf~~~~~~~~~d 138 (532) ++.+++. ++|..+|=.+ |.-.+. .+.+.|-..||+.-... =-..+||.....++.. T Consensus 81 ~~nvVG~Gi~~~~~~~~~~l~g~~~------------~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~ 148 (553) T protein:vir:63 81 RDSIVGAQYRLNSMPDINVIPGATE------------EWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVG 148 (553) T ss_pred HHhhccCCceeeeccchhhhcCCCH------------HHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHH Confidence 8888775 7776554332 211111 12233444444443322 2355799888899999 Q ss_pred HHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEE Q lcl|NC_015159. 139 LLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIY 218 (532) Q Consensus 139 l~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~ 218 (532) +++-|-+++-+... +++.+.+-..+ +-+..+.|+-.. . .++ .-.|. T Consensus 149 ~~~dGE~~~~~~~~---------------------~~~~~~~~~~l--q~ie~drl~~~~--------~--~~~-~~~i~ 194 (553) T protein:vir:63 149 YVKTGEVLATAEWD---------------------RAANRPYATCF--QMVSTDRLSNPY--------Q--QLD-TPTLR 194 (553) T ss_pred HHhCCceEEEeeec---------------------cCCCCcccceE--EEechhhcCCCC--------C--CCC-CCeeE Confidence 88888775532211 11111111111 222222222110 0 111 12477 Q ss_pred EEEEeeCCCCeEEEEEEEcCccccccc---ccC------cccc--CceEEEEeee-cCCCccccchHHHHHHHHHHHHHH Q lcl|NC_015159. 219 THVYRDPEAMVFRSYQEIDGEIVAGTE---GEY------PLDS--CPWIPVRLIK-MPNEDYGRSFVEEYLGDLKSLENL 286 (532) Q Consensus 219 ~~v~~~~~~~~~~s~~~~~~~~~~~~~---~~~------g~~~--~P~~~~Rw~~-~~g~~YG~Gp~~~al~d~~~L~~l 286 (532) ..|+.|..++|...+.. .......+. ..+ .+.. -|-+++-|.. .+|..-|.+..--+|..++.|+.. T Consensus 195 ~GVE~d~~Gr~vaY~i~-~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y 273 (553) T protein:vir:63 195 RGVQYDKRGRPQGYWIQ-VAHPGDLYQMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRF 273 (553) T ss_pred eeeEECCCCceEEEEee-ccCCCccccccccccceeeeccccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHH Confidence 89999999887654432 222111100 000 0111 1223333433 688999999999999999999999 Q ss_pred HHHHHHHHHHHhcCceeecCcc-ccCh------------------------------hhhccCCCceeecCccc-ccccc Q lcl|NC_015159. 287 YEAIVKMSMISSKVLFFVNPNG-VTQI------------------------------RRVAKANTGDFVAGRKQ-DVEVF 334 (532) Q Consensus 287 ~~~~l~~~~~a~~p~~lv~~~g-~~~~------------------------------~~~~~~~~G~~v~g~~~-~~~~~ 334 (532) ..+.+.++..++.....+..+. .-.. ......+||.|+.-.++ ++... T Consensus 274 ~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 353 (553) T protein:vir:63 274 KEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLK 353 (553) T ss_pred HHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeec Confidence 9999999999999886664221 0000 00111234444332222 23222 Q ss_pred ccC-CccchhHHHHHHHHHHHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 335 QLE-KYNDFQVAKATADDIEKRLSYAF-M-LNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKE 411 (532) Q Consensus 335 ~~~-~~~~~~~~~~~i~~~~~rI~~af-~-~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~i 411 (532) ... ..++|. .....+...|..++ + +..+ ..|-..++=.-+++-..|..+.+--.=..|...|..|+..+++.. T Consensus 354 ~p~~p~~~~~---~F~~~~lr~iaaglGi~Ye~l-t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ 429 (553) T protein:vir:63 354 PMGTPGGVGS---EFEASLNRHLASAFGMSYEEF-TRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEE 429 (553) T ss_pred CCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHH-hhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 211 223332 33344445555554 1 2222 245555666666666666666666555667778999999999999 Q ss_pred HHhcCCCCCCccc-------------cccceee----cchHHHHHHHHH-HHHHHHHHHHHhhcchhhhhcCHHHHHHHH Q lcl|NC_015159. 412 LQATSKIPNLPKE-------------AVEPAIA----TGLEALGRGHDL-NKLNVFIDYMIKLAGLQDDDINLLDVKMRL 473 (532) Q Consensus 412 l~r~g~lp~~p~~-------------~~~~~~v----~~l~~l~raq~~-~~l~~~~~~laq~~p~~~d~id~d~~~~~~ 473 (532) ..-.|.|+-++.. ...+..+ ..|+|+--++.. ..+.+-+.+ ...+ T Consensus 430 a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t-----------------~~~~ 492 (553) T protein:vir:63 430 AIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLST-----------------YERE 492 (553) T ss_pred HHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCC-----------------HHHH Confidence 9999998744321 1112222 224444322211 111100000 1122 Q ss_pred HHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHH-H-hhhHHH----HHHHHhhcccccCCCCC Q lcl|NC_015159. 474 ANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAG-Q-QMGAAG----GQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 474 a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~-~-~~~~~~----~~~~~~~~~~~~g~~~~ 532 (532) +...|.|+..++ +|.+.+++.. ...... . ....+. .+.......+..+...| T Consensus 493 ~a~~G~D~~~v~---~q~a~e~~~~----~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (553) T protein:vir:63 493 IARLGGDFRKSF---AQRAREDALL----KKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQ 550 (553) T ss_pred HHHhCCCHHHHH---HHHHHHHHHH----HHcCCCCCCCCccccCCCcccCCCCCCCCCCCCccc Confidence 222355443222 1111111111 000000 0 000000 00000000001111111 No 162 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=76.81 E-value=0.13 Score=25.41 Aligned_cols=448 Identities=11% Similarity=0.031 Sum_probs=166.3 Q ss_pred CCCC----------CCCccCHH--------------HH-----HHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcc Q lcl|NC_015159. 1 MAEV----------EKTGFAAD--------------GA-----AAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADG 51 (532) Q Consensus 1 m~~~----------~~~~~~~~--------------~~-----~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~ 51 (532) |..+ ++..+++. .+ ...+..+-.-..+. ..-.-.+.|+.|..++. T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~f~g----- 110 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPY-VVPTMLQDWYNSQGFIG----- 110 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCcc-chhhHHHhhhcccCCcc----- Confidence 1111 11110000 00 00000000000000 00001112222221111 Q ss_pred cccccccc--cchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCCh Q lcl|NC_015159. 52 STSYTTPW--QSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFR 129 (532) Q Consensus 52 ~~~~~~~~--dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~ 129 (532) . .+..+| +..+-.+|++.|-.+ -+.|+.+...+.+.. ++..++| .+.+.+-++. T Consensus 111 y-ql~alY~~~~l~rkiVd~pAeDa-------~R~g~~I~~~~~e~~---------~~~~~~l-------~~~~~rl~v~ 166 (765) T protein:vir:96 111 Y-QACAIISQHWLVDKACSMSGEDA-------ARNGWELKSDGRKLS---------DEQSALI-------ARRDMEFRVK 166 (765) T ss_pred H-HHHHHHHhCchhhhhhhcchHHh-------hcCCceeecCccccC---------HHHHHHH-------HHHHHHhhHH Confidence 0 011111 222333344443333 357888865432221 1122233 2344445788 Q ss_pred HHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEE--EEEEEeecHHHhhHHHHHHHHhhcc Q lcl|NC_015159. 130 PTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQ--IVTEDKIARAALPEDVRKSLEEAQG 207 (532) Q Consensus 130 ~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~--i~rk~~~~~~~l~~~~~~~~~~~~~ 207 (532) ..+.++++..-.||.+++++.-+..+. .. + .-||..-.|. .|.+.. ++-..+.+.. ++.+.... .. T Consensus 167 ~~l~ea~~~~RlyGga~i~i~i~~~D~-~~--l-~~PL~~~~I~---kg~~kgl~vldp~~~~~~-~v~e~~~D----p~ 234 (765) T protein:vir:96 167 DNLVELNRFKNVFGVRIALFVVESDDP-DY--Y-EKPFNPDGIA---PGSYKGISQIDPYWAMPQ-LTAESTAD----PS 234 (765) T ss_pred HHHHHHHHHhhhceeeEEEEEecccCc-ch--h-hccccccccc---cceeeEEEEechhhcccc-cchhcccc----cc Confidence 999999998888999988775432111 10 0 1133111111 111111 1111111110 00000000 00 Q ss_pred cCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_015159. 208 DQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLY 287 (532) Q Consensus 208 ~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~ 287 (532) ..+ +.+.+.| .+.++.++.. +.--|...| ++-+.+....-||++-.+.++..++...... T Consensus 235 sp~-fg~P~~y----------------~i~g~~IH~S-Rli~~~g~~--lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~ 294 (765) T protein:vir:96 235 AEH-FYEPDFW----------------IISGKKYHRS-HLVVVRGPQ--PPDILKPTYIFGGIPLTQRIYERVYAAERTA 294 (765) T ss_pred ccc-cCcceee----------------eecCceeccc-eEEEecCCC--chhhhccccCccCccHHHHHHHHHHHHHHHH Confidence 000 1111111 1222222211 111122223 1335556666779999998999998888887 Q ss_pred HHHHHHHHHHhcCceeecCcccc-Chhhhc-------c--CCCceeecCccccccccccCCccchhHHHHHHHHHHHHHH Q lcl|NC_015159. 288 EAIVKMSMISSKVLFFVNPNGVT-QIRRVA-------K--ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLS 357 (532) Q Consensus 288 ~~~l~~~~~a~~p~~lv~~~g~~-~~~~~~-------~--~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~ 357 (532) ....+...++.-..+-++....+ +.+.+. . ...|.++-+..+++..+.. +|.-+...+....+.|. T Consensus 295 ~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~ee~~e~~s~----~lsgl~d~l~~~~~~iA 370 (765) T protein:vir:96 295 NEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGIDETMEQFDT----NLSDFDSVIMNQYQLVA 370 (765) T ss_pred HHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecCCcceeEEec----ccCCHHHHHHHHHHHHH Confidence 77777666655555444322111 111111 1 1124444444444444432 34445566666677776 Q ss_pred HHHh--hhhcccC--CCCCCCHH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeec Q lcl|NC_015159. 358 YAFM--LNSAVQR--GGDRVTAE-EIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT 432 (532) Q Consensus 358 ~af~--~~~~~~~--~~~~~TAt-Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~ 432 (532) -+.= ..-+... .+-.=|.. +++.-. --+..++...+.|.+++++.++.+.|.+|+ .+.+.+ . T Consensus 371 aas~IP~t~LfGqsp~GlnATGe~D~~nYy--------D~I~s~Qe~~l~p~le~L~~li~~s~~i~~----d~~i~F-n 437 (765) T protein:vir:96 371 AIAKTPATKLLGTSPKGFNATGEHETISYH--------EELESIQEHIFDPLLERHYLLLAKSESIDV----QLEIVW-N 437 (765) T ss_pred hhhCCCeeeeccCCcccccCcchHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----cceEEe-C Confidence 6541 1111111 22222433 333222 234456677899999999999999876652 233333 2 Q ss_pred chHHHHHHHHHHH---HHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHH---------HHHHHH---H Q lcl|NC_015159. 433 GLEALGRGHDLNK---LNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQD---------KQAKMA---E 497 (532) Q Consensus 433 ~l~~l~raq~~~~---l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee---------~~~~~~---q 497 (532) +|..+....+++. .....+.+.+. ..|+.+++.+.+...-...-..+-..+.| .++... . T Consensus 438 pL~~~sekEkAei~~k~Aea~~~~~~~-----Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~ 512 (765) T protein:vir:96 438 PVDSTTSQQQAELNNKKAATDEIYINS-----GVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQ 512 (765) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhc-----CCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCCcc Confidence 3444444444433 33333333222 24677777776653211000111111111 111000 0 Q ss_pred HHHH-HHHHHHHHhhhHHHHHHHHhhcccccCCCC---------------C Q lcl|NC_015159. 498 ASTA-AGMVTAGQQMGAAGGQAAAAMMQQQAGLPT---------------Q 532 (532) Q Consensus 498 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~---------------~ 532 (532) ...+ ....+...+.++..+........+.+.-|- + T Consensus 513 ~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~ 563 (765) T protein:vir:96 513 SAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPS 563 (765) T ss_pred cccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCccc Confidence 0000 000000000000000000000000000000 0 No 163 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=76.41 E-value=0.14 Score=25.34 Aligned_cols=382 Identities=11% Similarity=0.047 Sum_probs=137.4 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHH--HHHHH----HhhcccccCCCCCcccccc-cccccchHHHHHHHHHHHHHHhhcCC Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETR--AEDCA----TYTIPSVFPSATADGSTSY-TTPWQSIGARGLNNLASKLMLALFPV 81 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~--w~e~~----~~~~P~~~~~~~~~~~~~~-~~~~dst~~~a~~~Laa~l~~~ltpp 81 (532) +.-+..-..|+.+++--.+.++. +..-. ..+.-..+...+..|..-. +......+..+|-++-|.-++.+ T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~l--- 77 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAM--- 77 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccC--- Confidence 22233344444443332221110 00000 0000000000011111100 11222233333333333333333 Q ss_pred CCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_015159. 82 GSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVE 156 (532) Q Consensus 82 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~ 156 (532) ||.-..-.+..-.+ ..+.-++..|+ +-| .+.=....+.++..+||+..++..+ . T Consensus 78 --p~~~y~~~~~g~~~----------------~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~---~ 136 (432) T protein:vir:97 78 --PLMMYMRTPDGRKE----------------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT---D 136 (432) T ss_pred --ceEEEEecCCCccc----------------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec---C Confidence 44322111100000 01111223332 222 3334455667888999998776432 2 Q ss_pred CCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 157 GQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 157 ~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) ++...+..++...+.+..|.+|++ +|+ ++.. T Consensus 137 g~~~~L~~l~p~~v~v~~~~~g~~--~y~-----------------------------------------------~~~~ 167 (432) T protein:vir:97 137 GRIESLQYLANDRLTITTDTKGNT--AYR-----------------------------------------------YRRT 167 (432) T ss_pred CcEEEEEEEcCcceEEEEcCCCcE--EEE-----------------------------------------------EEec Confidence 333334444445555656666642 111 1111 Q ss_pred cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhc Q lcl|NC_015159. 237 DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA 316 (532) Q Consensus 237 ~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~ 316 (532) +|... .+..++ +++.|....+| .||.||...+.-.+.......+.......-...|-.++.-++.++.+... T Consensus 168 ~g~~~-----~~~~~~--iih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~ 239 (432) T protein:vir:97 168 DGQMI-----DIPRQQ--IWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYD 239 (432) T ss_pred CceEE-----EEcccc--EEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHH Confidence 11100 001111 23344443445 79999999888777777777777777666667777666666666655432 Q ss_pred cC-------CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc-CC-CCCCCHHHHHHHHHH Q lcl|NC_015159. 317 KA-------NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AVQ-RG-GDRVTAEEIRYVAGE 384 (532) Q Consensus 317 ~~-------~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~-~~-~~~~TAtEi~~r~~E 384 (532) .. .+ |.+ .--.+++...++.. ..+.+. .+.....+..|-++|-.-. +.. .+ +..-+..-+.+.... T Consensus 240 ~~~~~~~~~~nag~~-~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~ 317 (432) T protein:vir:97 240 SFSKKVSGSVEAGRA-PLLEGGMDVKSLGLNPVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG 317 (432) T ss_pred HHHHHHhhhhcCCCc-eecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHH Confidence 11 11 111 11112233333332 223332 3445666778888884321 111 11 111122222222222 Q ss_pred -HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ec---chHHHHHHHHHHHHHHH----H---HH Q lcl|NC_015159. 385 -LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-AT---GLEALGRGHDLNKLNVF----I---DY 452 (532) Q Consensus 385 -~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~---~l~~l~raq~~~~l~~~----~---~~ 452 (532) ....|.|.+.+++.++-. .+|++-......+++ .+ ..+...|+.-...+... . -. T Consensus 318 f~~~tl~P~~~~ie~~ln~-------------kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~ 384 (432) T protein:vir:97 318 FLTMTLSPWLRRIEQSIAL-------------NLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEARE 384 (432) T ss_pred HHHHHHHHHHHHHHHHHhh-------------hccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Confidence 122444544444444433 233322221111222 01 11233333322222210 0 00 Q ss_pred HHhhcchh--hh-------hcCHHHHHHHHHH--hcC---CCHhHccC Q lcl|NC_015159. 453 MIKLAGLQ--DD-------DINLLDVKMRLAN--SLG---MDTTGLIL 486 (532) Q Consensus 453 laq~~p~~--~d-------~id~d~~~~~~a~--~~G---v~p~~i~~ 486 (532) +-.+.|.. .+ .+..+.+-..... .-| -+...+-+ T Consensus 385 ~~glpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 385 IEGLPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred HhCCCCCCCCcceEeecccccchhhhcccCCCCCCCCCCCcccccccC Confidence 11111110 00 0111111100000 000 11122333 No 164 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=75.54 E-value=0.15 Score=25.17 Aligned_cols=373 Identities=12% Similarity=0.035 Sum_probs=135.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc---cCCC---CC-ccccccc-ccccchHH-HHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSA---TA-DGSTSYT-TPWQSIGA-RGLNNLA 71 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~---~~-~~~~~~~-~~~dst~~-~a~~~La 71 (532) |-+++-+= +...=..-|+.+++ .|... +-.-|.. .... +. .+..-.. ......+. .|++.+| T Consensus 1 ~~~~~~~~-~~~~~~g~~~~~~~---~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia 71 (424) T protein:vir:18 1 MEEPKYTI-DLRTNNGWWARLQS---WFVGG-----RLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) T ss_pred CCCCcceE-eecCCCchHHHHHh---hhccc-----ccccccccccccccccccccccccccHHHhhccHHHHHHHHHHH Confidence 77765321 11111111222221 00000 0001110 0000 00 0110000 11122222 3444444 Q ss_pred HHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCcee Q lcl|NC_015159. 72 SKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVL 146 (532) Q Consensus 72 a~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~ 146 (532) +.+ .+-||--.......-.+ .+ ..+.-+...|+ +-| .+.=....+.+|..+||+. T Consensus 72 ~~i------A~lp~~~~~~~~~~~~~---------~~-----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay 131 (424) T protein:vir:18 72 TLT------ACLPLDVFETDQNDNRK---------KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) T ss_pred Hhh------ccCceEEEEeecCCcee---------ee-----ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeE Confidence 433 23355322211110000 00 01112233443 334 3334556677889999998 Q ss_pred eeecccccccCCcceEEEEecc--eEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEee Q lcl|NC_015159. 147 LYIPSTEQVEGQSNAPKLYKLH--NFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRD 224 (532) Q Consensus 147 ~~v~~~~~~~~~~~~~~~~pl~--~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~ 224 (532) +++..+ ..+.....+|+. .+-+..+. |. ++ T Consensus 132 ~~i~r~----~~G~~~~L~pl~~~~V~v~~~~-~~---~~---------------------------------------- 163 (424) T protein:vir:18 132 ALVDRN----SAGDVISLLPLQSANMDVKLVG-KK---VV---------------------------------------- 163 (424) T ss_pred EEEEEC----CCCcEEEEEEecCcceEEEEcC-Ce---EE---------------------------------------- Confidence 887532 122233444442 22222221 11 00 Q ss_pred CCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_015159. 225 PEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV 304 (532) Q Consensus 225 ~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv 304 (532) ..|..+|... .+..+ -.++.|....+| .||.||...+...+.......+.......-...|..++ T Consensus 164 -------y~~~~~g~~~-----~~~~~--eIih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil 228 (424) T protein:vir:18 164 -------YRYQRDSEYA-----DFSQK--EIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL 228 (424) T ss_pred -------EEEEeCCeEE-----Eeccc--cEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEE Confidence 0111122110 11111 234455443334 89999999998888888888888888888888887665 Q ss_pred c-CccccChhhhc----------cCCC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCC Q lcl|NC_015159. 305 N-PNGVTQIRRVA----------KANT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRG 369 (532) Q Consensus 305 ~-~~g~~~~~~~~----------~~~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~ 369 (532) . +++.+..+... .+.+ |.+ .--.+++...++.. ..+.|. .+..+..+..|-++|=.-. +...+ T Consensus 229 ~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~-~vl~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~ 306 (424) T protein:vir:18 229 STGEKVLTEQQRSQVEENFKEIAGGPVKKRL-WILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVE 306 (424) T ss_pred EeCCcCCCHHHHHHHHHHHHHHhCCcccCCc-eeccCCceEEecCCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCC Confidence 4 45555544321 1111 111 11112233333332 234444 3455666778888884321 11111 Q ss_pred CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecc---hHHHHHHHHHHH Q lcl|NC_015159. 370 GDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATG---LEALGRGHDLNK 445 (532) Q Consensus 370 ~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~---l~~l~raq~~~~ 445 (532) ...-+.+-+.+... .+...-|.|++.++...+. ..+||+.......+.+ +.. .+...|+.-... T Consensus 307 ~~t~~~sn~eq~~~-----------~f~~~tl~P~~~~ie~~l~-~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~ 374 (424) T protein:vir:18 307 KSTSWGSGIEQQNL-----------GFLQYTLQPYISRWENSIQ-RWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKA 374 (424) T ss_pred CcccccccHHHHHH-----------HHHHHHHHHHHHHHHHHHH-hhcCCccccCCeEEEEechhhhccCHHHHHHHHHH Confidence 11111121211111 1223344555555444442 2344443222222222 111 122333332222 Q ss_pred HHHH-------HHHHHhhcch-hhhh-------cCHHHHHHHH-HHhcCC Q lcl|NC_015159. 446 LNVF-------IDYMIKLAGL-QDDD-------INLLDVKMRL-ANSLGM 479 (532) Q Consensus 446 l~~~-------~~~laq~~p~-~~d~-------id~d~~~~~~-a~~~Gv 479 (532) +... +-.+-.+.|. -.|. +-.+.+-... -...|. T Consensus 375 ~~~~G~~T~NE~R~~~gl~pi~gGD~~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 375 MGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchHhhhccCCCccCCC Confidence 2210 0011122221 0111 1112211100 011122 No 165 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=75.39 E-value=0.15 Score=25.15 Aligned_cols=460 Identities=10% Similarity=-0.018 Sum_probs=180.3 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHH-HHHHHHHHHHhhcCCCCCccccCCCh Q lcl|NC_015159. 14 AAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARG-LNNLASKLMLALFPVGSSFFKLNVSE 92 (532) Q Consensus 14 ~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a-~~~Laa~l~~~ltpp~~~WF~l~~~d 92 (532) .++......+.-.+-...|+.-.+=|.=+..|.-.........+ ...+..++ .-.....|.++|++. |.++. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~-~~~~~~dst~~~a~~~Laa~l~~~------ltpp~ 73 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGG-YLPTPWQSVGSKGVNVLASKLMLS------LFPVN 73 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccc-cccccccccHHHHHHHHHHHHHHh------hcCCC Confidence 33332222222222234555555555444444322222211111 22233332 345677888888883 33332 Q ss_pred HHHhhhccChhH------HHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEe Q lcl|NC_015159. 93 LEVKQSITSPEE------LTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYK 166 (532) Q Consensus 93 ~~~~~~~~~~~~------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~p 166 (532) ..--.....+.+ ..+++.|++..-..+.+.+...-.++..+..+.++..- - +. T Consensus 74 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~--L-------------------~~ 132 (555) T protein:vir:17 74 TSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKH--L-------------------IV 132 (555) T ss_pred CcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHH--H-------------------Hh Confidence 222222222222 22567777766555555555444444444444443221 0 01 Q ss_pred cceEEEeeCCCC-CeEEEEEEEeecHHH------hh-------HHHHHHHHhhcccCC--------CcceEEEEEEEEee Q lcl|NC_015159. 167 LHNFVVERDAYD-NVLQIVTEDKIARAA------LP-------EDVRKSLEEAQGDQN--------PSEEVTIYTHVYRD 224 (532) Q Consensus 167 l~~~~v~~d~~G-~vd~i~rk~~~~~~~------l~-------~~~~~~~~~~~~~~~--------~~~~v~i~~~v~~~ 224 (532) .|+.++-.+.++ ++-.+ ..+.+..+. +. ..+.+...+...... ++..+.++|++.++ T Consensus 133 ~G~a~ly~~~~~~~~~pl-~~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~ 211 (555) T protein:vir:17 133 TGNALLYQGKKNLKLYPL-DRFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGR 211 (555) T ss_pred HCeEEEEecCCceeEEEc-CeEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhccc Confidence 133222223332 22111 122222221 11 111112211111100 12234556666554 Q ss_pred CCC-CeEEEEEEE-----------cCcccccccccCccccCceEEEEeeecCCC-ccccchHHHH-HHHHHHHHHHHHHH Q lcl|NC_015159. 225 PEA-MVFRSYQEI-----------DGEIVAGTEGEYPLDSCPWIPVRLIKMPNE-DYGRSFVEEY-LGDLKSLENLYEAI 290 (532) Q Consensus 225 ~~~-~~~~s~~~~-----------~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~-~YG~Gp~~~a-l~d~~~L~~l~~~~ 290 (532) ..+ .....+|.+ .+....... ....+-||.-.=|...-=+ ..|--.|.-. ..-+-.+..|++. T Consensus 212 ~~~~~~~~~v~t~~~~~~~~~~~~~e~~~~~v~--~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l- 288 (555) T protein:vir:17 212 DKGKSNDALVYTYVCRKDGQVKWHQECDGKVIP--GSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEAL- 288 (555) T ss_pred ccCCCcceeEeecccccCCeeEEEEecCceecc--ccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHH- Confidence 322 222222221 111100000 0023444443333332222 2233344432 2233333333333 Q ss_pred HHHHHHHhcCceeecCccccChhhhccCCCceeecCccccccccccCCccch---h----HHHHHHHHHHHHHHHHHhhh Q lcl|NC_015159. 291 VKMSMISSKVLFFVNPNGVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDF---Q----VAKATADDIEKRLSYAFMLN 363 (532) Q Consensus 291 l~~~~~a~~p~~lv~~~g~~~~~~~~~~~~G~~v~g~~~~~~~~~~~~~~~~---~----~~~~~i~~~~~rI~~af~~~ 363 (532) .+...+++.- .++|=-+.+++.+ ..+-.+.+|..+.+.+ +...++ + .-...++...+.|+.. +-. T Consensus 289 ~~~~l~~~~~--~~~pp~lv~~~g~--~~~~~l~~~~~g~v~~---g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~-I~~ 360 (555) T protein:vir:17 289 SQAMVEGSAA--SAKVVFMVSPSAT--TKPQNLALAANGAIIQ---GRPDDVSVVQANKAADFRTVLEMIQKLEQR-ISD 360 (555) T ss_pred HHHHHHHHHH--HhCCceeeccccc--cCcceeecCCCceeec---CCcccceeeeccccchhhHHHHHHHHHHHH-HHH Confidence 3333333332 3444333444433 2345567776665532 222222 2 1234555666666543 444 Q ss_pred hcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh------cCCCC---CCccccccceeecch Q lcl|NC_015159. 364 SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA------TSKIP---NLPKEAVEPAIATGL 434 (532) Q Consensus 364 ~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r------~g~lp---~~p~~~~~~~~v~~l 434 (532) .+.. ... .++.-+ -++|...+ ..--...|.|++.|+-..+.. .+++- .+|.--.+..-++++ T Consensus 361 aFm~-~~~-~d~~r~--TAtEV~~r-----~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~ 431 (555) T protein:vir:17 361 AFLM-LQV-RQSERT--TATEVQAT-----VQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVV 431 (555) T ss_pred HHhh-cCC-CCcccc--hHHHHHHH-----HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhcccee Confidence 5554 333 455443 23443322 112234677777776544332 11221 233322333346778 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCC--------HHH-HH--HHHHHHHHHHH Q lcl|NC_015159. 435 EALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILT--------QQD-KQ--AKMAEASTAAG 503 (532) Q Consensus 435 ~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s--------~ee-~~--~~~~q~~~~~~ 503 (532) +.+...++.+.+...++.+.++..... ...+++. +++..+++. +.. +. .+.++..++++ T Consensus 432 ~~l~~l~r~~~~~~l~~~~~~laq~~~----~p~~~d~------id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~ 501 (555) T protein:vir:17 432 AGLWGVGRGQDKQQLMEFITTLAQTMG----PEIAMKY------INPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQK 501 (555) T ss_pred ehHHHHHHHHHHHHHHHHHHHHHhhcC----chhHhhc------CCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHH Confidence 888888988888888887777654331 1223322 233333321 111 11 12222222233 Q ss_pred HHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 504 MVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +++++++..+.+++.+.+.+..++-.--+ T Consensus 502 ~~~~q~~~~~qa~~~~~~~~~~~~~~~~~ 530 (555) T protein:vir:17 502 QDMVQASLINQAGQLAKTPMAEQAMQLIQ 530 (555) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhHHhccc Confidence 33333333334444444444332221111 No 166 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=75.16 E-value=0.15 Score=25.10 Aligned_cols=367 Identities=11% Similarity=0.018 Sum_probs=137.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccc-cccc-cchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSY-TTPW-QSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~-dst~~~a~~~Laa~l~~~l 78 (532) |+=-+|. |..-+..|+. .....+..+ ....+..+..-. .... .++--.|++.+|+.+.+ T Consensus 1 Mgl~~~~----------f~~~~~~~~~-----~~~~~~~~~--~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~-- 61 (409) T protein:vir:84 1 MSLFTRI----------FSGPSEERTL-----TKISGIPSP--AEDWAMHGDRPGANSAMTLGAFYACVTLLADTVAS-- 61 (409) T ss_pred Cchhhhh----------hcCCCccccc-----ccccccccc--cchhhccCcccchhhhhccHHHHHHHHHHHHhhhh-- Confidence 5543321 1100111111 000111111 111111111100 1111 23334455555555543 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTE 153 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~ 153 (532) -||.-....+..-.+ +.-+...|. +-| .+.-+...+.++..+||+.+|+.... T Consensus 62 ----lp~~~~~~~~~~~~~------------------~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~ 119 (409) T protein:vir:84 62 ----LSIDAYRKKDNVRIP------------------VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARD 119 (409) T ss_pred ----CceEEEEecCCcccc------------------cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEEC Confidence 244433322111000 011122232 233 33344555678889999988764221 Q ss_pred cccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEE Q lcl|NC_015159. 154 QVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSY 233 (532) Q Consensus 154 ~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~ 233 (532) ..+....+..++...+.|....++. +..+..+ T Consensus 120 -~~g~~~~L~~l~p~~v~v~~~~~~~-----------------------------------------------~~~~~~~ 151 (409) T protein:vir:84 120 -EANRPTAIMPIHPDCIHVTDAKDED-----------------------------------------------GDWIEPV 151 (409) T ss_pred -CCCceEEEEEEcCceeEEEEcCCCc-----------------------------------------------ceEEEEE Confidence 1122233333333333333222111 1001111 Q ss_pred EEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChh Q lcl|NC_015159. 234 QEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIR 313 (532) Q Consensus 234 ~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~ 313 (532) +..+|.. +. .--+++.|+....+..||.||...+...+.......+.......-...|..++.-++.++++ T Consensus 152 ~~~~g~~-------~~--~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e 222 (409) T protein:vir:84 152 YRIDGKV-------VP--NHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPD 222 (409) T ss_pred ecCCceE-------Ec--hhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHH Confidence 1112211 11 11245666666677789999999999888888888888888888888888777666666665 Q ss_pred hhccCC---------CceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCCHHHHHHH Q lcl|NC_015159. 314 RVAKAN---------TGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGDRVTAEEIRYV 381 (532) Q Consensus 314 ~~~~~~---------~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~~~TAtEi~~r 381 (532) ...... .|.+.. ..+++...++.. ..+.+. .+..+..+..|-++|-.-. +...+...-++.=+.+. T Consensus 223 ~~~~~~~~~~~~~~n~g~~~v-l~~g~~~~~~~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~ 300 (409) T protein:vir:84 223 QVKQTQKQWIQSHHNRRLPAV-MSAGIKWQSVSITPNESQF-LETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQ 300 (409) T ss_pred HHHHHHHHHHHHhccCCCeee-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHH Confidence 432111 111111 122233333332 234443 3444566778888873321 11112211222222222 Q ss_pred HHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cchHHHHHHHHHHHHHHH----HH--- Q lcl|NC_015159. 382 AGE-LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVF----ID--- 451 (532) Q Consensus 382 ~~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~----~~--- 451 (532) ... ....|.|.+..++++|-.-| + .+..++.++. ...+...|+.-...+... .. T Consensus 301 ~~~f~~~~l~P~~~~ie~~l~~~L--------------~--~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R 364 (409) T protein:vir:84 301 GINFVRHTLLPWLRCIEQALDTFL--------------P--RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVR 364 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc--------------c--CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 222 34457777777777653221 0 0111111110 001122222211111100 00 Q ss_pred HHHhhcch-hhhh----cCHHHH--HHHH------HHhcCCCHhH Q lcl|NC_015159. 452 YMIKLAGL-QDDD----INLLDV--KMRL------ANSLGMDTTG 483 (532) Q Consensus 452 ~laq~~p~-~~d~----id~d~~--~~~~------a~~~Gv~p~~ 483 (532) .+-.+.|. -.|. .|...+ .... -.....+-.+ T Consensus 365 ~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 365 AWEDAPPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred HHhCCCCCCCcceeeecccccccccCCccccCcCCCCCCccCCCC Confidence 00011110 0000 000000 0000 0000000111 No 167 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=75.03 E-value=0.15 Score=25.08 Aligned_cols=197 Identities=9% Similarity=-0.009 Sum_probs=83.7 Q ss_pred EEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015159. 221 VYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 (532) Q Consensus 221 v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p 300 (532) ++...++. |..++........+....+.-++ .++.|.....+..||.+|..-++..+..-+...+-....-.-...| T Consensus 1 ~r~~~dg~-~~y~~~~~~~~~~g~~~~~~~~e--ilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p 77 (219) T protein:vir:98 1 MRVCKDGN-YKYLMKKSLYDTKSEIYEYNKND--VIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHM 77 (219) T ss_pred CceeecCe-EEEEEecceecCCceeEEecccc--EEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 44444443 22222111111111111222223 3555644444568999999988887776665555544444456677 Q ss_pred ceee-cCccccChhhhc----------cCCC-ce-ee--cCc-cccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015159. 301 LFFV-NPNGVTQIRRVA----------KANT-GD-FV--AGR-KQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLN 363 (532) Q Consensus 301 ~~lv-~~~g~~~~~~~~----------~~~~-G~-~v--~g~-~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~ 363 (532) -.++ .+++.++++... .+.+ +. ++ +|. .+++...++.. ..+.| -.+.-+..+..|-++|-.- T Consensus 78 ~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~q-fle~rk~~~~eIa~~fgVP 156 (219) T protein:vir:98 78 GFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDE-FANIKNISAQDVLTSHRFP 156 (219) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHH-HHHHHHhhHHHHHHHhCCC Confidence 7544 455556654321 1111 11 22 121 23444445432 23444 3334455566788888422 Q ss_pred h--cccCCCCC---CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchH Q lcl|NC_015159. 364 S--AVQRGGDR---VTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLE 435 (532) Q Consensus 364 ~--~~~~~~~~---~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~ 435 (532) . +...+..+ -++++... .=....|.|.+.+++.++ -+.=++|+-..-.+.-...+.+. T Consensus 157 p~~lG~~~~~~~~~sn~eq~~~--~f~~~tL~P~~~~ie~~l------------n~~~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 157 PGLSGIIPVNTAGLGDPLKIRE--AYQADEVLPLQEIIAESI------------NSDYEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred HHHcccccCCCCCccCHHHHHH--HHHHHHHHHHHHHHHHHh------------hhhhcCCCccEEeecCcccccCC Confidence 1 11112222 23443322 333444555555555544 22212332111111111122222 No 168 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=73.45 E-value=0.17 Score=24.80 Aligned_cols=381 Identities=10% Similarity=0.048 Sum_probs=137.6 Q ss_pred cCHHHHHHHHHHHHHHhhhHHH--HHHHHHHhhccc----ccCCCCCcccccc-cccccchHHH-HHHHHHHHHHHhhcC Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYET--RAEDCATYTIPS----VFPSATADGSTSY-TTPWQSIGAR-GLNNLASKLMLALFP 80 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~--~w~e~~~~~~P~----~~~~~~~~~~~~~-~~~~dst~~~-a~~~Laa~l~~~ltp 80 (532) +.-+..-.+|+.+++--.+-.+ .+.....--.+. .+...+..|..-. .......+.. |++.+|+.+ +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~i-a~l-- 77 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAI-AAM-- 77 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhh-hhC-- Confidence 2222233334333332211110 000000000000 0000011111100 1122223333 444444433 333 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) ||.-..-.+..-.+ ..+.-++..|+ +-| .+.=.+..+.++..+|||.+++..+ T Consensus 78 ---p~~~y~~~~~g~~~----------------~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--- 135 (432) T protein:vir:10 78 ---PLTMYMRTPDGRKE----------------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--- 135 (432) T ss_pred ---ceeEEEecCCCccc----------------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--- Confidence 45321111100000 01112223332 233 3333556667888899998776432 Q ss_pred cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQE 235 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~ 235 (532) .+....+..++...+-+..|.+|++ +|+ ++. T Consensus 136 ~g~~~~L~~l~~~~v~v~~~~~g~~--~y~-----------------------------------------------~~~ 166 (432) T protein:vir:10 136 DGRIESLQYLANDRLTITTDTKGNT--AYR-----------------------------------------------YRR 166 (432) T ss_pred CCcEEEEEEEcCCceEEEEcCCCcE--EEE-----------------------------------------------EEe Confidence 2334445555556677777776643 111 000 Q ss_pred EcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhh Q lcl|NC_015159. 236 IDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV 315 (532) Q Consensus 236 ~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~ 315 (532) .+|... ++..++ +++.|....+| .||.||...+...+.......+.......-...|-.++.-++.++++.. T Consensus 167 ~~g~~~-----~~~~~~--iih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~ 238 (432) T protein:vir:10 167 TDGQMI-----DIPKQQ--IWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQY 238 (432) T ss_pred cCceEE-----EEcCcc--EEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHH Confidence 111100 000111 23333333344 7999999988888877777777766666666777777765666665543 Q ss_pred ccCC-------C-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCC-CCCCCHHHHHHHHH Q lcl|NC_015159. 316 AKAN-------T-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRG-GDRVTAEEIRYVAG 383 (532) Q Consensus 316 ~~~~-------~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~-~~~~TAtEi~~r~~ 383 (532) .... + |.+. --.++....++.. ..+.+. .+..+..+..|-++|-.-. +...+ +..-+.+-+.+... T Consensus 239 ~~~~~~~~~~~nag~~~-vl~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~ 316 (432) T protein:vir:10 239 DSFAKKVSGSVEAGRAP-LLEGGMDVKSLGLNPVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL 316 (432) T ss_pred HHHHHHHhhhhhCCCce-ecCCCceEEEccCChHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHH Confidence 2110 1 1110 0112222233332 234443 3456677788888884321 11111 11112222322222 Q ss_pred HH-HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecc---hHHHHHHHHHHHHHHH-------HH Q lcl|NC_015159. 384 EL-EDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATG---LEALGRGHDLNKLNVF-------ID 451 (532) Q Consensus 384 E~-~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~---l~~l~raq~~~~l~~~-------~~ 451 (532) .. ...|.|.+.+++.++-.-| +++.......+++ ++. .+...|+.-.+.+... +- T Consensus 317 ~f~~~tl~P~~~~ie~~ln~kL-------------~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R 383 (432) T protein:vir:10 317 GFLSMTLSPWLRRIEQSIALNL-------------LSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAR 383 (432) T ss_pred HHHHHHHHHHHHHHHHHHHhhh-------------cCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHH Confidence 22 2355555555555544332 2221111111111 000 1222233222222110 00 Q ss_pred HHHhhcchh--hh-------hcCHHHHHHHHH----HhcC-CCHhHccC Q lcl|NC_015159. 452 YMIKLAGLQ--DD-------DINLLDVKMRLA----NSLG-MDTTGLIL 486 (532) Q Consensus 452 ~laq~~p~~--~d-------~id~d~~~~~~a----~~~G-v~p~~i~~ 486 (532) .+-.+.|.. .+ .+..+.+-.... ...+ -....+-+ T Consensus 384 ~~~glppi~g~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 384 EIEGLPKLGGNAAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred HHhCCCCCCCCcceEeecCcccchhhhcccCCCCCCCCCCCcccccccC Confidence 001111110 00 011111100000 0000 01111222 No 169 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=72.67 E-value=0.18 Score=24.67 Aligned_cols=448 Identities=10% Similarity=0.039 Sum_probs=172.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccccc--ccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTP--WQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~Laa~l~~~l 78 (532) |-+-++++.++-.+-.-..+ .. ...-+.|++ ++.+ |.+. ...+-+-...+ -|++-.-++++....+.+ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~--~~-~~~~~~~~~-~e~~-~~lr---~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~-- 70 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGS--GV-VDGWTVWDP-FEQT-PELQ---WPQSVAVYSRMDNEDSRVTSLLEAISLPIRS-- 70 (469) T ss_pred CCCcccCCCCccchhhhhhc--cc-ccchhhccc-cccc-cccc---cccchHHHHHHHhhChHHHHHHHHHHHHHhc-- Confidence 55444444343222111100 00 011122222 0000 1100 00000001111 255556666666655443 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHH------HHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMV------ERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~v------e~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) .+|- +.+++.+ .+....+..+|... ...+...+.+..|...+.+.+.+.+.+|-++.=+... T Consensus 71 ----~~w~-v~p~~~~-------~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~ 138 (469) T protein:vir:10 71 ----TPWR-IRANGAS-------DEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYR 138 (469) T ss_pred ----CCce-EecCCCC-------HHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeee Confidence 3453 4443221 11112234443321 1122233446778888888888888889776532211 Q ss_pred ccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) . . .+..+|++. ..+..+ + |. ..+.+-. -+++.....++...+... ... T Consensus 139 ~--------------~----~~~~dG~~~--~~~l~~--r--p~---~~i~~~~--~~~~~~l~~~~~~~~~~~---~~~ 186 (469) T protein:vir:10 139 P--------------R----NQSPDGRFW--LRKLAP--R--PQ---WTISKFN--VAPDGGLESIEQIAPPAR---TRG 186 (469) T ss_pred c--------------c----cccCCCcee--eeeeee--c--Cc---ccceeee--eccCCceeeeeecCcccc---ccc Confidence 0 0 011123211 001000 0 00 0000000 000111111110000000 000 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc-cccC Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN-GVTQ 311 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~-g~~~ 311 (532) ..+..+. +.......=|++.|+...+|+.||.|+...+..-..--+...+..+..+++---|..+..-+ +... T Consensus 187 ~~~~~~~------~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~ 260 (469) T protein:vir:10 187 SLYVANI------APPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDE 260 (469) T ss_pred ccccCCC------CccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCH Confidence 0000000 00011112389999999999999999999999998888888999999999877777555532 2221 Q ss_pred ---------hhhhccCCC-ceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccC-CCCCCCHHHHHH Q lcl|NC_015159. 312 ---------IRRVAKANT-GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQR-GGDRVTAEEIRY 380 (532) Q Consensus 312 ---------~~~~~~~~~-G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~-~~~~~TAtEi~~ 380 (532) ...+..+.. |.++|.+ ..+..+.. .++...-...|+.+.++|+++.+...+... ++......|++. T Consensus 261 ~ek~~l~~a~~~~~~g~~a~~iip~~-~~ie~~ea--~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~vh~ 337 (469) T protein:vir:10 261 DEVRKMAALARSVRGGINAGVGLAQG-QILELLGV--SGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASVLE 337 (469) T ss_pred HHHHHHHHHHHHHhcCCceEEEccCC-ceEEEeec--CCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHHHH Confidence 111221222 3445532 34555442 344445678899999999999986544432 233333455544 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchh Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQ 460 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~ 460 (532) ...+. .+.--...+...+..-||.+++.+-+ |.-.+.|. ..+-+ +.. +.......++.+.++.-.. T Consensus 338 ev~~d--~~~sDa~~i~~tln~~li~~l~~lN~--g~~~~~P~----~~~~~-~e~-----~~~~~a~~i~~l~~~G~~~ 403 (469) T protein:vir:10 338 DPFTQ--AVHAYATSICRIANQHIIEDLVDINF--GVDTPAPV----LTFDP-IGS-----RQDLTAAAVKLLYDAGVFD 403 (469) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCccE----EEecC-CCC-----cHHHHHHHHHHHHhcCCcc Confidence 33322 22222233333333344444444321 21111111 11100 100 1111233444444443222 Q ss_pred hhhcCHHHHHHHHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 461 DDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 461 ~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) .+.+ ...++.+.+|+|+. ...+++....+..+...+........++..+..+...-..+.+.-++ T Consensus 404 ~~~~----~~~~~~e~~gip~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~d 468 (469) T protein:vir:10 404 DDPA----VKRAIRQRFNLPSE---LNDTPSAEPEEPAAVPNQSAAPARTRSSGNADARARAPKADQGVLFD 468 (469) T ss_pred Cccc----cHHHHHHHhCCCCC---CCCcccccchhcccCCCCCccccccCCCCCcccccccCCChHHhhcc Confidence 2222 34556678899643 12222211111110000000000000000000000000000111111 No 170 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=71.78 E-value=0.19 Score=24.52 Aligned_cols=364 Identities=12% Similarity=0.065 Sum_probs=138.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhh-HHH-HHHHHHHhhcccccCCCCCcccccc-cc-cccchHHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGA-YET-RAEDCATYTIPSVFPSATADGSTSY-TT-PWQSIGARGLNNLASKLML 76 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~-~e~-~w~e~~~~~~P~~~~~~~~~~~~~~-~~-~~dst~~~a~~~Laa~l~~ 76 (532) |.=-.+. .+|+. .-. .+..... ++|... +..+..-. .. +=.++--.|++.+|+.+.+ T Consensus 1 Mg~f~~~---------------~~r~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~ 61 (416) T protein:vir:81 1 MGIFYKN---------------EKRDLQYNEDDLQMMVQ-TLPGFQ---GTKLRQYKDIEAIRHSDIFTAVMMIASDLAR 61 (416) T ss_pred CCccccc---------------ccccccCCCcchhHHHH-Hhcccc---ccCccccchhhhhcchHHHHHHHHHHHhhcc Confidence 4432210 01111 000 0111111 122211 11111100 01 1123333466666655544 Q ss_pred hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 77 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) -|| ++.-... .. . +..++..|+ +-| .+.-....+.++..+|||.+++.. T Consensus 62 ------~p~-~~~~~~~-~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r 115 (416) T protein:vir:81 62 ------MPI-RVTVNGQ-IN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITR 115 (416) T ss_pred ------Cce-EEecCcc-cc-----------c-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 233 3321110 00 0 111222332 222 233456677888889999888754 Q ss_pred cccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) +. .+....+..+|...+-+..|.+|++- . T Consensus 116 ~~--~G~~~~L~~i~~~~v~v~~~~~g~~~-------------------------------------------------~ 144 (416) T protein:vir:81 116 DK--TGEPMNLTFRKTSEIELKSDARGRLY-------------------------------------------------Y 144 (416) T ss_pred CC--CCcEEEEEEEcCceeEEEECCCccEE-------------------------------------------------E Confidence 32 23334444555566666666666432 1 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc- Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT- 310 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~- 310 (532) .++.+++... .....++.+ -+++.|+...+| .||.||...+...+.......+.......-...|..++.-++.+ T Consensus 145 ~~~~~~~~~~-~~~~~~~~~--evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~ 220 (416) T protein:vir:81 145 FHQRIDSNGN-NIERNVKFE--DMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLD 220 (416) T ss_pred EEEEecCCCc-eeEEEEccc--cEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCC Confidence 1111111110 000111111 234556554444 79999999999888888888888888777778888766544433 Q ss_pred Chhhh-------ccCCC-----ceeecCccccccccccCCc-cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHH Q lcl|NC_015159. 311 QIRRV-------AKANT-----GDFVAGRKQDVEVFQLEKY-NDFQVAKATADDIEKRLSYAFMLNS-AVQRGGDRVTAE 376 (532) Q Consensus 311 ~~~~~-------~~~~~-----G~~v~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~~~TAt 376 (532) +.+.. ...-. |.+.. -.+++...++... .+.+ ..+.....+..|-.+|-.-. +...+....+.+ T Consensus 221 ~~~~~~~~~~~~~~~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 298 (416) T protein:vir:81 221 NKKARDRAREEFHKSFSGTKQAGKVVV-LDESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSIT 298 (416) T ss_pred CHHHHHHHHHHHHHHhcCccccCceee-cCCCceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHH Confidence 33321 11001 11111 1122233333322 2233 23444556777888874321 111122222222 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cchHHHHHHHHHHHHHHH----- Q lcl|NC_015159. 377 EIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVF----- 449 (532) Q Consensus 377 Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~----- 449 (532) |. .......|-|.+..++.|+-.-| ++.-.+-.++.++. ...+...|+.-.+.+... T Consensus 299 ~~---~~~~~~~l~P~~~~ie~~ln~~l-------------~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~ 362 (416) T protein:vir:81 299 DA---NLDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 362 (416) T ss_pred HH---HHHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 22 11223344455555554444332 22211111221110 111233333322222210 Q ss_pred --HHHHHhhcchh---h-------hhcCHHHHHHHHH---------HhcCCCHhH Q lcl|NC_015159. 450 --IDYMIKLAGLQ---D-------DDINLLDVKMRLA---------NSLGMDTTG 483 (532) Q Consensus 450 --~~~laq~~p~~---~-------d~id~d~~~~~~a---------~~~Gv~p~~ 483 (532) +-.+-.+.|.. . ..+..| .++.+- ..-|=+-.+ T Consensus 363 NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~-~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 363 DEIRQRDGLAPIPGGNGSIHRVDLNHVNIE-LVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred HHHHHHhCCCCCCCCCcceEeecccccccc-cccccCcccccccccccCCCCCCC Confidence 00111222210 0 011111 111100 001112222 No 171 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=71.78 E-value=0.19 Score=24.52 Aligned_cols=364 Identities=12% Similarity=0.065 Sum_probs=138.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhh-HHH-HHHHHHHhhcccccCCCCCcccccc-cc-cccchHHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGA-YET-RAEDCATYTIPSVFPSATADGSTSY-TT-PWQSIGARGLNNLASKLML 76 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~-~e~-~w~e~~~~~~P~~~~~~~~~~~~~~-~~-~~dst~~~a~~~Laa~l~~ 76 (532) |.=-.+. .+|+. .-. .+..... ++|... +..+..-. .. +=.++--.|++.+|+.+.+ T Consensus 1 Mg~f~~~---------------~~r~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~ 61 (416) T protein:vir:45 1 MGIFYKN---------------EKRDLQYNEDDLQMMVQ-TLPGFQ---GTKLRQYKDIEAIRHSDIFTAVMMIASDLAR 61 (416) T ss_pred CCccccc---------------ccccccCCCcchhHHHH-Hhcccc---ccCccccchhhhhcchHHHHHHHHHHHhhcc Confidence 4432210 01111 000 0111111 122211 11111100 01 1123333466666655544 Q ss_pred hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 77 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) -|| ++.-... .. . +..++..|+ +-| .+.-....+.++..+|||.+++.. T Consensus 62 ------~p~-~~~~~~~-~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r 115 (416) T protein:vir:45 62 ------MPI-RVTVNGQ-IN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITR 115 (416) T ss_pred ------Cce-EEecCcc-cc-----------c-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 233 3321110 00 0 111222332 222 233456677888889999888754 Q ss_pred cccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) +. .+....+..+|...+-+..|.+|++- . T Consensus 116 ~~--~G~~~~L~~i~~~~v~v~~~~~g~~~-------------------------------------------------~ 144 (416) T protein:vir:45 116 DK--TGEPMNLTFRKTSEIELKSDARGRLY-------------------------------------------------Y 144 (416) T ss_pred CC--CCcEEEEEEEcCceeEEEECCCccEE-------------------------------------------------E Confidence 32 23334444555566666666666432 1 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc- Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT- 310 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~- 310 (532) .++.+++... .....++.+ -+++.|+...+| .||.||...+...+.......+.......-...|..++.-++.+ T Consensus 145 ~~~~~~~~~~-~~~~~~~~~--evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~ 220 (416) T protein:vir:45 145 FHQRIDSNGN-NIERNVKFE--DMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLD 220 (416) T ss_pred EEEEecCCCc-eeEEEEccc--cEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCC Confidence 1111111110 000111111 234556554444 79999999999888888888888888777778888766544433 Q ss_pred Chhhh-------ccCCC-----ceeecCccccccccccCCc-cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHH Q lcl|NC_015159. 311 QIRRV-------AKANT-----GDFVAGRKQDVEVFQLEKY-NDFQVAKATADDIEKRLSYAFMLNS-AVQRGGDRVTAE 376 (532) Q Consensus 311 ~~~~~-------~~~~~-----G~~v~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~~~TAt 376 (532) +.+.. ...-. |.+.. -.+++...++... .+.+ ..+.....+..|-.+|-.-. +...+....+.+ T Consensus 221 ~~~~~~~~~~~~~~~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~ 298 (416) T protein:vir:45 221 NKKARDRAREEFHKSFSGTKQAGKVVV-LDESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSIT 298 (416) T ss_pred CHHHHHHHHHHHHHHhcCccccCceee-cCCCceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHH Confidence 33321 11001 11111 1122233333322 2233 23444556777888874321 111122222222 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceee--cchHHHHHHHHHHHHHHH----- Q lcl|NC_015159. 377 EIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIA--TGLEALGRGHDLNKLNVF----- 449 (532) Q Consensus 377 Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v--~~l~~l~raq~~~~l~~~----- 449 (532) |. .......|-|.+..++.|+-.-| ++.-.+-.++.++. ...+...|+.-.+.+... T Consensus 299 ~~---~~~~~~~l~P~~~~ie~~ln~~l-------------~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~ 362 (416) T protein:vir:45 299 DA---NLDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 362 (416) T ss_pred HH---HHHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 22 11223344455555554444332 22211111221110 111233333322222210 Q ss_pred --HHHHHhhcchh---h-------hhcCHHHHHHHHH---------HhcCCCHhH Q lcl|NC_015159. 450 --IDYMIKLAGLQ---D-------DDINLLDVKMRLA---------NSLGMDTTG 483 (532) Q Consensus 450 --~~~laq~~p~~---~-------d~id~d~~~~~~a---------~~~Gv~p~~ 483 (532) +-.+-.+.|.. . ..+..| .++.+- ..-|=+-.+ T Consensus 363 NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~-~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 363 DEIRQRDGLAPIPGGNGSIHRVDLNHVNIE-LVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred HHHHHHhCCCCCCCCCcceEeecccccccc-cccccCcccccccccccCCCCCCC Confidence 00111222210 0 011111 111100 001112222 No 172 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=67.85 E-value=0.25 Score=23.92 Aligned_cols=397 Identities=13% Similarity=0.086 Sum_probs=139.9 Q ss_pred ccCCC-CCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 43 VFPSA-TADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMN 121 (532) Q Consensus 43 ~~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~ 121 (532) .|... +-..-.+...+...+-+.++ . .-....|+.--.+-..+.+. -....++..|=+.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------~--~~~~~~~~~pp~~~~~La~~-------~~~n~~v~scI~~ia~ 63 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQAL--------K--EDRFEEYVEPKVHPLVLLSL-------LQVNPYHASACSIKAN 63 (540) T ss_pred CCCcccChhhccchhhhhcccccccc--------c--cCCCCccccCCCCHHHHHHH-------HHhcHHHHHHHHHHHH Confidence 22111 11111122222222221111 0 00112333211111112111 1112233344444444 Q ss_pred HHHhc--------------------CChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEec--ceEEEeeCCCCC Q lcl|NC_015159. 122 YMESN--------------------SFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL--HNFVVERDAYDN 179 (532) Q Consensus 122 ~l~~s--------------------nf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl--~~~~v~~d~~G~ 179 (532) .+... +++.-+...+.|+..+|||.+++..+. .+.....+|| ..+-+.+|..+- T Consensus 64 ~ia~~~~~i~~~~~~~~~~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~----~G~~~~L~~i~~~~V~v~~~~~~~ 139 (540) T protein:vir:41 64 DILRTGYLIDGDDGGVEELLRACRPSFEFILLQALEDLQVFNYCTLEVVRDD----QGEPVRLDYIPAHTVRVHRDGSRY 139 (540) T ss_pred HHhcCCceEecCccchhhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC----CCcEEEEEEeCCcceEEeEcCcee Confidence 44333 234556677788999999988875432 2223344444 333333322221 Q ss_pred eEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEE Q lcl|NC_015159. 180 VLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVR 259 (532) Q Consensus 180 vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~R 259 (532) +.. .+....+|...+ +.+ ..+....+. ....+...-.++.| T Consensus 140 ~~~----------------------------~d~~~~~~~~~~----~~~-~~~~~~~g~------~~~~~~~~eViHir 180 (540) T protein:vir:41 140 MQT----------------------------WDGIHVTYFKDY----RYE-GEVNPDNGE------DQDGVGANEIIFIH 180 (540) T ss_pred Eee----------------------------ecCceeeeeecc----ccc-ceeeccccc------cceeecccceEEec Confidence 000 000000000000 000 000001111 01112223456777 Q ss_pred eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh-------------hc--------c- Q lcl|NC_015159. 260 LIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR-------------VA--------K- 317 (532) Q Consensus 260 w~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~-------------~~--------~- 317 (532) +....+..||.+|..-++..+.......+.....-.-...|..++.-.|.+..+. +. . T Consensus 181 ~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~ 260 (540) T protein:vir:41 181 LPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYL 260 (540) T ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccc Confidence 7766778999999999998888888777777776666677776554333322211 10 0 Q ss_pred -CCCce-ee-c---CccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhccc--CCCC---CCCHHHHHHHHHHH Q lcl|NC_015159. 318 -ANTGD-FV-A---GRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQ--RGGD---RVTAEEIRYVAGEL 385 (532) Q Consensus 318 -~~~G~-~v-~---g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~--~~~~---~~TAtEi~~r~~E~ 385 (532) ...|. ++ . +..+++...++.. ..+.+ ..+..+..++.|-.+|-...... .+.. .=++++.... =. T Consensus 261 ~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~q-fle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~--f~ 337 (540) T protein:vir:41 261 KEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELS-FREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRT--YY 337 (540) T ss_pred cccccceEEEecCCCcccceeEEecccchhHHH-HHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHH--HH Confidence 01121 11 1 1234455555543 23444 44556777888999985332111 1111 1233433222 12 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcC Q lcl|NC_015159. 386 EDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDIN 465 (532) Q Consensus 386 ~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id 465 (532) ...|.|...++ ...+.+ .+++..+. .+.+.+ ....+.|......+ ..+.+. -.+- T Consensus 338 ~~tL~P~~~~i------------e~~ln~-~L~~~~~~-~~~i~f--~~~~ll~~D~~~~~----~~lv~~-----G~lT 392 (540) T protein:vir:41 338 ESVVRPQQEIV------------SSVLTD-FIQLKLDP-GARFVF--NEEILMESEFVHNY----ALLVQC-----GVLT 392 (540) T ss_pred HHHHHHHHHHH------------HHHHHH-hhhhccCC-ceEEEe--cchhhcchHHHHHH----HHHHhC-----CCCC Confidence 23344444444 433332 23333322 233333 22233332211111 111110 0123 Q ss_pred HHHHHHHHHHhcCCCHhH--ccCC----HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhccc-------ccCCCCC Q lcl|NC_015159. 466 LLDVKMRLANSLGMDTTG--LILT----QQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQ-------QAGLPTQ 532 (532) Q Consensus 466 ~d~~~~~~a~~~Gv~p~~--i~~s----~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~g~~~~ 532 (532) .+++-+.+ .|++|.. ++.. ..++.. ++...+..+........+...+......++ ...+.+. T Consensus 393 ~NE~Re~L---~g~e~gdd~~l~p~n~~~~~~~~--~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (540) T protein:vir:41 393 PSEVREKL---FGLDGGPDMFMVPSSIGKSAMKR--QKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEV 467 (540) T ss_pred HHHHHHHh---CcCcCCCcccccccccccccccc--cccccCCCCccccccccchhcccccCcccccccccccccccccc Confidence 33322111 3443321 1100 000100 000000000000000000000000000000 0000000 No 173 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=67.00 E-value=0.26 Score=23.80 Aligned_cols=378 Identities=10% Similarity=0.067 Sum_probs=129.8 Q ss_pred hhcccccCCCCCcc-cccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCCh-----HHHhhhccC----hhHHHH Q lcl|NC_015159. 38 YTIPSVFPSATADG-STSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSE-----LEVKQSITS----PEELTE 107 (532) Q Consensus 38 ~~~P~~~~~~~~~~-~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d-----~~~~~~~~~----~~~~~~ 107 (532) ..+.. +..-+.+. ... .+||+-...- ..+...... -..... T Consensus 1 ~~~~~-~~~~~~p~~~e~----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 51 (518) T protein:vir:10 1 MLLAN-GQTLSAPAMAEL----------------------------SPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPW 51 (518) T ss_pred CcccC-ceeecCchhhhh----------------------------hhhhhcccccccccceecccccchhhHHHhhhHH Confidence 11110 00001110 000 1222111000 000000000 000001 Q ss_pred HHHHHHHHH-----------------------HHHHHHHHhcCC----hHHHHHHHHHHHhhCceeeeecccccccCCcc Q lcl|NC_015159. 108 IATGLAMVE-----------------------RICMNYMESNSF----RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) Q Consensus 108 v~~~L~~ve-----------------------~~~~~~l~~snf----~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~ 160 (532) |..-.+.+. ..++..+.+=|- +.-...++.+|..+||+++|+..+. .+... T Consensus 52 V~acV~~IA~~iA~lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~--~G~~~ 129 (518) T protein:vir:10 52 VRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK--SGTPE 129 (518) T ss_pred HHHHHHHHHHhhccCceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC--CCcEE Confidence 111111111 111222222232 2234555677888999988875432 12222 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcc Q lcl|NC_015159. 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) Q Consensus 161 ~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~ 240 (532) .+..++.+.+.+..|..+.. +..++...+.. T Consensus 130 ~L~~l~p~~v~v~~~~~~~~-------------------------------------------------~~y~~~~~~~~ 160 (518) T protein:vir:10 130 KLMPMHPSRVAIKRNSRTGR-------------------------------------------------YEYYFQAGAGV 160 (518) T ss_pred EEEEECCCceEEEEcCCCCE-------------------------------------------------EEEEEEecCCc Confidence 33333334444444332111 11111111100 Q ss_pred cccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCC- Q lcl|NC_015159. 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKAN- 319 (532) Q Consensus 241 ~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~- 319 (532) ......+ ..-=+++.|+...+|-.||.||..-+...+.....+.+.......-...|..++.-++.++.+...... T Consensus 161 -~~~~~~~--~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~ 237 (518) T protein:vir:10 161 -GTQLVSF--ADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) T ss_pred -cceEEEe--cCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHH Confidence 0000001 111245556555667679999999888888888888888888777788888777666666655431111 Q ss_pred ----------C-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHH Q lcl|NC_015159. 320 ----------T-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELED 387 (532) Q Consensus 320 ----------~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~ 387 (532) + |.+.. -.+++...++.. ..+.|. .+..+..+..|-++|-.........++-|-.=+.+.. T Consensus 238 ~~~~~~~G~~nag~v~v-L~~G~~~~~l~~s~~D~q~-le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~----- 310 (518) T protein:vir:10 238 QFDRAHSGSSNTGKTMV-VEEGMEPIPLQLTAVEMQF-IEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM----- 310 (518) T ss_pred HHHHHhcCccccCcceE-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHH----- Confidence 1 11111 112233333332 234443 3444566677888884321111111111221111111 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHH Q lcl|NC_015159. 388 TLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLL 467 (532) Q Consensus 388 ~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d 467 (532) ..+...-+.|++.++...+.+ .++++.... ..+.+ -++.|-|. +.+....++..+-+. -.+..+ T Consensus 311 ------~~f~~~tL~P~l~~ie~~ln~-~L~~~~~~~-~~~~f--d~~~llr~-D~~~r~~~~~~~~~~-----G~lT~N 374 (518) T protein:vir:10 311 ------RAFYRDTMAIPIARIQSAMDK-YVGQYWVRK-NRMKF--DIDDVIQP-DWEAKSESTQKMVNS-----GVATPN 374 (518) T ss_pred ------HHHHHHHHHHHHHHHHHHHHH-hhcccccCC-ceEEE--echhhhcc-CHHHHHHHHHHHHhC-----CCcCHH Confidence 112334466666666655543 344443321 22222 12333221 111112222221111 012223 Q ss_pred HHHHHHHHhcCCCHhH-------ccCCH-HHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhh----cccccCCCC-C Q lcl|NC_015159. 468 DVKMRLANSLGMDTTG-------LILTQ-QDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAM----MQQQAGLPT-Q 532 (532) Q Consensus 468 ~~~~~~a~~~Gv~p~~-------i~~s~-ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~~~-~ 532 (532) ++ -+.+|.+|.. ++.+. ..+.+. .... .. .+..+.+...+.... -++.++.|. + T Consensus 375 E~----R~~~Gl~pie~~~gD~~~~~~n~~pl~~~--~~~~-~~----g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) T protein:vir:10 375 EG----REIMGLPRSDDPKADELYANSALQPLGAT--PDGA-VE----GEEAPAPKRPASTPVASLDQSPPTSVPGLS 441 (518) T ss_pred HH----HHHhCCCCCCCCCCCeeeecccceecccc--cccc-cC----CCCCCCCCCCCccccccccccccccCCCCC Confidence 21 1233443321 01000 000000 0000 00 000000000000000 000111110 0 No 174 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=60.43 E-value=0.37 Score=22.93 Aligned_cols=432 Identities=11% Similarity=0.063 Sum_probs=141.8 Q ss_pred CCCCCCCccCHHHHH--HHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccc--cchHHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAA--AAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPW--QSIGARGLNNLASKLML 76 (532) Q Consensus 1 m~~~~~~~~~~~~~~--~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~--dst~~~a~~~Laa~l~~ 76 (532) |.+.++.-+..-.-. .-|.+ ..-.+......+. +.|.--++.. -+.+-+.| ....-.|+++.|..+.+ T Consensus 31 ~~~~~~~~~~k~~~~~~~~~~~----~~~~~~~~~~g~~-~~~~~~~~~~---l~~l~~~~~~npiv~~~I~~~a~~ia~ 102 (547) T protein:vir:63 31 IQQREQEQISKAMNNKEVAYSQ----PVIGSMSANPGFK-TKPSIRNNQD---LHGVLKKFGGNIILNAIINTRSNQVSM 102 (547) T ss_pred hhhhhHHHHHHhhcccchhhhc----hhhheeecccccc-cCCccCChhH---HHHHHHHhhcCHHHHHHHHHHHHHHhh Confidence 333321110000000 00000 0000000001100 0111000000 00111111 22334455555544443 Q ss_pred hh-----cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHh----cCChHHHHHHHHHHHhhCceee Q lcl|NC_015159. 77 AL-----FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMES----NSFRPTLHAAIKQLLVAGNVLL 147 (532) Q Consensus 77 ~l-----tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~dl~~~G~~~~ 147 (532) .. +..+-. |.+.+.+.+-............++.+|..+- ... .+|..-+...+.++..+||+++ T Consensus 103 ~~~~~~~~~~~~~-~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn------~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~ 175 (547) T protein:vir:63 103 YCKPARHSEKGVG-FEVRLKDLDKKPTSHDEATIKRIESFIEKTG------VDNDINRDSFSSFVKKIVRDTYMYDQVNF 175 (547) T ss_pred hhhhhhhhccCCC-ceeEecccccccChhhHHHHHHHHHHHHhhC------CCCCCccchHHHHHHHHHHHHHhhCCEEE Confidence 22 221222 3333322211100000011112222221110 001 1233445556788889999988 Q ss_pred eecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCC Q lcl|NC_015159. 148 YIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEA 227 (532) Q Consensus 148 ~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~ 227 (532) ++..+. .+....+..++.....+..+.+|.+.. + T Consensus 176 ~i~rd~--~G~~~~L~~l~p~~V~~~~~~~g~~~~-------------------------------------------~- 209 (547) T protein:vir:63 176 EKVFNR--NQSMVRFVAKDPTTIFFATTADGKIPD-------------------------------------------N- 209 (547) T ss_pred EEEECC--CCcEEEEEEecCceeEEEECCcccccc-------------------------------------------C- Confidence 775432 233334444444455555555543211 0 Q ss_pred CeEEEEEEEcCcccccccccCccccCceEEEEeeecC---CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_015159. 228 MVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMP---NEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV 304 (532) Q Consensus 228 ~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~---g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv 304 (532) .+..++..++.... .+..++ +++.|.+... ...||.+|...+...+.......+.......-...|..++ T Consensus 210 -~~~y~~~~~~~~~~----~~~~~e--iih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL 282 (547) T protein:vir:63 210 -GNRFVQVIDQKIVA----TFNARE--MAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGIL 282 (547) T ss_pred -ceEEEEEcCCcEEE----Eecccc--EEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEE Confidence 01112222222110 111112 3334433222 2579999999999999988888888888777777777443 Q ss_pred --cCccccChhhh-------cc---C-CC-ce--eecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--c Q lcl|NC_015159. 305 --NPNGVTQIRRV-------AK---A-NT-GD--FVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--A 365 (532) Q Consensus 305 --~~~g~~~~~~~-------~~---~-~~-G~--~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~ 365 (532) +.+..++.+.+ .. + .+ |. ++.+ +++...++.. ..+.+ ..+..+..+..|-++|-.-. + T Consensus 283 ~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~--~g~~~~~l~~~~~d~q-fle~~~~~~~~Ia~afgVPP~~l 359 (547) T protein:vir:63 283 QIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSA--EDVKFVNMTPSARDME-FEKWLNYLINVISALYGIDPAEI 359 (547) T ss_pred EecCCCCCCHHHHHHHHHHHHHHhcCcccccccccccC--CCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHc Confidence 43433444322 11 1 11 11 2211 2344444432 33444 33456667788989984321 1 Q ss_pred c-cCC-------CCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHH Q lcl|NC_015159. 366 V-QRG-------GDRVTAEEIRYVAGE-LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEA 436 (532) Q Consensus 366 ~-~~~-------~~~~TAtEi~~r~~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~ 436 (532) . ..+ ...+|-.=+.+.... ....|.|.+.+++.+|-.- +++... ..+...+. .+.. T Consensus 360 G~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~-------------L~~~~~-~~~~~~f~-~~~~ 424 (547) T protein:vir:63 360 NIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKH-------------IVAEFG-DKYTFQFV-GGDI 424 (547) T ss_pred CcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhh-------------cccccC-CceEEEee-cccc Confidence 1 111 111221112221111 2234445554444443222 333221 22333332 2222 Q ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh-----Hcc------CCHHHHHHHHHHHH-HHHHH Q lcl|NC_015159. 437 LGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT-----GLI------LTQQDKQAKMAEAS-TAAGM 504 (532) Q Consensus 437 l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~-----~i~------~s~ee~~~~~~q~~-~~~~~ 504 (532) ....... ++...+. + + .+-.++ +-+.+|.+|. .++ ...+..+...-+.+ ++... T Consensus 425 ~~~~~~~-~~~~~~~--~---g----~lT~NE----~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (547) T protein:vir:63 425 KSELESV-KILAEKA--K---V----AMTVNE----VRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNL 490 (547) T ss_pred ccHHHHH-HHHHHHh--C---C----CcCHHH----HHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhc Confidence 2222111 1111110 0 0 122222 1223344331 011 00000000000000 00000 Q ss_pred HHHHHhhhHHHHH--HHHhhcccccCCCCC Q lcl|NC_015159. 505 VTAGQQMGAAGGQ--AAAAMMQQQAGLPTQ 532 (532) Q Consensus 505 ~~~~~~~~~~~~~--~~~~~~~~~~g~~~~ 532 (532) .+..++.+.+.+. ......++..|-.+. T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (547) T protein:vir:63 491 QMLQEQTGNRVSTDVEDIPDGKDTTGDIGK 520 (547) T ss_pred cccccccCCCCCCCCCCCCCCcccCCCcCc Confidence 0011111100000 000000011111100 No 175 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=59.69 E-value=0.39 Score=22.84 Aligned_cols=415 Identities=10% Similarity=0.066 Sum_probs=137.7 Q ss_pred CCCC------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCC-c-ccccccccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEV------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATA-D-GSTSYTTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~-~-~~~~~~~~~dst~~~a~~~Laa 72 (532) |.+. +++.+..+- +-.+- ..-+.++. -|....+.-+ . --+.... ..+...|+++.+. T Consensus 42 ~~~~~~~~~~~~~~a~~~~----~~~~~-------~~~~~~~~--~~~~~~~~~~l~~~l~~~~~--n~i~~~~I~t~~~ 106 (563) T protein:vir:99 42 EYQDLTKSLYGQQQAYAEP----FIEMM-------DTNPEFRD--KRSYMKNEHNLHDVLKKFGN--NPILNAIILTRSN 106 (563) T ss_pred hHHHHHhhhccCCCcchhh----hHhhh-------cccccccc--cccCCCCcccHHHHHHHhhc--chHHHHHHHHHHH Confidence 2221 111110111 11110 00011111 1111100000 0 0000000 1112222333332 Q ss_pred HHHHhhcCC-----CCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHH-----HhcCChHHHHHHHHHHHhh Q lcl|NC_015159. 73 KLMLALFPV-----GSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-----ESNSFRPTLHAAIKQLLVA 142 (532) Q Consensus 73 ~l~~~ltpp-----~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf~~~~~~~~~dl~~~ 142 (532) -+...-.+. ...| .+.+-+.+... ... ++. -...+++.+.... ...+|..-+..++.++.++ T Consensus 107 ~vA~~~~~~~~~~~~~~~-~i~l~~~~~~~---~~~---~~~-~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~ 178 (563) T protein:vir:99 107 QVAMYCQPARYSEKGLGF-EVRLRDLDAEP---GRK---EKE-EMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIY 178 (563) T ss_pred HHHHHhhhhhhhcccccc-eeEEeecCCCc---chh---hhh-hhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhc Confidence 222211110 1111 11111111000 000 000 0111112221111 1234555666778899999 Q ss_pred CceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEE Q lcl|NC_015159. 143 GNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVY 222 (532) Q Consensus 143 G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~ 222 (532) |||.+|+-..-...+..+.+..++...+.+..+.+|.+-.-.+ T Consensus 179 Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~------------------------------------- 221 (563) T protein:vir:99 179 DQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGK------------------------------------- 221 (563) T ss_pred CCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccce------------------------------------- Confidence 9998875321111122333333334556666666654321111 Q ss_pred eeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeec---CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 223 RDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKM---PNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSK 299 (532) Q Consensus 223 ~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~---~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~ 299 (532) ..++..+|..... +..++ .+.++.... ....||.+|..-+...+.....+.+.......-... T Consensus 222 --------~y~~~~~g~~~~~----~~~~e--vI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~ 287 (563) T protein:vir:99 222 --------RFVQVVDKRVVAS----FTSRE--LAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGT 287 (563) T ss_pred --------eEEEEeCCceeEE----ecCcc--eEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCC Confidence 1122222221100 00011 122222221 225799999999999999888888888888777788 Q ss_pred Cceee--cCccccChhhh-------c---cC-CC-ceeecCccccccccccCCc-cchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015159. 300 VLFFV--NPNGVTQIRRV-------A---KA-NT-GDFVAGRKQDVEVFQLEKY-NDFQVAKATADDIEKRLSYAFMLNS 364 (532) Q Consensus 300 p~~lv--~~~g~~~~~~~-------~---~~-~~-G~~v~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~ 364 (532) |..++ +.+..++.+.. . .+ .+ |.+..-..+++...++... .+.+ ..+..+..+..|-++|-... T Consensus 288 p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~q-fle~~~~~~~~Ia~afgVPp 366 (563) T protein:vir:99 288 TRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQ-FEKWLNYLINIISALYGIDP 366 (563) T ss_pred CceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCH Confidence 88554 32322343322 1 11 11 1110011233444444432 2343 34566777888999984322 Q ss_pred cccCC---C-----------CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee Q lcl|NC_015159. 365 AVQRG---G-----------DRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI 430 (532) Q Consensus 365 ~~~~~---~-----------~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~ 430 (532) ..... + .+-++++.. ..=....|.|.+.++..+|-.-|+. .. ...+...+ T Consensus 367 ~~lG~~~~~~~~~~~~~ss~~~sn~e~~~--~~f~~~tL~P~l~~ie~~ln~~L~~-------------~~-~~~~~~~f 430 (563) T protein:vir:99 367 AEIGFPNRGGATGSKGGSTLNEADPGKKQ--QQSQNKGLQPLLRFIEDLVNRHIIS-------------EY-GDKYTFQF 430 (563) T ss_pred HHccccccccccccccccchhhccHHHHH--HHHHHHHHHHHHHHHHHHHHhhhch-------------hc-ccccEEEe Confidence 11111 0 111222221 1223445666666666665444432 11 11122232 Q ss_pred ecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh----------HccCCHHHHHHHHHHHHH Q lcl|NC_015159. 431 ATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT----------GLILTQQDKQAKMAEAST 500 (532) Q Consensus 431 v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~----------~i~~s~ee~~~~~~q~~~ 500 (532) .. .+...|+...+ +...++ +.+ +-.++ +-+.+|.+|- .+....+ ..+. ... T Consensus 431 ~r-~D~~~~~e~~~-~~~~~~--~G~-------lT~NE----~R~~~gl~Pi~gGD~~~~~~~~~~~~~-~~~~---~~~ 491 (563) T protein:vir:99 431 VG-GDTKSATDKLN-ILKLET--QIF-------KTVNE----AREEQGKKPIEGGDIILDASFLQGTAQ-LQQD---KQY 491 (563) T ss_pred cc-CCHHHHHHHHH-HHHHhc--CCc-------cCHHH----HHHHhCCCCCCCcceeecccccccccc-cccc---cCC Confidence 21 23334443322 111111 000 11111 1111233221 0110000 0000 000 Q ss_pred HHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 501 AAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ..+.++ ..............+.|++ T Consensus 492 ~~~~~~-------~~~~~~~~~~~~~~~~~~~ 516 (563) T protein:vir:99 492 NDGKQK-------ERLQMMMSLLEGDNDDSEE 516 (563) T ss_pred Cccccc-------hhhhhcccccCCCCCCCCC Confidence 000000 0000000111111111111 No 176 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=59.69 E-value=0.39 Score=22.84 Aligned_cols=415 Identities=10% Similarity=0.066 Sum_probs=137.7 Q ss_pred CCCC------CCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCC-c-ccccccccccchHHHHHHHHHH Q lcl|NC_015159. 1 MAEV------EKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATA-D-GSTSYTTPWQSIGARGLNNLAS 72 (532) Q Consensus 1 m~~~------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~-~-~~~~~~~~~dst~~~a~~~Laa 72 (532) |.+. +++.+..+- +-.+- ..-+.++. -|....+.-+ . --+.... ..+...|+++.+. T Consensus 42 ~~~~~~~~~~~~~~a~~~~----~~~~~-------~~~~~~~~--~~~~~~~~~~l~~~l~~~~~--n~i~~~~I~t~~~ 106 (563) T protein:vir:95 42 EYQDLTKSLYGQQQAYAEP----FIEMM-------DTNPEFRD--KRSYMKNEHNLHDVLKKFGN--NPILNAIILTRSN 106 (563) T ss_pred hHHHHHhhhccCCCcchhh----hHhhh-------cccccccc--cccCCCCcccHHHHHHHhhc--chHHHHHHHHHHH Confidence 2221 111110111 11110 00011111 1111100000 0 0000000 1112222333332 Q ss_pred HHHHhhcCC-----CCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHH-----HhcCChHHHHHHHHHHHhh Q lcl|NC_015159. 73 KLMLALFPV-----GSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYM-----ESNSFRPTLHAAIKQLLVA 142 (532) Q Consensus 73 ~l~~~ltpp-----~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-----~~snf~~~~~~~~~dl~~~ 142 (532) -+...-.+. ...| .+.+-+.+... ... ++. -...+++.+.... ...+|..-+..++.++.++ T Consensus 107 ~vA~~~~~~~~~~~~~~~-~i~l~~~~~~~---~~~---~~~-~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~ 178 (563) T protein:vir:95 107 QVAMYCQPARYSEKGLGF-EVRLRDLDAEP---GRK---EKE-EMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIY 178 (563) T ss_pred HHHHHhhhhhhhcccccc-eeEEeecCCCc---chh---hhh-hhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhc Confidence 222211110 1111 11111111000 000 000 0111112221111 1234555666778899999 Q ss_pred CceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEE Q lcl|NC_015159. 143 GNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVY 222 (532) Q Consensus 143 G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~ 222 (532) |||.+|+-..-...+..+.+..++...+.+..+.+|.+-.-.+ T Consensus 179 Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~------------------------------------- 221 (563) T protein:vir:95 179 DQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGK------------------------------------- 221 (563) T ss_pred CCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccce------------------------------------- Confidence 9998875321111122333333334556666666654321111 Q ss_pred eeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeec---CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 223 RDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKM---PNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSK 299 (532) Q Consensus 223 ~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~---~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~ 299 (532) ..++..+|..... +..++ .+.++.... ....||.+|..-+...+.....+.+.......-... T Consensus 222 --------~y~~~~~g~~~~~----~~~~e--vI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~ 287 (563) T protein:vir:95 222 --------RFVQVVDKRVVAS----FTSRE--LAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGT 287 (563) T ss_pred --------eEEEEeCCceeEE----ecCcc--eEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCC Confidence 1122222221100 00011 122222221 225799999999999999888888888888777788 Q ss_pred Cceee--cCccccChhhh-------c---cC-CC-ceeecCccccccccccCCc-cchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015159. 300 VLFFV--NPNGVTQIRRV-------A---KA-NT-GDFVAGRKQDVEVFQLEKY-NDFQVAKATADDIEKRLSYAFMLNS 364 (532) Q Consensus 300 p~~lv--~~~g~~~~~~~-------~---~~-~~-G~~v~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~ 364 (532) |..++ +.+..++.+.. . .+ .+ |.+..-..+++...++... .+.+ ..+..+..+..|-++|-... T Consensus 288 p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~q-fle~~~~~~~~Ia~afgVPp 366 (563) T protein:vir:95 288 TRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQ-FEKWLNYLINIISALYGIDP 366 (563) T ss_pred CceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCH Confidence 88554 32322343322 1 11 11 1110011233444444432 2343 34566777888999984322 Q ss_pred cccCC---C-----------CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee Q lcl|NC_015159. 365 AVQRG---G-----------DRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI 430 (532) Q Consensus 365 ~~~~~---~-----------~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~ 430 (532) ..... + .+-++++.. ..=....|.|.+.++..+|-.-|+. .. ...+...+ T Consensus 367 ~~lG~~~~~~~~~~~~~ss~~~sn~e~~~--~~f~~~tL~P~l~~ie~~ln~~L~~-------------~~-~~~~~~~f 430 (563) T protein:vir:95 367 AEIGFPNRGGATGSKGGSTLNEADPGKKQ--QQSQNKGLQPLLRFIEDLVNRHIIS-------------EY-GDKYTFQF 430 (563) T ss_pred HHccccccccccccccccchhhccHHHHH--HHHHHHHHHHHHHHHHHHHHhhhch-------------hc-ccccEEEe Confidence 11111 0 111222221 1223445666666666665444432 11 11122232 Q ss_pred ecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHh----------HccCCHHHHHHHHHHHHH Q lcl|NC_015159. 431 ATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTT----------GLILTQQDKQAKMAEAST 500 (532) Q Consensus 431 v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~----------~i~~s~ee~~~~~~q~~~ 500 (532) .. .+...|+...+ +...++ +.+ +-.++ +-+.+|.+|- .+....+ ..+. ... T Consensus 431 ~r-~D~~~~~e~~~-~~~~~~--~G~-------lT~NE----~R~~~gl~Pi~gGD~~~~~~~~~~~~~-~~~~---~~~ 491 (563) T protein:vir:95 431 VG-GDTKSATDKLN-ILKLET--QIF-------KTVNE----AREEQGKKPIEGGDIILDASFLQGTAQ-LQQD---KQY 491 (563) T ss_pred cc-CCHHHHHHHHH-HHHHhc--CCc-------cCHHH----HHHHhCCCCCCCcceeecccccccccc-cccc---cCC Confidence 21 23334443322 111111 000 11111 1111233221 0110000 0000 000 Q ss_pred HHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 501 AAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ..+.++ ..............+.|++ T Consensus 492 ~~~~~~-------~~~~~~~~~~~~~~~~~~~ 516 (563) T protein:vir:95 492 NDGKQK-------ERLQMMMSLLEGDNDDSEE 516 (563) T ss_pred Cccccc-------hhhhhcccccCCCCCCCCC Confidence 000000 0000000111111111111 No 177 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=55.85 E-value=0.47 Score=22.38 Aligned_cols=388 Identities=11% Similarity=0.041 Sum_probs=131.2 Q ss_pred hhcccccCCCCCcc-ccccc---ccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHH Q lcl|NC_015159. 38 YTIPSVFPSATADG-STSYT---TPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLA 113 (532) Q Consensus 38 ~~~P~~~~~~~~~~-~~~~~---~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~ 113 (532) ..+.. +..-+.+. ..... ..|........ .+.....-|+.... ....|..-.+ T Consensus 1 ~~~~~-~~~~~~p~~~~~~~~~~~~~~~~~~~g~---------~~~~~~~~~~~~~~-------------~~~~V~acV~ 57 (518) T protein:vir:78 1 MLLAN-GQTLSAPAMAELSPQMQDSYYYAPAVGM---------QLERQFSLYGGIYK-------------NQPWVRTVIA 57 (518) T ss_pred CcccC-ceeeccchhhhhhhhhhhcccccceece---------ecccccchhhHHhh-------------hhHHHHHHHH Confidence 11110 00000110 00000 11100000000 00000000000000 0001111111 Q ss_pred HHH-----------------------HHHHHHHHhcCC----hHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEe Q lcl|NC_015159. 114 MVE-----------------------RICMNYMESNSF----RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYK 166 (532) Q Consensus 114 ~ve-----------------------~~~~~~l~~snf----~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~p 166 (532) .+. ......+.+=|- +.=...++.+|...||+++|+..+. .+....+..++ T Consensus 58 ~IA~~iA~lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~--~G~~~~L~~l~ 135 (518) T protein:vir:78 58 KRAQALARLPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK--SGTPEKLMPMH 135 (518) T ss_pred HHHHhhccCceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC--CCcEEEEEEEC Confidence 111 111222333332 2334566677888899988875431 12222233333 Q ss_pred cceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccc Q lcl|NC_015159. 167 LHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEG 246 (532) Q Consensus 167 l~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~ 246 (532) .+.+.+..|.++. .+..++...+... ... T Consensus 136 p~~Vtv~~~~~~~-------------------------------------------------~~~y~~~~~~~~~-~~~- 164 (518) T protein:vir:78 136 PSRVAIKRNSRTG-------------------------------------------------RYEYYFQAGAGVG-TQL- 164 (518) T ss_pred CCceEEEEcCCCC-------------------------------------------------EEEEEEEecCCcc-cee- Confidence 3334333433211 1111111111100 000 Q ss_pred cCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCC------- Q lcl|NC_015159. 247 EYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKAN------- 319 (532) Q Consensus 247 ~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~------- 319 (532) ..|..-=+++.|+...+|..||.||..-+...+.......+.......-...|..++.-++.++++...... T Consensus 165 -~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~ 243 (518) T protein:vir:78 165 -VSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAH 243 (518) T ss_pred -EEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHh Confidence 001112245566665667789999999988888888888888888777788888777766666666432111 Q ss_pred -----CceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|NC_015159. 320 -----TGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVY 393 (532) Q Consensus 320 -----~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~~E~~~~LGpv~ 393 (532) .|.+.. -.+++...++.. ..+.+. .+..+..+..|-++|-.-.......+.-|-+=+.+... T Consensus 244 ~G~~nag~~~v-L~~G~~~~~l~~~~~d~q~-le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~---------- 311 (518) T protein:vir:78 244 AGSSNTGKTMV-VEEGMEPIPLQLTAVEMQF-IEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMR---------- 311 (518) T ss_pred cCcccCCceeE-cCCCceEEeccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHH---------- Confidence 111111 112233333332 234443 34445666778888843211111111112221111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHH Q lcl|NC_015159. 394 SLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRL 473 (532) Q Consensus 394 ~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~ 473 (532) .+...-+.|++.++...+.+ .++++.... ..+.+ .++.|-|. +.+....++..+-+.. .+..++ + T Consensus 312 -~f~~~tL~P~~~~ie~eln~-~L~~~~~~~-~~~~f--d~~~Llr~-D~~~r~~~~~~~~~~G-----~lT~NE----~ 376 (518) T protein:vir:78 312 -AFYRDTMAIPIARIQSAMDK-YVGQYWVRK-NRMKF--DIDDVIQP-DWEAKSESTQKMVNSG-----VATPNE----G 376 (518) T ss_pred -HHHHHHHHHHHHHHHHHHHH-hhcccccCc-ceEEe--echhhhcc-CHHHHHHHHHHHHhCC-----CcCHHH----H Confidence 12333466666665555543 344433321 12222 12333222 1111222222222110 122232 1 Q ss_pred HHhcCCCHhH-------ccCCH-HHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC-C Q lcl|NC_015159. 474 ANSLGMDTTG-------LILTQ-QDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT-Q 532 (532) Q Consensus 474 a~~~Gv~p~~-------i~~s~-ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~ 532 (532) -+.+|.+|.. ++.+. ..+.+.. ... ..-+++...........+...-+..++.|. + T Consensus 377 R~~~gl~pie~~~gD~~~v~~n~~pl~~~~--~~~-~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) T protein:vir:78 377 REIMGLPRSDDPKADELYANSALQPLGATP--DGA-VEGEEAPAPKRPASTPVASLDQSPPASVPGLS 441 (518) T ss_pred HHHhCCCCCCCCCCceeeecccceeccccc--ccc-cCCCCCCCCCCCCcccccccccCccccCCCCC Confidence 2223443321 11000 0000000 000 000000000000000000000000111110 0 No 178 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=55.57 E-value=0.48 Score=22.35 Aligned_cols=364 Identities=10% Similarity=0.010 Sum_probs=129.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHH--HHHHHHHHhhcccccCCCCCccccc-cccccc-chHHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYE--TRAEDCATYTIPSVFPSATADGSTS-YTTPWQ-SIGARGLNNLASKLML 76 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e--~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~d-st~~~a~~~Laa~l~~ 76 (532) |. +.+.| ..| +++ ..|..+... .+ ...+..|..- ...... ++--.|++.+|+.+.+ T Consensus 1 m~-----------~~~~~----~~~-~~~~~~~~~~~~~~-~~---~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~ 60 (419) T protein:vir:57 1 MF-----------IPQFW----KGR-PSENRVNWQVVPGG-MR---SSSSQAGVIITPETALALSAVRACVTLLAESVAQ 60 (419) T ss_pred Cc-----------chhhh----ccC-Cccccccccccccc-cc---cccccCCceechHHhhccHHHHHHHHHHHHhhcc Confidence 21 11112 111 111 112211110 00 0001111110 011222 3333445555554443 Q ss_pred hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----CChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SN----SFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 77 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) | ||--+.-.+..-.+ .+ .+.-+...|+ +- +.+.-....+.+|..+||+++++.. T Consensus 61 -l-----p~~~~~~~~~g~~~---------~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r 119 (419) T protein:vir:57 61 -L-----PCVLYRRTENGGRE---------IA------FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDR 119 (419) T ss_pred -C-----ceEEEEEcCCCcee---------cc------ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 2 44322211100000 00 0111222332 22 2344455667788899999888764 Q ss_pred cccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) +. .+..+.+..++...+.+..+.+|.+ |.+. . . T Consensus 120 ~~--~G~~~~L~pl~~~~v~v~~~~~g~~---~y~~--~----------------------------------~------ 152 (419) T protein:vir:57 120 NG--RGDITELIPINPHKVIVLKGPDGMP---YYDI--P----------------------------------S------ 152 (419) T ss_pred CC--CCcEEEEEEEcCcceEEEECCCceE---EEEE--c----------------------------------C------ Confidence 32 2233334444445555555555532 1000 0 0 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc--- Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG--- 308 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g--- 308 (532) .+.. ++.++ +++.|....+ ..||.||...+...+.....+.+.......-...|..++.-.+ T Consensus 153 -----~~~~-------~~~~~--vih~r~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 217 (419) T protein:vir:57 153 -----IGEI-------LPMRM--VHHIKSFSLD-GYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAK 217 (419) T ss_pred -----CceE-------Echhh--EEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCC Confidence 0000 00001 1223322223 4899999999999999999898888888888888886654321 Q ss_pred -ccChhhhc----------cC-CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cc-cCCCC Q lcl|NC_015159. 309 -VTQIRRVA----------KA-NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AV-QRGGD 371 (532) Q Consensus 309 -~~~~~~~~----------~~-~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~-~~~~~ 371 (532) .++.+.+. .+ .+ |.+. --.+++...++.. ..+.+. .+..+..+..|-.+|-... +. ...+. T Consensus 218 ~~~~~e~~~~~~~~~~~~~~g~~nag~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t 295 (419) T protein:vir:57 218 AIASQAAVDAILAKWTERYGGVRNAFSVG-MLQEGMTYKQLSQDNEKAQL-LQSRQYTVNEVCRLYKVPPHMIQDLQKST 295 (419) T ss_pred cccCHHHHHHHHHHHHHHhccccccccce-ecCCCceEEEcCCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCc Confidence 12222110 01 11 1111 1122333334332 334442 3444566677888884321 11 11222 Q ss_pred CCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecc---hHHHHHHHHHHHHH Q lcl|NC_015159. 372 RVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATG---LEALGRGHDLNKLN 447 (532) Q Consensus 372 ~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~---l~~l~raq~~~~l~ 447 (532) .-+++|... .+...-|.|++.+....+.+. +|++-......+.+ +.. -+...|+.-.+.+. T Consensus 296 ~sn~e~~~~--------------~f~~~~l~P~~~~ie~~l~~~-ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~ 360 (419) T protein:vir:57 296 NNNIEHQGL--------------QYVIYTMLAILKRHESAMMRD-LLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGR 360 (419) T ss_pred cccHHHHHH--------------HHHHHHHHHHHHHHHHHHHhh-ccCccccCCeEEEEechhhhccCHHHHHHHHHHHH Confidence 223333221 123334555555544443332 33222122222222 111 12223322221111 Q ss_pred HH----HH---HHHhhcch-hhh-------hcCHHHHH-------HHHHHhcCCCHhHccCC Q lcl|NC_015159. 448 VF----ID---YMIKLAGL-QDD-------DINLLDVK-------MRLANSLGMDTTGLILT 487 (532) Q Consensus 448 ~~----~~---~laq~~p~-~~d-------~id~d~~~-------~~~a~~~Gv~p~~i~~s 487 (532) .. .. .+-.+.|. -.| .++...+- +...+..++ ..-|- T Consensus 361 ~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 419 (419) T protein:vir:57 361 QWGWLSVNDIRRMENLTPIPGGDKYLTPLNMVDSKALTGIGKATPQQLKDIEAI---LCTRN 419 (419) T ss_pred hCCCcCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCCCcccCcchhhh---hhccC Confidence 10 00 00011110 000 00000000 000000000 00011 No 179 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=53.66 E-value=0.52 Score=22.13 Aligned_cols=385 Identities=11% Similarity=-0.001 Sum_probs=144.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccc----cCCCCCccccc-cccccc-chHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV----FPSATADGSTS-YTTPWQ-SIGARGLNNLASKL 74 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~----~~~~~~~~~~~-~~~~~d-st~~~a~~~Laa~l 74 (532) |++--... .++... ..|+.....-......+.|.. +...+..+..- ...... ++--.|++.+|+.+ T Consensus 1 ~~~~l~~~------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~i 72 (434) T protein:vir:43 1 MSKSLGKV------LSSATS--APRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSV 72 (434) T ss_pred Cccchhhh------hhhccc--ccchhhhcccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhh Confidence 88764221 111110 112211110001111111110 01111111110 011122 22234555555554 Q ss_pred HHhhcCCCCCccccCCC-hHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcCCh----HHHHHHHHHHHhhCceeee Q lcl|NC_015159. 75 MLALFPVGSSFFKLNVS-ELEVKQSITSPEELTEIATGLAMVERICMNYME-SNSFR----PTLHAAIKQLLVAGNVLLY 148 (532) Q Consensus 75 ~~~ltpp~~~WF~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~----~~~~~~~~dl~~~G~~~~~ 148 (532) .+ -||.-..-. |.... ++ .+..++..|+ +-|-+ .=...++.+|...||+.+| T Consensus 73 a~------lp~~~~~~~~~g~~~----------~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~ 130 (434) T protein:vir:43 73 AG------LPLGVYERKADGSRV----------DA------RSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAE 130 (434) T ss_pred hh------CceEEEEEcCCCccc----------cc------cccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEE Confidence 43 344322211 00000 00 0112223343 33433 3344557788899999888 Q ss_pred ecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCC Q lcl|NC_015159. 149 IPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAM 228 (532) Q Consensus 149 v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~ 228 (532) +..+ .+....+..++...+-+..|.+|.+- |+ T Consensus 131 i~~~---~G~~~~L~~l~p~~v~~~~~~~g~~~--y~------------------------------------------- 162 (434) T protein:vir:43 131 IRRA---AGRPAALDFLLPSRVDLECDENGRLK--YF------------------------------------------- 162 (434) T ss_pred EEeC---CCcEEEEEEEcCcceEEEEcCCCeEE--EE------------------------------------------- Confidence 7532 23333344444455555555555321 10 Q ss_pred eEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc Q lcl|NC_015159. 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG 308 (532) Q Consensus 229 ~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g 308 (532) ++..+|... .+ ...-+++.|....+| .||.||...+...+.......+.......-...|..++.-++ T Consensus 163 ----~~~~~g~~~-----~~--~~~eVih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~ 230 (434) T protein:vir:43 163 ----YTTKKGARR-----EI--ERTNMLHIPAFTLDG-RIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR 230 (434) T ss_pred ----EEecCceEE-----EE--ccccEEEecCcCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC Confidence 000011100 00 111223334333344 799999999998888888888888777777788887776666 Q ss_pred ccChhhhccC----------CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCC Q lcl|NC_015159. 309 VTQIRRVAKA----------NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGDRVT 374 (532) Q Consensus 309 ~~~~~~~~~~----------~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~~~T 374 (532) .++++..... .+ |.+ .--.+++...++.. ..+.+. .+..+..+..|-++|-.-. +...+....+ T Consensus 231 ~l~~e~~~~~r~~~~~~~g~~nag~~-~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~ 308 (434) T protein:vir:43 231 ILQPAQREEFREYVKSVSGAMNSGRS-PVLEQGITPETIGINPVDAQL-LETREHGVIEICRWFGVPPWMIGQTDKGSNW 308 (434) T ss_pred CCCHHHHHHHHHHHHHhcCccccCCc-cccCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCcCCccc Confidence 6666543111 11 111 11122333334332 334443 3445666778888874321 1111222222 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecc---hHHHHHHHHHHHHHHH- Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATG---LEALGRGHDLNKLNVF- 449 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~---l~~l~raq~~~~l~~~- 449 (532) .+-+.+.... +...-|.|++.++-..|.+ .+|++-......+++ ++. .+...|+.-...+... T Consensus 309 ~s~~e~~~~~-----------f~~~~L~P~~~~ie~~ln~-kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G 376 (434) T protein:vir:43 309 GTGLEQQMLA-----------FLTFSISSITNQIQQCVNK-RLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNG 376 (434) T ss_pred cchHHHHHHH-----------HHHHHHHHHHHHHHHHHHh-hcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCC Confidence 2222222111 2233344555444444332 234332211222222 111 1344444443333321 Q ss_pred ---HHH---HHhhcch-hhhh----cCHHHHHHHHHHhcCCC--HhHc-----cCCHHH Q lcl|NC_015159. 450 ---IDY---MIKLAGL-QDDD----INLLDVKMRLANSLGMD--TTGL-----ILTQQD 490 (532) Q Consensus 450 ---~~~---laq~~p~-~~d~----id~d~~~~~~a~~~Gv~--p~~i-----~~s~ee 490 (532) ... +-.+.|. -.|. .|.- -++.+.+..--+ .... -++++| T Consensus 377 ~~T~NE~R~~~gl~p~~ggD~~~~~~n~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 377 FMTRNEGRRKENLPELPGGDILTVQSNLV-PIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred CcCHHHHHHHhCCCCCCCCCeEeeccCcc-chhhhhccCCCcchhhhhhccCCCCCCCC Confidence 111 1122221 1121 1111 012222111100 0000 011222 No 180 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=51.77 E-value=0.57 Score=21.91 Aligned_cols=429 Identities=9% Similarity=0.006 Sum_probs=138.1 Q ss_pred CCCCC-----------------------------CCccCHHHHHHHHHHHHHHhhhHHHHHHHHHH---hhcccccCCCC Q lcl|NC_015159. 1 MAEVE-----------------------------KTGFAADGAAAAYNRLKNDRGAYETRAEDCAT---YTIPSVFPSAT 48 (532) Q Consensus 1 m~~~~-----------------------------~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~---~~~P~~~~~~~ 48 (532) .+.-+ -..+-.+++...| +.+..+|+.++-++.. ..+|....++. T Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pf---kkk~~~~~~d~f~~s~es~s~vtsls~pda 110 (945) T protein:vir:10 34 LSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPY---NHQEPPFKFNLFEYSPESLMYLPSISDPDA 110 (945) T ss_pred hhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccc---cccccchhhhhhhccCccceecccccCccc Confidence 00000 0000111222233 2233345543322210 01111111110 Q ss_pred Ccc--cccccccccchHHHHHHHHHHHHHHhhcCCCCCccccC-CChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015159. 49 ADG--STSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLN-VSELEVKQSITSPEELTEIATGLAMVERICMNYMES 125 (532) Q Consensus 49 ~~~--~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~-~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~ 125 (532) ..+ -......-+++--.|++.+|+.+.+. ||--.. ..+...............+..+|.. -...... T Consensus 111 f~~vnVs~~~AlknsaV~scI~~IA~sIAsL------PlklYrr~edG~~~~~~kk~~~~hpL~~LL~r----PNp~mT~ 180 (945) T protein:vir:10 111 FFLINLFRKYRFNNDSKLIKVSEIPKKLTSK------ELEIYKHIEDKHVNYYLKRIRDARNILEFLER----PDPYFSE 180 (945) T ss_pred eeeehhhhhhhhccHHHHHHHHHHHhhhccC------ceEEEEecccCcccccccccccchHHHHHHhC----CCcccCh Confidence 000 00111112233344556666555332 332110 0110000000000000112222210 0111111 Q ss_pred cCChH-HHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHh Q lcl|NC_015159. 126 NSFRP-TLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEE 204 (532) Q Consensus 126 snf~~-~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~ 204 (532) ..|.. -+...+.++..+||+.+++..+ ..+....+..++...+.+..+.+|..- T Consensus 181 ~eFwqsFl~~Lv~dLLL~GNAYieIiRd--~~G~ii~L~pLdPs~Vti~~ddDG~~~----------------------- 235 (945) T protein:vir:10 181 VNSWEYLLGMVLDDILTIDRGAIVKIRD--EQGNLVAITPVDGTTIKPILSEDTGIV----------------------- 235 (945) T ss_pred hHHHHHHHHHHHHHHhhcCCeEEEEEEC--CCCcEEEEEEECCcceEEEEcCCCcEE----------------------- Confidence 22222 2334568899999998876532 122223333333344444455444321 Q ss_pred hcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCc--cccchHHHHHHHHHH Q lcl|NC_015159. 205 AQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNED--YGRSFVEEYLGDLKS 282 (532) Q Consensus 205 ~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~--YG~Gp~~~al~d~~~ 282 (532) |..++..+|.... .+..++. ++..|+...+|.. ||.+|.+.+...+.. T Consensus 236 -------------------------y~Yv~~idG~~~~----~v~a~Dv-Ilhirn~s~DG~~~GyGlSPIeaa~~aI~~ 285 (945) T protein:vir:10 236 -------------------------VGYVQEVDGAIVA----HFDKRDV-VLFRQNLTPDVYMYGYSLPPIEILYKVILS 285 (945) T ss_pred -------------------------EEEEEecCCceEE----EecCCce-EEEeccCCCCcccccCCchHHHHHHHHHHH Confidence 1111222222111 0000010 2223333334433 789999988887777 Q ss_pred HHHHHHHHHHHHHH-HhcCceeec----------CccccChhhh----------ccCCC-ce-eecCccccccccccCC- Q lcl|NC_015159. 283 LENLYEAIVKMSMI-SSKVLFFVN----------PNGVTQIRRV----------AKANT-GD-FVAGRKQDVEVFQLEK- 338 (532) Q Consensus 283 L~~l~~~~l~~~~~-a~~p~~lv~----------~~g~~~~~~~----------~~~~~-G~-~v~g~~~~~~~~~~~~- 338 (532) .....+........ ...|..++. ..+.++++.. ..+.+ |. ++. .+++...++.. T Consensus 286 alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVL--deGmef~pLs~s 363 (945) T protein:vir:10 286 DIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILS--GGKFTWIDFKGK 363 (945) T ss_pred HHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceec--CCCceEEEccCC Confidence 77776666665433 345543332 1122232221 11111 21 121 23334444432 Q ss_pred ccchhHHHHHHHHHHHHHHHHHhhhh--cc-cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 339 YNDFQVAKATADDIEKRLSYAFMLNS--AV-QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQAT 415 (532) Q Consensus 339 ~~~~~~~~~~i~~~~~rI~~af~~~~--~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~ 415 (532) ..+.+. .+..+..+..|-++|-.-. +. ..++..-++++. ...=....|.|...+++.+ +.+ T Consensus 364 ~~DaQf-LEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq--~~~Fv~~tL~Pil~~IEqe------------LNr- 427 (945) T protein:vir:10 364 RRDMQF-KELAEFVARKICAVYQVSPQDVGILEGSNKATAEVM--ASLTKAKGLEPLMATISKG------------FDE- 427 (945) T ss_pred hhHHHH-HHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHH--HHHHHHHHHHHHHHHHHHH------------HHH- Confidence 234543 3556777788888884321 11 112222122221 1122223344555555444 322 Q ss_pred CCCCCCcccccccee--ecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhH---------- Q lcl|NC_015159. 416 SKIPNLPKEAVEPAI--ATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTG---------- 483 (532) Q Consensus 416 g~lp~~p~~~~~~~~--v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~---------- 483 (532) .+++...+..+...+ ....++..|+.-.+.+. +. . .+..++ +-+.+|.+|-. T Consensus 428 kLl~~~eg~~i~fdFd~ldl~D~ksraEal~kli---~s-G--------iLTiNE----vRe~lGLpPIeGGD~lli~~n 491 (945) T protein:vir:10 428 VVSEFRNEKDIKLWFKEDDLEKERDWWNIIQGQL---NT-G--------FRSINE----ARMEKGLEPVPWGDVPFSGLR 491 (945) T ss_pred hccccccCceeEEEecchhccCHHHHHHHHHHHH---hC-C--------CcCHHH----HHHHhCCCCCCCcceeeeccc Confidence 122222222333333 11222223332222111 11 0 111222 11112332210 Q ss_pred -ccCCHHHHHHH-----------------------------H-HHH-HHHHHHHHHHHhhhHHHHHHHHhhcccccCCCC Q lcl|NC_015159. 484 -LILTQQDKQAK-----------------------------M-AEA-STAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPT 531 (532) Q Consensus 484 -i~~s~ee~~~~-----------------------------~-~q~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 531 (532) +...++...+. . +.+ ..+...+....++.+......+++. +.+|+-- T Consensus 492 n~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~~e~~~~l~~~~~~~a~e~i~~~~-e~~~~~~ 570 (945) T protein:vir:10 492 NWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAGLEVLRNLFKSLDANASENLKQVI-ELTNDDN 570 (945) T ss_pred cccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHH-hhcCCCc Confidence 00000000000 0 000 0011111222222222222222222 2222222 Q ss_pred C Q lcl|NC_015159. 532 Q 532 (532) Q Consensus 532 ~ 532 (532) - T Consensus 571 ~ 571 (945) T protein:vir:10 571 Y 571 (945) T ss_pred h Confidence 1 No 181 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=51.24 E-value=0.59 Score=21.85 Aligned_cols=380 Identities=10% Similarity=-0.025 Sum_probs=149.6 Q ss_pred HHHHHHHHHhhhH-HHHHHHHHHhhcccccCCCCCcccccc-cccc-cchHHHHHHHHHHHHHHhhcCCCCCccccCCCh Q lcl|NC_015159. 16 AAYNRLKNDRGAY-ETRAEDCATYTIPSVFPSATADGSTSY-TTPW-QSIGARGLNNLASKLMLALFPVGSSFFKLNVSE 92 (532) Q Consensus 16 ~r~~~lk~~R~~~-e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~-dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d 92 (532) --|+.|..+|+.. ...+-+..+.+-.... +..|..-. +... .++--.|++.+|+.+. +-||.-....+ T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~---~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA------~~p~~~~~~~~ 71 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYD---TYTGKRISSQRAMRLTAVYSCVRVLAESVG------MLPCSLYKISG 71 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcc---cccCceechhhhhccHHHHHHHHHHHHhhh------hCceEEEEecC Confidence 2333443333321 1112222222211111 11111000 0111 2233344444444433 22332222111 Q ss_pred HHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----cCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEec Q lcl|NC_015159. 93 LEVKQSITSPEELTEIATGLAMVERICMNYME-S----NSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL 167 (532) Q Consensus 93 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl 167 (532) ....+ + .+..+...|+ + -+.+.-+...+.++...|||.+|+..+ .+....+..++. T Consensus 72 ~~~~~----------~------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~---~g~~~~L~~l~~ 132 (413) T protein:vir:48 72 TLKTR----------V------VDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA---LGEVVELLPIDP 132 (413) T ss_pred Cccee----------e------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC---CCcEEEEEEEcC Confidence 11000 0 0111122232 2 334455666778888999998887532 122222333333 Q ss_pred ceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccccccc Q lcl|NC_015159. 168 HNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGE 247 (532) Q Consensus 168 ~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~ 247 (532) ..+-+..|.+|.+ +| .... .++.. ..+-.+ T Consensus 133 ~~v~~~~~~~~~~--~y----------------------------------~~~~--~~g~~---~~~~~~--------- 162 (413) T protein:vir:48 133 GCVEPKLNSQWQP--VY----------------------------------QVTF--PDGSV---DVLTQD--------- 162 (413) T ss_pred ceEEEEEcCCceE--EE----------------------------------EEEe--cCceE---EEEccc--------- Confidence 4444444444322 11 0000 01100 000011 Q ss_pred CccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCC-------- Q lcl|NC_015159. 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKAN-------- 319 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~-------- 319 (532) =+++.|-...+ ..||.||...+...+.....+.+.......-...|..++.-++.++++...... T Consensus 163 ------evih~~~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~ 235 (413) T protein:vir:48 163 ------EIWHVRTLTLD-GLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHT 235 (413) T ss_pred ------cEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 12223322223 379999999999999998888888888777778888777776666665431111 Q ss_pred ----CceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccC-CCCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_015159. 320 ----TGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQR-GGDRVTAEEIRYVAGELEDTLGG 391 (532) Q Consensus 320 ----~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~-~~~~~TAtEi~~r~~E~~~~LGp 391 (532) .|.+.. ..+++...++.. ..+.+ ..+..+..+..|-.+|-... +... ++..-++++.. T Consensus 236 g~~n~g~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~------------ 301 (413) T protein:vir:48 236 GLGNAHRPMI-LEMGLDWKSMALNAEDSQ-FLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG------------ 301 (413) T ss_pred CccccCccee-cCCCceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH------------ Confidence 111111 112233334332 23444 34555666777888874321 1111 12222333322 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHH Q lcl|NC_015159. 392 VYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKM 471 (532) Q Consensus 392 v~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~ 471 (532) ..+...-+.|++.++-..|.+ .+|++.......+.+ .++.|-|. +.+....+++.+-+. ..+..+++- T Consensus 302 --~~f~~~~i~P~~~~ie~~l~~-~L~~~~~~~~~~~~f--d~~~l~~~-d~~~~~~~~~~~~~~-----g~~T~NE~R- 369 (413) T protein:vir:48 302 --LGFINYSLVPYLTRIEQRINT-GLVRESKQGKFYAKF--NAGALLRG-DMKSRFEAYATGINW-----GIYSPNDCR- 369 (413) T ss_pred --HHHHHHHHHHHHHHHHHHHHh-hccCccccCCeEEEE--echhhhcc-CHHHHHHHHHHHHhC-----CCcCHHHHH- Confidence 224455667777766555543 355543333332332 23444442 222222222222111 112333322 Q ss_pred HHHHhcCCCHhHccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 472 RLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 472 ~~a~~~Gv~p~~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +.+|.+|- ..-|+.-.- . .........++.|.+++ T Consensus 370 ---~~~g~~p~---~ggD~~~~~---~-----------------n~~~~~~~~~~~~~~~~ 404 (413) T protein:vir:48 370 ---DLEDMNPR---PGGDVYLTP---M-----------------NMTTSPSAGDDNGKKKE 404 (413) T ss_pred ---HHhCCCCC---CCcceeecc---c-----------------cccccccccccCCCCCC Confidence 33566542 111111000 0 00000011112222222 No 182 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=50.71 E-value=0.6 Score=21.79 Aligned_cols=336 Identities=14% Similarity=0.087 Sum_probs=126.3 Q ss_pred cCCCCCccc-c-ccc-ccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHH---- Q lcl|NC_015159. 44 FPSATADGS-T-SYT-TPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVE---- 116 (532) Q Consensus 44 ~~~~~~~~~-~-~~~-~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve---- 116 (532) +...+..+. + ... ..+... ......+ .++. + .+ .++.....+ ...|..-.+.+. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-~~~~~~~----~~~~-~--~~----~v~~~~al~-------~~~v~~~i~~ia~~ia 61 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSN-PAFFTTT----VGGM-Q--LS----YVSALSALQ-------NTNVYSVINRIASDVA 61 (385) T ss_pred Cccccchhcccccccccccccc-hhhhhhh----cccc-C--cc----ccCHHHhhc-------cHHHHHHHHHHHHHHh Confidence 322211110 0 011 111111 0100111 1100 0 00 011110000 011221111111 Q ss_pred --------HHHHHHHHhcCCh----HHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEE Q lcl|NC_015159. 117 --------RICMNYMESNSFR----PTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIV 184 (532) Q Consensus 117 --------~~~~~~l~~snf~----~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~ 184 (532) +.....+++-|-+ .=...++.+|..+|||.+++..+ ....+|+....|.... T Consensus 62 ~~p~~v~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~--------~~~~~p~~~~~v~~~~-------- 125 (385) T protein:vir:10 62 SAHFKTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ--------NLEHIPNSDVQINYLP-------- 125 (385) T ss_pred hCceeeeccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC--------ceeEeecCCceEEEEE-------- Confidence 1122234443433 22344556788899998887432 2344554333222111 Q ss_pred EEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecC Q lcl|NC_015159. 185 TEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMP 264 (532) Q Consensus 185 rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~ 264 (532) +.++..| .++...+... ..+ ..--+++.|....+ T Consensus 126 ---------------------------------------~~~~~~~-~~~~~~~~~~----~~~--~~~eiihik~~~~~ 159 (385) T protein:vir:10 126 ---------------------------------------GNMGIVY-TVLESNDRPQ----MVL--RQDQMLHFRLMPDP 159 (385) T ss_pred ---------------------------------------cCCceEE-EEEEcCCceE----EEE--ccccEEEeccCCCC Confidence 1111111 0111111100 011 11123444433333 Q ss_pred C--CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcc-ccChhhhc----------cCCC-ceeecCcccc Q lcl|NC_015159. 265 N--EDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNG-VTQIRRVA----------KANT-GDFVAGRKQD 330 (532) Q Consensus 265 g--~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g-~~~~~~~~----------~~~~-G~~v~g~~~~ 330 (532) + ..||.||...+...+.......+.......-...|..++.-++ ..+.+... .+.+ |.+ .-..++ T Consensus 160 ~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~-~vl~~g 238 (385) T protein:vir:10 160 QYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRL-MVLPDG 238 (385) T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCc-cccCCC Confidence 3 4689999999999999999999999998888888887776543 44443221 1111 111 111122 Q ss_pred ccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_015159. 331 VEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKI 407 (532) Q Consensus 331 ~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 407 (532) ....++.. ..+.+...+..+..+..|-++|-... +...+...-|.+.+-+........|.|.+.++.+++-.-| T Consensus 239 ~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~l--- 315 (385) T protein:vir:10 239 FDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLKM--- 315 (385) T ss_pred ceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHhh--- Confidence 33333332 23455444556667788888884321 1112222223332322233333455666666666553221 Q ss_pred HHHHHHhcCCCCCCccccccceeec--chHHHHHHHHHHHHHH-----------HHHHHHhhcchhhhhcCHHHHHHHHH Q lcl|NC_015159. 408 LLKELQATSKIPNLPKEAVEPAIAT--GLEALGRGHDLNKLNV-----------FIDYMIKLAGLQDDDINLLDVKMRLA 474 (532) Q Consensus 408 ~~~il~r~g~lp~~p~~~~~~~~v~--~l~~l~raq~~~~l~~-----------~~~~laq~~p~~~d~id~d~~~~~~a 474 (532) +.+ .++.++.. ..+.-.|+...+.+.. +++ +..++|.-.+.+......-. T Consensus 316 ----------~~~----~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g-~~p~p~~~~~~~~~~~~~~~-- 378 (385) T protein:vir:10 316 ----------NAP----DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILT-RSGFLPDNLPEFKPLTTQVK-- 378 (385) T ss_pred ----------CCc----eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC-CCccCCCCCccccCcccccC-- Confidence 211 12222111 1123334433333322 111 11222222222111100000 Q ss_pred HhcCCCHhH Q lcl|NC_015159. 475 NSLGMDTTG 483 (532) Q Consensus 475 ~~~Gv~p~~ 483 (532) -|=.-.+ T Consensus 379 --~g~~~dn 385 (385) T protein:vir:10 379 --GGDEGDN 385 (385) T ss_pred --CCCCCCC Confidence 1111111 No 183 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=50.07 E-value=0.62 Score=21.72 Aligned_cols=368 Identities=11% Similarity=0.007 Sum_probs=123.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH--HHHHhhcccccCCCCCcccccccccc-cchHHHHHHHHHHHHHHh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE--DCATYTIPSVFPSATADGSTSYTTPW-QSIGARGLNNLASKLMLA 77 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~--e~~~~~~P~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ 77 (532) |++... .+|.. +.+...|. ....+.-++-....+..+-... ... .++--.|++.+|+.+.+ T Consensus 1 ~~~~~~--------~~~~k------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~-~a~~~~~v~~~i~~Ia~~ia~- 64 (409) T protein:vir:94 1 MAKENI--------VTRIK------KKLIDNWIDQSASKLYDFSPWKNKSFWGVINN-TLETNETIFSAITKLSNSMAS- 64 (409) T ss_pred Cccccc--------chhhh------hHHhhhhhcCCcccccccccccCccccccchh-hhhccHHHHHHHHHHHHhhhh- Confidence 887753 22221 12222222 1111111110000000000000 111 12223444544444432 Q ss_pred hcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_015159. 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPST 152 (532) Q Consensus 78 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~ 152 (532) -||--..-.+ . .. ..+...|. +-| .+.=....+.++..+||+.+|+..+ T Consensus 65 -----lp~~~~~~~~-~-------------~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~ 118 (409) T protein:vir:94 65 -----LPLKMYEDYK-V-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD 118 (409) T ss_pred -----CceeEeeccc-c-------------cc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC Confidence 3332211110 0 00 01112222 233 2223345567788899998876532 Q ss_pred ccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEE Q lcl|NC_015159. 153 EQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRS 232 (532) Q Consensus 153 ~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s 232 (532) . .+....+..+|.+.+-+..+.+|... +.+ T Consensus 119 ~--~G~~~~L~~l~~~~v~v~~~~~~~~~--~y~---------------------------------------------- 148 (409) T protein:vir:94 119 I--YHQPSKLFLLNPDVVEMLIENQSREL--YYS---------------------------------------------- 148 (409) T ss_pred C--CCcEEEEEEEcCceeEEEEeCCCcEE--EEE---------------------------------------------- Confidence 1 12222233333333333333322211 100 Q ss_pred EEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccCh Q lcl|NC_015159. 233 YQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQI 312 (532) Q Consensus 233 ~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~ 312 (532) ++...|... .+..+ =+++.|-....+..||.||..-+...+...+.+.+..+.. ....+.+++..++-+++ T Consensus 149 ~~~~~g~~~-----~~~~~--dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~ 219 (409) T protein:vir:94 149 IHAATGNKL-----IVHNM--DMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTE--MQKPDSFMLKYGSNVGK 219 (409) T ss_pred EEcCCceEE-----EEccc--cEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHh--cCCCCeeEEecCCCCCH Confidence 000011100 00000 1233332223456899999988777777666665543322 22233355555555555 Q ss_pred hhhcc---------CCCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHH Q lcl|NC_015159. 313 RRVAK---------ANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 313 ~~~~~---------~~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~~~TAtEi~~r~ 382 (532) +.... ..+|.+.. -.+++...++.. ..+.|.. +..+..+..|-++|-.-.....+...-|-.-+.+.. T Consensus 220 e~~~~~~~~~~~~~~~~g~~~v-l~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 297 (409) T protein:vir:94 220 EKRQQVLEDFKQYYEENGGILF-QEPGVEIEPLPKKYVSEDIV-ASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN 297 (409) T ss_pred HHHHHHHHHHHHHhhcCCCeee-cCCCceEEEcCCChhHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH Confidence 44321 11222211 112233334332 2344432 344445667878874322111122222333333322 Q ss_pred HH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-ccccee-ecc---hHHHHHHHHHHHHHHH----HHH Q lcl|NC_015159. 383 GE-LEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-AVEPAI-ATG---LEALGRGHDLNKLNVF----IDY 452 (532) Q Consensus 383 ~E-~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~~~~~~-v~~---l~~l~raq~~~~l~~~----~~~ 452 (532) .. ....|.|.+.++++|+-.-|+ |+.... ...+.+ +.. .+...|+.-.+.+... ... T Consensus 298 ~~f~~~~l~P~~~~ie~~ln~~Ll-------------~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE 364 (409) T protein:vir:94 298 RFYLQHTLLPIVKQYEEEFNRKLL-------------TKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTIND 364 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhC-------------CcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 22 233466666666666544433 221110 011111 111 1222333222222110 000 Q ss_pred ---HHhhcchh-hh-------hcCHHHHH--HHHHHhcCCCHhHc Q lcl|NC_015159. 453 ---MIKLAGLQ-DD-------DINLLDVK--MRLANSLGMDTTGL 484 (532) Q Consensus 453 ---laq~~p~~-~d-------~id~d~~~--~~~a~~~Gv~p~~i 484 (532) .-.+.|.. .| .+-.|... +.-..+=+.....= T Consensus 365 ~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 365 IREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HHHHhCCCCCCCcCeEeecccccccccchhhcccccCCCCCcCCC Confidence 00111110 01 00011100 00011100100001 No 184 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=48.47 E-value=0.67 Score=21.54 Aligned_cols=307 Identities=12% Similarity=0.023 Sum_probs=109.8 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~ 154 (532) =++-||.-.. .+. .+. .-+...|. +-| .+.=+...+.+|..+||+++++..+. T Consensus 1 ia~lp~~~~~-~~~-------------~~~-------~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~- 58 (348) T protein:vir:93 1 MASLPLKMYE-DYK-------------VVN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI- 58 (348) T ss_pred CcccceEeEe-cCc-------------Ccc-------cHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC- Confidence 0233443211 110 011 11223343 333 22334556678888999988875432 Q ss_pred ccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEE Q lcl|NC_015159. 155 VEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQ 234 (532) Q Consensus 155 ~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~ 234 (532) .+....+..+|.+.+-+..+.+|... +.+ ++ T Consensus 59 -~G~~~~L~~l~~~~v~~~~~~~~~~~--~y~----------------------------------------------~~ 89 (348) T protein:vir:93 59 -YHQPSKLFLLNPDVVEMLIENQSREL--YYS----------------------------------------------IH 89 (348) T ss_pred -CCcEEEEEEEcCCceEEEEeCCCcEE--EEE----------------------------------------------EE Confidence 12222232222333433333332211 000 11 Q ss_pred EEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhh Q lcl|NC_015159. 235 EIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRR 314 (532) Q Consensus 235 ~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~ 314 (532) ...|... .+.. .-++..|-....+..||.||.+.+...+...+...+..+.. ....+.+++..++.++++. T Consensus 90 ~~~g~~~-----~~~~--~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~e~ 160 (348) T protein:vir:93 90 AATGNKL-----IVHN--MDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTE--MQKPDSFMLKYGSNVSTEK 160 (348) T ss_pred cCCCeEE-----EEcc--ccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHh--cCCCceeEEecCCCCCHHH Confidence 1111110 0111 11344443334567899999988877777666665554332 2233445656666666643 Q ss_pred hcc---------CCCce-eecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHHH Q lcl|NC_015159. 315 VAK---------ANTGD-FVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AVQ--RGGDRVTAEEIRY 380 (532) Q Consensus 315 ~~~---------~~~G~-~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~~~TAtEi~~ 380 (532) ... ..+|. ++-. ++....++.. ..+++ ..+..+..+..|-++|-.-. +.. .++..-+++|... T Consensus 161 ~~~~~~~~~~~~~n~~~~~vl~--~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~ 237 (348) T protein:vir:93 161 RQQVLEDFKQYYEENGGILFQE--PGVEIEPLPKKYVSED-IVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNR 237 (348) T ss_pred HHHHHHHHHHHhhcCCCeeecC--CCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 211 11222 2211 2233333332 22333 22334456667877774321 111 1222223333211 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-ccccee-ecch---HHHHHHHHHHHHHHH----HH Q lcl|NC_015159. 381 VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-AVEPAI-ATGL---EALGRGHDLNKLNVF----ID 451 (532) Q Consensus 381 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~~~~~~-v~~l---~~l~raq~~~~l~~~----~~ 451 (532) .-....|.|...++.++|-.- +||+.... ...+.+ +..+ +...|+.-.+.+... .. T Consensus 238 --~~~~~~l~P~~~~ie~~l~~~-------------l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~N 302 (348) T protein:vir:93 238 --FYLQHTLLPIVKQYEEEFNRK-------------LLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTIN 302 (348) T ss_pred --HHHHHHHHHHHHHHHHHHHHh-------------hCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHH Confidence 222333444444444443332 34432211 111222 1111 233333322222210 00 Q ss_pred H---HHhhcchh-hh---------hcCHHHHHHHHHHhcCCCHhHc Q lcl|NC_015159. 452 Y---MIKLAGLQ-DD---------DINLLDVKMRLANSLGMDTTGL 484 (532) Q Consensus 452 ~---laq~~p~~-~d---------~id~d~~~~~~a~~~Gv~p~~i 484 (532) . .-.+.|.. .| .+|...-.+.-..+=+.....= T Consensus 303 E~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 303 DIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred HHHHHhCCCCCCCcCeEeecccccccccchhhcccccCCCCCcCCC Confidence 0 01111110 01 0111111111011101000000 No 185 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=47.95 E-value=0.68 Score=21.48 Aligned_cols=378 Identities=12% Similarity=0.057 Sum_probs=140.0 Q ss_pred CC--CC-------CCCccCHHHHH--HHHHHHHHHhhh-HHHHHHHHHHhhcccccCCCCCcccccc-cc-cccchHHHH Q lcl|NC_015159. 1 MA--EV-------EKTGFAADGAA--AAYNRLKNDRGA-YETRAEDCATYTIPSVFPSATADGSTSY-TT-PWQSIGARG 66 (532) Q Consensus 1 m~--~~-------~~~~~~~~~~~--~r~~~lk~~R~~-~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~-~~dst~~~a 66 (532) |- ++ +--...++..+ ..|..- .+|+. .-..|-...--++|.... ..+..-. .. +=.++--.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~-e~R~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~al~~~~V~~c 76 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKN-EKRDLQYNEDDLQMMVQTLPGFQG---TKLRQYKDIEAIRHSDIFTA 76 (441) T ss_pred CccccCccccccccccccchhhhhcccccccc-ccccccCCCcchHHHHHHhcccCc---ccccccchhhhhccHHHHHH Confidence 11 00 00000111100 111111 11210 001111111111222111 1111000 01 112333446 Q ss_pred HHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHh Q lcl|NC_015159. 67 LNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLV 141 (532) Q Consensus 67 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~ 141 (532) ++.+|+.+.+. || ++--.. ... . +.-++..|+ +-| .+.-....+.++.. T Consensus 77 v~~Ia~~iA~l------p~-~~~~~~-~~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll 130 (441) T protein:vir:94 77 VMMIASDLARM------PI-RVTVNG-QIN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALL 130 (441) T ss_pred HHHHHHhhccC------ce-eeecCc-ccc-----------c-------cchHHHHHhcccCcCCCHHHHHHHHHHHHhh Confidence 66666655542 33 332111 000 0 111222332 333 23445667788889 Q ss_pred hCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEE Q lcl|NC_015159. 142 AGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHV 221 (532) Q Consensus 142 ~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 221 (532) +|||.+++..+. .+....+..+|...+-+..|.+|++- T Consensus 131 ~Gnay~~i~r~~--~G~~~~L~~i~~~~v~v~~d~~g~~~---------------------------------------- 168 (441) T protein:vir:94 131 TSHGYIEITRDK--TGEPMNLTFRKTSEIELKSDARGRLY---------------------------------------- 168 (441) T ss_pred cCCeEEEEEECC--CCcEEEEEEEcCceeEEEECCCccEE---------------------------------------- Confidence 999988875431 23334455555566666777666421 Q ss_pred EeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_015159. 222 YRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVL 301 (532) Q Consensus 222 ~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~ 301 (532) ..++..++... .....+. .--+++.|+...+| .||.||.+.+...+.......+.......-...|. T Consensus 169 ---------~~~~~~~~~~~-~~~~~~~--~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 235 (441) T protein:vir:94 169 ---------YFHQRIDSNGN-NIERNVK--FEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAG 235 (441) T ss_pred ---------EEEEEeccCCc-eeEEEEc--cccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 11111111100 0001111 11235556554454 79999999998888888888888888777778888 Q ss_pred eeecCcccc-Chhhhc-------cCCC-----ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-cc Q lcl|NC_015159. 302 FFVNPNGVT-QIRRVA-------KANT-----GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AV 366 (532) Q Consensus 302 ~lv~~~g~~-~~~~~~-------~~~~-----G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~ 366 (532) .++.-++.+ +.+... ..-. |.+.. -.+++...++.. ..+.|. .+.....+..|-++|-.-. +. T Consensus 236 gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~v-l~~G~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~l 313 (441) T protein:vir:94 236 GILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVV-LDESMTFDQLEVDTEVLKL-IRENKSSTREIAGVFGIPLHKF 313 (441) T ss_pred EEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccee-cCCCceEEEccCChhHHHH-HHHHHHhHHHHHHHhCCCHHHc Confidence 766544443 333221 1001 11110 112223333332 223333 3445566677888884321 11 Q ss_pred cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee--ecchHHHHHHHHHH Q lcl|NC_015159. 367 QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI--ATGLEALGRGHDLN 444 (532) Q Consensus 367 ~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~--v~~l~~l~raq~~~ 444 (532) ..+....+.+|. .......|-|.+.+++.|+-.-| +++-.+-.++.+. +...+...|+.-.+ T Consensus 314 g~~~~~~s~~q~---~~~~~~tl~P~~~~ie~eln~kl-------------~~~~~~~~~~fd~~~llr~D~~~~~~~~~ 377 (441) T protein:vir:94 314 GIETANMSITDA---NLDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDTTEIRVVDEKTQAEIDK 377 (441) T ss_pred CCCCCCccHHHH---HHHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEeechhhhccCHHHHHHHHH Confidence 112222222322 11222344455555555444333 2221111111111 11113334443333 Q ss_pred HHHHH----HH---HHHhhcch---hh-------hhcCHHHHHHHHH----------HhcCCCHhH Q lcl|NC_015159. 445 KLNVF----ID---YMIKLAGL---QD-------DDINLLDVKMRLA----------NSLGMDTTG 483 (532) Q Consensus 445 ~l~~~----~~---~laq~~p~---~~-------d~id~d~~~~~~a----------~~~Gv~p~~ 483 (532) .+... .. .+-.+.|. .. ..+..+. ++.+- ..-| +-.+ T Consensus 378 ~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~kgG-e~~e 441 (441) T protein:vir:94 378 INIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGG-EENE 441 (441) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccccccc-ccccccccccccccccCCC-CCCC Confidence 22210 00 11122221 00 0111111 11000 0112 1111 No 186 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=47.95 E-value=0.68 Score=21.48 Aligned_cols=378 Identities=12% Similarity=0.057 Sum_probs=140.0 Q ss_pred CC--CC-------CCCccCHHHHH--HHHHHHHHHhhh-HHHHHHHHHHhhcccccCCCCCcccccc-cc-cccchHHHH Q lcl|NC_015159. 1 MA--EV-------EKTGFAADGAA--AAYNRLKNDRGA-YETRAEDCATYTIPSVFPSATADGSTSY-TT-PWQSIGARG 66 (532) Q Consensus 1 m~--~~-------~~~~~~~~~~~--~r~~~lk~~R~~-~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~-~~dst~~~a 66 (532) |- ++ +--...++..+ ..|..- .+|+. .-..|-...--++|.... ..+..-. .. +=.++--.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~-e~R~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~al~~~~V~~c 76 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKN-EKRDLQYNEDDLQMMVQTLPGFQG---TKLRQYKDIEAIRHSDIFTA 76 (441) T ss_pred CccccCccccccccccccchhhhhcccccccc-ccccccCCCcchHHHHHHhcccCc---ccccccchhhhhccHHHHHH Confidence 11 00 00000111100 111111 11210 001111111111222111 1111000 01 112333446 Q ss_pred HHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHh Q lcl|NC_015159. 67 LNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLV 141 (532) Q Consensus 67 ~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~ 141 (532) ++.+|+.+.+. || ++--.. ... . +.-++..|+ +-| .+.-....+.++.. T Consensus 77 v~~Ia~~iA~l------p~-~~~~~~-~~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll 130 (441) T protein:vir:79 77 VMMIASDLARM------PI-RVTVNG-QIN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALL 130 (441) T ss_pred HHHHHHhhccC------ce-eeecCc-ccc-----------c-------cchHHHHHhcccCcCCCHHHHHHHHHHHHhh Confidence 66666655542 33 332111 000 0 111222332 333 23445667788889 Q ss_pred hCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEE Q lcl|NC_015159. 142 AGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHV 221 (532) Q Consensus 142 ~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v 221 (532) +|||.+++..+. .+....+..+|...+-+..|.+|++- T Consensus 131 ~Gnay~~i~r~~--~G~~~~L~~i~~~~v~v~~d~~g~~~---------------------------------------- 168 (441) T protein:vir:79 131 TSHGYIEITRDK--TGEPMNLTFRKTSEIELKSDARGRLY---------------------------------------- 168 (441) T ss_pred cCCeEEEEEECC--CCcEEEEEEEcCceeEEEECCCccEE---------------------------------------- Confidence 999988875431 23334455555566666777666421 Q ss_pred EeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_015159. 222 YRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVL 301 (532) Q Consensus 222 ~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~ 301 (532) ..++..++... .....+. .--+++.|+...+| .||.||.+.+...+.......+.......-...|. T Consensus 169 ---------~~~~~~~~~~~-~~~~~~~--~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 235 (441) T protein:vir:79 169 ---------YFHQRIDSNGN-NIERNVK--FEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAG 235 (441) T ss_pred ---------EEEEEeccCCc-eeEEEEc--cccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 11111111100 0001111 11235556554454 79999999998888888888888888777778888 Q ss_pred eeecCcccc-Chhhhc-------cCCC-----ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-cc Q lcl|NC_015159. 302 FFVNPNGVT-QIRRVA-------KANT-----GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AV 366 (532) Q Consensus 302 ~lv~~~g~~-~~~~~~-------~~~~-----G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~ 366 (532) .++.-++.+ +.+... ..-. |.+.. -.+++...++.. ..+.|. .+.....+..|-++|-.-. +. T Consensus 236 gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~v-l~~G~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~l 313 (441) T protein:vir:79 236 GILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVV-LDESMTFDQLEVDTEVLKL-IRENKSSTREIAGVFGIPLHKF 313 (441) T ss_pred EEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccee-cCCCceEEEccCChhHHHH-HHHHHHhHHHHHHHhCCCHHHc Confidence 766544443 333221 1001 11110 112223333332 223333 3445566677888884321 11 Q ss_pred cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee--ecchHHHHHHHHHH Q lcl|NC_015159. 367 QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI--ATGLEALGRGHDLN 444 (532) Q Consensus 367 ~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~--v~~l~~l~raq~~~ 444 (532) ..+....+.+|. .......|-|.+.+++.|+-.-| +++-.+-.++.+. +...+...|+.-.+ T Consensus 314 g~~~~~~s~~q~---~~~~~~tl~P~~~~ie~eln~kl-------------~~~~~~~~~~fd~~~llr~D~~~~~~~~~ 377 (441) T protein:vir:79 314 GIETANMSITDA---NLDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDTTEIRVVDEKTQAEIDK 377 (441) T ss_pred CCCCCCccHHHH---HHHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEeechhhhccCHHHHHHHHH Confidence 112222222322 11222344455555555444333 2221111111111 11113334443333 Q ss_pred HHHHH----HH---HHHhhcch---hh-------hhcCHHHHHHHHH----------HhcCCCHhH Q lcl|NC_015159. 445 KLNVF----ID---YMIKLAGL---QD-------DDINLLDVKMRLA----------NSLGMDTTG 483 (532) Q Consensus 445 ~l~~~----~~---~laq~~p~---~~-------d~id~d~~~~~~a----------~~~Gv~p~~ 483 (532) .+... .. .+-.+.|. .. ..+..+. ++.+- ..-| +-.+ T Consensus 378 ~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~kgG-e~~e 441 (441) T protein:vir:79 378 INIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGG-EENE 441 (441) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccccccc-ccccccccccccccccCCC-CCCC Confidence 22210 00 11122221 00 0111111 11000 0112 1111 No 187 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=44.78 E-value=0.79 Score=21.13 Aligned_cols=359 Identities=10% Similarity=0.010 Sum_probs=132.9 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccc-ccccc-chHHHHHHHHHHHHHHhhcCCCCCccccCCChH Q lcl|NC_015159. 16 AAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSY-TTPWQ-SIGARGLNNLASKLMLALFPVGSSFFKLNVSEL 93 (532) Q Consensus 16 ~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~d-st~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~ 93 (532) --|++....+.. +..+ ....|+.+..+...+..+..-. ..... ++--.|++.+|+.+. +-||.-..-.+. T Consensus 1 ~~~~r~~~~~~~-~~~~-~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA------~lp~~~~~~~~~ 72 (419) T protein:vir:14 1 MFFSRQLLSNLG-QTQM-SAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIA------QLPIELYERSGE 72 (419) T ss_pred Cccccccccccc-cccc-CcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhc------cCceEEEEecCC Confidence 111111000000 0000 0001111111111111111100 11222 222334444444332 334532222211 Q ss_pred HHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecc Q lcl|NC_015159. 94 EVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLH 168 (532) Q Consensus 94 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~ 168 (532) .-.+ + .+.-+...|. +-| .+.-....+.++..+||+++|+..+. .+....+..++.+ T Consensus 73 ~~~~----------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~--~G~~~~l~pl~~~ 134 (419) T protein:vir:14 73 DRKP----------A------TDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDS--DGVIQGLYPLDNE 134 (419) T ss_pred cccc----------c------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC--CCcEEEEEEecCc Confidence 1110 0 0111122332 222 33334555788889999999886432 1233334444445 Q ss_pred eEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccC Q lcl|NC_015159. 169 NFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEY 248 (532) Q Consensus 169 ~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~ 248 (532) .+.+..|.+|++ +|+ +. +... . T Consensus 135 ~v~v~~~~~~~~--~y~-~~-------------------------------------------------~~~~------~ 156 (419) T protein:vir:14 135 AVTVMRGSDLKP--VYR-VR-------------------------------------------------GSDP------M 156 (419) T ss_pred eEEEEECCCceE--EEE-Ec-------------------------------------------------cCcc------c Confidence 566666665532 111 00 0000 0 Q ss_pred ccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCcccc----Chhhh---cc---- Q lcl|NC_015159. 249 PLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT----QIRRV---AK---- 317 (532) Q Consensus 249 g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~----~~~~~---~~---- 317 (532) ..+ =+++.|+...+| .||.||..-+...+.....+.+.......-...|..++.-++.. +.+.. .. T Consensus 157 ~~~--~i~h~~~~~~dg-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~ 233 (419) T protein:vir:14 157 PQR--LVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNA 233 (419) T ss_pred chh--heeEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHH Confidence 000 023344443444 89999999999999988888888888888888887666433222 22221 10 Q ss_pred ---C-CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHHHHHHHHHHHHH Q lcl|NC_015159. 318 ---A-NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQ---RGGDRVTAEEIRYVAGELEDT 388 (532) Q Consensus 318 ---~-~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~~~TAtEi~~r~~E~~~~ 388 (532) + .+ |.+.. -.++....++.. ..+.+.+ +..+..+..|-++|-...... ..+..-+++|.. T Consensus 234 ~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~--------- 302 (419) T protein:vir:14 234 KFGGSGNAKKVAL-LQEGMTFRPLSMTNVDAALI-DALRLSALDIARIYKIPAHMVNELERATFSNIEHQS--------- 302 (419) T ss_pred HhcCccccCCcee-cCCCceEEEccCChhhHHHH-HHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH--------- Confidence 1 11 11111 112233333332 2344433 334455678888874321111 112211222221 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecch---HHHHHHHHHHHHHHH----HHH---HHhhc Q lcl|NC_015159. 389 LGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGL---EALGRGHDLNKLNVF----IDY---MIKLA 457 (532) Q Consensus 389 LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l---~~l~raq~~~~l~~~----~~~---laq~~ 457 (532) ..+...-|.|++.+.-..+.+. +|++-......+.+ +..+ +...|+.-.+.+... ... +-.+. T Consensus 303 -----~~f~~~~L~P~~~~ie~~l~~k-ll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~ 376 (419) T protein:vir:14 303 -----LQFVIYTLLPWVKRHEQAKTRD-LLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMP 376 (419) T ss_pred -----HHHHHHHHHHHHHHHHHHHhhh-ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 1233444556655554444332 44433322222332 1111 233333332222110 000 00101 Q ss_pred ch----------------hh------hhcCHHHHHHHHHHhcC Q lcl|NC_015159. 458 GL----------------QD------DDINLLDVKMRLANSLG 478 (532) Q Consensus 458 p~----------------~~------d~id~d~~~~~~a~~~G 478 (532) |. .. ..=....-.+++.+.+. T Consensus 377 p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 377 PVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKAAIDEIGRILS 419 (419) T ss_pred CCCCcCeeeeccccccccccccccCCCCCCccccccchhcccC Confidence 10 00 00011111222222222 No 188 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=43.57 E-value=0.84 Score=21.00 Aligned_cols=347 Identities=9% Similarity=-0.034 Sum_probs=131.8 Q ss_pred cccCCCCCcccccccccccchHHHHHHHH----------------------HHHHHHhhcCCCCCccccCCChHHHhhhc Q lcl|NC_015159. 42 SVFPSATADGSTSYTTPWQSIGARGLNNL----------------------ASKLMLALFPVGSSFFKLNVSELEVKQSI 99 (532) Q Consensus 42 ~~~~~~~~~~~~~~~~~~dst~~~a~~~L----------------------aa~l~~~ltpp~~~WF~l~~~d~~~~~~~ 99 (532) +. +....+++.....++.- ....-+ |=.+.+..+. +-||--..-.+.. T Consensus 1 m~--f~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia-~lp~~~~~~~~~~----- 70 (409) T protein:vir:10 1 ML--FRKGFKNQSQEISIDDK--KILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNIS-KLPIKIYQKKDGI----- 70 (409) T ss_pred Cc--ccccccCcCCCCCCChH--HHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhh-hCceEEEEecCCe----- Confidence 22 22221111111111110 000000 0011111121 1121100000000 Q ss_pred cChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEee Q lcl|NC_015159. 100 TSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVER 174 (532) Q Consensus 100 ~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~ 174 (532) ..+ .+..+...|. +-| .+.-+...+.++..+||+.+|+..+. .+....+..+|....-+.. T Consensus 71 ------~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~--~G~~~~L~~i~~~~V~v~~ 136 (409) T protein:vir:10 71 ------KRV------PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKK--NGEIKGLYPLKSDGMKIFV 136 (409) T ss_pred ------eec------cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC--CCcEEEEEEEcCCceEEEE Confidence 000 0111223333 223 33345566778888999988875432 1233333334434444444 Q ss_pred CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCc Q lcl|NC_015159. 175 DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCP 254 (532) Q Consensus 175 d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P 254 (532) |..|....- ..+ .|+.. ... |... .+ ...= T Consensus 137 ~~~~~~~~~-----------------------------~~~-~y~~~--~~~-----------g~~~-----~~--~~~e 166 (409) T protein:vir:10 137 DDTGLLNSE-----------------------------NNV-WYLYT--DDL-----------GQRH-----KF--MSDE 166 (409) T ss_pred cCCcccccc-----------------------------ceE-EEEEE--eCC-----------ceeE-----Ee--cccc Confidence 444432210 000 00000 001 1100 01 1122 Q ss_pred eEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccC-----------CCcee Q lcl|NC_015159. 255 WIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKA-----------NTGDF 323 (532) Q Consensus 255 ~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~-----------~~G~~ 323 (532) +++.|.... +..||.||...+...+.....+.+.......-...|..++.-++.++++..... .++.- T Consensus 167 vih~r~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~ 245 (409) T protein:vir:10 167 ILHFKGLTA-DGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHR 245 (409) T ss_pred EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCC Confidence 355554433 348999999998888888888888888888888888877776666665443211 11110 Q ss_pred ecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cc-cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_015159. 324 VAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AV-QRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQE 399 (532) Q Consensus 324 v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E 399 (532) +.--.++....++.. ..+.+. .+..+..+..|-.+|-.-. +. ..++..-++++... .+... T Consensus 246 ~~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~--------------~f~~~ 310 (409) T protein:vir:10 246 IAMLPIGYKFEPISQKLVDAQF-LENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNR--------------EFYID 310 (409) T ss_pred ceecCCCceEEEccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHH--------------HHHHH Confidence 111112233333332 334554 3555667788888884321 11 11222333333221 13344 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCc-ccccccee-ecc---hHHHHHHHHHHHHHHH----H---HHHHhhcch-hhhh--- Q lcl|NC_015159. 400 LQLPLVKILLKELQATSKIPNLP-KEAVEPAI-ATG---LEALGRGHDLNKLNVF----I---DYMIKLAGL-QDDD--- 463 (532) Q Consensus 400 ~l~Pli~r~~~il~r~g~lp~~p-~~~~~~~~-v~~---l~~l~raq~~~~l~~~----~---~~laq~~p~-~~d~--- 463 (532) -|.|++.++...+.+. +|++.. .....+.+ ... .+...|+.-.+.+... . -.+-.+.|. -.|. T Consensus 311 ~l~P~~~~ie~~ln~k-L~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~ 389 (409) T protein:vir:10 311 TLQSILNMYELEINYK-LFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLI 389 (409) T ss_pred HHHHHHHHHHHHHHHh-hcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee Confidence 4556666555544332 333211 11112222 111 1233343333322210 0 011122221 1121 Q ss_pred -cC---HHHHHHHHHHhcCCCH Q lcl|NC_015159. 464 -IN---LLDVKMRLANSLGMDT 481 (532) Q Consensus 464 -id---~d~~~~~~a~~~Gv~p 481 (532) .| .+.+-+....+ |= - T Consensus 390 ~~n~~~~~~~~~~~~kg-Ge-~ 409 (409) T protein:vir:10 390 NGNMIPVKMAGEQYSKG-GE-K 409 (409) T ss_pred ccCccchhhcccccccc-CC-C Confidence 11 11111111111 32 1 No 189 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=42.87 E-value=0.87 Score=20.92 Aligned_cols=374 Identities=11% Similarity=0.039 Sum_probs=138.9 Q ss_pred CC--CC-------CCCccCHHHHH--HHHHHHHHHhh-----hHHHHHHHHHHhhcccccCCCCCccccc-ccc-cccch Q lcl|NC_015159. 1 MA--EV-------EKTGFAADGAA--AAYNRLKNDRG-----AYETRAEDCATYTIPSVFPSATADGSTS-YTT-PWQSI 62 (532) Q Consensus 1 m~--~~-------~~~~~~~~~~~--~r~~~lk~~R~-----~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~-~~dst 62 (532) |- +| +--+..+++.. ..|..- .+|+ .+...|-. .+|.... ..+..- ... +=.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~-e~r~~~~~~~~~~~~~~----~~~~~~~---~~~~~~~~~~al~~~~ 72 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKN-EKRDLQYNEDDLQMMVQ----TLPGFQG---TKLRQYKDIEAIRHSD 72 (441) T ss_pred CceecCccceeccccccchhhhhhcccccccc-ccccccCCCcchHHHHH----Hhhcccc---cCccccchhhhhccHH Confidence 21 11 00000111111 111100 0121 12222211 2232211 111110 001 12233 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHH Q lcl|NC_015159. 63 GARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIK 137 (532) Q Consensus 63 ~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~ 137 (532) --.|++.+|+.+.+ + | |++.-.. ... . +.-++..|. +-| .+.-...++. T Consensus 73 V~acv~~Ia~~iA~-l-----p-l~~~~~~-~~~-----------~-------~~~~~~lL~~~PN~~~t~~~f~~~l~~ 126 (441) T protein:vir:98 73 IFTAVMMIASDLAR-M-----P-IRVTVNG-QIN-----------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFV 126 (441) T ss_pred HHHHHHHHHHhhcc-C-----c-eEEecCC-ccc-----------c-------cchHHHHHhcccccCCCHHHHHHHHHH Confidence 33456666655543 2 2 3332111 000 0 111223332 223 3344556678 Q ss_pred HHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEE Q lcl|NC_015159. 138 QLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTI 217 (532) Q Consensus 138 dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i 217 (532) ++..+|||.+++..+. .+....+..+|.+.+.+..|.+|++-- T Consensus 127 ~lll~Gnay~~i~r~~--~G~~~~L~~i~~~~v~v~~~~~g~~~~----------------------------------- 169 (441) T protein:vir:98 127 SALLTSHGYIEITRDK--TGEPMNLTFRKTSEIELKLDARGRLYY----------------------------------- 169 (441) T ss_pred HHhhcCCeEEEEEEcC--CCcEEEEEEEcCceeEEEECCCCcEEE----------------------------------- Confidence 8889999988875432 233444555555677777777775311 Q ss_pred EEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 218 YTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMIS 297 (532) Q Consensus 218 ~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a 297 (532) .++.+++... +....+. .--+++.|+...+| .||.||...+...+...+.+.+.......-. T Consensus 170 --------------~~~~~~~~~~-~~~~~~~--~~dviHir~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng 231 (441) T protein:vir:98 170 --------------FHQRIDSNGN-NIERNVK--FEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNG 231 (441) T ss_pred --------------EEEEeccCcc-eeeEEEc--cccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 1111111100 0000111 11234555544444 7999999999888888888888877777777 Q ss_pred hcCceeecCccc-cChhhhccC-------CCc-----eeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015159. 298 SKVLFFVNPNGV-TQIRRVAKA-------NTG-----DFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLN 363 (532) Q Consensus 298 ~~p~~lv~~~g~-~~~~~~~~~-------~~G-----~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~ 363 (532) ..|..++.-++. .+.+..... -.| .+. --.+++...++.. ..+.+. .+.....+..|-++|-.- T Consensus 232 ~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~-vl~~g~~~~~l~~~~~d~q~-~e~r~~~~~~Ia~~fgVP 309 (441) T protein:vir:98 232 THAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVV-VLDESMTFDQLEVDTEVLKL-IRENKSSTREIAGVFGIP 309 (441) T ss_pred CCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcce-ecCCCceEEEccCChhHHHH-HHHHHHhHHHHHHHhCCC Confidence 778766654433 333322110 011 111 0112223333332 223332 344455566788888432 Q ss_pred h-cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee--ecchHHHHHH Q lcl|NC_015159. 364 S-AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI--ATGLEALGRG 440 (532) Q Consensus 364 ~-~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~--v~~l~~l~ra 440 (532) . +...+....+.+|. .......|-|.+.+++.|+-.- ++++-.+-.++.+. +...+...|+ T Consensus 310 p~~lg~~~~~~s~~q~---~~~y~~tl~P~~~~ie~~ln~~-------------L~~~~~~~~~~fd~~~llr~d~~~~~ 373 (441) T protein:vir:98 310 LHKFGIETANMSITDA---NLDYLSTLKPYITCVCAELNFK-------------FNDEYVNREFKFDTTEIRVVDEKTQA 373 (441) T ss_pred HHHcCCCCCCccHHHH---HHHHHHHHHHHHHHHHHHHHhh-------------ccccccCceEEEechhhhccCHHHHH Confidence 1 11112222232332 1122234445555554444322 22221111111111 1111233333 Q ss_pred HHHHHHHHH----HH---HHHhhcchh----------hhhcCHHHHHHHHH----------HhcCCCHhH Q lcl|NC_015159. 441 HDLNKLNVF----ID---YMIKLAGLQ----------DDDINLLDVKMRLA----------NSLGMDTTG 483 (532) Q Consensus 441 q~~~~l~~~----~~---~laq~~p~~----------~d~id~d~~~~~~a----------~~~Gv~p~~ 483 (532) .-.+.+... .. .+-.+.|.. ...++.+. ++.+- ..-| +-.+ T Consensus 374 ~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~-~~~~q~~~~~~~~~~~kgG-e~ne 441 (441) T protein:vir:98 374 EIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGG-EENE 441 (441) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccccccc-ccccccccccccccccCCC-CCCC Confidence 333222210 00 111222210 00111111 00000 0112 1112 No 190 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=42.85 E-value=0.87 Score=20.92 Aligned_cols=415 Identities=10% Similarity=0.063 Sum_probs=159.1 Q ss_pred CCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHH-HhhcccccCCC-CCccccc-cccc-ccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 4 VEKTGFAADGAAAAYNRLKNDRGAYETRAEDCA-TYTIPSVFPSA-TADGSTS-YTTP-WQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 4 ~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~-~~~~P~~~~~~-~~~~~~~-~~~~-~dst~~~a~~~Laa~l~~~lt 79 (532) .+|+.+..+- . -+..+...|+.+. .++.|....-. ...+.-+ ...+ -|++-.-++++....+. T Consensus 1 v~~~~l~~e~-a--------t~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~---- 67 (488) T protein:vir:99 1 MEKPALGREI-A--------TSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVV---- 67 (488) T ss_pred CCccchhHHH-H--------HHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHh---- Confidence 4443332221 1 1222222232221 12222211000 0111100 0111 15555555555555544 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCCc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~ 159 (532) +.+|- +.+.+.... + .++ ...+.+.|++.+|...+.+++ +-+.+|-++.=+.. T Consensus 68 --~~~w~-i~p~~~~~~-------~-~~~-------ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w-------- 120 (488) T protein:vir:99 68 --SREWK-VEAGGDRPI-------D-QAA-------AEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIY-------- 120 (488) T ss_pred --cCCce-EEcCCCChH-------H-HHH-------HHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEE-------- Confidence 34664 333321100 0 111 222334556678887777766 55678877652211 Q ss_pred ceEEEEecceEEEeeCCCCCe--EEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEc Q lcl|NC_015159. 160 NAPKLYKLHNFVVERDAYDNV--LQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEID 237 (532) Q Consensus 160 ~~~~~~pl~~~~v~~d~~G~v--d~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~ 237 (532) ..+ .|.+ ..+..+- ..+ +..++++.. . +.-. T Consensus 121 -------------~~~-~g~~~~~~l~~r~------------------------~~~------f~~d~~~~l-~--~~~~ 153 (488) T protein:vir:99 121 -------------GRD-DRYITLEAIKVRN------------------------RRR------FRYDQDGGL-R--LLTP 153 (488) T ss_pred -------------eec-CCeeeEeeeeeec------------------------ccc------eeecCCCce-E--Eecc Confidence 111 1111 1111000 000 001111110 0 0000 Q ss_pred CcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc--cccChh-- Q lcl|NC_015159. 238 GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN--GVTQIR-- 313 (532) Q Consensus 238 ~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~--g~~~~~-- 313 (532) +....+. ..+ ..+=|++.+....+|+.||.|....+..-..--+...+..+..+++---|..+..-+ +-..-+ T Consensus 154 ~~~~~g~--~lp-~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~ 230 (488) T protein:vir:99 154 NNMFEGE--PCP-APYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKA 230 (488) T ss_pred CCCCCcc--ccc-cCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHH Confidence 0000000 000 011268888899999999999999999988777888999999999887787666522 222211 Q ss_pred h----hc--cCCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHhhhhcccCC-CCCCCHHHHHHHHHHHH Q lcl|NC_015159. 314 R----VA--KANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRG-GDRVTAEEIRYVAGELE 386 (532) Q Consensus 314 ~----~~--~~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~-~~~~TAtEi~~r~~E~~ 386 (532) . +. ....+.++|.+ ..+..+.... +.-..-...++.+.++|+++++...+...+ +......|++.... . T Consensus 231 ~l~~av~~~~~~~~~viP~~-~~ie~~ea~~-~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~~v~--~ 306 (488) T protein:vir:99 231 KLLAALHAIQTDSAIIMPAG-MQAELLEAGR-SGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQADVR--L 306 (488) T ss_pred HHHHHHHHHhcCcEEEecCC-ceeEEeecCC-CChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHH--H Confidence 1 11 12234455543 3466665433 333445678999999999998865554322 22344555554332 2 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCH Q lcl|NC_015159. 387 DTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINL 466 (532) Q Consensus 387 ~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~ 466 (532) .++-.-...+...+..=||..++.+-.-.... |. +..........-.++.- +..+.++.+- .++ T Consensus 307 d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~---p~--~~~~~~e~edl~~~a~~-------~~~l~~~~G~---~i~- 370 (488) T protein:vir:99 307 DLVKADADLICESFNLGPARWLTEWNFPGAQP---PR--VYRVIEEPEDITAKAER-------DEKVFRMSGF---RPT- 370 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCC---ce--eEecCCCcccHHHHHHH-------HHHHHhhcCC---CCC- Confidence 22222222233322233333333332111111 11 11111122122222222 2222222110 122 Q ss_pred HHHHHHHHHhcCCCHhHcc-----CC-----------HHHHHHHHHHHHH--HHH----HH---HHHHhhhHHHHHHHHh Q lcl|NC_015159. 467 LDVKMRLANSLGMDTTGLI-----LT-----------QQDKQAKMAEAST--AAG----MV---TAGQQMGAAGGQAAAA 521 (532) Q Consensus 467 d~~~~~~a~~~Gv~p~~i~-----~s-----------~ee~~~~~~q~~~--~~~----~~---~~~~~~~~~~~~~~~~ 521 (532) .+++.+.+|+|+...- .. .+..++...+... +.. .. .+.+...... ..... T Consensus 371 ---~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~s~e-e~~~~ 446 (488) T protein:vir:99 371 ---RGYVQETYGVEVESTQAEATAPTPSTEFAEGDQPSDPAAAMAPQLAEAMQPVVGNWTTQLRTLIEQASSLE-DLRER 446 (488) T ss_pred ---HHHHHHHcCCCCcccccccccCCCcccCCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHH-HHHHH Confidence 2244455666432110 00 0011111111000 000 00 0111111110 00111 Q ss_pred hcccccCCCCC Q lcl|NC_015159. 522 MMQQQAGLPTQ 532 (532) Q Consensus 522 ~~~~~~g~~~~ 532 (532) +..-...|.+. T Consensus 447 L~~l~~~~d~~ 457 (488) T protein:vir:99 447 LLDLAPQLSLD 457 (488) T ss_pred HHHHhccCCHH Confidence 11111122222 No 191 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=42.47 E-value=0.88 Score=20.88 Aligned_cols=433 Identities=10% Similarity=0.079 Sum_probs=151.9 Q ss_pred CCCC-CCCccCHHHHHHHHHHHHHHhhhH--HHHHHHHHHhhcccccCCCCCccccccc-------------ccc-cc-h Q lcl|NC_015159. 1 MAEV-EKTGFAADGAAAAYNRLKNDRGAY--ETRAEDCATYTIPSVFPSATADGSTSYT-------------TPW-QS-I 62 (532) Q Consensus 1 m~~~-~~~~~~~~~~~~r~~~lk~~R~~~--e~~w~e~~~~~~P~~~~~~~~~~~~~~~-------------~~~-ds-t 62 (532) =+.+ +...+ .+.+..++..++..-... ...-++ ..|+-|.++.-.+.++..+.+ +.| ++ + T Consensus 18 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npi 95 (576) T protein:vir:96 18 YEDIIDTVPI-DDGLQANIRNIEEKSKELNKSLYGKQ-QAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPI 95 (576) T ss_pred cccchhhhhc-ccChhHHHHHhhhhhhhhccccCCcc-chhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHH Confidence 0111 00111 123445555553310000 001112 223344321111211111111 111 11 1 Q ss_pred HHHHHHHHHHHHHHhhcCCC----CCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-----hcCChHHHH Q lcl|NC_015159. 63 GARGLNNLASKLMLALFPVG----SSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-----SNSFRPTLH 133 (532) Q Consensus 63 ~~~a~~~Laa~l~~~ltpp~----~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~ 133 (532) .-.|++.+|..+...-.|.. -.-|.+.+.+.+... ... +... +..+++.+...+. ..+|+.-+. T Consensus 96 v~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~---~~~---~~~~-~~~l~~~l~~~~~~~~p~~~t~~~f~~ 168 (576) T protein:vir:96 96 LNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEP---GKK---EKEE-IKRIENFILNTGRDKDIDRDSFQSFCR 168 (576) T ss_pred HHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCcc---chh---hhHh-hhhHHhhHhhccCCCCCccccHHHHHH Confidence 23456666655543221211 111222222211110 001 1111 1122223322222 235666777 Q ss_pred HHHHHHHhhCceeeeecccccccCCcceEEEEec--ceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCC Q lcl|NC_015159. 134 AAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL--HNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNP 211 (532) Q Consensus 134 ~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl--~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~ 211 (532) .++.|+..+|||..|+..+. .+++.....+|| ..+.+..+.+|.+-.-.. T Consensus 169 ~lv~dlll~Gna~~~i~~~r--d~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~-------------------------- 220 (576) T protein:vir:96 169 KIVRDTYTYDQVNFEKVFNK--KNATTMDKFIAVDPSTIFYATDKNGKIIKGGK-------------------------- 220 (576) T ss_pred HHHHHHHhcCCeEEEEEEec--CCCCceEEEEEeCCceeEEEECCCCceeeeee-------------------------- Confidence 78889999999988865321 223333444444 556666666664321111 Q ss_pred cceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecC---CCccccchHHHHHHHHHHHHHHHH Q lcl|NC_015159. 212 SEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMP---NEDYGRSFVEEYLGDLKSLENLYE 288 (532) Q Consensus 212 ~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~---g~~YG~Gp~~~al~d~~~L~~l~~ 288 (532) ..++..++.... .+..++ .+.++..... ...||.+|...+...+.....+.+ T Consensus 221 -------------------~~~~~~~~~~~~----~~~~~d--ii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~ 275 (576) T protein:vir:96 221 -------------------RFVQVINKKVVA----SFTSRE--MAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTET 275 (576) T ss_pred -------------------EEEEecCCceEE----Eecccc--eEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHH Confidence 111111221110 000011 1222222222 256999999999988888888888 Q ss_pred HHHHHHHHHhcCceee--cCccccChhhhc----------cC-CC-ceeecCccccccccccCC-ccchhHHHHHHHHHH Q lcl|NC_015159. 289 AIVKMSMISSKVLFFV--NPNGVTQIRRVA----------KA-NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIE 353 (532) Q Consensus 289 ~~l~~~~~a~~p~~lv--~~~g~~~~~~~~----------~~-~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~ 353 (532) .......-...|..++ +.+..++.+... .+ .+ |.+..-..+++...++.. ..+.+ ..+..+..+ T Consensus 276 ~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~q-fle~~~~~~ 354 (576) T protein:vir:96 276 FNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQ-FEKWLTYLI 354 (576) T ss_pred HHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHH-HHHHHHHhH Confidence 8887777777777443 333334443221 11 11 111111223344444443 23343 344556778 Q ss_pred HHHHHHHhhhh--cccCCCC---------CC---CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC Q lcl|NC_015159. 354 KRLSYAFMLNS--AVQRGGD---------RV---TAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIP 419 (532) Q Consensus 354 ~rI~~af~~~~--~~~~~~~---------~~---TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp 419 (532) ..|-++|-... +...+.. .+ ++++... .=....|.|.+.+++.+|-.-|+ | T Consensus 355 ~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~--~f~~~tL~P~~~~ie~~ln~~Ll-------------~ 419 (576) T protein:vir:96 355 NIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQ--QSQNKGLQPLLRFIEDLINTHII-------------S 419 (576) T ss_pred HHHHHHhCCCHHHccccccccccccccccccccccHHHHHH--HHHHHHHHHHHHHHHHHHHhhhc-------------h Confidence 88888884321 1111111 11 2222211 22223455666666555544433 2 Q ss_pred CCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhHccCCHHHH-HHHHHHH Q lcl|NC_015159. 420 NLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDK-QAKMAEA 498 (532) Q Consensus 420 ~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~i~~s~ee~-~~~~~q~ 498 (532) .. ...+...+. ......++...+...... +. .+-.++ +-+.+|.+|- ..-|+. .....+. T Consensus 420 ~~-~~~~~~~f~-r~d~~~~~e~~~~~~~~~---~G-------~lT~NE----~R~~~gl~pi---egGD~~~~~~~~~~ 480 (576) T protein:vir:96 420 EY-SDKYVFQFV-GGDTKSELDKIKILQEEV---KT-------YKTVNE----ARKEKGLKPI---EGGDVLLDGSFIQS 480 (576) T ss_pred hc-cCceEEEec-cCCHHHHHHHHHHHHHHh---cC-------ccCHHH----HHHHhCCCCC---CCcceecccccccc Confidence 21 111222221 123333333222111110 00 112222 1222454431 110100 0000000 Q ss_pred HHHHHHHHHHHhhhHHHHHHHHhhcc------cccCCCC--C Q lcl|NC_015159. 499 STAAGMVTAGQQMGAAGGQAAAAMMQ------QQAGLPT--Q 532 (532) Q Consensus 499 ~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~g~~~--~ 532 (532) . .+..+. .+......+........ ...+.++ + T Consensus 481 ~-~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~ 520 (576) T protein:vir:96 481 M-SLNTQK-EQYEDTKQKERFDMIQQFLNSPDDEEPQQESTE 520 (576) T ss_pred c-cccccC-CCCCCccccccccccccccCCCCCCCCCCCCCC Confidence 0 000000 00000000000000000 1111111 1 No 192 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=41.93 E-value=0.91 Score=20.82 Aligned_cols=315 Identities=10% Similarity=0.034 Sum_probs=119.5 Q ss_pred HHHhh-cccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHH Q lcl|NC_015159. 35 CATYT-IPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLA 113 (532) Q Consensus 35 ~~~~~-~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~ 113 (532) ..++. .|........+.+...=.-=++++...++ .+--+. -.+..|+.--++-..|.+.......+..+ |. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----y~~~~~-~~~~~~~epp~~~~~la~~~~~~~~h~~~---i~ 72 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALD----YVGIGF-DENYNCYLPPVNRHALAKLPHQNAQHGGI---LH 72 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhc----ccceee-ecCCccccCCCCHHHHHHHhhcchhhcch---hh Confidence 00000 00000000001000000000233221111 111111 12445665444333333332211111110 00 Q ss_pred HHHHHHHHHHHhcCC-------hHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEE Q lcl|NC_015159. 114 MVERICMNYMESNSF-------RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTE 186 (532) Q Consensus 114 ~ve~~~~~~l~~snf-------~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk 186 (532) + .+..-.++| ...+.++..|+.+||||.+++..+. .|-..+.+|+--.++.+..+|...-.++ T Consensus 73 -~----k~n~l~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~i~rn~----~G~~~~L~pl~~~~vr~~~d~~~~~~~~- 142 (345) T protein:vir:37 73 -S----RANMVSATYEGGKALSKMEMRALCLNLIQFGDVGLLKVRNG----FGQVVRLVPLSSLYLRVHKDGGYSYLMK- 142 (345) T ss_pred -h----hhhHHhhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECC----CCCEEEEEEecCceeEEeecCCeeEEEe- Confidence 0 001111122 2445567778889999988775331 2333444454333333222221110000 Q ss_pred EeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCCC Q lcl|NC_015159. 187 DKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNE 266 (532) Q Consensus 187 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~ 266 (532) .+....+.. .+.|..--++..|.....+. T Consensus 143 ---------------------------------------------~~~~~~~g~------~~~~~~~eViHir~~~~~~~ 171 (345) T protein:vir:37 143 ---------------------------------------------KSLYDTAQE------IYRYDAKDIIFIKLYDPMQQ 171 (345) T ss_pred ---------------------------------------------eeeeccCce------EEEEccccEEEEcCCCCCCC Confidence 000000000 00011112344443333467 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-cCccccChhhhc-------c---CCCc--eee--cC-cccc Q lcl|NC_015159. 267 DYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV-NPNGVTQIRRVA-------K---ANTG--DFV--AG-RKQD 330 (532) Q Consensus 267 ~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv-~~~g~~~~~~~~-------~---~~~G--~~v--~g-~~~~ 330 (532) .||.+|..-++-.+-.-+..++-..+.-.-...|..++ .++..++.++.. . ++++ -++ ++ ..++ T Consensus 172 ~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G 251 (345) T protein:vir:37 172 VYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDG 251 (345) T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccc Confidence 89999988777666544444444444444456676554 355555554332 1 1111 112 22 2455 Q ss_pred ccccccCCcc-chhHHHHHHHHHHHHHHHHHhhh----hcccCCC-CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Q lcl|NC_015159. 331 VEVFQLEKYN-DFQVAKATADDIEKRLSYAFMLN----SAVQRGG-DRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPL 404 (532) Q Consensus 331 ~~~~~~~~~~-~~~~~~~~i~~~~~rI~~af~~~----~~~~~~~-~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pl 404 (532) +...++.... +.+ -.+..+..++.|-.+|-.- .....+. ..-++++... .+...-+.|+ T Consensus 252 ~~~~pl~~~~~d~q-f~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~--------------~f~~~~l~P~ 316 (345) T protein:vir:37 252 LKVIPIGDTGTKDE-FANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYRE--------------VYHYDEVMPL 316 (345) T ss_pred eeEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHH--------------HHHHHHHHHH Confidence 6666665433 343 4445566677788888322 1111111 1122333322 2233345666 Q ss_pred HHHHHHHHHhcCCCCCCccccccceeecchHHHHH Q lcl|NC_015159. 405 VKILLKELQATSKIPNLPKEAVEPAIATGLEALGR 439 (532) Q Consensus 405 i~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~r 439 (532) +.++...+.+ +|++++..+ +.+ ....|.| T Consensus 317 ~~~ie~~ln~---~~e~~~~~~-i~F--~~~~l~k 345 (345) T protein:vir:37 317 QEIIAETINQ---DPEIKNLLK-IKF--REQNFAK 345 (345) T ss_pred HHHHHHHhhh---hhccCCcce-EEE--CchhhcC Confidence 6666666654 344443221 111 1223333 No 193 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=35.43 E-value=1.2 Score=20.09 Aligned_cols=368 Identities=11% Similarity=0.004 Sum_probs=128.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHH--HHHhhcccccCCCCCcccccc-ccccc-chHHHHHHHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAED--CATYTIPSVFPSATADGSTSY-TTPWQ-SIGARGLNNLASKLML 76 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e--~~~~~~P~~~~~~~~~~~~~~-~~~~d-st~~~a~~~Laa~l~~ 76 (532) |+=..+.+ +.+|. +.++...|.. ....+.|.-. .+..+..-. ..... ++--.|++.+|+.+.+ T Consensus 1 m~~~~~~~-----~~~~~------~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~ 67 (412) T protein:vir:26 1 MNVIAKEN-----IVTRI------KKKLIDNWIDQSTSKLYDFSPW--KNRSFWGVINNTLETNETIFSAITKLSNSMAS 67 (412) T ss_pred Cccchhhh-----hhhhh------hhhHhhhhhccccccccccccc--CCccccccchhhhhccHHHHHHHHHHHHhHhh Confidence 66655433 33332 2344444421 1111111100 000111000 11222 3334455555555443 Q ss_pred hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecc Q lcl|NC_015159. 77 ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPS 151 (532) Q Consensus 77 ~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~ 151 (532) -||.-..-.+ . .. ..+...|. +-| .+.-...++.+|..+|||.+|+.. T Consensus 68 ------lp~~~~~~~~-~-------------~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 120 (412) T protein:vir:26 68 ------LPLKMYEDYK-V-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER 120 (412) T ss_pred ------CceeEeeccc-c-------------cc-------chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEE Confidence 2443222111 0 00 11112222 233 233345667888899999887753 Q ss_pred cccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEE Q lcl|NC_015159. 152 TEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFR 231 (532) Q Consensus 152 ~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~ 231 (532) +. .+....+..+|.+..-+..+.++... +.++ T Consensus 121 ~~--~G~~~~L~~l~~~~v~v~~~~~~~~~--~y~~-------------------------------------------- 152 (412) T protein:vir:26 121 DI--YHQPSKLFLLNPDVVEMLIENQSREL--YYSI-------------------------------------------- 152 (412) T ss_pred CC--CCcEEEEEEEcCceeEEEEeCCCcEE--EEEE-------------------------------------------- Confidence 21 12233333333444444444333211 1000 Q ss_pred EEEEEcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccC Q lcl|NC_015159. 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQ 311 (532) Q Consensus 232 s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~ 311 (532) +...|... .+. .-=+++.|.....+..||.||..-+...+...+.+.+..+... ...+-+++..++.++ T Consensus 153 --~~~~g~~~-----~~~--~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~~~~~--~~~~~~i~~~~~~l~ 221 (412) T protein:vir:26 153 --HAATGNKL-----IVH--NMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVG 221 (412) T ss_pred --EcCCceEE-----EEc--cccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHhc--CCCCceEEecCCCCC Confidence 00001000 000 0112333333345678999999888777776666655543322 222334554555555 Q ss_pred hhhhcc---------CCCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHH Q lcl|NC_015159. 312 IRRVAK---------ANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AVQ--RGGDRVTAEEI 378 (532) Q Consensus 312 ~~~~~~---------~~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~~~TAtEi 378 (532) ++.... ...|.+.. -.+++...++.. ..+.+ ..+..+..+..|-++|-.-. +.. .++..-+++|. T Consensus 222 ~e~~~~~~~~~~~~~~~~g~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~ 299 (412) T protein:vir:26 222 KEKRQQVLEDFKQYYEENGGILF-QEPGVEIEPLPKKYVSED-IVASENLTRERVANVFQLPSVFLNARSNTNFAKNEEL 299 (412) T ss_pred HHHHHHHHHHHHHHhhcCCCeee-cCCCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH Confidence 553311 11222211 122333334432 23344 23344445677888884321 111 11111233322 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-ccccee-ecc---hHHHHHHHHHHHHHHH---- Q lcl|NC_015159. 379 RYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKE-AVEPAI-ATG---LEALGRGHDLNKLNVF---- 449 (532) Q Consensus 379 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~~~~~~-v~~---l~~l~raq~~~~l~~~---- 449 (532) .. .=....|.|.+.++.++|-.- +|++.... ...+.+ +.. .+...|+.-++.+... T Consensus 300 ~~--~f~~~~l~P~~~~ie~~ln~k-------------Ll~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t 364 (412) T protein:vir:26 300 NR--FYLQHTLLPIVKQYEEEFNRK-------------LLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYT 364 (412) T ss_pred HH--HHHHHHHHHHHHHHHHHHHhh-------------cCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 21 111223555555555444333 33322111 111121 111 1233333333322211 Q ss_pred HHH---HHhhcch-hhhh-------cCHHHHHH--HHHHhcCCCHhHc Q lcl|NC_015159. 450 IDY---MIKLAGL-QDDD-------INLLDVKM--RLANSLGMDTTGL 484 (532) Q Consensus 450 ~~~---laq~~p~-~~d~-------id~d~~~~--~~a~~~Gv~p~~i 484 (532) ... +-.+.|. -.|. +-.|...+ ....+=+....+= T Consensus 365 ~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 365 INDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred HHHHHHHhCCCCCCCcCeeeecccccccccchhhcccccCCCCCcCCC Confidence 000 1111121 0111 11111110 0111101110001 No 194 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=34.53 E-value=1.3 Score=19.99 Aligned_cols=310 Identities=10% Similarity=0.034 Sum_probs=125.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc-ccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS-YTTPWQSIGARGLNNLASKLMLALF 79 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~~a~~~Laa~l~~~lt 79 (532) |+|.+.......+. ++. + -+.+ +++..-- ...++| .+.... T Consensus 1 m~~~~~~~~~~~~~-----------~~~-----~--------~~~~-~~p~~~~~~~~~~~-------------~~~~~~ 42 (337) T protein:vir:78 1 MTKRQQQPAQAAAS-----------SPR-----P--------SVVF-SMPEAIDPTAWMTD-------------YTGVFY 42 (337) T ss_pred CCCcccCccccccc-----------Cce-----e--------EEEe-cCcccccCcchhHh-------------hhhhhh Confidence 88876532111100 000 0 0111 1110000 001111 122223 Q ss_pred CCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCC---hHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_015159. 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSF---RPTLHAAIKQLLVAGNVLLYIPSTEQVE 156 (532) Q Consensus 80 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf---~~~~~~~~~dl~~~G~~~~~v~~~~~~~ 156 (532) -.+-.|++-=++-..+.+.... ..+...+=. .......+.| +..+..+..|+.+||||.+++..+ T Consensus 43 ~~~~~~~~pP~~~~~La~l~~~-------~~~h~~~L~-~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn---- 110 (337) T protein:vir:78 43 NPYGEYYQPPIDRKGLAKVARA-------NAHHGAILM-ARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRN---- 110 (337) T ss_pred ccCcceecCCCCHHHHHHHhhc-------chhhhhHHH-hhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEEC---- Confidence 3455666443333333332211 111110000 0011112223 346777888999999998876433 Q ss_pred CCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 157 GQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 157 ~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) ..|-....+|+..-++.+..+|+. .| +. ..+ T Consensus 111 ~~G~~~~L~pl~~~~v~~~~d~~~--~~------------------------------------~~--~~~--------- 141 (337) T protein:vir:78 111 SFGQVVGLHPLSSVYLRRREDGCF--VY------------------------------------LQ--QGK--------- 141 (337) T ss_pred CCCcEEEEEEeCCceeEeeeCCeE--EE------------------------------------EE--cCC--------- Confidence 123334445554333443333311 00 00 000 Q ss_pred cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-cCccccChhhh Q lcl|NC_015159. 237 DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV-NPNGVTQIRRV 315 (532) Q Consensus 237 ~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv-~~~g~~~~~~~ 315 (532) ... .+. .--++..|.....+.+||.+|..-++..+-.-+..++-..+.-.-...|-.++ .+++.++.+.. T Consensus 142 --~~~-----~~~--~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~ 212 (337) T protein:vir:78 142 --PNL-----IYR--PDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTE 212 (337) T ss_pred --ceE-----EEC--CccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Confidence 000 011 11234455443456799999998888877766666655555555556677554 35555555543 Q ss_pred cc----------CCCc--eee--cC-ccccccccccCCcc-chhHHHHHHHHHHHHHHHHHhhh--hcc-cCCCCCCC-- Q lcl|NC_015159. 316 AK----------ANTG--DFV--AG-RKQDVEVFQLEKYN-DFQVAKATADDIEKRLSYAFMLN--SAV-QRGGDRVT-- 374 (532) Q Consensus 316 ~~----------~~~G--~~v--~g-~~~~~~~~~~~~~~-~~~~~~~~i~~~~~rI~~af~~~--~~~-~~~~~~~T-- 374 (532) .. .+|+ .++ +| ..+++...++...+ +.+ -.+..+..++.|-.+|-.- .+. ..+...-| T Consensus 213 ~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~q-fle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~ 291 (337) T protein:vir:78 213 EEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDE-FAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLG 291 (337) T ss_pred HHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccc Confidence 11 1111 112 22 24556666665433 443 3445555666787887321 111 11222222 Q ss_pred -HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecch Q lcl|NC_015159. 375 -AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGL 434 (532) Q Consensus 375 -AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l 434 (532) +++... .+...-|.|++.++...+.+.+ ||+..--.++.++-..+ T Consensus 292 n~e~~~~--------------~f~~~~L~P~~~~ie~~~n~~l-l~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 292 DPEKYDA--------------TYARNEVLPLCELVQDAINSAG-LPRALWVTFRETIGAAV 337 (337) T ss_pred cHHHHHH--------------HHHHHHHHHHHHHHHHHHhhhc-CChhhceeccccccccC Confidence 333322 1233344455555554444332 22111111111111112 No 195 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=33.66 E-value=1.3 Score=19.88 Aligned_cols=390 Identities=11% Similarity=0.083 Sum_probs=143.5 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccccccc-chHHHHHHHHHHHHHHhhcCCCCCccccCCC-h Q lcl|NC_015159. 15 AAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQ-SIGARGLNNLASKLMLALFPVGSSFFKLNVS-E 92 (532) Q Consensus 15 ~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d-st~~~a~~~Laa~l~~~ltpp~~~WF~l~~~-d 92 (532) =+.|..+- ....+.|...+ .-|..++..+ |.--.....+ ++--.|++.+|+.+.+ + ||--..-. | T Consensus 1 m~~~~~~~---~~~~~~~~~~~--~~~~~~~~~~--g~~~~~~Al~~~~V~~cv~~ia~~iA~-l-----p~~~~~~~~~ 67 (417) T protein:vir:38 1 MKLFRGLA---TEVDPHWADHL--LDSGVIPSFR--GGYLGISALRNSDVLTAVSIVSGDVSR-F-----PLVITDSSTD 67 (417) T ss_pred Cccccccc---cCCCccchhhh--cccccccccC--CceechhhcccHHHHHHHHHHHHhhcc-C-----eeEEEEcCCc Confidence 11121111 12234443322 1222222211 1000001122 3333456666665543 2 33211111 1 Q ss_pred HHHhhhccChhHHHHHHHHHHHHHHHHHHHH-HhcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEec Q lcl|NC_015159. 93 LEVKQSITSPEELTEIATGLAMVERICMNYM-ESNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL 167 (532) Q Consensus 93 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl 167 (532) ... +...+. ..| .+-| .+.=....+.+|..+|||..++..+. ..+....+..+|. T Consensus 68 ~~~--------~~~~~~-----------~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~-~g~~~~~l~~l~p 127 (417) T protein:vir:38 68 EVI--------DLANIE-----------YLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDP-ITNEPAMFEFYAP 127 (417) T ss_pred cee--------ccchHH-----------HHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CCCEEEEEEEeCC Confidence 100 000111 112 1223 33344455778888999999886432 1122233444444 Q ss_pred ceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccccccc Q lcl|NC_015159. 168 HNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGE 247 (532) Q Consensus 168 ~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~ 247 (532) ..+.+..+..|++- | ++.. . .+... .. T Consensus 128 ~~v~v~~~~~~~~~--y-~~~~---------------------------------~-------------~~~~~----~~ 154 (417) T protein:vir:38 128 SQTQVDTSDPDNII--Y-RFTP---------------------------------Y-------------NSSMQ----KV 154 (417) T ss_pred ceEEEEEcCCCeEE--E-EEEE---------------------------------c-------------CCcEE----EE Confidence 55555555555331 1 1110 0 00000 00 Q ss_pred CccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccCC-------- Q lcl|NC_015159. 248 YPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKAN-------- 319 (532) Q Consensus 248 ~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~~-------- 319 (532) .+.++ +++.|....++ .||.||...+...+...+...+.......-...|-+++.-++.++++...... T Consensus 155 ~~~~d--viH~r~~~~d~-~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~ 231 (417) T protein:vir:38 155 CGFED--VIHWKFFSYDT-IMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQA 231 (417) T ss_pred ecCcc--eEEecCCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 00011 24444443344 78999999999988888888888888777778888777777777665442211 Q ss_pred ---CceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHHHHHHHhhhhHH Q lcl|NC_015159. 320 ---TGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS-AVQRGGDRVTAEEIRYVAGELEDTLGGVYS 394 (532) Q Consensus 320 ---~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~~~TAtEi~~r~~E~~~~LGpv~~ 394 (532) .|.++- ..++....++.. ..+.+. .+.....+..|-++|-.-. +....+..-+++|. ... T Consensus 232 g~n~g~~~v-l~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~---~~~---------- 296 (417) T protein:vir:38 232 GADAGSPII-VDATMDYQPLEVDTNVLNL-INSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQL---ADD---------- 296 (417) T ss_pred ccccCCcee-ccCCceEEEccCCHHHHHH-HHHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHH---HHH---------- Confidence 111100 012233333332 233442 3344555677888773221 11111111122222 111 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHH Q lcl|NC_015159. 395 LLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLA 474 (532) Q Consensus 395 rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a 474 (532) +...-|.|++.+....+.+ .+|++.....+.+.+ .++.+.+.. ...+...++ .-.+..+++ - T Consensus 297 -~~~~tl~P~~~~ie~~l~~-~Ll~~~~~~~~~~~f--d~~~l~~~~-~~~~~~~~~---------~G~~T~NE~----R 358 (417) T protein:vir:38 297 -YIRNDLPFYFEPITSEFEL-KLLDDAQRHQYCIGF--DTKSVNGLP-IADVNTAVN---------GGLWTGNEG----R 358 (417) T ss_pred -HHHHHHHHHHHHHHHHHHh-hhcChhhcccceEEe--chhhhhHHH-HHHHHHHHh---------CCCcCHHHH----H Confidence 2233455555554444322 244433222222332 223332221 111111111 112344443 3 Q ss_pred HhcCCCHhHccCCH--HHHHHH----HHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 475 NSLGMDTTGLILTQ--QDKQAK----MAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 475 ~~~Gv~p~~i~~s~--ee~~~~----~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) +.+|.+|- ... +....- .-....+.+.. +.....+|..-+.-.++..|--+- T Consensus 359 ~~~gl~pi---~~g~~d~~~~~~n~~~~d~~~~~~~~---~~~~~kgg~~~~~~~~~~~~~~~~ 416 (417) T protein:vir:38 359 AELGKKPL---KDPNMDRIQSTLNTVFLDQKEAYQAE---HAAELKGGDTNAKGNQNGSGTNAN 416 (417) T ss_pred HHhCCCCC---CCCCCCeeeecccccccccccccccc---cccccCCCCCCCCCCCcCCCCcCC Confidence 34566552 110 110000 00000000000 000000111100000001111111 No 196 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=31.00 E-value=1.5 Score=19.57 Aligned_cols=358 Identities=12% Similarity=0.047 Sum_probs=117.4 Q ss_pred HHHHhhhHHHHHHH--HHHhhcccccCC--CCCcccccc-----ccccc-chHHHHHHHHHHHHHHhhcCCCCCccccCC Q lcl|NC_015159. 21 LKNDRGAYETRAED--CATYTIPSVFPS--ATADGSTSY-----TTPWQ-SIGARGLNNLASKLMLALFPVGSSFFKLNV 90 (532) Q Consensus 21 lk~~R~~~e~~w~e--~~~~~~P~~~~~--~~~~~~~~~-----~~~~d-st~~~a~~~Laa~l~~~ltpp~~~WF~l~~ 90 (532) |+.++ ...+-+. +.++.-++-... .+.-..+.. ..... ++--.|++.+|+.+.+ -||--..- T Consensus 1 ~~~~~--~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~------lp~~~~~~ 72 (409) T protein:vir:96 1 MAKEN--IVTRIKKKLIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMAS------LPLKMYED 72 (409) T ss_pred Ccccc--chhhhhhHHhhhhhccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhh------CceEEeec Confidence 33321 1111111 112222221110 000000000 11122 2222334444443332 23322211 Q ss_pred ChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEE Q lcl|NC_015159. 91 SELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLY 165 (532) Q Consensus 91 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~ 165 (532) .+ . .. ..+...|. +-| .+.-...++.+|..+||+.+|+..+. .+....+..+ T Consensus 73 ~~-~-------------~~-------~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--~G~~~~L~~l 129 (409) T protein:vir:96 73 YK-V-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI--YHQPSKLFLL 129 (409) T ss_pred cc-c-------------cc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC--CCcEEEEEEE Confidence 10 0 00 11112232 233 22334566678888999988775331 1222222222 Q ss_pred ecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccccc Q lcl|NC_015159. 166 KLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTE 245 (532) Q Consensus 166 pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~ 245 (532) |.+.+-+..+.++... |-.++...|... T Consensus 130 ~~~~v~v~~~~~~~~~------------------------------------------------~y~~~~~~g~~~---- 157 (409) T protein:vir:96 130 NPDVVEMLIENQSREL------------------------------------------------YYSIHAATGNKL---- 157 (409) T ss_pred cCceeEEEEeCCCcEE------------------------------------------------EEEEEcCCceEE---- Confidence 3333333333222110 000111111100 Q ss_pred ccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhcc-------- Q lcl|NC_015159. 246 GEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK-------- 317 (532) Q Consensus 246 ~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~-------- 317 (532) .+. ..-++..|-....+..||.||...+...+...+.+.+.... .....+.+++..++.++++.... T Consensus 158 -~~~--~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~--~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~ 232 (409) T protein:vir:96 158 -IVH--NMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLT--EMQKPDSFMLKYGSNVSTEKRQQVLEDFKQY 232 (409) T ss_pred -EEc--cccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH--hcCCCceeEEecCCCCCHHHHHHHHHHHHHH Confidence 011 11234444333456689999998777666666655554332 22223345666666666554421 Q ss_pred -CCCceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhhcccC---CCCCCCHHHHHHHHHHHHHHhhhh Q lcl|NC_015159. 318 -ANTGDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNSAVQR---GGDRVTAEEIRYVAGELEDTLGGV 392 (532) Q Consensus 318 -~~~G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~---~~~~~TAtEi~~r~~E~~~~LGpv 392 (532) ..+|.+.. -.++....++.. ..+.+.. +..+..+..|-++|-.-..... ++..=+++|.. ..=....|.|. T Consensus 233 ~~n~g~~~v-l~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~--~~f~~~~l~P~ 308 (409) T protein:vir:96 233 YEENGGILF-QEPGVEIEPLPKKYVSEDIV-ASENLTRERVANVFQLPSIFLNARSNTNFAKNEELN--RFYLQHTLLPI 308 (409) T ss_pred hhcCCCeee-cCCCceEEEcCCChhHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH--HHHHHHHHHHH Confidence 11222211 112233333332 2344422 3344456678888743211111 11112333222 11222334454 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCccc-ccccee-ecch---HHHHHHHHHHHHHHH----HHH---HHhhcchh Q lcl|NC_015159. 393 YSLLSQELQLPLVKILLKELQATSKIPNLPKE-AVEPAI-ATGL---EALGRGHDLNKLNVF----IDY---MIKLAGLQ 460 (532) Q Consensus 393 ~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~-~~~~~~-v~~l---~~l~raq~~~~l~~~----~~~---laq~~p~~ 460 (532) +.++++|+-.- +||+.... ...+.+ +..+ +...|+.-.+.+... ... .-.+.|.. T Consensus 309 ~~~ie~~l~~~-------------Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ 375 (409) T protein:vir:96 309 VKQYEEEFNRK-------------LLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE 375 (409) T ss_pred HHHHHHHHHhh-------------cCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Confidence 44444443333 33322111 111221 1111 222333222222110 000 01111110 Q ss_pred -hh-------hcCHHHHH--HHHHHhcCCC-HhH Q lcl|NC_015159. 461 -DD-------DINLLDVK--MRLANSLGMD-TTG 483 (532) Q Consensus 461 -~d-------~id~d~~~--~~~a~~~Gv~-p~~ 483 (532) .| .+-.+... +.-..+=+-. -+. T Consensus 376 ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 376 GGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred CcceeeecccccccccchhhcccccCCCCCcCCC Confidence 01 01111110 0000000000 011 No 197 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=26.26 E-value=2 Score=18.97 Aligned_cols=480 Identities=11% Similarity=0.026 Sum_probs=161.1 Q ss_pred CCCCCCCccC----HH----HHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccccccc--ccchHHHHHHHH Q lcl|NC_015159. 1 MAEVEKTGFA----AD----GAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTP--WQSIGARGLNNL 70 (532) Q Consensus 1 m~~~~~~~~~----~~----~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~L 70 (532) |+.+|+.-.+ +. ++..|- =..-++--+.+|..-...+-|-. +..+ +..+ ..++...|++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~p~~-----~~~~--L~~~~e~~~~~~~~i~~~ 71 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAK--SPNSTQIPDHRIQSHNVGVNPPY-----NPDR--LAAFLELNETLATGIRKK 71 (651) T ss_pred CCCccceeeeeEEEeecccccccccc--cccccccchhhhcccCCCCCCCC-----CHHH--HHHHHhcChHHHHHHHHH Confidence 7766522110 00 011000 00001111223322222233311 1111 2222 244556778888 Q ss_pred HHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcC----ChHHHHHHHHHHHhhCcee Q lcl|NC_015159. 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNS----FRPTLHAAIKQLLVAGNVL 146 (532) Q Consensus 71 aa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~ 146 (532) +..+.+. .| .+.+.. +..........+..+..++..+-..........| +..-+...+.|+.++||+| T Consensus 72 ~~~iag~------g~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ 143 (651) T protein:vir:99 72 SRYEVGF------GF-DLVPAQ-GVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLA 143 (651) T ss_pred hhhhhcc------Cc-eeeecc-cCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHh Confidence 8877443 22 111100 0000111222233344444332222222223333 3345566778999999998 Q ss_pred eeecccccccCCcceEEEEecceEEEeeCCCCCeEE-EEEEEee--cHHHhhHHHH---HHHHhh----cccCCCcceEE Q lcl|NC_015159. 147 LYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQ-IVTEDKI--ARAALPEDVR---KSLEEA----QGDQNPSEEVT 216 (532) Q Consensus 147 ~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~-i~rk~~~--~~~~l~~~~~---~~~~~~----~~~~~~~~~v~ 216 (532) +=+-.++ .+....+..+|+..+.+..+... ++. .++.... ....-.-.+. ..+... ...++-+..+. T Consensus 144 ieiIrn~--~g~pv~L~~lp~~~~Rv~~~~~~-~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~ 220 (651) T protein:vir:99 144 LEMLTDI--EGRPVGLAYVPARTVRVRRPQNR-FDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQE 220 (651) T ss_pred hhhhhcC--ccchhhhhhcChhheeeeccccc-ccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeecccccee Confidence 7332221 12333444455555555444322 221 1111100 0000011111 111000 00011111111 Q ss_pred EEEE--------EEeeCCCCeEEEEE--EEcCcccccccccCccccCc---eEEEEeeecCCCccccchHHHHHHHHHHH Q lcl|NC_015159. 217 IYTH--------VYRDPEAMVFRSYQ--EIDGEIVAGTEGEYPLDSCP---WIPVRLIKMPNEDYGRSFVEEYLGDLKSL 283 (532) Q Consensus 217 i~~~--------v~~~~~~~~~~s~~--~~~~~~~~~~~~~~g~~~~P---~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L 283 (532) .+.. +.+......+...+ ...+... ..+..+...+| +++.|.....+..||.||.+-++..+... T Consensus 221 ~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~--~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a 298 (651) T protein:vir:99 221 VVIDESGDEPTIRYREDEESEREPIFVDRETGDVT--TGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISAD 298 (651) T ss_pred eeeccCCcceeEEeccCcceeeeeecccceeeeEE--EcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHH Confidence 1110 01101100000000 0000000 00000111223 46666665567789999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCceeec-CccccChhhhccC---------CCce--eecCc--------cccccccccCCc--cc Q lcl|NC_015159. 284 ENLYEAIVKMSMISSKVLFFVN-PNGVTQIRRVAKA---------NTGD--FVAGR--------KQDVEVFQLEKY--ND 341 (532) Q Consensus 284 ~~l~~~~l~~~~~a~~p~~lv~-~~g~~~~~~~~~~---------~~G~--~v~g~--------~~~~~~~~~~~~--~~ 341 (532) ..+.+.......-...|..++. +++.++.+..... +.|. ++++. ..++...++... .+ T Consensus 299 ~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D 378 (651) T protein:vir:99 299 EAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEE 378 (651) T ss_pred HHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhh Confidence 9888888887777778886665 5555555443211 1121 22221 113333343321 24 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhh--c-ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_015159. 342 FQVAKATADDIEKRLSYAFMLNS--A-VQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKI 418 (532) Q Consensus 342 ~~~~~~~i~~~~~rI~~af~~~~--~-~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~l 418 (532) .+. .+..+..+..|.++|-... + ...++..=|+++... .+...-|.|++.++...|.+. +| T Consensus 379 ~qf-le~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~--------------~f~~~tL~P~~~~ie~eln~k-Ll 442 (651) T protein:vir:99 379 MDF-RQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDK--------------DFALEVIQPEQHTFAEWLYQI-IH 442 (651) T ss_pred HHH-HHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHh-hc Confidence 443 4456677888999984321 1 112222334443332 122333445554444443332 33 Q ss_pred CCCcc-ccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCCCHhH-------c------ Q lcl|NC_015159. 419 PNLPK-EAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTG-------L------ 484 (532) Q Consensus 419 p~~p~-~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv~p~~-------i------ 484 (532) ++-.. ...++.+..-...+-|. +......+++.+-+. + .+..+++ -+.+|.||-. + T Consensus 443 ~~~e~~~~~~i~~ef~~~~llr~-D~~~~~e~~~~~i~~-G----~~T~NE~----R~~lglppi~~~~gd~~l~~~~~~ 512 (651) T protein:vir:99 443 QQALGVTDWTIEYELRGADQPKQ-EAQLAEQRVRAMRLA-G----VGLVDEA----REELGLDPLGEPYGEMTLSEFEAE 512 (651) T ss_pred CccccccCceEEEEeccchhhhc-cHHHHHHHHHHHHhC-C----CcCHHHH----HHHhCCCCCCCccccccccccccc Confidence 32111 11112221111222221 111111111111110 0 0111111 1112322210 0 Q ss_pred ---------------cCCHHHHHHHHHHHHHHHHHH--HHHHhhhHHHHHHHHhhcccccCCCCC Q lcl|NC_015159. 485 ---------------ILTQQDKQAKMAEASTAAGMV--TAGQQMGAAGGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 485 ---------------~~s~ee~~~~~~q~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~ 532 (532) ...++..+.+++..+..+.+. ..-...+... ....+..=.. .++ T Consensus 513 ~~g~~~~gge~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~v~s-s~~~~~gyd~---~~~ 573 (651) T protein:vir:99 513 VAGDVAGGGETEAVHEPPEENKIGEREWDTVKSELTTKDPIEQMQFSS-SNLDEGLYDF---GEN 573 (651) T ss_pred cccccccCCCCcccccCccccccccchhhhhhhhhcccchhhhhhHHH-HHHHhhcCCC---ccc Confidence 000000000000000000000 0000000000 0000000000 000 No 198 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=25.76 E-value=2 Score=18.91 Aligned_cols=351 Identities=11% Similarity=-0.032 Sum_probs=137.8 Q ss_pred cccccCCCCCccc---ccc------------cccccch--------------HHHHHHHHHHHHHHhhcCCCCCccccCC Q lcl|NC_015159. 40 IPSVFPSATADGS---TSY------------TTPWQSI--------------GARGLNNLASKLMLALFPVGSSFFKLNV 90 (532) Q Consensus 40 ~P~~~~~~~~~~~---~~~------------~~~~dst--------------~~~a~~~Laa~l~~~ltpp~~~WF~l~~ 90 (532) +=-+....+.... .+. ...|+.. -..++..+...|.+.+- +-|+--..- T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA--~lp~~~~~~ 78 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIG--KLSLKIYKD 78 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhh--hCceEEEec Confidence 1100000111000 000 0111111 12235555555555552 344432221 Q ss_pred ChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEE Q lcl|NC_015159. 91 SELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLY 165 (532) Q Consensus 91 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~ 165 (532) .+. ++ +..+...|. +-| .+.-+..++.++..+|||.+++.-+. .+....+..+ T Consensus 79 ~~~--------------~~------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~--~G~~~~L~~i 136 (422) T protein:vir:13 79 KEE--------------YK------EHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDR--KGKIIGLYPI 136 (422) T ss_pred Ccc--------------cc------cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC--CCcEEEEEEE Confidence 100 00 011122232 222 33456677788999999988875432 2344455555 Q ss_pred ecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccccccc Q lcl|NC_015159. 166 KLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTE 245 (532) Q Consensus 166 pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~ 245 (532) +.+.+.+..|.+|..... .. +|-.+....|.. T Consensus 137 ~~~~v~~~~~~~~~~~~~-----------------------------~~--------------~~y~~~~~~g~~----- 168 (422) T protein:vir:13 137 NSDNVTKIIDDDNFLSSL-----------------------------SK--------------VWYVVTDKNGKE----- 168 (422) T ss_pred CCcceEEEEcCCcceecc-----------------------------ce--------------EEEEEEeCCCeE----- Confidence 556677777776643210 00 000000001110 Q ss_pred ccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhhccC------- Q lcl|NC_015159. 246 GEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKA------- 318 (532) Q Consensus 246 ~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~~~~------- 318 (532) +.+...-.++.+.....+..||.||...+...+.......+.......-...|..++.-++.++.+..... T Consensus 169 --~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~ 246 (422) T protein:vir:13 169 --HKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESM 246 (422) T ss_pred --EEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHH Confidence 00111223444444445668999999999999999998888888888888888877765555555432111 Q ss_pred ----CC-ce-eecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cc-cCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_015159. 319 ----NT-GD-FVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AV-QRGGDRVTAEEIRYVAGELEDT 388 (532) Q Consensus 319 ----~~-G~-~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~-~~~~~~~TAtEi~~r~~E~~~~ 388 (532) .+ |. .+- .+++...++.. ..+.|. .+..+..+..|-++|=.-. +. ..++..-+++|. ...=.... T Consensus 247 ~~g~~n~~~~~vl--~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~--~~~f~~~~ 321 (422) T protein:vir:13 247 SNGLENAHSISLL--PFGYQFQPISLSMADAQF-LENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQ--QKDFYVTT 321 (422) T ss_pred hcCccccCCceec--CCCceeeeccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH--HHHHHHHH Confidence 01 11 111 12223333332 234443 3444556677888873211 11 112222233332 12223334 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc-cccccee-ecch---HHHHHHHHHHHHHHH----HHH---HHhh Q lcl|NC_015159. 389 LGGVYSLLSQELQLPLVKILLKELQATSKIPNLPK-EAVEPAI-ATGL---EALGRGHDLNKLNVF----IDY---MIKL 456 (532) Q Consensus 389 LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~-~~~~~~~-v~~l---~~l~raq~~~~l~~~----~~~---laq~ 456 (532) |-|.+.++..+|-.- +|++... ....+.+ ++.+ +...|+.-.+.+... ... +-.+ T Consensus 322 l~P~~~~ie~~l~~~-------------Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl 388 (422) T protein:vir:13 322 LQSSLTVYEQEIQDK-------------LFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENL 388 (422) T ss_pred HHHHHHHHHHHHHHh-------------hCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 445554444444333 3332221 1122222 1111 122232222222110 000 1111 Q ss_pred cch-hhh-------hcCHHHHHHHHHHhcCCCHh Q lcl|NC_015159. 457 AGL-QDD-------DINLLDVKMRLANSLGMDTT 482 (532) Q Consensus 457 ~p~-~~d-------~id~d~~~~~~a~~~Gv~p~ 482 (532) .|. -.| .+..|.+-..-...-+=... T Consensus 389 ~p~~ggD~~~~~~n~~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 389 PPVEGGDRLLVNGNMIPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred CCCCCcCeeeeccCccchhhcccccccCCCcCCC Confidence 121 011 11112111111111000111 No 199 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=22.72 E-value=2.4 Score=18.49 Aligned_cols=359 Identities=9% Similarity=-0.038 Sum_probs=130.5 Q ss_pred cCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccc-ccccc-chHHHHHHHHHHHHHHhhcCCCCCcc Q lcl|NC_015159. 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSY-TTPWQ-SIGARGLNNLASKLMLALFPVGSSFF 86 (532) Q Consensus 9 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~d-st~~~a~~~Laa~l~~~ltpp~~~WF 86 (532) += ..+++.+....+.+.-. .|+-..++...+..+..-. ..... ++--.|++.+|+.+.+ + ||- T Consensus 1 m~---~~~~~~~~~~~~~~~~~------~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~-l-----p~~ 65 (419) T protein:vir:80 1 MF---FSRQLLSNLGQTQPGSG------GWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQ-L-----PVE 65 (419) T ss_pred CC---cccccccccCcCCCCcc------hhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhcc-C-----ceE Confidence 00 00000000000011100 0000111111111111000 11122 2333344555544432 2 442 Q ss_pred ccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcC----ChHHHHHHHHHHHhhCceeeeecccccccCCcce Q lcl|NC_015159. 87 KLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-SNS----FRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNA 161 (532) Q Consensus 87 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~ 161 (532) -..-.... .+ +++ +..+...|+ +-| .+.-....+.++..+|||++|+..+. .+.... T Consensus 66 ~~~~~~~~-~~---------~~~------~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~--~G~~~~ 127 (419) T protein:vir:80 66 LYERSGDD-RK---------PAT------DHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQ--DGVIQG 127 (419) T ss_pred EEEecCCC-cc---------ccc------ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC--CCcEEE Confidence 11111110 00 000 111222332 222 33334566678889999988875432 123333 Q ss_pred EEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCccc Q lcl|NC_015159. 162 PKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIV 241 (532) Q Consensus 162 ~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~ 241 (532) +..+|...+-+..+.+|++. |.+.|... T Consensus 128 L~~i~~~~v~i~~~~~~~~~----------------------------------------------------y~~~~~~~ 155 (419) T protein:vir:80 128 LYPLDNEAVTVMKGPDLKPM----------------------------------------------------YRVAGADP 155 (419) T ss_pred EEEecCceEEEEECCCceEE----------------------------------------------------EEEcCccc Confidence 44444455555555444221 11111110 Q ss_pred ccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCc----cccChhhhc- Q lcl|NC_015159. 242 AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPN----GVTQIRRVA- 316 (532) Q Consensus 242 ~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~----g~~~~~~~~- 316 (532) .+ .-=+++.|+...+| .||.||..-+...+.....+.+.......-...|..++.-. +..+.+... T Consensus 156 ------~~--~~~i~h~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 226 (419) T protein:vir:80 156 ------LP--QRLVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDR 226 (419) T ss_pred ------cc--hhheEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHH Confidence 00 11145566665555 89999999999888888888888888777778887666422 122222211 Q ss_pred ---------cC-CC-ceeecCccccccccccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cc-cCCCCCCCHHHHHHH Q lcl|NC_015159. 317 ---------KA-NT-GDFVAGRKQDVEVFQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AV-QRGGDRVTAEEIRYV 381 (532) Q Consensus 317 ---------~~-~~-G~~v~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~-~~~~~~~TAtEi~~r 381 (532) .+ .+ |.+.. -.++....++.. ..+.+ ..+..+..++.|-.+|-... +. ..++..-+++|... T Consensus 227 ~~~~~~~~~~g~~n~g~~~v-l~~g~~~~~l~~s~~d~q-~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~- 303 (419) T protein:vir:80 227 ITDGWNAKFGGSGNAKKVAL-LQEGMKFKPLSMTNVDAA-LIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSL- 303 (419) T ss_pred HHHHHHHHhcCccccCCcee-cCCCceEEeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH- Confidence 01 11 11111 122333344442 23444 23444556778888884321 11 11222223333221 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecch---HHHHHHHHHHHH----------- Q lcl|NC_015159. 382 AGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGL---EALGRGHDLNKL----------- 446 (532) Q Consensus 382 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l---~~l~raq~~~~l----------- 446 (532) .=....|.|.+.+++++|-.- +|++-......+.+ ++.+ +...|+.-.+.+ T Consensus 304 -~f~~~~l~P~~~~ie~~l~~k-------------ll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~ 369 (419) T protein:vir:80 304 -QFVIYTLLPWVKRHEQAKTRD-------------LLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDI 369 (419) T ss_pred -HHHHHHHHHHHHHHHHHHhhh-------------ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 112223445444444443322 33322222222222 1111 122222221111 Q ss_pred HHHHH-----------------HHHhhcchhh-hhcCHHHHHHHHHHhcC Q lcl|NC_015159. 447 NVFID-----------------YMIKLAGLQD-DDINLLDVKMRLANSLG 478 (532) Q Consensus 447 ~~~~~-----------------~laq~~p~~~-d~id~d~~~~~~a~~~G 478 (532) ...++ ...+..|... +.=+....++++-..+. T Consensus 370 R~~~g~~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 419 (419) T protein:vir:80 370 RRLENMPPVKGGDIYLSPMNMVDASKPQPIPMGKTEPTKAALDEIGRILS 419 (419) T ss_pred HHHhCCCCCCCcceeeeccccccccccccccCCCCCchhhhHHHHHhhcC Confidence 11110 0001111111 01122223333322222 No 200 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=21.61 E-value=2.6 Score=18.33 Aligned_cols=427 Identities=11% Similarity=0.105 Sum_probs=142.1 Q ss_pred CCCC--CCCccCHHHHHHHHHHHH------H-----------HhhhHHHHHHHH-HHhhcccccC-C----CCCcccc-- Q lcl|NC_015159. 1 MAEV--EKTGFAADGAAAAYNRLK------N-----------DRGAYETRAEDC-ATYTIPSVFP-S----ATADGST-- 53 (532) Q Consensus 1 m~~~--~~~~~~~~~~~~r~~~lk------~-----------~R~~~e~~w~e~-~~~~~P~~~~-~----~~~~~~~-- 53 (532) |-+- +..++...++..-++... . .++.....+... -.+.-|.... . .+.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRN 80 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCC Confidence 2211 111111111110000000 0 000011111110 0000010000 0 0000000 Q ss_pred --ccccc---c-cc-hHHHHHHHHHHHHHH-----hhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 54 --SYTTP---W-QS-IGARGLNNLASKLML-----ALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMN 121 (532) Q Consensus 54 --~~~~~---~-ds-t~~~a~~~Laa~l~~-----~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~ 121 (532) .++.+ | ++ +...+++.-++-+.+ .-+-.+-||. +...|.+-.. ...+..+ . ..++. T Consensus 81 ~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~-i~~kd~~~~~---~~~~~~~-~-------~~l~~ 148 (574) T protein:vir:80 81 SQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYE-IRLKDIEAEP---TSHDIAN-I-------KRIES 148 (574) T ss_pred cccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceE-EEEeccCCCc---cchhhhh-h-------hHHHH Confidence 01111 1 11 111222322222221 1133456664 2222111000 0000001 0 11223 Q ss_pred HHHh---------cCChHHHHHHHHHHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHH Q lcl|NC_015159. 122 YMES---------NSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARA 192 (532) Q Consensus 122 ~l~~---------snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~ 192 (532) .|+. ..|..-+..++.++..+||+.+++..+. .+....+..++...+.+..|.+|.+.. T Consensus 149 ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~--~G~~~~L~pl~p~~V~v~~d~~~~~~~---------- 216 (574) T protein:vir:80 149 FLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDK--DGNFIKFDTVDPTTIFLATNGEGKLIK---------- 216 (574) T ss_pred HHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECC--CCcEEEEEEEcCceeEEEEcCcccccc---------- Confidence 3332 2344556667788889999988765432 233333444444556666666553321 Q ss_pred HhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEEcCcccccccccCccccCceEEEEeeecCC---Cccc Q lcl|NC_015159. 193 ALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPN---EDYG 269 (532) Q Consensus 193 ~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g---~~YG 269 (532) ++ +..++..+|.... .+..+ =+++.|.+..++ ..|| T Consensus 217 ---------------------------------~~--~~y~~~~~g~~~~----~~~~~--eiih~~~~~~~~~~~~~~G 255 (574) T protein:vir:80 217 ---------------------------------NG--ERFVQVIDNRIVA----KFNER--ELAFAVRNPRADIEVGQYG 255 (574) T ss_pred ---------------------------------Cc--eEEEEEeCCceEE----EEccc--cEEEEeccCCCCccccccc Confidence 00 1112222222111 01111 134444333332 4699 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee--cCccccChhhhc----------cC-C-Cce--eecCccccccc Q lcl|NC_015159. 270 RSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV--NPNGVTQIRRVA----------KA-N-TGD--FVAGRKQDVEV 333 (532) Q Consensus 270 ~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv--~~~g~~~~~~~~----------~~-~-~G~--~v~g~~~~~~~ 333 (532) .+|..-+...+.......+.......-...|..++ +.+..++.+.+. .+ . .|. ++.+ +++.. T Consensus 256 ~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~--~G~~~ 333 (574) T protein:vir:80 256 YPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSA--EDVKF 333 (574) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecC--CCceE Confidence 99999999999888888888888877777888444 333334544321 11 1 121 2212 23444 Q ss_pred cccCC-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCC--------CC---CHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_015159. 334 FQLEK-YNDFQVAKATADDIEKRLSYAFMLNS--AVQRGGD--------RV---TAEEIRYVAGELEDTLGGVYSLLSQE 399 (532) Q Consensus 334 ~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~--------~~---TAtEi~~r~~E~~~~LGpv~~rl~~E 399 (532) .++.. ..+.+ ..+..+..+..|-++|-... +...+.. .+ |+++. .. .+... T Consensus 334 ~~l~~s~~D~q-fle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~--~~------------~f~~~ 398 (574) T protein:vir:80 334 VNMTPSANDMQ-FEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEK--MQ------------ASQNK 398 (574) T ss_pred EEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHH--HH------------HHHHH Confidence 44442 23444 33556667788888884321 1111111 11 12221 11 22333 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCHHHHHHHHHHhcCC Q lcl|NC_015159. 400 LQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGM 479 (532) Q Consensus 400 ~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~Gv 479 (532) -|.|++.++-..+.+ .+|++.. ..+.+.+.. .+.+.++...+ +...++ + ..+..+++- +.+|. T Consensus 399 tL~P~~~~ie~~ln~-~Ll~~~~-~~~~~~f~~-~d~~~~~~~~~-~~~~~~--~-------G~lT~NE~R----~~lgl 461 (574) T protein:vir:80 399 GLQPLLRFIEDTVNT-YIVAEFG-EKYQFQFRG-GDLSAQLDKLK-IIEQEG--K-------VFRTVNEIR----HDKGL 461 (574) T ss_pred HHHHHHHHHHHHHHh-hhhhhcC-CceEEEecc-cchhhHHHHHH-HHHHHh--C-------CccCHHHHH----HHhCC Confidence 444444444333322 2333322 222333321 22233332221 111111 0 112222211 12344 Q ss_pred CHh----------HccCCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhcccccCCC------------------- Q lcl|NC_015159. 480 DTT----------GLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGLP------------------- 530 (532) Q Consensus 480 ~p~----------~i~~s~ee~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~------------------- 530 (532) +|- .+..-.+..+.. +...+ ...+...++....+.....++.=.| T Consensus 462 ~Pi~gGD~~~~~~n~~~~~~~~~~~----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~ 535 (574) T protein:vir:80 462 EPIKGGDVILNGVHIQAIGQALQEE----QLEYQ--RSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLN 535 (574) T ss_pred CCCCCCCEeeeccceeecccccccc----cCCcc--chhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhc Confidence 331 111110000000 00000 0000000000000000000000000 Q ss_pred ----------CC Q lcl|NC_015159. 531 ----------TQ 532 (532) Q Consensus 531 ----------~~ 532 (532) ++ T Consensus 536 ~~~~~~~~~~~~ 547 (574) T protein:vir:80 536 GKSKKVNGKVDD 547 (574) T ss_pred cchhhhcCCccc Confidence 00 No 201 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=21.45 E-value=2.6 Score=18.31 Aligned_cols=295 Identities=12% Similarity=0.059 Sum_probs=119.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCccccc--ccccccchHHHHHHHHHHHHHHhh Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTS--YTTPWQSIGARGLNNLASKLMLAL 78 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~~~l 78 (532) +.++++.- +...+ + .+...|.+.-+|.-|-... .+=.+ ....+.+....+-++..+.+ + T Consensus 26 ~~~p~~~~-~~~~~---~--------~~~~~~~~~~~~~~pp~~~----~~la~l~~~~~~h~~~i~~k~n~l~~l---~ 86 (346) T protein:vir:10 26 FGDPIPVL-DRADI---L--------NYLECSAMYEKWYNPPMSF----DGLAKSLRSSTHHESAIITKANILLST---C 86 (346) T ss_pred cCCcceec-CchhH---H--------HHHHHhhcCCceEecCCCH----HHHHHHHHhhhhcchhhhhhhhhHHHH---H Confidence 55554311 11112 1 1122233333343332110 00000 00112222222222222111 1 Q ss_pred cCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHHhhCceeeeecccccccCC Q lcl|NC_015159. 79 FPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQ 158 (532) Q Consensus 79 tpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~~~~ 158 (532) --|+ ||. .+. .+.+++.|+.+||||.+++..+ .. T Consensus 87 ~~Pn-~~~-------------------------------------t~~----~f~~~~~d~ll~Gnay~~i~r~----~~ 120 (346) T protein:vir:10 87 EVDS-RYL-------------------------------------SRR----DLSSFVKDYLVFGNAYFEVVRN----RL 120 (346) T ss_pred hCCC-CCC-------------------------------------CHH----HHHHHHHHHHhcCCeEEEEEEc----CC Confidence 0112 111 111 2345566888999998876432 12 Q ss_pred cceEEEEecceEEEee--CCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEEE Q lcl|NC_015159. 159 SNAPKLYKLHNFVVER--DAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEI 236 (532) Q Consensus 159 ~~~~~~~pl~~~~v~~--d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~~ 236 (532) +-....+|+..-++.. +.++. + | .++.. T Consensus 121 G~~~~L~pl~~~~v~~~~~~~~~----~---------------------------------~-------------~~~~~ 150 (346) T protein:vir:10 121 GQVQRIESPLAKYVRKGLEAGQF----Y---------------------------------Y-------------VPQRF 150 (346) T ss_pred CcEEEEEEecCCceEEEEcCCeE----E---------------------------------E-------------EEEcc Confidence 2233444443322222 11110 0 0 00011 Q ss_pred cCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee-cCccccChhhh Q lcl|NC_015159. 237 DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFV-NPNGVTQIRRV 315 (532) Q Consensus 237 ~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv-~~~g~~~~~~~ 315 (532) +|... .+. .--++..|.....+..||.+|...++..+...+..++-....-.-...|..++ -+|..++.++. T Consensus 151 ~g~~~-----~~~--~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~ 223 (346) T protein:vir:10 151 DHQEH-----EFA--KGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDV 223 (346) T ss_pred CCeEE-----EEe--cccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHH Confidence 11100 000 11134455444457799999999999888888877777777777777788654 35655665543 Q ss_pred ccC----------CC-ce-ee-c--CccccccccccCCc-cchhHHHHHHHHHHHHHHHHHhhh----hcccCCC-CCCC Q lcl|NC_015159. 316 AKA----------NT-GD-FV-A--GRKQDVEVFQLEKY-NDFQVAKATADDIEKRLSYAFMLN----SAVQRGG-DRVT 374 (532) Q Consensus 316 ~~~----------~~-G~-~v-~--g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~----~~~~~~~-~~~T 374 (532) ... +| |. ++ . |...++...|+... .+.+ ..+..+..++.|-.+|-.- .....++ ..-+ T Consensus 224 ~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~q-f~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~ 302 (346) T protein:vir:10 224 ENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDE-FFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGN 302 (346) T ss_pred HHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCccc Confidence 211 11 11 12 2 22334455554432 2333 3334455577788887321 1111111 1223 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHHHHHH Q lcl|NC_015159. 375 AEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHD 442 (532) Q Consensus 375 AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~raq~ 442 (532) +++....- ....|.|...++. .+... | ..+.++. ....|.|+-+ T Consensus 303 ~e~~~~~f--~~~~l~P~~~~ie------------e~n~~---L---~~e~i~F----~~~~ll~~~~ 346 (346) T protein:vir:10 303 VADAAEVF--FITEIEPLQERLK------------EFNQW---L---GQEVIKF----KPSKLLQRTQ 346 (346) T ss_pred HHHHHHHH--HHHHHHHHHHHHH------------HHHhh---c---ccceeee----chhhhcccCC Confidence 44433322 2223455555553 22211 1 1121111 1233333333 No 202 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=20.20 E-value=2.8 Score=18.12 Aligned_cols=116 Identities=9% Similarity=-0.009 Sum_probs=57.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhccc---ccCCC----CCc---c------------------- Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPS---VFPSA----TAD---G------------------- 51 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~----~~~---~------------------- 51 (532) ||-.-..-++-+.+.+++++|...=..-.+.-+++.++..-. +|... ..+ . T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 886422333556677777777543222334455555555443 22111 100 0 Q ss_pred -----cccccccccchHHHHHHH-----------------HHHHHHHhh--------cCCCCCccccCCChHHHhhhccC Q lcl|NC_015159. 52 -----STSYTTPWQSIGARGLNN-----------------LASKLMLAL--------FPVGSSFFKLNVSELEVKQSITS 101 (532) Q Consensus 52 -----~~~~~~~~dst~~~a~~~-----------------Laa~l~~~l--------tpp~~~WF~l~~~d~~~~~~~~~ 101 (532) .....++...||.. .+. .|+-..-|. .=|.|||+.++-.|... T Consensus 81 ~~~~~~~~~~~~L~~tG~L-~~Si~~~~~~~~v~vGtn~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~~~------ 153 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQM-AASVSTDHDDNSAVIGSNKEYAAIHQFGGQAGRGLKVTIPARPWLPVTADGELQ------ 153 (175) T ss_pred hhhhhhccCCCcceechhh-hhhhheeecCCEEEEecChhhhhhhhcccccCCCCccccCCccccCCCcccccc------ Confidence 01122333333321 111 122222222 34799999998665421 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015159. 102 PEELTEIATGLAMVERICMNYMESN 126 (532) Q Consensus 102 ~~~~~~v~~~L~~ve~~~~~~l~~s 126 (532) ..+++.+|+.+.+.+...|.+- T Consensus 154 ---~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 154 ---PEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred ---hHHHHHHHHHHHHHHHHHhccC Confidence 1346778888888887777766 No 203 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=20.09 E-value=2.8 Score=18.10 Aligned_cols=453 Identities=9% Similarity=-0.001 Sum_probs=160.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHH-----HHHHhhcccccCCCCCcc--cccc--------------cccc Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAE-----DCATYTIPSVFPSATADG--STSY--------------TTPW 59 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~-----e~~~~~~P~~~~~~~~~~--~~~~--------------~~~~ 59 (532) |.+.++ +....+.+---.+.+...+....+. -+..|..+.-...+.+.. ..-. ..+| T Consensus 66 ~~~~~~--~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY 143 (862) T protein:vir:99 66 VEISDS--VNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALI 143 (862) T ss_pred cccccc--ccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHH Confidence 333331 1121111100000111111111111 133344443221111100 0000 0111 Q ss_pred --cchHHHHHHHHHHHHHHhhcCCCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHHhcCChHHHHHHHH Q lcl|NC_015159. 60 --QSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIK 137 (532) Q Consensus 60 --dst~~~a~~~Laa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~ 137 (532) +..+-.+|++.|-.+ -+.|+.+...+.... . .+ + ..+.+.+.+.+-+....+.++++ T Consensus 144 ~~~~larkiVd~pAeDa-------tR~g~~I~~~~d~~e-~--~~----e-------~~~~ie~~~~rL~v~~~l~eair 202 (862) T protein:vir:99 144 AQHWLVDKACSLAGEDA-------IRNGWHLKSLGEGEE-I--DE----E-------SLEKFKAIDVEFKVKENLIEFNR 202 (862) T ss_pred HhCchhhhhhhhhhHHH-------hhCCceEeecCcccc-c--CH----H-------HHHHHHHHHHHhhHHHHHHHHHH Confidence 222233344433333 357888875422111 0 00 1 11223345555677888889998 Q ss_pred HHHhhCceeeeecccccccCCcceEEEEecceEEEeeCCCCCeEEE--EEEEeecHHHhhHHHHHHHHhhcccCCCcceE Q lcl|NC_015159. 138 QLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQI--VTEDKIARAALPEDVRKSLEEAQGDQNPSEEV 215 (532) Q Consensus 138 dl~~~G~~~~~v~~~~~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i--~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v 215 (532) .--.||.+++++.-+..+. .. + .-||.- +.=..|.+-.| +-..+.+. + .+. .........+. .+. T Consensus 203 ~~RLyGga~ililv~~~D~-~~--L-sqPLn~---e~I~kG~lkgl~vlDp~w~~p--~--~v~-~~~~Dp~sp~y-GkP 269 (862) T protein:vir:99 203 FKNVFGIRVAIFVVDSEDP-DY--Y-EKPFNP---DGITPGSYRGISQIDPYWMMP--M--LTA-ESTADPSSQFF-YEP 269 (862) T ss_pred hcccccceEEEEEecCcCc-hh--h-hcCcCc---ccccccceeEEEEechhhhcc--c--ccc-ccccccccccc-CCc Confidence 7778998777653222111 00 0 112210 00011211111 11111110 0 000 00000000010 111 Q ss_pred EEEEEEEeeCCCCeEEEEEEEcCcccccccccCcc--ccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015159. 216 TIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEYPL--DSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKM 293 (532) Q Consensus 216 ~i~~~v~~~~~~~~~~s~~~~~~~~~~~~~~~~g~--~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~ 293 (532) +.| .+.|..++.. +.--| +..|+ +.+....-||+|..+.++..++..........+. T Consensus 270 ~~y----------------~I~g~~IH~S-Rliif~g~~vpd----~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~L 328 (862) T protein:vir:99 270 EFW----------------IISGQKYHRS-HLIIARGPQPAD----ILKPTYIFGGIPLVQRIYERVYAAERTANEAPLL 328 (862) T ss_pred eee----------------eecCeeeccc-eeEEecCCCchh----hhhccCCccCccHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1122222110 00001 12233 2333444579999998988888888777777666 Q ss_pred HHHHhcCceeec-------CccccChhhh-cc--CCCceeecCccccccccccCCccchhHHHHHHHHHHHHHHHHHh-- Q lcl|NC_015159. 294 SMISSKVLFFVN-------PNGVTQIRRV-AK--ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFM-- 361 (532) Q Consensus 294 ~~~a~~p~~lv~-------~~g~~~~~~~-~~--~~~G~~v~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-- 361 (532) +..+.-..+-++ ++++.....+ .. ...|.++-+..++...+. .+|.-+...+....+.|.-++= T Consensus 329 l~ka~l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~ls----~slSGL~dll~~~~q~IAaas~IP 404 (862) T protein:vir:99 329 AMNKRTTAIHTDTAKAIANEDKFIQRLMFWVRYRDNHAVKVLGTDETMEQFD----TSLADFDAVIMGQYQLVASIAKTP 404 (862) T ss_pred HHHhccceeechhHhhhccHHHHHHHHHHHHhccCcceeEEecCCCceeEEe----cccCChHHHHHHHHHHHHhhhCCC Confidence 555554443332 1222111111 11 112333333334443332 2344455667777777777751 Q ss_pred hh-hccc-CCCCCCCHH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccccccceeecchHHHH Q lcl|NC_015159. 362 LN-SAVQ-RGGDRVTAE-EIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALG 438 (532) Q Consensus 362 ~~-~~~~-~~~~~~TAt-Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~v~~l~~l~ 438 (532) .. .+.+ ..+-.=|.. ++.. .---+..++...+.|+++|++.++....- +| ..+.+.+ .+|..+. T Consensus 405 ~tiLfGqspaGlnATGE~D~~n--------YyD~I~s~QE~~L~P~LerL~~li~~~lg---~~-~d~~ieF-npL~~~s 471 (862) T protein:vir:99 405 ATKLLGTAPKGFNSTGEFETIS--------YHEELESIQEHVYMPFLQRHYLISRLSLG---IQ-HEIDVVM-EPVASMT 471 (862) T ss_pred ceeecccCcccccCchHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CC-CcceEEe-CCCCCCC Confidence 11 1111 123222444 3221 22223344566788999999888765421 22 2344443 3454444 Q ss_pred HHHHHHH---HHHHHHHHHhhcchhhhhcCHHHHHHHHHHh--cC---CCHhHcc----CCHHHHHHHHH----HHHHHH Q lcl|NC_015159. 439 RGHDLNK---LNVFIDYMIKLAGLQDDDINLLDVKMRLANS--LG---MDTTGLI----LTQQDKQAKMA----EASTAA 502 (532) Q Consensus 439 raq~~~~---l~~~~~~laq~~p~~~d~id~d~~~~~~a~~--~G---v~p~~i~----~s~ee~~~~~~----q~~~~~ 502 (532) ...+++. .....+.+.+ ...|+.+++.+.++.. .| ++...+- .++++..+-.. +.+.-. T Consensus 472 ekEkAEi~kk~Aea~~~lv~-----sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~ 546 (862) T protein:vir:99 472 AQQQADLNKTKAEGGKVLID-----GGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASA 546 (862) T ss_pred HHHHHHHHHHHHHHHHHHHh-----cCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcccccccc Confidence 4444433 3333333322 1257788877776532 11 1111110 00111100000 000000 Q ss_pred HHHHHHHhhhHH--------------HHHHHHhhcccccCCCCC Q lcl|NC_015159. 503 GMVTAGQQMGAA--------------GGQAAAAMMQQQAGLPTQ 532 (532) Q Consensus 503 ~~~~~~~~~~~~--------------~~~~~~~~~~~~~g~~~~ 532 (532) ...+++.+...+ .|+..+...+..+-+|+- T Consensus 547 de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~ 590 (862) T protein:vir:99 547 KETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPED 590 (862) T ss_pred cccccccCCccccCCcccccccCCCCCCCccccccccccCCCcc Confidence 000000000000 000000000001111110 No 204 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=20.08 E-value=2.8 Score=18.10 Aligned_cols=358 Identities=12% Similarity=0.033 Sum_probs=131.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHhhhHHHHHHHHHHhhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_015159. 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) Q Consensus 1 m~~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltp 80 (532) |+== +..+..+.++...|..+.- ..+. .+..... -+-.++--.|++.+|+.+.+. T Consensus 1 m~~f--------------~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~---Al~~~~V~~~i~~Ia~~iA~l--- 55 (406) T protein:vir:97 1 MSFF--------------QPLGTSKVSYDDYISSVLA-GDVS----QKYLGVS---ALKNSDILTATSIIAGDIARF--- 55 (406) T ss_pred Cccc--------------cccCCCCCCcchHHHHHhc-CCCC----cccccch---hhccHHHHHHHHHHHHhhhhC--- Confidence 3322 1112222233333433311 0000 0000000 111233234566666655542 Q ss_pred CCCCccccCCChHHHhhhccChhHHHHHHHHHHHHHHHHHHHHH-----hcCChHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_015159. 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYME-----SNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) Q Consensus 81 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~dl~~~G~~~~~v~~~~~~ 155 (532) ||-..... .+... +..+...|+ .-+.+.-....+.+|...|||.+|+..+. . T Consensus 56 ---p~~~~~~~-g~~~~------------------~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~-~ 112 (406) T protein:vir:97 56 ---PLVKKDVN-GDIIH------------------DEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP-K 112 (406) T ss_pred ---eeEEEecC-ccccc------------------cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-C Confidence 34322211 11100 111223332 22344555667788889999999886431 1 Q ss_pred cCCcceEEEEecceEEEeeCCCCCeEEEEEEEeecHHHhhHHHHHHHHhhcccCCCcceEEEEEEEEeeCCCCeEEEEEE Q lcl|NC_015159. 156 EGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEEAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQE 235 (532) Q Consensus 156 ~~~~~~~~~~pl~~~~v~~d~~G~vd~i~rk~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~s~~~ 235 (532) .+....+..++.+.+-+..+..|++- | ++. .+ ..+.. + T Consensus 113 ~g~~~~L~~i~p~~v~v~~~~~~~~~--y-~~~---------------------------------~~-~~~~~---~-- 150 (406) T protein:vir:97 113 TNQALQFQFYRPSETTVEETDNHEIV--Y-TFT---------------------------------DM-LTAKQ---V-- 150 (406) T ss_pred CCeEEEEEEECCCeeEEEEcCCceEE--E-EEE---------------------------------ec-CCceE---E-- Confidence 12223333333455555555444321 1 110 00 00000 0 Q ss_pred EcCcccccccccCccccCceEEEEeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecCccccChhhh Q lcl|NC_015159. 236 IDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV 315 (532) Q Consensus 236 ~~~~~~~~~~~~~g~~~~P~~~~Rw~~~~g~~YG~Gp~~~al~d~~~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~~ 315 (532) .+..++ +++.|....+| .||.||...+...+.....+.+.......-...|-++..+++.++.+.. T Consensus 151 -----------~~~~~e--vih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~ 216 (406) T protein:vir:97 151 -----------KCFAHD--VIHWKFFSHDT-ILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDAR 216 (406) T ss_pred -----------EEcccc--EEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHH Confidence 000011 23333332333 6799999988888888788888777766667777777667766666654 Q ss_pred ccC----------CC-ceeecCccccccccccC-CccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHH Q lcl|NC_015159. 316 AKA----------NT-GDFVAGRKQDVEVFQLE-KYNDFQVAKATADDIEKRLSYAFMLNS-AVQRGGDRVTAEEIRYVA 382 (532) Q Consensus 316 ~~~----------~~-G~~v~g~~~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~~~TAtEi~~r~ 382 (532) ... .+ |.+.. -.++....++. +..+.+.. +..+..+..|-++|-.-. +....+..-+.+|. . T Consensus 217 ~~~~~~~~~~~~g~n~g~~~v-l~~g~~~~~l~~~~~d~q~l-e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~---~ 291 (406) T protein:vir:97 217 QRARQEFEKMREGSVGGSPLV-FDSTMEYTPLEIDTNVLQLI-TSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQL---M 291 (406) T ss_pred HHHHHHHHHHhcccccCceee-cCCCceEEEccCCHHHHHHH-HHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHH---H Confidence 221 11 11110 11222333333 12233332 334444677777773221 11111111111221 1 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccccee-ecchHHHHHHHHHHHHHH--------HHHHH Q lcl|NC_015159. 383 GELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAI-ATGLEALGRGHDLNKLNV--------FIDYM 453 (532) Q Consensus 383 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lp~~p~~~~~~~~-v~~l~~l~raq~~~~l~~--------~~~~l 453 (532) .+ +...-|.|++.++...+.+. +|++-......+++ +..+. -.++..+.++.. +.. + T Consensus 292 ~~-----------f~~~~l~P~~~~ie~~l~~k-ll~~~~~~~~~i~fd~~~~~-~~~~~~~~~~~~~g~~T~NE~R~-~ 357 (406) T protein:vir:97 292 ED-----------YVTNDLPFYFDAITSELGLK-TLNDKDRRLYHIEFDTRSVT-GRNVDEIVKLVNNQILTPNQGLV-E 357 (406) T ss_pred HH-----------HHHHHHHHHHHHHHHHHhhh-hcChhhccceeEEEecCccc-hhhHHHHHHHHhCCCcCHHHHHH-H Confidence 11 12233444444443333221 33322112222222 11110 011111111111 000 1 Q ss_pred Hhhcc---hhhh-------hcCHHHHHHHHHHh----------cCC-CHh Q lcl|NC_015159. 454 IKLAG---LQDD-------DINLLDVKMRLANS----------LGM-DTT 482 (532) Q Consensus 454 aq~~p---~~~d-------~id~d~~~~~~a~~----------~Gv-~p~ 482 (532) ....| +..| .+..|. .+.+.+. .|= +-+ T Consensus 358 ~g~~p~~~~~gD~~~~~~n~~~~~~-~~~~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 358 LGKQKSTDPNMDRYQSSLNYVFLDK-KEEYQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred hCCCCCCCCCCCeEeeccCccchhc-ccccccccccccCCCCCCCCCCCC Confidence 11122 1111 112221 1111110 110 112 Done!