Query lcl|NC_011045.1_cdsid_YP_002003970.1 [gene=8] [protein=gp8] [protein_id=YP_002003970.1] [location=18705..20315] Match_columns 536 No_of_seqs 129 out of 166 Neff 7.9 Searched_HMMs 1612 Date Thu Nov 7 13:02:32 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:10447 Length: 536 100.0 1E-188 9E-192 1050.3 60.1 536 1-536 1-536 (536) 2 protein:vir:2198 Length: 536 # 100.0 2E-188 1E-191 1050.0 59.7 536 1-536 1-536 (536) 3 protein:vir:1538 Length: 535 # 100.0 1E-181 9E-185 1012.1 60.4 534 1-535 1-535 (535) 4 protein:vir:3361 Length: 535 # 100.0 8E-180 5E-183 1002.5 59.9 534 1-535 1-535 (535) 5 protein:vir:94572 Length: 535 100.0 4E-179 2E-182 998.8 59.2 533 1-535 1-535 (535) 6 protein:vir:8883 Length: 543 # 100.0 5E-177 3E-180 987.0 58.5 535 1-536 1-538 (543) 7 protein:vir:94709 Length: 522 100.0 2E-175 1E-178 978.2 58.9 521 1-526 1-522 (522) 8 protein:vir:99672 Length: 532 100.0 6E-170 4E-173 948.4 57.6 528 1-530 1-532 (532) 9 protein:vir:78696 Length: 542 100.0 3E-165 2E-168 922.7 57.5 518 9-536 1-535 (542) 10 protein:vir:78942 Length: 510 100.0 9E-163 6E-166 909.0 57.0 505 9-536 1-510 (510) 11 protein:vir:100039 Length: 522 100.0 1E-162 7E-166 908.4 54.4 510 11-535 1-522 (522) 12 protein:vir:6322 Length: 510 # 100.0 4E-162 3E-165 905.3 56.6 504 9-523 1-510 (510) 13 protein:vir:103330 Length: 517 100.0 2E-161 1E-164 902.0 56.8 509 1-533 1-517 (517) 14 protein:vir:1785 Length: 555 # 100.0 8E-160 5E-163 892.9 56.0 517 9-535 1-555 (555) 15 protein:vir:96988 Length: 516 100.0 2E-160 1E-163 895.7 52.8 507 1-535 1-516 (516) 16 protein:vir:80211 Length: 514 100.0 1E-159 6E-163 892.3 55.7 505 9-521 1-514 (514) 17 protein:vir:103765 Length: 549 100.0 2E-159 1E-162 890.6 54.2 513 1-536 1-548 (549) 18 protein:vir:107822 Length: 555 100.0 4E-159 2E-162 889.3 55.3 517 1-533 1-555 (555) 19 protein:vir:107404 Length: 555 100.0 4E-159 2E-162 889.3 55.3 517 1-533 1-555 (555) 20 protein:vir:98506 Length: 555 100.0 4E-159 2E-162 889.3 55.3 517 1-533 1-555 (555) 21 protein:vir:7017 Length: 515 # 100.0 8E-159 5E-162 887.4 54.3 507 1-526 1-515 (515) 22 protein:vir:105641 Length: 516 100.0 2E-157 1E-160 879.8 54.1 507 1-526 1-516 (516) 23 protein:vir:7321 Length: 556 # 100.0 2E-155 1E-158 868.9 56.3 516 1-534 1-556 (556) 24 protein:vir:95315 Length: 559 100.0 2E-154 9E-158 863.9 55.9 516 1-534 1-559 (559) 25 protein:vir:102668 Length: 547 100.0 1E-154 7E-158 864.6 55.1 505 8-527 1-547 (547) 26 protein:vir:94599 Length: 641 100.0 9.1E-90 5.7E-93 508.7 44.1 518 1-536 20-623 (641) 27 protein:vir:80165 Length: 651 100.0 2.3E-73 1.4E-76 418.8 46.2 521 1-536 3-631 (651) 28 protein:vir:95449 Length: 584 100.0 1.2E-43 7.4E-47 255.9 36.9 494 1-518 1-584 (584) 29 protein:vir:3139 Length: 599 # 100.0 3E-40 1.9E-43 237.3 37.7 502 1-533 1-599 (599) 30 protein:vir:8846 Length: 705 # 100.0 1.1E-33 6.6E-37 201.3 44.6 510 1-536 1-645 (705) 31 protein:vir:95821 Length: 763 100.0 1.2E-26 7.6E-30 162.6 42.8 514 1-536 1-707 (763) 32 protein:vir:93630 Length: 776 99.9 1E-20 6.3E-24 130.2 38.4 513 1-536 22-685 (776) 33 protein:vir:108295 Length: 711 99.8 1.6E-18 9.8E-22 118.2 40.7 523 1-536 1-691 (711) 34 protein:vir:817 Length: 714 # 99.8 1.1E-15 6.9E-19 102.6 41.1 513 1-536 8-696 (714) 35 protein:vir:9950 Length: 714 # 99.8 1.1E-15 6.9E-19 102.6 41.1 513 1-536 8-696 (714) 36 protein:vir:3296 Length: 714 # 99.8 1.1E-15 6.9E-19 102.6 41.1 513 1-536 8-696 (714) 37 protein:vir:2764 Length: 714 # 99.8 1.1E-15 6.9E-19 102.6 41.1 513 1-536 8-696 (714) 38 protein:vir:10117 Length: 714 99.8 1.1E-15 6.9E-19 102.6 41.1 513 1-536 8-696 (714) 39 protein:vir:100920 Length: 725 99.7 8E-16 4.9E-19 103.4 34.9 511 1-536 1-669 (725) 40 protein:vir:77597 Length: 725 99.7 9E-16 5.6E-19 103.1 35.2 512 1-536 1-669 (725) 41 protein:vir:9263 Length: 725 # 99.7 2.4E-15 1.5E-18 100.8 35.1 511 1-536 1-669 (725) 42 protein:vir:105429 Length: 708 99.7 4.4E-15 2.8E-18 99.3 35.3 512 1-536 1-684 (708) 43 protein:vir:104437 Length: 714 99.6 1.6E-13 1E-16 90.7 39.8 512 1-536 1-694 (714) 44 protein:vir:105520 Length: 706 99.6 8.9E-14 5.5E-17 92.1 36.1 510 1-536 1-666 (706) 45 protein:vir:105619 Length: 772 99.6 3.9E-13 2.4E-16 88.6 41.4 504 1-536 11-704 (772) 46 protein:vir:172 Length: 708 # 99.6 2.1E-14 1.3E-17 95.6 29.6 512 1-536 1-672 (708) 47 protein:vir:3520 Length: 720 # 99.5 2.6E-13 1.6E-16 89.6 28.8 509 1-536 1-675 (720) 48 protein:vir:2500 Length: 501 # 99.4 6.7E-11 4.1E-14 76.4 33.1 454 1-536 16-499 (501) 49 protein:vir:7768 Length: 484 # 99.3 1E-10 6.5E-14 75.3 39.0 451 1-535 1-484 (484) 50 protein:vir:99072 Length: 479 99.3 1.4E-10 8.8E-14 74.6 38.4 444 1-536 1-474 (479) 51 protein:vir:2427 Length: 485 # 99.3 1.7E-10 1.1E-13 74.1 40.0 444 1-536 1-479 (485) 52 protein:vir:5961 Length: 503 # 99.3 2.2E-10 1.3E-13 73.6 36.4 458 1-535 1-503 (503) 53 protein:vir:2341 Length: 488 # 99.3 2.4E-10 1.5E-13 73.3 41.4 457 1-536 1-487 (488) 54 protein:vir:78227 Length: 480 99.2 5.4E-10 3.3E-13 71.4 40.0 440 1-536 1-474 (480) 55 protein:vir:99916 Length: 504 99.2 6.3E-10 3.9E-13 71.0 35.6 450 1-536 1-502 (504) 56 protein:vir:106639 Length: 481 99.2 7.6E-10 4.7E-13 70.6 39.1 438 1-531 23-481 (481) 57 protein:vir:94805 Length: 492 99.2 1E-09 6.3E-13 69.9 36.9 437 1-532 37-492 (492) 58 protein:vir:104082 Length: 485 99.2 1.1E-09 6.6E-13 69.8 41.8 446 1-536 1-478 (485) 59 protein:vir:95113 Length: 474 99.2 1.1E-09 7.1E-13 69.6 35.7 440 1-528 20-474 (474) 60 protein:vir:4223 Length: 486 # 99.1 1.3E-09 7.8E-13 69.4 41.1 448 1-536 1-479 (486) 61 protein:vir:97336 Length: 492 99.1 1.3E-09 7.8E-13 69.4 37.6 437 1-533 35-492 (492) 62 protein:vir:80680 Length: 441 99.1 1.5E-09 9.6E-13 68.9 41.0 419 1-522 1-441 (441) 63 protein:vir:733 Length: 453 # 99.1 1.7E-09 1E-12 68.7 40.9 432 1-520 11-453 (453) 64 protein:vir:9306 Length: 511 # 99.1 1.7E-09 1.1E-12 68.6 40.1 450 1-536 31-508 (511) 65 protein:vir:93747 Length: 472 99.1 2.5E-09 1.5E-12 67.8 36.6 435 1-533 18-472 (472) 66 protein:vir:3964 Length: 453 # 99.1 2.6E-09 1.6E-12 67.6 39.6 435 1-531 11-453 (453) 67 protein:vir:78537 Length: 480 99.1 2.8E-09 1.7E-12 67.5 37.7 441 1-536 1-474 (480) 68 protein:vir:97447 Length: 474 99.1 2.8E-09 1.7E-12 67.5 36.7 441 1-532 13-474 (474) 69 protein:vir:94498 Length: 474 99.1 2.8E-09 1.7E-12 67.5 36.7 441 1-532 13-474 (474) 70 protein:vir:4898 Length: 502 # 99.1 3.7E-09 2.3E-12 66.8 39.7 452 1-534 31-502 (502) 71 protein:vir:97171 Length: 512 99.0 4.3E-09 2.6E-12 66.5 39.5 452 1-531 31-512 (512) 72 protein:vir:96366 Length: 511 99.0 5.4E-09 3.4E-12 65.9 39.2 450 1-536 31-508 (511) 73 protein:vir:78805 Length: 511 99.0 5.4E-09 3.4E-12 65.9 39.2 450 1-536 31-508 (511) 74 protein:vir:99781 Length: 511 99.0 7.3E-09 4.5E-12 65.2 39.3 453 1-532 31-511 (511) 75 protein:vir:1236 Length: 483 # 99.0 9.2E-09 5.7E-12 64.7 36.9 436 1-534 29-483 (483) 76 protein:vir:38 Length: 496 # N 98.9 1.2E-08 7.5E-12 64.0 33.8 440 1-533 1-496 (496) 77 protein:vir:96240 Length: 511 98.9 1.3E-08 8.1E-12 63.8 41.2 453 1-534 31-511 (511) 78 protein:vir:2732 Length: 501 # 98.9 1.5E-08 9.3E-12 63.5 40.8 451 1-536 30-500 (501) 79 protein:vir:96494 Length: 501 98.9 1.6E-08 1E-11 63.3 41.3 453 1-534 30-501 (501) 80 protein:vir:3609 Length: 452 # 98.9 1.8E-08 1.1E-11 63.1 40.1 432 1-534 11-452 (452) 81 protein:vir:79043 Length: 479 98.9 1.9E-08 1.1E-11 63.0 37.9 443 1-516 7-479 (479) 82 protein:vir:94546 Length: 506 98.9 2E-08 1.2E-11 62.8 32.9 444 1-536 16-505 (506) 83 protein:vir:105461 Length: 470 98.9 2.6E-08 1.6E-11 62.2 38.4 434 8-533 1-470 (470) 84 protein:vir:103951 Length: 511 98.8 3E-08 1.9E-11 61.8 40.6 453 1-534 31-511 (511) 85 protein:vir:105889 Length: 474 98.8 3E-08 1.9E-11 61.8 36.9 444 1-530 1-474 (474) 86 protein:vir:94101 Length: 474 98.8 3E-08 1.9E-11 61.8 36.9 444 1-530 1-474 (474) 87 protein:vir:102950 Length: 471 98.8 3.3E-08 2.1E-11 61.6 34.4 431 8-523 1-471 (471) 88 protein:vir:80959 Length: 499 98.8 3.4E-08 2.1E-11 61.5 30.0 441 1-534 1-499 (499) 89 protein:vir:98444 Length: 434 98.8 3.8E-08 2.4E-11 61.3 29.4 403 38-535 1-434 (434) 90 protein:vir:3028 Length: 500 # 98.8 4E-08 2.5E-11 61.1 31.3 446 1-532 1-500 (500) 91 protein:vir:9815 Length: 500 # 98.8 4E-08 2.5E-11 61.1 31.3 446 1-532 1-500 (500) 92 protein:vir:9871 Length: 429 # 98.8 4.8E-08 3E-11 60.7 40.0 421 8-528 1-429 (429) 93 protein:vir:101494 Length: 527 98.8 5.5E-08 3.4E-11 60.4 31.8 467 1-535 1-527 (527) 94 protein:vir:1587 Length: 508 # 98.8 5.7E-08 3.6E-11 60.3 32.0 445 1-521 1-508 (508) 95 protein:vir:102239 Length: 527 98.8 6.1E-08 3.8E-11 60.1 31.8 467 1-535 1-527 (527) 96 protein:vir:99522 Length: 470 98.8 6.4E-08 4E-11 60.0 40.2 437 1-533 19-470 (470) 97 protein:vir:95806 Length: 440 98.7 6.7E-08 4.2E-11 59.9 34.6 416 16-521 1-440 (440) 98 protein:vir:96266 Length: 474 98.7 7.4E-08 4.6E-11 59.7 35.8 437 1-530 1-474 (474) 99 protein:vir:95899 Length: 474 98.7 7.4E-08 4.6E-11 59.7 35.8 437 1-530 1-474 (474) 100 protein:vir:345 Length: 663 # 98.7 7.5E-08 4.7E-11 59.6 29.6 499 1-536 1-645 (663) 101 protein:vir:4782 Length: 522 # 98.7 1.1E-07 6.5E-11 58.8 29.7 449 1-531 14-522 (522) 102 protein:vir:80453 Length: 535 98.7 1.1E-07 7.1E-11 58.7 30.0 455 1-536 32-535 (535) 103 protein:vir:9922 Length: 489 # 98.6 1.7E-07 1E-10 57.8 35.2 442 1-534 1-489 (489) 104 protein:vir:7430 Length: 563 # 98.6 2.5E-07 1.5E-10 56.8 28.4 486 1-536 1-556 (563) 105 protein:vir:8184 Length: 474 # 98.6 2.8E-07 1.7E-10 56.5 35.1 416 1-521 12-474 (474) 106 protein:vir:7987 Length: 456 # 98.5 4.3E-07 2.7E-10 55.5 35.2 434 1-524 1-456 (456) 107 protein:vir:79703 Length: 505 98.5 4.9E-07 3.1E-10 55.2 37.3 435 1-528 1-505 (505) 108 protein:vir:106571 Length: 499 98.5 5E-07 3.1E-10 55.1 40.6 446 1-536 1-495 (499) 109 protein:vir:98883 Length: 517 98.5 5.2E-07 3.2E-10 55.0 36.3 457 1-524 1-517 (517) 110 protein:vir:105292 Length: 478 98.5 5.3E-07 3.3E-10 55.0 39.7 439 1-535 1-478 (478) 111 protein:vir:9751 Length: 422 # 98.4 9.5E-07 5.9E-10 53.6 35.5 389 8-500 1-422 (422) 112 protein:vir:94742 Length: 409 98.4 1E-06 6.2E-10 53.5 36.2 374 8-480 1-409 (409) 113 protein:vir:107112 Length: 478 98.3 1.3E-06 7.9E-10 52.9 36.1 436 1-532 1-478 (478) 114 protein:vir:105819 Length: 456 98.3 1.9E-06 1.1E-09 52.0 34.5 435 1-524 1-456 (456) 115 protein:vir:102602 Length: 456 98.3 1.9E-06 1.1E-09 52.0 34.5 435 1-524 1-456 (456) 116 protein:vir:9568 Length: 410 # 98.2 2.5E-06 1.6E-09 51.3 35.0 375 24-501 1-410 (410) 117 protein:vir:1634 Length: 409 # 98.2 3.2E-06 2E-09 50.7 35.6 377 8-480 1-409 (409) 118 protein:vir:78083 Length: 537 98.1 4.3E-06 2.7E-09 50.0 40.3 470 1-534 1-537 (537) 119 protein:vir:96179 Length: 468 98.1 5.3E-06 3.3E-09 49.5 38.9 429 1-535 1-468 (468) 120 protein:vir:94956 Length: 452 98.1 5.8E-06 3.6E-09 49.3 27.4 435 1-526 1-452 (452) 121 protein:vir:102330 Length: 451 98.0 6.8E-06 4.2E-09 48.9 38.6 422 8-515 1-451 (451) 122 protein:vir:95149 Length: 501 98.0 7.9E-06 4.9E-09 48.6 27.6 443 1-534 1-501 (501) 123 protein:vir:96839 Length: 474 98.0 9.2E-06 5.7E-09 48.2 37.8 433 1-528 1-474 (474) 124 protein:vir:80040 Length: 461 97.7 2.8E-05 1.7E-08 45.6 25.2 433 1-506 1-461 (461) 125 protein:vir:96783 Length: 488 97.6 3.6E-05 2.2E-08 45.0 32.4 412 1-476 14-488 (488) 126 protein:vir:97265 Length: 513 97.6 3.9E-05 2.4E-08 44.7 31.2 457 1-536 1-511 (513) 127 protein:vir:78907 Length: 518 97.5 4.9E-05 3.1E-08 44.2 37.6 447 4-531 1-518 (518) 128 protein:vir:95014 Length: 491 97.4 7.4E-05 4.6E-08 43.2 29.7 429 1-528 1-491 (491) 129 protein:vir:4995 Length: 384 # 97.4 8.5E-05 5.3E-08 42.9 23.4 363 14-460 1-384 (384) 130 protein:vir:3989 Length: 392 # 97.3 0.00011 6.9E-08 42.3 24.8 334 37-464 1-392 (392) 131 protein:vir:1023 Length: 392 # 97.3 0.00011 6.9E-08 42.3 24.8 334 37-464 1-392 (392) 132 protein:vir:1266 Length: 416 # 97.1 0.00017 1.1E-07 41.2 20.9 365 15-475 1-416 (416) 133 protein:vir:3843 Length: 397 # 97.1 0.00019 1.1E-07 41.0 19.2 372 39-535 1-397 (397) 134 protein:vir:78393 Length: 489 96.9 0.00025 1.6E-07 40.3 31.7 427 1-533 1-489 (489) 135 protein:vir:101647 Length: 460 96.7 0.00042 2.6E-07 39.1 20.1 401 1-525 1-460 (460) 136 protein:vir:81152 Length: 411 96.4 0.00062 3.8E-07 38.2 23.5 368 14-478 1-411 (411) 137 protein:vir:7407 Length: 392 # 96.4 0.00062 3.9E-07 38.2 25.6 333 37-457 1-392 (392) 138 protein:vir:78161 Length: 355 96.4 0.00069 4.3E-07 37.9 16.2 305 176-536 1-348 (355) 139 protein:vir:63755 Length: 547 96.3 0.00078 4.8E-07 37.6 21.2 441 1-536 11-538 (547) 140 protein:vir:4828 Length: 382 # 96.3 0.0008 5E-07 37.6 24.3 352 1-462 1-382 (382) 141 protein:vir:79538 Length: 502 96.2 0.00086 5.4E-07 37.4 28.0 446 1-534 11-502 (502) 142 protein:vir:4952 Length: 386 # 96.0 0.0011 7.1E-07 36.7 26.2 354 14-460 1-386 (386) 143 protein:vir:4854 Length: 386 # 95.9 0.0014 8.4E-07 36.3 24.6 353 14-460 1-386 (386) 144 protein:vir:96068 Length: 765 95.7 0.0016 9.8E-07 35.9 19.8 444 1-536 69-537 (765) 145 protein:vir:96579 Length: 576 95.6 0.0017 1.1E-06 35.7 21.8 442 1-536 1-538 (576) 146 protein:vir:5737 Length: 419 # 95.5 0.002 1.2E-06 35.4 20.9 364 15-484 1-419 (419) 147 protein:vir:96738 Length: 505 95.4 0.0021 1.3E-06 35.2 26.4 443 1-529 17-505 (505) 148 protein:vir:78641 Length: 278 95.2 0.0025 1.6E-06 34.8 18.7 258 74-430 1-278 (278) 149 protein:vir:100150 Length: 437 95.0 0.0029 1.8E-06 34.5 21.9 384 1-488 1-437 (437) 150 protein:vir:102727 Length: 945 95.0 0.0029 1.8E-06 34.5 23.7 439 1-536 64-541 (945) 151 protein:vir:3153 Length: 467 # 94.7 0.0036 2.2E-06 34.0 20.1 412 54-532 1-467 (467) 152 protein:vir:1326 Length: 457 # 94.7 0.0037 2.3E-06 33.9 17.4 415 14-536 1-453 (457) 153 protein:vir:4598 Length: 416 # 94.6 0.0039 2.4E-06 33.8 22.7 366 1-480 1-416 (416) 154 protein:vir:81095 Length: 416 94.6 0.0039 2.4E-06 33.8 22.7 366 1-480 1-416 (416) 155 protein:vir:8418 Length: 409 # 94.5 0.0044 2.7E-06 33.5 21.1 350 14-465 1-409 (409) 156 protein:vir:98396 Length: 441 94.4 0.0045 2.8E-06 33.5 22.0 352 29-480 1-441 (441) 157 protein:vir:97060 Length: 432 94.3 0.0049 3E-06 33.2 23.0 367 1-483 1-432 (432) 158 protein:vir:3420 Length: 533 # 94.1 0.0054 3.4E-06 33.0 33.2 454 1-532 1-533 (533) 159 protein:vir:81072 Length: 432 94.0 0.0056 3.5E-06 32.9 22.6 367 1-483 1-432 (432) 160 protein:vir:4698 Length: 251 # 94.0 0.0057 3.5E-06 32.9 14.7 242 1-344 1-251 (251) 161 protein:vir:10362 Length: 432 93.8 0.0065 4E-06 32.6 23.2 368 1-468 1-432 (432) 162 protein:vir:10321 Length: 495 93.5 0.0075 4.6E-06 32.2 27.8 429 1-536 1-491 (495) 163 protein:vir:9408 Length: 441 # 93.4 0.0076 4.7E-06 32.2 21.9 352 29-480 1-441 (441) 164 protein:vir:79984 Length: 441 93.4 0.0076 4.7E-06 32.2 21.9 352 29-480 1-441 (441) 165 protein:vir:1884 Length: 424 # 93.0 0.0091 5.6E-06 31.8 22.1 376 1-476 1-424 (424) 166 protein:vir:99563 Length: 862 93.0 0.0093 5.8E-06 31.7 14.3 459 1-536 66-575 (862) 167 protein:vir:9359 Length: 348 # 92.8 0.0098 6.1E-06 31.6 16.9 309 74-484 1-348 (348) 168 protein:vir:189 Length: 424 # 92.6 0.011 6.7E-06 31.4 22.0 379 1-476 1-424 (424) 169 protein:vir:1431 Length: 419 # 92.3 0.012 7.3E-06 31.1 22.9 360 1-475 1-419 (419) 170 protein:vir:99312 Length: 563 92.1 0.013 7.9E-06 31.0 27.7 428 1-536 42-548 (563) 171 protein:vir:95599 Length: 563 92.1 0.013 7.9E-06 31.0 27.7 428 1-536 42-548 (563) 172 protein:vir:93610 Length: 454 91.6 0.015 9.2E-06 30.6 26.8 382 16-484 1-454 (454) 173 protein:vir:80333 Length: 419 91.6 0.015 9.5E-06 30.5 25.0 370 26-514 1-419 (419) 174 protein:vir:389 Length: 530 # 91.4 0.016 9.8E-06 30.5 32.8 452 1-535 1-530 (530) 175 protein:vir:5249 Length: 437 # 91.4 0.016 1E-05 30.4 23.9 404 28-536 1-433 (437) 176 protein:vir:102080 Length: 429 90.9 0.019 1.2E-05 30.1 23.7 371 1-474 1-429 (429) 177 protein:vir:80644 Length: 551 90.4 0.021 1.3E-05 29.8 29.5 432 1-536 1-542 (551) 178 protein:vir:107742 Length: 537 90.2 0.022 1.4E-05 29.7 25.5 436 1-536 68-530 (537) 179 protein:vir:8317 Length: 409 # 89.8 0.024 1.5E-05 29.5 21.7 346 1-461 32-409 (409) 180 protein:vir:102855 Length: 432 89.1 0.028 1.8E-05 29.1 23.1 372 1-474 1-432 (432) 181 protein:vir:105002 Length: 432 89.1 0.028 1.8E-05 29.1 23.1 372 1-474 1-432 (432) 182 protein:vir:107605 Length: 432 89.1 0.028 1.8E-05 29.1 23.1 372 1-474 1-432 (432) 183 protein:vir:100249 Length: 431 88.9 0.029 1.8E-05 29.0 18.9 368 1-481 1-431 (431) 184 protein:vir:4194 Length: 540 # 88.8 0.03 1.9E-05 28.9 29.6 408 3-536 1-479 (540) 185 protein:vir:100882 Length: 383 88.5 0.032 2E-05 28.8 17.1 342 1-479 1-383 (383) 186 protein:vir:1150 Length: 350 # 88.3 0.033 2.1E-05 28.7 21.1 308 1-430 1-350 (350) 187 protein:vir:95542 Length: 548 88.1 0.034 2.1E-05 28.6 31.6 480 1-536 1-545 (548) 188 protein:vir:960 Length: 413 # 87.8 0.036 2.2E-05 28.5 19.5 367 1-478 4-413 (413) 189 protein:vir:6240 Length: 457 # 87.8 0.036 2.3E-05 28.5 21.9 407 14-536 1-441 (457) 190 protein:vir:104338 Length: 422 87.2 0.04 2.5E-05 28.2 20.7 390 28-522 1-422 (422) 191 protein:vir:3868 Length: 417 # 86.9 0.042 2.6E-05 28.1 16.3 386 14-535 1-417 (417) 192 protein:vir:1380 Length: 422 # 86.5 0.045 2.8E-05 27.9 23.5 374 14-481 1-422 (422) 193 protein:vir:102118 Length: 409 86.2 0.047 2.9E-05 27.9 25.4 350 41-478 1-409 (409) 194 protein:vir:78749 Length: 337 85.6 0.052 3.2E-05 27.6 23.5 308 1-430 1-337 (337) 195 protein:vir:107880 Length: 491 85.5 0.052 3.3E-05 27.6 32.3 414 1-536 1-467 (491) 196 protein:vir:108215 Length: 469 85.1 0.055 3.4E-05 27.5 23.4 448 1-535 1-469 (469) 197 protein:vir:96980 Length: 409 84.4 0.061 3.8E-05 27.2 21.6 362 14-484 1-409 (409) 198 protein:vir:100187 Length: 385 84.3 0.061 3.8E-05 27.2 17.1 351 1-480 1-385 (385) 199 protein:vir:483 Length: 413 # 83.9 0.065 4E-05 27.1 23.4 364 15-476 1-413 (413) 200 protein:vir:80796 Length: 574 81.4 0.086 5.3E-05 26.4 22.4 437 1-536 1-531 (574) 201 protein:vir:101648 Length: 518 80.3 0.096 6E-05 26.2 21.0 377 67-536 1-440 (518) 202 protein:vir:7853 Length: 518 # 80.1 0.098 6.1E-05 26.1 21.8 378 67-536 1-440 (518) 203 protein:vir:2683 Length: 412 # 80.0 0.099 6.2E-05 26.1 21.9 373 1-484 1-412 (412) 204 protein:vir:100328 Length: 346 79.4 0.11 6.5E-05 25.9 21.2 319 1-434 1-346 (346) 205 protein:vir:105064 Length: 421 75.4 0.15 9.1E-05 25.1 19.9 368 1-484 1-421 (421) 206 protein:vir:99452 Length: 651 75.0 0.15 9.4E-05 25.1 15.0 476 1-536 1-547 (651) 207 protein:vir:79063 Length: 491 74.3 0.16 0.0001 24.9 32.1 415 1-536 1-467 (491) 208 protein:vir:4337 Length: 434 # 71.3 0.2 0.00012 24.4 26.6 393 1-498 1-434 (434) 209 protein:vir:4156 Length: 542 # 70.8 0.2 0.00013 24.4 28.1 403 34-536 1-482 (542) 210 protein:vir:9702 Length: 406 # 70.6 0.21 0.00013 24.3 18.8 386 14-534 1-406 (406) 211 protein:vir:98853 Length: 219 69.1 0.23 0.00014 24.1 15.3 195 217-433 1-219 (219) 212 protein:vir:4454 Length: 414 # 63.9 0.31 0.00019 23.4 25.0 356 14-468 1-414 (414) 213 protein:vir:3780 Length: 345 # 63.8 0.31 0.00019 23.4 24.1 317 1-432 1-345 (345) 214 protein:vir:267 Length: 348 # 58.1 0.42 0.00026 22.6 25.1 320 1-442 1-348 (348) 215 protein:vir:94426 Length: 409 58.0 0.42 0.00026 22.6 22.2 361 20-484 1-409 (409) 216 protein:vir:2013 Length: 344 # 56.8 0.45 0.00028 22.5 20.5 315 1-422 1-344 (344) 217 protein:vir:78191 Length: 351 56.7 0.45 0.00028 22.5 23.1 315 8-436 1-351 (351) 218 protein:vir:103971 Length: 376 55.8 0.47 0.00029 22.4 22.9 331 1-436 1-376 (376) 219 protein:vir:81218 Length: 423 54.5 0.5 0.00031 22.2 17.6 375 14-481 1-423 (423) 220 protein:vir:3743 Length: 345 # 50.6 0.61 0.00038 21.8 22.5 304 39-432 1-345 (345) 221 protein:vir:6058 Length: 344 # 49.6 0.63 0.00039 21.7 22.9 316 1-422 1-344 (344) 222 protein:vir:6382 Length: 553 # 47.8 0.69 0.00043 21.5 30.4 459 1-532 2-553 (553) 223 protein:vir:79207 Length: 351 46.1 0.75 0.00046 21.3 23.8 315 8-436 1-351 (351) 224 protein:vir:99853 Length: 488 45.8 0.76 0.00047 21.2 30.5 420 8-536 1-454 (488) 225 protein:vir:79150 Length: 368 45.7 0.76 0.00047 21.2 19.5 330 8-437 1-368 (368) 226 protein:vir:1082 Length: 359 # 45.4 0.77 0.00048 21.2 23.1 316 39-457 1-359 (359) 227 protein:vir:4509 Length: 424 # 40.9 0.95 0.00059 20.7 20.2 362 1-465 1-424 (424) 228 protein:vir:107662 Length: 427 40.6 0.96 0.0006 20.7 25.6 396 1-531 1-427 (427) 229 protein:vir:94049 Length: 532 34.7 1.3 0.00079 20.0 26.7 464 1-536 1-518 (532) 230 protein:vir:79647 Length: 435 32.9 1.4 0.00086 19.8 25.0 397 1-523 5-435 (435) 231 protein:vir:93943 Length: 409 30.4 1.6 0.00098 19.5 22.0 362 15-484 1-409 (409) 232 protein:vir:95254 Length: 488 26.7 1.9 0.0012 19.0 22.9 444 1-536 1-482 (488) 233 protein:vir:98567 Length: 340 26.6 1.9 0.0012 19.0 22.6 303 1-421 1-340 (340) No 1 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=1.5e-188 Score=1050.35 Aligned_cols=536 Identities=99% Similarity=1.387 Sum_probs=527.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) ||++++++++++|++||+.|+++|++||++|+||++||+|+++++++++++++..++|||||++|+++|||||||+|||+ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEE Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~ 160 (536) +|||||.+.|+++++...+....+++++||+.||++++++|++||||.++|++|+||++|||||+|+++++++++.+|++ T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~ 160 (536) T protein:vir:10 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEE Confidence 99999999999999998888899999999999999999999999999999999999999999999999999988889999 Q ss_pred EecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccc Q lcl|NC_011045. 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGS 240 (536) Q Consensus 161 ~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~ 240 (536) |||++|||.+|++|+||+||||++||+++|+++|++++.+...+++++++|+|||||+|++++++|.+|++++|++++++ T Consensus 161 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~ 240 (536) T protein:vir:10 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGS 240 (536) T ss_pred EEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCcccccc Confidence 99999999999999999999999999999999999999988888999999999999999999999999999999999999 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCccee Q lcl|NC_011045. 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFV 320 (536) Q Consensus 241 ~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~ 320 (536) +|+|+|++|||+++||++.+|++|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|+++ T Consensus 241 ~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v 320 (536) T protein:vir:10 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFV 320 (536) T ss_pred ccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Q lcl|NC_011045. 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) Q Consensus 321 ~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pl 400 (536) +|+++++++++++++++|+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++|||.|| T Consensus 321 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pl 400 (536) T protein:vir:10 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) T ss_pred cCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh Q lcl|NC_011045. 401 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) Q Consensus 401 i~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~ 480 (536) |+|+|++|++.|+||++|+++++++|+|||++++|+++++++++|++.+++++|++++++||+|++++++++++||+|.+ T Consensus 401 i~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~ 480 (536) T protein:vir:10 401 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) T ss_pred HHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchh Confidence 99999999999999999999999999999999999999999999999999999999998899999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 481 ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 481 i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) ++||+|||+++|+|++++++++++++++++++++++.++|++|+++++++|+|||+ T Consensus 481 ~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 481 ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=1.7e-188 Score=1050.04 Aligned_cols=536 Identities=99% Similarity=1.390 Sum_probs=527.3 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) ||++++++++++|++||++||++|++||++|+||++||+|+++++++++++++..++|||||++|+++|||||||+|||+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALFPM 80 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEE Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~ 160 (536) +|||||.+.|+++++...+....+++++||+.||++++++|++||||.++|++|+||++|||||+|+++++++++.+|++ T Consensus 81 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~ 160 (536) T protein:vir:21 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) T ss_pred CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEE Confidence 99999999999999998888899999999999999999999999999999999999999999999999999988889999 Q ss_pred EecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccc Q lcl|NC_011045. 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGS 240 (536) Q Consensus 161 ~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~ 240 (536) |||++|||.+|++|+||+|||||+||+++|+++|++++.+...+++++++|+|||+|+|+++++.|.+|++++|+.++++ T Consensus 161 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~ 240 (536) T protein:vir:21 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGS 240 (536) T ss_pred EEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCeeeccc Confidence 99999999999999999999999999999999999999988888999999999999999999999999999999999999 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCccee Q lcl|NC_011045. 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFV 320 (536) Q Consensus 241 ~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~ 320 (536) +|+|+|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|+++ T Consensus 241 ~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v 320 (536) T protein:vir:21 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFV 320 (536) T ss_pred cCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCCCccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Q lcl|NC_011045. 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) Q Consensus 321 ~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pl 400 (536) +|+++++++++++++++|+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++|||.|| T Consensus 321 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pl 400 (536) T protein:vir:21 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) T ss_pred cCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh Q lcl|NC_011045. 401 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) Q Consensus 401 i~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~ 480 (536) |+|+|++|++.|+||++|+++++++|+|||++++|+++++++++|++.+++++|++++++||+|++++++++++||+|.+ T Consensus 401 i~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~ 480 (536) T protein:vir:21 401 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) T ss_pred HHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhh Confidence 99999999999999999999999999999999999999999999999999999999998899999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 481 ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 481 i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) ++||+|||+++|+|++++++++++++++++++++++.++|++|+++++++|+|||+ T Consensus 481 ~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 481 ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=1.4e-181 Score=1012.06 Aligned_cols=534 Identities=84% Similarity=1.245 Sum_probs=518.1 Q ss_pred CCC-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) ||+ +++++++++|++||+.|+++|++||++|+||++||+|++|++++++++++..++|||||++|+++|||||||+||| T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 999 7788999999999999999999999999999999999999999988888899999999999999999999999999 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) ++|||||.+.|..++++..+....+++++||+.||++|+.+|++||||.++|++|+||++|||||+|++++.+ ++++|+ T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~~f~ 159 (535) T protein:vir:15 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEG-SYNPMK 159 (535) T ss_pred CCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCC-CceeeE Confidence 9999999999999999888888999999999999999999999999999999999999999999999988764 567999 Q ss_pred EEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccc Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQG 239 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~ 239 (536) +|||++|||.+|++|+||+|||||+||+++|+++|+.++.++..+++++++|+|||||+|++++++|.+|++++|..+++ T Consensus 160 ~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~ 239 (535) T protein:vir:15 160 LYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDG 239 (535) T ss_pred EEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccc Confidence 99999999999999999999999999999999999999998888899999999999999999999999999999999988 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcce Q lcl|NC_011045. 240 SDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDF 319 (536) Q Consensus 240 ~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~ 319 (536) .++.|+|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|+| T Consensus 240 ~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~g~~ 319 (535) T protein:vir:15 240 SDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDF 319 (535) T ss_pred cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCceee Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_011045. 320 VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (536) Q Consensus 320 ~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~P 399 (536) ++|+++++++++++++++|+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++|||.| T Consensus 320 v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~P 399 (535) T protein:vir:15 320 VPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (535) T ss_pred ecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChh Q lcl|NC_011045. 400 LVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTS 479 (536) Q Consensus 400 li~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~ 479 (536) ||+|+|++|++.|+||++|+++++|+|+|||+++||++++++|++|++.+++++|++++++||+|++++++++++|||+. T Consensus 400 li~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~ 479 (535) T protein:vir:15 400 LVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS 479 (535) T ss_pred HHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChh Confidence 99999999999999999999999999999999999999999999999999999999999899999999999999999778 Q ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 480 GILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 480 ~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) .|+||+||++++++|+++++++++++++++++++++++++|++++++.+++|++-- T Consensus 480 ~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 480 GILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChHHHHHHHhccCCCCC Confidence 89999999999999999999999999999999999999999999999999995544 No 4 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=7.7e-180 Score=1002.54 Aligned_cols=534 Identities=84% Similarity=1.246 Sum_probs=518.9 Q ss_pred CCC-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) ||+ +++++++++|++||+.|+++|++||++|+||++||+|+++++++++++++..++|||||++|+++|||||||+||| T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALFP 80 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcC Confidence 999 7889999999999999999999999999999999999999999998888999999999999999999999999999 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) ++|||||.+.|+.++++..+....+++++||+.||++++.+|++||||.++|++|+||++|||||+|++++.+ ++++|+ T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~~f~ 159 (535) T protein:vir:33 81 MQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEG-SYNPMK 159 (535) T ss_pred CCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCC-CceeeE Confidence 9999999999999999988888999999999999999999999999999999999999999999999988765 568999 Q ss_pred EEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccc Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQG 239 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~ 239 (536) +|||++|||.+|++|+||+||||++||+++|+++|+.+..++..++++++++++||||+++++++.|.+|++++|..+.+ T Consensus 160 ~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~ 239 (535) T protein:vir:33 160 LYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDG 239 (535) T ss_pred EEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccc Confidence 99999999999999999999999999999999999999988888899999999999999999999999999999999989 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcce Q lcl|NC_011045. 240 SDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDF 319 (536) Q Consensus 240 ~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~ 319 (536) ++|.|+|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|+| T Consensus 240 ~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~g~~ 319 (535) T protein:vir:33 240 SDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQTGDF 319 (535) T ss_pred cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_011045. 320 VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (536) Q Consensus 320 ~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~P 399 (536) ++|+++++++++++++++|+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++|||.| T Consensus 320 v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~P 399 (535) T protein:vir:33 320 VPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (535) T ss_pred ecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChh Q lcl|NC_011045. 400 LVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTS 479 (536) Q Consensus 400 li~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~ 479 (536) ||+|+|++|++.|+||++|+++++|+|+|||+++||++++++|++|++.+++++|++++++||+|++++++++++|||+. T Consensus 400 li~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~ 479 (535) T protein:vir:33 400 LVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS 479 (535) T ss_pred HHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHh Confidence 99999999999999999999999999999999999999999999999999999999999899999999999999999778 Q ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 480 GILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 480 ~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) .|+||+||++++++|++++++++++++++++++++++..+|++++++.+++|+--- T Consensus 480 ~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 480 GILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred HhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhHHHHHHhccCCCC Confidence 89999999999999999999999999999999999999999999999999987666 No 5 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=3.7e-179 Score=998.79 Aligned_cols=533 Identities=80% Similarity=1.208 Sum_probs=514.7 Q ss_pred CCC--ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAE--KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~--~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) ||. +++++++++|++||+.|+++|++||++|+||++||+|+++++++++++++..++|||||++|+++|||||||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 80 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALF 80 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhc Confidence 987 788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeE Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPM 158 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~ 158 (536) |++|||||.+.|..++++..++.+.+++++||++||++++.+|++||||.++|++|+||++|||||+|++++.++ +++| T Consensus 81 P~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~-~~~f 159 (535) T protein:vir:94 81 PMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGT-YNPM 159 (535) T ss_pred CCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCc-ccce Confidence 999999999999999998888889999999999999999999999999999999999999999999999887764 4689 Q ss_pred EEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccc Q lcl|NC_011045. 159 KLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQ 238 (536) Q Consensus 159 ~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~ 238 (536) ++|||++|||.+|++|+||+|||||++++++|+++|++++.++ .+++++++|+|||||+|++++++|.+|++++|+.+. T Consensus 160 ~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~-~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~ 238 (535) T protein:vir:94 160 KLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSS-QEHKGDEMIDVYTHIYLDEESGEYLKYEEIDGVEVE 238 (535) T ss_pred EEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhc-cccCCCceeEEEEEEEeeCCCCcEEEEEEecCeeec Confidence 9999999999999999999999999999999999999988654 357889999999999999999999999999999998 Q ss_pred ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcc Q lcl|NC_011045. 239 GSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD 318 (536) Q Consensus 239 ~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~ 318 (536) +.++.++|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|+ T Consensus 239 ~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~g~ 318 (535) T protein:vir:94 239 GTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKAQTGD 318 (535) T ss_pred cccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccCCCce Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_011045. 319 FVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQL 398 (536) Q Consensus 319 ~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~ 398 (536) +++|+++++++++++++++|+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++|||. T Consensus 319 ~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~ 398 (535) T protein:vir:94 319 FVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQL 398 (535) T ss_pred eecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCCh Q lcl|NC_011045. 399 PLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDT 478 (536) Q Consensus 399 Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p 478 (536) |+|+|+|++|+|.|+||++|++.++++|+|||++++|++++++|++|++.+++++|++++++||+|++++++++++|||+ T Consensus 399 Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~ 478 (535) T protein:vir:94 399 PMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDT 478 (535) T ss_pred HHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCCh Confidence 99999999999999999999999999999999999999999999999999999999999989999999999999999977 Q ss_pred hhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 479 SGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 479 ~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) +.|+||+||++++++|+++|+++++++++.++.+....+.+|+.+++++++.|++|- T Consensus 479 ~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 479 SGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred hhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhccCCC Confidence 899999999999999999999888888888888888889999999999999999999 No 6 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=5.3e-177 Score=986.98 Aligned_cols=535 Identities=69% Similarity=1.092 Sum_probs=512.8 Q ss_pred CCC-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) ||+ ++++.++++|++||++|+++|++||++|+||++||+|++++++++.++++..++|||||++|+++|||||||+||| T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 80 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFP 80 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 999 7888999999999999999999999999999999999999999888888888999999999999999999999999 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc--eee Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN--YNP 157 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~--~~~ 157 (536) ++|||||.+.|..+++...+....++++.||++||++++.+|++||||.++|++|+||++|||||+|++++.++. ..+ T Consensus 81 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~ 160 (543) T protein:vir:88 81 LQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNP 160 (543) T ss_pred CCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecc Confidence 999999999999999888888899999999999999999999999999999999999999999999999887643 234 Q ss_pred EEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccc Q lcl|NC_011045. 158 MKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEV 237 (536) Q Consensus 158 ~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i 237 (536) |+.|||++|+|.+|++|+|++||||+++++++|+++|++++.. ..+++++++|+|||+|+||.++++|.+|++++|+.+ T Consensus 161 ~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~-~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~~~~v 239 (543) T protein:vir:88 161 MKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSG-GQEYKPEQELEVYTHIYIDDESGDFLSYQEIEGVEV 239 (543) T ss_pred eEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHH-HhhcCCccceEEEEEEEeecCCCcccccccccCeee Confidence 6779999999999999999999999999999999999998864 346788999999999999999999999999999999 Q ss_pred cccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCc Q lcl|NC_011045. 238 QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTG 317 (536) Q Consensus 238 ~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g 317 (536) .+.+|.|+|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++| T Consensus 240 ~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~~~~g 319 (543) T protein:vir:88 240 DGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVKAQTG 319 (543) T ss_pred ecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCCc Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 318 DFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 318 ~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) +|++|++++++++++++++||+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++||| T Consensus 320 ~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l 399 (543) T protein:vir:88 320 DFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQ 399 (543) T ss_pred eeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCC Q lcl|NC_011045. 398 LPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGID 477 (536) Q Consensus 398 ~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~ 477 (536) .|||+|+|++|++.|+||++|+++++|+|+|+|++++|++++++|.+|++.+++++|..+.++||+|++++++++++||| T Consensus 400 ~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~ 479 (543) T protein:vir:88 400 LPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGID 479 (543) T ss_pred HHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999998755556899999999999999999 Q ss_pred hhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 478 TSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 478 p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) |+.|+||+||++++|+|+++|+++++++++.+++++++..++|++++++.+++|.||+= T Consensus 480 ~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 538 (543) T protein:vir:88 480 TAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAGVQPGP 538 (543) T ss_pred hhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcCCCCCC Confidence 99999999999999999999999999999999999999999999999998888889887 No 7 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=2.1e-175 Score=978.22 Aligned_cols=521 Identities=67% Similarity=1.074 Sum_probs=496.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) ||+ +++.++++|++||+.|+++|++||++|+||++||+|+++++++++++++..++|||||++|+++||||||++|||+ T Consensus 1 ~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP~ 79 (522) T protein:vir:94 1 MAE-REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALFPQ 79 (522) T ss_pred Ccc-cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcCCC Confidence 999 9999999999999999999999999999999999999999999988888899999999999999999999999999 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEE Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~ 160 (536) +|||||.+.|..+++...+.+..+++++||++||++|+++|++||||.++|++|+||++|||||+|++++..+++++|++ T Consensus 80 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~ 159 (522) T protein:vir:94 80 SPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRM 159 (522) T ss_pred CcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEE Confidence 99999999999888888888889999999999999999999999999999999999999999999999988888889999 Q ss_pred EecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccc Q lcl|NC_011045. 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGS 240 (536) Q Consensus 161 ~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~ 240 (536) |||++|||.+|++|+||+||||+++++++|+++|++.+.. ++++++++|+|||+|+|+. ++|++|++++|+.+.+. T Consensus 160 ~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~--~~~~p~~~v~v~~~v~~~~--~~~~~~~~~~g~~~~~~ 235 (522) T protein:vir:94 160 YRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNA--DDYEPDTELEVYTHIYRQD--DEYLRYEEVEGIEVTGT 235 (522) T ss_pred EEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhc--ccCCccceEEEEEEEEeeC--CceeEEeeccCceeccc Confidence 9999999999999999999999999999999999988753 4567899999999999974 57999999999999999 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCccee Q lcl|NC_011045. 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFV 320 (536) Q Consensus 241 ~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~ 320 (536) +|.|+|++|||+++||++.+|++|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|+|+ T Consensus 236 ~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~g~~v 315 (522) T protein:vir:94 236 DGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAATGEFV 315 (522) T ss_pred CCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCCceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Q lcl|NC_011045. 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) Q Consensus 321 ~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pl 400 (536) +|+++++++++++++++|+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||++|||.|| T Consensus 316 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pl 395 (522) T protein:vir:94 316 AGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPI 395 (522) T ss_pred cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh Q lcl|NC_011045. 401 VRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) Q Consensus 401 i~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~ 480 (536) |+|+|++|++.|+||++|+++++|+|+|||+++||++++++|++|++.+++++|++++++||+|++++++++++||||+. T Consensus 396 i~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ 475 (522) T protein:vir:94 396 VRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAG 475 (522) T ss_pred HHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhh Confidence 99999999999999999999999999999999999999999999999999999999888899999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhcCcchHHhh Q lcl|NC_011045. 481 ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA-QATASPEAMAAA 526 (536) Q Consensus 481 i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~-~~~~~~~~~~~~ 526 (536) |+||++|++++++|+++++++++++.+.+++..+ .++..++.|++| T Consensus 476 ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 476 LLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhhcC Confidence 9999999999999988887777776666655433 456667777776 No 8 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=5.7e-170 Score=948.44 Aligned_cols=528 Identities=58% Similarity=0.940 Sum_probs=485.8 Q ss_pred CCC-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) ||+ +++++++++|++||+.|+++|++||++|+||++||+|+++++++++++++..++|||||++|+++||||||++||| T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltp 80 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALFP 80 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhcC Confidence 999 7889999999999999999999999999999999999999999999999999999999999999999999999997 Q ss_pred -CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC--Ccee Q lcl|NC_011045. 80 -MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG--SNYN 156 (536) Q Consensus 80 -~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~--~~~~ 156 (536) ++|||||.++|+.+++....+...++|++||++||++|+++|++||||.++|++|+||++|||||+|++++.+ .+.+ T Consensus 81 p~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~ 160 (532) T protein:vir:99 81 VGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSN 160 (532) T ss_pred CCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCccc Confidence 5699999999999999888888999999999999999999999999999999999999999999999986543 4667 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcc Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGME 236 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~ 236 (536) +|++|||++|||.+|++|+|++||||+++++++|+++|+.+..+...+++++++|+|||+|+|++++..|.+|++++|+. T Consensus 161 ~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~~~g~~ 240 (532) T protein:vir:99 161 APKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI 240 (532) T ss_pred ceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCCeeEEEEeecCce Confidence 89999999999999999999999999999999999999999887777889999999999999999998999999999999 Q ss_pred ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCC Q lcl|NC_011045. 237 VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT 316 (536) Q Consensus 237 i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 316 (536) +++++|.|+|++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++ T Consensus 241 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~~ 320 (532) T protein:vir:99 241 VAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAKANT 320 (532) T ss_pred ecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_011045. 317 GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (536) Q Consensus 317 g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~ 396 (536) |++++|+++++++++++++++|+.+++.|++++++|+++||++++.+++++|||||||++|++|++++|||||+||++|| T Consensus 321 g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~ 400 (532) T protein:vir:99 321 GDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQEL 400 (532) T ss_pred cceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCC Q lcl|NC_011045. 397 QLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGI 476 (536) Q Consensus 397 l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv 476 (536) |.|||+|+|++|++.|+||++|++.+++.+++++..+.|+++++++.+|++.++++.|+.++ +||+|++++++++++|| T Consensus 401 l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~Laraq~~~~l~~~~~~laq~~p~~~d-~id~d~~~~~~a~~~GV 479 (532) T protein:vir:99 401 QLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFIDYMIKLAGLQDD-DINLLDVKMRLANSLGM 479 (532) T ss_pred HHHHHHHHHHHHHhcCCCCCCChhhcccceeecchHHHHHHHHHHHHHHHHHHHhhcchhhh-hCCHHHHHHHHHHHhCC Confidence 99999999999999999999999887664444444444445678999999999999998765 69999999999999999 Q ss_pred ChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcC Q lcl|NC_011045. 477 DTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSV 530 (536) Q Consensus 477 ~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (536) ||..|+||+||++++++|++++++++ ++.+.++.++.++..++...+.+++.. T Consensus 480 ~~~~i~r~~ee~~~~~~q~~~~~~~~-~a~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 480 DTTGLILTQQDKQAKMAEASTAAGMV-TAGQQMGAAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred ChhhccCCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcchhHHhhcCCCCC Confidence 99999999999999998776555433 444445555555555554444444443 No 9 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=2.8e-165 Score=922.73 Aligned_cols=518 Identities=32% Similarity=0.515 Sum_probs=469.9 Q ss_pred cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC-CCcceecc Q lcl|NC_011045. 9 AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP-MQTWMRLT 87 (536) Q Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP-~~~Wf~l~ 87 (536) =+++|++||+.|+++|++||++|+||++||+|++++++++++.++..++|||||++|+++|||||||+||| ++|||||. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFFKLQ 80 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 28899999999999999999999999999999999999988888899999999999999999999999997 56999999 Q ss_pred CChhhhhhhcc-ChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceE Q lcl|NC_011045. 88 ISEYEAKQLLS-DPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSY 166 (536) Q Consensus 88 ~~d~~~~~~~~-~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~ 166 (536) ++|+.+++... +++..++++.||++||++++++|++||||.++|++|+||++|||||+|++++ +|++|||++| T Consensus 81 ~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~------~~~~~pl~~y 154 (542) T protein:vir:78 81 INDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKK------TLKVYPLDRY 154 (542) T ss_pred CCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCC------CceEEeccee Confidence 99999988654 6667789999999999999999999999999999999999999999999764 3889999999 Q ss_pred EEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhc----cccCCCCceEEEEEEEEecCCC----------CceeEEEEe Q lcl|NC_011045. 167 VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQ----GGEKKADETIDVYTHIYLDEDS----------GEYIRYEEV 232 (536) Q Consensus 167 ~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~----~~~~~~~~~~~v~~~v~p~~~~----------~~~~~~~~v 232 (536) ||.+|++|+||+|||||+||+++|+++|+.+..++ ...++++.+++|+|+|+|+.+. ..|.+|+++ T Consensus 155 ~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~ 234 (542) T protein:vir:78 155 VIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQEC 234 (542) T ss_pred EEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEEe Confidence 99999999999999999999999999998765443 3456788999999999998752 357889999 Q ss_pred cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc Q lcl|NC_011045. 233 EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT 312 (536) Q Consensus 233 ~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~ 312 (536) +|+.+.+..+.++|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++. T Consensus 235 ~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~ 314 (542) T protein:vir:78 235 DGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLA 314 (542) T ss_pred ccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc Confidence 99998777778889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_011045. 313 KAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSIL 392 (536) Q Consensus 313 ~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl 392 (536) ++++|+|++|+++++++++++++++|+.+++.|++++++|+++||+++ .++++|||||||++|++|++++|||||+|| T Consensus 315 ~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~--~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl 392 (542) T protein:vir:78 315 RAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILN--VRQSERTTATEVREVQMELDRQLSGIYGSL 392 (542) T ss_pred cCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcccc--cCCcccccHHHHHHHHHHHHHHhhHHHHHH Confidence 999999999999999999999999999999999999999999999875 489999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHh-hcchhhhhcCCHHHHHHHHH Q lcl|NC_011045. 393 SQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAA-LAPMRDDPDINLAMIKLRIA 471 (536) Q Consensus 393 ~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~-~~p~~~~~~id~d~~~~~~a 471 (536) ++|||.|+|+|+|++|++.|+||++|++.++|+|+|||++++|++++++|.+|++.+++ ++|++++++||+|+++++++ T Consensus 393 ~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a 472 (542) T protein:vir:78 393 TVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLA 472 (542) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999988 46888888999999999999 Q ss_pred HHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 472 NAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 472 ~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) +++|||++.|+||+||++++++|++++++++..+ ..++.++....+.+.+.+.+..++.||+= T Consensus 473 ~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~--~~a~~~a~~~~~~~~~~~~~a~~~~~~~~ 535 (542) T protein:vir:78 473 AASGIDTLNLVKSPETMANEAQQAQQQQMTASLM--GQAGQLAKSPIGEKMMQQINAPGQEAPAG 535 (542) T ss_pred HHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHH--HhhhhccccccccchhhhcCCCCcCCCCC Confidence 9999987899999999999887766544333322 22233333333344555655566778865 No 10 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=9.1e-163 Score=908.96 Aligned_cols=505 Identities=29% Similarity=0.407 Sum_probs=457.0 Q ss_pred cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC-Ccceecc Q lcl|NC_011045. 9 AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM-QTWMRLT 87 (536) Q Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~Wf~l~ 87 (536) -++++++||++|| |++||++|+||++||+|+++++++++++++..++|||||++|+++||||||++|||+ +|||||+ T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccC Confidence 3899999999996 899999999999999999999999888888899999999999999999999999975 6999999 Q ss_pred CChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEE Q lcl|NC_011045. 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) Q Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~ 167 (536) ++|..++++..+....++|++||++||++++.+|++||||.++|++|+||++||||++|++++.. +|++|||++|| T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~----~~~~~pl~~y~ 154 (510) T protein:vir:78 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEA----TVVAWSLRSYA 154 (510) T ss_pred CChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCCC----eEEEEEcceeE Confidence 99999988887788889999999999999999999999999999999999999999999987543 58999999999 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCC--CceeEEEEecCccccccccccc Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDS--GEYIRYEEVEGMEVQGSDGTYP 245 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~--~~~~~~~~v~g~~i~~~~~~~~ 245 (536) |.+|++|+||+||||++||+++|+++|+++..+...+++++++|+|||+|+|+++. ..|.+|+++||+.+ +.+|.|+ T Consensus 155 v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i-~~~~~~~ 233 (510) T protein:vir:78 155 VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRV-GETGRWP 233 (510) T ss_pred EeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeee-ccccccc Confidence 99999999999999999999999999999999888889999999999999998653 34778899999988 6778899 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceecCCcc Q lcl|NC_011045. 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) Q Consensus 246 ~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~g~~~ 325 (536) |++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++|++++|+++ T Consensus 234 ~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g~~~ 313 (510) T protein:vir:78 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) T ss_pred cccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 405 (536) ++++++++++++|+.+++.|++++++|+++||++ +.+++++|||||||++|++|++++|||||+||++|||.|||+|+| T Consensus 314 ~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) T protein:vir:78 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) T ss_pred cccccccCcccchHHHHHHHHHHHHHHHHHHhhc-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999997 667999999999999999999999999999999999999999999 Q ss_pred HHHHhcCCCCCCCCc--ceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccC Q lcl|NC_011045. 406 KQLQATQQIPELPKE--AVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) Q Consensus 406 ~il~~~g~lp~~~~~--~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r 483 (536) ++|++.|++|++|+. ...|+|+|+|+++|+.+.+.++.++++.+++++ ++++.||+|++++++++++||||+.|+| T Consensus 393 ~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~--q~~~~id~d~~~~~~a~~~Gv~p~~ivr 470 (510) T protein:vir:78 393 SEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIA--QLDPRISLPKMMDTIWAAFSVDTSQFYK 470 (510) T ss_pred HHHHhccCCCCCcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcChh--hhhhcCCHHHHHHHHHHHhCCChhhhcC Confidence 999999977766653 245678888888888888777777777766554 3677899999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 484 TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 484 s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) |+|||+++++|+++|+++++++++.. +.++++.++.++|+ T Consensus 471 s~eev~a~~~~~~~q~~~~~~~~~a~-------------~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 471 SADELQAEAEEQRRQAAQAQAAQETL-------------LEGASDMTNALAGV 510 (510) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHhhhhhcccCCCC Confidence 99999999988876555554444332 23333444445555 No 11 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=1.2e-162 Score=908.35 Aligned_cols=510 Identities=34% Similarity=0.546 Sum_probs=462.1 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC--CcccccccccccchHHHHHHHHHHHHHHhhcC-CCcceecc Q lcl|NC_011045. 11 EGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS--DNASTDYVTPWQAVGARGLNNLASKLMLALFP-MQTWMRLT 87 (536) Q Consensus 11 ~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP-~~~Wf~l~ 87 (536) =++++||+.|+++|++||++|+||++||+|+++++++ +.+.++..++|||||++|+++||||||++||| ++|||||. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFKLQ 80 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 2377899999999999999999999999999998875 34556788999999999999999999999997 56999999 Q ss_pred CChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEE Q lcl|NC_011045. 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) Q Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~ 167 (536) ++|+.+++.. +++..+++++||+.||++++.+|++||||.++|++|+||++|||||+|++++. |++|||++|| T Consensus 81 ~~d~~l~~~~-~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~------~~~~pl~~y~ 153 (522) T protein:vir:10 81 VRDDKLGEEL-DPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKDG------LKTFPLTRYV 153 (522) T ss_pred CChHHHhhhc-ChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCCC------ceEEEcceEE Confidence 9999888753 56677899999999999999999999999999999999999999999998753 7899999999 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhcc--ccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccc Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQG--GEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYP 245 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~--~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~ 245 (536) |.+|++|+||+||||++||+++|+++|+.+..+.. ..++++++|+|||||+|+++.++|.+|++++|+.+++.+|.++ T Consensus 154 v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s~~g 233 (522) T protein:vir:10 154 INRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRSTAP 233 (522) T ss_pred EeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCccccccccccc Confidence 99999999999999999999999999998765543 3468899999999999999989999999999999988888889 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceecCCcc Q lcl|NC_011045. 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) Q Consensus 246 ~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~g~~~ 325 (536) |++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++++|++++|.++ T Consensus 234 ~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v~g~~~ 313 (522) T protein:vir:10 234 KNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIVQGRPE 313 (522) T ss_pred cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCcceecCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 405 (536) ++.++++++++||+.+++.|++++++|+++||+.+ ++++++||||||++|++|++++|||||+||++|||.|+|+|+| T Consensus 314 ~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~--~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 391 (522) T protein:vir:10 314 DVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMN--VRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTL 391 (522) T ss_pred cceeecccccccchHHHHHHHHHHHHHHHHHhhcc--CCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999753 6899999999999999999999999999999999999999999 Q ss_pred HHHHhcCCCCCCCCcceE---EEEechHHHHHHHHHHHHHHHHHHHHHhh-cchhhhhcCCHHHHHHHHHHHcCCChhhc Q lcl|NC_011045. 406 KQLQATQQIPELPKEAVE---PTISTGLEAIGRGQDLDKLERCVAAWAAL-APMRDDPDINLAMIKLRIANAIGIDTSGI 481 (536) Q Consensus 406 ~il~~~g~lp~~~~~~v~---v~~vs~La~a~r~~~~~~l~~~~~~~~~~-~p~~~~~~id~d~~~~~~a~~~Gv~p~~i 481 (536) ++|++.|+||++|.+.++ |+|+|+|+++| +++++.+|++.++++ +|++++++||+|++++++++++|||++.| T Consensus 392 ~il~r~g~lP~~p~~~~~~~~v~~is~Laraq---~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~i 468 (522) T protein:vir:10 392 LVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQ---DRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNL 468 (522) T ss_pred HHHHhcCCCCCCCccccccccccchhHHHHHH---HHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhh Confidence 999999999999987654 56777777665 578999999999884 58888889999999999999999987999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 482 LLTEEQKQQKMAQQSMQMGMDN---GAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 482 ~rs~~ev~~~~~q~~~q~~~~~---~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) +||+||+++++|++++++++++ ++++.+++++++++++|+++.+.+.. |+- T Consensus 469 vrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~---~~~ 522 (522) T protein:vir:10 469 VKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLMDEEQPP---MEE 522 (522) T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHHHHhCCC---CCC Confidence 9999999999887766655554 44556677788888998877664322 222 No 12 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=4.2e-162 Score=905.33 Aligned_cols=504 Identities=29% Similarity=0.401 Sum_probs=458.0 Q ss_pred cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC-Ccceecc Q lcl|NC_011045. 9 AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM-QTWMRLT 87 (536) Q Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~Wf~l~ 87 (536) -+++|++||++|| |++||++|+||++||+|++++++++++.++..++|||||++|+++||||||++|||+ +|||||+ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccccC Confidence 4899999999996 999999999999999999999999888888899999999999999999999999974 6999999 Q ss_pred CChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEE Q lcl|NC_011045. 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) Q Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~ 167 (536) ++|..+++...+....+++++||++||++++.+|++||||.++|++|+||++|||||+|++++. .+|++|||++|| T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~~----~~~~~~pl~~y~ 154 (510) T protein:vir:63 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSDA----ATVVAWSLRSYA 154 (510) T ss_pred CChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCCC----cEEEEEEcceeE Confidence 9999999888888889999999999999999999999999999999999999999999998643 369999999999 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCC--ceeEEEEecCccccccccccc Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG--EYIRYEEVEGMEVQGSDGTYP 245 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~--~~~~~~~v~g~~i~~~~~~~~ 245 (536) |.+|++|+||+||||++||+++|+++|+.+..+...+++++++|+|||+|+|+++.+ .|.+|++++|+.+ +.+|.|+ T Consensus 155 v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~-~~~~~~~ 233 (510) T protein:vir:63 155 VRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRV-GKEGRWP 233 (510) T ss_pred EeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCcee-ccccccc Confidence 999999999999999999999999999999888888899999999999999876532 3567888999887 5678888 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceecCCcc Q lcl|NC_011045. 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPE 325 (536) Q Consensus 246 ~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~g~~~ 325 (536) |++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++|++++|+++ T Consensus 234 ~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~ 313 (510) T protein:vir:63 234 IHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAE 313 (510) T ss_pred cccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceeecCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 326 DISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLL 405 (536) Q Consensus 326 ~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~ 405 (536) ++++++++++++|+.+++.|++++++|+++||++ +.+++++|||||||++|++|++++|||||+||++|||.|||+|+| T Consensus 314 ~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~ 392 (510) T protein:vir:63 314 AVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCL 392 (510) T ss_pred cceeeecCcccchHHHHHHHHHHHHHHHHHHHhh-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999997 677999999999999999999999999999999999999999999 Q ss_pred HHHHhcCCCCCCCCcceE---EEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhcc Q lcl|NC_011045. 406 KQLQATQQIPELPKEAVE---PTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGIL 482 (536) Q Consensus 406 ~il~~~g~lp~~~~~~v~---v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~ 482 (536) ++|++.|++|++|+ .++ |+|+|+|+++|+.+++.++.++++.+++++ ++++.||+|++++++++++||||+.|+ T Consensus 393 ~il~r~gl~p~p~~-~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~a--q~~~~id~d~~~~~~a~~~Gv~p~~iv 469 (510) T protein:vir:63 393 SEVDDALLQGLITK-QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIA--QLDPRISLPKMMDTIWAAFSVDTSQFY 469 (510) T ss_pred HHHHhccCCCCCch-hcccceecchhHHHHHHHHHHHHHHHHHHHHhcCch--hhhccCCHHHHHHHHHHHhCCChhHhc Confidence 99999986665554 444 467788888887777777777777766654 367789999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchH Q lcl|NC_011045. 483 LTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAM 523 (536) Q Consensus 483 rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~ 523 (536) ||+||++++++++++|+++++++++.+...+.+++..+-.| T Consensus 470 rs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 470 KSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 99999999998877777766666666555555655555444 No 13 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=1.7e-161 Score=902.04 Aligned_cols=509 Identities=29% Similarity=0.459 Sum_probs=460.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC- Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP- 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP- 79 (536) |-. +-+.++++|++||+.||++|++|+++|+||++||+|+++++++++ ++..++|||||++|+++||||||++||| T Consensus 1 ~~~-~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~ltpp 77 (517) T protein:vir:10 1 MDM-RFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD--LSSQNAWQDDGASATNFLSNKLSQVLFPA 77 (517) T ss_pred Ccc-cccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC--ccccccccchHHHHHHHHHHHHHHhhcCC Confidence 543 666679999999999999999999999999999999999887654 3457899999999999999999999997 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) ++|||||.++|+.+++...+.+..++|++||+.||++++.+|++||||.++|++|+||++|||||+|+++.. .+|+ T Consensus 78 ~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~----~~~~ 153 (517) T protein:vir:10 78 QRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPDKT----SPIQ 153 (517) T ss_pred CCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeCCC----CcEE Confidence 569999999999999988889999999999999999999999999999999999999999999999986533 3689 Q ss_pred EEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhc--cccCCCCceEEEEEEEEecCCCCceeEEEEecCccc Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQ--GGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEV 237 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~--~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i 237 (536) +|||++|||.+|++|+|++||||+++|+++|+++|+....+. ...++++++|+|||+|+|+.+ ++|++|++++|+.+ T Consensus 154 ~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~~~d~~~~ 232 (517) T protein:vir:10 154 AVPLHHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKD-GKYLIRQSADDVPV 232 (517) T ss_pred EEEcCeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCC-CceEEEEEeCceee Confidence 999999999999999999999999999999999999876543 345788999999999999875 47899999999987 Q ss_pred cccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCc Q lcl|NC_011045. 238 QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTG 317 (536) Q Consensus 238 ~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g 317 (536) +.+|.|+|++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++| T Consensus 233 -~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~~~g 311 (517) T protein:vir:10 233 -GKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEGGSG 311 (517) T ss_pred -ccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCCCcc Confidence 5667788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 318 DFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 318 ~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) ++++|+++++.++++++++||+.+++.|++++++|+++||++.+.+++++|||||||++|++|++++|||||+||++||| T Consensus 312 ~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell 391 (517) T protein:vir:10 312 AVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQ 391 (517) T ss_pred ccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999998999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc--chhhhhcCCHHHHHHHHHHHcC Q lcl|NC_011045. 398 LPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA--PMRDDPDINLAMIKLRIANAIG 475 (536) Q Consensus 398 ~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~--p~~~~~~id~d~~~~~~a~~~G 475 (536) .|+|+|+|++|.+. +|...++|+|+|||++++|++++++|.+|++.+++++ |++++++||+|++++++++++| T Consensus 392 ~Pli~r~~~~l~~~-----l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~G 466 (517) T protein:vir:10 392 GPLARWFMNGISSI-----LTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQIS 466 (517) T ss_pred HHHHHHHHHHhhhh-----cCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhC Confidence 99999999998654 4555789999999999999999999999999998864 6778889999999999999999 Q ss_pred CChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---hcCcchHHhhhhcCCCC Q lcl|NC_011045. 476 IDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQA---TASPEAMAAAADSVGLQ 533 (536) Q Consensus 476 v~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~---~~~~~~~~~~~~~~~~q 533 (536) | |.+++||++||+++++++++++++++++.++++++++++ +.+|++. | T Consensus 467 v-p~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~---------~ 517 (517) T protein:vir:10 467 A-NFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGG---------Q 517 (517) T ss_pred C-ChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCC---------C Confidence 9 568999999999999988877777666554444433322 2222222 2 No 14 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=7.7e-160 Score=892.90 Aligned_cols=517 Identities=36% Similarity=0.551 Sum_probs=456.0 Q ss_pred cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC-CCcceecc Q lcl|NC_011045. 9 AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP-MQTWMRLT 87 (536) Q Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP-~~~Wf~l~ 87 (536) =+++|++||+.|+++|++||++|+||++||+|++++++++++..+..++|||||++|+++|||||||+||| ++|||||. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQ 80 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 27889999999999999999999999999999999999999988999999999999999999999999997 56999999 Q ss_pred CChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEE Q lcl|NC_011045. 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) Q Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~ 167 (536) +.|+.++++..+...++.+++||++||++++.+|++||||.++|++|+||++|||||+|++++. +++|||++|| T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~------~~~~pl~~y~ 154 (555) T protein:vir:17 81 INDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKKN------LKLYPLDRFV 154 (555) T ss_pred cCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCCc------eeEEEcCeEE Confidence 9999999988888899999999999999999999999999999999999999999999997643 7899999999 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhcc---------------------ccCCCCceEEEEEEEEecCCCCce Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQG---------------------GEKKADETIDVYTHIYLDEDSGEY 226 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~---------------------~~~~~~~~~~v~~~v~p~~~~~~~ 226 (536) |.+|++|+||+||||++||+++|+++|+++..+.. .+.+++.++++|+++.. ..++| T Consensus 155 v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~--~~~~~ 232 (555) T protein:vir:17 155 VSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCR--KDGQV 232 (555) T ss_pred EeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccc--cCCee Confidence 99999999999999999999999999997643211 12234556777877643 24578 Q ss_pred eEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Q lcl|NC_011045. 227 IRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT 306 (536) Q Consensus 227 ~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~ 306 (536) .+|++++|+.+.+..+.++|++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|||+|+|+|++ T Consensus 233 ~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~ 312 (555) T protein:vir:17 233 KWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATT 312 (555) T ss_pred EEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Confidence 89999999999877788889999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_011045. 307 QPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLG 386 (536) Q Consensus 307 ~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG 386 (536) ++.++.++++|+|++|+++++.+++++++++|+.+++.|++++++|+++||+++ .+++++||||||++|++|++++|| T Consensus 313 ~~~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~~--~~d~~r~TAtEV~~r~~E~~~~LG 390 (555) T protein:vir:17 313 KPQNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLMLQ--VRQSERTTATEVQATVQELNEQIG 390 (555) T ss_pred CcceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhcC--CCCcccchHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999999999999999999999999864 489999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc-chhhhhcCCHHH Q lcl|NC_011045. 387 GVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA-PMRDDPDINLAM 465 (536) Q Consensus 387 ~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~-p~~~~~~id~d~ 465 (536) |||+||++|||.|+|+|+|++|+|.|+||++|+++++++|++++++++|+++++++++|++.++++. |..+.++||+|+ T Consensus 391 pv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~ 470 (555) T protein:vir:17 391 GIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTE 470 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHH Confidence 9999999999999999999999999999999999999999999999999999999999999999975 444556799999 Q ss_pred HHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH-hhhc-----------CcchHHhhhhcC Q lcl|NC_011045. 466 IKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDN---GAAALAQGMAA-QATA-----------SPEAMAAAADSV 530 (536) Q Consensus 466 ~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~---~a~~~~~~~~~-~~~~-----------~~~~~~~~~~~~ 530 (536) ++++|++++||||+.|+||+||+++++|++++++++++ ++++.++.+++ ++.. .+.+++...-.. T Consensus 471 ~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~ 550 (555) T protein:vir:17 471 FIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDAGAAESETSSAE 550 (555) T ss_pred HHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHHHHHHhhcCCcc Confidence 99999999999999999999999999887766555444 33333332211 1111 111111111112 Q ss_pred CCCCC Q lcl|NC_011045. 531 GLQPG 535 (536) Q Consensus 531 ~~q~~ 535 (536) |.=|| T Consensus 551 ~~~~~ 555 (555) T protein:vir:17 551 AQAGA 555 (555) T ss_pred cccCC Confidence 22222 No 15 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=2.3e-160 Score=895.74 Aligned_cols=507 Identities=28% Similarity=0.417 Sum_probs=460.9 Q ss_pred CCC---ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |++ .+++.++++|++||+.|+++|++||++|+||++||+|++++++++. ++.+++|||||++|+++||||||++| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~--~~~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDN--ETSQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCc--cccCCcccchHHHHHHHHHHHHHhhh Confidence 766 6888999999999999999999999999999999999999877654 34568999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||||+++|..++.+.....+.+++++||++||++++.+|++||||.++|++|+||++|||||+|+++++ T Consensus 79 tpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~----- 153 (516) T protein:vir:96 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG----- 153 (516) T ss_pred cCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC----- Confidence 97 569999999998888877777788999999999999999999999999999999999999999999997654 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh--ccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG--QGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) +|++|||++|||.+|++|+|++||||+++++++|+++|+..... ...+++++++|+|||+|+|++ ++.|.+|++++| T Consensus 154 ~~~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~-~~~~~~~~~~d~ 232 (516) T protein:vir:96 154 AISAIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLG-DGFWELKQSADD 232 (516) T ss_pred CEEEEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeC-CceeEEEEEeCc Confidence 38999999999999999999999999999999999999765432 234568899999999999876 458999999999 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA 314 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~ 314 (536) +.+ +.+|.|+|++|||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++ T Consensus 233 ~~~-~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~ 311 (516) T protein:vir:96 233 IPV-GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNS 311 (516) T ss_pred eee-ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccC Confidence 987 5667888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_011045. 315 QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (536) Q Consensus 315 ~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~ 394 (536) ++|+|++|+++++.++++++++||+.++..|++++++|+++||++.+.+++++|||||||++|++|++.+|||||+||++ T Consensus 312 ~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 391 (516) T protein:vir:96 312 GTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAT 391 (516) T ss_pred CCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc---chhhhhcCCHHHHHHHHH Q lcl|NC_011045. 395 ELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA---PMRDDPDINLAMIKLRIA 471 (536) Q Consensus 395 E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~---p~~~~~~id~d~~~~~~a 471 (536) |||.|+|+|++.++ .|++|+.+++++|+|+|++++|++++++|.+|++.++++. |+++ ++||+|+++++++ T Consensus 392 Ell~Pli~r~l~~~-----~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~-d~id~d~~~~~~a 465 (516) T protein:vir:96 392 TMQSPVAMWGLLEA-----GESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVL-AAVKWPDYMDWVR 465 (516) T ss_pred HHHHHHHHHHHHhc-----CCCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHH-hcCCHHHHHHHHH Confidence 99999999998775 3889999999999999999999999999999999987764 5555 5799999999999 Q ss_pred HHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 472 NAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 472 ~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) +++|| |..++||+|||+++++|+++++++++++.++++++..+++..+ |.| T Consensus 466 ~~~Gv-p~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~------------~~~ 516 (516) T protein:vir:96 466 GQISA-ELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQEL------------KEA 516 (516) T ss_pred HHhCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhccc------------ccC Confidence 99999 5679999999999999999888888777766655543333222 222 No 16 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=9.9e-160 Score=892.30 Aligned_cols=505 Identities=27% Similarity=0.392 Sum_probs=449.2 Q ss_pred cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC--CCCCcccccccccccchHHHHHHHHHHHHHHhhcC-CCccee Q lcl|NC_011045. 9 AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP--KDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP-MQTWMR 85 (536) Q Consensus 9 ~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~--~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP-~~~Wf~ 85 (536) =++++.+.|. |.+|++||++|+||++||+|+++. .+++++..+..++|||||++|+++||||||++||| ++|||| T Consensus 1 m~~~~~~l~~--k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQQASAMWA--EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred CccchHHHHH--HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 1444445554 667999999999999999999874 45566667788999999999999999999999997 569999 Q ss_pred ccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecce Q lcl|NC_011045. 86 LTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSS 165 (536) Q Consensus 86 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~ 165 (536) |+++|...+....+..+.+++++||++||++++.+|++||||.++|++|+||++|||||+|++++.. +|++|||++ T Consensus 79 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~----~~~~~pl~~ 154 (514) T protein:vir:80 79 IELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTG----KMLVWTMQS 154 (514) T ss_pred cccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCC----cEEEEEcCe Confidence 9999877766666777889999999999999999999999999999999999999999999987543 589999999 Q ss_pred EEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCC--CCceeEEEEecCccccccccc Q lcl|NC_011045. 166 YVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDED--SGEYIRYEEVEGMEVQGSDGT 243 (536) Q Consensus 166 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~--~~~~~~~~~v~g~~i~~~~~~ 243 (536) |||.+|++|+|++||||++||+++|+++|+.+..+...+++++++|+|||||+|+++ +.+|.+|++++|+.+ +.+|. T Consensus 155 y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i-~~es~ 233 (514) T protein:vir:80 155 YTVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRV-GPESS 233 (514) T ss_pred EEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceee-cccCc Confidence 999999999999999999999999999999998888788889999999999998754 446889999999998 56788 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceecCC Q lcl|NC_011045. 244 YPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGR 323 (536) Q Consensus 244 ~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~g~ 323 (536) |+|++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.++++|++++|+ T Consensus 234 y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g~ 313 (514) T protein:vir:80 234 YPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQ 313 (514) T ss_pred cccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecCC Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_011045. 324 PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRV 403 (536) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r 403 (536) ++++.+++++++++|+.+++.|++++++|+++||++... +++++||||||++|++|++++|||||+||++|||.|||+| T Consensus 314 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~~~-rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r 392 (514) T protein:vir:80 314 VGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTGQV-RDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYL 392 (514) T ss_pred CccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhccC-CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999998765 8999999999999999999999999999999999999999 Q ss_pred HHHHHHh--cCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchh--hhhcCCHHHHHHHHHHHcCCChh Q lcl|NC_011045. 404 LLKQLQA--TQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMR--DDPDINLAMIKLRIANAIGIDTS 479 (536) Q Consensus 404 ~~~il~~--~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~--~~~~id~d~~~~~~a~~~Gv~p~ 479 (536) +|++|++ .|.||++|++.++++|+|+|++++|++++++|.+|++.+++++|.. ++++||+|++++++|+++|||++ T Consensus 393 ~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~ 472 (514) T protein:vir:80 393 TMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLS 472 (514) T ss_pred HHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHh Confidence 9999987 4899999999999999999999999999999999999998887642 45679999999999999999777 Q ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcc Q lcl|NC_011045. 480 GILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPE 521 (536) Q Consensus 480 ~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~ 521 (536) .|++|+|++++.+++++++++++++.++..+....+.+.-|. T Consensus 473 ~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 473 TLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 899998888877766665555444444332222222222222 No 17 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=2e-159 Score=890.64 Aligned_cols=513 Identities=15% Similarity=0.162 Sum_probs=442.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC------CCCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP------KDSDNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~------~~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |+.+...+ ++++++||+.|+++|++||++|+||++||+|++.. .+++.+.++..++|||||++|+++|||||| T Consensus 1 m~~d~~~~-~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~ 79 (549) T protein:vir:10 1 MTNDDAKI-LQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMD 79 (549) T ss_pred CCcchHHH-HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHH Confidence 98866544 78999999999999999999999999999998743 344566777899999999999999999999 Q ss_pred HhhcC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHH--HhccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 75 LALFP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI--ESNSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 75 ~~ltP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) ++||| ++|||||.+.|+.+++. +++++||++||++++..+ ++||||.++|++|+||++|||||+|++++. T Consensus 80 ~~ltpp~~~wF~l~~~~~~~~e~-------~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~ 152 (549) T protein:vir:10 80 SMITPATQLWHRLKTGNDALNEI-------ASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDV 152 (549) T ss_pred hhccCCCCccccccCCccchhhh-------hHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecC Confidence 99997 56999999999876653 479999999999999965 589999999999999999999999999876 Q ss_pred CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhc----cccCCCCceEEEEEEEEecCC----- Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQ----GGEKKADETIDVYTHIYLDED----- 222 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~----~~~~~~~~~~~v~~~v~p~~~----- 222 (536) + ++++|++|||++|||.+|++|+||+|||||+||+++|+++|+.+.++. ..+++++++|+|||+|+||.+ T Consensus 153 ~-~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~ 231 (549) T protein:vir:10 153 G-KGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRK 231 (549) T ss_pred C-CeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccc Confidence 6 568999999999999999999999999999999999999998764432 335678999999999999865 Q ss_pred -CCceeEEE----EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011045. 223 -SGEYIRYE----EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) Q Consensus 223 -~~~~~~~~----~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~ 297 (536) ++++++|. ++++..++.++| |++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|| T Consensus 232 ~~~~~~pf~sv~~e~~~~~il~esg---~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~ 308 (549) T protein:vir:10 232 LDGRNMQFASYWLDEGRDRIVQNSG---FRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPP 308 (549) T ss_pred cccccCceEEEEEEecCCEeeccCC---cccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 33445443 445666655554 4799999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHH Q lcl|NC_011045. 298 GLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRY 376 (536) Q Consensus 298 ~lv~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~ 376 (536) |+|+++|++++.++.+++.+++..+..++..+.|++++++|+.+++.|++++++|+++||.+. .+++++++||||||++ T Consensus 309 ~~v~~~g~~~~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~ 388 (549) T protein:vir:10 309 LLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQ 388 (549) T ss_pred eeeccccccccceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHH Confidence 999999999999999999888887777777888999999999999999999999999999997 4558999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc------ceEEEEechHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 377 VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE------AVEPTISTGLEAIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 377 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~------~v~v~~vs~La~a~r~~~~~~l~~~~~~~~ 450 (536) |++|++++|||||+||++|||.|+|+|+|++|++.|.||++|++ .++|+|||||+++||+.+++++++|++.++ T Consensus 389 r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~ 468 (549) T protein:vir:10 389 RAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLG 468 (549) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999998865 378999999999999999999999988765 Q ss_pred ---hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhhhcCcchHHh Q lcl|NC_011045. 451 ---ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALA--QGMAAQATASPEAMAA 525 (536) Q Consensus 451 ---~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~--~~~~~~~~~~~~~~~~ 525 (536) |++|++++ +||+|++++++++++|| |.+++||++||+++|+++++|+|+++++++.. ++.+...+.+..+ T Consensus 469 ~laq~~Pe~ld-~id~d~~~~~~a~~~Gv-p~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~ta--- 543 (549) T protein:vir:10 469 IVSQFDPAAAK-VPNGARIARLLADYGGV-PVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQTA--- 543 (549) T ss_pred HHhccChhHHh-cCCHHHHHHHHHHhcCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCC--- Confidence 46788765 69999999999999999 56899999999999998877666655433221 1222222221111 Q ss_pred hhhcCCCCCCC Q lcl|NC_011045. 526 AADSVGLQPGI 536 (536) Q Consensus 526 ~~~~~~~q~~~ 536 (536) .|-|- T Consensus 544 ------~~~~~ 548 (549) T protein:vir:10 544 ------AQTAR 548 (549) T ss_pred ------CcccC Confidence 12222 No 18 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=3.6e-159 Score=889.26 Aligned_cols=517 Identities=19% Similarity=0.200 Sum_probs=445.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |++.. ++++|++||+.|+++|++||++|+||++||+|++ +..++++++++..++|||||++|+++||||||++| T Consensus 1 M~~~~---~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:10 1 MAEQT---ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCCcc---cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 88766 4899999999999999999999999999999995 45567778888999999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||||.+.|+++++ .+++++||++||++++++|++||||.++|++|+||++|||||+|++++.+ +++ T Consensus 78 tpp~~~WF~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~ 149 (555) T protein:vir:10 78 TSPARPWFRLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD-AVV 149 (555) T ss_pred cCCCCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceE Confidence 97 5699999999887654 35799999999999999999999999999999999999999999988776 568 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh-----ccccCCCCceEEEEEEEEecCC------CCc Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG-----QGGEKKADETIDVYTHIYLDED------SGE 225 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~-----~~~~~~~~~~~~v~~~v~p~~~------~~~ 225 (536) +|++|||++|||.+|++|+||+|||||+||+++|+++|+.+.++ ...++.++++|+|+|+|+||.+ +++ T Consensus 150 rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~ 229 (555) T protein:vir:10 150 YHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDR 229 (555) T ss_pred EEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcc Confidence 99999999999999999999999999999999999999865443 3344455778999999999765 345 Q ss_pred eeEEEEe------cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_011045. 226 YIRYEEV------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) Q Consensus 226 ~~~~~~v------~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~l 299 (536) +++|.|+ +|+.++.++| |++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+ T Consensus 230 ~~p~~s~~~~~~~d~~~vl~esg---y~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 230 NMAWKSVYFEPGADETRTLRESG---YRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ccceEEEEEEeccCCccccccCC---cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 5555543 5666655554 479999999999999999999999999999999999999999999999999999 Q ss_pred eccccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHHHH Q lcl|NC_011045. 300 VNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN---SAVQRTGERVTAEEIRY 376 (536) Q Consensus 300 v~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~r~TAtEi~~ 376 (536) |+++|.+++.++.|++.|++.+|..++....++++++||+.+.+.|++++++|+++||.| ++.++++++||||||++ T Consensus 307 v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~ 386 (555) T protein:vir:10 307 LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAE 386 (555) T ss_pred eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHH Confidence 999999999999999999999998888777778999999999999999999999999987 67779999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc----ceEEEEechHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_011045. 377 VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE----AVEPTISTGLEAIGRGQDLDKLERCVAAWA-- 450 (536) Q Consensus 377 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~----~v~v~~vs~La~a~r~~~~~~l~~~~~~~~-- 450 (536) |++|++++|||||+||++|||.|+|+|+|++|++.|+||++|++ .|+|+|+|||+++||+.++.+|.+|++.++ T Consensus 387 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~l 466 (555) T protein:vir:10 387 RHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAV 466 (555) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999998864 489999999999999999999999887765 Q ss_pred -hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhhcC-----cch Q lcl|NC_011045. 451 -ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQATAS-----PEA 522 (536) Q Consensus 451 -~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~--~~~~~~~-----~~~ 522 (536) |+.|++++ +||+|++++++++++|| |.+++||++||+++|+|+++|+|++++++.+.+++ ++..+.. ... T Consensus 467 aq~~P~vld-~id~d~~~~~~a~~~Gv-p~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~ 544 (555) T protein:vir:10 467 AGIKPEVLD-KFDADRWADTYADMLGI-DPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNAL 544 (555) T ss_pred hcCChhhhh-cCCHHHHHHHHHHHhCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhH Confidence 45677765 69999999999999999 56899999999999999887777766555433332 1111111 001 Q ss_pred HHhhhhcCCCC Q lcl|NC_011045. 523 MAAAADSVGLQ 533 (536) Q Consensus 523 ~~~~~~~~~~q 533 (536) ......-+|-- T Consensus 545 ~~~~~~~~~~~ 555 (555) T protein:vir:10 545 TDVTRAFSGYT 555 (555) T ss_pred HHHHhhhccCC Confidence 11111111111 No 19 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=3.6e-159 Score=889.26 Aligned_cols=517 Identities=19% Similarity=0.200 Sum_probs=445.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |++.. ++++|++||+.|+++|++||++|+||++||+|++ +..++++++++..++|||||++|+++||||||++| T Consensus 1 M~~~~---~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:10 1 MAEQT---ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCCcc---cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 88766 4899999999999999999999999999999995 45567778888999999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||||.+.|+++++ .+++++||++||++++++|++||||.++|++|+||++|||||+|++++.+ +++ T Consensus 78 tpp~~~WF~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~ 149 (555) T protein:vir:10 78 TSPARPWFRLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD-AVV 149 (555) T ss_pred cCCCCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceE Confidence 97 5699999999887654 35799999999999999999999999999999999999999999988776 568 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh-----ccccCCCCceEEEEEEEEecCC------CCc Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG-----QGGEKKADETIDVYTHIYLDED------SGE 225 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~-----~~~~~~~~~~~~v~~~v~p~~~------~~~ 225 (536) +|++|||++|||.+|++|+||+|||||+||+++|+++|+.+.++ ...++.++++|+|+|+|+||.+ +++ T Consensus 150 rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~ 229 (555) T protein:vir:10 150 YHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDR 229 (555) T ss_pred EEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcc Confidence 99999999999999999999999999999999999999865443 3344455778999999999765 345 Q ss_pred eeEEEEe------cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_011045. 226 YIRYEEV------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) Q Consensus 226 ~~~~~~v------~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~l 299 (536) +++|.|+ +|+.++.++| |++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+ T Consensus 230 ~~p~~s~~~~~~~d~~~vl~esg---y~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 230 NMAWKSVYFEPGADETRTLRESG---YRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ccceEEEEEEeccCCccccccCC---cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 5555543 5666655554 479999999999999999999999999999999999999999999999999999 Q ss_pred eccccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHHHH Q lcl|NC_011045. 300 VNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN---SAVQRTGERVTAEEIRY 376 (536) Q Consensus 300 v~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~r~TAtEi~~ 376 (536) |+++|.+++.++.|++.|++.+|..++....++++++||+.+.+.|++++++|+++||.| ++.++++++||||||++ T Consensus 307 v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~ 386 (555) T protein:vir:10 307 LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAE 386 (555) T ss_pred eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHH Confidence 999999999999999999999998888777778999999999999999999999999987 67779999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc----ceEEEEechHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_011045. 377 VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE----AVEPTISTGLEAIGRGQDLDKLERCVAAWA-- 450 (536) Q Consensus 377 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~----~v~v~~vs~La~a~r~~~~~~l~~~~~~~~-- 450 (536) |++|++++|||||+||++|||.|+|+|+|++|++.|+||++|++ .|+|+|+|||+++||+.++.+|.+|++.++ T Consensus 387 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~l 466 (555) T protein:vir:10 387 RHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAV 466 (555) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999998864 489999999999999999999999887765 Q ss_pred -hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhhcC-----cch Q lcl|NC_011045. 451 -ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQATAS-----PEA 522 (536) Q Consensus 451 -~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~--~~~~~~~-----~~~ 522 (536) |+.|++++ +||+|++++++++++|| |.+++||++||+++|+|+++|+|++++++.+.+++ ++..+.. ... T Consensus 467 aq~~P~vld-~id~d~~~~~~a~~~Gv-p~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~ 544 (555) T protein:vir:10 467 AGIKPEVLD-KFDADRWADTYADMLGI-DPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNAL 544 (555) T ss_pred hcCChhhhh-cCCHHHHHHHHHHHhCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhH Confidence 45677765 69999999999999999 56899999999999999887777766555433332 1111111 001 Q ss_pred HHhhhhcCCCC Q lcl|NC_011045. 523 MAAAADSVGLQ 533 (536) Q Consensus 523 ~~~~~~~~~~q 533 (536) ......-+|-- T Consensus 545 ~~~~~~~~~~~ 555 (555) T protein:vir:10 545 TDVTRAFSGYT 555 (555) T ss_pred HHHHhhhccCC Confidence 11111111111 No 20 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=3.6e-159 Score=889.26 Aligned_cols=517 Identities=19% Similarity=0.200 Sum_probs=445.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |++.. ++++|++||+.|+++|++||++|+||++||+|++ +..++++++++..++|||||++|+++||||||++| T Consensus 1 M~~~~---~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~l 77 (555) T protein:vir:98 1 MAEQT---ERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGM 77 (555) T ss_pred CCCcc---cHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhh Confidence 88766 4899999999999999999999999999999995 45567778888999999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||||.+.|+++++ .+++++||++||++++++|++||||.++|++|+||++|||||+|++++.+ +++ T Consensus 78 tpp~~~WF~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~-~~~ 149 (555) T protein:vir:98 78 TSPARPWFRLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFD-AVV 149 (555) T ss_pred cCCCCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCC-ceE Confidence 97 5699999999887654 35799999999999999999999999999999999999999999988776 568 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh-----ccccCCCCceEEEEEEEEecCC------CCc Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG-----QGGEKKADETIDVYTHIYLDED------SGE 225 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~-----~~~~~~~~~~~~v~~~v~p~~~------~~~ 225 (536) +|++|||++|||.+|++|+||+|||||+||+++|+++|+.+.++ ...++.++++|+|+|+|+||.+ +++ T Consensus 150 rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~ 229 (555) T protein:vir:98 150 YHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDR 229 (555) T ss_pred EEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcc Confidence 99999999999999999999999999999999999999865443 3344455778999999999765 345 Q ss_pred eeEEEEe------cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_011045. 226 YIRYEEV------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) Q Consensus 226 ~~~~~~v------~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~l 299 (536) +++|.|+ +|+.++.++| |++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+ T Consensus 230 ~~p~~s~~~~~~~d~~~vl~esg---y~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:98 230 NMAWKSVYFEPGADETRTLRESG---YRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ccceEEEEEEeccCCccccccCC---cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 5555543 5666655554 479999999999999999999999999999999999999999999999999999 Q ss_pred eccccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh---hcccCCCCCCCHHHHHH Q lcl|NC_011045. 300 VNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN---SAVQRTGERVTAEEIRY 376 (536) Q Consensus 300 v~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~~~~~r~TAtEi~~ 376 (536) |+++|.+++.++.|++.|++.+|..++....++++++||+.+.+.|++++++|+++||.| ++.++++++||||||++ T Consensus 307 v~~~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~ 386 (555) T protein:vir:98 307 LPVSAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAE 386 (555) T ss_pred eccccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHH Confidence 999999999999999999999998888777778999999999999999999999999987 67779999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc----ceEEEEechHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_011045. 377 VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE----AVEPTISTGLEAIGRGQDLDKLERCVAAWA-- 450 (536) Q Consensus 377 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~----~v~v~~vs~La~a~r~~~~~~l~~~~~~~~-- 450 (536) |++|++++|||||+||++|||.|+|+|+|++|++.|+||++|++ .|+|+|+|||+++||+.++.+|.+|++.++ T Consensus 387 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~l 466 (555) T protein:vir:98 387 RHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAV 466 (555) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999998864 489999999999999999999999887765 Q ss_pred -hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhhcC-----cch Q lcl|NC_011045. 451 -ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQATAS-----PEA 522 (536) Q Consensus 451 -~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~--~~~~~~~-----~~~ 522 (536) |+.|++++ +||+|++++++++++|| |.+++||++||+++|+|+++|+|++++++.+.+++ ++..+.. ... T Consensus 467 aq~~P~vld-~id~d~~~~~~a~~~Gv-p~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~ 544 (555) T protein:vir:98 467 AGIKPEVLD-KFDADRWADTYADMLGI-DPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNAL 544 (555) T ss_pred hcCChhhhh-cCCHHHHHHHHHHHhCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhH Confidence 45677765 69999999999999999 56899999999999999887777766555433332 1111111 001 Q ss_pred HHhhhhcCCCC Q lcl|NC_011045. 523 MAAAADSVGLQ 533 (536) Q Consensus 523 ~~~~~~~~~~q 533 (536) ......-+|-- T Consensus 545 ~~~~~~~~~~~ 555 (555) T protein:vir:98 545 TDVTRAFSGYT 555 (555) T ss_pred HHHHhhhccCC Confidence 11111111111 No 21 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=7.8e-159 Score=887.39 Aligned_cols=507 Identities=30% Similarity=0.453 Sum_probs=455.4 Q ss_pred CCC--ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAE--KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~--~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) |-+ .+.+.++++|++||+.|+++|++||++|+||++||+|++|+++++.. ...++|||||++|+++||||||++|| T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~lt 78 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGWQGVGAQATNHLANKLAQVLF 78 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcc--cccccccchHHHHHHHHHHHHHHhhc Confidence 655 78999999999999999999999999999999999999998776543 45689999999999999999999999 Q ss_pred C-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceee Q lcl|NC_011045. 79 P-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP 157 (536) Q Consensus 79 P-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~ 157 (536) | ++|||||+++|..++.+.......+++++||+.||++++.+|++||||.++|++|+||++|||||+|+++++ + T Consensus 79 pp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~-----~ 153 (515) T protein:vir:70 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-----A 153 (515) T ss_pred CCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCC-----C Confidence 7 569999999999888887777888999999999999999999999999999999999999999999997654 2 Q ss_pred EEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhc--cccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 158 MKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQ--GGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 158 ~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~--~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) |++|||++|||.+|++|+||+||||++||+++|+++|+....+. ..+++++++|+|||+|+|+++ +.|++|++++|+ T Consensus 154 ~~~~pl~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~-~~~~~~~e~d~~ 232 (515) T protein:vir:70 154 MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDI 232 (515) T ss_pred eEEEEcCeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCC-CceEEEEecCce Confidence 88999999999999999999999999999999999999876543 335678999999999998864 579999999998 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCC Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ 315 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~ 315 (536) .+ +.+|.|+|++|||+++||++.+||+|||||++++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.+++ T Consensus 233 ~~-~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~ 311 (515) T protein:vir:70 233 PV-GKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSG 311 (515) T ss_pred ee-ccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccC Confidence 76 56678889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_011045. 316 TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (536) Q Consensus 316 ~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E 395 (536) +|++++|.++++.++++++++||+.++..|++++++|+++||++.+.+++++|||||||++|++|++++|||||+||++| T Consensus 312 ~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~E 391 (515) T protein:vir:70 312 TGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMT 391 (515) T ss_pred CceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHh---hcchhhhhcCCHHHHHHHHHH Q lcl|NC_011045. 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAA---LAPMRDDPDINLAMIKLRIAN 472 (536) Q Consensus 396 ~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~---~~p~~~~~~id~d~~~~~~a~ 472 (536) ||.||+.|++ .+.+|++|.++++++|+|+|++++|++++++|.+|++.++. +.|+.+ ++||+|++++++++ T Consensus 392 ll~Pli~r~~-----~~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~-~~id~d~~~~~~a~ 465 (515) T protein:vir:70 392 MQTPIAMWGL-----QEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQ-RAIRWGDYMDWVRG 465 (515) T ss_pred HHHHHHHHHH-----HhhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHH-hhCCHHHHHHHHHH Confidence 9999999864 57889999999999999999999999999999998888763 444444 57999999999999 Q ss_pred HcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhh Q lcl|NC_011045. 473 AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAA 526 (536) Q Consensus 473 ~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 526 (536) .+|+ |..++||+|||+++|+|+++++|+++.+++.++. ..+...+.|.++ T Consensus 466 ~~g~-p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a---~~~~~~~~~~~~ 515 (515) T protein:vir:70 466 QISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKA---VPGVIQQEMKEG 515 (515) T ss_pred HhCC-CccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhh---cccchhhhhccC Confidence 9998 8889999999999998887766654433322211 112222344444 No 22 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=1.9e-157 Score=879.77 Aligned_cols=507 Identities=28% Similarity=0.418 Sum_probs=455.6 Q ss_pred CCC---ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |.+ .+++.++++|++||+.|+++|++||++|+||++||+|++++++++.. +.+++|||||++|+++||||||++| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNE--TSQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcc--cccccccchHHHHHHHHHHHHHhhh Confidence 665 68899999999999999999999999999999999999998876543 4568999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||||+++|..++.+.......+++++||+.||++++.+|++||||.++|++|+||++|||||+|+++++ T Consensus 79 tpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~----- 153 (516) T protein:vir:10 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKG----- 153 (516) T ss_pred cCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCC----- Confidence 97 569999999999888877777788899999999999999999999999999999999999999999997654 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh--ccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG--QGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~--~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) +|++|||++|||.+|++|+|+++|||+++++++|+++|+..... ...+++++++|+|||||++++ ++.|.+|++++| T Consensus 154 ~~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~-~~~~~~~~~~d~ 232 (516) T protein:vir:10 154 AISAIPMHHYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLG-EGFWELKQSADD 232 (516) T ss_pred CeEEEEcCeEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecC-CCceEEEEeeCc Confidence 28899999999999999999999999999999999999875432 234567899999999999875 468999999999 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA 314 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~ 314 (536) +.+ +.+|.|+|++|||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|||+|+|+|++++.++.++ T Consensus 233 ~~~-~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~ 311 (516) T protein:vir:10 233 IPV-GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNS 311 (516) T ss_pred eee-ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhccC Confidence 977 5677888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_011045. 315 QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (536) Q Consensus 315 ~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~ 394 (536) ++|++++|.++++.++++++++||+.++..|++++++|+++||++.+.++++++||||||++|++|++++|||||+||++ T Consensus 312 ~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 391 (516) T protein:vir:10 312 GTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAT 391 (516) T ss_pred CCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc---chhhhhcCCHHHHHHHHH Q lcl|NC_011045. 395 ELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA---PMRDDPDINLAMIKLRIA 471 (536) Q Consensus 395 E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~---p~~~~~~id~d~~~~~~a 471 (536) |||.|+|+|++. +.+|++|+++++++|+++|++++|++++++|.+|++.++++. |+++ ++||+|+++++++ T Consensus 392 Ell~Pli~r~~~-----~~~p~~P~~lv~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~-d~id~d~~~~~~a 465 (516) T protein:vir:10 392 TMQSPVAMWGLL-----EAGDSFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVL-AAVKWPDYMDWVR 465 (516) T ss_pred HHHHHHHHHHHH-----hhCCCCChhhcCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHH-hhcCHHHHHHHHH Confidence 999999999975 557999999999999999999999999999999988887643 4444 5799999999999 Q ss_pred HHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhh Q lcl|NC_011045. 472 NAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAA 526 (536) Q Consensus 472 ~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 526 (536) +++|| |.+++||+|||+++|+|+++++|.++++.+.+++ +.+.-.+.+.++ T Consensus 466 ~~~gv-p~~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~---~~~~~~~~~~~~ 516 (516) T protein:vir:10 466 GQISA-ELPFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKA---VPGVIQQELKEA 516 (516) T ss_pred HHhCC-ChhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhc---ccchhhhhhhcC Confidence 99999 6789999999999999998777755543333221 122222334443 No 23 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=1.9e-155 Score=868.85 Aligned_cols=516 Identities=16% Similarity=0.164 Sum_probs=436.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC---CcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS---DNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~---~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |+++. ++++++||+.|+++|++||++|+||++||+|+++++.+ +.++++..++|||||++|+++||||||++| T Consensus 1 m~~~~----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~l 76 (556) T protein:vir:73 1 MAETE----KERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGI 76 (556) T ss_pred CChhh----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhh Confidence 99965 88999999999999999999999999999999876543 455567889999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||+|.+.|+.+.+ ..++++||++||++++++|++||||.++|++|+||++||||++|++++.+ +++ T Consensus 77 tpp~~~WF~l~~~d~~~~~-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~ 148 (556) T protein:vir:73 77 TSPARPWFKLATPDPDMMD-------YGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQ-DVI 148 (556) T ss_pred cCCCCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCC-ceE Confidence 97 5699999999886655 35799999999999999999999999999999999999999999998766 568 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHh-----hhccccCCCCceEEEEEEEEecCC------CCc Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAV-----EGQGGEKKADETIDVYTHIYLDED------SGE 225 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~-----~~~~~~~~~~~~~~v~~~v~p~~~------~~~ 225 (536) +|++|||++|||.+|++|+||+|||||+||+++|+++|+.+. ++.+.+++++++|+|+|+|+||.+ +.+ T Consensus 149 r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~ 228 (556) T protein:vir:73 149 RTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSK 228 (556) T ss_pred EEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcc Confidence 999999999999999999999999999999999999998653 334445556789999999999765 345 Q ss_pred eeEEEEe------cCccccccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_011045. 226 YIRYEEV------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRS-YIEEYLGDLRSLENLQEAIVKMSMISSKVIG 298 (536) Q Consensus 226 ~~~~~~v------~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrg-p~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~ 298 (536) +|||.|+ +++.++.++| |++|||+++||++.+||+|||| |++++|||+|+||.++++++++++++++||| T Consensus 229 ~~p~~s~~~~~~~~~~~vl~esg---~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (556) T protein:vir:73 229 NKPYRSVYFESGGDSDKLLRESG---FDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPM 305 (556) T ss_pred cceEEEEEEEecCCCceecccCC---cccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 6766655 4556655544 4799999999999999999999 8999999999999999999999999999999 Q ss_pred eeccccccchhhhccCCCcc-eecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh---cccCCCCCCCHHHH Q lcl|NC_011045. 299 LVNPAGITQPRRLTKAQTGD-FVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS---AVQRTGERVTAEEI 374 (536) Q Consensus 299 lv~~~g~~~~~~~~~~~~g~-~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~---~~~~~~~r~TAtEi 374 (536) ++++++...+.++.|++..+ ..++..+++.++..++ ++++.+.+.|++++++|+++||.|. +.++++++|||||| T Consensus 306 ~v~~~~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~-~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv 384 (556) T protein:vir:73 306 VAPTSLKNQRVSLLPGDVTYLDVISGQDGFKPAYLVN-PNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAV 384 (556) T ss_pred eccccccccceeeccCccccccCCCCccceeeecccc-ccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHH Confidence 99999988777887766333 3455566666665443 6899999999999999999999873 56689999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc----ceEEEEechHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE----AVEPTISTGLEAIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~----~v~v~~vs~La~a~r~~~~~~l~~~~~~~~ 450 (536) ++|++|++++|||||+||++|||.|+|+|+|++|+|.|+||++|++ +|+|+|+|||+++||..++++|.+|++.++ T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~ 464 (556) T protein:vir:73 385 IEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIG 464 (556) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999998865 589999999999999999988888777654 Q ss_pred ---hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHH---HHHH--HHh-hhcCcc Q lcl|NC_011045. 451 ---ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAAL---AQGM--AAQ-ATASPE 521 (536) Q Consensus 451 ---~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~---~~~~--~~~-~~~~~~ 521 (536) |++|++++ +||+|++++++++++|| |.+++||++||+++|+|+++|+|.++++++. ++++ +++ .+.++. T Consensus 465 ~laq~~Pe~~d-~id~d~~~~~~a~~~Gv-p~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~ 542 (556) T protein:vir:73 465 QLAQFKPEALD-KLDVDQAIDAFSEMSGV-SPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPS 542 (556) T ss_pred HHhccChhhHh-cCCHHHHHHHHHHHcCC-ChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHH Confidence 56788765 69999999999999999 5689999999999998876665554433322 2111 112 233455 Q ss_pred hHHhhhh-cCCCCC Q lcl|NC_011045. 522 AMAAAAD-SVGLQP 534 (536) Q Consensus 522 ~~~~~~~-~~~~q~ 534 (536) +++.... ++++|. T Consensus 543 ~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 543 ALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHhhcCCCC Confidence 5555444 444455 No 24 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=1.5e-154 Score=863.85 Aligned_cols=516 Identities=15% Similarity=0.154 Sum_probs=433.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC---CcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS---DNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~---~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |+++. ++++++||+.|+++|++||++|+||++||+|+++++.+ +.++++..++|||||++|+++||||||++| T Consensus 1 m~~~~----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~l 76 (559) T protein:vir:95 1 MAETT----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGI 76 (559) T ss_pred CChhh----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhh Confidence 99977 89999999999999999999999999999999977543 455677889999999999999999999999 Q ss_pred cC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 78 FP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 78 tP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) || ++|||||++.|+.+.+ ..++++||++||++++++|++||||.++|++|+||++|||||+|++++.++ ++ T Consensus 77 tpp~~~WF~l~~~d~~~~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~-~~ 148 (559) T protein:vir:95 77 TSPARPWFRLATPDPEMMD-------YGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDED-II 148 (559) T ss_pred cCCCCcccccccCCccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCc-ee Confidence 97 5699999999876554 357999999999999999999999999999999999999999999987764 68 Q ss_pred eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh-----ccccCCCCceEEEEEEEEecCC------CCc Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG-----QGGEKKADETIDVYTHIYLDED------SGE 225 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~-----~~~~~~~~~~~~v~~~v~p~~~------~~~ 225 (536) +|++|||++|||.+|++|+||+|||||+||+++|+++|+.+..+ ...++.++++|+|+|+|+||.+ +.+ T Consensus 149 r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~ 228 (559) T protein:vir:95 149 RTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSK 228 (559) T ss_pred EEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccc Confidence 99999999999999999999999999999999999999865433 3344555678999999999765 334 Q ss_pred eeEEEEe------cCccccccccccccccCceEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_011045. 226 YIRYEEV------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRS-YIEEYLGDLRSLENLQEAIVKMSMISSKVIG 298 (536) Q Consensus 226 ~~~~~~v------~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrg-p~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~ 298 (536) +++|.|+ +++.++.+ |. |++|||+++||++.+||+|||| |++++|||+|+||.|+++++++++++++||| T Consensus 229 ~~pf~s~~~e~~~~~~~~l~e-sg--~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~ 305 (559) T protein:vir:95 229 NKPFKSVYYEVGGDNDKLLRE-SG--FDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) T ss_pred cceEEEEEEEecCCCceeeec-CC--cccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 5666554 33455444 43 4799999999999999999999 8999999999999999999999999999999 Q ss_pred eeccccccchhhhccCCCcceecCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhhhh---cccCCCCCCCHHHH Q lcl|NC_011045. 299 LVNPAGITQPRRLTKAQTGDFVTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS---AVQRTGERVTAEEI 374 (536) Q Consensus 299 lv~~~g~~~~~~~~~~~~g~~~~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~---~~~~~~~r~TAtEi 374 (536) ++++++.+++.++.|++.+++..+.. +.+.+.... ..+++.+...|++++++|+++||.|. +.++++++|||||| T Consensus 306 ~v~~~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~-~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV 384 (559) T protein:vir:95 306 VAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLV-NPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) T ss_pred eccccccccceeeeccceeeeCCCCCcccceeeccc-ccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHH Confidence 99999999999988887776544433 334444333 35788889999999999999999874 66799999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc----ceEEEEechHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE----AVEPTISTGLEAIGRGQDLDKLERCVAAW- 449 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~----~v~v~~vs~La~a~r~~~~~~l~~~~~~~- 449 (536) ++|++|++++|||||+||++|||.|+|+|+|++|++.|+||++|++ +++|+|+|||+++||..++++|.+|++.+ T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~ 464 (559) T protein:vir:95 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999998865 58999999999999999988888877665 Q ss_pred --HhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HH---hhh-cCcc Q lcl|NC_011045. 450 --AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AA---QAT-ASPE 521 (536) Q Consensus 450 --~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~--~~---~~~-~~~~ 521 (536) +|++|++++ +||+|++++++++++|| |..++||++||+++|+|+++|+|+++++++..+.+ ++ ++. .++. T Consensus 465 ~laq~~Pevld-~id~d~~~~~~a~~~Gv-p~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~ 542 (559) T protein:vir:95 465 QLAQVKPEALD-KLNVDQAIDAFADMSGV-SPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPS 542 (559) T ss_pred HHhccChhhhh-cCCHHHHHHHHHHHhCC-chhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCChh Confidence 456788765 69999999999999999 56899999999999998877766555444322211 11 222 2334 Q ss_pred hHHhhh----hcCCCCC Q lcl|NC_011045. 522 AMAAAA----DSVGLQP 534 (536) Q Consensus 522 ~~~~~~----~~~~~q~ 534 (536) +++... ..++-|. T Consensus 543 ~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 543 VLSAMANAVSGQGGQSQ 559 (559) T ss_pred HHHHHHHhhcCccccCC Confidence 443332 2233233 No 25 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=1.1e-154 Score=864.63 Aligned_cols=505 Identities=15% Similarity=0.159 Sum_probs=431.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCC------cccccccccccchHHHHHHHHHHHHHHhhcC-C Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD------NASTDYVTPWQAVGARGLNNLASKLMLALFP-M 80 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~------~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP-~ 80 (536) |++++|++||+.|+++|++||++|+||++||+|+++++.++ ...++..++|||||++|+++||||||++||| + T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 99999999999999999999999999999999998764432 2235688999999999999999999999997 5 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC-CCceeeEE Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE-GSNYNPMK 159 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~-~~~~~~~~ 159 (536) +|||||++.|.++.+ .+++++||++||++|+++|++||||.++|++|+||++||||++|++++. ..++++|+ T Consensus 81 ~~WF~l~~~d~~~~~-------~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~ 153 (547) T protein:vir:10 81 TKWFELAFRDKELNS-------DDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQ 153 (547) T ss_pred CcccccccCCccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEE Confidence 699999999876654 3579999999999999999999999999999999999999999998764 34678999 Q ss_pred EEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhcc----ccCCC---CceEEEEEEEEecCCCC-------- Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQG----GEKKA---DETIDVYTHIYLDEDSG-------- 224 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~----~~~~~---~~~~~v~~~v~p~~~~~-------- 224 (536) +|||++|||.+|++|+||+|||||+||+++|.++|+.+.++.. .++++ ..++++||+|+|+.+.. T Consensus 154 ~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~ 233 (547) T protein:vir:10 154 SSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTV 233 (547) T ss_pred EeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccce Confidence 9999999999999999999999999999999999987644321 22333 34899999999986521 Q ss_pred ---cee----EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011045. 225 ---EYI----RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) Q Consensus 225 ---~~~----~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~ 297 (536) +++ +|++++|..+++.+|. |++|||+++||++.+||+|||||++++|||+|+||.|+++++++++++++|| T Consensus 234 ~~~~~~p~~s~~~e~~~~~~~l~esg--~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp 311 (547) T protein:vir:10 234 LAPTERPFGKKWILKEGAVQLGEEGG--YYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPA 311 (547) T ss_pred eeccccceeEEEEEecCceeeeecCC--cccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 233 4556676544455554 3799999999999999999999999999999999999999999999999999 Q ss_pred eeeccccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHH Q lcl|NC_011045. 298 GLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYV 377 (536) Q Consensus 298 ~lv~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r 377 (536) |+|+++|++++.++.++ |.++.+..+++++ ++++++|+.+++.|++++++|+++||.+.+.++++++||||||++| T Consensus 312 ~~v~~~g~~~~~~~~pg--g~~~~~~~~~v~p--l~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r 387 (547) T protein:vir:10 312 IMVTERGLISDIDLGAS--GLTVVRDMESMKP--FESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVR 387 (547) T ss_pred eecccccccccceecCC--eeeecCCccccee--eecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHH Confidence 99999999999776543 4555677777765 5677899999999999999999999999988999999999999999 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc-------ceEEEEechHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 378 ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE-------AVEPTISTGLEAIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 378 ~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~-------~v~v~~vs~La~a~r~~~~~~l~~~~~~~~ 450 (536) ++|++++|||+|+||++|||.|+|+|+|++|++.|.||++|.+ +++|+|+|+|+++||..+++++.+|++.++ T Consensus 388 ~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~ 467 (547) T protein:vir:10 388 YELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTA 467 (547) T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999998754 478999999999999999999999999876 Q ss_pred h---hcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHhhhcCcchHHh Q lcl|NC_011045. 451 A---LAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAAL--AQGMAAQATASPEAMAA 525 (536) Q Consensus 451 ~---~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~--~~~~~~~~~~~~~~~~~ 525 (536) + +.|++++ +||+|++++++++++|| |..++||++||+++|+|+++++|+++|++.+ ++.+++..+.+..+..+ T Consensus 468 ~laq~~P~vld-~id~d~~~~~~a~~~Gv-p~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~~~ 545 (547) T protein:vir:10 468 QLAEINPEVLD-IPDWDEMVRMLGSLLGA-PQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAALKE 545 (547) T ss_pred HhhccChhhhh-cCCHHHHHHHHHHHhCC-ChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhc Confidence 5 4577664 69999999999999999 6789999999999999988877766655532 22222222222222233 Q ss_pred hh Q lcl|NC_011045. 526 AA 527 (536) Q Consensus 526 ~~ 527 (536) .. T Consensus 546 ~~ 547 (547) T protein:vir:10 546 NQ 547 (547) T ss_pred cC Confidence 21 No 26 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=9.1e-90 Score=508.73 Aligned_cols=518 Identities=15% Similarity=0.127 Sum_probs=373.8 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc----------cccCCCCCcccccccccccchHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIP----------SLFPKDSDNASTDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P----------~~~~~~~~~~~~~~~~~~dst~~~a~~~La 70 (536) |+++.. ...+.+||+.+++.|++||.+|+||++|+.+ ..+...++.......+++++++..++++|+ T Consensus 20 ~~~~~~---~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~ 96 (641) T protein:vir:94 20 LSTDRI---GGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLV 96 (641) T ss_pred CCchhH---HHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHh Confidence 555442 5678999999999999999999999977654 333444444444456899999999999999 Q ss_pred HHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC Q lcl|NC_011045. 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) Q Consensus 71 a~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~ 150 (536) ++||+++||+++||+|.+.+++..+ . .+ .++..+...+++++|+..+++.+.|++.+|||++.+... T Consensus 97 s~Lm~~~~p~~~wf~~~p~~~ed~~-------~--A~----~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~ 163 (641) T protein:vir:94 97 AYFKGATFPSDDWFDLKGMVPELAD-------A--AR----VVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWD 163 (641) T ss_pred hHHhhhhcCCCceEEEecCCCChHH-------H--HH----HHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehh Confidence 9999999999999999887665322 1 11 123455567889999999999999999999998865421 Q ss_pred C------------CC-------------ceeeEEEEecceEEEeeCCCCCeE----EEEEeEeccHHHHHHH--HhHH-h Q lcl|NC_011045. 151 E------------GS-------------NYNPMKLYRLSSYVVQRDAFGNVL----QMVTRDQIAFGALPED--IRKA-V 198 (536) Q Consensus 151 ~------------~~-------------~~~~~~~~~l~~~~v~~d~~G~v~----~i~r~~~~t~~~l~~~--~~~~-~ 198 (536) . .+ ....+++.||..+.|-.|+.++++ ++||++++|+.+|..+ ++.+ + T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v 243 (641) T protein:vir:94 164 TSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLT 243 (641) T ss_pred hHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCChhhc Confidence 0 00 011235566666666666666665 5678888888888755 3221 1 Q ss_pred ----hhccccCCCCc----------eEEEEEEEE-ecCCCC-ceeEEEEecCccccccccccccccCceEEEeeeecCCC Q lcl|NC_011045. 199 ----EGQGGEKKADE----------TIDVYTHIY-LDEDSG-EYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGE 262 (536) Q Consensus 199 ----~~~~~~~~~~~----------~~~v~~~v~-p~~~~~-~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge 262 (536) ...+...+++. ..++|++.. ...++. .|.+|..++|+.+++.+++..|+++||+++||.+.+++ T Consensus 244 ~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~ 323 (641) T protein:vir:94 244 QVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDS 323 (641) T ss_pred chhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCc Confidence 11111111221 112332211 112233 35567788999999988888789999999999999999 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceecCCcccccccccccccchhHHH Q lcl|NC_011045. 263 SYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAK 342 (536) Q Consensus 263 ~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~ 342 (536) +||+||+++++||+++||.+++.+++++.++++|+|+++++|++++.++..++.|.+..+..+++.++..+ ..+|+..+ T Consensus 324 ~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~v~pl~~~-~~~~~~~~ 402 (641) T protein:vir:94 324 VYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGSLQPIDMG-RQDFVVTY 402 (641) T ss_pred ccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCcceeecCC-ccccchhH Confidence 99999999999999999999999999999999999999999999999876544445556667777766433 35899999 Q ss_pred HHHHHHHHHHHHHHhhhhc----ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC----- Q lcl|NC_011045. 343 AVSDAIEARLSFAFMLNSA----VQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ----- 413 (536) Q Consensus 343 ~~i~~~~~rI~~af~~~~~----~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~----- 413 (536) ..++.++.+|+++|+.+.+ ..++++++|||||+++.+|+...||+++++|+.||+.||+.|+++++.+.+. T Consensus 403 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~ 482 (641) T protein:vir:94 403 QEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETI 482 (641) T ss_pred HHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhh Confidence 9999999999999986643 2367778999999999999999999999999999999999999999988532 Q ss_pred -----------CCCCCCcceEEEE-echHHHHHHHHHHHHHHHHHHHHHhhc--chhhhhcCCHHHHHHHHHHHcCCC-h Q lcl|NC_011045. 414 -----------IPELPKEAVEPTI-STGLEAIGRGQDLDKLERCVAAWAALA--PMRDDPDINLAMIKLRIANAIGID-T 478 (536) Q Consensus 414 -----------lp~~~~~~v~v~~-vs~La~a~r~~~~~~l~~~~~~~~~~~--p~~~~~~id~d~~~~~~a~~~Gv~-p 478 (536) ++++|.++++.++ +++|+++++...++++.++++++..++ |..++ ++|+|.+++.+++..|++ | T Consensus 483 R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d-~~d~~~~~~~~~~~~g~~~p 561 (641) T protein:vir:94 483 RMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQ-SLDYALILEDLLRQMRFTDP 561 (641) T ss_pred hhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChhhhh-cCCHHHHHHHHHHHhCCCCc Confidence 3334445555544 368999888877777777666654443 66555 699999999999875532 8 Q ss_pred hhccCCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhhcC--cchHHhhhhcCCCCCCC Q lcl|NC_011045. 479 SGILLTEEQKQQKMAQQSM--QMGMDNGAAALAQGMAAQATAS--PEAMAAAADSVGLQPGI 536 (536) Q Consensus 479 ~~i~rs~~ev~~~~~q~~~--q~~~~~~a~~~~~~~~~~~~~~--~~~~~~~~~~~~~q~~~ 536 (536) ..++|++|...+.++++++ |+++..++.+.++....++..+ ++.++...+..|+|++= T Consensus 562 ~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 623 (641) T protein:vir:94 562 MRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSD 623 (641) T ss_pred hhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchh Confidence 8899998754443332222 2222333333344343344333 66666666666666665 No 27 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=2.3e-73 Score=418.84 Aligned_cols=521 Identities=15% Similarity=0.159 Sum_probs=366.8 Q ss_pred CCC-----cccccc-HH----HHHHHHHHHHHHhhhHHHHHHHHHHHhcccc-------cCCC---CCcccccccccccc Q lcl|NC_011045. 1 MAE-----KRTGLA-EE----GAKSVYERLKNDRAPYETRAQNCAQYTIPSL-------FPKD---SDNASTDYVTPWQA 60 (536) Q Consensus 1 Ma~-----~~~~~~-~~----~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~-------~~~~---~~~~~~~~~~~~ds 60 (536) ||. +++++. .+ .+.++|+++++.|+.|+++|.+|+++..+.. .... ..+......+++.+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~ 82 (651) T protein:vir:80 3 LATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTG 82 (651) T ss_pred ccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccCh Confidence 443 222222 22 3789999999999999999999998887741 1111 22222345678999 Q ss_pred hHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhh Q lcl|NC_011045. 61 VGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVA 140 (536) Q Consensus 61 t~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 140 (536) +...+++++.+.|+..+||+..||++.+..++. ..+++-+-|+..+...+++++|+..++.+++|.+++ T Consensus 83 ~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d-----------~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~ 151 (651) T protein:vir:80 83 KAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQ-----------DNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLIT 151 (651) T ss_pred hHHHHHHHHHHHHHHhhcCCCceeEeccCCchh-----------HHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhccc Confidence 999999999999999999999999998854321 134455667778888889999999999999999999 Q ss_pred CcEEEEEecCCC------------------------------CceeeEEEEecceEEEeeCCCCCeEEEE-EeEeccHHH Q lcl|NC_011045. 141 GNVLLYLPEPEG------------------------------SNYNPMKLYRLSSYVVQRDAFGNVLQMV-TRDQIAFGA 189 (536) Q Consensus 141 G~~~l~~~~~~~------------------------------~~~~~~~~~~l~~~~v~~d~~G~v~~i~-r~~~~t~~~ 189 (536) |||++.+..+.. ++..+++.+|+.+|+|..++.+--|+-| .+..+|..+ T Consensus 152 G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~ 231 (651) T protein:vir:80 152 GNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKAD 231 (651) T ss_pred CceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHH Confidence 999885533211 1235789999999999999877555533 344566666 Q ss_pred HHHHHhH----------Hhhhcc-------------------ccCCCCceEEEEEEEEe-cCCCCcee-EEEEecCcccc Q lcl|NC_011045. 190 LPEDIRK----------AVEGQG-------------------GEKKADETIDVYTHIYL-DEDSGEYI-RYEEVEGMEVQ 238 (536) Q Consensus 190 l~~~~~~----------~~~~~~-------------------~~~~~~~~~~v~~~v~p-~~~~~~~~-~~~~v~g~~i~ 238 (536) +.+.... .+.+.. ...++...|+||+|..+ +.++..+. ++....|+.++ T Consensus 232 l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il 311 (651) T protein:vir:80 232 ILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVL 311 (651) T ss_pred HHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEe Confidence 5432210 010000 01134568899988544 33444443 44455678887 Q ss_pred ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcc Q lcl|NC_011045. 239 GSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD 318 (536) Q Consensus 239 ~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~ 318 (536) +.++..+.+++||+++||.+.+|+.||+||++.++|+++.||.+++++++++.++++|+|+|+++|++++.++...+.|. T Consensus 312 ~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg~v 391 (651) T protein:vir:80 312 RFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPGKV 391 (651) T ss_pred cccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCCce Confidence 76655556899999999999999999999999999999999999999999999999999999999999999987655566 Q ss_pred eecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcc----cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_011045. 319 FVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAV----QRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (536) Q Consensus 319 ~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~----~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~ 394 (536) ++.++++++.+++.+ ..+++.++..++.++++|++.|..+... .+..+++|||||+.+++++..+||++|++|+. T Consensus 392 i~~~~~~~~~~l~~~-~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~ 470 (651) T protein:vir:80 392 FLVSDHGDLQPLANQ-SSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEE 470 (651) T ss_pred EEecCCCCceeeccC-cccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 678888888776544 3479999999999999999999775533 35567899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCCC----------------cceEE----EEechHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_011045. 395 ELQLPLVRVLLKQLQATQQIPELPK----------------EAVEP----TISTGLEAIGRGQDLDKLERCVAAWAALAP 454 (536) Q Consensus 395 E~l~Pli~r~~~il~~~g~lp~~~~----------------~~v~v----~~vs~La~a~r~~~~~~l~~~~~~~~~~~p 454 (536) ||+.||+.|+++++++.+..|+++. .++++ ..+++.+.++|...++++.++++.+++..+ T Consensus 471 e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~ 550 (651) T protein:vir:80 471 TSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPE 550 (651) T ss_pred HHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCc Confidence 9999999999999999987765432 23433 335666777888888898888887776433 Q ss_pred hhhhhcCCHHHHHHHHHHHcCC-ChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 455 MRDDPDINLAMIKLRIANAIGI-DTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA-AQATASPEAMAAAADSVGL 532 (536) Q Consensus 455 ~~~~~~id~d~~~~~~a~~~Gv-~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 532 (536) +...+|...+++.+++.+|+ +|..++..+++.+....+++.+++.+....+..+.++ .+..+. +......++..- T Consensus 551 --~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~-~~~~~~~~~~~~ 627 (651) T protein:vir:80 551 --MGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQAD-GGTQMMSEMYGT 627 (651) T ss_pred --cchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 23458999999999999998 5666777765554333222111111111111000000 000000 000000111110 Q ss_pred CCCC Q lcl|NC_011045. 533 QPGI 536 (536) Q Consensus 533 q~~~ 536 (536) |-+. T Consensus 628 ~~~~ 631 (651) T protein:vir:80 628 PNAD 631 (651) T ss_pred HHHH Confidence 0001 No 28 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=1.2e-43 Score=255.91 Aligned_cols=494 Identities=12% Similarity=0.097 Sum_probs=329.7 Q ss_pred CCCcc---cccc-----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR---TGLA-----EEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~---~~~~-----~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+-+. +-+. +..+.++|+.+.+.|++|+..|.|+++|..-+.-.+.++.......++|-+.....++++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~ 80 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSN 80 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHH Confidence 66422 1111 345689999999999999999999999998877666666666667789999999999999999 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG 152 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~ 152 (536) ||+.+||++.||++....++..+ .++ =+.+++.+...|+.+||+.++...++|++++|+|.+-+..... T Consensus 81 l~~~~Fp~~~w~~~v~~~~~~~~---------~~~--~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~ 149 (584) T protein:vir:95 81 YFSSLFPNDDWLRWVGYGKGDST---------KTK--AKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAK 149 (584) T ss_pred HHHhhcCccceeeeecCCCchhh---------HHH--HHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeec Confidence 99999999999999998774332 122 3456677778889999999999999999999999875543332 Q ss_pred C------------ceeeEEEEecceEEEeeCCCCCeEEEE--EeEeccHHHHHHHHhHH-------------hhhc---- Q lcl|NC_011045. 153 S------------NYNPMKLYRLSSYVVQRDAFGNVLQMV--TRDQIAFGALPEDIRKA-------------VEGQ---- 201 (536) Q Consensus 153 ~------------~~~~~~~~~l~~~~v~~d~~G~v~~i~--r~~~~t~~~l~~~~~~~-------------~~~~---- 201 (536) . ..-+++-+++.++++..++ +.+++.. +|..+|..+|.+..... ..+. T Consensus 150 ~~e~~e~~~v~~~~~prieriSP~d~~~Dpsa-~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~ 228 (584) T protein:vir:95 150 YKEMTDGTLVPDYIGPRLVRISPLDIVFNPLA-TSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLG 228 (584) T ss_pred ceeeeccccccccccceEEeeChhheeecCCC-CCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCC Confidence 1 1135666777888888888 5666642 47778999997654221 1000 Q ss_pred cccCC-C-------------------CceEEEEEE--EEecC---CCCceeEEEEecCcccccc-ccccccccCceEEEe Q lcl|NC_011045. 202 GGEKK-A-------------------DETIDVYTH--IYLDE---DSGEYIRYEEVEGMEVQGS-DGTYPKEACPYIPIR 255 (536) Q Consensus 202 ~~~~~-~-------------------~~~~~v~~~--v~p~~---~~~~~~~~~~v~g~~i~~~-~~~~~~~~~P~~~~r 255 (536) ..+.+ . ...|++++. .++.+ +...|.....++|..+++. ...++++++||++.. T Consensus 229 ~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~ 308 (584) T protein:vir:95 229 GYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVG 308 (584) T ss_pred CCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEc Confidence 00000 0 123554442 12222 1223444455677777754 456778999999999 Q ss_pred eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceecCCcccccccccccc Q lcl|NC_011045. 256 MVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQ 335 (536) Q Consensus 256 w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~ 335 (536) |.+...+.||.|+.+.++|-++.||.+.+.+++++.++++|++. .++++.+...+..+.|..+.++++++++.. . T Consensus 309 ~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k----~~~~~~~~~~~pg~~~~~~~~~~~q~~~p~-a 383 (584) T protein:vir:95 309 WRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLK----IIGEVEEFVWGPGAEIHLDQGGDVQEIAKN-V 383 (584) T ss_pred ceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCccee----eccccchhcccCCceeecCCCCCcceecCc-h Confidence 99999999999999999999999999999999999999999644 346667766544444667888887766532 2 Q ss_pred cchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcC--- Q lcl|NC_011045. 336 ADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ--- 412 (536) Q Consensus 336 ~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g--- 412 (536) .++..+...|+-+++...+ +..-....++.+.++++.+--.+.+.+..+.++.+...+|-.|+++|++..|.+.| T Consensus 384 ~~~~s~~~~lq~~e~~me~--~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~n 461 (584) T protein:vir:95 384 NYIINADNQIQMLEDRMEL--YAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRN 461 (584) T ss_pred hhhhHHHHHHHHHHHHHHh--hhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 3555566666666666554 22222223444555555555556666788899999999999999999999998764 Q ss_pred -CCC----------------CCCCcce----EEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHH Q lcl|NC_011045. 413 -QIP----------------ELPKEAV----EPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIA 471 (536) Q Consensus 413 -~lp----------------~~~~~~v----~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a 471 (536) ..+ ++..+++ ++....--+-+.|.+..+++.+|++. ++++ .++++++-..+.+.++ T Consensus 462 md~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~--~~~~-~i~p~~~~~~l~~~la 538 (584) T protein:vir:95 462 MDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNS--QIGQ-MILPHTSGKALATFVD 538 (584) T ss_pred ccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHh--hhhh-hccccchHHHHHHHHH Confidence 122 1222333 33333333336678889999999885 5666 4667788888888899 Q ss_pred HHcCCChhhccCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhc Q lcl|NC_011045. 472 NAIGIDTSGILLTEEQKQQKMAQQSMQMG-MDNGAAALAQGMAAQATA 518 (536) Q Consensus 472 ~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~-~~~~a~~~~~~~~~~~~~ 518 (536) +..+.+--.+.+.+-.+++. ++.+|+. ++|+...+.+.+.+.++. T Consensus 539 dl~~~p~~~~~~~~~~~~~Q--~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 539 DVTGLQGYEIFRPNVAVAEQ--AETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred HHhCCCcccccCCCcccchh--HHHHhhhHHHHHHHHHHHhhhhccCC Confidence 99998544566554444322 1111110 011111111111111111 No 29 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=3e-40 Score=237.27 Aligned_cols=502 Identities=12% Similarity=0.113 Sum_probs=317.1 Q ss_pred CCCcccccc------------HHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLA------------EEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNN 68 (536) Q Consensus 1 Ma~~~~~~~------------~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~ 68 (536) |+-+-..+. ...++.+|.++.+.|+..+..|.|+++|+.-+--+..++.+.....+++-+.....+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~ 80 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLM 80 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHH Confidence 433211111 24578999999999999999999999998665555555555555667888899999999 Q ss_pred HHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_011045. 69 LASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLP 148 (536) Q Consensus 69 Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~ 148 (536) |++.+|+++||+..||++...+++.. .+..-+.+++.+...|+.|+|+.++...+.|++++|||+.-++ T Consensus 81 l~a~~~~~~fp~~~w~d~~~~~~~~~-----------~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~ 149 (599) T protein:vir:31 81 ITTSYMEHLLPNRNWVDFVGFDNDSV-----------NAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTR 149 (599) T ss_pred HHHHHHhhhcCCccceEeeecCCchh-----------HHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeee Confidence 99999999999999999998876432 2233455677778889999999999999999999999976554 Q ss_pred cCCC----------Ccee--eEEEEecceEEEeeCCCCCeEEEE--EeEeccHHHHHHHHhH---------Hhh---hcc Q lcl|NC_011045. 149 EPEG----------SNYN--PMKLYRLSSYVVQRDAFGNVLQMV--TRDQIAFGALPEDIRK---------AVE---GQG 202 (536) Q Consensus 149 ~~~~----------~~~~--~~~~~~l~~~~v~~d~~G~v~~i~--r~~~~t~~~l~~~~~~---------~~~---~~~ 202 (536) .... +..+ +++.+.+.+++++.++ +.++.++ +|...|..+|...+.+ ++. ... T Consensus 150 ~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A-~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~ 228 (599) T protein:vir:31 150 HVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTA-DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREER 228 (599) T ss_pred EEEcceeecccccccccccceEEeecccceeeCCCC-CCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhc Confidence 1110 1111 4455666777777776 5666653 5888888888764432 110 011 Q ss_pred cc------C-------------CCCc---------eEEEEE--EEEecCCCCc-e--eEEEEecCcccccccc-cccccc Q lcl|NC_011045. 203 GE------K-------------KADE---------TIDVYT--HIYLDEDSGE-Y--IRYEEVEGMEVQGSDG-TYPKEA 248 (536) Q Consensus 203 ~~------~-------------~~~~---------~~~v~~--~v~p~~~~~~-~--~~~~~v~g~~i~~~~~-~~~~~~ 248 (536) .+ . +... .|++++ ..+++.+++. | .+...++++.+++.+. .++..+ T Consensus 229 ~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~ 308 (599) T protein:vir:31 229 RTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGS 308 (599) T ss_pred cCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCC Confidence 10 0 1111 133322 1233443333 2 1333445555655543 344556 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcc-eecCCcccc Q lcl|NC_011045. 249 CPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD-FVTGRPEDI 327 (536) Q Consensus 249 ~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~-~~~g~~~~~ 327 (536) .||++..|....++.||.||...++|.+..||.+.+.++++...+++|+ ++-.|.+.+.|+..+ ||. |..++.+++ T Consensus 309 ~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~--l~~~~dl~~eD~~~~-P~~v~~~~d~~~v 385 (599) T protein:vir:31 309 QNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPS--LKKVGDVREKGMRGG-PNHVFEVEETGDV 385 (599) T ss_pred CCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccc--ccccccccccCccCC-CCcceeecCCCcc Confidence 8999999999999999999999999999999999999999999999994 445666888888765 455 445777777 Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHH---hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 328 SFLQLEKQADFTVAKAVSDAIEARLSFAF---MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVL 404 (536) Q Consensus 328 ~~~~~~~~~~~~~~~~~i~~~~~rI~~af---~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~ 404 (536) ++.+ .+.+..-+...|+..+.+..+.- .. +...+.+..-||+||+...++........+..+..+|+.||++++ T Consensus 386 q~~~--p~s~~~~a~~~is~~e~~mee~sGvp~~-~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l 462 (599) T protein:vir:31 386 QYMT--PPAEVLQPDNQLSITLQLMEDLSGAPKE-SIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDY 462 (599) T ss_pred cccc--CchhhhhHHHHHHHHHHHHHHhhccchh-hcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 6442 22344445556666666554411 11 111122223499999999999999999999999999999999999 Q ss_pred HHHHHhcCCCCC----------------CCCcceE--EEEechHHH--HHHHHHHHHHHHHHHHHHhhcchhhhhcCCHH Q lcl|NC_011045. 405 LKQLQATQQIPE----------------LPKEAVE--PTISTGLEA--IGRGQDLDKLERCVAAWAALAPMRDDPDINLA 464 (536) Q Consensus 405 ~~il~~~g~lp~----------------~~~~~v~--v~~vs~La~--a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d 464 (536) +...++.-.=|+ +..++++ .+.++.-+. +.|.+.++++.+|++ +++++. +++++.-. T Consensus 463 ~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~--~~~~q~-~~P~~~~k 539 (599) T protein:vir:31 463 LEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILG--GPLGAA-LAPHMSRT 539 (599) T ss_pred HHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhc--ccCCCc-cchhhHHH Confidence 999887532222 1112221 223333333 678888999999997 555543 34455555 Q ss_pred HHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhcCcchHHhhhhcCCCC Q lcl|NC_011045. 465 MIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQ-ATASPEAMAAAADSVGLQ 533 (536) Q Consensus 465 ~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~-~~~~~~~~~~~~~~~~~q 533 (536) ++...++....+.--.|.+..--+++ |+.+.+++|.+..|.....+.+ .-+.|- .|.+ | T Consensus 540 ~l~~~l~~~~~l~~~~~~~~~va~~e---qq~~~~m~Q~~lq~~~~~~~~~~~~~~~~-----~~~~--~ 599 (599) T protein:vir:31 540 KLFNAVEYLGDLDAYGIFTFGIGVQE---DQQLARMAQKSTQQTEETALTQEEVGGPT-----TDTG--Q 599 (599) T ss_pred HHHHHHHHHHhccccccCCCchhHHH---HHHHHHHHHHHHHHhHhhhhhhhhcCCCC-----cccC--C Confidence 55555555333322334443322222 1222222222222222222222 111111 1111 1 No 30 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=100.00 E-value=1.1e-33 Score=201.35 Aligned_cols=510 Identities=11% Similarity=0.055 Sum_probs=292.7 Q ss_pred CCCc--cccccHHHH----HHHHHHHHHHhh-hHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEK--RTGLAEEGA----KSVYERLKNDRA-PYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma~~--~~~~~~~~~----~~r~~~l~~~R~-~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (536) |||. ...++.+++ ..+++..++.+. .+...+.+-.+|.+-..... .. ....+++.+.-...++.+.+.| T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~-~~---~~~s~~~~~~v~~~v~~~~~~l 76 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN-ER---PGKSGIVSRDVQETVDWIMPSL 76 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc-cc---CCCCccccHHHHHHHHHHHHHH Confidence 9983 445555543 344444444333 22223344445543221111 11 1235667777888999999999 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHH-HHhccChHHHHHHHHHHHhhCcEEE---EEec Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNY-IESNSYRVTLFEALKQLVVAGNVLL---YLPE 149 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~dl~~~G~~~l---~~~~ 149 (536) +..+|++.+||++.+..+...+ . -+.++..+.-. ...++.+..++.+++|.++.|+|++ |... T Consensus 77 ~~~~~~~~~~~~~~p~~~~D~~------~-------a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~ 143 (705) T protein:vir:88 77 MKVFTSGGQVVKYEPDTAEDVE------Q-------AEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEV 143 (705) T ss_pred HHhhcCCCceEEEeeCChhHHH------H-------HHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccc Confidence 9999999999999986543221 1 11222222222 3456678889999999999999987 3111 Q ss_pred ---------------------CC-----------------------CCceeeEEEEecceEEEeeCCCCCeEE--EEEeE Q lcl|NC_011045. 150 ---------------------PE-----------------------GSNYNPMKLYRLSSYVVQRDAFGNVLQ--MVTRD 183 (536) Q Consensus 150 ---------------------~~-----------------------~~~~~~~~~~~l~~~~v~~d~~G~v~~--i~r~~ 183 (536) +. ..+.++++.||..+|+|..++.+--|. +++++ T Consensus 144 ~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~ 223 (705) T protein:vir:88 144 LKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHRE 223 (705) T ss_pred cchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEE Confidence 00 013467888999999999998764443 56788 Q ss_pred eccHHHHHHHHhH-----Hhhhc------------c---------------ccCCCCceEEEEEEEEec--CCCCceeEE Q lcl|NC_011045. 184 QIAFGALPEDIRK-----AVEGQ------------G---------------GEKKADETIDVYTHIYLD--EDSGEYIRY 229 (536) Q Consensus 184 ~~t~~~l~~~~~~-----~~~~~------------~---------------~~~~~~~~~~v~~~v~p~--~~~~~~~~~ 229 (536) .+|..+|...... .+... . ..+.....|++|+|..+- ..++...+| T Consensus 224 ~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~ 303 (705) T protein:vir:88 224 KYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELR 303 (705) T ss_pred eccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeE Confidence 9999998432100 00000 0 001112357777776542 224455565 Q ss_pred EEe-cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch Q lcl|NC_011045. 230 EEV-EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP 308 (536) Q Consensus 230 ~~v-~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~ 308 (536) ..+ -|..++..+ +.+.+||++.++.+.++..||+|+++...+-++.+|.+.+.+++++..+++|.++++ +|..++ T Consensus 304 ~~~~~g~~il~~~---~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~-~g~v~~ 379 (705) T protein:vir:88 304 RILYVGDYIISNE---PWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVL-DGQVNL 379 (705) T ss_pred EEEEeCccccccc---cCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecc-ccccCc Confidence 544 355554433 346899999999999999999999999999999999999999999999999999995 566678 Q ss_pred hhhccCCCcceecCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCC----CCCCCHHHHHHHHHHHH Q lcl|NC_011045. 309 RRLTKAQTGDFVTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRT----GERVTAEEIRYVASELE 382 (536) Q Consensus 309 ~~~~~~~~g~~~~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~----~~r~TAtEi~~r~~E~~ 382 (536) .++....+|.++.-.. +.+.+++. +.-.+.+...++.+.+.|++..=. +.....+ ..+.||+.|....+... T Consensus 380 ~d~~~~~pg~vv~~~~~~~i~~~~~--~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~ 457 (705) T protein:vir:88 380 EDLLTNEAAGIVRVKSMNSITPLET--PQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAE 457 (705) T ss_pred ccccccCCCeeEEecCCCccccccC--CcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHH Confidence 8888888888776443 44555443 333455666777788888775522 1111112 33679999999999999 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCC---CC-----------cceEEEEechHHHHH---HHHHHHHHHHH Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPEL---PK-----------EAVEPTISTGLEAIG---RGQDLDKLERC 445 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~---~~-----------~~v~v~~vs~La~a~---r~~~~~~l~~~ 445 (536) ..+..+...+...++.+++.+++.++.....-|.+ .+ ..+.+++.+++..+. +.+.+..+++. T Consensus 458 ~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~ 537 (705) T protein:vir:88 458 QQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEM 537 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHH Confidence 99999999999899999999999999886543321 11 123444445454444 44444444444 Q ss_pred HHHHHhhcchhhhhcC---CHHHHHHHHHHHcCCC-hhhccCCHHHHHHHHHHHHHH-----H---HHHH-------HHH Q lcl|NC_011045. 446 VAAWAALAPMRDDPDI---NLAMIKLRIANAIGID-TSGILLTEEQKQQKMAQQSMQ-----M---GMDN-------GAA 506 (536) Q Consensus 446 ~~~~~~~~p~~~~~~i---d~d~~~~~~a~~~Gv~-p~~i~rs~~ev~~~~~q~~~q-----~---~~~~-------~a~ 506 (536) .+.+.+..+ ..+.+ +...+...++...|+- +..++..+...++++++++.+ + ++++ ++. T Consensus 538 ~q~l~~~~~--~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e 615 (705) T protein:vir:88 538 AQAVVGGGG--LGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSD 615 (705) T ss_pred HHHhhcccc--hhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 433333211 11123 3445566666666541 223332222222111110000 0 0000 000 Q ss_pred HHHHHHHHhh-hcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 507 ALAQGMAAQA-TASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 507 ~~~~~~~~~~-~~~~~~~~~~~~~~~~q~~~ 536 (536) +....+-.+. ....++ .++.....=|.+. T Consensus 616 ~~~~q~e~q~~q~E~q~-~q~e~e~~~~~~~ 645 (705) T protein:vir:88 616 ALAKQAEAQMKQVEAQI-RLAEIELKKQEAV 645 (705) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 0000000000 000000 0000000000000 No 31 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.95 E-value=1.2e-26 Score=162.63 Aligned_cols=514 Identities=10% Similarity=0.028 Sum_probs=261.5 Q ss_pred CCCccccc-------------------cHHHHHHHHHHHHHHhhhHH---HHHHHHHHHhcccccCCCCCcccccccccc Q lcl|NC_011045. 1 MAEKRTGL-------------------AEEGAKSVYERLKNDRAPYE---TRAQNCAQYTIPSLFPKDSDNASTDYVTPW 58 (536) Q Consensus 1 Ma~~~~~~-------------------~~~~~~~r~~~l~~~R~~~e---~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~ 58 (536) |-+.-+++ .-..|++-++.+++...+.. ..|.+++-|.. ... -+......++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~grs~vv 75 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEG-KAK----PPKVKGRSQVQ 75 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccc-cCc----ccccCCCcccc Confidence 33222221 01223333333333332222 33545443331 111 11111245678 Q ss_pred cchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHH-HHHhccChHHHHHHHHHH Q lcl|NC_011045. 59 QAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMN-YIESNSYRVTLFEALKQL 137 (536) Q Consensus 59 dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~dl 137 (536) ...-.+.++.+-+.|+..++++..||++.+..++..+. .. ..+..+.- ....++-+..++.+++++ T Consensus 76 ~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~------A~-------q~t~~~n~~~~~~~~~~~~~~~~~~~~ 142 (763) T protein:vir:95 76 PKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQG------AR-------QNELVLNYQFRTKLNRVSFIDNYVRSV 142 (763) T ss_pred CHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHH------HH-------HHHHHHHHHHhhcCchhhHHHHHHHHH Confidence 88888899999999999999999999999887653321 11 12222222 334566778899999999 Q ss_pred HhhCcEEEEEecCC----------------------------------------------------------C------- Q lcl|NC_011045. 138 VVAGNVLLYLPEPE----------------------------------------------------------G------- 152 (536) Q Consensus 138 ~~~G~~~l~~~~~~----------------------------------------------------------~------- 152 (536) ++.|||++=+-.+. + T Consensus 143 l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (763) T protein:vir:95 143 VDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQ 222 (763) T ss_pred hhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeec Confidence 99999975221100 0 Q ss_pred ------------CceeeEEEEecceEEEeeCCCCCeE---EEEEeEeccHHHHHHH-HhH--------Hhhhc------- Q lcl|NC_011045. 153 ------------SNYNPMKLYRLSSYVVQRDAFGNVL---QMVTRDQIAFGALPED-IRK--------AVEGQ------- 201 (536) Q Consensus 153 ------------~~~~~~~~~~l~~~~v~~d~~G~v~---~i~r~~~~t~~~l~~~-~~~--------~~~~~------- 201 (536) ++..+++.||+.+|+|..++.+.++ -+|+++.+|..+|... ++. ...+. T Consensus 223 ~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~ 302 (763) T protein:vir:95 223 TGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHA 302 (763) T ss_pred ccceeEEEEEEecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhcccccccccc Confidence 0112566799999999999876444 3578899999999542 111 00000 Q ss_pred -------cccCCCCceEEEEEEEEec--CCCCceeEEEEe-cCcccccc-ccccccccCceEEEeeeecCCCccccchHH Q lcl|NC_011045. 202 -------GGEKKADETIDVYTHIYLD--EDSGEYIRYEEV-EGMEVQGS-DGTYPKEACPYIPIRMVRLDGESYGRSYIE 270 (536) Q Consensus 202 -------~~~~~~~~~~~v~~~v~p~--~~~~~~~~~~~v-~g~~i~~~-~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~ 270 (536) .......++|.|+.|..+- .+++...+|..+ -|..+++. ...|+.+.+||+++.+.+.++..||.|.++ T Consensus 303 ~~~~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~ 382 (763) T protein:vir:95 303 TTTPQEFQISDPMRKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAE 382 (763) T ss_pred ccchhhccCCCcccceEEEEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHH Confidence 0001113678887775532 223444444422 34444443 355666889999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcceec---CCcccccccccc---cccchhHHHHH Q lcl|NC_011045. 271 EYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFVT---GRPEDISFLQLE---KQADFTVAKAV 344 (536) Q Consensus 271 ~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~~---g~~~~~~~~~~~---~~~~~~~~~~~ 344 (536) .+.+.++.+|++.+..++.+..+++|.|+++.+. .+..+.....+|.++. |......+.+.. .+..+..+.+. T Consensus 383 ~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~ga-v~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~ 461 (763) T protein:vir:95 383 LLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGM-LDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATL 461 (763) T ss_pred HhhHHHHHHHHHHHHHHHHHHhhcCCcEEeeccc-ccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHH Confidence 9999999999999999999999999999996554 5656655667777653 332221111111 12233333333 Q ss_pred HHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC----------- Q lcl|NC_011045. 345 SDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ----------- 413 (536) Q Consensus 345 i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~----------- 413 (536) ++...+.+.-.--.......+....||++|..+.+.....+..++.+|.. .+.+++.+++.++.+.-. T Consensus 462 ~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e 540 (763) T protein:vir:95 462 QNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEE 540 (763) T ss_pred HHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCc Confidence 33333333222112211122334569999999999999999988888876 689999999999988521 Q ss_pred CCCCCCc----ceEEEEechHHHHHHHHHHHHHHHHHHHHHh-hcchh-------hhhcCCHHHHHHHHHHHcCCChhhc Q lcl|NC_011045. 414 IPELPKE----AVEPTISTGLEAIGRGQDLDKLERCVAAWAA-LAPMR-------DDPDINLAMIKLRIANAIGIDTSGI 481 (536) Q Consensus 414 lp~~~~~----~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~-~~p~~-------~~~~id~d~~~~~~a~~~Gv~p~~i 481 (536) ..++... .+.|.+..+.+ ..+.+.++.+.++++.++. +.+.. ..+..+...+++.+.....- |..+ T Consensus 541 ~v~v~~~~~~~~~DV~V~~~~a-s~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~-~d~~ 618 (763) T protein:vir:95 541 FVTIKREDLKGNFDLEVDISTA-EVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQ-PDPV 618 (763) T ss_pred cccccHHHhcCCcceEEecccc-hHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCC-ccch Confidence 1122111 22333322222 2223333444444443322 11110 00122333333333322221 1111 Q ss_pred cCCHHHHHHHHHHHHHH-HHHHHH-HHHHHHHHHHhh-------hcCcchHHhhh------------------------- Q lcl|NC_011045. 482 LLTEEQKQQKMAQQSMQ-MGMDNG-AAALAQGMAAQA-------TASPEAMAAAA------------------------- 527 (536) Q Consensus 482 ~rs~~ev~~~~~q~~~q-~~~~~~-a~~~~~~~~~~~-------~~~~~~~~~~~------------------------- 527 (536) -.-..+.++.+++.+++ ++.+++ +.+.+....++. ...-+.+++.. T Consensus 619 ~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~eaq~~l~~~~a~~~~~ 698 (763) T protein:vir:95 619 QEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQGNQQLEITKALTKPR 698 (763) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111110000000 000000 000000000000 00000000000 Q ss_pred hcCCCCCCC Q lcl|NC_011045. 528 DSVGLQPGI 536 (536) Q Consensus 528 ~~~~~q~~~ 536 (536) ..+..++.. T Consensus 699 ~ea~~~~~~ 707 (763) T protein:vir:95 699 KEGELPPNL 707 (763) T ss_pred HHhccChhH Confidence 000001111 No 32 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.87 E-value=1e-20 Score=130.21 Aligned_cols=513 Identities=11% Similarity=0.029 Sum_probs=260.6 Q ss_pred CCCcccc---------cc----H---HHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc----ccccccccc Q lcl|NC_011045. 1 MAEKRTG---------LA----E---EGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS----TDYVTPWQA 60 (536) Q Consensus 1 Ma~~~~~---------~~----~---~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~----~~~~~~~ds 60 (536) |+.+... ++ . +.|.++|..-.+.-..|...+.+-.+|..-. ..+..... +..+.+.-+ T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 99 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNI--QWSQDEIDELKERGQAPTVYN 99 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC--CCCHHHHHHHHhcCCceEEec Confidence 3222211 11 1 1333444444444456666666666776211 11111101 112233333 Q ss_pred hHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhh Q lcl|NC_011045. 61 VGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVA 140 (536) Q Consensus 61 t~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 140 (536) ...-.++.+.+..-. +++=+++.+.++.-. ++- +.++..+......+++..+...++.|.++. T Consensus 100 ~i~~~i~~v~g~~~~----nr~~~~~~p~~~~d~----------~~A---e~l~~~~~~~~~~~~~~~~~~~af~d~~~~ 162 (776) T protein:vir:93 100 VISQSVNWIIGSEKR----GRSDFKVLPRRKDGG----------KAA---ERKTALLKYLSDVNHTPFERSMAFEETTKA 162 (776) T ss_pred chHHHHHHHHHHHHh----CCcceEEecCChhHH----------HHH---HHHHHHHHHHHHhhcHHHHHHHHHHHhhhc Confidence 334444444332222 334455555432110 122 223334444556789999999999999999 Q ss_pred CcEEE--EEecCCCCceeeEEEEecceEEEeeCCCC----CeEEEEEeEeccHHHHHHHHhHHhhhc-------c----- Q lcl|NC_011045. 141 GNVLL--YLPEPEGSNYNPMKLYRLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVEGQ-------G----- 202 (536) Q Consensus 141 G~~~l--~~~~~~~~~~~~~~~~~l~~~~v~~d~~G----~v~~i~r~~~~t~~~l~~~~~~~~~~~-------~----- 202 (536) |.|++ +++.+..+..+..+.++..+++++.++.- ...-+|++..+|.+++...|++..... + T Consensus 163 G~G~~~v~~d~~~~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 242 (776) T protein:vir:93 163 GIGWLESQVQDENDGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGT 242 (776) T ss_pred CcceEEEEeeccCCCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccch Confidence 99986 45666666777778888899999877642 233468889999999988776432110 0 Q ss_pred c------------------------cCCCCceEEEEEEEEecCC----------CCce---------------------- Q lcl|NC_011045. 203 G------------------------EKKADETIDVYTHIYLDED----------SGEY---------------------- 226 (536) Q Consensus 203 ~------------------------~~~~~~~~~v~~~v~p~~~----------~~~~---------------------- 226 (536) . .....++|.|+.+.+.++. +.+. T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~ 322 (776) T protein:vir:93 243 DDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLA 322 (776) T ss_pred hcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeeh Confidence 0 0011246777777554311 0000 Q ss_pred ------eEEEEecCcccccc-ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_011045. 227 ------IRYEEVEGMEVQGS-DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) Q Consensus 227 ------~~~~~v~g~~i~~~-~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~l 299 (536) +.+..+-|..++.. .+.|+++.|||+++...+.+.+.||.|.+..+.+-++.+|.+...++..+ .+.+++ T Consensus 323 ~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l---~~~~~~ 399 (776) T protein:vir:93 323 VSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL---STNKVL 399 (776) T ss_pred heeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhh---cCCcee Confidence 01122234444333 35577789999999999999999999999999999999999988877654 356688 Q ss_pred eccccccchhhhcc--CCCcceecCCcccccccccccccc-hhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHH Q lcl|NC_011045. 300 VNPAGITQPRRLTK--AQTGDFVTGRPEDISFLQLEKQAD-FTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIR 375 (536) Q Consensus 300 v~~~g~~~~~~~~~--~~~g~~~~g~~~~~~~~~~~~~~~-~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~ 375 (536) +..+++-+.+++.. +.+|.++..+++......+....+ .+...+.++...+.|+..- ..+.+....+..++..-|. T Consensus 400 ~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~ 479 (776) T protein:vir:93 400 MEEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQ 479 (776) T ss_pred eccccccchHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHH Confidence 87777777776654 567777765555433322222212 2344445555555555432 1122222344457888899 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC---------------CCCCC----Cc-----ceEEEEe-chH Q lcl|NC_011045. 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ---------------IPELP----KE-----AVEPTIS-TGL 430 (536) Q Consensus 376 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~---------------lp~~~----~~-----~v~v~~v-s~L 430 (536) .+.+.....|..++.++..-+ .=+.+.++.++.+.-- .-.|. .. .+.|.+. ++- T Consensus 480 ~~~~~~~~~~~~~~dn~~~~~-~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~ 558 (776) T protein:vir:93 480 ARQEQGSVATNKLFDNLRLAF-QQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEW 558 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeeccc Confidence 999999999999999997744 3366666666665310 00011 11 1234333 333 Q ss_pred HHHHHHHHHHHHHHHHHHHHh-hcc----hhh--hhcCCHHHHHHHHHHHcCC-ChhhccCCHHHHHHHHHHHHH-HHHH Q lcl|NC_011045. 431 EAIGRGQDLDKLERCVAAWAA-LAP----MRD--DPDINLAMIKLRIANAIGI-DTSGILLTEEQKQQKMAQQSM-QMGM 501 (536) Q Consensus 431 a~a~r~~~~~~l~~~~~~~~~-~~p----~~~--~~~id~d~~~~~~a~~~Gv-~p~~i~rs~~ev~~~~~q~~~-q~~~ 501 (536) ...+|.+.++.+++.++.+.. +.+ ..+ ++.-+.+++...+-...+- +|..-...+++.++.+.+++. +.++ T Consensus 559 ~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~ 638 (776) T protein:vir:93 559 RATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYND 638 (776) T ss_pred chhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHH Confidence 455577766666665543211 111 111 1112456677776666542 122222222222221111110 0000 Q ss_pred HHHHHH----HHH--HHHHhh-hcCcchH---Hhhhh--cCCCCCCC Q lcl|NC_011045. 502 DNGAAA----LAQ--GMAAQA-TASPEAM---AAAAD--SVGLQPGI 536 (536) Q Consensus 502 ~~~a~~----~~~--~~~~~~-~~~~~~~---~~~~~--~~~~q~~~ 536 (536) +.+..+ .+. ...+++ ....++. .++.. ....|+++ T Consensus 639 ~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~ 685 (776) T protein:vir:93 639 ALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDAT 685 (776) T ss_pred HHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhh Confidence 000000 000 000000 0000000 00000 00011111 No 33 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.84 E-value=1.6e-18 Score=118.17 Aligned_cols=523 Identities=12% Similarity=0.059 Sum_probs=260.4 Q ss_pred CCCcccc---------------ccHHHHHHHHHHHHHH---hhhHHHHHH----HHHHHhcccccCCCCCc----ccccc Q lcl|NC_011045. 1 MAEKRTG---------------LAEEGAKSVYERLKND---RAPYETRAQ----NCAQYTIPSLFPKDSDN----ASTDY 54 (536) Q Consensus 1 Ma~~~~~---------------~~~~~~~~r~~~l~~~---R~~~e~~w~----e~~~~~~P~~~~~~~~~----~~~~~ 54 (536) ||+++.- .+.++..+.+.++++. -..|...|+ +-.+|..- ...+... ..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~ 78 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGG--EQWPSQVRTERELEQR 78 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCC--CCCCHHHHHHHHhcCC Confidence 8775421 1111223333333332 123333343 44445521 1111110 01112 Q ss_pred cccccchHHHHHHHHHHHHHHhhcCCCcceeccCCh------------hhhhhhccChhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 55 VTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISE------------YEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE 122 (536) Q Consensus 55 ~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d------------~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~ 122 (536) +.+.-+...-.++...+..-. +++=+++.+.+ ............-.++-+.| +..+..... T Consensus 79 p~~~~N~i~~~v~~v~g~~~~----nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l---~~~~~~~~~ 151 (711) T protein:vir:10 79 PCLVNNVLPTFVDQVLGDQRQ----NRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVF---TGLIKNIEY 151 (711) T ss_pred CcEEEcchHHHHHHHhhhHhh----CCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHH---HHHHHHHHH Confidence 223333333334333333221 12211222211 11111111111111222223 333444555 Q ss_pred hccChHHHHHHHHHHHhhCcEEE-----EEecCCCCceeeEEEEe-cceEEEeeCC---CC-CeEEEEEeEeccHHHHHH Q lcl|NC_011045. 123 SNSYRVTLFEALKQLVVAGNVLL-----YLPEPEGSNYNPMKLYR-LSSYVVQRDA---FG-NVLQMVTRDQIAFGALPE 192 (536) Q Consensus 123 ~snf~~~~~~~~~dl~~~G~~~l-----~~~~~~~~~~~~~~~~~-l~~~~v~~d~---~G-~v~~i~r~~~~t~~~l~~ 192 (536) .++...+...++.+.++.|.|++ |..++...+-+.++.++ ..+++++.++ ++ ...-+|++..|+.+++.. T Consensus 152 ~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~ 231 (711) T protein:vir:10 152 NCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKFKA 231 (711) T ss_pred hcChhHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHHHH Confidence 78899999999999999999975 22233333445566664 6778886654 33 234478999999999999 Q ss_pred HHhHHhhhccc---cCC-----CCceEEEEEEEEecCC--------CC-----------------------------cee Q lcl|NC_011045. 193 DIRKAVEGQGG---EKK-----ADETIDVYTHIYLDED--------SG-----------------------------EYI 227 (536) Q Consensus 193 ~~~~~~~~~~~---~~~-----~~~~~~v~~~v~p~~~--------~~-----------------------------~~~ 227 (536) .|++....... ..+ ..++|.|..+.+.++. ++ .++ T Consensus 232 ~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 311 (711) T protein:vir:10 232 LYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFK 311 (711) T ss_pred hCCchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceee Confidence 99765322111 011 1244444443332210 00 111 Q ss_pred E-EEEecCccccccccccccccCceEEE--eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Q lcl|NC_011045. 228 R-YEEVEGMEVQGSDGTYPKEACPYIPI--RMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) Q Consensus 228 ~-~~~v~g~~i~~~~~~~~~~~~P~~~~--rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g 304 (536) + +.-+.|..++...+.++++.+||+++ .+..+++..++.|.+..+.+-++.+|++....+.++....+++|++.++. T Consensus 312 v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~ga 391 (711) T protein:vir:10 312 TYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN 391 (711) T ss_pred EEEEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcc Confidence 1 22245666665556677788999865 35567888888889999999999999999999999999999999998888 Q ss_pred ccchhhhc---cCCCcceecCCccc---ccccccccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHH Q lcl|NC_011045. 305 ITQPRRLT---KAQTGDFVTGRPED---ISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYV 377 (536) Q Consensus 305 ~~~~~~~~---~~~~g~~~~g~~~~---~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r 377 (536) +-+.++.. .+.+|.++..+++. ..+.+...+.-.+.....++...+.|.+.- ..+.+....+..+|+.-|..+ T Consensus 392 i~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~ 471 (711) T protein:vir:10 392 VEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIAR 471 (711) T ss_pred cCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHH Confidence 87766643 25677776554432 223333333334555566666666665542 222223344456799999999 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC------------C---CCCC------------------cceEE Q lcl|NC_011045. 378 ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI------------P---ELPK------------------EAVEP 424 (536) Q Consensus 378 ~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~l------------p---~~~~------------------~~v~v 424 (536) ++.....|..++.++..- ..=+...++.++.+.--- + .+.. ..+.| T Consensus 472 q~qg~~~l~~~~dn~~~~-~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv 550 (711) T protein:vir:10 472 QRQGDRGSFAFIDNLTKS-IRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDV 550 (711) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEE Confidence 999999999999998864 333445555555442100 0 0110 01123 Q ss_pred EE-echHHHHHHHHHHHHHHHHHHHHHhhcchh---hh---hcCCHHHHHHHHHHHcCCChhhccCCHHHHHH----HHH Q lcl|NC_011045. 425 TI-STGLEAIGRGQDLDKLERCVAAWAALAPMR---DD---PDINLAMIKLRIANAIGIDTSGILLTEEQKQQ----KMA 493 (536) Q Consensus 425 ~~-vs~La~a~r~~~~~~l~~~~~~~~~~~p~~---~~---~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~----~~~ 493 (536) .+ ++|-...+|.+.+..+.++++.+.++.+.. +. +.-+.++++..+....+- ........+..+ .++ T Consensus 551 ~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~--~~~~~~~~~~~qq~~~e~q 628 (711) T protein:vir:10 551 VVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPP--NVLSKDEREAIEEDMPEQT 628 (711) T ss_pred EEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCc--ccCcchhhhHHHHHHHHHH Confidence 32 455667778887777777766543322211 11 223678888888877653 222222211111 111 Q ss_pred HHHHHHHHHHHHHHH------HHHHHHhh---hcCc------------chHHhhhhcCCCCCCC Q lcl|NC_011045. 494 QQSMQMGMDNGAAAL------AQGMAAQA---TASP------------EAMAAAADSVGLQPGI 536 (536) Q Consensus 494 q~~~q~~~~~~a~~~------~~~~~~~~---~~~~------------~~~~~~~~~~~~q~~~ 536 (536) ++.++++.+++..+. +....+++ .... +.+.++. +.-+|++- T Consensus 629 q~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~-~~~~qq~~ 691 (711) T protein:vir:10 629 EPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGG-DVVYQQVR 691 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 110000100000000 00000000 0000 0011111 11122211 No 34 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.75 E-value=1.1e-15 Score=102.56 Aligned_cols=513 Identities=12% Similarity=0.065 Sum_probs=248.7 Q ss_pred CCCcc-ccccHHHHHHHHHHHHHHhh---hHHHHHHHHHHHhcccccCCCCCcc----cccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR-TGLAEEGAKSVYERLKNDRA---PYETRAQNCAQYTIPSLFPKDSDNA----STDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~-~~~~~~~~~~r~~~l~~~R~---~~e~~w~e~~~~~~P~~~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+.+. +.++.+..++.+..+..+.. .|.....+-.+|..- .-.+.... .+..+.+.-+...-.++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 66643 23334455556666666543 333333366666631 11111110 0112223233333333333222 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEE--EEEecC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL--LYLPEP 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~~~~~ 150 (536) . -=+++=+++.+.+..- ...++-+ .++..+......+++..+...++.+.++.|-|. +|++.+ T Consensus 86 ~----~~nr~~~~v~p~~~~~--------~~~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:81 86 E----AKTRTDLVVMSDEPDD--------ETEKLAE---AINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred H----HhCCcceEEecCCCCc--------hhHHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 2 2233333444432110 0011222 223344455557899999999999999988887 577777 Q ss_pred CCCceeeEEEEecceEEEeeCCCC----CeEEEEEeEeccHHHHHHHHhHHhh---hc---cc----------------- Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVE---GQ---GG----------------- 203 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~~G----~v~~i~r~~~~t~~~l~~~~~~~~~---~~---~~----------------- 203 (536) ..++.++++.+|..+++++.++.- .-.-+|++..++.+++...|++... .. +. T Consensus 151 ~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~ 230 (714) T protein:vir:81 151 PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMS 230 (714) T ss_pred CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccccc Confidence 777888999999999999886532 2224788999999999888875221 00 00 Q ss_pred --------c-------CCCCceEEEEEEEEecC--------CCCce----------------------------eEEEEe Q lcl|NC_011045. 204 --------E-------KKADETIDVYTHIYLDE--------DSGEY----------------------------IRYEEV 232 (536) Q Consensus 204 --------~-------~~~~~~~~v~~~v~p~~--------~~~~~----------------------------~~~~~v 232 (536) . .....+|.|+.|.+... .++.. ++...+ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~ 310 (714) T protein:vir:81 231 AWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF 310 (714) T ss_pred chhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE Confidence 0 00124566666654321 11111 122334 Q ss_pred cCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 233 EGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 233 ~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) .|..++.. .+.|+...|||++.-... ..|..| |.+..+.+-++.+|+..-..+.+ +..+-+++ .++++.... T Consensus 311 ~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~-~~~a~~~~d 385 (714) T protein:vir:81 311 VGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVIM-DEDATQLSD 385 (714) T ss_pred ecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCceee-ecCcccccH Confidence 56666543 345666789998764443 557777 68889999999999866555443 45666654 455554432 Q ss_pred -hhc--cCCCcceecCCcc---ccc---ccc-cccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHH Q lcl|NC_011045. 310 -RLT--KAQTGDFVTGRPE---DIS---FLQ-LEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVA 378 (536) Q Consensus 310 -~~~--~~~~g~~~~g~~~---~~~---~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~ 378 (536) .+. .+.+|.++.-+++ ... .+. ...+.-.+.....++...+.|++.- ..+.+....+...+..-|..|+ T Consensus 386 ~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:81 386 NDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 222 2566776644332 111 111 1222223444455555555554432 1122222344456777799999 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCC-----CCC--------Ccc-----eEEEE- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQIP-----ELP--------KEA-----VEPTI- 426 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~lp-----~~~--------~~~-----v~v~~- 426 (536) +.....|+..+.+|..-+.. +.+.++.++.+. +... .+. ..+ +.|.+ T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:81 466 EQGATTLAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 99999999999887765433 234444444331 1100 000 011 22333 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHh----hcchhhhhcC---CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAA----LAPMRDDPDI---NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQM 499 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~----~~p~~~~~~i---d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~ 499 (536) .+|-...+|.+.++.++++++.+.. +.+..+.... +.+++++.+-+.+|.....=-.++++.++..++++.++ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~ 624 (714) T protein:vir:81 545 PVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQ 624 (714) T ss_pred eccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHH Confidence 3455667788888888888765321 1122121223 56889999999888632111122222111111111111 Q ss_pred HHH--H---HHH------HHHHHHHHh----------------hhcCcchHHhhhhcC---CCCC-----CC Q lcl|NC_011045. 500 GMD--N---GAA------ALAQGMAAQ----------------ATASPEAMAAAADSV---GLQP-----GI 536 (536) Q Consensus 500 ~~~--~---~a~------~~~~~~~~~----------------~~~~~~~~~~~~~~~---~~q~-----~~ 536 (536) +++ + ..+ +.++.+.++ ......+..++..+. ++|+ ++ T Consensus 625 ~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 696 (714) T protein:vir:81 625 QQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDV 696 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHH Confidence 000 0 000 000000000 000000111110000 0000 01 No 35 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.75 E-value=1.1e-15 Score=102.56 Aligned_cols=513 Identities=12% Similarity=0.065 Sum_probs=248.7 Q ss_pred CCCcc-ccccHHHHHHHHHHHHHHhh---hHHHHHHHHHHHhcccccCCCCCcc----cccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR-TGLAEEGAKSVYERLKNDRA---PYETRAQNCAQYTIPSLFPKDSDNA----STDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~-~~~~~~~~~~r~~~l~~~R~---~~e~~w~e~~~~~~P~~~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+.+. +.++.+..++.+..+..+.. .|.....+-.+|..- .-.+.... .+..+.+.-+...-.++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 66643 23334455556666666543 333333366666631 11111110 0112223233333333333222 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEE--EEEecC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL--LYLPEP 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~~~~~ 150 (536) . -=+++=+++.+.+..- ...++-+ .++..+......+++..+...++.+.++.|-|. +|++.+ T Consensus 86 ~----~~nr~~~~v~p~~~~~--------~~~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:99 86 E----AKTRTDLVVMSDEPDD--------ETEKLAE---AINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred H----HhCCcceEEecCCCCc--------hhHHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 2 2233333444432110 0011222 223344455557899999999999999988887 577777 Q ss_pred CCCceeeEEEEecceEEEeeCCCC----CeEEEEEeEeccHHHHHHHHhHHhh---hc---cc----------------- Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVE---GQ---GG----------------- 203 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~~G----~v~~i~r~~~~t~~~l~~~~~~~~~---~~---~~----------------- 203 (536) ..++.++++.+|..+++++.++.- .-.-+|++..++.+++...|++... .. +. T Consensus 151 ~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~ 230 (714) T protein:vir:99 151 PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMS 230 (714) T ss_pred CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccccc Confidence 777888999999999999886532 2224788999999999888875221 00 00 Q ss_pred --------c-------CCCCceEEEEEEEEecC--------CCCce----------------------------eEEEEe Q lcl|NC_011045. 204 --------E-------KKADETIDVYTHIYLDE--------DSGEY----------------------------IRYEEV 232 (536) Q Consensus 204 --------~-------~~~~~~~~v~~~v~p~~--------~~~~~----------------------------~~~~~v 232 (536) . .....+|.|+.|.+... .++.. ++...+ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~ 310 (714) T protein:vir:99 231 AWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF 310 (714) T ss_pred chhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE Confidence 0 00124566666654321 11111 122334 Q ss_pred cCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 233 EGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 233 ~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) .|..++.. .+.|+...|||++.-... ..|..| |.+..+.+-++.+|+..-..+.+ +..+-+++ .++++.... T Consensus 311 ~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~-~~~a~~~~d 385 (714) T protein:vir:99 311 VGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVIM-DEDATQLSD 385 (714) T ss_pred ecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCceee-ecCcccccH Confidence 56666543 345666789998764443 557777 68889999999999866555443 45666654 455554432 Q ss_pred -hhc--cCCCcceecCCcc---ccc---ccc-cccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHH Q lcl|NC_011045. 310 -RLT--KAQTGDFVTGRPE---DIS---FLQ-LEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVA 378 (536) Q Consensus 310 -~~~--~~~~g~~~~g~~~---~~~---~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~ 378 (536) .+. .+.+|.++.-+++ ... .+. ...+.-.+.....++...+.|++.- ..+.+....+...+..-|..|+ T Consensus 386 ~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:99 386 NDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 222 2566776644332 111 111 1222223444455555555554432 1122222344456777799999 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCC-----CCC--------Ccc-----eEEEE- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQIP-----ELP--------KEA-----VEPTI- 426 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~lp-----~~~--------~~~-----v~v~~- 426 (536) +.....|+..+.+|..-+.. +.+.++.++.+. +... .+. ..+ +.|.+ T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:99 466 EQGATTLAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 99999999999887765433 234444444331 1100 000 011 22333 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHh----hcchhhhhcC---CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAA----LAPMRDDPDI---NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQM 499 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~----~~p~~~~~~i---d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~ 499 (536) .+|-...+|.+.++.++++++.+.. +.+..+.... +.+++++.+-+.+|.....=-.++++.++..++++.++ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~ 624 (714) T protein:vir:99 545 PVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQ 624 (714) T ss_pred eccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHH Confidence 3455667788888888888765321 1122121223 56889999999888632111122222111111111111 Q ss_pred HHH--H---HHH------HHHHHHHHh----------------hhcCcchHHhhhhcC---CCCC-----CC Q lcl|NC_011045. 500 GMD--N---GAA------ALAQGMAAQ----------------ATASPEAMAAAADSV---GLQP-----GI 536 (536) Q Consensus 500 ~~~--~---~a~------~~~~~~~~~----------------~~~~~~~~~~~~~~~---~~q~-----~~ 536 (536) +++ + ..+ +.++.+.++ ......+..++..+. ++|+ ++ T Consensus 625 ~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 696 (714) T protein:vir:99 625 QQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDV 696 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHH Confidence 000 0 000 000000000 000000111110000 0000 01 No 36 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.75 E-value=1.1e-15 Score=102.56 Aligned_cols=513 Identities=12% Similarity=0.065 Sum_probs=248.7 Q ss_pred CCCcc-ccccHHHHHHHHHHHHHHhh---hHHHHHHHHHHHhcccccCCCCCcc----cccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR-TGLAEEGAKSVYERLKNDRA---PYETRAQNCAQYTIPSLFPKDSDNA----STDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~-~~~~~~~~~~r~~~l~~~R~---~~e~~w~e~~~~~~P~~~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+.+. +.++.+..++.+..+..+.. .|.....+-.+|..- .-.+.... .+..+.+.-+...-.++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 66643 23334455556666666543 333333366666631 11111110 0112223233333333333222 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEE--EEEecC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL--LYLPEP 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~~~~~ 150 (536) . -=+++=+++.+.+..- ...++-+ .++..+......+++..+...++.+.++.|-|. +|++.+ T Consensus 86 ~----~~nr~~~~v~p~~~~~--------~~~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:32 86 E----AKTRTDLVVMSDEPDD--------ETEKLAE---AINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred H----HhCCcceEEecCCCCc--------hhHHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 2 2233333444432110 0011222 223344455557899999999999999988887 577777 Q ss_pred CCCceeeEEEEecceEEEeeCCCC----CeEEEEEeEeccHHHHHHHHhHHhh---hc---cc----------------- Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVE---GQ---GG----------------- 203 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~~G----~v~~i~r~~~~t~~~l~~~~~~~~~---~~---~~----------------- 203 (536) ..++.++++.+|..+++++.++.- .-.-+|++..++.+++...|++... .. +. T Consensus 151 ~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~ 230 (714) T protein:vir:32 151 PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMS 230 (714) T ss_pred CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccccc Confidence 777888999999999999886532 2224788999999999888875221 00 00 Q ss_pred --------c-------CCCCceEEEEEEEEecC--------CCCce----------------------------eEEEEe Q lcl|NC_011045. 204 --------E-------KKADETIDVYTHIYLDE--------DSGEY----------------------------IRYEEV 232 (536) Q Consensus 204 --------~-------~~~~~~~~v~~~v~p~~--------~~~~~----------------------------~~~~~v 232 (536) . .....+|.|+.|.+... .++.. ++...+ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~ 310 (714) T protein:vir:32 231 AWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF 310 (714) T ss_pred chhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE Confidence 0 00124566666654321 11111 122334 Q ss_pred cCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 233 EGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 233 ~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) .|..++.. .+.|+...|||++.-... ..|..| |.+..+.+-++.+|+..-..+.+ +..+-+++ .++++.... T Consensus 311 ~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~-~~~a~~~~d 385 (714) T protein:vir:32 311 VGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVIM-DEDATQLSD 385 (714) T ss_pred ecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCceee-ecCcccccH Confidence 56666543 345666789998764443 557777 68889999999999866555443 45666654 455554432 Q ss_pred -hhc--cCCCcceecCCcc---ccc---ccc-cccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHH Q lcl|NC_011045. 310 -RLT--KAQTGDFVTGRPE---DIS---FLQ-LEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVA 378 (536) Q Consensus 310 -~~~--~~~~g~~~~g~~~---~~~---~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~ 378 (536) .+. .+.+|.++.-+++ ... .+. ...+.-.+.....++...+.|++.- ..+.+....+...+..-|..|+ T Consensus 386 ~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:32 386 NDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 222 2566776644332 111 111 1222223444455555555554432 1122222344456777799999 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCC-----CCC--------Ccc-----eEEEE- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQIP-----ELP--------KEA-----VEPTI- 426 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~lp-----~~~--------~~~-----v~v~~- 426 (536) +.....|+..+.+|..-+.. +.+.++.++.+. +... .+. ..+ +.|.+ T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:32 466 EQGATTLAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 99999999999887765433 234444444331 1100 000 011 22333 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHh----hcchhhhhcC---CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAA----LAPMRDDPDI---NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQM 499 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~----~~p~~~~~~i---d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~ 499 (536) .+|-...+|.+.++.++++++.+.. +.+..+.... +.+++++.+-+.+|.....=-.++++.++..++++.++ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~ 624 (714) T protein:vir:32 545 PVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQ 624 (714) T ss_pred eccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHH Confidence 3455667788888888888765321 1122121223 56889999999888632111122222111111111111 Q ss_pred HHH--H---HHH------HHHHHHHHh----------------hhcCcchHHhhhhcC---CCCC-----CC Q lcl|NC_011045. 500 GMD--N---GAA------ALAQGMAAQ----------------ATASPEAMAAAADSV---GLQP-----GI 536 (536) Q Consensus 500 ~~~--~---~a~------~~~~~~~~~----------------~~~~~~~~~~~~~~~---~~q~-----~~ 536 (536) +++ + ..+ +.++.+.++ ......+..++..+. ++|+ ++ T Consensus 625 ~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 696 (714) T protein:vir:32 625 QQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDV 696 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHH Confidence 000 0 000 000000000 000000111110000 0000 01 No 37 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.75 E-value=1.1e-15 Score=102.56 Aligned_cols=513 Identities=12% Similarity=0.065 Sum_probs=248.7 Q ss_pred CCCcc-ccccHHHHHHHHHHHHHHhh---hHHHHHHHHHHHhcccccCCCCCcc----cccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR-TGLAEEGAKSVYERLKNDRA---PYETRAQNCAQYTIPSLFPKDSDNA----STDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~-~~~~~~~~~~r~~~l~~~R~---~~e~~w~e~~~~~~P~~~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+.+. +.++.+..++.+..+..+.. .|.....+-.+|..- .-.+.... .+..+.+.-+...-.++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 66643 23334455556666666543 333333366666631 11111110 0112223233333333333222 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEE--EEEecC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL--LYLPEP 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~~~~~ 150 (536) . -=+++=+++.+.+..- ...++-+ .++..+......+++..+...++.+.++.|-|. +|++.+ T Consensus 86 ~----~~nr~~~~v~p~~~~~--------~~~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:27 86 E----AKTRTDLVVMSDEPDD--------ETEKLAE---AINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred H----HhCCcceEEecCCCCc--------hhHHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 2 2233333444432110 0011222 223344455557899999999999999988887 577777 Q ss_pred CCCceeeEEEEecceEEEeeCCCC----CeEEEEEeEeccHHHHHHHHhHHhh---hc---cc----------------- Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVE---GQ---GG----------------- 203 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~~G----~v~~i~r~~~~t~~~l~~~~~~~~~---~~---~~----------------- 203 (536) ..++.++++.+|..+++++.++.- .-.-+|++..++.+++...|++... .. +. T Consensus 151 ~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~ 230 (714) T protein:vir:27 151 PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMS 230 (714) T ss_pred CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccccc Confidence 777888999999999999886532 2224788999999999888875221 00 00 Q ss_pred --------c-------CCCCceEEEEEEEEecC--------CCCce----------------------------eEEEEe Q lcl|NC_011045. 204 --------E-------KKADETIDVYTHIYLDE--------DSGEY----------------------------IRYEEV 232 (536) Q Consensus 204 --------~-------~~~~~~~~v~~~v~p~~--------~~~~~----------------------------~~~~~v 232 (536) . .....+|.|+.|.+... .++.. ++...+ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~ 310 (714) T protein:vir:27 231 AWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF 310 (714) T ss_pred chhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE Confidence 0 00124566666654321 11111 122334 Q ss_pred cCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 233 EGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 233 ~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) .|..++.. .+.|+...|||++.-... ..|..| |.+..+.+-++.+|+..-..+.+ +..+-+++ .++++.... T Consensus 311 ~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~-~~~a~~~~d 385 (714) T protein:vir:27 311 VGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVIM-DEDATQLSD 385 (714) T ss_pred ecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCceee-ecCcccccH Confidence 56666543 345666789998764443 557777 68889999999999866555443 45666654 455554432 Q ss_pred -hhc--cCCCcceecCCcc---ccc---ccc-cccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHH Q lcl|NC_011045. 310 -RLT--KAQTGDFVTGRPE---DIS---FLQ-LEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVA 378 (536) Q Consensus 310 -~~~--~~~~g~~~~g~~~---~~~---~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~ 378 (536) .+. .+.+|.++.-+++ ... .+. ...+.-.+.....++...+.|++.- ..+.+....+...+..-|..|+ T Consensus 386 ~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:27 386 NDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 222 2566776644332 111 111 1222223444455555555554432 1122222344456777799999 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCC-----CCC--------Ccc-----eEEEE- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQIP-----ELP--------KEA-----VEPTI- 426 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~lp-----~~~--------~~~-----v~v~~- 426 (536) +.....|+..+.+|..-+.. +.+.++.++.+. +... .+. ..+ +.|.+ T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:27 466 EQGATTLAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 99999999999887765433 234444444331 1100 000 011 22333 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHh----hcchhhhhcC---CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAA----LAPMRDDPDI---NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQM 499 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~----~~p~~~~~~i---d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~ 499 (536) .+|-...+|.+.++.++++++.+.. +.+..+.... +.+++++.+-+.+|.....=-.++++.++..++++.++ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~ 624 (714) T protein:vir:27 545 PVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQ 624 (714) T ss_pred eccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHH Confidence 3455667788888888888765321 1122121223 56889999999888632111122222111111111111 Q ss_pred HHH--H---HHH------HHHHHHHHh----------------hhcCcchHHhhhhcC---CCCC-----CC Q lcl|NC_011045. 500 GMD--N---GAA------ALAQGMAAQ----------------ATASPEAMAAAADSV---GLQP-----GI 536 (536) Q Consensus 500 ~~~--~---~a~------~~~~~~~~~----------------~~~~~~~~~~~~~~~---~~q~-----~~ 536 (536) +++ + ..+ +.++.+.++ ......+..++..+. ++|+ ++ T Consensus 625 ~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 696 (714) T protein:vir:27 625 QQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDV 696 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHH Confidence 000 0 000 000000000 000000111110000 0000 01 No 38 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.75 E-value=1.1e-15 Score=102.56 Aligned_cols=513 Identities=12% Similarity=0.065 Sum_probs=248.7 Q ss_pred CCCcc-ccccHHHHHHHHHHHHHHhh---hHHHHHHHHHHHhcccccCCCCCcc----cccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR-TGLAEEGAKSVYERLKNDRA---PYETRAQNCAQYTIPSLFPKDSDNA----STDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~-~~~~~~~~~~r~~~l~~~R~---~~e~~w~e~~~~~~P~~~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+.+. +.++.+..++.+..+..+.. .|.....+-.+|..- .-.+.... .+..+.+.-+...-.++...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 85 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLGM 85 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHhH Confidence 66643 23334455556666666543 333333366666631 11111110 0112223233333333333222 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEE--EEEecC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL--LYLPEP 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~~~~~ 150 (536) . -=+++=+++.+.+..- ...++-+ .++..+......+++..+...++.+.++.|-|. +|++.+ T Consensus 86 ~----~~nr~~~~v~p~~~~~--------~~~~~Ae---~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d 150 (714) T protein:vir:10 86 E----AKTRTDLVVMSDEPDD--------ETEKLAE---AINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSD 150 (714) T ss_pred H----HhCCcceEEecCCCCc--------hhHHHHH---HHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccC Confidence 2 2233333444432110 0011222 223344455557899999999999999988887 577777 Q ss_pred CCCceeeEEEEecceEEEeeCCCC----CeEEEEEeEeccHHHHHHHHhHHhh---hc---cc----------------- Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDAFG----NVLQMVTRDQIAFGALPEDIRKAVE---GQ---GG----------------- 203 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~~G----~v~~i~r~~~~t~~~l~~~~~~~~~---~~---~~----------------- 203 (536) ..++.++++.+|..+++++.++.- .-.-+|++..++.+++...|++... .. +. T Consensus 151 ~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~ 230 (714) T protein:vir:10 151 PFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMS 230 (714) T ss_pred CCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccccccc Confidence 777888999999999999886532 2224788999999999888875221 00 00 Q ss_pred --------c-------CCCCceEEEEEEEEecC--------CCCce----------------------------eEEEEe Q lcl|NC_011045. 204 --------E-------KKADETIDVYTHIYLDE--------DSGEY----------------------------IRYEEV 232 (536) Q Consensus 204 --------~-------~~~~~~~~v~~~v~p~~--------~~~~~----------------------------~~~~~v 232 (536) . .....+|.|+.|.+... .++.. ++...+ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~ 310 (714) T protein:vir:10 231 AWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWF 310 (714) T ss_pred chhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEE Confidence 0 00124566666654321 11111 122334 Q ss_pred cCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 233 EGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 233 ~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) .|..++.. .+.|+...|||++.-... ..|..| |.+..+.+-++.+|+..-..+.+ +..+-+++ .++++.... T Consensus 311 ~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~~~-~~~a~~~~d 385 (714) T protein:vir:10 311 VGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRVIM-DEDATQLSD 385 (714) T ss_pred ecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHHh--hcCCceee-ecCcccccH Confidence 56666543 345666789998764443 557777 68889999999999866555443 45666654 455554432 Q ss_pred -hhc--cCCCcceecCCcc---ccc---ccc-cccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHH Q lcl|NC_011045. 310 -RLT--KAQTGDFVTGRPE---DIS---FLQ-LEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVA 378 (536) Q Consensus 310 -~~~--~~~~g~~~~g~~~---~~~---~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~ 378 (536) .+. .+.+|.++.-+++ ... .+. ...+.-.+.....++...+.|++.- ..+.+....+...+..-|..|+ T Consensus 386 ~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:10 386 NDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 222 2566776644332 111 111 1222223444455555555554432 1122222344456777799999 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCC-----CCC--------Ccc-----eEEEE- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQIP-----ELP--------KEA-----VEPTI- 426 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~lp-----~~~--------~~~-----v~v~~- 426 (536) +.....|+..+.+|..-+.. +.+.++.++.+. +... .+. ..+ +.|.+ T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~ 544 (714) T protein:vir:10 466 EQGATTLAEINDNYQFACQQ-VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALA 544 (714) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEe Confidence 99999999999887765433 234444444331 1100 000 011 22333 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHh----hcchhhhhcC---CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAA----LAPMRDDPDI---NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQM 499 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~----~~p~~~~~~i---d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~ 499 (536) .+|-...+|.+.++.++++++.+.. +.+..+.... +.+++++.+-+.+|.....=-.++++.++..++++.++ T Consensus 545 ~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~ 624 (714) T protein:vir:10 545 PVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQ 624 (714) T ss_pred eccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHH Confidence 3455667788888888888765321 1122121223 56889999999888632111122222111111111111 Q ss_pred HHH--H---HHH------HHHHHHHHh----------------hhcCcchHHhhhhcC---CCCC-----CC Q lcl|NC_011045. 500 GMD--N---GAA------ALAQGMAAQ----------------ATASPEAMAAAADSV---GLQP-----GI 536 (536) Q Consensus 500 ~~~--~---~a~------~~~~~~~~~----------------~~~~~~~~~~~~~~~---~~q~-----~~ 536 (536) +++ + ..+ +.++.+.++ ......+..++..+. ++|+ ++ T Consensus 625 ~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~~~~~~~~~~a~~a~~~~~~~~~~~~~~~ 696 (714) T protein:vir:10 625 QQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQDV 696 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhhhhHH Confidence 000 0 000 000000000 000000111110000 0000 01 No 39 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.71 E-value=8e-16 Score=103.36 Aligned_cols=511 Identities=11% Similarity=0.020 Sum_probs=243.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-cc-cccc-ccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-TD-YVTP-WQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-~~-~~~~-~dst~~~a~~~Laa~l~~~l 77 (536) ||+++.. -+.|..+|....+.-+.|.....+=.+|..- .-.+..... .+ ..++ |+- ..-.++.+ .+.- T Consensus 1 m~d~~~~--~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~QW~~~~~~~l~~q~rp~~N~-i~~~v~~v----~g~e 71 (725) T protein:vir:10 1 MADNENR--LESILSRFDADWTASDEARREAKNDLFFSRV--SQWDDWLSQYTTLQYRGQFDV-VRPVVRKL----VSEM 71 (725) T ss_pred CCchHHH--HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcccc-hHHHHHHH----HhhH Confidence 9996532 4566677777766666666666666667631 111111100 00 1111 211 12222222 2211 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE-----EEecCCC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL-----YLPEPEG 152 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~~~~~~ 152 (536) -=+++=+++.+.++.- . ++-+.|+ ..+......+++..+...++.+.++.|-|++ |.++++. T Consensus 72 ~~nr~d~~v~p~~~~d-------~---~~Ae~l~---~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~ 138 (725) T protein:vir:10 72 RQNPIDVLYRPKDGAS-------P---DAADVLM---GMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) T ss_pred HhCCcceEEecCCcch-------H---HHHHHHH---HHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCC Confidence 1133433444443211 1 1222232 2333344578999999999999999999975 3334444 Q ss_pred CceeeEEEEe----cceEEEeeCCC-CC---eEEEEEeEeccHHH---HHHHHhHHhhh--ccc---cC----CCCceEE Q lcl|NC_011045. 153 SNYNPMKLYR----LSSYVVQRDAF-GN---VLQMVTRDQIAFGA---LPEDIRKAVEG--QGG---EK----KADETID 212 (536) Q Consensus 153 ~~~~~~~~~~----l~~~~v~~d~~-G~---v~~i~r~~~~t~~~---l~~~~~~~~~~--~~~---~~----~~~~~~~ 212 (536) ...+..+.++ ..+++++.++. .. -.-+||...|+... +++.++..... .+. .. ...+.|. T Consensus 139 ~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vr 218 (725) T protein:vir:10 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQ 218 (725) T ss_pred CCceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEE Confidence 4444444443 45577776542 12 22356777787543 33344322111 000 00 0123444 Q ss_pred EEEEEEec-----------CCCCc-------------------------------eeEEEE-ecCccccccccccccccC Q lcl|NC_011045. 213 VYTHIYLD-----------EDSGE-------------------------------YIRYEE-VEGMEVQGSDGTYPKEAC 249 (536) Q Consensus 213 v~~~v~p~-----------~~~~~-------------------------------~~~~~~-v~g~~i~~~~~~~~~~~~ 249 (536) |+.+.+.+ +.++. +++|.. +.|..++...+.++.+.| T Consensus 219 v~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~f 298 (725) T protein:vir:10 219 IAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHI 298 (725) T ss_pred EEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCce Confidence 44433322 11111 122222 356666555556666789 Q ss_pred ceEEEeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcc-ee-----c Q lcl|NC_011045. 250 PYIPIRMV--RLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD-FV-----T 321 (536) Q Consensus 250 P~~~~rw~--~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~-~~-----~ 321 (536) ||+++-.. ..+|..|+.|.+....+-++.+|++....+..+..+.+.++++..+.+-..........+. ++ . T Consensus 299 P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~ 378 (725) T protein:vir:10 299 PIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) T ss_pred eEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeeccccc Confidence 99975323 3689999999999999999999999999999999999999998776554333322221121 11 1 Q ss_pred CCcccc--ccc-ccccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 322 GRPEDI--SFL-QLEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 322 g~~~~~--~~~-~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) ...+.+ +++ ....+.-.+.....++...+.|.+.- ..+.+....+..++.--|..|++.....|..++.+|..-.. T Consensus 379 ~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~ 458 (725) T protein:vir:10 379 ENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred ccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111211 111 11222223345566666666666543 12222223444566777999999999999988888776442 Q ss_pred HHHHHHHHHHHHh-------------cCCC--CCCCC-----------------cceEEEE-echHHHHHHHHHHHHHHH Q lcl|NC_011045. 398 LPLVRVLLKQLQA-------------TQQI--PELPK-----------------EAVEPTI-STGLEAIGRGQDLDKLER 444 (536) Q Consensus 398 ~Pli~r~~~il~~-------------~g~l--p~~~~-----------------~~v~v~~-vs~La~a~r~~~~~~l~~ 444 (536) . +-+.++++..+ .|.. ..+.. ..+.|.+ ++|-...+|.+.+..+++ T Consensus 459 ~-~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~q 537 (725) T protein:vir:10 459 R-DGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILE 537 (725) T ss_pred H-HHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHH Confidence 2 22333333222 2211 00110 0133433 456778889888888888 Q ss_pred HHHHHHhhcch---hhhh---cC---CHHHHHHHHHHHcCCChhhcc--CCHHHHHHHHHHHHHHHHHHH-----HHHHH Q lcl|NC_011045. 445 CVAAWAALAPM---RDDP---DI---NLAMIKLRIANAIGIDTSGIL--LTEEQKQQKMAQQSMQMGMDN-----GAAAL 508 (536) Q Consensus 445 ~~~~~~~~~p~---~~~~---~i---d~d~~~~~~a~~~Gv~p~~i~--rs~~ev~~~~~q~~~q~~~~~-----~a~~~ 508 (536) +++.+....|. .++. .. ..+++++.+....+ +.... .++++.+++.+++++++++++ +.+.. T Consensus 538 ll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~--~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~ 615 (725) T protein:vir:10 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVL 615 (725) T ss_pred HHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhh--hhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHH Confidence 88776544332 2222 22 23556666555432 11111 123332222221111111110 00000 Q ss_pred HH------HHHHh-------h-hcCcchH-------HhhhhcCCCCC-----CC Q lcl|NC_011045. 509 AQ------GMAAQ-------A-TASPEAM-------AAAADSVGLQP-----GI 536 (536) Q Consensus 509 ~~------~~~~~-------~-~~~~~~~-------~~~~~~~~~q~-----~~ 536 (536) +. .+.++ + ....++. +....+.+.|- ++ T Consensus 616 ~~~qae~~ka~aE~~k~~~~a~~~~~~a~~~a~~~~~~~~q~~~~q~~~~~~~~ 669 (725) T protein:vir:10 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 00 00000 0 0000000 00000111111 00 No 40 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.71 E-value=9e-16 Score=103.06 Aligned_cols=512 Identities=12% Similarity=0.037 Sum_probs=241.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-cc-cccc-ccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-TD-YVTP-WQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-~~-~~~~-~dst~~~a~~~Laa~l~~~l 77 (536) ||+.+.. -+.|..+|....+....|.....+=.+|..- .-.+..... .+ ..++ |+-+ .-.++.+.+.- T Consensus 1 m~d~~~~--~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~q~rp~~N~i-~~~i~~v~g~~---- 71 (725) T protein:vir:77 1 MADNENR--LESILSRFDADWTASDEARREAKNDLFFSRV--SQWDDWLSQYTTLQYRGQFDVV-RPVVRKLVSEM---- 71 (725) T ss_pred CCchHHH--HHHHHHHHHHHHHhhHHHHHHHHHHHHhhCC--CCCCHHHHHHHHhcCCCccccH-HHHHHHHHhhH---- Confidence 9996532 4567777777777777777777666677632 111111000 00 1111 2111 12222222211 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE-----EEecCCC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL-----YLPEPEG 152 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~~~~~~ 152 (536) -=+++=+++.+.++.- . ++-+.|+ ..+......|++..+...++.+.++.|.|++ |.++++. T Consensus 72 ~~nr~d~~v~P~~~~d-------~---~~Ae~l~---~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~ 138 (725) T protein:vir:77 72 RQNPIDVLYRPKDGAR-------P---DAADVLM---GMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPT 138 (725) T ss_pred HhCCcceEEecCCccH-------H---HHHHHHH---HHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCC Confidence 1133433444443211 1 1222232 2333344578999999999999999999975 3334444 Q ss_pred CceeeEEEEe----cceEEEeeCCCC-Ce-E--EEEEeEeccHHH---HHHHHhHHhhhccc-----cC----CCCceEE Q lcl|NC_011045. 153 SNYNPMKLYR----LSSYVVQRDAFG-NV-L--QMVTRDQIAFGA---LPEDIRKAVEGQGG-----EK----KADETID 212 (536) Q Consensus 153 ~~~~~~~~~~----l~~~~v~~d~~G-~v-~--~i~r~~~~t~~~---l~~~~~~~~~~~~~-----~~----~~~~~~~ 212 (536) ...+..+.++ ..+++++.++.- .. | -+||...|+.+. +.++++........ .. ...+.|. T Consensus 139 ~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vr 218 (725) T protein:vir:77 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQ 218 (725) T ss_pred CCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeE Confidence 4444445554 445677666431 11 1 256777788764 33444432211100 00 0123444 Q ss_pred EEEEEEecC-----------CCCc-------------------------------eeEE-EEecCccccccccccccccC Q lcl|NC_011045. 213 VYTHIYLDE-----------DSGE-------------------------------YIRY-EEVEGMEVQGSDGTYPKEAC 249 (536) Q Consensus 213 v~~~v~p~~-----------~~~~-------------------------------~~~~-~~v~g~~i~~~~~~~~~~~~ 249 (536) |+.+.++++ .++. +++| .-+.|..++...+.++.+.| T Consensus 219 v~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~ 298 (725) T protein:vir:77 219 IAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHI 298 (725) T ss_pred EEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCcc Confidence 444433221 1111 1122 22356666555556777789 Q ss_pred ceEEEe--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC--CCcce----ec Q lcl|NC_011045. 250 PYIPIR--MVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA--QTGDF----VT 321 (536) Q Consensus 250 P~~~~r--w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~--~~g~~----~~ 321 (536) ||+++- ....+|..|+.|.+....+-++.+|++....+..+..+.+-++++..+.+-........ +..++ +. T Consensus 299 P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 378 (725) T protein:vir:77 299 PIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) T ss_pred ceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccc Confidence 999653 23478999999999999999999999999999999999999988876654333322211 11111 12 Q ss_pred CCcccc--cccccccccch-hHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 322 GRPEDI--SFLQLEKQADF-TVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 322 g~~~~~--~~~~~~~~~~~-~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) ...+.+ +++......++ +.....++...+.|.+.- ..+.+....+..++.--|..|++.....+...+.+|..-.. T Consensus 379 ~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~ 458 (725) T protein:vir:77 379 ENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred cCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222221 11111112222 344455555555565542 22222223333566777889999999999888877655432 Q ss_pred ------HHHHHHHH------HHHHhcCCC--CCCC--------C---------cceEEEE-echHHHHHHHHHHHHHHHH Q lcl|NC_011045. 398 ------LPLVRVLL------KQLQATQQI--PELP--------K---------EAVEPTI-STGLEAIGRGQDLDKLERC 445 (536) Q Consensus 398 ------~Pli~r~~------~il~~~g~l--p~~~--------~---------~~v~v~~-vs~La~a~r~~~~~~l~~~ 445 (536) .-||...+ .|+...|.. ..+. | ..+.|.+ ++|-...+|.+.+..+.++ T Consensus 459 ~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql 538 (725) T protein:vir:77 459 RDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILEL 538 (725) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHH Confidence 22332222 222222211 0011 0 0133433 4567778899989999988 Q ss_pred HHHHHhhcch---hhhh---cCCH---HHHHHHHHHHcCCChhhcc--CCHHHHHHHHHHHHHHHHHHH-----HHHHHH Q lcl|NC_011045. 446 VAAWAALAPM---RDDP---DINL---AMIKLRIANAIGIDTSGIL--LTEEQKQQKMAQQSMQMGMDN-----GAAALA 509 (536) Q Consensus 446 ~~~~~~~~p~---~~~~---~id~---d~~~~~~a~~~Gv~p~~i~--rs~~ev~~~~~q~~~q~~~~~-----~a~~~~ 509 (536) ++.+..+.|. .++. ..|. +++.+.+..... +.... .++++.+...+++++++++++ +++..+ T Consensus 539 l~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~--~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~ 616 (725) T protein:vir:77 539 LGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLL 616 (725) T ss_pred HHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhh--hhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 8876654443 2222 2343 445555544332 22222 222222211111111111110 000000 Q ss_pred HH------HHH-------hh-hcCcchHHhhh-------hcCCCC-----CCC Q lcl|NC_011045. 510 QG------MAA-------QA-TASPEAMAAAA-------DSVGLQ-----PGI 536 (536) Q Consensus 510 ~~------~~~-------~~-~~~~~~~~~~~-------~~~~~q-----~~~ 536 (536) .. ... ++ ....++..++. .....| .++ T Consensus 617 ~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~a~~~~~~ 669 (725) T protein:vir:77 617 QGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 00 000 00 00000000000 000001 111 No 41 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.69 E-value=2.4e-15 Score=100.77 Aligned_cols=511 Identities=12% Similarity=0.027 Sum_probs=246.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-cc-ccc-cccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-TD-YVT-PWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-~~-~~~-~~dst~~~a~~~Laa~l~~~l 77 (536) ||+++. .-+.|..+|....+....|.....+=.+|..- .-.+..... .+ ..+ .|+-++. .++. +.+.- T Consensus 1 m~d~~~--~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~q~rp~~N~i~~-~i~~----v~g~e 71 (725) T protein:vir:92 1 MADNEN--RLESILSRFDADWTASDEARREAKNDLFFSRI--SQWDDWLSQYTTLQYRGQFDVVRP-VVRK----LVSEM 71 (725) T ss_pred CCchHH--HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcccchHH-HHHH----HHhhH Confidence 999654 24667788887777777777777777777742 111111000 00 111 1222221 2222 22211 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE-----EEecCCC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL-----YLPEPEG 152 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~~~~~~ 152 (536) -=+++=+++.+.++.- . ++-+.|+ ..+......|+...+...++.+.++.|.|++ |.++++. T Consensus 72 ~~nr~d~~v~P~~~~d-------~---~~Ae~l~---~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~ 138 (725) T protein:vir:92 72 RQNPIDVLYRPKDGAS-------P---DAADVLM---GMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPT 138 (725) T ss_pred HhCCcceEEecCCccH-------H---HHHHHHH---HHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCC Confidence 1133433444443211 1 1222232 2333344579999999999999999999975 2334444 Q ss_pred CceeeEEEEe----cceEEEeeCCCC-Ce-E--EEEEeEeccHH---HHHHHHhHHhhhc-----cccC----CCCceEE Q lcl|NC_011045. 153 SNYNPMKLYR----LSSYVVQRDAFG-NV-L--QMVTRDQIAFG---ALPEDIRKAVEGQ-----GGEK----KADETID 212 (536) Q Consensus 153 ~~~~~~~~~~----l~~~~v~~d~~G-~v-~--~i~r~~~~t~~---~l~~~~~~~~~~~-----~~~~----~~~~~~~ 212 (536) ...+..+++| +.+++++.++.- .. | -+||...|+.+ ++.++++...... .... ...+.|. T Consensus 139 ~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vr 218 (725) T protein:vir:92 139 SNNQVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQ 218 (725) T ss_pred CCceeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEE Confidence 4445555554 445667666431 11 1 25666777765 3444554321110 0000 0124455 Q ss_pred EEEEEEec-----------CCCCc-------------------------------eeEEE-EecCccccccccccccccC Q lcl|NC_011045. 213 VYTHIYLD-----------EDSGE-------------------------------YIRYE-EVEGMEVQGSDGTYPKEAC 249 (536) Q Consensus 213 v~~~v~p~-----------~~~~~-------------------------------~~~~~-~v~g~~i~~~~~~~~~~~~ 249 (536) |+.+.+.+ +.++. +++|. -+.|..++...+.++.+.| T Consensus 219 v~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~ 298 (725) T protein:vir:92 219 IAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHI 298 (725) T ss_pred EEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCce Confidence 54443322 11111 12222 2356666555556666789 Q ss_pred ceEEEee--eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcc-e-----ec Q lcl|NC_011045. 250 PYIPIRM--VRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD-F-----VT 321 (536) Q Consensus 250 P~~~~rw--~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~-~-----~~ 321 (536) ||+++-. ...+|..|+.|.+....+-++.+|+..-..+..+..+.+.++++..+.+-..........+. + +. T Consensus 299 P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 378 (725) T protein:vir:92 299 PIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTD 378 (725) T ss_pred eeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccc Confidence 9997532 24689999999999999999999999999999999999999998776554333222211111 1 11 Q ss_pred CCcccc--cccc-cccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 322 GRPEDI--SFLQ-LEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 322 g~~~~~--~~~~-~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) ...+.+ +++. .....-.+.....++...+.|.+.- ..+......+..++.--|..|++.....|+..+.+|..-.. T Consensus 379 ~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~ 458 (725) T protein:vir:92 379 ENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMR 458 (725) T ss_pred cccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111211 1111 1222223345556666666666543 22222333444567778999999999999988877765432 Q ss_pred HHHHHHHHHHHHh-------------cCCC--CCCC--------C---------cceEEEE-echHHHHHHHHHHHHHHH Q lcl|NC_011045. 398 LPLVRVLLKQLQA-------------TQQI--PELP--------K---------EAVEPTI-STGLEAIGRGQDLDKLER 444 (536) Q Consensus 398 ~Pli~r~~~il~~-------------~g~l--p~~~--------~---------~~v~v~~-vs~La~a~r~~~~~~l~~ 444 (536) . +-+.++++..+ .|.. ..+. | ..+.|.+ ++|-...+|.+.+..+.+ T Consensus 459 ~-~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~q 537 (725) T protein:vir:92 459 R-DGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILE 537 (725) T ss_pred H-HHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHH Confidence 2 22333333222 2210 0010 0 0133333 466778889998898888 Q ss_pred HHHHHHhhcch---hhhh---cCCH---HHHHHHHHHHcCCChhhcc--CCHHHHHHHHHHHHHHHHHHH-----HHHHH Q lcl|NC_011045. 445 CVAAWAALAPM---RDDP---DINL---AMIKLRIANAIGIDTSGIL--LTEEQKQQKMAQQSMQMGMDN-----GAAAL 508 (536) Q Consensus 445 ~~~~~~~~~p~---~~~~---~id~---d~~~~~~a~~~Gv~p~~i~--rs~~ev~~~~~q~~~q~~~~~-----~a~~~ 508 (536) +++.+.++.|. .++. ..|. +++.+.+....+ +.... .++++.++..+++++++++++ +.+.. T Consensus 538 l~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~--~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~ 615 (725) T protein:vir:92 538 LLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLI--QMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVL 615 (725) T ss_pred HHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhc--hhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHH Confidence 88776554443 2222 2333 455555554432 22221 123332222222111111000 00000 Q ss_pred HHHHH------Hh-------h-hcCcchHHhhhhc-------CCCCC-----CC Q lcl|NC_011045. 509 AQGMA------AQ-------A-TASPEAMAAAADS-------VGLQP-----GI 536 (536) Q Consensus 509 ~~~~~------~~-------~-~~~~~~~~~~~~~-------~~~q~-----~~ 536 (536) +..-+ .+ + ....++...+... ...|- ++ T Consensus 616 ~~~qae~~kaqaE~~k~q~~a~~~~~~a~~~aa~~~~~~~q~~~~q~~~~~~~~ 669 (725) T protein:vir:92 616 LQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNMDLSKQSEFREFL 669 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 00000 00 0 0000000000000 00000 00 No 42 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.68 E-value=4.4e-15 Score=99.27 Aligned_cols=512 Identities=14% Similarity=0.078 Sum_probs=247.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc----c----cccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS----T----DYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~----~----~~~~~~dst~~~a~~~Laa~ 72 (536) ||++-..+ -+.+..||....+..+.|...+.+=.+|..=.-.-.+..... + ..+.+.-+...-.++...+. T Consensus 1 m~~~~~~~-~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~ 79 (708) T protein:vir:10 1 MAETLEKK-HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) T ss_pred CchhHHHH-HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHH Confidence 99975433 466778888887777777777766555542100111111000 0 11222223333333333332 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEe---- Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLP---- 148 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~---- 148 (536) -. =+++=+++.+.++.- + .++-+.| +..+......++...+...++.+.++.|-|++-+- T Consensus 80 ~~----~nr~d~~v~P~~~~~-----d----~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~ 143 (708) T protein:vir:10 80 YR----NNRITVKFRPGDREA-----S----EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) T ss_pred HH----hCCcceEEEcCCCCc-----h----HHHHHHH---HHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccc Confidence 21 233433444443211 0 0122222 33344444578999999999999999999977441 Q ss_pred -cC---CCCceeeEE--EEecceEEEeeCC---CC-CeEEEEEeEeccHHHHHHHHhHHhhhccccC------C---CCc Q lcl|NC_011045. 149 -EP---EGSNYNPMK--LYRLSSYVVQRDA---FG-NVLQMVTRDQIAFGALPEDIRKAVEGQGGEK------K---ADE 209 (536) Q Consensus 149 -~~---~~~~~~~~~--~~~l~~~~v~~d~---~G-~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~------~---~~~ 209 (536) +. .....+.++ ..|..+++++.++ ++ .-.-+||...|+.+++...|++......... . ..+ T Consensus 144 ~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d 223 (708) T protein:vir:10 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGAD 223 (708) T ss_pred cccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCC Confidence 11 111223333 3355667776553 22 1224678889999999999986432211000 0 001 Q ss_pred eEEEEEE-----------EEecCCCC-------------------------------ceeEE-EEecCcccccccccccc Q lcl|NC_011045. 210 TIDVYTH-----------IYLDEDSG-------------------------------EYIRY-EEVEGMEVQGSDGTYPK 246 (536) Q Consensus 210 ~~~v~~~-----------v~p~~~~~-------------------------------~~~~~-~~v~g~~i~~~~~~~~~ 246 (536) .+-|..+ +++++.++ +++++ ..+.|..++...+.+++ T Consensus 224 ~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~ 303 (708) T protein:vir:10 224 VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPG 303 (708) T ss_pred ceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCC Confidence 1211111 11222111 11122 12356666655677888 Q ss_pred ccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc------------ Q lcl|NC_011045. 247 EACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT------------ 312 (536) Q Consensus 247 ~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~------------ 312 (536) +.|||+++-+.+ .+|..++.|.+..+.+-++.+|+..-..+.++.++-+.+++++++.+.....-. T Consensus 304 ~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~ 383 (708) T protein:vir:10 304 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) T ss_pred CceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhc Confidence 999999874443 477887889999999999999999999999999999999999888765442211 Q ss_pred ----cCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 313 ----KAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASELEDTLGG 387 (536) Q Consensus 313 ----~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~ 387 (536) ....|.++++... +...+.+.-.....+.++...+.|.+.. ..+.+..+.+ .++..-|..|++.....|+. T Consensus 384 ~~~~~~~~G~~~~~~~~---~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~s-n~SG~aI~~rq~qg~~~l~~ 459 (708) T protein:vir:10 384 LREVRDKSGNIIAGATP---AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-NIAQETVNNLMNRADMASFI 459 (708) T ss_pred cccccccccccccccCC---ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCcc-chHHHHHHHHHHHHHHHHHH Confidence 1112222221110 0001111111223344444444454442 1222222223 46888899999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHh-------------cCC----------CCCCCCc-----ce-----EEEE-echHHHH Q lcl|NC_011045. 388 VYSILSQELQLPLVRVLLKQLQA-------------TQQ----------IPELPKE-----AV-----EPTI-STGLEAI 433 (536) Q Consensus 388 v~~rl~~E~l~Pli~r~~~il~~-------------~g~----------lp~~~~~-----~v-----~v~~-vs~La~a 433 (536) .+.+|..-... +-+.++.+..+ .|. .++-.+. ++ .|.+ .+|-... T Consensus 460 ~~Dnl~~~~~~-~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s 538 (708) T protein:vir:10 460 YLDNMAKSLKR-AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchh Confidence 99988754322 22233333222 221 0111111 11 2322 3567778 Q ss_pred HHHHHHHHHHHHHHHHHhhcchh------hhhcC---CHHHHHHHHHHHcCCChhhccCC--HHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMR------DDPDI---NLAMIKLRIANAIGIDTSGILLT--EEQKQQKMAQQSMQMGMD 502 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~------~~~~i---d~d~~~~~~a~~~Gv~p~~i~rs--~~ev~~~~~q~~~q~~~~ 502 (536) +|.+.++.++++++.+....|.. +.+.. +.++++..+-..++. + ..... +++.++..++++++++++ T Consensus 539 ~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~-~-~~~~~~~~ee~q~~~~~q~~~q~q~ 616 (708) T protein:vir:10 539 RRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI-S-GIAKPRNEKEQQIVQQAQMAAQSQP 616 (708) T ss_pred HHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcc-c-ccccccchhhHHHHHHHHHHHHHHH Confidence 88888888888887654322211 11223 456788887776654 2 22222 222221111111111000 Q ss_pred HHHH-----------HHHHHHHHhhh----c-------CcchHHhhhhcCCCCC------------CC Q lcl|NC_011045. 503 NGAA-----------ALAQGMAAQAT----A-------SPEAMAAAADSVGLQP------------GI 536 (536) Q Consensus 503 ~~a~-----------~~~~~~~~~~~----~-------~~~~~~~~~~~~~~q~------------~~ 536 (536) +++. ++++.+.+++. . ..++.+++.-....++ ++ T Consensus 617 ~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~~~~q~l 684 (708) T protein:vir:10 617 NPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLL 684 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 0000 00000000000 0 0000000000000011 11 No 43 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.64 E-value=1.6e-13 Score=90.72 Aligned_cols=512 Identities=12% Similarity=0.062 Sum_probs=246.1 Q ss_pred CCCcccc--cc------HHHHHHHHHHHHHHhhhHHHHHH----HHHHHhcccccCCCCCccc----ccccccccchHHH Q lcl|NC_011045. 1 MAEKRTG--LA------EEGAKSVYERLKNDRAPYETRAQ----NCAQYTIPSLFPKDSDNAS----TDYVTPWQAVGAR 64 (536) Q Consensus 1 Ma~~~~~--~~------~~~~~~r~~~l~~~R~~~e~~w~----e~~~~~~P~~~~~~~~~~~----~~~~~~~dst~~~ 64 (536) |++..+. +. ++.....|..+..++. +.+.|+ +-.+|..- .-.+..... +..+.+.-+...- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDID-SQPLWRDAANKACAYYDG--DQLAPEVIQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHh-hhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHH Confidence 7764333 11 1222344555555543 344554 55555521 111111000 1112222222222 Q ss_pred HHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEE Q lcl|NC_011045. 65 GLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL 144 (536) Q Consensus 65 a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 144 (536) .++... +..-=+++=++..+.+..-. -.++ .+.++..+......++...+...++.+.++.|-|. T Consensus 78 ~v~~v~----g~~~~nr~~~~v~pr~~~~~--------~~~~---Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~ 142 (714) T protein:vir:10 78 TVDGVL----GMEAKTRTDLIVMSDDPNDE--------TEKL---AEAINAEFADACRLGNMNKARSDAYAEQIKAGLSW 142 (714) T ss_pred HHHHHH----HHHHhCCcceEEecCCCChh--------hHHH---HHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccce Confidence 333222 22222333235554332110 0011 22334445555667899999999999999999888 Q ss_pred E--EEecCCCCceeeEEEEecceEEEeeCCCC-C---eEEEEEeEeccHHHHHHHHhHHhh---hc---c---------- Q lcl|NC_011045. 145 L--YLPEPEGSNYNPMKLYRLSSYVVQRDAFG-N---VLQMVTRDQIAFGALPEDIRKAVE---GQ---G---------- 202 (536) Q Consensus 145 l--~~~~~~~~~~~~~~~~~l~~~~v~~d~~G-~---v~~i~r~~~~t~~~l~~~~~~~~~---~~---~---------- 202 (536) + +++.+..++.++++.++..+++++.++.- . -.-+|++..|+.+++...|++... .. + T Consensus 143 ~~~~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~ 222 (714) T protein:vir:10 143 VEVRRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTE 222 (714) T ss_pred EEeeeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhh Confidence 7 77777777888999999999999887532 2 223678899999999888875211 00 0 Q ss_pred ----------c-----c-------CCCCceEEEEEEEEecC--------CCCce-------------------------- Q lcl|NC_011045. 203 ----------G-----E-------KKADETIDVYTHIYLDE--------DSGEY-------------------------- 226 (536) Q Consensus 203 ----------~-----~-------~~~~~~~~v~~~v~p~~--------~~~~~-------------------------- 226 (536) . . ....++|.|+.|.+... .++.. T Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~ 302 (714) T protein:vir:10 223 GQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRV 302 (714) T ss_pred hhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccce Confidence 0 0 01124576766644321 11111 Q ss_pred --eEEEEecCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec Q lcl|NC_011045. 227 --IRYEEVEGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN 301 (536) Q Consensus 227 --~~~~~v~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~ 301 (536) +.+..+.|..++.. .+.|++..|||++.-... ..|..| |.+..+.+-++.+|+..-..+.+ +..+-+ ++. T Consensus 303 ~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~-~~~ 377 (714) T protein:vir:10 303 SRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRV-IMD 377 (714) T ss_pred eeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccc--eehhhhhhHHHHHHHHHHHHHHH--HhCCce-eec Confidence 11223456655533 345677789998764333 455555 68888999999999866665553 355554 444 Q ss_pred cccccch-hhhcc--CCCcceecCCcc---cc---ccccccccc-chhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCC Q lcl|NC_011045. 302 PAGITQP-RRLTK--AQTGDFVTGRPE---DI---SFLQLEKQA-DFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVT 370 (536) Q Consensus 302 ~~g~~~~-~~~~~--~~~g~~~~g~~~---~~---~~~~~~~~~-~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~T 370 (536) ++++..- +.+.. +.+|.++.-+++ .. ..+...... -.+.....++...+.|.+.- ..+.+....+..++ T Consensus 378 ~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~S 457 (714) T protein:vir:10 378 EDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATS 457 (714) T ss_pred cccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhH Confidence 5555442 22322 456666543321 11 112222222 23344555555555555542 12222223444567 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCC--CC---CC--------Ccc--- Q lcl|NC_011045. 371 AEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQI--PE---LP--------KEA--- 421 (536) Q Consensus 371 AtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~l--p~---~~--------~~~--- 421 (536) ..-|..|++.....|+..+.+|..-.. =+.+.+++++.+. +.. +. +. ..+ T Consensus 458 GvAI~~r~~qg~~~l~~~~dnl~~~~~-~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~ 536 (714) T protein:vir:10 458 GVAISNLVEQGATTLAEINDNYQFACQ-QVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISR 536 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCcccccccee Confidence 777999999999999999999877442 2344455544331 100 00 00 011 Q ss_pred --eEEEE-echHHHHHHHHHHHHHHHHHHHHH----hhcchhhhhcC---CHHHHHHHHHHHcCCChh-hccCCHHHHHH Q lcl|NC_011045. 422 --VEPTI-STGLEAIGRGQDLDKLERCVAAWA----ALAPMRDDPDI---NLAMIKLRIANAIGIDTS-GILLTEEQKQQ 490 (536) Q Consensus 422 --v~v~~-vs~La~a~r~~~~~~l~~~~~~~~----~~~p~~~~~~i---d~d~~~~~~a~~~Gv~p~-~i~rs~~ev~~ 490 (536) +.|.+ +.|-...+|.+.++.++++++.+. ++.+..+.... +.+++++.+.+.+|.... .-...+++..+ T Consensus 537 ~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q 616 (714) T protein:vir:10 537 LNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVA 616 (714) T ss_pred eeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHH Confidence 23333 345567788888888888877542 11111111223 467899999999887321 11211221111 Q ss_pred HHHHH-HHHH---HHHHHHH------HHHHHHHHhh----hcCc------------chHHhhhhc---CCCCC---CC Q lcl|NC_011045. 491 KMAQQ-SMQM---GMDNGAA------ALAQGMAAQA----TASP------------EAMAAAADS---VGLQP---GI 536 (536) Q Consensus 491 ~~~q~-~~q~---~~~~~a~------~~~~~~~~~~----~~~~------------~~~~~~~~~---~~~q~---~~ 536 (536) ..+++ ++++ ++++..+ +.+..+.+++ ..+. .+..++..+ .++|+ -. T Consensus 617 ~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~~~~~~q~~ 694 (714) T protein:vir:10 617 AQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITGVQNMEQEQ 694 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhH Confidence 11111 1100 0000000 0000000000 0000 000000000 00000 00 No 44 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.62 E-value=8.9e-14 Score=92.13 Aligned_cols=510 Identities=14% Similarity=0.081 Sum_probs=232.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc--------ccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS--------TDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--------~~~~~~~dst~~~a~~~Laa~ 72 (536) ||++.... -+.+..+|....+..+.|...+++=.+|..-.-.-.+..... ...+.+.-+...-.++... T Consensus 1 m~e~~~~~-~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~-- 77 (706) T protein:vir:10 1 MAESRQKQ-HERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRII-- 77 (706) T ss_pred CCcchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHh-- Confidence 99976433 235556666665555566666655556652111111111110 0122333333333333332 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEE----- Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL----- 147 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~----- 147 (536) +..-=+++=++..+.++.- + . ++-+ .++..+......++...+...++.+.++.|.|.+=+ T Consensus 78 --g~~~~nr~~~~v~P~~~~~-----d-~---~~Ae---~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~ 143 (706) T protein:vir:10 78 --SEYRNNRISVKFRPGDNAA-----S-E---ELAN---KLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFV 143 (706) T ss_pred --hHHHhCCCceEEecCCCCc-----h-H---HHHH---HHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccc Confidence 2222223323443322110 0 0 1111 223344444557899999999999999999997522 Q ss_pred ecC---CCCceeeEEE--EecceEEEeeCC---CCC-eEEEEEeEeccHHHHHHHHhHHhhh---ccc--------cC-- Q lcl|NC_011045. 148 PEP---EGSNYNPMKL--YRLSSYVVQRDA---FGN-VLQMVTRDQIAFGALPEDIRKAVEG---QGG--------EK-- 205 (536) Q Consensus 148 ~~~---~~~~~~~~~~--~~l~~~~v~~d~---~G~-v~~i~r~~~~t~~~l~~~~~~~~~~---~~~--------~~-- 205 (536) .+. .....+.++. .|+.+++++.++ ++. ..-+||...|+.+++...|++.... ... .. T Consensus 144 ~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~ 223 (706) T protein:vir:10 144 NEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDV 223 (706) T ss_pred cccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCc Confidence 221 1122233332 356778887653 332 2247889999999999988764211 000 00 Q ss_pred ----CCCc-eEEEEEEE-EecCCC-------------------------------Ccee-EEEEecCccccccccccccc Q lcl|NC_011045. 206 ----KADE-TIDVYTHI-YLDEDS-------------------------------GEYI-RYEEVEGMEVQGSDGTYPKE 247 (536) Q Consensus 206 ----~~~~-~~~v~~~v-~p~~~~-------------------------------~~~~-~~~~v~g~~i~~~~~~~~~~ 247 (536) ..++ +....+.+ +.+... .+++ .+..+.|..++...+.|+.+ T Consensus 224 ~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~ 303 (706) T protein:vir:10 224 VYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGE 303 (706) T ss_pred ceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCC Confidence 0011 11111111 111100 1112 22334677776555677788 Q ss_pred cCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc------------ Q lcl|NC_011045. 248 ACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK------------ 313 (536) Q Consensus 248 ~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~------------ 313 (536) .|||+++-..+ +++..+..|.+..+.+-++.+|+....++..+...-+-+..+.++.+-....-.. T Consensus 304 ~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~ 383 (706) T protein:vir:10 304 HIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPL 383 (706) T ss_pred ccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhc Confidence 99999753322 3666777889999999999999988888887766655554554333211110000 Q ss_pred ----CCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH-hhh-hcccCCCCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 314 ----AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLGG 387 (536) Q Consensus 314 ----~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~-~~~~~~~~r~TAtEi~~r~~E~~~~LG~ 387 (536) ...|.+++.... ... ...+.-.+.....++.-.+.|.+.- ..+ ++. +.+ .++.--|..|++.....+.. T Consensus 384 ~~~~~~~g~i~~~~~~-~~~--~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG-~~s-n~SG~Ai~~rq~qg~~~~~~ 458 (706) T protein:vir:10 384 RTVTDKTGNVVAPANV-AGY--TQAPVLNQALAALLQQTSADIQEVTGSSQAMQQ-MPS-NVARETVNSLLNRSDMASFI 458 (706) T ss_pred ccccCCCCcccccccc-ccc--CCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcC-Ccc-chHHHHHHHHHHHHHHHHHH Confidence 112332221110 000 1111111223334444444454432 122 222 222 35778899999999999999 Q ss_pred hHHHHHHHH------HHHHHH------HHHHHHHhcCC--CCCCC--------Cc-----c-----eEEEE-echHHHHH Q lcl|NC_011045. 388 VYSILSQEL------QLPLVR------VLLKQLQATQQ--IPELP--------KE-----A-----VEPTI-STGLEAIG 434 (536) Q Consensus 388 v~~rl~~E~------l~Pli~------r~~~il~~~g~--lp~~~--------~~-----~-----v~v~~-vs~La~a~ 434 (536) .+.+|..-. +.-||. |+|.|+...|. .+.+. |. + +.|.+ .+|-...+ T Consensus 459 ~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~ 538 (706) T protein:vir:10 459 YLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSAR 538 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchH Confidence 887665443 333333 22223322221 11111 11 1 23333 25667788 Q ss_pred HHHHHHHHHHHHHHHHhhcchh------hhhcCC---HHHHHHHHHHHcCCChhhccCCHH-HHHHHH-HHHHHHHHHHH Q lcl|NC_011045. 435 RGQDLDKLERCVAAWAALAPMR------DDPDIN---LAMIKLRIANAIGIDTSGILLTEE-QKQQKM-AQQSMQMGMDN 503 (536) Q Consensus 435 r~~~~~~l~~~~~~~~~~~p~~------~~~~id---~d~~~~~~a~~~Gv~p~~i~rs~~-ev~~~~-~q~~~q~~~~~ 503 (536) |.+.++.+.++++.+....|.. +.+..| .++++..+-..++ +....+... +.+++. +++++++++++ T Consensus 539 r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~--~q~~~~~~~~~eq~~~~q~qq~q~~q~~ 616 (706) T protein:vir:10 539 RDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLL--TQGIVKPRNQQEQAIVQQAQQAQATQPD 616 (706) T ss_pred HHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhc--ccCCccccchhHHHHHHHHHHHHHHHHH Confidence 9999888888887553222211 112344 4556666655554 222333321 111111 11111111000 Q ss_pred -----HHH------HHHHHHHHh--------hhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 504 -----GAA------ALAQGMAAQ--------ATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 504 -----~a~------~~~~~~~~~--------~~~~~~~~~~~~~~~~~q~~~ 536 (536) +.+ +.++.+.++ ..+..+++.+. +..++.+. T Consensus 617 ~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~--~~~~~~~~ 666 (706) T protein:vir:10 617 PNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQ--ANTVYKLA 666 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH Confidence 000 000000000 01111111111 11111111 No 45 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.61 E-value=3.9e-13 Score=88.62 Aligned_cols=504 Identities=13% Similarity=0.057 Sum_probs=241.6 Q ss_pred CCCc----cccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcc----cccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEK----RTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNA----STDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~----~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~ 72 (536) |... ...+..+. ..+|..-.+....|.....+-.+|..-. -.+...- .+..+.+.-+...-.++...+. T Consensus 11 ~~~~~~~~~~~~~~~~-~~~~~~~~~~q~~~r~~a~~d~~fy~G~--QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 87 (772) T protein:vir:10 11 LNGLPPAGDTPLTVDE-YADINYEIEDQPAWRAVADKEMDYADGN--QLDTELLRRQQALGIPPAVEDLIGPALLSLQGY 87 (772) T ss_pred hccCCcccccccCHHH-HHHHHHHHhccHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEEcchHHHHHHHHHH Confidence 2221 11222233 2344444444455666666666676311 1111110 1112223333333333333332 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE--EEecC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL--YLPEP 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~~~~ 150 (536) .-=+++=+++.+.++.- + .++-+.| +..+......+++..+...++.+.++.|-|.+ +.+++ T Consensus 88 ----~~~nr~d~~v~Pr~~~~-----d----~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d 151 (772) T protein:vir:10 88 ----EAVTRTDWRVTPNGDVG-----G----QEVADAL---NYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESD 151 (772) T ss_pred ----HHhcCcceEEecCCCch-----H----HHHHHHH---HHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccC Confidence 22233333444432100 0 1122223 33344445578999999999999999888765 55666 Q ss_pred CCCceeeEEEEecceEEEeeCCCCCeEE---EEEeEeccHHHHHHHHhHHhhhc---c---------------c------ Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDAFGNVLQ---MVTRDQIAFGALPEDIRKAVEGQ---G---------------G------ 203 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~~G~v~~---i~r~~~~t~~~l~~~~~~~~~~~---~---------------~------ 203 (536) ..+..++++.++..+++++.+......+ +||...|+.+++...|++..... . . T Consensus 152 ~~~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (772) T protein:vir:10 152 PFKFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTG 231 (772) T ss_pred CCCCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccc Confidence 6677888999999999999887555455 78889999999988886531100 0 0 Q ss_pred ------------------cCCCCceEEEEEEEEecCC--------CCc---------------------------eeE-E Q lcl|NC_011045. 204 ------------------EKKADETIDVYTHIYLDED--------SGE---------------------------YIR-Y 229 (536) Q Consensus 204 ------------------~~~~~~~~~v~~~v~p~~~--------~~~---------------------------~~~-~ 229 (536) .....++|.|+++.+.+.. +++ .++ + T Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~ 311 (772) T protein:vir:10 232 LHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRR 311 (772) T ss_pred cccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEE Confidence 0011267888887554321 111 111 2 Q ss_pred EEecCcccccc-ccccccccCceEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Q lcl|NC_011045. 230 EEVEGMEVQGS-DGTYPKEACPYIPIRMVR--LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT 306 (536) Q Consensus 230 ~~v~g~~i~~~-~~~~~~~~~P~~~~rw~~--~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~ 306 (536) ..+.|..++.. .+.|++..|||++.-..+ ..|..| |.+....+-++.+|+..-..+..+. .+. .+ ...|.+ T Consensus 312 ~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l~--~~~-~~-~~~gav 385 (772) T protein:vir:10 312 SYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGMS--VAR-VE-RTKGAV 385 (772) T ss_pred EEEecceeeccCCCCCCCCccceEEEeeeEeccCCccc--chhhhhhhHHHHHHHHHHHHHHHHh--ccc-cc-ccCCCc Confidence 23456666543 455777889999764333 556666 7999999999999997766665433 222 23 344444 Q ss_pred chhh--h--ccCCCcceecCCcccc----cccccccccch-hHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHH Q lcl|NC_011045. 307 QPRR--L--TKAQTGDFVTGRPEDI----SFLQLEKQADF-TVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRY 376 (536) Q Consensus 307 ~~~~--~--~~~~~g~~~~g~~~~~----~~~~~~~~~~~-~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~ 376 (536) +..+ + ..+.++.++.-+++.. ..+.......+ ......++...+.|.+.- ..+.+....+..++..-|.. T Consensus 386 ~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~ 465 (772) T protein:vir:10 386 AMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQ 465 (772) T ss_pred cchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHH Confidence 4321 2 2245566554443311 11111111222 233444444444454431 11222223444567788999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-------------CCCCC----CC--------C-----cc----- Q lcl|NC_011045. 377 VASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-------------QQIPE----LP--------K-----EA----- 421 (536) Q Consensus 377 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-------------g~lp~----~~--------~-----~~----- 421 (536) |++.....|+..+.+|..-... +-+.++.++.+. +.-++ +. + .+ T Consensus 466 rq~qg~~~l~~~~Dnl~~~~~~-~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~ 544 (772) T protein:vir:10 466 QIEQSNQSIGRIMDNFRAGRTL-VGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTR 544 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeee Confidence 9999999999999887765433 234444443331 11110 00 1 11 Q ss_pred eEEEE-echHHHHHHHHHHHHHHHHHHHHHh-hcchhh---hhcC---CHHHHHHHHHHHcCCChhhccCCHHHHHHHHH Q lcl|NC_011045. 422 VEPTI-STGLEAIGRGQDLDKLERCVAAWAA-LAPMRD---DPDI---NLAMIKLRIANAIGIDTSGILLTEEQKQQKMA 493 (536) Q Consensus 422 v~v~~-vs~La~a~r~~~~~~l~~~~~~~~~-~~p~~~---~~~i---d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~ 493 (536) +.|.+ ..|-...+|.+.++.++++++.+.. +.+..+ .+.. +.+++++.+-...+- .++++.++..+ T Consensus 545 yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~------~~peq~~~~~~ 618 (772) T protein:vir:10 545 IKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQ------QTPEQIQQQID 618 (772) T ss_pred EEEEeeccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhcc------CChHHHHHHHH Confidence 12222 3566778888888888887654211 111111 1122 456788877776653 23333222211 Q ss_pred HHHHHHHHHHH--------H------HHHHHHHHHhh-----------hcCcchHHhhh----------hcCCC------ Q lcl|NC_011045. 494 QQSMQMGMDNG--------A------AALAQGMAAQA-----------TASPEAMAAAA----------DSVGL------ 532 (536) Q Consensus 494 q~~~q~~~~~~--------a------~~~~~~~~~~~-----------~~~~~~~~~~~----------~~~~~------ 532 (536) ++.+++.++++ . .+.+....+++ ..+++...+++ ...|. T Consensus 619 q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~~ad~~l~~~g~~~~~~~ 698 (772) T protein:vir:10 619 QAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAPIADAVMQSAGYQRPNPA 698 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhHHHHHHHHhccccccccc Confidence 11111000000 0 00000000000 00000000000 00110 Q ss_pred --CCCC Q lcl|NC_011045. 533 --QPGI 536 (536) Q Consensus 533 --q~~~ 536 (536) .|+. T Consensus 699 ~~~~~~ 704 (772) T protein:vir:10 699 GDDPNY 704 (772) T ss_pred ccCCCC Confidence 0011 No 46 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.59 E-value=2.1e-14 Score=95.60 Aligned_cols=512 Identities=14% Similarity=0.082 Sum_probs=239.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc--------ccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS--------TDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--------~~~~~~~dst~~~a~~~Laa~ 72 (536) ||++-..+ -+.+..||....+.-+.|...|++=.+|..=.-.-.+...-. ...+.+.-+...-.++.+.+. T Consensus 1 ma~~~~~~-~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~ 79 (708) T protein:vir:17 1 MAETLEKK-HERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAE 79 (708) T ss_pred CchhHHHH-HHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhh Confidence 99964322 344555555555555566666655544310000011110000 001222222222222222221 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE-----EE Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL-----YL 147 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~ 147 (536) --=+++=+++.+.++.- + .++-+.| +..+......++...+...++.+.++.|.|++ |+ T Consensus 80 ----e~~nr~d~~v~p~~~~~-----d----~~~Ae~l---~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~ 143 (708) T protein:vir:17 80 ----YRNNRITVKFRPGDREA-----S----EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLV 143 (708) T ss_pred ----HhhCCcceEEecCCCcc-----h----HHHHHHH---HHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeeccc Confidence 11123333444442210 0 1122223 33344455578999999999999999999976 33 Q ss_pred ecC-----CCCceeeEEEEecceEEEeeCC---CCC-eEEEEEeEeccHHHHHHHHhHHhhhcc-----ccCC----CCc Q lcl|NC_011045. 148 PEP-----EGSNYNPMKLYRLSSYVVQRDA---FGN-VLQMVTRDQIAFGALPEDIRKAVEGQG-----GEKK----ADE 209 (536) Q Consensus 148 ~~~-----~~~~~~~~~~~~l~~~~v~~d~---~G~-v~~i~r~~~~t~~~l~~~~~~~~~~~~-----~~~~----~~~ 209 (536) .++ ..+-++..+..|..+++++.++ ++. -.-+||...|+.+++...|++...... .... ..+ T Consensus 144 ~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d 223 (708) T protein:vir:17 144 NEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDAD 223 (708) T ss_pred ccCCCCCCccccceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCC Confidence 222 1122222333466788888775 321 222688999999999999976532110 0000 013 Q ss_pred eEEEEEEEEe-----------cCCCC-------------------------------ceeEE-EEecCcccccccccccc Q lcl|NC_011045. 210 TIDVYTHIYL-----------DEDSG-------------------------------EYIRY-EEVEGMEVQGSDGTYPK 246 (536) Q Consensus 210 ~~~v~~~v~p-----------~~~~~-------------------------------~~~~~-~~v~g~~i~~~~~~~~~ 246 (536) +|-|..+.+. ++.++ ++++| .-+.|..++...+.+++ T Consensus 224 ~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~ 303 (708) T protein:vir:17 224 VIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPG 303 (708) T ss_pred eEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCC Confidence 3333222211 11111 11122 22357777656666888 Q ss_pred ccCceEEE---eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhh------------ Q lcl|NC_011045. 247 EACPYIPI---RMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRL------------ 311 (536) Q Consensus 247 ~~~P~~~~---rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~------------ 311 (536) +.|||+++ ||. .+|...-.|.+..+.+-++.+|+..-..++.+.++.+-+++++.+.+.....- T Consensus 304 ~~fP~vP~~g~r~~-~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~ 382 (708) T protein:vir:17 304 EHIPLIPVYGKRWF-IDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFL 382 (708) T ss_pred CccceEEEeccccc-ccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhh Confidence 89999876 444 36666556899999999999999999999999999999999887654322110 Q ss_pred ----ccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_011045. 312 ----TKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASELEDTLG 386 (536) Q Consensus 312 ----~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG 386 (536) ..+..|.++++.... . ....+.--+.....++...+.|.+.- ..+.+..+.+ .++.--|..|++.....++ T Consensus 383 ~~~~~~~~~g~v~~~a~~~-~--~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~s-n~SG~Ai~~rq~qg~~~~~ 458 (708) T protein:vir:17 383 PLREVRDKYGNIIAGATPA-G--YTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-NIAQETVNNLMNRADMASF 458 (708) T ss_pred hhhccCCcccccccccCCc-c--cCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCcc-chHHHHHHHHHHHHHHHHH Confidence 012223333222111 0 01111111233334444444444432 1121222222 3567778899999999999 Q ss_pred hhHHHHH------HHHHHHHHHHHH------HHHHhcCC----------CCCCCCc-----ce-----EEEE-echHHHH Q lcl|NC_011045. 387 GVYSILS------QELQLPLVRVLL------KQLQATQQ----------IPELPKE-----AV-----EPTI-STGLEAI 433 (536) Q Consensus 387 ~v~~rl~------~E~l~Pli~r~~------~il~~~g~----------lp~~~~~-----~v-----~v~~-vs~La~a 433 (536) ..+.++. -+++.-||...| .|+...|. ..+.++. ++ .|.+ ..|-... T Consensus 459 ~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t 538 (708) T protein:vir:17 459 IYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTA 538 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchh Confidence 8888776 344444555433 22222221 1112221 22 2333 3455678 Q ss_pred HHHHHHHHHHHHHHHHHhhcchh------hhhcC---CHHHHHHHHHHHcCCChhhccC--CHHHHHHHHHHHHHHHHH- Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMR------DDPDI---NLAMIKLRIANAIGIDTSGILL--TEEQKQQKMAQQSMQMGM- 501 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~------~~~~i---d~d~~~~~~a~~~Gv~p~~i~r--s~~ev~~~~~q~~~q~~~- 501 (536) +|.+..+.++++++.+....|.. +.++. +.++++..+...++. ..... ++++.++..+++++++++ T Consensus 539 ~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~--~~~~~~~~~e~~q~~~q~qq~~q~q~ 616 (708) T protein:vir:17 539 RRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLI--SGIAKPRNEKEQQIVQQAQMAAQSQP 616 (708) T ss_pred HHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhc--cccccCcchhhHHHHHHHHHHHHHHH Confidence 88888888888777654322211 11223 457788777776653 22222 222222221111111100 Q ss_pred --HH--HHH------HHHHHHHHhh-hcCcchHH----------hhhhcCCCCCCC Q lcl|NC_011045. 502 --DN--GAA------ALAQGMAAQA-TASPEAMA----------AAADSVGLQPGI 536 (536) Q Consensus 502 --~~--~a~------~~~~~~~~~~-~~~~~~~~----------~~~~~~~~q~~~ 536 (536) ++ +.+ +.+..+.++. ....++.+ ++.-....++.+ T Consensus 617 ~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~~~~ 672 (708) T protein:vir:17 617 NPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNI 672 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000 0000000000 00000000 000000011111 No 47 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.50 E-value=2.6e-13 Score=89.60 Aligned_cols=509 Identities=12% Similarity=0.063 Sum_probs=222.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCc-c-------ccccccc-ccchHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDN-A-------STDYVTP-WQAVGARGLNNLAS 71 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~-~-------~~~~~~~-~dst~~~a~~~Laa 71 (536) ||++-..+ -..+..+|....+..+.|.....+=.+|..-.-.-.+... . ....+.. |+-++ -.+ . T Consensus 1 ma~~~~~~-l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~-~~v----~ 74 (720) T protein:vir:35 1 MAETLQKR-HEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKIS-TEL----N 74 (720) T ss_pred CchHHHHH-HHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHH-HHH----H Confidence 98864211 1233344444444334444333333344311011111100 0 0011111 22222 222 3 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEE---- Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL---- 147 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~---- 147 (536) ++.+.--=+++=+++.+.+..- + .++-+.| +..+......++...+...++.+.++.|-|++-+ T Consensus 75 ~v~g~~~~nr~d~~v~P~~~~~-----d----~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~ 142 (720) T protein:vir:35 75 RIISEYRHNRITVKFRPGDKTA-----S----EALANKL---NGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNL 142 (720) T ss_pred HHHhHHHhCCCceEEEcCCCcc-----h----HHHHHHH---HHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecc Confidence 3333332233433444442210 0 1122223 2333344457889999999999999999998743 Q ss_pred -ecCC---CCceeeEEEE--ecceEEEeeCCCC-C---eEEEEEeEeccHHHHHHHHhHHhhhccc---cC-----CCCc Q lcl|NC_011045. 148 -PEPE---GSNYNPMKLY--RLSSYVVQRDAFG-N---VLQMVTRDQIAFGALPEDIRKAVEGQGG---EK-----KADE 209 (536) Q Consensus 148 -~~~~---~~~~~~~~~~--~l~~~~v~~d~~G-~---v~~i~r~~~~t~~~l~~~~~~~~~~~~~---~~-----~~~~ 209 (536) .+.+ +...++++++ |..+++++.++.- . -.-+||...|+.+++...|++....... .. .... T Consensus 143 ~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~ 222 (720) T protein:vir:35 143 VNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVD 222 (720) T ss_pred cccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCC Confidence 1111 2233445544 4557777665421 1 2235677789999999999865332111 11 1123 Q ss_pred eEEEEEEEEe-----------cCCC-------------------------------CceeEEE-EecCcccccccccccc Q lcl|NC_011045. 210 TIDVYTHIYL-----------DEDS-------------------------------GEYIRYE-EVEGMEVQGSDGTYPK 246 (536) Q Consensus 210 ~~~v~~~v~p-----------~~~~-------------------------------~~~~~~~-~v~g~~i~~~~~~~~~ 246 (536) .|.|.++.+. ++.+ .++++|. -+.|..++...+.+|+ T Consensus 223 ~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~ 302 (720) T protein:vir:35 223 VVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPG 302 (720) T ss_pred ceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCC Confidence 3444333221 1111 1122322 2356666555566777 Q ss_pred ccCceEEEeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------- Q lcl|NC_011045. 247 EACPYIPIRMV--RLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------- 314 (536) Q Consensus 247 ~~~P~~~~rw~--~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------- 314 (536) +.|||+++-.. ..+|..+..|.+..+.+-++.+|+..-..+..+...-.-+.....+++-....-... T Consensus 303 ~~fP~vP~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~ 382 (720) T protein:vir:35 303 EHIPLIPVYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLP 382 (720) T ss_pred CccceEEEEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccccccccccc Confidence 88999876322 347788888999999999999999888888887655444444433333222211111 Q ss_pred ------CCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH-hhh-hcccCCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_011045. 315 ------QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLN-SAVQRTGERVTAEEIRYVASELEDTLG 386 (536) Q Consensus 315 ------~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~-~~~~~~~~r~TAtEi~~r~~E~~~~LG 386 (536) .+|.++.. ++.....+ ...-.+.....++.-...|.++- ..+ ++. ..+. ++.--|..|++.....+. T Consensus 383 ~~~~~~~~G~~~~~-~~~~~~~~--~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG-~~sn-~SG~Ai~~rq~qg~~~~~ 457 (720) T protein:vir:35 383 LNEIVDKQGNIIAP-PTPVGYTQ--PQPLNQAMAALLQQTGADIQEVTGSSQAMQP-MPSN-IAKETVNHLMHRSDMSSF 457 (720) T ss_pred cccccccCcccccC-CCcccccC--CCCCchHHHHHHHHHHHHHHHHhCCChHHcC-cccc-hHHHHHHHHHHHHHHHHH Confidence 12222111 11111111 11111222233333333343332 122 222 2332 467789999999999999 Q ss_pred hhHHHHHHHH------HHHHHHHHH------HHHHhcCC-----C-----CCCCCc-----c-----eEEEE-echHHHH Q lcl|NC_011045. 387 GVYSILSQEL------QLPLVRVLL------KQLQATQQ-----I-----PELPKE-----A-----VEPTI-STGLEAI 433 (536) Q Consensus 387 ~v~~rl~~E~------l~Pli~r~~------~il~~~g~-----l-----p~~~~~-----~-----v~v~~-vs~La~a 433 (536) ..+..+..-. +.-||...+ .|....|. + .+.++. + +.|.+ ++|-... T Consensus 458 ~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s 537 (720) T protein:vir:35 458 IYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTA 537 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCccc Confidence 9888876543 333443333 22222221 0 112221 1 22333 3456677 Q ss_pred HHHHHHHHHHHHHHHHHhhcc------hhhhhcCCH---HHHHHHHHHHcCCChhhccCC--HHHHHHHHHHHHH--HHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAP------MRDDPDINL---AMIKLRIANAIGIDTSGILLT--EEQKQQKMAQQSM--QMG 500 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p------~~~~~~id~---d~~~~~~a~~~Gv~p~~i~rs--~~ev~~~~~q~~~--q~~ 500 (536) +|.+.++.++++++.+....+ ..+..+.|+ ++++..+...+. |...+.. .++.+++.+++++ +++ T Consensus 538 ~req~~~~m~qll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~--~~~~~~~~~~e~qq~~a~~qq~~qq~~ 615 (720) T protein:vir:35 538 RRDATVSVLTNLLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLL--TQGVVKPRNTEEEQMVAQMIQQAQQPN 615 (720) T ss_pred HHHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcc--hhcccCccChhHHHHHHHHHHHHHhHh Confidence 888888877777664322111 011223443 456655554432 2223322 2222121111111 110 Q ss_pred HHHHHH--------HHHHH---------HHHhhhcCc---------chHHhhhhcCCCCCCC Q lcl|NC_011045. 501 MDNGAA--------ALAQG---------MAAQATASP---------EAMAAAADSVGLQPGI 536 (536) Q Consensus 501 ~~~~a~--------~~~~~---------~~~~~~~~~---------~~~~~~~~~~~~q~~~ 536 (536) .+.+.. +.... +.+..+.+. +...+. ....|+.+ T Consensus 616 ~e~~~aqa~l~qaqae~~kaqa~~~~~qa~a~~aqa~a~~~~a~~~~~~aq~--~~~~q~~i 675 (720) T protein:vir:35 616 AELVAAQGVLMQGQAEVQKAKNEELAIQVKAFQAQTEARVAEAKMVQILASA--DSAKRAEI 675 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH Confidence 000000 00000 000000000 000111 01112222 No 48 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.35 E-value=6.7e-11 Score=76.37 Aligned_cols=454 Identities=11% Similarity=0.019 Sum_probs=189.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCC-C---Cccccc-ccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKD-S---DNASTD-YVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~-~---~~~~~~-~~~~~dst~~~a~~~Laa~l~~ 75 (536) +.-..+.++.+.+...-..|-.....-.++.+.+.+|..-...... . ....+. ..+...+-+..+++.++..| T Consensus 16 ~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l-- 93 (501) T protein:vir:25 16 VEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNL-- 93 (501) T ss_pred ccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhh-- Confidence 3333344455555444444444444444566666667543211000 0 000111 11223345666666666544 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) +|.+ |++ +|... .+.+. ....+++|....+++.++..+||.|.+++-.+..+.. T Consensus 94 --~~~g--f~~--~d~~~---------~~~l~-----------~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~~ 147 (501) T protein:vir:25 94 --SVVG--YRN--ALAKE---------NDPAW-----------EMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGPV 147 (501) T ss_pred --cccc--eec--CCccc---------hHHHH-----------HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCCe Confidence 3432 333 22111 11122 3345688999999999999999999988876665433 Q ss_pred eeEEEEecceEE-EeeCCCC--CeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEE--EEEEecCCCCce---- Q lcl|NC_011045. 156 NPMKLYRLSSYV-VQRDAFG--NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVY--THIYLDEDSGEY---- 226 (536) Q Consensus 156 ~~~~~~~l~~~~-v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~--~~v~p~~~~~~~---- 226 (536) +++++..+.. |-.|+.- ++...++.+....+ .+....+++| ++++.-..+..+ T Consensus 148 --i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~----------------~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 209 (501) T protein:vir:25 148 --FRTRSPRQILAVYADPSVDAWPQYALETWVAQKD----------------AKPHRRGVLYDDTYMYELDLGEVVLGDA 209 (501) T ss_pred --EEEeccccEEEEEecCCCCcceeEEEEEEeeccc----------------cCcceeEEEecCeeEEEEecCceeeeec Confidence 5556655554 5455432 34443433321100 0011111221 111111111100 Q ss_pred ----eEEEEecCcc----ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_011045. 227 ----IRYEEVEGME----VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIG 298 (536) Q Consensus 227 ----~~~~~v~g~~----i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~ 298 (536) ..+..+.+.. .....+.++|..||++.+.=+. ..+.+|+|=.+..++-+..+|...-.++..++..+.|.. T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~-~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~ 288 (501) T protein:vir:25 210 GGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR-DADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQR 288 (501) T ss_pred cccccccccccccccccccccccccCCccceeeEeccCcc-ccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHH Confidence 0011111111 1112344567788988765443 345689998888999999999999999999998888864 Q ss_pred eeccccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh----hcccCCCCCCCHHHH Q lcl|NC_011045. 299 LVNPAGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN----SAVQRTGERVTAEEI 374 (536) Q Consensus 299 lv~~~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~----~~~~~~~~r~TAtEi 374 (536) .+. .-..+..+......|.+.....+++.+.++. ..+++... +.++.-|....... ..........++.-+ T Consensus 289 ~i~-G~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~~~~~---~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al 363 (501) T protein:vir:25 289 VIS-GWTGSKAEVLKASALRVWTFEDPEVKAQAFP-PASVEPYN---LILEEMLQHVAMVAQISPAQVTGKMINVSAEAL 363 (501) T ss_pred HHh-CCCCCccchhhhcccceeccCCCCceEEEec-ccChHHHH---HHHHHHHHHHHhhcCCChhhhccccCChHHHHH Confidence 442 1111222333444555543333344444443 23454433 34444443322211 111011123355544 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCCcceEEEEechHHH--HHHHHHHHHHHHHHHHHHh Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKEAVEPTISTGLEA--IGRGQDLDKLERCVAAWAA 451 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~-~g~lp~~~~~~v~v~~vs~La~--a~r~~~~~~l~~~~~~~~~ 451 (536) .....-+.+.. .+.+..|-.. +.+++.++.. .|.-.......+++.+..++.. ++.+..+.+|.+. . T Consensus 364 ~~~~~~l~~ka----~~k~~~f~~~-l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~-----g 433 (501) T protein:vir:25 364 AAAEANQQRKL----AAKRESFGES-WEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASA-----G 433 (501) T ss_pred HHHHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----C Confidence 43332222221 2222222221 2222332211 1211112223466666555432 2222222222111 1 Q ss_pred hcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhcCcchHHhhhhcC Q lcl|NC_011045. 452 LAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM-AAQATASPEAMAAAADSV 530 (536) Q Consensus 452 ~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~-~~~~~~~~~~~~~~~~~~ 530 (536) +.+ .. .+....|+++ ++++++.++++.+.. ..+..+..++. .......++...+..+.+ T Consensus 434 is~---------et---~~~~~~g~~~-------~~ie~~~~~~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (501) T protein:vir:25 434 IPI---------EH---LLSMVPGMTQ-------QTIQAIKDSLRGGEV-KSLVDKLLSNEPAPVPPPPPQAAAQALNEG 493 (501) T ss_pred CCH---------HH---HHHHcCCCCH-------HHHHHHHHHHHHHhH-HHHHHHhhccCcCCCCCCCCCCCccccccc Confidence 211 11 1334457644 444444433322211 11111111111 111111111121212222 Q ss_pred CCCCCC Q lcl|NC_011045. 531 GLQPGI 536 (536) Q Consensus 531 ~~q~~~ 536 (536) +..|-= T Consensus 494 ~~~~~~ 499 (501) T protein:vir:25 494 GVNGNG 499 (501) T ss_pred cCCCCC Confidence 211111 No 49 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.33 E-value=1e-10 Score=75.29 Aligned_cols=451 Identities=13% Similarity=0.068 Sum_probs=193.3 Q ss_pred CCC---ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--cccccCCCCCcccccccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYT--IPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~--~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) |+. .-++++.+.+.++...+...+.+...+++++|+=- +|.... ......+..+...+-+..+|++++..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~--~~~~~~~~~~~~~n~~~~ivd~~~~~l~- 77 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGV--TVPQQMQKLLAHVGYPRLYIDAIAARQE- 77 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccc--ccchhHHhhhhhcCcHHHHHHHHHhhhc- Confidence 665 34667766666665665555544444444444211 111100 0011111112233456666666666542 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) |.+ |+. ++. .+ .. ..+.+...+++|.....++.++..+||.|.++|..+..+.. T Consensus 78 ---~~g--~~~--~~~--------~~----~~-------~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~ 131 (484) T protein:vir:77 78 ---LEG--FRL--GGA--------DK----AD-------EQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNID 131 (484) T ss_pred ---cCc--eec--CCc--------ch----hH-------HHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcc Confidence 321 222 111 00 11 11233345689999999999999999999988776654322 Q ss_pred -------eeEEEEecceEEEeeCC-CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCcee Q lcl|NC_011045. 156 -------NPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYI 227 (536) Q Consensus 156 -------~~~~~~~l~~~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~ 227 (536) .++++++..+.++..|+ .+++...++.+.-. .......+++|+ ++ ..+. T Consensus 132 ~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~-----------------~~~~~~~~~~y~---~~---~~~~ 188 (484) T protein:vir:77 132 PGVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDE-----------------EGNEVIGATLYL---PN---NTVI 188 (484) T ss_pred cccccccceEEEeccceeEEEecCCCCceEEEEEEEEee-----------------cCCcEEEEEEEe---cC---eEEE Confidence 24667777776666664 45665555544321 001111222221 11 0111 Q ss_pred EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Q lcl|NC_011045. 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT 306 (536) Q Consensus 228 ~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~ 306 (536) |...+|.-...++..+++..+|++.++.+...++.+|+|-.++ ..+-+..++...-.+...++..+.|...+. |.. T Consensus 189 -~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~ 265 (484) T protein:vir:77 189 -WNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLF--GVK 265 (484) T ss_pred -EEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh--CCC Confidence 1111121111122345678999999998888899999997764 557778888888888888888877775542 111 Q ss_pred ch---------hhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCHH Q lcl|NC_011045. 307 QP---------RRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN-----SAVQRTGERVTAE 372 (536) Q Consensus 307 ~~---------~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r~TAt 372 (536) .. ........|.+.....+++.+.++.. +.+ ...++.++.-|.+..... .+.......-++. T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~-~~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~ 341 (484) T protein:vir:77 266 GEELGVDPETGQTLFDAYLARILAFEDHESKAQQFSA-AEL---RNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAE 341 (484) T ss_pred cchhcccccccchhhhhhhhhhcccCCCCceeEeecC-CCh---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHH Confidence 00 11112223333322223333333331 223 344555555554432111 1110111112343 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCC--cceEEEEechHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK--EAVEPTISTGLEAIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 373 Ei~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~--~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~ 450 (536) -+.....-+... ..+.+.. +.+-+.+++.++.........+. ..+++.+.-++..- ..+.++.+.. ++ T Consensus 342 Al~~~~~~l~~k----a~~k~~~-f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s-~~~~ad~~~k----l~ 411 (484) T protein:vir:77 342 AIRSSESRLVKT----VERKNKI-FGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPT-YAAKADAATK----LY 411 (484) T ss_pred HHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCC-HHHHHHHHHH----HH Confidence 333322211111 1222222 22223444444333211122222 34666765443221 1122222222 22 Q ss_pred hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---hcCcchHHhhh Q lcl|NC_011045. 451 ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQA---TASPEAMAAAA 527 (536) Q Consensus 451 ~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~---~~~~~~~~~~~ 527 (536) +.+... +.-.. +...+|+.+. ..+|++++++++..+.++.. .+ +.+...+. +...+..+..+ T Consensus 412 ~~g~gi----~s~et----~~~~l~~~~~----~~~e~~~~~~ee~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~ 476 (484) T protein:vir:77 412 NNGQGV----IPKER----ARIDMGYSIT----EREEMRKWDEEEQAQGLGLM--GT-MFGTDPSGGGNPDNPETPEPQP 476 (484) T ss_pred hccCCC----CCHHH----HHhcCCCChh----HHHHHHHHHHHHHHHHHHHH--hh-hccccccCCCCCCCCCcccccC Confidence 222111 12222 3333454222 12344444333322211111 11 11111111 11122222334 Q ss_pred hcCCCCCC Q lcl|NC_011045. 528 DSVGLQPG 535 (536) Q Consensus 528 ~~~~~q~~ 535 (536) +++.-+-| T Consensus 477 ~~~~~~~~ 484 (484) T protein:vir:77 477 NPAEEAAA 484 (484) T ss_pred CCccccCC Confidence 44444444 No 50 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.31 E-value=1.4e-10 Score=74.59 Aligned_cols=444 Identities=12% Similarity=0.039 Sum_probs=191.1 Q ss_pred CCC-ccccccHHHHHHHHH-HHHHHhhhHHHHHHHHHHHhcccccCC---CCC--ccccccc-ccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYE-RLKNDRAPYETRAQNCAQYTIPSLFPK---DSD--NASTDYV-TPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~-~l~~~R~~~e~~w~e~~~~~~P~~~~~---~~~--~~~~~~~-~~~dst~~~a~~~Laa~ 72 (536) |=+ ..+.|+.+.+++... +|-.....-.++++.+.+|..-..-.. ... ...+++. ++..+-+..+|+++++. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 80 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQ 80 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhh Confidence 877 566788777655433 344443344566667777764432110 001 0011111 12335566666666654 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC-- Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-- 150 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~-- 150 (536) | +|.+ |+. .+... ...++ +.+..++|....+++.++..+||.+.+++... T Consensus 81 l----~~~g--f~~--~d~~~---------~~~~~-----------~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~ 132 (479) T protein:vir:99 81 L----IVDG--YRK--TGTNE---------NAKGW-----------DTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGIS 132 (479) T ss_pred c----cccc--ccC--CCchh---------hHHHH-----------HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCC Confidence 4 3432 332 22211 11222 23345789999999999999999998887642 Q ss_pred --CCCceeeEEEEecceEEEee-CCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCcee Q lcl|NC_011045. 151 --EGSNYNPMKLYRLSSYVVQR-DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYI 227 (536) Q Consensus 151 --~~~~~~~~~~~~l~~~~v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~ 227 (536) ++.+..++++++..+.++.- |+......+|. ++. +.+..+.+|+. ..+. T Consensus 133 ~~d~~g~~~i~~~~p~~~~~iydd~~~~~~~~~~---~~~------------------~~~~~~~~~~~-------~~~~ 184 (479) T protein:vir:99 133 PLDGTTVARIKCIDPRDAFAIWEDPYWDEWPKYL---LER------------------QPNGQYWWWTE-------EDYS 184 (479) T ss_pred CcCCCCceEEEEechhheEEEecCCcccceeeEE---Eee------------------cCceeEEEEec-------ceEE Confidence 33344567777766655443 33322222221 110 11111222110 0111 Q ss_pred EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 228 ~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .|....|.-.......+++..+|++.++-+...+ .+|+|=.+..++.+-.++...-.+...++..+.|.+.+. |... T Consensus 185 ~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~-~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~ 261 (479) T protein:vir:99 185 IFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLR-GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT--GLML 261 (479) T ss_pred EEEecCCceeeccccccCCCCcceEEeecCCCcC-cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc--CCCc Confidence 1211122222222234456789999988776654 589999999999999999999999999999999875543 1111 Q ss_pred h------hhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccC---CCCCCCHHHHHHHH Q lcl|NC_011045. 308 P------RRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQR---TGERVTAEEIRYVA 378 (536) Q Consensus 308 ~------~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~---~~~r~TAtEi~~r~ 378 (536) + ........+.++....+++.+.++. .+++ ...++.++.-|.+.+........ .....++.-+.... T Consensus 262 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~---~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~ 337 (479) T protein:vir:99 262 PEGANADQEKMRFAQESMLISQNEKASFGAIP-AAPL---DGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGT 337 (479) T ss_pred ccccccchhccccccccceeecCCCceEEEec-ccch---HHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHH Confidence 1 1111112222333233344444443 2233 34444444444333221111000 11123454444332 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRD 457 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~ 457 (536) .-+... .++.+. .+.+-+.+++.++.+. |...+.....+++.+-.+... ...+.++.+...++ .+ T Consensus 338 ~~l~~k----a~~~~~-~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~-s~~~~ad~~~kl~~----ag---- 403 (479) T protein:vir:99 338 RQTMQK----LFEKQA-TWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQ-SLAQFADAWAKMVE----SL---- 403 (479) T ss_pred HHHHHH----HHHHHH-HHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCC-CHHHHHHHHHHHHh----cC---- Confidence 222221 122222 2333445555544332 222222222355555332111 01112222222111 11 Q ss_pred hhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHh-hhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 458 DPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQ-ATASPEAMAAAADSVGLQP 534 (536) Q Consensus 458 ~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~--~~~-~~~~~~~~~~~~~~~~~q~ 534 (536) -+....++ ....||++ ++++.+++.++.+.+..+.+.+...+. +.+ .+..++.. ...++.+| T Consensus 404 --~is~et~l---~~l~gv~~-------~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 468 (479) T protein:vir:99 404 --KIPAEGVW---DMIPNLDQ-------STVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATN---MQQANNKT 468 (479) T ss_pred --CCCHHHHH---HhcCCCCH-------HHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCC---CCCCCCCC Confidence 02222222 22236643 444444333332222222222222111 111 11111000 11111222 Q ss_pred C----C Q lcl|NC_011045. 535 G----I 536 (536) Q Consensus 535 ~----~ 536 (536) | | T Consensus 469 ~~~~~~ 474 (479) T protein:vir:99 469 GEPASL 474 (479) T ss_pred cchhcc Confidence 2 2 No 51 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.29 E-value=1.7e-10 Score=74.12 Aligned_cols=444 Identities=12% Similarity=0.071 Sum_probs=194.0 Q ss_pred CC-----CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC---CCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MA-----EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP---KDSDNASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma-----~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~---~~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (536) |. .+...++...+....+.+... .++.+.+.+|..-..-. ....+...+..++..+-+...|+++++. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~----~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~ 76 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQ----NQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHH----HHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhh Confidence 33 333344444444444444443 33444444554322110 0011111122234445667777777665 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG 152 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~ 152 (536) | +|. .|+ +. +. ......++ +.+..++|.....+..++..+||.|.+++..+.. T Consensus 77 l----~~~-g~~-~~--~~--------~~~~~~l~-----------~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~ 129 (485) T protein:vir:24 77 Q----AVE-GFR-LG--DA--------DEADEELW-----------QWWQANNLDIEAPLGYTDAYVHGRSYITISRPDP 129 (485) T ss_pred h----ccC-cee-cC--CC--------chhHHHHH-----------HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCc Confidence 5 332 232 22 11 00111222 2334578999999999999999999998866543 Q ss_pred C-------ceeeEEEEecceEEEeeCC-CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCC Q lcl|NC_011045. 153 S-------NYNPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG 224 (536) Q Consensus 153 ~-------~~~~~~~~~l~~~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~ 224 (536) . +..+++.++..+.++..|+ .+++...++++.-. .......+++|+ + . T Consensus 130 ~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y~---~----~ 185 (485) T protein:vir:24 130 QIDLGWDPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDA-----------------EGNEIQAATLYT---P----N 185 (485) T ss_pred ccccccCCCcceEEEeccceeEEEeeCCcCceeEEEEEEEee-----------------cCCeEEEEEEEc---C----C Confidence 2 2336777887787777774 46666555544210 011111222221 1 1 Q ss_pred ceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeec-- Q lcl|NC_011045. 225 EYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVN-- 301 (536) Q Consensus 225 ~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-- 301 (536) ....|...+|.-+......++|..+|++.++.+...+..||+|-..+ ..+-+..++...-.+...++..+.|...+. T Consensus 186 ~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~ 265 (485) T protein:vir:24 186 ETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGI 265 (485) T ss_pred cEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccC Confidence 12222233333332333456688999999998888888999998775 456677888887788888888888875542 Q ss_pred -ccccc----chhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCH Q lcl|NC_011045. 302 -PAGIT----QPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN-----SAVQRTGERVTA 371 (536) Q Consensus 302 -~~g~~----~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r~TA 371 (536) ++... +...+....+|.+.....+++.+.++.. +.+ ...++.++.-|.+..... .+........++ T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~-~~~---e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg 341 (485) T protein:vir:24 266 KPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSA-AEL---ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (485) T ss_pred CccccccccccccchhhhcccceeccCCCCceEEeecc-cch---HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHH Confidence 11100 0011112233443222223333333332 222 334445555454332211 111011111233 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc---CCCCCCCCcceEEEEechHHH--HHHHHHHHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT---QQIPELPKEAVEPTISTGLEA--IGRGQDLDKLERCV 446 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~---g~lp~~~~~~v~v~~vs~La~--a~r~~~~~~l~~~~ 446 (536) .-+.. ....+.... ++.+.. +.+-+.+++.++..- ...+ .....++++|..++.. ++.+..+.+| + T Consensus 342 ~Al~~-~~~~l~~ka---~~~~~~-f~~~l~~~~~l~~~~~~~~~~~-~d~~~i~v~f~~~~~~s~~~~ad~~~kl---~ 412 (485) T protein:vir:24 342 EAIRA-AESRLIKKV---ERKNAI-FGGAWEEAMRLAYRLMKGGDVP-PDMLRMETVWRDPSTPTYAAKADAATKL---Y 412 (485) T ss_pred HHHHH-HHHHHHHHH---HHHHHH-HHHHHHHHHHHHHHHhcCCCCc-cccceeeEEecCCCCCCHHHHHHHHHHH---H Confidence 33332 222222222 222222 333444555554332 1111 2223567778655532 2222222222 2 Q ss_pred HHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhcCcchHHh Q lcl|NC_011045. 447 AAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM-DNGAAALAQGMAAQATASPEAMAA 525 (536) Q Consensus 447 ~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~-~~~a~~~~~~~~~~~~~~~~~~~~ 525 (536) +. +. ..+....+ .+.+|.+ +++++++++.+.++..+ .+...+ + ......++.... T Consensus 413 ~~----g~----~~~s~et~----~~~l~~~-------~d~~~e~~~~~ee~~~~~~~~~~~----~-~~~~~~~~~~~~ 468 (485) T protein:vir:24 413 GN----GQ----GVIPRERA----RKDMGYS-------IAEREEMRRWDEEEAAMGLGLLGT----M-VDADPTVPGSPN 468 (485) T ss_pred hc----cc----ccCCHHHH----HhhCCCC-------HhHHHHHHHHHHHHhhhhhhHHHh----h-cccCCCCCCCCC Confidence 21 11 01222222 2345653 33443333322221111 111111 1 111111111111 Q ss_pred hhhcCCCCCCC Q lcl|NC_011045. 526 AADSVGLQPGI 536 (536) Q Consensus 526 ~~~~~~~q~~~ 536 (536) ...+..-|||- T Consensus 469 ~~e~~~~~~~~ 479 (485) T protein:vir:24 469 PTPAPKPQPAI 479 (485) T ss_pred CCCCCCCccCC Confidence 12233334444 No 52 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.28 E-value=2.2e-10 Score=73.57 Aligned_cols=458 Identities=11% Similarity=0.062 Sum_probs=211.5 Q ss_pred CCC---------------------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccC---CCCC--- Q lcl|NC_011045. 1 MAE---------------------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFP---KDSD--- 48 (536) Q Consensus 1 Ma~---------------------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~---~~~~--- 48 (536) ||+ .....+.+.+.+..+ ..| ..+++.+.+|..-. +.. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~---~~~---~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~ 74 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLID---EHN---PEPLLKGVRYYMCENDIEKKRRTYYDAAGQQL 74 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHH---hhc---HHHHHHHHHHhccccchhhccchhcccccccc Confidence 333 111111222222211 112 24566666666432 111 1100 Q ss_pred -cccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccCh Q lcl|NC_011045. 49 -NASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYR 127 (536) Q Consensus 49 -~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~ 127 (536) ...+...++..+-+...++..++.| |... ++++..+.. +.+.+. .+..++|- T Consensus 75 ~~~~~~~~ri~~n~~~~ivd~~~~yl----~g~~--~~~~~~d~~-------------~~~~l~--------~~~~n~~~ 127 (503) T protein:vir:59 75 VDDTKTNNRTSHAWHKLFVDQKTQYL----VGEP--VTFTSDNKT-------------LLEYVN--------ELADDDFD 127 (503) T ss_pred cccccccceeecchHHHHHHHHHhhh----hcCC--eeeccCcHH-------------HHHHHH--------HHHhcCHH Confidence 0111123444555666666666655 3211 123333322 222222 12246899 Q ss_pred HHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccC Q lcl|NC_011045. 128 VTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEK 205 (536) Q Consensus 128 ~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~ 205 (536) ....++.++..++|.+++++..+.. +.++++.++..+++...|. .+++...+|.++.. ..++ T Consensus 128 ~~~~~~~~~~~~~G~~~~~v~~d~d-g~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~---------------~~~~ 191 (503) T protein:vir:59 128 DILNETVKNMSNKGIEYWHPFVDEE-GEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYK---------------GIMG 191 (503) T ss_pred HHHHHHHHHHhhCCeEEEEEeecCC-CceEEEEEccceeEEEEeCCCCCceEEEEEEEEEe---------------cCCC Confidence 9999999999999999988876654 3467888888887777664 36677766655431 0111 Q ss_pred CCCceEEEEEEEE---ecCCCCceeE---EEEe-cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHH Q lcl|NC_011045. 206 KADETIDVYTHIY---LDEDSGEYIR---YEEV-EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRS 278 (536) Q Consensus 206 ~~~~~~~v~~~v~---p~~~~~~~~~---~~~v-~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~ 278 (536) +....+++|+.-. ....+..+.. +.+. ....+......++|..+|++.++- +.+|.|-.+.+.+.+.. T Consensus 192 ~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~n-----n~~~~sd~~~~~~liDa 266 (503) T protein:vir:59 192 EETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKN-----NEEMVSDLKFYKDLIDN 266 (503) T ss_pred ceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecC-----CCCCCcchhhhHHHHHH Confidence 1122333333210 0001111100 0000 000011122345567888887753 45799999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeccccccchhhhc-cCCC-cceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 279 LENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT-KAQT-GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF 356 (536) Q Consensus 279 L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~-~~~~-g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af 356 (536) +|.+.-......+....|.+.+..-...+..+.. .... +.+.....+++..+ ....+.+.....++.++..|.+.- T Consensus 267 ~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s 344 (503) T protein:vir:59 267 YDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGGVDTL--RAEIPVDSAAKELERIQDELYKSA 344 (503) T ss_pred HHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCcceeE--eccCCHHHHHHHHHHHHHHHHHHh Confidence 9999999999999999998776432222222211 1122 22322333444443 333456777778888777775533 Q ss_pred hhhh-cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHH Q lcl|NC_011045. 357 MLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGR 435 (536) Q Consensus 357 ~~~~-~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r 435 (536) .... .....+...|+..+..+..-.... .....+.-.+.|.-+++.++.++...+.....+...|+|+|.-++..-. T Consensus 345 ~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k-~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~- 422 (503) T protein:vir:59 345 QAVDNSPETIGGGATGPALENLYALLDLK-ANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQND- 422 (503) T ss_pred cccCCCcccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCH- Confidence 2211 111223456777766543333322 3334444455555555555555554444333344568888866655322 Q ss_pred HHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011045. 436 GQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQ 515 (536) Q Consensus 436 ~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~ 515 (536) .+.++.+...++. .+ +....++.. ++. +-..++|++.+.++++..+++.+.......+...+ T Consensus 423 ~~~~~~~~kl~~~--Gi--------iS~et~l~~----l~~----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 484 (503) T protein:vir:59 423 SEIVQSLVQGVTG--GI--------MSKETAVAR----NPF----VQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDL 484 (503) T ss_pred HHHHHHHHHHHhC--CC--------CchHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCC Confidence 2223333222211 11 112222222 221 11235677666554443332222111000111111 Q ss_pred hhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 516 ATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 516 ~~~~~~~~~~~~~~~~~q~~ 535 (536) ....+...+......| |-+ T Consensus 485 ~~~~~~~~~~~~~~~g-~~~ 503 (503) T protein:vir:59 485 EEDDPNAGAAESGGAG-QVS 503 (503) T ss_pred CcCCCCCCcccCCCCC-CcC Confidence 1111111111111222 222 No 53 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.27 E-value=2.4e-10 Score=73.29 Aligned_cols=457 Identities=13% Similarity=0.069 Sum_probs=203.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCC---CCCcccccccccccchHHHHHHHHHHHHHH-h Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPK---DSDNASTDYVTPWQAVGARGLNNLASKLML-A 76 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~---~~~~~~~~~~~~~dst~~~a~~~Laa~l~~-~ 76 (536) |++....=+.+.+...+..+...+ ++++.+.+|..-.-... ..-....+..++..+-+..+|++++..|.- + T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~----~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~G 76 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQ----NELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEG 76 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHH----HHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccc Confidence 998654333444445555554443 45555555643221000 010111122345567788888888887742 3 Q ss_pred hc-CCC-cceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC--- Q lcl|NC_011045. 77 LF-PMQ-TWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE--- 151 (536) Q Consensus 77 lt-P~~-~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~--- 151 (536) ++ |.. .+---...+. .+.+. +.+.+.+++|.....++.++..+||.|.+++..+. T Consensus 77 f~~~~~~~~~~~~~~d~-------------~~~~~-------l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~ 136 (488) T protein:vir:23 77 FRIPSANGEEPESGGEN-------------DPASE-------LWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEV 136 (488) T ss_pred eeccCCcccccccccch-------------hHHHH-------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccc Confidence 33 421 1211111111 11111 23345678999999999999999999988876532 Q ss_pred ----CCceeeEEEEecceEEEeeC-CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCce Q lcl|NC_011045. 152 ----GSNYNPMKLYRLSSYVVQRD-AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEY 226 (536) Q Consensus 152 ----~~~~~~~~~~~l~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~ 226 (536) +.+..++++++..+.++..| ..+++...++.+.- .+ +..+..++...++ .. T Consensus 137 ~~~~~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~-----------------~~---~~~~~~~~~y~~~----~~ 192 (488) T protein:vir:23 137 DFDVDPEVPLIRVEPPTALYAEVDPRTRKVLYAIRAIYG-----------------AD---GNEIVSATLYLPD----TT 192 (488) T ss_pred ccCCCCCcceEEEeccceeEEEEecCCCceEEEEEEEEe-----------------cC---CCcEEEEEEEecC----cE Confidence 12233567777777666555 45666665554420 00 1112222222111 12 Q ss_pred eEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeec---c Q lcl|NC_011045. 227 IRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVN---P 302 (536) Q Consensus 227 ~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~---~ 302 (536) ..|...+|.-.......++|..+|+++++.+...+..+|+|=..+ .++-+..++...-.+...++....|...+- + T Consensus 193 ~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~ 272 (488) T protein:vir:23 193 MTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKP 272 (488) T ss_pred EEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCc Confidence 222333333222233456688999999998888899999997764 456678888888888888888887765542 1 Q ss_pred ccc----cchhhhccCCCcceecCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCHH Q lcl|NC_011045. 303 AGI----TQPRRLTKAQTGDFVTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTAE 372 (536) Q Consensus 303 ~g~----~~~~~~~~~~~g~~~~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TAt 372 (536) +.. .+...+.+.+.|.+..... +++.+.+++. .+ +...++.++.-|.+.+.... +.......-++. T Consensus 273 ~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~-~~---~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~ 348 (488) T protein:vir:23 273 EELGINAETGQRMFDAYMARILAFEGGEGAHAEQFSA-AE---LRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAE 348 (488) T ss_pred ccccccccccchhhhhhhhhhccCCCCCCceeEecCC-CC---hHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHH Confidence 100 1111222333344332222 2233334332 23 34455555555544332111 110111111333 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCC--cceEEEEechHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK--EAVEPTISTGLEAIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 373 Ei~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~--~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~ 450 (536) -+.....-+... . ++.+.. +.+-+.+++.++...-.....+. ..++++|..+...- ..+.++.+...++ T Consensus 349 Al~~~~~~l~~k-~---~~~~~~-f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s-~~~~ada~~kl~~--- 419 (488) T protein:vir:23 349 AIKAAESRLVKK-V---ERKNKI-FGGAWEQAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPT-YAAKADAAAKLFA--- 419 (488) T ss_pred HHHHHHHHHHHH-H---HHHHHH-HHHHHHHHHHHHHHHhcCCCcchhhccceEEecCCCCCC-HHHHHHHHHHHHh--- Confidence 333322222222 1 233333 33344555555443211112222 34677775544321 1222222222222 Q ss_pred hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcC Q lcl|NC_011045. 451 ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSV 530 (536) Q Consensus 451 ~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (536) .+.. .+.... +.+.+|..+. ..++++++.+++.++. ..+..+..+....+...+. ....+.. T Consensus 420 -~g~~----~~s~et----~~~~l~~~~d----~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~---~~~~~~~ 481 (488) T protein:vir:23 420 -NGAG----LIPRER----GWVDMGYTIV----EREQMRQWLEQDQKQG--LGLIGSLYGASTPEGKPGE---APVGEPP 481 (488) T ss_pred -cccc----cCCHHH----HHHhCCCCch----HHHHHHHHHHHHHHHH--HHHHHHHhccCCCcccCCC---CCCCCCC Confidence 1110 122222 2333443221 1233333333222111 1111122111111111111 1112233 Q ss_pred CCCCCC Q lcl|NC_011045. 531 GLQPGI 536 (536) Q Consensus 531 ~~q~~~ 536 (536) ...|+. T Consensus 482 ~~e~~~ 487 (488) T protein:vir:23 482 APEPDA 487 (488) T ss_pred CCCCCC Confidence 344555 No 54 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.21 E-value=5.4e-10 Score=71.41 Aligned_cols=440 Identities=14% Similarity=0.121 Sum_probs=195.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC-C--CCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP-K--DSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~-~--~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |.-. .+.+....+.+..+ ..+...+.+|..-..-. . ...+...+..++..+-+..+|+.+++.| T Consensus 1 ~~t~-----~~~i~~L~~~~~~~----~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l---- 67 (480) T protein:vir:78 1 MTTY-----HEHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL---- 67 (480) T ss_pred CCCH-----HHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh---- Confidence 5443 34455555555443 33444555554332111 1 0111111122344555777777777765 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC-----CC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-----EG 152 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~-----~~ 152 (536) +|.+ |...- |. +....+ ...+.+++|.....+++++..+||.|.++|... +. T Consensus 68 ~~~g--~~~~~-d~---------~~~~~l-----------~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~ 124 (480) T protein:vir:78 68 DIEG--FRISE-DS---------EGLEEL-----------WNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDP 124 (480) T ss_pred ccCc--eecCC-Cc---------hhHHHH-----------HHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCC Confidence 2322 22211 11 111122 233456899999999999999999998877542 23 Q ss_pred CceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEE Q lcl|NC_011045. 153 SNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYE 230 (536) Q Consensus 153 ~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~ 230 (536) .+..+++.++..+.++..|+ .+++...+|.+.-. ........+++|+. +....|. T Consensus 125 ~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~----------------~~~~~~~~~~~y~~-------~~~~~~~ 181 (480) T protein:vir:78 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR----------------DDVAVPDRATLYLP-------DETVPLR 181 (480) T ss_pred CCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee----------------cCCCceEEEEEEeC-------CeEEEEE Confidence 34467888988888888885 46777666555311 01111123333321 1111121 Q ss_pred EecCc----cccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeeccccc Q lcl|NC_011045. 231 EVEGM----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGI 305 (536) Q Consensus 231 ~v~g~----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~ 305 (536) ...+. ....+...+++..+|+++++.+...+..||+|=.++ ..+-+-.++...-.+....+..+.|...+. |. T Consensus 182 ~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~ 259 (480) T protein:vir:78 182 RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV 259 (480) T ss_pred ecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--cC Confidence 11111 111222345678999999998888899999998876 568888889888888888888888875552 21 Q ss_pred cchhhh--------ccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCC-CCH Q lcl|NC_011045. 306 TQPRRL--------TKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN-----SAVQRTGER-VTA 371 (536) Q Consensus 306 ~~~~~~--------~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r-~TA 371 (536) . +..+ .....|.+..-..+++.+.++.. .+++ ..++.++.-|.+.+... .+. ..... -++ T Consensus 260 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~l~~~i~~~~~~~~~p~~~~g-~~~~n~~Sg 333 (480) T protein:vir:78 260 T-TDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQYLS-SSSENPASA 333 (480) T ss_pred C-ccccccccccchhhhhhhhhccCCCCCceEEecCc-cCHH---HHHHHHHHHHHHHhcccCCChHHhc-cccCcchHH Confidence 1 1111 11122333222233444444442 2344 34444544444432211 111 11111 233 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCC--cceEEEEechHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPK--EAVEPTISTGLEAIGRGQDLDKLERCVAA 448 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~--~~v~v~~vs~La~a~r~~~~~~l~~~~~~ 448 (536) .-++.....+.. ...+.... +.+-+.+++.++... |. ..+. ..++++|.-+...- ..+.++.+.+.++ T Consensus 334 ~Alk~~~~~l~~----ka~~~~~~-f~~~l~~~~~l~~~~~g~--~~~~~~~~i~v~f~~~~~~s-~~~~ad~~~kl~~- 404 (480) T protein:vir:78 334 EAIIATDSRIVK----MAERKGRI-FGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPT-VAAKADAVSKLYA- 404 (480) T ss_pred HHHHHHHHHHHH----HHHHHHHH-HHHHHHHHHHHHHHHcCC--CccccceeeeEEecCCCCCC-HHHHHHHHHHHHH- Confidence 333322221111 11232222 333445555554432 21 1112 23566664433221 1122222222222 Q ss_pred HHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhhcCcchHHhh Q lcl|NC_011045. 449 WAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQATASPEAMAAA 526 (536) Q Consensus 449 ~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~--~~~~~~~~~~~~~~ 526 (536) .+.. .+.... +...+|.. +++++++.+.++++.+ +...+..... .++.+..+++...+ T Consensus 405 ---~g~~----~~s~et----~~~~lg~~-------~d~~~~~~~~~~e~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (480) T protein:vir:78 405 ---NGQG----PIPKEQ----ARIDLGYT-------ATQREQMRDWDKQETE--DMIDTLYSTTKAQADATPKPTVTETK 464 (480) T ss_pred ---hccc----cCCHHH----HHhcCCCC-------HhHHHHHHHHHHHHHH--HHHHHhhccccccCCCCCCCCCCCCC Confidence 1110 122222 23335553 3444444322211111 1111111111 11112222222222 Q ss_pred hhcCCCCCCC Q lcl|NC_011045. 527 ADSVGLQPGI 536 (536) Q Consensus 527 ~~~~~~q~~~ 536 (536) ..+....-|. T Consensus 465 ~~~~~~~~~~ 474 (480) T protein:vir:78 465 TETQTSPSGF 474 (480) T ss_pred CccccccCCC Confidence 2222222233 No 55 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.20 E-value=6.3e-10 Score=71.02 Aligned_cols=450 Identities=10% Similarity=0.037 Sum_probs=194.0 Q ss_pred CCCcc----------ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCC-CC--cccccccccccchHHHHHH Q lcl|NC_011045. 1 MAEKR----------TGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKD-SD--NASTDYVTPWQAVGARGLN 67 (536) Q Consensus 1 Ma~~~----------~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~-~~--~~~~~~~~~~dst~~~a~~ 67 (536) |.+.. .+|+.+. ....+.|......+.++.+++.+|..-...... +. +...+.-+..-+-+..+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e-~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd 79 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDV-VDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVD 79 (504) T ss_pred CCccCCcccccccccCCCCHHH-HHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHH Confidence 54432 3333333 233455555555556677777777644321111 11 1111111234455777777 Q ss_pred HHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 68 NLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 68 ~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~ 147 (536) +||..|. +.+ |++ ++.. .....+++ ...+++|.....++.++..+||.+.++| T Consensus 80 ~~a~rl~----~~G--f~~--~d~~--------~~~~~l~~-----------i~~~N~ld~~~~~~~~~a~iyG~af~~v 132 (504) T protein:vir:99 80 TLARRCN----LES--FVW--PDGD--------YGSIGGPD-----------VWDENFFATKANNAMVSSLIHGPAFLIN 132 (504) T ss_pred HHHhhhc----cce--eeC--CCCC--------hhhHHHHH-----------HHHhcChhhHHHHHHHHHHhhCceeEEE Confidence 7776542 211 222 2110 01112222 3345899999999999999999999988 Q ss_pred ecCCCCc-eeeEEEEecceEEEeeCC-CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCc Q lcl|NC_011045. 148 PEPEGSN-YNPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 (536) Q Consensus 148 ~~~~~~~-~~~~~~~~l~~~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~ 225 (536) -.+..+. ..++++++..+..+..|+ .+++...++.... ........+++|. + +.. T Consensus 133 ~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~-----------------d~~g~~~~~~~y~---~---~~~ 189 (504) T protein:vir:99 133 TEGGAGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITSR-----------------DAEGHPTGIALYE---D---GVT 189 (504) T ss_pred ecCCCCCceeEEEEeccceeEEEEeCCCCceeEEEEEEEe-----------------cCCCeEEEEEEEc---C---CcE Confidence 7655433 345677787777666664 4444433332210 0000111233321 1 000 Q ss_pred eeEEEEecCccccccccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Q lcl|NC_011045. 226 YIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) Q Consensus 226 ~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g 304 (536) +.....-.|... .+...+++ .+|++++..+...++.||+|-. +..++-+..+|...-.++..+++.+.|...+- | T Consensus 190 ~~~~~~~~~~~~-~~~~~~~~-gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~--G 265 (504) T protein:vir:99 190 VTADMDDDGDWH-ADVRTHKL-GVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILL--G 265 (504) T ss_pred EEEEEcCCceee-eccccCCC-CcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--c Confidence 000000011111 11223444 3899999888888899999965 46789999999999999999999888875541 1 Q ss_pred cc---------chhhhccCCCccee--cCCccc-------ccccccccccchhHHHHHHHHHHHHHHHHHhhhhcc---- Q lcl|NC_011045. 305 IT---------QPRRLTKAQTGDFV--TGRPED-------ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAV---- 362 (536) Q Consensus 305 ~~---------~~~~~~~~~~g~~~--~g~~~~-------~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~---- 362 (536) .. ++........+.+. +.+.++ +.+.++. .++++. .++.++.-|.......... T Consensus 266 ~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~-~~~l~~---~~~~l~~~i~~~a~~t~~P~~~l 341 (504) T protein:vir:99 266 ADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFP-ASSPQP---HIEMLEQIAMMFSGETSIPVESL 341 (504) T ss_pred CCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecC-CCChHH---HHHHHHHHHHHHHhhhCCCHHHh Confidence 11 11111112222221 221111 1111221 223443 3444444443322211110 Q ss_pred --cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCCc--ceEEEEechH--HHHHH Q lcl|NC_011045. 363 --QRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKE--AVEPTISTGL--EAIGR 435 (536) Q Consensus 363 --~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~-~g~lp~~~~~--~v~v~~vs~L--a~a~r 435 (536) ..+...-+|.-|.+...-+.+ ...+.+.-| ..-+.+++.++.. .+.....+.+ .+++.+.-+. ..++. T Consensus 342 G~~~~~n~sSa~Ai~~~~~~L~~----ka~~k~~~f-~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~ 416 (504) T protein:vir:99 342 GFSNRANPTSADAYIASREDLIA----EAEGATDDW-SPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQ 416 (504) T ss_pred cccccccccHHHHHHHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHH Confidence 011122244444332222222 122322222 2223333333322 1223333333 3555564333 23333 Q ss_pred HHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_011045. 436 GQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQ----- 510 (536) Q Consensus 436 ~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~----- 510 (536) +..+.+|. +.++..+ ...+. +.+.+|+++ +|++.+.++++.++... ...+.+. T Consensus 417 aDa~~Kl~-------~ag~~l~---~~~~~----l~~~lg~~~-------~ei~r~~~e~~~~~~~~-~~~~l~~~~~~~ 474 (504) T protein:vir:99 417 ADAGAKML-------GAGPEWL---KETEV----GLELLGLTP-------QQAKRALAERRRASSVS-IIEALNRRQQEA 474 (504) T ss_pred HHHHHHHH-------hhccccc---cchHH----HHhhcCCCH-------HHHHHHHHHHHHHhhHH-HHHHHhcccCCC Confidence 33333332 2222111 11222 334457644 44443333222111111 0011110 Q ss_pred --HHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 511 --GMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 511 --~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) +...+....+++.+..+.+++-+|.+ T Consensus 475 ~~~~~~~~~~~~e~a~~~~~~~~~~p~~ 502 (504) T protein:vir:99 475 ATAGEDQDQGAGEPPANEPPAALGRPTL 502 (504) T ss_pred CCCCCCCCcCCCCCCCCCCCccCCCccc Confidence 00001111223333344466667777 No 56 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.18 E-value=7.6e-10 Score=70.57 Aligned_cols=438 Identities=11% Similarity=0.056 Sum_probs=204.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCCC---CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDS---DNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~~---~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |.+..+.++.+.+.+..+..+..| .++|+.+.+|....- ..... ....+...++..+.+...++..++.| T Consensus 23 ~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l- 98 (481) T protein:vir:10 23 VSDLAELLKEENLRNFISRHQTEQ---VPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYL- 98 (481) T ss_pred eecchhhcCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhh- Confidence 777777777888877777665544 456667777764421 11110 11112233455666777777766544 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) |...+ .+...+... . ..+.+.+.+++|.....++.++..++|.+.+++..+.. + T Consensus 99 ---~g~~~--~~~~~d~~~-------------~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~d-g 152 (481) T protein:vir:10 99 ---TGNPI--TITHQDNQT-------------N-------DKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFE-D 152 (481) T ss_pred ---ccCCc--eEecCChhH-------------H-------HHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCC-C Confidence 32112 222222211 1 12334455688999999999999999999887765554 3 Q ss_pred eeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .++++.++..+.++..|.. +++...+|.++..- .++.....+++|+. .+. .+... T Consensus 153 ~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~---------------~~~~~~~~~~~y~~-------~~i-~~~~~ 209 (481) T protein:vir:10 153 RDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQD---------------KDKVPVQHVEVYTT-------DKI-YYIEI 209 (481) T ss_pred eEEEEEEcccceEEEEcCCCCCceEEEEEEEEEee---------------CCCceEEEEEEEec-------CeE-EEEEe Confidence 4578889988887777754 45666555544220 01111122333321 111 11112 Q ss_pred cCccc-cccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhh Q lcl|NC_011045. 233 EGMEV-QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRL 311 (536) Q Consensus 233 ~g~~i-~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~ 311 (536) ++... ......+++..+|++.++- +.+|+|-.+...+-+..++.+.-......+....|.+.+.-....+.++. T Consensus 210 ~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~ 284 (481) T protein:vir:10 210 KGGTYHRVEEVEHYYNDVPIIEYLN-----DQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDA 284 (481) T ss_pred cCCceeecccccccCCceeEEEeec-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccch Confidence 22111 1122344567889876543 46799999999999999999888888888888999887753222223222 Q ss_pred ccCCCcce-e-------cCCcccccccccccccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHHHHHHHH Q lcl|NC_011045. 312 TKAQTGDF-V-------TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM-LNSAVQRTGERVTAEEIRYVASELE 382 (536) Q Consensus 312 ~~~~~g~~-~-------~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~r~TAtEi~~r~~E~~ 382 (536) .....+.+ . .+..++..+..+....+.+.....++.++..|...-. .+......+...|+..+.....-+ T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l- 363 (481) T protein:vir:10 285 KAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGL- 363 (481) T ss_pred hhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHH- Confidence 22111111 1 1111111122222223445556666666665533211 111111112233554443322222 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhc---CCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQAT---QQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~---g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) .. ..++.+ ..+...+.+++.++.+. ....+.....+++.|.-++..- ..+.++.+... +.+ T Consensus 364 ~~---k~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~-~~~~a~~~~kl----~g~------- 427 (481) T protein:vir:10 364 EQ---VRAIKE-RLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKS-MMESINAFNAL----SGG------- 427 (481) T ss_pred HH---HHHHHH-HHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcC-HHHHHHHHHHH----hcc------- Confidence 12 222222 22333444444444332 1112233345777775444321 12222222221 111 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCC Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (536) |....+++. ++. +-..++|++.+++++.+++...++. ...+... .....-+..| T Consensus 428 -is~et~~~~----l~~----i~d~~~E~~ri~~E~~~~~~~~~~~------~~~~~~~---~~~~~dd~~g 481 (481) T protein:vir:10 428 -VSESTRLSL----LDF----IDNPKEELEKMQEEEAQREKQADKR------GYGEAFE---NHLNVDDSNG 481 (481) T ss_pred -CChHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHhhhhhc------cCCccCC---CCCCCCCCCC Confidence 222223332 222 1123567766665554332221111 0001100 1111112222 No 57 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.16 E-value=1e-09 Score=69.87 Aligned_cols=437 Identities=10% Similarity=0.030 Sum_probs=200.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----c-cCCCC---CcccccccccccchHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----L-FPKDS---DNASTDYVTPWQAVGARGLNNLAS 71 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~-~~~~~---~~~~~~~~~~~dst~~~a~~~Laa 71 (536) +-........+.+++..+..+.. ..+++.+.+|..-. + ..... ....+...++..+-+...+++.++ T Consensus 37 ~~~~~~~~~~~~i~~~i~~~~~~----~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~ 112 (492) T protein:vir:94 37 RTNNKPETLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVS 112 (492) T ss_pred ccCCchhhHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHh Confidence 33333344456666665665543 34455666665321 1 00000 011112235667778888888887 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) .|++ -| + .++..|.. ....++.|+ .++|-....++.++..+||.|.+++..+. T Consensus 113 yl~G--~p--~--~~~~~d~~---------~~~~l~~~~------------~n~~~~~~~~~~~~a~~~G~a~~~v~~d~ 165 (492) T protein:vir:94 113 YIVG--KP--I--AFKHTDDE---------VVKRIDEVL------------GNRFDDKLHSVLTGASNKGIEWLHPYLDE 165 (492) T ss_pred hhcc--cC--c--eeccCchH---------HHHHHHHHH------------hccHHHHHHHHHHHHhhCCeEEEEEEecC Confidence 6532 12 1 22333221 112233332 35788888999999999999988776655 Q ss_pred CCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE---EecCCCCce Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEDSGEY 226 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v---~p~~~~~~~ 226 (536) .+ .++++.++..+.++..| ..+++...+|.+... ....+++|+-. +.+.+++.. T Consensus 166 dg-~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~~ 224 (492) T protein:vir:94 166 EG-EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGSL 224 (492) T ss_pred CC-ceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEEecCee Confidence 43 35678888777655554 456776666655421 01123333210 011111111 Q ss_pred eEEE--EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Q lcl|NC_011045. 227 IRYE--EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) Q Consensus 227 ~~~~--~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g 304 (536) .... ..++..+ ....+++..+|++.++- +.+|.|=.+..++.+..++.+.-.+....+....|.+++.--. T Consensus 225 ~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~ 297 (492) T protein:vir:94 225 IPDYSNNLENSKT--HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD 297 (492) T ss_pred eeccccccccccc--cccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 1111 1111111 22345567888876654 4579999999999999999998888889999999987663211 Q ss_pred ccchhhhcc--CCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 305 ITQPRRLTK--AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 305 ~~~~~~~~~--~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~E~ 381 (536) ..+..+... ...+.+.-+..+++..+ ....+.......++.++..|...-..-. ....-+...|+.-+.....- T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~- 374 (492) T protein:vir:94 298 DQELPEFKRLLRYYGAIKVSDNGGVDTI--QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTN- 374 (492) T ss_pred cccchhhHHHHhhccceecCCCCcceeE--eccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHH- Confidence 111111111 11223322333444433 3344566677777777776654322111 11111223344433322211 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcC Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDI 461 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~i 461 (536) +.... ++.+.. +...+.+++.++.+..-.+ ....+++|+|.-.+..- ..+.++.+.. +.+ .+ T Consensus 375 l~~k~---~~k~~~-f~~~l~~~~~li~~~~~~~-~~~~~i~v~f~~~~p~~-~~e~~~~~~k-------l~g-----ii 436 (492) T protein:vir:94 375 LNLKA---DKLARK-AKVAIQELLWFVFEHFDIK-GEHKDVDISFNYNKVAN-TELQVQTAQQ-------SMG-----IV 436 (492) T ss_pred HHHHH---HHHHHH-HHHHHHHHHHHHHHHhcCC-cccceeeEEecCCCCCC-HHHHHHHHHH-------Hhc-----cC Confidence 22222 222222 2223444444443322221 12235667665444321 1111121111 111 01 Q ss_pred CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 462 NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 462 d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) ....++ ..+|..+ -.++|++++.+++++.++..+... . .....++.....++..+- T Consensus 437 S~et~~----~~l~~v~----d~~~E~eri~~E~~~~~~~~~~~~----~---~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 437 SHETVL----ENHPFVE----DLQAELERIEQEQMEYNKQLPNLD----D---GGADSAQQQERSNNKESE 492 (492) T ss_pred chHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHhhccccc----c---ccCCCCccccCCccccCC Confidence 222222 2233211 235677777665543333221111 0 011111111122222221 No 58 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.16 E-value=1.1e-09 Score=69.77 Aligned_cols=446 Identities=10% Similarity=0.068 Sum_probs=191.7 Q ss_pred CCCcccccc-----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCC---CCCcccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLA-----EEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPK---DSDNASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~-----~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~---~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (536) |.=...++. ...+...+..+.. ..++.+++.+|..-..-.. ...+...+.-+...+-+..+|++++.. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~----~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~ 76 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFED----STQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHH----HHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhh Confidence 544333332 2223333333333 3355566666664322111 011111112233446677777777776 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG 152 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~ 152 (536) | +|.+ |+.. .+. +....++ +.+.+++|.....++.++..+||.|.+++..+.. T Consensus 77 l----~~~g--~~~~-~~~---------~~~~~~~-----------~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~ 129 (485) T protein:vir:10 77 Q----AVEG--FRFG-DAD---------EADEELW-----------QWWQANNLDIEAPLGYTDAYVHGRSYITISRPDP 129 (485) T ss_pred h----cccc--eecC-CCc---------hhHHHHH-----------HHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCc Confidence 5 3322 2221 111 1111222 2345689999999999999999999888765532 Q ss_pred C-------ceeeEEEEecceEEEeeCC-CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCC Q lcl|NC_011045. 153 S-------NYNPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG 224 (536) Q Consensus 153 ~-------~~~~~~~~~l~~~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~ 224 (536) . +..++++++.-+.++..|+ .+++...++.+.- ...+....+++|+- + T Consensus 130 ~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~y~~------~- 185 (485) T protein:vir:10 130 QIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSKAIRVAYD-----------------AEGNEIQAATLYTP------N- 185 (485) T ss_pred ccccccCCCeeEEEEEccceeEEEEcCCCCceeEEEEEEEe-----------------eCCCeEEEEEEEeC------C- Confidence 1 2335777887777777774 4556555554321 00111112222221 1 Q ss_pred ceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeec-- Q lcl|NC_011045. 225 EYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVN-- 301 (536) Q Consensus 225 ~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-- 301 (536) ....|....|.-.......+++..+|++.+..+...+..||+|=.+. .++-+..++...-.+...++..+.|...+. T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~ 265 (485) T protein:vir:10 186 DIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGI 265 (485) T ss_pred eEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcC Confidence 11111222222111222346678999999999999999999997765 456678888888888888888888875542 Q ss_pred -cccc----cchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCH Q lcl|NC_011045. 302 -PAGI----TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN-----SAVQRTGERVTA 371 (536) Q Consensus 302 -~~g~----~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r~TA 371 (536) ++.+ .+........+|.+.....+++.+.++.. ..++ ..++.++.-|.+..... .+........++ T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~~-~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg 341 (485) T protein:vir:10 266 KPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSA-AELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (485) T ss_pred CcccccccccccchhhhhcccceeccCCCCceEEeecc-cchH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 1100 00111122234443332223444444432 2333 44455555554433211 111011111234 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC--CcceEEEEechHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP--KEAVEPTISTGLEAIGRGQDLDKLERCVAAW 449 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~--~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~ 449 (536) .-+......+... .++.+ +.+.+-+.+++.++...-.....+ ...++|+|..++.... ++.++.+...++ T Consensus 342 ~Al~~~~~~l~~k----~~~k~-~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~-~~~ada~~kl~~-- 413 (485) T protein:vir:10 342 EAIRAAESRLIKK----VERKN-SIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTY-AAKADAASKLYN-- 413 (485) T ss_pred HHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCH-HHHHHHHHHHHh-- Confidence 3333322222221 12222 223334444444443321112222 2346777765553321 112222222111 Q ss_pred HhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhcCcchHHhhhh Q lcl|NC_011045. 450 AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMG-MDNGAAALAQGMAAQATASPEAMAAAAD 528 (536) Q Consensus 450 ~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~-~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 528 (536) .+. ..+..+.+. +.+|+++ +++++++..++++++ ...+..+... ...+.+ .+...+ T Consensus 414 --ag~----~~~s~et~~----~~lg~~~-------~~~~~~~~~~ee~~~~~~~~~~~~~~----~~~~~~--~~~~~~ 470 (485) T protein:vir:10 414 --GGT----GVIPRERAR----KDMGYSI-------AEREEMRRWDEEEAAMGLGLIGTMVD----PNPTVP--GSPSPA 470 (485) T ss_pred --ccc----cCCCHHHHH----HhCCCCH-------hHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCCC--CCCCcc Confidence 121 012222222 3356643 333333222211111 1111111111 100000 000011 Q ss_pred cCCCCCCC Q lcl|NC_011045. 529 SVGLQPGI 536 (536) Q Consensus 529 ~~~~q~~~ 536 (536) .++-+|+- T Consensus 471 ~~~~~~~~ 478 (485) T protein:vir:10 471 PAPKPAAL 478 (485) T ss_pred ccccCcCC Confidence 11122222 No 59 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.15 E-value=1.1e-09 Score=69.59 Aligned_cols=440 Identities=9% Similarity=0.002 Sum_probs=204.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--cccccCCC---CC-cccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYT--IPSLFPKD---SD-NASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~--~P~~~~~~---~~-~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) =..++..++++.+.+..+..+..+ +....++++++-- ++.+-... .. ...+...++..+-+...++..++.|+ T Consensus 20 ~~~~~~~~~~~~i~~~i~~~~~~~-~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~ 98 (474) T protein:vir:95 20 QLKPQFETQEEMIIRLIDDHRKQL-DKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVA 98 (474) T ss_pred hhhhccCChHHHHHHHHHHHHHHH-HHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhc Confidence 122334456677776666665443 3344555555321 11111111 11 11122335666777777777776553 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + -|. .+...|... ...++.|+ .++|...+.++.++..++|.|.+++..+.. + T Consensus 99 g--~p~----~~~~~d~~~---------~~~l~~~~------------~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~-~ 150 (474) T protein:vir:95 99 S--KPV----TYSCEDESV---------LKIIHDVL------------DTRWDNKLIDILTATSNKGIDWLQVYINEN-G 150 (474) T ss_pred c--CCc----eeccCchHH---------HHHHHHHH------------hccHHHHHHHHHHHHhhcCcEEEEEEecCC-C Confidence 3 121 233333221 11233333 368999999999999999999888776554 3 Q ss_pred eeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE---EecCCCCceeEE Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEDSGEYIRY 229 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v---~p~~~~~~~~~~ 229 (536) .+++.+++..+++...|. .|++..++|.+... ....+++|+.- +.+.+++.+... T Consensus 151 ~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~--------------------~~~~~~~y~~~~~~~~~~~~~~~~~~ 210 (474) T protein:vir:95 151 EMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLENGGLIPD 210 (474) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEEc--------------------CeeEEEEEeCCeEEEEEEcCCccccc Confidence 467788877776665553 56777777665421 11233443220 111122222221 Q ss_pred EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) .......+......+++..+|++.++. +.+|.|=.+...+.+..+|.+.-......+....|.+++..-...+.. T Consensus 211 ~~~~~~~~~~~~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~ 285 (474) T protein:vir:95 211 YYYGANHIQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLE 285 (474) T ss_pred cccCcccccccccccCCCccceEeecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccch Confidence 111112222223445577899988754 467999999999999999999999999999999998776532222222 Q ss_pred hhcc-CCCccee-cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_011045. 310 RLTK-AQTGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASELEDTLG 386 (536) Q Consensus 310 ~~~~-~~~g~~~-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi~~r~~E~~~~LG 386 (536) ++.. ...+.++ ....+++.. +....+.......++.+.+.|...-.. +......+...|+..+..+..-+... . T Consensus 286 ~~~~~~~~~~~i~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k-~ 362 (474) T protein:vir:95 286 EFMRGLKYYKAINVDGDGGVET--IQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLK-A 362 (474) T ss_pred hhhhhhhccceeeccCCCceeE--EeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHH-H Confidence 2221 1222223 223334433 333456777777888887777543221 11111112234665544332222221 1 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHH Q lcl|NC_011045. 387 GVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAM 465 (536) Q Consensus 387 ~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~ 465 (536) ....+. +...+.+++.++.+. |. ......++|+|.-.+..- ..+.++. +.+.+ .|.... T Consensus 363 ~~k~~~----~~~~l~~~~~li~~~~g~--~~d~~~i~v~f~~~~p~d-~~e~a~~-------~~~~g------~iS~et 422 (474) T protein:vir:95 363 NKLKNK----ATVAIQELIGFIIDFNNL--KMDVKDIEISFNFNRMMN-DAEQSQI-------IAQSQ------YLSRET 422 (474) T ss_pred HHHHHH----HHHHHHHHHHHHHHHhCC--CcccceeeEEeccCCCcC-HHHHHHH-------HHhcC------CCchHH Confidence 111222 233334444444332 21 122334556553332221 1111121 11111 122233 Q ss_pred HHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhh Q lcl|NC_011045. 466 IKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAAD 528 (536) Q Consensus 466 ~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 528 (536) ++.. ++. +--.++|++++.++++..++.++...........+ ..+.....++ T Consensus 423 ~i~~----l~~----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~---~~~~~~~~~~ 474 (474) T protein:vir:95 423 LVKS----SPL----VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQ---QERSNDKESE 474 (474) T ss_pred HHHh----CCC----CCCHHHHHHHHHHHHHHHHhcccccccccCCCCcC---CCCCccCCCC Confidence 3322 221 11235677776665544333222111100000000 0000111111 No 60 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.14 E-value=1.3e-09 Score=69.37 Aligned_cols=448 Identities=11% Similarity=0.064 Sum_probs=190.4 Q ss_pred CCCcccccc-----HHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCC---CCCcccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLA-----EEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPK---DSDNASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~-----~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~---~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (536) |.--..+++ .+-+.....++.. ..++.+.+.+|..-..-.. .......+..+...+-+..+|++++.. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~----~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~ 76 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFED----ASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAER 76 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHH----HHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhh Confidence 444333332 2223333333333 2345555556653321100 001111111123445567777776665 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG 152 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~ 152 (536) | +|.+ | ++. +. ......+ .+.+.+++|.....++.++..+||.+.++|..+.. T Consensus 77 l----~~~g-~-~~~--~~--------~~~~~~~-----------~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~ 129 (486) T protein:vir:42 77 Q----AVEG-F-RLG--DA--------DEADEEL-----------WQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDP 129 (486) T ss_pred h----cccc-e-ecC--CC--------chhHHHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCc Confidence 4 3422 2 221 11 0011112 23344588999999999999999999888754432 Q ss_pred -------CceeeEEEEecceEEEeeC-CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCC Q lcl|NC_011045. 153 -------SNYNPMKLYRLSSYVVQRD-AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG 224 (536) Q Consensus 153 -------~~~~~~~~~~l~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~ 224 (536) .+..++++++..+.++..| ..+++...+|.+.-. +.+....+++|+ + + T Consensus 130 ~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~y~---~----~ 185 (486) T protein:vir:42 130 QLDLGWDQNVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYDK-----------------EGNEIQAATLYT---P----M 185 (486) T ss_pred ccccccCCCeeEEEEecccceEEEEeCCCCCeEEEEEEEEec-----------------CCCeEEEEEEEc---C----C Confidence 2334677788777666666 566666666554310 011111122221 1 1 Q ss_pred ceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeec-- Q lcl|NC_011045. 225 EYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVN-- 301 (536) Q Consensus 225 ~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-- 301 (536) ....|...+|.-.....-.++|..+|++.++.+...+..||+|=.+. ..+-+..++...-.+...++..+.|...+. T Consensus 186 ~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~ 265 (486) T protein:vir:42 186 ETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGI 265 (486) T ss_pred cEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcC Confidence 11111222232222223346678999999999888899999997775 457778888887788888888888775543 Q ss_pred -ccccc----chhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCH Q lcl|NC_011045. 302 -PAGIT----QPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTA 371 (536) Q Consensus 302 -~~g~~----~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TA 371 (536) ++... +.........|.+.....+++.+.++. ..+ ....++.++.-|.+...... +........++ T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~---~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg 341 (486) T protein:vir:42 266 KPEEIGVDSETGQTLFDAYLARILAFEDAEGKIQQFS-AAE---LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (486) T ss_pred CccccccccccccchhhhhhchhcccCCCCceEEeec-ccC---HHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 11100 000111122333322222333443433 223 33455555555544322111 11011111233 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC--CcceEEEEechHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP--KEAVEPTISTGLEAIGRGQDLDKLERCVAAW 449 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~--~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~ 449 (536) .-+.....-+. ... ++.+. .+.+-+.+++.++.+.......+ ...++++|..++..- ..+.++.+...++.. T Consensus 342 ~Al~~~~~~l~-~ka---~~~~~-~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s-~~~~ad~~~kl~~~~ 415 (486) T protein:vir:42 342 EAIRAAESRLI-KKV---ERKNL-MFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPT-YAAKADAATKLYGNG 415 (486) T ss_pred HHHHHHHHHHH-HHH---HHHHH-HHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCC-HHHHHHHHHHHHhcc Confidence 33333222221 111 22222 23334455555443321111122 234677775554321 112222222222211 Q ss_pred HhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhc Q lcl|NC_011045. 450 AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS 529 (536) Q Consensus 450 ~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 529 (536) ..+ +.-+.+ ...+|+.+. ..+|++++.+++..+.+. ...+.. .....++........ T Consensus 416 ~g~--------~s~et~----~~~lg~~~d----~~~e~~~~~~e~~~~~~~--~~~~~~-----~~~~~~~~~~~~~~~ 472 (486) T protein:vir:42 416 QGV--------IPRERA----RIDMGYSVK----EREEMRRWDEEEAAMGLG--LLGTMV-----DADPTVPGSPSPTAP 472 (486) T ss_pred cCC--------CCHHHH----HhcCCCChh----HHHHHHHHHHHHHHHHHH--HHHHhh-----cCCCCCCCCCCCCCC Confidence 111 121112 233565322 123444443333222111 111111 111111111111111 Q ss_pred CCCCCCC Q lcl|NC_011045. 530 VGLQPGI 536 (536) Q Consensus 530 ~~~q~~~ 536 (536) ..-||+. T Consensus 473 ~~~~~~~ 479 (486) T protein:vir:42 473 PKPQPAI 479 (486) T ss_pred CCCCccc Confidence 2224444 No 61 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.14 E-value=1.3e-09 Score=69.37 Aligned_cols=437 Identities=9% Similarity=0.042 Sum_probs=201.2 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc--c-CCC----C--CcccccccccccchHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--F-PKD----S--DNASTDYVTPWQAVGARGLNNL 69 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~--~-~~~----~--~~~~~~~~~~~dst~~~a~~~L 69 (536) |. .....++.+.+.+..+..+.. ..+++.+.+|..-.- . ... + ....+...++..+-+...++.. T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~----~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:97 35 IVRTNNKPETLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred cccCCCchhhHHHHHHHHHHHHHHH----HHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHH Confidence 33 233334455565555555543 345556666653321 0 000 0 0111223356677788888888 Q ss_pred HHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 70 ASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 70 aa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) ++.|++ .| +.+...|.. ....++.|+ .++|-....++.+++.+||.|.+++.. T Consensus 111 ~~yl~g--~p----~~~~~~d~~---------~~~~l~~~~------------~n~~~~~~~~~~~~~~~~G~a~~~v~~ 163 (492) T protein:vir:97 111 VSYIVG--KP----IAFKHTDDE---------VVKRIDEVL------------GNRFDDKLHSVLTGASNKGIEWLHPYL 163 (492) T ss_pred hhhhcc--cC----ceeccCchH---------HHHHHHHHH------------hccHHHHHHHHHHHHhhcCeEEEEEEe Confidence 876532 12 123333321 111223222 367888999999999999999887766 Q ss_pred CCCCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE---EecCCCC Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEDSG 224 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v---~p~~~~~ 224 (536) +.. +.+++++++..+.++..| ..+++...+|.+... ....+++|+-. +...+++ T Consensus 164 d~d-g~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~ 222 (492) T protein:vir:97 164 DEE-GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENG 222 (492) T ss_pred cCC-CceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------cceeEEEEecCeEEEEEEecC Confidence 554 346788888877777665 357777777665431 01123333210 0011122 Q ss_pred ceeEE--EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc Q lcl|NC_011045. 225 EYIRY--EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP 302 (536) Q Consensus 225 ~~~~~--~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~ 302 (536) ..... ....... .....+++..+|++.++. +.+|+|-.+..++.+..++.+.-.+....+....|.+++.- T Consensus 223 ~~~~~~~~~~~~~~--~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g 295 (492) T protein:vir:97 223 SLIPDYSNNLENSK--THFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKN 295 (492) T ss_pred eeeecccccccccc--cccccCCCCCcceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeec Confidence 22111 1111111 123345577889887754 35799999999999999999888888899999999876642 Q ss_pred ccccchhhhcc-CC-CcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHHH Q lcl|NC_011045. 303 AGITQPRRLTK-AQ-TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVAS 379 (536) Q Consensus 303 ~g~~~~~~~~~-~~-~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi~~r~~ 379 (536) ....+..+... .. .+.+.-...+++.. +....+.......++.+++.|...-.. +.....-+...|+.-+..... T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~ 373 (492) T protein:vir:97 296 YDDQELPEFKRLLRYYGAIKVSDNGGVDT--IQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYT 373 (492) T ss_pred CCcccchhHHHHHhhccceecCCCCccee--EeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHH Confidence 11111112111 11 12222233334443 333345666777777777766543221 111111122334443332221 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) - +........+. +...+.+++.++.+..-.+ .....++|+|.-.+..- ..+.++.+... +.+ T Consensus 374 ~-l~~ka~~~~~~----f~~~l~~~~~li~~~~~~~-~~~~~i~v~f~~~~p~~-~~e~a~~~~kl----~G~------- 435 (492) T protein:vir:97 374 N-LNLKADKLARK----AKVAIQELLWFVFEHFDIK-GEHKDVDISFNYNKVAN-TELQVQTAQQS----MGI------- 435 (492) T ss_pred H-HHHHHHHHHHH----HHHHHHHHHHHHHHHhcCC-cccceeeEEecCCCCCC-HHHHHHHHHHH----hcc------- Confidence 1 22222222222 3334444444443332221 12345666664333221 11112222211 111 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCC Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQ 533 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q 533 (536) |....++ ..++. +--.++|++++.++++..++..+... .... ..++......+..+ | T Consensus 436 -iS~et~l----~~l~~----v~d~~~Eleri~~E~~~~~~~~~~~~----~~~~---~~~~~~~~~~~~~~-e 492 (492) T protein:vir:97 436 -VSHETVL----ENHPF----VEDLQAELERIEQEQTEYNKQLPNLD----DGGA---DSAQQQERSNNKES-E 492 (492) T ss_pred -CchHHHH----HhCCC----CCCHHHHHHHHHHHHHHHHHhhhccc----cCCC---CCCccccccccccc-C Confidence 2222222 22222 11235677777665543333222111 1000 11111111111111 1 No 62 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=99.13 E-value=1.5e-09 Score=68.89 Aligned_cols=419 Identities=11% Similarity=-0.020 Sum_probs=190.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc---CCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF---PKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~---~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |.++. .+.++...+.... ..++.+++.+|..-... .........+..++..+-+...|+.++..| T Consensus 1 ~~~~~----~~~i~~l~~~~~~----~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l---- 68 (441) T protein:vir:80 1 MNSDE----LALIEGMYDRIQR----LSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERL---- 68 (441) T ss_pred CCccH----HHHHHHHHHHHHH----HHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhh---- Confidence 55544 3334333333332 23344455555422110 011111111233455666777777666655 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceee Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP 157 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~ 157 (536) +|.+ | ..++. ..++ +....++|.....++.++..+||.|.+++..+..+. .+ T Consensus 69 ~~~g-~---~~~d~------------~~l~-----------~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~-~~ 120 (441) T protein:vir:80 69 DWLG-W---TNGDG------------YGLD-----------GVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGT-VS 120 (441) T ss_pred cccc-c---cCCCh------------HHHH-----------HHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCc-eE Confidence 2321 1 12211 1122 223458999999999999999999988877665543 46 Q ss_pred EEEEecceEEEeeCC-CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec-Cc Q lcl|NC_011045. 158 MKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE-GM 235 (536) Q Consensus 158 ~~~~~l~~~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~-g~ 235 (536) ++.++..++++..|+ .+++...++++... .+....+++|+ + +....|.+.+ +. T Consensus 121 i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~------------------~~~~~~~~vy~---~----~~~~~~~~~~~~~ 175 (441) T protein:vir:80 121 VRPQSPKNCTGKFSADGSRLDAGLVVQQTC------------------DPEVVEAELLL---P----DVIVQVERRGSRE 175 (441) T ss_pred EEEEccceEEEEEeCCCCceeEEEEEEEEe------------------cCceEEEEEEe---c----CeEEEEEEcCCcc Confidence 888888888776674 45666655554321 00111223321 1 1111122221 11 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc---hhhh Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIE-EYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ---PRRL 311 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~-~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~---~~~~ 311 (536) -.......++|..+|++++.-+...++.||+|-.. ...+-+..++...-.+....+....|.+.+. |... ..+. T Consensus 176 ~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--G~~~~~~~~~~ 253 (441) T protein:vir:80 176 WVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--GVSADEFSQPG 253 (441) T ss_pred eeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee--cCCccccccch Confidence 11122344567899999988788888899999654 4677788888888888889998999876552 2111 1111 Q ss_pred ccCCCcceec--CCccc--ccccccccccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCC-CCHHHHHHHHHHH Q lcl|NC_011045. 312 TKAQTGDFVT--GRPED--ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN-----SAVQRTGER-VTAEEIRYVASEL 381 (536) Q Consensus 312 ~~~~~g~~~~--g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r-~TAtEi~~r~~E~ 381 (536) .....|.+.. ++.++ +.+.++. .++++. .+..++.-|.+.+... .+. -.... -++.-+......+ T Consensus 254 ~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~l~~~i~~~~~~~~~p~~~~g-~~~~~~~Sg~Al~~~~~~l 328 (441) T protein:vir:80 254 WVLSMASVWAVDKDDDGDTPNVGSFP-VNSPTP---YSDQMRLLAQLTAGEAAVPERYFG-FITSNPPSGEALAAEESRL 328 (441) T ss_pred hhhcccccccCCCCCCCCcceeEecC-ccchHH---HHHHHHHHHHHHhcccCCCHHHhc-cCCCcchHHHHHHHHHHHH Confidence 1223344433 22111 2222222 233443 3334444443322111 111 11111 1343333322222 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCC--CcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELP--KEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDD 458 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~--~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~ 458 (536) . -...+.+.. +.+-+.+++.++.+. |.....+ ...+++.|.-++..-. .+.++.+... .+.+.. T Consensus 329 ~----~k~~~~~~~-f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~-~e~ad~~~kl----~~~g~~--- 395 (441) T protein:vir:80 329 V----KRAERRQTS-FGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTR-AATADAVTKL----VGAGIL--- 395 (441) T ss_pred H----HHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCH-HHHHHHHHHH----HhcCcc--- Confidence 2 222333333 333445555544332 2222222 2456777766654321 2222222222 222210 Q ss_pred hcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcch Q lcl|NC_011045. 459 PDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEA 522 (536) Q Consensus 459 ~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~ 522 (536) .+.... +...+|. +++|++++.+++++++. +.++..+....+.. |. T Consensus 396 -~~s~~~----~~~~l~~-------~~~e~~~~~~e~~e~~~---~~~~~~~~~~~~~~---~~ 441 (441) T protein:vir:80 396 -PADSRT----VLEMLGL-------DDVQVEAVMRHRAESSD---PLAVLAGAISRQTN---EV 441 (441) T ss_pred -cccHHH----HHHhCCC-------CHHHHHHHHHHHHHHHH---HHHHHhhhhhcccc---cC Confidence 112222 2234444 35666655443332221 11111111111111 11 No 63 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.12 E-value=1.7e-09 Score=68.67 Aligned_cols=432 Identities=9% Similarity=-0.007 Sum_probs=203.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) |++. +.++.+.+.+..+.... |.+....++++|+-.-+-+... .....+...++..+.+...++..++.|.+- | T Consensus 11 ~~~~-~~~~~~~i~~~i~~~~~-~~~r~~~~~~yy~g~~~i~~~~-~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~- 84 (453) T protein:vir:73 11 YSRD-EEITDKVVNDFMKKHQE-EVERYEYLGNMYKGIMEISSQK-AKDSWKPDNRLTNNFAKYIVDTFVGYFNGI--P- 84 (453) T ss_pred cccc-ccCCHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcCC-CCCccCccceeecchHHHHHHHhhhhhccc--C- Confidence 6754 55677888777777754 4455556666665432211111 111122334566677888888777665331 2 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEE Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~ 160 (536) + ++...+.. ....++. .+..++|.....++.++..+||.|.+++..+..+ .+++.. T Consensus 85 -~--~~~~~d~~---------~~~~l~~-----------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~-~~~i~~ 140 (453) T protein:vir:73 85 -I--KKTHDDKS---------VLEAMQL-----------FDNLNDMEDEESELAKIACVYGRAYELMYQNEST-ESEVIY 140 (453) T ss_pred -c--eeecCChH---------HHHHHHH-----------HHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCC-ceEEEE Confidence 1 22222211 1112333 3445789999999999999999998877665543 346766 Q ss_pred EecceE-EEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccc Q lcl|NC_011045. 161 YRLSSY-VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQG 239 (536) Q Consensus 161 ~~l~~~-~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~ 239 (536) ++..+. .+..|..++....+.++.... +....++||+. + ....|..-.+.-... T Consensus 141 ~~p~~~~~v~dd~~~~~~~~~i~~~~~~------------------~~~~~~~vyt~------~-~i~~~~~~~~~~~~~ 195 (453) T protein:vir:73 141 CSPLNVFMVYDDSIKQKPLFAVYYGFDE------------------EGNLSGTVYTL------L-ETISITGKAGEVKFG 195 (453) T ss_pred EcccceEEEEeCCCCceeEEEEEEEEec------------------CceEEEEEEeC------C-eEEEEEecCCceEEc Confidence 665554 444455566555544444220 11123344432 1 111111111111111 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcce Q lcl|NC_011045. 240 SDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDF 319 (536) Q Consensus 240 ~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~ 319 (536) ....++|..+|++.++ .+.+|+|-.+...+-+-.++.+.-......+....|.+.+. +.....++......+.+ T Consensus 196 ~~~~~~~g~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~-g~~~~~~~~~~~~~~~~ 269 (453) T protein:vir:73 196 ESTYNVYSDLPIVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFL-GAEVDEEDAKNIKDNRL 269 (453) T ss_pred cceeccCCceeEEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee-cCCCCchhhhccccccc Confidence 2233557789988654 34679999999999999999999999999999999987763 11111112111111111 Q ss_pred ------ecC----CcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|NC_011045. 320 ------VTG----RPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVY 389 (536) Q Consensus 320 ------~~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~ 389 (536) .++ ...+..+..+....+.......++.++..|-..-..-.+........|+.-+..+..- ......-. T Consensus 270 ~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~-l~~ka~~~ 348 (453) T protein:vir:73 270 INFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQA-MSNLALSF 348 (453) T ss_pred ccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHH-HHHHHHHH Confidence 000 1111112223333355666677777777664422111111111123455554333221 11222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHH Q lcl|NC_011045. 390 SILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLR 469 (536) Q Consensus 390 ~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~ 469 (536) .+.-.+.+..++..+..++...|. ...-..++++|.-++..- ..+.++.+.... .+ +....++ T Consensus 349 ~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~i~v~f~~~~p~~-~~~~a~~~~k~~----gi--------is~et~~-- 411 (453) T protein:vir:73 349 QRKFQSALNRRYSLWSSLSTNASN--KDAWKDIEYTFTRNEPKD-IKEQAETANILK----GI--------TSEETAL-- 411 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeCCCCCCC-HHHHHHHHHHHh----cc--------CcHHHHH-- Confidence 233333333344434444433332 222345677775444321 112222221111 11 2222222 Q ss_pred HHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCc Q lcl|NC_011045. 470 IANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASP 520 (536) Q Consensus 470 ~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~ 520 (536) ..++. +-..++|++++.+++++++.+++... + ....+..++- T Consensus 412 --~~~~~----~~d~~~E~~ri~~E~~~~~~~~~~~~--~-~~~~~~~~~~ 453 (453) T protein:vir:73 412 --SVISV----IPDVQAEMEKIKKKKLLQLSLTRTSN--L-VRMKQMRGNL 453 (453) T ss_pred --HhCCC----CCCHHHHHHHHHHHHHHHHHHHHhcc--C-CcchhhhcCC Confidence 22222 11236777777666554433332110 0 0001111111 No 64 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.12 E-value=1.7e-09 Score=68.60 Aligned_cols=450 Identities=10% Similarity=-0.001 Sum_probs=206.5 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPKDSDNASTDYVTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (536) |. ++.+..+.+.+.+..+.....|.+ +++++.+|..-. +.... ........++..+-+...++..++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~~-~~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcC-cccccCcceeecchHHHHHHHHhhhh Confidence 44 344444566666666666555544 444455554321 11111 11111224566666777777777555 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) ++ -| + +++..+.. +. ..+...+..++|.....++.+++.+||.|.+++..+.. T Consensus 107 ~g--~p--~--~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~- 159 (511) T protein:vir:93 107 LG--NP--I--QYQDDDKD-------------VL-------EVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD- 159 (511) T ss_pred cc--cC--e--eeccCChH-------------HH-------HHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC- Confidence 32 12 1 23333321 11 12333444588999999999999999999887766554 Q ss_pred ceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) +.++++.++..+.++..|. .+++...+|.+.....+ ....+.-..+++|+. + ....|.. T Consensus 160 ~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~iyt~------~-~i~~~~~ 220 (511) T protein:vir:93 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS------H-GVYRYLT 220 (511) T ss_pred CceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEeC------C-cEEEEEe Confidence 3456788888777766664 36666655555432100 001111122333321 1 1111211 Q ss_pred ecCc-----cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Q lcl|NC_011045. 232 VEGM-----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT 306 (536) Q Consensus 232 v~g~-----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~ 306 (536) -.+. ........+++..+|++.++- +..|.|-.+..++-+..++.+.-......+...+|.+.+.-.... T Consensus 221 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~ 295 (511) T protein:vir:93 221 SRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (511) T ss_pred cCCCccccccccccccccCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCccc Confidence 1111 001122345577889887653 457899999999999999998888888899889998766432222 Q ss_pred chhhhccCCCccee--------c----CCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHH Q lcl|NC_011045. 307 QPRRLTKAQTGDFV--------T----GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEE 373 (536) Q Consensus 307 ~~~~~~~~~~g~~~--------~----g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtE 373 (536) ...++.....+.+. . +..++..+..+....+.......++.+...|...-.. +.....-+...|+.. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~A 375 (511) T protein:vir:93 296 DPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA 375 (511) T ss_pred CchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 32222211111111 1 1111222333444456677777777777777442211 111111123345555 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-ELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAAL 452 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp-~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~ 452 (536) +...-. ..........+.-.+.+.-+++-++.++...+... ...-..+++.|.-.+..- ..+.++.+... +.+ T Consensus 376 l~~~~~-~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n-~~e~~~~~~kl----~g~ 449 (511) T protein:vir:93 376 MKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKS-LIEELKAYIDS----GGK 449 (511) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCC-HHHHHHHHHHH----hcc Confidence 544322 22233333333333334334333333333333221 112234677775433321 12222222211 111 Q ss_pred cchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 453 APMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 453 ~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) |....++.. ++. +-..++|++++.+++..++...+...... ..+.+. ...+...- T Consensus 450 --------iS~et~~~~----l~~----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~-----~~~~~~----~~~~~~~~ 504 (511) T protein:vir:93 450 --------ISQTTLMSL----FSF----FQDPELEVKKIEEDEKESIKKAQKGIYKD-----PRDIND----DEQDDDTK 504 (511) T ss_pred --------CchHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHHHHhhhcccC-----CCCCCC----CCCCCccc Confidence 222223322 221 11235677777665543322221111000 000000 00011111 Q ss_pred CCCC Q lcl|NC_011045. 533 QPGI 536 (536) Q Consensus 533 q~~~ 536 (536) ..+. T Consensus 505 ~~~~ 508 (511) T protein:vir:93 505 DTVD 508 (511) T ss_pred cccc Confidence 1111 No 65 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.09 E-value=2.5e-09 Score=67.77 Aligned_cols=435 Identities=10% Similarity=0.045 Sum_probs=199.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCC-CCC---cccccccccccchHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPK-DSD---NASTDYVTPWQAVGARGLNNLAS 71 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~-~~~---~~~~~~~~~~dst~~~a~~~Laa 71 (536) |-...+ ...+.+.+..+..+.. ..+++.+.+|..-. +-.. ... ...+...++..+-+...++.+++ T Consensus 18 ~~~~~~-~~~~~i~~~i~~~~~~----~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~ 92 (472) T protein:vir:93 18 TNNKPE-TLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVS 92 (472) T ss_pred ecCchh-hHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhh Confidence 322221 2244454444544433 35566666665432 1000 000 11122335667788888888887 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) .|.+ -| +.+...+.. ....++.|+ .++|-..++++.++..++|.|.+++..+. T Consensus 93 ~l~g--~~----~~~~~~d~~---------~~~~l~~~~------------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~ 145 (472) T protein:vir:93 93 YIVG--KP----IAFKHTDDE---------VVKRIDEVL------------GNRFDDKLHSVLTGASNKGIEWLHPYLDE 145 (472) T ss_pred hhcc--cC----eeeccCChH---------HHHHHHHHH------------hccHHHHHHHHHHHHhhcCeEEEEEEECC Confidence 6643 12 123333221 111233232 36788999999999999999988887665 Q ss_pred CCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE---EecCCCCce Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEDSGEY 226 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v---~p~~~~~~~ 226 (536) .+ .+++.+++..+.++..| ..+++...+|.++..- ...+++|+-. +-+.+++.. T Consensus 146 d~-~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~ 204 (472) T protein:vir:93 146 EG-EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--------------------ETKVEYWDKVTVNYYVYENGSL 204 (472) T ss_pred CC-ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeec--------------------ceeEEEEecCeEEEEEEecCee Confidence 53 45788888877777665 3677766666554320 1122332210 001111111 Q ss_pred eE--EEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Q lcl|NC_011045. 227 IR--YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) Q Consensus 227 ~~--~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g 304 (536) .. ..+.++..+ ....+++..+|++.++. +.+|+|-.+...+.+..++.+.-.+....+....|.+++.--. T Consensus 205 ~~~~~~~~~~~~~--~~~~~~~~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~ 277 (472) T protein:vir:93 205 IPDYSNNLENSKT--HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD 277 (472) T ss_pred eeccccccccccc--ccccCCCCCcceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC Confidence 11 111222222 23445677899987764 4589999999999999999998999999999999987764211 Q ss_pred ccchhhhcc-C-CCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 305 ITQPRRLTK-A-QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 305 ~~~~~~~~~-~-~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~E~ 381 (536) ..+..+... . ..+.+.....+++..+ ....+.......++.++..|...-..-. ....-+...|+.-+.....-+ T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 355 (472) T protein:vir:93 278 DQELPEFKRLLRYYGAIKVSDNGGVDTI--QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNL 355 (472) T ss_pred cccchhhHHHHhhccccccCCCCcceeE--eecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHH Confidence 111112111 1 1123322333444433 3334566667777777776644322111 111112234554433221111 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhc Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPD 460 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~ 460 (536) .... .+.+.. +...+.+++.++.+. |. + .....++|+|.-.+.. ...+.++.+... +++ T Consensus 356 -~~ka---~~~~~~-~~~~l~~~~~li~~~~~~-~-~~~~~i~v~f~~~~p~-~~~~~~~~~~k~----~gi-------- 415 (472) T protein:vir:93 356 -NLKA---DKLARK-AKVAIQELLWFVFEHFDI-K-GEHKDVDISFNYNKVA-NTELQVQTAQQS----MGI-------- 415 (472) T ss_pred -HHHH---HHHHHH-HHHHHHHHHHHHHHHhCC-C-cccceeeEEeCCCCCC-CHHHHHHHHHHH----hcc-------- Confidence 1111 222222 222334444443332 21 1 1223456655333322 111122222221 111 Q ss_pred CCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCC Q lcl|NC_011045. 461 INLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQ 533 (536) Q Consensus 461 id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q 533 (536) +....++ ..++. +--.++|++++.++++..++.++.. ......+.......+...-| T Consensus 416 is~et~l----~~l~~----~~d~~~E~~ri~~E~~~~~~~~~~~--------~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 416 VSHETVL----ENHPF----VEDLQAELERIEQEQMEYNKQLPNL--------DDGGADGAQQQERSNNKESE 472 (472) T ss_pred CchHHHH----HhCCC----CCCHHHHHHHHHHHHHHHHHhccCc--------CcccCCCCCCCCCCCcccCC Confidence 1222222 22222 1123567777666554433332211 11111111111111112112 No 66 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.08 E-value=2.6e-09 Score=67.62 Aligned_cols=435 Identities=9% Similarity=0.023 Sum_probs=205.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc--ccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS--LFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~--~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) |..+ +.+..+.+.+..+..... ..+++++.+|..-. ..........+...++..+.+...|+.+++.|++ - T Consensus 11 ~p~d-~~~~~~~l~~~i~~~~~~----~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~ 83 (453) T protein:vir:39 11 FPKD-EPITNEVVTKFMEKHRLE----VARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTGYFNG--I 83 (453) T ss_pred cCCC-CCCCHHHHHHHHHHHHHH----HHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhhhhcc--c Confidence 6663 456677777766665543 34555556665431 1001111111223455667788888888776642 1 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeE Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPM 158 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~ 158 (536) | +.++..+.. + ...+.+.+..++|.....++.++..++|.|.+++..+..+ .+++ T Consensus 84 ~----~~~~~~d~~-------------~-------~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-~~~i 138 (453) T protein:vir:39 84 P----VKKSHSDKE-------------T-------LSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEET-QTNV 138 (453) T ss_pred C----ceeccCChH-------------H-------HHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCC-ceEE Confidence 2 122222211 1 1234445666899999999999999999999988776553 4567 Q ss_pred EEEecceEEEeeC-CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccc Q lcl|NC_011045. 159 KLYRLSSYVVQRD-AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEV 237 (536) Q Consensus 159 ~~~~l~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i 237 (536) +.++..+.+...| ..++....+.++... .+....+++|+. + +...|..-.+.-. T Consensus 139 ~~~~p~~~~~v~d~~~~~~~~~~ir~~~~------------------~~~~~~~~~yt~------~-~i~~~~~~~~~~~ 193 (453) T protein:vir:39 139 IYNTPENMFMVYDDTIKQEPLFAVRYGYD------------------DDYKLYGEVYTK------E-TTYALNGTMGFYN 193 (453) T ss_pred EEEcccceEEEecCCCCCeEEEEEEEEEe------------------CCeEEEEEEEeC------C-eEEEEEecCCcee Confidence 7888766666655 344444444333211 011122333321 1 1111111111111 Q ss_pred cccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCc Q lcl|NC_011045. 238 QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTG 317 (536) Q Consensus 238 ~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g 317 (536) ......+++..+|++.++. +.+|+|-.+...+-+..++.+.-.....++....|.+.+.- .....+++.....+ T Consensus 194 ~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~~ 267 (453) T protein:vir:39 194 MTEQAPNPFDDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNIRSN 267 (453) T ss_pred eecccccCCCceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeec-CCCCchhhhhhhhc Confidence 1223345577899887654 45799999999999999999999999999999999877642 11222222221111 Q ss_pred -ce-ecCC---cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_011045. 318 -DF-VTGR---PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSIL 392 (536) Q Consensus 318 -~~-~~g~---~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl 392 (536) .+ +++. ..+..+..+....+.+.....++.++..|...-..-......-...|+.-+..+..-+. ....-..+. T Consensus 268 ~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~-~ka~~~~~~ 346 (453) T protein:vir:39 268 RVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMS-NLALSFQRK 346 (453) T ss_pred ceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHH-HHHHHHHHH Confidence 21 1111 11222333444456777777788877766442211111111112345555443322222 222223333 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHH Q lcl|NC_011045. 393 SQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIAN 472 (536) Q Consensus 393 ~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~ 472 (536) -.+.+..++.-+..++...|. ......++|.|.-++..-. .+.++.+... +++ |....++ . T Consensus 347 ~~~~l~~~~~li~~~~~~~~~--~~~~~~i~v~f~~~~p~~~-~~~a~~~~kl----~g~--------is~et~l----~ 407 (453) T protein:vir:39 347 FQSSLNSRYKLYCELSTNVSN--KEAWKDIEYTFTRNEPKDI-KEQAETANIL----MGI--------TSQETAL----S 407 (453) T ss_pred HHHHHHHHHHHHHHHHhccCC--ccccccceEEeCCCCCcCH-HHHHHHHHHH----hcc--------CChHHHH----H Confidence 333344444444444444443 1223456777754443211 1122222211 111 2222232 2 Q ss_pred HcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCC Q lcl|NC_011045. 473 AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) Q Consensus 473 ~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (536) .+|..+ -.++|++++.+++....+..+ . ......+....+..+... T Consensus 408 ~l~~v~----D~~~E~~ri~~E~~~~~~~~~--~-------~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 408 VISVIP----DVQAEMEKIKKEEASTAIFDK--D-------KQPSEKGTDTVVPETNEE 453 (453) T ss_pred hCCCCC----CHHHHHHHHHHHHHHHHHHHH--h-------ccCCCCCCCCCCCCcCCC Confidence 233201 135666655544432211111 0 011111111111111111 No 67 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.08 E-value=2.8e-09 Score=67.51 Aligned_cols=441 Identities=14% Similarity=0.131 Sum_probs=196.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc-CCCC--CcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF-PKDS--DNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~-~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |.-. .+.+...++.+... ..+...+.+|..-.-- +..+ .+...+..++..+-+..+|+++++.| T Consensus 1 ~~t~-----~d~i~~L~~~~~~~----~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l---- 67 (480) T protein:vir:78 1 MTTY-----HEHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL---- 67 (480) T ss_pred CCCH-----HHHHHHHHHHHHHH----HHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh---- Confidence 4442 34455555555443 4455555666543211 1111 11111122344566777777777765 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC-----CC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-----EG 152 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~-----~~ 152 (536) +|.+ | ... .+. +....++ ..+.+++|.....++.++..+||.|.++|..+ ++ T Consensus 68 ~~~g-~-~~~-~d~---------~~~~~l~-----------~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~ 124 (480) T protein:vir:78 68 DIEG-F-RIS-EDS---------EGLEELW-----------NWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) T ss_pred ccCc-e-ecC-CCc---------hhHHHHH-----------HHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCC Confidence 3322 2 221 111 1111222 33456899999999999999999998877532 23 Q ss_pred CceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEE Q lcl|NC_011045. 153 SNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYE 230 (536) Q Consensus 153 ~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~ 230 (536) .+..+++.++..+.++..|+ .+++...+|.+.-. .+.+....+++|+. + ....|. T Consensus 125 ~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~----------------d~~~~~~~~~~y~~------~-~~~~~~ 181 (480) T protein:vir:78 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR----------------DDVAVPDRATLYLP------D-ETVPLR 181 (480) T ss_pred CCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee----------------cCCcceEEEEEEeC------C-eEEEEE Confidence 34457888988888888886 45666655544211 01111223333321 1 111111 Q ss_pred EecCc----cccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeeccccc Q lcl|NC_011045. 231 EVEGM----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGI 305 (536) Q Consensus 231 ~v~g~----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~ 305 (536) ...+. ....+...++|..+|++++..+...+..||+|=.++ ..+-+..++...-.+...++..+.|.+.+. |. T Consensus 182 ~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~ 259 (480) T protein:vir:78 182 RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV 259 (480) T ss_pred ecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--CC Confidence 11110 111222345678999999998888899999998775 468888889888888888888888875542 21 Q ss_pred cchh-------hhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh----cccCCCCC-CCHHH Q lcl|NC_011045. 306 TQPR-------RLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS----AVQRTGER-VTAEE 373 (536) Q Consensus 306 ~~~~-------~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~~~r-~TAtE 373 (536) .... .......|.+..-..+++.+.++.. .+++ ..++.++.-|.+.+.... ....+... -++.- T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~A 335 (480) T protein:vir:78 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEA 335 (480) T ss_pred CccccccccccchhhhhhhhhccCCCCCceEEecCc-cCHH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHH Confidence 1110 0011122333222233344444332 2343 334444444444322111 00011111 13333 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCCCcceEEEEechHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT--QQIPELPKEAVEPTISTGLE--AIGRGQDLDKLERCVAAW 449 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~--g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~~~~~ 449 (536) +......+. -..++.+..| .+-+.+++.++.+. +..+ .....++++|.-+.. .++.+..+.+| ++ T Consensus 336 l~~~~~~l~----~k~~~~~~~f-~~~l~~~~rl~~~~~~~~~~-~~~~~i~v~w~~~~~~s~~~~ad~~~kl---~~-- 404 (480) T protein:vir:78 336 IIATDSRIV----KMAERKGRIF-GGAWERAMRIAMQIMGREVT-EEYTRLETVWRDPSTPTVAAKADAVSKL---YA-- 404 (480) T ss_pred HHHHHHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHcCCCcc-ccceeeeEEecCCCCCCHHHHHHHHHHH---HH-- Confidence 332222222 1223333333 33344455544432 1111 112346777754432 23322222222 21 Q ss_pred HhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhhcCcchHHhhh Q lcl|NC_011045. 450 AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA--QATASPEAMAAAA 527 (536) Q Consensus 450 ~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~--~~~~~~~~~~~~~ 527 (536) .+.. .+.-+. +...+|+ ++++++++.+.+.++. +....+....... +.+..|.+..... T Consensus 405 --~g~~----~~s~et----~~~~lg~-------~~d~~~e~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (480) T protein:vir:78 405 --NGQG----PIPKEQ----ARIDLGY-------TATQREQMRDWDKQET--EDMIDTLYSTTKAQADATPKPTVTETKT 465 (480) T ss_pred --hccc----CCCHHH----HHhcCCC-------CHhHHHHHHHHHHHHH--HHHHHHhhccccCCCccccCCCCCCCCC Confidence 1110 122222 2334555 2444444432211111 1111111111111 1112222222222 Q ss_pred hcCCCCCCC Q lcl|NC_011045. 528 DSVGLQPGI 536 (536) Q Consensus 528 ~~~~~q~~~ 536 (536) .+...+-|+ T Consensus 466 ~~~~~~~~~ 474 (480) T protein:vir:78 466 ETQTSPSGF 474 (480) T ss_pred ccCCCcccC Confidence 233333444 No 68 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.08 E-value=2.8e-09 Score=67.51 Aligned_cols=441 Identities=9% Similarity=-0.002 Sum_probs=204.4 Q ss_pred CCC-------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--cccc-cC--CCC-CcccccccccccchHHHHHH Q lcl|NC_011045. 1 MAE-------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYT--IPSL-FP--KDS-DNASTDYVTPWQAVGARGLN 67 (536) Q Consensus 1 Ma~-------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~--~P~~-~~--~~~-~~~~~~~~~~~dst~~~a~~ 67 (536) |++ .+..+..+.+++.++..+. |-+...++.++|+-- +..+ .. ... ........++..+-+...++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd 91 (474) T protein:vir:97 13 YGEEVVEQLKPQFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVD 91 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHH Confidence 321 1222445677777776654 445556666665421 1111 11 111 11112234566677778888 Q ss_pred HHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 68 NLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 68 ~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~ 147 (536) ..++.|++ -| +.+...+... ...++.|+ .+||...+.++.++..++|.|.+++ T Consensus 92 ~~~~~l~g--~p----~~~~~~d~~~---------~~~l~~~~------------~n~~~~~~~e~~~~~~~~G~~~~~~ 144 (474) T protein:vir:97 92 QKVSYVAS--KP----VTYSCEDENV---------LKVIHDVL------------DTRWDNKLIDILTATSNKGIDWLQV 144 (474) T ss_pred HHHhhhhc--CC----ceeccCcHHH---------HHHHHHHH------------hccHHHHHHHHHHHHhhcCceEEEE Confidence 77766643 12 1233333211 11233332 4789999999999999999998877 Q ss_pred ecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE---EecCC Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDED 222 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v---~p~~~ 222 (536) ..+.. +.+++.+++..+.++..|. .+++...+|.++.. ....+++|+-- +.+.+ T Consensus 145 ~~d~~-~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y~~~ 203 (474) T protein:vir:97 145 YINEN-GEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLE 203 (474) T ss_pred EecCC-CeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEEEEc Confidence 66554 4467888888888877764 57777777766521 01123333210 00111 Q ss_pred CCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc Q lcl|NC_011045. 223 SGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP 302 (536) Q Consensus 223 ~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~ 302 (536) ++.+..-...+...+......+++..+|++.++. +.+|.|=.+...+.+..+|.+.-......+....|.+++.. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g 278 (474) T protein:vir:97 204 NGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG 278 (474) T ss_pred CCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 1111111111111121222345567888887654 46899999999999999999999999999999999887653 Q ss_pred ccccchhhhcc-CCCccee-cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHH Q lcl|NC_011045. 303 AGITQPRRLTK-AQTGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVAS 379 (536) Q Consensus 303 ~g~~~~~~~~~-~~~g~~~-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~ 379 (536) -......++.. ...+.++ ....+++.. +....+.......++.++..|-..-..-. .....+...|+..+..+.. T Consensus 279 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~ 356 (474) T protein:vir:97 279 YEGEDLEEFMRGLKYYKAINVDGDGGVET--IQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYG 356 (474) T ss_pred CCcccchhhhhhhhccceeeccCCCceeE--EeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHH Confidence 22222222222 1222222 223333433 33344667777777777776644321111 1111122345544332222 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) -+... . .+ ....+...+.+++.++.+..-.. .....++|.|.-.+..- ..+.++. +.+.+ T Consensus 357 ~l~~k-~---~~-k~~~~~~~l~~~~~li~~~~~~~-~d~~~i~v~f~~~~p~~-~~e~a~~-------~~~~g------ 416 (474) T protein:vir:97 357 NLDLK-A---NK-LKNKATVAIQELISFIIDFNNLK-TDVKDIEISFNFNRMMN-DAEQSQI-------IAQSQ------ 416 (474) T ss_pred HHHHH-H---HH-HHHHHHHHHHHHHHHHHHHhCCC-cccceeeEEeccCcccC-HHHHHHH-------HHHcC------ Confidence 11111 1 11 11223334444444443321111 12234666653322211 1111111 11111 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) .+....++..+ -+| + -.++|++++.++++..++..... ...... .++......+..+- T Consensus 417 ~iS~et~l~~l---~~v-~----D~~~E~eri~~E~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~e 474 (474) T protein:vir:97 417 YLSRETLVKSS---PLV-D----DYKAELERIEQEQMEYNKQLPNL----DDGGAD---GAQQQEGSNNKESE 474 (474) T ss_pred CCCHHHHHHhC---CCC-C----CHHHHHHHHHHHHHHHHhhcccc----CCCCCC---CcccCCCCcccccC Confidence 12333333321 122 1 12466666655544322211100 000000 00001110111111 No 69 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.08 E-value=2.8e-09 Score=67.51 Aligned_cols=441 Identities=9% Similarity=-0.002 Sum_probs=204.4 Q ss_pred CCC-------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--cccc-cC--CCC-CcccccccccccchHHHHHH Q lcl|NC_011045. 1 MAE-------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYT--IPSL-FP--KDS-DNASTDYVTPWQAVGARGLN 67 (536) Q Consensus 1 Ma~-------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~--~P~~-~~--~~~-~~~~~~~~~~~dst~~~a~~ 67 (536) |++ .+..+..+.+++.++..+. |-+...++.++|+-- +..+ .. ... ........++..+-+...++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd 91 (474) T protein:vir:94 13 YGEEVVEQLKPQFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVD 91 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHH Confidence 321 1222445677777776654 445556666665421 1111 11 111 11112234566677778888 Q ss_pred HHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 68 NLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 68 ~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~ 147 (536) ..++.|++ -| +.+...+... ...++.|+ .+||...+.++.++..++|.|.+++ T Consensus 92 ~~~~~l~g--~p----~~~~~~d~~~---------~~~l~~~~------------~n~~~~~~~e~~~~~~~~G~~~~~~ 144 (474) T protein:vir:94 92 QKVSYVAS--KP----VTYSCEDENV---------LKVIHDVL------------DTRWDNKLIDILTATSNKGIDWLQV 144 (474) T ss_pred HHHhhhhc--CC----ceeccCcHHH---------HHHHHHHH------------hccHHHHHHHHHHHHhhcCceEEEE Confidence 77766643 12 1233333211 11233332 4789999999999999999998877 Q ss_pred ecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE---EecCC Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDED 222 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v---~p~~~ 222 (536) ..+.. +.+++.+++..+.++..|. .+++...+|.++.. ....+++|+-- +.+.+ T Consensus 145 ~~d~~-~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~yt~~~~~~y~~~ 203 (474) T protein:vir:94 145 YINEN-GEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------------------NEEKVEFWTDTTVTYYVLE 203 (474) T ss_pred EecCC-CeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------------------CeEEEEEEeCCeEEEEEEc Confidence 66554 4467888888888877764 57777777766521 01123333210 00111 Q ss_pred CCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc Q lcl|NC_011045. 223 SGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP 302 (536) Q Consensus 223 ~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~ 302 (536) ++.+..-...+...+......+++..+|++.++. +.+|.|=.+...+.+..+|.+.-......+....|.+++.. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g 278 (474) T protein:vir:94 204 NGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG 278 (474) T ss_pred CCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 1111111111111121222345567888887654 46899999999999999999999999999999999887653 Q ss_pred ccccchhhhcc-CCCccee-cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHH Q lcl|NC_011045. 303 AGITQPRRLTK-AQTGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVAS 379 (536) Q Consensus 303 ~g~~~~~~~~~-~~~g~~~-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~ 379 (536) -......++.. ...+.++ ....+++.. +....+.......++.++..|-..-..-. .....+...|+..+..+.. T Consensus 279 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~ 356 (474) T protein:vir:94 279 YEGEDLEEFMRGLKYYKAINVDGDGGVET--IQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYG 356 (474) T ss_pred CCcccchhhhhhhhccceeeccCCCceeE--EeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHH Confidence 22222222222 1222222 223333433 33344667777777777776644321111 1111122345544332222 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) -+... . .+ ....+...+.+++.++.+..-.. .....++|.|.-.+..- ..+.++. +.+.+ T Consensus 357 ~l~~k-~---~~-k~~~~~~~l~~~~~li~~~~~~~-~d~~~i~v~f~~~~p~~-~~e~a~~-------~~~~g------ 416 (474) T protein:vir:94 357 NLDLK-A---NK-LKNKATVAIQELISFIIDFNNLK-TDVKDIEISFNFNRMMN-DAEQSQI-------IAQSQ------ 416 (474) T ss_pred HHHHH-H---HH-HHHHHHHHHHHHHHHHHHHhCCC-cccceeeEEeccCcccC-HHHHHHH-------HHHcC------ Confidence 11111 1 11 11223334444444443321111 12234666653322211 1111111 11111 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) .+....++..+ -+| + -.++|++++.++++..++..... ...... .++......+..+- T Consensus 417 ~iS~et~l~~l---~~v-~----D~~~E~eri~~E~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~e 474 (474) T protein:vir:94 417 YLSRETLVKSS---PLV-D----DYKAELERIEQEQMEYNKQLPNL----DDGGAD---GAQQQEGSNNKESE 474 (474) T ss_pred CCCHHHHHHhC---CCC-C----CHHHHHHHHHHHHHHHHhhcccc----CCCCCC---CcccCCCCcccccC Confidence 12333333321 122 1 12466666655544322211100 000000 00001110111111 No 70 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.05 E-value=3.7e-09 Score=66.83 Aligned_cols=452 Identities=11% Similarity=-0.001 Sum_probs=204.7 Q ss_pred CCCccccc---cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCC-CCcccccccccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGL---AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKD-SDNASTDYVTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma~~~~~~---~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (536) |-. .+.+ ..+.+++..++-+..+ .++++++.+|....- .... .....+...++..+-+...++..++.| T Consensus 31 ~~~-~~~~~~~~~~~i~~~i~~h~~~~---~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl 106 (502) T protein:vir:48 31 ADN-LEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYL 106 (502) T ss_pred ccc-hhhhccccHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhh Confidence 311 1111 1233444444333333 345566666655421 1111 111112233555566666666666544 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) |...+ +++..+.... ..+. ..+...+..++|....+++.+++.+||.|.+++..+.. T Consensus 107 ----~g~p~--~~~~~d~~~~---------~~~~-------~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded- 163 (502) T protein:vir:48 107 ----AGNPI--RVEYDDNEDN---------SQND-------DAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEY- 163 (502) T ss_pred ----cccCe--eEecCCccch---------hHHH-------HHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCC- Confidence 33222 3333322111 1222 23334456689999999999999999999887766544 Q ss_pred ceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) +.++++.++..+.++..|. .+++...+|.+..... .+....+++|+. + +. ++.. T Consensus 164 g~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~----------------~~~~~~~~iyt~------~-~i-~~~~ 219 (502) T protein:vir:48 164 DETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QNAKDVVEIYTN------Q-HI-YTLD 219 (502) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeec----------------CCcEEEEEEEeC------C-eE-EEEE Confidence 3456788877766665553 4666666655442211 111223333321 1 11 1111 Q ss_pred ecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh-h Q lcl|NC_011045. 232 VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR-R 310 (536) Q Consensus 232 v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~-~ 310 (536) .++.........+++..+|++.++- +..|.|-.+.+++-+..++.+.-.+....+....|.+.+.-....... . T Consensus 220 ~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~ 294 (502) T protein:vir:48 220 ASDSFNEISVTPHAFGTVPITEFLN-----NADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQ 294 (502) T ss_pred eCCceeeccceecCCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccc Confidence 2222121223345577889877642 457999999999999999999999999999999998776432221111 1 Q ss_pred hccC-CCccee-------cCCcccccccccccccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 311 LTKA-QTGDFV-------TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM-LNSAVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 311 ~~~~-~~g~~~-------~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~r~TAtEi~~r~~E~ 381 (536) .... ..+.+. .+...+..+..+....+.+.....++.+.+.|...-. .+......+...|+..+..... . T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~-~ 373 (502) T protein:vir:48 295 ASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLF-G 373 (502) T ss_pred hhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHH-H Confidence 1100 011111 1122222333344444566667777777777643211 1111111123456666554432 2 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcC Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDI 461 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~i 461 (536) +........++-.+.+.-++.-++.++...+.........++++|.-.+..- ..+.++.+... +.+ | T Consensus 374 l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d-~~e~a~~~~kl----~g~--------i 440 (502) T protein:vir:48 374 LDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKS-LYEQVSILNDL----GGQ--------V 440 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcC-HHHHHHHHHHH----hcc--------C Confidence 2233333334444444444444445554444444444456788875544332 12222222211 111 1 Q ss_pred CHHHHHHHHHHHcCCChhhccC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 462 NLAMIKLRIANAIGIDTSGILL-TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 462 d~d~~~~~~a~~~Gv~p~~i~r-s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) .-+.+++. +|. +. .++|++.+.+++++. ........ ........ ...+.-+...+...+-. T Consensus 441 S~et~l~~----l~~-----v~D~~~E~~ri~~E~~~~-~~~~~~~~-~~~~~~~~-~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 441 SQETALSL----SGL-----VENPTEELDKINEESSKI-DFKGYPSY-FYDNVGKY-TDEVKETHTDDFERVYE 502 (502) T ss_pred cHHHHHHh----CCC-----CCCHHHHHHHHHHHHHhh-hhhccccc-cccccccc-CCCccCCCCcCcCCCCC Confidence 22222222 332 22 346666665543321 11110000 00000000 00000000001111111 No 71 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.04 E-value=4.3e-09 Score=66.48 Aligned_cols=452 Identities=10% Similarity=-0.009 Sum_probs=205.5 Q ss_pred CCCcc--ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCC-CCCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKR--TGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPK-DSDNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~--~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~-~~~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |.... ...+.+.+.+.-++....| .++++++.+|..-.- ... ......+...++..+.+...++.+++.|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~---~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 107 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhh---HHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 43311 1112344444444444444 345566666654321 111 11111122345666777777777776554 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + -| ++++..+.. +. ..+...+..++|.....++.+++.+||.+.+++..+.. + T Consensus 108 g--~p----~~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded-~ 160 (512) T protein:vir:97 108 G--NP----IQCQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD-D 160 (512) T ss_pred c--cC----ceeccCChH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCC-C Confidence 3 12 122333221 11 12333445588999999999999999999887766544 3 Q ss_pred eeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .+++..++..+.++..|.. +++...+|.+++.... ....+.-..+++|+. + ....|... T Consensus 161 ~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~------------~~~~~~~~~~~vyt~------~-~i~~~~~~ 221 (512) T protein:vir:97 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS------H-GVYRYLTS 221 (512) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEeC------C-cEEEEEec Confidence 4578888887777766643 5666666555432110 001111122333321 1 11112211 Q ss_pred cCc-----cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 233 EGM-----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 233 ~g~-----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .+. ........+++..+|++.++ .+..|+|-.+..++-+..++.+.-......+....|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (512) T protein:vir:97 222 RTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (512) T ss_pred CCCcccccccccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCC Confidence 111 00112344567788888754 34679999999999999999988888888999999987764322233 Q ss_pred hhhhccCCCcceec-------------CCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHH Q lcl|NC_011045. 308 PRRLTKAQTGDFVT-------------GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEE 373 (536) Q Consensus 308 ~~~~~~~~~g~~~~-------------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtE 373 (536) ..++.....+..+. +..++..+..+....+.......++.++..|-..-.. +.....-+...|+.. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~A 376 (512) T protein:vir:97 297 PVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA 376 (512) T ss_pred chhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHH Confidence 33322211111110 1112222333444456666777777777766432111 111111123346655 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCcceEEEEechHHH--HHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-ELPKEAVEPTISTGLEA--IGRGQDLDKLERCVAAWA 450 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp-~~~~~~v~v~~vs~La~--a~r~~~~~~l~~~~~~~~ 450 (536) +..... .+........+.-.+.+.-++..++.++...+... +..-..+++.|.-++.. ++....+.++ . T Consensus 377 l~~~~~-~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl---~---- 448 (512) T protein:vir:97 377 MKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS---G---- 448 (512) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH---h---- Confidence 543322 22223333333333333334444444443333322 12223567777544332 3222222221 1 Q ss_pred hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcC Q lcl|NC_011045. 451 ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSV 530 (536) Q Consensus 451 ~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (536) .+ +....++.. ++. +-..++|++++.++++.+++..+.......... .....+.......+.. T Consensus 449 gi--------iS~et~~~~----l~~----v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 511 (512) T protein:vir:97 449 GK--------ISQTTLMSL----FSF----FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDI-NDDEQDDDTKDTVDKK 511 (512) T ss_pred cc--------CchHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCC-CCCCCCCCcccccccc Confidence 11 122222222 221 112456777766655443322221110000000 0000000000000111 Q ss_pred C Q lcl|NC_011045. 531 G 531 (536) Q Consensus 531 ~ 531 (536) . T Consensus 512 ~ 512 (512) T protein:vir:97 512 E 512 (512) T ss_pred C Confidence 0 No 72 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.02 E-value=5.4e-09 Score=65.90 Aligned_cols=450 Identities=10% Similarity=0.002 Sum_probs=209.3 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc---ccCCCC-CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDS-DNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |. +..+.++.+.+.+..++....+.+ +++++.+|..-. ...... ....+...++..+.+...+++.++.|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhH---HHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 33 344444566666666666655544 445555554322 111111 111122345666778888887776553 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + -| + +++..+.. + ...+...+..++|.....++.++..+||.+.+++..+.. + T Consensus 108 g--~p--~--~~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~d-g 160 (511) T protein:vir:96 108 G--NP--I--QYQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD-D 160 (511) T ss_pred c--cC--c--eeecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCC-C Confidence 2 12 1 22333221 1 123344445578999999999999999999887766544 3 Q ss_pred eeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .++++.++..+.++..|.. +++...+|.+..... .....+....+++|+. + ....|..- T Consensus 161 ~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~------------~~~~~~~~~~~~vyt~------~-~i~~~~~~ 221 (511) T protein:vir:96 161 ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI------------DKTDEDEVFTVDLFTS------H-GVYRYLTN 221 (511) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec------------cccccceEEEEEEEeC------C-cEEEEEec Confidence 4578888877777666643 455555554433210 0001111122333321 1 11111111 Q ss_pred cCcc-----ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 233 EGME-----VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 233 ~g~~-----i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .+.. .......+++..+|++.++- +.+|+|-.+..++-+..++.+.-......+....|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (511) T protein:vir:96 222 RTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC Confidence 1110 01123345677888877653 4579999999999999999988888888888889987765433333 Q ss_pred hhhhccCCCccee--------cC----CcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHH Q lcl|NC_011045. 308 PRRLTKAQTGDFV--------TG----RPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEI 374 (536) Q Consensus 308 ~~~~~~~~~g~~~--------~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi 374 (536) ..++.....+.++ .+ ..++..+..+....+.......++.+++.|...-.. +.....-+...|+..+ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al 376 (511) T protein:vir:96 297 PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM 376 (511) T ss_pred chhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHH Confidence 3332211111111 11 111222333444445666667777777666432111 1111111223456555 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-ELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp-~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~ 453 (536) ..... .+........+.-.+.+.-+++-++.++...+... +..-..+++.|.-++..-. .+.++.+....+ .+ T Consensus 377 ~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~-~e~~d~~~kl~G---~i- 450 (511) T protein:vir:96 377 KYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL-IEELKAYIDSGG---KI- 450 (511) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCH-HHHHHHHHHHhc---cC- Confidence 44332 33333444444445555555554555554433221 1222356777755444321 122222222111 12 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhhcc-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSGIL-LTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~i~-rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) ....++.. ++ ++ -.++|++++.++++.++..++... .... .+.+. ..++...- T Consensus 451 --------S~et~l~~----l~-----~v~d~~~El~ri~~E~~~~~~~~~~~~--~~~~---~~~~~----~~~~~~~~ 504 (511) T protein:vir:96 451 --------SQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIKKAQKGI--YKDP---RDIND----DEQDDDTK 504 (511) T ss_pred --------ChHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHHhhcc--ccCC---CCCCC----CCCCCCcc Confidence 22222222 22 12 235677666665543322222111 0000 00000 00001100 Q ss_pred CCCC Q lcl|NC_011045. 533 QPGI 536 (536) Q Consensus 533 q~~~ 536 (536) ..+. T Consensus 505 ~~~~ 508 (511) T protein:vir:96 505 DTVD 508 (511) T ss_pred Cccc Confidence 1111 No 73 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.02 E-value=5.4e-09 Score=65.90 Aligned_cols=450 Identities=10% Similarity=0.002 Sum_probs=209.3 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc---ccCCCC-CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDS-DNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |. +..+.++.+.+.+..++....+.+ +++++.+|..-. ...... ....+...++..+.+...+++.++.|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhH---HHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 33 344444566666666666655544 445555554322 111111 111122345666778888887776553 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + -| + +++..+.. + ...+...+..++|.....++.++..+||.+.+++..+.. + T Consensus 108 g--~p--~--~~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~d-g 160 (511) T protein:vir:78 108 G--NP--I--QYQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD-D 160 (511) T ss_pred c--cC--c--eeecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCC-C Confidence 2 12 1 22333221 1 123344445578999999999999999999887766544 3 Q ss_pred eeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .++++.++..+.++..|.. +++...+|.+..... .....+....+++|+. + ....|..- T Consensus 161 ~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~------------~~~~~~~~~~~~vyt~------~-~i~~~~~~ 221 (511) T protein:vir:78 161 ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI------------DKTDEDEVFTVDLFTS------H-GVYRYLTN 221 (511) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec------------cccccceEEEEEEEeC------C-cEEEEEec Confidence 4578888877777666643 455555554433210 0001111122333321 1 11111111 Q ss_pred cCcc-----ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 233 EGME-----VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 233 ~g~~-----i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .+.. .......+++..+|++.++- +.+|+|-.+..++-+..++.+.-......+....|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (511) T protein:vir:78 222 RTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC Confidence 1110 01123345677888877653 4579999999999999999988888888888889987765433333 Q ss_pred hhhhccCCCccee--------cC----CcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHH Q lcl|NC_011045. 308 PRRLTKAQTGDFV--------TG----RPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEI 374 (536) Q Consensus 308 ~~~~~~~~~g~~~--------~g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi 374 (536) ..++.....+.++ .+ ..++..+..+....+.......++.+++.|...-.. +.....-+...|+..+ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al 376 (511) T protein:vir:78 297 PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM 376 (511) T ss_pred chhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHH Confidence 3332211111111 11 111222333444445666667777777666432111 1111111223456555 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-ELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp-~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~ 453 (536) ..... .+........+.-.+.+.-+++-++.++...+... +..-..+++.|.-++..-. .+.++.+....+ .+ T Consensus 377 ~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~-~e~~d~~~kl~G---~i- 450 (511) T protein:vir:78 377 KYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL-IEELKAYIDSGG---KI- 450 (511) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCH-HHHHHHHHHHhc---cC- Confidence 44332 33333444444445555555554555554433221 1222356777755444321 122222222111 12 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhhcc-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSGIL-LTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~i~-rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) ....++.. ++ ++ -.++|++++.++++.++..++... .... .+.+. ..++...- T Consensus 451 --------S~et~l~~----l~-----~v~d~~~El~ri~~E~~~~~~~~~~~~--~~~~---~~~~~----~~~~~~~~ 504 (511) T protein:vir:78 451 --------SQTTLMSL----FS-----FFQDPELEVKKIEEDEKESIKKAQKGI--YKDP---RDIND----DEQDDDTK 504 (511) T ss_pred --------ChHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHHHhhcc--ccCC---CCCCC----CCCCCCcc Confidence 22222222 22 12 235677666665543322222111 0000 00000 00001100 Q ss_pred CCCC Q lcl|NC_011045. 533 QPGI 536 (536) Q Consensus 533 q~~~ 536 (536) ..+. T Consensus 505 ~~~~ 508 (511) T protein:vir:78 505 DTVD 508 (511) T ss_pred Cccc Confidence 1111 No 74 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.99 E-value=7.3e-09 Score=65.20 Aligned_cols=453 Identities=10% Similarity=-0.001 Sum_probs=210.5 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPKDSDNASTDYVTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (536) |. ++.+.++.+.+.+..+.....|.+ +++++.+|..-. ..... ....+...++..+.+...+++.++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcc-cccccCcceeecchHHHHHHHHHhhh Confidence 44 344444566666666665555544 555555555331 11111 11112234566677777777776655 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) ++ - | + +++..+.. + ...+...+..++|.....++.++..+||.|.+++..+.. T Consensus 107 ~g-~-p--~--~~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded- 159 (511) T protein:vir:99 107 LG-N-P--I--QYQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD- 159 (511) T ss_pred cc-c-C--c--eeecCchH-------------H-------HHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC- Confidence 32 1 2 1 22333221 1 123334445578999999999999999999887766544 Q ss_pred ceeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) +.++++.++..+.++..|.. +++...+|.+..... .....+....+++|+. +. ...|.. T Consensus 160 ~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~------------~~~~~~~~~~~~vyt~------~~-i~~~~~ 220 (511) T protein:vir:99 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPI------------DKTDEDEVFTVDLFTS------HG-VYRYLT 220 (511) T ss_pred CceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeec------------ccCccceEEEEEEEeC------Cc-EEEEEe Confidence 34678888888887777653 566666665544210 0001111122333321 11 111111 Q ss_pred ecCc-----cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Q lcl|NC_011045. 232 VEGM-----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT 306 (536) Q Consensus 232 v~g~-----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~ 306 (536) -.+. ........+++..+|++.++- +..|+|-.+..++.+..++.+.-......+....|.+.+.-.+.. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~ 295 (511) T protein:vir:99 221 SRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (511) T ss_pred cCCccccccccccccccCCCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCccc Confidence 1100 001122345577889887764 357999999999999999999999999999888888766432222 Q ss_pred chhhhccCCCccee--------c----CCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHH Q lcl|NC_011045. 307 QPRRLTKAQTGDFV--------T----GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEE 373 (536) Q Consensus 307 ~~~~~~~~~~g~~~--------~----g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtE 373 (536) +...+.....+..+ . +..++..+..+....+.......++.+++.|-..-+. +.....-+...|+.. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~A 375 (511) T protein:vir:99 296 DPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA 375 (511) T ss_pred CchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 32222111111111 0 1111222333444445666677777777766432211 111111123345555 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC-CCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE-LPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAAL 452 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~-~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~ 452 (536) +..+.. .+........+.-.+.+.-+++-++.++...+.... ..-..+++.|.-.+..- ..+.++.+... ..+ T Consensus 376 lk~~~~-~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n-~~e~~~~~~kl----~Gi 449 (511) T protein:vir:99 376 MKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKS-LIEELKAYIDS----GGK 449 (511) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcC-HHHHHHHHHHH----hcc Confidence 544433 233333344444444444444444444444443221 11124677765433321 11122222111 111 Q ss_pred cchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 453 APMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 453 ~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) |....++..+ -+| + ..++|++++.++++.++...+...-........... +...+.....+- T Consensus 450 --------iS~et~l~~l---~~v-~----D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~d~~e 511 (511) T protein:vir:99 450 --------ISQTTLMSLF---SFF-Q----DPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQ--DDSTKDSIDKKE 511 (511) T ss_pred --------CCHHHHHHhC---CCC-C----CHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCC--CCCCcCcccccC Confidence 2222333321 122 1 235677777666554333222111000000000000 001111111111 No 75 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.97 E-value=9.2e-09 Score=64.65 Aligned_cols=436 Identities=10% Similarity=0.040 Sum_probs=199.3 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc--cC---C-CC---CcccccccccccchHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--FP---K-DS---DNASTDYVTPWQAVGARGLNNLAS 71 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~--~~---~-~~---~~~~~~~~~~~dst~~~a~~~Laa 71 (536) +....+ ...+.+.+..++.+.. ..++..+.+|..-.- .. . .. ....+...++..+-+...++..++ T Consensus 29 ~~~~~e-~~~~~i~~~i~~~~~~----~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~ 103 (483) T protein:vir:12 29 TNNKPE-TLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVS 103 (483) T ss_pred cCCchh-hHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhh Confidence 223221 2244555555555433 345566666654321 00 0 00 011122335667778888888877 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) .|++ .| +.+...|.. ....++.|+ .++|.....++.++..+||.|.+++..+. T Consensus 104 ~l~G--~p----~~~~~~d~~---------~~~~l~~~~------------~n~~~~~~~~~~~~~~~~G~~y~~v~~d~ 156 (483) T protein:vir:12 104 YIVG--KP----IAFKHTDDE---------VVKRIDEVL------------GNRFDDKLHSVLTGASNKGIEWLHPYLDE 156 (483) T ss_pred hhcc--cC----ceeccCChH---------HHHHHHHHH------------hccHHHHHHHHHHHHhhCCeEEEEEEEcC Confidence 6532 12 123333221 112233332 35788889999999999999988776655 Q ss_pred CCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEE--E-EecCCCCce Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH--I-YLDEDSGEY 226 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~--v-~p~~~~~~~ 226 (536) . +.+++++++..+.++..| ..+++...+|.++.. ....+++|+- + +...+++.. T Consensus 157 d-~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------------------~~~~~~~y~~~~v~~~~~~~~~~ 215 (483) T protein:vir:12 157 E-GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKVTVNYYVYENGSL 215 (483) T ss_pred C-CceEEEEEcccceEEEEcCCCCCceEEEEEEEEee--------------------cceEEEEEecCeEEEEEEeCCee Confidence 4 345788888888777665 457777766665431 0112333321 0 001112111 Q ss_pred eE--EEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc Q lcl|NC_011045. 227 IR--YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) Q Consensus 227 ~~--~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g 304 (536) .. ....++..+ ....+++..+|++.++- +.+|+|-.+...+.+..+|.+.-......+....|.+++.-.. T Consensus 216 ~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 288 (483) T protein:vir:12 216 IPDYSNNLENSKT--HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD 288 (483) T ss_pred eeccccccccccc--ccccCCCCccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 11 111122222 22345567888887654 4579999999999999999998889999999999987764222 Q ss_pred ccchhhhcc-CC-CcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 305 ITQPRRLTK-AQ-TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 305 ~~~~~~~~~-~~-~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi~~r~~E~ 381 (536) ..+...... .. .+.+.....+++..+ ....+.......++.+++.|...-.. +.....-+...|+..+..+..-+ T Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 366 (483) T protein:vir:12 289 DQELPEFKRLLRYYGAIKVSDNGGVDTI--QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNL 366 (483) T ss_pred cccchhHHHhhhhccccccCCCCcceEE--eecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHH Confidence 222222111 11 123322333444433 33445666777777777766442211 11111112234554433222111 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcC Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDI 461 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~i 461 (536) . .......+. +...+.+++.++.+.--.+ .....++|+|.-.+..- ..+.++.+... +.+ + T Consensus 367 ~-~k~~~~~~~----f~~~l~~~~~li~~~~~~~-~~~~~i~v~f~~~~p~~-~~~~a~~~~kl----~Gi--------i 427 (483) T protein:vir:12 367 N-LKADKLARK----AKVAIQELLWFVFEHFDIK-GEHKDVDISFNYNKVAN-TELQVQTAQQS----MGI--------V 427 (483) T ss_pred H-HHHHHHHHH----HHHHHHHHHHHHHHHhcCC-CccceeeEEeCCCCCCC-HHHHHHHHHHH----hcc--------C Confidence 1 111222222 3333444444433321111 12345677665433321 11112222211 111 2 Q ss_pred CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 462 NLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 462 d~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) ....++. .++. +--.++|++++.++++..++..+... ....+ +.+...+.++.. |. T Consensus 428 S~et~~~----~~~~----v~d~~~E~~ri~~E~~~~~~~~~~~~----~~~~d---~~~~~~~~~~~e--~e 483 (483) T protein:vir:12 428 SHETVLE----NHPF----VEDLQAELERIEQEQMEYNKQLPNLD----DGGAD---GAQQQERSNNKE--SE 483 (483) T ss_pred chHHHHH----hCCC----CCCHHHHHHHHHHHHHHHHhhccccc----ccccC---CcccCCCCCccc--CC Confidence 2222222 2221 11235677776665543332221111 00000 000011111111 22 No 76 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.94 E-value=1.2e-08 Score=63.99 Aligned_cols=440 Identities=12% Similarity=0.073 Sum_probs=188.6 Q ss_pred CCCccccccHHHHHHHHHHH-----------------HHHhhhHHHHHHHHHHHhcccccCCC-CCccc-ccccccccch Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERL-----------------KNDRAPYETRAQNCAQYTIPSLFPKD-SDNAS-TDYVTPWQAV 61 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l-----------------~~~R~~~e~~w~e~~~~~~P~~~~~~-~~~~~-~~~~~~~dst 61 (536) |-+.- .+.++.-+.++ ...+......|+++|+=--+-..... ...+. ....++--+. T Consensus 1 m~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~ 76 (496) T protein:vir:38 1 MINQI----IAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNL 76 (496) T ss_pred ChhHH----HHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecch Confidence 22211 11111111111 11222334556555432112111111 11111 1122333355 Q ss_pred HHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhC Q lcl|NC_011045. 62 GARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAG 141 (536) Q Consensus 62 ~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G 141 (536) +...++.+|+-| |..-+ .++.++. +..++|.+ .+..++|...+.++..+...+| T Consensus 77 ~k~i~~~~a~~l----~~~p~--~i~~~d~-------------~~~e~l~~-------~~~~n~f~~~~~~~~~~a~~~G 130 (496) T protein:vir:38 77 PKVTAKYMSKLL----FNEKV--KINIDDK-------------AAEEFVLN-------VLKTNGFTKNMERYIEYGEAMG 130 (496) T ss_pred HHHHHHHHhhhh----hCCcc--eEeeCCh-------------HHHHHHHH-------HHhccCHHHHHHHHHHHHhhhC Confidence 666666666543 32112 2333332 12333333 3445889999999999999999 Q ss_pred cEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecC Q lcl|NC_011045. 142 NVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDE 221 (536) Q Consensus 142 ~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~ 221 (536) .+.+++..+.. +.+.+..++...++--.+..|++..+..-..++. +++.+..++.++. T Consensus 131 ~~~~~~~~D~~-~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~----------------~~~~y~~le~h~~----- 188 (496) T protein:vir:38 131 GFVIKVYHDGN-KNVKVSFATADCMYPLSNDSENVDECVIANSFHK----------------NNKYYTLLEWNEW----- 188 (496) T ss_pred cEEEEEEEcCC-CcEEEEEEcccceEEEEecCCcEEEEEEEEEEEe----------------CCeEEEEEEEEEE----- Confidence 99987766554 3467899999998855555677765443222221 1122222222211 Q ss_pred CCCce----eEEEEec----Ccccc---------ccccccccccCceEEEe----eeecCCCccccchHHHHHHHHHHHH Q lcl|NC_011045. 222 DSGEY----IRYEEVE----GMEVQ---------GSDGTYPKEACPYIPIR----MVRLDGESYGRSYIEEYLGDLRSLE 280 (536) Q Consensus 222 ~~~~~----~~~~~v~----g~~i~---------~~~~~~~~~~~P~~~~r----w~~~~ge~YGrgp~~~~l~d~~~L~ 280 (536) +++++ ..|.+-+ |..+. .......+...||+.++ .+...++.||+|-..++++-+..|+ T Consensus 189 ~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld 268 (496) T protein:vir:38 189 QGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLD 268 (496) T ss_pred eCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHH Confidence 11222 1122211 11111 11111223344454432 3446678999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCceeeccccccchhhhccCCC--------cce--ecCCc-cccc-ccccccccchhHHHHHHHHH Q lcl|NC_011045. 281 NLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT--------GDF--VTGRP-EDIS-FLQLEKQADFTVAKAVSDAI 348 (536) Q Consensus 281 ~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~--------g~~--~~g~~-~~~~-~~~~~~~~~~~~~~~~i~~~ 348 (536) ..--......+. .++.+.|+++ ++....-..+.. ..+ +.+.. ++.. +..+...-+...-...++.+ T Consensus 269 ~~~s~~~~~~~~-~~~~i~v~~~-~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~ 346 (496) T protein:vir:38 269 LMFDSYYQEFKL-GKKKVLVPSS-FVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAM 346 (496) T ss_pred HHHHHHHHHHhh-cccceecchH-HhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHH Confidence 887777776665 5777777533 222111001110 001 11111 1111 11111111112223333333 Q ss_pred HHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcceEE Q lcl|NC_011045. 349 EARLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP--ELPKEAVEP 424 (536) Q Consensus 349 ~~rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp--~~~~~~v~v 424 (536) .+.|.... + ...+....+...||+||..+.+.......- ..+.....|..+++-++.+....+.+- ..++..+++ T Consensus 347 l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v 425 (496) T protein:vir:38 347 LRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITV 425 (496) T ss_pred HHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEE Confidence 33333221 0 111222233346999999888777776544 555555666677666665543222211 123445677 Q ss_pred EEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 425 TISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNG 504 (536) Q Consensus 425 ~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~ 504 (536) .|.-++..-. ...++++.+..+ .+ + +....+ +....|+ |++|++++.++.+..+ +++ T Consensus 426 ~f~d~i~~d~-~~~~~~~~~~~~----~G---i---iS~et~---l~~~~~~-------~d~ea~~el~ri~~E~--~~~ 482 (496) T protein:vir:38 426 DFDDSIAQDE-DTTINRYTNAKN----QG---M---IPLKIA---LQRAWNI-------TEAEADEWAEMLAKEK--QAE 482 (496) T ss_pred EeCCCCCCCH-HHHHHHHHHHHh----cC---C---CCHHHH---HHhcCCC-------ChHHHHHHHHHHHHhh--hcc Confidence 7644333211 122222222221 11 0 222222 2333444 3444433322211111 000 Q ss_pred HHHHHHHHHHhhhcCcchHHhhhhcCCCC Q lcl|NC_011045. 505 AAALAQGMAAQATASPEAMAAAADSVGLQ 533 (536) Q Consensus 505 a~~~~~~~~~~~~~~~~~~~~~~~~~~~q 533 (536) .+ +... .+. .+..| T Consensus 483 ~~--------~~d~----~~~---~~~~e 496 (496) T protein:vir:38 483 MP--------NNDM----NGI---FGEEE 496 (496) T ss_pred Cc--------cccc----cCC---CCCCC Confidence 00 0000 000 01111 No 77 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.93 E-value=1.3e-08 Score=63.80 Aligned_cols=453 Identities=11% Similarity=0.005 Sum_probs=208.2 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPKDSDNASTDYVTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (536) |. +..+.++.+.+.+..++....+.+ +++++.+|..-. +.... ....+...++..+.+...++..++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~-~~~~~~~~ki~~n~~k~Iv~~~~~yl 106 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYISDFINGYF 106 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcC-cccccCcceeecchHHHHHHHHHhhh Confidence 43 344445566666666655555543 555555555432 11111 11111223555666777777766544 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) + ...+ +++..+.. + ...+...+..++|.....++.+++.+||.+.+++..+.. T Consensus 107 ~----g~p~--~~~~~~~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded- 159 (511) T protein:vir:96 107 L----GNPI--QYQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD- 159 (511) T ss_pred c----cCCc--eeecCchH-------------H-------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCC- Confidence 3 2112 22333221 1 123444555689999999999999999999887766544 Q ss_pred ceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) +.++++.++..+.++..|. .+++...+|.+.....+ ....+.-..+++|+. + ....|.. T Consensus 160 ~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d------------~~~~~~~~~~~iyt~------~-~i~~~~~ 220 (511) T protein:vir:96 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS------H-GVYRYLT 220 (511) T ss_pred CceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------ccccceEEEEEEEeC------C-cEEEEEe Confidence 3457888877777766654 35666666555432111 001111112233221 1 1112222 Q ss_pred ecCc-----cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc Q lcl|NC_011045. 232 VEGM-----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT 306 (536) Q Consensus 232 v~g~-----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~ 306 (536) -.+. ........+++..+|++.++- +.+|+|-.+..++-+..++.+.-......+...+|.+.+.-.... T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~ 295 (511) T protein:vir:96 221 SRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL 295 (511) T ss_pred cCCCcccccccccccccccCCceeeEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccC Confidence 1111 011123455677889888764 457999999999999999999999999999999998776533333 Q ss_pred chhhhccCCCcceec--------C----CcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHH Q lcl|NC_011045. 307 QPRRLTKAQTGDFVT--------G----RPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEE 373 (536) Q Consensus 307 ~~~~~~~~~~g~~~~--------g----~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtE 373 (536) +..++.....+..+. + ..++..+..+....+.+.....++.+.+.|...-.. +.....-+...|+.. T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A 375 (511) T protein:vir:96 296 DPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA 375 (511) T ss_pred CchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 333322111111110 0 111222333444456667777777777776442211 111111123456666 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP-ELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAAL 452 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp-~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~ 452 (536) +..... ..........+.-.+.+.-+++-++.++...+... ...-..+++.|.-++..- ..+.++.+... +.+ T Consensus 376 l~~~~~-~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n-~~e~~~~~~kl----~G~ 449 (511) T protein:vir:96 376 MKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKS-LIEELKAYIDS----GGK 449 (511) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCC-HHHHHHHHHHH----hcc Confidence 554433 22233333334334444434333333333332221 111235677775433321 11122222211 111 Q ss_pred cchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 453 APMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 453 ~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) +....++. .++... -.++|++++.+++..++...+... .... ......+.-.+......- T Consensus 450 --------iS~et~l~----~l~~v~----D~~~E~~ri~~E~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~ 509 (511) T protein:vir:96 450 --------ISQTTLMS----LFSFFQ----DPELEVKKIEEDEKESIKKAQKGI--YKDP--RDINDDEQDDDTKDTVDK 509 (511) T ss_pred --------CChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHhhcc--ccCC--CCCCCCCCCCcccccccc Confidence 22222332 222101 235677777665543322222111 0000 000000000000000000 Q ss_pred CC Q lcl|NC_011045. 533 QP 534 (536) Q Consensus 533 q~ 534 (536) .. T Consensus 510 ~~ 511 (511) T protein:vir:96 510 KE 511 (511) T ss_pred cC Confidence 00 No 78 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.91 E-value=1.5e-08 Score=63.47 Aligned_cols=451 Identities=11% Similarity=-0.000 Sum_probs=207.7 Q ss_pred CCCc--cccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCC-CCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEK--RTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKD-SDNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~--~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) +-.. -+..+.+.+++..+.-+..+ .++|+++.+|..... .... .....+...++..+.+...+++.++.|+ T Consensus 30 ~~~~~~~~~~~~~~l~~~i~~~~~~~---~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 106 (501) T protein:vir:27 30 ADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLA 106 (501) T ss_pred cccccccccccHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhc Confidence 3221 11112334555444444333 345666666765421 1111 1111222345666777777777776654 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + ..+ +++..+... ...+. ..+......++|.....++.++..+||.+.+++..+.. + T Consensus 107 g----~p~--~~~~~d~~~---------~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded-~ 163 (501) T protein:vir:27 107 G----NPI--RVEYDDNDN---------NSQND-------DTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEY-D 163 (501) T ss_pred c----cCe--eEecCCccc---------hHHHH-------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCC-C Confidence 3 111 233332211 11222 23334455689999999999999999999988876654 3 Q ss_pred eeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .++++.++..+.++..|. .+++...+|.+..... .+....+++|+. + ....| .. T Consensus 164 ~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~----------------~~~~~~~~vyt~------~-~v~~~-~~ 219 (501) T protein:vir:27 164 ETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QNAKDVVEIYTN------E-HIYTL-DA 219 (501) T ss_pred ceEEEEEccceeEEEecCCCCCceEEEEEEEEeeec----------------CCcEEEEEEEeC------C-eEEEE-Ee Confidence 456788877776666554 3556555554432111 111122333321 1 11111 11 Q ss_pred cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh-hh Q lcl|NC_011045. 233 EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR-RL 311 (536) Q Consensus 233 ~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~-~~ 311 (536) .+.........+++..+|++.++- +..|+|-.+..++-+..++.+.-.+....+....|.+.+.-....... .. T Consensus 220 ~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~ 294 (501) T protein:vir:27 220 SDDFNEISVTTHAFGTVPITEFLN-----NVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA 294 (501) T ss_pred CCceeeccccccCCCcccEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccch Confidence 111111122345577899887653 457999999999999999999999999999999998776422111110 00 Q ss_pred ccC-CCcceec-------CCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHHHHHH Q lcl|NC_011045. 312 TKA-QTGDFVT-------GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASELE 382 (536) Q Consensus 312 ~~~-~~g~~~~-------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi~~r~~E~~ 382 (536) ... ..+.+.. +..++..+..+....+.+.....++.+++.|...-.. +......+...|+..+..... .+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~-~l 373 (501) T protein:vir:27 295 SDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLF-GL 373 (501) T ss_pred hhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHH-HH Confidence 000 1112211 1122222333333344555666677776666442211 111111123345555443322 22 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCC Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDIN 462 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id 462 (536) ........+.-.+.+.-+++.++.++...+....+....++|.|.-.+... ..+.++.+... +++ |. T Consensus 374 ~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n-~~e~ad~~~kl----~g~--------iS 440 (501) T protein:vir:27 374 DQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKS-LNEQVSILTGL----GGQ--------VS 440 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcC-HHHHHHHHHHH----hcc--------Cc Confidence 233344444444445555555555554455444454456788875544432 12222222221 111 12 Q ss_pred HHHHHHHHHHHcCCChhhccC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcch-HHhhhhcCCCCCCC Q lcl|NC_011045. 463 LAMIKLRIANAIGIDTSGILL-TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEA-MAAAADSVGLQPGI 536 (536) Q Consensus 463 ~d~~~~~~a~~~Gv~p~~i~r-s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~-~~~~~~~~~~q~~~ 536 (536) ...++. .++ ++. .++|+++++++++... ...++. .+-.+....... .....+.. .++. T Consensus 441 ~et~l~----~l~-----~v~D~~~E~eri~~E~~e~~-~~~~~~----~~~~~~~~~~d~~~~~~~d~~--e~~~ 500 (501) T protein:vir:27 441 QETALS----LSG-----LVESPNEELDKINKEVSEID-FKGYSN----DFNEHVGKYTDEVKETHTDDF--ERAY 500 (501) T ss_pred HHHHHH----hCC-----CCCCHHHHHHHHHHHHHhhh-HhhhcC----ccccccccccCCCCCCccccc--cccC Confidence 222222 222 122 3567766665543211 111111 000011111000 00111111 1222 No 79 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.91 E-value=1.6e-08 Score=63.31 Aligned_cols=453 Identities=11% Similarity=-0.005 Sum_probs=205.7 Q ss_pred CCC--ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCC-CCcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAE--KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKD-SDNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~--~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |-. +...-+.+.+++..+..+..+ .++++++.+|....- .... .........++..+.+...++..++.|+ T Consensus 30 ~~~~~~~~~~~~~~i~~~i~~~~~~~---~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~ 106 (501) T protein:vir:96 30 ADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLA 106 (501) T ss_pred ccccccccCChHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhc Confidence 333 222222344555555444443 345666666664421 1111 1111222345667778888887776554 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + ..+ ++...+.. ...++. +.+...+..++|.....++.++..+||.|.+++..+.. + T Consensus 107 g----~p~--~~~~~~~~---------~~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~ded-g 163 (501) T protein:vir:96 107 G----NPI--RVEYDDND---------DNSQND-------DAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEY-D 163 (501) T ss_pred c----cCe--eEeeCCcc---------chhHHH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCC-C Confidence 3 111 22332211 111222 33344555689999999999999999999888766544 3 Q ss_pred eeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .++++.++..+.++..|. .|++...+|.+..... ......+++|+. + ....|. . T Consensus 164 ~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~----------------~~~~~~~~vyt~------~-~i~~~~-~ 219 (501) T protein:vir:96 164 ETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL----------------QSAKDVVEIYTD------E-HIYTLD-A 219 (501) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecC----------------CCcEEEEEEEcC------C-cEEEEe-e Confidence 467888888887777765 3666666655432111 001112233221 1 111111 1 Q ss_pred cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh-hh Q lcl|NC_011045. 233 EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR-RL 311 (536) Q Consensus 233 ~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~-~~ 311 (536) .+.........+++..+|++.++ .+..|+|-.+...+.+..++.+.-......+....|.+.+.-....... .. T Consensus 220 ~~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~ 294 (501) T protein:vir:96 220 SDDFNEISVTTHAFGTVPITEYL-----NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA 294 (501) T ss_pred CCCceeccccccCCCccceEEec-----CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccch Confidence 11111112234557788987653 3567999999999999999999999999999999998776422111110 00 Q ss_pred ccC-CCcceec-------CCcccccccccccccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHHHHHHHHHHH Q lcl|NC_011045. 312 TKA-QTGDFVT-------GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM-LNSAVQRTGERVTAEEIRYVASELE 382 (536) Q Consensus 312 ~~~-~~g~~~~-------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~r~TAtEi~~r~~E~~ 382 (536) ... ..+.+.. +...+..+..+....+.......++.+++.|...-. .+......+...|+..+.....-+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l- 373 (501) T protein:vir:96 295 SDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGL- 373 (501) T ss_pred hhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHH- Confidence 000 1111111 111122222233333445556666666665543211 111111112344666554432222 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCC Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDIN 462 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id 462 (536) ........+.-.+-+.-+++.++.++...+.........++|+|.-++..- ..+.++.+....+ . |. T Consensus 374 ~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n-~~e~ad~~~kl~g---~---------iS 440 (501) T protein:vir:96 374 DQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKS-LNEQVSILTGLGG---Q---------VS 440 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcC-HHHHHHHHHHHhc---c---------Cc Confidence 222233333333334444444455554454444444456788876555432 1222222222211 1 22 Q ss_pred HHHHHHHHHHHcCCChhhccC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 463 LAMIKLRIANAIGIDTSGILL-TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 463 ~d~~~~~~a~~~Gv~p~~i~r-s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) ...++.. ++ ++- .++|++++.++++.... .....+.............+....-.+.. =. T Consensus 441 ~et~~~~----l~-----~v~D~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~d~~e~~--~~ 501 (501) T protein:vir:96 441 QETALSL----SG-----LVESPNEELDKINKEMSEIDF-KGYSNDFNEHVGKYTDEVKETHTDDFERE--YE 501 (501) T ss_pred hHHHHHh----CC-----CCCCHHHHHHHHHHHHHHhhc-cccccchhhcccccCCcCCCCCCCccccc--cC Confidence 2223222 22 122 25666666544432110 00001000000000000000000000000 00 No 80 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.90 E-value=1.8e-08 Score=63.06 Aligned_cols=432 Identities=9% Similarity=0.019 Sum_probs=206.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc--cCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--FPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~--~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) +.+ .+.++.+.+.+..+..+.. .++++.+.+|..-.- .........+...++..+.+...+++.++.|++ T Consensus 11 ~~~-~~~~~~~~i~~~i~~~~~~----~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--- 82 (452) T protein:vir:36 11 FSK-DEPITVEVVTKFMEKHKLE----VARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNG--- 82 (452) T ss_pred cCC-ccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhcc--- Confidence 444 3455677777776665543 345566666665431 111111111223456666777777777766643 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeE Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPM 158 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~ 158 (536) ..+ .+...+.. . ...+...+..++|....+++.++..+||.+++++..+.. +.+++ T Consensus 83 -~~~--~~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-g~~~i 138 (452) T protein:vir:36 83 -IPV--KKSHSDKE-------------I-------LTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDED-TQTNV 138 (452) T ss_pred -cCc--eeecCChh-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCC-CeeEE Confidence 212 23333321 1 112334555689999999999999999999888766554 34578 Q ss_pred EEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcc Q lcl|NC_011045. 159 KLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGME 236 (536) Q Consensus 159 ~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~ 236 (536) ..++..+.+...|.. +++...+|.+.-. +....+++|+. +..+. |....+.- T Consensus 139 ~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~-------------------~~~~~~~vyt~------~~i~~-~~~~~~~~ 192 (452) T protein:vir:36 139 VYNSPENMFMVYDDTVKQEPLFAVRYGVDE-------------------DKKLQGEVYTL------LETIK-ISGENDEI 192 (452) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEec-------------------CceEEEEEEec------CeEEE-EEEcCCce Confidence 888877776666653 3444444433210 11123444432 11111 11111111 Q ss_pred ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCC Q lcl|NC_011045. 237 VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT 316 (536) Q Consensus 237 i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~ 316 (536) .......+++..+|++.++. +..|+|-.+...+-+..++.+.-......+....|.+.+.- .....+....... T Consensus 193 ~~~~~~~~~~g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~ 266 (452) T protein:vir:36 193 SFGEGTYNPYPDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVEEEDLKNIRS 266 (452) T ss_pred EEecceeccCCcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec-CCcCchhhhhhhh Confidence 11223345567889877654 34689999999999999999999999999999999877742 2222233222222 Q ss_pred ccee--cCC--cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_011045. 317 GDFV--TGR--PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSIL 392 (536) Q Consensus 317 g~~~--~g~--~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl 392 (536) +..+ +.. ..+..+..+....+.......++.+++.|-..-..-..........|+..+..+..-+.. ...-..+. T Consensus 267 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~-k~~~~~~~ 345 (452) T protein:vir:36 267 NRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSN-LALSFQRK 345 (452) T ss_pred cceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHH-HHHHHHHH Confidence 2221 111 111122233334456666777777777664322111111112224566655443322222 22223333 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHH Q lcl|NC_011045. 393 SQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRI 470 (536) Q Consensus 393 ~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~ 470 (536) -...+..+++-++.++...|.- .....++|+|.-++. .++.+. .+... +++ |....++ T Consensus 346 ~~~~l~~~~~li~~~~~~~~~~--~~~~~i~i~f~~~~p~d~~~~a~---~~~k~----~g~--------iS~et~~--- 405 (452) T protein:vir:36 346 FQSSLNSRYKLFCELSTNVSNK--DSWKDIEYTFTRNEPKDIKEQAE---TANIL----MGI--------TSQETAL--- 405 (452) T ss_pred HHHHHHHHHHHHHHHHhccCCc--cccccceEEeCCCCCcCHHHHHH---HHHHH----hcc--------CChHHHH--- Confidence 3334444444455555444432 223456777654443 222222 22111 111 2222222 Q ss_pred HHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 471 ANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 471 a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) ..+|... -.++|++++.+++.+..+..+. .+....+.....-.+. |. T Consensus 406 -~~~~~~~----d~~~E~~ri~~E~~~~~~~~~~---------~~~~~~~~~~~~~~~~---~e 452 (452) T protein:vir:36 406 -SVISVIP----DVQAEMEKIKKEEASTAIFDKD---------KQPSEKGTDTVVSETN---EE 452 (452) T ss_pred -HhCCCCC----CHHHHHHHHHHHHHHHHHHHhh---------ccCCCCcccccCcccc---CC Confidence 2233211 1356776665544322111110 0000100000000001 11 No 81 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.89 E-value=1.9e-08 Score=62.98 Aligned_cols=443 Identities=10% Similarity=0.028 Sum_probs=206.5 Q ss_pred CCCcc-----ccccHHHHHHHHHHHHHH-hhhHHHHHHHHHHHh--ccccc-CCCCC-----cccccccccccchHHHHH Q lcl|NC_011045. 1 MAEKR-----TGLAEEGAKSVYERLKND-RAPYETRAQNCAQYT--IPSLF-PKDSD-----NASTDYVTPWQAVGARGL 66 (536) Q Consensus 1 Ma~~~-----~~~~~~~~~~r~~~l~~~-R~~~e~~w~e~~~~~--~P~~~-~~~~~-----~~~~~~~~~~dst~~~a~ 66 (536) |-... .......+.+..+.+... |-+...+++++++-- ++.+- ...+. ...+...++..+-+...+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Iv 86 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLV 86 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHH Confidence 22110 111233455555555443 333334444444311 12110 00110 111122355666677777 Q ss_pred HHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 67 ~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) +..++.|++- | + +++..+.. +++.+ ..+..++|.....++.++..+||.++++ T Consensus 87 d~~~~~l~g~--p--~--~~~~~~~~-------------~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~ 139 (479) T protein:vir:79 87 DQKVGYSVGN--P--I--VFNADDDN-------------LTKLL--------NDLLGEEFDDTITELYLNASNKGVEWLH 139 (479) T ss_pred HHHHhhhhcC--C--c--eeccCCHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHHhcCeEEEE Confidence 7777666431 2 1 22333321 22222 2233478999999999999999999887 Q ss_pred EecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEE---EEecC Q lcl|NC_011045. 147 LPEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH---IYLDE 221 (536) Q Consensus 147 ~~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~---v~p~~ 221 (536) +..+.+ +.++++.++.-+++...|. .+++...+|.+...-. +.+....+++|+- .+.+. T Consensus 140 v~~d~~-~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~---------------~~~~~~~~e~y~~~~i~~~~~ 203 (479) T protein:vir:79 140 PYINRK-GEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDI---------------DGNKIKRVEYYTENDITYFIE 203 (479) T ss_pred EEeCCC-CceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeec---------------CCceEEEEEEEeCCcEEEEEe Confidence 766554 3467888888887777664 4566666665554310 0111122333211 00111 Q ss_pred CCCceeE---EE------EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 222 DSGEYIR---YE------EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMI 292 (536) Q Consensus 222 ~~~~~~~---~~------~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~ 292 (536) .++.+.. +. ..+..........++|..+|++.++- +.+|+|-.+...+-+..++.+.-......+. T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~ 278 (479) T protein:vir:79 204 RGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN-----NEKCVSDLTFYKSLIDIYDNNISTLADNLDE 278 (479) T ss_pred cCCcccccccccccccccccccccccccccccCCCcccEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHH Confidence 1111110 00 01111111223456678899987754 4679999999999999999998889999999 Q ss_pred HhCCceeeccccccchhhhc-cCCCcceec-CCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCC Q lcl|NC_011045. 293 SSKVIGLVNPAGITQPRRLT-KAQTGDFVT-GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVT 370 (536) Q Consensus 293 a~~p~~lv~~~g~~~~~~~~-~~~~g~~~~-g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~T 370 (536) ..+|.+++.-.......+.. ....+.++. ...+++..+ ....+.......++.++..|...-..-....-.....| T Consensus 279 ~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~S 356 (479) T protein:vir:79 279 IQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKL--EINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKS 356 (479) T ss_pred hhCceeeeecCCccccccchhhhhhccceecCCCCcceEE--eccCCHHHHHHHHHHHHHHHHHHhCccccccccccchh Confidence 99998776432111111211 122223332 223344433 33456777788888888777543322111111222346 Q ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 371 AEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 371 AtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~ 450 (536) ++.+..+..-+ .....-..+.-.+.+..+++-++.++...+. ..+....++|+|.-.+..- ..+.++.+... T Consensus 357 g~Ai~~~~~~l-~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~~~~~~i~i~f~~~~p~~-~~~~a~~~~kl----- 428 (479) T protein:vir:79 357 GVALKFLYSLL-DLKCSKTEKKFKKAIRELLWFVCEYLKISGN-KSYDYKTVQITFNHSMIIN-EAEKIDMAAKS----- 428 (479) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CccccccceEEeCCCCCcC-HHHHHHHHHHH----- Confidence 65554432222 2222333333334444444444444433332 2234446777775554331 11222222221 Q ss_pred hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 451 ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQA 516 (536) Q Consensus 451 ~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~ 516 (536) .+ .+....++.. ++. +--.++|++++.+++..+.+..+.......+...+. T Consensus 429 --~g-----~iS~et~l~~----l~~----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 429 --TG-----IVSDETIVSN----HPW----VEDVNDELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred --hc-----cCcHHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 11 1223333322 222 112356776666555433322222111111111111 No 82 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.89 E-value=2e-08 Score=62.81 Aligned_cols=444 Identities=11% Similarity=0.051 Sum_probs=210.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCCC--CcccccccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDS--DNASTDYVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) | +....++.+.+.+..++.+..| .++|+++.+|....- ..... ....+...++-.+.+...++..++.|++ T Consensus 16 ~-~~~~~l~~~~i~~li~~~~~~~---~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G 91 (506) T protein:vir:94 16 Q-ESLENLTPNKIMKFITHHFNYQ---RPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVG 91 (506) T ss_pred c-cchhcCCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcc Confidence 5 4466788888877777766655 446777777765432 11111 1111223455566677777777766543 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) - | + .+...+.. .+ ..+...+..++|.....++.++..++|.+.+++..+.. +. T Consensus 92 ~--p--~--~~~~~d~~-------------~~-------~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded-~~ 144 (506) T protein:vir:94 92 N--P--I--NVKLPDDG-------------SN-------SGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGED-NE 144 (506) T ss_pred c--C--c--eeecCcch-------------HH-------HHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCC-Ce Confidence 1 2 1 22232211 11 12333445689999999999999999999987776554 44 Q ss_pred eeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) +++..++..+.++..|. .+++...+|.+..... ..+.......++.+|.. ..+..|.... T Consensus 145 ~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~---------------~~~~~~~~~~~~~~yt~---~~~~~~~~~~ 206 (506) T protein:vir:94 145 EHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELV---------------DDNQVSTINYVPETWTA---DTYTLYNPTP 206 (506) T ss_pred eEEEEEcccceEEEecCCCCCceEEEEEEEeeeec---------------cCCceeEEEEEEEEEeC---ceEEEecccc Confidence 67788887777776664 4556555555543211 11111112222222211 1222222211 Q ss_pred -CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh--h Q lcl|NC_011045. 234 -GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR--R 310 (536) Q Consensus 234 -g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~--~ 310 (536) +..+ .....+++..+|++.++= +..|.|-.+...+-+-.++.+.-..+...+...+|.+++.-....... + T Consensus 207 ~~~~~-~~~~~~~~g~vPvv~~~n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 280 (506) T protein:vir:94 207 IMGKM-QVDTTKPITTFPVVEFKN-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSD 280 (506) T ss_pred Cccce-eccccccCCccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchh Confidence 1122 223346677889877643 345888889999999999998888888888888887665321110000 0 Q ss_pred h-------------------------------ccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHh-h Q lcl|NC_011045. 311 L-------------------------------TKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM-L 358 (536) Q Consensus 311 ~-------------------------------~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~ 358 (536) . .....+....+...+..+..+....+.+.....++.+.+.|-..-. . T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 360 (506) T protein:vir:94 281 MMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTP 360 (506) T ss_pred ccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 0 0000011111112222233344445667777777777776643211 1 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH--HHHH Q lcl|NC_011045. 359 NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA--IGRG 436 (536) Q Consensus 359 ~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~--a~r~ 436 (536) +......+...|+..+..+..-+.. -.....+.-.+.+..+++-++.++...+....+....++|.|.-++.. +..+ T Consensus 361 ~~~~~~~~~n~Sg~Aik~~~~~l~~-k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a 439 (506) T protein:vir:94 361 DLTDENFASNSSGVAMQYKVLGTVE-LASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQI 439 (506) T ss_pred ccccccccccchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHH Confidence 1111111234566665544332222 222333444444555555555555433332233344567776544432 2222 Q ss_pred HHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 437 QDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQA 516 (536) Q Consensus 437 ~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~ 516 (536) .-+.++ +++ |....++..+ -+|+ -.++|++++.++++....... +. T Consensus 440 ~~~~kl-------~g~--------iS~et~~~~l---p~v~-----d~~~E~~ri~~E~~~~~~~~~-----------~~ 485 (506) T protein:vir:94 440 KALVQA-------GAT--------LPQKYLYQQL---PGVT-----NPQDIVDMMKEQSANGDYSFD-----------QN 485 (506) T ss_pred HHHHHH-------hcc--------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHhhcch-----------hh Confidence 221111 111 2223333321 1221 124566666554432211110 00 Q ss_pred hcCcch--HHhhhhcCCCCCCC Q lcl|NC_011045. 517 TASPEA--MAAAADSVGLQPGI 536 (536) Q Consensus 517 ~~~~~~--~~~~~~~~~~q~~~ 536 (536) ...+.. ...+.+ .-..=| T Consensus 486 ~~~~~~~~~~~~~~--~~~~e~ 505 (506) T protein:vir:94 486 GVISNDGQTNTTAT--QTDEEV 505 (506) T ss_pred cCCCcccCcccccc--ccccCC Confidence 011100 111100 001122 No 83 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=98.86 E-value=2.6e-08 Score=62.16 Aligned_cols=434 Identities=9% Similarity=0.016 Sum_probs=211.0 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCC----C------CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKD----S------DNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~----~------~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |..+++++.-+.+...++....+++.+.+|..-.- .... . ....+...++..+-+...++..++.|+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 88899998889899888888889999999876531 0100 0 001112234555555555555554433 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + -| + .+...+.. ....+++|+. .+|...+.++.++..++|.|.+++..+.. + T Consensus 81 G--~p--~--~~~~~d~~---------~~~~l~~~~~------------~~~~~~~~~l~~~~~~~G~a~~~~y~d~~-~ 132 (470) T protein:vir:10 81 S--VF--P--DIDVGKDA---------DNKKIIDVLG------------DDRALTLNGLLVDSSNAGRAWLHYWIDED-G 132 (470) T ss_pred c--cc--e--eeecCchH---------HHHHHHHHHh------------hhHHHHHHHHHHHHhhcCeeEEEEEecCC-C Confidence 2 12 1 23333321 1122333332 36777888888999999999887766554 3 Q ss_pred eeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEE---EEecCCCCcee-- Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH---IYLDEDSGEYI-- 227 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~---v~p~~~~~~~~-- 227 (536) -+++..++..+.++..|. .|++..++|.+...-.+ .......+++|+. .+.+.++.... T Consensus 133 ~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~--------------~~~~~~~~e~yt~~~~~~~~~~~~~~~~~ 198 (470) T protein:vir:10 133 NFRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPD--------------SGKYFTVHEYWTDKEAQFFRTNATDSTVI 198 (470) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecC--------------CceEEEEEEEEcCCcEEEEEeecCcceec Confidence 467888888887777764 36776666555442000 0011112333220 00011111110 Q ss_pred -EE---EEe------cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011045. 228 -RY---EEV------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) Q Consensus 228 -~~---~~v------~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~ 297 (536) ++ ... ++..+ ....++|..+|++.++= +.+|.|=.+...+.+..++.+.-......+...+|. T Consensus 199 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 271 (470) T protein:vir:10 199 EPYNIITSYDLSAGYETGQS--NTLKHNFGRVPFIEFSK-----NKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (470) T ss_pred cccccccccccccccccccc--cccccCCCeeeEEEeec-----CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 00 000 11111 11234466777776653 468999999999999999999999999999999999 Q ss_pred eeeccccccchhhhcc-CC-Cccee-cCC--cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHH Q lcl|NC_011045. 298 GLVNPAGITQPRRLTK-AQ-TGDFV-TGR--PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAE 372 (536) Q Consensus 298 ~lv~~~g~~~~~~~~~-~~-~g~~~-~g~--~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAt 372 (536) +++.-.+..+..++.. .. .|.+. +.. .....+..+....+.......++.+++.|-+.-..-.+..-.....|+. T Consensus 272 lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~ 351 (470) T protein:vir:10 272 LVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESSNASGV 351 (470) T ss_pred eeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccchHH Confidence 8886433333222222 11 12222 211 1122333445555677788888888887754222111111122234555 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAAL 452 (536) Q Consensus 373 Ei~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~ 452 (536) .+..+-.-+... ..+.... +.+.+.+++.++.+.=-+-..+...++++|.-++-.-.. +.+ +.+..+ T Consensus 352 Alk~~~~~l~~k----~~~~~~~-~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~-e~~-------~~~~~~ 418 (470) T protein:vir:10 352 AIKMLYSHLELK----AAKTQTY-FEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSL-TKA-------QIVSTV 418 (470) T ss_pred HHHHHHHHHHHH----HHHHHHH-HHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCCHH-HHH-------HHHHHH Confidence 554432222221 2222222 233344444444331111123344677777655443211 111 111111 Q ss_pred cchhhhhcCCHHHHHHHHHHHcCCChhhccC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCC Q lcl|NC_011045. 453 APMRDDPDINLAMIKLRIANAIGIDTSGILL-TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) Q Consensus 453 ~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r-s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (536) ++ .+.-..++.. ++ ++. .++|++++.+++++.+...+... .....+ ..+ T Consensus 419 ~g-----~iS~et~l~~----~p-----~v~D~~~E~eri~~E~~e~~~~~~~~~--------~~~~~~-----~dd--- 468 (470) T protein:vir:10 419 AN-----YSSKEAVAKA----NP-----IVDDWQQELKDLAKDKEENDPYSNQAD--------ELNGKG-----VND--- 468 (470) T ss_pred hc-----cCcHHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHHhhcccc--------ccCCCC-----CCC--- Confidence 11 1222333322 22 122 35667666655443322221111 000000 011 Q ss_pred CC Q lcl|NC_011045. 532 LQ 533 (536) Q Consensus 532 ~q 533 (536) -| T Consensus 469 e~ 470 (470) T protein:vir:10 469 EQ 470 (470) T ss_pred CC Confidence 11 No 84 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.84 E-value=3e-08 Score=61.83 Aligned_cols=453 Identities=11% Similarity=0.002 Sum_probs=204.3 Q ss_pred CC--CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc---ccCCCC-CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MA--EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDS-DNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma--~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~---~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |. ++.+..+.+.+.+..++....+.+ +++++.+|..-. ...... ....+...++..+.+...++..++.|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 44 344545566676666665555433 445555555332 111111 111122345556667777776665543 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) + ..+ +++..+.. +. ..+...+..++|.....++.+++.+||.|..++..+.. + T Consensus 108 g----~p~--~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded-g 160 (511) T protein:vir:10 108 G----NPI--QYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQD-D 160 (511) T ss_pred c----cCc--eeecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCC-C Confidence 2 111 22333221 11 22334445588999999999999999999887765543 3 Q ss_pred eeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV 232 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v 232 (536) .++++.++..+.++..|.. +++...+|.+.....+ ....+.-..+++|+. + ....|..- T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d------------~~~~~~~~~~~iyt~------~-~i~~~~~~ 221 (511) T protein:vir:10 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS------H-GVYRYLTS 221 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc------------cCccceEEEEEEEeC------C-cEEEEEec Confidence 4577888777777766643 4565555555432110 001111122333321 1 11111111 Q ss_pred cCc-----cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 233 EGM-----EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 233 ~g~-----~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .+. ........+++..+|++.++- +.+|.|-.+..++-+..++...-......+...+|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~ 296 (511) T protein:vir:10 222 RTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccccCcceeEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCC Confidence 111 011123446677889887653 3579999999999999999888888888888888887654322223 Q ss_pred hhhhccCCCccee--------c----CCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHH Q lcl|NC_011045. 308 PRRLTKAQTGDFV--------T----GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEI 374 (536) Q Consensus 308 ~~~~~~~~~g~~~--------~----g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi 374 (536) ..++.....+.++ . +..++..+..+....+.......++.++..|...-.. +.....-+...|+..+ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al 376 (511) T protein:vir:10 297 PVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM 376 (511) T ss_pred chhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHH Confidence 3222211111111 1 1111222333444456666777777777766432211 1111111234466665 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC-CCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE-LPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~-~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~ 453 (536) ...-. ..........++-.+.+.-++.-++.++...+.... ..-..+++.|.-++..-. .+.++.+....+ .+ T Consensus 377 ~~~~~-~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~-~~~~~~~~kl~G---~i- 450 (511) T protein:vir:10 377 KYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL-IEELKAYIDSGG---KI- 450 (511) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCH-HHHHHHHHHHhc---cC- Confidence 54422 222222222333333333333333333333332211 112356777755443321 112222222211 12 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcC-cchHHhhhhcCCC Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATAS-PEAMAAAADSVGL 532 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 532 (536) ....++.. ++..+ ..++|++++.++++.+....+... .+ .....+ .+.-.+......- T Consensus 451 --------S~et~~~~----l~~v~----d~~~E~~ri~~E~~~~~~~~~~~~--~~---~~~~~~~~~~~~~~~~~~~~ 509 (511) T protein:vir:10 451 --------SQTTLMSL----FSFFQ----DPELEVKKIEEDEKESIKKAQKGI--YK---DPRDINDDEQDDDTKDTVDK 509 (511) T ss_pred --------cHHHHHHh----CCCCC----CHHHHHHHHHHHHHHHHHHHhhhc--cc---CCCCCCCCCCCCcccCcccc Confidence 22222222 22101 235677776665543322221111 00 000000 0000000000000 Q ss_pred CC Q lcl|NC_011045. 533 QP 534 (536) Q Consensus 533 q~ 534 (536) .. T Consensus 510 ~~ 511 (511) T protein:vir:10 510 KE 511 (511) T ss_pred cC Confidence 00 No 85 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.84 E-value=3e-08 Score=61.80 Aligned_cols=444 Identities=10% Similarity=0.010 Sum_probs=208.2 Q ss_pred CC------C-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc--cccc--------CCC-CC----cccccccccc Q lcl|NC_011045. 1 MA------E-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTI--PSLF--------PKD-SD----NASTDYVTPW 58 (536) Q Consensus 1 Ma------~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~--P~~~--------~~~-~~----~~~~~~~~~~ 58 (536) |. + +.+.++.+.+++..+.-+..|..+...++-+-.+.. +.+. ... +. ...+...++. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 32 2 555677888888777777766665554443332221 1110 000 00 0011123455 Q ss_pred cchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHH Q lcl|NC_011045. 59 QAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLV 138 (536) Q Consensus 59 dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~ 138 (536) .+-+...+++.++.|++- |. .+...+. .+...+++++|. +.+..++|.....++.++.. T Consensus 81 ~n~~~~ivd~~~~yl~g~--pv----~~~~~~~--------~~~~e~~~~~l~-------~~~~~n~~~~~~~~~~~~~~ 139 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGV--PV----TYDLDEN--------AEKNEKLKKFIT-------NFAIRNSVDDEDSEIGKMAA 139 (474) T ss_pred cchHHHHHHhHhhheecc--ce----eEeeCCC--------CcchHHHHHHHH-------HHHhhcCHhHHHHHHHHHHh Confidence 555666666665544321 32 2222211 011122333333 34445889999999999999 Q ss_pred hhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEE Q lcl|NC_011045. 139 VAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY 218 (536) Q Consensus 139 ~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~ 218 (536) +||.|.+++..+.. +.++++.++..+.++..|..+.....+|.+.... ...+.....+++|+ T Consensus 140 ~~G~a~~~~~~d~~-~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~--------------~~~~~~~~~~~~y~--- 201 (474) T protein:vir:10 140 ICGYGARLAYIDTN-GDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKD--------------DDNGTDYVYAEFYD--- 201 (474) T ss_pred hcCeEEEEEEeCCC-CeeEEEEEcccceEEEEcCCCceEEEEEEEEEee--------------CCCceEEEEEEEEc--- Confidence 99999988765544 3467888888887777787777655554443210 00011111122221 Q ss_pred ecCCCCceeEEEEecCcc--ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011045. 219 LDEDSGEYIRYEEVEGME--VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKV 296 (536) Q Consensus 219 p~~~~~~~~~~~~v~g~~--i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p 296 (536) ...+..|. .++.. .......+++..+|++.++ .+.+|.|=.+...+-+..++.+.-......+....| T Consensus 202 ----~~~~~~~~-~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:10 202 ----NAYYYVFR-GEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred ----CceEEEEe-ecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 11222221 11111 1112234557788887654 456899999999999999999999999999999999 Q ss_pred ceeeccccccchhhhccCC-Cccee-cCCcccccccccccccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHH Q lcl|NC_011045. 297 IGLVNPAGITQPRRLTKAQ-TGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM-LNSAVQRTGERVTAEE 373 (536) Q Consensus 297 ~~lv~~~g~~~~~~~~~~~-~g~~~-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~r~TAtE 373 (536) .+.+.-.+ +..+...+.. .|.+. .+..+++. .+....+.......++.+++.|...-. .+.....-+...|+.. T Consensus 272 ~l~i~g~~-~~~~~~~~~~~~~~i~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 348 (474) T protein:vir:10 272 YLVLRGMG-MSEEMIQETQKSGAFELFDKDMDVK--YLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIG 348 (474) T ss_pred hhhhccCC-CCchhhhhhhhcceeEecCCCCcee--EEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 87764221 1222222222 23332 23333333 344444566777778888777754221 1111111123446665 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-CCCCCcceEEEEechHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI-PELPKEAVEPTISTGLE--AIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~l-p~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~~~~~~ 450 (536) +..+-.-+ .+......+.-.+.+.-+++-++.++...+.- .+..-.++++.|.-++. .+..++.+.++ . + T Consensus 349 l~~~~~~l-~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl---~---g 421 (474) T protein:vir:10 349 MKLKLMAL-ENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL---K---G 421 (474) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH---h---c Confidence 55432221 22222233333333333444444444333321 12222356777754433 23322222221 1 1 Q ss_pred hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcC Q lcl|NC_011045. 451 ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSV 530 (536) Q Consensus 451 ~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (536) . +....+++. ++..+ -.++|++.+.+++...++.... . ..+.....+...+.. T Consensus 422 ~---------iS~et~~~~----l~~v~----d~~~E~eri~~E~~e~~~~~~~---~-------~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 422 Q---------VSERTRLGQ----SQLVD----DVDYELDEMEKESLEFNDKLPD---I-------DEGDANDKSQNNQSE 474 (474) T ss_pred c---------CchHHHHHh----CCCCC----CHHHHHHHHHHHHHHHHhhccc---c-------cCCCcCCCCccccCC Confidence 1 122222222 22101 2346666665444322211100 0 000111111111111 No 86 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.84 E-value=3e-08 Score=61.80 Aligned_cols=444 Identities=10% Similarity=0.010 Sum_probs=208.2 Q ss_pred CC------C-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc--cccc--------CCC-CC----cccccccccc Q lcl|NC_011045. 1 MA------E-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTI--PSLF--------PKD-SD----NASTDYVTPW 58 (536) Q Consensus 1 Ma------~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~--P~~~--------~~~-~~----~~~~~~~~~~ 58 (536) |. + +.+.++.+.+++..+.-+..|..+...++-+-.+.. +.+. ... +. ...+...++. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 32 2 555677888888777777766665554443332221 1110 000 00 0011123455 Q ss_pred cchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHH Q lcl|NC_011045. 59 QAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLV 138 (536) Q Consensus 59 dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~ 138 (536) .+-+...+++.++.|++- |. .+...+. .+...+++++|. +.+..++|.....++.++.. T Consensus 81 ~n~~~~ivd~~~~yl~g~--pv----~~~~~~~--------~~~~e~~~~~l~-------~~~~~n~~~~~~~~~~~~~~ 139 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGV--PV----TYDLDEN--------AEKNEKLKKFIT-------NFAIRNSVDDEDSEIGKMAA 139 (474) T ss_pred cchHHHHHHhHhhheecc--ce----eEeeCCC--------CcchHHHHHHHH-------HHHhhcCHhHHHHHHHHHHh Confidence 555666666665544321 32 2222211 011122333333 34445889999999999999 Q ss_pred hhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEE Q lcl|NC_011045. 139 VAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY 218 (536) Q Consensus 139 ~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~ 218 (536) +||.|.+++..+.. +.++++.++..+.++..|..+.....+|.+.... ...+.....+++|+ T Consensus 140 ~~G~a~~~~~~d~~-~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~--------------~~~~~~~~~~~~y~--- 201 (474) T protein:vir:94 140 ICGYGARLAYIDTN-GDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKD--------------DDNGTDYVYAEFYD--- 201 (474) T ss_pred hcCeEEEEEEeCCC-CeeEEEEEcccceEEEEcCCCceEEEEEEEEEee--------------CCCceEEEEEEEEc--- Confidence 99999988765544 3467888888887777787777655554443210 00011111122221 Q ss_pred ecCCCCceeEEEEecCcc--ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011045. 219 LDEDSGEYIRYEEVEGME--VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKV 296 (536) Q Consensus 219 p~~~~~~~~~~~~v~g~~--i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p 296 (536) ...+..|. .++.. .......+++..+|++.++ .+.+|.|=.+...+-+..++.+.-......+....| T Consensus 202 ----~~~~~~~~-~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 271 (474) T protein:vir:94 202 ----NAYYYVFR-GEGIDALQEVGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLA 271 (474) T ss_pred ----CceEEEEe-ecCCCcccccccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 11222221 11111 1112234557788887654 456899999999999999999999999999999999 Q ss_pred ceeeccccccchhhhccCC-Cccee-cCCcccccccccccccchhHHHHHHHHHHHHHHHHHh-hhhcccCCCCCCCHHH Q lcl|NC_011045. 297 IGLVNPAGITQPRRLTKAQ-TGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFM-LNSAVQRTGERVTAEE 373 (536) Q Consensus 297 ~~lv~~~g~~~~~~~~~~~-~g~~~-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~-~~~~~~~~~~r~TAtE 373 (536) .+.+.-.+ +..+...+.. .|.+. .+..+++. .+....+.......++.+++.|...-. .+.....-+...|+.. T Consensus 272 ~l~i~g~~-~~~~~~~~~~~~~~i~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 348 (474) T protein:vir:94 272 YLVLRGMG-MSEEMIQETQKSGAFELFDKDMDVK--YLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIG 348 (474) T ss_pred hhhhccCC-CCchhhhhhhhcceeEecCCCCcee--EEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 87764221 1222222222 23332 23333333 344444566777778888777754221 1111111123446665 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC-CCCCCcceEEEEechHH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI-PELPKEAVEPTISTGLE--AIGRGQDLDKLERCVAAWA 450 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~l-p~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~~~~~~ 450 (536) +..+-.-+ .+......+.-.+.+.-+++-++.++...+.- .+..-.++++.|.-++. .+..++.+.++ . + T Consensus 349 l~~~~~~l-~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl---~---g 421 (474) T protein:vir:94 349 MKLKLMAL-ENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL---K---G 421 (474) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH---h---c Confidence 55432221 22222233333333333444444444333321 12222356777754433 23322222221 1 1 Q ss_pred hhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcC Q lcl|NC_011045. 451 ALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSV 530 (536) Q Consensus 451 ~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (536) . +....+++. ++..+ -.++|++.+.+++...++.... . ..+.....+...+.. T Consensus 422 ~---------iS~et~~~~----l~~v~----d~~~E~eri~~E~~e~~~~~~~---~-------~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 422 Q---------VSERTRLGQ----SQLVD----DVDYELDEMEKESLEFNDKLPD---I-------DEGDANDKSQNNQSE 474 (474) T ss_pred c---------CchHHHHHh----CCCCC----CHHHHHHHHHHHHHHHHhhccc---c-------cCCCcCCCCccccCC Confidence 1 122222222 22101 2346666665444322211100 0 000111111111111 No 87 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=98.83 E-value=3.3e-08 Score=61.57 Aligned_cols=431 Identities=8% Similarity=-0.022 Sum_probs=201.7 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc------cCCCC-----------CcccccccccccchHHHHHHHHH Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL------FPKDS-----------DNASTDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~------~~~~~-----------~~~~~~~~~~~dst~~~a~~~La 70 (536) |+.+.+.+....+....+...+++.++.+|..-.- ...+. +.......++..+-+...++..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 88888888888887776666677778877774320 00000 00011122455555666666555 Q ss_pred HHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC Q lcl|NC_011045. 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) Q Consensus 71 a~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~ 150 (536) +.|.+ -|. .+...+... ...++.| + .++|.....++.++..++|.|.+++-.+ T Consensus 81 ~yl~G--~p~----~~~~~~~~~---------~~~l~~~-----------~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d 133 (471) T protein:vir:10 81 AYALT--YPP----TFDVDDKKV---------NDMIVDV-----------L-GDDYERISKQLCVNAGNAGIAWLHVWKD 133 (471) T ss_pred hhhcc--cCc----eeccCChHH---------HHHHHHH-----------H-hcCHHHHHHHHHHHHhhCCeEEEEEEee Confidence 44432 131 223332211 1122222 2 3688889999999999999998776555 Q ss_pred CCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEE------EEEEecCC Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVY------THIYLDED 222 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~------~~v~p~~~ 222 (536) ...+.+++.+++..+.++..|. .+++...+|.+...... ..+....+++| +.+.-.. T Consensus 134 ~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~--------------~~~~~~~~~vy~~~~~~~y~~~~~- 198 (471) T protein:vir:10 134 ASDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDET--------------DGKNYTVYEYWNDKECSFYRHEKE- 198 (471) T ss_pred CCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccC--------------CCceeEEEEEEeCCcEEEEEecCC- Confidence 4444567888888887777665 34566665555432110 11112223332 1111110 Q ss_pred CCceeE--------EEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011045. 223 SGEYIR--------YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISS 294 (536) Q Consensus 223 ~~~~~~--------~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~ 294 (536) +..... ..-.+|.......-.++|..+|++.++. +.+|.|-.+...+-+-.++.+.-......+... T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 273 (471) T protein:vir:10 199 KPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN-----NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQ 273 (471) T ss_pred cccccccccccccccccccccccccccccCCCCceeEEEecc-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000000 0011222222222345577888887755 457899999999999999999989999999999 Q ss_pred CCceeeccccccchhhhc-cCC-Cccee-cC--CcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCC Q lcl|NC_011045. 295 KVIGLVNPAGITQPRRLT-KAQ-TGDFV-TG--RPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERV 369 (536) Q Consensus 295 ~p~~lv~~~g~~~~~~~~-~~~-~g~~~-~g--~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~ 369 (536) +|.+.+.-.......+.. ... .+.+. ++ ...+..+..+....+.......++.+++.|-..-..-....-..... T Consensus 274 ~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~ 353 (471) T protein:vir:10 274 EVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNS 353 (471) T ss_pred CceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCc Confidence 998766422111111111 111 22222 11 11222333344445677777888888777744321111111111233 Q ss_pred CHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 370 TAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAA 448 (536) Q Consensus 370 TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~ 448 (536) |+.-+..+..- |---..+.... +...+.+++.++... |. ....+++|+|.-.+-.-. .+.++.+.. T Consensus 354 Sg~Alk~~~~~----l~~k~~~~~~~-~~~~l~~~~~li~~~~~~---~d~~~i~i~f~~~~p~n~-~e~~~~~~k---- 420 (471) T protein:vir:10 354 SGVALKFLYSL----LELKAGNMETQ-FRSGYATLVKMILKHLGL---SDKLKIKQTWTRNSINND-TEMAQVVST---- 420 (471) T ss_pred cHHHHHHHHHH----HHHHHHHHHHH-HHHHHHHHHHHHHHHhcc---CCCceeEEEeCCCCCCCH-HHHHHHHHH---- Confidence 44333322111 11112222222 222334444443321 11 223457777754443211 111111111 Q ss_pred HHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchH Q lcl|NC_011045. 449 WAALAPMRDDPDINLAMIKLRIANAIGIDTSGILL-TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAM 523 (536) Q Consensus 449 ~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r-s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~ 523 (536) ++. .|.-..++.. ++ .+. .++|++++.++++++++... . .......++.- T Consensus 421 l~g--------~iS~et~~~~----~p-----~v~D~~~E~eri~~E~~~~~~~~~----~----~~~~~~~~e~~ 471 (471) T protein:vir:10 421 LAT--------ITSRENVAKS----NP-----IVEDWQDELRLQKAEQEGRSEKLY----D----MEEVEHESEVE 471 (471) T ss_pred Hhc--------cCchHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHhccc----c----cCCCCCccccC Confidence 111 1222222222 21 122 25666666555443222111 0 01111111111 No 88 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.83 E-value=3.4e-08 Score=61.51 Aligned_cols=441 Identities=11% Similarity=0.074 Sum_probs=191.0 Q ss_pred CCCccccccHHHHHHHHHH-----------------HHHHhhhHHHHHHHHHHHhcccccCC--CCCcccccccccccch Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYER-----------------LKNDRAPYETRAQNCAQYTIPSLFPK--DSDNASTDYVTPWQAV 61 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~-----------------l~~~R~~~e~~w~e~~~~~~P~~~~~--~~~~~~~~~~~~~dst 61 (536) |-+.- ...++.-+.+ +..+.......|+++|+=--|..... ..........++--+. T Consensus 1 m~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~ 76 (499) T protein:vir:80 1 MINQI----IAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNL 76 (499) T ss_pred ChhHH----HHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecch Confidence 22211 1111111111 11122244556666653211211111 1111111223344456 Q ss_pred HHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhC Q lcl|NC_011045. 62 GARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAG 141 (536) Q Consensus 62 ~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G 141 (536) +...|+.+|+-|++- |. .+++++. +..++|.+ .+..++|...+.++..+...+| T Consensus 77 ~~~iv~~~a~~l~~e--p~----~i~~~d~-------------~~~e~l~~-------~~~~n~f~~~~~~~~~~a~~~G 130 (499) T protein:vir:80 77 PKVTAKYMSKLLFNE--KV----KINIDDE-------------TAEEFVLN-------VLKTNGFTKNMERYIEYGEAMG 130 (499) T ss_pred HHHHHHHHHHhhhCC--cc----eEeeCCH-------------HHHHHHHH-------HHhhccHHHHHHHHHHHHhhcC Confidence 777777777644321 32 2333332 22333333 4445789999999999999999 Q ss_pred cEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecC Q lcl|NC_011045. 142 NVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDE 221 (536) Q Consensus 142 ~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~ 221 (536) .+++.+-.+.. +.+++..++...++--....|++..+......+.+ ++.+..++.. ++.. T Consensus 131 ~~~~~~~~D~~-~~~~i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~----------------~~~y~~lE~h---~~~~ 190 (499) T protein:vir:80 131 GFVIKVYHDGN-KNVKVSFATADCMYPLSNDSENVDECLIANSFHKN----------------NKYYKLLEWN---EWKG 190 (499) T ss_pred cEEEEEEECCC-CcEEEEEEcCCceEEEEecCCCeEEEEEEEEEeec----------------CeEEEEEEEE---Eecc Confidence 99987665544 34678999998887533445777665533333211 1122222211 1111 Q ss_pred C-CCce----eEEEEec----Ccccccc---------ccccccccCceEEEe----eeecCCCccccchHHHHHHHHHHH Q lcl|NC_011045. 222 D-SGEY----IRYEEVE----GMEVQGS---------DGTYPKEACPYIPIR----MVRLDGESYGRSYIEEYLGDLRSL 279 (536) Q Consensus 222 ~-~~~~----~~~~~v~----g~~i~~~---------~~~~~~~~~P~~~~r----w~~~~ge~YGrgp~~~~l~d~~~L 279 (536) + .+.| ..|.+-+ |..+... .........||+.++ .++..++++|+|-...+.+.+..| T Consensus 191 ~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~l 270 (499) T protein:vir:80 191 EKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTL 270 (499) T ss_pred cceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHH Confidence 1 1111 1122111 2222111 011112344555543 344568899999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeeccccccchh-hhccCCC--------cc--eecCCccccc--ccccccccchhHHHHHHH Q lcl|NC_011045. 280 ENLQEAIVKMSMISSKVIGLVNPAGITQPR-RLTKAQT--------GD--FVTGRPEDIS--FLQLEKQADFTVAKAVSD 346 (536) Q Consensus 280 ~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~-~~~~~~~--------g~--~~~g~~~~~~--~~~~~~~~~~~~~~~~i~ 346 (536) +..--......+. .+..+.|+++. +.+. +. .+.+ .. .+.+..++.+ +..+...-+...-...++ T Consensus 271 D~~~s~~~~e~~~-~~~~i~v~~~~-l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~ 347 (499) T protein:vir:80 271 DLMFDSYYQEFKL-GKKKVLVPSSF-VKTAVNL-DGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESIN 347 (499) T ss_pred HHHHHHHHHHHHh-cccceecchhh-hhccCCC-CCCcccCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHH Confidence 9888888777765 56666664333 2221 11 0000 00 1122222111 111111111111223333 Q ss_pred HHHHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcce Q lcl|NC_011045. 347 AIEARLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIP--ELPKEAV 422 (536) Q Consensus 347 ~~~~rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp--~~~~~~v 422 (536) .+.+.|.... + ...+........|||||....+.......-.-..+.. -|..|++-++.+..-.+... ..+...+ T Consensus 348 ~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~-~l~~l~~~il~~~~~~~~~~~~~~~~~~v 426 (499) T protein:vir:80 348 AMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQ-GIKEMIVSILEVGKLIKAYDGDTVELDTI 426 (499) T ss_pred HHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhccccCCCCCccce Confidence 3333332211 1 0111222333469999998888887776655444433 35556665555544333322 1234567 Q ss_pred EEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 423 EPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMD 502 (536) Q Consensus 423 ~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~ 502 (536) .+.|--++..-. ...+++..+.++ .+ + +.... .++...|+ |++|.+++.++.+..++. T Consensus 427 ~v~f~d~i~~d~-~~~~~~~~~~~~----~G---i---~S~et---~l~~~~~~-------~d~ea~~el~~i~~E~~~- 484 (499) T protein:vir:80 427 TVDFDDSIAQDE-DTTINRYTTAKN----QG---M---IPLKI---ALQRAWNI-------TEAEADEWAEMLAKEKQA- 484 (499) T ss_pred EEEeCCCCCCCH-HHHHHHHHHHHH----cC---C---CCHHH---HHhhcCCC-------ChHHHHHHHHHHHHHhhc- Confidence 777754443321 122222222221 11 0 11111 23444555 444443332222111100 Q ss_pred HHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 503 NGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 503 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) . . . .+..++..++ .. T Consensus 485 -~----~----~----~~d~~g~~ge----~e 499 (499) T protein:vir:80 485 -E----I----P----NNDMTGIFGE----EE 499 (499) T ss_pred -C----C----C----CCCccccCCC----CC Confidence 0 0 0 0001111111 11 No 89 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.81 E-value=3.8e-08 Score=61.25 Aligned_cols=403 Identities=11% Similarity=0.007 Sum_probs=178.4 Q ss_pred hcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 38 TIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERII 117 (536) Q Consensus 38 ~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~ 117 (536) ++|.-...+-.. ...+...+-+..+|++++..|. +.+ |+. .|... ...++ T Consensus 1 ~l~~~~~~~~~~---~~~~~v~n~~~~ivd~~~~~l~----~~g--f~~--~d~~~---------~~~~~---------- 50 (434) T protein:vir:98 1 MLPKNAEQAFLD---FQRKARTNFCGLIANASVHRLL----ALG--VTG--PDGEP---------DTRAS---------- 50 (434) T ss_pred CCCCCccHHHHH---hhhhhhccchHHHHHHHHhhhc----cCc--eec--CCCch---------HHHHH---------- Confidence 333322211111 1122234567788888877653 221 332 22211 11222 Q ss_pred HHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC------ceeeEEEEecceEEEeeCC-CCCeEEEEEeEeccHHHH Q lcl|NC_011045. 118 MNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS------NYNPMKLYRLSSYVVQRDA-FGNVLQMVTRDQIAFGAL 190 (536) Q Consensus 118 ~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~------~~~~~~~~~l~~~~v~~d~-~G~v~~i~r~~~~t~~~l 190 (536) +.+.+++|.....+++++..+||.+.+++..+..+ ....+++++..+..+..|. .+++...++.+.... T Consensus 51 -~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~--- 126 (434) T protein:vir:98 51 -RWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDI--- 126 (434) T ss_pred -HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEecc--- Confidence 23456899999999999999999999887644322 1234677887777777774 355555444433221 Q ss_pred HHHHhHHhhhccccCCCCceEEEEEEEEe---cC-CCCcee--EEEEecCccccccccccccccCceEEEeeeecCCCcc Q lcl|NC_011045. 191 PEDIRKAVEGQGGEKKADETIDVYTHIYL---DE-DSGEYI--RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESY 264 (536) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~v~~~v~p---~~-~~~~~~--~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~Y 264 (536) ++.....+.+++.++. +. .+..+. .-.++.... ......++|..+|++.+.-+...++ + T Consensus 127 -------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~h~~g~vPvv~f~N~~~~~~-~ 191 (434) T protein:vir:98 127 -------------DGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGT-ADSGDVHDLGGMQLVEFARMPDLGE-D 191 (434) T ss_pred -------------CCceEEEEEEeCcEEEEEEeeccccccccccccceeccc-ccccccCCCCccceEEeccCCCcCc-C Confidence 1111122222222111 11 111111 101111111 1223446688999999876665554 7 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch------------hhhccCCCcceecCCccccccccc Q lcl|NC_011045. 265 GRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP------------RRLTKAQTGDFVTGRPEDISFLQL 332 (536) Q Consensus 265 Grgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~------------~~~~~~~~g~~~~g~~~~~~~~~~ 332 (536) |+|=.+..++.+..++...-.++..++..+.|...+. |.... ........|.+..-..+++.+.++ T Consensus 192 g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~ 269 (434) T protein:vir:98 192 PEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIK--GHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQL 269 (434) T ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCcccccccccccchhhhhhhccccccccCCCCCceEEEe Confidence 9999999999999999999999999999988875552 11110 111112223322111223333333 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHhhhh----cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 333 EKQADFTVAKAVSDAIEARLSFAFMLNS----AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQL 408 (536) Q Consensus 333 ~~~~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il 408 (536) . .++++. .++.++.-|........ ....+....++.-+......+..... +.+. .+.+-+.+.+.++ T Consensus 270 ~-~~~~~~---~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~----~k~~-~f~~~l~~~~rl~ 340 (434) T protein:vir:98 270 D-ATDLSG---FLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAKVR----EHIA-SFSEGLESVLALA 340 (434) T ss_pred c-CcchHH---HHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHH Confidence 3 223433 44444444433222111 11112223456555544333332222 2221 1222334444444 Q ss_pred HhcCCCCCCCCcceEEEEechHHH--HHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHH Q lcl|NC_011045. 409 QATQQIPELPKEAVEPTISTGLEA--IGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEE 486 (536) Q Consensus 409 ~~~g~lp~~~~~~v~v~~vs~La~--a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ 486 (536) .+..-.+ .....+++.+..++.. +..++.+.+| .+++ +-.+ .+...+|. +++ T Consensus 341 ~~~~g~~-~~~~~~~v~w~~~~~~s~~~~ada~~kl-------~~~g-------~~~e----~~~~~lg~-------~~~ 394 (434) T protein:vir:98 341 AAQAGVP-EDYTEAEVRWANPAHVTMAVKADAATKL-------KSIG-------YPLD----VIAEELDE-------SPA 394 (434) T ss_pred HHhcCCC-hhheeeeEEecCCCCCCHHHHHHHHHHH-------HhcC-------CcHH----HHHHhCCC-------CHH Confidence 3322111 1223467777665543 2222222222 2221 1111 23344565 345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 487 QKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 487 ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) |++++.+++.++....++.+. .++.+. .+.-++.++.=-| T Consensus 395 e~~r~~~e~~~~~~~~~~~~~--------~~~~~~-~g~~~~~~~~~dg 434 (434) T protein:vir:98 395 RVRRIVAGAASQALLAASLLP--------APGAPS-AGNVPDSGGAVDG 434 (434) T ss_pred HHHHHHHHHHHHHHHHHhhhc--------cCCCCC-CCCCCcccCCCCC Confidence 666555444332222211110 001110 1111222111112 No 90 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.81 E-value=4e-08 Score=61.15 Aligned_cols=446 Identities=13% Similarity=0.087 Sum_probs=181.8 Q ss_pred CCC--ccccccHH--------HHHHHHH----HHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHH Q lcl|NC_011045. 1 MAE--KRTGLAEE--------GAKSVYE----RLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGL 66 (536) Q Consensus 1 Ma~--~~~~~~~~--------~~~~r~~----~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) |.. .-..+-++ ++.+..+ .+..++......|+.+|+=-.|.+.-....+......+.--+.+...| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 111 00000000 0000000 011122223334444443221221111111111111111224455555 Q ss_pred HHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 67 ~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) +.+|+ .+|--.+ .+++++.. ..++|+ +.+..++|+..+.+++.+..+.|.+++- T Consensus 81 ~~~A~----lv~~e~~--~i~~~d~~-------------~~~~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k 134 (500) T protein:vir:30 81 KKIAS----LVFNEQA--EIKVDDDA-------------ANEFIS-------ETLKNDRFNKNFERYLESCLALGGLAMR 134 (500) T ss_pred HHHhh----hhcCCcc--eEecCChH-------------HHHHHH-------HHHhhccHHHHHHHHHHHHhhcCCEEEE Confidence 55555 3442112 23333321 233333 3455689999999999999999999875 Q ss_pred EecCCCCceeeEEEEecceEEE-eeCCCCCeEEEE-EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCC Q lcl|NC_011045. 147 LPEPEGSNYNPMKLYRLSSYVV-QRDAFGNVLQMV-TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG 224 (536) Q Consensus 147 ~~~~~~~~~~~~~~~~l~~~~v-~~d~~G~v~~i~-r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~ 224 (536) +..+.+ .+.+..++...++- .-|..|.+...| ++...+. ..+...+..++.++. . ++. T Consensus 135 ~~~d~~--~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~--------------~~~~~~yt~lE~h~~---~-~~~ 194 (500) T protein:vir:30 135 PYVDGD--KVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTI--------------NGKEVYYTLIEFHEW---Q-SSD 194 (500) T ss_pred EEEeCC--ceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeee--------------cCCceEEEEEEEEEE---e-CCc Confidence 444332 34578888888775 455556554444 2221110 011122233332221 1 111 Q ss_pred c----eeEEEEe----cCccccccc--------cccccccCc-eEEE---ee-eecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_011045. 225 E----YIRYEEV----EGMEVQGSD--------GTYPKEACP-YIPI---RM-VRLDGESYGRSYIEEYLGDLRSLENLQ 283 (536) Q Consensus 225 ~----~~~~~~v----~g~~i~~~~--------~~~~~~~~P-~~~~---rw-~~~~ge~YGrgp~~~~l~d~~~L~~l~ 283 (536) . |..|.+- -|..+...+ ..+..-..| |..+ -. +...+++||.|-...+.+.+..|+..- T Consensus 195 ~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~ 274 (500) T protein:vir:30 195 DYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTY 274 (500) T ss_pred eeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHH Confidence 1 2223221 133322111 100001223 2222 22 334478899999999999999999988 Q ss_pred HHHHHHHHHHhCCceeeccccccchhhhccC---CCcc-------e--ecCCccc-ccccccccccchhHHHHHHHHHHH Q lcl|NC_011045. 284 EAIVKMSMISSKVIGLVNPAGITQPRRLTKA---QTGD-------F--VTGRPED-ISFLQLEKQADFTVAKAVSDAIEA 350 (536) Q Consensus 284 ~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---~~g~-------~--~~g~~~~-~~~~~~~~~~~~~~~~~~i~~~~~ 350 (536) -+.....+. .+..+.|+++-+....+...+ .+.. + +.+..++ ..+..+...-........++.+-+ T Consensus 275 s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 353 (500) T protein:vir:30 275 DEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLS 353 (500) T ss_pred HHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHH Confidence 888877765 666667754432111111111 0111 1 1111111 111111111112223333434444 Q ss_pred HHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--CCCcceEEEE Q lcl|NC_011045. 351 RLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE--LPKEAVEPTI 426 (536) Q Consensus 351 rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~--~~~~~v~v~~ 426 (536) .|.... + ...+........|||||....+...+...-....+.. -|..|++-++.+..-.+.... .+...+.|++ T Consensus 354 ~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~-al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f 432 (500) T protein:vir:30 354 LFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQ-SLKELVISIFEIAKAYDLYQSEVPSMDNISISL 432 (500) T ss_pred HHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEe Confidence 443221 1 1112212233459999999998888887765555443 355566666655432221111 1223467777 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAA 506 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~ 506 (536) --++.... ...++...+.++ + + + +....+ +.+.+|+ |++|.+++.++-+..+.. T Consensus 433 ~d~i~~d~-~~~~~~~~~~v~---a-G---i---~s~~~~---i~~~~g~-------~eeea~~~l~~i~~E~~~----- 486 (500) T protein:vir:30 433 DDGVFTDR-DAELDYWIKVVN---A-G---F---GTREMA---IQKVLNV-------TEEKAQEIAAEINTGIVD----- 486 (500) T ss_pred CCCCCCCH-HHHHHHHHHHHH---c-C---C---CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhccc----- Confidence 54433221 122222222221 1 1 0 222222 2344565 455554443332211000 Q ss_pred HHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 507 ALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) . .+. .....+..|- T Consensus 487 -~---------~~~--~~~~~~~~g~ 500 (500) T protein:vir:30 487 -E---------INQ--QRTDTHLYGE 500 (500) T ss_pred -c---------CCC--CCccccccCC Confidence 0 000 0000111111 No 91 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.81 E-value=4e-08 Score=61.15 Aligned_cols=446 Identities=13% Similarity=0.087 Sum_probs=181.8 Q ss_pred CCC--ccccccHH--------HHHHHHH----HHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHH Q lcl|NC_011045. 1 MAE--KRTGLAEE--------GAKSVYE----RLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGL 66 (536) Q Consensus 1 Ma~--~~~~~~~~--------~~~~r~~----~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) |.. .-..+-++ ++.+..+ .+..++......|+.+|+=-.|.+.-....+......+.--+.+...| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 111 00000000 0000000 011122223334444443221221111111111111111224455555 Q ss_pred HHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 67 ~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) +.+|+ .+|--.+ .+++++.. ..++|+ +.+..++|+..+.+++.+..+.|.+++- T Consensus 81 ~~~A~----lv~~e~~--~i~~~d~~-------------~~~~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k 134 (500) T protein:vir:98 81 KKIAS----LVFNEQA--EIKVDDDA-------------ANEFIS-------ETLKNDRFNKNFERYLESCLALGGLAMR 134 (500) T ss_pred HHHhh----hhcCCcc--eEecCChH-------------HHHHHH-------HHHhhccHHHHHHHHHHHHhhcCCEEEE Confidence 55555 3442112 23333321 233333 3455689999999999999999999875 Q ss_pred EecCCCCceeeEEEEecceEEE-eeCCCCCeEEEE-EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCC Q lcl|NC_011045. 147 LPEPEGSNYNPMKLYRLSSYVV-QRDAFGNVLQMV-TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG 224 (536) Q Consensus 147 ~~~~~~~~~~~~~~~~l~~~~v-~~d~~G~v~~i~-r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~ 224 (536) +..+.+ .+.+..++...++- .-|..|.+...| ++...+. ..+...+..++.++. . ++. T Consensus 135 ~~~d~~--~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~--------------~~~~~~yt~lE~h~~---~-~~~ 194 (500) T protein:vir:98 135 PYVDGD--KVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTI--------------NGKEVYYTLIEFHEW---Q-SSD 194 (500) T ss_pred EEEeCC--ceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeee--------------cCCceEEEEEEEEEE---e-CCc Confidence 444332 34578888888775 455556554444 2221110 011122233332221 1 111 Q ss_pred c----eeEEEEe----cCccccccc--------cccccccCc-eEEE---ee-eecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_011045. 225 E----YIRYEEV----EGMEVQGSD--------GTYPKEACP-YIPI---RM-VRLDGESYGRSYIEEYLGDLRSLENLQ 283 (536) Q Consensus 225 ~----~~~~~~v----~g~~i~~~~--------~~~~~~~~P-~~~~---rw-~~~~ge~YGrgp~~~~l~d~~~L~~l~ 283 (536) . |..|.+- -|..+...+ ..+..-..| |..+ -. +...+++||.|-...+.+.+..|+..- T Consensus 195 ~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~ 274 (500) T protein:vir:98 195 DYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTY 274 (500) T ss_pred eeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHH Confidence 1 2223221 133322111 100001223 2222 22 334478899999999999999999988 Q ss_pred HHHHHHHHHHhCCceeeccccccchhhhccC---CCcc-------e--ecCCccc-ccccccccccchhHHHHHHHHHHH Q lcl|NC_011045. 284 EAIVKMSMISSKVIGLVNPAGITQPRRLTKA---QTGD-------F--VTGRPED-ISFLQLEKQADFTVAKAVSDAIEA 350 (536) Q Consensus 284 ~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---~~g~-------~--~~g~~~~-~~~~~~~~~~~~~~~~~~i~~~~~ 350 (536) -+.....+. .+..+.|+++-+....+...+ .+.. + +.+..++ ..+..+...-........++.+-+ T Consensus 275 s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 353 (500) T protein:vir:98 275 DEFMWEVKM-GQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLS 353 (500) T ss_pred HHHHHHHHh-CcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHH Confidence 888877765 666667754432111111111 0111 1 1111111 111111111112223333434444 Q ss_pred HHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--CCCcceEEEE Q lcl|NC_011045. 351 RLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE--LPKEAVEPTI 426 (536) Q Consensus 351 rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~--~~~~~v~v~~ 426 (536) .|.... + ...+........|||||....+...+...-....+.. -|..|++-++.+..-.+.... .+...+.|++ T Consensus 354 ~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~-al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f 432 (500) T protein:vir:98 354 LFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQ-SLKELVISIFEIAKAYDLYQSEVPSMDNISISL 432 (500) T ss_pred HHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEe Confidence 443221 1 1112212233459999999998888887765555443 355566666655432221111 1223467777 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAA 506 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~ 506 (536) --++.... ...++...+.++ + + + +....+ +.+.+|+ |++|.+++.++-+..+.. T Consensus 433 ~d~i~~d~-~~~~~~~~~~v~---a-G---i---~s~~~~---i~~~~g~-------~eeea~~~l~~i~~E~~~----- 486 (500) T protein:vir:98 433 DDGVFTDR-DAELDYWIKVVN---A-G---F---GTREMA---IQKVLNV-------TEEKAQEIAAEINTGIVD----- 486 (500) T ss_pred CCCCCCCH-HHHHHHHHHHHH---c-C---C---CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhccc----- Confidence 54433221 122222222221 1 1 0 222222 2344565 455554443332211000 Q ss_pred HHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 507 ALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) . .+. .....+..|- T Consensus 487 -~---------~~~--~~~~~~~~g~ 500 (500) T protein:vir:98 487 -E---------INQ--QRTDTHLYGE 500 (500) T ss_pred -c---------CCC--CCccccccCC Confidence 0 000 0000111111 No 92 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.79 E-value=4.8e-08 Score=60.73 Aligned_cols=421 Identities=10% Similarity=0.021 Sum_probs=195.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc---ccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcce Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWM 84 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf 84 (536) |+.+.+.+..++++..+ .+++.+.+|..-. +.... ....+...++-.+.+...++..++.|++ -| + T Consensus 1 l~~~~l~~~i~~~~~~~----~r~~~l~~yy~g~~~il~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~----~ 69 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFN----LSYSAYKQLYEGDHAILQQKQ-KEQYKPDNRLVVNFAKYIVDTFNGYFIG--VP----V 69 (429) T ss_pred CCHHHHHHHHHHHHHHH----HHHHHHHHHhccccccccccc-cccCCCcceeecchHHHHHHHHhhhhcc--cC----c Confidence 88888888888876543 3344444444321 11111 1111223456666677777777766643 12 1 Q ss_pred eccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecc Q lcl|NC_011045. 85 RLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLS 164 (536) Q Consensus 85 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~ 164 (536) .++..+. .+.+ .+...+..++|.....++.++..+||.|.+++..+.. +.++++.++.. T Consensus 70 ~~~~~~~-------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-g~~~~~~~~p~ 128 (429) T protein:vir:98 70 QTSHENK-------------QVSN-------YLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDEN-AEAGITYLTPL 128 (429) T ss_pred eeecCCh-------------HHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCC-CcEEEEEEccc Confidence 2233221 1222 2333344578999999999999999999988776554 44568888776 Q ss_pred eEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccc Q lcl|NC_011045. 165 SYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDG 242 (536) Q Consensus 165 ~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~ 242 (536) +.++..|.. +++...+|.+.- .+. +.+..+... .....|..-.+........ T Consensus 129 ~~~~v~dd~~~~~~~~~i~~~~~-------------------~~~-----~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ 182 (429) T protein:vir:98 129 EAFIVYDDSIRQKPLFAVRYFYN-------------------KGG-----VLEGSYSDA--SNITYFKDGEKGIEIGESE 182 (429) T ss_pred ceEEEEeCCCCCceEEEEEEEEe-------------------cCc-----eEEEEEEeC--ceEEEEEecCCceEecccc Confidence 666655542 334444443321 000 111111111 1122222221111112233 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCccee-- Q lcl|NC_011045. 243 TYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGDFV-- 320 (536) Q Consensus 243 ~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~~~-- 320 (536) .+++..+|++.++ ++.+|+|-.+...+-+..++.+.-......+....|.+.+... ....+.......+.++ T Consensus 183 ~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~-~~~~~~~~~~~~~~~~~~ 256 (429) T protein:vir:98 183 PHPFDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGA-ELDDETLKSLRDTRIINL 256 (429) T ss_pred cccCCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-CCCcchhhhHhhCceeec Confidence 4557788987653 3568999999999999999999999999999999998776422 1122222222222222 Q ss_pred cCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_011045. 321 TGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (536) Q Consensus 321 ~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~P 399 (536) ++.. .+..+..+....+.+.....++.+.+.|-..-..-..........|+..+..+..- .........+.-.+.+.- T Consensus 257 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~-l~~k~~~~~~~~~~~l~~ 335 (429) T protein:vir:98 257 KDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQA-MDNLAKTKERKFMSGMNR 335 (429) T ss_pred cCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 2111 11122233444466667777777777664432211111111123455544432111 111112222222222222 Q ss_pred HHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChh Q lcl|NC_011045. 400 LVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTS 479 (536) Q Consensus 400 li~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~ 479 (536) +++-+..++...+. +..-..++|+|.-++..- ..+.++.+... +.+ +..+.++ +.+|..+ T Consensus 336 ~~~li~~~~~~~~~--~~d~~~i~v~f~~~~p~~-~~~~a~~~~kl----~g~--------is~et~~----~~l~~v~- 395 (429) T protein:vir:98 336 RYKLIASYPTSKIG--PKDWIGIKYKFTRNLPAN-LLEESQIAGNL----AGI--------VSEETQV----GVLSIVE- 395 (429) T ss_pred HHHHHHHHhccCCC--ccccccceEEeCCCCCcC-HHHHHHHHHHH----hcc--------CchHHHH----HhCCCCC- Confidence 33333333322222 122234677765443321 11112222211 111 2222222 3333211 Q ss_pred hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhh Q lcl|NC_011045. 480 GILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAAD 528 (536) Q Consensus 480 ~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 528 (536) ..++|++++.+++....+. ++. .+-.+ ....-.+ T Consensus 396 ---d~~~E~~ri~~E~~~~~~~--~~~----~~~~~------~~~~~~~ 429 (429) T protein:vir:98 396 ---NPQKEIERKNSDKSTLISR--QAG----GLNGQ------NTTTILE 429 (429) T ss_pred ---CHHHHHHHHHHHHHHHHHH--HHh----hhcCC------CCCCCCC Confidence 2356666665554422111 111 11111 1111111 No 93 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.77 E-value=5.5e-08 Score=60.36 Aligned_cols=467 Identities=13% Similarity=0.048 Sum_probs=217.7 Q ss_pred CCCcccccc-HHHH-------HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC--Ccc-cccccccccchHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLA-EEGA-------KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS--DNA-STDYVTPWQAVGARGLNNL 69 (536) Q Consensus 1 Ma~~~~~~~-~~~~-------~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~-~~~~~~~~dst~~~a~~~L 69 (536) |+..+.--. .+-+ -.+-......| +..++.+.+|..-.-..-.. ..+ .+....++++.+...++ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~R---l~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~-- 75 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKAR---LASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIE-- 75 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHH---HHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhC-- Confidence 544221000 0000 00001111111 34455555665432100000 001 11234567777743332 Q ss_pred HHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 70 ASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 70 aa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +-+-.+.|--.|+- . .. =++|+..+...+++.|++....++-.+.++.|-|++++-. T Consensus 76 --~~~~~~~~g~~~~~---~-~~-----------------~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~w 132 (527) T protein:vir:10 76 --AKMRFLGQGLKWEF---S-KK-----------------DAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIG 132 (527) T ss_pred --CcceeeccCccccc---c-ch-----------------hHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEee Confidence 22222234223321 0 00 0124445556677799999999999999999999998876 Q ss_pred CCCC---ceeeEEEEecceEEEeeCCCC--CeEEEEEeEeccHHHHHHHHhHHhh-----h----ccccCCC--CceE-- Q lcl|NC_011045. 150 PEGS---NYNPMKLYRLSSYVVQRDAFG--NVLQMVTRDQIAFGALPEDIRKAVE-----G----QGGEKKA--DETI-- 211 (536) Q Consensus 150 ~~~~---~~~~~~~~~l~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~~~~-----~----~~~~~~~--~~~~-- 211 (536) +.++ +.++.+.+-.+.|+..+|++| .|..+|-. ....++++-++..+ + ....+.+ .-.+ T Consensus 133 D~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~---~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~y 209 (527) T protein:vir:10 133 DDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLV---DEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKY 209 (527) T ss_pred ccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEe---eeccCCccccccceehhhhhhhhhcCcccccccCcceee Confidence 6543 356788888899999999876 45555432 12222222221111 0 1111111 0111 Q ss_pred EEEEEEEecCCCCc-----eeEEE-EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 212 DVYTHIYLDEDSGE-----YIRYE-EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEA 285 (536) Q Consensus 212 ~v~~~v~p~~~~~~-----~~~~~-~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~ 285 (536) +..|+-.=+.++-. ..-|. .++|.++. ....+..-.|+++++=...++++||+|=..+.+.-+.+||..... T Consensus 210 t~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~--~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td 287 (527) T protein:vir:10 210 TEELYEPGKWDDRPESPLEPDDIKKLSTLTEEE--PLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTD 287 (527) T ss_pred eeceeeccccccccccccchhhhhhhcCceeee--cccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhH Confidence 11111000000000 00011 12333332 223334567888777777889999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeccccccchhhhc------cCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 286 IVKMSMISSKVIGLVNPAGITQPRRLT------KAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN 359 (536) Q Consensus 286 ~~~~~~~a~~p~~lv~~~g~~~~~~~~------~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~ 359 (536) ....+...-.|+... +|+... +.. .-++|.++.-..+ ..+..+....++...+..+..+..+|... T Consensus 288 ~s~is~~sG~Pi~~~--tg~~~v-d~~G~~~~~~VgPG~iweL~e~-ak~~~v~~~~~la~~~~h~~~L~~~l~~v---- 359 (527) T protein:vir:10 288 EDLIMVFGGLGFYAT--DSAPPR-DSRGNMVPWTISPLGMVEHGQN-NKIYRVNGVASLEPSQTHMTKAEEAMQQT---- 359 (527) T ss_pred HHHHHHHhCCceeee--cccccc-cccCCcCccccCCceeEecCCC-cceeeccchhhhHHHHHHHHHHHHHHHHh---- Confidence 988898888887655 344322 211 0123443321111 12223333345565666666666555432 Q ss_pred hcccC------CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH-HHHHHHHHHH---------HHHhcCCCCCCCCcceE Q lcl|NC_011045. 360 SAVQR------TGERVTAEEIRYVASELEDTLGGVYSILSQEL-QLPLVRVLLK---------QLQATQQIPELPKEAVE 423 (536) Q Consensus 360 ~~~~~------~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~-l~Pli~r~~~---------il~~~g~lp~~~~~~v~ 423 (536) +-.++ +..+ --+.+ .+...|+|++.+.+..- +.-.+.|-|+ ....-+.-+..+.-.++ T Consensus 360 A~~PavA~G~vD~s~-~~SG~-----ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ 433 (527) T protein:vir:10 360 KGIPDIAVGVVDAAV-AESGI-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVT 433 (527) T ss_pred hcCCeeeeccccCCc-CcHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceE Confidence 21111 2111 11222 23344556666555542 2222232211 11111111112222456 Q ss_pred EEEe--chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 424 PTIS--TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM 501 (536) Q Consensus 424 v~~v--s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~ 501 (536) +.|- .|..+.+. ++++....+ .--+....+++.+.++.|+ -..++|++++.+.+.++.-. T Consensus 434 ivf~p~lP~D~~av---ie~v~tL~~----------aGi~S~~tAv~~L~~~~g~-----eD~E~E~~~I~~era~~a~a 495 (527) T protein:vir:10 434 ITFRDPKPVNSEKR---FNQLLQLWE----------AGLIPAKKLTEELSKIMGF-----ELTEEDFKQATEDKKTQGIA 495 (527) T ss_pred EEecccCCCCHHHH---HHHHHHHHH----------cCchhHHHHHHHHHhccCC-----CChHHHHHHHHHHHHHHhHH Confidence 6553 44444332 222221111 1125667788888887775 24667887777665543322 Q ss_pred HHHHHHHHHHHHHhhhcCcchHHhhhh-cCCCCCC Q lcl|NC_011045. 502 DNGAAALAQGMAAQATASPEAMAAAAD-SVGLQPG 535 (536) Q Consensus 502 ~~~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~q~~ 535 (536) +..| .+...++++..+...-+..| +++.||- T Consensus 496 ~a~A---~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 496 QAEA---ADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred hhhh---cCchhhhhccccCCCCCCcccccCCCCC Confidence 2222 22333344444333333344 6677777 No 94 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.77 E-value=5.7e-08 Score=60.29 Aligned_cols=445 Identities=13% Similarity=0.124 Sum_probs=190.2 Q ss_pred CCC--ccccccHHHHH--------------HHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccc-ccccccchHH Q lcl|NC_011045. 1 MAE--KRTGLAEEGAK--------------SVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTD-YVTPWQAVGA 63 (536) Q Consensus 1 Ma~--~~~~~~~~~~~--------------~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~~~dst~~ 63 (536) |.- .-..+=++.+. .+. .+..++..-...|+.+|+=..|.+--.. ..+.++ ..++--+.+. T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~ri~~~~~~y~g~~~~~~~~~-~~~~~~~~~~~sln~~~ 78 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRI-SIDPDEYVRIQTDLDYYSDKLQYIHYQA-SDGIKKKRLKNTINMAK 78 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhccccc-ccCHHHHHHHHHHHHHhcCCCccccccc-CCCCccccceeecchHH Confidence 221 00000000000 000 1122222334455555544333221111 111111 1122224556 Q ss_pred HHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcE Q lcl|NC_011045. 64 RGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNV 143 (536) Q Consensus 64 ~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 143 (536) ..|+.+|+-+.+- | + .+++++.. ...+||++ .+..++|+..+.+++.+..++|.+ T Consensus 79 ~i~~~~A~lv~~e--~--~--~i~v~~~~------------~~~e~l~~-------il~~n~f~~~~~~~~e~a~a~G~~ 133 (508) T protein:vir:15 79 TAARRIASVVFNE--K--A--EIHVKDNN------------EADKFLND-------VLEDNDFKNKFEEALEKGVALGGF 133 (508) T ss_pred HHHHHHHhhhhCC--C--c--eEEeCCch------------HHHHHHHH-------HHHhccHHHHHHHHHHHHhhcCce Confidence 6666666555322 2 2 12222211 12334433 555789999999999999999999 Q ss_pred EEEEecCCCCceeeEEEEecceEEE-eeCCCCCeEEE-E-EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEec Q lcl|NC_011045. 144 LLYLPEPEGSNYNPMKLYRLSSYVV-QRDAFGNVLQM-V-TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLD 220 (536) Q Consensus 144 ~l~~~~~~~~~~~~~~~~~l~~~~v-~~d~~G~v~~i-~-r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~ 220 (536) ++-+..+.+ .+.+..++...++- ..|. |++.++ | ++.+.+ ...+.+.+..++.++. T Consensus 134 ~~k~~~d~~--~~~i~~v~ad~~~P~~~d~-~~~~~~af~~~~~~~--------------~~~~~~~yt~lE~h~~---- 192 (508) T protein:vir:15 134 AMRPYIDGN--HIKIAWVRADQFYPLQSNT-NDISEAAIASRTQRT--------------ESNQTKYYTLLEFHQW---- 192 (508) T ss_pred EEEEEEeCC--eeEEEEEcCCeeEEEEEcC-CCeEEEEEEEEEEee--------------cCCCceEEEEEEEEEE---- Confidence 874443333 35678889888774 5554 545443 3 233221 0111222333332221 Q ss_pred CCCCce----eEEEEec----Cccccccc----------cc-cccccCceEEEee---e-ecCCCccccchHHHHHHHHH Q lcl|NC_011045. 221 EDSGEY----IRYEEVE----GMEVQGSD----------GT-YPKEACPYIPIRM---V-RLDGESYGRSYIEEYLGDLR 277 (536) Q Consensus 221 ~~~~~~----~~~~~v~----g~~i~~~~----------~~-~~~~~~P~~~~rw---~-~~~ge~YGrgp~~~~l~d~~ 277 (536) .+++.+ ..|.+-. |..+.... .. .+...-||+.++. + ...++.||+|-...+.+.+. T Consensus 193 ~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid 272 (508) T protein:vir:15 193 QDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLD 272 (508) T ss_pred ecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHH Confidence 122222 2333211 33332111 00 1122233444443 2 23368899999999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCceeeccccccchhh-hc---cCCCccee--cCCcc-cccccccccccchhHHHHHHHHHHH Q lcl|NC_011045. 278 SLENLQEAIVKMSMISSKVIGLVNPAGITQPRR-LT---KAQTGDFV--TGRPE-DISFLQLEKQADFTVAKAVSDAIEA 350 (536) Q Consensus 278 ~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~-~~---~~~~g~~~--~g~~~-~~~~~~~~~~~~~~~~~~~i~~~~~ 350 (536) .||..--....-. ...++.+.|+++ +++... .. +...-.+. .+..+ ...+..+...-+...-...++.+.+ T Consensus 273 ~lD~~~s~~~~e~-~~~~~~i~v~~~-~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~ 350 (508) T protein:vir:15 273 DINDTHDQFIWEI-RLGQKHIAVQPG-MLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIK 350 (508) T ss_pred HHHHHHHHHHHHH-HhcccceeechH-HhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHH Confidence 9998877777766 677888777544 332211 00 01111111 11111 1111111111122223444455555 Q ss_pred HHHHHHhhh--hcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--------C--C Q lcl|NC_011045. 351 RLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE--------L--P 418 (536) Q Consensus 351 rI~~af~~~--~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~--------~--~ 418 (536) .|....-+. .+......-.|||||....+...+...-....+. ..|..|++-++.++.-.+.... . + T Consensus 351 ~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~-~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~ 429 (508) T protein:vir:15 351 EFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVE-KAIDELCQSIFELANAGALFDDGKPLFTLDSASQ 429 (508) T ss_pred HHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhccccccccccccccccC Confidence 444332111 1111122335999999998888887776554444 4466677777776654433221 1 2 Q ss_pred CcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHH Q lcl|NC_011045. 419 KEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQ 498 (536) Q Consensus 419 ~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q 498 (536) ...++|.|--++..-. ...++...+.++ .+ + +....+ +....|+ |++|++++.++.+.. T Consensus 430 ~~~v~v~f~D~i~~d~-~~~~~~~~~~v~----aG---i---~s~e~~---i~~~~g~-------~deea~~el~ri~~E 488 (508) T protein:vir:15 430 PLDIECHFDDGVFVNK-DKQLEEDAKVLA----IG---A---LSKQTF---LQRNYGM-------TDEQAAEELAKIQSE 488 (508) T ss_pred CcceEEEeCCCCCCCH-HHHHHHHHHHHh----cC---C---CCHHHH---HHhcCCC-------ChHHHHHHHHHHHHh Confidence 2335666654433222 112222222221 11 1 122222 2334555 455554443332221 Q ss_pred HHHHHHHHHHHHHHHH-hhhcCcc Q lcl|NC_011045. 499 MGMDNGAAALAQGMAA-QATASPE 521 (536) Q Consensus 499 ~~~~~~a~~~~~~~~~-~~~~~~~ 521 (536) +.... ...+... ..+..++ T Consensus 489 ~~~~~----~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 489 APTDT----FEGGRSAILNGGDGE 508 (508) T ss_pred ccccC----ccccccccCCCCCCC Confidence 11000 0000000 0111112 No 95 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.76 E-value=6.1e-08 Score=60.13 Aligned_cols=467 Identities=13% Similarity=0.041 Sum_probs=218.1 Q ss_pred CCCcccccc-HHHH-------HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC--Ccc-cccccccccchHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLA-EEGA-------KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS--DNA-STDYVTPWQAVGARGLNNL 69 (536) Q Consensus 1 Ma~~~~~~~-~~~~-------~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~-~~~~~~~~dst~~~a~~~L 69 (536) |+..+.--. .+-+ -.+-......| +..++.+.+|..-.-..-.. ..+ .+....++++.+...++ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~R---l~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~-- 75 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKAR---LASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIE-- 75 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHH---HHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhC-- Confidence 544221000 0000 00001111111 34455555665432100000 001 11234567777743332 Q ss_pred HHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 70 ASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 70 aa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +-+-.+.|--.|+- . .. =++|+..+...+++.|++....++-.+.++.|-|++++-. T Consensus 76 --~~~~~~~~g~~~~~---~-~~-----------------~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~w 132 (527) T protein:vir:10 76 --AKMRFLGQGLKWEF---S-KK-----------------DAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIG 132 (527) T ss_pred --CcceeeccCccccc---c-ch-----------------hHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEee Confidence 22222234223321 0 00 0134555566777899999999999999999999998876 Q ss_pred CCCC---ceeeEEEEecceEEEeeCCCC--CeEEEEEeEeccHHHHHHHHhHHhh-----h----ccccCCC--CceE-- Q lcl|NC_011045. 150 PEGS---NYNPMKLYRLSSYVVQRDAFG--NVLQMVTRDQIAFGALPEDIRKAVE-----G----QGGEKKA--DETI-- 211 (536) Q Consensus 150 ~~~~---~~~~~~~~~l~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~~~~-----~----~~~~~~~--~~~~-- 211 (536) +.++ +.++.+.+-.+.|+..+|++| .|..+|-. ....++++-++..+ + ....+.+ .-.+ T Consensus 133 D~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~---~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~y 209 (527) T protein:vir:10 133 DDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLV---DEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKY 209 (527) T ss_pred ccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEe---eeccCCccccccceehhhhhhhhhcCcccccccCcceee Confidence 6543 356788888899999999876 45555432 12222222221111 0 1111111 0111 Q ss_pred EEEEEEEecCCCCc-----eeEEE-EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 212 DVYTHIYLDEDSGE-----YIRYE-EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEA 285 (536) Q Consensus 212 ~v~~~v~p~~~~~~-----~~~~~-~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~ 285 (536) +..|+-.=+.++-. ..-|. .++|.++. ....+..-.|+++++=...++++||+|=..+.+.-+.+||..... T Consensus 210 t~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~--~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td 287 (527) T protein:vir:10 210 TEELYEPGKWDDRPESPLEPDDIKKLSTLTEEE--PLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTD 287 (527) T ss_pred eeceeeccccccccccccchhhhhhhcCceeee--cccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhH Confidence 11111000000000 00011 12333332 223334567888777777889999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeccccccchhhhc------cCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 286 IVKMSMISSKVIGLVNPAGITQPRRLT------KAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN 359 (536) Q Consensus 286 ~~~~~~~a~~p~~lv~~~g~~~~~~~~------~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~ 359 (536) ....+...-.|+... +|+... +.. .-++|.++.-..+ ..+..+....++...+..+..+..+|... T Consensus 288 ~s~is~~sG~Pi~~~--tg~~~v-d~~G~~~~~~VgPG~iweL~e~-ak~~~v~~~~~la~~~~h~~~L~~~l~~v---- 359 (527) T protein:vir:10 288 EDLIMVFGGLGFYAT--DSAPPR-DSRGNMVPWTISPLGMVEHGQN-NKIYRVNGVASLEPSQTHMNKAEEAMQQT---- 359 (527) T ss_pred HHHHHHHhCCceeee--cccccc-cccCCcCccccCCceeEecCCC-cceeeccchhhhHHHHHHHHHHHHHHHHh---- Confidence 988898888887655 344322 211 0123443321111 12223333345555666666666555432 Q ss_pred hcccC------CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH-HHHHHHHHHH---------HHHhcCCCCCCCCcceE Q lcl|NC_011045. 360 SAVQR------TGERVTAEEIRYVASELEDTLGGVYSILSQEL-QLPLVRVLLK---------QLQATQQIPELPKEAVE 423 (536) Q Consensus 360 ~~~~~------~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~-l~Pli~r~~~---------il~~~g~lp~~~~~~v~ 423 (536) +-.++ +..+ --+.+ .+...|+|++.+.+..- +.-.+.|-|+ ....-+.-+..+.-.++ T Consensus 360 A~~PavA~G~vD~s~-~~SG~-----ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ 433 (527) T protein:vir:10 360 KGIPDIAVGVVDAAV-AESGI-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVT 433 (527) T ss_pred hcCCeeeeccccCCc-CcHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceE Confidence 21111 2111 11222 23344556666555542 2222332211 11111111112222456 Q ss_pred EEE--echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 424 PTI--STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM 501 (536) Q Consensus 424 v~~--vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~ 501 (536) +.| ..|....+.-.+ +....+ .--+....+++.+.++.|+ -..++|++++.+.+.++.-. T Consensus 434 ivf~p~lP~D~~avie~---v~tL~~----------aGiiS~etAv~~L~~~~g~-----eD~E~E~~~I~~era~~a~a 495 (527) T protein:vir:10 434 ITFRDPKPVNNEKRFAQ---LLELWE----------AGLIPAKKLTEELSKIMGF-----ELTEEDFRQATEDKKTQGIA 495 (527) T ss_pred EEecccCCCCHHHHHHH---HHHHHH----------cCchhHHHHHHHHHhccCC-----CchHHHHHHHHHHHHHHhHH Confidence 655 344444332222 211111 1125667788888887775 24567777776666543332 Q ss_pred HHHHHHHHHHHHHhhhcCcchHHhhhh-cCCCCCC Q lcl|NC_011045. 502 DNGAAALAQGMAAQATASPEAMAAAAD-SVGLQPG 535 (536) Q Consensus 502 ~~~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~q~~ 535 (536) +..|. +...++++..+...-+..| +++.||- T Consensus 496 ~a~a~---~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 496 QAEAA---DPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred hhhhc---CchhhhhccccCCCCCCcccccCCCCC Confidence 22222 2333344444333333344 6677777 No 96 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.75 E-value=6.4e-08 Score=60.02 Aligned_cols=437 Identities=7% Similarity=-0.007 Sum_probs=202.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc-CCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF-PKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~-~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) |.. .+.++.+.+.+..+..+..+ .++++.+.+|..-.-- ........+...++..+-+...++..++.|++- | T Consensus 19 ~~~-~~~~~~~~i~~~i~~~~~~~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~--p 92 (470) T protein:vir:99 19 FPK-GEKLTSNELLGFIAYNETVL---KPRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKYVVDVYNGYFCGI--E 92 (470) T ss_pred eCC-CCCcCHHHHHHHHHHHHHhh---HHHHHHHHHHhccccccccCcccccCCcceeecchHHHHHHHHhhhhccC--C Confidence 665 44577777777766655544 3455566666643210 000111112233555566777777666655321 2 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) + ++...+. ..... .+.+.+.+++|.....++.++..++|.+.+++..+.. +.+++. T Consensus 93 --~--~~~~~~d--------~~~~~-----------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d-g~~~i~ 148 (470) T protein:vir:99 93 --P--KLALLND--------SSKID-----------EIARWNRQENFFDTINEISKQCDIFGRSIASIYQGED-ARPHLM 148 (470) T ss_pred --e--eEeeCCc--------hhHHH-----------HHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC-CeEEEE Confidence 1 1222211 00011 2233455689999999999999999999887765554 345788 Q ss_pred EEecceEEEeeCCCCC--eEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc-c Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGN--VLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM-E 236 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~--v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~-~ 236 (536) .++..++++..|..+. +...+|.++.. ........+++|+ . + .+..|...++. . T Consensus 149 ~~~p~~~~~i~d~~~~~~~~~~vr~~~~~----------------~~~~~~~~~~~~~----~--~-~~~~~~~~~~~~~ 205 (470) T protein:vir:99 149 YSSPNHAFIIYDDTVQRQPLAFVHYQIDN----------------SNNWTDAYGVIQY----A--D-KFYKFKGYDIEED 205 (470) T ss_pred EEccceeEEEEcCCCCcceEEEEEEEEEe----------------cCCeeEEEEEEEe----c--C-eEEEEEecccccc Confidence 8898888888876543 33333333321 0001111122221 1 1 11122221111 1 Q ss_pred c-cccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh---hhc Q lcl|NC_011045. 237 V-QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR---RLT 312 (536) Q Consensus 237 i-~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~---~~~ 312 (536) . ......+++..+|++.++- +.+|+|-.+..++.+..++.+.-......+....|.+.+.-.+....+ ... T Consensus 206 ~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~ 280 (470) T protein:vir:99 206 TNAAGYAINPYGLVPAVEFFE-----NEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKF 280 (470) T ss_pred cccccccccCCCccceEeecC-----CCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhh Confidence 1 1223346677889887643 468999999999999999999999999999999998877522111100 011 Q ss_pred cCC-Ccce-ecCCc--ccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 313 KAQ-TGDF-VTGRP--EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASELEDTLGG 387 (536) Q Consensus 313 ~~~-~g~~-~~g~~--~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi~~r~~E~~~~LG~ 387 (536) ... .+.+ +++.. .+..+..+....+.......++.+.+.|-..-.. +......+...|+..+..+..-+.. ... T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~-k~~ 359 (470) T protein:vir:99 281 DFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKN-KAD 359 (470) T ss_pred hhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHH-HHH Confidence 111 1111 12111 1112223333445566666777776666432111 1111111234566665543222222 222 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHH Q lcl|NC_011045. 388 VYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLERCVAAWAALAPMRDDPDINLAM 465 (536) Q Consensus 388 v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~ 465 (536) ...+.-.+.+.-+++-++.++...+.. +.....+++.|.-++. .++.++.+.++ . ++ |.... T Consensus 360 ~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~i~v~f~~~~p~~~~e~a~~~~kl---~----gi--------is~et 423 (470) T protein:vir:99 360 SKERKFDKSLMQLYRIVLATLFNNKQD-QELWSELDFKFTRNLPEDMASAIDNAKNA---E----GI--------VSKKT 423 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCc-ccccccceEEeCCCCCcCHHHHHHHHHHH---h----cc--------CCHHH Confidence 222223333333333333444333332 2233456777754433 23333222222 1 11 11222 Q ss_pred HHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCC Q lcl|NC_011045. 466 IKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQ 533 (536) Q Consensus 466 ~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q 533 (536) ++..+ -+| -.++|++++.+++...++..++.. .........| .++-| T Consensus 424 ~l~~l---~~v------d~~~E~eri~~E~~~~~~~~~~~~----~~~d~~~~d~--------~~ee~ 470 (470) T protein:vir:99 424 QLGMI---PDI------EPDAEMKQIAKEKADAIKQTQQLS----MPIDILKRDN--------NAEEE 470 (470) T ss_pred HHHhC---CCC------CHHHHHHHHHHHHHHHHHHHHhhc----CCCCcCCCCC--------CccCC Confidence 22221 122 234677666555443222221110 0101111111 11112 No 97 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.75 E-value=6.7e-08 Score=59.91 Aligned_cols=416 Identities=12% Similarity=0.051 Sum_probs=190.0 Q ss_pred HHHHHHHHhhhHHHHHHHHHHHhccc---ccCCC-CCcccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChh Q lcl|NC_011045. 16 VYERLKNDRAPYETRAQNCAQYTIPS---LFPKD-SDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEY 91 (536) Q Consensus 16 r~~~l~~~R~~~e~~w~e~~~~~~P~---~~~~~-~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~ 91 (536) ......++| .++|+.+.+|..-. ..... .....+...++..+-+...+++.++.|++- |. ++...+. T Consensus 1 ~~~~~~~~~---~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~----~~~~~~~ 71 (440) T protein:vir:95 1 MLAAFLGSQ---KQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN--PV----SIGVMEG 71 (440) T ss_pred ChhhHHHHH---HHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheecc--Cc----eEeeCCC Confidence 223333333 34455555555321 11111 111122234566666777777766555221 21 1222221 Q ss_pred hhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeC Q lcl|NC_011045. 92 EAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRD 171 (536) Q Consensus 92 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d 171 (536) .. ++.+ ..+.+.+..++|.....++.++..+||.+.+++..+..+ .++++.++..+.++..| T Consensus 72 ~~-------------~~~~----~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~-~~~i~~~~p~~~~~~~d 133 (440) T protein:vir:95 72 GS-------------ADQL----STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDK-VDRVVLISPLEMFVIRD 133 (440) T ss_pred cc-------------HHHH----HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCC-ceEEEEEcccceEEEEc Confidence 11 1111 123345556899999999999999999999887765543 35688889888888888 Q ss_pred CCC--CeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEe---cCcccccccccccc Q lcl|NC_011045. 172 AFG--NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEV---EGMEVQGSDGTYPK 246 (536) Q Consensus 172 ~~G--~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v---~g~~i~~~~~~~~~ 246 (536) +.+ ++...+|.+... ....++||+. ..+..|... ++.......-.+++ T Consensus 134 ~~~~~~~~~~i~~~~~~--------------------~~~~~~vyt~-------~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (440) T protein:vir:95 134 LTVEQNIIAAVHLPIYA--------------------DKVNMTVYTK-------DKVITYKPYSNNSVRLVVDDVKKHSY 186 (440) T ss_pred CCCCCceEEEEEEEEec--------------------CceEEEEEeC-------CeEEEEEEecCCccceeecceeeccC Confidence 654 455555443211 0112333321 111111111 11111112233557 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc---cccchhhhccCCC-cce-ec Q lcl|NC_011045. 247 EACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA---GITQPRRLTKAQT-GDF-VT 321 (536) Q Consensus 247 ~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~---g~~~~~~~~~~~~-g~~-~~ 321 (536) ..+|++.++. +.+|.|=.+...+.+..+|.+.-......+....|.+++.-. .....++..+... +.+ .+ T Consensus 187 g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~ 261 (440) T protein:vir:95 187 NDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLK 261 (440) T ss_pred ceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecc Confidence 7899987754 457999999999999999999999999999999997665311 1111221111111 111 10 Q ss_pred ------CCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_011045. 322 ------GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (536) Q Consensus 322 ------g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~ 394 (536) +...+..+..+....+.......++.++..|...-.. +.....-+...|+..+..+..-+... .++.+. T Consensus 262 ~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k----~~~k~~ 337 (440) T protein:vir:95 262 TGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQV----RKDKET 337 (440) T ss_pred cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHH----HHHHHH Confidence 0111112223333445666777777777766432211 10100112345666544332211111 112111 Q ss_pred HHHHHHHHHHHHHHHh---cCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHH Q lcl|NC_011045. 395 ELQLPLVRVLLKQLQA---TQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIA 471 (536) Q Consensus 395 E~l~Pli~r~~~il~~---~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a 471 (536) . +..-+.+++.++.+ ...-.......+++.|.-++..- ..+.++.+... +++ |....++.. T Consensus 338 ~-~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~-~~~~ad~~~kl----~g~--------iS~et~~~~-- 401 (440) T protein:vir:95 338 Y-FTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQD-VWTEIKAYIEA----GGE--------ISQETLMEN-- 401 (440) T ss_pred H-HHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCC-HHHHHHHHHHH----hcc--------CcHHHHHHh-- Confidence 1 11222333333222 11112344455777775554322 11222222211 111 222333332 Q ss_pred HHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcc Q lcl|NC_011045. 472 NAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPE 521 (536) Q Consensus 472 ~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~ 521 (536) ++ ++-.++|++++.++++..+...+ +. .+.....+..++ T Consensus 402 --l~-----~~d~~~E~~ri~~E~~~~~~~~~---~~-~~~~~~~~~~~e 440 (440) T protein:vir:95 402 --AS-----FTDYKTEHSRILKQGGSSDLEIG---QI-VGDADVGQADTE 440 (440) T ss_pred --CC-----CCCcHHHHHHHHHHHHHhhhhHH---hh-ccCCCCCCcCCC Confidence 22 12235666666554432111111 00 011112222222 No 98 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.74 E-value=7.4e-08 Score=59.69 Aligned_cols=437 Identities=10% Similarity=-0.008 Sum_probs=195.9 Q ss_pred CCC-------------------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCCCCCc----ccc Q lcl|NC_011045. 1 MAE-------------------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPKDSDN----AST 52 (536) Q Consensus 1 Ma~-------------------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~----~~~ 52 (536) |.+ .......+.+.+..+..+.+ ..+..++.+|..-. +-...... ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQK----LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTK 76 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHH----HHHHHHHHHHhcccCccccccchhhhcccccccc Confidence 221 11122234444444444432 33444555554321 11100000 011 Q ss_pred cccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHH Q lcl|NC_011045. 53 DYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFE 132 (536) Q Consensus 53 ~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 132 (536) ...++..+-+...++..++.|++ -|. +++..+.. ....++.|+ .++|.....+ T Consensus 77 ~~~ki~~n~~k~Iv~~~~~yl~g--~p~----~~~~~~~~---------~~~~l~~~~------------~n~~~~~~~~ 129 (474) T protein:vir:96 77 PDWRITTNFHQNLVDQKVSYVAG--KPV----TYAHDDDK---------VLDVIHQVL------------DTRWDNKLID 129 (474) T ss_pred cccccccchHHHHHHhhhhhhcc--cCc----eeccCChH---------HHHHHHHHH------------hccHHHHHHH Confidence 12345556666666666665543 121 23333221 112233333 3689999999 Q ss_pred HHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCce Q lcl|NC_011045. 133 ALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADET 210 (536) Q Consensus 133 ~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~ 210 (536) +.++..++|.|.+++..+.. +.+++..++..++++..|. .+++...+|.++.. .... T Consensus 130 l~~~~~~~G~~~~~~~~d~~-~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~ 188 (474) T protein:vir:96 130 ILTAASNKGIDWLQVYINED-GELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFN--------------------GETK 188 (474) T ss_pred HHHHHhhCCeEEEEeeeCCC-CceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------CeeE Confidence 99999999999987766554 4467888888888877664 47777766665421 0122 Q ss_pred EEEEEEE---EecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 211 IDVYTHI---YLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIV 287 (536) Q Consensus 211 ~~v~~~v---~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~ 287 (536) +++|+.. +...+++.........+.........+++..+|++.++. +.+|.|-.+..++.+-.++.+.-... T Consensus 189 ~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~ 263 (474) T protein:vir:96 189 VEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQ 263 (474) T ss_pred EEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHH Confidence 3443321 001112221111111111122223345678899887754 46799999999999999999999999 Q ss_pred HHHHHHhCCceeeccccccchhhhcc-CC-CcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccC Q lcl|NC_011045. 288 KMSMISSKVIGLVNPAGITQPRRLTK-AQ-TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQR 364 (536) Q Consensus 288 ~~~~~a~~p~~lv~~~g~~~~~~~~~-~~-~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~ 364 (536) ...+....|.+.+.--+..+..++.. .. .+.+.....+++.. +....+.......++.++..|-..-.. +..... T Consensus 264 ~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~ 341 (474) T protein:vir:96 264 NMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVET--IQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDK 341 (474) T ss_pred HHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeE--EeccCCHHHHHHHHHHHHHHHHHHhCCcCccccc Confidence 99999999987654221222122111 11 12222233344443 333446677777788877776543211 111111 Q ss_pred CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHH Q lcl|NC_011045. 365 TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLE 443 (536) Q Consensus 365 ~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~ 443 (536) .+...|+..+..+-. ...+......+ .+...+.+++.++.+. |.- .....++++|.-.+..-. .+.++.+ T Consensus 342 ~~~n~Sg~Alk~~~~-~l~~k~~~~~~----~~~~~l~~~~~~i~~~~g~~--~d~~~i~i~f~~~~p~~~-~e~a~~~- 412 (474) T protein:vir:96 342 FGSATSGIALKFLYT-NLNLKANKLKN----KANVALQELMQFILDFNKIK--LDAKEIEITFNFNVMVND-LEQSQIG- 412 (474) T ss_pred cccccHHHHHHHHHH-HHHHHHHHHHH----HHHHHHHHHHHHHHHHhCCC--cccceeeEEecCCCccCH-HHHHHHH- Confidence 122334443332211 11111111222 2333334444444332 222 223456676654433211 1111111 Q ss_pred HHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchH Q lcl|NC_011045. 444 RCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAM 523 (536) Q Consensus 444 ~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~ 523 (536) .+.+ .+.-..++.. ++. +--.++|++++.+++...++..+... .+.. . ....+.. T Consensus 413 ------~~~g------iiS~et~~~~----lp~----v~D~~~E~eri~~E~~~~~~~~~~~~---~~~~-~-~~~~~~~ 467 (474) T protein:vir:96 413 ------AQSQ------YLSKETLVRH----HPW----VDDPKAELERLDEEQLELNKQLPNLD---DGGA-D-GAQQQQQ 467 (474) T ss_pred ------HHcC------CCChHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHhhccccc---cccC-C-CCCCcCC Confidence 1111 1222223222 221 11235666666555443222111000 0000 0 0000011 Q ss_pred HhhhhcC Q lcl|NC_011045. 524 AAAADSV 530 (536) Q Consensus 524 ~~~~~~~ 530 (536) ..+.... T Consensus 468 ~~~~e~~ 474 (474) T protein:vir:96 468 SENNQSK 474 (474) T ss_pred CCccccC Confidence 1111111 No 99 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.74 E-value=7.4e-08 Score=59.69 Aligned_cols=437 Identities=10% Similarity=-0.008 Sum_probs=195.9 Q ss_pred CCC-------------------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCCCCCc----ccc Q lcl|NC_011045. 1 MAE-------------------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPKDSDN----AST 52 (536) Q Consensus 1 Ma~-------------------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~----~~~ 52 (536) |.+ .......+.+.+..+..+.+ ..+..++.+|..-. +-...... ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQK----LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTK 76 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHH----HHHHHHHHHHhcccCccccccchhhhcccccccc Confidence 221 11122234444444444432 33444555554321 11100000 011 Q ss_pred cccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHH Q lcl|NC_011045. 53 DYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFE 132 (536) Q Consensus 53 ~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 132 (536) ...++..+-+...++..++.|++ -|. +++..+.. ....++.|+ .++|.....+ T Consensus 77 ~~~ki~~n~~k~Iv~~~~~yl~g--~p~----~~~~~~~~---------~~~~l~~~~------------~n~~~~~~~~ 129 (474) T protein:vir:95 77 PDWRITTNFHQNLVDQKVSYVAG--KPV----TYAHDDDK---------VLDVIHQVL------------DTRWDNKLID 129 (474) T ss_pred cccccccchHHHHHHhhhhhhcc--cCc----eeccCChH---------HHHHHHHHH------------hccHHHHHHH Confidence 12345556666666666665543 121 23333221 112233333 3689999999 Q ss_pred HHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCce Q lcl|NC_011045. 133 ALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADET 210 (536) Q Consensus 133 ~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~ 210 (536) +.++..++|.|.+++..+.. +.+++..++..++++..|. .+++...+|.++.. .... T Consensus 130 l~~~~~~~G~~~~~~~~d~~-~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~--------------------~~~~ 188 (474) T protein:vir:95 130 ILTAASNKGIDWLQVYINED-GELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFN--------------------GETK 188 (474) T ss_pred HHHHHhhCCeEEEEeeeCCC-CceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------CeeE Confidence 99999999999987766554 4467888888888877664 47777766665421 0122 Q ss_pred EEEEEEE---EecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 211 IDVYTHI---YLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIV 287 (536) Q Consensus 211 ~~v~~~v---~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~ 287 (536) +++|+.. +...+++.........+.........+++..+|++.++. +.+|.|-.+..++.+-.++.+.-... T Consensus 189 ~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~ 263 (474) T protein:vir:95 189 VEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQ 263 (474) T ss_pred EEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHH Confidence 3443321 001112221111111111122223345678899887754 46799999999999999999999999 Q ss_pred HHHHHHhCCceeeccccccchhhhcc-CC-CcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhcccC Q lcl|NC_011045. 288 KMSMISSKVIGLVNPAGITQPRRLTK-AQ-TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQR 364 (536) Q Consensus 288 ~~~~~a~~p~~lv~~~g~~~~~~~~~-~~-~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~ 364 (536) ...+....|.+.+.--+..+..++.. .. .+.+.....+++.. +....+.......++.++..|-..-.. +..... T Consensus 264 ~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~ 341 (474) T protein:vir:95 264 NMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVET--IQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDK 341 (474) T ss_pred HHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeE--EeccCCHHHHHHHHHHHHHHHHHHhCCcCccccc Confidence 99999999987654221222122111 11 12222233344443 333446677777788877776543211 111111 Q ss_pred CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHH Q lcl|NC_011045. 365 TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLE 443 (536) Q Consensus 365 ~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~ 443 (536) .+...|+..+..+-. ...+......+ .+...+.+++.++.+. |.- .....++++|.-.+..-. .+.++.+ T Consensus 342 ~~~n~Sg~Alk~~~~-~l~~k~~~~~~----~~~~~l~~~~~~i~~~~g~~--~d~~~i~i~f~~~~p~~~-~e~a~~~- 412 (474) T protein:vir:95 342 FGSATSGIALKFLYT-NLNLKANKLKN----KANVALQELMQFILDFNKIK--LDAKEIEITFNFNVMVND-LEQSQIG- 412 (474) T ss_pred cccccHHHHHHHHHH-HHHHHHHHHHH----HHHHHHHHHHHHHHHHhCCC--cccceeeEEecCCCccCH-HHHHHHH- Confidence 122334443332211 11111111222 2333334444444332 222 223456676654433211 1111111 Q ss_pred HHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchH Q lcl|NC_011045. 444 RCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAM 523 (536) Q Consensus 444 ~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~ 523 (536) .+.+ .+.-..++.. ++. +--.++|++++.+++...++..+... .+.. . ....+.. T Consensus 413 ------~~~g------iiS~et~~~~----lp~----v~D~~~E~eri~~E~~~~~~~~~~~~---~~~~-~-~~~~~~~ 467 (474) T protein:vir:95 413 ------AQSQ------YLSKETLVRH----HPW----VDDPKAELERLDEEQLELNKQLPNLD---DGGA-D-GAQQQQQ 467 (474) T ss_pred ------HHcC------CCChHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHhhccccc---cccC-C-CCCCcCC Confidence 1111 1222223222 221 11235666666555443222111000 0000 0 0000011 Q ss_pred HhhhhcC Q lcl|NC_011045. 524 AAAADSV 530 (536) Q Consensus 524 ~~~~~~~ 530 (536) ..+.... T Consensus 468 ~~~~e~~ 474 (474) T protein:vir:95 468 SENNQSK 474 (474) T ss_pred CCccccC Confidence 1111111 No 100 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=98.73 E-value=7.5e-08 Score=59.65 Aligned_cols=499 Identities=14% Similarity=0.108 Sum_probs=202.3 Q ss_pred CCCcc---ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKR---TGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~---~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |++.. -..+.+++.++|...-+.=..+...|++-.+.+-=.......+...+ .+.| |-|.|.++ +| T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~--~~r~--------nl~~sni~-~i 69 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDA--ETRW--------NLFSTNIQ-TQ 69 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCcc--cccc--------chhhhhHH-HH Confidence 98822 22234667778866544333344445444443322211111111111 1112 55555553 33 Q ss_pred cCC---C-c--ceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH--hccChHHHHHHHHHHHhhCcEEEEEe- Q lcl|NC_011045. 78 FPM---Q-T--WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE--SNSYRVTLFEALKQLVVAGNVLLYLP- 148 (536) Q Consensus 78 tP~---~-~--Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~--~snf~~~~~~~~~dl~~~G~~~l~~~- 148 (536) .|+ + | =++=...|.. . .-.+...+.+||.+...|+ +.+|+..+..+..+.+..|-|++++- T Consensus 70 ~P~iYar~P~p~V~~rf~d~d-~---------~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Y 139 (663) T protein:vir:34 70 MASLYGQTPKVSVSRRFADAD-D---------DVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRY 139 (663) T ss_pred hhhhhcCCCcceeeecccCcc-c---------chhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEe Confidence 332 1 2 2211111110 0 0134455666776666664 47899999999999998887776532 Q ss_pred ---------------cCCCCc--------------eeeEEEEecceEEEeeC-CCCCeEEEEEeEeccHHHHHHHHhHHh Q lcl|NC_011045. 149 ---------------EPEGSN--------------YNPMKLYRLSSYVVQRD-AFGNVLQMVTRDQIAFGALPEDIRKAV 198 (536) Q Consensus 149 ---------------~~~~~~--------------~~~~~~~~l~~~~v~~d-~~G~v~~i~r~~~~t~~~l~~~~~~~~ 198 (536) ++.+.+ .+.+..|.-.+|.+..- .--.|+=|.++-.||-+++.+.|+.+. T Consensus 140 e~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~ 219 (663) T protein:vir:34 140 EVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFDADG 219 (663) T ss_pred ecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhcCCh Confidence 111111 23444455455544332 112677788899999999999996544 Q ss_pred hhc--------cc--c------CCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccc-----cccCceEEEeee Q lcl|NC_011045. 199 EGQ--------GG--E------KKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYP-----KEACPYIPIRMV 257 (536) Q Consensus 199 ~~~--------~~--~------~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~-----~~~~P~~~~rw~ 257 (536) ... +. + .+...+..|+. || + ...+++|.-++|+.++...+..+ |=-||+...-.. T Consensus 220 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwE-IW-d--K~~~~V~w~~eg~~~~L~~~~p~lgl~~ffPcPrpl~~~~ 295 (663) T protein:vir:34 220 SRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWE-IW-D--KGGRKVDWYVEGYSAVLDTQPDPLGLESFFPCPKPLLANW 295 (663) T ss_pred hhhhhhhccCcCCccccCCCCCcchhcCcceeE-EE-e--cCCcEEEEEEcCcceecccCCCCCCCCCCCCCccccccee Confidence 210 00 0 01112444443 33 2 23356666666655443332221 225688777666 Q ss_pred ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh-hhccCCCcceecCC-------ccc--- Q lcl|NC_011045. 258 RLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR-RLTKAQTGDFVTGR-------PED--- 326 (536) Q Consensus 258 ~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~-~~~~~~~g~~~~g~-------~~~--- 326 (536) ..++-+=+-..+ .+=.-++.||.++..+-. ...+++|.++++.+...... .+..+..+.++|.. .++ T Consensus 296 ~~ds~ipvpd~~-~y~~~~~E~n~~t~Rin~-l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k 373 (663) T protein:vir:34 296 TTDKVVPRPDFV-LAQDLYKEIDLVSTRITL-LERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRG 373 (663) T ss_pred cCCCeecCCcHH-HHHHHHHHHHHHHHHHHH-HHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccc Confidence 666544444455 788889999988876654 55689999999744332222 34445555555431 111 Q ss_pred -ccccccccccchhHHHHHHH---HHHHHHHHHHh-hhh--cccCCC--CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 327 -ISFLQLEKQADFTVAKAVSD---AIEARLSFAFM-LNS--AVQRTG--ERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 327 -~~~~~~~~~~~~~~~~~~i~---~~~~rI~~af~-~~~--~~~~~~--~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) +.-.| ++.+...|. +.+..|+...+ .++ -..|++ .+-||||-.... +.++.-+...+.|+. T Consensus 374 ~I~~~p------i~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKs----q~gS~RIqe~qdevq 443 (663) T protein:vir:34 374 VVDWFP------LEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKA----KFGSIRLQRLQDEVA 443 (663) T ss_pred hhhccc------chhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHH----HHHhHHHHHHHHHHH Confidence 22222 222333333 33444554444 222 222332 233555433222 444444444444421 Q ss_pred ---HHHHHHHHHHHHh-----------cCCCCC----------CCC---cceEEEEe--chHH--HHHH----HHHHHHH Q lcl|NC_011045. 398 ---LPLVRVLLKQLQA-----------TQQIPE----------LPK---EAVEPTIS--TGLE--AIGR----GQDLDKL 442 (536) Q Consensus 398 ---~Pli~r~~~il~~-----------~g~lp~----------~~~---~~v~v~~v--s~La--~a~r----~~~~~~l 442 (536) .-++.-.-.+|-. .+.+|. |-. ..+++.+- |.+. .+.. ..-+..+ T Consensus 444 R~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i 523 (663) T protein:vir:34 444 RFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGI 523 (663) T ss_pred HHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHH Confidence 1122222222211 234442 101 22444443 3221 1112 2222222 Q ss_pred HHHHHHH---Hhhcchhhh--------------hcCCHHHHHHHHHHHcC-C--ChhhccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 443 ERCVAAW---AALAPMRDD--------------PDINLAMIKLRIANAIG-I--DTSGILLTEEQKQQKMAQQSMQMGMD 502 (536) Q Consensus 443 ~~~~~~~---~~~~p~~~~--------------~~id~d~~~~~~a~~~G-v--~p~~i~rs~~ev~~~~~q~~~q~~~~ 502 (536) ..|++.+ ++..|+... ...+.+.+++.+.++.- + -|..=-..++.-+.....++++.+.. T Consensus 524 ~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~ 603 (663) T protein:vir:34 524 ASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQE 603 (663) T ss_pred HHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHHHHHH Confidence 3333322 233332100 00122222222222110 0 00000000000001110011111111 Q ss_pred HHHH-HHHHHHHH--hhhcCc---chHHhhhhcC--CCCCCC Q lcl|NC_011045. 503 NGAA-ALAQGMAA--QATASP---EAMAAAADSV--GLQPGI 536 (536) Q Consensus 503 ~~a~-~~~~~~~~--~~~~~~---~~~~~~~~~~--~~q~~~ 536 (536) .+.+ .+++.-.. ++.... ..+.++..+. .-|=.| T Consensus 604 ~aeAq~e~q~~~~~~ql~~~~~~~k~~~~a~~~~~~a~q~~~ 645 (663) T protein:vir:34 604 MAKVQAEVQGDLLRIQAETQANETKERQQAEWNVREAAQKNL 645 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH Confidence 0000 01111000 110000 1111111100 001111 No 101 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.69 E-value=1.1e-07 Score=58.83 Aligned_cols=449 Identities=11% Similarity=0.049 Sum_probs=187.1 Q ss_pred CC----Ccccc-----ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-ccccccccchHHHHHHHHH Q lcl|NC_011045. 1 MA----EKRTG-----LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-TDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma----~~~~~-----~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-~~~~~~~dst~~~a~~~La 70 (536) |. .+... .....-..+|.+.+..|.-|+..|.++..+. ..+. .+..++--+.+...|+.+| T Consensus 14 ~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~---------~~~~~~~~~~~slnl~~~i~~~~A 84 (522) T protein:vir:47 14 GRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKN---------TDGDIKSRPMNHLPIARTASKKIA 84 (522) T ss_pred HHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccc---------cCcchhcccceecchHHHHHHHHh Confidence 11 11100 0000012344444444444444444332111 1111 1111222244555555555 Q ss_pred HHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC Q lcl|NC_011045. 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) Q Consensus 71 a~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~ 150 (536) +- +|.-.+ .+++++. ...++| .+.+..++|+..+.+++.+..+.|++++-+..+ T Consensus 85 ~l----v~~e~~--~i~v~d~-------------~~~~~l-------~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d 138 (522) T protein:vir:47 85 SL----VYNEQA--TITTKNE-------------ILQKFL-------DDMLTNDRFNKNFERYLESCLALGGLAMRPYID 138 (522) T ss_pred hh----hcCCcc--eeecCCh-------------HHHHHH-------HHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc Confidence 44 342112 2223332 233344 345557999999999999999999988743333 Q ss_pred CCCceeeEEEEecceEEE-eeCCCCCeEEE-EEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEec-------C Q lcl|NC_011045. 151 EGSNYNPMKLYRLSSYVV-QRDAFGNVLQM-VTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLD-------E 221 (536) Q Consensus 151 ~~~~~~~~~~~~l~~~~v-~~d~~G~v~~i-~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~-------~ 221 (536) . +.+++..++...|+- ..|..|.+..+ |.+...+-.. ....+..++..+.+--. . T Consensus 139 ~--~~~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~--------------~~~~yt~lE~he~~~~~~~~~~~~~ 202 (522) T protein:vir:47 139 G--DKVRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGR--------------KNVYYTLVEFHEWVTADGQETGSTN 202 (522) T ss_pred C--CceEEEEEcCCceEEEEEcCCceEEEEEEEEEEeeccc--------------ceeEEEEEEEeeecccccccccccc Confidence 3 346788899988885 67777755443 3333321100 00011111111000000 0 Q ss_pred CCCc----eeEEEEec----Cccccccc----------cccccccCceE-E---Eeeee-cCCCccccchHHHHHHHHHH Q lcl|NC_011045. 222 DSGE----YIRYEEVE----GMEVQGSD----------GTYPKEACPYI-P---IRMVR-LDGESYGRSYIEEYLGDLRS 278 (536) Q Consensus 222 ~~~~----~~~~~~v~----g~~i~~~~----------~~~~~~~~P~~-~---~rw~~-~~ge~YGrgp~~~~l~d~~~ 278 (536) .++. |..|.+-. |.++.... -.++--..|.+ . +.++. ..++.||+|-...+.+.++. T Consensus 203 ~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~ 282 (522) T protein:vir:47 203 DKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDF 282 (522) T ss_pred cCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHH Confidence 0111 11222211 22221111 01101123422 2 23333 34789999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCC-----------cceecCC--ccc-ccccccccccchhHHHHH Q lcl|NC_011045. 279 LENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT-----------GDFVTGR--PED-ISFLQLEKQADFTVAKAV 344 (536) Q Consensus 279 L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~-----------g~~~~g~--~~~-~~~~~~~~~~~~~~~~~~ 344 (536) ||..--+...-....= -...|+.+ ++....-..+++ ..+.+.. .++ ..+..+...-+....... T Consensus 283 lD~~~s~~~~e~~~g~-~~i~v~~~-~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~ 360 (522) T protein:vir:47 283 INRSYDEFMWEVRMGQ-RRVIVPEH-LTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILA 360 (522) T ss_pred HHHHHHHHHHHHHhcc-ceeecchH-HhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHH Confidence 9987777666655433 34455333 222211111110 1122111 111 111112211122333344 Q ss_pred HHHHHHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--CCCc Q lcl|NC_011045. 345 SDAIEARLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE--LPKE 420 (536) Q Consensus 345 i~~~~~rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~--~~~~ 420 (536) ++.+-+.|.... + ...+......-.|||||....+...+...-.-..+.. -|..|++-++.++.-.+.+-. +... T Consensus 361 ~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~-al~~lv~~i~~l~~~~~~~~~~~~~~~ 439 (522) T protein:vir:47 361 ISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQ-SIKELCVSMCELGKAVGVYSGEIPELD 439 (522) T ss_pred HHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhhccCCCCCcc Confidence 444444444321 1 1112222333469999999999999988776666554 356677777766643333221 2233 Q ss_pred ceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHH Q lcl|NC_011045. 421 AVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMG 500 (536) Q Consensus 421 ~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~ 500 (536) .+.|.+--++..-.. ..++...+.+ ++ + .+....++ ....|+ |++|.+++-++-++++ T Consensus 440 ~i~v~f~D~i~~D~~-~~~~~~~~~v---~a-G------~~s~e~~i---~~~~g~-------~eeea~~el~ri~~E~- 497 (522) T protein:vir:47 440 DISVNLDDGVFTDRH-AELDYWAKMV---AA-G------FSTKKRAI---GKTLNI-------SGVEAEKELNAINSEL- 497 (522) T ss_pred eeEEEcCCCCCCCHH-HHHHHHHHHH---hc-C------CCCHHHHH---HhcCCC-------ChHHHHHHHHHHHHhh- Confidence 466776554433221 2222222221 11 1 12222322 334555 4444433322221110 Q ss_pred HHHHHHHHHHHHHHhhhcCcchHHhhhhcCC Q lcl|NC_011045. 501 MDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) Q Consensus 501 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (536) ..+.... ..-..++.+ .+...|..| T Consensus 498 -~~~~~~~----~~~~~~~~~-~~~~~d~~~ 522 (522) T protein:vir:47 498 -LPMNDAE----LAIYGMHDQ-NEEKADDKG 522 (522) T ss_pred -ccCCCCC----CCCCCCCCc-ccccCCCCC Confidence 0000000 000011111 112223333 No 102 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=98.68 E-value=1.1e-07 Score=58.65 Aligned_cols=455 Identities=12% Similarity=0.096 Sum_probs=200.0 Q ss_pred CCC-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcc-c----ccc-cccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNA-S----TDY-VTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~-~----~~~-~~~~dst~~~a~~~Laa~l 73 (536) |++ +....+-..+..+|+..++--.- ...|++...-.||.....+.... . .++ .-.|-+.-.+.++.++ T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~--- 107 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSG-QEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMM--- 107 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHh--- Confidence 664 22222234455566555554332 35666666667777533322211 1 111 1234444455555444 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) +.+|=..|.+.+ ++ .++.+++.|.. .-.+++.-+..++.+...+|-+.++||-+..+ T Consensus 108 -G~vfrk~p~~~~--p~--------------~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~ 164 (535) T protein:vir:80 108 -GQVFSRDPIRQL--PP--------------ALEAIVEDIDG------EGVSLDQQAKKALGYTMGFGRAAIFTDYPNVG 164 (535) T ss_pred -chhhcCCcceec--cH--------------HHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCC Confidence 444411244432 21 24445555433 24578888999999999999999999855433 Q ss_pred cee------------eEEEEecceEE-Eee---CCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE Q lcl|NC_011045. 154 NYN------------PMKLYRLSSYV-VQR---DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI 217 (536) Q Consensus 154 ~~~------------~~~~~~l~~~~-v~~---d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v 217 (536) ... .+..|+..+.. +.. |+.+++.-+..+++...+. ..|+ .+.++.|... T Consensus 165 ~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--d~f~------------~~~~~q~RvL 230 (535) T protein:vir:80 165 RPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD--DGFE------------TTYVQQWRVL 230 (535) T ss_pred CcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC--CCcc------------cceeEEEEEE Confidence 210 13444433322 222 2333455555566554221 2222 2344445445 Q ss_pred EecCCCCceeEEEE-ecC---------ccccccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHHHH---H Q lcl|NC_011045. 218 YLDEDSGEYIRYEE-VEG---------MEVQGSDGTYPKEACPYIPIRMVRLDGESY--GRSYIEEYLGDLRSLEN---L 282 (536) Q Consensus 218 ~p~~~~~~~~~~~~-v~g---------~~i~~~~~~~~~~~~P~~~~rw~~~~ge~Y--Grgp~~~~l~d~~~L~~---l 282 (536) .|..++ .|..+.+ .++ ..+.-.++.+ .+++|++.|.-..+..+ |..|.. |+..||. - T Consensus 231 ~~~~~G-~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~---~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~lni~Hy~ 302 (535) T protein:vir:80 231 QLNAEG-NYQVERWRRETQEEMYYSYSKHVPTDGNGN---PFKEIPFQFIGPLDNNADIDHPPLL----DLCEVNIGHYR 302 (535) T ss_pred EecCCc-eEEEEEEEeecCCccccccceeecccCCCc---ccCeeEEEEeecCCCCCCCCccchH----HHHHHHHHHhh Confidence 554333 3332221 111 1222223433 45667777765444444 444533 4444442 2 Q ss_pred HHHHHH-HHHHHhCCceeec-c-----ccccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 283 QEAIVK-MSMISSKVIGLVN-P-----AGITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFA 355 (536) Q Consensus 283 ~~~~~~-~~~~a~~p~~lv~-~-----~g~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~a 355 (536) ..+-.+ .+..+..|.+.+. . +.......+.-|....+.-+..++...+++. +..+ ..+.++++++++.+. T Consensus 303 ~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~-~~~~--a~~~l~~~e~qM~~l 379 (535) T protein:vir:80 303 NSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQIT-PNSV--PFEAMTHKESQMIAM 379 (535) T ss_pred chhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeec-cchh--HHHHHHHHHHHHHHH Confidence 233333 4455555554432 0 1111111222222233322223333344332 2222 245677777777652 Q ss_pred HhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHH- Q lcl|NC_011045. 356 FMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAI- 433 (536) Q Consensus 356 f~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a- 433 (536) .-.+........||+|.+.........|+.+..++++-+ .++|.++.+. |. .+.++.++|++-.-.... T Consensus 380 --Ga~ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al-----~~aL~~~A~w~G~--~~~~~~~~i~~n~dF~~~~ 450 (535) T protein:vir:80 380 --GANLLVKSGGNRTFGEAQQEEASEQSILSACTKNVSMAF-----RKALRWANQFQTG--IVNDETVEYNLNTDFPAAR 450 (535) T ss_pred --HHHhhccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHH-----HHHHHHHHHHcCC--ccCCCceEEEecccccccc Confidence 111222334457999999998888888988888877663 3344444332 22 233444555433221111 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA 513 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~ 513 (536) -..++++.++...+ + -.|..+.+..++ ...||... -+..++|...+..+.+. .. ..++.. T Consensus 451 ld~~~~~all~~~~---~-------G~Is~et~~~~L-~r~gvl~~-~~~~eee~~ri~~E~~~----~~----~~~g~~ 510 (535) T protein:vir:80 451 LTPNERAELILEWQ---Q-------GAITFKEMRAGL-RRAGVASE-DDAKAETEGKATVEFIA----KT----AAAGKV 510 (535) T ss_pred CCHHHHHHHHHHHh---c-------CCCCHHHHHHHH-HhCCCCCc-ccchHHHHHHHHhhhhh----cc----ccCCCC Confidence 11223333333222 1 125556666665 44576322 22333433333222111 00 011111 Q ss_pred Hhh--hcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 514 AQA--TASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 514 ~~~--~~~~~~~~~~~~~~~~q~~~ 536 (536) ... ++.+.....-.+.++-|-|- T Consensus 511 ~d~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 511 GDAASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred CCCCCCCCCcCcccCCccccccCCC Confidence 111 11111111112222333333 No 103 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.63 E-value=1.7e-07 Score=57.76 Aligned_cols=442 Identities=11% Similarity=0.057 Sum_probs=194.8 Q ss_pred CCC-------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc---ccCCCCCcccccccccccchHHHHHHHHH Q lcl|NC_011045. 1 MAE-------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDSDNASTDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma~-------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~---~~~~~~~~~~~~~~~~~dst~~~a~~~La 70 (536) |++ +...++.+.+.+..++.+..|.+ +++.+.+|..-. ..............++..+-+...++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~---r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~ 77 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLE---RLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQ 77 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHh Confidence 544 23445667787887777766644 445555554321 11111111111223566677777777777 Q ss_pred HHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEe-- Q lcl|NC_011045. 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLP-- 148 (536) Q Consensus 71 a~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~-- 148 (536) +.|.+ -| + +++..+. .+.++|.. .+..++|.....++.++..++|.+.+++. T Consensus 78 ~~l~g--~~--~--~~~~~d~-------------~~~~~l~~-------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~ 131 (489) T protein:vir:99 78 GYMLG--VP--V--EYKNENK-------------DLQAAIDL-------MSVRNNEDYHNVKIKTDLSIYGRAYELLTVE 131 (489) T ss_pred hhhcc--CC--c--eeecCCh-------------hHHHHHHH-------HHhhcChhHHHHHHHHHHhhCCeEEEEEeec Confidence 65542 12 1 2233322 13333333 34457898999999999999999976543 Q ss_pred c-CCCCceeeEEEEecceEEEeeCCC--CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCc Q lcl|NC_011045. 149 E-PEGSNYNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 (536) Q Consensus 149 ~-~~~~~~~~~~~~~l~~~~v~~d~~--G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~ 225 (536) + .++.+.+++..++..+++...|.. +++...+|.+.... ........+++|+. + . T Consensus 132 ~~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~---------------~~~~~~~~~~~y~~------~-~ 189 (489) T protein:vir:99 132 KIDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDY---------------GSGKRKQIIKAYTS------D-T 189 (489) T ss_pred cCcCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEEEec---------------CCCceEEEEEEEeC------C-c Confidence 2 133445678889988877777753 34555554443210 00111122333321 1 1 Q ss_pred eeEEE----EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec Q lcl|NC_011045. 226 YIRYE----EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN 301 (536) Q Consensus 226 ~~~~~----~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~ 301 (536) ...|. ...+..+. ....+++..+|++.++. ...|+|-.+...+-+..++.+.-.....+.....|.+.+. T Consensus 190 i~~~~~~~~~~~~~~~~-~~~~~~~g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~ 263 (489) T protein:vir:99 190 IYTYEDYNLETKGMRLK-DYEGHFFKGVPVNEYAN-----NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIA 263 (489) T ss_pred EEEEEecCCCcccceec-ccccccCCceeEEEeec-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhc Confidence 11111 11222222 23345577899988764 3578999999999999999999999999988888886653 Q ss_pred ccccc-----chhhhccCCC------------cceecCCcc------cccccccccccchhHHHHHHHHHHHHHHHHHh- Q lcl|NC_011045. 302 PAGIT-----QPRRLTKAQT------------GDFVTGRPE------DISFLQLEKQADFTVAKAVSDAIEARLSFAFM- 357 (536) Q Consensus 302 ~~g~~-----~~~~~~~~~~------------g~~~~g~~~------~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~- 357 (536) -.... ...+...... +.++....+ +..+..+....+.......++.+.+.|-..-. T Consensus 264 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 343 (489) T protein:vir:99 264 GNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFT 343 (489) T ss_pred cCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCC Confidence 11110 0000000000 111110000 11112233333445555566666555532111 Q ss_pred hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC--CCCCCcceEEEEechHH--HH Q lcl|NC_011045. 358 LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI--PELPKEAVEPTISTGLE--AI 433 (536) Q Consensus 358 ~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~l--p~~~~~~v~v~~vs~La--~a 433 (536) .+......+...|+..+..+...+... .....+.-.+.+.-+++-++.++...+.- ....-.+++|.|.-++. .+ T Consensus 344 p~~~~~~~~~n~Sg~Al~~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~ 422 (489) T protein:vir:99 344 PDTQDMKFSGVQSGESMKYKLMASDNY-REKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDN 422 (489) T ss_pred cccccccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHH Confidence 111100111234555544332111111 12222222233333333333333222211 11212346676643333 23 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA 513 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~ 513 (536) +.. +.+.... .+ +....++..+ -+|+.. ..++|++++++++...++.. T Consensus 423 ~~~---~~~~kl~----gi--------is~et~~~~l---~~v~~~---d~~~E~~ri~~E~~~~~~~~----------- 470 (489) T protein:vir:99 423 EIV---TAAQNLY----GI--------VSDQTIFEIL---NTVTGV---DAEAELKRLKEEADKKQSLP----------- 470 (489) T ss_pred HHH---HHHHHHh----cc--------CCHHHHHHhc---CCCCch---hHHHHHHHHHHHHHHHhccc----------- Confidence 222 2222211 11 2223333321 223211 23345555544433222111 Q ss_pred HhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 514 AQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 514 ~~~~~~~~~~~~~~~~~~~q~ 534 (536) +....+....+-.... -+| T Consensus 471 -~~~~~~~~~~~~~~~~-~~p 489 (489) T protein:vir:99 471 -EPRLVGDASGQEEPTA-EKP 489 (489) T ss_pred -cccccCCCCCCcCCCC-CCC Confidence 1111111111111111 134 No 104 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=98.58 E-value=2.5e-07 Score=56.79 Aligned_cols=486 Identities=11% Similarity=0.042 Sum_probs=222.6 Q ss_pred CCCc-ccccc-HH---HHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC-CcccccccccccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEK-RTGLA-EE---GAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS-DNASTDYVTPWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~-~~~~~-~~---~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) |+.. ++.+- ++ ...+-|=...++| -..+++.+.+|..-.-..-.. -.+. +...++++.|.+-|+++++-|. T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~--RlaaY~ly~d~y~n~~~el~~il~G~-dr~~~~~ps~r~~V~~~~~~Lg 77 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKN--RVRAYDLYENIYLNSAETLKLVLRGD-DSVPILMPSGRKIVEAVHRFLG 77 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHH--HHHHHHHHHHhhcCchhhhhhhcCCC-ceeeeccchHHHHHHHHHHhcC Confidence 6552 21111 11 1111111111111 234444555554322110000 0111 2445688889888888664442 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC-- Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG-- 152 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~-- 152 (536) ..+ .|+ ++...- +.. .++ .+++.+.....+-|+.....++-.+.++.|-|++++-.+.+ T Consensus 78 ~~~----~~~---Ve~~~~-----de~----~~~---avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~ 138 (563) T protein:vir:74 78 VGF----DYL---VEPDMG-----DEG----IRQ---SLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKK 138 (563) T ss_pred CCc----EEe---cCcccc-----Ccc----hHH---HHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccc Confidence 222 232 221110 111 111 14666667778899999999999999999999998876643 Q ss_pred -CceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhh-c-c-ccCCCCceEEEEEEEEecCC------ Q lcl|NC_011045. 153 -SNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEG-Q-G-GEKKADETIDVYTHIYLDED------ 222 (536) Q Consensus 153 -~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~-~-~-~~~~~~~~~~v~~~v~p~~~------ 222 (536) ++.++.+.|-.+.|+-..|++ .|-.+|-..-.....++++..+.+-+ . + ...+++. ...+|+..+.+ T Consensus 139 ~g~R~rv~~vDP~~~fp~~dpd-~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg--~~~~~~~~dae~w~lg~ 215 (563) T protein:vir:74 139 AGERISVDEVDPRQIFLIEDGS-TVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEG--MFTGRISSELTHWTLGN 215 (563) T ss_pred cCCCceEeecCCceeeeccCCC-CcccceeeecccCCCCCcchhccceeeeeeeeeeCCCC--Cccceeeeccchhcccc Confidence 345677777778888877774 45555533222222233333322110 0 0 0011111 11112211100 Q ss_pred -CCce---eEE-EEecCccccc---ccc--ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 223 -SGEY---IRY-EEVEGMEVQG---SDG--TYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMI 292 (536) Q Consensus 223 -~~~~---~~~-~~v~g~~i~~---~~~--~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~ 292 (536) +++- ..+ .+.++....+ ++. --+..-.||++++=...++++||+|-..+.+.-++.||.-....-..+.. T Consensus 216 wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~ 295 (563) T protein:vir:74 216 WDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVF 295 (563) T ss_pred ccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHh Confidence 0000 001 1112211111 111 00123467877666677899999999999999999999877766666666 Q ss_pred HhCCceeeccccccchh----hh--ccCCCcceec--CCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccC Q lcl|NC_011045. 293 SSKVIGLVNPAGITQPR----RL--TKAQTGDFVT--GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQR 364 (536) Q Consensus 293 a~~p~~lv~~~g~~~~~----~~--~~~~~g~~~~--g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~ 364 (536) .=.|+...... ...+ +. .+-++|.++. ++..+--+..+...++++.++..+.++..| +.+..+-.++ T Consensus 296 tG~pi~vl~~~--~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~er---al~~~s~tPa 370 (563) T protein:vir:74 296 QGLGMYVTNAS--APVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEK---GIAEGSGTPE 370 (563) T ss_pred cCCCeEEeccc--cccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHH---HHHhhccCcc Confidence 65676554321 1111 10 1124666542 222222344566667888888888877653 2222111111 Q ss_pred ------CCCCC---CHHH-----HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC------CCCcc-eE Q lcl|NC_011045. 365 ------TGERV---TAEE-----IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE------LPKEA-VE 423 (536) Q Consensus 365 ------~~~r~---TAtE-----i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~------~~~~~-v~ 423 (536) +..+. .|=| +..+.+|++..|=.++.++-.++..=++ ..+.-++..|..|. +|... |. T Consensus 371 vA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL-~~~erl~~~g~~~~~~g~~~~~~~~~v~ 449 (563) T protein:vir:74 371 VAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWL-PAYESDFQEQDGSRPFASADLLNECSVV 449 (563) T ss_pred eeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHH-HHHHhHhhhhcccccccccccCCceEEE Confidence 21111 2222 2344555555444444444433322111 12333444566554 33332 45 Q ss_pred EEE--echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 424 PTI--STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM 501 (536) Q Consensus 424 v~~--vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~ 501 (536) +.| ..|....+.-++ +....+ ...|....+++.+.++ |. | +=--++|++++...+-.+|.. T Consensus 450 ivf~p~~P~d~~~vv~~---~~tl~~----------aGiiSretAv~~L~~~-g~-~--~pdae~e~~~ie~~~i~~~~~ 512 (563) T protein:vir:74 450 CIFADPMPVNKTQVTQD---TLLLQQ----------AHLILRKMAVAKLRSI-GW-E--YPEVDDQGNALTDDDIADMLL 512 (563) T ss_pred EEeCCCCCccHHHHHHH---HHHHHH----------cCchhHHHHHHHHHhC-CC-C--CCcHHHHHhhcCHHHHHHHHH Confidence 544 345555443222 211111 1235667777877776 65 2 112256666665554444333 Q ss_pred HHHHHHHHHHHHHhhhcCcchHHhhhhcC-----------CCCCCC Q lcl|NC_011045. 502 DNGAAALAQGMAAQATASPEAMAAAADSV-----------GLQPGI 536 (536) Q Consensus 502 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~q~~~ 536 (536) +++-+. .+...++.+++....++.|.. .+.|-+ T Consensus 513 a~a~ad--~~~~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~ 556 (563) T protein:vir:74 513 AEAEAD--ASLGLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDV 556 (563) T ss_pred HHhhcc--CcccceecccCCCCcccccccCCchhHcCCcccCCccc Confidence 332221 111122222222222333322 233333 No 105 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.56 E-value=2.8e-07 Score=56.54 Aligned_cols=416 Identities=12% Similarity=0.039 Sum_probs=181.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCC-C--CcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKD-S--DNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~-~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |.++. .+.+...+..+ ....++...+.+|..-...... + -+..-+.-+..-+-+..+|+.||..|.- T Consensus 12 l~~~~----~~~~~~L~~~~----~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~-- 81 (474) T protein:vir:81 12 LSNDE----NALINGLLAQI----ENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNL-- 81 (474) T ss_pred CChhH----HHHHHHHHHHH----HHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcc-- Confidence 44433 23343333333 3344455566666533321111 1 0111011123455566777777765532 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC-cee Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS-NYN 156 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~-~~~ 156 (536) . - |++ ++.... ...+ .+...++++.....+++++..+||.+.++|-.+..+ ... T Consensus 82 --~-G-f~~--~d~~~~--------~~~l-----------~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~ 136 (474) T protein:vir:81 82 --E-G-FVW--PDGDLD--------SLGG-----------TEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEA 136 (474) T ss_pred --c-c-eEC--CCCCcc--------chHH-----------HHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCcee Confidence 1 1 222 221100 0112 233456999999999999999999999988754433 234 Q ss_pred eEEEEecceEEEeeCCC-CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCC-ceEEEEE--EE-EecCCCCceeEEEE Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAF-GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKAD-ETIDVYT--HI-YLDEDSGEYIRYEE 231 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~-~~~~v~~--~v-~p~~~~~~~~~~~~ 231 (536) ++++++..+.++..|+. +++...+++... ..+.+ ..+.+|. .+ +...+++.+.|..+ T Consensus 137 ~i~~~sp~~~~~~~D~~~~~~~~al~~~~~------------------~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~ 198 (474) T protein:vir:81 137 LIHVKDASEATGEWNRRRRGLNNLLSIIDK------------------DKEGKVLSLALYLDNETVTAQRDKATLKWQVD 198 (474) T ss_pred EEEEeccceEEEEEeCCCCcceeeeEEEEE------------------cCCCcEEEEEEEeCCcEEEEEEcCccceeeec Confidence 67888887777777753 232222221100 01111 1112220 11 11112222222211 Q ss_pred ecCccccccccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc--- Q lcl|NC_011045. 232 VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ--- 307 (536) Q Consensus 232 v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~--- 307 (536) ...+++ .+|++++..+..-++.+|+|-. +.+++-+..+|...-.++..++..+.|...+- |... T Consensus 199 ---------~~~~~~-gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~--G~~~~~~ 266 (474) T protein:vir:81 199 ---------RDEHVY-GVPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL--GADESAL 266 (474) T ss_pred ---------cCCCCC-CcceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee--cCChhhc Confidence 112333 3799998888888899999966 67889999999999999999999999985542 2211 Q ss_pred ------hhhhccCCCccee--cCCcc-ccc------ccccccccchhHHHHHHHHHHHHHHHHHhhhhc-----ccCC-C Q lcl|NC_011045. 308 ------PRRLTKAQTGDFV--TGRPE-DIS------FLQLEKQADFTVAKAVSDAIEARLSFAFMLNSA-----VQRT-G 366 (536) Q Consensus 308 ------~~~~~~~~~g~~~--~g~~~-~~~------~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~-----~~~~-~ 366 (536) +.+......+.+. +.+.+ ++. +-++. .++++. .++.++.-|......... ...+ . T Consensus 267 ~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~-~a~l~~---~~~~l~~~~~~~a~~t~iP~~~lG~~~~~ 342 (474) T protein:vir:81 267 KNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFP-AASPDA---HWSDINGLAKLFAREASLPDTAVAISGLS 342 (474) T ss_pred ccccccccchhhhhHHHHhcCCCcccccccccccccccccC-CCChhH---HHHHHHHHHHHHHhhhCCCHHHhcccccc Confidence 1111112223332 22221 111 11111 223333 344444444332221111 0011 1 Q ss_pred CCCCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcc--eEEEEechH--HHHHH Q lcl|NC_011045. 367 ERVTAEEIRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEA--VEPTISTGL--EAIGR 435 (536) Q Consensus 367 ~r~TAtEi~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~--v~v~~vs~L--a~a~r 435 (536) ..-+|.-|.+. ++++.+.+|.-+.+ ++..++.+.... ....++.+. +++.+-.|. ..+++ T Consensus 343 np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~--------~~rla~~i~~~~-~~~~~~~~~~~~~v~W~d~~~~s~a~~ 413 (474) T protein:vir:81 343 NPTSAESYDASQYELIAEAEGAVDDFTPALRK--------AFIRALAMKNKV-AIDEIPDEWKSIDAKWRDPRYLSKSAQ 413 (474) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhCCC-CccccchhhccceeEecCCCccCHHHH Confidence 12245444433 33333333333222 222233222111 122344443 455543332 22333 Q ss_pred HHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_011045. 436 GQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA- 514 (536) Q Consensus 436 ~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~- 514 (536) +..+.++.+ .++-.+. . ++ +.+.+|+ |++|++.+.+.+..++.... ...+++ T Consensus 414 aDa~~Kl~~-------a~~~~~~----~-~~---~~~~lg~-------t~~~i~~~~~~~~~~~~~~~-----~~~l~~~ 466 (474) T protein:vir:81 414 ADAGMKQLA-------AVPWLAE----T-EV---GLELIGL-------TPQQARRAMADKRRVQGRGT-----LQALIDR 466 (474) T ss_pred HHHHHHHHh-------cccCCCc----H-HH---HHhhcCC-------CHHHHHHHHHHHHHHhHHHH-----HHHHHhc Confidence 333333322 2211111 1 11 2233465 45666555444332211111 111111 Q ss_pred -hhhcCcc Q lcl|NC_011045. 515 -QATASPE 521 (536) Q Consensus 515 -~~~~~~~ 521 (536) +.++.+| T Consensus 467 ~~~~~~aq 474 (474) T protein:vir:81 467 SNNGATAQ 474 (474) T ss_pred CCCCCCCC Confidence 1122222 No 106 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.50 E-value=4.3e-07 Score=55.47 Aligned_cols=434 Identities=11% Similarity=0.031 Sum_probs=176.8 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC-CCC---Cccccc-ccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP-KDS---DNASTD-YVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~-~~~---~~~~~~-~~~~~dst~~~a~~~Laa~l~~ 75 (536) |. ..+.+.+. +.|....+...++.+.+.+|..-.... .-+ ....+. ..++..+-+..+|+.+++.|++ T Consensus 1 ~~----~~t~~~~~---~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:79 1 MT----ASTPAEWL---PVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CC----CCCHHHHH---HHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhcc Confidence 33 23333322 223333333344455555555332110 001 011111 1223445677777777776643 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) - + |++...++ .+....++ +.+.+++|.....++.++..+||.|.+++..+..+. T Consensus 74 ~-----g-~~~~~~~d--------~~~~~~~~-----------~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~- 127 (456) T protein:vir:79 74 N-----G-ITVGGSAD--------SDLALRAR-----------RIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT- 127 (456) T ss_pred C-----C-eecCCCCC--------ccHHHHHH-----------HHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCc- Confidence 2 1 12221111 01111222 334557899999999999999999988776655544 Q ss_pred eeEEEEecceEEEeeCC-CC-CeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDA-FG-NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~-~G-~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++++++..+.++..|+ .+ ++...+|.++-. .+... .......+.....+...+. ..+..+..+.... T Consensus 128 ~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~-----d~~~~----~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 197 (456) T protein:vir:79 128 ATITADSPETMVVSVDPLQPWRIRSAMRWWRDL-----DAESD----FAIVWSGDGWQKFARPCFV-QSSSRRRLVTRIS 197 (456) T ss_pred eEEEEeccceeEEEEcCCCCCceEEEEEEEEec-----CCcee----EEEEEcCCceEEEEEEEEe-eccccceeeeccC Confidence 46788887777777764 33 444455443210 00000 0000111122222111111 1111111111112 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc---------- Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA---------- 303 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~---------- 303 (536) +.-.......+.+..+|++.++ +..|.|=.+..++-+-.++...-.+...++..+.|...+.-. T Consensus 198 ~~~~~~~~~~~~~~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~ 271 (456) T protein:vir:79 198 DSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDEN 271 (456) T ss_pred CceeecccccCCCCceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccc Confidence 2211111223334566665542 467888888888888888877666666777777665444211 Q ss_pred c-ccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhc----ccCCCCCCCHHHHHHHH Q lcl|NC_011045. 304 G-ITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSA----VQRTGERVTAEEIRYVA 378 (536) Q Consensus 304 g-~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~----~~~~~~r~TAtEi~~r~ 378 (536) | ..+..+.....+|.+... +++..+.++. ..+++... +.++.-|...+..... ..-+....++.-+.... T Consensus 272 g~~i~~~~~~~~~~~~~~~~-~~~~~~~q~~-~~~~~~~~---~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~ 346 (456) T protein:vir:79 272 GNAIDYASIFEAAPGALWEL-PPGVDIWESQ-TNDFTPML---SAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIE 346 (456) T ss_pred ccccchhhhhhhhccccccC-CCCcceeeec-ccChHHHH---HHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHH Confidence 1 011222222334443332 2233333332 23444333 3333333332222111 00112234555444433 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDD 458 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~ 458 (536) ..+... ..+.+ ..+.+-+.+++.++....-.+ +...+++.+..+.... ..+.++.+.... +.+ T Consensus 347 ~~l~~k----~~~~~-~~f~~~l~~~~~l~~~~~g~~--~~~~i~v~w~~~~~~s-~~~~ada~~kl~----~~G----- 409 (456) T protein:vir:79 347 KGFLFK----CEDRL-SIAKIGLEAILVKALQIEGES--VEDTVDVSFESPDRVT-LGEKYSAASLAK----AAG----- 409 (456) T ss_pred HHHHHH----HHHHH-HHHHHHHHHHHHHHHHhcCCC--ccccceEEeCCCCCcC-HHHHHHHHHHHH----hcC----- Confidence 333322 22222 334445566666655432222 2235777776554321 122222222211 111 Q ss_pred hcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHH Q lcl|NC_011045. 459 PDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMA 524 (536) Q Consensus 459 ~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~ 524 (536) +-..+. ....+|+++. ++++...++..+ +..+.++.+++- ..+++.. T Consensus 410 --~~~~~~---~~~~lg~~~~-------~i~~~e~~r~~~-----e~~~~~~~~~~~--~~~~~~~ 456 (456) T protein:vir:79 410 --ESWASI---RRNILNYNAD-------QIKQDDLDRARE-----QITLFAGNPVQR--PQEDGSR 456 (456) T ss_pred --CChHHH---HHhcCCCCHH-------HHHHHHHHHHHH-----HHHHHhhhHhhc--CCCCCCC Confidence 101111 2234576543 332211111111 111111222111 1111111 No 107 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.48 E-value=4.9e-07 Score=55.16 Aligned_cols=435 Identities=14% Similarity=0.089 Sum_probs=184.0 Q ss_pred CCCccccccHHHHHHHHHHH---------------------HHHhhhHHHHHHHHHHHhcccccCCCCCccccccccccc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERL---------------------KNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQ 59 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l---------------------~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d 59 (536) |.- -+.++..+.++ -.++......|+.+|+=--|.+.............++-- T Consensus 1 m~~------~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~sl 74 (505) T protein:vir:79 1 MAF------WDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSV 74 (505) T ss_pred Cch------HHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeec Confidence 221 00111111110 011112223455544321121111111111111111112 Q ss_pred chHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHh Q lcl|NC_011045. 60 AVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVV 139 (536) Q Consensus 60 st~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 139 (536) +.+...|+.+|+ .+|.-.+ .+++++. +..++|+ +.+..++|+..+.+++.+..+ T Consensus 75 nl~~~i~~~~A~----ll~~e~~--~i~~~d~-------------~~~e~l~-------~i~~~n~f~~~~~~~~e~a~a 128 (505) T protein:vir:79 75 NVTKLASAKLAS----LIFNEQC--QVTVSDE-------------TANDFLD-------DVFQQNDFYTTFEEKLEEWIA 128 (505) T ss_pred chHHHHHHHHHh----hhcCCCc--eeecCCh-------------HHHHHHH-------HHHHhccHHHHHHHHHHHHhh Confidence 455555565555 4442112 2333332 2333443 345568899999999999999 Q ss_pred hCcEEEEEecCCCCceeeEEEEecceEE-EeeCCCCCeEEEE-EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE Q lcl|NC_011045. 140 AGNVLLYLPEPEGSNYNPMKLYRLSSYV-VQRDAFGNVLQMV-TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI 217 (536) Q Consensus 140 ~G~~~l~~~~~~~~~~~~~~~~~l~~~~-v~~d~~G~v~~i~-r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v 217 (536) +|.+++.+.-+. +.+++..++...++ +..|..+....+| ++++.+ ...+...+..++.++ T Consensus 129 ~G~~~~k~~~D~--~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~--------------~~~~~~~yt~lE~h~-- 190 (505) T protein:vir:79 129 LGSGCVRPYVDS--GKIKLAWATADQVYPLQADTNQVNELAIASRTTEV--------------ENHRTIYYTLLEFHQ-- 190 (505) T ss_pred cCCeEEEEEEeC--CceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEe--------------cCCcceEEEEEEEEE-- Confidence 999988444333 23567888988876 4556544334333 222211 000111222222221 Q ss_pred EecCCCCce----eEEEEec----Cccccccc----------cc-cccccCceEEEe---ee-ecCCCccccchHHHHHH Q lcl|NC_011045. 218 YLDEDSGEY----IRYEEVE----GMEVQGSD----------GT-YPKEACPYIPIR---MV-RLDGESYGRSYIEEYLG 274 (536) Q Consensus 218 ~p~~~~~~~----~~~~~v~----g~~i~~~~----------~~-~~~~~~P~~~~r---w~-~~~ge~YGrgp~~~~l~ 274 (536) . +++++ ..|.+-. |..+.... .. .+...-+|..++ ++ ...++.+|+|-...+.+ T Consensus 191 -~--~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~ 267 (505) T protein:vir:79 191 -W--DHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYT 267 (505) T ss_pred -e--cCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHH Confidence 1 12222 2222211 22211110 00 111122233322 22 23467899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc------------cCCCcceec--CCcccccccccccccchhH Q lcl|NC_011045. 275 DLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT------------KAQTGDFVT--GRPEDISFLQLEKQADFTV 340 (536) Q Consensus 275 d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~------------~~~~g~~~~--g~~~~~~~~~~~~~~~~~~ 340 (536) .+..||..--+.....+. .+....|+++ ++...... ....-.+.. +..+...+..+...-+... T Consensus 268 ~id~lD~~~s~~~~e~~~-g~~~i~v~~~-~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~ 345 (505) T protein:vir:79 268 VIDAINRTHDQFVDEVKK-GQRRLIVPAE-WLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVAD 345 (505) T ss_pred HHHHHHHHHHHHHHHHHh-cccceeechH-HhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEecccCCHHH Confidence 999999877777776654 4555555433 22111000 000000111 1111111211211111222 Q ss_pred HHHHHHHHHHHHHHHHhh--hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC---- Q lcl|NC_011045. 341 AKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI---- 414 (536) Q Consensus 341 ~~~~i~~~~~rI~~af~~--~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~l---- 414 (536) ....++.+-++|....=+ ..+......-.|||||..+.+.......-.-..+ ...|..|++.++.+..-.+.. T Consensus 346 ~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~-~~al~~li~~i~~~~~~~~~~~~g~ 424 (505) T protein:vir:79 346 YQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQV-EKTIKALTYAILELASVPSFYADGQ 424 (505) T ss_pred HHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccccccc Confidence 333444444444322111 1122122334599999999999988888766665 445677888888776554432 Q ss_pred ----CCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHH Q lcl|NC_011045. 415 ----PELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQ 490 (536) Q Consensus 415 ----p~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~ 490 (536) ++++...+.|.|--++..-. ..+++...+.++. + + +.... .+....|+ |++|+++ T Consensus 425 ~~~~~~~~~~~i~v~f~d~i~~d~-~~~~~~~~~~v~~----G---i---~s~e~---~l~~~~~~-------~eeea~~ 483 (505) T protein:vir:79 425 ARWTGDVDSLDITINFNDGVFVDQ-ESKRAADLQAVQA----Q---V---MPKKQ---FLMRNYGL-------DEEEADE 483 (505) T ss_pred ccccCCCCceeEEEEeCCCCCCCH-HHHHHHHHHHHHc----C---C---CCHHH---HHHhcCCC-------ChHHHHH Confidence 23445567777765544322 1112222222211 1 1 12222 23344555 4454433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhh Q lcl|NC_011045. 491 KMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAAD 528 (536) Q Consensus 491 ~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 528 (536) +.++-+..+. . ..|+......+ T Consensus 484 el~ri~~E~~--~--------------~~p~~~~~gg~ 505 (505) T protein:vir:79 484 WLAQIDAENS--T--------------AEPEFNQFGGD 505 (505) T ss_pred HHHHHHHhcc--c--------------cCCCchhccCC Confidence 3222111100 0 11111111111 No 108 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.48 E-value=5e-07 Score=55.15 Aligned_cols=446 Identities=13% Similarity=0.077 Sum_probs=197.6 Q ss_pred CC--------CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc--cCCCCCcccccccccccchHHHHHHHHH Q lcl|NC_011045. 1 MA--------EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--FPKDSDNASTDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma--------~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~--~~~~~~~~~~~~~~~~dst~~~a~~~La 70 (536) || +..+.++.+.+.+..+..+.. ..+++.+.+|....- .............++-.+-+...++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 76 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNR----KKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNV 76 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHH----HHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHh Confidence 44 234455566666666666443 345555666654421 0011111112234555556666666666 Q ss_pred HHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC Q lcl|NC_011045. 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) Q Consensus 71 a~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~ 150 (536) +.|.+- |. ++...+.. .... +.+.+..++|.....++.++..+||.+.+++..+ T Consensus 77 ~~l~g~--p~----~~~~~~~~---------~~~~-----------l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~ 130 (499) T protein:vir:10 77 GFMTGN--PV----KYVAEKGK---------NIDD-----------ILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLK 130 (499) T ss_pred hhhccc--Cc----eeecCChh---------HHHH-----------HHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEec Confidence 544321 31 22222211 1112 3334445788899999999999999998766544 Q ss_pred CCCc----------------eeeEEEEecceEEEee-CCCCCeEE-EEEeEeccHHHHHHHHhHHhhhccccCCCCceEE Q lcl|NC_011045. 151 EGSN----------------YNPMKLYRLSSYVVQR-DAFGNVLQ-MVTRDQIAFGALPEDIRKAVEGQGGEKKADETID 212 (536) Q Consensus 151 ~~~~----------------~~~~~~~~l~~~~v~~-d~~G~v~~-i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~ 212 (536) ..+. .+++..++..+.++.. |..++... .+|.+... ..........++ T Consensus 131 ~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~--------------~~~~~~~~~~~~ 196 (499) T protein:vir:10 131 KTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKK--------------DLEGNTNGYSIT 196 (499) T ss_pred ccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEe--------------ecCCCceEEEEE Confidence 4321 2345555544444443 33333333 33333221 000011112233 Q ss_pred EEEEEEecCCCCceeEEE-----Ee-cCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 213 VYTHIYLDEDSGEYIRYE-----EV-EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 (536) Q Consensus 213 v~~~v~p~~~~~~~~~~~-----~v-~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~ 286 (536) +|+. . +...|. .. .+...+ ....+++..+|++.++- +.+|.|=.+...+.+..++.+.-.. T Consensus 197 iyt~---~----~i~~~~~~~~~~~~~~~~~~-~~~~~~~g~vPvv~~~n-----~~~~~~d~e~v~~liD~~~~~~S~~ 263 (499) T protein:vir:10 197 VYMP---Q----RIVEYRTKTTMEVSANDPIV-YDGENLFGAVPIIEFRN-----NEERQGDFEQLISLIDAYNLLQTDR 263 (499) T ss_pred EEeC---C----eEEEEEecCCccccCcceec-ccccCCCCccceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHH Confidence 3321 0 011111 01 111111 23345577899987654 4679999999999999999999999 Q ss_pred HHHHHHHhCCceeeccccccc-hhhhccCCCccee-cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhccc Q lcl|NC_011045. 287 VKMSMISSKVIGLVNPAGITQ-PRRLTKAQTGDFV-TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQ 363 (536) Q Consensus 287 ~~~~~~a~~p~~lv~~~g~~~-~~~~~~~~~g~~~-~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~ 363 (536) ....+....|.+++.-..... .........|.+. ....++..+..+....+.......++.+.+.|.+.-.. +.... T Consensus 264 ~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 343 (499) T protein:vir:10 264 ISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDE 343 (499) T ss_pred HHHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCch Confidence 999999999988775322211 1111111223322 12222223444555556777788888888877552211 11111 Q ss_pred CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHH Q lcl|NC_011045. 364 RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLE 443 (536) Q Consensus 364 ~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~ 443 (536) .-+...|+..+..+..-+... ..-..+.-.+.+.-++.-++.++...|.- ..-..+++.|.-++..-. .+.++.+. T Consensus 344 ~~~gn~Sg~Al~~~~~~l~~k-~~~k~~~~~~~l~~~~~li~~~~~~~~~~--~d~~~i~i~f~~~~p~n~-~e~~~~~~ 419 (499) T protein:vir:10 344 KFMGNVSGEAMKFKLFGLENL-LSIKQRYFFDGLRRRLKLIQTIVNIKGAN--DDASGCKISLVANIPSNL-SDVVNNVK 419 (499) T ss_pred hhcccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCc--cccccceEEeCCCCCCCH-HHHHHHHH Confidence 112234666655433322222 22222222222333333333333223321 222356676654443211 12222221 Q ss_pred HHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchH Q lcl|NC_011045. 444 RCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAM 523 (536) Q Consensus 444 ~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~ 523 (536) .. +. .|....+++.+ -+| -..++|++++.++++..+...+... . +.+|..+ T Consensus 420 kl----~g--------~iS~et~~~~l---~~v-----~d~~~E~~ri~~E~~~~~~~~~~~~-~--------~~~~~~~ 470 (499) T protein:vir:10 420 NA----DG--------IIPRKYTYSWL---PDV-----DNPQDVIDEMNQQDAETIKKNQEAL-R--------GQDPDRL 470 (499) T ss_pred HH----hc--------cCChHHHHHhC---CCC-----CCHHHHHHHHHHHHHHHHHHHHhhh-c--------cCCCCCC Confidence 11 11 12223333221 122 1235677666655543322222111 0 0001111 Q ss_pred HhhhhcCCC------------CCCC Q lcl|NC_011045. 524 AAAADSVGL------------QPGI 536 (536) Q Consensus 524 ~~~~~~~~~------------q~~~ 536 (536) .......+- +||= T Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (499) T protein:vir:10 471 ELEDKQDDSSENDKEAGSNHNQSHR 495 (499) T ss_pred CCCCCCcccCCCCCCCccccccCCC Confidence 111111111 1221 No 109 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.47 E-value=5.2e-07 Score=55.02 Aligned_cols=457 Identities=9% Similarity=0.015 Sum_probs=187.3 Q ss_pred CCCccccccHHHHHHHHHHH--------------------HHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERL--------------------KNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQA 60 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l--------------------~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~ds 60 (536) |.. -+.++.-|.++ -..-.....+|+.+|+=-.|.+....++....+..+.-=+ T Consensus 1 m~~------~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~ 74 (517) T protein:vir:98 1 MKV------IQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLN 74 (517) T ss_pred Cch------HHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecC Confidence 221 11111111111 1111123445666654333433222222212222222223 Q ss_pred hHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhh Q lcl|NC_011045. 61 VGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVA 140 (536) Q Consensus 61 t~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 140 (536) .+...++.+|+ .+|--.+ .+.+++....+.. ........++|++ .+..++|+..+.+++.+..+. T Consensus 75 ~~~~i~~~~A~----Ll~~e~~--~i~v~d~~~~~~~--~~~~~~~~e~l~~-------i~~~n~f~~~~~~~~e~a~a~ 139 (517) T protein:vir:98 75 LRKLSADVLSG----LVFNEQC--EVYVSDAKDEEKK--DNSFKTAHEFIQH-------VFQHNKFIKNLSDYLEPTFAL 139 (517) T ss_pred cHHHHHHHhhh----hhcCCcc--eEEeccccccccc--ccchhHHHHHHHH-------HHHhccHHHHHHHHHHHHhhh Confidence 44445555544 4442222 2222222111110 0111224455554 455789999999999999999 Q ss_pred CcEEEEEecCCCCceeeEEEEecceEEE-eeCCCCCeEEEE-EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEE Q lcl|NC_011045. 141 GNVLLYLPEPEGSNYNPMKLYRLSSYVV-QRDAFGNVLQMV-TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY 218 (536) Q Consensus 141 G~~~l~~~~~~~~~~~~~~~~~l~~~~v-~~d~~G~v~~i~-r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~ 218 (536) |++++-+-.+. +.++...++...|+- ..|..|.+..+| .++..+.+ .+...+..++. -+ T Consensus 140 G~~a~k~~~d~--~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~--------------~~~~~Yt~lE~---H~ 200 (517) T protein:vir:98 140 GGLTVRPYVDN--GEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIG--------------NKTVYYTLLEF---HE 200 (517) T ss_pred CCEEEEEEEeC--CeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeec--------------CCceEEEEEEE---Ee Confidence 99987433333 235677888877775 556666444433 23322210 01111111121 11 Q ss_pred ecC---CCCcee----EEEEe----cCccccccc--------cccccccCceEE----Eeeee-cCCCccccchHHHHHH Q lcl|NC_011045. 219 LDE---DSGEYI----RYEEV----EGMEVQGSD--------GTYPKEACPYIP----IRMVR-LDGESYGRSYIEEYLG 274 (536) Q Consensus 219 p~~---~~~~~~----~~~~v----~g~~i~~~~--------~~~~~~~~P~~~----~rw~~-~~ge~YGrgp~~~~l~ 274 (536) +.. .++.|. .|.+- -|.++.... -+...-..|.++ +-.+. ..++.||+|-...+++ T Consensus 201 ~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~ 280 (517) T protein:vir:98 201 WEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVS 280 (517) T ss_pred cCceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHH Confidence 111 112222 22221 123332110 010101234221 12233 3378999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCCcc--------e--ecCCcccccccccccccchhHHHHH Q lcl|NC_011045. 275 DLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQTGD--------F--VTGRPEDISFLQLEKQADFTVAKAV 344 (536) Q Consensus 275 d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~g~--------~--~~g~~~~~~~~~~~~~~~~~~~~~~ 344 (536) .++.||..--+.+.-.++ .+..+.|+++.+....+.....++. + +.+..++..+..+...-+....... T Consensus 281 ~~d~lD~~~s~~~~e~~~-g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~ 359 (517) T protein:vir:98 281 TLKKINDTYDQFWWEIKM-GQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEA 359 (517) T ss_pred HHHHHHHHHHHHHHHHHh-CCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHH Confidence 999999877777766655 6667777555432111110001111 1 1111121111111111112233444 Q ss_pred HHHHHHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC--CCCc Q lcl|NC_011045. 345 SDAIEARLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE--LPKE 420 (536) Q Consensus 345 i~~~~~rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~--~~~~ 420 (536) ++.+-+.|.... + ...+......-.|||||..+.+...+...-+-.. -...|.-+++-++.+..-.+++.. .+.. T Consensus 360 ~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~-~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~ 438 (517) T protein:vir:98 360 INQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYE-VEQFIKGLVISVLELAKTYKLFGGEIPSAE 438 (517) T ss_pred HHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCCCCCCc Confidence 444444443221 1 1112212233359999999999888877754333 333455555555444333333322 2234 Q ss_pred ceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHH Q lcl|NC_011045. 421 AVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMG 500 (536) Q Consensus 421 ~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~ 500 (536) .+.|.|--++..-.. ..++...+ .+++ + .+....+ +...+|+ |++|.+++.++-+.... T Consensus 439 ~v~v~f~D~i~~D~~-~~~~~~~~---~v~a-G------~ms~~~~---i~~~~g~-------~eeeA~~e~~~i~~E~~ 497 (517) T protein:vir:98 439 HIGVDFDDGVFQDRS-ALLRFYGQ---AKTF-G------FIPTVEA---IQRIFKV-------PKKTAEQWLEEIRKDQI 497 (517) T ss_pred ceEEEcCCCCCCCHH-HHHHHHHH---HHhc-C------CCCHHHH---HHHhCCC-------ChHHHHHHHHHHHHhcc Confidence 577777555433221 11111122 1111 1 1222333 3334554 44444333222111111 Q ss_pred HHHHHHHHHHHHHHhhhcCcchHH Q lcl|NC_011045. 501 MDNGAAALAQGMAAQATASPEAMA 524 (536) Q Consensus 501 ~~~~a~~~~~~~~~~~~~~~~~~~ 524 (536) ... . ......+....+.... T Consensus 498 ~~~--~--~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 498 ELD--P--VTISQRAQKRMFGDEE 517 (517) T ss_pred ccC--C--CCccccccCCCCCCCC Confidence 000 0 0001111111111111 No 110 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.47 E-value=5.3e-07 Score=54.99 Aligned_cols=439 Identities=10% Similarity=0.074 Sum_probs=196.3 Q ss_pred CCCccccccHHHHHHHHHHHHH--------------HhhhHHHHHHHHHHHhcccc-----cC-CCC---Cccccccccc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKN--------------DRAPYETRAQNCAQYTIPSL-----FP-KDS---DNASTDYVTP 57 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~--------------~R~~~e~~w~e~~~~~~P~~-----~~-~~~---~~~~~~~~~~ 57 (536) |++=+=++++...+..++.++. .-....++++.+.+|....- .. ... ....+...++ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRM 80 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccccccee Confidence 7774433443333322222211 11222445666666665421 00 000 0011112345 Q ss_pred ccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHH Q lcl|NC_011045. 58 WQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQL 137 (536) Q Consensus 58 ~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl 137 (536) ..+-+...+++.++.|++ - | + .+...+.. ....++.|+ ..+|.....++.++. T Consensus 81 ~~n~~~~ivd~~~~~l~g-~-~--~--~~~~~~d~---------~~~~l~~~~------------~n~~~~~~~~~~~~~ 133 (478) T protein:vir:10 81 YTNYHQNLVDQKVAYAVA-N-P--V--TFGVDNDK---------ALKQIQHTL------------NHKWDDKLVDILTAA 133 (478) T ss_pred ccchHHHHHHHHHhhhcc-C-C--e--eeecCChH---------HHHHHHHHH------------hcCHHHHHHHHHHHH Confidence 566677777777765542 1 2 1 22333221 111223222 358899999999999 Q ss_pred HhhCcEEEEEecCCCCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEE Q lcl|NC_011045. 138 VVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYT 215 (536) Q Consensus 138 ~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~ 215 (536) .++|.|.+++..+.. +.+++++++..+.+...| ..|++...+|.++..- .+.+++|+ T Consensus 134 ~~~G~~~~~~~~d~~-g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~--------------------~~~~~~y~ 192 (478) T protein:vir:10 134 SNKGIEWVQPYVDEE-GEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG--------------------AERVEYWT 192 (478) T ss_pred HhcCeEEEEEEecCC-CeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--------------------ceEEEEEe Confidence 999999987766554 345677787777766655 3577777777664221 11233332 Q ss_pred E--E-EecCCCCceeE--EEEecCcc--ccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 216 H--I-YLDEDSGEYIR--YEEVEGME--VQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVK 288 (536) Q Consensus 216 ~--v-~p~~~~~~~~~--~~~v~g~~--i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~ 288 (536) . | +.+..++...+ +....+.. .......+++..+|++.++. +.+|+|=.+...+-+..++.+.-.+.. T Consensus 193 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~liDa~~~~~S~~~~ 267 (478) T protein:vir:10 193 KDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKRLSDTQN 267 (478) T ss_pred CCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEecc-----CCCCCCcHHHHHHHHHHHHHHHHHHHH Confidence 1 0 00111111110 11111111 11112345567889887754 468999999999999999999999999 Q ss_pred HHHHHhCCceeeccccccchhhhc-cCC-Ccce-ecCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhccc Q lcl|NC_011045. 289 MSMISSKVIGLVNPAGITQPRRLT-KAQ-TGDF-VTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSAVQ 363 (536) Q Consensus 289 ~~~~a~~p~~lv~~~g~~~~~~~~-~~~-~g~~-~~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~ 363 (536) ..+....|.+++.--+..+..+.. ... .+.+ +++.. +++. .+....+.......++.+++.|-..-.. +.... T Consensus 268 ~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 345 (478) T protein:vir:10 268 TFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVD--TIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQD 345 (478) T ss_pred HHHHhhCceeeeecCCccccchhhhhhhhcceEEecCCCCCcce--EEeecCChHHHHHHHHHHHHHHHHHhCccccCcc Confidence 999999998765422111111111 111 1222 22222 3333 3333345666777777777666443211 11111 Q ss_pred CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHH Q lcl|NC_011045. 364 RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDK 441 (536) Q Consensus 364 ~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~ 441 (536) ..+...|+..+..+..-+ .+......+. +...+.+++.++.+..-. ......++|+|.-.+. .+..+ +. T Consensus 346 ~~~~n~Sg~Al~~~~~~l-~~k~~~~~~~----~~~~l~~~~~li~~~~g~-~~~~~~i~i~f~~~~p~d~~e~a---~~ 416 (478) T protein:vir:10 346 KFGNSPSGIALKFMYSNL-DLKANKLKNK----TLTALQELLQYIIDFYRL-DVKVQDIEITFNFNVMVNELENS---QI 416 (478) T ss_pred ccccccHHHHHHHHHHHH-HHHHHHHHHH----HHHHHHHHHHHHHHHhCC-CcccccceEEecCCCCCCHHHHH---HH Confidence 112344665544432211 1122222222 233334444443332111 1233356676654333 22222 11 Q ss_pred HHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcc Q lcl|NC_011045. 442 LERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPE 521 (536) Q Consensus 442 l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~ 521 (536) ++.+++ .+....+++. +|. +--.++|++.+.+++...++.. . ..........+ T Consensus 417 -------~~kl~g-----~iS~et~~~~----l~~----v~D~~~E~~ri~~E~~~~~~~~---~----~~~~~~~~~~~ 469 (478) T protein:vir:10 417 -------AMNSTG-----LLSKETILSN----HAW----VEDPVAEMERIEQENIELNQQL---P----DIEEGLNGEQQ 469 (478) T ss_pred -------HHHHhC-----CCChHHHHHh----CCC----CCCHHHHHHHHHHHHHHHHhhc---c----ccccccCCCCC Confidence 111111 1223333333 332 1123466666655443222111 1 11111111111 Q ss_pred hHHhhhhcCCCCCC Q lcl|NC_011045. 522 AMAAAADSVGLQPG 535 (536) Q Consensus 522 ~~~~~~~~~~~q~~ 535 (536) .... .+ ||= T Consensus 470 ~~~~-~~----~~~ 478 (478) T protein:vir:10 470 RQSE-NN----QPE 478 (478) T ss_pred CCCC-CC----CCC Confidence 1111 11 111 No 111 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.38 E-value=9.5e-07 Score=53.60 Aligned_cols=389 Identities=13% Similarity=0.102 Sum_probs=183.9 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCC-CC---cccccccccccchHHHHHHHHHHHHHHhhcCCCcc Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKD-SD---NASTDYVTPWQAVGARGLNNLASKLMLALFPMQTW 83 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~-~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~W 83 (536) |+...+......+...+ ++.+.+.+|.....-... +. ..-+.+.+...+-+..+|++||..|. + T Consensus 1 m~~~~i~~L~~~~~~~~----~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~--------~ 68 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFK----TGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRII--------F 68 (422) T ss_pred CChHHHHHHHHHHHHHH----HHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccc--------c Confidence 66777766666665543 344556666544321111 11 11111222333455666666655331 1 Q ss_pred eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEec Q lcl|NC_011045. 84 MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL 163 (536) Q Consensus 84 f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l 163 (536) =.+...|. .+ .+...++++.....++.++..+||.+.++|-.+...+..+++.++. T Consensus 69 ~Gf~~~d~-------------~l-----------~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp 124 (422) T protein:vir:97 69 REFTNDDF-------------NA-----------WEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEA 124 (422) T ss_pred ceeeCCch-------------hH-----------HHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEech Confidence 11222221 12 2344569999999999999999999999997554323346778888 Q ss_pred ceEEEeeCCC-CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccc Q lcl|NC_011045. 164 SSYVVQRDAF-GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDG 242 (536) Q Consensus 164 ~~~~v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~ 242 (536) .+..+..|+. +++...++++... . +..... ..++++. ..++....|... .- T Consensus 125 ~~~~~i~D~~~~~~~~a~~~~~~~------------------~--~~~~~~-~~~~~~~----~~~~~~~~~~~~---~~ 176 (422) T protein:vir:97 125 SKATGILDPTTFLLTEGYAILESD------------------S--NGNPTL-EAYFTDK----DIWYYPKKGKPY---NI 176 (422) T ss_pred hhEEEEEeCCCCcceeeEEEEEec------------------C--CCcEEE-EEEEcCc----eEEEEcCCCccc---cc Confidence 7777777753 3333333322110 0 011111 1112211 011111111111 11 Q ss_pred ccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceee---ccccccchhhhccCCCcc Q lcl|NC_011045. 243 TYPKEACPYIPIRMVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLV---NPAGITQPRRLTKAQTGD 318 (536) Q Consensus 243 ~~~~~~~P~~~~rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv---~~~g~~~~~~~~~~~~g~ 318 (536) .+++..+|++++..+...++.||+|-. +..++-+..++...-.++..++..+.|...+ .++|. +.+......|. T Consensus 177 ~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~--~~~~~~~~~~~ 254 (422) T protein:vir:97 177 KNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAK--PMEKWRATVST 254 (422) T ss_pred cCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccc--cCchhhhhhhh Confidence 233567899999999899999999977 6789999999999999999999999998544 22221 11212223344 Q ss_pred ee--cCCccc--ccccccccccchhHHHHHHHHHHHHHHHHHhhh-----hcccCCCCCCCHHHHHHH-------HHHHH Q lcl|NC_011045. 319 FV--TGRPED--ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN-----SAVQRTGERVTAEEIRYV-------ASELE 382 (536) Q Consensus 319 ~~--~g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~-----~~~~~~~~r~TAtEi~~r-------~~E~~ 382 (536) +. +.+.++ +.+-++. .++++. .++.++.-|....... .+........+|.-|.+. .+++. T Consensus 255 i~~~~~de~~~~~~v~q~~-~~~l~~---~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~ 330 (422) T protein:vir:97 255 LLEISKDEDGDKPTVGQFT-TASMAP---FMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQ 330 (422) T ss_pred hhccCCCCCCCcceeeecC-CCChhH---HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 42 222221 2222232 345553 3444444443322111 111111112244444322 23333 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCCc--ceEEEEe-----chHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKE--AVEPTIS-----TGLEAIGRGQDLDKLERCVAAWAALAP 454 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~-~g~lp~~~~~--~v~v~~v-----s~La~a~r~~~~~~l~~~~~~~~~~~p 454 (536) +.+|.-+ .+++.++.. .|..+..+.. .+++.+. +....++.+..+.+|.+ .+| T Consensus 331 ~~fg~~l------------~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~-------a~~ 391 (422) T protein:vir:97 331 RSFSSGF------------LNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQ-------AIP 391 (422) T ss_pred HHHHHHH------------HHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHh-------hcc Confidence 3333333 333333222 2334444433 3555543 33334554444444333 222 Q ss_pred hhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHH Q lcl|NC_011045. 455 MRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMG 500 (536) Q Consensus 455 ~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~ 500 (536) . ..+.+.+. +.+|++. .++|...+.+.++. . T Consensus 392 ~----~~~~~~~~----~~lg~~~-----~~~~~~~~~~~~~d--~ 422 (422) T protein:vir:97 392 G----FMDADVIR----DLTGVKG-----ADKPIPAITEVTTD--G 422 (422) T ss_pred c----cccHHHHH----HHcCCCc-----hhHHHHHHHhhhcc--C Confidence 1 12333333 3346632 23333222222111 1 No 112 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.37 E-value=1e-06 Score=53.49 Aligned_cols=374 Identities=15% Similarity=0.104 Sum_probs=184.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC-CCC--Cc-ccccccccccchHHHHHHHHHHHHHHhhcCCCcc Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP-KDS--DN-ASTDYVTPWQAVGARGLNNLASKLMLALFPMQTW 83 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~-~~~--~~-~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~W 83 (536) |+.+.+....+++...+ ++.+.+.+|..-..-. .-+ -+ .-+..-+..-+-+..+|+++|..|. +.+ T Consensus 1 ~~~~~i~~L~~~~~~~~----~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G-- 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHK----RRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE-- 70 (409) T ss_pred CCHHHHHHHHHHHHHHh----HHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc----cCc-- Confidence 77777766666665543 3344444454221110 001 11 1111223444566777777666542 111 Q ss_pred eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEec Q lcl|NC_011045. 84 MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL 163 (536) Q Consensus 84 f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l 163 (536) | ...|. .++ +...+++|.....++.++..+||.+.++|-.+..+.+ +++.++. T Consensus 71 f--~~~d~-------------~l~-----------~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp 123 (409) T protein:vir:94 71 F--ENDDF-------------TVN-----------EIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEA 123 (409) T ss_pred c--cCCch-------------HHH-----------HHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEecc Confidence 1 11111 122 3345689999999999999999999999887665544 6788888 Q ss_pred ceEEEeeCCC-CCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEE--EE-EecCCCCceeEEEEecCccccc Q lcl|NC_011045. 164 SSYVVQRDAF-GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYT--HI-YLDEDSGEYIRYEEVEGMEVQG 239 (536) Q Consensus 164 ~~~~v~~d~~-G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~--~v-~p~~~~~~~~~~~~v~g~~i~~ 239 (536) .+.++..|+. +++...++...- .+........+|. .+ +...+++.|.. T Consensus 124 ~~~~~i~D~~~~~~~~a~~~~~~-----------------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 175 (409) T protein:vir:94 124 VNATGIIDPITGLLTEGYAVLER-----------------DENNNVVLEAHFLPDRTDYYYRDSRNNIS----------- 175 (409) T ss_pred ceEEEEEecCCCceeeeEEEEEe-----------------cCCCceEEEEEEecCcEEEEEecCceeEe----------- Confidence 8888888863 444444433210 0000001111111 00 00111222211 Q ss_pred cccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceee---ccccccchhhhccCC Q lcl|NC_011045. 240 SDGTYPKEACPYIPIRMVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLV---NPAGITQPRRLTKAQ 315 (536) Q Consensus 240 ~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv---~~~g~~~~~~~~~~~ 315 (536) -.+++..+|++++..+...++.||+|-. +..++-+..+|...-.++..++..+.|...+ .+++. +.+..... T Consensus 176 --~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~--~~~~~~~~ 251 (409) T protein:vir:94 176 --IANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAE--PMETWKAT 251 (409) T ss_pred --eeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCc--ccchhhhh Confidence 1234568999999998888999999976 5688999999999999999999999997444 33332 22222333 Q ss_pred Ccceec--CCccc--ccccccccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCHHHHHH-------HHH Q lcl|NC_011045. 316 TGDFVT--GRPED--ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIRY-------VAS 379 (536) Q Consensus 316 ~g~~~~--g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TAtEi~~-------r~~ 379 (536) .+.+.. ...++ +.+-++. .++++. .++.++.-|........ +.......-+|.-|.+ +++ T Consensus 252 ~~~i~~~~~d~dg~~~~v~q~~-~~~l~~---~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~ 327 (409) T protein:vir:94 252 VSSMLQFTKDEDGDKPTLGQFT-QPSMSP---FTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGR 327 (409) T ss_pred HHHhhcCCCCCCCCCceEEecC-CCChhH---HHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHH Confidence 444432 22221 2222332 345553 44555544443322211 1101111123433332 223 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc--ceEEEEe-----chHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE--AVEPTIS-----TGLEAIGRGQDLDKLERCVAAWAAL 452 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~--~v~v~~v-----s~La~a~r~~~~~~l~~~~~~~~~~ 452 (536) ++.+.+|.-+. -++..++.+. +..+..+.+ .+++.+. +.-..++.+..+.+|.+ . T Consensus 328 ~k~~~fg~~~~--------~~~rla~~i~---~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~-------a 389 (409) T protein:vir:94 328 KAQRSLGAGLL--------NVAYLAACLR---DDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQ-------A 389 (409) T ss_pred HHHHHHHHHHH--------HHHHHHHHHh---CCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHH-------h Confidence 33333333222 2333333333 223333332 3555554 22223444444444433 2 Q ss_pred cchhhhhcCCHHHHHHHHHHHcCCChhh Q lcl|NC_011045. 453 APMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) Q Consensus 453 ~p~~~~~~id~d~~~~~~a~~~Gv~p~~ 480 (536) +|.. .+.+ .+.+.+|.+... T Consensus 390 g~~~----~~~~----~~~~~lG~~~~d 409 (409) T protein:vir:94 390 IPEF----INKD----TIRDLTGIEGGE 409 (409) T ss_pred cccc----cchh----HHHHHcCCCCCC Confidence 3311 1222 244456665444 No 113 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.33 E-value=1.3e-06 Score=52.91 Aligned_cols=436 Identities=10% Similarity=0.059 Sum_probs=193.7 Q ss_pred CCCccccccH------------------HHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----c-cCCCCC---ccccc Q lcl|NC_011045. 1 MAEKRTGLAE------------------EGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----L-FPKDSD---NASTD 53 (536) Q Consensus 1 Ma~~~~~~~~------------------~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~-~~~~~~---~~~~~ 53 (536) |++-+-..+. +.+.+..+..+. -..+++.+.+|..-. + ...... ..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~----~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKE----NIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKP 76 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHH----HHHHHHHHHHHhcccccccccchhhhcccccccccc Confidence 6664333333 233333333332 234455555555321 1 000000 00111 Q ss_pred ccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHH Q lcl|NC_011045. 54 YVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEA 133 (536) Q Consensus 54 ~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~ 133 (536) ..++..+.+...++..++.|++ -| + .+...+.. ....++.|+ .++|.....++ T Consensus 77 ~~ki~~n~~k~ivd~~~~yl~g--~p--~--~~~~~~~~---------~~~~l~~~~------------~n~~~~~~~~~ 129 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVA--NP--V--TFGVDNDK---------ALKQIQHTL------------NHKWDDKLVDI 129 (478) T ss_pred cceeccchHHHHHHHHhhhhcc--cC--c--eeecCChH---------HHHHHHHHH------------hccHHHHHHHH Confidence 2244555666677776665543 12 1 22333221 111233333 26888999999 Q ss_pred HHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceE Q lcl|NC_011045. 134 LKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETI 211 (536) Q Consensus 134 ~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~ 211 (536) .++..+||.+.+++..+.. +.+++..++..+.+...| ..|++...+|.+...- ...+ T Consensus 130 ~~~~~~~G~~~~~v~~d~~-~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------------------~~~~ 188 (478) T protein:vir:10 130 LTAASNKGIEWVQPYVDEE-GEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG--------------------AERV 188 (478) T ss_pred HHHHhhCCeEEEEEEecCC-CceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeC--------------------ceEE Confidence 9999999999887765544 346788888777665554 3577777766655320 1123 Q ss_pred EEEEE--E-EecCCCCceeE--EEEecCc--cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHH Q lcl|NC_011045. 212 DVYTH--I-YLDEDSGEYIR--YEEVEGM--EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQE 284 (536) Q Consensus 212 ~v~~~--v-~p~~~~~~~~~--~~~v~g~--~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~ 284 (536) ++|+. | +.+.+++.... .....+. ........+++..+|++.++. +.+|.|-.+...+.+..++.+.- T Consensus 189 ~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~S 263 (478) T protein:vir:10 189 EYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDALDKRLS 263 (478) T ss_pred EEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEecc-----CCCCCCcHHHHHHHHHHHHHHHH Confidence 33221 0 00111111110 0000111 111223346678899888765 35799999999999999999999 Q ss_pred HHHHHHHHHhCCceeeccccccchhhhcc--CCCcce-ecCC-cccccccccccccchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011045. 285 AIVKMSMISSKVIGLVNPAGITQPRRLTK--AQTGDF-VTGR-PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS 360 (536) Q Consensus 285 ~~~~~~~~a~~p~~lv~~~g~~~~~~~~~--~~~g~~-~~g~-~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~ 360 (536) ......+....|.+.+.--..-+..+... ...+.+ +.+. .+++.. +....+.......++.+++.|-..-..-. T Consensus 264 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 341 (478) T protein:vir:10 264 DTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDT--IKVEVPIDSVKEYTKMLRDYIIEFGQGVD 341 (478) T ss_pred HHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecCCCCCcceE--EeecCCHHHHHHHHHHHHHHHHHHhCCcC Confidence 99999999999987664211111111111 112232 3332 233433 33344667777778887777654322111 Q ss_pred cc-cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHH Q lcl|NC_011045. 361 AV-QRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQD 438 (536) Q Consensus 361 ~~-~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~ 438 (536) .. ...+...|+..+..+..-+... . .+. ...+.+.+.+++.++.+. |. ......++|+|.-.+..-. .+. T Consensus 342 ~~~~~~~~n~Sg~Ai~~~~~~l~~k-~---~~~-~~~~~~~l~~~~~li~~~~~~--~~d~~~i~i~f~~~~p~~~-~e~ 413 (478) T protein:vir:10 342 FQQDKFGNSPSGIALKFMYSNLDLK-A---NKL-KNKTLTALQELLQYIIDFYRL--DVRVQDIEITFNFNVMVNE-LEN 413 (478) T ss_pred cCccccccchHHHHHHHHHHHHHHH-H---HHH-HHHHHHHHHHHHHHHHHHhCC--CcccccceEEeCCCCCCCH-HHH Confidence 11 1112234555443322221111 1 111 122333344444443332 21 2333356776644443211 111 Q ss_pred HHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_011045. 439 LDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATA 518 (536) Q Consensus 439 ~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~ 518 (536) ++.+.. ++.+ +....++. .++. +--.++|++++.+++.+.++.. ... .. +. T Consensus 414 ~~~~~~----~~g~--------iS~et~i~----~~~~----v~d~~~E~~ri~~E~~~~~~~~---~~~-~~-----~~ 464 (478) T protein:vir:10 414 SQIAMN----STGL--------LSKETILG----NHSW----VQDPVAEMERIEQENIELNQQL---PDI-EE-----GL 464 (478) T ss_pred HHHHHH----HhCC--------CChHHHHH----hCCC----CCCHHHHHHHHHHHHHHHHHhc---ccc-CC-----CC Confidence 222111 1111 22222222 2221 1123466666655543322111 000 00 00 Q ss_pred CcchHHhhhhcCCC Q lcl|NC_011045. 519 SPEAMAAAADSVGL 532 (536) Q Consensus 519 ~~~~~~~~~~~~~~ 532 (536) +-+...+..+.+.- T Consensus 465 ~d~~~~~~~d~~~e 478 (478) T protein:vir:10 465 NDEQQRQSEDNQSE 478 (478) T ss_pred cccccccCcCCCCC Confidence 10111111111111 No 114 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.27 E-value=1.9e-06 Score=52.01 Aligned_cols=435 Identities=10% Similarity=0.018 Sum_probs=173.3 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc----CCCCCccccc-ccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF----PKDSDNASTD-YVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~----~~~~~~~~~~-~~~~~dst~~~a~~~Laa~l~~ 75 (536) |..... .+.++....++... .++.+.+.+|..-.-. +.......+. ..++..+-+...|+.++..|++ T Consensus 1 ~~~~t~---~~~~~~l~~~~~~~----~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTP---AEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCH---HHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 544221 23344444444333 3444555555532210 0011111111 2344455566677766665532 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) - |+ ++...+. .+....++ +.+.++++.....++.+++.+||.+.+++..+..+. T Consensus 74 ~-----~~-~~~~~~d--------~~~~~~~~-----------~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~- 127 (456) T protein:vir:10 74 N-----GI-TVGGSAD--------SDLALRAR-----------RIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT- 127 (456) T ss_pred C-----Ce-ecCCCCC--------cchHHHHH-----------HHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCc- Confidence 1 22 1211110 00011222 334468899999999999999999988887665543 Q ss_pred eeEEEEecceEEEeeCCCC--CeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFG--NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++++++..+.++..|+.- ++...+|.++.. ..-+.. ......+.....+..++...+ .....+.... T Consensus 128 ~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~-----d~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 197 (456) T protein:vir:10 128 ATITADSPETMVVSVDPLQPWRIRAAMRWWRDL-----DAESDF----AIVWSGDGWQKFARPCFVQSS-SRRRLVTRIS 197 (456) T ss_pred eEEEEEccceeEEEEcCCCCcceEEEEEEEEec-----CCceeE----EEEEeccceeEEEEEEEEeec-ccceeeeecC Confidence 4678888888777777543 333444443310 000000 000011111111111111101 0111111112 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec----------cc Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN----------PA 303 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~----------~~ 303 (536) |..+......+.+..+|++.+ .+..|.|-.+..++-+..++...-..+..++..+.|...+. .+ T Consensus 198 ~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~ 271 (456) T protein:vir:10 198 DSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDEN 271 (456) T ss_pred CceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccc Confidence 222222222333445555443 23578999999999999999877777777776666553331 11 Q ss_pred cc-cchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 304 GI-TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 304 g~-~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~ 381 (536) |. .++.+......|.+.... .+..+.++. .++++.....+..+...|...- +-+.....+....|+.-+.....-+ T Consensus 272 g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l 349 (456) T protein:vir:10 272 GNAIDYASIFEAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGF 349 (456) T ss_pred ccccchhhhhhhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHH Confidence 10 112222233344433222 222333332 2344443333433333332110 0001000112233554433332222 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH--HHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA--IGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~--a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) .. ..++.+ ..+.+-+.+++.++.+..-. .....+++.+.-++.. ++.+..+.++ . +++ T Consensus 350 ~~----k~~~~~-~~f~~~l~~~~rl~~~~~g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl---~----~~g------ 409 (456) T protein:vir:10 350 LF----KCEDRL-SIAKIGLEAILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLA---K----AAG------ 409 (456) T ss_pred HH----HHHHHH-HHHHHHHHHHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHHH---H----HcC------ Confidence 22 122222 23444455555555442211 2233577777555432 2222222221 1 111 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHH Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMA 524 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~ 524 (536) +=..+. +...+|+++. ++++...++... +..+.++++++.+.-+ +.. T Consensus 410 -i~~~~~---~~~~lg~~~~-------~i~~~e~er~~~-----e~~~~~~~~~~~~~~~--~~~ 456 (456) T protein:vir:10 410 -ESWASI---RRNILNYNAD-------QIKQDDLDRARE-----QITLFAGNPVQRPQED--GSR 456 (456) T ss_pred -CChHHH---HHhhCCCCHH-------HHHHHHHHHHHH-----HHHHHhhhhhhcCCCC--CCC Confidence 101111 1234576543 332221111111 1111111221111111 111 No 115 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.27 E-value=1.9e-06 Score=52.01 Aligned_cols=435 Identities=10% Similarity=0.018 Sum_probs=173.3 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc----CCCCCccccc-ccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF----PKDSDNASTD-YVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~----~~~~~~~~~~-~~~~~dst~~~a~~~Laa~l~~ 75 (536) |..... .+.++....++... .++.+.+.+|..-.-. +.......+. ..++..+-+...|+.++..|++ T Consensus 1 ~~~~t~---~~~~~~l~~~~~~~----~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 73 (456) T protein:vir:10 1 MTASTP---AEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) T ss_pred CCCCCH---HHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhcc Confidence 544221 23344444444333 3444555555532210 0011111111 2344455566677766665532 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) - |+ ++...+. .+....++ +.+.++++.....++.+++.+||.+.+++..+..+. T Consensus 74 ~-----~~-~~~~~~d--------~~~~~~~~-----------~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~- 127 (456) T protein:vir:10 74 N-----GI-TVGGSAD--------SDLALRAR-----------RIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT- 127 (456) T ss_pred C-----Ce-ecCCCCC--------cchHHHHH-----------HHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCc- Confidence 1 22 1211110 00011222 334468899999999999999999988887665543 Q ss_pred eeEEEEecceEEEeeCCCC--CeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFG--NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++++++..+.++..|+.- ++...+|.++.. ..-+.. ......+.....+..++...+ .....+.... T Consensus 128 ~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~-----d~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 197 (456) T protein:vir:10 128 ATITADSPETMVVSVDPLQPWRIRAAMRWWRDL-----DAESDF----AIVWSGDGWQKFARPCFVQSS-SRRRLVTRIS 197 (456) T ss_pred eEEEEEccceeEEEEcCCCCcceEEEEEEEEec-----CCceeE----EEEEeccceeEEEEEEEEeec-ccceeeeecC Confidence 4678888888777777543 333444443310 000000 000011111111111111101 0111111112 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec----------cc Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN----------PA 303 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~----------~~ 303 (536) |..+......+.+..+|++.+ .+..|.|-.+..++-+..++...-..+..++..+.|...+. .+ T Consensus 198 ~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~ 271 (456) T protein:vir:10 198 DSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDEN 271 (456) T ss_pred CceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccc Confidence 222222222333445555443 23578999999999999999877777777776666553331 11 Q ss_pred cc-cchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 304 GI-TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAF-MLNSAVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 304 g~-~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af-~~~~~~~~~~~r~TAtEi~~r~~E~ 381 (536) |. .++.+......|.+.... .+..+.++. .++++.....+..+...|...- +-+.....+....|+.-+.....-+ T Consensus 272 g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l 349 (456) T protein:vir:10 272 GNAIDYASIFEAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGF 349 (456) T ss_pred ccccchhhhhhhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHH Confidence 10 112222233344433222 222333332 2344443333433333332110 0001000112233554433332222 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH--HHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA--IGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~--a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) .. ..++.+ ..+.+-+.+++.++.+..-. .....+++.+.-++.. ++.+..+.++ . +++ T Consensus 350 ~~----k~~~~~-~~f~~~l~~~~rl~~~~~g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl---~----~~g------ 409 (456) T protein:vir:10 350 LF----KCEDRL-SIAKIGLEAILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYSAASLA---K----AAG------ 409 (456) T ss_pred HH----HHHHHH-HHHHHHHHHHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHHH---H----HcC------ Confidence 22 122222 23444455555555442211 2233577777555432 2222222221 1 111 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHH Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMA 524 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~ 524 (536) +=..+. +...+|+++. ++++...++... +..+.++++++.+.-+ +.. T Consensus 410 -i~~~~~---~~~~lg~~~~-------~i~~~e~er~~~-----e~~~~~~~~~~~~~~~--~~~ 456 (456) T protein:vir:10 410 -ESWASI---RRNILNYNAD-------QIKQDDLDRARE-----QITLFAGNPVQRPQED--GSR 456 (456) T ss_pred -CChHHH---HHhhCCCCHH-------HHHHHHHHHHHH-----HHHHHhhhhhhcCCCC--CCC Confidence 101111 1234576543 332221111111 1111111221111111 111 No 116 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.21 E-value=2.5e-06 Score=51.30 Aligned_cols=375 Identities=13% Similarity=0.096 Sum_probs=173.6 Q ss_pred hhhHHHHHHHHHHHhcccccCCC-C--Ccc-cccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccC Q lcl|NC_011045. 24 RAPYETRAQNCAQYTIPSLFPKD-S--DNA-STDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSD 99 (536) Q Consensus 24 R~~~e~~w~e~~~~~~P~~~~~~-~--~~~-~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~ 99 (536) =+-+.++-+.+.+|..-..-... + -+. -+..-+...+-+..+|++||..|.- . .+...|. T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~----~----Gf~~~d~-------- 64 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIF----R----AFANDDF-------- 64 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhcc----c----cccCCCc-------- Confidence 12223333344444432211111 1 011 1111234456677777777765531 1 1111111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCC-CCeEE Q lcl|NC_011045. 100 PDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAF-GNVLQ 178 (536) Q Consensus 100 ~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~-G~v~~ 178 (536) .+. +...+++|.....++.++..+||.+.++|-.+..+.+ ++++++..+.++..|+. +++.. T Consensus 65 -----~l~-----------~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~~~~ 127 (410) T protein:vir:95 65 -----NVT-----------EIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGLLVE 127 (410) T ss_pred -----hHH-----------HHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCceEE Confidence 123 2344699999999999999999999999876655443 67888888777777763 44444 Q ss_pred EEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEE---EEEecCCCCceeEEEEecCccccccccccccccCceEEEe Q lcl|NC_011045. 179 MVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYT---HIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIR 255 (536) Q Consensus 179 i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~---~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~r 255 (536) -++...- .++.....+.+|+ .++.++++..| .-.+++..||++++. T Consensus 128 al~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~g~vPvV~f~ 176 (410) T protein:vir:95 128 GYAVLAR-----------------DDYNRPTLEAYFEPNATHFIPKDGEPY--------------SVTNETGIPLLVPVI 176 (410) T ss_pred EEEEEEe-----------------cCCCeEEEEEEEeCCcEEEEeeCCccc--------------cccCCCCCcceEEec Confidence 3332110 0011111222221 01111111111 112445689999999 Q ss_pred eeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceee---ccccccchhhhccCCCcceec--CCccc--c Q lcl|NC_011045. 256 MVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLV---NPAGITQPRRLTKAQTGDFVT--GRPED--I 327 (536) Q Consensus 256 w~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv---~~~g~~~~~~~~~~~~g~~~~--g~~~~--~ 327 (536) .+...++.||+|=. +..++-+..++...-.++..++....|...+ .+++. +.+......|.+.. .+.++ + T Consensus 177 n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~--~~~~~~~~~~~i~~~~~~~~~~~~ 254 (410) T protein:vir:95 177 HRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAE--PMEKWKATVSSLLTISSSDKGVKP 254 (410) T ss_pred ccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCC--cCchhhhhhhhheeccCCCCCCcc Confidence 98888999999954 6788999999999999999999999997444 22222 11222233344432 22221 1 Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHH Q lcl|NC_011045. 328 SFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIRY-------VASELEDTLGGVYSILSQE 395 (536) Q Consensus 328 ~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TAtEi~~-------r~~E~~~~LG~v~~rl~~E 395 (536) .+-++ ..++++.. ++.++.-+........ +.......-+|.-|.+ +++++.+.+|.-+.+ T Consensus 255 ~v~q~-~~~~l~~~---~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~---- 326 (410) T protein:vir:95 255 SVGQF-TTASMSPF---TEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLN---- 326 (410) T ss_pred eEEec-CCCChHHH---HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 22223 23456543 4444444433222211 1101111123333332 223333333332222 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCCcc--eEEEEe---chH--HHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHH Q lcl|NC_011045. 396 LQLPLVRVLLKQLQATQQIPELPKEA--VEPTIS---TGL--EAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKL 468 (536) Q Consensus 396 ~l~Pli~r~~~il~~~g~lp~~~~~~--v~v~~v---s~L--a~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~ 468 (536) ++..++.+. +..+..+.+. ++|..- -|- ..++.+..+.+|. ++.|-. ++.+ T Consensus 327 ----~~rla~~i~---~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~-------~a~~g~----~~~~---- 384 (410) T protein:vir:95 327 ----VAYVAACLR---DEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLN-------QALPGY----INAE---- 384 (410) T ss_pred ----HHHHHHHHh---cCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHH-------HhccCC----ccHH---- Confidence 233333333 3334344333 344443 121 2233444433332 222211 1222 Q ss_pred HHHHHcCCChhhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 469 RIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM 501 (536) Q Consensus 469 ~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~ 501 (536) .+.+.+|.++ ++.+.++.+++.++.+ T Consensus 385 ~~~~~lg~~~-------~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 385 TIRDLTGIAG-------DMSAKPVVSEGGSNGE 410 (410) T ss_pred HHHHhcCCCh-------HHHHHHHHHHHHhCCC Confidence 2445567633 3332222222211111 No 117 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.17 E-value=3.2e-06 Score=50.73 Aligned_cols=377 Identities=14% Similarity=0.106 Sum_probs=181.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccC-CC--CCc-ccccccccccchHHHHHHHHHHHHHHhhcCCCcc Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP-KD--SDN-ASTDYVTPWQAVGARGLNNLASKLMLALFPMQTW 83 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~-~~--~~~-~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~W 83 (536) |+.+.+..+.+++...+ ++.+.+.+|..-..-. .- .-+ .-+..-+..-+-+..+|+++|..|. +.+ T Consensus 1 ~~~~~i~~L~~~~~~~~----~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G-- 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHK----RRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE-- 70 (409) T ss_pred CCHHHHHHHHHHHHHHh----HHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcc----ccc-- Confidence 77777777766665544 3333444443221100 00 011 1111123344566777777766542 111 Q ss_pred eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEec Q lcl|NC_011045. 84 MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL 163 (536) Q Consensus 84 f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l 163 (536) |+ ..|. .++ +...+++|.....++.++..+||.+.++|-.+..+. .++++++. T Consensus 71 f~--~~d~-------------~l~-----------~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~-~~i~~~sP 123 (409) T protein:vir:16 71 FE--NDDF-------------TVN-----------EIFEENNPDIFFDSTVLSALIASCSFTYISKGENDA-VRLQVIEA 123 (409) T ss_pred cc--Ccch-------------HHH-----------HHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCc-eEEEEEcc Confidence 11 1111 122 334569999999999999999999999988766554 36778887 Q ss_pred ceEEEeeCC-CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccc Q lcl|NC_011045. 164 SSYVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDG 242 (536) Q Consensus 164 ~~~~v~~d~-~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~ 242 (536) .+.++..|+ .+++...++...- ..+.+ ...+....|+ ..+ ++....+.. ..- T Consensus 124 ~~~~~i~D~~~~~~~~a~~~~~~------------------d~~~~--~~~~~~~~~~---~~~-~~~~~~~~~---~~~ 176 (409) T protein:vir:16 124 TNATGIIDPITGLLTEGYAVLER------------------DENNN--VVLEAHFLPD---RTD-YYYRDSRNN---ISI 176 (409) T ss_pred cceEEEeecccccceeeeEEEEe------------------cCCCc--eEEEEEEecC---cEE-EEEecCccc---cce Confidence 777777775 3444443332110 00001 1111111121 111 111111110 111 Q ss_pred ccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceeec---cccccchhhhccCCCcc Q lcl|NC_011045. 243 TYPKEACPYIPIRMVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLVN---PAGITQPRRLTKAQTGD 318 (536) Q Consensus 243 ~~~~~~~P~~~~rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~---~~g~~~~~~~~~~~~g~ 318 (536) .+++..||++.+..+...++.||+|=. +..++-+..+|...-.++..++....|...+- ++| ++.+......|. T Consensus 177 ~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~--~~~~~~~~~~~~ 254 (409) T protein:vir:16 177 ANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA--EPMETWKATVSS 254 (409) T ss_pred ecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC--CccchhhhhhhH Confidence 345678999999999888999999955 66889999999999999999999999985541 222 111222233344 Q ss_pred ee--cCCccc--ccccccccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCCCHHHHHH-------HHHHHH Q lcl|NC_011045. 319 FV--TGRPED--ISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIRY-------VASELE 382 (536) Q Consensus 319 ~~--~g~~~~--~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TAtEi~~-------r~~E~~ 382 (536) +. +...++ +.+-++. .++++. .++.++.-|........ +.......-+|.-|.+ +++++. T Consensus 255 i~~~~~d~~g~~~~v~q~~-~~~l~~---~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~ 330 (409) T protein:vir:16 255 MLQFTKDEDGDKPTLGQFT-QPSMSP---FTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQ 330 (409) T ss_pred hhccCCCCCCCCceEEecC-CCChhH---HHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 43 222221 2222332 345553 34455444443322211 1101111123433332 233333 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc--ceEEEEechH-----HHHHHHHHHHHHHHHHHHHHhhcch Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE--AVEPTISTGL-----EAIGRGQDLDKLERCVAAWAALAPM 455 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~--~v~v~~vs~L-----a~a~r~~~~~~l~~~~~~~~~~~p~ 455 (536) +.+|..+.+ ++..++.+. |..+..+.+ .+++.+--.. ..++.+..+.+|.+. +|. T Consensus 331 ~~fg~~l~~--------~~rla~~~~---~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a-------~~~ 392 (409) T protein:vir:16 331 RSLGAGLLN--------VAYLAACLR---DDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQA-------IPE 392 (409) T ss_pred HHHHHHHHH--------HHHHHHHHh---cCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhh-------ccc Confidence 333333332 223333333 333444443 3455554221 234444444444432 221 Q ss_pred hhhhcCCHHHHHHHHHHHcCCChhh Q lcl|NC_011045. 456 RDDPDINLAMIKLRIANAIGIDTSG 480 (536) Q Consensus 456 ~~~~~id~d~~~~~~a~~~Gv~p~~ 480 (536) . .+.+- +.+.+|.+.+. T Consensus 393 ~----~~~~v----~~~~~g~~~~d 409 (409) T protein:vir:16 393 F----INKDT----IRDLTGIKGAE 409 (409) T ss_pred c----cchhH----HHHhccCCCCC Confidence 1 12222 23344554333 No 118 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.11 E-value=4.3e-06 Score=50.00 Aligned_cols=470 Identities=10% Similarity=-0.004 Sum_probs=208.7 Q ss_pred CCCccccccHHHHHHHHHHH-HHHh-hhHHHHHHHHHHHhcccc---cCCC---CCcc------cccccccccchHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERL-KNDR-APYETRAQNCAQYTIPSL---FPKD---SDNA------STDYVTPWQAVGARGL 66 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l-~~~R-~~~e~~w~e~~~~~~P~~---~~~~---~~~~------~~~~~~~~dst~~~a~ 66 (536) |...-..+..+.+...+... .... +...++.+.+.+|..-.- .... +..+ .+...|+..+-+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 88777777776666555332 2111 222455566677764421 0100 0011 1112356666777777 Q ss_pred HHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 67 ~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) +..++.|++. |. .++..+... .++ .+.+...+ ..+|.....++.+++.++|.|.+| T Consensus 81 d~~~~yl~G~--Pv----~~~~~d~~~----------~e~-------~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~ 136 (537) T protein:vir:78 81 DQLAQYLLSN--GV----EVKVKDEDN----------TQL-------DEILQEYF-DEDFQATIDTLVTNASKKGFEGIF 136 (537) T ss_pred HHHhhhhccc--Cc----eeecCcchh----------HHH-------HHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEE Confidence 7777766432 32 122222111 111 12222222 367778889999999999999776 Q ss_pred EecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE----EecCC Q lcl|NC_011045. 147 LPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI----YLDED 222 (536) Q Consensus 147 ~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v----~p~~~ 222 (536) +..+.. +.+++..++.-+.+...|..|....++|-+.....+-. ....+.-..+++|+-- +.... T Consensus 137 ~y~de~-~~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~----------~~~~~~~~~~evyt~~~i~~y~~~~ 205 (537) T protein:vir:78 137 ARTTSE-GKLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTK----------QQSTETIWHADVWNEEAVCYYIQDD 205 (537) T ss_pred eeecCC-CceEEEEEccceeEEEEcCCCCceeEEEEEeeeecccc----------ccCcceEEEEEEEcCCcEEEEEecC Confidence 554444 34678888888888888888888888877765532211 0011111233333210 11111 Q ss_pred CCce-------------eEEEEec--------CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHH Q lcl|NC_011045. 223 SGEY-------------IRYEEVE--------GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLEN 281 (536) Q Consensus 223 ~~~~-------------~~~~~v~--------g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~ 281 (536) ++.+ .++.... +.......-.++|..+|++.++= +.+|.|=.++..+-+-.++. T Consensus 206 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~n-----n~~~~sd~e~v~~LiDayd~ 280 (537) T protein:vir:78 206 EGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYN-----NKDGMSDVKRVKSIIDDYDV 280 (537) T ss_pred CcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEecc-----CccCCCchhhhHHHHHHHHH Confidence 1100 0011000 00111122335677888876654 45789999999999999999 Q ss_pred HHHHHHHHHHHHhCCceeeccccccchhhhcc-C-CCcce-ecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 282 LQEAIVKMSMISSKVIGLVNPAGITQPRRLTK-A-QTGDF-VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML 358 (536) Q Consensus 282 l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~-~-~~g~~-~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~ 358 (536) +.-......+...+|.+.+.-.+..+..++.. . ..+.+ +.+..+++.. +....+.......++.+++.|-+.-+. T Consensus 281 ~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~--l~~~~~~~~~e~~ld~L~~~I~~~s~~ 358 (537) T protein:vir:78 281 MNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEI--QTVSIPYEARKAKMDIDVENIYRSGMG 358 (537) T ss_pred HHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCCCceeE--EEecCCHHHHHHHHHHHHHHHHHhcCC Confidence 99999999999999987775333322222211 1 12333 3344455543 344456777778888888777442221 Q ss_pred hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHH Q lcl|NC_011045. 359 NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQD 438 (536) Q Consensus 359 ~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~ 438 (536) ...........|..-+..+-.-+ .+-.....+.-.+.+.-++.-++.++...|. .......++++|.-.+-.-. ... T Consensus 359 ~~~~~~~~gn~SGvAlk~~~~~l-~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~-~~~d~~~i~i~f~~~~P~n~-~e~ 435 (537) T protein:vir:78 359 FNSTAVGDGNVTNVVIKSRYTLL-AMKARKMETSLRKVLRWCADMVVSDIALRGL-GEYDSNDICFEIEPHVLANE-LDI 435 (537) T ss_pred CCCccccccCCcHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-cccccceeeEEeccCCCCCH-HHH Confidence 11111222334443333221111 1222333333333333344444444433332 23444567887776554322 111 Q ss_pred HHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHH-----HHHHHHH----- Q lcl|NC_011045. 439 LDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM-----DNGAAAL----- 508 (536) Q Consensus 439 ~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~-----~~~a~~~----- 508 (536) ++.+.... ..+.++.+. ++.. ++ ++-+.++.+.+.++..+...- ..+.++. T Consensus 436 a~~~~~l~-~~giiS~eT---------~l~~----~p-----~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~ 496 (537) T protein:vir:78 436 ATTRKTEA-ETEALKIGN---------IMTV----AP-----RIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSP 496 (537) T ss_pred HHHHHHHH-hcCcchHHH---------HHHh----CC-----CCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCc Confidence 11111111 001111111 1111 11 111111111111111000000 0000000 Q ss_pred -HHHHHHhhhcC--cchHHhhhh------------cCCCCC Q lcl|NC_011045. 509 -AQGMAAQATAS--PEAMAAAAD------------SVGLQP 534 (536) Q Consensus 509 -~~~~~~~~~~~--~~~~~~~~~------------~~~~q~ 534 (536) ......-.+.+ +++.-..+. ++++|- T Consensus 497 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 497 DVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred chhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 00000000000 000000000 011111 No 119 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=98.07 E-value=5.3e-06 Score=49.52 Aligned_cols=429 Identities=10% Similarity=0.038 Sum_probs=192.7 Q ss_pred CCCc------------------cccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc--cccCCCCC----cccccccc Q lcl|NC_011045. 1 MAEK------------------RTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIP--SLFPKDSD----NASTDYVT 56 (536) Q Consensus 1 Ma~~------------------~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P--~~~~~~~~----~~~~~~~~ 56 (536) |++. ...++.+.+.+..+.... |.++...|+++|+=.-+ .+-..... ...+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKE-NVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR 79 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccc Confidence 5543 223445556555555554 44455555555543211 11111000 01111235 Q ss_pred cccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHH Q lcl|NC_011045. 57 PWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQ 136 (536) Q Consensus 57 ~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~d 136 (536) +..+-+...++..++.|++ -| + .++..+... ...++.|+ ..||...+.++.++ T Consensus 80 i~~n~~~~Iv~~~~~~l~g--~p--~--~~~~~d~~~---------~~~l~~~~------------~n~~~~~~~~~~~~ 132 (468) T protein:vir:96 80 MYTNYHQNLVDQKVAYAVA--NP--V--TYGTEDEKS---------LKTIQEVL------------NHKWDDKLVDILTA 132 (468) T ss_pred cccchHHHHHHHHHhhhcc--CC--c--eeccCChHH---------HHHHHHHH------------hcCHHHHHHHHHHH Confidence 5556666666666655432 12 1 223333211 11233332 25788889999999 Q ss_pred HHhhCcEEEEEecCCCCceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEE Q lcl|NC_011045. 137 LVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVY 214 (536) Q Consensus 137 l~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~ 214 (536) ..+||.+++++..+.. +.+++.+++..+.+...| ..|++...+|.+...- ...+++| T Consensus 133 ~~~~G~~~~~v~~d~~-~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~--------------------~~~~~~~ 191 (468) T protein:vir:96 133 ASNKGVEWIQPYVDEQ-GEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDG--------------------GERVEYW 191 (468) T ss_pred HhhcCeEEEEEEEcCC-CceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--------------------ceEEEEE Confidence 9999999877665544 346778888777665544 3567776666654321 0112222 Q ss_pred EE----EEecCCCCcee--EEEEecCc--cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 215 TH----IYLDEDSGEYI--RYEEVEGM--EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 (536) Q Consensus 215 ~~----v~p~~~~~~~~--~~~~v~g~--~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~ 286 (536) +. .+.. .++... ......+. ........+++..+|++.++ .+.+|.|-.+...+-+..++.+.-.. T Consensus 192 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~ 265 (468) T protein:vir:96 192 TANDVTFYEL-KDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRLSDT 265 (468) T ss_pred eCCeEEEEEE-cCCceeecccccccccccceeeccccccCCcccEEEec-----CCCCCCCchHHHHHHHHHHHHHHHHH Confidence 11 0011 111110 00111110 01111223556788888664 35679999999999999999999999 Q ss_pred HHHHHHHhCCceeeccccccchhhhcc-C-CCcce-ecCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhh-hhc Q lcl|NC_011045. 287 VKMSMISSKVIGLVNPAGITQPRRLTK-A-QTGDF-VTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-NSA 361 (536) Q Consensus 287 ~~~~~~a~~p~~lv~~~g~~~~~~~~~-~-~~g~~-~~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~ 361 (536) ....+....|.+++.-...-+...+.. . ..+.+ +.+.. +++. .+....+.......++.++..|...-.. +.. T Consensus 266 ~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~--~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 343 (468) T protein:vir:96 266 QNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVD--TIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQ 343 (468) T ss_pred HHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCCCcce--EEeecCChHHHHHHHHHHHHHHHHHhCccccc Confidence 999999999987765222211122111 1 12222 22322 2333 3333345666777777777766543221 111 Q ss_pred ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHH Q lcl|NC_011045. 362 VQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDK 441 (536) Q Consensus 362 ~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~ 441 (536) ....+...|+..+..+..-+... .....+ .+...+.+++.++.+..-. ......+.|+|.-.+..-. .+.++. T Consensus 344 ~~~~~~n~Sg~Alk~~~~~l~~k-~~~k~~----~~~~~l~~~~~li~~~~g~-~~d~~~i~i~f~~~~p~d~-~e~a~~ 416 (468) T protein:vir:96 344 QDKFGNSPSGIALKFMYSNLDLK-ANKLKN----KTLTALQELLQYIIDFYKL-SIKVQDVEITFNFNVMVNE-LEQSQI 416 (468) T ss_pred ccccccchHHHHHHHHHHHHHHH-HHHHHH----HHHHHHHHHHHHHHHHhCC-CcccceeeEEecCCCCcCH-HHHHHH Confidence 11112344565544332221111 111222 2333344444444332111 1223356666654443221 111221 Q ss_pred HHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcc Q lcl|NC_011045. 442 LERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPE 521 (536) Q Consensus 442 l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~ 521 (536) + ...+ .+....++..+ -++ --.++|++++.+++++.++.+. ...+ T Consensus 417 ~----~~~g---------~iS~et~i~~l---~~v-----~D~~~E~~ri~~E~~~~~~~~~---~~~~----------- 461 (468) T protein:vir:96 417 G----VNSQ---------YLSKETVVTNH---PWV-----DDPVAEMERIDQEELALPSIEE---GLNG----------- 461 (468) T ss_pred H----HhcC---------CCchHHHHHhC---CCC-----CCHHHHHHHHHHHHHHHHHHhh---ccCC----------- Confidence 1 1111 12223333221 122 1125677666554432221111 0100 Q ss_pred hHHhhhhcCCCCCC Q lcl|NC_011045. 522 AMAAAADSVGLQPG 535 (536) Q Consensus 522 ~~~~~~~~~~~q~~ 535 (536) ...=+|- T Consensus 462 -------~~~~~~~ 468 (468) T protein:vir:96 462 -------KENNEPT 468 (468) T ss_pred -------CCCCCCC Confidence 0011111 No 120 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.05 E-value=5.8e-06 Score=49.28 Aligned_cols=435 Identities=9% Similarity=0.065 Sum_probs=196.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccc-ccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYV-TPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l~~~ltP 79 (536) |.=+....+=.....+|+.+++--.- ...+++...-.||..-..+...-..++. -.|-+...+.++. |++.+|- T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G-~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~----~~G~vf~ 75 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLG-QREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSA----LSGMVLD 75 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcC-hHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHH----Hhchhhc Confidence 77544444445566666666554322 3455555555566542221111112222 2344444444444 4444552 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) -.|=+ +.++. +..+.. -..-.+.+.-+...+.+...+|-+.++||-+..+.-=.+. T Consensus 76 k~p~~--~~p~~--------------l~~~~~--------D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~ 131 (452) T protein:vir:94 76 QPPVI--THPDA--------------MSKYFE--------DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYIS 131 (452) T ss_pred CCcee--cccHH--------------HHHHHh--------cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEE Confidence 22322 22221 111110 2346788899999999999999999999977554322345 Q ss_pred EEecceE-EEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEE--EecC-- Q lcl|NC_011045. 160 LYRLSSY-VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYE--EVEG-- 234 (536) Q Consensus 160 ~~~l~~~-~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~--~v~g-- 234 (536) .|+..+. -++.+..|+..-+..|++...++-.++|+.... +.|.+.... ++.|.++. ..++ T Consensus 132 ~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~------------~~yRvL~l~--~g~~~v~~~~~~~~~~ 197 (452) T protein:vir:94 132 VYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNIR------------VRYRCLELV--DGLLQITVHETQDGKV 197 (452) T ss_pred EechhhhcCccccccCCeeEEEEEEEEEEecCCCcccceeE------------EEEEEEEEe--CCeEEEEEEEccCCce Confidence 5553332 244466676665655666555444445554332 222211111 12222211 1111 Q ss_pred ----ccccccccccccccCceEEEeeeecCCC--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch Q lcl|NC_011045. 235 ----MEVQGSDGTYPKEACPYIPIRMVRLDGE--SYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP 308 (536) Q Consensus 235 ----~~i~~~~~~~~~~~~P~~~~rw~~~~ge--~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~ 308 (536) ....-..+.. .+++|++.|....+. ..|..|..+..--...+-..+-..-..+..+..|.+.+. |..+- T Consensus 198 ~~~~~~~~~~~~~~---~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~--g~~~~ 272 (452) T protein:vir:94 198 WELAKTSTIQNVGV---TMDYIPFFCITPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWIT--GAESQ 272 (452) T ss_pred eeeccceeecCCCc---ccceeEEEEEcCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEee--cCcCC Confidence 1111122333 445666666554443 335556443322222333334445555666666665443 33333 Q ss_pred hhhccCCCcceec-CC-cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHH-HHHHHHHHHHh Q lcl|NC_011045. 309 RRLTKAQTGDFVT-GR-PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEI-RYVASELEDTL 385 (536) Q Consensus 309 ~~~~~~~~g~~~~-g~-~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi-~~r~~E~~~~L 385 (536) ..+.- |++..+. .. ..+...++. ++..+......++++++.+.++ ...+...+..+.|++|- ..+.......| T Consensus 273 ~~i~i-G~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~~m~~~--Ga~ll~~~~~~~~s~ea~~~~~~~~~s~L 348 (452) T protein:vir:94 273 STMHI-GSTKAWVIPEVAAKVGFLEF-TGQGLQSLEKALSEKQAQLASL--SARLIDNSTRGSEATETVKLRYMSETASL 348 (452) T ss_pred CceEe-cccccccCCCCCCcceEEcc-CchhHHHHHHHHHHHHHHHHHH--HHHhhccCCCcchHHHHHHHHHHHhhHHH Confidence 33332 4444332 22 223444442 3556788888899998888652 22233333434455554 44555556788 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHH-HHHHHHHHHHHHHHHHHHHhhcchhhhhcCCH Q lcl|NC_011045. 386 GGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLE-AIGRGQDLDKLERCVAAWAALAPMRDDPDINL 463 (536) Q Consensus 386 G~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La-~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~ 463 (536) ..+..++++-+ ++++.++.+. |. +..++|+|-.-.. +---.++++.++...+ ...|.. T Consensus 349 ~~~a~~~e~al-----~~~l~~~a~w~g~-----~~~~~v~~n~dF~~~~~~~~~~~al~~~~~----------~G~is~ 408 (452) T protein:vir:94 349 KSVTRAVEALL-----NKAYSCIMDMESM-----GGTLNIKLNSAFLDSKLTAAELKAWVEAYL----------SGGISK 408 (452) T ss_pred HHHHHHHHHHH-----HHHHHHHHHHcCC-----CCceEEEeccccccccCCHHHHHHHHHHHh----------cCCCcH Confidence 88888876653 4555555552 32 1234444332221 1111233333332211 112444 Q ss_pred HHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhh Q lcl|NC_011045. 464 AMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAA 526 (536) Q Consensus 464 d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 526 (536) ..+.+++-+ .||. ..++|...+..+... +++.. ....+.++..+ T Consensus 409 ~t~~~~L~~-~gvl-----~~~~e~~~i~~E~~~------~~~~~-------~~~~~~~~~~~ 452 (452) T protein:vir:94 409 EIYIHALKV-GKVL-----PPPGESMGVIPDPPA------PEPSP-------SNTPPNPSSKA 452 (452) T ss_pred HHHHHHHHh-CCCC-----CCccCHHHHHHHhhc------cCccc-------CCCCCCCccCC Confidence 445554444 5662 233333333222111 11101 11111111121 No 121 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=98.02 E-value=6.8e-06 Score=48.90 Aligned_cols=422 Identities=9% Similarity=0.023 Sum_probs=191.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccc-----ccCCC--C--CcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPS-----LFPKD--S--DNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~-----~~~~~--~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) |+.+.+.+..+..+..+ +... .+.+|..-. +.... . ........++..+-+...++..++.|++- T Consensus 1 l~~~~i~~~i~~~~~~~-~r~~---~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~-- 74 (451) T protein:vir:10 1 MELEKIRAIISADAARR-QEIL---QAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTY-- 74 (451) T ss_pred CCHHHHHHHHHHHHHHH-HHHH---HHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecc-- Confidence 99999999888888643 3343 444444331 11000 0 00111122444555666666666544220 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC------ Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG------ 152 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~------ 152 (536) |. .+...+. ......++. +..++|.....++.++..++|.|.+++..+.. T Consensus 75 p~----~~~~~~~--------~~~~~~~~~------------~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~ 130 (451) T protein:vir:10 75 PV----LFDIDNN--------KELNEKVTD------------VLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQV 130 (451) T ss_pred cc----eeecCCc--------HHHHHHHHH------------HhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccc Confidence 21 1222221 111111222 22478999999999999999998765543322 Q ss_pred -CceeeEEEEecceEEEeeC--CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 153 -SNYNPMKLYRLSSYVVQRD--AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 153 -~~~~~~~~~~l~~~~v~~d--~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) .+.+++.+++.-+.++..| -.+++...+|.+......- ....++....+++|+. + ....| T Consensus 131 ~~~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~----------~~~~~~~~~~~e~yt~------~-~~~~~ 193 (451) T protein:vir:10 131 TNQTFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVK----------GQIQKQAYTYVEFWTD------K-ILDKY 193 (451) T ss_pred cccceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc----------ccccceEEEEEEEEeC------C-eEEEE Confidence 2456677776666555443 3567777776664332110 0000111112222221 1 12222 Q ss_pred EE----ecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccc Q lcl|NC_011045. 230 EE----VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGI 305 (536) Q Consensus 230 ~~----v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~ 305 (536) .. ..+..+......++|..+|++.++. +.+|.|-.+...+-+..+|.+.-......+...+|.+.+.--+. T Consensus 194 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~ 268 (451) T protein:vir:10 194 KFFGVSCCGSQIEHITVQHRFNSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGG 268 (451) T ss_pred EecccCccccccccccccCCCCeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc Confidence 21 1222233333456688899887654 45688999999999999999999999999999999877642111 Q ss_pred cchhhh-ccCCC-cceec-CC--cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHH Q lcl|NC_011045. 306 TQPRRL-TKAQT-GDFVT-GR--PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASE 380 (536) Q Consensus 306 ~~~~~~-~~~~~-g~~~~-g~--~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E 380 (536) ....+. ..... +.+.. .. ..+..+..+....+.+.....++.++..|-..-..-.+........|+.-+..+-.- T Consensus 269 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~ 348 (451) T protein:vir:10 269 EDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRK 348 (451) T ss_pred ccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHH Confidence 111111 11111 22221 11 111223334444567778888888877775432211111111123344433332221 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 381 LEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 381 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) +... . .+.+.. +.+.+.+++.++.+. |. ..-.++++.|.-.+..-. .+.++.+.... +. T Consensus 349 l~~k-~---~~k~~~-f~~~l~~~~~li~~~~~~---~d~~~i~i~f~~~~p~n~-~e~~~~~~kl~---g~-------- 408 (451) T protein:vir:10 349 LELK-S---GLLETE-FRTSFDKLIKAILYFLGV---TDYKKIQQTYTRNMMSND-LEDADIATKSV---GI-------- 408 (451) T ss_pred HHHH-H---HHHHHH-HHHHHHHHHHHHHHHhCC---CCccceeEEecCCCCCCH-HHHHHHHHHHh---cc-------- Confidence 1111 1 222222 333344555444332 21 223467777765554321 11222221111 11 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTE-EQKQQKMAQQSMQMGMDNGAAALAQGMAAQ 515 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~-~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~ 515 (536) |.-..++.. ++ ++.+. +|.+.+.++++.+.+ + ..+.... +.+ T Consensus 409 -iS~et~~~~----~p-----~v~d~~~e~~~~~ee~~~~~~--~-~~~~~~~-~~~ 451 (451) T protein:vir:10 409 -IPTKIILRH----HP-----WVDDVEEAEKLYLEEKKIQAS--K-VSDDYNN-FTE 451 (451) T ss_pred -CchHHHHHh----CC-----CCCCHHHHHHHHHHHHHHHHH--H-HHhhcCC-CCC Confidence 222222222 22 23333 333222222221111 1 1111111 111 No 122 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.99 E-value=7.9e-06 Score=48.56 Aligned_cols=443 Identities=14% Similarity=0.110 Sum_probs=192.3 Q ss_pred CCC-ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-----cccc-ccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-----TDYV-TPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-----~~~~-~~~dst~~~a~~~Laa~l 73 (536) |++ +....+=..+..+|+..++--.- ...|++...-.||.....+....+ .++. -.|-+...+.++ .| T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G-~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~----~l 75 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAG-EPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLF----GL 75 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHH----HH Confidence 986 33333345555666665554322 356777777778875433221111 1111 234444444444 44 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) ++.+|=-.| .++.+ ..++.+++.|.. .-.+++.-+..++.+...+|-+.++||-+... T Consensus 76 ~G~vf~k~p--~~~~p--------------~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~ 133 (501) T protein:vir:95 76 VGQVFMRDP--VVKVP--------------ALLNPLVANATG------SGINLTQLAKRAVSLNLAYSRAGLLVDYPTTE 133 (501) T ss_pred hhhhhcCCc--ceeCc--------------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEeecCCC Confidence 444441112 12222 124445555433 24578888999999999999999999865321 Q ss_pred c--ee------------eEEEEecceEE-EeeC---CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEE Q lcl|NC_011045. 154 N--YN------------PMKLYRLSSYV-VQRD---AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYT 215 (536) Q Consensus 154 ~--~~------------~~~~~~l~~~~-v~~d---~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~ 215 (536) + .. .+..|+..+.. +..+ ...++.-+..+++.+.++ .+|+. +.++.|. T Consensus 134 ~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~~------------~~~~q~R 199 (501) T protein:vir:95 134 AEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD--DGFEM------------KTSGQFR 199 (501) T ss_pred CcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC--CCccc------------ceeEEEE Confidence 1 00 14444433321 1212 233444455555554222 23332 3344444 Q ss_pred EEEecCCCC-ceeEEEEe-----------cC------ccccccccccccccCceEEEeeeecCCCcccc--chHHHHHHH Q lcl|NC_011045. 216 HIYLDEDSG-EYIRYEEV-----------EG------MEVQGSDGTYPKEACPYIPIRMVRLDGESYGR--SYIEEYLGD 275 (536) Q Consensus 216 ~v~p~~~~~-~~~~~~~v-----------~g------~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGr--gp~~~~l~d 275 (536) ...+..++. .+.+|..- +| ...+..+|. ..+++|++.|.-..+...+. .|.. + T Consensus 200 vL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pPLl----~ 272 (501) T protein:vir:95 200 VLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQG---KRLTEIPFMFIGSENNDSNPDNPNFY----D 272 (501) T ss_pred EEeeCCCceEEEEEEEecCCcccCcceecCCcccccceeeeeccCC---CcCCeeeEEEEecCCCCCCCCccchH----H Confidence 444432221 12223221 11 111122232 46677888887655554433 4433 4 Q ss_pred HHHHHHH---HHH-HHHHHHHHhCCceeeccccccch-------hhhccCCCcc-eecCCcccccccccccccchhHHHH Q lcl|NC_011045. 276 LRSLENL---QEA-IVKMSMISSKVIGLVNPAGITQP-------RRLTKAQTGD-FVTGRPEDISFLQLEKQADFTVAKA 343 (536) Q Consensus 276 ~~~L~~l---~~~-~~~~~~~a~~p~~lv~~~g~~~~-------~~~~~~~~g~-~~~g~~~~~~~~~~~~~~~~~~~~~ 343 (536) +..||.- ..+ .-..+..+..|.+.+. |.... ..+.- |++. +.-...++...++.. +..+ ... T Consensus 273 lA~lni~hy~~ssd~~~~l~~~~~P~l~i~--G~~~~~~~~~~~~~i~~-G~~~~~~lP~~~~~~~ie~~-~~~i--~~~ 346 (501) T protein:vir:95 273 LASLNMAHYRNSADYEESCYIVGQPTPVLI--GLTEEWVTNVLKGSVNF-GSRGGIPLPVGADAKLLQAS-ENTM--LKE 346 (501) T ss_pred HHHHHHHHHhhhhHHHHHHHHcccceeeee--CCcccccccCCCCceee-cccccccCCCCCceeEEecC-hhhH--HHH Confidence 4344322 223 3334455555554332 22111 11111 1222 111122233333321 2223 356 Q ss_pred HHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceE Q lcl|NC_011045. 344 VSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVE 423 (536) Q Consensus 344 ~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~ 423 (536) .+++++++++++ .-.+........||++.+.+.......|+.+..++++-+.. ++..+-..+ |. +++.++ T Consensus 347 ~l~~l~~~m~~~--Ga~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~-~l~~~a~w~---g~----~~~~~~ 416 (501) T protein:vir:95 347 AMDTKERQMVAL--GAKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEW-ALKWAARWV---GQ----ADSGVK 416 (501) T ss_pred HHHHHHHHHHHH--HHhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHH-HHHHHHHHc---CC----CCCceE Confidence 677777777653 11233334445899999999999999999999998766333 333333332 22 123344 Q ss_pred EEEechHHHHH-HHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 424 PTISTGLEAIG-RGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMD 502 (536) Q Consensus 424 v~~vs~La~a~-r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~ 502 (536) |++-.-+.... -.++++.+....+ ...|..+.+.+.+.+ .||.+.. .++|.+++....+.+-. T Consensus 417 v~i~~df~~~~~~~~~~~al~~~~~----------~G~is~~t~~~~L~~-~~v~~~~---~~~e~e~i~~~~~~~~~-- 480 (501) T protein:vir:95 417 FELNTDFDIARMTPDERRSLVEEWQ----------KGAITFEEMRTGLRK-AGVATED---DSKAKEKIAKDTAEAMA-- 480 (501) T ss_pred EEEecccccccCCHHHHHHHHHHHh----------CCCCcHHHHHHHHHh-CCCCChh---HHHHHHHHHhhhcCccc-- Confidence 54322221111 1222333222221 112555556555544 5774321 12333333222111000 Q ss_pred HHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 503 NGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 503 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) ..+ ++...+. ..++. ..|=+. T Consensus 481 --~~~----~~~~~~~--~~gg~---~~~~~~ 501 (501) T protein:vir:95 481 --LAT----PANVPGD--GSGGD---NVGNSE 501 (501) T ss_pred --ccc----cCCCCCC--Ccccc---cccCCC Confidence 000 0000000 01111 111111 No 123 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=97.96 E-value=9.2e-06 Score=48.18 Aligned_cols=433 Identities=10% Similarity=0.038 Sum_probs=193.2 Q ss_pred CCCc------------------cccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc-----cccCC-CC--Ccc-ccc Q lcl|NC_011045. 1 MAEK------------------RTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIP-----SLFPK-DS--DNA-STD 53 (536) Q Consensus 1 Ma~~------------------~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P-----~~~~~-~~--~~~-~~~ 53 (536) |++= ......+.+.+..+..+.. ..+.+.+.+|..- .+-.. .+ ... .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPK----IDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHH----HHHHHHHHHHhccCCcchhccchhccccccccccc Confidence 4431 1112234445555555432 2333444444322 11111 00 000 111 Q ss_pred ccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHH Q lcl|NC_011045. 54 YVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEA 133 (536) Q Consensus 54 ~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~ 133 (536) ..++..+-+...++..++.|++ -| + .++..+.. ....+++|+ ..||.....++ T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~g--~p--~--~~~~~d~~---------~~~~l~~~~------------~n~~~~~~~~~ 129 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAVA--NP--V--TFSSDDDK---------SLKTIQEVL------------NHKWDDKLVDI 129 (474) T ss_pred chhcccchHHHHHHhhhhhhcc--cC--c--eeecCchH---------HHHHHHHHH------------hcCHHHHHHHH Confidence 2345556666666666654433 12 1 22333321 122334333 25778888999 Q ss_pred HHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCC--CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceE Q lcl|NC_011045. 134 LKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETI 211 (536) Q Consensus 134 ~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~--~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~ 211 (536) .++..++|.+.+++..+.. +.+++..++..++++..|. .+++...+|.++.. ....+ T Consensus 130 ~~~~~~~G~~~~~~y~d~~-~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~--------------------~~~~~ 188 (474) T protein:vir:96 130 LTAASNKGIEWLQPYIDEN-GEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLD--------------------GAERV 188 (474) T ss_pred HHHHHhcCeeEEEEEecCC-CceEEEEEcccceEEEEcCCCCCceEEEEEEEeec--------------------CceEE Confidence 9999999999877665544 3467888888888877764 56776666665421 11123 Q ss_pred EEEEE--E-EecCCCCceeE---EE--EecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_011045. 212 DVYTH--I-YLDEDSGEYIR---YE--EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQ 283 (536) Q Consensus 212 ~v~~~--v-~p~~~~~~~~~---~~--~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~ 283 (536) ++|+. | +.+..++.... +. ......... ...+++..+|++.++. +.+|+|=.+...+.+..++.+. T Consensus 189 ~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~ 262 (474) T protein:vir:96 189 EYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVG-NKRVSWGRVPFIPFKN-----NPQEMSDLFMYKTIIDAMDKRL 262 (474) T ss_pred EEEeCCeEEEEEecCCceeecccccccccccccccc-ccccCCCceeEEEecc-----CCCCCCcHHHHHHHHHHHHHHH Confidence 33311 0 00111111111 00 001111111 2235578899988775 4679999999999999999999 Q ss_pred HHHHHHHHHHhCCceeeccccccchhhhc-cCCCcc-e-ecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh-h Q lcl|NC_011045. 284 EAIVKMSMISSKVIGLVNPAGITQPRRLT-KAQTGD-F-VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML-N 359 (536) Q Consensus 284 ~~~~~~~~~a~~p~~lv~~~g~~~~~~~~-~~~~g~-~-~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~ 359 (536) -......+....|.+.+.-.+.-+..+.. ....+. + +++..+++.. +....+.+.....++.+++.|-..-.. + T Consensus 263 S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 340 (474) T protein:vir:96 263 SDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDGSGVDT--IQIEVPVQSSKEYLDMLRDYVIEFGQGVD 340 (474) T ss_pred HHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCCCceeE--EeecCChHHHHHHHHHHHHHHHHHhCCcc Confidence 99999999999998776422211222211 112222 2 2333444443 333456677777777777766442211 1 Q ss_pred hcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHH Q lcl|NC_011045. 360 SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDL 439 (536) Q Consensus 360 ~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~ 439 (536) ......+...|+..+..+..- ..+-.....+.-.+.+..++..++.++ |. ......++|+|.-.+..-. ..-+ T Consensus 341 ~~~~~~~~n~Sg~Al~~~~~~-l~~k~~~k~~~~~~~l~~~~~~i~~~~---~~--~~~~~~i~i~f~~~~p~~~-~e~~ 413 (474) T protein:vir:96 341 FQQDKFGNSPSGIALKFMYSN-LDLKANKLKNKTLTALQELLQYIIDFY---KL--NIKVQDVEITFNFNVMVNE-LEQS 413 (474) T ss_pred ccccccccccHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHh---CC--CcccceeeEEeccCCCcCH-HHHH Confidence 111111223344443322111 112222233333333333333333332 21 1222345666643332211 1111 Q ss_pred HHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcC Q lcl|NC_011045. 440 DKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATAS 519 (536) Q Consensus 440 ~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~ 519 (536) + .+.+.+ .+.-..++..+ -+| --.++|++++.++++...+.. .....+.... T Consensus 414 ~-------~~~~ag------~iS~et~~~~~---~~v-----~d~~~E~~ri~~E~~e~~~~~-------~~~~~~~~~~ 465 (474) T protein:vir:96 414 Q-------IGVQSQ------YLSKETVVTNH---PWV-----DDPVAELERIEQDNIDFNKQL-------PPLEGDANGR 465 (474) T ss_pred H-------HHHhcC------CCchHHHHHhC---CCC-----CCHHHHHHHHHHHHHHHHhcc-------cccccccccc Confidence 1 111111 13333333321 112 112466666654443222111 1111111111 Q ss_pred cchHHhhhh Q lcl|NC_011045. 520 PEAMAAAAD 528 (536) Q Consensus 520 ~~~~~~~~~ 528 (536) .+.-..-++ T Consensus 466 ~~d~~~e~~ 474 (474) T protein:vir:96 466 AQDNESETN 474 (474) T ss_pred cCCCcccCC Confidence 111222223 No 124 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=97.69 E-value=2.8e-05 Score=45.57 Aligned_cols=433 Identities=8% Similarity=-0.003 Sum_probs=191.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHH-----HHHHHHhcccccCCCCCccccccccc--ccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRA-----QNCAQYTIPSLFPKDSDNASTDYVTP--WQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w-----~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~Laa~l 73 (536) |++-+++...+-..++.. |..+.... ++-+.|..+ .....-+-.....+ .+..+..+|++.|..+ T Consensus 1 ~~~~~~a~~~~~~~~a~~-----~~~~~~~~g~~~~~d~~~~~~~---~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~ 72 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVN-----RNDFMVGHGKANSRDKLTRQTP---GNGQKLDLKACENLYASNSIAMNIVDIISEDM 72 (461) T ss_pred Cccchhhhhhhhhhhhhh-----hhHHHhhcCCcchhhhhhcccc---CcccccCHHHHHHHHHhCCccchhhccchHHh Confidence 999777665544433422 11111100 000000000 00000000011111 1222333444444333 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) + +.|+.+...++. ....++.|++ +-+....+.++++.--+||.|.+++.-.+.+ T Consensus 73 ~------r~g~~i~~~~~~---------~~~~~~~~~~-----------~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~ 126 (461) T protein:vir:80 73 V------RAGWSLKTDNKE---------MKKNIESKWR-----------KLKTKDRFQKLYADKRLYGDGFLSIGVVSSN 126 (461) T ss_pred h------cCCeeeecCCHH---------HHHHHHHHHH-----------HhhHHHHHHHHHHhhcccccEEEEEEeecCC Confidence 2 468888754432 2223333333 2467889999999999999998887543322 Q ss_pred ceeeEEEEecceEEEeeCCCC--CeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFG--NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G--~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) .-...-.-|| ....-+ ....+|.+..++..... .+..+ ...-.-+.|+.. . ...+..+. T Consensus 127 ~~~~~~~~pl-----~~~~~~~~~~l~~~~~~~i~~~~~~----~dp~s-----p~fg~P~~y~i~-~---~~~~~~~~- 187 (461) T protein:vir:80 127 REQADLSTAI-----DPKTIKSIPYINTFNTQKVTQLYLN----QDMFS-----EHFGEVEFFEVN-R---VSQLGEEI- 187 (461) T ss_pred ccccCccCCc-----ccccccceeEEEeccccccchhhhc----ccCcC-----cccccceEEEEe-c---cccccccc- Confidence 1000111111 111111 11223333333322211 11100 000111222211 0 01111111 Q ss_pred ecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc------- Q lcl|NC_011045. 232 VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG------- 304 (536) Q Consensus 232 v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g------- 304 (536) +.+. .+.. ....+..+++++.-...++..||+|..+..++.++..+.......+.+..+.-+.+..+.-. T Consensus 188 ~~~~--~~~~-~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~ 264 (461) T protein:vir:80 188 LSGT--TAST-SEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDK 264 (461) T ss_pred cccc--cCcc-ceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHH Confidence 1100 0110 01124556666666667778899999999999999999999988887777776666554211 Q ss_pred --ccchhhhccCCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--h-hcccCCCCCCCHHHHHHHHH Q lcl|NC_011045. 305 --ITQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--N-SAVQRTGERVTAEEIRYVAS 379 (536) Q Consensus 305 --~~~~~~~~~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~-~~~~~~~~r~TAtEi~~r~~ 379 (536) .....+......|..+-+..++...+. .++.-+...+....+.|.-+-=. . .+.+..+..=|. + T Consensus 265 ~~~~~~~~~~~~~~g~~~~d~~e~~e~~~----~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asg-------e 333 (461) T protein:vir:80 265 ANLTAMLDFMFRTEALAIIKGDEQLTKES----TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGA-------Q 333 (461) T ss_pred HHHHHHHHHhcCCceEEEEcCCcceEEEe----cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccc-------h Confidence 111112222233444434444433222 24445556666666766654310 0 011111212122 2 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCC--CcceEEEEechH--HHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQAT--QQIPELP--KEAVEPTISTGL--EAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~--g~lp~~~--~~~v~v~~vs~L--a~a~r~~~~~~l~~~~~~~~~~~ 453 (536) +-...+---+.+++...+.|.+++++.++.+. |..|.++ ...++++|-... ..-.++.-..+..+..+.+.+. T Consensus 334 ~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~- 412 (461) T protein:vir:80 334 YDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVN- 412 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc- Confidence 22344555666777778999999999998763 3333333 245677764332 3333333333333333333221 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhhccCC-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSGILLT-EEQKQQKMAQQSMQMGMDNGAA 506 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs-~~ev~~~~~q~~~q~~~~~~a~ 506 (536) ..|+.+++.+.+...+|++|...+-. ..|...+..+.....+.+...+ T Consensus 413 -----g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 413 -----GVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYAKKNADG 461 (461) T ss_pred -----CCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccccccCCCC Confidence 14788998888887788766544322 1222111110000000000000 No 125 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=97.62 E-value=3.6e-05 Score=44.95 Aligned_cols=412 Identities=10% Similarity=0.036 Sum_probs=181.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCc--c--c----cccc----------ccccchH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDN--A--S----TDYV----------TPWQAVG 62 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~--~--~----~~~~----------~~~dst~ 62 (536) |.=+....+=..+..+|+..+. .+...=+...+-.||.....+... . . .+.. -.|-+.- T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d---~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~ 90 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLD---CVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIV 90 (488) T ss_pred ecccccCHHHHHHhhhhhHhhh---hhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchh Confidence 7744444445566667765432 444445556666778653221110 0 0 0001 1133333 Q ss_pred HHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCc Q lcl|NC_011045. 63 ARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGN 142 (536) Q Consensus 63 ~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~ 142 (536) .+.++. |++.+|=-.| .++.+++ .+++.+++.|.. .-.+.+.-+...+.+...+|- T Consensus 91 ~~tl~~----l~G~vfrk~p--~~~~~~~------------~~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~ 146 (488) T protein:vir:96 91 NPTMNA----ITGAVMRREP--EFDTMDN------------PVLIGLRDNIDG------KGNGIDQECKQALNALQWGSR 146 (488) T ss_pred HHHHHH----hcchhhccCc--eeccCCc------------HHHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCe Confidence 333333 4444441111 1112211 124555555533 256788889999999999999 Q ss_pred EEEEEecCCCC-----------ceeeEEEEecceEE---Eee-CCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCC Q lcl|NC_011045. 143 VLLYLPEPEGS-----------NYNPMKLYRLSSYV---VQR-DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKA 207 (536) Q Consensus 143 ~~l~~~~~~~~-----------~~~~~~~~~l~~~~---v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~ 207 (536) +.++||-+... ++ .+..|+..+.. ..+ |+...+.-+..++++...+ .... . T Consensus 147 ~~ilVD~P~~~~T~ade~~~~~rP-y~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D------------~~~~-~ 212 (488) T protein:vir:96 147 CGWLVRSHPESATMADWNKGKKLP-TAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERD------------GGTY-V 212 (488) T ss_pred EEEEEecCCCcCCHHHHHHhcCCc-EEEEechhhhcCcceeccCCceeeEEEEEEEEEEecc------------CCCc-c Confidence 99999976432 12 24444433322 222 1222344344455443111 0000 1 Q ss_pred CceEEEEEEEEecCCCCceeEEEEecCcc----ccccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHHHH Q lcl|NC_011045. 208 DETIDVYTHIYLDEDSGEYIRYEEVEGME----VQGSDGTYPKEACPYIPIRMVRLDGESY--GRSYIEEYLGDLRSLEN 281 (536) Q Consensus 208 ~~~~~v~~~v~p~~~~~~~~~~~~v~g~~----i~~~~~~~~~~~~P~~~~rw~~~~ge~Y--Grgp~~~~l~d~~~L~~ 281 (536) ......++. -.++.|..|..-+|.. ++..+|.. .+++|++.|....+..+ |..|.. |+..||. T Consensus 213 ~~~~~~~~~----l~~g~~~v~~~~~~~~~~e~~~~~~g~~---~l~~IP~v~~~~~~~~~~~~~pPLl----dLA~lnl 281 (488) T protein:vir:96 213 SKQRLINHR----LVDGLCEFQEVTDDEYSDEWTPVLINSK---QSDTIPFFLASSQSNEWCIDSTPLT----SLAEISL 281 (488) T ss_pred cceEEEEEE----EECcEEEEEEEecCCcccceEeecCCCc---ccCeeEEEEEecCCCCCCCCCCchH----HHHHHHH Confidence 111111111 1245666665443322 22223443 45666666665555544 444543 4444432 Q ss_pred ---HHHHHHHHHHHHh-CCceeeccccccchhhhccCCCcceecCC-------cccccccccccccchhHHHHHHHHHHH Q lcl|NC_011045. 282 ---LQEAIVKMSMISS-KVIGLVNPAGITQPRRLTKAQTGDFVTGR-------PEDISFLQLEKQADFTVAKAVSDAIEA 350 (536) Q Consensus 282 ---l~~~~~~~~~~a~-~p~~lv~~~g~~~~~~~~~~~~g~~~~g~-------~~~~~~~~~~~~~~~~~~~~~i~~~~~ 350 (536) -..+-.+.+-..+ -|+|....++.... ...+..++.+..+. .++... ++.+++ ..+.+.++++++ T Consensus 282 ~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~-~~~~~~~~g~~~~~~~~~~~~~g~~~~--~e~~~~-~l~~~~l~~l~~ 357 (488) T protein:vir:96 282 SIYVMNAYSNKAMILANEAKWMVDMGDMNKT-MASEMNPLGFTLAGRMPYYVKNGDVKV--IQAQFS-PETENKVEKLFE 357 (488) T ss_pred HHHhhhhHHHHHHHhcCCceeeeccCCCCcc-cccccccceeeecccccccccCCceee--cCCchh-HHHHHHHHHHHH Confidence 2333334444344 44454433332221 11111111211111 122222 222222 124666777777 Q ss_pred HHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCC-CCCCcceEEEEec Q lcl|NC_011045. 351 RLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIP-ELPKEAVEPTIST 428 (536) Q Consensus 351 rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp-~~~~~~v~v~~vs 428 (536) ++.++ .-.+.... .+.||++.+.+.......|+.+...+++-+ +++|.++.+- |.-. ......+++++-. T Consensus 358 qm~~~--Ga~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~al-----~~~l~~~A~w~g~~~~~~~~~~~~~~in~ 429 (488) T protein:vir:96 358 QAVKV--GASLFTQQ-SNETATGAAIRSGSSTASMATLGNNVEDTV-----RNMLRFIMRYFEGTNLYVNPDELVFKLNR 429 (488) T ss_pred HHHHH--hHhhccCC-CcchHHHHHHHHHHhhHHHHHHHHHHHHHH-----HHHHHHHHHHcCCCCCCcCccceEEEecc Confidence 66542 11222233 357999999999999999999988877653 3344444331 2211 1112223333221 Q ss_pred hHHHHH-HHHHHHHHHHHHH--------HHHhhc-chhhhhcCCHHHHHHHHHHH-cCC Q lcl|NC_011045. 429 GLEAIG-RGQDLDKLERCVA--------AWAALA-PMRDDPDINLAMIKLRIANA-IGI 476 (536) Q Consensus 429 ~La~a~-r~~~~~~l~~~~~--------~~~~~~-p~~~~~~id~d~~~~~~a~~-~Gv 476 (536) -...+. -.++++.++...+ ....+. ..++++.+++++..++|.+. +|+ T Consensus 430 dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 430 DYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred CCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhcCCCC Confidence 100000 1122222222211 111111 12343446777777777652 343 No 126 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=97.60 E-value=3.9e-05 Score=44.74 Aligned_cols=457 Identities=11% Similarity=0.077 Sum_probs=197.6 Q ss_pred CCCc-cccccHH-----HHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccc-cccccchHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEK-RTGLAEE-----GAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDY-VTPWQAVGARGLNNLASKL 73 (536) Q Consensus 1 Ma~~-~~~~~~~-----~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~dst~~~a~~~Laa~l 73 (536) |++. .+.++.+ .+..+|+.+++--.- ....++...-.||.....+...-..++ .-.|-+.-.+.++.++..+ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~v 79 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGG-TEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKP 79 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcC-hHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhh Confidence 9994 4555533 334455544443322 344455555556654322211112222 2346666666666666443 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHH-HHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDE-GLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG 152 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~-~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~ 152 (536) +. - |..-|..+ + ..+.. +++.|.. .-.+++.-+..++.+.+.+|-+.++||-+.. T Consensus 80 f~-k-~p~~~~~~--p--------------~~~~~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~ 135 (513) T protein:vir:97 80 FS-E-PIKLNEDV--P--------------KAIEETILPDVDL------QGNNLDVFARQWFREGMAKALCHVLIDMPRP 135 (513) T ss_pred hh-c-CcccCcCc--h--------------HHHHHHHhhccCC------CCCCHHHHHHHHHHHHHhcCeEEEEEecCCC Confidence 22 1 32212211 1 12332 3344322 2457888889999999999999999986643 Q ss_pred Cc------------------eeeEEEEecceEE---Eee-CCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCce Q lcl|NC_011045. 153 SN------------------YNPMKLYRLSSYV---VQR-DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADET 210 (536) Q Consensus 153 ~~------------------~~~~~~~~l~~~~---v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~ 210 (536) +. + .+..|+..+.. ..+ |..+.+.-+..+++...+ +.|+ .+. T Consensus 136 ~~~~~~~~~T~Ade~~~~~rP-y~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~---Dgf~------------~~~ 199 (513) T protein:vir:97 136 APREDGQPRTLADDRREGLRP-YWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ---DGFA------------EVC 199 (513) T ss_pred CCccchhHHhHHHHHhhccCc-eEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec---CCCc------------ceE Confidence 21 2 14445543332 111 333344445445554421 1121 111 Q ss_pred EEEEEEEEecCCCCceeEEEEecCc------cccccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHHHHHH- Q lcl|NC_011045. 211 IDVYTHIYLDEDSGEYIRYEEVEGM------EVQGSDGTYPKEACPYIPIRMVRLDGESY--GRSYIEEYLGDLRSLEN- 281 (536) Q Consensus 211 ~~v~~~v~p~~~~~~~~~~~~v~g~------~i~~~~~~~~~~~~P~~~~rw~~~~ge~Y--Grgp~~~~l~d~~~L~~- 281 (536) ++-|. .-+.+.|.+|...++. -.+..++.. ..++|++.|....+..+ |..|.. ++..||. T Consensus 200 ~~q~r----vL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~---~l~~IP~v~~~~~~~~~~~~~pPLl----~LA~ln~~ 268 (513) T protein:vir:97 200 KRRIR----VLEPGLVQLWEPVKKSNAQKEEWALADEWAT---GLNYVPLVTFYADRQGFMMGKPPLL----DLAHLNVA 268 (513) T ss_pred EEEEE----EEeCceEEEEEeecCCCccccceEEecCCCC---cCCceeEEEEecCCCCCCCCccchH----HHHHHHHH Confidence 12111 1124456666554321 122233433 45666666655444433 445543 4444443 Q ss_pred --HHHHHHHH-HHHHhCCceeeccccccc-hhhhccCCCcceec--CCcccccccccccccchhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 282 --LQEAIVKM-SMISSKVIGLVNPAGITQ-PRRLTKAQTGDFVT--GRPEDISFLQLEKQADFTVAKAVSDAIEARLSFA 355 (536) Q Consensus 282 --l~~~~~~~-~~~a~~p~~lv~~~g~~~-~~~~~~~~~g~~~~--g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~a 355 (536) -..+-.+. +..+..|.+.+. |... ..+.+..|++.++. +..++...++.. +..+......+.++++.++++ T Consensus 269 hy~~~Sd~~~il~~~~~P~l~~~--G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~~-g~~i~~~~~~l~~le~qm~~~ 345 (513) T protein:vir:97 269 HWQSASDQRHILTVSRFPILACS--GASGEDSDPVVVGPNKVLYNPDPAGRFYYVEHT-GQAIAAGRTDLKDLEEQMAGY 345 (513) T ss_pred HHhhhhhHHHHHHhcccceeeee--cCCcCCCCceEeeccccccCCCCCCcceeeccC-chhHHHHHHHHHHHHHHHHHH Confidence 23333344 444444544432 3321 11212334444332 222334444332 456788888999999988764 Q ss_pred HhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceEEEEechHHHHH Q lcl|NC_011045. 356 FMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVEPTISTGLEAIG 434 (536) Q Consensus 356 f~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~v~~vs~La~a~ 434 (536) = -.+........||++.+.+.......|+.+...+++-+ ++++.++.+. |. ..+.++|+|-.-..... T Consensus 346 G--a~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al-----~~~l~~~a~wlg~----~~~~~~v~in~dF~~~~ 414 (513) T protein:vir:97 346 G--AEFLKRKTGGQTATARALDSAEATSDLSAMTGLFEDAL-----AQALDITADWLRL----GPNGGTVELVKDYDLEE 414 (513) T ss_pred H--HHhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhCC----CCCccEEEeccccCccc Confidence 2 12222334458999999999999999999888866543 3344444332 21 11234454433222111 Q ss_pred H-HHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHH--HHHHHH-HH Q lcl|NC_011045. 435 R-GQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMD--NGAAAL-AQ 510 (536) Q Consensus 435 r-~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~--~~a~~~-~~ 510 (536) - .+.++.+++..+ ...|....+.+++-+ .||-+..+ ..+++.++++.+-+.+.... ....+. .. T Consensus 415 ~~~~~~~al~~a~~----------~G~is~~t~~~~L~r-~gvl~~d~-d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~ 482 (513) T protein:vir:97 415 MDAPGLQALQVARE----------KRDISRKTYLNGLRL-RGVLPEDF-DEDEDWEELMEEISEAMGRAGLDLDPAQKNP 482 (513) T ss_pred CCHHHHHHHHHHHh----------CCCCCHHHHHHHHHh-ccCCCccC-CHHHHHHHHHHhhhhccCCCCccccccCCCC Confidence 1 223333332221 112333344443333 44433222 11222222222211111000 000000 00 Q ss_pred HHHHhhh--cCcch--HHhhhhcCCCCCCC Q lcl|NC_011045. 511 GMAAQAT--ASPEA--MAAAADSVGLQPGI 536 (536) Q Consensus 511 ~~~~~~~--~~~~~--~~~~~~~~~~q~~~ 536 (536) +-..+.. .+.+. .+. -.++|--||= T Consensus 483 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 511 (513) T protein:vir:97 483 PEGGEGEGEGEGEGGEGGE-GGEGGGNPGG 511 (513) T ss_pred CCCCCCCCCCCCCCCCCCC-ccccCCCCCC Confidence 0000000 00000 001 1122223333 No 127 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.53 E-value=4.9e-05 Score=44.20 Aligned_cols=447 Identities=8% Similarity=-0.024 Sum_probs=183.4 Q ss_pred ccccccHHHHHHHHHHHHHH--hhhHHHHHHHHHHHhcc--------cccCCCCCcccccccccccchHHHHHHHHHHHH Q lcl|NC_011045. 4 KRTGLAEEGAKSVYERLKND--RAPYETRAQNCAQYTIP--------SLFPKDSDNASTDYVTPWQAVGARGLNNLASKL 73 (536) Q Consensus 4 ~~~~~~~~~~~~r~~~l~~~--R~~~e~~w~e~~~~~~P--------~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (536) ..-...-+.+.+.|=+-+.. .-.....|..++.=..+ .++-........+..++--+.+...++.+|+-| T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll 80 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYI 80 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhh Confidence 11111122333333221110 00011111111100000 000000001111223444445677777777655 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) |--.+=+.+...+.. + ...++++| ...+..++|+..+.+.+.+..+.|++++-+.-+++ T Consensus 81 ----~~e~~~i~v~~~~~~------d---~e~~~~~l-------~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~- 139 (518) T protein:vir:78 81 ----SGKPLSIDVTGVNGS------K---DENLTKQL-------KEALRIDNFDSKSVKIVELAGGSGVSAVKINILNG- 139 (518) T ss_pred ----cCCCceEEecCcccc------C---cHHHHHHH-------HHHHHhccHHHHHHHHHHHhhccCceEEEEEEECC- Confidence 422121333222110 0 01233333 44566799999999999999999999873322232 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCC---------CC Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDED---------SG 224 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~---------~~ 224 (536) .+.+..++...|+...+ +|++..+..-...... +++ .+|+.++.... .+ T Consensus 140 -~~~i~~v~ad~~~P~~~-~g~~~~~~f~~~~~~~--------------~k~------~~y~~lE~he~~~~~~~~~~~~ 197 (518) T protein:vir:78 140 -RPSISVHSSSQFWIDFK-NNEPFRFNFFEEIPTS--------------NKA------DIYYLVESREIKQWDKEGKKLS 197 (518) T ss_pred -eeEEEEEcCCeeEEEee-cCcEEEEEEEEEeecC--------------Ccc------eeEEEEEeeccccccceeeccc Confidence 35678888888887654 5776655322211110 001 12222222110 01 Q ss_pred ----ceeEEEEecCcccc-------------------ccc-cccccccCceEEEeeeec-----CCCccccchHHHHHHH Q lcl|NC_011045. 225 ----EYIRYEEVEGMEVQ-------------------GSD-GTYPKEACPYIPIRMVRL-----DGESYGRSYIEEYLGD 275 (536) Q Consensus 225 ----~~~~~~~v~g~~i~-------------------~~~-~~~~~~~~P~~~~rw~~~-----~ge~YGrgp~~~~l~d 275 (536) .|..|.+-.++.+. .++ ........||+++..+.. .++.||+|-...+.+. T Consensus 198 ~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~ 277 (518) T protein:vir:78 198 GGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNY 277 (518) T ss_pred ceeEEEEEeeecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHH Confidence 11122211111110 000 001123457777655543 3678899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCCC----------cceec--CCcc-ccc---ccc-cccccch Q lcl|NC_011045. 276 LRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQT----------GDFVT--GRPE-DIS---FLQ-LEKQADF 338 (536) Q Consensus 276 ~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~~----------g~~~~--g~~~-~~~---~~~-~~~~~~~ 338 (536) ++.||..--+...-.+. .++.+.|+.+-+ .... ..++. ..++. +..+ +.. .++ +...-+. T Consensus 278 id~lD~~~s~~~~e~~~-g~~~i~v~~~~l-~~~~-~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~ 354 (518) T protein:vir:78 278 LFAVDYFFTVYMREGEK-TKTKIAASERMF-RKKV-NKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRD 354 (518) T ss_pred HHHHHHHHHHHHHHHHh-CCceeeechhHh-ccCC-CCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccCh Confidence 99999988888888765 888878854433 2211 11111 11111 1111 111 111 1111112 Q ss_pred hHHHHHHHHHHHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcC---- Q lcl|NC_011045. 339 TVAKAVSDAIEARLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ---- 412 (536) Q Consensus 339 ~~~~~~i~~~~~rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g---- 412 (536) ..-...++.+-+.|.... + ...+. -++...|||||..+.+...+.+--.-..+..- |.-|+..++.++.... T Consensus 355 e~~~~~~~~~l~~~~~~~G~s~~tfg-~~~~~~TATei~s~~~~~~~t~~~~~~~~e~a-l~~l~~~i~~l~~~~~~~~~ 432 (518) T protein:vir:78 355 GSYRETMEYFAQKAVSKSGYNPATFN-LGNREVKATEIWSLQDATVRKIEKKKRLIQNV-YEQMLWDFLYLLTGGTNNKE 432 (518) T ss_pred HHHHHHHHHHHHHHHHhhCCChhhcC-cccccccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCccc Confidence 222333333333333221 1 11122 23445799999999988766664444443332 3345555555543321 Q ss_pred CCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHH Q lcl|NC_011045. 413 QIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKM 492 (536) Q Consensus 413 ~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~ 492 (536) ..+..+...++|+|--++..-.. ..++...+ .++ .+ .+....+++.+. .|+ |++|.+++- T Consensus 433 ~~~~~~~~~v~i~f~D~i~~D~~-~~~~~~~~---~v~-aG------imS~e~~i~~~~--~~~-------~deea~~e~ 492 (518) T protein:vir:78 433 KAIMRDEIRVIIEFPDPMSVNLN-ELSSTLNN---MNS-AL------AMSVEEKVKLIH--PKW-------EDEEIQAEV 492 (518) T ss_pred cccCCCceeEEEEeCCCCCCCHH-HHHHHHHH---HHh-cC------CCCHHHHHHHhC--CCC-------CHHHHHHHH Confidence 12223334567776544432211 11111111 111 11 123344444321 122 555554432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCC Q lcl|NC_011045. 493 AQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) Q Consensus 493 ~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (536) ++-+.++.... . ..|..++--+..+| T Consensus 493 ~ri~~E~~~~~--~-----------~~p~~~~g~~~~~g 518 (518) T protein:vir:78 493 KRIYLENAIGE--V-----------PDPEAIGGMETKGG 518 (518) T ss_pred HHHHHHhcccC--C-----------CCCccccCCCCCCC Confidence 22111111100 0 00111111111222 No 128 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=97.41 E-value=7.4e-05 Score=43.22 Aligned_cols=429 Identities=13% Similarity=0.131 Sum_probs=180.3 Q ss_pred CCC---ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc--CC-------CCCcccc----ccc-ccccchHH Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF--PK-------DSDNAST----DYV-TPWQAVGA 63 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~--~~-------~~~~~~~----~~~-~~~dst~~ 63 (536) |-. ++..++.+.. .|. ...++|+-+.+.+--.+- .. ....+.. ++. -.|-+.- T Consensus 1 ~~~~~~~~~~V~~~hp--~y~-------a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~- 70 (491) T protein:vir:95 1 MLTANGQGSGVKTKHR--EWL-------HYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFT- 70 (491) T ss_pred CcccCCccCCCCccCH--HHH-------HHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChH- Confidence 433 4555544332 222 224456555555432110 00 0000000 111 1232333 Q ss_pred HHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcE Q lcl|NC_011045. 64 RGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNV 143 (536) Q Consensus 64 ~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 143 (536) ......|++.+|=-.|.+. .++ .++.+++.|.. .-.+++.-+...+.+...+|-+ T Consensus 71 ---~~tl~~l~G~vfrk~p~~~--~p~--------------~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~ 125 (491) T protein:vir:95 71 ---RRTLSGMVGSVMRKEPEIN--IPK--------------ELEYLLKNADG------SGVGLIQHAQDTLMEIDSVGRG 125 (491) T ss_pred ---HHHHHHHhchhhcCCceee--ccH--------------HHHHHHhccCC------CCCCHHHHHHHHHHHHHHcCeE Confidence 3333344444442224442 221 24445555433 2567888899999999999999 Q ss_pred EEEEecCCCCc------------eeeEEEEecceEE---E-eeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCC Q lcl|NC_011045. 144 LLYLPEPEGSN------------YNPMKLYRLSSYV---V-QRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKA 207 (536) Q Consensus 144 ~l~~~~~~~~~------------~~~~~~~~l~~~~---v-~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~ 207 (536) .++||-+..+. + .+..|+..+.. . ..|+.+++.-+..+++..+++=...|+. T Consensus 126 ~ilVD~P~~~~~T~Ade~~~~~rP-y~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~----------- 193 (491) T protein:vir:95 126 GLLVDAPETAAATAAEQNAGLLNP-TIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFET----------- 193 (491) T ss_pred EEEEecCCCcccCHHHHHHhcCCc-EEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCccc----------- Confidence 99999764432 2 24555544432 1 1345556666666776544333333333 Q ss_pred CceEEEEEEEEecCCCC-ceeEEEEe-cC------ccccccccccccccCceEEEeeeecCCCcc--ccchHHHHHHHHH Q lcl|NC_011045. 208 DETIDVYTHIYLDEDSG-EYIRYEEV-EG------MEVQGSDGTYPKEACPYIPIRMVRLDGESY--GRSYIEEYLGDLR 277 (536) Q Consensus 208 ~~~~~v~~~v~p~~~~~-~~~~~~~v-~g------~~i~~~~~~~~~~~~P~~~~rw~~~~ge~Y--Grgp~~~~l~d~~ 277 (536) +.++.|....+..++. ++..|..- +| ..+...+|.. .+++|++.|.-..+..+ |..|.. |+. T Consensus 194 -~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~---~l~~IPfv~~~~~~~~~~~~~pPLl----~LA 265 (491) T protein:vir:95 194 -KYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGES---LRGVIPFTFIGATNNDATIDDAPLL----PLA 265 (491) T ss_pred -ceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCc---ccCeeEEEEEecCCCCCCCCcCchH----HHH Confidence 3344444444433321 22333321 11 1122234443 45666666665444444 445543 444 Q ss_pred HHHH---HHHHHHHHH-HHHhCCceeecc-ccccchhhhccCCCcceecCC--------cccccccccccccchhHHHHH Q lcl|NC_011045. 278 SLEN---LQEAIVKMS-MISSKVIGLVNP-AGITQPRRLTKAQTGDFVTGR--------PEDISFLQLEKQADFTVAKAV 344 (536) Q Consensus 278 ~L~~---l~~~~~~~~-~~a~~p~~lv~~-~g~~~~~~~~~~~~g~~~~g~--------~~~~~~~~~~~~~~~~~~~~~ 344 (536) .||. -+.+-.+.+ ..+..|.+.+.. +.. ....+..+.+..++-|. .++...++.. +. ..+... T Consensus 266 ~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~-~~~~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~-~~--~~~~~~ 341 (491) T protein:vir:95 266 ELNIGHYRNSADNEESSFVVGQPTLFIYPGDNL-TPQSFKEANPNGIKFGSRCGHNLGYGGSAQLIQAG-EN--NLARQN 341 (491) T ss_pred HHHHHHhhhhhHHHHHHHHcccceeeeecCccc-CcchhhccCcceeEecCcCCcCCCCCCccceeecC-cc--hHHHHH Confidence 4432 333334444 444445443321 111 11111111222222221 2222333322 11 234666 Q ss_pred HHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcceE Q lcl|NC_011045. 345 SDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAVE 423 (536) Q Consensus 345 i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v~ 423 (536) +.+++.+...+= -.+...+ .+.||++...+...-...|+.+...+++-+- ++|.++-+. |. + . +..++ T Consensus 342 l~~~e~qm~~~G--a~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~-----~~l~~~a~w~G~-~-~-~~~v~ 410 (491) T protein:vir:95 342 MLDKEQQAIQIG--AQLITPS-QQITAESARIQRGADTSVMATIARNVSQAYT-----DALRWVAMMLGK-P-E-DSEVE 410 (491) T ss_pred HHHHHHHHHHHH--HHhccCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHHHH-----HHHHHHHHHcCC-C-C-CCceE Confidence 777777665421 1223233 3689999999999999999999998877643 333333332 32 1 1 22333 Q ss_pred E----EEe-chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHH Q lcl|NC_011045. 424 P----TIS-TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQ 498 (536) Q Consensus 424 v----~~v-s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q 498 (536) + +|. .+++ .++++.++...+. ..|....+.+++ ...||. . ...+++..++..+.. T Consensus 411 i~~n~dF~~~~~~----~~~~~all~~~~~----------G~is~~t~~~~L-~~~~vl-~--~~~e~~~~~ie~~~~-- 470 (491) T protein:vir:95 411 FQLNMDFFLQPMT----AQDRAAWMADINA----------GLLPATAYYAAL-RKAGVT-D--WTDEDILNAIEDAPL-- 470 (491) T ss_pred EEeecccccccCC----HHHHHHHHHHHhc----------CCCCHHHHHHHH-HhCCCC-C--ccHHHHHHHHHhcCC-- Confidence 2 332 2232 2233333332220 123333444433 445662 2 112222222211110 Q ss_pred HHHHHHHHHHHHHHHHhhhcCcchHHhhhh Q lcl|NC_011045. 499 MGMDNGAAALAQGMAAQATASPEAMAAAAD 528 (536) Q Consensus 499 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 528 (536) ...+.++.+ +.-|++.++-.+ T Consensus 471 --~~~~~~~~~-------~~~~~~~~~~~~ 491 (491) T protein:vir:95 471 --PSGAVTQVA-------GEIPQAAQQQQE 491 (491) T ss_pred --CCCcccccc-------ccchhhhhhccC Confidence 000000000 000111111000 No 129 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.36 E-value=8.5e-05 Score=42.90 Aligned_cols=363 Identities=12% Similarity=0.035 Sum_probs=148.6 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccc-cchHHHHHHHHHHHHHHhhcCCCcceeccCChhh Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPW-QAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~ 92 (536) -..|+.+...|..-...-..+..+..|..+........-...+.. .++--.|++.+|+.+ +.+ | | ++ .+.. T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~i-a~l-~---~-~~--~~~~ 72 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDL-ATA-K---I-TT--SRKQ 72 (384) T ss_pred CccccccccCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHH-hhC-c---e-ee--ecch Confidence 122332222211100000112223333332211110000001112 233334555555544 333 2 1 11 1110 Q ss_pred hhhhccChhHHHHHHHHHHHHHHHHHHHHH-hccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeC Q lcl|NC_011045. 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRD 171 (536) Q Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d 171 (536) ... +...-+ .-+.+.=+...+.++...|||.+++..+..+.++.+..++...+-+..+ T Consensus 73 ~~~---------------------l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~ 131 (384) T protein:vir:49 73 LQG---------------------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRL 131 (384) T ss_pred hhh---------------------hhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 000 000000 1134455566778888999999998877666666666666666655554 Q ss_pred CCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCce Q lcl|NC_011045. 172 AFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPY 251 (536) Q Consensus 172 ~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~ 251 (536) .++.. ++ |. +. ..+... +.... +..-=+ T Consensus 132 ~~~~~--~~---------------------------------y~-~~-------------~~~~~~-~~~~~--~~~~eV 159 (384) T protein:vir:49 132 DNQNG--LY---------------------------------YN-IT-------------FDDPRI-PPKQH--VPQGDI 159 (384) T ss_pred CCCce--EE---------------------------------EE-EE-------------ecCccc-cceeE--ecCccE Confidence 43211 11 10 10 000000 00000 001115 Q ss_pred EEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc---------cCCCcceecC Q lcl|NC_011045. 252 IPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---------KAQTGDFVTG 322 (536) Q Consensus 252 ~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~---------~~~~g~~~~g 322 (536) ++.|+...++..||.||...+...+.......+.......-...|..++.-.+....+... ....|.++. T Consensus 160 ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~v- 238 (384) T protein:vir:49 160 LHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLV- 238 (384) T ss_pred EEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCcccee- Confidence 6666666678899999999999999999999999999888889998777655554432211 001122111 Q ss_pred Cccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHH Q lcl|NC_011045. 323 RPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS--AVQRTGERVTAEEIRYVASE-LEDTLGGVYSILSQELQL 398 (536) Q Consensus 323 ~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~r~TAtEi~~r~~E-~~~~LG~v~~rl~~E~l~ 398 (536) -.++....++.. +.+.+ ..+..+..++.|-++|-.-. +.......-|++.+.+.... ....|-|+.++++.+|.. T Consensus 239 l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~ 317 (384) T protein:vir:49 239 LDDLEDFTPLEIKSNVAQ-LLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSC 317 (384) T ss_pred cCCCceEEEccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhch Confidence 112223334442 33444 34666777788888884432 11122233455555443332 223466666666666533 Q ss_pred HHHHHHHHHHHhcCCCCCC-CCcceEEEEechHHHHHHHHHH----HHHHHHHHHHHhhc-chhhhhc Q lcl|NC_011045. 399 PLVRVLLKQLQATQQIPEL-PKEAVEPTISTGLEAIGRGQDL----DKLERCVAAWAALA-PMRDDPD 460 (536) Q Consensus 399 Pli~r~~~il~~~g~lp~~-~~~~v~v~~vs~La~a~r~~~~----~~l~~~~~~~~~~~-p~~~~~~ 460 (536) -+..-........+..... -..+++-...+........... ..+.... ....+. .+.-+.+ T Consensus 318 ~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~~~-~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 318 EVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPEGE-TDSTLKGGETNEQY 384 (384) T ss_pred hhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHHHc-CCCCCCCCCCCCCC Confidence 2210000000000000000 0001111112221111110000 0001100 011111 1222233 No 130 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.27 E-value=0.00011 Score=42.27 Aligned_cols=334 Identities=12% Similarity=0.067 Sum_probs=133.8 Q ss_pred HhcccccCCCCC---cccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHH-HHHHH Q lcl|NC_011045. 37 YTIPSLFPKDSD---NASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVD-EGLSM 112 (536) Q Consensus 37 ~~~P~~~~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~-~~L~~ 112 (536) .+++-+ ..-.. .........+-+.+ -...+.+.++.. + ...++... .++ ..... T Consensus 1 m~m~~f-~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~--~-~~~v~~~~------------al~~~~v~~ 58 (392) T protein:vir:39 1 MILPIL-NFINQTNDPPEVGSVQSYFPDG------NDAQIMESLLGD--N-NEWVSARA------------ALRNSDLFS 58 (392) T ss_pred Ccchhh-hhhhcccccccccccccccccC------chhhhhhhhcCC--C-CceechHH------------hhccHHHHH Confidence 111111 10000 00000000000000 000000000000 0 00011000 001 01112 Q ss_pred HHHHHHHHHHhc----------------c----ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCC Q lcl|NC_011045. 113 VERIIMNYIESN----------------S----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDA 172 (536) Q Consensus 113 ve~~~~~~l~~s----------------n----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~ 172 (536) |-+.+...++.. | .+.=+...+.++..+|||++++..+..+.++.+..++...+.+..+. T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~ 138 (392) T protein:vir:39 59 IILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFE 138 (392) T ss_pred HHHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 222333332222 2 24555667779999999999987776666666666666666666655 Q ss_pred CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceE Q lcl|NC_011045. 173 FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYI 252 (536) Q Consensus 173 ~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~ 252 (536) +|... + |+. ..++...... .. +..--++ T Consensus 139 ~~~~~--~---------------------------------y~~--------------~~~~~~~~~~-~~--~~~~eii 166 (392) T protein:vir:39 139 YENGM--Y---------------------------------YNI--------------TFDDPKIEPI-LQ--APQSDLI 166 (392) T ss_pred CCceE--E---------------------------------EEE--------------EecCccccee-EE--EccccEE Confidence 43210 1 110 1111000000 00 1111256 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc-ccchh--------hhccCC-CcceecC Q lcl|NC_011045. 253 PIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-ITQPR--------RLTKAQ-TGDFVTG 322 (536) Q Consensus 253 ~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g-~~~~~--------~~~~~~-~g~~~~g 322 (536) +.|+...+|..||.||...+...+.......+.......-...|.+++.-.+ ....+ .+.... .|.+ .. T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~v 245 (392) T protein:vir:39 167 HMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGP-VV 245 (392) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCe-ee Confidence 6777777888999999999999999999999999999999999987664322 11111 111111 1111 11 Q ss_pred Cccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHH Q lcl|NC_011045. 323 RPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (536) Q Consensus 323 ~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli 401 (536) -+++....++.. +.+.+. .+..+..+..|-++|=......-+...-|..+ .+...=....|.|.+.++++|+-.-|+ T Consensus 246 l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~L~ 323 (392) T protein:vir:39 246 LDDLEEFTALEIKSNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKLS 323 (392) T ss_pred cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 123334444443 334443 35566677788888743321111222222211 122223445667777777776643322 Q ss_pred HHHHHHHHhcCCCCCCCCcceEEE-EechHHHHHHHHHHHHHH--------HHHHH-------------HHhhcchh-hh Q lcl|NC_011045. 402 RVLLKQLQATQQIPELPKEAVEPT-ISTGLEAIGRGQDLDKLE--------RCVAA-------------WAALAPMR-DD 458 (536) Q Consensus 402 ~r~~~il~~~g~lp~~~~~~v~v~-~vs~La~a~r~~~~~~l~--------~~~~~-------------~~~~~p~~-~~ 458 (536) . .+. -+++.. ..++...+. .+..+. ++-.. ...+.|.. .+ T Consensus 324 ~-------------~~~-~d~~~~~~~d~~~~~~---~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd 386 (392) T protein:vir:39 324 D-------------HIS-VNMRPAIDPLGDNYLS---TISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQ 386 (392) T ss_pred c-------------ccc-ccchhhhccCHHHHHH---HHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCC Confidence 1 110 001110 011111111 111110 00000 01122210 00 Q ss_pred hcCCHH Q lcl|NC_011045. 459 PDINLA 464 (536) Q Consensus 459 ~~id~d 464 (536) .+=... T Consensus 387 ~~~p~p 392 (392) T protein:vir:39 387 SNEPVP 392 (392) T ss_pred CCCCCC Confidence 000000 No 131 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.27 E-value=0.00011 Score=42.27 Aligned_cols=334 Identities=12% Similarity=0.067 Sum_probs=133.8 Q ss_pred HhcccccCCCCC---cccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHH-HHHHH Q lcl|NC_011045. 37 YTIPSLFPKDSD---NASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVD-EGLSM 112 (536) Q Consensus 37 ~~~P~~~~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~-~~L~~ 112 (536) .+++-+ ..-.. .........+-+.+ -...+.+.++.. + ...++... .++ ..... T Consensus 1 m~m~~f-~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~--~-~~~v~~~~------------al~~~~v~~ 58 (392) T protein:vir:10 1 MILPIL-NFINQTNDPPEVGSVQSYFPDG------NDAQIMESLLGD--N-NEWVSARA------------ALRNSDLFS 58 (392) T ss_pred Ccchhh-hhhhcccccccccccccccccC------chhhhhhhhcCC--C-CceechHH------------hhccHHHHH Confidence 111111 10000 00000000000000 000000000000 0 00011000 001 01112 Q ss_pred HHHHHHHHHHhc----------------c----ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCC Q lcl|NC_011045. 113 VERIIMNYIESN----------------S----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDA 172 (536) Q Consensus 113 ve~~~~~~l~~s----------------n----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~ 172 (536) |-+.+...++.. | .+.=+...+.++..+|||++++..+..+.++.+..++...+.+..+. T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~ 138 (392) T protein:vir:10 59 IILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFE 138 (392) T ss_pred HHHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 222333332222 2 24555667779999999999987776666666666666666666655 Q ss_pred CCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceE Q lcl|NC_011045. 173 FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYI 252 (536) Q Consensus 173 ~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~ 252 (536) +|... + |+. ..++...... .. +..--++ T Consensus 139 ~~~~~--~---------------------------------y~~--------------~~~~~~~~~~-~~--~~~~eii 166 (392) T protein:vir:10 139 YENGM--Y---------------------------------YNI--------------TFDDPKIEPI-LQ--APQSDLI 166 (392) T ss_pred CCceE--E---------------------------------EEE--------------EecCccccee-EE--EccccEE Confidence 43210 1 110 1111000000 00 1111256 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc-ccchh--------hhccCC-CcceecC Q lcl|NC_011045. 253 PIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-ITQPR--------RLTKAQ-TGDFVTG 322 (536) Q Consensus 253 ~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g-~~~~~--------~~~~~~-~g~~~~g 322 (536) +.|+...+|..||.||...+...+.......+.......-...|.+++.-.+ ....+ .+.... .|.+ .. T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~v 245 (392) T protein:vir:10 167 HMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGP-VV 245 (392) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCe-ee Confidence 6777777888999999999999999999999999999999999987664322 11111 111111 1111 11 Q ss_pred Cccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHH Q lcl|NC_011045. 323 RPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLV 401 (536) Q Consensus 323 ~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli 401 (536) -+++....++.. +.+.+. .+..+..+..|-++|=......-+...-|..+ .+...=....|.|.+.++++|+-.-|+ T Consensus 246 l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~~L~ 323 (392) T protein:vir:10 246 LDDLEEFTALEIKSNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-QQISGMYASALNRYLRPAISELEYKLS 323 (392) T ss_pred cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 123334444443 334443 35566677788888743321111222222211 122223445667777777776643322 Q ss_pred HHHHHHHHhcCCCCCCCCcceEEE-EechHHHHHHHHHHHHHH--------HHHHH-------------HHhhcchh-hh Q lcl|NC_011045. 402 RVLLKQLQATQQIPELPKEAVEPT-ISTGLEAIGRGQDLDKLE--------RCVAA-------------WAALAPMR-DD 458 (536) Q Consensus 402 ~r~~~il~~~g~lp~~~~~~v~v~-~vs~La~a~r~~~~~~l~--------~~~~~-------------~~~~~p~~-~~ 458 (536) . .+. -+++.. ..++...+. .+..+. ++-.. ...+.|.. .+ T Consensus 324 ~-------------~~~-~d~~~~~~~d~~~~~~---~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd 386 (392) T protein:vir:10 324 D-------------HIS-VNMRPAIDPLGDNYLS---TISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQ 386 (392) T ss_pred c-------------ccc-ccchhhhccCHHHHHH---HHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCC Confidence 1 110 001110 011111111 111110 00000 01122210 00 Q ss_pred hcCCHH Q lcl|NC_011045. 459 PDINLA 464 (536) Q Consensus 459 ~~id~d 464 (536) .+=... T Consensus 387 ~~~p~p 392 (392) T protein:vir:10 387 SNEPVP 392 (392) T ss_pred CCCCCC Confidence 000000 No 132 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=97.10 E-value=0.00017 Score=41.24 Aligned_cols=365 Identities=9% Similarity=0.023 Sum_probs=147.7 Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccc--cccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhh Q lcl|NC_011045. 15 SVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAST--DYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) Q Consensus 15 ~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~--~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~ 92 (536) ..|+++-..|+..-....-........+....+..+.. ...-+-.++--.|++.+|+.+.+. ||--....+.. T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l-----~~~~~~~~~~~ 75 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKL-----PIHTYKRTDGG 75 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhC-----ceEEEEecCCc Confidence 33333333333221100000111111111111111110 011123445556777777766432 33211111111 Q ss_pred hhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEE Q lcl|NC_011045. 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) Q Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~ 167 (536) ..+. . +.-+...|. + -+.+.=+...+.++..+|||.+|+..+..+.+..+..++...+. T Consensus 76 ~~~~---------~-------~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~ 139 (416) T protein:vir:12 76 IERK---------P-------EHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTN 139 (416) T ss_pred cccc---------c-------ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceE Confidence 1110 0 001111121 2 23445566778889999999999876665544444444433343 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccc Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKE 247 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~ 247 (536) +..+.++.. ++ +...++|..+ .+. T Consensus 140 v~~~~~~~~------------------------------------~~-------------~~~~~~g~~~-------~~~ 163 (416) T protein:vir:12 140 AYVHPTTGM------------------------------------LW-------------YQTVLNGKAI-------ELY 163 (416) T ss_pred EEEeCCCcE------------------------------------EE-------------EEEecCCeEE-------Eec Confidence 333332210 00 0011122211 111 Q ss_pred cCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc---------CCCcc Q lcl|NC_011045. 248 ACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK---------AQTGD 318 (536) Q Consensus 248 ~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~---------~~~g~ 318 (536) ..-++++|+...++ .||.||..-+...+.......+.......-...|.+++.-++.++++.... ++.+. T Consensus 164 ~~eiih~~~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~ 242 (416) T protein:vir:12 164 DYEVLHFKGLSTDG-IHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVNKVENI 242 (416) T ss_pred CccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhcCCCe Confidence 22356666654444 899999999999999999999998888888888988887666666553221 11111 Q ss_pred eecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cc--cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_011045. 319 FVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AV--QRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (536) Q Consensus 319 ~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~--~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~ 394 (536) .+ -.++....++.. +.+.+. .+........|-++|-.-. +. ..++..-++++... .=....|.|.+.++.+ T Consensus 243 ~v--l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~--~f~~~~l~P~~~~ie~ 317 (416) T protein:vir:12 243 AI--IDYGLEYQSISMPLQEAQF-VESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSI--EYVRNTLQPWIVNFEQ 317 (416) T ss_pred ee--cCCCceEEEccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHH--HHHHHHHHHHHHHHHH Confidence 11 122233334433 345554 3455666788888885422 11 11222223333221 1223345555555555 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHH---HHHHHHHHHHHHHH----HHHH---Hhhcch-hhhh-- Q lcl|NC_011045. 395 ELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLE---AIGRGQDLDKLERC----VAAW---AALAPM-RDDP-- 459 (536) Q Consensus 395 E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La---~a~r~~~~~~l~~~----~~~~---~~~~p~-~~~~-- 459 (536) +|-.- ++++.. .....++| ++.|- ...|+.....+.+. .+.+ -.+.|. -.|. T Consensus 318 ~l~~~-------------l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~ 384 (416) T protein:vir:12 318 ELNVK-------------LFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYI 384 (416) T ss_pred HHHHh-------------hcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceee Confidence 54222 222221 11122332 22221 11111111111110 0001 011121 0110 Q ss_pred ----cCCHHHHHHH----------H--HHHcC Q lcl|NC_011045. 460 ----DINLAMIKLR----------I--ANAIG 475 (536) Q Consensus 460 ----~id~d~~~~~----------~--a~~~G 475 (536) .+-.|.+-+. = -..-| T Consensus 385 ~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 385 SSLNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred eccccccccccchhhccccccccCCCCCcCCC Confidence 0000100000 0 01124 No 133 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=97.07 E-value=0.00019 Score=41.04 Aligned_cols=372 Identities=11% Similarity=0.023 Sum_probs=142.9 Q ss_pred cccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHH------ Q lcl|NC_011045. 39 IPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSM------ 112 (536) Q Consensus 39 ~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~------ 112 (536) ++-+... ..+........+. +...++.+.. ...++...... .....+.|+..-.. T Consensus 1 M~~f~~~----~~~~~~~~~~~~~----------~~~~~~~~~~--~~~v~~~~al~---~~~V~~~v~~ia~~ia~~p~ 61 (397) T protein:vir:38 1 MPLLKLN----KSHSQGFSLNDPD----------WVNFLTGGEA--QKYVSADTALK---NSDIFSLIMQLSGDLAMVRY 61 (397) T ss_pred Ccchhhh----hcccCcccCCchh----------hhhhhcCCcC--CceechHHhhc---cHHHHHHHHHHHHHHhhCcc Confidence 2211100 0000001110000 0000000000 00001000000 00111111110000 Q ss_pred -HHH-HHHHHHHh----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEecc Q lcl|NC_011045. 113 -VER-IIMNYIES----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIA 186 (536) Q Consensus 113 -ve~-~~~~~l~~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t 186 (536) +++ .....+.+ .+.+.-+..+..++.++|||.+++..+..+.++.+..++...+.+..+.+|.. ++.++.+. T Consensus 62 ~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~--~~y~~~~~ 139 (397) T protein:vir:38 62 TSESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSG--LIYNINFD 139 (397) T ss_pred cccccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce--EEEEEEec Confidence 001 11111211 23455667788899999999999887777777777777777777776666531 11111110 Q ss_pred HHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCcccc Q lcl|NC_011045. 187 FGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGR 266 (536) Q Consensus 187 ~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGr 266 (536) ...++... .++.++ ++++|.....+..||. T Consensus 140 ---------------------------------~~~~~~~~---------------~~~~~e--iih~~~~~~~~~~~G~ 169 (397) T protein:vir:38 140 ---------------------------------EPAIGYME---------------NVPAAD--VIHIRLLSKNGGKTGI 169 (397) T ss_pred ---------------------------------ccccccee---------------EecCcc--EEEecCCCCCCccccc Confidence 00000000 000111 4555555566778999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc---------cC-C-CcceecCCcccccccccccc Q lcl|NC_011045. 267 SYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT---------KA-Q-TGDFVTGRPEDISFLQLEKQ 335 (536) Q Consensus 267 gp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~---------~~-~-~g~~~~g~~~~~~~~~~~~~ 335 (536) ||...+...+.......+.......-...|.+++.-++..+.+... .+ . .|.++ .-.++....++... T Consensus 170 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~-vl~~g~~~~~l~~~ 248 (397) T protein:vir:38 170 SPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPV-VIDALEDYKPLEVK 248 (397) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCce-ecCCCceEEecCCC Confidence 9999999999999999999998888888888777765555443211 11 1 11111 12233344455433 Q ss_pred -cchhHHHHHHHHHHHHHHHHHhhhhcccCCCCC-CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_011045. 336 -ADFTVAKAVSDAIEARLSFAFMLNSAVQRTGER-VTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ 413 (536) Q Consensus 336 -~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r-~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~ 413 (536) .+.+ ..+..+..+..|-.+|-......-.... .+..| +...-....|-|.+..++++|-.- + T Consensus 249 ~~d~~-~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e--~~~~~~~~~l~P~~~~ie~~ln~~-------------l 312 (397) T protein:vir:38 249 GNIAS-LLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT--QISGQYAKSLNRYVQAIVGELNDK-------------L 312 (397) T ss_pred hhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH--HHHHHHHHHHHHHHHHHHHHHHHh-------------c Confidence 3444 4556677788888888543321111111 12222 112222334455555555553222 2 Q ss_pred CCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHH Q lcl|NC_011045. 414 IPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMA 493 (536) Q Consensus 414 lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~ 493 (536) +++. .+.+++. +. .+......++..+.+.+ .+.++++-. .+|.+|. . ..+..... T Consensus 313 ~~~~---~~~~~~~--~~-----~d~~~~~~~~~~~~~~G------~~t~nE~R~----~lg~~p~--~-~~d~~~~~-- 367 (397) T protein:vir:38 313 HANI---SANIRFA--ID-----AMGDQYASTISSSVKGG------TIAGNQARF----ILQNSGY--L-AKDLPDPE-- 367 (397) T ss_pred cChh---ccccccc--cc-----CCHHHHHHHHHHHHhCC------CcCHHHHHH----HhCCCCC--C-CCcccccc-- Confidence 2221 1222211 00 01111111111111111 134444333 2344331 0 00100000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 494 QQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 494 q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) .... ..... ....++..++. +..+.++- |= T Consensus 368 ~~~~--~~~~~------~~~~~g~~~~~---~~~e~~~~-~~ 397 (397) T protein:vir:38 368 KEPQ--QAIQL------IQQEGGENDGN---NSDERGSD-PE 397 (397) T ss_pred cccc--ccccc------cccccCCCCCC---CCCCCCCC-CC Confidence 0000 00000 00000000000 00000100 00 No 134 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.93 E-value=0.00025 Score=40.29 Aligned_cols=427 Identities=12% Similarity=0.127 Sum_probs=182.0 Q ss_pred CCC---ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cCCC------CCcccc----ccc-ccccchHH Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKD------SDNAST----DYV-TPWQAVGA 63 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~~~------~~~~~~----~~~-~~~dst~~ 63 (536) |-. ++..++.+.. .|. ...++|+-|.+.+--.. .+.. ...+.+ ++. -.|-+. T Consensus 1 ~~~~~~~~~~V~~~hp--~y~-------a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~-- 69 (489) T protein:vir:78 1 MLTENGQGSGVKTKHR--EWL-------HYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNF-- 69 (489) T ss_pred CccCCCccCCCCccCH--HHH-------HHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCCh-- Confidence 543 4444544332 222 22445665555543311 0000 000000 111 112222 Q ss_pred HHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcE Q lcl|NC_011045. 64 RGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNV 143 (536) Q Consensus 64 ~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 143 (536) +......|++.+|=-.|++.+ ++ .++.+++.|.. .-.+++.-+...+.+...+|-+ T Consensus 70 --~~~tl~~l~G~vfrk~p~~~~--p~--------------~l~~l~~d~D~------~G~~L~~f~~~~~~~~l~~G~~ 125 (489) T protein:vir:78 70 --TRRTLSGMVGSVMRKEPEINI--PK--------------ELEYLLKNADG------SGVGLIQHAQDTLMEIDSVGRG 125 (489) T ss_pred --HHHHHHHHhchhhcCCcceec--cH--------------HHHHHHhccCC------CCCCHHHHHHHHHHHHHhcCeE Confidence 233344444555522356532 21 24455555533 2567888999999999999999 Q ss_pred EEEEecCCCCc------------eeeEEEEecceEE---Eee-CCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCC Q lcl|NC_011045. 144 LLYLPEPEGSN------------YNPMKLYRLSSYV---VQR-DAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKA 207 (536) Q Consensus 144 ~l~~~~~~~~~------------~~~~~~~~l~~~~---v~~-d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~ 207 (536) .++||-+..+. + .+..|+..+.. ..+ |+.+++.-+..+++...++=...|+ T Consensus 126 ~ilVD~P~~~~~T~ade~~~~~rP-y~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~------------ 192 (489) T protein:vir:78 126 GLLVDAPETGAATAAEQNAGLLNP-TIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFE------------ 192 (489) T ss_pred EEEEeeCCCCCcCHHHHHHhcCCc-EEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCcc------------ Confidence 99999765431 2 24555544432 222 3333555555566554322112222 Q ss_pred CceEEEEEEEEecCCCC-ceeEEEEe-cCc------cccccccccccccCceEEEeeeecCCCcccc--chHHHHHHHHH Q lcl|NC_011045. 208 DETIDVYTHIYLDEDSG-EYIRYEEV-EGM------EVQGSDGTYPKEACPYIPIRMVRLDGESYGR--SYIEEYLGDLR 277 (536) Q Consensus 208 ~~~~~v~~~v~p~~~~~-~~~~~~~v-~g~------~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGr--gp~~~~l~d~~ 277 (536) .+.++.|....+..++. ++..|... +|. .+...+|. ..+++|++.|.-..+..+.. .|.. |+. T Consensus 193 ~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA 265 (489) T protein:vir:78 193 TKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGE---SLRGVIPFTFIGATNNDATIDDAPLL----PLA 265 (489) T ss_pred ceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCC---CccCeeeEEEEecCCCCCCCCcCchH----HHH Confidence 33445454444433331 22334332 221 12122343 35677888777666555544 4543 444 Q ss_pred HHHH---HHHHHHHHH-HHHhCCceeecc-ccccchhhhccCCCcceecCC--------cccccccccccccchhHHHHH Q lcl|NC_011045. 278 SLEN---LQEAIVKMS-MISSKVIGLVNP-AGITQPRRLTKAQTGDFVTGR--------PEDISFLQLEKQADFTVAKAV 344 (536) Q Consensus 278 ~L~~---l~~~~~~~~-~~a~~p~~lv~~-~g~~~~~~~~~~~~g~~~~g~--------~~~~~~~~~~~~~~~~~~~~~ 344 (536) .||. -..+-.+.+ ..+..|.+.+.. +.. ....+..+....++-|. .++...++.. + ...+.+. T Consensus 266 ~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~-~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~-~--~~~~r~~ 341 (489) T protein:vir:78 266 ELNIGHYRNSADNEESSFVVGQPTLFIYPGENL-TPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAG-E--NNLARQN 341 (489) T ss_pred HHHHHHhhhhhHHHHHHHHcccceeeeecCccC-CcccccccCccceeeCCcccccCCCCCCcceeccC-c--chHHHHH Confidence 4432 333334444 444445443321 111 11111111122222222 1222223222 1 2334666 Q ss_pred HHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCCcce- Q lcl|NC_011045. 345 SDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQAT-QQIPELPKEAV- 422 (536) Q Consensus 345 i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~-g~lp~~~~~~v- 422 (536) +.++++++.++ .-.+.. .+.+.||++.+.+...-...|+.+...+++-+ .++|.++-+. |. + . +..+ T Consensus 342 l~~le~qm~~l--Ga~l~~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al-----~~~l~~~a~w~G~-~-~-~~~~~ 410 (489) T protein:vir:78 342 MLDKEQQAIQI--GAQLIT-PTQQITAQSARIQRGADTSVMATIARNVSQAY-----TDALRWVAVMLGK-P-E-DTEVE 410 (489) T ss_pred HHHHHHHHHHH--hhhhcc-CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHH-----HHHHHHHHHHcCC-C-C-CCceE Confidence 77777666542 112222 23468999999999999999999988877653 3444444332 32 1 1 2222 Q ss_pred ---EEEEe-chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHH Q lcl|NC_011045. 423 ---EPTIS-TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQ 498 (536) Q Consensus 423 ---~v~~v-s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q 498 (536) +.+|. .+++ .++++.++...+ ...|..+.+.+++ ...||.+ .+.++++..-+ T Consensus 411 i~~n~dF~~~~~d----~~~~~al~~~~~----------~G~is~~t~~~~L-~~~gv~d----~~~e~~~~ei~----- 466 (489) T protein:vir:78 411 FRLNMDFFLEPMT----AQDRAAWMADIN----------AGLLPATAYYAAL-RKAGVTD----WTDADIKDAVA----- 466 (489) T ss_pred EEeecccCcccCC----HHHHHHHHHHHh----------cCCCCHHHHHHHH-HhCCCCC----ccHHHHHHHHh----- Confidence 23342 2232 223333333222 1124444555544 3346621 23333322111 Q ss_pred HHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCC Q lcl|NC_011045. 499 MGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQ 533 (536) Q Consensus 499 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q 533 (536) .+..+.+ ....+.-|+-.++ +. | T Consensus 467 ----~~~~~~~---~~~~g~~~~~~q~---~~--~ 489 (489) T protein:vir:78 467 ----DQPLPVA---TEVQGEIPQSAQQ---QE--K 489 (489) T ss_pred ----hcCCCcc---cCCcccCCCCccc---cc--C Confidence 1110000 0000000111111 00 1 No 135 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.68 E-value=0.00042 Score=39.12 Aligned_cols=401 Identities=11% Similarity=0.076 Sum_probs=169.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccc--ccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYV--TPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~--~~~dst~~~a~~~Laa~l~~~lt 78 (536) ||+--+.+. .+++..-.++.+.| .+++-|...+.. ..+..-.. -.-.++--.|++.+|+.+.+. T Consensus 1 ~~~~~~~~~--------~~~~~~~~~~~~~~---~~~~g~~~~~~~-~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~l-- 66 (460) T protein:vir:10 1 MANRIIRAL--------RELTGLDNKFNDAF---IKYIGQTFTKYD-NNGKTYLEQGYNINPDVYSCISQMAAKTVAV-- 66 (460) T ss_pred CchhHHHHH--------hhhhccCCCchHHH---HHhhccccCCCc-cchhhhhHHHHhcchHHHHHHHHHHHhhhhC-- Confidence 777443221 22222223334445 356655443221 11111111 234455667778888776432 Q ss_pred CCCcceeccCChhh--------------hhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhh Q lcl|NC_011045. 79 PMQTWMRLTISEYE--------------AKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVA 140 (536) Q Consensus 79 P~~~Wf~l~~~d~~--------------~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~ 140 (536) ||.-....... +.... .......+ ..+...+......+.+= +.+.-+..++.++..+ T Consensus 67 ---p~~v~~~~~~g~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~ 141 (460) T protein:vir:10 67 ---PYTIKVVKDTKAYQQLNNLNISTKGLYSFT-QSLQKNRL-DTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLN 141 (460) T ss_pred ---ceEEEeccCCccchhhhhhhhhhhhhHHHH-HHhhcchh-hhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhc Confidence 22211111000 00000 00000001 11222222333333332 4556667777899999 Q ss_pred CcEEEEEecCCC----CceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEE Q lcl|NC_011045. 141 GNVLLYLPEPEG----SNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH 216 (536) Q Consensus 141 G~~~l~~~~~~~----~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~ 216 (536) |||.+|+..+.. +.+..+..++.+.+.+..+.+|.+-... + .++.. T Consensus 142 Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~--~----------------------------~~~~~ 191 (460) T protein:vir:10 142 GNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTD--S----------------------------PIKSY 191 (460) T ss_pred CCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeee--e----------------------------eeeEE Confidence 999999876432 3344455566677777776665332110 0 01111 Q ss_pred EEecCCCCceeEEEEecCccccccccccccccCceEEEeeeec-----CCCccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 217 IYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRL-----DGESYGRSYIEEYLGDLRSLENLQEAIVKMSM 291 (536) Q Consensus 217 v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~-----~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~ 291 (536) .++. ++.... + ...=.+++|+... .+..||.||...+...+.......+....... T Consensus 192 ~~~~--~g~~~~---~--------------~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ 252 (460) T protein:vir:10 192 MLIQ--GDQFIE---F--------------NEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQ 252 (460) T ss_pred EEec--CceeEE---e--------------cccceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1110 111100 0 0111344454332 24579999999999999999888888888888 Q ss_pred HHhCCceeeccccccchhhhccCC------------CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 292 ISSKVIGLVNPAGITQPRRLTKAQ------------TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFML 358 (536) Q Consensus 292 ~a~~p~~lv~~~g~~~~~~~~~~~------------~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~ 358 (536) ....|-+++..++.++++...... -|.++. -.++....++.. +.+.+. .+..+..+..|-++|-. T Consensus 253 ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~v-l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgV 330 (460) T protein:vir:10 253 NGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAG-ASGEIAFTKISLNTDELKP-FDYLKYDQKAICNALGW 330 (460) T ss_pred cCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCcee-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCC Confidence 878887887777766655332110 011111 122233334443 234443 45556667888888743 Q ss_pred hh--cccCCCCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHHH Q lcl|NC_011045. 359 NS--AVQRTGERVTAEEIRYVASE-LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEAI 433 (536) Q Consensus 359 ~~--~~~~~~~r~TAtEi~~r~~E-~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~a 433 (536) -. +...++...|-.-+.+.... ....|.|...+++++|-.- ++|+.. .....++| .+.+... T Consensus 331 Pp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~k-------------l~~~~~~~~~~~i~~d~~~l~~l 397 (460) T protein:vir:10 331 SDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKK-------------FIKRFKGYENAVIEWDISELPEM 397 (460) T ss_pred CHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHh-------------hcCcccccCCceEEeecchhhhH Confidence 21 11112212222222222222 2224556666665554332 333321 12334554 3444333 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChh------------hccCCHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTS------------GILLTEEQKQQKMAQQSMQMGM 501 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~------------~i~rs~~ev~~~~~q~~~q~~~ 501 (536) + .+......+++. + .+.+++ +.+.+|.+|. .++..++ + .+ T Consensus 398 ~--~d~~~~~~~~~~----g------~~T~NE----~R~~~g~~pi~~~~gD~~~~~~n~~~~~~-~---~~-------- 449 (460) T protein:vir:10 398 Q--TDMVAMASWLNT----I------PVTPNE----IRIAMKYETLNQDGMDIVFMPSNKVRIDD-V---SN-------- 449 (460) T ss_pred H--HHHHHHHHHHhC----C------CCCHHH----HHHHhCCCCCCCCCCCeeeecccccchhh-c---cc-------- Confidence 2 122222222221 0 122222 3334455442 1111000 0 00 Q ss_pred HHHHHHHHHHHHHhhhcCcchHHh Q lcl|NC_011045. 502 DNGAAALAQGMAAQATASPEAMAA 525 (536) Q Consensus 502 ~~~a~~~~~~~~~~~~~~~~~~~~ 525 (536) +....++...+ T Consensus 450 -------------~~~~~~~nq~~ 460 (460) T protein:vir:10 450 -------------NLIDSAFNQNQ 460 (460) T ss_pred -------------ccCCCcccCCC Confidence 00000000000 No 136 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=96.45 E-value=0.00062 Score=38.19 Aligned_cols=368 Identities=11% Similarity=0.027 Sum_probs=154.0 Q ss_pred HHHHHHHHH---HhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCCh Q lcl|NC_011045. 14 KSVYERLKN---DRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISE 90 (536) Q Consensus 14 ~~r~~~l~~---~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d 90 (536) -.-|++++. .|+.-. ..+.|..+...+........-+-.++--.|++.+|+.+.+ + ||--....+ T Consensus 1 MG~~~~~~~~~~~~~~~~-------~~~~~~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~-l----p~~~~~~~~ 68 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETV-------DMTNPLLLQWLGVDPDTPRNQLSEATYFACLKILSESLGK-L----PLKMYQKTE 68 (411) T ss_pred CchHHHHHhhccCccccc-------ccchHHHHHHhcCcccChhhhhccHHHHHHHHHHHHhHhh-C----ceeEEEecC Confidence 122222211 111000 0111111110010000000111123334456666655532 2 222111111 Q ss_pred hhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecce Q lcl|NC_011045. 91 YEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSS 165 (536) Q Consensus 91 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~ 165 (536) ....+. .+..+...|. +- +.+.=++..+.++..+|||.+++..+ ++.+..+..+|.+. T Consensus 69 ~~~~~~----------------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~ 131 (411) T protein:vir:81 69 RGIVKS----------------DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQY 131 (411) T ss_pred Cceeee----------------cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCce Confidence 100000 0111222222 22 34455677788889999999998766 45677777788888 Q ss_pred EEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccc Q lcl|NC_011045. 166 YVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYP 245 (536) Q Consensus 166 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~ 245 (536) +.+..|..|.+..- ..-+|+...+ ++|... . T Consensus 132 v~~~~~~~~~~~~~------------------------------~~~~~~~~~~------------~~g~~~-----~-- 162 (411) T protein:vir:81 132 VTIVVDDRGLLGEK------------------------------NAIWYRYNDP------------YDGKMY-----V-- 162 (411) T ss_pred EEEEEcCccccccc------------------------------ceEEEEEEec------------CCceEE-----E-- Confidence 87777776631100 0001111111 111111 0 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc----------CC Q lcl|NC_011045. 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK----------AQ 315 (536) Q Consensus 246 ~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~----------~~ 315 (536) +..--++++|+....+..||.||..-+...+.......+.......-...|..++.-++.++++.... +. T Consensus 163 ~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~ 242 (411) T protein:vir:81 163 FRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGS 242 (411) T ss_pred EccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCc Confidence 11112566776555566899999999999999999999999988888888998776666555543211 10 Q ss_pred --CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|NC_011045. 316 --TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQ---RTGERVTAEEIRYVASELEDTLGGVY 389 (536) Q Consensus 316 --~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~r~TAtEi~~r~~E~~~~LG~v~ 389 (536) .|.+. --.++....++.. +.+.+.. +..+..+..|-.+|-...... .++..-++++.. T Consensus 243 ~n~g~~~-vl~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~-------------- 306 (411) T protein:vir:81 243 KNAGKII-PVPLGMKLVPLDIKLTDSQFF-ELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQN-------------- 306 (411) T ss_pred cccCCce-ecCCCceEEEccCCHHHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHH-------------- Confidence 01111 1122233334432 2344433 455666788888885432111 122222333221 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHH---HHHHHHHHHHHH--HH--HHH---Hhhcch-h Q lcl|NC_011045. 390 SILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEA---IGRGQDLDKLER--CV--AAW---AALAPM-R 456 (536) Q Consensus 390 ~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~---a~r~~~~~~l~~--~~--~~~---~~~~p~-~ 456 (536) ..+...-+.|++.++-..+.+. ++++-. +....++| ++.|-. ..+...++.+.. ++ +.+ -.+.|. - T Consensus 307 ~~f~~~~l~P~~~~ie~~l~~~-ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~g 385 (411) T protein:vir:81 307 LAFYVDTLLYVLKQYEEEITYK-ILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDY 385 (411) T ss_pred HHHHHHHHHHHHHHHHHHHHhh-cCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 1222333555555444444322 333211 12223333 222211 111111111111 00 111 112221 1 Q ss_pred hh------hcCCHHHHHHHHHHHcCCCh Q lcl|NC_011045. 457 DD------PDINLAMIKLRIANAIGIDT 478 (536) Q Consensus 457 ~~------~~id~d~~~~~~a~~~Gv~p 478 (536) .| .++-.+.+-+.... -| +- T Consensus 386 gD~~~~~~n~~pl~~~~~~~~k-gG-d~ 411 (411) T protein:vir:81 386 GNNLMANGNYIPLSMLGANYGK-GG-DS 411 (411) T ss_pred CCeeeeccCccchhhhhhhhcc-CC-CC Confidence 11 11223333333221 22 22 No 137 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=96.44 E-value=0.00062 Score=38.16 Aligned_cols=333 Identities=13% Similarity=0.083 Sum_probs=131.9 Q ss_pred HhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC--Cc-ceeccCChhhhhhhccChhH--HHHHHHHHH Q lcl|NC_011045. 37 YTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM--QT-WMRLTISEYEAKQLLSDPDG--LAKVDEGLS 111 (536) Q Consensus 37 ~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~--~~-Wf~l~~~d~~~~~~~~~~~~--~~~v~~~L~ 111 (536) ..++-+ .+-. +..+.. .+.-.+..++. .+ |+..-..+. ......... ...|..-.+ T Consensus 1 m~m~~~-~~~~-----~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~--g~~v~~~~al~~~~v~~~v~ 61 (392) T protein:vir:74 1 MILPIL-NFIN-----QTNDPP-----------EAGSVQSYFPDGNDAQIMESLLGDN--NEWVSARAALRNSDLFSIIL 61 (392) T ss_pred Ccchhh-hhhh-----cccCcc-----------cccccccccccCchhhhhhhccCCC--CcccchhhhhcchHHHHHHH Confidence 000000 0000 000000 00000111110 01 111100000 000000000 011211111 Q ss_pred HHHHHH------------HHHHHhcc----ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCC Q lcl|NC_011045. 112 MVERII------------MNYIESNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGN 175 (536) Q Consensus 112 ~ve~~~------------~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~ 175 (536) .+...+ ...+.+-| .+.=+...+.++.++|||++++..+..+.++.+..++...+.+..+.+|. T Consensus 62 ~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~ 141 (392) T protein:vir:74 62 QLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYEN 141 (392) T ss_pred HHHHhhccCceeeccchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCc Confidence 111111 11122222 25556667779999999999987766666655565665666666555442 Q ss_pred eEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEe Q lcl|NC_011045. 176 VLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIR 255 (536) Q Consensus 176 v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~r 255 (536) . ++ |+ ...++...... .. +..--+++.+ T Consensus 142 ~--~~---------------------------------y~--------------~~~~~~~~~~~-~~--~~~~evih~~ 169 (392) T protein:vir:74 142 G--MY---------------------------------YN--------------ITFDDPKIEPI-LQ--APQSDLIHMK 169 (392) T ss_pred e--EE---------------------------------EE--------------EEecCCcccee-EE--EcCccEEEec Confidence 1 11 10 01111000000 00 0111155666 Q ss_pred eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc-ccccchh----h----hccCC-CcceecCCcc Q lcl|NC_011045. 256 MVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP-AGITQPR----R----LTKAQ-TGDFVTGRPE 325 (536) Q Consensus 256 w~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~-~g~~~~~----~----~~~~~-~g~~~~g~~~ 325 (536) +...+|..||.||...+...+.......+.......-...|..++.- ++....+ . +.... .|.+ ..-.+ T Consensus 170 ~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~-~vl~~ 248 (392) T protein:vir:74 170 LLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGP-VVLDD 248 (392) T ss_pred CCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCe-eecCC Confidence 66677889999999999999999999999999999999999877643 2222211 1 11111 1111 11123 Q ss_pred ccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 326 DISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVL 404 (536) Q Consensus 326 ~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~ 404 (536) +....++.. +.+.+. .+..+..+..|-++|-......-+...-|.. +.+...-....|.|.+..+.+++-.-|+.. T Consensus 249 g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~- 325 (392) T protein:vir:74 249 LEEFTALEIKSNVAQL-LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSS-IQQISGMYASALNRYLRPAISELEYKLSDH- 325 (392) T ss_pred CceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhccch- Confidence 333444443 334553 4556667788888885432111122222221 112223345567777777777764332211 Q ss_pred HHHHHhcCCCCCCCCcceEEEE-echHHHHH------HH-----HHHHHHH----------HHHHHHHhhc-----chhh Q lcl|NC_011045. 405 LKQLQATQQIPELPKEAVEPTI-STGLEAIG------RG-----QDLDKLE----------RCVAAWAALA-----PMRD 457 (536) Q Consensus 405 ~~il~~~g~lp~~~~~~v~v~~-vs~La~a~------r~-----~~~~~l~----------~~~~~~~~~~-----p~~~ 457 (536) +. -+++..+ .++..++. ++ .++-.+. +....+..+. ..++ T Consensus 326 ------------~~-~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 326 ------------IS-VNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQSNEPVP 392 (392) T ss_pred ------------hc-ccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCCCCCCC Confidence 00 0000000 01111000 00 0000110 0001111111 0123 No 138 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.37 E-value=0.00069 Score=37.90 Aligned_cols=305 Identities=10% Similarity=0.063 Sum_probs=123.3 Q ss_pred eEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE-ecCcccc-ccccccccc--cCc- Q lcl|NC_011045. 176 VLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE-VEGMEVQ-GSDGTYPKE--ACP- 250 (536) Q Consensus 176 v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~-v~g~~i~-~~~~~~~~~--~~P- 250 (536) |-+++++.. +..+.+. .+.+|+.. ...+|.. -++..+. ...+....+ ..| T Consensus 1 v~Eivw~~~-----------------------~g~~~~~-~l~~r~~~-~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~ 55 (355) T protein:vir:78 1 MFEQVYRIE-----------------------NGRARLG-KLAWRPPR-TISRFDVAPDGGLVAIEQWGVFGKATVRIPV 55 (355) T ss_pred CeEEEEEee-----------------------CCeEEEe-eeeecCcc-ceeeeeeccCCceeEEEecCCCCCCcceecc Confidence 222221110 0000000 00011000 0001111 1121111 111111111 222 Q ss_pred --eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc-eeeccccccc------------------hh Q lcl|NC_011045. 251 --YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI-GLVNPAGITQ------------------PR 309 (536) Q Consensus 251 --~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~-~lv~~~g~~~------------------~~ 309 (536) |++.|....+|+.||.|....+..-..--+...+..+..+++-..|. +..-+.+... .. T Consensus 56 ~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~ 135 (355) T protein:vir:78 56 DRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGL 135 (355) T ss_pred CCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHH Confidence 88899999999999999999999999988999999999999875553 3333332111 00 Q ss_pred hhc---cCC--CcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCC---CCHHHHHHHHHHH Q lcl|NC_011045. 310 RLT---KAQ--TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGER---VTAEEIRYVASEL 381 (536) Q Consensus 310 ~~~---~~~--~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r---~TAtEi~~r~~E~ 381 (536) .++ ..+ .|.++|-. ..+.++... ++-......|+.+.+.|+++++...+....... ....|+.. +. T Consensus 136 ~~~~~i~~g~~a~~iip~g-~~ie~~ea~--g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~---~v 209 (355) T protein:vir:78 136 QLAKEFRAGEAAGGYIPHG-ANFTLTGVQ--GKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFA---SF 209 (355) T ss_pred HHHHHhhCCcceeEeecCC-ceEEEeecC--CCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHH---HH Confidence 010 112 23344433 344444422 222224468899999999999876554322112 23345532 22 Q ss_pred HH-HhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhc Q lcl|NC_011045. 382 ED-TLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPD 460 (536) Q Consensus 382 ~~-~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~ 460 (536) .. .+-.-...|...|..-||..++.+. .|.-.+.| +++|-.. .. +...+...++.+..++- . T Consensus 210 ~~~~~~aD~~~i~~~ln~~li~~l~~lN--~~~~~~~P----~~~~~~~-~~-----~~~~~a~~~~~l~~~G~-----~ 272 (355) T protein:vir:78 210 FTGSLNAVMKHIADVTQQHVVEDLVDQN--WGPEEPAP----RLVPAQL-GK-----EQPVTAEAIRALVECGA-----F 272 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCCC----EEEecCc-Ch-----hHHHHHHHHHHHHhCCC-----c Confidence 22 2222223333333333444444332 22212222 3333211 11 11123344444554442 1 Q ss_pred CCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHhh-hcCcchHHhhhhcCCC Q lcl|NC_011045. 461 INLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQG-------MAAQA-TASPEAMAAAADSVGL 532 (536) Q Consensus 461 id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~-------~~~~~-~~~~~~~~~~~~~~~~ 532 (536) +..+.+..++.+.+|+ |..-- .++++....+. ....++++...+ .+... +..+..+.. .-+..+ T Consensus 273 ~~~~~~~~~~~e~~gi-p~p~~-~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~-~~~~~~ 344 (355) T protein:vir:78 273 TADPELEKDLRARYGL-PAPAE-RDDGADAAAAK-----AAGRRRAKRLPGQRQGAALPSRSPRADPPRRRGP-LRRRPR 344 (355) T ss_pred cccHHHHHHHHHHhCC-CCCCC-CCcccCCcccc-----ccccccccccCCccccccccccCCCCCChhhhHH-HHHHhh Confidence 3445667788899999 43221 12222110000 000000000000 00000 001111111 112224 Q ss_pred CCCC Q lcl|NC_011045. 533 QPGI 536 (536) Q Consensus 533 q~~~ 536 (536) .||- T Consensus 345 ~~~~ 348 (355) T protein:vir:78 345 HPAH 348 (355) T ss_pred cccc Confidence 4444 No 139 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=96.29 E-value=0.00078 Score=37.63 Aligned_cols=441 Identities=9% Similarity=0.053 Sum_probs=160.4 Q ss_pred CC---C--ccccccHHH------H-HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc------------ccccc Q lcl|NC_011045. 1 MA---E--KRTGLAEEG------A-KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS------------TDYVT 56 (536) Q Consensus 1 Ma---~--~~~~~~~~~------~-~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~------------~~~~~ 56 (536) |- . ++...+.+. . ....+..++.. =-.|..|.........+- +.+-+ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~---------~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~ 81 (547) T protein:vir:63 11 GVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNK---------EVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLK 81 (547) T ss_pred cCCccccccccccccccchhhhhhhHHHHHHhhccc---------chhhhchhhheeecccccccCCccCChhHHHHHHH Confidence 11 0 111110000 0 01112222111 011344443221111110 01112 Q ss_pred cc--cchHHHHHHHHHHHHHHhhcCC-----CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHH Q lcl|NC_011045. 57 PW--QAVGARGLNNLASKLMLALFPM-----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVT 129 (536) Q Consensus 57 ~~--dst~~~a~~~Laa~l~~~ltP~-----~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~ 129 (536) .| ..+...|+++.|..+.+...|. ..=|.+.+.+..-.....+......++.+|..+--... =.+.+|..- T Consensus 82 ~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~--p~~~s~~~f 159 (547) T protein:vir:63 82 KFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDND--INRDSFSSF 159 (547) T ss_pred HhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCC--CccchHHHH Confidence 33 2334466677776665433343 22244444332211111111111122222221100000 000123445 Q ss_pred HHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCc Q lcl|NC_011045. 130 LFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADE 209 (536) Q Consensus 130 ~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~ 209 (536) +...+.|+.++|||++++..+..+.++.+..++.....+..+.+|.+..- T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~------------------------------ 209 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDN------------------------------ 209 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccC------------------------------ Confidence 56678888999999999887776667666666666666666666542210 Q ss_pred eEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecC---CCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 210 TIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLD---GESYGRSYIEEYLGDLRSLENLQEAI 286 (536) Q Consensus 210 ~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~---ge~YGrgp~~~~l~d~~~L~~l~~~~ 286 (536) .+. +++.++|..... ++-++ ++++|.+... ...||.||...+...+.......+.. T Consensus 210 ~~~---------------y~~~~~~~~~~~----~~~~e--iih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~ 268 (547) T protein:vir:63 210 GNR---------------FVQVIDQKIVAT----FNARE--MAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFN 268 (547) T ss_pred ceE---------------EEEEcCCcEEEE----ecccc--EEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHH Confidence 000 011111111100 00011 3333333222 24699999999999999998888888 Q ss_pred HHHHHHHhCCceee--ccccccchhhh---c----c---CC--Ccc--eecCCccccccccccc-ccchhHHHHHHHHHH Q lcl|NC_011045. 287 VKMSMISSKVIGLV--NPAGITQPRRL---T----K---AQ--TGD--FVTGRPEDISFLQLEK-QADFTVAKAVSDAIE 349 (536) Q Consensus 287 ~~~~~~a~~p~~lv--~~~g~~~~~~~---~----~---~~--~g~--~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~ 349 (536) .....-...|..++ +.+..++.+.. . . +. .|. ++. .+++...++.. ..+.+ ..+..+... T Consensus 269 ~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~--~~g~~~~~l~~~~~d~q-fle~~~~~~ 345 (547) T protein:vir:63 269 DRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS--AEDVKFVNMTPSARDME-FEKWLNYLI 345 (547) T ss_pred HHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc--CCCceEEEcCCChhHHH-HHHHHHHHH Confidence 88888878887554 33333333221 1 1 10 111 121 22334444432 33455 334556677 Q ss_pred HHHHHHHhhhhcc--cC-C-------CCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_011045. 350 ARLSFAFMLNSAV--QR-T-------GERVTAEEIRYVASE-LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP 418 (536) Q Consensus 350 ~rI~~af~~~~~~--~~-~-------~~r~TAtEi~~r~~E-~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~ 418 (536) ..|-++|-..... .. + ...+|-.-+.+.... ....|.|.+.+++.+|-.- ++++. T Consensus 346 ~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~-------------L~~~~- 411 (547) T protein:vir:63 346 NVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKH-------------IVAEF- 411 (547) T ss_pred HHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhh-------------ccccc- Confidence 8888888543211 01 1 111222112222222 2334555555555544222 33333 Q ss_pred CcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChh-----------hccCCHHH Q lcl|NC_011045. 419 KEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTS-----------GILLTEEQ 487 (536) Q Consensus 419 ~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~-----------~i~rs~~e 487 (536) +..+.++|....... .. +...+...+. . + .+..++ +-+.+|.+|. .+....+. T Consensus 412 ~~~~~~~f~~~~~~~-~~-~~~~~~~~~~---~-g------~lT~NE----~R~~~gl~P~~egGD~~~~~~~~~~~~~~ 475 (547) T protein:vir:63 412 GDKYTFQFVGGDIKS-EL-ESVKILAEKA---K-V------AMTVNE----VRKELNLPGDVIGGDIPLNGVIVQRIGQL 475 (547) T ss_pred CCceEEEeecccccc-HH-HHHHHHHHHh---C-C------CcCHHH----HHHHhCCCCCCCCCceeeccccccccccc Confidence 234666665432211 11 1111111111 0 0 111222 2223343331 00000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhcC-------------cchHHhhhhcCCCCCCC Q lcl|NC_011045. 488 KQQKMAQQSMQMGMDNGAAALAQGM-AAQATAS-------------PEAMAAAADSVGLQPGI 536 (536) Q Consensus 488 v~~~~~q~~~q~~~~~~a~~~~~~~-~~~~~~~-------------~~~~~~~~~~~~~q~~~ 536 (536) .+...-+-+.++...++.....++. ..+.... .+....-+..+|.|+.- T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 538 (547) T protein:vir:63 476 MQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMK 538 (547) T ss_pred ccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccccCccccchhhhhcC Confidence 0000000000000000000000000 0000000 00000111111112111 No 140 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=96.27 E-value=0.0008 Score=37.56 Aligned_cols=352 Identities=11% Similarity=0.039 Sum_probs=152.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccc-cchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPW-QAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP 79 (536) |. .|+.+...|..-...| ..++.+...........-...... .++--.|++.+|+.+. .+ | T Consensus 1 Mg-------------~f~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia-~~-~ 62 (382) T protein:vir:48 1 MP-------------IFNLATESPPDNQGGF---FDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLA-TV-K 62 (382) T ss_pred Cc-------------cccccccCCccccccc---ccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhc-cC-c Confidence 32 2222222221111111 111111111110000000000111 2333345566665552 22 2 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) | ++. +.... . .+.+- +.+.=+..++.+|..+|||++++..+..+.+ T Consensus 63 ---~-~~~--~~~~~---------------------~---L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~ 112 (382) T protein:vir:48 63 ---L-ITS--RKKLQ---------------------G---IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRD 112 (382) T ss_pred ---e-eee--cchhh---------------------h---hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcE Confidence 2 111 11000 0 11122 3355566777788899999999987776667 Q ss_pred eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) +.+..++...+-+..+.+|... + |. ..+++. T Consensus 113 ~~l~~i~~~~v~v~~~~~~~~~--~---------------------------------y~--------------~~~~~~ 143 (382) T protein:vir:48 113 MKWEYLRPSQVSFNRLDNKDGI--Y---------------------------------YN--------------ITFDDP 143 (382) T ss_pred EEEEEEcCceeEEEEcCCCCeE--E---------------------------------EE--------------EEecCc Confidence 7777777777776666554211 0 00 011111 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc-- Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK-- 313 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~-- 313 (536) .... ... +..--++++|+...++..||.||...+...+...+...+.......-...|.+++.-++..+.+.... T Consensus 144 ~~~~-~~~--~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~ 220 (382) T protein:vir:48 144 RIPP-KQH--VPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLS 220 (382) T ss_pred cccc-eeE--EcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHH Confidence 1000 001 11223667777777788999999999999999999999999999999999998887666655543221 Q ss_pred -----C--CCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_011045. 314 -----A--QTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTL 385 (536) Q Consensus 314 -----~--~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~L 385 (536) + ..|.++. -.++....++.. +.+.+. .+..+..+..|-++|-......-....-|..+ .....-....| T Consensus 221 ~~~~~~~~n~g~~~v-l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~-~~~~~~~~~~l 297 (382) T protein:vir:48 221 RSRQAMKQMQGGPLV-LDDLEDFTPLEIKSNVSQL-LKQADWTTGQFAKVYGIPDNVVGGQGDQQSSL-EMSSDLYSKAV 297 (382) T ss_pred HHHHhhccCCCCeeE-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-HHHHHHHHHHH Confidence 0 1122111 112223334432 334443 35566777888888854321111111112221 11233445566 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC--C--------cceEEEEechHHHHHHHHH----HHHHHHHHHHHHh Q lcl|NC_011045. 386 GGVYSILSQELQLPLVRVLLKQLQATQQIPELP--K--------EAVEPTISTGLEAIGRGQD----LDKLERCVAAWAA 451 (536) Q Consensus 386 G~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~--~--------~~v~v~~vs~La~a~r~~~----~~~l~~~~~~~~~ 451 (536) -|.+..+.+|+-.-|+.+. .....+.+. + ++++-.+.++-........ ...+...-+.... T Consensus 298 ~p~~~~i~~~l~~~l~~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~~~ 372 (382) T protein:vir:48 298 SRYLRPFLSELSQKLSCDV-----DADIFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPNST 372 (382) T ss_pred HHHHHHHHHHHHHHhcChh-----hhhhhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCCCC Confidence 6777777666533322111 001111111 0 0111112222111111000 0000111111111 Q ss_pred hc-chhhhhcCC Q lcl|NC_011045. 452 LA-PMRDDPDIN 462 (536) Q Consensus 452 ~~-p~~~~~~id 462 (536) +. .+.-+. | T Consensus 373 ~~GGd~~~~--~ 382 (382) T protein:vir:48 373 LKGGEEDGQ--D 382 (382) T ss_pred CCCCCCCCC--C Confidence 11 111111 2 No 141 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=446 Identities=11% Similarity=0.042 Sum_probs=185.3 Q ss_pred CCCccccccH---HHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCc-cc----cccc--ccccchHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAE---EGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDN-AS----TDYV--TPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma~~~~~~~~---~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~-~~----~~~~--~~~dst~~~a~~~La 70 (536) .+= .-+..+ ......|+.....|. ..|. -+....+.... .. .+.. -..++.+..+++.++ T Consensus 11 ~sP-~~~~~R~~ar~~~~~y~aa~~~r~---~~~~------~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 11 FSP-GWKAARLRSRAVIQAYEAVKTTRT---HKAR------RENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred cCh-HHHHHHHhhHHHHhhccccCcccc---cCCC------CCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 221 111111 112233444433321 1121 11111110000 00 0111 136788999999999 Q ss_pred HHHHH--hhcC-CCc-ceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 71 SKLML--ALFP-MQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 71 a~l~~--~ltP-~~~-Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) +.+++ +++| +++ |-.+... .++++ .-...-+.|.+.| +.=.+.+||.....++...++-|-+++- T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~-~~~~~-----~ie~~w~~Wa~~~-----D~~g~~~f~~~q~l~~r~~~~dGE~f~~ 149 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIA-RDLAA-----EIRTRWSEWSVSP-----EVTGQFTRPMLERLMLRTWLRDGEVFAQ 149 (502) T ss_pred HhhccCCceeeeeccCCCChhHH-HHHHH-----HHHHHHHHhhcCc-----CccccCCHHHHHHHHHHHHHhCCceEEE Confidence 99996 4555 343 2211111 11111 1112233343322 2223568999999999999999998775 Q ss_pred EecCCC-----Ccee--eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEe Q lcl|NC_011045. 147 LPEPEG-----SNYN--PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYL 219 (536) Q Consensus 147 ~~~~~~-----~~~~--~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p 219 (536) +..... +.++ .++.+....+ ... . .+ .-.|+..|+. T Consensus 150 ~~~~~~~~~~~g~~~~l~lq~iepd~l--~~~-~-----------------------------~~-----~~~i~~GVe~ 192 (502) T protein:vir:79 150 MVSGRINSLTPSAGVHFWLEALEPDFI--PMT-S-----------------------------DE-----SNRLNQGVFV 192 (502) T ss_pred EeecccCccCCCcccceEEEEecchhc--CCC-C-----------------------------CC-----CCeeEeeeEE Confidence 432221 1222 2232322211 100 0 00 1135566666 Q ss_pred cCCCCceeEEEEecCccccccccccccccCc---eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011045. 220 DEDSGEYIRYEEVEGMEVQGSDGTYPKEACP---YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKV 296 (536) Q Consensus 220 ~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P---~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p 296 (536) +..|....+|- .. .+ ++..........| +++.-....+|..=|.+..-.+|..++.|+.+..+.+.++..++.. T Consensus 193 d~~Gr~~aY~i-~~-~h-Pgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~ 269 (502) T protein:vir:79 193 DDWGRPEKYLV-YK-SR-PVSGRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAAL 269 (502) T ss_pred CCCCceEEEEE-ee-cC-CCCCcccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhh Confidence 55543322221 11 10 0110000011222 3444445678999999999999999999999999999999999988 Q ss_pred ceeeccc-cccc---------hhhhccCCCcceecC-Ccc-cccccccc-cccchhHHHHHHHHHHHHHHHHHh--hhhc Q lcl|NC_011045. 297 IGLVNPA-GITQ---------PRRLTKAQTGDFVTG-RPE-DISFLQLE-KQADFTVAKAVSDAIEARLSFAFM--LNSA 361 (536) Q Consensus 297 ~~lv~~~-g~~~---------~~~~~~~~~g~~~~g-~~~-~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~--~~~~ 361 (536) ...+..+ +-.. -.+.....+|.++.. .++ ++.+..-. ..++|. .....+...|..++= +..+ T Consensus 270 ~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~ye~l 346 (502) T protein:vir:79 270 GMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLE---TFRNGQLRAVAAGSRLSFSST 346 (502) T ss_pred eeeeecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHH Confidence 8776522 1100 011122345665542 332 33332211 122332 333333344544431 1122 Q ss_pred ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcc----eEEEEechH-HHHHHH Q lcl|NC_011045. 362 VQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEA----VEPTISTGL-EAIGRG 436 (536) Q Consensus 362 ~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~----v~v~~vs~L-a~a~r~ 436 (536) . .|-.. +=.-+++-..|..+.+--.=..|...|+.|+..+++......|.+|-+.... ++++++.|= ...=-. T Consensus 347 t-~D~s~-nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~ 424 (502) T protein:vir:79 347 A-RNYNG-TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPV 424 (502) T ss_pred h-ccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChH Confidence 1 12222 3444455555555554444455666789999999999999999987433221 233333220 000000 Q ss_pred HHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 437 QDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQA 516 (536) Q Consensus 437 ~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~ 516 (536) .+++.....+. .-+. ....++...|.||. |+-..++.+.+.++.....-.. .+..... T Consensus 425 Ke~~a~~~~i~-----------~Gl~---t~~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~Gl~~~~-~~~~~~~ 482 (502) T protein:vir:79 425 KEAEAWKIQIR-----------GGAA---TESDWVRAGGRNPD-------DVKRRRKAEIDENRKLDLVFDT-DPASDKG 482 (502) T ss_pred HHHHHHHHHHH-----------cCCC---CHHHHHHHcCCCHH-------HHHHHHHHHHHHHHHcCCCCCC-CCCCCCC Confidence 00000000000 0000 00122233455443 2222222222111111100000 0000000 Q ss_pred --hcCcchHHhhhhcCCCCC Q lcl|NC_011045. 517 --TASPEAMAAAADSVGLQP 534 (536) Q Consensus 517 --~~~~~~~~~~~~~~~~q~ 534 (536) ....+..........-|. T Consensus 483 ~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 483 GSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCC Confidence 000000000111111112 No 142 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=96.00 E-value=0.0011 Score=36.70 Aligned_cols=354 Identities=10% Similarity=0.030 Sum_probs=148.1 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccc-cchHHHHHHHHHHHHHHhhcCCCcceeccCChhh Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPW-QAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~ 92 (536) -+.|+.++..+.+....-.....+..+...........-...+.. .++--.|++.+|+.+ +. +|- + +.+.. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~i-a~-~p~----~--~~~~~ 72 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDL-AT-AKI----T--TSRKQ 72 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHh-hh-Cce----e--eccch Confidence 244555544443321111112222222222111110000000111 233334555555544 33 231 1 11111 Q ss_pred hhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEE Q lcl|NC_011045. 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVV 168 (536) Q Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v 168 (536) .. . .+.+- +.+.-+...+.++..+|||++++..+..+.++.+..++.+.+-+ T Consensus 73 ~~-------------~-----------l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v 128 (386) T protein:vir:49 73 LQ-------------G-----------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSF 128 (386) T ss_pred hh-------------h-----------hhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEE Confidence 00 0 11112 34555667788899999999998877776777777777777766 Q ss_pred eeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccccccccc Q lcl|NC_011045. 169 QRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEA 248 (536) Q Consensus 169 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~ 248 (536) ..+.+|... +.++. ..+... +.... +.. T Consensus 129 ~~~~~~~~~--~y~~~-----------------------------------------------~~~~~~-~~~~~--~~~ 156 (386) T protein:vir:49 129 NRLDNQNGL--YYNIT-----------------------------------------------FDDPHI-APKQH--VPQ 156 (386) T ss_pred EEcCCCceE--EEEEE-----------------------------------------------EcCccc-cceeE--Ecc Confidence 666543211 10100 000000 00000 111 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhh----------hccCCCcc Q lcl|NC_011045. 249 CPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRR----------LTKAQTGD 318 (536) Q Consensus 249 ~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~----------~~~~~~g~ 318 (536) .=++++|+...++..||.||..-+...+.......+.......-...|.+++.-++..+.+. ...+..+. T Consensus 157 ~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~ 236 (386) T protein:vir:49 157 NDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGP 236 (386) T ss_pred ccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCc Confidence 12566677667788999999999999999999999999999998899997776555544422 11111122 Q ss_pred eecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCC-CCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_011045. 319 FVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQRT-GERVTAEEIRYVASELEDTLGGVYSILSQE 395 (536) Q Consensus 319 ~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~-~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E 395 (536) ++- .++....++.. +.+.+ ..+..+..+..|-.+|-... +...+ ...-+++.+.. -....+-|.+..+.++ T Consensus 237 ~vl--~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~---~~~~~i~~~l~~i~~~ 310 (386) T protein:vir:49 237 LVL--DDLEDFTPLEIKSNVAQ-LLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYN---IYFKSVSRYLRPFVSE 310 (386) T ss_pred eec--CCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHH---HHHHHHHHHHHHHHHH Confidence 221 22223334443 23444 34556777888888885432 21112 22223332221 2233444554444444 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCC--------cceEEEEechHHHHHHHHH--H--HHHHHHHHHHH-hh-cchhhhhc Q lcl|NC_011045. 396 LQLPLVRVLLKQLQATQQIPELPK--------EAVEPTISTGLEAIGRGQD--L--DKLERCVAAWA-AL-APMRDDPD 460 (536) Q Consensus 396 ~l~Pli~r~~~il~~~g~lp~~~~--------~~v~v~~vs~La~a~r~~~--~--~~l~~~~~~~~-~~-~p~~~~~~ 460 (536) |-.-|...+. .....+-.... ..++--+.|+-..-..... + ..+........ .+ +.+.=+++ T Consensus 311 ~~~~l~~~~~---~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 311 MSKKLSCEVD---VDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPNRTSLKGGEINEQD 386 (386) T ss_pred HHHHhcchhc---ccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccCCCCCCCCCCCCCC Confidence 3221111100 00000000000 0011111121111110000 0 00000000000 01 12222233 No 143 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=95.85 E-value=0.0014 Score=36.30 Aligned_cols=353 Identities=11% Similarity=0.040 Sum_probs=154.2 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-ccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhh Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-TDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~ 92 (536) -..|+.++..++.-...-.+...++.|........... ....-.-.++--.|++.+|+.+.+ + |-+ +.+.. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~-~-p~~------~~~~~ 72 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLAT-V-KLT------ASRKQ 72 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhcc-C-cee------eccch Confidence 23444444433322222112222222222111110000 000011234444566767666644 2 311 11100 Q ss_pred hhhhccChhHHHHHHHHHHHHHHHHHHHHHhcc----ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEE Q lcl|NC_011045. 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIESNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVV 168 (536) Q Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v 168 (536) .. ..+.+-| .+.-+...+.++..+|||.+++..+..+.++.+..+|...+.+ T Consensus 73 -------------~~-----------~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v 128 (386) T protein:vir:48 73 -------------LQ-----------GIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSF 128 (386) T ss_pred -------------hH-----------HHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEE Confidence 01 1122333 3344556677888999999999887777777777777777777 Q ss_pred eeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccccccccc Q lcl|NC_011045. 169 QRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEA 248 (536) Q Consensus 169 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~ 248 (536) .++.+|.. ++.++ .+++..... ... +.. T Consensus 129 ~~~~~~~~--~~y~~-----------------------------------------------~~~~~~~~~-~~~--~~~ 156 (386) T protein:vir:48 129 NRLDNKDG--IYYNI-----------------------------------------------TFDDPRIPP-KQH--VPQ 156 (386) T ss_pred EEcCCCce--EEEEE-----------------------------------------------EecCccccc-eeE--ecC Confidence 76654421 11010 111110000 000 011 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------CCcce Q lcl|NC_011045. 249 CPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------QTGDF 319 (536) Q Consensus 249 ~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------~~g~~ 319 (536) --+++.|....++..||.||..-+...+.....+.+.......-...|..++..++.++.+....- ..|.+ T Consensus 157 ~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~ 236 (386) T protein:vir:48 157 GDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGP 236 (386) T ss_pred ccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCc Confidence 124556666667789999999999999999999999999999998999988877766655432210 01111 Q ss_pred ecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 320 VTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 320 ~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) +. -.++....++.. +.+.+ ..+..+..++.|-.+|-... +....+..-+++|- ...-....|.|.+..+.++|- T Consensus 237 ~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~--~~~~~~~~l~P~~~~ie~~l~ 312 (386) T protein:vir:48 237 LV-LDDLEEFTPLEIKSNVSQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM--SLDLYNKAVSRYLRPFLSELS 312 (386) T ss_pred ee-cCCCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHH--HHHHHHHHHHHHHHHHHHHHH Confidence 11 112233344442 23344 34555666778888874332 11111212123322 112344567788888777764 Q ss_pred HHHHHHHHHHHHhcCCCCCCCCc----------ceEEEEechHHHHHHHHHHHHH----HHHHHHH--Hhh-cchhhhhc Q lcl|NC_011045. 398 LPLVRVLLKQLQATQQIPELPKE----------AVEPTISTGLEAIGRGQDLDKL----ERCVAAW--AAL-APMRDDPD 460 (536) Q Consensus 398 ~Pli~r~~~il~~~g~lp~~~~~----------~v~v~~vs~La~a~r~~~~~~l----~~~~~~~--~~~-~p~~~~~~ 460 (536) .-|+.++ .....+.+..+ .++-.+.|+-..-... +...+ ....... ..+ +.+.-... T Consensus 313 ~~l~~~~-----~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l-g~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 313 QKLSCDV-----DADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYIL-QQAEILPKELPEGENPNKTTLKGGEINGED 386 (386) T ss_pred Hhhcchh-----hcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHh-hcCCCCCccchhhcCCCCCccCCCCCCCCC Confidence 4332211 00000000000 0111111111110000 00000 0000000 001 11111111 No 144 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=95.71 E-value=0.0016 Score=35.93 Aligned_cols=444 Identities=9% Similarity=0.005 Sum_probs=176.6 Q ss_pred CCCcccc--ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccccc--ccchHHHHHHHHHHHHHHh Q lcl|NC_011045. 1 MAEKRTG--LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTP--WQAVGARGLNNLASKLMLA 76 (536) Q Consensus 1 Ma~~~~~--~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~Laa~l~~~ 76 (536) ||++... -..+.+ ..+..-+.+.. .-.-.+.|+.|..++.- .+..+ ....+..+|++.|-.++ T Consensus 69 ~a~ds~~~~~~~~~~----~~~~~~~~~~~-~~~~~~~~~~~~~f~gy------ql~alY~~~~l~rkiVd~pAeDa~-- 135 (765) T protein:vir:96 69 VAMDSAYGDGPTPAA----KAAAGGQNPYV-VPTMLQDWYNSQGFIGY------QACAIISQHWLVDKACSMSGEDAA-- 135 (765) T ss_pred eeccccccccccchH----HHhhhccCccc-hhhHHHhhhcccCCccH------HHHHHHHhCchhhhhhhcchHHhh-- Confidence 5553321 111222 11111111100 00112334434332221 01111 12233334444443332 Q ss_pred hcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCcee Q lcl|NC_011045. 77 LFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) Q Consensus 77 ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~ 156 (536) +.|+.+...+.+... +..++|. +.+.+-++...+.++++..-.||.+++++.-+... .. T Consensus 136 ----R~g~~I~~~~~e~~~---------~~~~~l~-------~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D-~~ 194 (765) T protein:vir:96 136 ----RNGWELKSDGRKLSD---------EQSALIA-------RRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDD-PD 194 (765) T ss_pred ----cCCceeecCccccCH---------HHHHHHH-------HHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccC-cc Confidence 479998764332211 1222232 33334578999999999999999988766432110 00 Q ss_pred eEEEEecceEEEeeCCCCCeEEE--EEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQM--VTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) .+ .-||..-.|. .|.+..| +.++..+. .++.+.-.+..+ .+. -..+.|. |.| T Consensus 195 ~l-~~PL~~~~I~---kg~~kgl~vldp~~~~~-~~v~e~~~Dp~s----p~f-g~P~~y~----------------i~g 248 (765) T protein:vir:96 195 YY-EKPFNPDGIA---PGSYKGISQIDPYWAMP-QLTAESTADPSA----EHF-YEPDFWI----------------ISG 248 (765) T ss_pred hh-hccccccccc---cceeeEEEEechhhccc-ccchhccccccc----ccc-Ccceeee----------------ecC Confidence 01 1233111111 1222111 11111111 001111111100 000 1112221 122 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccc-------cc Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGI-------TQ 307 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~-------~~ 307 (536) +.|+... ...|...|. +-+.+....-||+|..+.++..++.....+....+.+.++.-..+-++.... .. T Consensus 249 ~~IH~SR-li~~~g~~l--pd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~ 325 (765) T protein:vir:96 249 KKYHRSH-LVVVRGPQP--PDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNA 325 (765) T ss_pred ceeccce-EEEecCCCc--hhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHH Confidence 2222111 111222231 2245555556799999999999999998888888888777766665543322 11 Q ss_pred hhhhcc---CCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccCCCCCC--CHH-HHHHHHH Q lcl|NC_011045. 308 PRRLTK---AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERV--TAE-EIRYVAS 379 (536) Q Consensus 308 ~~~~~~---~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~~~~r~--TAt-Ei~~r~~ 379 (536) ...... +..|.++-+..++...+.. +|.-+...+....+.|.-+.=. .-+...+...+ |.+ ++ T Consensus 326 r~~~~~~~r~n~g~~~id~ee~~e~~s~----~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~----- 396 (765) T protein:vir:96 326 RLAFWIANRDNHGVKVIGIDETMEQFDT----NLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHET----- 396 (765) T ss_pred HHHHHHHhcCCceeEEecCCcceeEEec----ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHH----- Confidence 111111 1123344444444443322 3445556666667777655411 11221222222 333 32 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEech--HHHHHHHHHHHHHHHHHHHHHhhcchhh Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTG--LEAIGRGQDLDKLERCVAAWAALAPMRD 457 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~--La~a~r~~~~~~l~~~~~~~~~~~p~~~ 457 (536) +..---+..++...+.|++++++.++.+.+.+|+ +++++|-.- +....|+.-..+..+..+.+.+.+ T Consensus 397 ---~nYyD~I~s~Qe~~l~p~le~L~~li~~s~~i~~----d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~G---- 465 (765) T protein:vir:96 397 ---ISYHEELESIQEHIFDPLLERHYLLLAKSESIDV----QLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSG---- 465 (765) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC----cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC---- Confidence 2233345556777899999999999999876542 577776533 333333333333333333332221 Q ss_pred hhcCCHHHHHHHHHHH--cCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 458 DPDINLAMIKLRIANA--IGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 458 ~~~id~d~~~~~~a~~--~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) .|+.+++-+.+... +|. ..+-.++.|......-... .+........+......... ..+...+.|-||. T Consensus 466 --vis~dEvR~~L~~~~~~g~--~~l~d~~~e~~~~~~pe~~-~~~~~~~~~~~~~~~e~~~~----~a~p~~~eg~~~~ 536 (765) T protein:vir:96 466 --VVSPDEVRERLRDDPRSGY--NRLTDDQAETEPGMSPENL-AELEKAGAQSAKAKGEAERA----EAQAGAVEGAGDP 536 (765) T ss_pred --CCCHHHHHHHHhccccCCC--CCCCccccccccCCCcccc-ccccCCCcccccccCccccc----cCCCCccCCCCcc Confidence 36777777766542 121 1111111111000000000 00000000000000000000 0000001111111 Q ss_pred C Q lcl|NC_011045. 536 I 536 (536) Q Consensus 536 ~ 536 (536) . T Consensus 537 ~ 537 (765) T protein:vir:96 537 V 537 (765) T ss_pred c Confidence 1 No 145 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=95.62 E-value=0.0017 Score=35.72 Aligned_cols=442 Identities=10% Similarity=0.072 Sum_probs=159.0 Q ss_pred CCC------cccc---------cc---HHHHHHHHHHHHHHhhhH--HHHHHHHHHHhcccccCCCCCccccccc----- Q lcl|NC_011045. 1 MAE------KRTG---------LA---EEGAKSVYERLKNDRAPY--ETRAQNCAQYTIPSLFPKDSDNASTDYV----- 55 (536) Q Consensus 1 Ma~------~~~~---------~~---~~~~~~r~~~l~~~R~~~--e~~w~e~~~~~~P~~~~~~~~~~~~~~~----- 55 (536) |-- +|.. .+ .+.+..+|..++..=... .....+ -.|+-|..+.-.++.+-...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~~p~~~~~~~~~~~~~~p~~~~~ 79 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQ-QAYAEPFLEVMDTNPEFRTKRSYMKN 79 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCcc-chhhcceeeeeecCCCccccCcchhh Confidence 110 0000 00 112333444443210000 001112 123344322112222211111 Q ss_pred --------ccc--cchHHHHHHHHHHHHHHhhcCC-----CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 56 --------TPW--QAVGARGLNNLASKLMLALFPM-----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNY 120 (536) Q Consensus 56 --------~~~--dst~~~a~~~Laa~l~~~ltP~-----~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~ 120 (536) +.| +.+.-.|++.+|..+...-.|. ..-|.+...+....... . +.. -+..+++.+... T Consensus 80 ~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~---~---~~~-~~~~l~~~l~~~ 152 (576) T protein:vir:96 80 SDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGK---K---EKE-EIKRIENFILNT 152 (576) T ss_pred hhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccch---h---hhH-hhhhHHhhHhhc Confidence 111 1123356666666554322232 12223222222111100 0 111 112222222222 Q ss_pred HH-----hccChHHHHHHHHHHHhhCcEEEEEecCC--CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHH Q lcl|NC_011045. 121 IE-----SNSYRVTLFEALKQLVVAGNVLLYLPEPE--GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPED 193 (536) Q Consensus 121 l~-----~snf~~~~~~~~~dl~~~G~~~l~~~~~~--~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~ 193 (536) +. ..+|+.-+..++.|+.++|||.+|+.... .+.++.+..++...+.+..+.+|.+-.-..++ T Consensus 153 ~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~---------- 222 (576) T protein:vir:96 153 GRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRF---------- 222 (576) T ss_pred cCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEE---------- Confidence 21 13566677888999999999999876433 33455555566666777777766432211111 Q ss_pred HhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecC---CCccccchHH Q lcl|NC_011045. 194 IRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLD---GESYGRSYIE 270 (536) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~---ge~YGrgp~~ 270 (536) ++.++|........ ++ .+.++..... ...||.+|.. T Consensus 223 -----------------------------------~~~~~~~~~~~~~~----~d--ii~~~~~~~~d~~~~~~G~Spi~ 261 (576) T protein:vir:96 223 -----------------------------------VQVINKKVVASFTS----RE--MAMGIRNPRTELSSSGYGLSEVE 261 (576) T ss_pred -----------------------------------EEecCCceEEEecc----cc--eEEEeecCCCCcccCcccccHHH Confidence 11111111110000 01 1222222222 2469999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCceeec--cccccchhhh-------cc--CC---CcceecCCccccccccccc-c Q lcl|NC_011045. 271 EYLGDLRSLENLQEAIVKMSMISSKVIGLVN--PAGITQPRRL-------TK--AQ---TGDFVTGRPEDISFLQLEK-Q 335 (536) Q Consensus 271 ~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~--~~g~~~~~~~-------~~--~~---~g~~~~g~~~~~~~~~~~~-~ 335 (536) .+...+.......+.......-...|..++. .+...+.+.. .. .+ .|.+.....+++...++.. + T Consensus 262 ~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~ 341 (576) T protein:vir:96 262 IAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTA 341 (576) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCCh Confidence 9999998888888888888887788875543 3333333221 11 11 1111112233344445443 3 Q ss_pred cchhHHHHHHHHHHHHHHHHHhhhhccc--CCCC---------CCCHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_011045. 336 ADFTVAKAVSDAIEARLSFAFMLNSAVQ--RTGE---------RVTAEEIRYVA-SELEDTLGGVYSILSQELQLPLVRV 403 (536) Q Consensus 336 ~~~~~~~~~i~~~~~rI~~af~~~~~~~--~~~~---------r~TAtEi~~r~-~E~~~~LG~v~~rl~~E~l~Pli~r 403 (536) .+.+ ..+..+.....|-++|-...... .+.. .+|=.-+.+.. .=....|.|.+.+++.+|-.-|+ T Consensus 342 ~d~q-fle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll-- 418 (576) T protein:vir:96 342 NDMQ-FEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHII-- 418 (576) T ss_pred hhHH-HHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-- Confidence 3444 34555667788888885432111 1111 11111111111 12223466666666666544332 Q ss_pred HHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccC Q lcl|NC_011045. 404 LLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) Q Consensus 404 ~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r 483 (536) |.. +..+.++|...-.. ....+.....++.. -.+..++ +-+.+|.+|.. T Consensus 419 -----------~~~-~~~~~~~f~r~d~~--------~~~e~~~~~~~~~~----G~lT~NE----~R~~~gl~pie--- 467 (576) T protein:vir:96 419 -----------SEY-SDKYVFQFVGGDTK--------SELDKIKILQEEVK----TYKTVNE----ARKEKGLKPIE--- 467 (576) T ss_pred -----------hhc-cCceEEEeccCCHH--------HHHHHHHHHHHHhc----CccCHHH----HHHHhCCCCCC--- Confidence 222 23456666433111 11111111111100 0122222 22234444321 Q ss_pred CHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHh-hhc-C---c--------chHHhhhhcCCCCCCC Q lcl|NC_011045. 484 TEEQK------QQKMAQQSMQMGMDNGAAALAQGMAAQ-ATA-S---P--------EAMAAAADSVGLQPGI 536 (536) Q Consensus 484 s~~ev------~~~~~q~~~q~~~~~~a~~~~~~~~~~-~~~-~---~--------~~~~~~~~~~~~q~~~ 536 (536) .-|+. ...-+..+ ..+...+..+.......+ ... . | ..+....+..+.-+-+ T Consensus 468 gGD~~~~~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~ 538 (576) T protein:vir:96 468 GGDVLLDGSFIQSMSLNTQ-KEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPV 538 (576) T ss_pred Ccceecccccccccccccc-CCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCcc Confidence 00100 00000000 000000000000000000 000 0 0 0000000110000001 No 146 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=95.49 E-value=0.002 Score=35.41 Aligned_cols=364 Identities=12% Similarity=0.056 Sum_probs=140.2 Q ss_pred HHHHHHHHHhhhHHH--HHHHHHHHhcccccCCCCCccccc--ccc-cccchHHHHHHHHHHHHHHhhcCCCcceeccCC Q lcl|NC_011045. 15 SVYERLKNDRAPYET--RAQNCAQYTIPSLFPKDSDNASTD--YVT-PWQAVGARGLNNLASKLMLALFPMQTWMRLTIS 89 (536) Q Consensus 15 ~r~~~l~~~R~~~e~--~w~e~~~~~~P~~~~~~~~~~~~~--~~~-~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~ 89 (536) ..+..+...| +++. .|..+ +.........+... ..+ +-.++--.|++.+|+.+.+ | ||--.... T Consensus 1 m~~~~~~~~~-~~~~~~~~~~~-----~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~-l----p~~~~~~~ 69 (419) T protein:vir:57 1 MFIPQFWKGR-PSENRVNWQVV-----PGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQ-L----PCVLYRRT 69 (419) T ss_pred CcchhhhccC-Ccccccccccc-----ccccccccccCCceechHHhhccHHHHHHHHHHHHhhcc-C----ceEEEEEc Confidence 2333332222 2221 12110 00000000000000 011 1223334455666665533 2 33211111 Q ss_pred hhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecc Q lcl|NC_011045. 90 EYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLS 164 (536) Q Consensus 90 d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~ 164 (536) +..-.+ .+ .+.-+...|. + .+.+.-....+.++..+||+++++..+..+.++.+..++.. T Consensus 70 ~~g~~~---------~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~ 134 (419) T protein:vir:57 70 ENGGRE---------IA------FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPH 134 (419) T ss_pred CCCcee---------cc------ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCc Confidence 110000 00 0111222232 1 24455567778899999999999987776666666666667 Q ss_pred eEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccccc Q lcl|NC_011045. 165 SYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTY 244 (536) Q Consensus 165 ~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~ 244 (536) .+.+..+.+|.+ |. .+. +. |..+. . T Consensus 135 ~v~v~~~~~g~~---~y--~~~-----------------------------------~~----------~~~~~-~---- 159 (419) T protein:vir:57 135 KVIVLKGPDGMP---YY--DIP-----------------------------------SI----------GEILP-M---- 159 (419) T ss_pred ceEEEECCCceE---EE--EEc-----------------------------------CC----------ceEEc-h---- Confidence 777766665532 10 000 00 00000 0 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc----ccchhh---hc----c Q lcl|NC_011045. 245 PKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG----ITQPRR---LT----K 313 (536) Q Consensus 245 ~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g----~~~~~~---~~----~ 313 (536) ++ +++.|....+| .||.||...+...+.....+.+.......-...|..++.-.+ ..+.+. +. . T Consensus 160 --~~--vih~r~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~ 234 (419) T protein:vir:57 160 --RM--VHHIKSFSLDG-YIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTE 234 (419) T ss_pred --hh--EEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHH Confidence 00 23333332344 899999999999999999999999998888888987764322 112111 11 0 Q ss_pred --CC--C-cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHHHHHHHHHHH Q lcl|NC_011045. 314 --AQ--T-GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGERVTAEEIRYVASELEDT 384 (536) Q Consensus 314 --~~--~-g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~r~TAtEi~~r~~E~~~~ 384 (536) ++ + |.+. .-.++....++.. +.+.+. .+..+..+..|-.+|-... ++. ..+..-++++... T Consensus 235 ~~~g~~nag~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~-------- 304 (419) T protein:vir:57 235 RYGGVRNAFSVG-MLQEGMTYKQLSQDNEKAQL-LQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGL-------- 304 (419) T ss_pred Hhccccccccce-ecCCCceEEEcCCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHH-------- Confidence 00 0 1111 1122333334332 334443 3444556677888885432 111 1122223333221 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHH---HHHHHHHHHHHH--H--HHHH---Hhhc Q lcl|NC_011045. 385 LGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEA---IGRGQDLDKLER--C--VAAW---AALA 453 (536) Q Consensus 385 LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~---a~r~~~~~~l~~--~--~~~~---~~~~ 453 (536) .+...-|.|++.++-..+.+. ++++-......|+| ++.|-+ ..|..-.+.+.+ + .+.+ -.+. T Consensus 305 ------~f~~~~l~P~~~~ie~~l~~~-ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~ 377 (419) T protein:vir:57 305 ------QYVIYTMLAILKRHESAMMRD-LLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLT 377 (419) T ss_pred ------HHHHHHHHHHHHHHHHHHHhh-ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 122333445444444433332 22221112333333 112211 111111111111 0 0000 0011 Q ss_pred ch-hhh------hcCCHH-------HHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 454 PM-RDD------PDINLA-------MIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 454 p~-~~~------~~id~d-------~~~~~~a~~~Gv~p~~i~rs 484 (536) |. -.| ..++.. ..-+...+..++ ...|. T Consensus 378 p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 419 (419) T protein:vir:57 378 PIPGGDKYLTPLNMVDSKALTGIGKATPQQLKDIEAI---LCTRN 419 (419) T ss_pred CCCCcCeeeeccccccccccccccCCCcccCcchhhh---hhccC Confidence 10 000 000000 000011111111 01111 No 147 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=95.41 E-value=0.0021 Score=35.24 Aligned_cols=443 Identities=13% Similarity=0.048 Sum_probs=192.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCC-c-------ccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD-N-------ASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~-~-------~~~~~~~~~dst~~~a~~~Laa~ 72 (536) =+-.+.....+.....|+.....|. -..|. ..|.....+.. . ...+.--..++.+..+++.+++. T Consensus 17 ~~~~~~~~~~~~~~~~y~aa~~~r~--~~~w~-----~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~n 89 (505) T protein:vir:96 17 WAWYRYVEPQKNAARAFEAARRDRL--GKAWL-----RRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNN 89 (505) T ss_pred hhhhhhHHHHHHhhhhcccccCCCc--ccccc-----CCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 0000111112233344554443331 11121 11111111100 0 00011112577899999999999 Q ss_pred HHH--hhcC-CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 73 LML--ALFP-MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 73 l~~--~ltP-~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +++ +|+| +++.......++++++.. ...-+.|.+.. -.++=.+.+||.....++...++-|-+++-... T Consensus 90 vVG~~Gi~~~~~~~~~~~~~~~~~~~~i-----e~~w~~Wa~~~---~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~ 161 (505) T protein:vir:96 90 VIGPKGMTFQSRVKRRNGKPDDRANTLI-----EGNWQQWIKKG---NCDVTGRYHFVTLLHLWMETLARDGEVLVREHR 161 (505) T ss_pred hcCCCcceeeecCCcccccccHHHHHHH-----HHHHHHhcCCc---CcceeccCCHHHHHHHHHHHHhhCCceEEEEee Confidence 996 7888 456444433344333211 12233443311 012223468999999999999998887653222 Q ss_pred CCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) ..+ +.+.+++ +.+..+.|... ...... ..-.|+..|+.+..|....+| T Consensus 162 ~~~-~~~~~~l----------------------qliepd~l~~~--------~n~~~~-~~~~i~~GIe~d~~Gr~~aY~ 209 (505) T protein:vir:96 162 GYP-NKWGYAL----------------------QILECDRLDLN--------YNADLQ-NGNRIRMSIELDAWERPVAYH 209 (505) T ss_pred cCC-CCcceEE----------------------EEechhhcCCC--------CCcccC-CcCeEEeceEECCCCceEEEE Confidence 111 1111111 11111111110 000000 111356677666554332222 Q ss_pred EEe--cCccccccc-cccccccCc--eEEEee-eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc Q lcl|NC_011045. 230 EEV--EGMEVQGSD-GTYPKEACP--YIPIRM-VRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA 303 (536) Q Consensus 230 ~~v--~g~~i~~~~-~~~~~~~~P--~~~~rw-~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~ 303 (536) -.- .|....... ....+...| -+..-| ...+|..=|.+..-.+|..++.|.....+.+.++..++...+.+..+ T Consensus 210 i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~ 289 (505) T protein:vir:96 210 LLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQD 289 (505) T ss_pred EeecCCCccccccccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecC Confidence 111 121111000 001122334 122223 34578888999999999999999999999999999999988777533 Q ss_pred c--ccchh------hhccCCCcceecCCcc-ccccccccc-ccchhHHHHHHHHHHHHHHHHHh--hhhcccCCCCCCCH Q lcl|NC_011045. 304 G--ITQPR------RLTKAQTGDFVTGRPE-DISFLQLEK-QADFTVAKAVSDAIEARLSFAFM--LNSAVQRTGERVTA 371 (536) Q Consensus 304 g--~~~~~------~~~~~~~g~~~~g~~~-~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~--~~~~~~~~~~r~TA 371 (536) . ...+. .....++|.|..-.++ ++.+..-.. .++|. .....+...|..++= +..+. .|-..++= T Consensus 290 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~---~f~~~~lr~iaaglgi~ye~lt-~D~s~~nY 365 (505) T protein:vir:96 290 PEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFG---AFVKSSLRGVAAGMGPAYNRLA-HDLEGVNF 365 (505) T ss_pred CccCCCccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHh-cccccccH Confidence 1 11110 1222345555443443 233332221 23332 222223333333321 12222 34445565 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCc---ceEEEEechHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKE---AVEPTISTGLEAIGRGQDLDKLERCVAA 448 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~---~v~v~~vs~La~a~r~~~~~~l~~~~~~ 448 (536) .-+++-..|..+.+-..=..+..-|+.|+..+++..+...|.+|-+... .++++++.| T Consensus 366 SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p------------------- 426 (505) T protein:vir:96 366 SSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPR------------------- 426 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccC------------------- Confidence 6666666666666665556677788999999999999999998744321 123333322 Q ss_pred HHhhcchhhhhcCCHHH----HHH----------HHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 449 WAALAPMRDDPDINLAM----IKL----------RIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA 514 (536) Q Consensus 449 ~~~~~p~~~~~~id~d~----~~~----------~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~ 514 (536) .. ..||+-+ .+. .++...|.||. |+-+.++.+.+.++...-.... ..... T Consensus 427 ------~~--~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~-------~v~~q~a~e~~~~~~~Gl~~~~-~~~~~ 490 (505) T protein:vir:96 427 ------GW--DWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPE-------DVFDEIAWEEQLMRDKGVNPTP-PEQES 490 (505) T ss_pred ------Cc--cccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH-------HHHHHHHHHHHHHHHcCCCCCC-CCCCC Confidence 11 1122211 111 12222344432 2222222222111111100000 00000 Q ss_pred hhhcCcchHHhhhhc Q lcl|NC_011045. 515 QATASPEAMAAAADS 529 (536) Q Consensus 515 ~~~~~~~~~~~~~~~ 529 (536) ..+..++....++|. T Consensus 491 ~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 491 KDATTDEEDDSASDD 505 (505) T ss_pred CCCCCCCCCCCCCCC Confidence 000111111111111 No 148 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=95.21 E-value=0.0025 Score=34.82 Aligned_cols=258 Identities=10% Similarity=0.035 Sum_probs=121.5 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLP 148 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~ 148 (536) ++.+ ||--.. . +.. .+.. +...|. + .+.+.=+...+.++..+|||++++. T Consensus 1 ia~l----~~~~~~-~---------~~~----~~~~-------l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~ 55 (278) T protein:vir:78 1 MASL----PLKMYE-D---------YKV----VNTE-------VSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE 55 (278) T ss_pred Cccc----eeEEEe-c---------Ccc----cccH-------HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEE Confidence 1111 221110 0 000 1111 112222 1 2455567788889999999999988 Q ss_pred cCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeE Q lcl|NC_011045. 149 EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIR 228 (536) Q Consensus 149 ~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~ 228 (536) .+..+.++.+..++...+-+..+.+|.. ++..+. . T Consensus 56 r~~~G~~~~l~~l~~~~v~v~~~~~~~~--~~y~~~-----------------------------------~-------- 90 (278) T protein:vir:78 56 RDIYHQPSKLFLLNPDVVEMLIENQSRE--LYYSIH-----------------------------------A-------- 90 (278) T ss_pred ECCCCcEEEEEEECCceeEEEEcCCCce--EEEEEE-----------------------------------c-------- Confidence 7666666666666666666666555432 111110 0 Q ss_pred EEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch Q lcl|NC_011045. 229 YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP 308 (536) Q Consensus 229 ~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~ 308 (536) ..|..+ . +...-.+..|.....+..||.||...+...+...+...+..+.... ..|.+++..++..+. T Consensus 91 ---~~g~~~-----~--~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~--~~~~~i~~~~~~l~~ 158 (278) T protein:vir:78 91 ---ATGNKL-----I--VHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQ--KPDSFMLKYGSNVGK 158 (278) T ss_pred ---CCceEE-----E--EccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhc--CCCcEEEEeCCCCCH Confidence 011100 0 0011134445444456689999999999988888877766543333 345566666655544 Q ss_pred hhhcc---------CCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHH Q lcl|NC_011045. 309 RRLTK---------AQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGERVTAEEIR 375 (536) Q Consensus 309 ~~~~~---------~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~r~TAtEi~ 375 (536) +.... ...|.++. -.++....++.. ..+.+ ..+..+...+.|-.+|-... ++. .++..-|++|.. T Consensus 159 e~~~~~~~~~~~~~~~~g~~~v-l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~ 236 (278) T protein:vir:78 159 EKRQQVLEDFKQYYEENGGILF-QEPGVEIEPLPKKYVSED-IVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN 236 (278) T ss_pred HHHHHHHHHHHHHhccCCCcee-cCCCceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH Confidence 43221 11222221 122223333332 33444 34555666778888874432 111 122233444422 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echH Q lcl|NC_011045. 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGL 430 (536) Q Consensus 376 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~L 430 (536) ..+...-+.|++.++-..+.+. ++|+-. .....++| .+.| T Consensus 237 --------------~~~~~~~l~P~~~~i~~~ln~~-L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 237 --------------RFYLQHTLLPIVKQYEEEFNRK-LLTKTDREKIGILNLTLNLI 278 (278) T ss_pred --------------HHHHHHHHHHHHHHHHHHHHhh-cCChhHhcCCceEEEecccC Confidence 1233334666666555554443 454421 11233443 3444 No 149 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=95.05 E-value=0.0029 Score=34.52 Aligned_cols=384 Identities=12% Similarity=0.081 Sum_probs=156.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc---cC-CCCCcccc-cccccc-cchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FP-KDSDNAST-DYVTPW-QAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~---~~-~~~~~~~~-~~~~~~-dst~~~a~~~Laa~l~ 74 (536) |-+. ..+++.+++.....|... .++ ...|.+ +. ..+..+.. ...... .++--.|++.+|+.+. T Consensus 1 ~~~~--------~~~~~~~~~~~~~~~~g~--~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia 69 (437) T protein:vir:10 1 MKQG--------KQRALGRIKSSFLKWLGV--PIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIA 69 (437) T ss_pred CCcc--------hhhhhhhhHHhhhhhcCC--ccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHh Confidence 5443 335556666654444321 000 000111 00 00000000 001111 2233346666666553 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) + + ||.-....+..-.+. + .+..+...|. +- +.+.=.+..+.++..+||+.+++.. T Consensus 70 ~-l----p~~~~~~~~~g~~~~---------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 129 (437) T protein:vir:10 70 T-L----PLNLYQTKPDGTRVL---------A------KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLR 129 (437) T ss_pred h-C----ceeEEEEcCCCceee---------c------cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 3 2 343222111100000 0 0111222232 22 4555667778888999999999877 Q ss_pred CCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) +. +.++.+..++...+.+..+.+|.+. |+ | T Consensus 130 ~~-g~~~~L~~l~p~~v~i~~~~~g~~~------------------------------------y~-------------~ 159 (437) T protein:vir:10 130 SA-GVLIGLELMLPQRTTVKRLTSGALQ------------------------------------YT-------------Y 159 (437) T ss_pred cC-CcEEEEEEEcCcceEEEECCCCeEE------------------------------------EE-------------E Confidence 64 5666666666666666555444321 00 1 Q ss_pred EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) +.+.|.... + ...=++++|....+| .||.||...+...+.....+.+.......-...|-.++.-++.++++ T Consensus 160 ~~~~g~~~~-----~--~~~dIih~r~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e 231 (437) T protein:vir:10 160 RNVDGTVST-----L--AEDDVFHVRGFSLDG-LMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKE 231 (437) T ss_pred EecCceEEE-----E--ccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHH Confidence 111111100 0 001134445443334 89999999999999998899988888888888898887766666655 Q ss_pred hhccCC-------Ccce----ecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCC--CCCHHHHH Q lcl|NC_011045. 310 RLTKAQ-------TGDF----VTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE--RVTAEEIR 375 (536) Q Consensus 310 ~~~~~~-------~g~~----~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~--r~TAtEi~ 375 (536) ...... .|.- +.--.++....++.. +.+.+. .+..+..+..|-.+|-......-..+ ..+..-+. T Consensus 232 ~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e 310 (437) T protein:vir:10 232 KRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQL-LETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIE 310 (437) T ss_pred HHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHH Confidence 432110 0100 001112223334332 334443 34445556778888854321111222 22223343 Q ss_pred HHHHHH-HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcc--eEEEEechHH--HHHHHHHHHHHHHH--H-- Q lcl|NC_011045. 376 YVASEL-EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEA--VEPTISTGLE--AIGRGQDLDKLERC--V-- 446 (536) Q Consensus 376 ~r~~E~-~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~--v~v~~vs~La--~a~r~~~~~~l~~~--~-- 446 (536) +..... ...|.|.+.++.+| |.+. ++++-.... +++.+.+.+- ...|....+.+... + T Consensus 311 ~~~~~f~~~tl~P~~~~ie~~------------l~~k-ll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~ 377 (437) T protein:vir:10 311 QQTLGFLTFTLRPWLTRIEQA------------ARRS-LLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTR 377 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHH------------HHhh-ccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 333322 22344444444444 3222 333322222 3333322221 11222222222211 1 Q ss_pred HHH---Hhhcchh-hhh-------cCCHHHHHHHHHH--------HcCCChhhccCCHHHH Q lcl|NC_011045. 447 AAW---AALAPMR-DDP-------DINLAMIKLRIAN--------AIGIDTSGILLTEEQK 488 (536) Q Consensus 447 ~~~---~~~~p~~-~~~-------~id~d~~~~~~a~--------~~Gv~p~~i~rs~~ev 488 (536) +.+ -.+.|.. .+. .+..+.+-...-. +... +..=-+.+||. T Consensus 378 NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~e~ 437 (437) T protein:vir:10 378 DECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLY-QEEKTRATQER 437 (437) T ss_pred HHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCC-CCCCCCccccC Confidence 111 1122310 111 1122221111000 0001 11111223333 No 150 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=95.02 E-value=0.0029 Score=34.47 Aligned_cols=439 Identities=11% Similarity=0.032 Sum_probs=156.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHH---HHhcccccCCCCCcc--cccccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCA---QYTIPSLFPKDSDNA--STDYVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~---~~~~P~~~~~~~~~~--~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) .=+++..+.++++...| +.+..+|+.++-++. -..+|+...++.... -......-+++--.|++.+|+.+.+ T Consensus 64 ~~~~~~~~kk~~i~~pf---kkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAs 140 (945) T protein:vir:10 64 IFRKNQVLKKEKIIVPY---NHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTS 140 (945) T ss_pred eehhhhHHHhhcccccc---cccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhcc Confidence 33344444455555444 334456666443221 011122211111000 0011112234444566666666633 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChH-HHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRV-TLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~-~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) . |-+-+=+..-.. .............+..+|.. -........|.. -+...+.|+..+||+.+++..+..+. T Consensus 141 L--PlklYrr~edG~--~~~~~kk~~~~hpL~~LL~r----PNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ 212 (945) T protein:vir:10 141 K--ELEIYKHIEDKH--VNYYLKRIRDARNILEFLER----PDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGN 212 (945) T ss_pred C--ceEEEEecccCc--ccccccccccchHHHHHHhC----CCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCc Confidence 2 211000111100 00000000001112222211 001111112222 23456689999999999988776666 Q ss_pred eeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) ++.+..++...+.+..+.+|.+-..|+ +.++| T Consensus 213 ii~L~pLdPs~Vti~~ddDG~~~y~Yv------------------------------------------------~~idG 244 (945) T protein:vir:10 213 LVAITPVDGTTIKPILSEDTGIVVGYV------------------------------------------------QEVDG 244 (945) T ss_pred EEEEEEECCcceEEEEcCCCcEEEEEE------------------------------------------------EecCC Confidence 666666666666666666654321110 11111 Q ss_pred ccccccccccccccCceEEEeeeecCCCc--cccchHHHHHHHHHHHHHHHHHHHHHHHH-HhCCceeecc--------- Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGES--YGRSYIEEYLGDLRSLENLQEAIVKMSMI-SSKVIGLVNP--------- 302 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~--YGrgp~~~~l~d~~~L~~l~~~~~~~~~~-a~~p~~lv~~--------- 302 (536) ........ ++. ++..|+...+|.. ||.+|.+.+...+.......+........ .+.|..++.- T Consensus 245 ~~~~~v~a----~Dv-Ilhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k 319 (945) T protein:vir:10 245 AIVAHFDK----RDV-VLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGD 319 (945) T ss_pred ceEEEecC----Cce-EEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccc Confidence 11100000 010 2233333334433 79999998888887777777766666543 3556544421 Q ss_pred -ccccchhhh----------ccCCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCC Q lcl|NC_011045. 303 -AGITQPRRL----------TKAQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQ---RTGE 367 (536) Q Consensus 303 -~g~~~~~~~----------~~~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~ 367 (536) .+.++++.. ..+.++....--.++....++.. ..+.+. .+..+..+.+|-++|-...... .++. T Consensus 320 ~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQf-LEsrkfs~eeIArAFGVPP~lLG~~e~st 398 (945) T protein:vir:10 320 IYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQF-KELAEFVARKICAVYQVSPQDVGILEGSN 398 (945) T ss_pred cccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcccCCCCC Confidence 122222211 11111111101122333444433 335553 3556666778888885432111 1122 Q ss_pred CCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 368 RVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVA 447 (536) Q Consensus 368 r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~ 447 (536) .-++++. ...=....|.|...++++++ .+ .+++...+..+++++.... .. +. .....++. T Consensus 399 ~SNiEqq--~~~Fv~~tL~Pil~~IEqeL------------Nr-kLl~~~eg~~i~fdFd~ld-l~-D~---ksraEal~ 458 (945) T protein:vir:10 399 KATAEVM--ASLTKAKGLEPLMATISKGF------------DE-VVSEFRNEKDIKLWFKEDD-LE-KE---RDWWNIIQ 458 (945) T ss_pred cchHHHH--HHHHHHHHHHHHHHHHHHHH------------HH-hccccccCceeEEEecchh-cc-CH---HHHHHHHH Confidence 2222221 11222234555555555553 22 1222233455677664332 11 11 11111222 Q ss_pred HHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh----ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcC-cch Q lcl|NC_011045. 448 AWAALAPMRDDPDINLAMIKLRIANAIGIDTSG----ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATAS-PEA 522 (536) Q Consensus 448 ~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~----i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~-~~~ 522 (536) ...+.+ .+.+++ +-+.+|.+|-. ++.+...+.-..+... .+.-.+..+.......+.+.. ++. T Consensus 459 kli~sG------iLTiNE----vRe~lGLpPIeGGD~lli~~nn~~P~d~~~k--a~~ga~p~q~aq~~~dqp~~kGGe~ 526 (945) T protein:vir:10 459 GQLNTG------FRSINE----ARMEKGLEPVPWGDVPFSGLRNWKPEDEQAK--AQQGAMPPQLAQAMADQPSQQGGGV 526 (945) T ss_pred HHHhCC------CcCHHH----HHHHhCCCCCCCcceeeeccccccccccccc--cccCCCCcccccCCCCCCCCCCCCC Confidence 111111 123333 33345555521 1100000000000000 000000000000000010000 000 Q ss_pred HHhhhhcCCCC-CCC Q lcl|NC_011045. 523 MAAAADSVGLQ-PGI 536 (536) Q Consensus 523 ~~~~~~~~~~q-~~~ 536 (536) ........+.+ .++ T Consensus 527 dEns~~psE~kda~~ 541 (945) T protein:vir:10 527 DENSSVPSEQKNAGL 541 (945) T ss_pred CCCCCCCCcccchHH Confidence 00000000000 011 No 151 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=94.75 E-value=0.0036 Score=33.98 Aligned_cols=412 Identities=9% Similarity=-0.017 Sum_probs=148.8 Q ss_pred cccc--ccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHH--HHHHH-HHHhccChH Q lcl|NC_011045. 54 YVTP--WQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVE--RIIMN-YIESNSYRV 128 (536) Q Consensus 54 ~~~~--~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve--~~~~~-~l~~snf~~ 128 (536) +..+ .+++...|++.+|..+.+. | |- +...+..-.. .........+..+|-..+ ..+.. .+....+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~--p---~~-i~~~~~~~~~-~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~ 73 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGF--G---IN-IIPHPEAEDP-DRDGEQYERVWDFWFGDDSNWQVGPMESERATATN 73 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcC--C---eE-EEEccCcccc-cchhhhhhhHHHHhhccCCCccccchhhHhhHHHH Confidence 3333 3566667778888777432 3 21 1110000000 000000111111111110 00000 111234556 Q ss_pred HHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCC Q lcl|NC_011045. 129 TLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKAD 208 (536) Q Consensus 129 ~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~ 208 (536) -+...+.|+..+|||.+++..+..+.++.+..++.....+..|..+.+...- ... T Consensus 74 ~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~-------------------------~~~ 128 (467) T protein:vir:31 74 VLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLE-------------------------EKE 128 (467) T ss_pred HHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecC-------------------------Cce Confidence 6778889999999999999887776666555555555555554432111100 000 Q ss_pred ceEEEEEEEEe-cCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 209 ETIDVYTHIYL-DEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIV 287 (536) Q Consensus 209 ~~~~v~~~v~p-~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~ 287 (536) ..+.++...+. +..+..+..+...+.... +. ...+...=.+++|.....+..||.+|..-++..+.......+... T Consensus 129 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 205 (467) T protein:vir:31 129 KYFGVAGDRYQTNGNGDLDPVFVDADDGST-GT--SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNI 205 (467) T ss_pred eeEEeccccceeecccceeeeeeeeccccc-cc--eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 01111110000 111111222222221111 00 111112235677776667889999999999998888777777777 Q ss_pred HHHHHHhCCceeec-cccccchhhhccCC-------Cc------------------ceecCCcc--c--cccccccc--c Q lcl|NC_011045. 288 KMSMISSKVIGLVN-PAGITQPRRLTKAQ-------TG------------------DFVTGRPE--D--ISFLQLEK--Q 335 (536) Q Consensus 288 ~~~~~a~~p~~lv~-~~g~~~~~~~~~~~-------~g------------------~~~~g~~~--~--~~~~~~~~--~ 335 (536) ....-...|.+++. +++..+++...... .| .++++... . +...++.. . T Consensus 206 ~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~ 285 (467) T protein:vir:31 206 DFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGID 285 (467) T ss_pred HHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccCh Confidence 66666777776553 44444443321100 00 01111110 0 11111111 1 Q ss_pred cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_011045. 336 ADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVA-SELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ 413 (536) Q Consensus 336 ~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~-~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~ 413 (536) .+.+ ..+........|..+|-... ++......-+++-+.+.. .=....|.|.+.+++++|-.-++.+... T Consensus 286 ~d~q-f~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~------- 357 (467) T protein:vir:31 286 EEAS-FLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLD------- 357 (467) T ss_pred hhHH-HHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhc------- Confidence 1222 23444455667888875432 111111111222221111 2233445565555555543333221110 Q ss_pred CCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHH--- Q lcl|NC_011045. 414 IPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQ--- 490 (536) Q Consensus 414 lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~--- 490 (536) ..+..+++.+...+ ..-....++.+...++ .+ .+..+++ -+.+|.+|- . ++++-. T Consensus 358 ---~~~~~i~f~~~~l~-~~d~~~~~~~~~~~~~----~G------~~T~NE~----R~~~Gl~pi---~-d~~~~~~~~ 415 (467) T protein:vir:31 358 ---APDWTIEFELAKPD-TKLQDVEIASQRVQAM----QG------LLTVNEL----RDEFGFEPF---P-EEHVYGGET 415 (467) T ss_pred ---cCCceEEEecchhh-ccCHHHHHHHHHHHHh----CC------CcCHHHH----HHHhCCCCC---C-cccccCCcc Confidence 11223444433333 2111111111111111 00 1122222 222344331 1 111000 Q ss_pred HHHHHH------------HHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 491 KMAQQS------------MQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 491 ~~~q~~------------~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) ...... ..++....+.... ..+.......+.+.. .+++.- T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~ 467 (467) T protein:vir:31 416 LVAEVTGGSGPGGGIGDQIEQLVEDRADEII-DSYQADLETEQLIEI-GANADS 467 (467) T ss_pred cccccccccCCCCcccCcCCCCCCCcccchH-hhhhhccccchhhhh-ccccCC Confidence 000000 0000000000000 000000000000000 000000 No 152 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=94.72 E-value=0.0037 Score=33.94 Aligned_cols=415 Identities=12% Similarity=0.030 Sum_probs=154.8 Q ss_pred HHHHHHHHHHhhh-----HHH-HHH--HHHHHhcccccCCCCCccccc-ccccc-cchHHHHHHHHHHHHHHhhcCCCcc Q lcl|NC_011045. 14 KSVYERLKNDRAP-----YET-RAQ--NCAQYTIPSLFPKDSDNASTD-YVTPW-QAVGARGLNNLASKLMLALFPMQTW 83 (536) Q Consensus 14 ~~r~~~l~~~R~~-----~e~-~w~--e~~~~~~P~~~~~~~~~~~~~-~~~~~-dst~~~a~~~Laa~l~~~ltP~~~W 83 (536) -..|+.|+..-.. .+. .|. +-+-+. .+... ..+..- ..... .++--.|++.+|+.+.+ + |=. . T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~-l-p~~-~ 73 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYN---LGAVA-ASGETVTPHDALQVSAVFASVRLLSETIAT-L-PLS-T 73 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHh---hcccc-cCCceechHHhhccHHHHHHHHHHHHhhcc-C-ceE-E Confidence 1223333221110 000 010 000000 00000 000000 00111 23334466666666533 2 211 2 Q ss_pred eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 84 MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 84 f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) ++-..... . ++ ....++..++.. +.+.-+..++.++..+|||.+++..+ ++.++.+. T Consensus 74 ~~~~~~~~--~----------~~------~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~ 134 (457) T protein:vir:13 74 YSKRGGSR--K----------EI------VTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLD 134 (457) T ss_pred EEecCCcc--c----------cc------ccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEE Confidence 22111100 0 01 111223333332 34456777788888999999998544 45555554 Q ss_pred EEecceEEEeeCCCCCe-EEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccc Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGNV-LQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQ 238 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~v-~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~ 238 (536) .++...+.+..+..+.. ..+|+.+.++ ..|.... T Consensus 135 ~l~p~~v~v~~~~~~~~~~~~~~~y~~~---------------------------------------------~~~~~~~ 169 (457) T protein:vir:13 135 VLDPTKIHVHMVMVDGLRRKVFEAYDID---------------------------------------------ADGNEVL 169 (457) T ss_pred EEccCceEEEEecCCCccceeEEEEEEe---------------------------------------------cCCceee Confidence 44445555544433211 0111111111 1111111 Q ss_pred ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCC--- Q lcl|NC_011045. 239 GSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ--- 315 (536) Q Consensus 239 ~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~--- 315 (536) .. . |..--++++|+...++..||.||...+...+.....+.+.......-...|..++.-++.++++...... T Consensus 170 ~~--~--~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~ 245 (457) T protein:vir:13 170 LG--W--FTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAW 245 (457) T ss_pred EE--e--eCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHH Confidence 00 0 1112356666666777889999999999999999999998888888889999888777776665432111 Q ss_pred -------C--cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCC--CCHHHHHHHHHHH-H Q lcl|NC_011045. 316 -------T--GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGER--VTAEEIRYVASEL-E 382 (536) Q Consensus 316 -------~--g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r--~TAtEi~~r~~E~-~ 382 (536) . |.+.. -.++....++.. +.+.+. .+..+..+..|-++|-.-....-..++ .+..-+.+..... . T Consensus 246 ~~~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~ 323 (457) T protein:vir:13 246 RAANSGVDNAHRVAL-LTEGAKFSKVAMSPDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTM 323 (457) T ss_pred HHHhcCccccCccee-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHH Confidence 0 11110 112222333332 234443 344455677888888543211112222 1223333333332 3 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEe-chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcC Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTIS-TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDI 461 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~v-s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~i 461 (536) ..|.|.+.+++++|-.- ++++.......++|. +.| .|. +......++..+.+.+ .+ T Consensus 324 ~tl~P~~~~ie~~ln~~-------------L~~~~~~~~~~i~fd~~~l---~~~-D~~~r~~~~~~~~~~G------~~ 380 (457) T protein:vir:13 324 FSLRPWLERIEAGFNRL-------------LFAETADRFRFVKFNLDEI---KRG-APKERMELWSLGLQNG------IY 380 (457) T ss_pred HHHHHHHHHHHHHHHHh-------------hcCccccCceeEEeechhh---hcc-CHHHHHHHHHHHHhCC------Cc Confidence 34555555555553322 333322222334442 222 111 2212222222211111 12 Q ss_pred CHHHHHHHHHHHcCCChhhccCCH--HHHH-HHHHHHHHHHHHHHHH---HHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 462 NLAMIKLRIANAIGIDTSGILLTE--EQKQ-QKMAQQSMQMGMDNGA---AALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 462 d~d~~~~~~a~~~Gv~p~~i~rs~--~ev~-~~~~q~~~q~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) .+++ +-+.+|.+|. ... |+.- ...-..-. .+...+. .........+....++..+.-.++.+..+- T Consensus 381 T~NE----~R~~~gl~Pi---~~g~~d~~~~~~n~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~ 452 (457) T protein:vir:13 381 SIDE----VRAAEDMTPL---PDGLGEKYRVPLNLGEVG-EEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEED 452 (457) T ss_pred CHHH----HHHHhCCCCC---CCCcccceeecccccccc-ccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCc Confidence 3333 2233455441 110 0000 00000000 0000000 000000000000000000000000000000 Q ss_pred C Q lcl|NC_011045. 536 I 536 (536) Q Consensus 536 ~ 536 (536) = T Consensus 453 ~ 453 (457) T protein:vir:13 453 D 453 (457) T ss_pred c Confidence 0 No 153 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=366 Identities=13% Similarity=0.128 Sum_probs=148.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhh-HHHH-HHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAP-YETR-AQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~-~e~~-w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) |.= |...+ .|+. .-.. +.... -++|......+..- +...-+-.++--.|++.+|+.+.+ . T Consensus 1 Mg~-------------f~~~~-~r~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~al~~~~v~~cv~~Ia~~iA~--~ 62 (416) T protein:vir:45 1 MGI-------------FYKNE-KRDLQYNEDDLQMMV-QTLPGFQGTKLRQY-KDIEAIRHSDIFTAVMMIASDLAR--M 62 (416) T ss_pred CCc-------------ccccc-cccccCCCcchhHHH-HHhccccccCcccc-chhhhhcchHHHHHHHHHHHhhcc--C Confidence 221 11111 1111 0000 11111 12222211111000 000001123333466777666643 2 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) | | ++.-.... .. +.-++..|. += +.+.-....+.++..+|||.+++..+..+ T Consensus 63 p---~-~~~~~~~~------------~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G 119 (416) T protein:vir:45 63 P---I-RVTVNGQI------------NY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG 119 (416) T ss_pred c---e-EEecCccc------------cc-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 3 3 33211100 00 111222232 21 23455677788888999999999877766 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .+..+..+|...+.+..|.+|++--.| ..++ T Consensus 120 ~~~~L~~i~~~~v~v~~~~~g~~~~~~-------------------------------------------------~~~~ 150 (416) T protein:vir:45 120 EPMNLTFRKTSEIELKSDARGRLYYFH-------------------------------------------------QRID 150 (416) T ss_pred cEEEEEEEcCceeEEEECCCccEEEEE-------------------------------------------------EEec Confidence 777777777788888888877532111 1111 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc-chhh-- Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT-QPRR-- 310 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~-~~~~-- 310 (536) +..... ...++ .--++++|+...+| .||.||.+.+...+.......+.......-...|.+++.-++.. +.+. T Consensus 151 ~~~~~~-~~~~~--~~evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~ 226 (416) T protein:vir:45 151 SNGNNI-ERNVK--FEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARD 226 (416) T ss_pred CCCcee-EEEEc--cccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHH Confidence 110000 00011 11235556554444 79999999999999888888888888888888888776544433 3321 Q ss_pred -hc----cCCC-----cceecCCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHH Q lcl|NC_011045. 311 -LT----KAQT-----GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVA 378 (536) Q Consensus 311 -~~----~~~~-----g~~~~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~ 378 (536) +. ..-. |.+.. -.++....++... .+.+ ..+.....+..|-.+|-... +...+....+.+|. . T Consensus 227 ~~~~~~~~~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~---~ 301 (416) T protein:vir:45 227 RAREEFHKSFSGTKQAGKVVV-LDESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDA---N 301 (416) T ss_pred HHHHHHHHHhcCccccCceee-cCCCceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHH---H Confidence 11 1001 11111 1122223333322 2333 23444555677888885432 22112222222222 1 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHH--H--HHHH--- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLER--C--VAAW--- 449 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~--~--~~~~--- 449 (536) ......|-|.+..+++||-.-| +++-.+..+++.+...+. -..|..-.+.+.+ + .+.+ T Consensus 302 ~~~~~~l~P~~~~ie~~ln~~l-------------~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~ 368 (416) T protein:vir:45 302 LDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQR 368 (416) T ss_pred HHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 2233345555555555543322 222222233333222111 1111222222211 0 1111 Q ss_pred Hhhcch-hhh--------hcCCHHHHHHHH-------H--H-HcCCChhh Q lcl|NC_011045. 450 AALAPM-RDD--------PDINLAMIKLRI-------A--N-AIGIDTSG 480 (536) Q Consensus 450 ~~~~p~-~~~--------~~id~d~~~~~~-------a--~-~~Gv~p~~ 480 (536) -.+.|. -.+ .++..|. ++.+ . . .-| +-.. T Consensus 369 ~gl~p~~~gd~~~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~kgG-e~n~ 416 (416) T protein:vir:45 369 DGLAPIPGGNGSIHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGG-EENE 416 (416) T ss_pred hCCCCCCCCCcceEeeccccccccc-ccccCcccccccccccCCC-CCCC Confidence 112121 000 0011111 1100 0 0 011 1222 No 154 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=366 Identities=13% Similarity=0.128 Sum_probs=148.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhh-HHHH-HHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAP-YETR-AQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~-~e~~-w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) |.= |...+ .|+. .-.. +.... -++|......+..- +...-+-.++--.|++.+|+.+.+ . T Consensus 1 Mg~-------------f~~~~-~r~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~al~~~~v~~cv~~Ia~~iA~--~ 62 (416) T protein:vir:81 1 MGI-------------FYKNE-KRDLQYNEDDLQMMV-QTLPGFQGTKLRQY-KDIEAIRHSDIFTAVMMIASDLAR--M 62 (416) T ss_pred CCc-------------ccccc-cccccCCCcchhHHH-HHhccccccCcccc-chhhhhcchHHHHHHHHHHHhhcc--C Confidence 221 11111 1111 0000 11111 12222211111000 000001123333466777666643 2 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) | | ++.-.... .. +.-++..|. += +.+.-....+.++..+|||.+++..+..+ T Consensus 63 p---~-~~~~~~~~------------~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G 119 (416) T protein:vir:81 63 P---I-RVTVNGQI------------NY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTG 119 (416) T ss_pred c---e-EEecCccc------------cc-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 3 3 33211100 00 111222232 21 23455677788888999999999877766 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .+..+..+|...+.+..|.+|++--.| ..++ T Consensus 120 ~~~~L~~i~~~~v~v~~~~~g~~~~~~-------------------------------------------------~~~~ 150 (416) T protein:vir:81 120 EPMNLTFRKTSEIELKSDARGRLYYFH-------------------------------------------------QRID 150 (416) T ss_pred cEEEEEEEcCceeEEEECCCccEEEEE-------------------------------------------------EEec Confidence 777777777788888888877532111 1111 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc-chhh-- Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT-QPRR-- 310 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~-~~~~-- 310 (536) +..... ...++ .--++++|+...+| .||.||.+.+...+.......+.......-...|.+++.-++.. +.+. T Consensus 151 ~~~~~~-~~~~~--~~evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~ 226 (416) T protein:vir:81 151 SNGNNI-ERNVK--FEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARD 226 (416) T ss_pred CCCcee-EEEEc--cccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHH Confidence 110000 00011 11235556554444 79999999999999888888888888888888888776544433 3321 Q ss_pred -hc----cCCC-----cceecCCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHH Q lcl|NC_011045. 311 -LT----KAQT-----GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVA 378 (536) Q Consensus 311 -~~----~~~~-----g~~~~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~ 378 (536) +. ..-. |.+.. -.++....++... .+.+ ..+.....+..|-.+|-... +...+....+.+|. . T Consensus 227 ~~~~~~~~~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~---~ 301 (416) T protein:vir:81 227 RAREEFHKSFSGTKQAGKVVV-LDESMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDA---N 301 (416) T ss_pred HHHHHHHHHhcCccccCceee-cCCCceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHH---H Confidence 11 1001 11111 1122223333322 2333 23444555677888885432 22112222222222 1 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHH--H--HHHH--- Q lcl|NC_011045. 379 SELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLER--C--VAAW--- 449 (536) Q Consensus 379 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~--~--~~~~--- 449 (536) ......|-|.+..+++||-.-| +++-.+..+++.+...+. -..|..-.+.+.+ + .+.+ T Consensus 302 ~~~~~~l~P~~~~ie~~ln~~l-------------~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~ 368 (416) T protein:vir:81 302 LDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQR 368 (416) T ss_pred HHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 2233345555555555543322 222222233333222111 1111222222211 0 1111 Q ss_pred Hhhcch-hhh--------hcCCHHHHHHHH-------H--H-HcCCChhh Q lcl|NC_011045. 450 AALAPM-RDD--------PDINLAMIKLRI-------A--N-AIGIDTSG 480 (536) Q Consensus 450 ~~~~p~-~~~--------~~id~d~~~~~~-------a--~-~~Gv~p~~ 480 (536) -.+.|. -.+ .++..|. ++.+ . . .-| +-.. T Consensus 369 ~gl~p~~~gd~~~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~kgG-e~n~ 416 (416) T protein:vir:81 369 DGLAPIPGGNGSIHRVDLNHVNIEL-VDEYQMNKSRATDKKLKGG-EENE 416 (416) T ss_pred hCCCCCCCCCcceEeeccccccccc-ccccCcccccccccccCCC-CCCC Confidence 112121 000 0011111 1100 0 0 011 1222 No 155 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=350 Identities=12% Similarity=0.066 Sum_probs=142.9 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccc-cc-cccchHHHHHHHHHHHHHHhhcCCCcceeccCChh Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDY-VT-PWQAVGARGLNNLASKLMLALFPMQTWMRLTISEY 91 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~-~~-~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~ 91 (536) -..|+++...+..- ........+..+ ....+..+..-. .. .-.++--.|++.+|+.+.+ + ||.-....+. T Consensus 1 Mgl~~~~f~~~~~~-~~~~~~~~~~~~--~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~-l----p~~~~~~~~~ 72 (409) T protein:vir:84 1 MSLFTRIFSGPSEE-RTLTKISGIPSP--AEDWAMHGDRPGANSAMTLGAFYACVTLLADTVAS-L----SIDAYRKKDN 72 (409) T ss_pred CchhhhhhcCCCcc-cccccccccccc--cchhhccCcccchhhhhccHHHHHHHHHHHHhhhh-C----ceEEEEecCC Confidence 23333332221100 000000001111 000000000000 01 1123344566666666643 2 3332222111 Q ss_pred hhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEec-CCCCceeeEEEEecce Q lcl|NC_011045. 92 EAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPE-PEGSNYNPMKLYRLSS 165 (536) Q Consensus 92 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~-~~~~~~~~~~~~~l~~ 165 (536) .- ++ +.-+...|. +- +.+.-+...+.++..+||+.+|+.. +..+.+..+..++.+. T Consensus 73 ~~------------~~------~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~ 134 (409) T protein:vir:84 73 VR------------IP------VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDC 134 (409) T ss_pred cc------------cc------cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCce Confidence 10 00 011122232 22 3445566677788899999998753 3444454455454444 Q ss_pred EEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeE-EEEecCcccccccccc Q lcl|NC_011045. 166 YVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIR-YEEVEGMEVQGSDGTY 244 (536) Q Consensus 166 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~-~~~v~g~~i~~~~~~~ 244 (536) +.|....++. ..+.. ...++|+.+. T Consensus 135 v~v~~~~~~~------------------------------------------------~~~~~~~~~~~g~~~~------ 160 (409) T protein:vir:84 135 IHVTDAKDED------------------------------------------------GDWIEPVYRIDGKVVP------ 160 (409) T ss_pred eEEEEcCCCc------------------------------------------------ceEEEEEecCCceEEc------ Confidence 4443332221 11100 1112222211 Q ss_pred ccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------C Q lcl|NC_011045. 245 PKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------Q 315 (536) Q Consensus 245 ~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------~ 315 (536) .-=++++|+...+|..||.||...+...+.......+.......-...|..++.-++.++++..... + T Consensus 161 ---~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n 237 (409) T protein:vir:84 161 ---NHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHN 237 (409) T ss_pred ---hhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcc Confidence 1125666666677778999999999999999998888888888888888888766666655442210 1 Q ss_pred CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc-CCCCCCCHHHHHHHHHH-HHHHhhhhHHH Q lcl|NC_011045. 316 TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ-RTGERVTAEEIRYVASE-LEDTLGGVYSI 391 (536) Q Consensus 316 ~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~-~~~~r~TAtEi~~r~~E-~~~~LG~v~~r 391 (536) .|.+.. -.++....++.. +.+.+. .+..+..+..|-++|-... ++. .+...-++.-+.+.... ....|.|.+.. T Consensus 238 ~g~~~v-l~~g~~~~~~~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ 315 (409) T protein:vir:84 238 RRLPAV-MSAGIKWQSVSITPNESQF-LETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRC 315 (409) T ss_pred CCCeee-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHH Confidence 111111 122233334433 234443 3444555677888874432 111 11111122223333322 24457788888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHH--H--HHHH---Hhhcc------- Q lcl|NC_011045. 392 LSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLER--C--VAAW---AALAP------- 454 (536) Q Consensus 392 l~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~--~--~~~~---~~~~p------- 454 (536) ++++|-.-| ..+..+++.+ +.|-. ..|...+..+.+ + .+.+ -.+.| T Consensus 316 ie~~l~~~L----------------~~g~~i~fd~-~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~ 378 (409) T protein:vir:84 316 IEQALDTFL----------------PRGQFVKFNV-DGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIH 378 (409) T ss_pred HHHHHHHhc----------------cCCCeEEEec-hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccee Confidence 777753211 0111122221 11111 011111111100 0 0000 00001 Q ss_pred --------------h------hhhhcCCHHH Q lcl|NC_011045. 455 --------------M------RDDPDINLAM 465 (536) Q Consensus 455 --------------~------~~~~~id~d~ 465 (536) . .....-|..+ T Consensus 379 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 379 LQPMNFVPLGYVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred eecccccccccCCccccCcCCCCCCccCCCC Confidence 0 0000112222 No 156 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=94.42 E-value=0.0045 Score=33.47 Aligned_cols=352 Identities=13% Similarity=0.145 Sum_probs=144.1 Q ss_pred HHHH--HHHHHhccc-c------------cCCCCCcc----ccc--------------ccccc-------cchHHHHHHH Q lcl|NC_011045. 29 TRAQ--NCAQYTIPS-L------------FPKDSDNA----STD--------------YVTPW-------QAVGARGLNN 68 (536) Q Consensus 29 ~~w~--e~~~~~~P~-~------------~~~~~~~~----~~~--------------~~~~~-------dst~~~a~~~ 68 (536) .||- +|| |..+. + |......+ ... .-..+ .++--.|++. T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~ 79 (441) T protein:vir:98 1 MHWYNTDCY-FVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CceecCccc-eeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHH Confidence 4442 333 11110 0 00000000 000 00001 1122234555 Q ss_pred HHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcE Q lcl|NC_011045. 69 LASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNV 143 (536) Q Consensus 69 Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~ 143 (536) +|+.+.+ .| +++.-.... . .+.-++..|. +- +.+.-+...+.++..+||| T Consensus 80 Ia~~iA~--lp----l~~~~~~~~------------~-------~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna 134 (441) T protein:vir:98 80 IASDLAR--MP----IRVTVNGQI------------N-------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHG 134 (441) T ss_pred HHHhhcc--Cc----eEEecCCcc------------c-------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCe Confidence 5554432 23 222111000 0 0111223332 22 3445567778888999999 Q ss_pred EEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCC Q lcl|NC_011045. 144 LLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDS 223 (536) Q Consensus 144 ~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~ 223 (536) .+++..+..+.+..+..+|.+.+.+..|.+|++--.+ T Consensus 135 y~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~------------------------------------------- 171 (441) T protein:vir:98 135 YIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYYFH------------------------------------------- 171 (441) T ss_pred EEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEEEE------------------------------------------- Confidence 9999877766777777788888888888888541111 Q ss_pred CceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc Q lcl|NC_011045. 224 GEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA 303 (536) Q Consensus 224 ~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~ 303 (536) ..+++..... ... +..--++++|+...+| .||.||...+...+...+...+.......-...|..++.-+ T Consensus 172 ------~~~~~~~~~~-~~~--~~~~dviHir~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~ 241 (441) T protein:vir:98 172 ------QRIDSNGNNI-ERN--VKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK 241 (441) T ss_pred ------EEeccCccee-eEE--EccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC Confidence 1111100000 000 0111245556554455 79999999999888888888888888888888888776544 Q ss_pred cc-cchhhh---c----c--CC-C--cceecCCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCC Q lcl|NC_011045. 304 GI-TQPRRL---T----K--AQ-T--GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGER 368 (536) Q Consensus 304 g~-~~~~~~---~----~--~~-~--g~~~~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r 368 (536) +. .+.+.. . . ++ . |.+. --.++....++... .+.+. .+........|-++|-... ++..+... T Consensus 242 ~~~~~~e~~~~~~~~~~~~~~G~~nag~~~-vl~~g~~~~~l~~~~~d~q~-~e~r~~~~~~Ia~~fgVPp~~lg~~~~~ 319 (441) T protein:vir:98 242 GVLDNKKARDRAREEFHKSFSGTKQAGKVV-VLDESMTFDQLEVDTEVLKL-IRENKSSTREIAGVFGIPLHKFGIETAN 319 (441) T ss_pred CCCCCHHHHHHHHHHHHHHhcCccccCcce-ecCCCceEEEccCChhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCC Confidence 43 233321 1 1 11 0 1111 11122223333322 23332 3444555667888885432 22122222 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHH- Q lcl|NC_011045. 369 VTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLER- 444 (536) Q Consensus 369 ~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~- 444 (536) .+.+|. .......|-|.+.++++||-.- ++++-.+..+++.. +.|-. ..|..-.+.+.+ T Consensus 320 ~s~~q~---~~~y~~tl~P~~~~ie~~ln~~-------------L~~~~~~~~~~fd~-~~llr~d~~~~~~~~~~~~~~ 382 (441) T protein:vir:98 320 MSITDA---NLDYLSTLKPYITCVCAELNFK-------------FNDEYVNREFKFDT-TEIRVVDEKTQAEIDKINIDS 382 (441) T ss_pred ccHHHH---HHHHHHHHHHHHHHHHHHHHhh-------------ccccccCceEEEec-hhhhccCHHHHHHHHHHHHhC Confidence 232332 1222335555555555554322 22322222233322 22211 112222222211 Q ss_pred -H--HHHH---Hhhcch-hhh--------hcCCHHHH----------HHHHHHHcCCChhh Q lcl|NC_011045. 445 -C--VAAW---AALAPM-RDD--------PDINLAMI----------KLRIANAIGIDTSG 480 (536) Q Consensus 445 -~--~~~~---~~~~p~-~~~--------~~id~d~~----------~~~~a~~~Gv~p~~ 480 (536) + .+.+ -.+.|. -.+ .+++.+.+ .+.- ..-| +-.. T Consensus 383 G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~-~kgG-e~ne 441 (441) T protein:vir:98 383 GKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKK-LKGG-EENE 441 (441) T ss_pred CCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccc-cCCC-CCCC Confidence 0 1111 112221 000 00111110 0000 0112 1222 No 157 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=94.26 E-value=0.0049 Score=33.24 Aligned_cols=367 Identities=13% Similarity=0.070 Sum_probs=145.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHH--HHHHHHHhcc----cccCCCCCcccccc-ccccc-chHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETR--AQNCAQYTIP----SLFPKDSDNASTDY-VTPWQ-AVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~--w~e~~~~~~P----~~~~~~~~~~~~~~-~~~~d-st~~~a~~~Laa~ 72 (536) |..+| ....|++++.-..+.++. +..-..-..+ .++...+..+..-. .+... ++--.|++.+|+. T Consensus 1 ~~~~~-------~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ 73 (432) T protein:vir:97 1 MPDEK-------KLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQA 73 (432) T ss_pred CCCcc-------cCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHh Confidence 55544 334555554443322210 0000000000 00000001110000 11112 3333344555554 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~ 147 (536) + +.+ ||.-..-.+..-.+ ..+.-++..|+ +- +.+.=....+.++..+|||.+++ T Consensus 74 i-a~l----p~~~y~~~~~g~~~----------------~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~ 132 (432) T protein:vir:97 74 V-AAM----PLMMYMRTPDGRKE----------------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRK 132 (432) T ss_pred h-ccC----ceEEEEecCCCccc----------------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEE Confidence 4 333 33211111110000 01112222332 22 34445566777889999999888 Q ss_pred ecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCcee Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYI 227 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~ 227 (536) ..++ +.+..+..++...+.+..|.+|++ +|+ T Consensus 133 ~~~~-g~~~~L~~l~p~~v~v~~~~~g~~--~y~---------------------------------------------- 163 (432) T protein:vir:97 133 VVTD-GRIESLQYLANDRLTITTDTKGNT--AYR---------------------------------------------- 163 (432) T ss_pred EecC-CcEEEEEEEcCcceEEEEcCCCcE--EEE---------------------------------------------- Confidence 7654 456566666667777767766642 111 Q ss_pred EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 228 ~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) |...+|.... ++-++ +++.|....+| .||.||...+...+.......+.......-...|-.++.-++.++ T Consensus 164 -~~~~~g~~~~-----~~~~~--iih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~ 234 (432) T protein:vir:97 164 -YRRTDGQMID-----IPRQQ--IWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT 234 (432) T ss_pred -EEecCceEEE-----Ecccc--EEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCC Confidence 1111111100 00011 34445444455 799999998888887777777777777777778877776666555 Q ss_pred hhhhccC-------CC-cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCC---CCCHHHHH Q lcl|NC_011045. 308 PRRLTKA-------QT-GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE---RVTAEEIR 375 (536) Q Consensus 308 ~~~~~~~-------~~-g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~---r~TAtEi~ 375 (536) .+....- .+ |.+ .--.++....++.. +.+.+. .+.....+..|-++|-......-..+ .-+..-+. T Consensus 235 ~e~~~~~~~~~~~~~nag~~-~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e 312 (432) T protein:vir:97 235 DDQYDSFSKKVSGSVEAGRA-PLLEGGMDVKSLGLNPVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIE 312 (432) T ss_pred HHHHHHHHHHHhhhhcCCCc-eecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHH Confidence 4432210 00 111 11112222333332 234443 34455667788888854321111111 11223333 Q ss_pred HHHHHH-HHHhhhhHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC-----------CCCCC Q lcl|NC_011045. 376 YVASEL-EDTLGGVYSILSQELQLPLVRV-------------------------LLKQLQATQQ-----------IPELP 418 (536) Q Consensus 376 ~r~~E~-~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~~~g~-----------lp~~~ 418 (536) +..... ...|.|.+.++++++-.-|+.. .+..+.+.|. +||++ T Consensus 313 ~~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~ 392 (432) T protein:vir:97 313 SQQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG 392 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Confidence 333222 2366666666666654322211 1111111121 13333 Q ss_pred CcceEEEEe---chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccC Q lcl|NC_011045. 419 KEAVEPTIS---TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) Q Consensus 419 ~~~v~v~~v---s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r 483 (536) +++..+... .||..+.+.. .-.|...+ -.-+...+-+ T Consensus 393 g~~~~~~~~~~~~pl~~~~~~~-------------~~~~~~~~---------------~~~~~~~~~~ 432 (432) T protein:vir:97 393 GNAAVLTVQSAMVPLDSIGLQA-------------SPEPASGL---------------GNQQQDKVSK 432 (432) T ss_pred CCcceEeecccccchhhhcccC-------------CCCCCCCC---------------CCcccccccC Confidence 322111100 0111110000 00000000 0001111111 No 158 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=94.08 E-value=0.0054 Score=32.99 Aligned_cols=454 Identities=10% Similarity=0.059 Sum_probs=190.7 Q ss_pred CCCcc-------ccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC-Cc-cc----ccccc--cccchHHHH Q lcl|NC_011045. 1 MAEKR-------TGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS-DN-AS----TDYVT--PWQAVGARG 65 (536) Q Consensus 1 Ma~~~-------~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~-~~-~~----~~~~~--~~dst~~~a 65 (536) |...- .++......+.|....+.+......|. |.....+. .. .. .+... ..++.+..+ T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~-------p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~a 73 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWN-------PPSESVDAALLPNFTRGNARADDLVRNNGYAANA 73 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccc-------cCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 54421 111111111112211111111111221 21111110 00 00 11111 367889999 Q ss_pred HHHHHHHHHH-hhcC-CCc-ceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHH----------HHHhccChHHHHH Q lcl|NC_011045. 66 LNNLASKLML-ALFP-MQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMN----------YIESNSYRVTLFE 132 (536) Q Consensus 66 ~~~Laa~l~~-~ltP-~~~-Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~----------~l~~snf~~~~~~ 132 (536) ++.+++.+++ +++| ++| |=.|...+.. .++|-+.+++.... .=.+.+||..... T Consensus 74 v~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~-------------~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l 140 (533) T protein:vir:34 74 IQLHQDHIVGSFFRLSHRPSWRYLGIGEEE-------------ARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIRE 140 (533) T ss_pred HHHHHHHhhCCCceeeeccchhhcCCChhH-------------HHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHH Confidence 9999999988 4678 455 5445444332 23333344433322 2234689999999 Q ss_pred HHHHHHhhCcEEEEEecCCC-CceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceE Q lcl|NC_011045. 133 ALKQLVVAGNVLLYLPEPEG-SNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETI 211 (536) Q Consensus 133 ~~~dl~~~G~~~l~~~~~~~-~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~ 211 (536) ++..+++-|-+++-...... +..+.+++ +.+..+.|.... .. ++ .= T Consensus 141 ~~r~~~~dGE~f~~~~~~~~~g~~~~~~l----------------------q~ie~d~l~~~~--------~~--~~-~~ 187 (533) T protein:vir:34 141 GVAMHAFNGELFVQATWDTSSSRLFRTQF----------------------RMVSPKRISNPN--------NT--GD-SR 187 (533) T ss_pred HHHHHHhCCceEEEeeeccCCCCccceEE----------------------EEechhhcCCCC--------CC--CC-CC Confidence 99999999988664322111 11111111 111111111100 00 00 01 Q ss_pred EEEEEEEecCCCCceeEEEEec---CccccccccccccccCc---eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 212 DVYTHIYLDEDSGEYIRYEEVE---GMEVQGSDGTYPKEACP---YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEA 285 (536) Q Consensus 212 ~v~~~v~p~~~~~~~~~~~~v~---g~~i~~~~~~~~~~~~P---~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~ 285 (536) .|+..|+.+..|....+|-.-+ |........+..+...| +++.-....+|..=|.+..-.+|..++.|+....+ T Consensus 188 ~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~da 267 (533) T protein:vir:34 188 NCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNT 267 (533) T ss_pred ceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHH Confidence 3556666655543322222111 11000000000011112 34444445689999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeccc-cccchh-------------hh---------------ccCCCcceecCCcc-cccccccc-c Q lcl|NC_011045. 286 IVKMSMISSKVIGLVNPA-GITQPR-------------RL---------------TKAQTGDFVTGRPE-DISFLQLE-K 334 (536) Q Consensus 286 ~~~~~~~a~~p~~lv~~~-g~~~~~-------------~~---------------~~~~~g~~~~g~~~-~~~~~~~~-~ 334 (536) .+.++..++.....+..+ +-.... .+ ...++|.|..-.++ ++.+..-. . T Consensus 268 el~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p 347 (533) T protein:vir:34 268 QLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDT 347 (533) T ss_pred HHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCC Confidence 999999999988776522 100000 00 01234554433333 23222211 1 Q ss_pred ccchhHHHHHHHHHHHHHHHHH--hhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011045. 335 QADFTVAKAVSDAIEARLSFAF--MLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQ 412 (536) Q Consensus 335 ~~~~~~~~~~i~~~~~rI~~af--~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g 412 (536) +++|. .....+...|..++ =+..+. .|-.+++=.-+++-..|..+.+-..=..|...|+.|+..+.+..+...| T Consensus 348 ~~~~~---~f~~~~lr~iAaglGi~ye~lt-~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G 423 (533) T protein:vir:34 348 DNGYS---VFEQSLLRYIAAGLGVSYEQLS-RNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRR 423 (533) T ss_pred CCCHH---HHHHHHHHHHHhhcCCCHHHHh-hhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 22333 23333344444443 122222 3444556556666666666665555566677788999999999999999 Q ss_pred CCCCCCCc----------ceEEEEechHH-HHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhc Q lcl|NC_011045. 413 QIPELPKE----------AVEPTISTGLE-AIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGI 481 (536) Q Consensus 413 ~lp~~~~~----------~v~v~~vs~La-~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i 481 (536) .+|-+.+. .++++++.|=- ..=-..+++.....+.. .+.. ...++...|.||. T Consensus 424 ~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~-----------G~~s---~~~~~a~~G~D~~-- 487 (533) T protein:vir:34 424 VVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEA-----------GLST---YEKECAKRGDDYQ-- 487 (533) T ss_pred cccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHc-----------CCCC---HHHHHHHcCCCHH-- Confidence 98733221 12444444310 00000111111000000 0000 0122223454443 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCC Q lcl|NC_011045. 482 LLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGL 532 (536) Q Consensus 482 ~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (536) |+.+.++...+.++...-...............+.....+.++.+- T Consensus 488 -----ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 488 -----EIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred -----HHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCCCC Confidence 2222222221111111000000000000000000001111111111 No 159 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=367 Identities=14% Similarity=0.099 Sum_probs=149.0 Q ss_pred CCCccccccHHHHHHHHHHHHHH---hhhHHH-HHHHHHHHhcc----cccCCCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKND---RAPYET-RAQNCAQYTIP----SLFPKDSDNASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~---R~~~e~-~w~e~~~~~~P----~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (536) |+. ++.-..|++++.- ++++.. .|..+.-..-. ...+.++...=+...-+=.++--.|++.+|+. T Consensus 1 ~~~-------~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ 73 (432) T protein:vir:81 1 MPD-------EKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQA 73 (432) T ss_pred CCc-------hhhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHh Confidence 554 3344566655432 111111 01100000000 00000000000000011224444566666666 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~ 147 (536) +.+. |+.-..-.+..-.+ ..+.-++..|. +- +.+.-.+.++.++..+|||.+++ T Consensus 74 ia~l-----p~~~y~~~~~g~~~----------------~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i 132 (432) T protein:vir:81 74 IAAM-----PLTMYMRTPDGRKE----------------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRK 132 (432) T ss_pred hhhC-----ceeeEEecCCccee----------------cccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEE Confidence 6433 32211111100000 00111222232 22 23445667777889999999888 Q ss_pred ecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCcee Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYI 227 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~ 227 (536) ...+ +.+..+..++.+.+-+..|.+|++. |+ T Consensus 133 ~~~~-g~~~~L~~l~~~~v~v~~~~~g~~~--y~---------------------------------------------- 163 (432) T protein:vir:81 133 VVTD-GRIESLQYLANDRLTITTDPKGNTA--YR---------------------------------------------- 163 (432) T ss_pred EecC-CcEEEEEEEcCCceEEEECCCCcEE--EE---------------------------------------------- Confidence 7654 4566666666677777776666321 11 Q ss_pred EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 228 ~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) |...+|.... ++.+ =++++|....+| .||.||...+...+.......+.......-...|-.++.-++.++ T Consensus 164 -~~~~~g~~~~-----~~~~--~iih~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~ 234 (432) T protein:vir:81 164 -YRRTDGQMID-----IPKQ--QIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT 234 (432) T ss_pred -EEecCceEEE-----Eccc--cEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC Confidence 0011111000 0000 123445444555 799999999988888888888777777777778877766666555 Q ss_pred hhhhc---cC----C-CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc-CC-CCCCCHHHHH Q lcl|NC_011045. 308 PRRLT---KA----Q-TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ-RT-GERVTAEEIR 375 (536) Q Consensus 308 ~~~~~---~~----~-~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~-~~-~~r~TAtEi~ 375 (536) ++... .. . .|.+.. -.++....++.. +.+.+. .+..+..+..|-++|-.-. ++. .+ +..-|..-+. T Consensus 235 ~e~~~~~~~~~~~~~nag~~~v-l~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~e 312 (432) T protein:vir:81 235 DDQYDSFAKKVSGSVEAGRAPL-LEGGMDVKSLGLNPVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIE 312 (432) T ss_pred HHHHHHHHHHHhhhhcCCCcee-cCCCceEEEccCCHHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHH Confidence 54322 10 0 011111 112222333332 234443 3445666778888884322 111 11 1112334444 Q ss_pred HHHHHH-HHHhhhhHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC-----------CCCCC Q lcl|NC_011045. 376 YVASEL-EDTLGGVYSILSQELQLPLVRV-------------------------LLKQLQATQQ-----------IPELP 418 (536) Q Consensus 376 ~r~~E~-~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~~~g~-----------lp~~~ 418 (536) +..... ...|.|.+.++++||-.-|+.+ .+..+.+.|. +||++ T Consensus 313 q~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~ 392 (432) T protein:vir:81 313 SQQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG 392 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Confidence 433333 2367777777777764433211 1111222221 13333 Q ss_pred CcceEEEEec---hHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccC Q lcl|NC_011045. 419 KEAVEPTIST---GLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) Q Consensus 419 ~~~v~v~~vs---~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r 483 (536) +++..+...+ |+..+.. +-.|......-| -+-..+-+ T Consensus 393 g~~~~~~~~~~~~pl~~~~~---------------~~~~~~~~~~~n-------------~~~~~~~~ 432 (432) T protein:vir:81 393 GNAAVLTVQSAMVPLDSIGL---------------QASPEPASGLGN-------------QQQDKVSK 432 (432) T ss_pred CCcceEeecCcccchhhhcc---------------CCCCCCCCCCCC-------------cccccccC Confidence 3221110000 1111100 000000000000 00000111 No 160 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=94.00 E-value=0.0057 Score=32.89 Aligned_cols=242 Identities=13% Similarity=0.114 Sum_probs=111.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhh-hHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRA-PYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~-~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) |.= |... ..|+ .....|..-.--+.|.+....+..-. ...-+-.++--.|++.+|+.+.+. | T Consensus 1 Mgl-------------F~~~-~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~-~~~al~~~~v~~~i~~ia~~iA~l--p 63 (251) T protein:vir:46 1 MGI-------------FYKN-EKRDLQYNEDDLQMMVQTLPSFQGTKLRQYK-DIEAIRHSDIFTAVMMIASDLARM--P 63 (251) T ss_pred CCc-------------cccc-cccccCCCccchhhhhhhhccccCcCcceec-hhhhhccHHHHHHHHHHHHhHhhC--c Confidence 331 1111 1111 11111110000112222211111000 001122344445667777766443 3 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) |.-..-. ... . +.-+...|. +-| .+.-+.....++..+|||.+|+..+..+. T Consensus 64 ---~~~~~~~-~~~------------~-------~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~ 120 (251) T protein:vir:46 64 ---IRVTVNG-QIN------------Y-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGE 120 (251) T ss_pred ---eEEeeCc-ccc------------c-------cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCc Confidence 3221111 000 0 111122222 333 34455566778889999999998777667 Q ss_pred eeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) ++.+..++...+-+..|.+|++--. ++.+..+. ++.. ..+.. T Consensus 121 ~~~L~~i~~~~v~v~~~~~g~~~~~----------------------------------~~~~~~~~-~g~~---~~~~~ 162 (251) T protein:vir:46 121 PMNLTFRKTSEIELKSDARGRLYYF----------------------------------HQRIDSNG-NNIE---RNVKF 162 (251) T ss_pred EEEEEEECCceEEEEECCCCcEEEE----------------------------------EEEeccCC-ccee---EEECC Confidence 7777777778888877777733110 00000000 0000 00111 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc-chhhhc- Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT-QPRRLT- 312 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~-~~~~~~- 312 (536) + =++++|....+| .||.||...+...+...+...+.......-...|..++.-++.. +.+... T Consensus 163 ------------~--diiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~ 227 (251) T protein:vir:46 163 ------------E--DMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDR 227 (251) T ss_pred ------------c--cEEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHH Confidence 1 135556554444 79999999999999999999999999999989998777554433 332211 Q ss_pred -cCCCcceecCCcccccccccccccchhHHHHH Q lcl|NC_011045. 313 -KAQTGDFVTGRPEDISFLQLEKQADFTVAKAV 344 (536) Q Consensus 313 -~~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~ 344 (536) ....-....|..+ .+.+.. .+++ T Consensus 228 ~~~~~~~~~~g~~n-~g~~~~--------gm~~ 251 (251) T protein:vir:46 228 AREEFPKVLVELNK-LGKLSY--------SMNQ 251 (251) T ss_pred HHHHHHHHhcCccc-cccccc--------ccCC Confidence 1111111222211 111111 1111 No 161 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=93.76 E-value=0.0065 Score=32.59 Aligned_cols=368 Identities=12% Similarity=0.054 Sum_probs=147.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHH--HHHHHHHHhccc----ccCCCCCcccccc-cccc-cchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYET--RAQNCAQYTIPS----LFPKDSDNASTDY-VTPW-QAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~--~w~e~~~~~~P~----~~~~~~~~~~~~~-~~~~-dst~~~a~~~Laa~ 72 (536) |..+| .-..|+++++-..+-.+ .+.....-..+. .+...+..+..-. .... .++--.|++.+|+. T Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ 73 (432) T protein:vir:10 1 MPDEK-------KLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQA 73 (432) T ss_pred CCCCc-------ccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHh Confidence 55544 23444444333222111 110000000000 0000000010000 0111 23333455555555 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~ 147 (536) + +.+ ||.-..-.+..-.+ ..+.-++..|+ +- +.+.=.+..+.++..+|||.+++ T Consensus 74 i-a~l----p~~~y~~~~~g~~~----------------~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~ 132 (432) T protein:vir:10 74 I-AAM----PLTMYMRTPDGRKE----------------AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRK 132 (432) T ss_pred h-hhC----ceeEEEecCCCccc----------------ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEE Confidence 5 333 44211111100000 01122233332 22 34444666777888999999988 Q ss_pred ecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCcee Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYI 227 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~ 227 (536) ...+ +.+..+..++...+.+..|.+|++ +|+ .. .. +++.. T Consensus 133 ~~~~-g~~~~L~~l~~~~v~v~~~~~g~~--~y~----------------------------------~~--~~-~g~~~ 172 (432) T protein:vir:10 133 VVTD-GRIESLQYLANDRLTITTDTKGNT--AYR----------------------------------YR--RT-DGQMI 172 (432) T ss_pred EecC-CcEEEEEEEcCCceEEEEcCCCcE--EEE----------------------------------EE--ec-CceEE Confidence 7654 566666777777888877777743 111 00 00 11110 Q ss_pred EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 228 ~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .+... + ++++|....+| .||.||...+...+.......+.......-...|-.++.-++.++ T Consensus 173 ---~~~~~------------~--iih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~ 234 (432) T protein:vir:10 173 ---DIPKQ------------Q--IWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT 234 (432) T ss_pred ---EEcCc------------c--EEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC Confidence 11111 1 23344444444 799999999888888777777777776666777877776555555 Q ss_pred hhhhcc---CCCcce----ecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCC---CCCCHHHHHH Q lcl|NC_011045. 308 PRRLTK---AQTGDF----VTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTG---ERVTAEEIRY 376 (536) Q Consensus 308 ~~~~~~---~~~g~~----~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~---~r~TAtEi~~ 376 (536) ++.... .-.|.. +.--.++....++.. +.+.+. .+..+..+..|-++|-.-....-.. ..-+..-+.+ T Consensus 235 ~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~ 313 (432) T protein:vir:10 235 DDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQL-LQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHHHHhhhhhCCCceecCCCceEEEccCChHHHHH-HHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHH Confidence 443221 000110 001112222333332 334553 3455667778888884422111111 1122233333 Q ss_pred HHHHHH-HHhhhhHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC-----------CCCCCC Q lcl|NC_011045. 377 VASELE-DTLGGVYSILSQELQLPLVRV-------------------------LLKQLQATQQ-----------IPELPK 419 (536) Q Consensus 377 r~~E~~-~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~~~g~-----------lp~~~~ 419 (536) ...... ..|.|.+.++++|+-.-|+.. .+..+...|. +||+++ T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g 393 (432) T protein:vir:10 314 QQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 333322 366677777666654322211 1111122221 233333 Q ss_pred cceEEEEe---chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHH Q lcl|NC_011045. 420 EAVEPTIS---TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKL 468 (536) Q Consensus 420 ~~v~v~~v---s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~ 468 (536) ++..+..- .||..+.+.. .-.|.....+-+-++..+ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~-------------~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 394 NAAVLTVQSAMVPLDSIGLQA-------------SPEPASGLGNQQQDKVSK 432 (432) T ss_pred CcceEeecCcccchhhhcccC-------------CCCCCCCCCCcccccccC Confidence 22111000 0111111000 000000000001111111 No 162 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=93.46 E-value=0.0075 Score=32.24 Aligned_cols=429 Identities=10% Similarity=0.020 Sum_probs=186.1 Q ss_pred CCCccccc-------cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-----cccc--ccccchHHHHH Q lcl|NC_011045. 1 MAEKRTGL-------AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-----TDYV--TPWQAVGARGL 66 (536) Q Consensus 1 Ma~~~~~~-------~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-----~~~~--~~~dst~~~a~ 66 (536) |--...++ .+......|+.....|. |+. .|...++...... .+.. -..++.+..++ T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~y~aa~~~~~-----~~~-----~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av 70 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASAYEGASGGHR-----WQD-----IGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAV 70 (495) T ss_pred CCcccccccccchhhhhHHHhhhhhccccCcc-----cCC-----CCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 43333333 23444455665554432 221 0111111111010 1111 13678899999 Q ss_pred HHHHHHHHH-hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE Q lcl|NC_011045. 67 NNLASKLML-ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL 145 (536) Q Consensus 67 ~~Laa~l~~-~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l 145 (536) +.+.+.+++ +++|.- .+.++.+++. -...-+.|.+.| ++-.+.+||.....++..+++-|-+++ T Consensus 71 ~~~~~~vVG~Gi~p~~-----~~~~~~~~~~-----ie~~w~~wa~~~-----D~~g~~~f~~lq~l~~r~~~~dGE~f~ 135 (495) T protein:vir:10 71 ATWVAAAVGNGLTPRW-----RMKEQELRQE-----LQELWGDWVNEA-----DFDEVQSFYGLQALVVRTVINSGEAFV 135 (495) T ss_pred HHHHHhhcCCCccccc-----CCchHHHHHH-----HHHHHHHhhcCc-----ccccccCHHHHHHHHHHHHHhCCceEE Confidence 999999877 556732 1233333321 123344554432 233457999999999999999998876 Q ss_pred EEecC--CCCce--eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecC Q lcl|NC_011045. 146 YLPEP--EGSNY--NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDE 221 (536) Q Consensus 146 ~~~~~--~~~~~--~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~ 221 (536) -+... ..+.. ..++.+....+....+ .......-.|+..|+.+. T Consensus 136 ~~~~~~~~~g~~~~~~lqliepd~l~~~~~--------------------------------~~~~~~g~~i~~GIe~d~ 183 (495) T protein:vir:10 136 IKKPRPLSEGLSVPLQLQIIEPDMLASDIP--------------------------------DETLPSGGYVKGGIRFSN 183 (495) T ss_pred EEeecccCCCCccceEEEEechhhcCCCCC--------------------------------CCCCCCCCEEEeceEECC Confidence 33221 11121 2223222211111000 000011112455555554 Q ss_pred CCCceeEEEE--ecCccccccccccccccCc--eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011045. 222 DSGEYIRYEE--VEGMEVQGSDGTYPKEACP--YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) Q Consensus 222 ~~~~~~~~~~--v~g~~i~~~~~~~~~~~~P--~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~ 297 (536) .|..-.+|.. -.|.......+ ..+...| -++.-|...+|..=|.+..- .+-.++.|+.+..+.+.++..++... T Consensus 184 ~Gr~vaY~i~~~hpgd~~~~~~~-~~~~rvpA~~vlH~f~~r~gQ~RGis~la-~i~~l~~l~~y~dael~~a~i~A~~~ 261 (495) T protein:vir:10 184 GGKRKAYCFYRNHPAESSLIGDP-VDTVWIKAEHVLHVTVLTVRSDAGAPWFQ-LLLRLNELDQYEDAELVRKKTAALFA 261 (495) T ss_pred CCceEEEEEeecCCCcccccccc-cceeeechhheEeccccCCCcccCcchhH-HHHHHHHhhHHHHHHHHHHHHhhhhe Confidence 4322221111 11111110000 0011123 22333667789999998665 45579999999999999999999888 Q ss_pred eeecc-ccccchh-------------hhccCCCcceecCCccc-ccccccc-cccchhHHHHHHHHHHHHHHHHH-h-hh Q lcl|NC_011045. 298 GLVNP-AGITQPR-------------RLTKAQTGDFVTGRPED-ISFLQLE-KQADFTVAKAVSDAIEARLSFAF-M-LN 359 (536) Q Consensus 298 ~lv~~-~g~~~~~-------------~~~~~~~g~~~~g~~~~-~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af-~-~~ 359 (536) ..+.. ++-.... .....++|.|..-.++. +.+..-. .+++|. .....+...|..++ + +. T Consensus 262 ~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~Ye 338 (495) T protein:vir:10 262 AFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYE---PWLRYQLLSIAKGYGITYE 338 (495) T ss_pred eeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHH Confidence 66642 1111000 11223455554434432 3332211 122333 22333334454444 1 22 Q ss_pred hcccCCCCCCCHHHHHHHHHHHHHHhhhhHH-HHHHHHHHHHHHHHHHHHHhcCCCCCCCCc-----ceEEEEechHHHH Q lcl|NC_011045. 360 SAVQRTGERVTAEEIRYVASELEDTLGGVYS-ILSQELQLPLVRVLLKQLQATQQIPELPKE-----AVEPTISTGLEAI 433 (536) Q Consensus 360 ~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~-rl~~E~l~Pli~r~~~il~~~g~lp~~~~~-----~v~v~~vs~La~a 433 (536) .+. .|-..++=.-+++-..|..+.+-..=. .+...|..|+..+.+..+...|.++.++-- .++++++.| T Consensus 339 ~lt-gD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p---- 413 (495) T protein:vir:10 339 MLT-GDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTP---- 413 (495) T ss_pred HHh-cccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccC---- Confidence 222 355555555556555555555544433 345678889999999999999988743210 022222222 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCCHHH----HHH----------HHHHHcCCChhhccCCHHHHHHHHHHHHHHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMRDDPDINLAM----IKL----------RIANAIGIDTSGILLTEEQKQQKMAQQSMQM 499 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~----~~~----------~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~ 499 (536) .. ..||+-+ .+. .++...|.|| +|+...++...+.+ T Consensus 414 ---------------------~~--~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~-------~~v~~q~a~e~~~~ 463 (495) T protein:vir:10 414 ---------------------RW--EEVDPLKKHLADLGDVRAGFAPISDKQAERGYDM-------EELFDMISDANQLI 463 (495) T ss_pred ---------------------Cc--cccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCH-------HHHHHHHHHHHHHH Confidence 11 0122211 111 1122234433 23322222222111 Q ss_pred HHHHHHHHHHHHHHHhhh-cCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 500 GMDNGAAALAQGMAAQAT-ASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 500 ~~~~~a~~~~~~~~~~~~-~~~~~~~~~~~~~~~q~~~ 536 (536) +.. ++.-... ...++.++. ....+++- T Consensus 464 ~~~--------Gl~~~~~p~~~~~~~~~--~~~~~~~~ 491 (495) T protein:vir:10 464 DEY--------DLRLDSDPRYVNGSGAE--QKSVMEAA 491 (495) T ss_pred HHc--------CCCCCCCCCcCCCccCC--CCCCCCCC Confidence 111 1100000 000000110 11111222 No 163 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=93.41 E-value=0.0076 Score=32.19 Aligned_cols=352 Identities=13% Similarity=0.153 Sum_probs=146.1 Q ss_pred HHHH--HHHHHhcc-cc------------cCCCCCcc----cc--------------cccccc-------cchHHHHHHH Q lcl|NC_011045. 29 TRAQ--NCAQYTIP-SL------------FPKDSDNA----ST--------------DYVTPW-------QAVGARGLNN 68 (536) Q Consensus 29 ~~w~--e~~~~~~P-~~------------~~~~~~~~----~~--------------~~~~~~-------dst~~~a~~~ 68 (536) .||- +|| |.-| ++ |......+ .. ..-..+ .++--.|++. T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~ 79 (441) T protein:vir:94 1 MHWYNTDCY-FVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CccccCccc-cccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHH Confidence 4442 343 1212 00 10000000 00 000001 1222245555 Q ss_pred HHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcE Q lcl|NC_011045. 69 LASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNV 143 (536) Q Consensus 69 Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~ 143 (536) +|+.+.+ .| |++.-.. ... .+.-++..|. +-| .+.-.+..+.++..+||| T Consensus 80 Ia~~iA~--lp----~~~~~~~-~~~------------------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna 134 (441) T protein:vir:94 80 IASDLAR--MP----IRVTVNG-QIN------------------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHG 134 (441) T ss_pred HHHhhcc--Cc----eeeecCc-ccc------------------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCe Confidence 5555533 23 2321110 000 0111222232 222 345567778888999999 Q ss_pred EEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCC Q lcl|NC_011045. 144 LLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDS 223 (536) Q Consensus 144 ~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~ 223 (536) .+++..+..+.++.+..++.+.+.+..|.+|++--.+ T Consensus 135 y~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~------------------------------------------- 171 (441) T protein:vir:94 135 YIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFH------------------------------------------- 171 (441) T ss_pred EEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEE------------------------------------------- Confidence 9999877666677777777788888888877532111 Q ss_pred CceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc Q lcl|NC_011045. 224 GEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA 303 (536) Q Consensus 224 ~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~ 303 (536) ..+++... .....+ ..--++++|+...+| .||.||.+.+...+.......+.......-...|..++.-+ T Consensus 172 ------~~~~~~~~-~~~~~~--~~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 241 (441) T protein:vir:94 172 ------QRIDSNGN-NIERNV--KFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK 241 (441) T ss_pred ------EEeccCCc-eeEEEE--ccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC Confidence 11111000 000000 111245566655555 79999999999888888888888888888888888776544 Q ss_pred ccc-chhhh---c----cCCC-----cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCC Q lcl|NC_011045. 304 GIT-QPRRL---T----KAQT-----GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGER 368 (536) Q Consensus 304 g~~-~~~~~---~----~~~~-----g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r 368 (536) +.. +.+.. . ..-. |.+. --.++....++.. +.+.+. .+.....+..|-++|-.-. +....... T Consensus 242 ~~~~~~e~~e~~r~~~~~~~~G~~nag~~~-vl~~G~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 319 (441) T protein:vir:94 242 GVLDNKKARDRAREEFHKSFSGTKQAGKVV-VLDESMTFDQLEVDTEVLKL-IRENKSSTREIAGVFGIPLHKFGIETAN 319 (441) T ss_pred CCCCCHHHHHHHHHHHHHHhcCccccCcce-ecCCCceEEEccCChhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCC Confidence 443 33321 1 1001 1111 1112223333333 223443 3444556677888885432 22122222 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHH- Q lcl|NC_011045. 369 VTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLER- 444 (536) Q Consensus 369 ~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~- 444 (536) .+.+|. .......|-|.+.++++||-.-| +++-.+..+++.. +.|-. ..|..-.+.+.+ T Consensus 320 ~s~~q~---~~~~~~tl~P~~~~ie~eln~kl-------------~~~~~~~~~~fd~-~~llr~D~~~~~~~~~~~i~~ 382 (441) T protein:vir:94 320 MSITDA---NLDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDT-TEIRVVDEKTQAEIDKINIDS 382 (441) T ss_pred ccHHHH---HHHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEeec-hhhhccCHHHHHHHHHHHHhC Confidence 222322 12233345555555555544332 2222222233322 22211 112222222221 Q ss_pred -H--HHHH---Hhhcch-hhh--------hcCCHHHHHHHH----------HHHcCCChhh Q lcl|NC_011045. 445 -C--VAAW---AALAPM-RDD--------PDINLAMIKLRI----------ANAIGIDTSG 480 (536) Q Consensus 445 -~--~~~~---~~~~p~-~~~--------~~id~d~~~~~~----------a~~~Gv~p~~ 480 (536) + .+.+ -.+.|. -.+ .++..+.+ +.+ .-.-| +-.. T Consensus 383 G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~-~~~~~~~~~~~~~~~kgG-e~~e 441 (441) T protein:vir:94 383 GKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DEYQMNKSRATDKKLKGG-EENE 441 (441) T ss_pred CCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-cccccccccccccccCCC-CCCC Confidence 1 1111 112221 000 00111110 000 00112 1222 No 164 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=93.41 E-value=0.0076 Score=32.19 Aligned_cols=352 Identities=13% Similarity=0.153 Sum_probs=146.1 Q ss_pred HHHH--HHHHHhcc-cc------------cCCCCCcc----cc--------------cccccc-------cchHHHHHHH Q lcl|NC_011045. 29 TRAQ--NCAQYTIP-SL------------FPKDSDNA----ST--------------DYVTPW-------QAVGARGLNN 68 (536) Q Consensus 29 ~~w~--e~~~~~~P-~~------------~~~~~~~~----~~--------------~~~~~~-------dst~~~a~~~ 68 (536) .||- +|| |.-| ++ |......+ .. ..-..+ .++--.|++. T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~ 79 (441) T protein:vir:79 1 MHWYNTDCY-FVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CccccCccc-cccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHH Confidence 4442 343 1212 00 10000000 00 000001 1222245555 Q ss_pred HHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcE Q lcl|NC_011045. 69 LASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNV 143 (536) Q Consensus 69 Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~ 143 (536) +|+.+.+ .| |++.-.. ... .+.-++..|. +-| .+.-.+..+.++..+||| T Consensus 80 Ia~~iA~--lp----~~~~~~~-~~~------------------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna 134 (441) T protein:vir:79 80 IASDLAR--MP----IRVTVNG-QIN------------------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHG 134 (441) T ss_pred HHHhhcc--Cc----eeeecCc-ccc------------------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCe Confidence 5555533 23 2321110 000 0111222232 222 345567778888999999 Q ss_pred EEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCC Q lcl|NC_011045. 144 LLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDS 223 (536) Q Consensus 144 ~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~ 223 (536) .+++..+..+.++.+..++.+.+.+..|.+|++--.+ T Consensus 135 y~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~------------------------------------------- 171 (441) T protein:vir:79 135 YIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFH------------------------------------------- 171 (441) T ss_pred EEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEE------------------------------------------- Confidence 9999877666677777777788888888877532111 Q ss_pred CceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc Q lcl|NC_011045. 224 GEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA 303 (536) Q Consensus 224 ~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~ 303 (536) ..+++... .....+ ..--++++|+...+| .||.||.+.+...+.......+.......-...|..++.-+ T Consensus 172 ------~~~~~~~~-~~~~~~--~~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 241 (441) T protein:vir:79 172 ------QRIDSNGN-NIERNV--KFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK 241 (441) T ss_pred ------EEeccCCc-eeEEEE--ccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC Confidence 11111000 000000 111245566655555 79999999999888888888888888888888888776544 Q ss_pred ccc-chhhh---c----cCCC-----cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCC Q lcl|NC_011045. 304 GIT-QPRRL---T----KAQT-----GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGER 368 (536) Q Consensus 304 g~~-~~~~~---~----~~~~-----g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r 368 (536) +.. +.+.. . ..-. |.+. --.++....++.. +.+.+. .+.....+..|-++|-.-. +....... T Consensus 242 ~~~~~~e~~e~~r~~~~~~~~G~~nag~~~-vl~~G~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 319 (441) T protein:vir:79 242 GVLDNKKARDRAREEFHKSFSGTKQAGKVV-VLDESMTFDQLEVDTEVLKL-IRENKSSTREIAGVFGIPLHKFGIETAN 319 (441) T ss_pred CCCCCHHHHHHHHHHHHHHhcCccccCcce-ecCCCceEEEccCChhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCCCC Confidence 443 33321 1 1001 1111 1112223333333 223443 3444556677888885432 22122222 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHH- Q lcl|NC_011045. 369 VTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLER- 444 (536) Q Consensus 369 ~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~- 444 (536) .+.+|. .......|-|.+.++++||-.-| +++-.+..+++.. +.|-. ..|..-.+.+.+ T Consensus 320 ~s~~q~---~~~~~~tl~P~~~~ie~eln~kl-------------~~~~~~~~~~fd~-~~llr~D~~~~~~~~~~~i~~ 382 (441) T protein:vir:79 320 MSITDA---NLDYLSTLKPYITCVCAELNFKF-------------NDEYVNREFKFDT-TEIRVVDEKTQAEIDKINIDS 382 (441) T ss_pred ccHHHH---HHHHHHHHHHHHHHHHHHHhhhc-------------cccccCceEEeec-hhhhccCHHHHHHHHHHHHhC Confidence 222322 12233345555555555544332 2222222233322 22211 112222222221 Q ss_pred -H--HHHH---Hhhcch-hhh--------hcCCHHHHHHHH----------HHHcCCChhh Q lcl|NC_011045. 445 -C--VAAW---AALAPM-RDD--------PDINLAMIKLRI----------ANAIGIDTSG 480 (536) Q Consensus 445 -~--~~~~---~~~~p~-~~~--------~~id~d~~~~~~----------a~~~Gv~p~~ 480 (536) + .+.+ -.+.|. -.+ .++..+.+ +.+ .-.-| +-.. T Consensus 383 G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~-~~~~~~~~~~~~~~~kgG-e~~e 441 (441) T protein:vir:79 383 GKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DEYQMNKSRATDKKLKGG-EENE 441 (441) T ss_pred CCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-cccccccccccccccCCC-CCCC Confidence 1 1111 112221 000 00111110 000 00112 1222 No 165 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=93.03 E-value=0.0091 Score=31.78 Aligned_cols=376 Identities=11% Similarity=0.056 Sum_probs=144.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc---C---CCCCccccc-c-cccc-cchHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF---P---KDSDNASTD-Y-VTPW-QAVGARGLNNLAS 71 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~---~---~~~~~~~~~-~-~~~~-dst~~~a~~~Laa 71 (536) |-+..-.|+-..=..-|..+++. |...+ -.-|... . ..+..+... . .... .++--.|++.+|+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~---~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSW---FVGGR-----LVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLIST 72 (424) T ss_pred CCCCcceEeecCCCchHHHHHhh---hcccc-----cccccccccccccccccccccccccHHHhhccHHHHHHHHHHHH Confidence 66644444332222333333321 00000 0011100 0 000000000 0 0111 1222245555555 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~ 146 (536) .+ +.+ ||--.......-.+ .+ ..+.-+...|+ +-| .+.=....+.++..+||+.++ T Consensus 73 ~i-A~l----p~~~~~~~~~~~~~---------~~-----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~ 133 (424) T protein:vir:18 73 LT-ACL----PLDVFETDQNDNRK---------KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYAL 133 (424) T ss_pred hh-ccC----ceEEEEeecCCcee---------ee-----ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEE Confidence 55 222 33221111100000 00 00112233333 223 444566777899999999999 Q ss_pred EecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCce Q lcl|NC_011045. 147 LPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEY 226 (536) Q Consensus 147 ~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~ 226 (536) +..+..+.++.+..++...+-+..+. |. ++ |.. T Consensus 134 i~r~~~G~~~~L~pl~~~~V~v~~~~-~~---~~---------------------------------y~~---------- 166 (424) T protein:vir:18 134 VDRNSAGDVISLLPLQSANMDVKLVG-KK---VV---------------------------------YRY---------- 166 (424) T ss_pred EEECCCCcEEEEEEecCcceEEEEcC-Ce---EE---------------------------------EEE---------- Confidence 87666555544444444444333321 11 11 100 Q ss_pred eEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccc Q lcl|NC_011045. 227 IRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGI 305 (536) Q Consensus 227 ~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~ 305 (536) .++|... .+ ..--.+++|....+| .||.||...+...+.......+.......-...|..++. +++. T Consensus 167 ----~~~g~~~-----~~--~~~eIih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~ 234 (424) T protein:vir:18 167 ----QRDSEYA-----DF--SQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV 234 (424) T ss_pred ----EeCCeEE-----Ee--ccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcC Confidence 1111100 00 011234555443344 899999999999998888888888888888888886664 3444 Q ss_pred cchhhhc---------cCC--CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCC--H Q lcl|NC_011045. 306 TQPRRLT---------KAQ--TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVT--A 371 (536) Q Consensus 306 ~~~~~~~---------~~~--~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~T--A 371 (536) +..+... -++ .|.+ .--.++....++.. +.+.+. .+..+..+..|-++|-......-+.++-| . T Consensus 235 l~~e~~~~~~~~~~~~~~g~nag~~-~vl~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~ 312 (424) T protein:vir:18 235 LTEQQRSQVEENFKEIAGGPVKKRL-WILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) T ss_pred CCHHHHHHHHHHHHHHhCCcccCCc-eeccCCceEEecCCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCccccc Confidence 4443211 011 1111 11122223334332 344554 34555666778888844321111111111 1 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHH---HHHHHHHHHHHHH--H Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLE---AIGRGQDLDKLER--C 445 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La---~a~r~~~~~~l~~--~ 445 (536) .-+.+.... +...-+.|++.++-..+.+ .++|+.....+.++| ++.|- ...|..-...+.+ + T Consensus 313 sn~eq~~~~-----------f~~~tl~P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~ 380 (424) T protein:vir:18 313 SGIEQQNLG-----------FLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL 380 (424) T ss_pred ccHHHHHHH-----------HHHHHHHHHHHHHHHHHHh-hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCC Confidence 222222221 1122344444444443322 234433222333443 22221 1222222222221 1 Q ss_pred H--HHH---Hhhcch-hhhh------cCCHHHHHHHH-HHHcCC Q lcl|NC_011045. 446 V--AAW---AALAPM-RDDP------DINLAMIKLRI-ANAIGI 476 (536) Q Consensus 446 ~--~~~---~~~~p~-~~~~------~id~d~~~~~~-a~~~Gv 476 (536) + +.+ -.+.|. -.|. ++-.+.+-..- -...|. T Consensus 381 ~T~NE~R~~~gl~pi~gGD~~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 381 RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred cCHHHHHHHhCCCCCCCcCeeeeccCccchHhhhccCCCccCCC Confidence 1 111 112231 1111 11122221110 011232 No 166 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=92.97 E-value=0.0093 Score=31.73 Aligned_cols=459 Identities=9% Similarity=-0.011 Sum_probs=169.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHH-----HHHHHhcccccCCCCCccc-cc-cc-----ccccch---HHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQ-----NCAQYTIPSLFPKDSDNAS-TD-YV-----TPWQAV---GARG 65 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~-----e~~~~~~P~~~~~~~~~~~-~~-~~-----~~~dst---~~~a 65 (536) |+++.. +....+...--...+...+....+. -+..+..+.....+.+... .+ .. ..|... +..+ T Consensus 66 ~~~~~~-~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~ 144 (862) T protein:vir:99 66 VEISDS-VNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIA 144 (862) T ss_pred cccccc-ccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHH Confidence 444332 2221111110111111111111111 1333443332222211110 00 00 011111 1112 Q ss_pred HHHHHHHHHHhhcCC----CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhC Q lcl|NC_011045. 66 LNNLASKLMLALFPM----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAG 141 (536) Q Consensus 66 ~~~Laa~l~~~ltP~----~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G 141 (536) .+-|+.++... |+ +.|+.|...+...+. +. ++ .+.+.+.+.+.+....+.++++.--.|| T Consensus 145 ~~~larkiVd~--pAeDatR~g~~I~~~~d~~e~---~~-------e~----~~~ie~~~~rL~v~~~l~eair~~RLyG 208 (862) T protein:vir:99 145 QHWLVDKACSL--AGEDAIRNGWHLKSLGEGEEI---DE-------ES----LEKFKAIDVEFKVKENLIEFNRFKNVFG 208 (862) T ss_pred hCchhhhhhhh--hhHHHhhCCceEeecCccccc---CH-------HH----HHHHHHHHHHhhHHHHHHHHHHhccccc Confidence 22233333222 32 479999864321110 00 11 1223344445678888999999888899 Q ss_pred cEEEEEe--cCCCCceeeEEEEecceEEEeeCCCCCeEEEE--EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE Q lcl|NC_011045. 142 NVLLYLP--EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMV--TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI 217 (536) Q Consensus 142 ~~~l~~~--~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~--r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v 217 (536) .+++++. ..+... + .-||.- ..=..|.+-.|. -++..+. .++..+-.+.. ..+.. .-+.|. | T Consensus 209 ga~ililv~~~D~~~---L-sqPLn~---e~I~kG~lkgl~vlDp~w~~p-~~v~~~~~Dp~----sp~yG-kP~~y~-I 274 (862) T protein:vir:99 209 IRVAIFVVDSEDPDY---Y-EKPFNP---DGITPGSYRGISQIDPYWMMP-MLTAESTADPS----SQFFY-EPEFWI-I 274 (862) T ss_pred ceEEEEEecCcCchh---h-hcCcCc---ccccccceeEEEEechhhhcc-ccccccccccc----ccccC-Cceeee-e Confidence 8776543 222210 1 122210 000122222211 1111110 00001111100 00111 112221 1 Q ss_pred EecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011045. 218 YLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) Q Consensus 218 ~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~ 297 (536) .+. .|.-.+++...| +..|+ +.+....-||+|..+.++..++..........+.+..+.-.. T Consensus 275 ----~g~------~IH~SRliif~g----~~vpd----~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v 336 (862) T protein:vir:99 275 ----SGQ------KYHRSHLIIARG----PQPAD----ILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTA 336 (862) T ss_pred ----cCe------eeccceeEEecC----CCchh----hhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccce Confidence 010 111122222222 12233 222333357999999999999998888888877777766665 Q ss_pred eeecc-------ccccchhhhccC---CCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccC- Q lcl|NC_011045. 298 GLVNP-------AGITQPRRLTKA---QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQR- 364 (536) Q Consensus 298 ~lv~~-------~g~~~~~~~~~~---~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~- 364 (536) +.++. +++.....+... ..|.++-+..++...+. .+|.-+...+....+.|.-++=. .-+... T Consensus 337 ~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~ls----~slSGL~dll~~~~q~IAaas~IP~tiLfGqs 412 (862) T protein:vir:99 337 IHTDTAKAIANEDKFIQRLMFWVRYRDNHAVKVLGTDETMEQFD----TSLADFDAVIMGQYQLVASIAKTPATKLLGTA 412 (862) T ss_pred eechhHhhhccHHHHHHHHHHHHhccCcceeEEecCCCceeEEe----cccCChHHHHHHHHHHHHhhhCCCceeecccC Confidence 54432 222222222121 11233333334433332 24444556666667777766521 111111 Q ss_pred -CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHH Q lcl|NC_011045. 365 -TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLE 443 (536) Q Consensus 365 -~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~ 443 (536) .+-.=|.++= ....---+..++...+.|+++|++.++....- ++ .++.++| .||.+....+.++... T Consensus 413 paGlnATGE~D-------~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg---~~-~d~~ieF-npL~~~sekEkAEi~k 480 (862) T protein:vir:99 413 PKGFNSTGEFE-------TISYHEELESIQEHVYMPFLQRHYLISRLSLG---IQ-HEIDVVM-EPVASMTAQQQADLNK 480 (862) T ss_pred cccccCchHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CC-CcceEEe-CCCCCCCHHHHHHHHH Confidence 2222244321 12233334444566789999999988765421 23 3578887 4665544333333222 Q ss_pred HHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHH-----------HHHHHHHHHHH---HHHHHHHHH Q lcl|NC_011045. 444 RCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQ-----------QKMAQQSMQMG---MDNGAAALA 509 (536) Q Consensus 444 ~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~-----------~~~~q~~~q~~---~~~~a~~~~ 509 (536) ...+.+..+.. .-.|+.+++.+.++.. +..+..=+ +++++. +.......+.+ -..++++.+ T Consensus 481 k~Aea~~~lv~---sGvispdEvR~~L~~~-~~~g~~~l-~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~ 555 (862) T protein:vir:99 481 TKAEGGKVLID---GGVISPDEERNRIRDD-KRSGYNRL-TKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAV 555 (862) T ss_pred HHHHHHHHHHh---cCCCCHHHHHHHHHhc-CCcCCCCC-CcccccccCCCCcccccccccCCcccccccccccccccCC Confidence 22222222211 1247888888877642 22110001 111111 10000000000 000000000 Q ss_pred HHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 510 QGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 510 ~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) .. .++... ..+.++.-.||- T Consensus 556 ~~--~e~d~~-----~~p~~~~~~~g~ 575 (862) T protein:vir:99 556 TT--AEGDQP-----NVQMVPSMKPGQ 575 (862) T ss_pred cc--ccCCcc-----cccccCCCCCCC Confidence 00 000000 000011111111 No 167 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=92.84 E-value=0.0098 Score=31.61 Aligned_cols=309 Identities=11% Similarity=0.038 Sum_probs=118.6 Q ss_pred HHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_011045. 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLP 148 (536) Q Consensus 74 ~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~~ 148 (536) ++.| ||.-.. .+. .++ .-+...|. +-| .+.=+...+.++.++|||++++. T Consensus 1 ia~l----p~~~~~-~~~-------------~~~-------~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~ 55 (348) T protein:vir:93 1 MASL----PLKMYE-DYK-------------VVN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE 55 (348) T ss_pred Cccc----ceEeEe-cCc-------------Ccc-------cHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 1111 332111 000 011 11223333 223 33445677788889999999988 Q ss_pred cCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeE Q lcl|NC_011045. 149 EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIR 228 (536) Q Consensus 149 ~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~ 228 (536) .+..+.+..+..+|.+.+.+..+.+|.... | + + T Consensus 56 r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~-y-~----------------------------------~----------- 88 (348) T protein:vir:93 56 RDIYHQPSKLFLLNPDVVEMLIENQSRELY-Y-S----------------------------------I----------- 88 (348) T ss_pred ECCCCcEEEEEEEcCCceEEEEeCCCcEEE-E-E----------------------------------E----------- Confidence 766666655555555555555554332111 0 0 0 Q ss_pred EEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch Q lcl|NC_011045. 229 YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP 308 (536) Q Consensus 229 ~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~ 308 (536) ....|..+ .+ ...-++.+|-....+..||.||.+.+...+...+...+..+.. ....|.+++..++..++ T Consensus 89 -~~~~g~~~-----~~--~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~ 158 (348) T protein:vir:93 89 -HAATGNKL-----IV--HNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTE--MQKPDSFMLKYGSNVST 158 (348) T ss_pred -EcCCCeEE-----EE--ccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHh--cCCCceeEEecCCCCCH Confidence 00011110 00 0111344443334567899999998887777766666554332 22333455555555555 Q ss_pred hhhcc---------CCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHH Q lcl|NC_011045. 309 RRLTK---------AQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGERVTAEEIR 375 (536) Q Consensus 309 ~~~~~---------~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~r~TAtEi~ 375 (536) +.... ...|.+.. -.++....++.. +.+.+ ..+..+.....|-++|-... ++. .++..-++++.+ T Consensus 159 e~~~~~~~~~~~~~~n~~~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~ 236 (348) T protein:vir:93 159 EKRQQVLEDFKQYYEENGGILF-QEPGVEIEPLPKKYVSED-IVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELN 236 (348) T ss_pred HHHHHHHHHHHHHhhcCCCeee-cCCCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH Confidence 32110 11222111 122223334432 23344 22344455667777775432 111 112222333321 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHH---HHHHHHHHHHHHH----H Q lcl|NC_011045. 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEA---IGRGQDLDKLERC----V 446 (536) Q Consensus 376 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~---a~r~~~~~~l~~~----~ 446 (536) ..=....|.|...++.++|-.- ++|+.. .....++| ++.|-+ ..|+.-+..+.+- . T Consensus 237 --~~~~~~~l~P~~~~ie~~l~~~-------------l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~ 301 (348) T protein:vir:93 237 --RFYLQHTLLPIVKQYEEEFNRK-------------LLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTI 301 (348) T ss_pred --HHHHHHHHHHHHHHHHHHHHHh-------------hCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCH Confidence 1222334445555555444332 333221 11223333 222211 1222222222110 0 Q ss_pred HHH---Hhhcch-hhhh--------cCCHHHHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 447 AAW---AALAPM-RDDP--------DINLAMIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 447 ~~~---~~~~p~-~~~~--------~id~d~~~~~~a~~~Gv~p~~i~rs 484 (536) +.+ -.+.|. -.|. .+|.....+.-..+=+..- =.+ T Consensus 302 NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg~~n~---~~~ 348 (348) T protein:vir:93 302 NDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNV---NES 348 (348) T ss_pred HHHHHHhCCCCCCCcCeEeecccccccccchhhcccccCCCCCc---CCC Confidence 000 011110 0000 1222211111111111100 011 No 168 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=92.60 E-value=0.011 Score=31.38 Aligned_cols=379 Identities=11% Similarity=0.024 Sum_probs=143.1 Q ss_pred CCCccccc---cHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccc-c-cccc-cchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGL---AEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTD-Y-VTPW-QAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~---~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~-~~~~-dst~~~a~~~Laa~l~ 74 (536) |-+..--+ .+...-+++..+-..+..-. -+....+-|-. ..+..+... . .+.. .++--.|++.+|+.+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~---~~~~~~~~~~~--~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~i- 74 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVT---PNQGSQTGPVS--AHGYLGDSSINDERILQISTVWRCVSLISTLT- 74 (424) T ss_pred CCCCccccccCCCCchHHHHHhhcccccccc---ccchhhccccc--cccccccccccHHHhhccHHHHHHHHHHHHhh- Confidence 54422111 12222222222111111000 01111122211 000000000 0 0111 223334555555555 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHh-c----cChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIES-N----SYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~-s----nf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +.+ ||--.......-.+ .+ ..+.-+...|+. - +.+.=....+.++..+|||.+++.. T Consensus 75 A~l----p~~vy~~~~~~~~~---------~~-----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 136 (424) T protein:vir:18 75 ACL----PLDVFETDQNDNRK---------KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) T ss_pred ccC----ceEEEEeccCCcee---------ee-----ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 222 33211111100000 00 001122333332 1 3444466678899999999999876 Q ss_pred CCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) +..+.++.+..++.....+..+. | .++.++ T Consensus 137 ~~~G~~~~L~~l~~~~v~v~~~~-~---~~~y~~---------------------------------------------- 166 (424) T protein:vir:18 137 NSAGDVISLLPLQSANMDVKLVG-K---KVVYRY---------------------------------------------- 166 (424) T ss_pred CCCCcEEEEEEecCcceEEEEcC-C---eEEEEE---------------------------------------------- Confidence 66555544443443333333221 1 111111 Q ss_pred EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc-ccccch Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP-AGITQP 308 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~-~g~~~~ 308 (536) .++|... . |..--++++|+...+| .||.||...+...+.......+.......-...|..++.- ++.+.. T Consensus 167 -~~~g~~~-----~--~~~~eVihir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~ 237 (424) T protein:vir:18 167 -QRDSEYA-----D--FSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) T ss_pred -EeCCeEE-----E--eccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCH Confidence 0111110 0 0011245556554444 8999999999999988888888888888888888766653 444443 Q ss_pred hhhc---------cCCC--cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc-CCCCCCCHHHH Q lcl|NC_011045. 309 RRLT---------KAQT--GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ-RTGERVTAEEI 374 (536) Q Consensus 309 ~~~~---------~~~~--g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~-~~~~r~TAtEi 374 (536) +... -++. |. +..-.++....++.. +.+.+. .+..+..+..|-++|-... ++. .+...-+..-+ T Consensus 238 e~~~~~~~~~~~~~~~~nag~-~~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~ 315 (424) T protein:vir:18 238 QQRSQVEENFKEIAGGPVKKR-LWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) T ss_pred HHHHHHHHHHHHHhCCcccCC-ceeccCCceEEecCCChhHHHH-HHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccH Confidence 3211 1111 11 111122333344432 234453 3455566778888884332 111 11111111222 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHH---HHHHHHHHHHHHH----H Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEA---IGRGQDLDKLERC----V 446 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~---a~r~~~~~~l~~~----~ 446 (536) .+.... +...-+.|++.++-..+.+ .++++-....+.++| ++.|-. ..|..-...+.+. . T Consensus 316 eq~~~~-----------f~~~tl~P~~~~ie~~ln~-~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~ 383 (424) T protein:vir:18 316 EQQNLG-----------FLQYTLQPYISRWENSIQR-WLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI 383 (424) T ss_pred HHHHHH-----------HHHHHHHHHHHHHHHHHHh-hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 222221 2223344544444333322 234332222344444 233311 2222222222211 1 Q ss_pred HHH---Hhhcch-hhhhc---CC---HHHHHHH-HHHHcCC Q lcl|NC_011045. 447 AAW---AALAPM-RDDPD---IN---LAMIKLR-IANAIGI 476 (536) Q Consensus 447 ~~~---~~~~p~-~~~~~---id---~d~~~~~-~a~~~Gv 476 (536) +.+ -.+.|. -.|.. .| .+.+-+. --...|. T Consensus 384 NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 384 NEMRRTDNMPPLPGGDVAMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred HHHHHHhCCCCCCCcCeeeeccCccchhhhhccCCccccCC Confidence 111 112221 11110 11 1211111 0111233 No 169 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=92.33 E-value=0.012 Score=31.15 Aligned_cols=360 Identities=11% Similarity=0.036 Sum_probs=143.8 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccc-ccc-cccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTD-YVT-PWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~-~~dst~~~a~~~Laa~l~~~lt 78 (536) |-=+|.... .....+..-. .|-. +-++...+..+..- ..+ +=.++--.|++.+|+.+- .+ T Consensus 1 ~~~~r~~~~---------~~~~~~~~~~-~~~~------~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA-~l- 62 (419) T protein:vir:14 1 MFFSRQLLS---------NLGQTQMSAG-GWVS------ALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIA-QL- 62 (419) T ss_pred Ccccccccc---------cccccccCcc-hhhH------HhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhc-cC- Confidence 322221110 0000000000 1111 10111111100000 011 123344455566666553 22 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) ||.-....+..-.+ + .+.-+...|. +- +.+.-+...+.++..+||+++|+..+..+ T Consensus 63 ---p~~~~~~~~~~~~~----------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G 123 (419) T protein:vir:14 63 ---PIELYERSGEDRKP----------A------TDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDG 123 (419) T ss_pred ---ceEEEEecCCcccc----------c------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 33221111111000 0 0111222222 22 34444566688888999999999877666 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++.+..++.+.+.+..+.+|++ +|+ ++ T Consensus 124 ~~~~l~pl~~~~v~v~~~~~~~~--~y~---~~----------------------------------------------- 151 (419) T protein:vir:14 124 VIQGLYPLDNEAVTVMRGSDLKP--VYR---VR----------------------------------------------- 151 (419) T ss_pred cEEEEEEecCceEEEEECCCceE--EEE---Ec----------------------------------------------- Confidence 66666666667777777766532 111 00 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc----chh Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT----QPR 309 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~----~~~ 309 (536) +...... ++ +++.|+...+| .||.||..-+...+.......+.......-...|..++.-++.. +.+ T Consensus 152 ~~~~~~~------~~--i~h~~~~~~dg-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~ 222 (419) T protein:vir:14 152 GSDPMPQ------RL--VHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQA 222 (419) T ss_pred cCcccch------hh--eeEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHH Confidence 0000000 00 24444444444 89999999999999999999988888888888888776533322 222 Q ss_pred h---hcc------CC--C-cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccC---CCCCCCHHH Q lcl|NC_011045. 310 R---LTK------AQ--T-GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQR---TGERVTAEE 373 (536) Q Consensus 310 ~---~~~------~~--~-g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~---~~~r~TAtE 373 (536) . +.. .+ + |.+.. -.++....++.. +.+.+.+ +..+..+..|-++|-......- .+..-++++ T Consensus 223 ~~~~~~~~~~~~~~g~~nag~~~v-l~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~ 300 (419) T protein:vir:14 223 SVDRITDGWNAKFGGSGNAKKVAL-LQEGMTFRPLSMTNVDAALI-DALRLSALDIARIYKIPAHMVNELERATFSNIEH 300 (419) T ss_pred HHHHHHHHHHHHhcCccccCCcee-cCCCceEEEccCChhhHHHH-HHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHH Confidence 1 111 01 1 11111 122223334432 3344433 3344556778788744321111 122212222 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHH---HHHHHHHHHHHH--H-- Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEA---IGRGQDLDKLER--C-- 445 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~---a~r~~~~~~l~~--~-- 445 (536) .. . .+...-|.|++.++-..+.+. ++++-......++| ++.|-+ ..|....+.+.+ + T Consensus 301 ~~---~-----------~f~~~~L~P~~~~ie~~l~~k-ll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T 365 (419) T protein:vir:14 301 QS---L-----------QFVIYTLLPWVKRHEQAKTRD-LLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLS 365 (419) T ss_pred HH---H-----------HHHHHHHHHHHHHHHHHHhhh-ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcC Confidence 21 1 133333555555444444332 33332222344444 222211 122222222211 0 Q ss_pred HHHHH---hhcch----------------h-----hhhcCCHHHHHHHHHHHcC Q lcl|NC_011045. 446 VAAWA---ALAPM----------------R-----DDPDINLAMIKLRIANAIG 475 (536) Q Consensus 446 ~~~~~---~~~p~----------------~-----~~~~id~d~~~~~~a~~~G 475 (536) .+.+- .+.|. . ..+.=..+.-.+++.+.+. T Consensus 366 ~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 366 INDIRRLENMPPVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKAAIDEIGRILS 419 (419) T ss_pred HHHHHHHhCCCCCCCcCeeeeccccccccccccccCCCCCCccccccchhcccC Confidence 00000 01010 0 0000011223333333332 No 170 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=92.14 E-value=0.013 Score=30.99 Aligned_cols=428 Identities=9% Similarity=0.065 Sum_probs=148.9 Q ss_pred CCC--------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcc--cccccccccchHHHHHHHHH Q lcl|NC_011045. 1 MAE--------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNA--STDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma~--------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~--~~~~~~~~dst~~~a~~~La 70 (536) |-+ +..+-+ +-|..+-+ .-+.++ .-|+...+.-+-. -+.+.. ..+...|+++.+ T Consensus 42 ~~~~~~~~~~~~~~a~~-----~~~~~~~~-------~~~~~~--~~~~~~~~~~~l~~~l~~~~~--n~i~~~~I~t~~ 105 (563) T protein:vir:99 42 EYQDLTKSLYGQQQAYA-----EPFIEMMD-------TNPEFR--DKRSYMKNEHNLHDVLKKFGN--NPILNAIILTRS 105 (563) T ss_pred hHHHHHhhhccCCCcch-----hhhHhhhc-------cccccc--ccccCCCCcccHHHHHHHhhc--chHHHHHHHHHH Confidence 111 111100 01111111 111111 1111111110000 000010 222334444444 Q ss_pred HHHHHhhcC---CCcc--eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-----hccChHHHHHHHHHHHhh Q lcl|NC_011045. 71 SKLMLALFP---MQTW--MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-----SNSYRVTLFEALKQLVVA 140 (536) Q Consensus 71 a~l~~~ltP---~~~W--f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~dl~~~ 140 (536) .-+...--| +..= |.+.+.+........ +.. ....+++.+..... ..+|..-+..++.|+.++ T Consensus 106 ~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~------~~~-~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~ 178 (563) T protein:vir:99 106 NQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRK------EKE-EMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIY 178 (563) T ss_pred HHHHHHhhhhhhhcccccceeEEeecCCCcchh------hhh-hhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhc Confidence 444322112 1111 111111111000000 000 11112222221111 134566677788899999 Q ss_pred CcEEEEEe--cCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEE Q lcl|NC_011045. 141 GNVLLYLP--EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY 218 (536) Q Consensus 141 G~~~l~~~--~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~ 218 (536) |||.+|+. .+..+.++.+..++...+.+..+.+|.+-.-.++| T Consensus 179 Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y----------------------------------- 223 (563) T protein:vir:99 179 DQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRF----------------------------------- 223 (563) T ss_pred CCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeE----------------------------------- Confidence 99998754 33344565666666677777777776542211111 Q ss_pred ecCCCCceeEEEEecCccccccccccccccCceEEEeeeec---CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_011045. 219 LDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRL---DGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSK 295 (536) Q Consensus 219 p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~---~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~ 295 (536) ++.++|....... .++ .|.++.... ....||.||..-+...+.......+.......-... T Consensus 224 ----------~~~~~g~~~~~~~----~~e--vI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~ 287 (563) T protein:vir:99 224 ----------VQVVDKRVVASFT----SRE--LAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGT 287 (563) T ss_pred ----------EEEeCCceeEEec----Ccc--eEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCC Confidence 1112221111000 001 111221111 124699999999999999999999888888888888 Q ss_pred Cceeec--cccccchhh-------hcc--CC-C--cceecCCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011045. 296 VIGLVN--PAGITQPRR-------LTK--AQ-T--GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS 360 (536) Q Consensus 296 p~~lv~--~~g~~~~~~-------~~~--~~-~--g~~~~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~ 360 (536) |..++. .+..++.+. +.. .+ . |.+..-..+++...++... .+.+ ..+..+..+..|-++|-... T Consensus 288 p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~q-fle~~~~~~~~Ia~afgVPp 366 (563) T protein:vir:99 288 TRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQ-FEKWLNYLINIISALYGIDP 366 (563) T ss_pred CceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCH Confidence 885553 222223321 111 11 1 1111112333344454433 3444 34566677888989885432 Q ss_pred cccCCC--------------CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE Q lcl|NC_011045. 361 AVQRTG--------------ERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI 426 (536) Q Consensus 361 ~~~~~~--------------~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~ 426 (536) ...-.. .+-++++... .=....|.|.+.+++.+|-.-|+ ++. +..+.++| T Consensus 367 ~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~--~f~~~tL~P~l~~ie~~ln~~L~-------------~~~-~~~~~~~f 430 (563) T protein:vir:99 367 AEIGFPNRGGATGSKGGSTLNEADPGKKQQ--QSQNKGLQPLLRFIEDLVNRHII-------------SEY-GDKYTFQF 430 (563) T ss_pred HHccccccccccccccccchhhccHHHHHH--HHHHHHHHHHHHHHHHHHHhhhc-------------hhc-ccccEEEe Confidence 111111 1112222211 22344566777776666544333 221 23456666 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh----------ccCCHHHHHHHHH-HH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG----------ILLTEEQKQQKMA-QQ 495 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~----------i~rs~~ev~~~~~-q~ 495 (536) ..+=.. .+..-. ++...++ .- .+.+++ +-+.+|.+|-. +... .+..+... .. T Consensus 431 ~r~D~~-~~~e~~-~~~~~~~------~G----~lT~NE----~R~~~gl~Pi~gGD~~~~~~~~~~~-~~~~~~~~~~~ 493 (563) T protein:vir:99 431 VGGDTK-SATDKL-NILKLET------QI----FKTVNE----AREEQGKKPIEGGDIILDASFLQGT-AQLQQDKQYND 493 (563) T ss_pred ccCCHH-HHHHHH-HHHHHhc------CC----ccCHHH----HHHHhCCCCCCCcceeecccccccc-cccccccCCCc Confidence 433111 111111 1111110 00 011111 11122332210 0000 00000000 00 Q ss_pred HHHHHHHHHHHHHHHHHHH----hhhcCc-chHHhhhhcCCC---------CCCC Q lcl|NC_011045. 496 SMQMGMDNGAAALAQGMAA----QATASP-EAMAAAADSVGL---------QPGI 536 (536) Q Consensus 496 ~~q~~~~~~a~~~~~~~~~----~~~~~~-~~~~~~~~~~~~---------q~~~ 536 (536) +.++..-........+... +....+ ..-......+++ |-+. T Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (563) T protein:vir:99 494 GKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSN 548 (563) T ss_pred cccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCcc Confidence 0000000000000000000 000000 000000001111 1111 No 171 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=92.14 E-value=0.013 Score=30.99 Aligned_cols=428 Identities=9% Similarity=0.065 Sum_probs=148.9 Q ss_pred CCC--------ccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcc--cccccccccchHHHHHHHHH Q lcl|NC_011045. 1 MAE--------KRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNA--STDYVTPWQAVGARGLNNLA 70 (536) Q Consensus 1 Ma~--------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~--~~~~~~~~dst~~~a~~~La 70 (536) |-+ +..+-+ +-|..+-+ .-+.++ .-|+...+.-+-. -+.+.. ..+...|+++.+ T Consensus 42 ~~~~~~~~~~~~~~a~~-----~~~~~~~~-------~~~~~~--~~~~~~~~~~~l~~~l~~~~~--n~i~~~~I~t~~ 105 (563) T protein:vir:95 42 EYQDLTKSLYGQQQAYA-----EPFIEMMD-------TNPEFR--DKRSYMKNEHNLHDVLKKFGN--NPILNAIILTRS 105 (563) T ss_pred hHHHHHhhhccCCCcch-----hhhHhhhc-------cccccc--ccccCCCCcccHHHHHHHhhc--chHHHHHHHHHH Confidence 111 111100 01111111 111111 1111111110000 000010 222334444444 Q ss_pred HHHHHhhcC---CCcc--eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-----hccChHHHHHHHHHHHhh Q lcl|NC_011045. 71 SKLMLALFP---MQTW--MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-----SNSYRVTLFEALKQLVVA 140 (536) Q Consensus 71 a~l~~~ltP---~~~W--f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-----~snf~~~~~~~~~dl~~~ 140 (536) .-+...--| +..= |.+.+.+........ +.. ....+++.+..... ..+|..-+..++.|+.++ T Consensus 106 ~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~------~~~-~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~ 178 (563) T protein:vir:95 106 NQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRK------EKE-EMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIY 178 (563) T ss_pred HHHHHHhhhhhhhcccccceeEEeecCCCcchh------hhh-hhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhc Confidence 444322112 1111 111111111000000 000 11112222221111 134566677788899999 Q ss_pred CcEEEEEe--cCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEE Q lcl|NC_011045. 141 GNVLLYLP--EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY 218 (536) Q Consensus 141 G~~~l~~~--~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~ 218 (536) |||.+|+. .+..+.++.+..++...+.+..+.+|.+-.-.++| T Consensus 179 Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y----------------------------------- 223 (563) T protein:vir:95 179 DQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRF----------------------------------- 223 (563) T ss_pred CCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeE----------------------------------- Confidence 99998754 33344565666666677777777776542211111 Q ss_pred ecCCCCceeEEEEecCccccccccccccccCceEEEeeeec---CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_011045. 219 LDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRL---DGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSK 295 (536) Q Consensus 219 p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~---~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~ 295 (536) ++.++|....... .++ .|.++.... ....||.||..-+...+.......+.......-... T Consensus 224 ----------~~~~~g~~~~~~~----~~e--vI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~ 287 (563) T protein:vir:95 224 ----------VQVVDKRVVASFT----SRE--LAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGT 287 (563) T ss_pred ----------EEEeCCceeEEec----Ccc--eEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCC Confidence 1112221111000 001 111221111 124699999999999999999999888888888888 Q ss_pred Cceeec--cccccchhh-------hcc--CC-C--cceecCCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011045. 296 VIGLVN--PAGITQPRR-------LTK--AQ-T--GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS 360 (536) Q Consensus 296 p~~lv~--~~g~~~~~~-------~~~--~~-~--g~~~~g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~ 360 (536) |..++. .+..++.+. +.. .+ . |.+..-..+++...++... .+.+ ..+..+..+..|-++|-... T Consensus 288 p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~q-fle~~~~~~~~Ia~afgVPp 366 (563) T protein:vir:95 288 TRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQ-FEKWLNYLINIISALYGIDP 366 (563) T ss_pred CceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCH Confidence 885553 222223321 111 11 1 1111112333344454433 3444 34566677888989885432 Q ss_pred cccCCC--------------CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE Q lcl|NC_011045. 361 AVQRTG--------------ERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI 426 (536) Q Consensus 361 ~~~~~~--------------~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~ 426 (536) ...-.. .+-++++... .=....|.|.+.+++.+|-.-|+ ++. +..+.++| T Consensus 367 ~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~--~f~~~tL~P~l~~ie~~ln~~L~-------------~~~-~~~~~~~f 430 (563) T protein:vir:95 367 AEIGFPNRGGATGSKGGSTLNEADPGKKQQ--QSQNKGLQPLLRFIEDLVNRHII-------------SEY-GDKYTFQF 430 (563) T ss_pred HHccccccccccccccccchhhccHHHHHH--HHHHHHHHHHHHHHHHHHHhhhc-------------hhc-ccccEEEe Confidence 111111 1112222211 22344566777776666544333 221 23456666 Q ss_pred echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh----------ccCCHHHHHHHHH-HH Q lcl|NC_011045. 427 STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG----------ILLTEEQKQQKMA-QQ 495 (536) Q Consensus 427 vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~----------i~rs~~ev~~~~~-q~ 495 (536) ..+=.. .+..-. ++...++ .- .+.+++ +-+.+|.+|-. +... .+..+... .. T Consensus 431 ~r~D~~-~~~e~~-~~~~~~~------~G----~lT~NE----~R~~~gl~Pi~gGD~~~~~~~~~~~-~~~~~~~~~~~ 493 (563) T protein:vir:95 431 VGGDTK-SATDKL-NILKLET------QI----FKTVNE----AREEQGKKPIEGGDIILDASFLQGT-AQLQQDKQYND 493 (563) T ss_pred ccCCHH-HHHHHH-HHHHHhc------CC----ccCHHH----HHHHhCCCCCCCcceeecccccccc-cccccccCCCc Confidence 433111 111111 1111110 00 011111 11122332210 0000 00000000 00 Q ss_pred HHHHHHHHHHHHHHHHHHH----hhhcCc-chHHhhhhcCCC---------CCCC Q lcl|NC_011045. 496 SMQMGMDNGAAALAQGMAA----QATASP-EAMAAAADSVGL---------QPGI 536 (536) Q Consensus 496 ~~q~~~~~~a~~~~~~~~~----~~~~~~-~~~~~~~~~~~~---------q~~~ 536 (536) +.++..-........+... +....+ ..-......+++ |-+. T Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (563) T protein:vir:95 494 GKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSN 548 (563) T ss_pred cccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCcc Confidence 0000000000000000000 000000 000000001111 1111 No 172 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=91.64 E-value=0.015 Score=30.60 Aligned_cols=382 Identities=11% Similarity=0.024 Sum_probs=159.5 Q ss_pred HHHHHHHHhh-------hHHHHHHHHHHHhcccccCCCCCcccc-ccccccc-chHHHHHHHHHHHHHHhhcCCCcceec Q lcl|NC_011045. 16 VYERLKNDRA-------PYETRAQNCAQYTIPSLFPKDSDNAST-DYVTPWQ-AVGARGLNNLASKLMLALFPMQTWMRL 86 (536) Q Consensus 16 r~~~l~~~R~-------~~e~~w~e~~~~~~P~~~~~~~~~~~~-~~~~~~d-st~~~a~~~Laa~l~~~ltP~~~Wf~l 86 (536) .|+-++..|. .-.+.|..+..++-=...... ..+.. ...+... ++--.|++.+|..+- .| ||.-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~g~~v~~~~al~~~~V~~~v~~Ia~~iA-~l----p~~~~ 74 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAW-QQGVKADPEAVLSFHAVFACISLISQDIA-KM----RLRLM 74 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchh-hcCcccChHHhhccHHHHHHHHHHHHhhc-cC----ceEEE Confidence 3333322111 112334443332210011000 00000 0011111 222334555555442 22 44322 Q ss_pred cCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEe Q lcl|NC_011045. 87 TISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYR 162 (536) Q Consensus 87 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~ 162 (536) .-...... .+++ ...++..+.+= +.+.=.+.++.++..+|||++++..+..+.+..+..++ T Consensus 75 ~~~~~g~~---------~~~~------~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~ 139 (454) T protein:vir:93 75 QTDAQGIR---------RETR------RGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILD 139 (454) T ss_pred EeccCCcc---------chhh------hHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEc Confidence 21111100 0111 11112223222 34456667777889999999998877666666666666 Q ss_pred cceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccc Q lcl|NC_011045. 163 LSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDG 242 (536) Q Consensus 163 l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~ 242 (536) ...+-+..+.+|++- | ++. .....+... .. T Consensus 140 ~~~v~v~~~~~g~~~--y-~~~----------------------------------~~~~~~~~~-------------~~ 169 (454) T protein:vir:93 140 WNRVEPLVADDGEVF--Y-RIT----------------------------------PDRNCGITE-------------AV 169 (454) T ss_pred CcceEEEEcCCCcEE--E-EEE----------------------------------eccccccce-------------eE Confidence 666666666655431 1 100 000000000 00 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCC------- Q lcl|NC_011045. 243 TYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ------- 315 (536) Q Consensus 243 ~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~------- 315 (536) .+ ..-=.+++|+....+..||.||...+...+.....+.+.......-...|..++.-++.++++...... T Consensus 170 ~~--~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~ 247 (454) T protein:vir:93 170 TV--PAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGY 247 (454) T ss_pred Ee--cCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHh Confidence 00 011145555555556789999999999999999999988888888888888777766655554332110 Q ss_pred ----CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHH-HHHHHhhhhH Q lcl|NC_011045. 316 ----TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVAS-ELEDTLGGVY 389 (536) Q Consensus 316 ----~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~-E~~~~LG~v~ 389 (536) .|.+. --.++....++.. +.+.+. .+..+..+..|-++|-.-....-....-|-.-+.+... =....|.|.+ T Consensus 248 ~g~n~g~~~-vl~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~ 325 (454) T protein:vir:93 248 TGENAGKTA-ILSNGAKYNPTTFSPVDSQT-VEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLI 325 (454) T ss_pred cccccCCce-eccCCceEEEcccChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHH Confidence 11110 0112222333332 234443 34455666778888743321111122223222222222 2445788888 Q ss_pred HHHHHHHHHHHHH----------------------HHHHHHHhcCC-----------CCCCCCcceE-E-EEechHHHHH Q lcl|NC_011045. 390 SILSQELQLPLVR----------------------VLLKQLQATQQ-----------IPELPKEAVE-P-TISTGLEAIG 434 (536) Q Consensus 390 ~rl~~E~l~Pli~----------------------r~~~il~~~g~-----------lp~~~~~~v~-v-~~vs~La~a~ 434 (536) .+++.++-.-|+. ..+..+.+.|. +||+++.+.- + .-..++..+. T Consensus 326 ~~ie~~ln~~L~~~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~ 405 (454) T protein:vir:93 326 ESIELLLDEALETGENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALS 405 (454) T ss_pred HHHHHHHHHhhcCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhh Confidence 8888887443320 11222333332 2556654411 1 0011233322 Q ss_pred HHHHHHHHHHHHHHHHhhcchh---hhh--------cCCHHHHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 435 RGQDLDKLERCVAAWAALAPMR---DDP--------DINLAMIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 435 r~~~~~~l~~~~~~~~~~~p~~---~~~--------~id~d~~~~~~a~~~Gv~p~~i~rs 484 (536) +..+-+.=..-.+.-++ .|+. .+. +-..+..++.+ ++- T Consensus 406 ~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~~~e~~~d~~~~~~~~~-----------~~~ 454 (454) T protein:vir:93 406 RRDAREDPFASSGKTAS-VPQAVAASDGNKAITETEHDAVKAMFRGI-----------LKK 454 (454) T ss_pred ccCcccCCCCCCccCCC-CCCCCCCCCCCCCccCCccchhhhhhhhh-----------hcC Confidence 21111000000000000 0100 010 00122233333 322 No 173 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=91.57 E-value=0.015 Score=30.54 Aligned_cols=370 Identities=11% Similarity=0.044 Sum_probs=144.6 Q ss_pred hHHHHHH-HHHHHhccc-------ccCCCCCcccccc--ccc-ccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhh Q lcl|NC_011045. 26 PYETRAQ-NCAQYTIPS-------LFPKDSDNASTDY--VTP-WQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAK 94 (536) Q Consensus 26 ~~e~~w~-e~~~~~~P~-------~~~~~~~~~~~~~--~~~-~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~ 94 (536) -|-.+|. .-..-.-|. .+....+.+.... .+. =.++--.|++.+|+.+-+ + ||--.......-. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~-l----p~~~~~~~~~~~~ 75 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQ-L----PVELYERSGDDRK 75 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhcc-C----ceEEEEecCCCcc Confidence 1111110 000000000 0000000000000 011 123333455555555532 2 3321111111000 Q ss_pred hhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEe Q lcl|NC_011045. 95 QLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQ 169 (536) Q Consensus 95 ~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~ 169 (536) + + .+.-+...|. +- +.+.-.+..+.++.++|||++++..+..+.+..+..++.+.+.+. T Consensus 76 ~----------~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~ 139 (419) T protein:vir:80 76 P----------A------TDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVM 139 (419) T ss_pred c----------c------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEE Confidence 0 0 0111222232 22 234445677778899999999998776666665666666666666 Q ss_pred eCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccC Q lcl|NC_011045. 170 RDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEAC 249 (536) Q Consensus 170 ~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~ 249 (536) .+.+|++. | .+.|...... + T Consensus 140 ~~~~~~~~--y--------------------------------------------------~~~~~~~~~~------~-- 159 (419) T protein:vir:80 140 KGPDLKPM--Y--------------------------------------------------RVAGADPLPQ------R-- 159 (419) T ss_pred ECCCceEE--E--------------------------------------------------EEcCccccch------h-- Confidence 66554320 1 1111111000 1 Q ss_pred ceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc----ccchhhh---cc-------C- Q lcl|NC_011045. 250 PYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG----ITQPRRL---TK-------A- 314 (536) Q Consensus 250 P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g----~~~~~~~---~~-------~- 314 (536) =+++.|+...+| .||.||..-+...+.......+.......-...|..++.-.+ ..+.+.. .. + T Consensus 160 ~i~h~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 238 (419) T protein:vir:80 160 LVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGS 238 (419) T ss_pred heEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCc Confidence 145666665555 899999999999999888888888888888888887764222 1122211 10 0 Q ss_pred CC-cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHHHHHHHHHHHhhhhH Q lcl|NC_011045. 315 QT-GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGERVTAEEIRYVASELEDTLGGVY 389 (536) Q Consensus 315 ~~-g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~r~TAtEi~~r~~E~~~~LG~v~ 389 (536) .+ |.+. --.++....++.. +.+.+. .+..+..++.|-.+|-... ++. ..+..-++++... T Consensus 239 ~n~g~~~-vl~~g~~~~~l~~s~~d~q~-~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~------------- 303 (419) T protein:vir:80 239 GNAKKVA-LLQEGMKFKPLSMTNVDAAL-IDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSL------------- 303 (419) T ss_pred cccCCce-ecCCCceEEeccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH------------- Confidence 01 1111 1122333444442 334443 3444555778888885432 111 1122223332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHH---HHHHHHHHHHHH--H--HHHHHh---hcchhhh Q lcl|NC_011045. 390 SILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEA---IGRGQDLDKLER--C--VAAWAA---LAPMRDD 458 (536) Q Consensus 390 ~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~---a~r~~~~~~l~~--~--~~~~~~---~~p~~~~ 458 (536) .+...-+.|++.++-..+.+. ++++-....+.++| ++.|-. ..|......+.+ + .+.+-. +.|. T Consensus 304 -~f~~~~l~P~~~~ie~~l~~k-ll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--- 378 (419) T protein:vir:80 304 -QFVIYTLLPWVKRHEQAKTRD-LLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV--- 378 (419) T ss_pred -HHHHHHHHHHHHHHHHHHhhh-ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--- Confidence 123333555555444444332 34433233444555 233321 112222222111 0 011111 1110 Q ss_pred hcCCHHHHHHH--HHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 459 PDINLAMIKLR--IANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA 514 (536) Q Consensus 459 ~~id~d~~~~~--~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~ 514 (536) + +.|...-- +...-...|...-.++++-+... +...+++ T Consensus 379 ~--gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~l~ 419 (419) T protein:vir:80 379 K--GGDIYLSPMNMVDASKPQPIPMGKTEPTKAALD---------------EIGRILS 419 (419) T ss_pred C--CcceeeeccccccccccccccCCCCCchhhhHH---------------HHHhhcC Confidence 0 12222110 11111111111112222111111 1111111 No 174 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=91.44 E-value=0.016 Score=30.46 Aligned_cols=452 Identities=10% Similarity=0.082 Sum_probs=192.6 Q ss_pred CCC-ccccccHHHHHHHHHHHH---HHhhhHHHHHHHHHHHhcccccCCCCCc-c----cccccc--cccchHHHHHHHH Q lcl|NC_011045. 1 MAE-KRTGLAEEGAKSVYERLK---NDRAPYETRAQNCAQYTIPSLFPKDSDN-A----STDYVT--PWQAVGARGLNNL 69 (536) Q Consensus 1 Ma~-~~~~~~~~~~~~r~~~l~---~~R~~~e~~w~e~~~~~~P~~~~~~~~~-~----~~~~~~--~~dst~~~a~~~L 69 (536) |-. .-.+++.+....++.... ..+. +..+.|.-+......... . ..+... ..++.+..+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~------~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~ 74 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFG------GQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLH 74 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCC------CcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 432 111222222222222211 1111 112222111111111000 0 011111 3677899999999 Q ss_pred HHHHHHh-hcC-CCc-ceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHH----------HhccChHHHHHHHHH Q lcl|NC_011045. 70 ASKLMLA-LFP-MQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI----------ESNSYRVTLFEALKQ 136 (536) Q Consensus 70 aa~l~~~-ltP-~~~-Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l----------~~snf~~~~~~~~~d 136 (536) ++.+++. ++| ++| |-.|...... .++|-+.|++...... ...+||.....++.. T Consensus 75 ~~nvVG~Gi~~~~~p~~~~l~~~~~~-------------~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~ 141 (530) T protein:vir:38 75 QDHIVGSFFRLSYRPSWRYLGINEED-------------SRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAM 141 (530) T ss_pred HHHhhCCCceeeeccchhhcCCCHhH-------------HHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHH Confidence 9999884 668 454 5555444322 2334444444443322 135899999999999 Q ss_pred HHhhCcEEEEEecC-CCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEE Q lcl|NC_011045. 137 LVVAGNVLLYLPEP-EGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYT 215 (536) Q Consensus 137 l~~~G~~~l~~~~~-~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~ 215 (536) .++-|-+++-+... ..+.++.+++--| -.+.|.... . .++ .-.|+. T Consensus 142 ~~~dGE~~~~~~~~~~~g~~~~~~lq~i----------------------e~d~l~~~~--------~--~~~-~~~i~~ 188 (530) T protein:vir:38 142 HAFNGELCVQATWDSDSTRLFRTQFKMV----------------------SPKRVSNPN--------N--IGD-TRNCRA 188 (530) T ss_pred HhhCCceEEEeeeccCCCCccceEEEEe----------------------chhhcCCCC--------C--CCC-CCeeEe Confidence 99999887744322 2222222221111 111111000 0 000 113566 Q ss_pred EEEecCCCCceeEEEEecCccccccccc----cc-cccCc---eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 216 HIYLDEDSGEYIRYEEVEGMEVQGSDGT----YP-KEACP---YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIV 287 (536) Q Consensus 216 ~v~p~~~~~~~~~~~~v~g~~i~~~~~~----~~-~~~~P---~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~ 287 (536) .|+.+..|..-.+| -.... ..+..+. ++ +...| +++.-....+|..=|.+..-.+|..++.|+....+.+ T Consensus 189 GIe~d~~Gr~~aY~-i~~~~-~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael 266 (530) T protein:vir:38 189 GVKINDSGAALGYY-VSDDG-YPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQL 266 (530) T ss_pred eeEECCCCceEEEE-Eeecc-CCCccccccceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHH Confidence 67666554332222 11110 0011100 00 11112 3333344568999999999999999999999999999 Q ss_pred HHHHHHhCCceeeccc-cccchh----------------------------hhccCCCcceecCCccc-ccccccc-ccc Q lcl|NC_011045. 288 KMSMISSKVIGLVNPA-GITQPR----------------------------RLTKAQTGDFVTGRPED-ISFLQLE-KQA 336 (536) Q Consensus 288 ~~~~~a~~p~~lv~~~-g~~~~~----------------------------~~~~~~~g~~~~g~~~~-~~~~~~~-~~~ 336 (536) .++..++.....+..+ +..... .....++|.|..-.++. +.+..-. .+. T Consensus 267 ~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~ 346 (530) T protein:vir:38 267 QSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDN 346 (530) T ss_pred HHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCC Confidence 9999999888666421 100000 00112355554433332 3322211 123 Q ss_pred chhHHHHHHHHHHHHHHHHHh--hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_011045. 337 DFTVAKAVSDAIEARLSFAFM--LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQI 414 (536) Q Consensus 337 ~~~~~~~~i~~~~~rI~~af~--~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~l 414 (536) +|. .....+...|..++= +..+. .|-..++=.-+++-..|..+.+--.=..+..-|+.|+..+.+..+...|.+ T Consensus 347 ~~~---~f~~~~lr~iaaglGi~ye~lt-~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i 422 (530) T protein:vir:38 347 GYS---TFEQSLLRYIAAGLGVSYEQLS-RNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVV 422 (530) T ss_pred CHH---HHHHHHHHHHHhhcCCCHHHHh-cccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCc Confidence 332 233344445544441 12222 344455555566666666666666556677778899999999999999998 Q ss_pred CCCCCcc----------eEEEEechHH-HHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccC Q lcl|NC_011045. 415 PELPKEA----------VEPTISTGLE-AIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILL 483 (536) Q Consensus 415 p~~~~~~----------v~v~~vs~La-~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r 483 (536) |-+.+.. ++++++.|=- ..=-..+++....-+.. -+.. ...++...|.||. T Consensus 423 ~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~-----------G~~s---~~~~~a~~G~D~~---- 484 (530) T protein:vir:38 423 TLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEA-----------GLST---YEKECAKRGDDYQ---- 484 (530) T ss_pred cCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHc-----------CCCC---HHHHHHHcCCCHH---- Confidence 8443221 3444444310 00000111111000000 0000 0112223454443 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhh-cCCCCCC Q lcl|NC_011045. 484 TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAAD-SVGLQPG 535 (536) Q Consensus 484 s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~q~~ 535 (536) |+.+.++.....++....... ...........+.......+ +.| | T Consensus 485 ---~v~~q~a~e~~~~~~~Gl~~~-~~~~~~~~~~~~~~~~~~~d~~~~---a 530 (530) T protein:vir:38 485 ---EIFAQQVRESMERRAAGLNPP-AWAAAAFEAGVKKSNEEEQDGARA---A 530 (530) T ss_pred ---HHHHHHHHHHHHHHHcCCCCC-CCcccccCCCCCCCCCCCCCCCCC---C Confidence 222222222211111100000 00000000000000000000 000 1 No 175 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=91.39 E-value=0.016 Score=30.42 Aligned_cols=404 Identities=13% Similarity=0.088 Sum_probs=169.3 Q ss_pred HHHHHHHHHHhcccccCCCCCcccc-cccccc--cchHHH-----HHHHHHHHHHHhhcCC----CcceeccCChhhhhh Q lcl|NC_011045. 28 ETRAQNCAQYTIPSLFPKDSDNAST-DYVTPW--QAVGAR-----GLNNLASKLMLALFPM----QTWMRLTISEYEAKQ 95 (536) Q Consensus 28 e~~w~e~~~~~~P~~~~~~~~~~~~-~~~~~~--dst~~~-----a~~~Laa~l~~~ltP~----~~Wf~l~~~d~~~~~ 95 (536) ...-.-+..++. .+ ++.... .....| .-.+.. .-+-|+.++... |+ +.|+.+...+... T Consensus 1 ~~~~D~~~~~~~-~~----g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~--~a~d~~r~~~~i~~~d~~~-- 71 (437) T protein:vir:52 1 MKFFDGIKSLAL-KL----GSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIK--RPEDMVRNWREIYSNDLNS-- 71 (437) T ss_pred CchhhhhHhHHh-cC----CCccccceeecCccccccHHHHHHHHHhCchhhHHhhc--chHHhhcCCceEecCCCCH-- Confidence 111111111111 00 110000 000011 011111 122333333322 54 5799987543211 Q ss_pred hccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCC Q lcl|NC_011045. 96 LLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGN 175 (536) Q Consensus 96 ~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~ 175 (536) +. + +.+.+.+.+-++...+.++++.--.||.|++++.-+.. .. .-|+. ..|. T Consensus 72 -----~~---~--------~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~-~~----~~pl~-------~~~~ 123 (437) T protein:vir:52 72 -----KQ---L--------DLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQ-NT----SAPLK-------PTER 123 (437) T ss_pred -----HH---H--------HHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCC-Cc----ccccc-------cCCc Confidence 11 1 12233444457899999999988899999998765432 11 12321 1232 Q ss_pred eEEE--EEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEE Q lcl|NC_011045. 176 VLQM--VTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIP 253 (536) Q Consensus 176 v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~ 253 (536) +..+ +-+..++.. .-....+ .+..+-..+.|+. +..+....+ .-.+++...+. ..| T Consensus 124 ~~~~~v~~~~~v~~~-----~~~~~dp---~s~~fg~p~~y~v---~~~~~~~~i----H~SRii~~~~~----~~~--- 181 (437) T protein:vir:52 124 LKRLIILPKWKISPT-----GTKDDDV---LSPNFGRYSEYSI---LGGSQSITV----HHSRLIILNAN----DAP--- 181 (437) T ss_pred eeEEEEechhhcccc-----ccccccc---cccccCcceEEEE---ecCCcceeE----ccceeEEecCc----cCC--- Confidence 2221 111111110 0000000 0000111122221 111111111 11122222211 112 Q ss_pred EeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc-cc---------ccchhhhc---cCCCccee Q lcl|NC_011045. 254 IRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP-AG---------ITQPRRLT---KAQTGDFV 320 (536) Q Consensus 254 ~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~-~g---------~~~~~~~~---~~~~g~~~ 320 (536) .....-||+|+.+..+..++..+.......+.+..+..+.+.++. .. +....... .+..|.++ T Consensus 182 ----~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 257 (437) T protein:vir:52 182 ----LSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLL 257 (437) T ss_pred ----CccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEE Confidence 133667899999999999999999999988888877767665531 00 11111111 12223333 Q ss_pred cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_011045. 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQL 398 (536) Q Consensus 321 ~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~ 398 (536) -+..++...+. .+|.-+...+....+.|..++=. .-+...+.... |+ .++-.+..---+..++...+. T Consensus 258 ~d~~~~~e~~~----~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gl-as-----ge~D~~~yyd~i~~~Qe~~l~ 327 (437) T protein:vir:52 258 LDAENEYDRKE----LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGL-AS-----GDEDIQNYHEAIRRLQETRLR 327 (437) T ss_pred EcCCcceEEEe----cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccc-cc-----cHHHHHHHHHHHHHHHHHHHH Confidence 33334433332 23444556677777888776621 11111222223 22 111122333345566777899 Q ss_pred HHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCCh Q lcl|NC_011045. 399 PLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDT 478 (536) Q Consensus 399 Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p 478 (536) |++++++.++.+....+ ++. +++++|- ||.+....+.++......+....+.. ...++++++.+.+.+. |+-+ T Consensus 328 p~le~l~~~i~~~~~g~-~~~-~~~~~f~-pL~~~s~kekae~~~~~a~a~~~~~~---~g~i~~~e~r~~L~~~-g~~~ 400 (437) T protein:vir:52 328 PIFEIIDPLICNELFGG-LPA-DWWFEFV-PLTTVKQEQQINMLNTFATAANTLIQ---NGVLNEYQIANELRES-GLFA 400 (437) T ss_pred HHHHHHHHHHHHHhcCC-CCC-cceEEeC-CcCCcCHHHHHHHHHHHHHHHHHHHh---cCCCCHHHHHHHHHhc-CCCC Confidence 99999999988764333 332 5777775 55444333333222222222222111 1147788887776553 4422 Q ss_pred hhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 479 SGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 479 ~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) .+ +++++...--. .+.. +...+.....+.||+- T Consensus 401 -~i--~~~~~~~~~~~-------------------~~~~---~~~~~~~~~~~~~~~~ 433 (437) T protein:vir:52 401 -NI--SAEHIEELKNA-------------------DEFA---GNFEEPEKMEGAQVQN 433 (437) T ss_pred -CC--CccccccccCC-------------------CCCC---CccCCCCCCCCCCCCC Confidence 11 22222111000 0000 0011111122222222 No 176 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=90.87 E-value=0.019 Score=30.07 Aligned_cols=371 Identities=12% Similarity=0.039 Sum_probs=150.5 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHH---HHHHHHHhcccccCCCCCccccccccccc-chHHHHHHHHHHHHHHh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETR---AQNCAQYTIPSLFPKDSDNASTDYVTPWQ-AVGARGLNNLASKLMLA 76 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~---w~e~~~~~~P~~~~~~~~~~~~~~~~~~d-st~~~a~~~Laa~l~~~ 76 (536) |.- +++.|...+....+.... -..+.++. ........ -.....+. ++--.|++.+|+.+ +. T Consensus 1 M~~---------~~~~f~~~~r~~~~~~~~~~~~~~~~~~~----g~~~~~~~-v~~~~al~~~~v~~~i~~ia~~i-a~ 65 (429) T protein:vir:10 1 MDS---------VKKFFNFEKRQTSQVIELNKDDEKLLEWL----GISPSTIS-VKGKNALKVATVFACIKILSESV-SK 65 (429) T ss_pred Cch---------hhhhhcccccCcccccccCCChHHHHHHh----cCCCCcce-echhhhhccHHHHHHHHHHHHhh-cc Confidence 322 111111111111111110 01112221 11100000 00012222 33334455555544 32 Q ss_pred hcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHh-----ccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 77 LFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIES-----NSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 77 ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~-----snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) + ||--..-.+....+ ..+.-+...|+. -+.+.-+..++.++.++||+.+++..+. T Consensus 66 l----~~~~~~~~~~~~~~----------------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~ 125 (429) T protein:vir:10 66 L----PLKIYQEDEYGIQR----------------GTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR 125 (429) T ss_pred C----ceEEEEecCCceee----------------ccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC Confidence 2 33211111111000 001112222321 2344556778889999999999998777 Q ss_pred CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) .+.++.+..+|...+.+..|..|.+..-++ ++..+ +. ++..+. T Consensus 126 ~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~-------------------------------~~~~~--~~-~g~~~~--- 168 (429) T protein:vir:10 126 KGKVQALWPIDASKVTVYIDDVGLLNSKTK-------------------------------MWYVV--NT-GGQQRV--- 168 (429) T ss_pred CCcEEEEEEEcCceeEEEEcCcccccccce-------------------------------EEEEE--cc-CCeEEE--- Confidence 666767777777777777777665432111 11000 00 111110 Q ss_pred ecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhh Q lcl|NC_011045. 232 VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRL 311 (536) Q Consensus 232 v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~ 311 (536) +..--++++|.....+..||.||...+...+.......+.......-...|.+++.-++.++++.. T Consensus 169 --------------~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~ 234 (429) T protein:vir:10 169 --------------LKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAK 234 (429) T ss_pred --------------EccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHH Confidence 111225666655556678999999999999999999999999999999999988776665554432 Q ss_pred cc---------CC---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHHHH Q lcl|NC_011045. 312 TK---------AQ---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQ---RTGERVTAEEIR 375 (536) Q Consensus 312 ~~---------~~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~r~TAtEi~ 375 (536) .. ++ .|.+. --.++....++.. +.+.+. .+.....++.|-.+|-...... .++..-+++|.. T Consensus 235 ~~~~~~~~~~~~g~~n~~~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~ 312 (429) T protein:vir:10 235 KVFRENFESMSSGLQNSHRIA-LMPVGYQFQPISLNMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQ 312 (429) T ss_pred HHHHHHHHHHhccccccCcee-ecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH Confidence 11 00 01111 1122233334432 344553 3445566778888885432111 122222333322 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHH---HHHHHHHHHHHHH--HH-- Q lcl|NC_011045. 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLE---AIGRGQDLDKLER--CV-- 446 (536) Q Consensus 376 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La---~a~r~~~~~~l~~--~~-- 446 (536) .. =....|-|.+..++++|-.-| +++.. ...+.++| ++.|- -..|....+.+.. ++ T Consensus 313 ~~--f~~~~l~P~~~~ie~~ln~kl-------------~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~ 377 (429) T protein:vir:10 313 QQ--FYTDTLQATLTMYEQEMTYKL-------------FLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKP 377 (429) T ss_pred HH--HHHHHHHHHHHHHHHHHHHhh-------------cChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH Confidence 21 123344555555555543222 22111 11122222 11111 0111111111111 00 Q ss_pred HHH---Hhhcch-hhhhc---CC---HHH--------------HHHHHHHHc Q lcl|NC_011045. 447 AAW---AALAPM-RDDPD---IN---LAM--------------IKLRIANAI 474 (536) Q Consensus 447 ~~~---~~~~p~-~~~~~---id---~d~--------------~~~~~a~~~ 474 (536) +.+ -.+.|. -.|.. .| .|. .-+.-.+.. T Consensus 378 NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 378 NEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred HHHHHHhCCCCCCCcCeeeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 000 001110 00000 00 000 000111111 No 177 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=90.36 E-value=0.021 Score=29.75 Aligned_cols=432 Identities=10% Similarity=0.079 Sum_probs=159.4 Q ss_pred CCCccccccHHHHHHHHHHHHH---HhhhHHHHHHH---------------HHHHhc--------ccccCCCCCcc--cc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKN---DRAPYETRAQN---------------CAQYTI--------PSLFPKDSDNA--ST 52 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~---~R~~~e~~w~e---------------~~~~~~--------P~~~~~~~~~~--~~ 52 (536) |-+ +...|++++- ..+.+..|-+. +.++.. |......-..+ .+ T Consensus 1 ~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r 71 (551) T protein:vir:80 1 MKN---------KLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTK 71 (551) T ss_pred Cch---------hhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccC Confidence 211 1122222221 11111111111 111111 11100000111 11 Q ss_pred cccc----------cc--cchHHHHHHHHHHHHHHhhcCCC-----cceeccCChhhhhhhccChhHHHHHHHHHHHHHH Q lcl|NC_011045. 53 DYVT----------PW--QAVGARGLNNLASKLMLALFPMQ-----TWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVER 115 (536) Q Consensus 53 ~~~~----------~~--dst~~~a~~~Laa~l~~~ltP~~-----~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~ 115 (536) ...+ .| ..+...|++..|.-+.+.-.+.. .=|.+.+.+..-.....+.. -...++ T Consensus 72 ~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~-------~~~~i~- 143 (551) T protein:vir:80 72 PSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEA-------TIKRIE- 143 (551) T ss_pred ccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHH-------HHHHHH- Confidence 1111 12 23334566777776654333321 12333333322111111111 111122 Q ss_pred HHHHHHHhcc---------ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEecc Q lcl|NC_011045. 116 IIMNYIESNS---------YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIA 186 (536) Q Consensus 116 ~~~~~l~~sn---------f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t 186 (536) ..|.+-| |..-+...+.|+.++|||.+++..+..+.+..+..++.+.+.+..+.+|.+..-.++| T Consensus 144 ---~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y--- 217 (551) T protein:vir:80 144 ---SFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRF--- 217 (551) T ss_pred ---HHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEE--- Confidence 2233333 3445666788899999999988777777777777777777777777777532200000 Q ss_pred HHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeee---cCCCc Q lcl|NC_011045. 187 FGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVR---LDGES 263 (536) Q Consensus 187 ~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~---~~ge~ 263 (536) ++..+|..... ++-++ .++++.+. ..+.+ T Consensus 218 ------------------------------------------~~~~~g~~~~~----~~~~e--iiH~~~n~~~~~~~~~ 249 (551) T protein:vir:80 218 ------------------------------------------VQVIDQKIVAT----FNARE--MAFAVRNPRSDIYATG 249 (551) T ss_pred ------------------------------------------EEEeCCcEEEE----Ecccc--eEEecccCCCCccccc Confidence 11111111100 00011 22222211 12347 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee--ccccccchhhhc-------c---CCC--cc--eecCCcccc Q lcl|NC_011045. 264 YGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLV--NPAGITQPRRLT-------K---AQT--GD--FVTGRPEDI 327 (536) Q Consensus 264 YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv--~~~g~~~~~~~~-------~---~~~--g~--~~~g~~~~~ 327 (536) ||.||..-+...+.......+.......-...|.+++ +.+.....+... . +.. |. ++. .+++ T Consensus 250 ~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~--~~g~ 327 (551) T protein:vir:80 250 YGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS--AEDV 327 (551) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcccccc--CCCc Confidence 9999999999999999999888888888888888654 333333332211 1 111 11 121 2233 Q ss_pred ccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh--cccC-C-------CCCCCHHHHHHHH-HHHHHHhhhhHHHHHHH Q lcl|NC_011045. 328 SFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS--AVQR-T-------GERVTAEEIRYVA-SELEDTLGGVYSILSQE 395 (536) Q Consensus 328 ~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~-~-------~~r~TAtEi~~r~-~E~~~~LG~v~~rl~~E 395 (536) ...++.. ..+.+ ..+..+.....|-++|-.-. +... + ...+|-.-+.+.. .=....|.|.+.+++.+ T Consensus 328 ~~~~l~~~~~D~q-fle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ 406 (551) T protein:vir:80 328 KFVNMTPSARDME-FEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDF 406 (551) T ss_pred eEEEccCChhHHH-HHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444432 33455 34555667778888884321 1101 1 1112211122221 11223455555555444 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcC Q lcl|NC_011045. 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIG 475 (536) Q Consensus 396 ~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~G 475 (536) |- + .++|+. +..+.++|......-. . +...+. ..+.. + .+.+++ +-+.+| T Consensus 407 ln------------~-~L~~~~-~~~~~f~f~~~~~~~~-~-~~~~~~---~~~~~-g------~lT~NE----~R~~~g 456 (551) T protein:vir:80 407 IN------------K-HIVAEF-GDKYTFQFVGGDIKSE-L-ESVKIL---AEKAK-V------AMTVNE----VRKELN 456 (551) T ss_pred HH------------h-hhcccc-CCceEEEeeccChhhH-H-HHHHHH---HHHhc-C------CcCHHH----HHHHhC Confidence 32 2 233433 3456777765432211 1 111111 11110 0 011111 222333 Q ss_pred CChh-----------hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHhh----hcCcchHHhhhhcC Q lcl|NC_011045. 476 IDTS-----------GILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM----------AAQA----TASPEAMAAAADSV 530 (536) Q Consensus 476 v~p~-----------~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~----------~~~~----~~~~~~~~~~~~~~ 530 (536) .+|. .+....+..+....+.+.++....+..+..+.. ..+. ....+...+-+... T Consensus 457 l~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 536 (551) T protein:vir:80 457 LPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANA 536 (551) T ss_pred CCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCccccch Confidence 3321 000000000000000000000000000000000 0000 00000000111111 Q ss_pred CCCCCC Q lcl|NC_011045. 531 GLQPGI 536 (536) Q Consensus 531 ~~q~~~ 536 (536) |.|+.- T Consensus 537 ~~~~~~ 542 (551) T protein:vir:80 537 GKQGMK 542 (551) T ss_pred hhhhcC Confidence 112211 No 178 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=90.19 E-value=0.022 Score=29.66 Aligned_cols=436 Identities=8% Similarity=0.004 Sum_probs=171.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccccc--ccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTP--WQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~Laa~l~~~lt 78 (536) ||...+.-.......+.. ..|.. -+..|..+..+++ . .+..+ .+..+..+|++.|..+ T Consensus 68 ~a~d~~~~~~~~~~~~~~---~~~~~------~~~~~~~~~~~~~--~----~l~a~Y~~~~l~r~iVd~~A~d~----- 127 (537) T protein:vir:10 68 MAMDGLDVEGGTFSAYAN---PNLSE------GLVLWYAQQAFIG--H----QMCALIATHWLVNKACSQMPRDA----- 127 (537) T ss_pred hhccccccchhhhhhhcc---ccccc------hhhhhccccCCcc--H----HHHHHHHhCchhhhhhhhhhHHh----- Confidence 332221111111110000 00000 0011111111111 0 01111 1233444444444433 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecC--CCCcee Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP--EGSNYN 156 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~--~~~~~~ 156 (536) .+.|+.+...+..-.+ . +..+.+.+.+.+-+++..+.++++..-.||.+++++.-+ ++. T Consensus 128 -~r~~~~i~~~~~~~~~----~-----------~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~--- 188 (537) T protein:vir:10 128 -MRKGYKIISDDGNELD----P-----------KDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPY--- 188 (537) T ss_pred -hcCCceeecCCccccc----H-----------HHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCc--- Confidence 2579998765431110 0 112233444455678999999999988899988776532 221 Q ss_pred eEEEEecceEEEeeCCCCCeEEE--EEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 157 PMKLYRLSSYVVQRDAFGNVLQM--VTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 157 ~~~~~~l~~~~v~~d~~G~v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) .-.-||.--.| ..|.+..+ +-+...+. .++..+-.+..+ ..-++| +.|. | .+. .|.- T Consensus 189 -~~~~Pl~~~~i---~kg~~k~l~vidp~~~~~-~~~~~~~~dp~s-p~fg~P----~~y~-v----~g~------~iH~ 247 (537) T protein:vir:10 189 -YYEKPFNIDGV---MPGAYKGIVQIDPYWCAP-LLDAQASSNPVS-MHFYEP----TYWL-I----NGK------KYHR 247 (537) T ss_pred -ccccccccccc---cccceeEEEEechhhccc-ccchhhhccCCc-cccCCc----eeee-e----cCe------Eecc Confidence 01112211001 11222222 11111111 000111111000 000111 1111 1 011 0111 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccc-cchhhhc- Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGI-TQPRRLT- 312 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~-~~~~~~~- 312 (536) .+++...| ...|+ +.+....-||++..+.++..++..........+...+..-..+.++.... .+.+.+. T Consensus 248 SRli~f~g----~~~p~----~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~ 319 (537) T protein:vir:10 248 SHLAIYIN----DEVVD----FLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDE 319 (537) T ss_pred eeEEEecC----CCCch----hhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHH Confidence 12222222 11233 22333345799999999999999999999999988888888776643221 1111111 Q ss_pred -----c---CCCcceecCCc-ccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccCCCCCCCHHHHHHHHHHH Q lcl|NC_011045. 313 -----K---AQTGDFVTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIRYVASEL 381 (536) Q Consensus 313 -----~---~~~g~~~~g~~-~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~~~~r~TAtEi~~r~~E~ 381 (536) . ...|.++-+.. +.+..+. .++..+...+....+.|.-++=. .-+...+....-|+ .++- T Consensus 320 r~~~~~~~r~n~g~~~id~e~e~~e~~~----~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~Glnat-----Ge~D 390 (537) T protein:vir:10 320 TMSWWTATRDNYQVRVVDKDNEDVVQID----TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNST-----GDYE 390 (537) T ss_pred HHHHHHhhcCCcceeEecCCCceeEEEe----ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccc-----hhHH Confidence 1 11233333332 3333222 24555566677777777766411 11111221222221 1112 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHHHHHHHHhhcchhhh Q lcl|NC_011045. 382 EDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLERCVAAWAALAPMRDD 458 (536) Q Consensus 382 ~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~~~~~~~~~~p~~~~ 458 (536) ....--.+..++.+ +.|++++++.++++....+++ +++++|- ||.+ ..|+.-..+..+..+.+.+. T Consensus 391 ~~~yyd~I~~~Qe~-l~p~l~~l~~ll~~~~~~~~~---~~~i~f~-pL~~~s~kEkAei~~~~a~a~~~~~~~------ 459 (537) T protein:vir:10 391 EASYHEECESTQDD-MRPLIDRHHQLVCRSHLRKRI---RVKVEFP-PMDAPKESERADTFLKKMQAAKLAFEM------ 459 (537) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCCc---ceEEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHHc------ Confidence 22233334555655 789999999998887655432 4677765 4433 33333223333333332221 Q ss_pred hcCCHHHHHHHHHHHcCCChhhccC--CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcc---hHHhhhhcCCCC Q lcl|NC_011045. 459 PDINLAMIKLRIANAIGIDTSGILL--TEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPE---AMAAAADSVGLQ 533 (536) Q Consensus 459 ~~id~d~~~~~~a~~~Gv~p~~i~r--s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~---~~~~~~~~~~~q 533 (536) ..|+.+++-+++...-...-.++.. ++++.+....+.+. + . ........++. ......+.. .+ T Consensus 460 G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~-----~--~----~~~~~~~~~~~~~~~~~~~~~~~-~~ 527 (537) T protein:vir:10 460 GAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEG-----K--P----VRIIEDQPAPSEMFGATSSGESA-ND 527 (537) T ss_pred CCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccC-----C--c----CCCCCCCCCccccCCCCcccccc-CC Confidence 1478888888887642110112211 12222111111000 0 0 00000000000 000000000 01 Q ss_pred CCC Q lcl|NC_011045. 534 PGI 536 (536) Q Consensus 534 ~~~ 536 (536) |+- T Consensus 528 ~~~ 530 (537) T protein:vir:10 528 PRD 530 (537) T ss_pred Ccc Confidence 111 No 179 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=89.83 E-value=0.024 Score=29.45 Aligned_cols=346 Identities=13% Similarity=0.082 Sum_probs=137.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHH-hcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQY-TIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~-~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) |++-+-.-... ...+.+|. ....+ ..|......+...-....-.-.++.-.|++.+|+.+-+ + T Consensus 32 ~~~~~~~~~~~---------~~~~~~~~----~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~v~acV~~Ia~~iA~-l-- 95 (409) T protein:vir:83 32 MVEFRGPEEEP---------EARALPWI----RPTAWSGYPESWATPSWGSAQDKLRTLIDVAWACIDLNASVLSS-M-- 95 (409) T ss_pred eeeccCCCcch---------hhhhcccc----cccccccccccccccCccccchhhHhhhHHHHHHHHHHHHhhcc-C-- Confidence 33321111000 00011110 00000 01111111111000000111234455667777766533 2 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE-EecCCCCceeeE Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY-LPEPEGSNYNPM 158 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~-~~~~~~~~~~~~ 158 (536) |+....-... .+ ... ++-. ..=... .+.+.-+...+.+|+. ||+..+ +..+..+.++.+ T Consensus 96 --pl~~~~~~~~-~~----------~~~-~ll~--~~PN~~---~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L 155 (409) T protein:vir:83 96 --PIYRMRNGRI-ID----------SVA-WMSN--PDPEVY---TSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRF 155 (409) T ss_pred --ceEEeeCCcc-cc----------chh-hhcc--cCCCCC---CCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEE Confidence 3332221100 00 000 0000 000000 1223334445667765 998765 334444444444 Q ss_pred EEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccc Q lcl|NC_011045. 159 KLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQ 238 (536) Q Consensus 159 ~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~ 238 (536) ..++.....+..+.+|.. ++.+.+... T Consensus 156 ~pl~p~~v~v~~~~~g~~----------------------------------------------------~y~~~~~~~- 182 (409) T protein:vir:83 156 RVVPPWLVNVELKKGARR----------------------------------------------------EYRIGGLNV- 182 (409) T ss_pred EEECCcceEEEEcCCceE----------------------------------------------------EEEEccccC- Confidence 444444444444433321 011111100 Q ss_pred ccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCC--- Q lcl|NC_011045. 239 GSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ--- 315 (536) Q Consensus 239 ~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~--- 315 (536) .-.+++.|+....+..||.||.+.+...+...+...+.......-...|-.++.-++.++++...... T Consensus 183 ---------~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~ 253 (409) T protein:vir:83 183 ---------TDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERRLSETEAVDLMDRW 253 (409) T ss_pred ---------ccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHHHHH Confidence 11356667666667789999999999888888888877777777778888777777766654432110 Q ss_pred -------Cc--ceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhh-cc--cCCCCCCCHHHHHHHHHHHH- Q lcl|NC_011045. 316 -------TG--DFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AV--QRTGERVTAEEIRYVASELE- 382 (536) Q Consensus 316 -------~g--~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~--~~~~~r~TAtEi~~r~~E~~- 382 (536) .| .++.+......++.+ ++.++|.. +..+-....|-++|-.-. ++ ..+..+.|-.-+.+...... T Consensus 254 ~~~~~~nag~~~il~~g~~~~~~~~~-s~~d~q~l-e~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~ 331 (409) T protein:vir:83 254 IESRSKYAGHPALVTGGATLNQAKSM-SAQDLSLM-ELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDR 331 (409) T ss_pred HHhhCCccCccceecCCcccccccCC-CHHHHHHH-HHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHH Confidence 01 111111111111111 22344432 223334566778885432 11 12334445333333333333 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHHH--H--HHH---Hhhc Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLERC--V--AAW---AALA 453 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~--~--~~~---~~~~ 453 (536) ..|.|.+.+++++|-.-| +++ ++-+++.+.+-|- -..|....+.+.+. + +.+ -.+. T Consensus 332 ~tL~P~~~~ie~~l~~~L-------------l~~--~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~glp 396 (409) T protein:vir:83 332 SSLRPKATAVMAALDRWA-------------LPS--PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAMERLH 396 (409) T ss_pred HHHHHHHHHHHHHHHHhh-------------CCC--CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 366677777766654332 221 3334444322221 12233333322221 1 111 1232 Q ss_pred chh-hh----hcC Q lcl|NC_011045. 454 PMR-DD----PDI 461 (536) Q Consensus 454 p~~-~~----~~i 461 (536) |.. .| --+ T Consensus 397 p~~ggd~l~~~gv 409 (409) T protein:vir:83 397 SEAAAVRLSGGGV 409 (409) T ss_pred CCCCCcccCCCCC Confidence 321 11 013 No 180 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=89.09 E-value=0.028 Score=29.07 Aligned_cols=372 Identities=12% Similarity=0.056 Sum_probs=148.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHH----H-HHHHHHHHhcccccCCCCCcccccccccccchHH-HHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYE----T-RAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGA-RGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e----~-~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~ 74 (536) |.= -+.+++.|.- ..|+.-. + .-..+..|. ....+... -...+.+..... .|++.+|+.+ T Consensus 1 M~~------~~r~~~~~~~--~~r~~~~~~~~~~~~~~~~~~~----g~~~~~~~-v~~~~al~~~~v~~~i~~ia~~i- 66 (432) T protein:vir:10 1 MKI------VDSVKKFFNF--EKRQTSQVIELNKDDEKLLEWL----GISPSTIS-VKGKNALKVATVFACIKILSESV- 66 (432) T ss_pred CCh------HHHHHHhcCc--cccCcccccccCCchHHHHHHh----CCCcCccc-cchhhhhccHHHHHHHHHHHHhh- Confidence 221 1112111110 0111100 0 001112221 11111000 001122333333 4455555544 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +.+ | |--....+....+ .. +.-+...|. + -+.+.-+..++.++..+||+.+++.. T Consensus 67 a~l-p---~~~~~~~~~~~~~---------~~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 126 (432) T protein:vir:10 67 SKL-P---LKIYQEDEYGIQR---------GT-------KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF 126 (432) T ss_pred ccC-c---eEEEEecCCceee---------cc-------ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 322 2 2111111110000 00 111222222 1 23455567778888899999999987 Q ss_pred CCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) +..+.+..+..++...+.+..|..|.+..-. .++..+ . .++..+.+ T Consensus 127 ~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~-------------------------------~~~y~~--~-~~g~~~~~ 172 (432) T protein:vir:10 127 DRKGKVQALWPIDASKVTVYIDDVGLLNSKT-------------------------------KMWYVV--N-TGGQQRVL 172 (432) T ss_pred CCCCcEEEEEEEcCceeEEEEcCcccccccc-------------------------------eEEEEE--e-cCCeEEEE Confidence 7766676666677677766666655432110 011000 0 01111110 Q ss_pred EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) ...=++++|.....+..||.||...+...+.......+.......-...|.+++.-++.++++ T Consensus 173 -----------------~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e 235 (432) T protein:vir:10 173 -----------------KPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNED 235 (432) T ss_pred -----------------ccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHH Confidence 011245555544556689999999999999999999999999999999999887766655554 Q ss_pred hhccC---------C---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHH Q lcl|NC_011045. 310 RLTKA---------Q---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQ---RTGERVTAEE 373 (536) Q Consensus 310 ~~~~~---------~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~r~TAtE 373 (536) ..... + .|.+. --+++....++.. +.+.+. .+..+..++.|-.+|-.-.... .++..-+++| T Consensus 236 ~~~~~~~~~~~~~~g~~n~~~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~ 313 (432) T protein:vir:10 236 AKKVFRENFESMSSGLQNSHRIA-LMPVGYQFQPISLNMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ 313 (432) T ss_pred HHHHHHHHHHHHhcccccCCcce-ecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 32210 0 01111 1112223333332 334443 3445666788888885432111 1222223333 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHH---HHHHHHHHHHHH--HH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEA---IGRGQDLDKLER--CV 446 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~---a~r~~~~~~l~~--~~ 446 (536) .... =....|.|.+.+++++|-.-|+ ++.. ...+.++| ++.|-. ..|....+.+.. ++ T Consensus 314 ~~~~--~~~~~l~P~~~~ie~~ln~kLl-------------~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~ 378 (432) T protein:vir:10 314 QQQQ--FYTDTLQATLTMYEQEMTYKLF-------------LDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFL 378 (432) T ss_pred HHHH--HHHHHHHHHHHHHHHHHHHhhc-------------ChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCc Confidence 2211 1233455666666655433222 1110 11122222 111110 111111111111 00 Q ss_pred --HHH---Hhhcch-hhhh------cCCHHHHH--------------HHHHHHc Q lcl|NC_011045. 447 --AAW---AALAPM-RDDP------DINLAMIK--------------LRIANAI 474 (536) Q Consensus 447 --~~~---~~~~p~-~~~~------~id~d~~~--------------~~~a~~~ 474 (536) +.+ -.+.|. -.|. ++-.|.+- +.--+.. T Consensus 379 t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 379 KPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 000 001110 0000 00000000 0000111 No 181 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=89.09 E-value=0.028 Score=29.07 Aligned_cols=372 Identities=12% Similarity=0.056 Sum_probs=148.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHH----H-HHHHHHHHhcccccCCCCCcccccccccccchHH-HHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYE----T-RAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGA-RGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e----~-~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~ 74 (536) |.= -+.+++.|.- ..|+.-. + .-..+..|. ....+... -...+.+..... .|++.+|+.+ T Consensus 1 M~~------~~r~~~~~~~--~~r~~~~~~~~~~~~~~~~~~~----g~~~~~~~-v~~~~al~~~~v~~~i~~ia~~i- 66 (432) T protein:vir:10 1 MKI------VDSVKKFFNF--EKRQTSQVIELNKDDEKLLEWL----GISPSTIS-VKGKNALKVATVFACIKILSESV- 66 (432) T ss_pred CCh------HHHHHHhcCc--cccCcccccccCCchHHHHHHh----CCCcCccc-cchhhhhccHHHHHHHHHHHHhh- Confidence 221 1112111110 0111100 0 001112221 11111000 001122333333 4455555544 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +.+ | |--....+....+ .. +.-+...|. + -+.+.-+..++.++..+||+.+++.. T Consensus 67 a~l-p---~~~~~~~~~~~~~---------~~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 126 (432) T protein:vir:10 67 SKL-P---LKIYQEDEYGIQR---------GT-------KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF 126 (432) T ss_pred ccC-c---eEEEEecCCceee---------cc-------ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 322 2 2111111110000 00 111222222 1 23455567778888899999999987 Q ss_pred CCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) +..+.+..+..++...+.+..|..|.+..-. .++..+ . .++..+.+ T Consensus 127 ~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~-------------------------------~~~y~~--~-~~g~~~~~ 172 (432) T protein:vir:10 127 DRKGKVQALWPIDASKVTVYIDDVGLLNSKT-------------------------------KMWYVV--N-TGGQQRVL 172 (432) T ss_pred CCCCcEEEEEEEcCceeEEEEcCcccccccc-------------------------------eEEEEE--e-cCCeEEEE Confidence 7766676666677677766666655432110 011000 0 01111110 Q ss_pred EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) ...=++++|.....+..||.||...+...+.......+.......-...|.+++.-++.++++ T Consensus 173 -----------------~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e 235 (432) T protein:vir:10 173 -----------------KPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNED 235 (432) T ss_pred -----------------ccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHH Confidence 011245555544556689999999999999999999999999999999999887766655554 Q ss_pred hhccC---------C---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHH Q lcl|NC_011045. 310 RLTKA---------Q---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQ---RTGERVTAEE 373 (536) Q Consensus 310 ~~~~~---------~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~r~TAtE 373 (536) ..... + .|.+. --+++....++.. +.+.+. .+..+..++.|-.+|-.-.... .++..-+++| T Consensus 236 ~~~~~~~~~~~~~~g~~n~~~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~ 313 (432) T protein:vir:10 236 AKKVFRENFESMSSGLQNSHRIA-LMPVGYQFQPISLNMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ 313 (432) T ss_pred HHHHHHHHHHHHhcccccCCcce-ecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 32210 0 01111 1112223333332 334443 3445666788888885432111 1222223333 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHH---HHHHHHHHHHHH--HH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEA---IGRGQDLDKLER--CV 446 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~---a~r~~~~~~l~~--~~ 446 (536) .... =....|.|.+.+++++|-.-|+ ++.. ...+.++| ++.|-. ..|....+.+.. ++ T Consensus 314 ~~~~--~~~~~l~P~~~~ie~~ln~kLl-------------~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~ 378 (432) T protein:vir:10 314 QQQQ--FYTDTLQATLTMYEQEMTYKLF-------------LDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFL 378 (432) T ss_pred HHHH--HHHHHHHHHHHHHHHHHHHhhc-------------ChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCc Confidence 2211 1233455666666655433222 1110 11122222 111110 111111111111 00 Q ss_pred --HHH---Hhhcch-hhhh------cCCHHHHH--------------HHHHHHc Q lcl|NC_011045. 447 --AAW---AALAPM-RDDP------DINLAMIK--------------LRIANAI 474 (536) Q Consensus 447 --~~~---~~~~p~-~~~~------~id~d~~~--------------~~~a~~~ 474 (536) +.+ -.+.|. -.|. ++-.|.+- +.--+.. T Consensus 379 t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 379 KPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 000 001110 0000 00000000 0000111 No 182 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=89.09 E-value=0.028 Score=29.07 Aligned_cols=372 Identities=12% Similarity=0.056 Sum_probs=148.1 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHH----H-HHHHHHHHhcccccCCCCCcccccccccccchHH-HHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYE----T-RAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGA-RGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e----~-~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~ 74 (536) |.= -+.+++.|.- ..|+.-. + .-..+..|. ....+... -...+.+..... .|++.+|+.+ T Consensus 1 M~~------~~r~~~~~~~--~~r~~~~~~~~~~~~~~~~~~~----g~~~~~~~-v~~~~al~~~~v~~~i~~ia~~i- 66 (432) T protein:vir:10 1 MKI------VDSVKKFFNF--EKRQTSQVIELNKDDEKLLEWL----GISPSTIS-VKGKNALKVATVFACIKILSESV- 66 (432) T ss_pred CCh------HHHHHHhcCc--cccCcccccccCCchHHHHHHh----CCCcCccc-cchhhhhccHHHHHHHHHHHHhh- Confidence 221 1112111110 0111100 0 001112221 11111000 001122333333 4455555544 Q ss_pred HhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEec Q lcl|NC_011045. 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPE 149 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~ 149 (536) +.+ | |--....+....+ .. +.-+...|. + -+.+.-+..++.++..+||+.+++.. T Consensus 67 a~l-p---~~~~~~~~~~~~~---------~~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r 126 (432) T protein:vir:10 67 SKL-P---LKIYQEDEYGIQR---------GT-------KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF 126 (432) T ss_pred ccC-c---eEEEEecCCceee---------cc-------ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 322 2 2111111110000 00 111222222 1 23455567778888899999999987 Q ss_pred CCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 150 ~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) +..+.+..+..++...+.+..|..|.+..-. .++..+ . .++..+.+ T Consensus 127 ~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~-------------------------------~~~y~~--~-~~g~~~~~ 172 (432) T protein:vir:10 127 DRKGKVQALWPIDASKVTVYIDDVGLLNSKT-------------------------------KMWYVV--N-TGGQQRVL 172 (432) T ss_pred CCCCcEEEEEEEcCceeEEEEcCcccccccc-------------------------------eEEEEE--e-cCCeEEEE Confidence 7766676666677677766666655432110 011000 0 01111110 Q ss_pred EEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchh Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPR 309 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~ 309 (536) ...=++++|.....+..||.||...+...+.......+.......-...|.+++.-++.++++ T Consensus 173 -----------------~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e 235 (432) T protein:vir:10 173 -----------------KPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNED 235 (432) T ss_pred -----------------ccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHH Confidence 011245555544556689999999999999999999999999999999999887766655554 Q ss_pred hhccC---------C---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhccc---CCCCCCCHHH Q lcl|NC_011045. 310 RLTKA---------Q---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQ---RTGERVTAEE 373 (536) Q Consensus 310 ~~~~~---------~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~---~~~~r~TAtE 373 (536) ..... + .|.+. --+++....++.. +.+.+. .+..+..++.|-.+|-.-.... .++..-+++| T Consensus 236 ~~~~~~~~~~~~~~g~~n~~~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~ 313 (432) T protein:vir:10 236 AKKVFRENFESMSSGLQNSHRIA-LMPVGYQFQPISLNMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ 313 (432) T ss_pred HHHHHHHHHHHHhcccccCCcce-ecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH Confidence 32210 0 01111 1112223333332 334443 3445666788888885432111 1222223333 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHH---HHHHHHHHHHHH--HH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEA---IGRGQDLDKLER--CV 446 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~---a~r~~~~~~l~~--~~ 446 (536) .... =....|.|.+.+++++|-.-|+ ++.. ...+.++| ++.|-. ..|....+.+.. ++ T Consensus 314 ~~~~--~~~~~l~P~~~~ie~~ln~kLl-------------~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~ 378 (432) T protein:vir:10 314 QQQQ--FYTDTLQATLTMYEQEMTYKLF-------------LDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFL 378 (432) T ss_pred HHHH--HHHHHHHHHHHHHHHHHHHhhc-------------ChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCc Confidence 2211 1233455666666655433222 1110 11122222 111110 111111111111 00 Q ss_pred --HHH---Hhhcch-hhhh------cCCHHHHH--------------HHHHHHc Q lcl|NC_011045. 447 --AAW---AALAPM-RDDP------DINLAMIK--------------LRIANAI 474 (536) Q Consensus 447 --~~~---~~~~p~-~~~~------~id~d~~~--------------~~~a~~~ 474 (536) +.+ -.+.|. -.|. ++-.|.+- +.--+.. T Consensus 379 t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 379 KPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 000 001110 0000 00000000 0000111 No 183 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=88.95 E-value=0.029 Score=29.00 Aligned_cols=368 Identities=10% Similarity=0.008 Sum_probs=142.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhh--------------------hHH-HHHH-----HHHHHhcccccCCCCCcccccc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRA--------------------PYE-TRAQ-----NCAQYTIPSLFPKDSDNASTDY 54 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~--------------------~~e-~~w~-----e~~~~~~P~~~~~~~~~~~~~~ 54 (536) |- .|+.++...+ .|- ..|. .+.+|+- .....+. .-.. T Consensus 1 Mg-------------l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~--~~~~~g~--~v~~ 63 (431) T protein:vir:10 1 MG-------------LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIR--RGELNGG--TGRE 63 (431) T ss_pred Cc-------------chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhc--cCccCcc--eech Confidence 22 1111111000 000 0000 0111110 0000000 0000 Q ss_pred cccc-cchHHHHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChH Q lcl|NC_011045. 55 VTPW-QAVGARGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRV 128 (536) Q Consensus 55 ~~~~-dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~ 128 (536) .+-. .++--.|++.+|+.+- .+ |=. -++- .+.. .. ..+.-+...|. +- +.+. T Consensus 64 ~~al~~~~V~~ci~~Ia~~iA-~l-p~~-v~~~--~~~~-~~----------------~~~~~~~~lL~~~PN~~~t~~~ 121 (431) T protein:vir:10 64 TRALRNMAVLRCVTLISGTIG-ML-PMN-LISS--DDSK-QV----------------LTDDPAHRLLKYKPNDWQTPME 121 (431) T ss_pred hhhhccHHHHHHHHHHHHhhc-cC-ceE-EEEe--cCce-ee----------------eccchHHHHHhhccCCCCCHHH Confidence 1111 2334455566665553 22 311 1221 1100 00 00111222222 11 2333 Q ss_pred HHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCC Q lcl|NC_011045. 129 TLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKAD 208 (536) Q Consensus 129 ~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~ 208 (536) -...+..++...|||++++..+. +.++.+..++...+.+..+.+|.+- |+ T Consensus 122 f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~~~~~~~~~~~--y~--------------------------- 171 (431) T protein:vir:10 122 FKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAKGRLTSTWQIV--YD--------------------------- 171 (431) T ss_pred HHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeEEEEcCCCeEE--EE--------------------------- Confidence 45666788889999999987764 4555444444455555555554321 10 Q ss_pred ceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 209 ETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVK 288 (536) Q Consensus 209 ~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~ 288 (536) .. ..++ ... .+...+ ++++|....+| .||.||...+...+.......+.... T Consensus 172 -------~~--~~~g-~~~---~~~~~d--------------ViHir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~ 223 (431) T protein:vir:10 172 -------YT--TPTG-DKI---ELPARE--------------VFHLRDLSIDG-VSGVSRVKLSGNALELAEQAERAASR 223 (431) T ss_pred -------EE--eCCc-eEE---EEchhh--------------EEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHH Confidence 00 0011 100 011111 23344333334 89999999999999999999998888 Q ss_pred HHHHHhCCceeeccccccchhhhccC---------C---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 289 MSMISSKVIGLVNPAGITQPRRLTKA---------Q---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFA 355 (536) Q Consensus 289 ~~~~a~~p~~lv~~~g~~~~~~~~~~---------~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~a 355 (536) ...-...|-.++.-++.++++..... + .|.+. --.++....++.. +.+.+.. +..+..+..|-++ T Consensus 224 ~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~-vl~~g~~~~~l~~~~~d~q~l-e~r~~~~~~Ia~~ 301 (431) T protein:vir:10 224 TFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENAGSWM-LLEEGATAKQFSNTAASAQQI-ENRNHQIEEVARM 301 (431) T ss_pred HHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCCce-ecCCCceEEEccCChhHHHHH-HHHHHhHHHHHHH Confidence 88888888877776666666543211 0 01111 1112223333332 2344433 3334445677777 Q ss_pred HhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHH--- Q lcl|NC_011045. 356 FMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLE--- 431 (536) Q Consensus 356 f~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La--- 431 (536) |-......-...+-|-.-+.+..... ...-|.|++.++-..+.+. ++++-......++| ++.|- T Consensus 302 fgVPp~~lg~~~~~t~sn~eq~~~~f-----------~~~tL~P~~~~ie~~ln~~-Ll~~~~~~~~~~~fd~~~llr~d 369 (431) T protein:vir:10 302 YGVPRPLLMMDDTSWGSGIEQLAIFF-----------IQYGLSHWFVSWEQAAARA-FLPEKMLGQRQFKFNEGALLRGT 369 (431) T ss_pred hCCCHHHhCCCCCCccccHHHHHHHH-----------HHHHHHHHHHHHHHHHHhh-ccChhhcCCceEEEechhhhccC Confidence 74432111122222333222222222 2223555555444444322 33332112233333 22221 Q ss_pred HHHHHHHHHHHHH------HH--HHH---Hhhcc---hhhhhcCCHHHHHHHHHHHcCCChhhc Q lcl|NC_011045. 432 AIGRGQDLDKLER------CV--AAW---AALAP---MRDDPDINLAMIKLRIANAIGIDTSGI 481 (536) Q Consensus 432 ~a~r~~~~~~l~~------~~--~~~---~~~~p---~~~~~~id~d~~~~~~a~~~Gv~p~~i 481 (536) -..|....+.+.. |+ +.+ -.+.| ...|....+-... ...++.- +|+.- T Consensus 370 ~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~-~~~~~~~-~p~~~ 431 (431) T protein:vir:10 370 LNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQLRNPMTQK-QKGSGDE-PPATT 431 (431) T ss_pred HHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccccceecccccc-cCCCCCC-CCCCC Confidence 1222222222211 01 111 11223 1233322221111 1222222 23322 No 184 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=88.77 E-value=0.03 Score=28.91 Aligned_cols=408 Identities=14% Similarity=0.083 Sum_probs=152.1 Q ss_pred CccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccccccc-----------chHHHHHHHHHH Q lcl|NC_011045. 3 EKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQ-----------AVGARGLNNLAS 71 (536) Q Consensus 3 ~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d-----------st~~~a~~~Laa 71 (536) .=+..|+..++ .||...++. ..++ .. ..+ ........+|+ ++...|++.+|+ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~-----~~~~--------~~-~~~--~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~ 63 (540) T protein:vir:41 1 MFNYHLSIKSL-EKYRAIKGD-----TDSQ--------AL-KED--RFEEYVEPKVHPLVLLSLLQVNPYHASACSIKAN 63 (540) T ss_pred CCCcccChhhc-cchhhhhcc-----cccc--------cc-ccC--CCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHH Confidence 11222222222 122222221 0000 00 000 00001111121 222334444444 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) .+ +++ +| ++...+.. +..++-. ..-+++.-+...+.|+.++|||.+++..+. T Consensus 64 ~i-a~~----~~-~i~~~~~~-------------~~~~lpN---------~~~t~~~f~~~~v~dlll~Gnayv~i~r~~ 115 (540) T protein:vir:41 64 DI-LRT----GY-LIDGDDGG-------------VEELLRA---------CRPSFEFILLQALEDLQVFNYCTLEVVRDD 115 (540) T ss_pred HH-hcC----Cc-eEecCccc-------------hhhhccC---------CCCCHHHHHHHHHHHHHhcCCeEEEEEECC Confidence 44 221 33 33332221 1111110 112455666778889999999999988777 Q ss_pred CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) .+.+..+..++...+-+.++..+.+ . ..++ ....|.. T Consensus 116 ~G~~~~L~~i~~~~V~v~~~~~~~~--------------------------------------~----~~d~-~~~~~~~ 152 (540) T protein:vir:41 116 QGEPVRLDYIPAHTVRVHRDGSRYM--------------------------------------Q----TWDG-IHVTYFK 152 (540) T ss_pred CCcEEEEEEeCCcceEEeEcCceeE--------------------------------------e----eecC-ceeeeee Confidence 6666555555545444443333210 0 0000 0111111 Q ss_pred ecCc-c-cccccc--ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 232 VEGM-E-VQGSDG--TYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 232 v~g~-~-i~~~~~--~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) .++. . +....+ .+.+...-.+++|+....+..||.||..-++..+.......+.....-.-...|.+++.-.|... T Consensus 153 ~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~ 232 (540) T protein:vir:41 153 DYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFE 232 (540) T ss_pred cccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccC Confidence 1000 0 000001 11112233677777766677999999999999888888887777777777778876654333222 Q ss_pred hh-h------------hcc----------CCCcc-e-ec---CCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011045. 308 PR-R------------LTK----------AQTGD-F-VT---GRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFML 358 (536) Q Consensus 308 ~~-~------------~~~----------~~~g~-~-~~---g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~ 358 (536) .+ . +.. ...|. + +. +..+++...++.. ..+.+ ..+..+..++.|-.+|-. T Consensus 233 ~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~q-fle~~~~~~~eIa~afgV 311 (540) T protein:vir:41 233 DEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELS-FREYAAEKKHDIAAAHMI 311 (540) T ss_pred chhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHH-HHHHHHHHHHHHHHHhCC Confidence 11 0 100 01111 1 11 1223445555543 33455 445566778889899864 Q ss_pred hhccc--CCC---CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHH Q lcl|NC_011045. 359 NSAVQ--RTG---ERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAI 433 (536) Q Consensus 359 ~~~~~--~~~---~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a 433 (536) ..... .+. ..-++++.... =....|.|...++..+ +.+ .++++... .+.++|... .. T Consensus 312 Pp~~lG~~~~~~~n~sn~eq~~~~--f~~~tL~P~~~~ie~~------------ln~-~L~~~~~~-~~~i~f~~~--~l 373 (540) T protein:vir:41 312 DPYRLGITDVGPLGGNFAEVARRT--YYESVVRPQQEIVSSV------------LTD-FIQLKLDP-GARFVFNEE--IL 373 (540) T ss_pred CHHHcCcccCCCCCcccHHHHHHH--HHHHHHHHHHHHHHHH------------HHH-hhhhccCC-ceEEEecch--hh Confidence 32111 111 11233333222 1223445555555444 332 13333332 456666432 11 Q ss_pred HHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh--ccCC----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 434 GRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG--ILLT----EEQKQQKMAQQSMQMGMDNGAAA 507 (536) Q Consensus 434 ~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~--i~rs----~~ev~~~~~q~~~q~~~~~~a~~ 507 (536) .+..-... +..+.+.+ .+.++++-.. ..|++|.. ++.+ ..++. .+... +...+. . T Consensus 374 l~~D~~~~----~~~lv~~G------~lT~NE~Re~---L~g~e~gdd~~l~p~n~~~~~~~---~~~~~--~~~~~~-~ 434 (540) T protein:vir:41 374 MESEFVHN----YALLVQCG------VLTPSEVREK---LFGLDGGPDMFMVPSSIGKSAMK---RQKRN--YEKNQI-N 434 (540) T ss_pred cchHHHHH----HHHHHhCC------CCCHHHHHHH---hCcCcCCCccccccccccccccc---ccccc--cCCCCc-c Confidence 22211111 11111111 2344443211 13554321 1110 00000 00000 000000 0 Q ss_pred HHHHHHH--hhhcCcchHHh-hhh--cCCC-----------CCCC Q lcl|NC_011045. 508 LAQGMAA--QATASPEAMAA-AAD--SVGL-----------QPGI 536 (536) Q Consensus 508 ~~~~~~~--~~~~~~~~~~~-~~~--~~~~-----------q~~~ 536 (536) +.....+ +...++..... ..+ ...+ +++- T Consensus 435 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (540) T protein:vir:41 435 EIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAEAYENG 479 (540) T ss_pred ccccccchhcccccCccccccccccccccccccccccCCccccch Confidence 0000000 00000000000 000 0000 1111 No 185 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=88.46 E-value=0.032 Score=28.77 Aligned_cols=342 Identities=15% Similarity=0.078 Sum_probs=135.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHh---hhHHHHHHH-HHHHhcccccCCCC-CcccccccccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDR---APYETRAQN-CAQYTIPSLFPKDS-DNASTDYVTPWQAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R---~~~e~~w~e-~~~~~~P~~~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) |-- |+.+.-.| .++...|-+ ...+. .....+ ..+.. .-.-.++--.|++.+|+.+.+ T Consensus 1 Mg~-------------~~~~~~~k~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~--~~l~~~~v~~~i~~ia~~ia~ 62 (383) T protein:vir:10 1 MGL-------------LTPKNFSKRNAKNMVYPSNPAFFTTT---VGGMQLSYVSAL--SALQNTNVYSVINRIASDVSS 62 (383) T ss_pred CCc-------------ccccccccccccccccccchhhhhhh---ccCccccccchh--HhhcchHHHHHHHHHHHhhcc Confidence 332 11110001 111111110 01111 111000 00000 011123334455555554432 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHh----ccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIES----NSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) + | | ++. +.. .. ..|.+ -+.+.-+..++.++..+|||.+++..+ T Consensus 63 -~-~---~-~~~--~~~-------------~~-----------~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~- 109 (383) T protein:vir:10 63 -A-H---F-KTE--NTA-------------TL-----------NRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ- 109 (383) T ss_pred -C-c---e-eec--ccc-------------hh-----------hhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC- Confidence 2 3 2 211 100 00 01111 133444566778888899999987532 Q ss_pred CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) ....+|+....|....++ +. .+|. +... T Consensus 110 -----~~~~~p~~~~~v~~~~~~---------------------------------~~--~~~~-~~~~----------- 137 (383) T protein:vir:10 110 -----NLEHIPNSDVQINYLPGN---------------------------------MG--IVYT-VLES----------- 137 (383) T ss_pred -----ceeEeecCcceEEEEEcC---------------------------------Cc--eEEE-EEEc----------- Confidence 134455544433211100 00 0111 1000 Q ss_pred ecCccccccccccccccCceEEEeeeecCC--CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc-ccch Q lcl|NC_011045. 232 VEGMEVQGSDGTYPKEACPYIPIRMVRLDG--ESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-ITQP 308 (536) Q Consensus 232 v~g~~i~~~~~~~~~~~~P~~~~rw~~~~g--e~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g-~~~~ 308 (536) ..|..+ . +...-++++|....++ ..||.||..-+...+.......+.......-...|..++.-.+ ..+. T Consensus 138 ~~~~~~-----~--~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~ 210 (383) T protein:vir:10 138 NDRPKM-----V--LRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDG 210 (383) T ss_pred CCceEE-----E--EcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCH Confidence 011100 0 1122345555443333 3589999999999999999999999999888888886665544 3334 Q ss_pred hhhcc----------CC-CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCCCCCCHHHH Q lcl|NC_011045. 309 RRLTK----------AQ-TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS--AVQRTGERVTAEEI 374 (536) Q Consensus 309 ~~~~~----------~~-~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~r~TAtEi 374 (536) +.... +. .|.+ .--.++....++.. ..+.+...+..+..+..|-.+|-... +...+....|...+ T Consensus 211 e~~~~~~~~~~~~~~~~n~~~~-~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~ 289 (383) T protein:vir:10 211 KDLESAREEFEKANTGDNSGRL-MVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNI 289 (383) T ss_pred HHHHHHHHHHHHHhCccccCCc-cccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccH Confidence 32211 11 1111 11122233344443 23455444566777888888884422 11122233333333 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHHH----HHH Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLERC----VAA 448 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~~----~~~ 448 (536) .+........|.|.+..+++++-.-+ .+..+++.+...+. ...++..+..+.+. .+. T Consensus 290 eq~~~~~~~~l~P~~~~ie~~l~~~l-----------------~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE 352 (383) T protein:vir:10 290 DQIKATYLANLNSYVNPIVDELRLKM-----------------NAPDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQ 352 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh-----------------CCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 33333344456666666666643211 11234444333221 11222222222210 111 Q ss_pred HH---hhcch----hhh--hcCCHHHHHHHHHHHcCCChh Q lcl|NC_011045. 449 WA---ALAPM----RDD--PDINLAMIKLRIANAIGIDTS 479 (536) Q Consensus 449 ~~---~~~p~----~~~--~~id~d~~~~~~a~~~Gv~p~ 479 (536) +- .+.|. ... ...+.- =|=+-. T Consensus 353 ~R~~lg~~p~~~~d~~~~~~~~~~~---------~gGd~e 383 (383) T protein:vir:10 353 AQFILTRSGFLPDNLPEFKPLTNET---------KGGDDK 383 (383) T ss_pred HHHHhCCCcccCCcccccCCCcccC---------CCCCCC Confidence 11 11111 000 001110 011111 No 186 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=88.25 E-value=0.033 Score=28.67 Aligned_cols=308 Identities=14% Similarity=0.078 Sum_probs=116.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhH------------HHH--HHHHHHHhcccccCCCCCcccccccccccchHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPY------------ETR--AQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGL 66 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~------------e~~--w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) |++.+..-.+..+........+....- ++. ..++.+|+ -+-.+ + .-..-+++-.|- + T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~---~~~~~---~-~~~~pp~~~~~l--a 71 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYL---ECWPN---G-RWYEPPLSMEGL--A 71 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHH---HHhhc---C-ccccCCCCHHHH--H Confidence 777543222211111111111100000 000 00011111 00000 0 001111111110 0 Q ss_pred HHH-HHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCc Q lcl|NC_011045. 67 NNL-ASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGN 142 (536) Q Consensus 67 ~~L-aa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~ 142 (536) +.+ ++..++... +|+.. .+...+ +-| -+..+.++..|+.+||| T Consensus 72 ~~~~~~~~h~~~l----~~k~n----------------------------~l~~~~-~Pn~~~t~~~f~~~v~d~ll~Gn 118 (350) T protein:vir:11 72 KSVGSSVYLQSGL----KFKRN----------------------------MLAKTF-IPHRLLSRATFEQFSLDWLTFGS 118 (350) T ss_pred HHHhhhhhhccch----hhhhh----------------------------hhhhcc-cCCCCCCHHHHHHHHHHHHhcCC Confidence 000 000110000 11100 000000 001 12234456678889999 Q ss_pred EEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCC Q lcl|NC_011045. 143 VLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDED 222 (536) Q Consensus 143 ~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~ 222 (536) |.+++..+..+.++. .+|+...++.+..+|+. | | .+ ..+ T Consensus 119 ay~~~~rn~~G~~~~--L~~l~~~~vr~~~~~~~---~---------------------------------~-~~--~~~ 157 (350) T protein:vir:11 119 AYLEQPRSRLGTRMP--LQAPLAKYMRRGTDLET---F---------------------------------Y-QV--RSW 157 (350) T ss_pred eEEEEEEcCCCCEEE--EEEeCCceeEeeecCCe---E---------------------------------E-EE--eeC Confidence 999987776665544 44544445544322210 0 0 00 000 Q ss_pred CCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec- Q lcl|NC_011045. 223 SGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN- 301 (536) Q Consensus 223 ~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~- 301 (536) + ... . |..--++.+|.....+.+||.+|..-++..+..-+..++-....-.-...|-+++. T Consensus 158 ~-~~~---~--------------~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~ 219 (350) T protein:vir:11 158 K-DEH---E--------------FEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYM 219 (350) T ss_pred C-eEE---E--------------ECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 0 000 0 01111445554444567999999999888887777666666666666667775543 Q ss_pred cccccchhhhc-------c-CCCc----cee--c-CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----c Q lcl|NC_011045. 302 PAGITQPRRLT-------K-AQTG----DFV--T-GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----A 361 (536) Q Consensus 302 ~~g~~~~~~~~-------~-~~~g----~~~--~-g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~ 361 (536) ++...+.++.. . .+.| -++ + |..+++...++... .+.+ ..+..+..++.|-.+|-.-. . T Consensus 220 ~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~q-f~e~k~~~~~eIa~a~~VPp~llGi 298 (350) T protein:vir:11 220 TDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDE-FGSIKNISRDDQLAGLRVYPQLMGV 298 (350) T ss_pred cCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhcc Confidence 34444443322 1 0111 122 1 22344556665533 3444 34555566677888884321 1 Q ss_pred ccCCC-CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEE-EE-echH Q lcl|NC_011045. 362 VQRTG-ERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEP-TI-STGL 430 (536) Q Consensus 362 ~~~~~-~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v-~~-vs~L 430 (536) ...+. ..-++++....- ....|.|...+++ ++.. ++. + +.+++ +| ++.| T Consensus 299 ~~~~t~~~sn~e~~~~~f--~~~~L~P~~~~ie-~ln~----~l~---------~----~~~~F~~~~~~~l 350 (350) T protein:vir:11 299 VPQNAGGFGSISDAAAVW--ASLELAPMQTRLQ-QVNE----MIG---------E----EVVRFAQFDAPGL 350 (350) T ss_pred cCCCCCCcCCHHHHHHHH--HHHHHHHHHHHHH-HHHh----hcC---------c----cccccCcccccCC Confidence 11211 122344433221 1223455555554 2221 111 1 11111 11 2233 No 187 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=88.15 E-value=0.034 Score=28.63 Aligned_cols=480 Identities=12% Similarity=-0.021 Sum_probs=187.4 Q ss_pred CCC---ccccccHHHHHHHHHHHHHHhhhHHHHHH-HHHHHhcccccCCCC-Cccc----cccc--ccccchHHHHHHHH Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKNDRAPYETRAQ-NCAQYTIPSLFPKDS-DNAS----TDYV--TPWQAVGARGLNNL 69 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~-e~~~~~~P~~~~~~~-~~~~----~~~~--~~~dst~~~a~~~L 69 (536) |-= -...++.....+|.. .+..++.|+.--. .....--|.+..... .... .+.. -..++.+..+++.+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~-ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~ 79 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLA-AREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRL 79 (548) T ss_pred CchHHhHhhhcchHHHHHHHH-hHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 221 112222222222211 1111122221100 000000000100000 0000 0111 13678899999999 Q ss_pred HHHHHHh--hcCC-CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 70 ASKLMLA--LFPM-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 70 aa~l~~~--ltP~-~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) ++.+++. +.+. ++ +........++. ......-+.|.+.| +.-.+.+||.....++...++-|-+++- T Consensus 80 ~~nvVG~~G~~i~p~~---l~~d~~~a~~l~--~~ie~~w~~Wa~~~-----D~~g~~~f~~lq~l~~R~~~~dGE~f~~ 149 (548) T protein:vir:95 80 EERVVGGSGIGVEPLP---LRLDGSVHAELA--MEIRSAWAEWSLSP-----ETSGELTRPQVERLMCRTWLRDGEGLAQ 149 (548) T ss_pred HHhccCccccceeeee---cCCCHHHHHHHH--HHHHHHHHHhhcCc-----cccccCCHHHHHHHHHHHHHhCCceEEE Confidence 9999973 4442 23 222211111110 01111223343322 1223568999999999999999988764 Q ss_pred EecCCC-----Ccee--eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEe Q lcl|NC_011045. 147 LPEPEG-----SNYN--PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYL 219 (536) Q Consensus 147 ~~~~~~-----~~~~--~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p 219 (536) ...... +..+ .++.+....+....+. .+. .|+..|+. T Consensus 150 ~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~----------------------------------~~~--~i~~GIE~ 193 (548) T protein:vir:95 150 KLMGRVPNYTFATSVPFALELLEPDYLPFSYNN----------------------------------LSK--GIVQGIER 193 (548) T ss_pred eeecccccccCCcccceEEEEechhhcCCCCCC----------------------------------CCC--ceeeeeEE Confidence 432211 1111 2222222221100000 001 24455555 Q ss_pred cCCCCceeEEEEe--cCccccccccccccccCc---eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011045. 220 DEDSGEYIRYEEV--EGMEVQGSDGTYPKEACP---YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISS 294 (536) Q Consensus 220 ~~~~~~~~~~~~v--~g~~i~~~~~~~~~~~~P---~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~ 294 (536) +..+....+|..- .|..... .+...+...| +++.-....+|..=|.+..-.+|..++.|..+..+.+.++..++ T Consensus 194 D~~Grp~aY~i~~~hPgd~~~~-~~~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A 272 (548) T protein:vir:95 194 DTWRRKRAYHLLKDHPGNLQTL-GGSLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISA 272 (548) T ss_pred CCCCceEEEEEeecCCCccccc-ccccceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhh Confidence 5443322222111 1110000 0000011122 23333445688999999999999999999999999999999999 Q ss_pred CCceeeccc-cc--------cchhhhccCCCcceecC-Ccc-cccccccc-cccchhHHHHHHHHHHHHHHHHHh--hhh Q lcl|NC_011045. 295 KVIGLVNPA-GI--------TQPRRLTKAQTGDFVTG-RPE-DISFLQLE-KQADFTVAKAVSDAIEARLSFAFM--LNS 360 (536) Q Consensus 295 ~p~~lv~~~-g~--------~~~~~~~~~~~g~~~~g-~~~-~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~--~~~ 360 (536) ...+.+..+ +- ..........+|.+++. .++ ++.+..-. ..++|. .....+...|..++= +.. T Consensus 273 ~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~IAaglGipYe~ 349 (548) T protein:vir:95 273 ALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLE---GFRNGQLRMIGAGTRSTYSS 349 (548) T ss_pred hheeeeecCCCccccCCCCcccccccccccCCccccccCCCceeeecCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHH Confidence 988776532 10 01111122345665543 332 33333222 122333 333333444555441 122 Q ss_pred cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCC----cceEEEEechH----HH Q lcl|NC_011045. 361 AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK----EAVEPTISTGL----EA 432 (536) Q Consensus 361 ~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~----~~v~v~~vs~L----a~ 432 (536) +. .|-. .+=.-+++-..|..+.+...=..+...|+.|+..+++......|.++-+.. ..+.++++.|= .- T Consensus 350 lt-gD~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP 427 (548) T protein:vir:95 350 VS-RAYD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINP 427 (548) T ss_pred Hh-cccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccCh Confidence 22 2322 255555666666666666555566778899999999999999999873322 12455555431 11 Q ss_pred HHHH-HHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHH------HHHcCCChhhccCCHHHHHHHHHHHHHHHHHH--- Q lcl|NC_011045. 433 IGRG-QDLDKLERCVAAWAALAPMRDDPDINLAMIKLRI------ANAIGIDTSGILLTEEQKQQKMAQQSMQMGMD--- 502 (536) Q Consensus 433 a~r~-~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~------a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~--- 502 (536) .... +.+..+..-+.+..++. ...-.|+++.++.+ ++.+|++....-+..-...........|.+.. T Consensus 428 ~Kea~A~~~~i~~Gl~T~~~~~---a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (548) T protein:vir:95 428 MHEANAWELLVKAGFADEAEVA---RARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVG 504 (548) T ss_pred HHHHHHHHHHHHcCCCCHHHHH---HHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccc Confidence 1110 11111111111111111 01112333333222 22234411100000000000000000000000 Q ss_pred -HHHHHHHHHHHHhhhcCcch-----HH-hhhhcCCCCCCC Q lcl|NC_011045. 503 -NGAAALAQGMAAQATASPEA-----MA-AAADSVGLQPGI 536 (536) Q Consensus 503 -~~a~~~~~~~~~~~~~~~~~-----~~-~~~~~~~~q~~~ 536 (536) ..++-++--.+..-.+.-+. .. ..+..+.=||.- T Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (548) T protein:vir:95 505 KMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQPSN 545 (548) T ss_pred cccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCCCC Confidence 00000000000000000000 00 000001112222 No 188 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=87.82 E-value=0.036 Score=28.48 Aligned_cols=367 Identities=9% Similarity=0.034 Sum_probs=153.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc-CCCC---Ccccccccccc-cchHHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF-PKDS---DNASTDYVTPW-QAVGARGLNNLASKLML 75 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~-~~~~---~~~~~~~~~~~-dst~~~a~~~Laa~l~~ 75 (536) |++++..+ ...-++..+++++..-..-.....|... .... .-......... .++--.|++.+|+.+.+ T Consensus 4 ~~~~~~~~-------~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~ 76 (413) T protein:vir:96 4 VSEIRKDK-------NLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSN 76 (413) T ss_pred cchhhhhh-------cCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhcc Confidence 66655322 1122222333333221111112222111 0000 00000011111 33444555666666532 Q ss_pred hhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecC Q lcl|NC_011045. 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) Q Consensus 76 ~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~ 150 (536) + ||--....+.. .+ ..+ .-+...|. + -+.+.-++.++.+++.+|+|.+++..+ T Consensus 77 -~----~~~~~~~~~~~-~~---------~~~-------~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~ 134 (413) T protein:vir:96 77 -M----TIQLMQNGETG-DK---------RIK-------NDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQ 134 (413) T ss_pred -C----ceEEEEecCCC-cc---------ccc-------cHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEc Confidence 1 33211111100 00 011 11111222 1 345677788899999999999998876 Q ss_pred CCCc-eeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEE Q lcl|NC_011045. 151 EGSN-YNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRY 229 (536) Q Consensus 151 ~~~~-~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~ 229 (536) ..++ +..+..++.+.+-+..+..+ ++ +..+ T Consensus 135 ~~g~~~~~L~~l~~~~v~~~~~~~~----~~--y~~~------------------------------------------- 165 (413) T protein:vir:96 135 VSGDKIIGLTPISPYKVTFNVSDDD----LD--YSIT------------------------------------------- 165 (413) T ss_pred CCCCceEEEEEecCceeEEEEcCCe----EE--EEEe------------------------------------------- Confidence 5443 33444444444444333211 10 0000 Q ss_pred EEecCccccccccccccccCceEEEeeeecC-CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch Q lcl|NC_011045. 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLD-GESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP 308 (536) Q Consensus 230 ~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~-ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~ 308 (536) ..+..+ ...-.+++|+.... +..||.||..-+...+.......+.......-...|..++.-++.+++ T Consensus 166 --~~~~~~---------~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~ 234 (413) T protein:vir:96 166 --FDNKEY---------DPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDE 234 (413) T ss_pred --ecCcEE---------chhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCH Confidence 011100 01124556654333 446899999999999999999999999999999999887766665554 Q ss_pred hhhcc----------CC--Cc--ceecCCcccccc-cccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHH Q lcl|NC_011045. 309 RRLTK----------AQ--TG--DFVTGRPEDISF-LQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEE 373 (536) Q Consensus 309 ~~~~~----------~~--~g--~~~~g~~~~~~~-~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtE 373 (536) +.... +. .| .++++....... .++ +..+.+. .+..+..+..|-++|-.-....-.+ +..| T Consensus 235 e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~-~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~---~~~~ 309 (413) T protein:vir:96 235 LSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPL-TLNDLAI-NDAVTLDKKTVAGIFGVPAFLLGVG---TYNK 309 (413) T ss_pred HHHHHHHHHHHHHhcCccccCceeeecCCcccccccccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCC---cchH Confidence 43211 10 11 122222111111 111 1234443 3455566778888886433211111 2222 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH--HHHHHHHHHHHHH--H--HH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE--AIGRGQDLDKLER--C--VA 447 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La--~a~r~~~~~~l~~--~--~~ 447 (536) -.. ..+...-+.|++.++-..+.+. ++++ +..+++.....+- ...++...+.+.. + .+ T Consensus 310 ~~~-------------~~~~~~~l~P~~~~ie~~ln~~-ll~~--~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~N 373 (413) T protein:vir:96 310 DEF-------------NNFINTKIMSIAQVIQQTYNKL-IVEE--DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRN 373 (413) T ss_pred HHH-------------HHHHHHHHHHHHHHHHHHHHHh-hCCC--CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHH Confidence 111 1234445777777666665443 4442 3334443222221 1122222222221 1 11 Q ss_pred HH---Hhhcch-hhhh------cCCHHHHHHHHHHHcCCCh Q lcl|NC_011045. 448 AW---AALAPM-RDDP------DINLAMIKLRIANAIGIDT 478 (536) Q Consensus 448 ~~---~~~~p~-~~~~------~id~d~~~~~~a~~~Gv~p 478 (536) .+ -.+.|. -.|. ++..+.+-+.--..-| +. T Consensus 374 E~R~~~g~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~-dt 413 (413) T protein:vir:96 374 EFRNWVGMPPDAEMDDLLVLENYLQQKDLVNQKKLIQD-ET 413 (413) T ss_pred HHHHHhCCCCCCCcceeeecccccchhhcccccCCCCC-CC Confidence 11 112231 1121 1223333221111122 22 No 189 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=87.80 E-value=0.036 Score=28.47 Aligned_cols=407 Identities=11% Similarity=0.034 Sum_probs=152.5 Q ss_pred HHHHHHHHHHhhh-----HH-HHHHHHHHHhcccccCCCCCcccccc-cccc-cchHHHHHHHHHHHHHHhhcCCCccee Q lcl|NC_011045. 14 KSVYERLKNDRAP-----YE-TRAQNCAQYTIPSLFPKDSDNASTDY-VTPW-QAVGARGLNNLASKLMLALFPMQTWMR 85 (536) Q Consensus 14 ~~r~~~l~~~R~~-----~e-~~w~e~~~~~~P~~~~~~~~~~~~~~-~~~~-dst~~~a~~~Laa~l~~~ltP~~~Wf~ 85 (536) -..|+.|...... -. ..|..+.-.. ..++.. ...+..-. .... .++--.|++.+|..+.+ + |=. .++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~-l-p~~-~~~ 75 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSI-YNLGAT-ASSGERVTPHDALQVSAVFASVRLLSETIAT-L-PLS-TYS 75 (457) T ss_pred Cchhhhhhccccccccccccccccccchhhh-hhcccc-ccCCceechHHhhccHHHHHHHHHHHHhHhh-C-ceE-EEE Confidence 2334433321110 00 1111110000 001100 00000000 0011 23334456666655532 2 211 222 Q ss_pred ccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHh----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEE Q lcl|NC_011045. 86 LTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIES----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLY 161 (536) Q Consensus 86 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~ 161 (536) -.-... . +++ ...+...+.+ -+.+.-+..++.++..+|||++++..+ .+++..+..+ T Consensus 76 ~~~~~~--~----------~~~------~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l 136 (457) T protein:vir:62 76 KRGGTR--K----------EID------TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVL 136 (457) T ss_pred ecCCcc--c----------ccc------chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEE Confidence 211100 0 010 0111112222 236667778888999999999998544 4455444444 Q ss_pred ecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE-ecCcccccc Q lcl|NC_011045. 162 RLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE-VEGMEVQGS 240 (536) Q Consensus 162 ~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~-v~g~~i~~~ 240 (536) +...+.+.++..+... .. + +..|.. .+|..... T Consensus 137 ~p~~v~v~~~~~~~~~------------------------------~~---~------------~~~y~~~~~g~~~~~- 170 (457) T protein:vir:62 137 DPTKIHVHMVMVDGLR------------------------------RK---V------------FEAYDIDADGNEVLL- 170 (457) T ss_pred cCcceEEEEeccCCcc------------------------------ce---e------------EEEEEEccCCceeEE- Confidence 4445555444332110 00 0 111111 11111100 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccCC----- Q lcl|NC_011045. 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ----- 315 (536) Q Consensus 241 ~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~~----- 315 (536) ..++-++ +|++|....+|..||.||...+...+.......+.......-...|..++.-++.++++...... T Consensus 171 -~~~~~~e--iih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~ 247 (457) T protein:vir:62 171 -GWFTPRD--VLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRA 247 (457) T ss_pred -EeeCccc--eEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHH Confidence 1111111 56666666677789999999999999998888888888888888888777766666655432111 Q ss_pred --C-----cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCC--CCHHHHHHHHHHH-HHH Q lcl|NC_011045. 316 --T-----GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGER--VTAEEIRYVASEL-EDT 384 (536) Q Consensus 316 --~-----g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r--~TAtEi~~r~~E~-~~~ 384 (536) . |.+. .-.++....++.. +.+.+. .+..+..+..|-++|-.-....-+..+ .+..-+.+..... ... T Consensus 248 ~~~G~~nag~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~ 325 (457) T protein:vir:62 248 ANSGVDNAHRVA-LLTEGAKFSKVAMSPDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS 325 (457) T ss_pred HhcCccccCcce-ecCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH Confidence 0 1111 1112223333332 234443 344455677888888543211111122 1222232222222 223 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCH Q lcl|NC_011045. 385 LGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINL 463 (536) Q Consensus 385 LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~ 463 (536) |.|.+.+++++|-. .++++.......++| ++.|-. . +......++..+.+.+ .+.+ T Consensus 326 l~P~~~~ie~~ln~-------------~L~~~~~~~~~~i~fd~~~l~~---~-d~~~r~~~~~~~~~~G------~~T~ 382 (457) T protein:vir:62 326 LRPWLERIEAGFNR-------------LLFAETADRFRFVKFNLDEIKR---G-APKERMELWSLGLQNG------IYSI 382 (457) T ss_pred HHHHHHHHHHHHHh-------------hhcCccccCceEEEeechhhhc---c-CHHHHHHHHHHHHhCC------CcCH Confidence 44444444444322 234433333344554 233322 1 1111122222211111 1122 Q ss_pred HHHHHHHHHHcCCChhhccCCH--HHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 464 AMIKLRIANAIGIDTSGILLTE--EQKQ-QKMAQQS-MQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 464 d~~~~~~a~~~Gv~p~~i~rs~--~ev~-~~~~q~~-~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) + ++-+.+|.+|. ... |+.- ...-... .+-+.+.+. . + .+.+..+. ..+.+.||.- T Consensus 383 N----E~R~~~gl~pi---~~g~~D~~~~~~n~~~~~~~~~~~~~~---~-----~--~~~~~~~~-~~~~~~~~~~ 441 (457) T protein:vir:62 383 D----EVRAAEDMTPL---PDGLGEKYRVPLNLGEIGEEPEPEPAP---A-----P--PAIDPPAE-EPADDEEPDN 441 (457) T ss_pred H----HHHHHhCCCCC---CCCCcceeeeccccccccccccccccC---C-----C--ccCCCCcc-CCCCCCCCCC Confidence 2 23334444331 110 0000 0000000 000000000 0 0 00000000 0011112222 No 190 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=87.20 E-value=0.04 Score=28.23 Aligned_cols=390 Identities=12% Similarity=0.084 Sum_probs=167.6 Q ss_pred HHHHHHHHHHhcccccCCCCCcccccccccccchHHH-----HHHHHHHHHHHhhcCC----CcceeccCChhhhhhhcc Q lcl|NC_011045. 28 ETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGAR-----GLNNLASKLMLALFPM----QTWMRLTISEYEAKQLLS 98 (536) Q Consensus 28 e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~-----a~~~Laa~l~~~ltP~----~~Wf~l~~~d~~~~~~~~ 98 (536) ..+..-+....+ .++.+.+.......-+... .-+-|+.++... |+ +.|+.++..+.. T Consensus 1 ~~~~D~~~n~~~------gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~--~aed~~r~g~~i~~~~~~------ 66 (422) T protein:vir:10 1 MVKTDSYANIFL------GGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDT--IPETALAAGFHIDGIDDE------ 66 (422) T ss_pred CccchhhHHHHc------CCCCCccccCcccccCHHHHHHHHHhChhhHHHHhh--hhHHHhcCCccccCCCHH------ Confidence 111111111111 1111111111111111111 112222222221 22 579998643211 Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEE Q lcl|NC_011045. 99 DPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQ 178 (536) Q Consensus 99 ~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~ 178 (536) ..+ ...+++-++...+.++++.--+||.|++++.-++++.. .-|+. ..|.+-. T Consensus 67 -----~~~-----------~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~----~~Pl~-------~~g~~~~ 119 (422) T protein:vir:10 67 -----PAF-----------WSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRAL----TSPVR-------EGAELET 119 (422) T ss_pred -----HHH-----------HHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCc----ccccc-------ccCceee Confidence 111 12233457889999999999999999998876544321 12332 2343322 Q ss_pred --EEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEee Q lcl|NC_011045. 179 --MVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRM 256 (536) Q Consensus 179 --i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw 256 (536) ++-+..+++.... .+. ....+-.-+.|+ |..+..+..+.+| -.+++...|. ..|+ + T Consensus 120 l~v~d~~~i~~~~~~----~dp-----~s~~fg~P~~y~-v~~~~~~~~~~iH----~SRli~~~g~----~~p~----~ 177 (422) T protein:vir:10 120 VRVYDRTQVKVQTRE----ENP-----RNARFGEPLTYR-ITTNESDMFYDVH----YSRIHIIDGE----RIPN----V 177 (422) T ss_pred EEeeccccccchhcc----cCc-----cccccCcceEEE-EecCCCCcceeec----cceeEEeCCC----Cchh----h Confidence 2333334432110 010 000011112222 2222111112221 1222222221 2233 4 Q ss_pred eecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeecc------cccc-----chhhhc---cCCCcce-e Q lcl|NC_011045. 257 VRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNP------AGIT-----QPRRLT---KAQTGDF-V 320 (536) Q Consensus 257 ~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~------~g~~-----~~~~~~---~~~~g~~-~ 320 (536) .+....-||+||... +++.++.+...+....+.+.++.-..+-++. ++.. ...... .+.+|.+ + T Consensus 178 ~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l 257 (422) T protein:vir:10 178 MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGI 257 (422) T ss_pred hcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeE Confidence 455677789999987 6799999999999888888777766665542 1110 011111 1222332 2 Q ss_pred cCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_011045. 321 TGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQL 398 (536) Q Consensus 321 ~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~ 398 (536) -+..++...+. .++.-+...+....+.|.-+.=. .-+...+.....+| -++-....---+..++...+. T Consensus 258 ~~~~e~~e~~~----~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnat-----gd~d~~~yyd~i~~~Qe~~l~ 328 (422) T protein:vir:10 258 DAESEEYSVLN----SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSS-----QNTALETFHKLVDRKRNAELL 328 (422) T ss_pred ecCCcceEEEe----cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccc-----chHHHHHHHHHHHHHHHHHHH Confidence 22233333332 23444556677777777665421 11222334445443 111122333444556777899 Q ss_pred HHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHH---cC Q lcl|NC_011045. 399 PLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANA---IG 475 (536) Q Consensus 399 Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~---~G 475 (536) |++++++.++.+. +++.++|- ||-+....+.++.....++....+.. ...++.+++-+.+... .| T Consensus 329 p~l~~l~~~i~~s--------~~~~~~f~-pL~~~sekekaei~~~~a~a~~~~~~---~g~i~~~e~r~~L~~~~~~~~ 396 (422) T protein:vir:10 329 PILEFLIPFIVNA--------EEWSVEFN-PLAQESSKDKAEILEKNVNSIAALIA---AGAMDIDEARDTLRTIAPEVK 396 (422) T ss_pred HHHHHHHHHhccc--------CCcEEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHh---cCCCCHHHHHHHhhhhccccc Confidence 9999999998764 36677765 54443333222222222222222211 1236777777766543 44 Q ss_pred CChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcch Q lcl|NC_011045. 476 IDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEA 522 (536) Q Consensus 476 v~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~ 522 (536) + ...+.. ++......+ ..+. ...++. T Consensus 397 ~-~~~~~~--~~~~~~~~~---------------~~~~---~~~~~d 422 (422) T protein:vir:10 397 I-NDGSVE--TEVTISETS---------------NDPL---EVPTDD 422 (422) T ss_pred C-CCCCCc--cccchhhcC---------------CCCC---CCCCCC Confidence 4 222322 222111000 0000 000000 No 191 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=86.89 E-value=0.042 Score=28.11 Aligned_cols=386 Identities=10% Similarity=0.053 Sum_probs=150.7 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCC--CcccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChh Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDS--DNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEY 91 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~ 91 (536) -+.|..+. ....+.|...+ .-|..++... ..+.+ -+=.++--.|++.+|+.+-+ + |-+ .++- ..+. T Consensus 1 m~~~~~~~---~~~~~~~~~~~--~~~~~~~~~~g~~~~~~---Al~~~~V~~cv~~ia~~iA~-l-p~~-~~~~-~~~~ 68 (417) T protein:vir:38 1 MKLFRGLA---TEVDPHWADHL--LDSGVIPSFRGGYLGIS---ALRNSDVLTAVSIVSGDVSR-F-PLV-ITDS-STDE 68 (417) T ss_pred Cccccccc---cCCCccchhhh--cccccccccCCceechh---hcccHHHHHHHHHHHHhhcc-C-eeE-EEEc-CCcc Confidence 12332211 12234443221 1222222211 11111 11123334566777776643 2 311 1121 1111 Q ss_pred hhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCC-CCceeeEEEEecce Q lcl|NC_011045. 92 EAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPE-GSNYNPMKLYRLSS 165 (536) Q Consensus 92 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~-~~~~~~~~~~~l~~ 165 (536) ... ... +...|. + .+.+.-....+.++..+|||.+++..+. ++.+..+..+|... T Consensus 69 ~~~--------~~~-----------~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~ 129 (417) T protein:vir:38 69 VID--------LAN-----------IEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQ 129 (417) T ss_pred eec--------cch-----------HHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCce Confidence 000 001 111121 2 2344455666888899999999987654 33455555666666 Q ss_pred EEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccc Q lcl|NC_011045. 166 YVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYP 245 (536) Q Consensus 166 ~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~ 245 (536) +.+..+..|++. | ++.. + +++.... +. . T Consensus 130 v~v~~~~~~~~~--y-~~~~---------------------------------~--~~~~~~~---~~------~----- 157 (417) T protein:vir:38 130 TQVDTSDPDNII--Y-RFTP---------------------------------Y--NSSMQKV---CG------F----- 157 (417) T ss_pred EEEEEcCCCeEE--E-EEEE---------------------------------c--CCcEEEE---ec------C----- Confidence 666666555331 1 1111 0 0110000 00 0 Q ss_pred cccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC----------C Q lcl|NC_011045. 246 KEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA----------Q 315 (536) Q Consensus 246 ~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~----------~ 315 (536) ++ ++++|....+| .||.||...+...+...+...+.......-...|-+++.-++.++++..... . T Consensus 158 -~d--viH~r~~~~d~-~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~ 233 (417) T protein:vir:38 158 -ED--VIHWKFFSYDT-IMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGA 233 (417) T ss_pred -cc--eEEecCCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccc Confidence 01 34445443344 7899999999999988888888888888888888887777776665433211 0 Q ss_pred -Cc-ceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_011045. 316 -TG-DFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSI 391 (536) Q Consensus 316 -~g-~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~r 391 (536) .| .++ -.++....++.. +.+.+. .+.....+..|-++|-... .+...+ |..-+.+.... T Consensus 234 n~g~~~v--l~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVPp~~lg~~~---~~s~~e~~~~~----------- 296 (417) T protein:vir:38 234 DAGSPII--VDATMDYQPLEVDTNVLNL-INSNNYSTAQIAKALRVPAYRLAQNS---PNQSVKQLADD----------- 296 (417) T ss_pred ccCCcee--ccCCceEEEccCCHHHHHH-HHHHHhhHHHHHHHhCCCHHHhCCCC---cchhHHHHHHH----------- Confidence 01 111 112223333332 234443 3344445677877774322 111122 22222222222 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEe-chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHH Q lcl|NC_011045. 392 LSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTIS-TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRI 470 (536) Q Consensus 392 l~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~v-s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~ 470 (536) +...-|.|++.++...+.+. ++++.....+.++|- +.+..+.++ + +...++ .+ .+.++++ T Consensus 297 ~~~~tl~P~~~~ie~~l~~~-Ll~~~~~~~~~~~fd~~~l~~~~~~-~---~~~~~~----~G------~~T~NE~---- 357 (417) T protein:vir:38 297 YIRNDLPFYFEPITSEFELK-LLDDAQRHQYCIGFDTKSVNGLPIA-D---VNTAVN----GG------LWTGNEG---- 357 (417) T ss_pred HHHHHHHHHHHHHHHHHHhh-hcChhhcccceEEechhhhhHHHHH-H---HHHHHh----CC------CcCHHHH---- Confidence 11223444444443333222 344332233455553 223222211 1 111111 11 2333333 Q ss_pred HHHcCCChhhccCCH--HHHH------HHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 471 ANAIGIDTSGILLTE--EQKQ------QKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 471 a~~~Gv~p~~i~rs~--~ev~------~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) .+.+|.+|. ... +++. -+....+. +..+.....++. .+.+ ..+-++..+ +-+ T Consensus 358 R~~~gl~pi---~~g~~d~~~~~~n~~~~d~~~~~--~~~~~~~~kgg~------~~~~-~~~~~~~~~-~~~ 417 (417) T protein:vir:38 358 RAELGKKPL---KDPNMDRIQSTLNTVFLDQKEAY--QAEHAAELKGGD------TNAK-GNQNGSGTN-ANS 417 (417) T ss_pred HHHhCCCCC---CCCCCCeeeeccccccccccccc--ccccccccCCCC------CCCC-CCCcCCCCc-CCC Confidence 344566552 110 1100 00000000 000000000000 0001 000000000 000 No 192 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=86.46 E-value=0.045 Score=27.95 Aligned_cols=374 Identities=13% Similarity=0.023 Sum_probs=156.8 Q ss_pred HHHHHHHHHHhhhH-HHHHHHHHHHhcc----cccCCCCCccc-c-cccc-cccchHHHHHHHHHHHHHHhhcCCCccee Q lcl|NC_011045. 14 KSVYERLKNDRAPY-ETRAQNCAQYTIP----SLFPKDSDNAS-T-DYVT-PWQAVGARGLNNLASKLMLALFPMQTWMR 85 (536) Q Consensus 14 ~~r~~~l~~~R~~~-e~~w~e~~~~~~P----~~~~~~~~~~~-~-~~~~-~~dst~~~a~~~Laa~l~~~ltP~~~Wf~ 85 (536) -..|+.|...|.+- +.++......-++ .....-+.... . ...+ +-.++--.|++.+|+.+.+ + | +-- T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~-l-p---~~~ 75 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGK-L-S---LKI 75 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhh-C-c---eEE Confidence 23455554443322 2222111111111 00000010000 0 0011 1123334455666555543 2 3 211 Q ss_pred ccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEE Q lcl|NC_011045. 86 LTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) Q Consensus 86 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~ 160 (536) ..-.+.. + +..+...|. + -+.+.-+..++.++.++|||.+++..+..+.++.+.. T Consensus 76 ~~~~~~~--------------~------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~ 135 (422) T protein:vir:13 76 YKDKEEY--------------K------EHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYP 135 (422) T ss_pred EecCccc--------------c------cchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 1111100 0 011112222 1 2345667778889999999999998777777777777 Q ss_pred EecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccc Q lcl|NC_011045. 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGS 240 (536) Q Consensus 161 ~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~ 240 (536) ++.+.+.+..|.+|.....-+ + .|. +....|... T Consensus 136 i~~~~v~~~~~~~~~~~~~~~-----------------------------~-~y~-------------~~~~~g~~~--- 169 (422) T protein:vir:13 136 INSDNVTKIIDDDNFLSSLSK-----------------------------V-WYV-------------VTDKNGKEH--- 169 (422) T ss_pred ECCcceEEEEcCCcceeccce-----------------------------E-EEE-------------EEeCCCeEE--- Confidence 888888888887775321000 0 000 000011100 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc------- Q lcl|NC_011045. 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK------- 313 (536) Q Consensus 241 ~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~------- 313 (536) .+...-.++.+.....+..||.||...+...+.......+.......-...|..++.-++.++.+.... T Consensus 170 ----~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~ 245 (422) T protein:vir:13 170 ----KLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFES 245 (422) T ss_pred ----EEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHH Confidence 011112344444445566899999999999999999999888888888888887776555544432211 Q ss_pred --CC---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHHHHHHHHHHH Q lcl|NC_011045. 314 --AQ---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGERVTAEEIRYVASELEDT 384 (536) Q Consensus 314 --~~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~r~TAtEi~~r~~E~~~~ 384 (536) .+ .|.+ ..-.++....++.. +.+.+. .+..+..+..|-++|=.-. +.. .++..-+++|.. ..=.... T Consensus 246 ~~~g~~n~~~~-~vl~~g~~~~~l~~~~~d~q~-le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~--~~f~~~~ 321 (422) T protein:vir:13 246 MSNGLENAHSI-SLLPFGYQFQPISLSMADAQF-LENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQ--KDFYVTT 321 (422) T ss_pred HhcCccccCCc-eecCCCceeeeccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH--HHHHHHH Confidence 00 0111 11112223333332 234443 3444555677877774321 111 112222333321 1122334 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHHH---HHHHHHHHHHHHH----HHHHH---hh Q lcl|NC_011045. 385 LGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLEA---IGRGQDLDKLERC----VAAWA---AL 452 (536) Q Consensus 385 LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La~---a~r~~~~~~l~~~----~~~~~---~~ 452 (536) |-|.+.+++++|-.- ++++.. ...+.++| ++.|-+ ..|..-.+.+.+. .+.+- .+ T Consensus 322 l~P~~~~ie~~l~~~-------------Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl 388 (422) T protein:vir:13 322 LQSSLTVYEQEIQDK-------------LFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENL 388 (422) T ss_pred HHHHHHHHHHHHHHh-------------hCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 444444444443322 333322 12344554 333322 1122222222210 11111 12 Q ss_pred cch-hhhh------cCCHHHHHHHHHHHcCCChhhc Q lcl|NC_011045. 453 APM-RDDP------DINLAMIKLRIANAIGIDTSGI 481 (536) Q Consensus 453 ~p~-~~~~------~id~d~~~~~~a~~~Gv~p~~i 481 (536) .|. -.|. .+..|.+-..- ..-| +..+= T Consensus 389 ~p~~ggD~~~~~~n~~~l~~~~~~~-~~~g-~~~g~ 422 (422) T protein:vir:13 389 PPVEGGDRLLVNGNMIPIEMAGEQY-KKGG-EKGGK 422 (422) T ss_pred CCCCCcCeeeeccCccchhhccccc-ccCC-CcCCC Confidence 221 1111 11223322211 1111 11111 No 193 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=86.22 E-value=0.047 Score=27.86 Aligned_cols=350 Identities=12% Similarity=0.028 Sum_probs=140.8 Q ss_pred cccCCCCCccccccc-c---cc---cchHH------------HHHHHHHHHHHHhhcCCCcceeccCChhhhhhhccChh Q lcl|NC_011045. 41 SLFPKDSDNASTDYV-T---PW---QAVGA------------RGLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPD 101 (536) Q Consensus 41 ~~~~~~~~~~~~~~~-~---~~---dst~~------------~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~ 101 (536) .+|..--.+...... . +. -.+.. .++..+.+.+.+.+ =+-||-=....+.. . T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~i-a~lp~~~~~~~~~~-~------- 71 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNI-SKLPIKIYQKKDGI-K------- 71 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhh-hhCceEEEEecCCe-e------- Confidence 333221111111000 0 00 00000 00111111111111 11122111000000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCe Q lcl|NC_011045. 102 GLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNV 176 (536) Q Consensus 102 ~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v 176 (536) .+ .+..+...|. +- +.+.-+...+.++.++|||.+++..+..+.+..+..+|....-+..|.+|.. T Consensus 72 ---~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~ 142 (409) T protein:vir:10 72 ---RV------PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLL 142 (409) T ss_pred ---ec------cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccc Confidence 00 0111223333 22 3445566778889999999999987777766666666666665666655533 Q ss_pred EEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEee Q lcl|NC_011045. 177 LQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRM 256 (536) Q Consensus 177 ~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw 256 (536) ..- ..+ .|. + ....|... . +...=++++|. T Consensus 143 ~~~-----------------------------~~~-~y~-~------------~~~~g~~~-----~--~~~~evih~r~ 172 (409) T protein:vir:10 143 NSE-----------------------------NNV-WYL-Y------------TDDLGQRH-----K--FMSDEILHFKG 172 (409) T ss_pred ccc-----------------------------ceE-EEE-E------------EeCCceeE-----E--eccccEEEecC Confidence 210 000 000 0 00011100 0 00112455555 Q ss_pred eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc----------cC-C-CcceecCCc Q lcl|NC_011045. 257 VRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT----------KA-Q-TGDFVTGRP 324 (536) Q Consensus 257 ~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~----------~~-~-~g~~~~g~~ 324 (536) ...+ ..||.||.+.+...+.......+.......-...|.+++.-++.++++... .+ . .|.+ .--. T Consensus 173 ~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~-~vl~ 250 (409) T protein:vir:10 173 LTAD-GLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRI-AMLP 250 (409) T ss_pred cCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCc-eecC Confidence 4333 489999999999999998888888888888888898887766655553322 11 0 1111 1112 Q ss_pred cccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cc-c-CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Q lcl|NC_011045. 325 EDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AV-Q-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPL 400 (536) Q Consensus 325 ~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~-~-~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pl 400 (536) ++....++.. +.+.+. .+..+.....|-++|-... +. . .++..-++++... .+.+.-+.|+ T Consensus 251 ~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~--------------~f~~~~l~P~ 315 (409) T protein:vir:10 251 IGYKFEPISQKLVDAQF-LENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNR--------------EFYIDTLQSI 315 (409) T ss_pred CCceEEEccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHH--------------HHHHHHHHHH Confidence 2223334333 345554 4555666778888885432 11 1 1222333333221 1233335566 Q ss_pred HHHHHHHHHhcCCCCCCC-CcceEEEEe-chHHH---HHHHHHHHHHHH--HH--HHH---Hhhcch-hhhhc---CC-- Q lcl|NC_011045. 401 VRVLLKQLQATQQIPELP-KEAVEPTIS-TGLEA---IGRGQDLDKLER--CV--AAW---AALAPM-RDDPD---IN-- 462 (536) Q Consensus 401 i~r~~~il~~~g~lp~~~-~~~v~v~~v-s~La~---a~r~~~~~~l~~--~~--~~~---~~~~p~-~~~~~---id-- 462 (536) +.++...+.+. ++++-. ...+.++|. +.|-. ..|....+++.+ ++ +.+ -.+.|. -.|.. .| T Consensus 316 ~~~ie~~ln~k-L~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~n~~ 394 (409) T protein:vir:10 316 LNMYELEINYK-LFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLINGNMI 394 (409) T ss_pred HHHHHHHHHHh-hcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCcc Confidence 55554444332 232211 122334332 22211 112222222211 01 111 112221 11111 12 Q ss_pred -HHHHHHHHHHHcCCCh Q lcl|NC_011045. 463 -LAMIKLRIANAIGIDT 478 (536) Q Consensus 463 -~d~~~~~~a~~~Gv~p 478 (536) .+.+-+....+ |= - T Consensus 395 ~~~~~~~~~~kg-Ge-~ 409 (409) T protein:vir:10 395 PVKMAGEQYSKG-GE-K 409 (409) T ss_pred chhhcccccccc-CC-C Confidence 11211111122 31 1 No 194 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=85.57 E-value=0.052 Score=27.63 Aligned_cols=308 Identities=12% Similarity=0.084 Sum_probs=130.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc-C Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF-P 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt-P 79 (536) |++.+......... ++-+ -|.+. ++.+...-..+-. .+...+ + T Consensus 1 m~~~~~~~~~~~~~----------~~~~-------------~~~~~------------~p~~~~~~~~~~~-~~~~~~~~ 44 (337) T protein:vir:78 1 MTKRQQQPAQAAAS----------SPRP-------------SVVFS------------MPEAIDPTAWMTD-YTGVFYNP 44 (337) T ss_pred CCCcccCccccccc----------Ccee-------------EEEec------------CcccccCcchhHh-hhhhhhcc Confidence 77654322111100 0000 01111 0000000000111 112222 4 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHH--HHhccC---hHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNY--IESNSY---RVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~--l~~snf---~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) ...|+.--++-..|.++.. ...+... .+... ...+.| +..+..+..|+.+||||.+++..+..+. T Consensus 45 ~~~~~~pP~~~~~La~l~~-------~~~~h~~---~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~ 114 (337) T protein:vir:78 45 YGEYYQPPIDRKGLAKVAR-------ANAHHGA---ILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQ 114 (337) T ss_pred CcceecCCCCHHHHHHHhh-------cchhhhh---HHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCc Confidence 5667765554444444322 1111111 11111 111223 4567888899999999999987776666 Q ss_pred eeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) ++. .+|+..-++.+..+|+. .| + ..+ + T Consensus 115 ~~~--L~pl~~~~v~~~~d~~~--~~------------------------------------~--~~~-----------~ 141 (337) T protein:vir:78 115 VVG--LHPLSSVYLRRREDGCF--VY------------------------------------L--QQG-----------K 141 (337) T ss_pred EEE--EEEeCCceeEeeeCCeE--EE------------------------------------E--EcC-----------C Confidence 554 44444444444433321 00 0 000 0 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc- Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT- 312 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~- 312 (536) ..+ . |..--++..|.....+..||.+|..-++..+-.-+..++-..+.-.-...|-.++. +++..+.++.. T Consensus 142 ~~~-~------~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~ 214 (337) T protein:vir:78 142 PNL-I------YRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEE 214 (337) T ss_pred ceE-E------ECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHH Confidence 000 0 01112355554444567999999998888877766666666666666677776653 45444443321 Q ss_pred --------cC-CC--ccee--cC-Ccccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-cc-c-CCCCCCC---H Q lcl|NC_011045. 313 --------KA-QT--GDFV--TG-RPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-AV-Q-RTGERVT---A 371 (536) Q Consensus 313 --------~~-~~--g~~~--~g-~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~-~-~~~~r~T---A 371 (536) .+ ++ +.++ ++ ..+++...++... .+.+ ..+..+-.++.|-.+|-.-. ++ . .+...-| + T Consensus 215 lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~q-fle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~ 293 (337) T protein:vir:78 215 MKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDE-FAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDP 293 (337) T ss_pred HHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccH Confidence 10 11 1112 22 2345566665543 3444 34455555667878774321 11 1 1222222 3 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGL 430 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~L 430 (536) ++... .+...-|.|++.++...+.+.+ +|+..-..++...-+.| T Consensus 294 e~~~~--------------~f~~~~L~P~~~~ie~~~n~~l-l~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 294 EKYDA--------------TYARNEVLPLCELVQDAINSAG-LPRALWVTFRETIGAAV 337 (337) T ss_pred HHHHH--------------HHHHHHHHHHHHHHHHHHhhhc-CChhhceeccccccccC Confidence 33221 1222334555555555544332 22211111111111222 No 195 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=85.49 E-value=0.052 Score=27.61 Aligned_cols=414 Identities=10% Similarity=0.017 Sum_probs=162.3 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHH---Hh-cccccCCCCCcc-----cccccccc-----cchHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQ---YT-IPSLFPKDSDNA-----STDYVTPW-----QAVGARGL 66 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~---~~-~P~~~~~~~~~~-----~~~~~~~~-----dst~~~a~ 66 (536) |+.......-+=+. +..+ |. +.-++++. +. .|+....-.+.. ....-++| |++-.-++ T Consensus 1 m~~~i~~~~g~p~~--~~~~---~~---~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m~~D~~i~s~l 72 (491) T protein:vir:10 1 MSKGLWVSPTEFVT--FGEP---DK---SLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCV 72 (491) T ss_pred CCCceeCCCCCccC--cccC---Ch---HHHHHHHhhhcccccccccCCccchHHHHHhcCCCHHHHHHHhhChHHHHHH Confidence 55533332221111 1111 11 11123321 00 111110000000 00001112 33333333 Q ss_pred HHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 67 ~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) ++....+.+ .+|- +.+.+. +......++ +.|.+.+|...+.+++ |.+.+|-++.- T Consensus 73 ~~Rk~av~~-----~~w~-i~~~~~-------~~~~~e~v~-----------e~l~~~~~~~~l~~~l-da~~~G~s~~E 127 (491) T protein:vir:10 73 RRRKAAVKA-----LEWG-LDRGKA-------KSRVAKSIA-----------DVFADLDLSRIVTEML-DAVLYGYQPME 127 (491) T ss_pred HHHHHHHhC-----CCcE-EecCCC-------CHHHHHHHH-----------HHHhcCCHHHHHHHHH-HhhhhcceeEE Confidence 444333322 2443 222221 111122333 3344567887777775 67778987753 Q ss_pred EecCCCCcee---eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCC Q lcl|NC_011045. 147 LPEPEGSNYN---PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDS 223 (536) Q Consensus 147 ~~~~~~~~~~---~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~ 223 (536) +.....++.+ .+..+|-..|. .|.+|++ .+. ..++ T Consensus 128 i~w~~~~g~~~~~~l~~r~~~~f~--~d~~~~l-----------------------------------~~~-----~~~~ 165 (491) T protein:vir:10 128 ITWGKVGNYIVPIDVVGKPADWFV--YDPENQL-----------------------------------RFR-----SKDH 165 (491) T ss_pred EEEeecCCeeEEEEeeeeccccee--eccCCce-----------------------------------EEe-----cCCC Confidence 2211111111 11111111111 1111111 000 0000 Q ss_pred CceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc- Q lcl|NC_011045. 224 GEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP- 302 (536) Q Consensus 224 ~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~- 302 (536) ..+|..+ ..+=|++.|+...+|+.||.|....+..-..--+...+..+..+++---|..+..- T Consensus 166 -------~~~g~~l---------~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~ 229 (491) T protein:vir:10 166 -------WMQGEEL---------PARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP 229 (491) T ss_pred -------CCCccee---------cCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecC Confidence 0112211 12228999999999999999999999999999999999999999998888766553 Q ss_pred ccccchh--hhc----c--CCCcceecCCcccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHH Q lcl|NC_011045. 303 AGITQPR--RLT----K--AQTGDFVTGRPEDISFLQLE-KQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEE 373 (536) Q Consensus 303 ~g~~~~~--~~~----~--~~~g~~~~g~~~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtE 373 (536) .+..+.+ .+. . +..+.++|. ..++.++... .+++.+.-...++.+.+.|+++++...+...++......| T Consensus 230 ~~a~~~ek~~l~~al~~~~~~a~~viP~-~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~ 308 (491) T protein:vir:10 230 RSASDGEKNLLLDCLEDMVQDAVAVVPD-DSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQ 308 (491) T ss_pred CCCCHHHHHHHHHHHHHHhcCcEEEecC-CceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhHHH Confidence 2222221 111 1 122334443 3445555543 3455666778899999999999987654433333334455 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~ 453 (536) |.....+.. +-.--..+...+ .-+|..++.+. .+. .+ ..+++|..+-... ......++.+..++ T Consensus 309 vh~~v~~di--~~~D~~~i~~tl-n~li~~l~~~N--~~~-~~----~p~f~~~~~~e~~------~~~a~~~~~L~~~G 372 (491) T protein:vir:10 309 AGLEVTDDI--RDGDKAVVSEAM-NMLIRWICDLN--FDG-AD----RPVFDMWEQEQVD------EIQAGRDQKLTQAG 372 (491) T ss_pred HHHHHHHHH--HHHHHHHHHHHH-HHHHHHHHHhc--CCC-CC----cceEEecCcCchh------HHHHHHHHHHHhCC Confidence 543332211 111122222222 22444433332 111 11 1234444332111 11112222333332 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhhccC----------------------C----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSGILL----------------------T----EEQKQQKMAQQSMQMGMDNGAAA 507 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r----------------------s----~~ev~~~~~q~~~q~~~~~~a~~ 507 (536) = .++. .++.+.+|+++...-. + ++.+.++ ..+..+........+ T Consensus 373 ~-----~i~~----~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~~~ 442 (491) T protein:vir:10 373 A-----RFTP----AYFKRAYNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQDALDAALNTL-SARDLNADAQALVAP 442 (491) T ss_pred C-----cCCH----HHHHHHhCCCCCCcCccccccCCCCCcccccccccCCCCCCchHHHHHHH-HHHHHHHHHHHHHHH Confidence 1 1222 2444555553221100 0 0111111 011111111110011 Q ss_pred HHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 508 LAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) .. ..+ +.+.+.+.... ....+.|.+ T Consensus 443 i~-~~l-~~~~s~~e~~~--~L~~l~~~~ 467 (491) T protein:vir:10 443 LL-KRI-ANGASADELLG--MLAELYPSL 467 (491) T ss_pred HH-HHH-HhcCCHHHHHH--HHHHHhhcC Confidence 10 001 11111111111 111233334 No 196 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=85.09 E-value=0.055 Score=27.47 Aligned_cols=448 Identities=10% Similarity=0.009 Sum_probs=174.7 Q ss_pred CCCccccccHH-HHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccccc--ccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEE-GAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTP--WQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~-~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~Laa~l~~~l 77 (536) |.+.-+.-+.. .+-.-.. ... .+.=+.|++ ++.+ |.+. ...+-.-...+ -|++-.-++++....+.+. T Consensus 1 ~~~~~~~~~p~~~~g~~~~--~~~-~~~~~~~~~-~e~~-~~lr---~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~- 71 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFG--SGV-VDGWTVWDP-FEQT-PELQ---WPQSVAVYSRMDNEDSRVTSLLEAISLPIRST- 71 (469) T ss_pred CCCcccCCCCccchhhhhh--ccc-ccchhhccc-cccc-cccc---cccchHHHHHHHhhChHHHHHHHHHHHHHhcC- Confidence 65533222211 1100000 000 000112221 0000 0000 00000001111 2555566666655554321 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHH------HHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMV------ERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPE 151 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~v------e~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~ 151 (536) +|- +.+.+. +.+....+.++|... ..-+...+.+..|...+.+.+.+.+.||-++.=+.-.. T Consensus 72 ----~w~-v~p~~~-------~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~ 139 (469) T protein:vir:10 72 ----PWR-IRANGA-------SDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRP 139 (469) T ss_pred ----Cce-EecCCC-------CHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeec Confidence 443 333321 111222334443321 11122333466788899999998888998776332211 Q ss_pred CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEE Q lcl|NC_011045. 152 GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEE 231 (536) Q Consensus 152 ~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~ 231 (536) .+ ...+|++ +..+..+.. ++.+.. -..+++.....++...+.. . T Consensus 140 ~~----------------~~~dG~~--~~~~l~~rp-------~~~i~~--~~~~~~~~l~~~~~~~~~~---------~ 183 (469) T protein:vir:10 140 RN----------------QSPDGRF--WLRKLAPRP-------QWTISK--FNVAPDGGLESIEQIAPPA---------R 183 (469) T ss_pred cc----------------ccCCCce--eeeeeeecC-------ccccee--eeeccCCceeeeeecCccc---------c Confidence 10 0011211 000000000 000000 0000011111111100000 0 Q ss_pred ecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc-ccccch-- Q lcl|NC_011045. 232 VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP-AGITQP-- 308 (536) Q Consensus 232 v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~-~g~~~~-- 308 (536) +.+..-....+..+.-..=|++.|+...+|+.||.|+...+..-..--+...+..+..+++---|..+..- .+..+. T Consensus 184 ~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek 263 (469) T protein:vir:10 184 TRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEV 263 (469) T ss_pred cccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHH Confidence 00000000001000011128999999999999999999999999888888999999999998878655442 222111 Q ss_pred -------hhhccCCC-cceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccC-CCCCCCHHHHHHHHH Q lcl|NC_011045. 309 -------RRLTKAQT-GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQR-TGERVTAEEIRYVAS 379 (536) Q Consensus 309 -------~~~~~~~~-g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~-~~~r~TAtEi~~r~~ 379 (536) .++..+.. |.++|. ...+.++. ..++...-...|+.+.+.|+++++...+... ++..-...|+..... T Consensus 264 ~~l~~a~~~~~~g~~a~~iip~-~~~ie~~e--a~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~vh~ev~ 340 (469) T protein:vir:10 264 RKMAALARSVRGGINAGVGLAQ-GQILELLG--VSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASVLEDPF 340 (469) T ss_pred HHHHHHHHHHhcCCceEEEccC-CceEEEee--cCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHHHHHHH Confidence 11111112 334442 23444443 3445556778899999999999986554432 222222344433222 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDP 459 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~ 459 (536) +. .+..-...|...|..-||.+++.+.+ |.-.+. -+++|... .. +.......++.+.+++-. T Consensus 341 ~d--~~~sDa~~i~~tln~~li~~l~~lN~--g~~~~~----P~~~~~~~-e~-----~~~~~a~~i~~l~~~G~~---- 402 (469) T protein:vir:10 341 TQ--AVHAYATSICRIANQHIIEDLVDINF--GVDTPA----PVLTFDPI-GS-----RQDLTAAAVKLLYDAGVF---- 402 (469) T ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCc----cEEEecCC-CC-----cHHHHHHHHHHHHhcCCc---- Confidence 21 22223333333343445555554432 211111 23444321 11 111123344444455421 Q ss_pred cCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCCC Q lcl|NC_011045. 460 DINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPG 535 (536) Q Consensus 460 ~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~ 535 (536) ++.+....++.+.+|+++. ...+.+....+..+ . ..+.++ ..+................+.+.|+-| T Consensus 403 -~~~~~~~~~~~e~~gip~~---~~~~~~~~~~~~~~--~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~da 469 (469) T protein:vir:10 403 -DDDPAVKRAIRQRFNLPSE---LNDTPSAEPEEPAA--V-PNQSAA--PARTRSSGNADARARAPKADQGVLFDA 469 (469) T ss_pred -cCccccHHHHHHHhCCCCC---CCCcccccchhccc--C-CCCCcc--ccccCCCCCcccccccCCChHHhhccC Confidence 1223345667889999432 22233221111100 0 000000 000000000111111122223334444 No 197 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=84.37 E-value=0.061 Score=27.24 Aligned_cols=362 Identities=10% Similarity=0.016 Sum_probs=128.0 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhccccc---CCCCCccc--c--ccccccc-chHHHHHHHHHHHHHHhhcCCCccee Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLF---PKDSDNAS--T--DYVTPWQ-AVGARGLNNLASKLMLALFPMQTWMR 85 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~---~~~~~~~~--~--~~~~~~d-st~~~a~~~Laa~l~~~ltP~~~Wf~ 85 (536) -+.=.-+...++.+...|. -++.. .+..-.+. . ....... ++--.|++.+|+.+.+ + | | + T Consensus 1 ~~~~~~~~~~k~~~~~~~~------~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~-l-p---~-~ 68 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWI------DQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMAS-L-P---L-K 68 (409) T ss_pred CccccchhhhhhHHhhhhh------ccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhh-C-c---e-E Confidence 1111112222333344433 11110 00000000 0 0011112 2233445555555432 2 3 2 2 Q ss_pred ccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEE Q lcl|NC_011045. 86 LTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKL 160 (536) Q Consensus 86 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~ 160 (536) +--.... .+ .-+...|. +-| .+.-....+.++..+|||.+++..+..+.+..+.. T Consensus 69 ~~~~~~~-------------~~-------~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~ 128 (409) T protein:vir:96 69 MYEDYKV-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFL 128 (409) T ss_pred Eeecccc-------------cc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEE Confidence 2111100 00 01112222 222 33445677788899999999987766665555555 Q ss_pred EecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccc Q lcl|NC_011045. 161 YRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGS 240 (536) Q Consensus 161 ~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~ 240 (536) +|.+.+-+..+.++... + | .+ ....|... T Consensus 129 l~~~~v~v~~~~~~~~~--~---------------------------------y-~~------------~~~~g~~~--- 157 (409) T protein:vir:96 129 LNPDVVEMLIENQSREL--Y---------------------------------Y-SI------------HAATGNKL--- 157 (409) T ss_pred EcCceeEEEEeCCCcEE--E---------------------------------E-EE------------EcCCceEE--- Confidence 55555544444332110 0 0 00 00011100 Q ss_pred ccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc------- Q lcl|NC_011045. 241 DGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK------- 313 (536) Q Consensus 241 ~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~------- 313 (536) .+ ...-.+++|-....+..||.||...+...+...+.+.+..... ....+.+++..++.++.+.... T Consensus 158 --~~--~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~e~~~~~~~~~~~ 231 (409) T protein:vir:96 158 --IV--HNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTE--MQKPDSFMLKYGSNVSTEKRQQVLEDFKQ 231 (409) T ss_pred --EE--ccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHh--cCCCceeEEecCCCCCHHHHHHHHHHHHH Confidence 00 0011334443234466899999988777766666665554332 2223345665555555443321 Q ss_pred --CCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCC---CCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 314 --AQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRT---GERVTAEEIRYVASELEDTLGG 387 (536) Q Consensus 314 --~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~---~~r~TAtEi~~r~~E~~~~LG~ 387 (536) ...|.+.. -.++....++.. +.+.+. .+..+.....|-++|-......-. +..-+++|.. ..=....|.| T Consensus 232 ~~~n~g~~~v-l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~--~~f~~~~l~P 307 (409) T protein:vir:96 232 YYEENGGILF-QEPGVEIEPLPKKYVSEDI-VASENLTRERVANVFQLPSIFLNARSNTNFAKNEELN--RFYLQHTLLP 307 (409) T ss_pred HhhcCCCeee-cCCCceEEEcCCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH--HHHHHHHHHH Confidence 11222211 112223333332 234443 234444566788888543211111 1222333222 1122233555 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHH---HHHHHHHHHHHHHH----HHHHH---hhcch Q lcl|NC_011045. 388 VYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLE---AIGRGQDLDKLERC----VAAWA---ALAPM 455 (536) Q Consensus 388 v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La---~a~r~~~~~~l~~~----~~~~~---~~~p~ 455 (536) .+.++++|+-.- ++|+.. .....++| ++.|- ...|+..++.+.+. .+.+- .+.|. T Consensus 308 ~~~~ie~~l~~~-------------Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi 374 (409) T protein:vir:96 308 IVKQYEEEFNRK-------------LLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV 374 (409) T ss_pred HHHHHHHHHHhh-------------cCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Confidence 555555554332 333211 11123333 22221 11222222222110 01100 01110 Q ss_pred -hhh------h--cCCHHHHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 456 -RDD------P--DINLAMIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 456 -~~~------~--~id~d~~~~~~a~~~Gv~p~~i~rs 484 (536) -.| . .+|.....+.-..+=+- .-=.+ T Consensus 375 ~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~---n~~e~ 409 (409) T protein:vir:96 375 EGGDKPLISGDLYPIDTPLELRKSLKGGDK---NVNES 409 (409) T ss_pred CCcceeeecccccccccchhhcccccCCCC---CcCCC Confidence 000 0 11111111111111000 00011 No 198 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=84.33 E-value=0.061 Score=27.23 Aligned_cols=351 Identities=14% Similarity=0.073 Sum_probs=134.7 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccc-cchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPW-QAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ltP 79 (536) |.--. +.. ...+..+....+.......+....... ..... .+.. .++--.|++.+|+.+. .+ | T Consensus 1 Mg~~~----~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~---~~al~~~~v~~~i~~ia~~ia-~~-p 64 (385) T protein:vir:10 1 MGLLT----PRN-----FNKRKAKNMVYPSNPAFFTTTVGGMQL--SYVSA---LSALQNTNVYSVINRIASDVA-SA-H 64 (385) T ss_pred Ccccc----chh-----cccccccccccccchhhhhhhccccCc--cccCH---HHhhccHHHHHHHHHHHHHHh-hC-c Confidence 44210 000 000011111111111111111111100 00111 1111 2333345555555553 32 3 Q ss_pred CCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) | ++ .+.. ... .+.+- +.+.=...+..++..+|||.+++..+. T Consensus 65 ---~-~v--~~~~-------------~~~-----------ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---- 110 (385) T protein:vir:10 65 ---F-KT--ENTA-------------TLN-----------RLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---- 110 (385) T ss_pred ---e-ee--eccc-------------hhh-----------hhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---- Confidence 1 22 1100 111 12222 233445556678889999999985321 Q ss_pred eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) ...+|+....|....++. -.+|.... .. + +. T Consensus 111 --~~~~p~~~~~v~~~~~~~-----------------------------------~~~~~~~~-~~-~----------~~ 141 (385) T protein:vir:10 111 --LEHIPNSDVQINYLPGNM-----------------------------------GIVYTVLE-SN-D----------RP 141 (385) T ss_pred --eeEeecCCceEEEEEcCC-----------------------------------ceEEEEEE-cC-C----------ce Confidence 345554433332211110 00111110 00 0 00 Q ss_pred cccccccccccccCceEEEeeeecCC--CccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc-ccchhhhc Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDG--ESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-ITQPRRLT 312 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~g--e~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g-~~~~~~~~ 312 (536) .. . +..--++++|....++ ..||.||...+...+.......+.......-...|.+++.-++ ..+.+... T Consensus 142 ~~-----~--~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~ 214 (385) T protein:vir:10 142 QM-----V--LRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLE 214 (385) T ss_pred EE-----E--EccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHH Confidence 00 0 0111234455333333 4689999999999999999999999999999999988777554 44443211 Q ss_pred ----------cCCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cc-cCCCCCCCHHHHHHHHH Q lcl|NC_011045. 313 ----------KAQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AV-QRTGERVTAEEIRYVAS 379 (536) Q Consensus 313 ----------~~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~-~~~~~r~TAtEi~~r~~ 379 (536) .+.+..-+.--.++....++.. +.+.+...+..+.....|-++|-... ++ ..+...-|...+.+... T Consensus 215 ~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~ 294 (385) T protein:vir:10 215 SAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKA 294 (385) T ss_pred HHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHH Confidence 1111110111122223333332 23455444555666788888885432 11 11222223333322233 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHH--HH--HHHH-- Q lcl|NC_011045. 380 ELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLER--CV--AAWA-- 450 (536) Q Consensus 380 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~--~~--~~~~-- 450 (536) .....|.|.+.++.+++-.-+ + ...+++.+. +|-+ ..|...++.+.+ ++ +.+- T Consensus 295 ~~~~~l~P~~~~ie~~l~~~l-------------~----~~~~~f~~~-~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 356 (385) T protein:vir:10 295 TYLANLNSYVNPIVDELRLKM-------------N----APDLELDIK-DMLDVDDSALINQVSNLAKSGVLGAEQAQFI 356 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhh-------------C----CceEEeech-hhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 334456666666666653211 1 123444322 2211 112222222222 11 1111 Q ss_pred ----hhcchhhhhcCCHHHHHHHHHHHcCCChhh Q lcl|NC_011045. 451 ----ALAPMRDDPDINLAMIKLRIANAIGIDTSG 480 (536) Q Consensus 451 ----~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~ 480 (536) .+.|+..+.....-..++ ..+. -.. T Consensus 357 ~g~~p~p~~~~~~~~~~~~~~~-~g~~----~dn 385 (385) T protein:vir:10 357 LTRSGFLPDNLPEFKPLTTQVK-GGDE----GDN 385 (385) T ss_pred hCCCccCCCCCccccCcccccC-CCCC----CCC Confidence 111222211100000000 0000 011 No 199 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=83.87 E-value=0.065 Score=27.09 Aligned_cols=364 Identities=11% Similarity=0.015 Sum_probs=144.7 Q ss_pred HHHHHHHHHhhhH-HHHHHHHHHHhcccccCCCCCc-ccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhh Q lcl|NC_011045. 15 SVYERLKNDRAPY-ETRAQNCAQYTIPSLFPKDSDN-ASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYE 92 (536) Q Consensus 15 ~r~~~l~~~R~~~-e~~w~e~~~~~~P~~~~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~ 92 (536) .-|+.+...|+.. ...+-+..+.+-.......+.. +.... .=.++--.|++.+|+.+.+ + |- ..++-. +.. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~--l~~~~v~~~i~~Ia~~iA~-~-p~-~~~~~~--~~~ 73 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRA--MRLTAVYSCVRVLAESVGM-L-PC-SLYKIS--GTL 73 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccccCceechhhh--hccHHHHHHHHHHHHhhhh-C-ce-EEEEec--CCc Confidence 3344443333321 1112222222211111111100 00000 0123344555666665532 2 20 112221 111 Q ss_pred hhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEE Q lcl|NC_011045. 93 AKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYV 167 (536) Q Consensus 93 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~ 167 (536) -.. + .+.-+...|+ + -+.+.-+...+.++...|||.+++..+. +.+..+..++.+.+. T Consensus 74 ~~~----------~------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~-g~~~~L~~l~~~~v~ 136 (413) T protein:vir:48 74 KTR----------V------VDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL-GEVVELLPIDPGCVE 136 (413) T ss_pred cee----------e------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC-CcEEEEEEEcCceEE Confidence 000 0 0111222222 1 3455667778889999999999986553 445445555555555 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccc Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKE 247 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~ 247 (536) +..|.+|.+ +| ....+ ++ .... +...+ T Consensus 137 ~~~~~~~~~--~y----------------------------------~~~~~--~g-~~~~---~~~~e----------- 163 (413) T protein:vir:48 137 PKLNSQWQP--VY----------------------------------QVTFP--DG-SVDV---LTQDE----------- 163 (413) T ss_pred EEEcCCceE--EE----------------------------------EEEec--Cc-eEEE---Ecccc----------- Confidence 555554432 11 00000 01 1100 01111 Q ss_pred cCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------C--- Q lcl|NC_011045. 248 ACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------Q--- 315 (536) Q Consensus 248 ~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------~--- 315 (536) ++++|-...++ .||.||...+...+.......+.......-...|..++.-++..+++..... + T Consensus 164 ---vih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n 239 (413) T protein:vir:48 164 ---IWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGN 239 (413) T ss_pred ---EEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccc Confidence 23333222233 7999999999999999999988888888888888888877766555532210 0 Q ss_pred CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc-C-CCCCCCHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_011045. 316 TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ-R-TGERVTAEEIRYVASELEDTLGGVYSI 391 (536) Q Consensus 316 ~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~-~-~~~r~TAtEi~~r~~E~~~~LG~v~~r 391 (536) .|.+.. -.++....++.. +.+.+ ..+..+..+..|-.+|-... +.. . ++..-++++.. .. T Consensus 240 ~g~~~v-l~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~--------------~~ 303 (413) T protein:vir:48 240 AHRPMI-LEMGLDWKSMALNAEDSQ-FLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG--------------LG 303 (413) T ss_pred cCccee-cCCCceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH--------------HH Confidence 011111 122233344443 23444 34555566777888875432 111 1 12222333222 11 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHH---HHHHHHHHHHHH--H--HHHH---Hhhcch-hhhh Q lcl|NC_011045. 392 LSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEA---IGRGQDLDKLER--C--VAAW---AALAPM-RDDP 459 (536) Q Consensus 392 l~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~---a~r~~~~~~l~~--~--~~~~---~~~~p~-~~~~ 459 (536) +...-+.|++.++-..+.+. ++++-....+.++| ++.|-. ..|....+.+.+ + .+.+ -.+.|. -.|. T Consensus 304 f~~~~i~P~~~~ie~~l~~~-L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~ 382 (413) T protein:vir:48 304 FINYSLVPYLTRIEQRINTG-LVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDV 382 (413) T ss_pred HHHHHHHHHHHHHHHHHHhh-ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcce Confidence 23334455555444444332 33332222333443 222211 111222222111 0 0000 011110 0000 Q ss_pred ------cCCHHHHHHH--------HHHHcCC Q lcl|NC_011045. 460 ------DINLAMIKLR--------IANAIGI 476 (536) Q Consensus 460 ------~id~d~~~~~--------~a~~~Gv 476 (536) +...+.+.+. -.+.... T Consensus 383 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 383 YLTPMNMTTSPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred eeccccccccccccccCCCCCCCCCccccCC Confidence 0000000000 0000000 No 200 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=81.39 E-value=0.086 Score=26.42 Aligned_cols=437 Identities=10% Similarity=0.076 Sum_probs=157.4 Q ss_pred CCC---ccccccHHHHHHHHHHHHH------Hh-----------hhHHHHHHHH-HHHhcccccCCCCCcc--ccccccc Q lcl|NC_011045. 1 MAE---KRTGLAEEGAKSVYERLKN------DR-----------APYETRAQNC-AQYTIPSLFPKDSDNA--STDYVTP 57 (536) Q Consensus 1 Ma~---~~~~~~~~~~~~r~~~l~~------~R-----------~~~e~~w~e~-~~~~~P~~~~~~~~~~--~~~~~~~ 57 (536) |.+ +..++.+.++..-+....= .. +.....+... -.+.-|......-..+ .+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRN 80 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCC Confidence 655 4444444433222111110 00 0011111110 0011110000000011 1111111 Q ss_pred ccch------------HHHHHHHHHHHHHHhh-----cCCC-cceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 58 WQAV------------GARGLNNLASKLMLAL-----FPMQ-TWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMN 119 (536) Q Consensus 58 ~dst------------~~~a~~~Laa~l~~~l-----tP~~-~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~ 119 (536) +.++ ...|++.-++-+.+.. +.+. ||. +...+..-..... +.. .. ..++. T Consensus 81 ~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~-i~~kd~~~~~~~~------~~~-~~----~~l~~ 148 (574) T protein:vir:80 81 SQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYE-IRLKDIEAEPTSH------DIA-NI----KRIES 148 (574) T ss_pred cccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceE-EEEeccCCCccch------hhh-hh----hHHHH Confidence 1111 1233344344444322 1122 553 2111111000000 000 01 11223 Q ss_pred HHHh---------ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHH Q lcl|NC_011045. 120 YIES---------NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGAL 190 (536) Q Consensus 120 ~l~~---------snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l 190 (536) .|+. ..|..-+..++.|+.++||+.+++..+..+.++.+..++...+.+..|.+|.+..-- T Consensus 149 ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~---------- 218 (574) T protein:vir:80 149 FLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNG---------- 218 (574) T ss_pred HHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCc---------- Confidence 3322 234456677888899999999988877777776666666677777777666332200 Q ss_pred HHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecC---CCccccc Q lcl|NC_011045. 191 PEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLD---GESYGRS 267 (536) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~---ge~YGrg 267 (536) ..+|+.++|...... ..-=++++|.+..+ ...||.| T Consensus 219 -----------------------------------~~y~~~~~g~~~~~~------~~~eiih~~~~~~~~~~~~~~G~s 257 (574) T protein:vir:80 219 -----------------------------------ERFVQVIDNRIVAKF------NERELAFAVRNPRADIEVGQYGYP 257 (574) T ss_pred -----------------------------------eEEEEEeCCceEEEE------ccccEEEEeccCCCCccccccccc Confidence 011122222111000 00113444433332 2469999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee--ccccccchhhhc----------cC--CCcc--eecCCcccccccc Q lcl|NC_011045. 268 YIEEYLGDLRSLENLQEAIVKMSMISSKVIGLV--NPAGITQPRRLT----------KA--QTGD--FVTGRPEDISFLQ 331 (536) Q Consensus 268 p~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv--~~~g~~~~~~~~----------~~--~~g~--~~~g~~~~~~~~~ 331 (536) |..-+...+.......+.......-...|..++ +.+..++.+... .+ ..|. ++. .+++...+ T Consensus 258 pi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~--~~G~~~~~ 335 (574) T protein:vir:80 258 ELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVS--AEDVKFVN 335 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec--CCCceEEE Confidence 999999999999988888888888888888554 333333433211 11 1111 221 22334444 Q ss_pred ccc-ccchhHHHHHHHHHHHHHHHHHhhhh--cccCCC--------CCC---CHHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_011045. 332 LEK-QADFTVAKAVSDAIEARLSFAFMLNS--AVQRTG--------ERV---TAEEIRYVASELEDTLGGVYSILSQELQ 397 (536) Q Consensus 332 ~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~--------~r~---TAtEi~~r~~E~~~~LG~v~~rl~~E~l 397 (536) +.. ..+.+ ..+..+.....|-++|-... +...+. +.+ |+++. ...-....|.|.+.+++.+|- T Consensus 336 l~~s~~D~q-fle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~--~~~f~~~tL~P~~~~ie~~ln 412 (574) T protein:vir:80 336 MTPSANDMQ-FEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEK--MQASQNKGLQPLLRFIEDTVN 412 (574) T ss_pred ccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHH--HHHHHHHHHHHHHHHHHHHHH Confidence 443 33444 33555666788888885432 111111 111 22221 112222344455555544443 Q ss_pred HHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCC Q lcl|NC_011045. 398 LPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGID 477 (536) Q Consensus 398 ~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~ 477 (536) .- ++++.. ..+.++|..+-- .+..+..++...+. + ..+.+++ +-+.+|.+ T Consensus 413 ~~-------------Ll~~~~-~~~~~~f~~~d~--~~~~~~~~~~~~~~--~--------G~lT~NE----~R~~lgl~ 462 (574) T protein:vir:80 413 TY-------------IVAEFG-EKYQFQFRGGDL--SAQLDKLKIIEQEG--K--------VFRTVNE----IRHDKGLE 462 (574) T ss_pred hh-------------hhhhcC-CceEEEecccch--hhHHHHHHHHHHHh--C--------CccCHHH----HHHHhCCC Confidence 22 333332 345566543211 11111111111111 0 0112222 22223333 Q ss_pred hh----------hccCCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 478 TS----------GILLTEEQKQQKMAQQSMQMGMDNGA-AALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 478 p~----------~i~rs~~ev~~~~~q~~~q~~~~~~a-~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) |- .+..-.+..+......+.+++-..+. .+..+.........|. ....-+...+|.+- T Consensus 463 Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~-~~~~d~~~~~~~~~ 531 (574) T protein:vir:80 463 PIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPK-DSQNDTDVSFQDEQ 531 (574) T ss_pred CCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCC-Cccccccchhhhhh Confidence 31 11100000000000000000000000 0000000000000000 00000011111111 No 201 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=80.27 E-value=0.096 Score=26.15 Aligned_cols=377 Identities=11% Similarity=0.022 Sum_probs=139.8 Q ss_pred HHHHHHHHHhhcCC----CcceeccCCh-hh----hhhhccChhHHHHHHHHHHHHHHHHHHHH---------------- Q lcl|NC_011045. 67 NNLASKLMLALFPM----QTWMRLTISE-YE----AKQLLSDPDGLAKVDEGLSMVERIIMNYI---------------- 121 (536) Q Consensus 67 ~~Laa~l~~~ltP~----~~Wf~l~~~d-~~----~~~~~~~~~~~~~v~~~L~~ve~~~~~~l---------------- 121 (536) --||+| ..-.+|. .|||+-...- +. +............-..+.-.|-+.+...+ T Consensus 1 ~~~~~~-~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~ 79 (518) T protein:vir:10 1 MLLANG-QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCc-eeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 111111 0111232 2333311100 00 00000000000000011122222222222 Q ss_pred -----------Hhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEecc Q lcl|NC_011045. 122 -----------ESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIA 186 (536) Q Consensus 122 -----------~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t 186 (536) .+= +.+.-+...+.++.++||+++++..+..+.+..+..++.+.+.+..|..+....+ ++.. T Consensus 80 ~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y--~~~~- 156 (518) T protein:vir:10 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEY--YFQA- 156 (518) T ss_pred eccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEE--EEEe- Confidence 221 2233456677788899999999987776666556666666666665543211100 0000 Q ss_pred HHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCcccc Q lcl|NC_011045. 187 FGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGR 266 (536) Q Consensus 187 ~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGr 266 (536) ....+.+. .. |..-=++++|+...+|-.||. T Consensus 157 --------------------------------~~~~~~~~---~~--------------~~~~eViHir~~s~dg~~~G~ 187 (518) T protein:vir:10 157 --------------------------------GAGVGTQL---VS--------------FADDEVVPIRFFNPDGLERGL 187 (518) T ss_pred --------------------------------cCCccceE---EE--------------ecCCcEEEecCCCCCcccccc Confidence 00000000 00 001114555665566777999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------C---CcceecCCccccccccccc Q lcl|NC_011045. 267 SYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------Q---TGDFVTGRPEDISFLQLEK 334 (536) Q Consensus 267 gp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------~---~g~~~~g~~~~~~~~~~~~ 334 (536) ||..-+...+.......+.......-...|..++.-++.++.+..... + .|.+.. -.++....++.. T Consensus 188 spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~v-L~~G~~~~~l~~ 266 (518) T protein:vir:10 188 SLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMV-VEEGMEPIPLQL 266 (518) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceE-cCCCceEEEccC Confidence 999998888888888888888888888888877766665555432110 0 011111 112222333332 Q ss_pred -ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_011045. 335 -QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ 413 (536) Q Consensus 335 -~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~ 413 (536) ..+.+. .+..+..+..|-++|-......-..++-|-.-+.+.... +...-+.|++.++-..+.+. + T Consensus 267 s~~D~q~-le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~-----------f~~~tL~P~l~~ie~~ln~~-L 333 (518) T protein:vir:10 267 TAVEMQF-IEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA-----------FYRDTMAIPIARIQSAMDKY-V 333 (518) T ss_pred ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH-----------HHHHHHHHHHHHHHHHHHHh-h Confidence 234443 344455567788888543211111111122212111111 22333556655555555432 4 Q ss_pred CCCCCCcceEEEE-echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh-------ccCCH Q lcl|NC_011045. 414 IPELPKEAVEPTI-STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG-------ILLTE 485 (536) Q Consensus 414 lp~~~~~~v~v~~-vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~-------i~rs~ 485 (536) +++... ...++| ++.| .|. +......++..+-+.+ .+.+++ +-+.+|.+|.. ++.+. T Consensus 334 ~~~~~~-~~~~~fd~~~l---lr~-D~~~r~~~~~~~~~~G------~lT~NE----~R~~~Gl~pie~~~gD~~~~~~n 398 (518) T protein:vir:10 334 GQYWVR-KNRMKFDIDDV---IQP-DWEAKSESTQKMVNSG------VATPNE----GREIMGLPRSDDPKADELYANSA 398 (518) T ss_pred cccccC-CceEEEechhh---hcc-CHHHHHHHHHHHHhCC------CcCHHH----HHHHhCCCCCCCCCCCeeeeccc Confidence 444332 234444 2233 221 1111122222211111 123333 22344544421 01000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhc--CCCCCCC Q lcl|NC_011045. 486 EQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS--VGLQPGI 536 (536) Q Consensus 486 ~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~--~~~q~~~ 536 (536) +..+.......... +++... ...+.. ..+.+ +. .+-.|.+ T Consensus 399 --~~pl~~~~~~~~~g-~~~~~~-----~~~~~~--~~~~~-~~~~~~~~~~~ 440 (518) T protein:vir:10 399 --LQPLGATPDGAVEG-EEAPAP-----KRPAST--PVASL-DQSPPTSVPGL 440 (518) T ss_pred --ceecccccccccCC-CCCCCC-----CCCCcc--ccccc-cccccccCCCC Confidence 00000000000000 000000 000000 00000 00 0001111 No 202 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=80.10 E-value=0.098 Score=26.11 Aligned_cols=378 Identities=11% Similarity=0.041 Sum_probs=140.1 Q ss_pred HHHHHHHHHhhcCC----CcceeccCCh-h----hhhhhccC----hhHHHHHHHHHHHHHH------------------ Q lcl|NC_011045. 67 NNLASKLMLALFPM----QTWMRLTISE-Y----EAKQLLSD----PDGLAKVDEGLSMVER------------------ 115 (536) Q Consensus 67 ~~Laa~l~~~ltP~----~~Wf~l~~~d-~----~~~~~~~~----~~~~~~v~~~L~~ve~------------------ 115 (536) --||+|= ...+|+ .+|++-...- + .+...... -.....|..-.+.+.. T Consensus 1 ~~~~~~~-~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~ 79 (518) T protein:vir:78 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCce-eeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCccc Confidence 0111110 000121 1232210000 0 00000000 0000011111111111 Q ss_pred -----HHHHHHHhcc----ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEecc Q lcl|NC_011045. 116 -----IIMNYIESNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIA 186 (536) Q Consensus 116 -----~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t 186 (536) .+...+.+=| .+.=+..++.++..+||+++++..+..+.++.+..++.+.+.+..|.++.... T Consensus 80 ~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~-------- 151 (518) T protein:vir:78 80 EEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYE-------- 151 (518) T ss_pred cccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEE-------- Confidence 1112222222 33345677788889999999998776666655555555655555554321100 Q ss_pred HHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCcccc Q lcl|NC_011045. 187 FGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGR 266 (536) Q Consensus 187 ~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGr 266 (536) |+.......+.+.. . |..-=++++|+...+|..||. T Consensus 152 ---------------------------y~~~~~~~~~~~~~---~--------------~~~~eIiHir~~~~dg~~~G~ 187 (518) T protein:vir:78 152 ---------------------------YYFQAGAGVGTQLV---S--------------FADDEVVPIRFFNPDGLERGL 187 (518) T ss_pred ---------------------------EEEEecCCccceeE---E--------------ecCCcEEEecCCCCCcccccc Confidence 00000000000000 0 001124566666667778999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------C---CcceecCCccccccccccc Q lcl|NC_011045. 267 SYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------Q---TGDFVTGRPEDISFLQLEK 334 (536) Q Consensus 267 gp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------~---~g~~~~g~~~~~~~~~~~~ 334 (536) ||..-+...+.......+.......-...|..++.-++.++++..... + .|.+.. -.++....++.. T Consensus 188 Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~v-L~~G~~~~~l~~ 266 (518) T protein:vir:78 188 SLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMV-VEEGMEPIPLQL 266 (518) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeE-cCCCceEEeccC Confidence 999999988888888888888888888888877776666665543210 0 111111 112223333332 Q ss_pred -ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_011045. 335 -QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQ 413 (536) Q Consensus 335 -~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~ 413 (536) +.+.+. .+..+..+..|-++|-......-..+.-|-.-+.+.... +...-+.|++.++-..+.+ .+ T Consensus 267 ~~~d~q~-le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~-----------f~~~tL~P~~~~ie~eln~-~L 333 (518) T protein:vir:78 267 TAVEMQF-IEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA-----------FYRDTMAIPIARIQSAMDK-YV 333 (518) T ss_pred ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH-----------HHHHHHHHHHHHHHHHHHH-hh Confidence 234554 344455567788888543211111112122222222111 2233355555555554433 23 Q ss_pred CCCCCCcceEEEE-echHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhh-------ccCCH Q lcl|NC_011045. 414 IPELPKEAVEPTI-STGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG-------ILLTE 485 (536) Q Consensus 414 lp~~~~~~v~v~~-vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~-------i~rs~ 485 (536) +++... ...++| ++.| .|. +......++..+-+.+ .+.+++ +-+.+|.+|.. ++.+. T Consensus 334 ~~~~~~-~~~~~fd~~~L---lr~-D~~~r~~~~~~~~~~G------~lT~NE----~R~~~gl~pie~~~gD~~~v~~n 398 (518) T protein:vir:78 334 GQYWVR-KNRMKFDIDDV---IQP-DWEAKSESTQKMVNSG------VATPNE----GREIMGLPRSDDPKADELYANSA 398 (518) T ss_pred cccccC-cceEEeechhh---hcc-CHHHHHHHHHHHHhCC------CcCHHH----HHHHhCCCCCCCCCCceeeeccc Confidence 443322 223444 2333 221 2222222222222211 123333 22334554421 01000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcch-HHhhhhcCCCCCCC Q lcl|NC_011045. 486 EQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEA-MAAAADSVGLQPGI 536 (536) Q Consensus 486 ~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~-~~~~~~~~~~q~~~ 536 (536) +..+.......... +++... .+.+..+.+ ..+.+.++ .|.+ T Consensus 399 --~~pl~~~~~~~~~g-~~~~~~-----~~~~~~~~~~~~~~~~~~--~~~~ 440 (518) T protein:vir:78 399 --LQPLGATPDGAVEG-EEAPAP-----KRPASTPVASLDQSPPAS--VPGL 440 (518) T ss_pred --ceecccccccccCC-CCCCCC-----CCCCcccccccccCcccc--CCCC Confidence 00000000000000 000000 000000000 00000000 1111 No 203 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=79.96 E-value=0.099 Score=26.08 Aligned_cols=373 Identities=10% Similarity=0.027 Sum_probs=136.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHH--HHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQN--CAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF 78 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e--~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) |.=-. ++.+.+ ++ +.+....|.. ....+-|.-....+..+-....-.-.++--.|++.+|+.+.+ + T Consensus 1 m~~~~----~~~~~~---~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~-l- 68 (412) T protein:vir:26 1 MNVIA----KENIVT---RI---KKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMAS-L- 68 (412) T ss_pred Cccch----hhhhhh---hh---hhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhh-C- Confidence 33211 112222 11 2344555521 111111110000000000011111223344566666665533 2 Q ss_pred CCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 79 P~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) ||.-..-.+ . .+ ..+...|. +- +.+.-+...+.++.++|||.+|+..+..+ T Consensus 69 ---p~~~~~~~~-~-------------~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 124 (412) T protein:vir:26 69 ---PLKMYEDYK-V-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH 124 (412) T ss_pred ---ceeEeeccc-c-------------cc-------chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCC Confidence 332111110 0 00 11111222 22 23444667788999999999998877666 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .+..+..+|.+...+..+.++... + |+ +. ..++ .... +. T Consensus 125 ~~~~L~~l~~~~v~v~~~~~~~~~--~---------------------------------y~-~~-~~~g-~~~~---~~ 163 (412) T protein:vir:26 125 QPSKLFLLNPDVVEMLIENQSREL--Y---------------------------------YS-IH-AATG-NKLI---VH 163 (412) T ss_pred cEEEEEEEcCceeEEEEeCCCcEE--E---------------------------------EE-EE-cCCc-eEEE---Ec Confidence 666666666666666665443211 1 00 00 0000 0000 00 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc- Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT- 312 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~- 312 (536) ..+ .+++|.....+..||.||..-+...+...+.+.+..+..... .+-+++..++.++.+... T Consensus 164 ~~e--------------vih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~~~~~~~--~~~~i~~~~~~l~~e~~~~ 227 (412) T protein:vir:26 164 NMD--------------MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLKYGSNVGKEKRQQ 227 (412) T ss_pred ccc--------------EEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHhcCC--CCceEEecCCCCCHHHHHH Confidence 011 233333334567899999988877777776666554333222 233444444444444221 Q ss_pred ------c--CCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHH-H Q lcl|NC_011045. 313 ------K--AQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASEL-E 382 (536) Q Consensus 313 ------~--~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~-~ 382 (536) . ...|.+. --.++....++.. +.+.+. .+..+..+..|-++|-......-....-|-.-+.+..... . T Consensus 228 ~~~~~~~~~~~~g~~~-vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~ 305 (412) T protein:vir:26 228 VLEDFKQYYEENGGIL-FQEPGVEIEPLPKKYVSEDI-VASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQ 305 (412) T ss_pred HHHHHHHHhhcCCCee-ecCCCceEEEcCCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHH Confidence 0 1122221 1122333444432 334442 3444445677888885432111111121222222222222 2 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCC-cceEEEE-echHHH---HHHHHHHHHHHHH----HHHH-H-- Q lcl|NC_011045. 383 DTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPK-EAVEPTI-STGLEA---IGRGQDLDKLERC----VAAW-A-- 450 (536) Q Consensus 383 ~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~-~~v~v~~-vs~La~---a~r~~~~~~l~~~----~~~~-~-- 450 (536) ..|.|.+.+++++|-.-| +++... ....++| ++.|-. ..|...++.+... .+.+ . T Consensus 306 ~~l~P~~~~ie~~ln~kL-------------l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~ 372 (412) T protein:vir:26 306 HTLLPIVKQYEEEFNRKL-------------LTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE 372 (412) T ss_pred HHHHHHHHHHHHHHHhhc-------------CCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 235565555555543322 222110 1122333 222211 1122222222110 0111 0 Q ss_pred hhcch-hhh------h--cCCHHHHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 451 ALAPM-RDD------P--DINLAMIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 451 ~~~p~-~~~------~--~id~d~~~~~~a~~~Gv~p~~i~rs 484 (536) .+.|. -.| . .+|..........+=+.+ -=.+ T Consensus 373 gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n---~~e~ 412 (412) T protein:vir:26 373 DLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKN---VNES 412 (412) T ss_pred CCCCCCCcCeeeecccccccccchhhcccccCCCCC---cCCC Confidence 11110 000 0 122111111111111110 0111 No 204 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=79.38 E-value=0.11 Score=25.95 Aligned_cols=319 Identities=13% Similarity=0.052 Sum_probs=122.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh-cC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL-FP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l-tP 79 (536) |++.+..-.... +.++.. +.. ..|.+.+-..--....++|- +...+ .-+.. .| T Consensus 1 m~~~~~~~~~~~-----~~~~~~-~~~-------------~~~~~~~p~~~~~~~~~~~~-----~~~~~--~~~~~~~p 54 (346) T protein:vir:10 1 MKKQLRKNLTQN-----DRLQPQ-AQT-------------EIFSFGDPIPVLDRADILNY-----LECSA--MYEKWYNP 54 (346) T ss_pred CCcccCCCCCcc-----cccccc-cCe-------------EEEecCCcceecCchhHHHH-----HHHhh--cCCceEec Confidence 888542211110 011000 000 01111100000000000000 00000 00001 12 Q ss_pred CCcceeccCChhhhhhhcc-ChhHHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLS-DPDGLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~-~~~~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) +-++..| .++.. ..-....+ ......+...+..-| -...+.+++.|+.+||||.+++..+..+.+ T Consensus 55 p~~~~~l-------a~l~~~~~~h~~~i----~~k~n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~~ 123 (346) T protein:vir:10 55 PMSFDGL-------AKSLRSSTHHESAI----ITKANILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQV 123 (346) T ss_pred CCCHHHH-------HHHHHhhhhcchhh----hhhhhhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcE Confidence 1111111 00000 00000000 000001111111111 123455677789999999999877766666 Q ss_pred eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) +.+..+|....-+..+.++.. |. + ...+|. T Consensus 124 ~~L~pl~~~~v~~~~~~~~~~-------------------------------------~~-~------------~~~~g~ 153 (346) T protein:vir:10 124 QRIESPLAKYVRKGLEAGQFY-------------------------------------YV-P------------QRFDHQ 153 (346) T ss_pred EEEEEecCCceEEEEcCCeEE-------------------------------------EE-E------------EccCCe Confidence 544444433332222222110 00 0 000111 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhcc- Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLTK- 313 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~~- 313 (536) ... +..--++.+|.....+..||.+|...++..+...+..++-....-.-...|.+++. ++..++.++... T Consensus 154 ~~~-------~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i 226 (346) T protein:vir:10 154 EHE-------FAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENI 226 (346) T ss_pred EEE-------EecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHH Confidence 000 00111455555544577999999999998888888888888777777888887653 455545443221 Q ss_pred --------C-CC-cc-e-ec--CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----cccCCC-CCCCHHH Q lcl|NC_011045. 314 --------A-QT-GD-F-VT--GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----AVQRTG-ERVTAEE 373 (536) Q Consensus 314 --------~-~~-g~-~-~~--g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~~-~r~TAtE 373 (536) + ++ |. + +. +..+++...++... .+.+ ..+..+..++.|-.+|-.-. ....+. ..-++++ T Consensus 227 ~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~q-f~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~ 305 (346) T protein:vir:10 227 RQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDE-FFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVAD 305 (346) T ss_pred HHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHH Confidence 1 11 11 1 12 12334455555432 3444 33344555677888884321 111111 1223444 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIG 434 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~ 434 (536) ....- ....|.|...++. .+... +..+.+++..-+.|...+ T Consensus 306 ~~~~f--~~~~l~P~~~~ie------------e~n~~------L~~e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 306 AAEVF--FITEIEPLQERLK------------EFNQW------LGQEVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHH--HHHHHHHHHHHHH------------HHHhh------cccceeeechhhhcccCC Confidence 33222 1223445555553 22211 112233433233333222 No 205 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=75.39 E-value=0.15 Score=25.14 Aligned_cols=368 Identities=12% Similarity=0.025 Sum_probs=140.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~ 80 (536) |--.+..-.++ ..+ +=.+.|..+.... +...+..+..-. ...-+=.++--.|++.+|+.+.+ + T Consensus 1 m~~~~~~~~~~------~~~-----s~~~~w~~~~~~~-~~~~~~~g~~vt-~~~al~~~~v~~~i~~Ia~~iA~-l--- 63 (421) T protein:vir:10 1 MFIPQMFEGKK------RSV-----SGGGFWEAMLGGV-RSSHSKAGVMIT-PETALALSAVRACVTLLAESVAQ-L--- 63 (421) T ss_pred CCCcchhcccc------ccc-----CcchhhHHHhhhh-ccCcccCCceec-hHHhhccHHHHHHHHHHHHhhcc-C--- Confidence 32211100000 000 1123344433222 111111110000 00011233444566777666643 2 Q ss_pred CcceeccCC-hhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcEEEEEecCCCCc Q lcl|NC_011045. 81 QTWMRLTIS-EYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) Q Consensus 81 ~~Wf~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~~~~~~~~ 154 (536) ||--..-. +..... ++ +.-+...|. +-| .+.-.+..+.++..+|||.+++..+..+. T Consensus 64 -p~~~~~~~~~g~~~~----------~~------~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~ 126 (421) T protein:vir:10 64 -PVELYRRDKNGGRQR----------AT------DHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGY 126 (421) T ss_pred -ceEEEEEcCCCceee----------cc------cchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc Confidence 44211111 100000 00 111222332 223 45555667788889999999998777666 Q ss_pred eeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecC Q lcl|NC_011045. 155 YNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEG 234 (536) Q Consensus 155 ~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g 234 (536) +..+..++.+.+.+..|.+|.+- |+ ++ .. | T Consensus 127 ~~~L~~l~~~~v~v~~~~~g~~~--y~---~~-----------------------------------~~----------g 156 (421) T protein:vir:10 127 PKELIPINPKKVIVLKGPDGMPY--YE---IP-----------------------------------EI----------G 156 (421) T ss_pred EEEEEEecCceEEEEECCCceEE--EE---Ec-----------------------------------CC----------C Confidence 65555555566666555554221 10 00 00 1 Q ss_pred ccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccc----cchhh Q lcl|NC_011045. 235 MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGI----TQPRR 310 (536) Q Consensus 235 ~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~----~~~~~ 310 (536) ..++ .++ +++.|....+| .||.||...+...+.......+.......-...|..++.-++- .+.+. T Consensus 157 ~~~~-------~~e--iih~~~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~ 226 (421) T protein:vir:10 157 ETLP-------MRM--MHHVKVFSLDG-YIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEK 226 (421) T ss_pred cEEc-------hhh--EEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHH Confidence 1010 011 34445444444 7999999999999998888888888888888888876643321 12222 Q ss_pred h---c-------cC-C-CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccC--CCCCCCHHHH Q lcl|NC_011045. 311 L---T-------KA-Q-TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQR--TGERVTAEEI 374 (536) Q Consensus 311 ~---~-------~~-~-~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~--~~~r~TAtEi 374 (536) . . .+ . .|.+.. -.++....++.. ..+.+. .+..+..+..|-++|-.-. +... .+..-++++. T Consensus 227 ~~~~~~~~~~~~~g~~n~~~~~v-l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~ 304 (421) T protein:vir:10 227 IDQLLAKWTDRYSGINNMFSVAL-LQEGMSYKQMSQDNEKAQL-LQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQ 304 (421) T ss_pred HHHHHHHHHHHhcCccccCccee-cCCCceEEecCCChhHHHH-HHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHH Confidence 1 1 11 0 111111 122233344433 234442 3444556677888885432 1111 1212222222 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHH---HHHHHHHHHHHHH----H Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEA---IGRGQDLDKLERC----V 446 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~---a~r~~~~~~l~~~----~ 446 (536) .. .=....|.|.+.++++||-.- ++++-....+.++| ++.|-+ ..|..-.+.+.+. . T Consensus 305 ~~--~f~~~tl~P~~~~ie~~ln~k-------------L~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~ 369 (421) T protein:vir:10 305 GL--QFVMYTLLAWLKRHEGALQRD-------------LLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSV 369 (421) T ss_pred HH--HHHHHHHHHHHHHHHHHHhhh-------------ccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 11 112224555555555554322 22221111222332 111111 1111111111110 0 Q ss_pred HHH-H--hhcch----------h---hhhcC--CHHHHHHHHHHHcCCChhhcc-CC Q lcl|NC_011045. 447 AAW-A--ALAPM----------R---DDPDI--NLAMIKLRIANAIGIDTSGIL-LT 484 (536) Q Consensus 447 ~~~-~--~~~p~----------~---~~~~i--d~d~~~~~~a~~~Gv~p~~i~-rs 484 (536) +.+ . .+.|. . +.+.. +.+.....-++ -..+. +| T Consensus 370 NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~e-----~d~~~~~~ 421 (421) T protein:vir:10 370 NDIRRMENLPPIAGGDKYLTPLNMVDSAQIIPGDKKPTAQQMAE-----IDTILSRT 421 (421) T ss_pred HHHHHHhCCCCCCCcceeeeccccccccccccCCCCcccccCcc-----cccccccC Confidence 000 0 00000 0 00000 00001111111 00111 11 No 206 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=74.99 E-value=0.15 Score=25.07 Aligned_cols=476 Identities=10% Similarity=0.017 Sum_probs=171.4 Q ss_pred CCCccccccHHHHHHH-----HHHHHHH--hhhHHHHHHHHHHHhcccccCCCCCccccccccc--ccchHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSV-----YERLKND--RAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTP--WQAVGARGLNNLAS 71 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r-----~~~l~~~--R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~--~dst~~~a~~~Laa 71 (536) |+++.--+-+...+.- -...++. ++--+.+|..-...+-|-. +. ..+..+ ..++...|++.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~-----~~--~~L~~~~e~~~~~~~~i~~~~~ 73 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPY-----NP--DRLAAFLELNETLATGIRKKSR 73 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCC-----CH--HHHHHHHhcChHHHHHHHHHhh Confidence 6653311111100000 0000000 1111112211111111111 11 112222 45667778888888 Q ss_pred HHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhcc----ChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 72 KLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNS----YRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 72 ~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~~ 147 (536) .+.+.=+=-.|=+.....+ .+......+..++..+...........| +..-+...+.|+.++||+|+=+ T Consensus 74 ~iag~g~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~iei 146 (651) T protein:vir:99 74 YEVGFGFDLVPAQGVDGDD-------ASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEM 146 (651) T ss_pred hhhccCceeeecccCCCCc-------cchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhh Confidence 8854311001111111111 1222233444444443333333333333 4456667778999999999854 Q ss_pred ecCCCCceeeEEEEecceEEEeeCCCCCeEE-EEEeE------eccHHHHHHHHhHHhhhc----cccCCCCceEEEEE- Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQ-MVTRD------QIAFGALPEDIRKAVEGQ----GGEKKADETIDVYT- 215 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~-i~r~~------~~t~~~l~~~~~~~~~~~----~~~~~~~~~~~v~~- 215 (536) -.++.+.++.+..+|+..+.+..+... ++. .++.. .++.... ..|-..+... ...++.+..+..+. T Consensus 147 Irn~~g~pv~L~~lp~~~~Rv~~~~~~-~~~~~~~ll~~~pn~~~~~~~~-~~~~q~~~~~~~~~~~~g~~~~~~~~~~~ 224 (651) T protein:vir:99 147 LTDIEGRPVGLAYVPARTVRVRRPQNR-FDQPRHPEEGRYVDGDVADIAS-RGYVQIRNGNRRYFGEAGDRYRGQEVVID 224 (651) T ss_pred hhcCccchhhhhhcChhheeeeccccc-ccchhhhhhhcccccccchhHH-HHHHHHHhcCcceEEEeeccccceeeeec Confidence 444444454455556665555544322 221 11111 1111110 0011100000 00111111111111 Q ss_pred -------EEEecCCCCceeE--EEEecCccccccccccccccCc---eEEEeeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_011045. 216 -------HIYLDEDSGEYIR--YEEVEGMEVQGSDGTYPKEACP---YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQ 283 (536) Q Consensus 216 -------~v~p~~~~~~~~~--~~~v~g~~i~~~~~~~~~~~~P---~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~ 283 (536) ..++......+.+ ...+.|.-.....+ ....+| .|++|.....+..||.||.+-++..+.....+. T Consensus 225 ~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~--~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~ 302 (651) T protein:vir:99 225 ESGDEPTIRYREDEESEREPIFVDRETGDVTTGDAN--GLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAK 302 (651) T ss_pred cCCcceeEEeccCcceeeeeecccceeeeEEEcCCC--ceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHH Confidence 1111100000000 00111110000000 011222 566776655577899999999999999999999 Q ss_pred HHHHHHHHHHhCCceeec-cccccchhhhcc---------CCCcc--eecCC--------ccccccccccc-c-cchhHH Q lcl|NC_011045. 284 EAIVKMSMISSKVIGLVN-PAGITQPRRLTK---------AQTGD--FVTGR--------PEDISFLQLEK-Q-ADFTVA 341 (536) Q Consensus 284 ~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~~---------~~~g~--~~~g~--------~~~~~~~~~~~-~-~~~~~~ 341 (536) +.......-...|..++. +++.++.+.... .+.|. ++++. ..++...++.. . .+.+. T Consensus 303 ~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qf- 381 (651) T protein:vir:99 303 DYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDF- 381 (651) T ss_pred HHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHH- Confidence 988888888888887765 455444433211 11111 22221 11233334432 2 24443 Q ss_pred HHHHHHHHHHHHHHHhhhhcc---cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCC-- Q lcl|NC_011045. 342 KAVSDAIEARLSFAFMLNSAV---QRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPE-- 416 (536) Q Consensus 342 ~~~i~~~~~rI~~af~~~~~~---~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~-- 416 (536) .+..+.....|.++|-..... ..++..-|+++... .+...-+.|++.++-..+.+. ++++ T Consensus 382 le~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~--------------~f~~~tL~P~~~~ie~eln~k-Ll~~~e 446 (651) T protein:vir:99 382 RQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDK--------------DFALEVIQPEQHTFAEWLYQI-IHQQAL 446 (651) T ss_pred HHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHh-hcCccc Confidence 455666778898998543211 11222334443332 122233444444444433332 3332 Q ss_pred -CCCcceEEEEe-chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCH---HHHHHH Q lcl|NC_011045. 417 -LPKEAVEPTIS-TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTE---EQKQQK 491 (536) Q Consensus 417 -~~~~~v~v~~v-s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~---~ev~~~ 491 (536) ..+..+.++|. +.|-+. +......+++.+-+.+ .+.+++ +-+.+|.+|- ..+ .-+... T Consensus 447 ~~~~~~i~~ef~~~~llr~----D~~~~~e~~~~~i~~G------~~T~NE----~R~~lglppi---~~~~gd~~l~~~ 509 (651) T protein:vir:99 447 GVTDWTIEYELRGADQPKQ----EAQLAEQRVRAMRLAG------VGLVDE----AREELGLDPL---GEPYGEMTLSEF 509 (651) T ss_pred cccCceEEEEeccchhhhc----cHHHHHHHHHHHHhCC------CcCHHH----HHHHhCCCCC---CCcccccccccc Confidence 11223445543 233322 2222222222221111 122222 2233455441 100 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHH--hhhhcCCCCCCC Q lcl|NC_011045. 492 MAQQSMQMGMDNGAAALAQGMAAQATASPEAMA--AAADSVGLQPGI 536 (536) Q Consensus 492 ~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~--~~~~~~~~q~~~ 536 (536) +. .... ..+++.-.+....+.... ...++..+|-++ T Consensus 510 --~~----~~~g---~~~~gge~~~~~~~~~~~~~~~~e~~~~~~~~ 547 (651) T protein:vir:99 510 --EA----EVAG---DVAGGGETEAVHEPPEENKIGEREWDTVKSEL 547 (651) T ss_pred --cc----cccc---ccccCCCCcccccCccccccccchhhhhhhhh Confidence 00 0000 000000000000000000 000011111111 No 207 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=74.26 E-value=0.16 Score=24.94 Aligned_cols=415 Identities=10% Similarity=0.032 Sum_probs=161.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcc-cccCCCCCcccc--------cccccc-----cchHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIP-SLFPKDSDNAST--------DYVTPW-----QAVGARGL 66 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P-~~~~~~~~~~~~--------~~~~~~-----dst~~~a~ 66 (536) |+.-.....-+-.+ ....+.+ .-++++...-. ...+..+-...+ ..-.+| |++-.-++ T Consensus 1 ~~~~i~~~~g~~~~-----~~~~~~~---~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l 72 (491) T protein:vir:79 1 MSKGLWVSPTEFVK-----FGEPDKS---LSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCV 72 (491) T ss_pred CCCeeeCCCCCccc-----ccccchh---HHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHH Confidence 66644333222111 1111111 11233311100 000111100000 001122 33444444 Q ss_pred HHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEE Q lcl|NC_011045. 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) Q Consensus 67 ~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 146 (536) ++....+.+ .+|- +.+++. +......++ +.|.+.+|...+.+++ +.+.+|-++.- T Consensus 73 ~~Rk~av~~-----~~w~-i~~~~~-------~~~~a~~i~-----------e~l~~~~~~~~i~~~l-da~~~G~s~~E 127 (491) T protein:vir:79 73 RRRKAAVKA-----LEWG-LDRGKA-------KSRVAKSIA-----------DVFADLDLSRIATEML-DAVLYGYQPME 127 (491) T ss_pred HHHHHHHhC-----CCcE-EecCCC-------CHHHHHHHH-----------HHHhcCCHHHHHHHHH-HhhhhcceeEE Confidence 444333332 2554 333221 111112233 3344567877776664 56678887753 Q ss_pred EecCCCCcee---eEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCC Q lcl|NC_011045. 147 LPEPEGSNYN---PMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDS 223 (536) Q Consensus 147 ~~~~~~~~~~---~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~ 223 (536) +......+.+ .+..+|-..|. .|.+|++ . +..+++ T Consensus 128 i~w~~~~g~~~~~~l~~r~~~~f~--~d~~~~l-----------------------------------~-----l~~~~~ 165 (491) T protein:vir:79 128 ITWGKVGNYIVPIDVVGKPADWFV--YDPENQL-----------------------------------R-----FRSKEH 165 (491) T ss_pred EEEeecCCeeeEEeeeeeccccee--eccCCce-----------------------------------E-----EeecCC Confidence 3211111111 11111111111 1111111 0 000000 Q ss_pred CceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc- Q lcl|NC_011045. 224 GEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP- 302 (536) Q Consensus 224 ~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~- 302 (536) ..+|..+ ..+=|++.++...+|+.||.|....+..-..--+...+..+..+++---|..+..- T Consensus 166 -------~~~g~~l---------p~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~ 229 (491) T protein:vir:79 166 -------WVQGEEL---------PARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP 229 (491) T ss_pred -------CCCceee---------cCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecC Confidence 1122222 12238999999999999999999999999999999999999999998888644442 Q ss_pred ccccchh--hh----cc--CCCcceecCCcccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHH Q lcl|NC_011045. 303 AGITQPR--RL----TK--AQTGDFVTGRPEDISFLQLE-KQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEE 373 (536) Q Consensus 303 ~g~~~~~--~~----~~--~~~g~~~~g~~~~~~~~~~~-~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtE 373 (536) .+..+.+ .+ .. +..+.++|. ..++.+++.. .++..+.-...++.+.+.|+++++...+...++......| T Consensus 230 ~~a~~~ek~~l~~al~~~~~~a~~viP~-~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~ 308 (491) T protein:vir:79 230 RSASDAETNLLLDRLEDMVQDAVAVIPD-DSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQ 308 (491) T ss_pred CCCCHHHHHHHHHHHHHHhcCeEEEecC-CceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhhHH Confidence 2322221 11 11 122334443 3455565543 3456666778899999999999986554433333334455 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~ 453 (536) |.....+. .+-.--..+...| .-+|..++.+.. + +.+ ..++.+..+-.... .....++.+..++ T Consensus 309 vh~~v~~~--i~~~D~~~i~~tl-n~li~~l~~~N~--~---~~~--~p~f~~~e~ee~~~------~~a~~~~~L~~~G 372 (491) T protein:vir:79 309 AGLEVTDD--IRDGDKAIVVEAM-NMLIRWICDLNF--D---GAA--RPVFDMWEQEQVDE------IQAGRDEKLTRAG 372 (491) T ss_pred HHHHHHHH--HHHHHHHHHHHHH-HHHHHHHHHhcC--C---CCC--cceEeecCcCchhH------HHHHHHHHHHhCC Confidence 54332221 1112222222222 224444433321 1 111 12333333221111 0112222333332 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhhccC----------------------CHHHHHHHH---HHHHHHHHHHHHHHHH Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSGILL----------------------TEEQKQQKM---AQQSMQMGMDNGAAAL 508 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~i~r----------------------s~~ev~~~~---~q~~~q~~~~~~a~~~ 508 (536) = .|+. .++.+.+|+++...-. ++++....- .+++.+........+. T Consensus 373 ~-----~i~~----~~~~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i 443 (491) T protein:vir:79 373 A-----RFTP----AYFKRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQDALDAALNALSARDLNADAQALVAPL 443 (491) T ss_pred C-----ccCH----HHHHHHhCCCCCCCCccccCcCcccccccccccccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1222 2344555553211100 001111110 0111111111100111 Q ss_pred HHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 509 AQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) . ..+ +.+.+.+.... .-..+.|.+ T Consensus 444 ~-~~l-~~~~s~~e~~~--~L~~l~~~~ 467 (491) T protein:vir:79 444 L-KRI-ANGASADELLG--MLAELYPSL 467 (491) T ss_pred H-HHH-HhcCCHHHHHH--HHHHHhhcC Confidence 0 011 11111111111 111233444 No 208 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=71.31 E-value=0.2 Score=24.45 Aligned_cols=393 Identities=9% Similarity=0.006 Sum_probs=150.0 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc----cCCCCCccccc-ccc-cccchHHHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL----FPKDSDNASTD-YVT-PWQAVGARGLNNLASKLM 74 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~----~~~~~~~~~~~-~~~-~~dst~~~a~~~Laa~l~ 74 (536) |++.-..+..+ ... ..|+.....-......+-|.. +......+..- ..+ +=.++--.|++.+|+.+- T Consensus 1 ~~~~l~~~~~~----~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia 73 (434) T protein:vir:43 1 MSKSLGKVLSS----ATS---APRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVA 73 (434) T ss_pred Cccchhhhhhh----ccc---ccchhhhcccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhh Confidence 87744222111 111 112211110001111111110 11111000000 011 112333356666666654 Q ss_pred HhhcCCCcceeccCC-hhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hcc----ChHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_011045. 75 LALFPMQTWMRLTIS-EYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLP 148 (536) Q Consensus 75 ~~ltP~~~Wf~l~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~~ 148 (536) + + ||.-..-. +.... . ..+.-+...|. +-| .+.=....+.++..+||+.+++. T Consensus 74 ~-l----p~~~~~~~~~g~~~----------~------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 132 (434) T protein:vir:43 74 G-L----PLGVYERKADGSRV----------D------ARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIR 132 (434) T ss_pred h-C----ceEEEEEcCCCccc----------c------ccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 3 2 33211111 00000 0 01112223332 223 33445666788899999999986 Q ss_pred cCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeE Q lcl|NC_011045. 149 EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIR 228 (536) Q Consensus 149 ~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~ 228 (536) .+ .+.++.+..++.+.+.+..|.+|.+--.+ T Consensus 133 ~~-~G~~~~L~~l~p~~v~~~~~~~g~~~y~~------------------------------------------------ 163 (434) T protein:vir:43 133 RA-AGRPAALDFLLPSRVDLECDENGRLKYFY------------------------------------------------ 163 (434) T ss_pred eC-CCcEEEEEEEcCcceEEEEcCCCeEEEEE------------------------------------------------ Confidence 55 45565566666667766666665321000 Q ss_pred EEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccch Q lcl|NC_011045. 229 YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQP 308 (536) Q Consensus 229 ~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~ 308 (536) ...+|.... +...-+++.|....+| .||.||...+...+.......+.......-...|..++.-++.+++ T Consensus 164 -~~~~g~~~~-------~~~~eVih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~ 234 (434) T protein:vir:43 164 -TTKKGARRE-------IERTNMLHIPAFTLDG-RIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQP 234 (434) T ss_pred -EecCceEEE-------EccccEEEecCcCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCH Confidence 000111000 0011123344433345 7999999999999988888888888888778888877766665555 Q ss_pred hhhccC----------CC-cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc-CCCCCCCHHHH Q lcl|NC_011045. 309 RRLTKA----------QT-GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ-RTGERVTAEEI 374 (536) Q Consensus 309 ~~~~~~----------~~-g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~-~~~~r~TAtEi 374 (536) +..... .+ |. +.--.++....++.. +.+.+. .+..+.....|-++|-.-. ++. .+....+..-+ T Consensus 235 e~~~~~r~~~~~~~g~~nag~-~~vl~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~ 312 (434) T protein:vir:43 235 AQREEFREYVKSVSGAMNSGR-SPVLEQGITPETIGINPVDAQL-LETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGL 312 (434) T ss_pred HHHHHHHHHHHHhcCccccCC-ccccCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchH Confidence 432110 00 11 111122333444432 334553 3445566778888885432 111 12112222222 Q ss_pred HHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHH---HHHHHHHHHHHHH--H-- Q lcl|NC_011045. 375 RYVASE-LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLE---AIGRGQDLDKLER--C-- 445 (536) Q Consensus 375 ~~r~~E-~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La---~a~r~~~~~~l~~--~-- 445 (536) .+.... ....|.|.+.++++++-. . ++++-....+.++| ++.|- ...|......+.+ + T Consensus 313 e~~~~~f~~~~L~P~~~~ie~~ln~------------k-L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T 379 (434) T protein:vir:43 313 EQQMLAFLTFSISSITNQIQQCVNK------------R-LLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMT 379 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh------------h-cCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcC Confidence 222222 122355555555544332 2 33332212233444 22221 1222222222221 1 Q ss_pred HHHHH---hhcch-hhhhc-CCHHH-HHHHHHHHcCCChhhccCCHHHHHHHHHHHHHH Q lcl|NC_011045. 446 VAAWA---ALAPM-RDDPD-INLAM-IKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQ 498 (536) Q Consensus 446 ~~~~~---~~~p~-~~~~~-id~d~-~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q 498 (536) .+.+- .+.|. -.|.. ++..- -++.+.+..- +. -. +..+..+..+.+.+. T Consensus 380 ~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~--~~-~~-~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 380 RNEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNK--SQ-AV-RAALMNWFSQPEPQE 434 (434) T ss_pred HHHHHHHhCCCCCCCCCeEeeccCccchhhhhccCC--Cc-ch-hhhhhccCCCCCCCC Confidence 11111 12231 11211 11100 1112222110 10 00 001111000000000 No 209 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=70.78 E-value=0.2 Score=24.36 Aligned_cols=403 Identities=11% Similarity=0.049 Sum_probs=147.1 Q ss_pred HHHHhcccc--cCCCC----Ccc---------cccccccccchHH----------HHHHHHHHHHHHhhcCCCcceeccC Q lcl|NC_011045. 34 CAQYTIPSL--FPKDS----DNA---------STDYVTPWQAVGA----------RGLNNLASKLMLALFPMQTWMRLTI 88 (536) Q Consensus 34 ~~~~~~P~~--~~~~~----~~~---------~~~~~~~~dst~~----------~a~~~Laa~l~~~ltP~~~Wf~l~~ 88 (536) +..|-+--+ ..... ..+ ......+++..+. .+|-++-+.-++.+ || ++.. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l----~~-~~~~ 75 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRT----GY-ILEG 75 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhC----ce-eeec Confidence 111111100 00000 000 0001111221111 11111111111111 22 2211 Q ss_pred ChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEE Q lcl|NC_011045. 89 SEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVV 168 (536) Q Consensus 89 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v 168 (536) .+. ..+...+.. ..-+++.-+...+.++.++|||.+++..+..+.+..+..++....-+ T Consensus 76 ~~~------------~~l~~~lpN---------~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v 134 (542) T protein:vir:41 76 DDE------------GVVDEFIRA---------CKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRV 134 (542) T ss_pred ccc------------hhhhhhcCC---------CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEE Confidence 110 001111100 01345666778888999999999999877766776666666555555 Q ss_pred eeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccccccccc Q lcl|NC_011045. 169 QRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEA 248 (536) Q Consensus 169 ~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~ 248 (536) ..|..+.+.... ....+......+. +. +....|.. ...+ .. T Consensus 135 ~~d~~~~~~~~~--------------------------~~~~~~~~~y~~~------~~-~~~~~g~~----~~~~--~~ 175 (542) T protein:vir:41 135 HKDGSRYRQTWD--------------------------GVNITHFKDYRYE------GE-INPETGED----QDSV--GA 175 (542) T ss_pred EEcCCeeEeeec--------------------------CCcceeEEeeccc------cc-cccccccc----cccc--Cc Confidence 554332111000 0000000000000 00 00001110 0011 11 Q ss_pred CceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc----------hh-------hh Q lcl|NC_011045. 249 CPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ----------PR-------RL 311 (536) Q Consensus 249 ~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~----------~~-------~~ 311 (536) .-.+++|+....+..||.||..-++..+.......+.......-...|.+++.-.|... ++ .+ T Consensus 176 ~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~ 255 (542) T protein:vir:41 176 NELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALI 255 (542) T ss_pred ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHH Confidence 23577787776777999999999999888888777777777776777876654222211 10 01 Q ss_pred c---cC--CC-c-c-eec---CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhhccc--CCC---CCCCHHHH Q lcl|NC_011045. 312 T---KA--QT-G-D-FVT---GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNSAVQ--RTG---ERVTAEEI 374 (536) Q Consensus 312 ~---~~--~~-g-~-~~~---g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~~~~--~~~---~r~TAtEi 374 (536) . .+ ++ | . ++. +..+++...++... .+.+ ..+..+..++.|-.+|-...... .+. .+-++++. T Consensus 256 ~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~q-fle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~ 334 (542) T protein:vir:41 256 EDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELS-FREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVT 334 (542) T ss_pred HHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHH Confidence 0 01 01 1 1 221 12234455555432 3444 34455666788888885432111 111 11234433 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEec-hHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011045. 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTIST-GLEAIGRGQDLDKLERCVAAWAALA 453 (536) Q Consensus 375 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs-~La~a~r~~~~~~l~~~~~~~~~~~ 453 (536) . ..+...-+.|++.++-..+.+ .++++.. ..+.++|.. .+.+..+.. .+. .+.+. T Consensus 335 ~--------------~~f~~~tL~P~~~~ie~~ln~-~L~~~~~-~~~~~~f~~~~ll~~d~~~---~~~----~~v~~- 390 (542) T protein:vir:41 335 R--------------RTYYESVVRPQQNIISSILTD-FFQVKFN-PKTRFKFNDETLLESDSVR---NCA----LLVQS- 390 (542) T ss_pred H--------------HHHHHHHHHHHHHHHHHHHHh-hcccccC-CceEEEecchhhcchHHHH---HHH----HHHhC- Confidence 2 223444455655555444443 2344332 245666542 222222221 111 11111 Q ss_pred chhhhhcCCHHHHHHHHHHHcCCChhh--cc-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hhhcCcch--H Q lcl|NC_011045. 454 PMRDDPDINLAMIKLRIANAIGIDTSG--IL-LTEEQKQQKMAQQSMQMGMDNGAAALAQGMAA-----QATASPEA--M 523 (536) Q Consensus 454 p~~~~~~id~d~~~~~~a~~~Gv~p~~--i~-rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~-----~~~~~~~~--~ 523 (536) ..+.++++-.. ..|++|.. ++ ...-..+++..+... ....+..+.....++ +....... . T Consensus 391 -----GilT~NE~Re~---L~g~~pgdd~~l~p~~~~~~~~~~~~~n--~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~ 460 (542) T protein:vir:41 391 -----GVLTPAEARER---LFGLDGGPDIFMVPSKGAAKSVKRQERN--YEKNQIREIRKIYAKYRPRFNEIISSKLSAE 460 (542) T ss_pred -----CCCCHHHHHHh---hCCCCCCCccccccccccccccccCCcC--CCCCchhhhhhcccccCccccccccccccch Confidence 12344444211 23554422 11 000000001000000 000000000000000 00000000 0 Q ss_pred Hhhhh---------cCCCCCCC Q lcl|NC_011045. 524 AAAAD---------SVGLQPGI 536 (536) Q Consensus 524 ~~~~~---------~~~~q~~~ 536 (536) +...+ +...|-|- T Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~ 482 (542) T protein:vir:41 461 EKKKKIDESLAEFRAEAYEAGK 482 (542) T ss_pred hhcccccchhhhhHHhHHhcCc Confidence 00000 11111111 No 210 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=70.62 E-value=0.21 Score=24.34 Aligned_cols=386 Identities=10% Similarity=0.048 Sum_probs=144.7 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChhhh Q lcl|NC_011045. 14 KSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEYEA 93 (536) Q Consensus 14 ~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~~~ 93 (536) -+-|+..+..+.++.+.|..+. ...+ ........ -+-.++--.|++.+|+.+.+ + | |-.... +... T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~---Al~~~~V~~~i~~Ia~~iA~-l-p---~~~~~~-~g~~ 66 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVL-AGDV----SQKYLGVS---ALKNSDILTATSIIAGDIAR-F-P---LVKKDV-NGDI 66 (406) T ss_pred CccccccCCCCCCcchHHHHHh-cCCC----Ccccccch---hhccHHHHHHHHHHHHhhhh-C-e---eEEEec-Cccc Confidence 1223332333334444444331 0000 00000000 01123333466777776643 2 3 322111 1100 Q ss_pred hhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCC-CCceeeEEEEecceEE Q lcl|NC_011045. 94 KQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPE-GSNYNPMKLYRLSSYV 167 (536) Q Consensus 94 ~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~-~~~~~~~~~~~l~~~~ 167 (536) ++ +.-+...|. + -+.+.-+...+.+|...|||.+|+..+. .+.+..+..++.+.+- T Consensus 67 ------------~~------~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~ 128 (406) T protein:vir:97 67 ------------IH------DEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETT 128 (406) T ss_pred ------------cc------cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeE Confidence 00 011222232 1 2456667778889999999999997653 3444444445556665 Q ss_pred EeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccc Q lcl|NC_011045. 168 VQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKE 247 (536) Q Consensus 168 v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~ 247 (536) +..+..|++- | ++. .+ .++....+ .- + T Consensus 129 v~~~~~~~~~--y-~~~---------------------------------~~--~~~~~~~~---~~------------~ 155 (406) T protein:vir:97 129 VEETDNHEIV--Y-TFT---------------------------------DM--LTAKQVKC---FA------------H 155 (406) T ss_pred EEEcCCceEE--E-EEE---------------------------------ec--CCceEEEE---cc------------c Confidence 5555544321 1 010 00 01111000 00 0 Q ss_pred cCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC----------CC- Q lcl|NC_011045. 248 ACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA----------QT- 316 (536) Q Consensus 248 ~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~----------~~- 316 (536) + ++++|....+| .||.||...+...+.....+.+.......-...|-++..+++.++.+..... .+ T Consensus 156 e--vih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~ 232 (406) T protein:vir:97 156 D--VIHWKFFSHDT-ILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREGSVG 232 (406) T ss_pred c--EEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhccccc Confidence 1 23334333344 6799999988888888888888777777777777777666665555443211 11 Q ss_pred cceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_011045. 317 GDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (536) Q Consensus 317 g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~ 394 (536) |.+. .-.++....++.. +.+.+.. +..+.....|-++|-.-. +....+..-+.+| ...+ +.. T Consensus 233 g~~~-vl~~g~~~~~l~~~~~d~q~l-e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~---~~~~-----------f~~ 296 (406) T protein:vir:97 233 GSPL-VFDSTMEYTPLEIDTNVLQLI-TSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQ---LMED-----------YVT 296 (406) T ss_pred Ccee-ecCCCceEEEccCCHHHHHHH-HHHHhhHHHHHHHhCCCHHHcCCCCCcchHHH---HHHH-----------HHH Confidence 1111 1112222333332 2333432 333444667777774322 1111111111111 1111 122 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEe-chHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHH Q lcl|NC_011045. 395 ELQLPLVRVLLKQLQATQQIPELPKEAVEPTIS-TGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANA 473 (536) Q Consensus 395 E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~v-s~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~ 473 (536) .-+.|++.++-..+.+. ++++-....+.++|- +.+... +++.+...++ . ..+.++++-.. T Consensus 297 ~~l~P~~~~ie~~l~~k-ll~~~~~~~~~i~fd~~~~~~~----~~~~~~~~~~----~------g~~T~NE~R~~---- 357 (406) T protein:vir:97 297 NDLPFYFDAITSELGLK-TLNDKDRRLYHIEFDTRSVTGR----NVDEIVKLVN----N------QILTPNQGLVE---- 357 (406) T ss_pred HHHHHHHHHHHHHHhhh-hcChhhccceeEEEecCccchh----hHHHHHHHHh----C------CCcCHHHHHHH---- Confidence 33455555444443332 333322223445552 222111 1111111111 0 11344444332 Q ss_pred cCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhcCCCCC Q lcl|NC_011045. 474 IGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQP 534 (536) Q Consensus 474 ~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~q~ 534 (536) +|.+|..- ..-|+.- + . .+.+.-....-.+....+...++..+...+ |. T Consensus 358 ~g~~p~~~-~~gD~~~-~----~-----~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~-~~ 406 (406) T protein:vir:97 358 LGKQKSTD-PNMDRYQ-S----S-----LNYVFLDKKEEYQDKVGIKGKGGEVNAEED-KS 406 (406) T ss_pred hCCCCCCC-CCCCeEe-e----c-----cCccchhcccccccccccccCCCCCCCCCC-CC Confidence 34433100 0000000 0 0 000000000000000000111111111111 22 No 211 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=69.11 E-value=0.23 Score=24.11 Aligned_cols=195 Identities=10% Similarity=0.060 Sum_probs=81.6 Q ss_pred EEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011045. 217 IYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKV 296 (536) Q Consensus 217 v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p 296 (536) +...++ +.+.+..........+..-.++-++ .+++|.....+..||.+|..-++..+..-+...+-....-.-...| T Consensus 1 ~r~~~d-g~~~y~~~~~~~~~~g~~~~~~~~e--ilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p 77 (219) T protein:vir:98 1 MRVCKD-GNYKYLMKKSLYDTKSEIYEYNKND--VIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHM 77 (219) T ss_pred Cceeec-CeEEEEEecceecCCceeEEecccc--EEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 332333 3332211111110101111111122 4556654444568999999988888876666655555555556777 Q ss_pred ceee-ccccccchhhhc---------cCC-C-cc-eec--CC-ccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011045. 297 IGLV-NPAGITQPRRLT---------KAQ-T-GD-FVT--GR-PEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLN 359 (536) Q Consensus 297 ~~lv-~~~g~~~~~~~~---------~~~-~-g~-~~~--g~-~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~ 359 (536) -.++ .+++.++++... .++ + +. ++. |. .+++...++.. ..+.|. .+.-+..+..|-++|-.. T Consensus 78 ~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qf-le~rk~~~~eIa~~fgVP 156 (219) T protein:vir:98 78 GFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEF-ANIKNISAQDVLTSHRFP 156 (219) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHH-HHHHHhhHHHHHHHhCCC Confidence 7654 355544443221 011 1 11 221 11 22344444432 334553 334444567788888433 Q ss_pred h--cccCCCCC---CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechH--HH Q lcl|NC_011045. 360 S--AVQRTGER---VTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGL--EA 432 (536) Q Consensus 360 ~--~~~~~~~r---~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~L--a~ 432 (536) . +...+..+ -++++... .=....|.|.+.+++++| .+.=++| . .++++|..+. .. T Consensus 157 p~~lG~~~~~~~~~sn~eq~~~--~f~~~tL~P~~~~ie~~l------------n~~~~~~---~-~~~~~F~~~~~~d~ 218 (219) T protein:vir:98 157 PGLSGIIPVNTAGLGDPLKIRE--AYQADEVLPLQEIIAESI------------NSDYEIK---S-ALKVNFKQPEKRDK 218 (219) T ss_pred HHHcccccCCCCCccCHHHHHH--HHHHHHHHHHHHHHHHHh------------hhhhcCC---C-ccEEeecCcccccC Confidence 2 11112222 23443222 223344555555555544 2221222 1 1233333221 11 Q ss_pred H Q lcl|NC_011045. 433 I 433 (536) Q Consensus 433 a 433 (536) . T Consensus 219 ~ 219 (219) T protein:vir:98 219 N 219 (219) T ss_pred C Confidence 1 No 212 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=63.91 E-value=0.31 Score=23.38 Aligned_cols=356 Identities=10% Similarity=0.013 Sum_probs=145.3 Q ss_pred HHHHHHHHHHhhhH-HHHHHHHHHHhcccccCCCCCc-ccccccccccchHHHHHHHHHHHHHHhhcCCCcceeccCChh Q lcl|NC_011045. 14 KSVYERLKNDRAPY-ETRAQNCAQYTIPSLFPKDSDN-ASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLTISEY 91 (536) Q Consensus 14 ~~r~~~l~~~R~~~-e~~w~e~~~~~~P~~~~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~~d~ 91 (536) -..|+.+...|+.. ...+.+..++.-..+....+.. +... -+-.++--.|++.+|+.+.+ + |- ..++...... T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~--al~~~~v~~~i~~Ia~~ia~-~-p~-~~~~~~~~~~ 75 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQR--AMRLTAVFSCVRVLAESVGM-L-PC-NLYHLNGSLK 75 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceechhh--hhccHHHHHHHHHHHHHhcc-C-ce-EEEEecCCce Confidence 24444444433221 2223333333221111111110 0000 01133444566666665532 1 20 0122211100 Q ss_pred hhhhhccChhHHHHHHHHHHHHHHHHHHHHH-h----ccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceE Q lcl|NC_011045. 92 EAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-S----NSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSY 166 (536) Q Consensus 92 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~ 166 (536) . .. . ..-+...|. + -+.+.-+..+..++..+|||.+|+..+ .+.+..+..++.+.+ T Consensus 76 ~--~~---------~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v 136 (414) T protein:vir:44 76 Q--RA---------T-------GERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCV 136 (414) T ss_pred e--ec---------c-------cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceE Confidence 0 00 0 011112222 1 244555677788888999999988655 455555555555666 Q ss_pred EEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccccccc Q lcl|NC_011045. 167 VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPK 246 (536) Q Consensus 167 ~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~ 246 (536) .+..+..|++ +|....+ ++ ... .+. T Consensus 137 ~~~~~~~~~~------------------------------------~y~~~~~--~g-~~~---~~~------------- 161 (414) T protein:vir:44 137 VPKLNSSWEP------------------------------------VYQVTFP--DG-STD---VLS------------- 161 (414) T ss_pred EEEECCCCcE------------------------------------EEEEEec--Cc-eEE---EEc------------- Confidence 5555554432 1111111 01 000 011 Q ss_pred ccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhccC---------C-- Q lcl|NC_011045. 247 EACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKA---------Q-- 315 (536) Q Consensus 247 ~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~~---------~-- 315 (536) ..-++++|....++ .||.||..-+...+.....+.+.......-...|.+++.-++.++++..... + T Consensus 162 -~~evih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~ 239 (414) T protein:vir:44 162 -QEDIWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLG 239 (414) T ss_pred -cccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCcc Confidence 11134444332333 7999999999999999998888888888888889887776666655532211 0 Q ss_pred -CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccC---CCCCCCHHHHHHHHHHHHHHhhhhHH Q lcl|NC_011045. 316 -TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQR---TGERVTAEEIRYVASELEDTLGGVYS 390 (536) Q Consensus 316 -~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~---~~~r~TAtEi~~r~~E~~~~LG~v~~ 390 (536) .|.+.. -+++....++.. +.+.+. .+..+..+..|-++|-....... .+..-++++.. . T Consensus 240 n~~~~~v-l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~---~----------- 303 (414) T protein:vir:44 240 NAHRPMI-LEMGLDWKSMALNAEDSQF-LETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG---L----------- 303 (414) T ss_pred ccCccee-cCCCceEEEccCChHHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH---H----------- Confidence 011111 112223333332 234443 34455556778888854321111 12222333322 1 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEe-chHHH---HHHHHHHHHHH-----------HHHHH------- Q lcl|NC_011045. 391 ILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTIS-TGLEA---IGRGQDLDKLE-----------RCVAA------- 448 (536) Q Consensus 391 rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~v-s~La~---a~r~~~~~~l~-----------~~~~~------- 448 (536) .+...-+.|++.++-..+.+ .++++.....+.++|. +.|-. ..|..-.+++. ..++. T Consensus 304 ~~~~~~l~P~~~~ie~~ln~-~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD 382 (414) T protein:vir:44 304 GFINYSLVPYLTRIEQRINT-GLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGD 382 (414) T ss_pred HHHHHHHHHHHHHHHHHHHh-hcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc Confidence 13333445555544433322 2333333223333331 12211 11111111111 11110 Q ss_pred -------HHhhc------chhhhhcCCHHHHHH Q lcl|NC_011045. 449 -------WAALA------PMRDDPDINLAMIKL 468 (536) Q Consensus 449 -------~~~~~------p~~~~~~id~d~~~~ 468 (536) ..... ...-+ .-|.|+-.. T Consensus 383 ~~~~~~n~~~~~~~~~~~~~~~~-~~~~d~~~~ 414 (414) T protein:vir:44 383 VYLTPMNMTTKPSDGSKAGKQKD-NANADETTS 414 (414) T ss_pred eecccccccccCCccccCCCCCC-CCCCCCCCC Confidence 00000 00000 122222222 No 213 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=63.81 E-value=0.31 Score=23.37 Aligned_cols=317 Identities=10% Similarity=0.071 Sum_probs=130.4 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc-C Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF-P 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt-P 79 (536) |-+.+..-...... .++- .+-.|+.| +.++. . +- ..++..+ . T Consensus 1 ~~~~~~~~~~~~~~---------~~~~-----~~~~f~~~------------------~~~~~---~-~~-~y~~~~~~~ 43 (345) T protein:vir:37 1 MKTNVKTDNKKGIV---------IAPI-----NDRTFSLN------------------EISAS---P-AL-DYVGIGFDE 43 (345) T ss_pred CCCCccccchhhcc---------cCcc-----eeEEeecC------------------Ccccc---c-ch-hhhhhhhcC Confidence 32222111000000 0000 00001111 11111 1 11 1222222 3 Q ss_pred CCcceeccCChhhhhhhccC-hhHHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSD-PDGLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~-~~~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) ...|+.--++-..|.++... ....+.+. +.+-+.....+-| -+..+.++..|+.+||||.+++..+..+.+ T Consensus 44 ~~~~~epp~~~~~la~l~~~~~~h~~~i~-----~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~ 118 (345) T protein:vir:37 44 NYNCYLPPVNRHALAKLPHQNAQHGGILH-----SRANMVSSLYEGGKALSRMDMRALCLNLIQFGDVGLLKVRNGFGQV 118 (345) T ss_pred CccccCCCCCHHHHHHHhhccccccccee-----eechHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcE Confidence 44666654444444433211 00001110 0000111111112 134456677899999999999887776666 Q ss_pred eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) +.+ +|+...++.+..+|...-.++.... ...|. T Consensus 119 ~~L--~pl~~~~vr~~~d~~~~~~~~~~~~---------------------------------------------~~~g~ 151 (345) T protein:vir:37 119 VRL--VPLSSLYLRVRKDGGYSYLMKKSLY---------------------------------------------DTAQE 151 (345) T ss_pred EEE--EEEcCceeEEEEeCCeeEEEEEeEe---------------------------------------------cCCce Confidence 544 4443334433222211111100000 00111 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc-- Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT-- 312 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~-- 312 (536) .. . |..--++..|.....+..||.+|..-++..+..-+..++-....-.-...|.+++. ++...+.++.. T Consensus 152 ~~-~------~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~l 224 (345) T protein:vir:37 152 IY-R------YDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEI 224 (345) T ss_pred EE-E------EccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHH Confidence 00 0 00111455554444567899999988888777666666555555555666776543 44444433321 Q ss_pred -----c-CCCc---c-ee--c-CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----cccCC-CCCCCHHH Q lcl|NC_011045. 313 -----K-AQTG---D-FV--T-GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----AVQRT-GERVTAEE 373 (536) Q Consensus 313 -----~-~~~g---~-~~--~-g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~-~~r~TAtE 373 (536) . .+.| . ++ + |..+++...|+... .+.+ ..+..+..++.|-.+|-.-. ....+ +..-++++ T Consensus 225 k~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~q-f~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~ 303 (345) T protein:vir:37 225 ARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDE-FANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLK 303 (345) T ss_pred HHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHH Confidence 1 1111 1 11 2 22455566665543 3444 33444455777878874321 11111 11223443 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEec-hHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTIST-GLEA 432 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs-~La~ 432 (536) ... .+...-+.|++.++...+.+ +|++++. ..++|.. -|++ T Consensus 304 ~~~--------------~f~~~~l~P~~~~ie~~ln~---~~~~~~~-~~i~F~~~~L~~ 345 (345) T protein:vir:37 304 YRE--------------VYHYDEVMPLQEIIAETINQ---DPEIKNL-LKIKFREQNFAK 345 (345) T ss_pred HHH--------------HHHHHHHHHHHHHHHHHhhh---hccCCCc-ceEEecchhhcC Confidence 322 12233356777766666654 3445443 2444432 2333 No 214 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=58.05 E-value=0.42 Score=22.64 Aligned_cols=320 Identities=8% Similarity=0.026 Sum_probs=125.3 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhc-C Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALF-P 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt-P 79 (536) |.+.-........ .++ ...|.+.++ +.....-..+ ...++... . T Consensus 1 ~~~~~~~~~~~~~----------~~~-------------~~~~~~~~~-----------p~~~~~~~~~-~~~~~~~~~~ 45 (348) T protein:vir:26 1 MTEQLIHSHTTDG----------TES-------------KSVYSFDPN-----------PEPVDTNSWM-TRYCELFYND 45 (348) T ss_pred CCccccchhhccc----------cCC-------------ceEEEecCC-----------CeeecCcchH-HHHHHHHhcC Confidence 3331110000000 000 001111100 1111100111 11111112 2 Q ss_pred CCcceeccCChhhhhhhccCh-hHHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDP-DGLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~-~~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) ...|+.--++-..|.++.... -....+. -....+...+ .-| -+..+.++..|+.+||||.+++..+..+.+ T Consensus 46 ~~~~~epp~~~~~La~l~~~n~~h~~~i~----~k~N~l~~~~-~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~ 120 (348) T protein:vir:26 46 FDDYWEPPISLKGLAEIANANGYHGSLLK----ARANYVAGRF-MNGGGLPMYKMNSACWDYFGLGMSAFVKIRSYLKNV 120 (348) T ss_pred CCccccCCCCHHHHHHHHhhhhhhhhhHh----hhhhHHhhcc-cCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcE Confidence 334544333333444332100 0001111 0001111111 111 135567788899999999999877766666 Q ss_pred eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) +.++.+|. .++.+..+|+ | | +...+|. T Consensus 121 ~~L~~l~~--~~v~~~~d~~----~---------------------------------~--------------~~~~~g~ 147 (348) T protein:vir:26 121 IALEPLPM--VHMRKRKNGD----F---------------------------------V--------------QLLRNNE 147 (348) T ss_pred EEEEEecC--ceeEeeecCc----E---------------------------------E--------------EEEecCe Confidence 55544443 3333332221 0 0 0001111 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc-- Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT-- 312 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~-- 312 (536) .+. |..--.+.+|.....+..||.+|..-++..+-.-+..+.-....-.-..+|-+++. ++...+.++.. T Consensus 148 ~~~-------f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~l 220 (348) T protein:vir:26 148 QKV-------FKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKAL 220 (348) T ss_pred EEE-------EcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHH Confidence 000 00111344454333467899999988887776666666555555555667776653 44434433221 Q ss_pred -----cC-CCc----cee--c-CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-c---ccCC-CCCCCHHH Q lcl|NC_011045. 313 -----KA-QTG----DFV--T-GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-A---VQRT-GERVTAEE 373 (536) Q Consensus 313 -----~~-~~g----~~~--~-g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~---~~~~-~~r~TAtE 373 (536) .. +.| .++ + |..+++...|+... .+.+ ..+..+-.++.|-.+|-.-. + ...+ +..-++++ T Consensus 221 k~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~q-f~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~ 299 (348) T protein:vir:26 221 KEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDE-FERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLK 299 (348) T ss_pred HHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHH-HHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHH Confidence 10 111 112 2 22345566665533 3344 34445555777888884321 1 1111 11223443 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE-echHHHHHHHHHHHHH Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEAIGRGQDLDKL 442 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~-vs~La~a~r~~~~~~l 442 (536) .... +...-+.|++.++...+.+. +.+++ .++++| .++... ..+..++ T Consensus 300 ~~~~--------------f~~~~l~P~~~~ie~~ln~~---l~~~~-~~~~~fdl~~~~e---~~~~~a~ 348 (348) T protein:vir:26 300 VSQV--------------YDFYEVIPVCKRFMDAVNND---PEIPD-NLKLKFNLNPGVE---SANGSAV 348 (348) T ss_pred HHHH--------------HHHHHHHHHHHHHHHHHhhh---hCCCC-ccEEEEecCcccc---cchhhcC Confidence 3322 22222556655555544332 11222 233443 232111 1111111 No 215 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=57.99 E-value=0.42 Score=22.64 Aligned_cols=361 Identities=9% Similarity=0.014 Sum_probs=127.8 Q ss_pred HHHHh-hhHHHHHHHHHHHhcccccCCCCCccccccccccc----------chHHHHHHHHHHHHHHhhcCCCcceeccC Q lcl|NC_011045. 20 LKNDR-APYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQ----------AVGARGLNNLASKLMLALFPMQTWMRLTI 88 (536) Q Consensus 20 l~~~R-~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~d----------st~~~a~~~Laa~l~~~ltP~~~Wf~l~~ 88 (536) ++.++ -.|...+- +-++.-++.....+ -.....+.|. ++--.|++.+|+.+.+ .| | ++-- T Consensus 1 ~~~~~~~~~~k~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~--lp---~-~~~~ 71 (409) T protein:vir:94 1 MAKENIVTRIKKKL-IDNWIDQSASKLYD--FSPWKNKSFWGVINNTLETNETIFSAITKLSNSMAS--LP---L-KMYE 71 (409) T ss_pred CcccccchhhhhHH-hhhhhcCCcccccc--cccccCccccccchhhhhccHHHHHHHHHHHHhhhh--Cc---e-eEee Confidence 33322 12222110 11222111111000 0000111222 2222344444444422 23 2 2210 Q ss_pred ChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEec Q lcl|NC_011045. 89 SEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL 163 (536) Q Consensus 89 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l 163 (536) .... .+ ..+...|. +- +.+.=....+.++..+||+.+|+..+..+.+..+..+|. T Consensus 72 ~~~~-------------~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~ 131 (409) T protein:vir:94 72 DYKV-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNP 131 (409) T ss_pred cccc-------------cc-------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 1000 00 01111222 22 233445666778888999999987666655555555555 Q ss_pred ceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccc Q lcl|NC_011045. 164 SSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGT 243 (536) Q Consensus 164 ~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~ 243 (536) +.+-+..+.+|... +.++. ...|..+. T Consensus 132 ~~v~v~~~~~~~~~--~y~~~----------------------------------------------~~~g~~~~----- 158 (409) T protein:vir:94 132 DVVEMLIENQSREL--YYSIH----------------------------------------------AATGNKLI----- 158 (409) T ss_pred ceeEEEEeCCCcEE--EEEEE----------------------------------------------cCCceEEE----- Confidence 55555554433211 10000 00111000 Q ss_pred cccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhcc---------C Q lcl|NC_011045. 244 YPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTK---------A 314 (536) Q Consensus 244 ~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~~---------~ 314 (536) ++.++ ++++|-....+..||.||..-+...+...+.+.+..+... ...+.+++..++.++++.... . T Consensus 159 ~~~~d--vih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~ 234 (409) T protein:vir:94 159 VHNMD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVGKEKRQQVLEDFKQYYE 234 (409) T ss_pred Ecccc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHhc--CCCCeeEEecCCCCCHHHHHHHHHHHHHHhh Confidence 00000 2333322234568999999888777777666655533322 223334544444444433220 1 Q ss_pred CCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHH-HHHhhhhHHHH Q lcl|NC_011045. 315 QTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASEL-EDTLGGVYSIL 392 (536) Q Consensus 315 ~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~-~~~LG~v~~rl 392 (536) ..|.+. --.++....++.. +.+.+.. +..+.....|-++|-.-....-....-|-.-+.+..... ...|.|.+.++ T Consensus 235 ~~g~~~-vl~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~i 312 (409) T protein:vir:94 235 ENGGIL-FQEPGVEIEPLPKKYVSEDIV-ASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQY 312 (409) T ss_pred cCCCee-ecCCCceEEEcCCChhHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 122221 1122233444442 3344432 344445567778875432111122222333333333333 33566777777 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCC-cceEEEE-echHH---HHHHHHHHHHHHH--H--HHHH-H--hhcch-hhh- Q lcl|NC_011045. 393 SQELQLPLVRVLLKQLQATQQIPELPK-EAVEPTI-STGLE---AIGRGQDLDKLER--C--VAAW-A--ALAPM-RDD- 458 (536) Q Consensus 393 ~~E~l~Pli~r~~~il~~~g~lp~~~~-~~v~v~~-vs~La---~a~r~~~~~~l~~--~--~~~~-~--~~~p~-~~~- 458 (536) ++|+-.-|+ |+... ....++| ++.|- -..|....+.+.+ + .+.+ . .+.|. ..| T Consensus 313 e~~ln~~Ll-------------~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~ 379 (409) T protein:vir:94 313 EEEFNRKLL-------------TKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK 379 (409) T ss_pred HHHHHHhhC-------------CcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 666544332 21110 1122222 22221 1111111121111 0 0000 0 01110 000 Q ss_pred -----h--cCCHHHHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 459 -----P--DINLAMIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 459 -----~--~id~d~~~~~~a~~~Gv~p~~i~rs 484 (536) . .+|.....+.-..+=+-+ -=.+ T Consensus 380 ~~~~~n~~~~~~~~~~~~~~kGG~~n---~~e~ 409 (409) T protein:vir:94 380 PLISGDLYPIDTPLELRKSLKGGDKN---VNES 409 (409) T ss_pred EeecccccccccchhhcccccCCCCC---cCCC Confidence 0 011111111111111000 0011 No 216 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=56.77 E-value=0.45 Score=22.49 Aligned_cols=315 Identities=12% Similarity=0.057 Sum_probs=120.9 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccc--cCCCCCccc-ccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--FPKDSDNAS-TDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~--~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) |++.+..-.+...+ ...+ .-|.. |.+.+ +.. -....++| -.+|.. T Consensus 1 ~~~~~~~~~~~~~~-------~~~~------------~~~~~~~~~f~~-p~~v~~~~~~~~--~~~~~~---------- 48 (344) T protein:vir:20 1 MSKKKGKTPQPAAK-------TMTA------------SGPKMEAFTFGE-PVPVLDRRDILD--YVECIS---------- 48 (344) T ss_pred CCcccCCCCcchhh-------hhhc------------cCCceEEEEcCC-ceEecCcchhhh--hhhhhh---------- Confidence 88866432221111 0000 00000 11110 000 00000000 001110 Q ss_pred cCCCcceeccCChhhhhhhccChh-HHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPD-GLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~-~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) .+.|+.--++-..|.++..... ....+. -.-..+...+ +-| -...+..+..|+.+||||.+++..+..+ T Consensus 49 --~~~~~~pp~~~~~la~~~~a~~~h~~~i~----~k~n~l~~~~-~Pn~~lt~~~f~~~~~d~ll~Gnay~~i~rn~~G 121 (344) T protein:vir:20 49 --NGRWYEPPVSFTGLAKSLRAAVHHSSPIY----VKRNILASTF-IPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTG 121 (344) T ss_pred --cCceecCCCCHHHHHHHHhhhhhhCccce----ehhhhHHHhc-cCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCC Confidence 1123222222222222110000 000000 0000000000 111 1234556677899999999998877766 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++.+. |+...++.+..+|.. |. .+ .. + T Consensus 122 ~~~~L~--pl~~~~vr~~~~~~~---~~----------------------------------~~--~~-----------~ 149 (344) T protein:vir:20 122 KVIRLE--TSPAKYTRRGVEEDV---YW----------------------------------WV--PS-----------F 149 (344) T ss_pred cEEEEE--EcCCceeEeeecCCE---EE----------------------------------EE--cc-----------C Confidence 665444 444444443222211 00 00 00 0 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT 312 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~ 312 (536) |..+ . |..--++.+|.....+.+||.+|..-++..+..-+..++-....-.-.+.|.+++. +++..+.++.. T Consensus 150 ~~~~-~------~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~ 222 (344) T protein:vir:20 150 NEPT-A------FAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIE 222 (344) T ss_pred CeEE-E------EcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHH Confidence 1100 0 11112455554434567999999988888777666666666666666677776653 45444433221 Q ss_pred ---------cCC-Cc--cee--cC-Ccccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----cccCC-CCCCCH Q lcl|NC_011045. 313 ---------KAQ-TG--DFV--TG-RPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----AVQRT-GERVTA 371 (536) Q Consensus 313 ---------~~~-~g--~~~--~g-~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~-~~r~TA 371 (536) .++ +| -++ ++ ..+++...++... .+.+ ..+..+..++.|-.+|-.-. ....+ +..-++ T Consensus 223 ~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~q-f~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~ 301 (344) T protein:vir:20 223 MLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDD-FFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDI 301 (344) T ss_pred HHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChhHHH-HHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccH Confidence 111 11 122 22 2345666665533 3444 44555666778888883321 11111 112234 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcce Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAV 422 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v 422 (536) ++....- ....|.|...++. ++. +++-....+. ..+.+..++= T Consensus 302 e~~~~~f--~~~~l~P~~~~~e-~in----~~lg~~~i~F-~~~~l~~~d~ 344 (344) T protein:vir:20 302 EKVAKVF--VRNELIPLQDRIR-EIN----GWLGQEVIRF-KNYSLDTDND 344 (344) T ss_pred HHHHHHH--HHHHHHHHHHHHH-HHH----HhcCCccccc-CccccccCCC Confidence 4433321 2233456655554 211 1110000010 0112221110 No 217 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=56.66 E-value=0.45 Score=22.48 Aligned_cols=315 Identities=12% Similarity=0.048 Sum_probs=119.9 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCC----cccccccccc---cchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD----NASTDYVTPW---QAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~----~~~~~~~~~~---dst~~~a~~~Laa~l~~~ltP~ 80 (536) |.+ .|.. -+.......+ .........| |+.+...-..+-. ++ .+.-. T Consensus 1 ~~~------------~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~-~~-~~~~~ 54 (351) T protein:vir:78 1 MSK------------RRSR------------APRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILD-YV-ECWSN 54 (351) T ss_pred CCC------------CCCC------------CCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhh-hh-hhhcc Confidence 110 0000 0000000000 0000000011 1111100000000 00 11112 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccC-------hHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSY-------RVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf-------~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) +.|+.--++-..|.++.. +..+...+-..-...+ .+.| ...+.+++.|+.+||||.+++..+..+ T Consensus 55 ~~~~~pp~~~~~la~~~~-------~~~~h~~~l~~k~n~l-~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G 126 (351) T protein:vir:78 55 GEWFEPPVSFAGLAKSFR-------ASTHHSSALFFKANVL-ASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVG 126 (351) T ss_pred CceecCCCCHHHHHHHHh-------hhHhhhhhhhhhhhHH-hhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCC Confidence 345543333333333221 1111111111100111 1122 345667888999999999998877766 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++.+..+|....-+..+.+| | +|..-+ T Consensus 127 ~~~~L~pl~~~~v~~~~~~~~---------------------------------------------------~-~~~~~~ 154 (351) T protein:vir:78 127 GTLRLEPALAKYVRRKADFSG---------------------------------------------------F-VYVNGW 154 (351) T ss_pred CEEEEEEecCcceEEeeeCCe---------------------------------------------------E-EEEecC Confidence 665544444332221111110 0 000001 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT 312 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~ 312 (536) |... .|..--++.+|..-..+++||.+|..-++..+-.-+..+.-....-.-.++|.+++. +++..+.++.. T Consensus 155 ~~~~-------~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~ 227 (351) T protein:vir:78 155 QERH-------EFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVD 227 (351) T ss_pred CeEE-------EEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHH Confidence 1100 011112455554434567999999988887776666555555555555667776543 34433333221 Q ss_pred ---------cCC-C-c-cee--c-CCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh----cccCC-CCCCCH Q lcl|NC_011045. 313 ---------KAQ-T-G-DFV--T-GRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS----AVQRT-GERVTA 371 (536) Q Consensus 313 ---------~~~-~-g-~~~--~-g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~-~~r~TA 371 (536) .+. + | -++ + |..+++...|+.. ..+.+ ..+..+..++.|-.+|-.-. ....+ +..-++ T Consensus 228 ~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~q-f~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~ 306 (351) T protein:vir:78 228 NMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDE-FFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTP 306 (351) T ss_pred HHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccH Confidence 111 1 1 122 2 2234556666553 33444 33445555677878874322 11111 112234 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRG 436 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~ 436 (536) ++.... +...-+.|++.++..+... ++.+.+++..-..+..-.++ T Consensus 307 e~~~~~--------------f~~~~l~P~~~~iee~n~~------l~~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 307 DTAARV--------------FGRNEIRPLQARFAELNDW------LGDEVVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHH--------------HHHHHHHHHHHHHHHHHhh------cCccceecChhhhccccccC Confidence 443321 2222244544444333221 12223443333333332333 No 218 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=55.84 E-value=0.47 Score=22.38 Aligned_cols=331 Identities=12% Similarity=0.031 Sum_probs=126.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccc---CCCC----Ccc------cccccccccchHHHHHH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLF---PKDS----DNA------STDYVTPWQAVGARGLN 67 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~---~~~~----~~~------~~~~~~~~dst~~~a~~ 67 (536) |.-.+. ..+ .+.+|-.+ -+-.=.+|.-- +.-. .+. +......|.--..+.| T Consensus 1 ~~~~~~----~~~------~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v- 64 (376) T protein:vir:10 1 MPARDR----PRA------ARRRRHSF-----IFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPV- 64 (376) T ss_pred CCCCcc----chh------hhhhcccc-----hhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceec- Confidence 544331 111 12222111 11111223100 0000 000 0000001110000000 Q ss_pred HHHHH-HHHhh--cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccC-------hHHHHHHHHHH Q lcl|NC_011045. 68 NLASK-LMLAL--FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSY-------RVTLFEALKQL 137 (536) Q Consensus 68 ~Laa~-l~~~l--tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf-------~~~~~~~~~dl 137 (536) |... +...+ .-.+.|++--++-..|.++.. +..+...+=......+ .+.| ...+.+++.|+ T Consensus 65 -~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~-------~~~~h~s~l~~k~n~l-~~~~~Pnp~lT~~~f~~~v~d~ 135 (376) T protein:vir:10 65 -MNRAEILDYVECWSNGEWFEPPVSFAGLAKSFR-------ASTHHSSALFFKANVL-ASTFRPHRWLSRHAFERWALDF 135 (376) T ss_pred -cCcchhhhhhhhhhcCceecCCCCHHHHHHHHh-------hhHHhhhhHHHHhHHH-HhccCCCCCCCHHHHHHHHHHH Confidence 1111 11111 112345554444333333221 1111111111111111 1222 35567788899 Q ss_pred HhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEE Q lcl|NC_011045. 138 VVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI 217 (536) Q Consensus 138 ~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v 217 (536) .+||||.+++..+..+.++.+..+| ..++.+..++. T Consensus 136 ll~Gnay~~~~rn~~G~~~~L~pl~--~~~vr~~~d~~------------------------------------------ 171 (376) T protein:vir:10 136 LTFGNGYLERRRNMVGGTLRLEPAL--AKYVRRKADFN------------------------------------------ 171 (376) T ss_pred HhcCCeEEEEEECCCCCEEEEEEeC--CcceEEEeeCC------------------------------------------ Confidence 9999999998777666555444433 33332211110 Q ss_pred EecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011045. 218 YLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI 297 (536) Q Consensus 218 ~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~ 297 (536) .| +|....+..+ . |..--++.+|.....+..||.+|..-++..+-.-+..++-....-.-...|- T Consensus 172 -------~~-~~~~~~~~~~-~------~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pg 236 (376) T protein:vir:10 172 -------GF-VYVNGWQERH-E------FEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAG 236 (376) T ss_pred -------eE-EEEEcCCeEE-E------EccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 00 0000011100 0 1111245555444456799999998888877766666655555555566777 Q ss_pred eeec-cccccchhhhc-------c-CCCc----cee--c-CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011045. 298 GLVN-PAGITQPRRLT-------K-AQTG----DFV--T-GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS 360 (536) Q Consensus 298 ~lv~-~~g~~~~~~~~-------~-~~~g----~~~--~-g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~ 360 (536) .++. ++...+.++.. . .|.| -++ + |..+++...++... .+.+ ..+..+..++.|-.+|-.-. T Consensus 237 gIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~q-f~e~k~~~~~eIa~af~VPp 315 (376) T protein:vir:10 237 FILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDE-FFNIKNVTRDDLLAAHRVPP 315 (376) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHH-HHHHHHHhHHHHHHHhCCCH Confidence 6543 34333433211 1 1111 112 2 22455666666543 4454 34455555677878874321 Q ss_pred ----cccCC-CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHH Q lcl|NC_011045. 361 ----AVQRT-GERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGR 435 (536) Q Consensus 361 ----~~~~~-~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r 435 (536) ....+ +..-++++.... +...-|.|++.++..+... ++.+.+++..-.-+....+ T Consensus 316 ~llGi~~~~t~~~sn~eq~~~~--------------f~~~~L~Pl~~~ieeln~~------L~~~~~~F~~~~Llr~d~k 375 (376) T protein:vir:10 316 QLLGIVPSNSGGFGTPDTAARV--------------FGRNEIRPLQARFAELNDW------LGEEVVRFDDYEIPPAPVA 375 (376) T ss_pred HHhcccCCCCCCcccHHHHHHH--------------HHHHHHHHHHHHHHHHHhh------ccccccccChhHhhccccc Confidence 11111 122345444332 1222244444444332211 1122233322222222222 Q ss_pred H Q lcl|NC_011045. 436 G 436 (536) Q Consensus 436 ~ 436 (536) + T Consensus 376 a 376 (376) T protein:vir:10 376 A 376 (376) T ss_pred C Confidence 2 No 219 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=54.53 E-value=0.5 Score=22.23 Aligned_cols=375 Identities=11% Similarity=0.036 Sum_probs=130.5 Q ss_pred HHHHHHHHHHhh---hHHHHHHHHHHHhcccccCCCCCcccccccccc--cchHHHHHHHHHHHHHHhhcCCCcceeccC Q lcl|NC_011045. 14 KSVYERLKNDRA---PYETRAQNCAQYTIPSLFPKDSDNASTDYVTPW--QAVGARGLNNLASKLMLALFPMQTWMRLTI 88 (536) Q Consensus 14 ~~r~~~l~~~R~---~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~--dst~~~a~~~Laa~l~~~ltP~~~Wf~l~~ 88 (536) -..++.|...++ .....| +.-+......+...+......+ .++--.|++.+|+.+-+ + |-.-|-+-.- T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~-l-p~~~~~~~~d 73 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIE-----LVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVAS-L-QLQAFERVED 73 (423) T ss_pred CchhHhhccccccccCccccc-----cccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhh-C-ceEEEEEecC Confidence 122222221111 111111 0000011111111111122222 22333566666666533 2 4111111111 Q ss_pred ChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhc----cChHHHHHHHHHHHhhCcEEEEEecCCCCc--eeeEEEEe Q lcl|NC_011045. 89 SEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSN--YNPMKLYR 162 (536) Q Consensus 89 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~--~~~~~~~~ 162 (536) .+. +. ++ +.-++..+.+= ..+.-+.....++..+|||.+++..+.++. .+.++.++ T Consensus 74 g~~--~~----------~~------~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~ 135 (423) T protein:vir:81 74 GGR--ER----------VR------EGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIP 135 (423) T ss_pred Cce--ee----------ec------cchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecc Confidence 110 00 00 01111222222 244445566678889999999987654322 11222222 Q ss_pred cceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccc Q lcl|NC_011045. 163 LSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDG 242 (536) Q Consensus 163 l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~ 242 (536) +..+.+....+|. ..+ .|.......+ +|..+. T Consensus 136 ~~~v~~~~~~~~~---------------------------------~~~-~Y~~~~~~~~----------~g~~~~---- 167 (423) T protein:vir:81 136 VSWVQRRAYKDGW---------------------------------GSL-DYIIIESGDN----------DGRSVK---- 167 (423) T ss_pred cceeeeeeccCCC---------------------------------cce-EEEEEEecCC----------CceEEE---- Confidence 2222211111100 000 0111111111 111110 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccc-----ccchhh------- Q lcl|NC_011045. 243 TYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-----ITQPRR------- 310 (536) Q Consensus 243 ~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g-----~~~~~~------- 310 (536) +..-=+++.|....++..||.||...+...+.......+.......-...|-.++.-+. -++.+. T Consensus 168 ---~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~ 244 (423) T protein:vir:81 168 ---VPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMAN 244 (423) T ss_pred ---EcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHH Confidence 00111455565555666799999999999888888888888887777777876653211 112211 Q ss_pred hc---cCC---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHHHHHHHHHHHH Q lcl|NC_011045. 311 LT---KAQ---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELED 383 (536) Q Consensus 311 ~~---~~~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEi~~r~~E~~~ 383 (536) +. .++ .|.+.. -.++....++.. ..+.+.+. ..+-.+..|-.+|-.-....-+.+.-|-.-+.+... T Consensus 245 ~~~~~~~~~~n~g~~~v-l~~g~~~~~l~~s~~d~q~~e-~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~---- 318 (423) T protein:vir:81 245 LRASFSPKSSDVGGTLL-LEDGMKAENFHTTSKDEQTVE-TTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRK---- 318 (423) T ss_pred HHHHhccccccCCccee-cCCCceEEeccCChhhHHHHH-HHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH---- Confidence 11 011 121111 112223333332 23444432 334445667777743221111111212111111111 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC--CcceEEEE-echHHH---HHHHHHHHHHH-H--HH--HHH--- Q lcl|NC_011045. 384 TLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP--KEAVEPTI-STGLEA---IGRGQDLDKLE-R--CV--AAW--- 449 (536) Q Consensus 384 ~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~--~~~v~v~~-vs~La~---a~r~~~~~~l~-~--~~--~~~--- 449 (536) .+...-|.|++.++-..+.+ .++|+.. ...+.++| ++.|-+ ..|....+..+ + ++ +.+ T Consensus 319 -------~f~~~~L~P~~~~ie~~l~~-~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~ 390 (423) T protein:vir:81 319 -------ALYGDNLGSWIRIIQDVMNL-FLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAM 390 (423) T ss_pred -------HHHHHHHHHHHHHHHHHHhh-hhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHH Confidence 12333355555544444333 2444432 23333444 222211 11222222111 1 11 111 Q ss_pred Hhhcch-hhhhcC---CHHHHHHHHHHHcCCChhhc Q lcl|NC_011045. 450 AALAPM-RDDPDI---NLAMIKLRIANAIGIDTSGI 481 (536) Q Consensus 450 ~~~~p~-~~~~~i---d~d~~~~~~a~~~Gv~p~~i 481 (536) -.+.|. -.|..+ |... .+ -.+.-| ++..- T Consensus 391 ~gl~p~~gGD~~~~p~n~~~-~~-~~~~~~-~~~~t 423 (423) T protein:vir:81 391 DNLPSIDGGDDLARPLNTEF-GD-SEDAPG-EEVET 423 (423) T ss_pred hCCCCCCCcceeeccccccc-Cc-cCCCCC-CCCCC Confidence 112221 111111 1100 00 001111 01111 No 220 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=50.59 E-value=0.61 Score=21.78 Aligned_cols=304 Identities=11% Similarity=0.058 Sum_probs=127.2 Q ss_pred cccccCCCC---Cc-ccccccc--cccchHHHHHHHHHHHHHHhhc----CCCcceeccCChhhhhhhccChh-HHHHHH Q lcl|NC_011045. 39 IPSLFPKDS---DN-ASTDYVT--PWQAVGARGLNNLASKLMLALF----PMQTWMRLTISEYEAKQLLSDPD-GLAKVD 107 (536) Q Consensus 39 ~P~~~~~~~---~~-~~~~~~~--~~dst~~~a~~~Laa~l~~~lt----P~~~Wf~l~~~d~~~~~~~~~~~-~~~~v~ 107 (536) +-....... .. +..+... -=++++ +.++..+. ....|+.--++-..|.++..... ..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i- 71 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITA--------SPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGIL- 71 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCccc--------chhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchh- Confidence 111100000 00 0000000 002222 22333332 23467775555444544321111 01111 Q ss_pred HHHHHHHHHHHHHHHhccC-------hHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEEEE Q lcl|NC_011045. 108 EGLSMVERIIMNYIESNSY-------RVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMV 180 (536) Q Consensus 108 ~~L~~ve~~~~~~l~~snf-------~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~ 180 (536) .-....+ .++| ...+.+...|+.+||||.+++..+..+.++.+ +|+...++.+..+|...-.+ T Consensus 72 ---~~k~n~l-----~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~i~rn~~G~~~~L--~pl~~~~vr~~~d~~~~~~~ 141 (345) T protein:vir:37 72 ---HSRANMV-----SATYEGGKALSKMEMRALCLNLIQFGDVGLLKVRNGFGQVVRL--VPLSSLYLRVHKDGGYSYLM 141 (345) T ss_pred ---hhhhhHH-----hhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEE--EEecCceeEEeecCCeeEEE Confidence 0000011 1222 34566777899999999999887776665544 44434444433222111111 Q ss_pred EeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecC Q lcl|NC_011045. 181 TRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLD 260 (536) Q Consensus 181 r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ 260 (536) +. ... ...|... . |..--++.+|..... T Consensus 142 ~~---------------------------------~~~------------~~~g~~~-----~--~~~~eViHir~~~~~ 169 (345) T protein:vir:37 142 KK---------------------------------SLY------------DTAQEIY-----R--YDAKDIIFIKLYDPM 169 (345) T ss_pred ee---------------------------------eee------------ccCceEE-----E--EccccEEEEcCCCCC Confidence 00 000 0001100 0 111124445533334 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee-ccccccchhhhc---------cCC-C--ccee--cC-Cc Q lcl|NC_011045. 261 GESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLV-NPAGITQPRRLT---------KAQ-T--GDFV--TG-RP 324 (536) Q Consensus 261 ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv-~~~g~~~~~~~~---------~~~-~--g~~~--~g-~~ 324 (536) +..||.+|..-++-.+-.-+..++-..+.-.-...|-+++ .++...+.++.. .++ + +-++ ++ .. T Consensus 170 ~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~ 249 (345) T protein:vir:37 170 QQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHP 249 (345) T ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCc Confidence 6789999987777666554544444444444456777665 244444443222 111 1 1112 22 23 Q ss_pred ccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----cccCCC-CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_011045. 325 EDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----AVQRTG-ERVTAEEIRYVASELEDTLGGVYSILSQELQL 398 (536) Q Consensus 325 ~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~~-~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~ 398 (536) +++...++... .+.+ ..+..+..++.|-.+|-.-. ....+. ..-++++... .+...-+. T Consensus 250 ~G~~~~pl~~~~~d~q-f~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~--------------~f~~~~l~ 314 (345) T protein:vir:37 250 DGLKVIPIGDTGTKDE-FANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYRE--------------VYHYDEVM 314 (345) T ss_pred cceeEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHH--------------HHHHHHHH Confidence 45666665543 3444 44455666778888884321 111111 1122333222 12233356 Q ss_pred HHHHHHHHHHHhcCCCCCCCCcceEEEEec-hHHH Q lcl|NC_011045. 399 PLVRVLLKQLQATQQIPELPKEAVEPTIST-GLEA 432 (536) Q Consensus 399 Pli~r~~~il~~~g~lp~~~~~~v~v~~vs-~La~ 432 (536) |++.++...+.+ +|++++. ..++|.- -|.+ T Consensus 315 P~~~~ie~~ln~---~~e~~~~-~~i~F~~~~l~k 345 (345) T protein:vir:37 315 PLQEIIAETINQ---DPEIKNL-LKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHhhh---hhccCCc-ceEEECchhhcC Confidence 776666666654 3455442 3344331 2333 No 221 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=49.61 E-value=0.63 Score=21.67 Aligned_cols=316 Identities=13% Similarity=0.065 Sum_probs=122.2 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccc-ccccccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-TDYVTPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~~~ltP 79 (536) |++.+.+-.....++. .+. .. .. .-|.+.+ +.. -...+++ ..+.. .- T Consensus 1 m~~~~~~~~~~~~~~~---~~~--~~------~~------~~~~f~~-p~~v~~~~~~~-----~~~~~---------~~ 48 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKM---TAS--AP------KM------EAFTFGE-PVPVLDRRDIL-----DYVEC---------IS 48 (344) T ss_pred CCcccCCCCCchHHhh---cCC--cC------cE------EEEEcCC-ceeecCCcchh-----HHHHh---------hh Confidence 7775543221111000 000 00 00 0011110 000 0000000 00000 00 Q ss_pred CCcceeccCChhhhhhhccChh-HHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 MQTWMRLTISEYEAKQLLSDPD-GLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~~~Wf~l~~~d~~~~~~~~~~~-~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) .+.|+.--++-..|.++..... ....+. -....+...+ +-| -...+.....|+.+||||.+++..+..+.+ T Consensus 49 ~~~~~~pp~~~~~la~~~~a~~~h~~~i~----~k~n~l~~~~-~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~rn~~G~~ 123 (344) T protein:vir:60 49 NGRWYEPPISFTGLAKSLRAAVHHSSPIY----VKRNILASTF-IPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKV 123 (344) T ss_pred cCccccCCCCHHHHHHHHHhhhhhccchh----hhhhHHHhhc-cCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcE Confidence 1123222222222221110000 000010 0000111111 111 123456677899999999999887776666 Q ss_pred eeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) +.+. |+...++.+..+|.. | | .+ .. +|. T Consensus 124 ~~L~--~l~~~~vr~~~~~~~---~---------------------------------~-~v--~~-----------~~~ 151 (344) T protein:vir:60 124 IRLE--TSPAKYTRRGVEEDV---Y---------------------------------W-WV--PS-----------FNE 151 (344) T ss_pred EEEE--EcCcceEEEeecCCe---E---------------------------------E-EE--cc-----------CCe Confidence 5444 444444443222210 0 0 00 00 011 Q ss_pred cccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc-- Q lcl|NC_011045. 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT-- 312 (536) Q Consensus 236 ~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~-- 312 (536) .+ .|..-.++.+|.....+.+||.+|..-++..+..-+..++-....-.-...|-.++. ++...+.++.. T Consensus 152 ~~-------~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~i 224 (344) T protein:vir:60 152 PT-------AFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEML 224 (344) T ss_pred EE-------EEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHH Confidence 00 011112455554434577999999988888777766666656666666667776553 44434433221 Q ss_pred -----c-CCCcc----ee--cC-Ccccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----cccCCC-CCCCHHH Q lcl|NC_011045. 313 -----K-AQTGD----FV--TG-RPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----AVQRTG-ERVTAEE 373 (536) Q Consensus 313 -----~-~~~g~----~~--~g-~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~~-~r~TAtE 373 (536) . .+.|. ++ ++ ..+++...++... .+.+ ..+..+-.++.|-.+|-.-. ....+. .--++++ T Consensus 225 k~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~q-f~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~ 303 (344) T protein:vir:60 225 RENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDD-FFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEK 303 (344) T ss_pred HHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCChhHHH-HHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHH Confidence 1 11111 22 22 2344556665533 3444 44556666788888884321 111111 1224454 Q ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH-HHhcCCCCCCCCcce Q lcl|NC_011045. 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQ-LQATQQIPELPKEAV 422 (536) Q Consensus 374 i~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~i-l~~~g~lp~~~~~~v 422 (536) ....- ....|.|...++. ++ ...+.. ..+. ..+.++.++= T Consensus 304 ~~~~f--~~~~L~Pl~~~~e-~l-----n~~lg~~~i~F-~~~~l~~~d~ 344 (344) T protein:vir:60 304 VAKVF--VRNELIPLQDRIR-EI-----NGWLGQEVIRF-KNYSLDTDNG 344 (344) T ss_pred HHHHH--HHHHHHHHHHHHH-HH-----HHhcCCccccc-CccccCCCCC Confidence 44322 2233556666664 21 111110 0010 1122322221 No 222 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=47.83 E-value=0.69 Score=21.47 Aligned_cols=459 Identities=12% Similarity=0.055 Sum_probs=188.1 Q ss_pred CCCccccc---cHHHHHHHHHHHHHHhhhHHHH-H--HHHHHHhcccccCCCCCc-cc------ccccccccchHHHHHH Q lcl|NC_011045. 1 MAEKRTGL---AEEGAKSVYERLKNDRAPYETR-A--QNCAQYTIPSLFPKDSDN-AS------TDYVTPWQAVGARGLN 67 (536) Q Consensus 1 Ma~~~~~~---~~~~~~~r~~~l~~~R~~~e~~-w--~e~~~~~~P~~~~~~~~~-~~------~~~~~~~dst~~~a~~ 67 (536) |-...-.+ +......+ ....+..|+.. + +..+.|.-+......... .. .+.--..++.+..+++ T Consensus 2 ~~~~~r~~~~~a~~~~~~~---~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQS---ASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred cchhhhhhcccccccchhh---hhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 11100000 00000000 00111111110 0 011111111111111100 00 0111136788999999 Q ss_pred HHHHHHHHh-hcC-CCc-ceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHH----------HhccChHHHHHHH Q lcl|NC_011045. 68 NLASKLMLA-LFP-MQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI----------ESNSYRVTLFEAL 134 (536) Q Consensus 68 ~Laa~l~~~-ltP-~~~-Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l----------~~snf~~~~~~~~ 134 (536) .+++.+++. ++| ++| |=.|...+.. ..++|-+.|++.....- -..+||.....++ T Consensus 79 ~~~~nvVG~Gi~~~~~~~~~~l~g~~~~------------~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~ 146 (553) T protein:vir:63 79 YQRDSIVGAQYRLNSMPDINVIPGATEE------------WAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGV 146 (553) T ss_pred HHHHhhccCCceeeeccchhhhcCCCHH------------HHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHH Confidence 999999884 668 354 5444222221 23334444444433322 2458999999999 Q ss_pred HHHHhhCcEEEEEecCC-CCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEE Q lcl|NC_011045. 135 KQLVVAGNVLLYLPEPE-GSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDV 213 (536) Q Consensus 135 ~dl~~~G~~~l~~~~~~-~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v 213 (536) ...++-|-+++-..... .+..+.+++ +-+..+.|..... . + ..-.| T Consensus 147 r~~~~dGE~~~~~~~~~~~~~~~~~~l----------------------q~ie~drl~~~~~--------~--~-~~~~i 193 (553) T protein:vir:63 147 VGYVKTGEVLATAEWDRAANRPYATCF----------------------QMVSTDRLSNPYQ--------Q--L-DTPTL 193 (553) T ss_pred HHHHhCCceEEEeeeccCCCCcccceE----------------------EEechhhcCCCCC--------C--C-CCCee Confidence 99999998876432221 111111111 1111111111000 0 0 11146 Q ss_pred EEEEEecCCCCceeEEEEe--cCcccccccccc------cccc--CceEEEeee-ecCCCccccchHHHHHHHHHHHHHH Q lcl|NC_011045. 214 YTHIYLDEDSGEYIRYEEV--EGMEVQGSDGTY------PKEA--CPYIPIRMV-RLDGESYGRSYIEEYLGDLRSLENL 282 (536) Q Consensus 214 ~~~v~p~~~~~~~~~~~~v--~g~~i~~~~~~~------~~~~--~P~~~~rw~-~~~ge~YGrgp~~~~l~d~~~L~~l 282 (536) +..|+.+..|..-.+|..- .|.......+.. .+.. -|-++.-|. ..+|..=|.+..-.+|..++.|+.. T Consensus 194 ~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y 273 (553) T protein:vir:63 194 RRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRF 273 (553) T ss_pred EeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHH Confidence 6677766554332222110 111100000000 0011 122222222 3688899999999999999999999 Q ss_pred HHHHHHHHHHHhCCceeecccc-ccch------------------------------hhhccCCCcceecCCcc-ccccc Q lcl|NC_011045. 283 QEAIVKMSMISSKVIGLVNPAG-ITQP------------------------------RRLTKAQTGDFVTGRPE-DISFL 330 (536) Q Consensus 283 ~~~~~~~~~~a~~p~~lv~~~g-~~~~------------------------------~~~~~~~~g~~~~g~~~-~~~~~ 330 (536) ..+.+.++..++...+.+..+. .-.. ......++|.|+.-.++ ++.+. T Consensus 274 ~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 353 (553) T protein:vir:63 274 KEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLK 353 (553) T ss_pred HHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeec Confidence 9999999999999987765221 0000 00112234444433333 23322 Q ss_pred ccc-cccchhHHHHHHHHHHHHHHHHH-h-hhhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 331 QLE-KQADFTVAKAVSDAIEARLSFAF-M-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQ 407 (536) Q Consensus 331 ~~~-~~~~~~~~~~~i~~~~~rI~~af-~-~~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~i 407 (536) .-. .+++|. .....+...|..++ + +..+. .|-..++=.-+++-..|..+.+-..=..|...|..|+..+.+.. T Consensus 354 ~p~~p~~~~~---~F~~~~lr~iaaglGi~Ye~lt-~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ 429 (553) T protein:vir:63 354 PMGTPGGVGS---EFEASLNRHLASAFGMSYEEFT-RDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEE 429 (553) T ss_pred CCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHh-hhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 123333 22333344454444 1 22222 35456666666666667666666666667788899999999999 Q ss_pred HHhcCCCCCCCCc-------------ceEEEEechHHH-HHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHH Q lcl|NC_011045. 408 LQATQQIPELPKE-------------AVEPTISTGLEA-IGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANA 473 (536) Q Consensus 408 l~~~g~lp~~~~~-------------~v~v~~vs~La~-a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~ 473 (536) ....|.++-+.+. .++++++.|=-. .=-..+++.....+.. -+. ....++.. T Consensus 430 a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-----------G~~---t~~~~~a~ 495 (553) T protein:vir:63 430 AIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDA-----------GLS---TYEREIAR 495 (553) T ss_pred HHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHc-----------CCC---CHHHHHHH Confidence 9999998743321 123343333100 0000011111100000 000 00112222 Q ss_pred cCCChhhccCCHHHHHHHHHHHHHHHHHHHHH----HH-HHHHHHHhhhcCcc-hHHhhhhcCCC Q lcl|NC_011045. 474 IGIDTSGILLTEEQKQQKMAQQSMQMGMDNGA----AA-LAQGMAAQATASPE-AMAAAADSVGL 532 (536) Q Consensus 474 ~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a----~~-~~~~~~~~~~~~~~-~~~~~~~~~~~ 532 (536) .|.||..++ ..++...+.++..... +. ..+....+.+..++ +....+...|- T Consensus 496 ~G~D~~~v~-------~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 496 LGGDFRKSF-------AQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred hCCCHHHHH-------HHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 344433222 2222111111111000 00 00000000000000 00000000110 No 223 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=46.11 E-value=0.75 Score=21.28 Aligned_cols=315 Identities=12% Similarity=0.038 Sum_probs=118.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCC----cccccccccc---cchHHHHHHHHHHHHHHhhcCC Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSD----NASTDYVTPW---QAVGARGLNNLASKLMLALFPM 80 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~----~~~~~~~~~~---dst~~~a~~~Laa~l~~~ltP~ 80 (536) |.+ .|.. -+.......+ .........| |+.+...-..+-..+ .+.-. T Consensus 1 ~~~------------~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~--~~~~~ 54 (351) T protein:vir:79 1 MSK------------RRSR------------APRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYV--ECWSN 54 (351) T ss_pred CCC------------CCCC------------CCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhh--hhhhc Confidence 110 0000 0000000000 0000000011 111110000000000 01112 Q ss_pred CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccC-------hHHHHHHHHHHHhhCcEEEEEecCCCC Q lcl|NC_011045. 81 QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSY-------RVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) Q Consensus 81 ~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf-------~~~~~~~~~dl~~~G~~~l~~~~~~~~ 153 (536) +.|+.--++-..|.++.. +..+...+=..-...| .+.| ...+.+...|+.+||||.+++..+..+ T Consensus 55 ~~~~~pp~~~~~la~~~~-------~~~~h~~~l~~k~n~l-~~~~~Pnp~~t~~~f~~~v~d~ll~Gnay~~~~r~~~G 126 (351) T protein:vir:79 55 GEWFEPPVSFAGLAKSFR-------ASTHHSSALFFKANVL-ASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVG 126 (351) T ss_pred CceecCCCCHHHHHHHHh-------hhHhhhhhhhhhhhHH-hhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCC Confidence 245543333333333221 1111211111101111 1122 334667788999999999998776666 Q ss_pred ceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 154 ~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) .++.+ +|+..-++.+..++. .| ++.... T Consensus 127 ~~~~L--~~l~~~~v~~~~~~~-------------------------------------------------~~-~~~~~~ 154 (351) T protein:vir:79 127 GTLRL--EPALAKYVRRKADFS-------------------------------------------------GF-VYVNGW 154 (351) T ss_pred CEEEE--EEeCCcceeeeecCC-------------------------------------------------eE-EEEecC Confidence 55444 443333332211110 00 000011 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT 312 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~ 312 (536) |..+ . |..--.+..|..-..++.||.+|..-++..+-.-+..++-....-.-.+.|-+++. ++...+.++.. T Consensus 155 g~~~-~------~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~ 227 (351) T protein:vir:79 155 QERH-E------FEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVD 227 (351) T ss_pred ceEE-E------EcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHH Confidence 1111 0 11112455554444577999999888777776655555555555555667776542 44433333221 Q ss_pred ---------cC-CC--ccee--c-CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh----cccCC-CCCCCH Q lcl|NC_011045. 313 ---------KA-QT--GDFV--T-GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS----AVQRT-GERVTA 371 (536) Q Consensus 313 ---------~~-~~--g~~~--~-g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~----~~~~~-~~r~TA 371 (536) .+ ++ +.++ + |..+++...|+... .+.+ ..+..+..++.|-.+|-.-. ....+ +..-++ T Consensus 228 ~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~e-f~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~ 306 (351) T protein:vir:79 228 NMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDE-FFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTP 306 (351) T ss_pred HHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccH Confidence 11 01 1111 2 22345566665543 3444 34455555677888874321 11111 112344 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHH Q lcl|NC_011045. 372 EEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRG 436 (536) Q Consensus 372 tEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~ 436 (536) ++....- ....|.|...++ ..+... +..+.+++..-.-|....++ T Consensus 307 e~~~~~f--~~~~l~Pl~~~i------------e~ln~~------lg~~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 307 DTAARVF--GRNEIRPLQARF------------AELNDW------LGDEVVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHH--HHHHHHHHHHHH------------HHHHhh------cCcceeeeChhhhccccccC Confidence 4443321 112234444444 332211 12223343332323222222 No 224 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=45.75 E-value=0.76 Score=21.24 Aligned_cols=420 Identities=10% Similarity=0.046 Sum_probs=159.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHH-HHhcccccC-CCCCccccc-ccc-cccchHHHHHHHHHHHHHHhhcCCCcc Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCA-QYTIPSLFP-KDSDNASTD-YVT-PWQAVGARGLNNLASKLMLALFPMQTW 83 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~-~~~~P~~~~-~~~~~~~~~-~~~-~~dst~~~a~~~Laa~l~~~ltP~~~W 83 (536) +.+..+...-. -+..+...|.... .++.|.... .....+.-+ ... .-|++-.-++++....+.+ .+| T Consensus 1 v~~~~l~~e~a----t~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~-----~~w 71 (488) T protein:vir:99 1 MEKPALGREIA----TSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVS-----REW 71 (488) T ss_pred CCccchhHHHH----HHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhc-----CCc Confidence 32222211111 1122222222111 122222100 000111100 000 1255555555555555543 255 Q ss_pred eeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceeeE-EEEe Q lcl|NC_011045. 84 MRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPM-KLYR 162 (536) Q Consensus 84 f~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~-~~~~ 162 (536) -= .+.+... .+......|+ +.|.+.+|...+.+++ +.+.||-++.=+......+.+.. +..+ T Consensus 72 ~i-~p~~~~~----~~~~~ae~v~-----------~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~ 134 (488) T protein:vir:99 72 KV-EAGGDRP----IDQAAAEHLE-----------QQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKV 134 (488) T ss_pred eE-EcCCCCh----HHHHHHHHHH-----------HHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeee Confidence 43 3322110 0111122233 3344568887777776 56678877653221111111111 0111 Q ss_pred cceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCcccccccc Q lcl|NC_011045. 163 LSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDG 242 (536) Q Consensus 163 l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~ 242 (536) ...-++..|.+|+. .+. ..++ ..+|..++. T Consensus 135 r~~~~f~~d~~~~l-----------------------------------~~~-----~~~~-------~~~g~~lp~--- 164 (488) T protein:vir:99 135 RNRRRFRYDQDGGL-----------------------------------RLL-----TPNN-------MFEGEPCPA--- 164 (488) T ss_pred ecccceeecCCCce-----------------------------------EEe-----ccCC-------CCCcccccc--- Confidence 11111111111110 000 0000 001111110 Q ss_pred ccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc--cccch--hhh----cc- Q lcl|NC_011045. 243 TYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPA--GITQP--RRL----TK- 313 (536) Q Consensus 243 ~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~--g~~~~--~~~----~~- 313 (536) .+=|++.+....+|+.||.|....+..-..--+...+..+..+++---|..+..-+ +-.+. ..+ .. T Consensus 165 -----~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~ 239 (488) T protein:vir:99 165 -----PYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAI 239 (488) T ss_pred -----CceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHH Confidence 11177888888899999999999999998888888999999999988887665422 22221 111 11 Q ss_pred -CCCcceecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCC-CCCHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_011045. 314 -AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE-RVTAEEIRYVASELEDTLGGVYSI 391 (536) Q Consensus 314 -~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~-r~TAtEi~~r~~E~~~~LG~v~~r 391 (536) +..+.++|.. ..+..+.... +.-..-...++.+.+.|+++++...+...++. .....|+..... ...+-.-... T Consensus 240 ~~~~~~viP~~-~~ie~~ea~~-~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~~v~--~d~~~aDa~~ 315 (488) T protein:vir:99 240 QTDSAIIMPAG-MQAELLEAGR-SGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQADVR--LDLVKADADL 315 (488) T ss_pred hcCcEEEecCC-ceeEEeecCC-CChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHH--HHHHHHHHHH Confidence 1223344433 3455555432 33344577899999999999886654422222 233444443222 2222233333 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHH Q lcl|NC_011045. 392 LSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIA 471 (536) Q Consensus 392 l~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a 471 (536) +...|..-||..++.+..-....| .+.+.+..+- ++......+..+..++.. .|+. .++. T Consensus 316 i~~tln~~li~~l~~~N~~~~~~p-----~~~~~~~e~e-------dl~~~a~~~~~l~~~~G~----~i~~----~~i~ 375 (488) T protein:vir:99 316 ICESFNLGPARWLTEWNFPGAQPP-----RVYRVIEEPE-------DITAKAERDEKVFRMSGF----RPTR----GYVQ 375 (488) T ss_pred HHHHHHHHHHHHHHHhCcCCcCCc-----eeEecCCCcc-------cHHHHHHHHHHHHhhcCC----CCCH----HHHH Confidence 333333334444444432111111 1222222221 222222223333332111 1221 2445 Q ss_pred HHcCCChhhcc-----C-----------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhhcCcchHHhhhhcCCCC Q lcl|NC_011045. 472 NAIGIDTSGIL-----L-----------TEEQKQQKMAQQSMQMGMDNGAAALAQGMAA--QATASPEAMAAAADSVGLQ 533 (536) Q Consensus 472 ~~~Gv~p~~i~-----r-----------s~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~--~~~~~~~~~~~~~~~~~~q 533 (536) +.+|+++...- . ..+...++..+.. ...+.+.......... +.+.+.+..... -..+. T Consensus 376 e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~l~~a~s~ee~~~~--L~~l~ 451 (488) T protein:vir:99 376 ETYGVEVESTQAEATAPTPSTEFAEGDQPSDPAAAMAPQLA--EAMQPVVGNWTTQLRTLIEQASSLEDLRER--LLDLA 451 (488) T ss_pred HHcCCCCcccccccccCCCcccCCCCCCCCCchHHHHHHHH--HHHHHHHHHHHHHHHHHHHhcCCHHHHHHH--HHHHh Confidence 55565432110 0 0000000000000 0000000011111100 111122111110 11122 Q ss_pred CCC Q lcl|NC_011045. 534 PGI 536 (536) Q Consensus 534 ~~~ 536 (536) |.+ T Consensus 452 ~~~ 454 (488) T protein:vir:99 452 PQL 454 (488) T ss_pred ccC Confidence 222 No 225 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=45.71 E-value=0.76 Score=21.24 Aligned_cols=330 Identities=14% Similarity=0.053 Sum_probs=125.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHH-HHhcccccCCCCCcccccccccc---cchHHHHHHHHHHHHHHhhcCCCcc Q lcl|NC_011045. 8 LAEEGAKSVYERLKNDRAPYETRAQNCA-QYTIPSLFPKDSDNASTDYVTPW---QAVGARGLNNLASKLMLALFPMQTW 83 (536) Q Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~-~~~~P~~~~~~~~~~~~~~~~~~---dst~~~a~~~Laa~l~~~ltP~~~W 83 (536) |+++. ....++..-+ .-+.+.-.+.+........-..| +++....-..+...+ ..+-.+.| T Consensus 1 m~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~--~~~~~~~~ 65 (368) T protein:vir:79 1 MSRNK-------------TRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYV--ECMRMGQW 65 (368) T ss_pred CCccc-------------cccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHH--HHHhccch Confidence 21111 0000000000 00000000000000000000011 111111111111111 11111235 Q ss_pred eeccCChhhhhhhccChh-HHHHHHHHHHHHHHHHHHHHHhcc---ChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEE Q lcl|NC_011045. 84 MRLTISEYEAKQLLSDPD-GLAKVDEGLSMVERIIMNYIESNS---YRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMK 159 (536) Q Consensus 84 f~l~~~d~~~~~~~~~~~-~~~~v~~~L~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~ 159 (536) ++-.++-..+.++..... ....+. ..+-+...+.+-| -...+.+.+.|+.+||||.+++..+..+.++ . T Consensus 66 ~~~pi~~~~la~~~~~~~~h~~~~~-----~~~n~l~l~~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~~G~~~--~ 138 (368) T protein:vir:79 66 YEPPMPWDGLARSFRAAAHHSSAVY-----VKRNILVSTFIPHPLLSRATFERLVLDWQVFGNAYLERRENVLGGTI--R 138 (368) T ss_pred hccCcCHHHHHHHHhhccccchhhh-----hhcchhhhhcCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcCCCCEE--E Confidence 443333222322211000 000000 0111222222222 1234667788999999999998776655554 4 Q ss_pred EEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccc Q lcl|NC_011045. 160 LYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQG 239 (536) Q Consensus 160 ~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~ 239 (536) .+|+...++.+..+++. |..+ +. +|... . T Consensus 139 L~~l~~~~v~~~~~~~~-------------------------------------~~~~--~~-----------~~~~~-~ 167 (368) T protein:vir:79 139 LDTPLAKYVRRGLDLNT-------------------------------------YFFV--QN-----------WQQPY-T 167 (368) T ss_pred EEEeCcccceeeccCCE-------------------------------------EEEE--ec-----------CCeEE-E Confidence 44544444433322210 0000 00 01000 0 Q ss_pred cccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee-ccccccchhhhc------ Q lcl|NC_011045. 240 SDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLV-NPAGITQPRRLT------ 312 (536) Q Consensus 240 ~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv-~~~g~~~~~~~~------ 312 (536) +..--++.+|..-..+..||.+|..-++..+-.-+..++-....-.-.+.|..++ -++...+.++.. T Consensus 168 ------~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~ 241 (368) T protein:vir:79 168 ------FAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAM 241 (368) T ss_pred ------EccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 0111245556444456789999999888888777777777777777778888665 344444443221 Q ss_pred ---cCC--Cc-cee--c-CCcccccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCC--CCCHHHHHHH Q lcl|NC_011045. 313 ---KAQ--TG-DFV--T-GRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGE--RVTAEEIRYV 377 (536) Q Consensus 313 ---~~~--~g-~~~--~-g~~~~~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~--r~TAtEi~~r 377 (536) .+. -| .++ + |..+++...++... .+.+ ..+..+..++.|-.+|-.-. ++. .++. .-++++.... T Consensus 242 ~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~q-f~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~ 320 (368) T protein:vir:79 242 KSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDE-FWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMV 320 (368) T ss_pred HHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHH-HHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHH Confidence 111 11 111 2 22455566666543 3444 34555666777888884321 111 1111 1234443322 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEE--------echHHHHHHHH Q lcl|NC_011045. 378 ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI--------STGLEAIGRGQ 437 (536) Q Consensus 378 ~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~--------vs~La~a~r~~ 437 (536) +...-+.|++.++..+... | ..+.+++.. -+.-..++|++ T Consensus 321 --------------f~~~~l~Pl~~~ie~ln~~---l---~~e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 321 --------------FARNEVKPLQDRLLAINDW---I---GDEVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred --------------HHHHHHHHHHHHHHHHHhc---c---CcceeeechhHhhcccccccCCcccccC Confidence 1222244444444322211 1 111222211 01111233444 No 226 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=45.42 E-value=0.77 Score=21.20 Aligned_cols=316 Identities=12% Similarity=0.086 Sum_probs=118.1 Q ss_pred cccccCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcceecc-CChhhhhhhccChhHHHHHHHHHHHHHHHH Q lcl|NC_011045. 39 IPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLALFPMQTWMRLT-ISEYEAKQLLSDPDGLAKVDEGLSMVERII 117 (536) Q Consensus 39 ~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~Wf~l~-~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~ 117 (536) +--+.++... +. .....| -..++..++|..-. ++.....+ . ..| -.|-+.+ T Consensus 1 M~~~~~f~~r-~~-~~~~~~---------------~~~~~~~~~~~~~~~v~~~~al~---~----~av----~~cv~~i 52 (359) T protein:vir:10 1 MSILNPFERR-SS-ITPNNY---------------YPFMVQNGSIVPNSLVDATEALK---N----SDL----YAVTSLI 52 (359) T ss_pred Ccccchhhcc-cc-CCCCcc---------------hhhhhccccccCCcccCHHHhhc---c----hHH----HHHHHHH Confidence 1111111100 00 000000 00000011111000 00000000 0 001 0111122 Q ss_pred H---------------HHHHhccChH----HHHHHHHHHHhhCcEEEEEecCCCCceeeEEEEecceEEEeeCCCCCeEE Q lcl|NC_011045. 118 M---------------NYIESNSYRV----TLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQ 178 (536) Q Consensus 118 ~---------------~~l~~snf~~----~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~ 178 (536) . ..+.+=|-+. =....+.++..+|||.+++..+..+.+..+..+|.....+..+..+ T Consensus 53 a~~ia~~p~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~---- 128 (359) T protein:vir:10 53 SSDIAGTRFIGNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDT---- 128 (359) T ss_pred HHhhhcCccccchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCe---- Confidence 1 1122223222 2456666777889999998877666555555555555555443221 Q ss_pred EEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeee Q lcl|NC_011045. 179 MVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVR 258 (536) Q Consensus 179 i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~ 258 (536) ++ |+ +... +++... .+...+|+... +....... T Consensus 129 ~~---------------------------------y~-~~~~-~~~~~~---~~~~~evih~~---------~~~~~~~~ 161 (359) T protein:vir:10 129 LT---------------------------------YE-VNQF-DDYPSA---KYNASEMIHVK---------IMAYGVDT 161 (359) T ss_pred EE---------------------------------EE-EEec-CCceEE---EEcccceEEec---------cCCCCCCc Confidence 11 11 1100 011110 01111111110 00001111 Q ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccchhhhc---c------C-CC-cceecCCccc Q lcl|NC_011045. 259 LDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRRLT---K------A-QT-GDFVTGRPED 326 (536) Q Consensus 259 ~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~-~~g~~~~~~~~---~------~-~~-g~~~~g~~~~ 326 (536) . +..||.||...+...+.......+.......-...|..++. |.+.++.+... . + .+ |. +..-.++ T Consensus 162 ~-dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~-~~vl~~g 239 (359) T protein:vir:10 162 L-HNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGR-VMVLDQS 239 (359) T ss_pred c-CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC-ceecCCC Confidence 2 33689999999999999888888888888888888887765 34444443221 0 1 10 11 1111222 Q ss_pred ccccccccc-cchhHHHHHHHHHHHHHHHHHhhhh-cc-cCCCCCCCHHHHHHHHHHH-HHHhhhhHHHHHHHHHHHHHH Q lcl|NC_011045. 327 ISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS-AV-QRTGERVTAEEIRYVASEL-EDTLGGVYSILSQELQLPLVR 402 (536) Q Consensus 327 ~~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~-~~-~~~~~r~TAtEi~~r~~E~-~~~LG~v~~rl~~E~l~Pli~ 402 (536) ....++... .+.+ ..+..+.....|-++|-... ++ ..+...-|...+.+...+. ...|.|..++|+..|...+ T Consensus 240 ~~~~~l~~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~-- 316 (359) T protein:vir:10 240 ADFSTVSINADVAN-YLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKCDSSI-- 316 (359) T ss_pred cceeeecCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-- Confidence 333444332 2333 23455555677888885432 11 1223334655554433221 2223333333333322110 Q ss_pred HHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHHHH--HH--HHH---Hhhcchhh Q lcl|NC_011045. 403 VLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLER--CV--AAW---AALAPMRD 457 (536) Q Consensus 403 r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l~~--~~--~~~---~~~~p~~~ 457 (536) .+... .-+++.+ .....++.++.+ ++ +.+ -.+.|. + T Consensus 317 -------------~~~~~-~~~~~d~----~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv-~ 359 (359) T protein:vir:10 317 -------------GVDMS-PITDYSN----SVFKADILNWVKEGIIEPTEAKTLLESKGI-I 359 (359) T ss_pred -------------cccch-hhhhcCH----HHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-C Confidence 01100 0111211 111223333222 11 111 123342 2 No 227 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=40.87 E-value=0.95 Score=20.70 Aligned_cols=362 Identities=14% Similarity=0.030 Sum_probs=137.8 Q ss_pred CCCccc--cccHHHHHHHHHHHHHHhhh------HHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHH Q lcl|NC_011045. 1 MAEKRT--GLAEEGAKSVYERLKNDRAP------YETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASK 72 (536) Q Consensus 1 Ma~~~~--~~~~~~~~~r~~~l~~~R~~------~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (536) |---=. -+=-+..+..|+.|...|+. +...|-+.. ....+...-+...-+=.++--.|++.+|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTD-------GLFRADVYVSPETAMKLAAVYSCIYVLSSS 73 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCccccchhhhhhh-------ccccCCceechHHhhccHHHHHHHHHHHHH Confidence 110000 00001122233333322210 011110000 000000000000001123344566667666 Q ss_pred HHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEE Q lcl|NC_011045. 73 LMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYL 147 (536) Q Consensus 73 l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~ 147 (536) +-+. ||-=..-.+....+ ++ +.-++..|. +- +.+.-....+.++..+||+++++ T Consensus 74 iA~l-----p~~v~~~~~~~~~~----------~~------~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i 132 (424) T protein:vir:45 74 LAQM-----PLHVMRRHKGKVEP----------AR------DHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWV 132 (424) T ss_pred HhhC-----ceEEEEecCCceee----------cc------cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 6432 33111111111100 10 112222332 22 33444566778899999999999 Q ss_pred ecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCcee Q lcl|NC_011045. 148 PEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYI 227 (536) Q Consensus 148 ~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~ 227 (536) ..+..+.++.+..++.+.+.+..+. |++ +| .+... + T Consensus 133 ~r~~~G~~~~L~~l~~~~v~i~~~~-~~~------------------------------------~y-~~~~~--~---- 168 (424) T protein:vir:45 133 KRNRRGEVISLDCCMPWETTLMNTG-GRY------------------------------------TY-GLYNE--Y---- 168 (424) T ss_pred EEcCCCcEEEEEEecCceEEEEEcC-CeE------------------------------------EE-EEEec--C---- Confidence 8766666554444444444433221 100 01 01100 0 Q ss_pred EEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc Q lcl|NC_011045. 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQ 307 (536) Q Consensus 228 ~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~ 307 (536) |...+ ..--++++|.... +..||.||.+.+...+.......+.......-...|..++.-++.++ T Consensus 169 ------~~~~~--------~~~eVih~r~~~~-d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~ 233 (424) T protein:vir:45 169 ------GAFAI--------SPDDMIHIRALGN-NQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLN 233 (424) T ss_pred ------ceEEE--------CcccEEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCC Confidence 10000 0112455554433 34799999999998888888888888888888888887776555544 Q ss_pred hhhhc-------c---CC---CcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCCCCHHH Q lcl|NC_011045. 308 PRRLT-------K---AQ---TGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEE 373 (536) Q Consensus 308 ~~~~~-------~---~~---~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtE 373 (536) .+... . +. .|.+.. -.++....++.. +.+.+. .+...-.+..|-++|-.-....-....-|-.- T Consensus 234 ~e~~~~~~~~~~~~~~g~~~n~g~~~v-l~~g~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn 311 (424) T protein:vir:45 234 KESWGWLKDQWQKASQALRRQENKTML-LPADLDYKALTVSPVDAQI-IDMMKLNRSMIAGIFNIPAHMINDLEKATFSN 311 (424) T ss_pred HHHHHHHHHHHHHHhccccccCCceeE-cCCCceEEEccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc Confidence 43221 1 10 111111 112223333332 234443 34445556778888843321111111112222 Q ss_pred HHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCC-CcceEEEE-echHH---HHHHHHHHHHHHH--- Q lcl|NC_011045. 374 IRYVASE-LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELP-KEAVEPTI-STGLE---AIGRGQDLDKLER--- 444 (536) Q Consensus 374 i~~r~~E-~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~-~~~v~v~~-vs~La---~a~r~~~~~~l~~--- 444 (536) +.+.... ....|.|.+.++++||-.-|+ ++.. .....++| ++.|- ...|....+.+.+ T Consensus 312 ~eq~~~~f~~~tL~P~~~~ie~~ln~kLl-------------~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~ 378 (424) T protein:vir:45 312 ISAQAIQFVRYTMMPWVTNWEQELNRRLF-------------TRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGW 378 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------------ChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCC Confidence 2222222 233566666666666543332 2110 01122222 11110 1111111111111 Q ss_pred --------HHHH------------------HHhhcchhhhhcCCHHH Q lcl|NC_011045. 445 --------CVAA------------------WAALAPMRDDPDINLAM 465 (536) Q Consensus 445 --------~~~~------------------~~~~~p~~~~~~id~d~ 465 (536) .++. .....|...++ =+.++ T Consensus 379 ~T~NE~R~~~gl~pi~ggD~~~~~~n~~~~~~~~~~~~~~~-~~~~~ 424 (424) T protein:vir:45 379 MSRNEARAFEDMNPVEGLDEMLVSVNAANPAGDFKPPKNDE-GKTNE 424 (424) T ss_pred cCHHHHHHHhCCCCCCCcceeeecccccccccccCCCCCCC-CCCCC Confidence 0000 00001111110 11222 No 228 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=40.60 E-value=0.96 Score=20.67 Aligned_cols=396 Identities=10% Similarity=0.051 Sum_probs=158.6 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccccc-ccccchHHHHHHHHHHHHHHhhcC Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYV-TPWQAVGARGLNNLASKLMLALFP 79 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l~~~ltP 79 (536) |..-+ ++.+ .+ +. ....+++....... ..|+--..-+-+-|+.++.. .| T Consensus 1 ~~~~~----~d~~------------------~~---~~---~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd--~~ 50 (427) T protein:vir:10 1 MKIVK----HDGY------------------ND---IF---NGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVD--VI 50 (427) T ss_pred CCccc----cchH------------------HH---Hh---hcCCCCcccCccccCchHHHHHHHHcCchhhhhhc--cc Confidence 11100 0000 00 00 00000100000000 00110011111222222211 13 Q ss_pred C----CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCce Q lcl|NC_011045. 80 M----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) Q Consensus 80 ~----~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~ 155 (536) + +.|+++...+. ... +.+.+.+-++...+.++++.--+||.+++++.-++.. . T Consensus 51 aed~~r~g~~i~g~~~-----------~~~-----------~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~-~ 107 (427) T protein:vir:10 51 PEEMVTAGFKMSGVKD-----------EKE-----------FKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNR-M 107 (427) T ss_pred hHHhhcCCccccCccH-----------HHH-----------HHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCC-c Confidence 3 57999864321 011 2223334578899999999999999999987655442 1 Q ss_pred eeEEEEecceEEEeeCCCCCeEEE--EEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEec Q lcl|NC_011045. 156 NPMKLYRLSSYVVQRDAFGNVLQM--VTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVE 233 (536) Q Consensus 156 ~~~~~~~l~~~~v~~d~~G~v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~ 233 (536) +. .|+ +..|.+-.+ +-+..+++... -.+ + .+..+-.-+.|+ |.++.......+| T Consensus 108 l~---~p~-------~~~g~l~~l~v~d~~~~~~~~~----~~d--p---~s~~fg~P~~y~-v~~~~~~~~~~iH---- 163 (427) T protein:vir:10 108 LT---SQA-------KPGAKLEGVRVYDRFAITVEKR----VTN--A---RSPRYGEPEIYK-VSPGDNMQPYLIH---- 163 (427) T ss_pred cc---ccc-------CCCcceeEEEEechhccccccc----ccC--c---cccccCcceEEE-EecCCCCcceEEc---- Confidence 11 111 233433332 22333333211 000 0 000011112332 2121111111111 Q ss_pred CccccccccccccccCceEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeecc------ccc- Q lcl|NC_011045. 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEE-YLGDLRSLENLQEAIVKMSMISSKVIGLVNP------AGI- 305 (536) Q Consensus 234 g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~-~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~------~g~- 305 (536) -.+++...| .|+ +-+.+.....||+|+... .++.++..........+.+.++.-..+.++. ++. T Consensus 164 ~SRli~~~g------~~~--p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~ 235 (427) T protein:vir:10 164 HSRVFIADG------ERV--AQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDA 235 (427) T ss_pred cccEEEecC------CCc--hhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccc Confidence 112222222 222 234566777899999876 4577888888888887777776665554431 111 Q ss_pred ----cchhhh---ccCCCcce-ecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccCCCCCCCHHHHH Q lcl|NC_011045. 306 ----TQPRRL---TKAQTGDF-VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAEEIR 375 (536) Q Consensus 306 ----~~~~~~---~~~~~g~~-~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~~~~r~TAtEi~ 375 (536) ...... ..+.+|.+ +-+..++...+. .+|.-+...+....+.|.-+.=. .-+...+.....+|. T Consensus 236 ~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~----~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstg-- 309 (427) T protein:vir:10 236 QYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLN----SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQ-- 309 (427) T ss_pred hHHHHHHHHHHHHhcCcccceeeecCCCceeEEe----cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccch-- Confidence 011111 11122332 223334333332 23444556667777777665421 112223333444431 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHH---HHHHHHHHHHhh Q lcl|NC_011045. 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLD---KLERCVAAWAAL 452 (536) Q Consensus 376 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~---~l~~~~~~~~~~ 452 (536) ++-....---+..++...+.|++++++.++.+. ++++++|- ||-+....+.++ +..+..+.+.+. T Consensus 310 ---d~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s--------~~~~~~f~-pL~~~s~kEkaei~~~~a~a~~~~~~~ 377 (427) T protein:vir:10 310 ---NTALETFYKLVDRKREEDYRPLLEFLLPFIVDE--------EEWSIEFE-PLSVPSKKEESEITKNNVESVTKAITE 377 (427) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--------CCcEEEeC-CCCCCCHHHHHHHHHHHHHHHHHHHhc Confidence 111222333344456677999999999998765 35677765 444333222222 222222222221 Q ss_pred cchhhhhcCCHHHHHHHHHHH---cCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchHHhhhhc Q lcl|NC_011045. 453 APMRDDPDINLAMIKLRIANA---IGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS 529 (536) Q Consensus 453 ~p~~~~~~id~d~~~~~~a~~---~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 529 (536) ..++++++-+.+... .|+.+.. --+.++..+. ...+.+...-++. T Consensus 378 ------gvi~~~e~r~~L~~~~~~~~~~~~~-~~~~e~~~~~-------------------------~e~~p~~~e~~~d 425 (427) T protein:vir:10 378 ------QIIDLEEARDTLRSIAPEFKLKDGN-NINIREPEET-------------------------TEPEPGLGEKLED 425 (427) T ss_pred ------CCCCHHHHHHHHHhhhccccCCCCc-cccccccchh-------------------------cCCCCCCCCCCCC Confidence 136677766655433 2221110 0011111100 0000000110111 Q ss_pred CC Q lcl|NC_011045. 530 VG 531 (536) Q Consensus 530 ~~ 531 (536) .. T Consensus 426 ~~ 427 (427) T protein:vir:10 426 EN 427 (427) T ss_pred CC Confidence 11 No 229 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=34.71 E-value=1.3 Score=20.01 Aligned_cols=464 Identities=13% Similarity=0.068 Sum_probs=173.8 Q ss_pred CCCccc----cccHHHHHHHHHHHHHHhhhHHHHH--H----HHHHHhcccc----cC----CCCC-ccc-cccc-cccc Q lcl|NC_011045. 1 MAEKRT----GLAEEGAKSVYERLKNDRAPYETRA--Q----NCAQYTIPSL----FP----KDSD-NAS-TDYV-TPWQ 59 (536) Q Consensus 1 Ma~~~~----~~~~~~~~~r~~~l~~~R~~~e~~w--~----e~~~~~~P~~----~~----~~~~-~~~-~~~~-~~~d 59 (536) ||++.- .++--++ +.-++.+.+|+.+-..- + +-..+ .|.. .. ..+- ++. ++.. ..+. T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~ 78 (532) T protein:vir:94 1 MADTDPTPRPEITYATL-QQAQRVDAKRATHTSLGLATAHEIDPTAY-SPYERNAAQNAMAMDYGLQTGRNGRNALSFVE 78 (532) T ss_pred CCCCCCCCCcceehhhh-hhHhhhhhhhhhhhhhhhhhhhhhccccc-ccccccccccccccccccCccccccccccccc Confidence 988332 2222233 22244555554432211 1 11111 2211 10 0010 101 1100 1111 Q ss_pred chHH---HHHHHHH-HHHHHhh--cCC----CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHH Q lcl|NC_011045. 60 AVGA---RGLNNLA-SKLMLAL--FPM----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVT 129 (536) Q Consensus 60 st~~---~a~~~La-a~l~~~l--tP~----~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~ 129 (536) ++.. ..+.-++ +.|...+ .|+ +.|+.+...+..- .......+++. .+.+-++... T Consensus 79 ~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~--------~~~~~~~~i~~-------~~~~l~v~~~ 143 (532) T protein:vir:94 79 ATSWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKDE--------LAADKATRITQ-------KLEQYNVRTL 143 (532) T ss_pred ccccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCccc--------cchHHHHHHHH-------HHHhhhHHHH Confidence 2111 1111111 1122222 243 4688886543210 01122233332 2333467889 Q ss_pred HHHHHHHHHhhCcEEEEEecCCCCceeeEE-EEecceEEEeeCCCCCeEEE--EEeEeccHHHHHHHHhHHhhhccccCC Q lcl|NC_011045. 130 LFEALKQLVVAGNVLLYLPEPEGSNYNPMK-LYRLSSYVVQRDAFGNVLQM--VTRDQIAFGALPEDIRKAVEGQGGEKK 206 (536) Q Consensus 130 ~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~-~~~l~~~~v~~d~~G~v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~ 206 (536) +.++++.--+||.+++++.-...+....+. ...+..-.| ..|.+..+ +-+..++... .....+. +. T Consensus 144 l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I---~~g~~~~l~vld~~~v~p~~-----~~~~dp~---sp 212 (532) T protein:vir:94 144 VRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFV---QRGCLIGFATIEPMWLSPNA-----YNATDPT---LP 212 (532) T ss_pred HHHHHHhhhcccceEEEEEeccCCcccccccccccccccc---ccceeeEEEeechheecccc-----ccccccc---cc Confidence 999999988999998876543221100000 000111111 22222222 1122222110 0000000 00 Q ss_pred CCceEEEEEEEEecCCCCceeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 207 ADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAI 286 (536) Q Consensus 207 ~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~ 286 (536) .+..-+.|..+ .+. .|.-.+++...| ...|. +.+....-||++..+.++..++......... T Consensus 213 ~fg~P~~y~v~----~g~------~iH~SRli~f~g----~~~p~----~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~ 274 (532) T protein:vir:94 213 SFYKPDSWIAT----SGK------KIHSSRIHTVVG----RPVGD----MLKAAYSFRGVSISQLAMPYVDNWLRTRQSV 274 (532) T ss_pred ccCCceeEEEc----cCe------eeccceEEEecC----CCchh----hhccccccccccHHHHHHHHHHHHHHHHHHH Confidence 01111111110 010 111112222222 12222 2223333469999999999999999999888 Q ss_pred HHHHHHHhCCceeeccccccc---------hhhhc---cCCCcceec-CCcccccccccccccchhHHHHHHHHHHHHHH Q lcl|NC_011045. 287 VKMSMISSKVIGLVNPAGITQ---------PRRLT---KAQTGDFVT-GRPEDISFLQLEKQADFTVAKAVSDAIEARLS 353 (536) Q Consensus 287 ~~~~~~a~~p~~lv~~~g~~~---------~~~~~---~~~~g~~~~-g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~ 353 (536) .+.+..+.-..+.++-...+. ..... .+..|.++- ...+++..+. .+|.-+...+....+.|. T Consensus 275 ~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~----~~lsgl~~~l~~~~~~iA 350 (532) T protein:vir:94 275 SDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTN----TPLSGLDSLQAQSQEQMA 350 (532) T ss_pred HHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEe----cccCCHHHHHHHHHHHHH Confidence 887777666665543221111 11111 111233322 2233333332 234445556666667776 Q ss_pred HHHhh--hhcccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHH Q lcl|NC_011045. 354 FAFML--NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE 431 (536) Q Consensus 354 ~af~~--~~~~~~~~~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La 431 (536) -+.=. .-+...+....-+| .++-....---+..++...+.|++++++.++.+....- ++ .++.++|-. |- T Consensus 351 aa~~IP~t~LfG~sp~Glnst-----Ge~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~-~~-~d~~~~f~p-L~ 422 (532) T protein:vir:94 351 AVSHIPLVKLLGITPNGLNAS-----SDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQ-ID-PGLAWEWSP-LM 422 (532) T ss_pred hHhCCCeeeeecCCccccccc-----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CC-CCceEEeCC-CC Confidence 55411 11111222222221 11112223334455666778999999999998753222 22 247777653 44 Q ss_pred HHH---HHHHHHHHHHHHHHHHhhcchhhhhcCCHHHHHHHHHHH--cCCChhhccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011045. 432 AIG---RGQDLDKLERCVAAWAALAPMRDDPDINLAMIKLRIANA--IGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAA 506 (536) Q Consensus 432 ~a~---r~~~~~~l~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~--~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~ 506 (536) +.. ++.-..+..+..+.+.+. ..|+.+++-+++... .|.. ..+.+.++......+....+....... T Consensus 423 ~~s~kEkAei~~~~a~a~~~~~~~------Gvi~~~Evr~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (532) T protein:vir:94 423 ELDDKELAEVRQLNASTDSTLMEL------GVIDAKMVQQRLAADPTSGYA--GALGERDELDDVEEIAKQLMAAALNPP 494 (532) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhc------CCCCHHHHHHHHhcCCccccc--cccccccccccccchhhhhcccccCCC Confidence 332 222222222233222221 136777777766542 1111 122222222111111100000000000 Q ss_pred HHHHHHHHhhhcCcchHHhhhhcCCCCCCC Q lcl|NC_011045. 507 ALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) Q Consensus 507 ~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 536 (536) ..+. .+..|++ ....|...-||+- T Consensus 495 --~~~~---~~~~~~~-~~~~d~~~~~~~~ 518 (532) T protein:vir:94 495 --ATAP---QTPNPQP-DSEDDQTDNQPDA 518 (532) T ss_pred --CCCC---CCCCCCC-CCCCCCCCCccCC Confidence 0000 0000100 1112222223332 No 230 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=32.90 E-value=1.4 Score=19.79 Aligned_cols=397 Identities=11% Similarity=0.070 Sum_probs=169.8 Q ss_pred CCCccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCccccc-ccc--cccchHH-HHHHHHHHHHHHh Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTD-YVT--PWQAVGA-RGLNNLASKLMLA 76 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~-~~~--~~dst~~-~a~~~Laa~l~~~ 76 (536) |+++...+... +.|.+ . ++ .+ ++...... ... .|..... -+.+-|+.++.. T Consensus 5 m~~~~~~~~~~------D~~~~---~----------~~--~~---~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd- 59 (435) T protein:vir:79 5 MSDKVKAITKE------DGYNE---I----------FG--SK---DGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVD- 59 (435) T ss_pred cccccccchhh------cchhh---h----------hc--cc---ccccccCcccCCcCCHHHHHHHHhcCchhhhhhc- Confidence 88865333211 11111 0 11 00 01000000 000 1111000 011222222211 Q ss_pred hcCC----CcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCC Q lcl|NC_011045. 77 LFPM----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEG 152 (536) Q Consensus 77 ltP~----~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~ 152 (536) .|+ +.|+.+...+.. .. +.+.+.+-+....+.++++.--+||.|++++...+. T Consensus 60 -~~aed~~r~g~~i~g~~~~-----------~~-----------~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~ 116 (435) T protein:vir:79 60 -VIPEEMVTPGFKVDGVKNE-----------KS-----------FKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADN 116 (435) T ss_pred -cchHHhhcCCceecCCChH-----------HH-----------HHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCC Confidence 243 478888543210 11 222333446788999999999999999888876544 Q ss_pred CceeeEEEEecceEEEeeCCCCCeEEE--EEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEE Q lcl|NC_011045. 153 SNYNPMKLYRLSSYVVQRDAFGNVLQM--VTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYE 230 (536) Q Consensus 153 ~~~~~~~~~~l~~~~v~~d~~G~v~~i--~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~ 230 (536) ... .-|+. ..|.+..+ +-+..+++.... .+. ....+-..+.|+ |.+........+|. T Consensus 117 ~~~----~~Pl~-------~~g~i~~i~v~d~~~i~~~~~~----~dp-----~sp~fg~P~~y~-v~~~~~~~~~~iH~ 175 (435) T protein:vir:79 117 KML----KSPVK-------PGAQLEDIRVYDRYQITIHERE----TNA-----RSVRYGEPKLYK-ISPGGDIPEFFVHY 175 (435) T ss_pred CCc----ccccc-------cCCceeeEEeechhhccchhhc----cCC-----cccccCcceEEE-EecCCCCCceEEcc Confidence 321 12432 23544432 333444432210 010 000011123332 22221111222221 Q ss_pred EecCccccccccccccccCceEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc------c Q lcl|NC_011045. 231 EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYI-EEYLGDLRSLENLQEAIVKMSMISSKVIGLVNP------A 303 (536) Q Consensus 231 ~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~-~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~------~ 303 (536) .+++. |+..|+ +-+.+..++-||.+|. +..++.++..........+.+.++.-..+.++. + T Consensus 176 ----SRli~------~~g~~~--p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~ 243 (435) T protein:vir:79 176 ----SRICI------IDGERV--SNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDD 243 (435) T ss_pred ----eeEEE------ecCCcc--hhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcC Confidence 11222 222232 2345566778899987 567899999999999988888777766665532 1 Q ss_pred cccc-----hhh---hccCCCcce-ecCCcccccccccccccchhHHHHHHHHHHHHHHHHHhh--hhcccCCCCCCCHH Q lcl|NC_011045. 304 GITQ-----PRR---LTKAQTGDF-VTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQRTGERVTAE 372 (536) Q Consensus 304 g~~~-----~~~---~~~~~~g~~-~~g~~~~~~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~~~~r~TAt 372 (536) +... +.. ...+.+|.+ +-+..++...+. .+|.-+...+....+.|.-++=. .-+...+.....+| T Consensus 244 ~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~----~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnst 319 (435) T protein:vir:79 244 EEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLN----SDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSAS 319 (435) T ss_pred ccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEe----cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccc Confidence 1111 100 012223333 223333333332 24555566677777777766521 11222333444443 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_011045. 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEA---IGRGQDLDKLERCVAAW 449 (536) Q Consensus 373 Ei~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~---a~r~~~~~~l~~~~~~~ 449 (536) . ++-....---+..++...+.|+++|++.++.+. .++.++|- ||-+ ..|+.-..+..+..+.+ T Consensus 320 g-----d~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s--------~d~~~~f~-pL~~~sekEkAei~~~~a~a~~~~ 385 (435) T protein:vir:79 320 Q-----NTALETFYKLIDRKRVEDYKPILEFLLPFMISE--------TEWSIEFE-PLSVPSDKDKAEIMAKNVESVVKL 385 (435) T ss_pred h-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--------CCCeEEeC-CCCCCCHHHHHHHHHHHHHHHHHH Confidence 1 111222333344456677899999999998765 25666653 3333 33332222222333322 Q ss_pred HhhcchhhhhcCCHHHHHHHHHH---HcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcchH Q lcl|NC_011045. 450 AALAPMRDDPDINLAMIKLRIAN---AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAM 523 (536) Q Consensus 450 ~~~~p~~~~~~id~d~~~~~~a~---~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~~ 523 (536) .+. ..|+.+++-+.+.. ..|+.+.....- ++...... .+...+++.- T Consensus 386 ~~~------g~i~~~e~r~~L~~~~~~~~~~~~~~~~~-~~~~d~~~--------------------~~~~e~g~~~ 435 (435) T protein:vir:79 386 KAE------QAINLKETRDTLRSICPDLKIMDNDNIEL-PEPEDLDP--------------------EPGQEGGLNK 435 (435) T ss_pred Hhc------CCCCHHHHHHHHHHhccccCCCCcccccC-CccccCCC--------------------CCCCCCCCCC Confidence 221 13677777776643 345422111110 11000000 0000000000 No 231 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=30.36 E-value=1.6 Score=19.49 Aligned_cols=362 Identities=10% Similarity=0.023 Sum_probs=130.0 Q ss_pred HHHHHHHHH-hhhHHHHHHHHHHHhcccc--cCCCC---Ccccc-cccccccchHHH-HHHHHHHHHHHhhcCCCcceec Q lcl|NC_011045. 15 SVYERLKND-RAPYETRAQNCAQYTIPSL--FPKDS---DNAST-DYVTPWQAVGAR-GLNNLASKLMLALFPMQTWMRL 86 (536) Q Consensus 15 ~r~~~l~~~-R~~~e~~w~e~~~~~~P~~--~~~~~---~~~~~-~~~~~~dst~~~-a~~~Laa~l~~~ltP~~~Wf~l 86 (536) .+=+.+... .+.+.+.|..- |.. ..+.. ..... .........+.. |++.+|+.+ +.+ | |--. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~i-a~l-p---~~~~ 70 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQ-----STSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSM-ASL-P---LKMY 70 (409) T ss_pred CCccchhhhhhhhhhhhhhcc-----ccccccccccccCccccccchhhhhccHHHHHHHHHHHHhh-hhC-c---eeEe Confidence 111222222 23344444321 111 10110 00000 011112222223 334444444 332 4 2111 Q ss_pred cCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHH-hc----cChHHHHHHHHHHHhhCcEEEEEecCCCCceeeEEEE Q lcl|NC_011045. 87 TISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLY 161 (536) Q Consensus 87 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~~~~~ 161 (536) .-. .. .+ ..+...|. +- +.+.-+..++.++..+||+.+|+..+..+.+..+..+ T Consensus 71 ~~~-~~-------------~~-------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l 129 (409) T protein:vir:93 71 EDY-KV-------------VN-------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLL 129 (409) T ss_pred ecc-cc-------------cc-------chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEE Confidence 111 00 00 11122222 12 3444567778888899999999877666666666666 Q ss_pred ecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCccccccc Q lcl|NC_011045. 162 RLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGMEVQGSD 241 (536) Q Consensus 162 ~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~~i~~~~ 241 (536) |.+.+-+..+.+|. .++.++. ...|..+. T Consensus 130 ~~~~v~~~~~~~~~--~~~y~~~----------------------------------------------~~~g~~~~--- 158 (409) T protein:vir:93 130 NPDVVEMLIENQSR--ELYYSIH----------------------------------------------AATGNKLI--- 158 (409) T ss_pred cCceeEEEEeCCCc--EEEEEEE----------------------------------------------cCCceEEE--- Confidence 66666555554332 1111110 00011000 Q ss_pred cccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccchhhhc--------- Q lcl|NC_011045. 242 GTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT--------- 312 (536) Q Consensus 242 ~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv~~~g~~~~~~~~--------- 312 (536) ++ ..=++++|-....+..||.||..-+...+...+.+.+..+.... ..+-+++..++.++.+... T Consensus 159 --~~--~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~--~~~~~i~~~~~~l~~e~~~~~~~~~~~~ 232 (409) T protein:vir:93 159 --VH--NMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQ--KPDSFMLKYGSNVGKEKRQQVLEDFKQY 232 (409) T ss_pred --Ec--cccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHhcC--CCCceEEecCCCCCHHHHHHHHHHHHHH Confidence 00 00123333222345689999998877777776666555433222 2223444444444443321 Q ss_pred cCCCcceecCCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh-ccc--CCCCCCCHHHHHHHHHHHHHHhhhh Q lcl|NC_011045. 313 KAQTGDFVTGRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS-AVQ--RTGERVTAEEIRYVASELEDTLGGV 388 (536) Q Consensus 313 ~~~~g~~~~g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~-~~~--~~~~r~TAtEi~~r~~E~~~~LG~v 388 (536) -...|.+. --.++....++.. +.+.+. .+..+..+..|-++|-... +.. .++..-+++|.. . T Consensus 233 ~~~~g~~~-vl~~g~~~~~l~~~~~d~q~-~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~---~--------- 298 (409) T protein:vir:93 233 YEENGGIL-FQEPGVEIEPLPKKYVSEDI-VASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN---R--------- 298 (409) T ss_pred hhcCCCee-ecCCCceEEEcCCChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH---H--------- Confidence 01122221 1122233344432 334443 3344445667878885432 111 111122333322 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCC-cceEEEE-echHHH---HHHHHHHHHHHHH----HHHH-H--hhcch- Q lcl|NC_011045. 389 YSILSQELQLPLVRVLLKQLQATQQIPELPK-EAVEPTI-STGLEA---IGRGQDLDKLERC----VAAW-A--ALAPM- 455 (536) Q Consensus 389 ~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~-~~v~v~~-vs~La~---a~r~~~~~~l~~~----~~~~-~--~~~p~- 455 (536) .+...-+.|++.++-..+.+ .++++... ....++| ++.|-. ..|+.....+.+. .+.+ . .+.|. T Consensus 299 --~f~~~~l~P~~~~ie~~l~~-~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ 375 (409) T protein:vir:93 299 --FYLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE 375 (409) T ss_pred --HHHHHHHHHHHHHHHHHHHh-hcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 12222344444443333322 23333211 1233443 233321 2222222222211 0111 1 11121 Q ss_pred hhh------h--cCCHHHHHHHHHHHcCCChhhccCC Q lcl|NC_011045. 456 RDD------P--DINLAMIKLRIANAIGIDTSGILLT 484 (536) Q Consensus 456 ~~~------~--~id~d~~~~~~a~~~Gv~p~~i~rs 484 (536) -.| . .+|..........+=+-+ -=.+ T Consensus 376 ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n---~~e~ 409 (409) T protein:vir:93 376 GGDKPLISGDLYPIDTPLELRKSLKGGDKN---VNES 409 (409) T ss_pred CcCeeeecccccccccchhhcccccCCCCC---cCCC Confidence 000 0 122222111111211110 1111 No 232 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=26.73 E-value=1.9 Score=19.04 Aligned_cols=444 Identities=11% Similarity=0.028 Sum_probs=164.5 Q ss_pred CCCc---cccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHHHHHHHHHHHHHHhh Q lcl|NC_011045. 1 MAEK---RTGLAEEGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGARGLNNLASKLMLAL 77 (536) Q Consensus 1 Ma~~---~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) ||++ ..+++...+..-..... +...-..++ -..|-+-....-.--++.. -|++-.-++++....+++ T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~--~~~~~~~~~----~~~~~Lr~~~~~~ly~~m~--~D~hi~s~l~~Rk~av~~-- 70 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGL--KVKNGRIYE----EPRQALRFPESIKTFQLMM--RDPAVAASVNIIKMFVRK-- 70 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhh--ccccchhhc----cchhhhcccchHHHHHHHh--hChHHHHHHHHHHHHHhc-- Confidence 9994 45666655544332211 111111111 1111110000000001112 256666666666655543 Q ss_pred cCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEEEEecCCCCceee Q lcl|NC_011045. 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP 157 (536) Q Consensus 78 tP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~~~~~~~~~~~ 157 (536) .+| ++.+.+..-++ ..+.+....++.+|... ...|...+.+++ |.+.+|-++.=+......+... T Consensus 71 ---~~w-~v~p~~~~~~d-~~~~~~a~~v~~~l~~~---------~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~ 135 (488) T protein:vir:95 71 ---VNW-RFVPPKGKEQD-PKMLERADFFNSLMDDM---------EHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKG 135 (488) T ss_pred ---CCc-eEecCCCCchh-HHHHHHHHHHHHHHhcc---------CccHHHHHHHHH-Hhhcccceeeeeeeeccccccc Confidence 255 44443211100 00001112233333221 123555555554 6777887765332211110000 Q ss_pred EEEEecceEEEeeCCCCC--eEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCceeEEEEecCc Q lcl|NC_011045. 158 MKLYRLSSYVVQRDAFGN--VLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYIRYEEVEGM 235 (536) Q Consensus 158 ~~~~~l~~~~v~~d~~G~--v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~~~~~~~v~g~ 235 (536) .+ ...-.+|+ +..|..|-..+ ...|. .+.+....+ ..+++. .......+. T Consensus 136 -------~~-~~~~~dg~~~~~~i~~Rpq~~----~~~f~---------~d~d~~l~~-----~~~~~~--~~~~~~~~~ 187 (488) T protein:vir:95 136 -------KY-QSKFDDGLIGWAKLPIRNQST----LDKWY---------FDEDFRRVT-----GVRQNL--RNVSHIAGA 187 (488) T ss_pred -------cc-cccccCCeeeeeeeeecCccc----cccee---------eccCCCcee-----eccccc--ccccccccc Confidence 00 00001111 01111000000 00000 000000000 000000 000111111 Q ss_pred cccccccccccccCc---eEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeccc---cccc Q lcl|NC_011045. 236 EVQGSDGTYPKEACP---YIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVI--GLVNPA---GITQ 307 (536) Q Consensus 236 ~i~~~~~~~~~~~~P---~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~--~lv~~~---g~~~ 307 (536) .-...... ..+.| |++.|....+|+.||.|+...+..-..-=+...+.-+..+++-.-|. ...++. +-.. T Consensus 188 ~~~~~~~~--~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~ 265 (488) T protein:vir:95 188 INLGERPL--TRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAE 265 (488) T ss_pred cccccccc--cccccccceEEEeecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCccc Confidence 10011000 12344 78999999999999999999988888766777788888888643332 333221 1111 Q ss_pred hh--hhc----c--------CCCcceecCCcc-cc-----cccccccc-cchhHHHHHHHHHHHHHHHHHhhhhcccCCC Q lcl|NC_011045. 308 PR--RLT----K--------AQTGDFVTGRPE-DI-----SFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNSAVQRTG 366 (536) Q Consensus 308 ~~--~~~----~--------~~~g~~~~g~~~-~~-----~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~ 366 (536) .+ .+. . ..-|.++|.... +. .+.-++.. +....-...|+-+.+.|+++.+...+...++ T Consensus 266 ~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~ 345 (488) T protein:vir:95 266 PEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQS 345 (488) T ss_pred HHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccC Confidence 10 011 0 012334443221 11 11112222 2233345678889999999998764443222 Q ss_pred --CCCCHHHHHHHH-HHHHHHhhhhH-HHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcceEEEEechHHHHHHHHHHHHH Q lcl|NC_011045. 367 --ERVTAEEIRYVA-SELEDTLGGVY-SILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKL 442 (536) Q Consensus 367 --~r~TAtEi~~r~-~E~~~~LG~v~-~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~v~v~~vs~La~a~r~~~~~~l 442 (536) ..-...||.... .++.....-.+ ..|+..++.||+. +- .|.-.+.| +++|... ...++..+ T Consensus 346 ~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~----~N--fg~~~~~P----~~~~~~~-----e~~Dl~~~ 410 (488) T protein:vir:95 346 KYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYA----LN--MWDDEEHV----QITYDDI-----ETPDLEAI 410 (488) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hc--CCCCCCcc----EEEecCc-----ChhhHHHH Confidence 122233333221 11221111111 2333344444433 22 12111122 3444322 13344455 Q ss_pred HHHHHHHHhhcchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcch Q lcl|NC_011045. 443 ERCVAAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEA 522 (536) Q Consensus 443 ~~~~~~~~~~~p~~~~~~id~d~~~~~~a~~~Gv~p~~i~rs~~ev~~~~~q~~~q~~~~~~a~~~~~~~~~~~~~~~~~ 522 (536) ...++.+..++- .+..+.+.+++.+.+|+++. ...+++..... .++...++. ...... T Consensus 411 ae~~~~L~~~G~-----~i~~~~~~~~i~e~~gip~~---~~~e~~~~~~~---------~~~~~~~~~-----~~~~~~ 468 (488) T protein:vir:95 411 GSYIQKTVAVGA-----LEVDKELSNKLREHIGLPPA---DESQPVSEKLS---------PNSQSRSGD-----GYKTAG 468 (488) T ss_pred HHHHHHHHhCCC-----ccccHHHHHHHHHHhCCCCC---CCCccccccCC---------CCCCCCCCc-----ccCCCc Confidence 556666665552 13445677889999999542 12222211100 000000000 000000 Q ss_pred HHhhhhcCCCCCCC Q lcl|NC_011045. 523 MAAAADSVGLQPGI 536 (536) Q Consensus 523 ~~~~~~~~~~q~~~ 536 (536) .+.+.......|++ T Consensus 469 ~~~~~~~~~~~~~~ 482 (488) T protein:vir:95 469 EGTAKTPSAKDPST 482 (488) T ss_pred ccCCcccccccchh Confidence 01111111223333 No 233 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=26.55 E-value=1.9 Score=19.01 Aligned_cols=303 Identities=13% Similarity=0.079 Sum_probs=113.0 Q ss_pred CCCccccccHHHHHHHHHHHHH----HhhhHHHHHHHHHHHhcccccCCCCCcccccccccccchHH-----------HH Q lcl|NC_011045. 1 MAEKRTGLAEEGAKSVYERLKN----DRAPYETRAQNCAQYTIPSLFPKDSDNASTDYVTPWQAVGA-----------RG 65 (536) Q Consensus 1 Ma~~~~~~~~~~~~~r~~~l~~----~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~~~dst~~-----------~a 65 (536) |.++..........+.=+.... +-.+..+ ..++.+|+ -|-. + ..-.+.+++-.|- .+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~---~~~~---~-~~~~~pp~~~~~la~l~~a~~~h~s~ 72 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLD-KRDILDYV---ECIS---N-GKWYEPPVSFSGLAKSLRSAVHHSSP 72 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecC-cchhhhhh---hhhh---c-CceecCCCCHHHHHHHHHhccccchh Confidence 8875543321110000000000 0000000 00111111 0000 0 0011112211111 01 Q ss_pred HHHHHHHHHHhhcCCCcceeccCChhhhhhhccChhHHHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHhhCcEEE Q lcl|NC_011045. 66 LNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLL 145 (536) Q Consensus 66 ~~~Laa~l~~~ltP~~~Wf~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l 145 (536) +..=++ +....+-++||.. +..+.++..|+.+||||.+ T Consensus 73 i~~k~n-~l~~~~~Pn~~lt-----------------------------------------~~~f~~~~~d~ll~Gnay~ 110 (340) T protein:vir:98 73 IYVKRN-VLASTYIPHPLLS-----------------------------------------RQDFSRFALDYLVFGNAFL 110 (340) T ss_pred hhhhhh-HHhhccCCCCCCC-----------------------------------------HHHHHHHHHHHHhcCCeEE Confidence 111111 1122221123321 1123456678889999999 Q ss_pred EEecCCCCceeeEEEEecceEEEeeCCCCCeEEEEEeEeccHHHHHHHHhHHhhhccccCCCCceEEEEEEEEecCCCCc Q lcl|NC_011045. 146 YLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGE 225 (536) Q Consensus 146 ~~~~~~~~~~~~~~~~~l~~~~v~~d~~G~v~~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~~~v~~~v~p~~~~~~ 225 (536) ++..+..+.++. .+|+...++.+..+|+. |..+ T Consensus 111 ~~~rn~~G~~~~--L~pl~~~~vr~~~~~~~-------------------------------------~~~~-------- 143 (340) T protein:vir:98 111 EQRHSVTGQLIK--LLTSPAKYTRRGVDDSV-------------------------------------FWFV-------- 143 (340) T ss_pred EEEECCCCcEEE--EEEeCCceEEEcccCcE-------------------------------------EEEE-------- Confidence 887766655544 44444444443332221 0000 Q ss_pred eeEEEEecCccccccccccccccCceEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee-cccc Q lcl|NC_011045. 226 YIRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLV-NPAG 304 (536) Q Consensus 226 ~~~~~~v~g~~i~~~~~~~~~~~~P~~~~rw~~~~ge~YGrgp~~~~l~d~~~L~~l~~~~~~~~~~a~~p~~lv-~~~g 304 (536) ..+|..+ .|..-=.+.+|.....+.+||.+|..-++-.+-.-+...+-....-.-...|-.++ -++. T Consensus 144 -----~~~~~~~-------~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~ 211 (340) T protein:vir:98 144 -----ENFTQPH-------EFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDP 211 (340) T ss_pred -----ecCCeEE-------EEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCC Confidence 0011100 01111145555333345699999998888777665555555555555556676543 2343 Q ss_pred ccchhhhc-------c-CCCcc----ee--c-CCccccccccccc-ccchhHHHHHHHHHHHHHHHHHhhhh----cccC Q lcl|NC_011045. 305 ITQPRRLT-------K-AQTGD----FV--T-GRPEDISFLQLEK-QADFTVAKAVSDAIEARLSFAFMLNS----AVQR 364 (536) Q Consensus 305 ~~~~~~~~-------~-~~~g~----~~--~-g~~~~~~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~----~~~~ 364 (536) ..+.++.. . .|+|. ++ + |..+++...++.. +.+.+ ..+..+..++.|-.+|-.-. .... T Consensus 212 ~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~q-f~e~k~~~~~eIa~a~~VPp~llGi~~~ 290 (340) T protein:vir:98 212 AQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDD-FFNIKKASAADLMDAHRVPFQLMGGKPE 290 (340) T ss_pred CCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHH-HHHHHHhhHHHHHHHhCCCHHHhcccCC Confidence 34433221 1 11121 12 2 2235566666654 34555 34455555777888884321 1111 Q ss_pred CC-CCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcc Q lcl|NC_011045. 365 TG-ERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEA 421 (536) Q Consensus 365 ~~-~r~TAtEi~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~~~g~lp~~~~~~ 421 (536) +. ..-++++.... =....|.|...++. |+..-|... ..+.... .+...+ T Consensus 291 ~t~~~sn~e~~~~~--f~~~~l~Pl~~~ie-e~n~~L~~e----~~rF~~~-~l~~~d 340 (340) T protein:vir:98 291 NIGSLGDVEKVAKV--FVRNELSPLQDRFR-EVNDWLGME----VIRFKEY-TLDNPE 340 (340) T ss_pred CCCccccHHHHHHH--HHHHHHHHHHHHHH-HHHhccccc----ccccCcc-ccccCC Confidence 11 11234433322 12223445555443 221111000 0010000 011111 Done!