Query lcl|NC_020414.1_cdsid_YP_007501012.1 [gene=I132_gp37] [protein=head portal protein] [protein_id=YP_007501012.1] [location=20311..21858] Match_columns 515 No_of_seqs 114 out of 157 Neff 7.6 Searched_HMMs 1612 Date Thu Nov 7 16:57:42 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:7017 Length: 515 # 100.0 4E-186 2E-189 1037.1 57.7 515 1-515 1-515 (515) 2 protein:vir:105641 Length: 516 100.0 9E-181 6E-184 1007.7 56.8 515 1-515 2-516 (516) 3 protein:vir:96988 Length: 516 100.0 1E-178 8E-182 995.7 54.5 515 1-515 1-516 (516) 4 protein:vir:103330 Length: 517 100.0 3E-175 2E-178 977.2 57.5 512 4-515 1-515 (517) 5 protein:vir:99672 Length: 532 100.0 2E-165 1E-168 923.6 54.9 509 1-515 1-528 (532) 6 protein:vir:2198 Length: 536 # 100.0 8E-165 5E-168 920.1 55.7 507 1-515 1-531 (536) 7 protein:vir:10447 Length: 536 100.0 1E-164 8E-168 919.2 55.9 507 1-515 1-531 (536) 8 protein:vir:78942 Length: 510 100.0 4E-164 2E-167 916.7 55.8 495 11-511 1-510 (510) 9 protein:vir:8883 Length: 543 # 100.0 3E-164 2E-167 917.5 54.3 508 1-515 1-527 (543) 10 protein:vir:94572 Length: 535 100.0 2E-163 1E-166 912.4 55.6 506 1-515 1-531 (535) 11 protein:vir:6322 Length: 510 # 100.0 7E-163 4E-166 909.5 55.4 494 11-511 1-510 (510) 12 protein:vir:100039 Length: 522 100.0 4E-163 3E-166 910.7 53.5 496 13-515 1-512 (522) 13 protein:vir:3361 Length: 535 # 100.0 1E-162 9E-166 907.9 55.3 508 1-515 1-525 (535) 14 protein:vir:78696 Length: 542 100.0 3E-162 2E-165 906.1 55.3 499 11-515 1-524 (542) 15 protein:vir:1538 Length: 535 # 100.0 4E-162 3E-165 905.3 55.9 507 1-515 1-522 (535) 16 protein:vir:80211 Length: 514 100.0 1E-161 9E-165 902.3 53.7 498 11-514 1-514 (514) 17 protein:vir:94709 Length: 522 100.0 4E-160 2E-163 894.7 54.3 503 1-515 1-522 (522) 18 protein:vir:103765 Length: 549 100.0 5E-160 3E-163 893.7 54.5 501 1-515 1-540 (549) 19 protein:vir:98506 Length: 555 100.0 3E-159 2E-162 889.5 55.7 499 6-515 1-536 (555) 20 protein:vir:107822 Length: 555 100.0 3E-159 2E-162 889.5 55.7 499 6-515 1-536 (555) 21 protein:vir:107404 Length: 555 100.0 3E-159 2E-162 889.5 55.7 499 6-515 1-536 (555) 22 protein:vir:1785 Length: 555 # 100.0 4E-159 2E-162 889.1 55.7 498 11-515 1-529 (555) 23 protein:vir:102668 Length: 547 100.0 3E-157 2E-160 878.6 54.6 493 10-515 1-546 (547) 24 protein:vir:7321 Length: 556 # 100.0 3E-156 2E-159 873.6 55.0 499 1-515 1-550 (556) 25 protein:vir:95315 Length: 559 100.0 7E-155 4E-158 865.7 54.9 499 1-515 1-546 (559) 26 protein:vir:94599 Length: 641 100.0 1.1E-87 6.6E-91 497.4 44.0 498 1-515 15-628 (641) 27 protein:vir:80165 Length: 651 100.0 1.4E-67 8.6E-71 387.1 44.8 500 1-515 3-633 (651) 28 protein:vir:95449 Length: 584 100.0 4.2E-36 2.6E-39 214.6 36.7 480 1-508 1-584 (584) 29 protein:vir:3139 Length: 599 # 100.0 1.6E-34 1E-37 205.8 34.5 489 1-513 11-599 (599) 30 protein:vir:8846 Length: 705 # 100.0 7E-28 4.4E-31 169.4 44.2 493 1-515 1-635 (705) 31 protein:vir:95821 Length: 763 99.9 5.2E-22 3.2E-25 137.3 43.5 494 1-515 2-672 (763) 32 protein:vir:93630 Length: 776 99.7 1.7E-14 1E-17 96.1 36.9 490 1-515 22-684 (776) 33 protein:vir:108295 Length: 711 99.5 3E-12 1.9E-15 83.7 43.0 502 1-515 1-670 (711) 34 protein:vir:105429 Length: 708 99.4 2.6E-11 1.6E-14 78.6 31.8 489 1-515 1-651 (708) 35 protein:vir:9263 Length: 725 # 99.3 1.9E-10 1.2E-13 73.8 33.2 487 1-515 1-643 (725) 36 protein:vir:100920 Length: 725 99.2 3.4E-10 2.1E-13 72.5 34.2 488 1-515 1-643 (725) 37 protein:vir:77597 Length: 725 99.2 7.6E-10 4.7E-13 70.6 35.3 487 1-515 1-657 (725) 38 protein:vir:172 Length: 708 # 99.1 1.6E-09 1E-12 68.8 33.7 484 1-515 1-669 (708) 39 protein:vir:2764 Length: 714 # 99.1 2.3E-09 1.4E-12 67.9 37.9 488 1-515 8-668 (714) 40 protein:vir:817 Length: 714 # 99.1 2.3E-09 1.4E-12 67.9 37.9 488 1-515 8-668 (714) 41 protein:vir:10117 Length: 714 99.1 2.3E-09 1.4E-12 67.9 37.9 488 1-515 8-668 (714) 42 protein:vir:3296 Length: 714 # 99.1 2.3E-09 1.4E-12 67.9 37.9 488 1-515 8-668 (714) 43 protein:vir:9950 Length: 714 # 99.1 2.3E-09 1.4E-12 67.9 37.9 488 1-515 8-668 (714) 44 protein:vir:3520 Length: 720 # 99.0 6.5E-09 4E-12 65.5 28.4 478 1-515 1-642 (720) 45 protein:vir:104437 Length: 714 98.9 1.2E-08 7.5E-12 64.0 36.2 489 1-515 1-686 (714) 46 protein:vir:105520 Length: 706 98.8 3.1E-08 1.9E-11 61.8 35.9 487 1-515 1-658 (706) 47 protein:vir:2341 Length: 488 # 98.8 3.5E-08 2.2E-11 61.4 36.7 432 1-515 1-478 (488) 48 protein:vir:97171 Length: 512 98.8 4.9E-08 3.1E-11 60.6 36.6 429 1-515 31-502 (512) 49 protein:vir:105619 Length: 772 98.8 5.7E-08 3.5E-11 60.3 36.1 476 1-515 1-669 (772) 50 protein:vir:7768 Length: 484 # 98.7 9.1E-08 5.6E-11 59.2 33.4 431 1-515 1-474 (484) 51 protein:vir:104082 Length: 485 98.6 1.6E-07 9.9E-11 57.8 34.4 417 1-515 8-473 (485) 52 protein:vir:2500 Length: 501 # 98.6 1.7E-07 1.1E-10 57.7 32.1 432 1-515 16-493 (501) 53 protein:vir:7430 Length: 563 # 98.4 6.2E-07 3.8E-10 54.6 25.6 462 1-515 1-540 (563) 54 protein:vir:4223 Length: 486 # 98.4 7.7E-07 4.8E-10 54.1 34.2 422 1-515 8-483 (486) 55 protein:vir:99916 Length: 504 98.3 1.3E-06 8.3E-10 52.8 34.3 425 1-515 9-504 (504) 56 protein:vir:3964 Length: 453 # 98.3 1.3E-06 8.3E-10 52.8 38.0 413 1-514 11-453 (453) 57 protein:vir:38 Length: 496 # N 98.2 2.1E-06 1.3E-09 51.7 34.4 430 1-513 13-496 (496) 58 protein:vir:96240 Length: 511 98.2 2.3E-06 1.4E-09 51.5 38.8 427 1-515 31-501 (511) 59 protein:vir:103951 Length: 511 98.2 3E-06 1.9E-09 50.9 38.5 427 1-515 31-501 (511) 60 protein:vir:80453 Length: 535 98.2 3.2E-06 2E-09 50.7 32.3 434 1-515 32-534 (535) 61 protein:vir:4898 Length: 502 # 98.1 3.8E-06 2.4E-09 50.3 35.8 427 1-515 31-492 (502) 62 protein:vir:80680 Length: 441 98.1 4E-06 2.5E-09 50.2 36.5 408 1-508 1-441 (441) 63 protein:vir:733 Length: 453 # 98.1 4.5E-06 2.8E-09 49.9 38.1 412 1-512 11-453 (453) 64 protein:vir:9306 Length: 511 # 98.1 4.8E-06 3E-09 49.8 38.5 425 1-515 31-503 (511) 65 protein:vir:3609 Length: 452 # 98.1 5.4E-06 3.4E-09 49.5 37.2 416 1-514 9-452 (452) 66 protein:vir:9871 Length: 429 # 98.0 6.4E-06 4E-09 49.1 38.7 406 10-510 1-429 (429) 67 protein:vir:96366 Length: 511 98.0 6.7E-06 4.2E-09 48.9 38.1 431 1-515 31-501 (511) 68 protein:vir:78805 Length: 511 98.0 6.7E-06 4.2E-09 48.9 38.1 431 1-515 31-501 (511) 69 protein:vir:345 Length: 663 # 98.0 7E-06 4.3E-09 48.9 30.5 474 1-515 1-632 (663) 70 protein:vir:96494 Length: 501 98.0 8.6E-06 5.3E-09 48.3 36.3 428 1-515 30-496 (501) 71 protein:vir:78227 Length: 480 97.9 9.9E-06 6.1E-09 48.0 35.1 414 8-515 1-467 (480) 72 protein:vir:1587 Length: 508 # 97.9 1E-05 6.2E-09 48.0 34.6 427 1-510 20-508 (508) 73 protein:vir:99072 Length: 479 97.9 1E-05 6.2E-09 48.0 38.3 416 1-515 1-463 (479) 74 protein:vir:94805 Length: 492 97.8 1.7E-05 1.1E-08 46.7 33.8 420 1-515 35-491 (492) 75 protein:vir:97336 Length: 492 97.8 1.8E-05 1.1E-08 46.7 35.9 415 1-515 35-485 (492) 76 protein:vir:2427 Length: 485 # 97.8 2.2E-05 1.4E-08 46.1 35.9 422 1-515 1-475 (485) 77 protein:vir:105889 Length: 474 97.7 2.3E-05 1.4E-08 46.0 30.8 425 1-515 3-473 (474) 78 protein:vir:94101 Length: 474 97.7 2.3E-05 1.4E-08 46.0 30.8 425 1-515 3-473 (474) 79 protein:vir:98444 Length: 434 97.7 2.4E-05 1.5E-08 45.9 31.3 387 40-515 1-429 (434) 80 protein:vir:2732 Length: 501 # 97.7 3.2E-05 2E-08 45.2 36.9 426 1-515 29-491 (501) 81 protein:vir:106639 Length: 481 97.6 3.7E-05 2.3E-08 44.8 39.3 421 1-515 22-481 (481) 82 protein:vir:95113 Length: 474 97.6 3.9E-05 2.4E-08 44.8 36.1 419 1-515 7-469 (474) 83 protein:vir:1236 Length: 483 # 97.6 3.9E-05 2.4E-08 44.7 35.0 417 1-515 22-478 (483) 84 protein:vir:101494 Length: 527 97.6 4.4E-05 2.7E-08 44.5 27.6 459 1-515 1-522 (527) 85 protein:vir:93747 Length: 472 97.6 4.4E-05 2.7E-08 44.5 36.3 419 1-515 15-467 (472) 86 protein:vir:102239 Length: 527 97.6 4.6E-05 2.9E-08 44.4 27.6 459 1-515 1-522 (527) 87 protein:vir:78537 Length: 480 97.5 5.1E-05 3.2E-08 44.1 34.8 412 6-515 1-467 (480) 88 protein:vir:78907 Length: 518 97.5 6E-05 3.7E-08 43.7 33.9 425 6-515 1-518 (518) 89 protein:vir:99781 Length: 511 97.4 6.5E-05 4.1E-08 43.5 40.2 425 1-515 31-501 (511) 90 protein:vir:9922 Length: 489 # 97.3 0.00011 7E-08 42.2 35.6 425 1-515 1-487 (489) 91 protein:vir:97265 Length: 513 97.2 0.00012 7.4E-08 42.1 29.5 449 1-515 1-511 (513) 92 protein:vir:96266 Length: 474 97.2 0.00014 8.4E-08 41.8 36.4 413 1-515 7-466 (474) 93 protein:vir:95899 Length: 474 97.2 0.00014 8.4E-08 41.8 36.4 413 1-515 7-466 (474) 94 protein:vir:106571 Length: 499 97.2 0.00015 9.3E-08 41.6 38.9 424 1-515 1-478 (499) 95 protein:vir:80959 Length: 499 97.1 0.00016 1E-07 41.4 34.8 425 1-513 1-499 (499) 96 protein:vir:94742 Length: 409 97.0 0.00021 1.3E-07 40.7 29.1 377 10-473 1-409 (409) 97 protein:vir:99522 Length: 470 97.0 0.00022 1.3E-07 40.7 38.3 419 1-512 1-470 (470) 98 protein:vir:1634 Length: 409 # 97.0 0.00024 1.5E-07 40.5 29.1 377 10-473 1-409 (409) 99 protein:vir:105461 Length: 470 96.9 0.00028 1.7E-07 40.1 38.1 415 10-514 1-470 (470) 100 protein:vir:9751 Length: 422 # 96.8 0.00035 2.2E-07 39.5 30.8 389 10-490 1-422 (422) 101 protein:vir:97447 Length: 474 96.4 0.00064 4E-07 38.1 36.6 417 1-515 7-467 (474) 102 protein:vir:94498 Length: 474 96.4 0.00064 4E-07 38.1 36.6 417 1-515 7-467 (474) 103 protein:vir:94546 Length: 506 96.3 0.00072 4.5E-07 37.8 37.4 429 1-515 13-501 (506) 104 protein:vir:4782 Length: 522 # 96.3 0.00078 4.8E-07 37.6 35.4 426 1-515 18-518 (522) 105 protein:vir:98883 Length: 517 96.3 0.00079 4.9E-07 37.6 39.3 434 6-515 1-511 (517) 106 protein:vir:102950 Length: 471 96.2 0.00092 5.7E-07 37.2 36.5 412 10-514 1-471 (471) 107 protein:vir:95806 Length: 440 96.1 0.001 6.2E-07 37.0 37.2 398 18-514 1-440 (440) 108 protein:vir:9568 Length: 410 # 96.1 0.001 6.4E-07 37.0 31.2 377 8-494 1-410 (410) 109 protein:vir:79703 Length: 505 96.0 0.0011 6.7E-07 36.8 38.5 420 6-511 1-505 (505) 110 protein:vir:95149 Length: 501 96.0 0.0011 6.8E-07 36.8 33.3 435 1-515 1-500 (501) 111 protein:vir:96179 Length: 468 95.6 0.0017 1.1E-06 35.7 36.4 412 1-515 1-467 (468) 112 protein:vir:8184 Length: 474 # 95.2 0.0026 1.6E-06 34.8 35.0 409 1-510 12-474 (474) 113 protein:vir:95014 Length: 491 95.2 0.0026 1.6E-06 34.8 29.9 422 1-515 3-485 (491) 114 protein:vir:105292 Length: 478 95.1 0.0027 1.7E-06 34.7 36.3 411 1-515 1-472 (478) 115 protein:vir:79043 Length: 479 95.0 0.0031 1.9E-06 34.4 37.7 425 1-515 6-479 (479) 116 protein:vir:102602 Length: 456 94.7 0.0038 2.4E-06 33.8 34.6 418 1-511 2-456 (456) 117 protein:vir:105819 Length: 456 94.7 0.0038 2.4E-06 33.8 34.6 418 1-511 2-456 (456) 118 protein:vir:78393 Length: 489 94.6 0.004 2.5E-06 33.7 30.0 421 1-515 2-483 (489) 119 protein:vir:9815 Length: 500 # 94.3 0.0048 3E-06 33.3 34.1 423 1-511 11-500 (500) 120 protein:vir:3028 Length: 500 # 94.3 0.0048 3E-06 33.3 34.1 423 1-511 11-500 (500) 121 protein:vir:4995 Length: 384 # 94.2 0.005 3.1E-06 33.2 19.1 356 6-473 1-384 (384) 122 protein:vir:107112 Length: 478 94.2 0.0052 3.2E-06 33.1 36.2 412 1-515 1-472 (478) 123 protein:vir:4828 Length: 382 # 93.9 0.006 3.7E-06 32.8 18.8 356 16-483 1-382 (382) 124 protein:vir:4698 Length: 251 # 92.4 0.0068 4.2E-06 32.5 9.8 236 6-369 1-251 (251) 125 protein:vir:1326 Length: 457 # 91.4 0.016 1E-05 30.4 19.3 400 16-515 1-441 (457) 126 protein:vir:3843 Length: 397 # 89.1 0.028 1.8E-05 29.1 19.4 367 6-515 1-393 (397) 127 protein:vir:5961 Length: 503 # 89.0 0.029 1.8E-05 29.0 36.6 433 1-515 1-500 (503) 128 protein:vir:101647 Length: 460 86.4 0.046 2.8E-05 27.9 22.3 407 1-513 1-460 (460) 129 protein:vir:7987 Length: 456 # 84.8 0.058 3.6E-05 27.4 37.7 417 1-509 1-456 (456) 130 protein:vir:96783 Length: 488 80.0 0.099 6.1E-05 26.1 34.0 407 1-469 14-488 (488) 131 protein:vir:98853 Length: 219 79.7 0.1 6.3E-05 26.0 13.7 193 215-422 1-219 (219) 132 protein:vir:1266 Length: 416 # 76.7 0.13 8.2E-05 25.4 20.9 387 6-515 1-416 (416) 133 protein:vir:4854 Length: 386 # 76.5 0.13 8.4E-05 25.4 20.0 361 16-508 1-386 (386) 134 protein:vir:6240 Length: 457 # 75.2 0.15 9.3E-05 25.1 16.4 413 16-515 1-454 (457) 135 protein:vir:3989 Length: 392 # 75.0 0.15 9.4E-05 25.1 23.8 324 39-456 1-392 (392) 136 protein:vir:1023 Length: 392 # 75.0 0.15 9.4E-05 25.1 23.8 324 39-456 1-392 (392) 137 protein:vir:94956 Length: 452 73.3 0.17 0.00011 24.8 30.2 419 1-515 1-449 (452) 138 protein:vir:7407 Length: 392 # 73.2 0.17 0.00011 24.8 23.3 313 73-456 1-392 (392) 139 protein:vir:78083 Length: 537 73.1 0.17 0.00011 24.7 39.2 438 1-515 1-504 (537) 140 protein:vir:81152 Length: 411 71.6 0.19 0.00012 24.5 18.3 371 6-512 1-411 (411) 141 protein:vir:3153 Length: 467 # 67.2 0.26 0.00016 23.8 20.0 404 54-505 1-467 (467) 142 protein:vir:4952 Length: 386 # 65.8 0.28 0.00017 23.6 27.2 359 16-515 1-386 (386) 143 protein:vir:4454 Length: 414 # 65.5 0.28 0.00018 23.6 15.1 375 16-515 1-407 (414) 144 protein:vir:102330 Length: 451 65.2 0.29 0.00018 23.5 36.3 406 10-510 1-451 (451) 145 protein:vir:80040 Length: 461 64.5 0.3 0.00019 23.5 23.0 421 1-515 1-461 (461) 146 protein:vir:78161 Length: 355 63.4 0.32 0.0002 23.3 19.7 291 172-515 1-355 (355) 147 protein:vir:7853 Length: 518 # 51.2 0.59 0.00037 21.8 16.2 384 39-515 1-436 (518) 148 protein:vir:78749 Length: 337 48.2 0.68 0.00042 21.5 25.1 298 45-413 1-337 (337) 149 protein:vir:101648 Length: 518 42.7 0.87 0.00054 20.9 16.0 379 39-515 1-436 (518) 150 protein:vir:96839 Length: 474 36.9 1.1 0.00071 20.3 35.5 416 1-515 1-473 (474) 151 protein:vir:96738 Length: 505 34.8 1.3 0.00079 20.0 24.8 447 1-515 1-501 (505) 152 protein:vir:4598 Length: 416 # 31.6 1.5 0.00092 19.6 17.2 377 6-514 1-416 (416) 153 protein:vir:81095 Length: 416 31.6 1.5 0.00092 19.6 17.2 377 6-514 1-416 (416) 154 protein:vir:5249 Length: 437 # 31.5 1.5 0.00092 19.6 19.5 400 30-514 1-437 (437) 155 protein:vir:78641 Length: 278 25.7 2 0.0013 18.9 17.9 257 79-424 1-278 (278) 156 protein:vir:102855 Length: 432 23.8 2.3 0.0014 18.6 19.1 389 6-515 1-428 (432) 157 protein:vir:107605 Length: 432 23.8 2.3 0.0014 18.6 19.1 389 6-515 1-428 (432) 158 protein:vir:105002 Length: 432 23.8 2.3 0.0014 18.6 19.1 389 6-515 1-428 (432) 159 protein:vir:102080 Length: 429 22.1 2.5 0.0015 18.4 19.8 385 16-515 1-425 (429) 160 protein:vir:100150 Length: 437 21.8 2.5 0.0016 18.4 25.1 389 11-515 1-433 (437) 161 protein:vir:79538 Length: 502 21.8 2.5 0.0016 18.4 25.7 436 1-514 1-502 (502) 162 protein:vir:483 Length: 413 # 21.5 2.6 0.0016 18.3 22.2 369 17-515 1-406 (413) 163 protein:vir:3780 Length: 345 # 21.3 2.6 0.0016 18.3 19.1 308 40-426 1-345 (345) No 1 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=100.00 E-value=3.8e-186 Score=1037.14 Aligned_cols=515 Identities=99% Similarity=1.445 Sum_probs=506.9 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) |++|+||+|++|++|++||++||++|++|+++|+||++||+|++|++++++++++++|||||++|+++|||||||+|||| T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 80 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCcccccccccchHHHHHHHHHHHHHHhhcCC Confidence 99999999999999999999999999999999999999999999999998888899999999999999999999999999 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEcc Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMH 160 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl~ 160 (515) ++|||||+++|+..+.+++.+...+++++||+.||+.++.+|++||||.++|++|+||++|||||+|+|++++|++|||+ T Consensus 81 ~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~~~~~~pl~ 160 (515) T protein:vir:70 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMH 160 (515) T ss_pred CCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCCCCeEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCc Q lcl|NC_020414. 161 HYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRI 240 (515) Q Consensus 161 ~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy 240 (515) +|||++|++|+||+|||||+||+++|+++||.+..+....++.+|+++|+|||||++++++||+||++++|++++++||| T Consensus 161 ~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~y 240 (515) T protein:vir:70 161 HYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKESRI 240 (515) T ss_pred eEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEecCceeecccccc Confidence 99999999999999999999999999999998887777777889999999999999999999999999999999999999 Q ss_pred ccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCc Q lcl|NC_020414. 241 KAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVE 320 (515) Q Consensus 241 ~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~ 320 (515) +.++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.++++|.+++|.+ T Consensus 241 ~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~~~~~g~iv~g~~ 320 (515) T protein:vir:70 241 KSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVA 320 (515) T ss_pred ccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhccccCCceeecCCc Confidence 98999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|NC_020414. 321 EDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWG 400 (515) Q Consensus 321 ~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~ 400 (515) ++++++++++++||+.++..|++++++|+++||++++.++++++||||||++|++||+++||||||||+.|||.||++|+ T Consensus 321 ~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~ 400 (515) T protein:vir:70 321 EDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWG 400 (515) T ss_pred ccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHH Q lcl|NC_020414. 401 LQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEM 480 (515) Q Consensus 401 ~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev 480 (515) +++++|+||++++++++|+|+++|+|++++++|.+++++++.+++++|+++++||+|+++|++++.+|+|.+++||+||| T Consensus 401 ~~~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev 480 (515) T protein:vir:70 401 LQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEM 480 (515) T ss_pred HHhhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 481 QQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 481 ~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++|+|+++++|++++++++++|+++++++.|+|| T Consensus 481 ~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 481 QQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 99999999999999999999999999999999999 No 2 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=100.00 E-value=8.9e-181 Score=1007.67 Aligned_cols=515 Identities=86% Similarity=1.333 Sum_probs=503.9 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) =.+|++++|++|++|++||++||++|++||++|+||++||+|+++++++++++++++|||||++|+++|||||||+|||| T Consensus 2 ~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 81 (516) T protein:vir:10 2 KQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVLFPA 81 (516) T ss_pred CchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcccccccccchHHHHHHHHHHHHHhhhcCC Confidence 46799999999999999999999999999999999999999999999999888899999999999999999999999999 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEcc Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMH 160 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl~ 160 (515) ++|||||+++|..++.+++.+.+.+++++||++||++++.+|++||||.++|++|+||++|||||+|+|++++|++|||+ T Consensus 82 ~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~~~~~~pl~ 161 (516) T protein:vir:10 82 QRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKGAISAIPMH 161 (516) T ss_pred CCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCCCeEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCc Q lcl|NC_020414. 161 HYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRI 240 (515) Q Consensus 161 ~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy 240 (515) +|||++|++|+|++||||+++|+++|+++|+....+.++..+++|++++++||||++++++||++|+++|+++++++|+| T Consensus 162 ~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~~~~~d~~~~~~~s~~ 241 (516) T protein:vir:10 162 HYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWELKQSADDIPVGKVSKI 241 (516) T ss_pred eEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEEEEeeCceeecccccc Confidence 99999999999999999999999999999987766666677888999999999999999999999999999999999999 Q ss_pred ccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCc Q lcl|NC_020414. 241 KAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVE 320 (515) Q Consensus 241 ~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~ 320 (515) +.++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.++++|.+++|.+ T Consensus 242 ~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~~~~g~~ 321 (516) T protein:vir:10 242 KSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGVE 321 (516) T ss_pred ccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhhccCCCceeecCCc Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH Q lcl|NC_020414. 321 EDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWG 400 (515) Q Consensus 321 ~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~ 400 (515) ++++++++++++||+.++..|++++++|+++||++.+.++++++||||||++|++||+++||||||||+.|||.|||+|+ T Consensus 322 ~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~ 401 (516) T protein:vir:10 322 EDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWG 401 (516) T ss_pred ccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHH Q lcl|NC_020414. 401 LQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEM 480 (515) Q Consensus 401 ~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev 480 (515) +.+++|++|++++++++|+||++|+|+|++++|.+++++|+++++++|+++|+||+|+++|++++++|||++++||+||| T Consensus 402 ~~~~~p~~P~~lv~~~~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev 481 (516) T protein:vir:10 402 LLEAGDSFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEM 481 (516) T ss_pred HHhhCCCCChhhcCcceehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 481 QQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 481 ~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++|+|+++++|.+++++++++|.++++|++++|+ T Consensus 482 ~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 482 EQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccchhhhhhhcC Confidence 99999999999999999999999999999999999 No 3 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=100.00 E-value=1.3e-178 Score=995.73 Aligned_cols=515 Identities=86% Similarity=1.333 Sum_probs=501.3 Q ss_pred CCC-ccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcC Q lcl|NC_020414. 1 MQD-TILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFP 79 (515) Q Consensus 1 ~~~-~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltp 79 (515) |.+ -.+++|++|++|++||++|+++|++||++|+||++||+|+++++++++++++++|||||++|+++|||||||+||| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 80 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVLFP 80 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCCccccCCcccchHHHHHHHHHHHHHhhhcC Confidence 654 4678999999999999999999999999999999999999999999888889999999999999999999999999 Q ss_pred CCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEc Q lcl|NC_020414. 80 AQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPM 159 (515) Q Consensus 80 p~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl 159 (515) |++|||||+++|..++.+++.+.+.+++++||++||++|+.+|++||||.++|++|+||++|||||+|+|++++|++||| T Consensus 81 p~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~~~~~~~pl 160 (516) T protein:vir:96 81 AQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSKGAISAIPM 160 (516) T ss_pred CCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCCCCEEEEEc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCC Q lcl|NC_020414. 160 HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENR 239 (515) Q Consensus 160 ~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esg 239 (515) ++|||++|++|+|++||||+++++++|+++|+........+.+++++++|+|||||+|+++++|+||+++|+++++++|| T Consensus 161 ~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~es~ 240 (516) T protein:vir:96 161 HHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSK 240 (516) T ss_pred CeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEeCceeeccccc Confidence 99999999999999999999999999999998766555556678899999999999999999999999999999999999 Q ss_pred cccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCC Q lcl|NC_020414. 240 IKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGV 319 (515) Q Consensus 240 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~ 319 (515) |+.++|||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|||+++|+|++++.++.++++|.+++|+ T Consensus 241 ~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l~~~~~g~i~~g~ 320 (516) T protein:vir:96 241 IKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHFVNSGTGEVVTGV 320 (516) T ss_pred cccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhhccCCCceeecCC Confidence 98889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_020414. 320 EEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMW 399 (515) Q Consensus 320 ~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r 399 (515) +++++++++++++||+.++..|++++++|+++||++.+.++++++||||||++|++||+++||||||||+.|||.|||+| T Consensus 321 ~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r 400 (516) T protein:vir:96 321 EEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMW 400 (516) T ss_pred cccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHH Q lcl|NC_020414. 400 GLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEE 479 (515) Q Consensus 400 ~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~ee 479 (515) ++...+|++|+.++++++|+||++|+|++++++|.+++++|+++++++|+++|+||+|++++++++++|||++++||+|| T Consensus 401 ~l~~~~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~ee 480 (516) T protein:vir:96 401 GLLEAGESFTSDLVDPVIITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEE 480 (516) T ss_pred HHHhcCCCCccccccceeechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 480 MQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 480 v~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) |+++|+|+++++|+++++++++++.++.++++.||+ T Consensus 481 v~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 481 MAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 999999999999999999999999999999999999 No 4 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=100.00 E-value=3.2e-175 Score=977.19 Aligned_cols=512 Identities=56% Similarity=0.952 Sum_probs=492.8 Q ss_pred ccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_020414. 4 TILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRS 83 (515) Q Consensus 4 ~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~ 83 (515) -+|++|+++++|++||++||++|++|+++|+||++||+|+++++++++.+.+++|||||++|+++|||||||+||||++| T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~ 80 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLFPAQRS 80 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 79999999999999999999999999999999999999999999998888899999999999999999999999999999 Q ss_pred ceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC-CcEEEEEcceE Q lcl|NC_020414. 84 FFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK-GAMSAVPMHHY 162 (515) Q Consensus 84 WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~-~~~r~~pl~~y 162 (515) ||||+++++.+++.+......++|+.||++||++++.+|++||||.++|++|+||++|||||+|+++. .+|++|||++| T Consensus 81 WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~pl~~y 160 (517) T protein:vir:10 81 FFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHPDKTSPIQAVPLHHY 160 (517) T ss_pred cccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEeCCCCcEEEEEcCeE Confidence 99999999999998888889999999999999999999999999999999999999999999999765 46999999999 Q ss_pred EEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCccc Q lcl|NC_020414. 163 VVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKA 242 (515) Q Consensus 163 ~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~ 242 (515) ||++|++|+|++||||+++|+++|+++|+.+..+.+..++++|+++|+|||||+|+++++|+||+++|+++++++|||++ T Consensus 161 ~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~~s~y~~ 240 (517) T protein:vir:10 161 CVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVPVGKESTVTE 240 (517) T ss_pred EEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEeCceeecccccccc Confidence 99999999999999999999999999999988777777788999999999999999999999999999999999999998 Q ss_pred ccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCccc Q lcl|NC_020414. 243 EKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEED 322 (515) Q Consensus 243 ~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~ 322 (515) ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.++++|++++|++++ T Consensus 241 ~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~~~~~g~~~~g~~~~ 320 (517) T protein:vir:10 241 DKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFVEGGSGAVLHGVEGD 320 (517) T ss_pred ccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhccCCCccccccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 323 IHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ 402 (515) Q Consensus 323 v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 402 (515) +.++++++++||+.+++.|++++++|+++||++.+.++++++||||||++|++||+++||||||||+.|||.|||+|++. T Consensus 321 v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~ 400 (517) T protein:vir:10 321 IHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMN 400 (517) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHH Q lcl|NC_020414. 403 EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQ 482 (515) Q Consensus 403 ~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~ 482 (515) ...+.+|...+++++++++++|+|++++++|.+++++++++++++|.++++||+|++++++++++|||+++|||++||++ T Consensus 401 ~l~~~l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~ 480 (517) T protein:vir:10 401 GISSILTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNA 480 (517) T ss_pred HhhhhcCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhhccchh--hhhhccC Q lcl|NC_020414. 483 EMAQQAQAQQEAMLNEGVAKAVPGVI--QQEMKEG 515 (515) Q Consensus 483 ~rq~~~~~~q~~~~~~~~~~a~~~~~--~~~~~~~ 515 (515) +|+++++++|+++++++++++++..+ ++++++| T Consensus 481 ~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~ 515 (517) T protein:vir:10 481 EAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQG 515 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCC Confidence 99999999988888888877776666 4555666 No 5 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=100.00 E-value=1.9e-165 Score=923.64 Aligned_cols=509 Identities=27% Similarity=0.440 Sum_probs=463.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+|+ +.|+.+++|++||++||++|++||++|+||++||+|+++++++++.. ..++|||||++|+++|||||||+|| T Consensus 1 m~~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 79 (532) T protein:vir:99 1 MAEVE-KTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGARGLNNLASKLMLALF 79 (532) T ss_pred Ccchh-hccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHHHHHHHHHHHHHHhhc Confidence 99999 89999999999999999999999999999999999999988776543 4689999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC------ Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG------ 152 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~------ 152 (515) ||++|||||.++|+.+++....+...++|+.||++||++|+.+|++||||.++|++|+||++|||||+|++++. T Consensus 80 pp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~ 159 (532) T protein:vir:99 80 PVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQS 159 (532) T ss_pred CCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCcc Confidence 99999999999999999988888899999999999999999999999999999999999999999999997532 Q ss_pred -cEEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCC-eEEEEEeC Q lcl|NC_020414. 153 -AMSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGF-WKINQSAD 230 (515) Q Consensus 153 -~~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~-~~~~~e~~ 230 (515) +|++|||++|||++|++|+|++||||+++++++|+++++.+.... ..+++|+++|+|||||+|++++. +++|++++ T Consensus 160 ~~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~--~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~~~ 237 (532) T protein:vir:99 160 NAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDA--QGDQNPSEEVTIYTHVYRDPEAMVFRSYQEID 237 (532) T ss_pred cceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhcc--ccccCCCcceEEEEEEEecCCCCeeEEEEeec Confidence 489999999999999999999999999999999999998775533 24568999999999999998875 77888888 Q ss_pred Cee-ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC Q lcl|NC_020414. 231 DIP-VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN 309 (515) Q Consensus 231 ~~~-i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~ 309 (515) |+. ++++|+|+.++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.+ T Consensus 238 g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~ 317 (532) T protein:vir:99 238 GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVAK 317 (532) T ss_pred CceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhcc Confidence 864 5788998778999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_020414. 310 SGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) Q Consensus 310 ~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 389 (515) +++|++++|.+++++++++++++||+.++..|++++++|+++||++++.++|+++||||||++|++|++++||||||||+ T Consensus 318 ~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~ 397 (532) T protein:vir:99 318 ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLS 397 (532) T ss_pred CCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHH Q lcl|NC_020414. 390 MTMQTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVR 464 (515) Q Consensus 390 ~E~l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a 464 (515) .|||.|||+|++. +.+|++|++++++.+|+|++||+|+|+++++.++++ .++++.|+++|+||+|++++.++ T Consensus 398 ~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~Laraq~~~~l~~~~~---~laq~~p~~~d~id~d~~~~~~a 474 (532) T protein:vir:99 398 QELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALGRGHDLNKLNVFID---YMIKLAGLQDDDINLLDVKMRLA 474 (532) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecchHHHHHHHHHHHHHHHH---HHHhhcchhhhhCCHHHHHHHHH Confidence 9999999999753 789999999999999999999999999887776655 45777888999999999999999 Q ss_pred HhcCC-chhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhc--cC Q lcl|NC_020414. 465 GQISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMK--EG 515 (515) Q Consensus 465 ~~~Gv-p~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~--~~ 515 (515) +++|| |+.++||+|||+++|||+++++++++..++++++++-+.+..++ -| T Consensus 475 ~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 528 (532) T protein:vir:99 475 NSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAG 528 (532) T ss_pred HHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcC Confidence 99999 46899999999999998877777666555555444322222221 12 No 6 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=100.00 E-value=8.3e-165 Score=920.14 Aligned_cols=507 Identities=29% Similarity=0.449 Sum_probs=460.4 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+ .++|+.+++|++||++||++|++||++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 m~~--~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:21 1 MAE--KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred Ccc--hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhc Confidence 999 567999999999999999999999999999999999999988876654 3579999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-----c Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-----A 153 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~-----~ 153 (515) |+ +|||||.++|+.+++....+...+++++||+.||++++.+|++||||.++|++|+||++|||+|+|++++. . T Consensus 79 P~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~ 157 (536) T protein:vir:21 79 PM-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP 157 (536) T ss_pred CC-CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceee Confidence 86 69999999999999888888889999999999999999999999999999999999999999999997653 3 Q ss_pred EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC-CCeEEEEEeCCe Q lcl|NC_020414. 154 MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDI 232 (515) Q Consensus 154 ~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~e~~~~ 232 (515) |++|||++|||++|++|+||+|||||+||+++|+++||.++.+.. .+++++++|+|||+|+|+++ +.|+||++++|+ T Consensus 158 f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~--~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~ 235 (536) T protein:vir:21 158 MKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQG--GEKKADETIDVYTHIYLDEDSGEYLRYEEVEGM 235 (536) T ss_pred EEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccc--cccccccceeEEEEEEEecCCCcEEEEeccCCe Confidence 789999999999999999999999999999999999998876543 45688999999999999865 579999999999 Q ss_pred eecccCC-cccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC Q lcl|NC_020414. 233 PVGKENR-IKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG 311 (515) Q Consensus 233 ~i~~esg-y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~ 311 (515) +++.++| |+.++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+++|+|+++|+++.+++ T Consensus 236 ~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~ 315 (536) T protein:vir:21 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ 315 (536) T ss_pred eeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCC Confidence 9866555 456789999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_020414. 312 TGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMT 391 (515) Q Consensus 312 ~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E 391 (515) +|++++|.+++++++++++++||+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.| T Consensus 316 ~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E 395 (536) T protein:vir:21 316 TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (536) T ss_pred CcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh-cCCHHHHHHHHHH Q lcl|NC_020414. 392 MQTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQR-AIRWGDYMDWVRG 465 (515) Q Consensus 392 ~l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d-~id~d~~~~~~a~ 465 (515) ||.|||+|++. +.+|++|++++++.+|++|++|+|+++++++.+|++.+ +++.|+++| +||+|++++++++ T Consensus 396 ll~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~l---a~~~Pe~ld~~id~d~~~~~~a~ 472 (536) T protein:vir:21 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAW---AALAPMRDDPDINLAMIKLRIAN 472 (536) T ss_pred HHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHH---HhhchhhhcccCCHHHHHHHHHH Confidence 99999999753 78999999999999999999999999888888776654 677899998 5999999999999 Q ss_pred hcCC-chhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-----c---hhhhhhccC Q lcl|NC_020414. 466 QISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVP-----G---VIQQEMKEG 515 (515) Q Consensus 466 ~~Gv-p~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~-----~---~~~~~~~~~ 515 (515) ++|| |++++||+|||+++|+|+++++|+++++..+++++. + +.+...++| T Consensus 473 ~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g 531 (536) T protein:vir:21 473 AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) T ss_pred HcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhccc Confidence 9999 789999999999999999887777766554433221 1 112222233 No 7 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=100.00 E-value=1.2e-164 Score=919.23 Aligned_cols=507 Identities=29% Similarity=0.445 Sum_probs=460.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+ .++|+.+++|++||++||++|++||++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 m~~--~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:10 1 MAE--KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNLASKLMLALF 78 (536) T ss_pred Ccc--hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhhc Confidence 999 567999999999999999999999999999999999999988876654 3579999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-----c Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-----A 153 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~-----~ 153 (515) |+ +|||||.++|+.+++....+...+++++||+.||++++.+|++||||.++|++|+||++|||+|+|++++. . T Consensus 79 P~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~ 157 (536) T protein:vir:10 79 PM-QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNP 157 (536) T ss_pred CC-CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceee Confidence 76 69999999999999888888889999999999999999999999999999999999999999999997653 3 Q ss_pred EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC-CCeEEEEEeCCe Q lcl|NC_020414. 154 MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDI 232 (515) Q Consensus 154 ~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~e~~~~ 232 (515) |++|||++|||++|++|+||+|||||+||+++|+++||.++.+.. .+++++++|+|||||+|+++ +.|++|++++|+ T Consensus 158 ~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~--~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~ 235 (536) T protein:vir:10 158 MKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQG--GEKKADETIDVYTHIYLDEASGEYLRYEEVEGM 235 (536) T ss_pred EEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccc--cccCcccceEEEEEEEEecCCCcEEEEEeecCc Confidence 789999999999999999999999999999999999998866543 35688999999999999864 579999999999 Q ss_pred eecccCC-cccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC Q lcl|NC_020414. 233 PVGKENR-IKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG 311 (515) Q Consensus 233 ~i~~esg-y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~ 311 (515) +++.++| |+.++|||+++||++.+|++||||||+++|||+|+||.|+++++++++++++|||+++|+|+++|+++.+++ T Consensus 236 ~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~~ 315 (536) T protein:vir:10 236 EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLTKAQ 315 (536) T ss_pred cccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhccCC Confidence 9865544 456889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_020414. 312 TGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMT 391 (515) Q Consensus 312 ~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E 391 (515) +|++++|.+++++++++++++||+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.| T Consensus 316 ~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E 395 (536) T protein:vir:10 316 TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (536) T ss_pred CcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh-cCCHHHHHHHHHH Q lcl|NC_020414. 392 MQTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQR-AIRWGDYMDWVRG 465 (515) Q Consensus 392 ~l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d-~id~d~~~~~~a~ 465 (515) ||.|||+|++. +.+|++|++++++.+|++|++|+|+++++++.+|++. ++++.|+++| +||+|++++++++ T Consensus 396 ll~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~---la~~~P~~ld~~id~d~~~~~~a~ 472 (536) T protein:vir:10 396 LQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTA---WAALAPMRDDPDINLAMIKLRIAN 472 (536) T ss_pred HHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHH---HHhhchhhhcccCCHHHHHHHHHH Confidence 99999999753 7899999999999999999999999988888777654 4678899998 5999999999999 Q ss_pred hcCC-chhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-----c---hhhhhhccC Q lcl|NC_020414. 466 QISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVP-----G---VIQQEMKEG 515 (515) Q Consensus 466 ~~Gv-p~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~-----~---~~~~~~~~~ 515 (515) ++|| |++++||+|||+++|+||++++|+++++.++++++. + +.+...++| T Consensus 473 ~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g 531 (536) T protein:vir:10 473 AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) T ss_pred HcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhccc Confidence 9999 789999999999999999887777766654443221 1 112222233 No 8 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=100.00 E-value=3.5e-164 Score=916.70 Aligned_cols=495 Identities=30% Similarity=0.447 Sum_probs=451.4 Q ss_pred cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 11 ~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) =|+++++||++|| |++||++|+||++||+|+++++++++.+ .+++|||||++|+++|||||||+||||++|||||+ T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccC Confidence 5789999999995 9999999999999999999988776543 36799999999999999999999999999999999 Q ss_pred CChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-cEEEEEcceEEEeeC Q lcl|NC_020414. 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-AMSAVPMHHYVVNRD 167 (515) Q Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~-~~r~~pl~~y~v~~d 167 (515) ++|..++++++.+.+.+++++||++||++++.+|++||||.++|++|+||++|||+++|++++. .|++|||++|||++| T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~~d 158 (510) T protein:vir:78 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDEATVVAWSLRSYAVRRD 158 (510) T ss_pred CChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCCCeEEEEEcceeEEeeC Confidence 9999999988888899999999999999999999999999999999999999999999998765 499999999999999 Q ss_pred CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC---CCeEEEEEeCCeeecccCCccccc Q lcl|NC_020414. 168 TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE---GFWKINQSADDIPVGKENRIKAEK 244 (515) Q Consensus 168 ~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~---~~~~~~~e~~~~~i~~esgy~~~~ 244 (515) ++|+||+|||||+||+++|+++|+.+..+. ..+++|+++|+|||+|++++. +|||||+|+||++++++|+|+.++ T Consensus 159 ~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~--~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i~~~~~~~~~e 236 (510) T protein:vir:78 159 ATGRWMDIVLKQRYKSKDLDDVYKQDLMRA--GRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHL 236 (510) T ss_pred CCcCeeEEEeeeeccHHHHHHHhhHHhhhh--hhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeeecccccccccc Confidence 999999999999999999999999876633 356789999999999999764 589999999999999999998899 Q ss_pred CcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCccccc Q lcl|NC_020414. 245 LPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEEDIH 324 (515) Q Consensus 245 ~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~ 324 (515) |||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|+++.++++|.++||++++++ T Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~~~~~g~~v~g~~~~v~ 316 (510) T protein:vir:78 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) T ss_pred CCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhccCCCceeecCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-- Q lcl|NC_020414. 325 IVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-- 402 (515) Q Consensus 325 ~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-- 402 (515) ++++++++||+++++.|++++++|+++||++ +.++++++||||||++|++|++++||||||||+.|||.|||+|++. T Consensus 317 ~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) T protein:vir:78 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) T ss_pred ccccCcccchHHHHHHHHHHHHHHHHHHhhc-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999986 8899999999999999999999999999999999999999999753 Q ss_pred --hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCC-chhccCCHHH Q lcl|NC_020414. 403 --EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISA-ELPFLKSEEE 479 (515) Q Consensus 403 --~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gv-p~~~irs~ee 479 (515) ..++++|++.+++.+|+|+++|+|+|+++++.+++++++.+.++ +++.+.||+|++++++++++|| |..++||+|| T Consensus 396 ~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~-~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~ee 474 (510) T protein:vir:78 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) T ss_pred HhccCCCCCcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcCh-hhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHH Confidence 34566677789999999999999999999999999999887764 5588899999999999999999 6789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh----hccchhhhh Q lcl|NC_020414. 480 MQQEMAQQAQAQQEAMLNEGVAK----AVPGVIQQE 511 (515) Q Consensus 480 v~~~rq~~~~~~q~~~~~~~~~~----a~~~~~~~~ 511 (515) |+++|+++++++++++.++++.. +.+++.+|. T Consensus 475 v~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 475 LQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 99999987665555544333311 111122222 No 9 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=100.00 E-value=2.5e-164 Score=917.47 Aligned_cols=508 Identities=29% Similarity=0.466 Sum_probs=463.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+|+ +.|+.+++|++||++|+++|++|+++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 ~~~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 79 (543) T protein:vir:88 1 MAETK-REGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALF 79 (543) T ss_pred Ccccc-cCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhc Confidence 99999 88999999999999999999999999999999999999988776544 3579999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCc----- Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGA----- 153 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~----- 153 (515) |+ +|||||+++|..+++....+.+.++++.||++||++|+.+|++||||.++|++|+||++|||+|+|++++.+ T Consensus 80 P~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~ 158 (543) T protein:vir:88 80 PL-QSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSY 158 (543) T ss_pred CC-CcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCcccccee Confidence 86 799999999999988877888899999999999999999999999999999999999999999999987642 Q ss_pred --EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCC-CeEEEEEeC Q lcl|NC_020414. 154 --MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG-FWKINQSAD 230 (515) Q Consensus 154 --~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~-~~~~~~e~~ 230 (515) |+.|||++|+|++|++|+||+|||||++|+++|+++|++++. ...+++|+++|+|||+|+|++++ .|++|++++ T Consensus 159 ~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~---~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~ 235 (543) T protein:vir:88 159 NPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLS---GGQEYKPEQELEVYTHIYIDDESGDFLSYQEIE 235 (543) T ss_pred cceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHH---HHhhcCCccceEEEEEEEeecCCCccccccccc Confidence 677999999999999999999999999999999999987653 23467889999999999998664 588999999 Q ss_pred Ceee-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC Q lcl|NC_020414. 231 DIPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN 309 (515) Q Consensus 231 ~~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~ 309 (515) |+.+ +.+|+|+.++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.+ T Consensus 236 ~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~~ 315 (543) T protein:vir:88 236 GVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLVK 315 (543) T ss_pred CeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhccc Confidence 9988 688889888999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_020414. 310 SGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) Q Consensus 310 ~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 389 (515) +++|.+++|.++++.++++++++||+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+ T Consensus 316 ~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~ 395 (543) T protein:vir:88 316 AQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILS 395 (543) T ss_pred CCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHH Q lcl|NC_020414. 390 MTMQTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVR 464 (515) Q Consensus 390 ~E~l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a 464 (515) .|||.|||+|++. +.+|++|++++++++|++|++|+|++++++|.+++++++.+++ |+++|+||+|+++++++ T Consensus 396 ~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~--p~vld~id~d~~~~~~a 473 (543) T protein:vir:88 396 QELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQ--LNGDPDLNVNNIKLRLA 473 (543) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccc--hhhhccCCHHHHHHHHH Confidence 9999999999753 7899999999999999999999999999999999999998775 89999999999999999 Q ss_pred HhcCCc-hhccCCHHHHHHHHHHHHHHHHHH--HHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 465 GQISAE-LPFLKSEEEMQQEMAQQAQAQQEA--MLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 465 ~~~Gvp-~~~irs~eev~~~rq~~~~~~q~~--~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++||| .+++||++||+++|+|+++++|.+ +.++++++|++...+.+.+++ T Consensus 474 ~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (543) T protein:vir:88 474 NAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMES 527 (543) T ss_pred HHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHH Confidence 999995 689999999999999876544433 223333444333333222222 No 10 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=100.00 E-value=2.1e-163 Score=912.45 Aligned_cols=506 Identities=29% Similarity=0.456 Sum_probs=462.7 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||..-.+.|+.++++++||++||++|++||++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 80 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGLNNLASKLMLALF 80 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHHHHHHHHHHhhhc Confidence 98877788999999999999999999999999999999999999988876554 3679999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC----cE Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG----AM 154 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~----~~ 154 (515) |+ +|||||+++|..+++++..+.+.+++++||++||++|+.+|++||||.++|++|+||++|||+|+|++++. +| T Consensus 81 P~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f 159 (535) T protein:vir:94 81 PM-QTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPM 159 (535) T ss_pred CC-CCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccce Confidence 86 69999999999999988888999999999999999999999999999999999999999999999998753 48 Q ss_pred EEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC-CCeEEEEEeCCee Q lcl|NC_020414. 155 SAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDIP 233 (515) Q Consensus 155 r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~e~~~~~ 233 (515) ++|||++|||++|++|+||+|||||++++++|+++|++++... .+++++++|+|||||+++++ ..|++|++++|.. T Consensus 160 ~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~---~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~ 236 (535) T protein:vir:94 160 KLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSS---QEHKGDEMIDVYTHIYLDEESGEYLKYEEIDGVE 236 (535) T ss_pred EEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhc---cccCCCceeEEEEEEEeeCCCCcEEEEEEecCee Confidence 9999999999999999999999999999999999999765432 24678999999999999865 4688899999987 Q ss_pred e---cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCC Q lcl|NC_020414. 234 V---GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNS 310 (515) Q Consensus 234 i---~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~ 310 (515) + ++++|| ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+++|+|+++++++.++ T Consensus 237 ~~~~~~~~g~--~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~~~ 314 (535) T protein:vir:94 237 VEGTDASYPV--DACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLTKA 314 (535) T ss_pred eccccccCcc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcccC Confidence 6 356677 77999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_020414. 311 GTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAM 390 (515) Q Consensus 311 ~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~ 390 (515) ++|++++|.+++++++++++++||+.++..|++++++|+++||++++.++++++||||||++|++|++++||||||||+. T Consensus 315 ~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 394 (535) T protein:vir:94 315 QTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQ 394 (535) T ss_pred CCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh-cCCHHHHHHHHH Q lcl|NC_020414. 391 TMQTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQR-AIRWGDYMDWVR 464 (515) Q Consensus 391 E~l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d-~id~d~~~~~~a 464 (515) |||.|||+|++. +.+|++|++++++++|++|++|+|+++++++.+|+ +.++++.|+++| +||+|+++++++ T Consensus 395 ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~~~~---~~laq~~P~~ld~~id~d~~~~~~a 471 (535) T protein:vir:94 395 ELQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLERCI---AAWSALAPMQGDPDINIATIKLRIA 471 (535) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHHHHH---HHHHhhChHHhhhcCCHHHHHHHHH Confidence 999999999753 78999999999999999999999998877777665 456788899998 599999999999 Q ss_pred HhcCCc-hhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh--h------hhhccC Q lcl|NC_020414. 465 GQISAE-LPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI--Q------QEMKEG 515 (515) Q Consensus 465 ~~~Gvp-~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~--~------~~~~~~ 515 (515) +++||| ..++||+|||+++|+|+++++|++++++++++++++.. . ...+.| T Consensus 472 ~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g 531 (535) T protein:vir:94 472 NAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAG 531 (535) T ss_pred HHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhc Confidence 999999 57999999999999999998888888777776665432 1 122222 No 11 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=100.00 E-value=7.1e-163 Score=909.53 Aligned_cols=494 Identities=32% Similarity=0.463 Sum_probs=450.3 Q ss_pred cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 11 ~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) =|++|++||++|| |++||++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+||||++|||||+ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 78 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSE 78 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccccC Confidence 5789999999995 9999999999999999999988875543 46899999999999999999999999999999999 Q ss_pred CChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCc-EEEEEcceEEEeeC Q lcl|NC_020414. 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGA-MSAVPMHHYVVNRD 167 (515) Q Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~-~r~~pl~~y~v~~d 167 (515) ++|..+++.+......+++++||++||+.++.+|++||||.++|++|+||++|||+|+|++++.. |++|||++|||++| T Consensus 79 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~~~~~~~~~pl~~y~v~~d 158 (510) T protein:vir:63 79 LTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDSDAATVVAWSLRSYAVRRD 158 (510) T ss_pred CChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcCCCcEEEEEEcceeEEeeC Confidence 99999999888888999999999999999999999999999999999999999999999998764 99999999999999 Q ss_pred CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC---CCeEEEEEeCCeeecccCCccccc Q lcl|NC_020414. 168 TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE---GFWKINQSADDIPVGKENRIKAEK 244 (515) Q Consensus 168 ~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~---~~~~~~~e~~~~~i~~esgy~~~~ 244 (515) ++|+||+||||++||+++|+++|+.+..+. ..+++++++|+|||+|+|+++ +|||||+|++|++++.+|+|+.++ T Consensus 159 ~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~--~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e 236 (510) T protein:vir:63 159 ATGRWMDIVLKQRYKSKDLDEEYKQDLMRA--GRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGKEGRWPIHL 236 (510) T ss_pred CCcCeeEEEeeeeccHHHHhHHhhhhhhcc--ccccCCCcceEEEEEEEeecCCCceEEEEEEEecCceecccccccccc Confidence 999999999999999999999999776543 356789999999999999765 489999999999999999998899 Q ss_pred CcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCccccc Q lcl|NC_020414. 245 LPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEEDIH 324 (515) Q Consensus 245 ~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~ 324 (515) |||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|||+|+|+|+++|+++.++++|++++|++++++ T Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~~~~~g~~v~g~~~~v~ 316 (510) T protein:vir:63 237 CPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQDAEMGDYVPGGAEAVR 316 (510) T ss_pred CceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhccCCCceeecCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-- Q lcl|NC_020414. 325 IVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-- 402 (515) Q Consensus 325 ~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-- 402 (515) ++++++++||+.+++.|++++++|+++||++ +.++++++||||||++|++|++++||||||||+.|||.|||+|++. T Consensus 317 ~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 395 (510) T protein:vir:63 317 AYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-ANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEV 395 (510) T ss_pred eeecCcccchHHHHHHHHHHHHHHHHHHHhh-cccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999986 8899999999999999999999999999999999999999999753 Q ss_pred --hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCC-chhccCCHHH Q lcl|NC_020414. 403 --EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISA-ELPFLKSEEE 479 (515) Q Consensus 403 --~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gv-p~~~irs~ee 479 (515) ..++++|++.+++.+|+|+++|+|+|+++++.++.++++.+.++ +++.++||+|++++++++++|| |..|+||+|| T Consensus 396 ~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~-aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~ee 474 (510) T protein:vir:63 396 DDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPI-AQLDPRISLPKMMDTIWAAFSVDTSQFYKSADE 474 (510) T ss_pred HhccCCCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCc-hhhhccCCHHHHHHHHHHHhCCChhHhcCCHHH Confidence 34666777889999999999999999999999999999988775 5588999999999999999999 5789999999 Q ss_pred HHHHHHHHHHHHHHH-----HHHHHhhhhccchhhhh Q lcl|NC_020414. 480 MQQEMAQQAQAQQEA-----MLNEGVAKAVPGVIQQE 511 (515) Q Consensus 480 v~~~rq~~~~~~q~~-----~~~~~~~~a~~~~~~~~ 511 (515) |+++|++++++++++ .+++++++++...+| . T Consensus 475 v~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g-~ 510 (510) T protein:vir:63 475 LQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAG-V 510 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-C Confidence 999987643322222 234444444333333 2 No 12 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=100.00 E-value=4.4e-163 Score=910.67 Aligned_cols=496 Identities=26% Similarity=0.444 Sum_probs=457.9 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCC----ccccccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 13 SKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDN----ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 13 ~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~----~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) =++++||++|+++|++|+++|+||++||+|++++++++. .+..++|||||++|+++|||||||+||||++|||||. T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~ 80 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFKLQ 80 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 358899999999999999999999999999998876432 3446899999999999999999999999999999999 Q ss_pred CChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEcceEEEeeCC Q lcl|NC_020414. 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMHHYVVNRDT 168 (515) Q Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl~~y~v~~d~ 168 (515) ++++.+.+. ..+...+++++||++||++++.+|++||||.++|++|+||++|||||+|++++ +|++|||++|||++|+ T Consensus 81 ~~d~~l~~~-~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~-~~~~~pl~~y~v~~d~ 158 (522) T protein:vir:10 81 VRDDKLGEE-LDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKD-GLKTFPLTRYVINRDG 158 (522) T ss_pred CChHHHhhh-cChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCC-CceEEEcceEEEeeCC Confidence 999887664 23456788999999999999999999999999999999999999999999876 6999999999999999 Q ss_pred CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC-CCeEEEEEeCCeee-cc--cCCccccc Q lcl|NC_020414. 169 NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDIPV-GK--ENRIKAEK 244 (515) Q Consensus 169 ~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~e~~~~~i-~~--esgy~~~~ 244 (515) +|+||+|||||+||+++|+++||.+..+....++++++++|+|||||+|+++ ++++||++++|+.+ +. +||| ++ T Consensus 159 ~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s~~g~--~~ 236 (522) T protein:vir:10 159 DGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRSTAPK--NA 236 (522) T ss_pred CCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCcccccccccccc--cc Confidence 9999999999999999999999998877766677889999999999999866 57899999988754 44 5566 77 Q ss_pred CcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCccccc Q lcl|NC_020414. 245 LPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEEDIH 324 (515) Q Consensus 245 ~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~ 324 (515) |||+++||++.+||+||||||+++|||+|+||.|+++++++++++++|||+++|+|++++.++.++++|.+++|.++++. T Consensus 237 ~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~~~~~~~~v~g~~~~v~ 316 (522) T protein:vir:10 237 SPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIAKAGNGAIVQGRPEDVA 316 (522) T ss_pred CCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccccccccCCCCcceecCCCccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-- Q lcl|NC_020414. 325 IVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-- 402 (515) Q Consensus 325 ~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-- 402 (515) ++++++++||+.+++.|++++++|+++||+ +.++|+++||||||++|++|++++||||||||+.|||.|||+|++. T Consensus 317 ~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~--~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il 394 (522) T protein:vir:10 317 VIQVGKTADFSTAANMATAIEKRLLEAFLV--MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVL 394 (522) T ss_pred eecccccccchHHHHHHHHHHHHHHHHHhh--ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999985 5689999999999999999999999999999999999999999754 Q ss_pred ---hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCc-hhccCCHH Q lcl|NC_020414. 403 ---EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAE-LPFLKSEE 478 (515) Q Consensus 403 ---~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp-~~~irs~e 478 (515) +.+|++|++++++.+|+|++||+|+|++++|.+|++.+++++ .||.++|+||+|++++.+++++||| +.++||+| T Consensus 395 ~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~-~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~e 473 (522) T protein:vir:10 395 QRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTL-GPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQ 473 (522) T ss_pred HhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhh-CchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHH Confidence 789999999999999999999999999999999998887665 3777889999999999999999999 68999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccchhh--hhhccC Q lcl|NC_020414. 479 EMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQ--QEMKEG 515 (515) Q Consensus 479 ev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~--~~~~~~ 515 (515) ||+++||++++++|+++++++|++.++.+++ ++++++ T Consensus 474 ev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 512 (522) T protein:vir:10 474 QLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQL 512 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHH Confidence 9999999999999999999999988888775 455666 No 13 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=100.00 E-value=1.4e-162 Score=907.86 Aligned_cols=508 Identities=30% Similarity=0.451 Sum_probs=461.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+|+ +.|+.+++|++||++|+++|++|+++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 m~~~~-~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 79 (535) T protein:vir:33 1 MADSK-RTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALF 79 (535) T ss_pred CChhh-hhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhc Confidence 99999 88999999999999999999999999999999999999988876544 3679999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC----cE Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG----AM 154 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~----~~ 154 (515) |+ +|||||+++|..+++++..+...++++.||++||++|+.+|++||||.++|++|+||++|||+|+|++++. .| T Consensus 80 P~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f 158 (535) T protein:vir:33 80 PM-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPM 158 (535) T ss_pred CC-CcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceee Confidence 86 79999999999999999899999999999999999999999999999999999999999999999997663 38 Q ss_pred EEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC-CCeEEEEEeCCee Q lcl|NC_020414. 155 SAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDIP 233 (515) Q Consensus 155 r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~e~~~~~ 233 (515) ++|||++|||++|++|+||+|||||+||+++|+++|+.+..+.. .++++++++++||||+++++ +.|++|++++|.. T Consensus 159 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~ 236 (535) T protein:vir:33 159 KLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSG--GEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVE 236 (535) T ss_pred EEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccc--cccccccCCeEEEEEEeeCCCCcEEEEEEEeCcc Confidence 89999999999999999999999999999999999998766533 35678889999999999765 5688999999987 Q ss_pred e-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCC Q lcl|NC_020414. 234 V-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGT 312 (515) Q Consensus 234 i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~ 312 (515) + +.+|+|+.++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.++++ T Consensus 237 ~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~~~~~ 316 (535) T protein:vir:33 237 IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQT 316 (535) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCCc Confidence 6 688888778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_020414. 313 GEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTM 392 (515) Q Consensus 313 g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 392 (515) |.+++|.+++++++++++++||+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (535) T protein:vir:33 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh-cCCHHHHHHHHHHh Q lcl|NC_020414. 393 QTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQR-AIRWGDYMDWVRGQ 466 (515) Q Consensus 393 l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d-~id~d~~~~~~a~~ 466 (515) |.|||+|++. +.+|++|+++++++++++|++++|++++++|.+|+ +.++++.|+++| +||+|+++++++++ T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~---~~la~~~P~~~d~~id~d~~~~~~a~~ 473 (535) T protein:vir:33 397 QLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCI---SAWAALAPMQGDPDINLAVIKLRIANA 473 (535) T ss_pred HHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHH---HHHHhhChhhhhccCCHHHHHHHHHHH Confidence 9999999753 78999999998888888887777777766666655 556778889998 59999999999999 Q ss_pred cCCchh-ccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh--hhhhccC Q lcl|NC_020414. 467 ISAELP-FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI--QQEMKEG 515 (515) Q Consensus 467 ~Gvp~~-~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~--~~~~~~~ 515 (515) +|||++ |+||+|||+++++|+++++|++++++++++.+.+.+ +-+.++| T Consensus 474 ~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 525 (535) T protein:vir:33 474 IGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAMQG 525 (535) T ss_pred cCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhHHH Confidence 999975 999999999999998887777777666555443332 2233333 No 14 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=100.00 E-value=3e-162 Score=906.14 Aligned_cols=499 Identities=24% Similarity=0.418 Sum_probs=461.0 Q ss_pred cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 11 ~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) =|+++++||++|+++|++||++|+||++||+|+++++++++.. ..++|||||++|+++|||||||+||||++|||||. T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFFKLQ 80 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 4678899999999999999999999999999999988876543 46899999999999999999999999999999999 Q ss_pred CChHHHhhhhcc-chhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEcceEEEeeC Q lcl|NC_020414. 89 LTAKGEKVLDDR-GLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMHHYVVNRD 167 (515) Q Consensus 89 ~~d~~~~~~~~~-~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl~~y~v~~d 167 (515) ++|..+.+..+. +...++++.||++||++|+.+|++||||.++|++|+||++|||||+|++++ +|++|||++|+|++| T Consensus 81 ~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~-~~~~~pl~~y~v~~d 159 (542) T protein:vir:78 81 INDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKK-TLKVYPLDRYVIERD 159 (542) T ss_pred CCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCC-CceEEecceeEEeeC Confidence 999998886553 445688999999999999999999999999999999999999999999875 699999999999999 Q ss_pred CCCCeeEEEEEEEecHHHHHHHhcccccch--hhhccCCCcccEEEEEEEEEcCC-----------CCeEEEEEeCCeee Q lcl|NC_020414. 168 TNGDLMDVILLQEKALRTFDPATRMAIEVG--MKGKKCKEDDNVKLYTHAQYAGE-----------GFWKINQSADDIPV 234 (515) Q Consensus 168 ~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~--~~~~~~~~~~~v~v~~~v~~~~~-----------~~~~~~~e~~~~~i 234 (515) ++|+||+|||||+||+++|+++||.+..+. +....++++..++++|+++|+++ ++||||++++|+++ T Consensus 160 ~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~v 239 (542) T protein:vir:78 160 GDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRWHQECDGKEI 239 (542) T ss_pred CCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEEEEEeccccc Confidence 999999999999999999999999876554 33456788899999999998654 57999999999987 Q ss_pred ---cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC Q lcl|NC_020414. 235 ---GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG 311 (515) Q Consensus 235 ---~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~ 311 (515) +++||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+|+|+|++++.++.+++ T Consensus 240 ~~~~~e~g~--~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~~~~ 317 (542) T protein:vir:78 240 KGSRSSSPL--KHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLARAG 317 (542) T ss_pred ccccccccc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcccCC Confidence 789999 669999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH Q lcl|NC_020414. 312 TGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMT 391 (515) Q Consensus 312 ~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E 391 (515) +|++++|++++++++++++++||+.+++.|++++++|+++||++ ..+|+++||||||++|++|++++||||||||+.| T Consensus 318 ~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~--~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E 395 (542) T protein:vir:78 318 TGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLIL--NVRQSERTTATEVREVQMELDRQLSGIYGSLTVE 395 (542) T ss_pred CceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhccc--ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 99999999999999999999999999999999999999999975 5789999999999999999999999999999999 Q ss_pred HHHHHHHHHH-----HhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHh Q lcl|NC_020414. 392 MQTPIAMWGL-----QEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQ 466 (515) Q Consensus 392 ~l~Pli~r~~-----~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~ 466 (515) ||.|+|+|++ .+.+|++|++++++.++++|++|+|++++++|.+|++.|+++ ..||.++++||+|+++++++++ T Consensus 396 ~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~-~~p~~l~~~id~d~~~~~~a~~ 474 (542) T protein:vir:78 396 LLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQA-MGPEALQQFIDPTEFLKRLAAA 474 (542) T ss_pred HHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHh-cCChhHHhcCCHHHHHHHHHHH Confidence 9999999975 478999999999999999999999999999999999999875 4588888999999999999999 Q ss_pred cCCch-hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 467 ISAEL-PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 467 ~Gvp~-~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +|||+ .++||+||+++++++++++++++.++++++++++.++|+..++- T Consensus 475 ~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~~~~~ 524 (542) T protein:vir:78 475 SGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKMMQQ 524 (542) T ss_pred cCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhh Confidence 99995 69999999999999999999999999999988887776553322 No 15 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=100.00 E-value=4.2e-162 Score=905.29 Aligned_cols=507 Identities=30% Similarity=0.453 Sum_probs=458.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+|+ +.|+.+++|++||++|+++|++||++|+||++||+|++|++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 m~~~~-~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~lt 79 (535) T protein:vir:15 1 MADSK-RTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNLASKLMLALF 79 (535) T ss_pred CCccc-hhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhc Confidence 99999 88999999999999999999999999999999999999988876544 3689999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC----cE Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG----AM 154 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~----~~ 154 (515) |+ +|||||+++|..+++++..+...++++.||++||++|+.+|++||||.++|++|+||++|||+|+|++++. +| T Consensus 80 P~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f 158 (535) T protein:vir:15 80 PM-QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPM 158 (535) T ss_pred CC-CcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceee Confidence 86 79999999999999988888999999999999999999999999999999999999999999999997653 38 Q ss_pred EEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC-CCeEEEEEeCCee Q lcl|NC_020414. 155 SAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE-GFWKINQSADDIP 233 (515) Q Consensus 155 r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~~e~~~~~ 233 (515) ++|||++|||++|++|+||+|||||+||+++|+++|+.++.+. ..+++++++|+|||||+++.+ ++|++|++++|.. T Consensus 159 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~--~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~ 236 (535) T protein:vir:15 159 KLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKA--GGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVE 236 (535) T ss_pred EEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhcc--ccccCCCCceeEEEEEEEecCCCcEEEEEEeeCcc Confidence 8999999999999999999999999999999999999876543 345678999999999999865 5799999999977 Q ss_pred e-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCC Q lcl|NC_020414. 234 V-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGT 312 (515) Q Consensus 234 i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~ 312 (515) + +.+|+|+.++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.++.++++ T Consensus 237 ~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~~~~~ 316 (535) T protein:vir:15 237 IDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLTKAQT 316 (535) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcccCCc Confidence 6 677887778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_020414. 313 GEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTM 392 (515) Q Consensus 313 g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 392 (515) |.+++|.+++++++++++++||+.+++.|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~El 396 (535) T protein:vir:15 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQEL 396 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh-cCCHHHHHHHHHHh Q lcl|NC_020414. 393 QTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQR-AIRWGDYMDWVRGQ 466 (515) Q Consensus 393 l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d-~id~d~~~~~~a~~ 466 (515) |.|||+|++. +.+|++|+++++++++++|++++|++++++|.+|+ +.++++.|++++ +||+|+++++++++ T Consensus 397 l~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~---~~la~~~P~~ld~~id~d~~~~~~a~~ 473 (535) T protein:vir:15 397 QLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCI---SAWAALAPMQGDPDINLAVIKLRIANA 473 (535) T ss_pred HHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHH---HHHHhcChhhhhccCCHHHHHHHHHHH Confidence 9999999753 78999999998888888887777777766666655 556778889888 59999999999999 Q ss_pred cCCchh-ccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 467 ISAELP-FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 467 ~Gvp~~-~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +|||++ |+||+|||+++++|+++++++++++.++++.+.... .++.|+ T Consensus 474 ~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~-~~~p~~ 522 (535) T protein:vir:15 474 IGIDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALA-TSSPEA 522 (535) T ss_pred cCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchh-ccChHH Confidence 999975 999999999999988777777766665554432221 111111 No 16 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=100.00 E-value=1.5e-161 Score=902.29 Aligned_cols=498 Identities=27% Similarity=0.381 Sum_probs=439.1 Q ss_pred cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCC----CccccccccccHHHHHHHHHHHHHHhhcCCCCCcee Q lcl|NC_020414. 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD----NETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFR 86 (515) Q Consensus 11 ~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~----~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFr 86 (515) =++++.+.|. |.+|++||++|+||++||+|+++.+..+ ..+..++|||||++|+++|||||||+||||++|||| T Consensus 1 m~~~~~~l~~--k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQQASAMWA--EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred CccchHHHHH--HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 1223344444 6689999999999999999998754332 223468899999999999999999999999999999 Q ss_pred cCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC-CcEEEEEcceEEEe Q lcl|NC_020414. 87 VDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK-GAMSAVPMHHYVVN 165 (515) Q Consensus 87 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~-~~~r~~pl~~y~v~ 165 (515) |+++|...+.....+.+.+++++||++||++|+.+|++||||.++|++|+||++|||||+|++++ .+|++|||++|||+ T Consensus 79 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~pl~~y~v~ 158 (514) T protein:vir:80 79 IELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTGKMLVWTMQSYTVR 158 (514) T ss_pred cccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCCcEEEEEcCeEEEe Confidence 99999888777778888999999999999999999999999999999999999999999999775 46999999999999 Q ss_pred eCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC---CCeEEEEEeCCeeecccCCccc Q lcl|NC_020414. 166 RDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE---GFWKINQSADDIPVGKENRIKA 242 (515) Q Consensus 166 ~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~---~~~~~~~e~~~~~i~~esgy~~ 242 (515) +|++|+||+||||++||+++|+++|+...... ..+++++++|+|||||+++++ +||+||++++|++++++|||+. T Consensus 159 ~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~--~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i~~es~y~~ 236 (514) T protein:vir:80 159 RTSHGDPAVVVLRQQMPFRELTPEIQADAQAK--QIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRVGPESSYPA 236 (514) T ss_pred eCCCcCeEEEEeeeeecHHHhhhhhhhhhhhh--hccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceeecccCcccc Confidence 99999999999999999999999998765533 235678889999999999754 5799999999999999999998 Q ss_pred ccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCccc Q lcl|NC_020414. 243 EKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEED 322 (515) Q Consensus 243 ~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~ 322 (515) ++|||+++||++.+||+||||||+++|||+|+||+|++..+++++++++|||+++|+|++++.++.++++|++++|++++ T Consensus 237 ~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~~~~g~~v~g~~~~ 316 (514) T protein:vir:80 237 HLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRDAETGDFVPGQVGS 316 (514) T ss_pred ccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcccCCceeecCCCcc Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 323 IHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ 402 (515) Q Consensus 323 v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~ 402 (515) ++++++++++||+.+++.|++++++|+++||++. ..+++++||||||++|++||+++||||||||+.|||.|||+|++. T Consensus 317 v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~-~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~ 395 (514) T protein:vir:80 317 VASYERGDYNKIAQASASVESIVMRLNRAFMYTG-QVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMY 395 (514) T ss_pred ceeeecCcccchHHHHHHHHHHHHHHHHHHhhhc-cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999764 458999999999999999999999999999999999999999742 Q ss_pred -------hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchh-cc Q lcl|NC_020414. 403 -------EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP-FL 474 (515) Q Consensus 403 -------~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~-~i 474 (515) +.+|++|++++++++|+++++|+|++++++|.+++++++.+++++|+++|+||+|++++++|+++|||++ ++ T Consensus 396 il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~ 475 (514) T protein:vir:80 396 EASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLS 475 (514) T ss_pred HHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhcc Confidence 5789999999999999999999999999999999999999999999999999999999999999999986 66 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 475 KSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 475 rs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) +++|++++.+|++++++|.++.++++.++++- -++...- T Consensus 476 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 514 (514) T protein:vir:80 476 KDPDVVAAEAEQEAALAQQQLDVASGALAAET-SAGVLTS 514 (514) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hccccCC Confidence 66666665555544444433333333322211 1111111 No 17 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=100.00 E-value=3.6e-160 Score=894.74 Aligned_cols=503 Identities=29% Similarity=0.464 Sum_probs=454.2 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc--ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET--SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~--~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) ||+ +.|+++++|++||++||++|++|+++|+||++||+|+++++++++.+ ..++|||||++|+++|||||||+|| T Consensus 1 ~~~---~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (522) T protein:vir:94 1 MAE---REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALF 77 (522) T ss_pred Ccc---cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHhhcC Confidence 988 66999999999999999999999999999999999999988776544 3579999999999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-----c Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-----A 153 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~-----~ 153 (515) | ++|||||.+++..++++...+...+++++||++||++|+.+|++||||.++|++|+||++||||++|++++. + T Consensus 78 P-~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~ 156 (522) T protein:vir:94 78 P-QSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSP 156 (522) T ss_pred C-CCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceee Confidence 7 679999999998888877778888999999999999999999999999999999999999999999986542 3 Q ss_pred EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCee Q lcl|NC_020414. 154 MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIP 233 (515) Q Consensus 154 ~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~ 233 (515) |++|||++|||++|++|+||+|||||++++++|+++|+.+.. .++++|+++|+|||+|+|++++ +++|++++|+. T Consensus 157 ~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~----~~~~~p~~~v~v~~~v~~~~~~-~~~~~~~~g~~ 231 (522) T protein:vir:94 157 MRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLN----ADDYEPDTELEVYTHIYRQDDE-YLRYEEVEGIE 231 (522) T ss_pred EEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHh----cccCCccceEEEEEEEEeeCCc-eeEEeeccCce Confidence 889999999999999999999999999999999999997642 2456789999999999998775 67888999987 Q ss_pred e-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCC Q lcl|NC_020414. 234 V-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGT 312 (515) Q Consensus 234 i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~ 312 (515) + +++|+|+.++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+++|+|+++++++.++++ T Consensus 232 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~~~~~ 311 (522) T protein:vir:94 232 VTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLNKAAT 311 (522) T ss_pred ecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchheeccCC Confidence 6 778877778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_020414. 313 GEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTM 392 (515) Q Consensus 313 g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 392 (515) |.+++|.+++++++++++++||+.++..|++++++|+++||++++.++++++||||||++|++|++++|||||+||+.|| T Consensus 312 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~ 391 (522) T protein:vir:94 312 GEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQEL 391 (522) T ss_pred ceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHH-----hcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHH-hcCCHHHHHHHHHHh Q lcl|NC_020414. 393 QTPIAMWGLQ-----EAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQ-RAIRWGDYMDWVRGQ 466 (515) Q Consensus 393 l~Pli~r~~~-----~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~-d~id~d~~~~~~a~~ 466 (515) |.|||+|++. +.+|++|+++++++++++|++++|++++++|.+|++.+ +++.|+++ ++||+|++++.++++ T Consensus 392 l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~i---a~l~P~~~~~~id~d~~~~~~a~~ 468 (522) T protein:vir:94 392 QLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMM---TGLQPLSQDPDINLPTLKLRLLNA 468 (522) T ss_pred HHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHH---HhccchhhhhcCCHHHHHHHHHHH Confidence 9999999753 78999999999999999999998888888888877655 56677876 589999999999999 Q ss_pred cCC-chhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhc----cchhhhhhccC Q lcl|NC_020414. 467 ISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAV----PGVIQQEMKEG 515 (515) Q Consensus 467 ~Gv-p~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~----~~~~~~~~~~~ 515 (515) +|| |+.++||++|++++|+|+++++++++++.++++.. +...+.+|.+| T Consensus 469 ~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 469 LGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred cCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhhcC Confidence 999 56799999999999998777666665544443332 33334555555 No 18 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=100.00 E-value=5.5e-160 Score=893.71 Aligned_cols=501 Identities=13% Similarity=0.095 Sum_probs=440.4 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCC------CC--CCccccccccccHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNN------KG--DNETSQNGWQGVGAQATNHLANK 72 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~------~~--~~~~~~~~~dst~~~a~~~Laa~ 72 (515) |...-.++ +++|++||++|+++|++||++|+||++||+|++... ++ ++....++|||||++|+++|||| T Consensus 1 m~~d~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~ 77 (549) T protein:vir:10 1 MTNDDAKI---LQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAA 77 (549) T ss_pred CCcchHHH---HHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHH Confidence 77654333 589999999999999999999999999999986321 11 22334689999999999999999 Q ss_pred HHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHhhCceEEEEeC Q lcl|NC_020414. 73 LAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL--EQRQFRPAIVEVFKHLIVAGNCLLYKPS 150 (515) Q Consensus 73 l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~dl~~~G~~~l~~d~ 150 (515) ||++||||++|||||.++++.+.+ ..+++.||++||+.++..+ ++||||.++|++|+||++|||||+|+++ T Consensus 78 l~~~ltpp~~~wF~l~~~~~~~~e-------~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~ 150 (549) T protein:vir:10 78 MDSMITPATQLWHRLKTGNDALNE-------IASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEH 150 (549) T ss_pred HHhhccCCCCccccccCCccchhh-------hhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEee Confidence 999999999999999999877644 4689999999999999965 5899999999999999999999999987 Q ss_pred CC--c--EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccch--hhhccCCCcccEEEEEEEEEcCC---- Q lcl|NC_020414. 151 KG--A--MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVG--MKGKKCKEDDNVKLYTHAQYAGE---- 220 (515) Q Consensus 151 ~~--~--~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~--~~~~~~~~~~~v~v~~~v~~~~~---- 220 (515) +. + |++|||++|||++|++|+||+|||||+||++||+++||.+..+. +...+++|+++|+|||+|+|+.+ T Consensus 151 ~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~ 230 (549) T protein:vir:10 151 DVGKGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPR 230 (549) T ss_pred cCCCeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCcc Confidence 64 2 67899999999999999999999999999999999999875543 33456788999999999998643 Q ss_pred -------CCeEEEEEeCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_020414. 221 -------GFWKINQSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIK 293 (515) Q Consensus 221 -------~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~ 293 (515) +|.|||++.++.++++|||| ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|| T Consensus 231 ~~~~~~~pf~sv~~e~~~~~il~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~ 308 (549) T protein:vir:10 231 KLDGRNMQFASYWLDEGRDRIVQNSGF--RTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPP 308 (549) T ss_pred ccccccCceEEEEEEecCCEeeccCCc--ccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 46799999999999999999 569999999999999999999999999999999999999999999999999 Q ss_pred eeecCccccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhh-ccCCCCCCCHHHHHH Q lcl|NC_020414. 294 YLIRPGSQTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETM-TRRDAERVTAVEIQR 372 (515) Q Consensus 294 ~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l-~~~~~~~~TAtEi~~ 372 (515) |+++++|.+++.++.+++.+.+..|..++....+++++++|+.+++.|++++++|+++||.+.+ .++++++||||||++ T Consensus 309 ~~v~~~g~~~~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~ 388 (549) T protein:vir:10 309 LLANEDGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQ 388 (549) T ss_pred eeeccccccccceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHH Confidence 9999999999999998888877766555555555667789999999999999999999999874 568999999999999 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhccce---eeeehHHHHHHHHHH---HHHHHHHHHH Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDPV---IVTGIEALGRMAELD---KLANFAQYMS 441 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~---~v~~l~~l~ra~~~~---~l~~~~~~v~ 441 (515) |++|++++|||||+||+.|||.|||+|++ .|.+|++|++++.+. .|+|++||+|+|+.+ ++.+++++++ T Consensus 389 r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~ 468 (549) T protein:vir:10 389 RAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLG 468 (549) T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999975 478999999987543 488899988888865 4568888888 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) ++++++|+++|+||+|++++++++++|||+++|||++||+++|+++++++|++++++++.++++ ++.++.++ T Consensus 469 ~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~--~a~~~~~~ 540 (549) T protein:vir:10 469 IVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAG--AIKDLSDA 540 (549) T ss_pred HHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHhhhhh Confidence 9999999999999999999999999999999999999999999999999998888777766554 33333333 No 19 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=100.00 E-value=3.2e-159 Score=889.53 Aligned_cols=499 Identities=12% Similarity=0.105 Sum_probs=439.3 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc---cCCCCCC--ccccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL---MNNKGDN--ETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~~~~--~~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) |....++++|++||++|+++|++||++|+||++||+|++ ++.++++ .+.+++|||||++|+++|||||||+|||| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 566669999999999999999999999999999999984 3444433 33578999999999999999999999999 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC--c--EEE Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG--A--MSA 156 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~--~--~r~ 156 (515) ++|||||++.|+++.+ ..++++||++||++|+.+|++||||.++|++|+||++|||||+|++++. + |++ T Consensus 81 ~~~WF~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~ 153 (555) T protein:vir:98 81 ARPWFRLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHS 153 (555) T ss_pred CCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEE Confidence 9999999999876643 5679999999999999999999999999999999999999999997653 3 557 Q ss_pred EEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--h-hccCCCcccEEEEEEEEEcCC-----------CC Q lcl|NC_020414. 157 VPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--K-GKKCKEDDNVKLYTHAQYAGE-----------GF 222 (515) Q Consensus 157 ~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~-~~~~~~~~~v~v~~~v~~~~~-----------~~ 222 (515) |||++|||++|++|+||+|||||+||+++|+++||.+..+.. . .++++++.+|+|||+|+|+.+ +| T Consensus 154 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~ 233 (555) T protein:vir:98 154 LTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAW 233 (555) T ss_pred eecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccce Confidence 999999999999999999999999999999999998765432 2 233444678999999998643 36 Q ss_pred eEEEEE--eCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcc Q lcl|NC_020414. 223 WKINQS--ADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGS 300 (515) Q Consensus 223 ~~~~~e--~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g 300 (515) .|||++ +|++++++|||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||++++++ T Consensus 234 ~s~~~~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~ 311 (555) T protein:vir:98 234 KSVYFEPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA 311 (555) T ss_pred EEEEEEeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 678875 578899999999 5699999999999999999999999999999999999999999999999999999999 Q ss_pred ccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCCCHHHHHHHHHHH Q lcl|NC_020414. 301 QTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVTAVEIQRDALEI 377 (515) Q Consensus 301 ~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~---~l~~~~~~~~TAtEi~~r~~E~ 377 (515) .+++.++.+++.+.+.+|..++....++++++||+.+++.|++++++|+++||.+ ++.++++++||||||++|++|+ T Consensus 312 ~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~ 391 (555) T protein:vir:98 312 KNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEK 391 (555) T ss_pred ccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHH Confidence 9988888888888888888877666677888999999999999999999999876 6788999999999999999999 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhcccee-eeehHHHHHHHHHH---HHHHHHHHHHHhhcCCh Q lcl|NC_020414. 378 EQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDPVI-VTGIEALGRMAELD---KLANFAQYMSLPQTWPE 448 (515) Q Consensus 378 ~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~~-v~~l~~l~ra~~~~---~l~~~~~~v~~~a~~~p 448 (515) +++|||||+||+.|||.|||+|++ .+.+|++|+++..+.+ |+|+++|+|+|+.. +|.+++++++.+++++| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P 471 (555) T protein:vir:98 392 LLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKP 471 (555) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 999999999999999999999974 3688999998887765 88888888888865 57788889999999999 Q ss_pred HHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 449 PAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 449 ~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++|+||+|++++++++++|||+++|||++||+++|+||++++|++++++.+.++++ .++.+.++ T Consensus 472 ~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~--~~~~~~~~ 536 (555) T protein:vir:98 472 EVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGAD--TAAKLGSV 536 (555) T ss_pred hhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhccc Confidence 999999999999999999999999999999999999999988888776655544431 11222222 No 20 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=100.00 E-value=3.2e-159 Score=889.53 Aligned_cols=499 Identities=12% Similarity=0.105 Sum_probs=439.3 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc---cCCCCCC--ccccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL---MNNKGDN--ETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~~~~--~~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) |....++++|++||++|+++|++||++|+||++||+|++ ++.++++ .+.+++|||||++|+++|||||||+|||| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 566669999999999999999999999999999999984 3444433 33578999999999999999999999999 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC--c--EEE Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG--A--MSA 156 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~--~--~r~ 156 (515) ++|||||++.|+++.+ ..++++||++||++|+.+|++||||.++|++|+||++|||||+|++++. + |++ T Consensus 81 ~~~WF~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~ 153 (555) T protein:vir:10 81 ARPWFRLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHS 153 (555) T ss_pred CCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEE Confidence 9999999999876643 5679999999999999999999999999999999999999999997653 3 557 Q ss_pred EEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--h-hccCCCcccEEEEEEEEEcCC-----------CC Q lcl|NC_020414. 157 VPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--K-GKKCKEDDNVKLYTHAQYAGE-----------GF 222 (515) Q Consensus 157 ~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~-~~~~~~~~~v~v~~~v~~~~~-----------~~ 222 (515) |||++|||++|++|+||+|||||+||+++|+++||.+..+.. . .++++++.+|+|||+|+|+.+ +| T Consensus 154 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~ 233 (555) T protein:vir:10 154 LTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAW 233 (555) T ss_pred eecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccce Confidence 999999999999999999999999999999999998765432 2 233444678999999998643 36 Q ss_pred eEEEEE--eCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcc Q lcl|NC_020414. 223 WKINQS--ADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGS 300 (515) Q Consensus 223 ~~~~~e--~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g 300 (515) .|||++ +|++++++|||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||++++++ T Consensus 234 ~s~~~~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~ 311 (555) T protein:vir:10 234 KSVYFEPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA 311 (555) T ss_pred EEEEEEeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 678875 578899999999 5699999999999999999999999999999999999999999999999999999999 Q ss_pred ccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCCCHHHHHHHHHHH Q lcl|NC_020414. 301 QTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVTAVEIQRDALEI 377 (515) Q Consensus 301 ~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~---~l~~~~~~~~TAtEi~~r~~E~ 377 (515) .+++.++.+++.+.+.+|..++....++++++||+.+++.|++++++|+++||.+ ++.++++++||||||++|++|+ T Consensus 312 ~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~ 391 (555) T protein:vir:10 312 KNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEK 391 (555) T ss_pred ccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHH Confidence 9988888888888888888877666677888999999999999999999999876 6788999999999999999999 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhcccee-eeehHHHHHHHHHH---HHHHHHHHHHHhhcCCh Q lcl|NC_020414. 378 EQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDPVI-VTGIEALGRMAELD---KLANFAQYMSLPQTWPE 448 (515) Q Consensus 378 ~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~~-v~~l~~l~ra~~~~---~l~~~~~~v~~~a~~~p 448 (515) +++|||||+||+.|||.|||+|++ .+.+|++|+++..+.+ |+|+++|+|+|+.. +|.+++++++.+++++| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P 471 (555) T protein:vir:10 392 LLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKP 471 (555) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 999999999999999999999974 3688999998887765 88888888888865 57788889999999999 Q ss_pred HHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 449 PAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 449 ~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++|+||+|++++++++++|||+++|||++||+++|+||++++|++++++.+.++++ .++.+.++ T Consensus 472 ~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~--~~~~~~~~ 536 (555) T protein:vir:10 472 EVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGAD--TAAKLGSV 536 (555) T ss_pred hhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhccc Confidence 999999999999999999999999999999999999999988888776655544431 11222222 No 21 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=100.00 E-value=3.2e-159 Score=889.53 Aligned_cols=499 Identities=12% Similarity=0.105 Sum_probs=439.3 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc---cCCCCCC--ccccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL---MNNKGDN--ETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~~~~--~~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) |....++++|++||++|+++|++||++|+||++||+|++ ++.++++ .+.+++|||||++|+++|||||||+|||| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 566669999999999999999999999999999999984 3444433 33578999999999999999999999999 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC--c--EEE Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG--A--MSA 156 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~--~--~r~ 156 (515) ++|||||++.|+++.+ ..++++||++||++|+.+|++||||.++|++|+||++|||||+|++++. + |++ T Consensus 81 ~~~WF~l~~~d~~l~e-------~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~ 153 (555) T protein:vir:10 81 ARPWFRLTTSIPELDE-------SAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHS 153 (555) T ss_pred CCcccccccCcccccc-------hHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEE Confidence 9999999999876643 5679999999999999999999999999999999999999999997653 3 557 Q ss_pred EEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--h-hccCCCcccEEEEEEEEEcCC-----------CC Q lcl|NC_020414. 157 VPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--K-GKKCKEDDNVKLYTHAQYAGE-----------GF 222 (515) Q Consensus 157 ~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~-~~~~~~~~~v~v~~~v~~~~~-----------~~ 222 (515) |||++|||++|++|+||+|||||+||+++|+++||.+..+.. . .++++++.+|+|||+|+|+.+ +| T Consensus 154 ~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~ 233 (555) T protein:vir:10 154 LTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAW 233 (555) T ss_pred eecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccce Confidence 999999999999999999999999999999999998765432 2 233444678999999998643 36 Q ss_pred eEEEEE--eCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcc Q lcl|NC_020414. 223 WKINQS--ADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGS 300 (515) Q Consensus 223 ~~~~~e--~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g 300 (515) .|||++ +|++++++|||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||++++++ T Consensus 234 ~s~~~~~~~d~~~vl~esgy--~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~ 311 (555) T protein:vir:10 234 KSVYFEPGADETRTLRESGY--RSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA 311 (555) T ss_pred EEEEEEeccCCccccccCCc--ccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 678875 578899999999 5699999999999999999999999999999999999999999999999999999999 Q ss_pred ccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCCCHHHHHHHHHHH Q lcl|NC_020414. 301 QTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVTAVEIQRDALEI 377 (515) Q Consensus 301 ~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~---~l~~~~~~~~TAtEi~~r~~E~ 377 (515) .+++.++.+++.+.+.+|..++....++++++||+.+++.|++++++|+++||.+ ++.++++++||||||++|++|+ T Consensus 312 ~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~ 391 (555) T protein:vir:10 312 KNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEK 391 (555) T ss_pred ccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHH Confidence 9988888888888888888877666677888999999999999999999999876 6788999999999999999999 Q ss_pred HHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhcccee-eeehHHHHHHHHHH---HHHHHHHHHHHhhcCCh Q lcl|NC_020414. 378 EQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDPVI-VTGIEALGRMAELD---KLANFAQYMSLPQTWPE 448 (515) Q Consensus 378 ~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~~-v~~l~~l~ra~~~~---~l~~~~~~v~~~a~~~p 448 (515) +++|||||+||+.|||.|||+|++ .+.+|++|+++..+.+ |+|+++|+|+|+.. +|.+++++++.+++++| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P 471 (555) T protein:vir:10 392 LLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKP 471 (555) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 999999999999999999999974 3688999998887765 88888888888865 57788889999999999 Q ss_pred HHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 449 PAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 449 ~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++|+||+|++++++++++|||+++|||++||+++|+||++++|++++++.+.++++ .++.+.++ T Consensus 472 ~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~--~~~~~~~~ 536 (555) T protein:vir:10 472 EVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGAD--TAAKLGSV 536 (555) T ss_pred hhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhccc Confidence 999999999999999999999999999999999999999988888776655544431 11222222 No 22 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=100.00 E-value=3.8e-159 Score=889.11 Aligned_cols=498 Identities=26% Similarity=0.439 Sum_probs=451.4 Q ss_pred cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCccc--cccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETS--QNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 11 ~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~--~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) =|+++++||++|+++|++||++|+||++||+|+++++++++++. .++|||||++|+++|||||||+||||++|||||. T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF~l~ 80 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFFKLQ 80 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcccccc Confidence 46789999999999999999999999999999999888766543 5799999999999999999999999999999999 Q ss_pred CChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEcceEEEeeCC Q lcl|NC_020414. 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMHHYVVNRDT 168 (515) Q Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl~~y~v~~d~ 168 (515) ++|+++++.........++++||++||++++.+|++||||.++|++|+||++|||+|+|++++ ++++|||++|||++|+ T Consensus 81 ~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~-~~~~~pl~~y~v~~d~ 159 (555) T protein:vir:17 81 INDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKK-NLKLYPLDRFVVSRDG 159 (555) T ss_pred cCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCC-ceeEEEcCeEEEeeCC Confidence 999999887777788899999999999999999999999999999999999999999999865 6999999999999999 Q ss_pred CCCeeEEEEEEEecHHHHHHHhcccccchh--h-----------------hccCCCcccEEEEEEEEEcCCCCeEEEEEe Q lcl|NC_020414. 169 NGDLMDVILLQEKALRTFDPATRMAIEVGM--K-----------------GKKCKEDDNVKLYTHAQYAGEGFWKINQSA 229 (515) Q Consensus 169 ~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~-----------------~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~ 229 (515) +|+||+|||||+||+++|+++||.+..+.. . ..+..++.++++|+++.++ +++++||+++ T Consensus 160 ~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~-~~~~~~~~e~ 238 (555) T protein:vir:17 160 EGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRK-DGQVKWHQEC 238 (555) T ss_pred CcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeeccccc-CCeeEEEEec Confidence 999999999999999999999997543211 1 1233455678899988774 5689999999 Q ss_pred CCeee---cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhh Q lcl|NC_020414. 230 DDIPV---GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDH 306 (515) Q Consensus 230 ~~~~i---~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~ 306 (515) +|+.+ +++||| ++|||+++||++.+|++||||||+++|||+|+||.|+++.+++++++++|||+++|+|++++.+ T Consensus 239 ~~~~v~~~l~e~g~--~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~~~g~~~~~~ 316 (555) T protein:vir:17 239 DGKVIPGSNSSAPY--THNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVSPSATTKPQN 316 (555) T ss_pred CceeccccccccCc--ccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccCcce Confidence 99987 689999 5799999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHH Q lcl|NC_020414. 307 FVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYS 386 (515) Q Consensus 307 ~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~ 386 (515) +.++++|.+++|.+++++++++++++||+.+++.|++++++|+++||+ +..+|+++||||||++|++|++++|||||+ T Consensus 317 l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~--~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~ 394 (555) T protein:vir:17 317 LALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM--LQVRQSERTTATEVQATVQELNEQIGGIYS 394 (555) T ss_pred eecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh--cCCCCcccchHHHHHHHHHHHHHHHhHHHH Confidence 999999999999999999999999999999999999999999999997 457899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHH-----HhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHH Q lcl|NC_020414. 387 LFAMTMQTPIAMWGL-----QEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMD 461 (515) Q Consensus 387 rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~ 461 (515) ||+.|||.|||+|++ .+.+|++|++++++.+++++.+|.|+++.+++.+|++.++++.+ +|+++|+||+|++++ T Consensus 395 rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq~~~-~p~~~d~id~d~~~~ 473 (555) T protein:vir:17 395 NLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQTMG-PEIAMKYINPTEFIK 473 (555) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHhhcC-chhHhhcCCHHHHHH Confidence 999999999999975 47899999999999999999999999999888888877766544 799999999999999 Q ss_pred HHHHhcCC-chhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh-hhhhccC Q lcl|NC_020414. 462 WVRGQISA-ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI-QQEMKEG 515 (515) Q Consensus 462 ~~a~~~Gv-p~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~-~~~~~~~ 515 (515) .+++++|| |..++||+|||+++||++++++|+++.+++++++++.+. .+.++++ T Consensus 474 ~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~ 529 (555) T protein:vir:17 474 RLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLI 529 (555) T ss_pred HHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhcc Confidence 99999999 568999999999999999888888887777776655433 2333333 No 23 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=100.00 E-value=3.1e-157 Score=878.59 Aligned_cols=493 Identities=14% Similarity=0.174 Sum_probs=429.8 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCC--------CccccccccccHHHHHHHHHHHHHHhhcCCC Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD--------NETSQNGWQGVGAQATNHLANKLAQVLFPAQ 81 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~--------~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~ 81 (515) +++++|++||+.|+++|++||++|+||++||+|+++...++ ..+..++|||||++|+++|||||||+||||+ T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 89999999999999999999999999999999998653322 1234679999999999999999999999999 Q ss_pred CCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC----Cc--EE Q lcl|NC_020414. 82 RSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK----GA--MS 155 (515) Q Consensus 82 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~----~~--~r 155 (515) +|||||++.|.++. +.+++++||++||+.|+.+|++||||.++|++|+||++|||+++|++++ ++ |+ T Consensus 81 ~~WF~l~~~d~~~~-------~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~ 153 (547) T protein:vir:10 81 TKWFELAFRDKELN-------SDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQ 153 (547) T ss_pred CcccccccCCcccc-------chHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEE Confidence 99999999887654 3568999999999999999999999999999999999999999998644 23 67 Q ss_pred EEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--hhccCCCcc---cEEEEEEEEEcC----------- Q lcl|NC_020414. 156 AVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--KGKKCKEDD---NVKLYTHAQYAG----------- 219 (515) Q Consensus 156 ~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~~~~~~~~~---~v~v~~~v~~~~----------- 219 (515) +|||++|||++|++|+||+|||+|+||++||+++||.+..+.. +..+.++++ ++++||+|+++. T Consensus 154 ~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~ 233 (547) T protein:vir:10 154 SSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTV 233 (547) T ss_pred EeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccce Confidence 8999999999999999999999999999999999998765532 233444444 799999999864 Q ss_pred -----CCCeEEEEEeCC-eeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_020414. 220 -----EGFWKINQSADD-IPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIK 293 (515) Q Consensus 220 -----~~~~~~~~e~~~-~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~ 293 (515) ++|.|+|++.+| +++++|||| ++|||+++||++.+||+||||||+++|||+|+||.|+++.+++++++++|| T Consensus 234 ~~~~~~p~~s~~~e~~~~~~~l~esg~--~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp 311 (547) T protein:vir:10 234 LAPTERPFGKKWILKEGAVQLGEEGGY--YEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPA 311 (547) T ss_pred eeccccceeEEEEEecCceeeeecCCc--ccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 345689999886 789999999 569999999999999999999999999999999999999999999999999 Q ss_pred eeecCccccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHH Q lcl|NC_020414. 294 YLIRPGSQTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRD 373 (515) Q Consensus 294 ~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r 373 (515) |+++|+|++++.++. +.|.++.|..++++|++ +++||+.+++.|++++++|+++||.+.+.++++++||||||++| T Consensus 312 ~~v~~~g~~~~~~~~--pgg~~~~~~~~~v~pl~--~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r 387 (547) T protein:vir:10 312 IMVTERGLISDIDLG--ASGLTVVRDMESMKPFE--SRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVR 387 (547) T ss_pred eecccccccccceec--CCeeeecCCcccceeee--cccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHH Confidence 999999999986654 45666778889999876 55799999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhccc----eeeeehHHHHHHHHHH---HHHHHHHHHH Q lcl|NC_020414. 374 ALEIEQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDP----VIVTGIEALGRMAELD---KLANFAQYMS 441 (515) Q Consensus 374 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~----~~v~~l~~l~ra~~~~---~l~~~~~~v~ 441 (515) ++|++++|||||+||+.|||.|||.|++ .+.+|++|++++++ ..|+|++||+|+|+.+ +|.+++++++ T Consensus 388 ~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~ 467 (547) T protein:vir:10 388 YELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTA 467 (547) T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999974 37899999998754 3589999999998765 5668888888 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh----hh-hhccC Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI----QQ-EMKEG 515 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~----~~-~~~~~ 515 (515) ++++++|+++|+||+|++++.+++++|||+++|||++||+++|+||++++|+++++..+.++-..+. |+ ..+|- T Consensus 468 ~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~~~~ 546 (547) T protein:vir:10 468 QLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAALKEN 546 (547) T ss_pred HhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhcc Confidence 9999999999999999999999999999999999999999999999988887665544432211111 11 11222 No 24 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=100.00 E-value=2.6e-156 Score=873.57 Aligned_cols=499 Identities=9% Similarity=0.078 Sum_probs=431.3 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCC-----ccccccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDN-----ETSQNGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~-----~~~~~~~dst~~~a~~~Laa~l~s 75 (515) |++|+ +++|++||++|+++|++||++|+||++||+|+++++.++. .+..++|||||++|+++||||||| T Consensus 1 m~~~~------~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~ 74 (556) T protein:vir:73 1 MAETE------KERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMS 74 (556) T ss_pred CChhh------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHH Confidence 99876 8899999999999999999999999999999987654432 224589999999999999999999 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC--Cc Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK--GA 153 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~--~~ 153 (515) +||||++|||||+++|+.+. +..++++||++||+.|+.+|++||||.++|++|+||++|||+++|++++ .+ T Consensus 75 ~ltpp~~~WF~l~~~d~~~~-------~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 147 (556) T protein:vir:73 75 GITSPARPWFKLATPDPDMM-------DYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDV 147 (556) T ss_pred hhcCCCCcccccccCccccc-------chHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCce Confidence 99999999999999987654 3567999999999999999999999999999999999999999999765 33 Q ss_pred --EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh---hhccCCCcccEEEEEEEEEcCC-------- Q lcl|NC_020414. 154 --MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM---KGKKCKEDDNVKLYTHAQYAGE-------- 220 (515) Q Consensus 154 --~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~---~~~~~~~~~~v~v~~~v~~~~~-------- 220 (515) |++|||++|||++|++|+||+|||+|+||+++|+++||.+..+.. ...+++++.+|+++|+|+|+.+ T Consensus 148 ~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~ 227 (556) T protein:vir:73 148 IRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDS 227 (556) T ss_pred EEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCc Confidence 567999999999999999999999999999999999998754432 2344445678999999998643 Q ss_pred ---CCeEEEEEe--CCeeecccCCcccccCcEEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccCce Q lcl|NC_020414. 221 ---GFWKINQSA--DDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRP-LVEDYSGDLFVIQFLSEAVARGAALMADIKY 294 (515) Q Consensus 221 ---~~~~~~~e~--~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrg-p~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~ 294 (515) +|.|||++. +++++++|||| ++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++||| T Consensus 228 ~~~p~~s~~~~~~~~~~~vl~esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (556) T protein:vir:73 228 KNKPYRSVYFESGGDSDKLLRESGF--DEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPM 305 (556) T ss_pred ccceEEEEEEEecCCCceecccCCc--ccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 366788874 67899999999 569999999999999999999 8999999999999999999999999999999 Q ss_pred eecCccccChhhccCCC-CcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCCCHHHH Q lcl|NC_020414. 295 LIRPGSQTDVDHFVNSG-TGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVTAVEI 370 (515) Q Consensus 295 l~~~~g~~~~~~~~~~~-~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~---~l~~~~~~~~TAtEi 370 (515) ++++++...+.++.+++ ++...+|..+++.|++.++ +|++.+.+.|++++++|+++||.+ ++.++++++|||||| T Consensus 306 ~v~~~~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~-~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv 384 (556) T protein:vir:73 306 VAPTSLKNQRVSLLPGDVTYLDVISGQDGFKPAYLVN-PNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAV 384 (556) T ss_pred eccccccccceeeccCccccccCCCCccceeeecccc-ccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHH Confidence 99999877665555444 3444567778889987654 689999999999999999999876 567899999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhcccee-eeehHHHHHHHHHH---HHHHHHHHHH Q lcl|NC_020414. 371 QRDALEIEQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDPVI-VTGIEALGRMAELD---KLANFAQYMS 441 (515) Q Consensus 371 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~~-v~~l~~l~ra~~~~---~l~~~~~~v~ 441 (515) ++|++|++++|||||+||+.|||.|||+|++ .+.+|++|+++....+ |+|+++|+|+|+.. +|.+++++++ T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~ 464 (556) T protein:vir:73 385 IEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIG 464 (556) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999975 3788999988876655 78888888888765 5789999999 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh--h---------- Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI--Q---------- 509 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~--~---------- 509 (515) .+++++|+++|+||+|++++.+++++|||+++|||++||+++||||++++|.+++++++++|++++- + T Consensus 465 ~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l 544 (556) T protein:vir:73 465 QLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSAL 544 (556) T ss_pred HHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHH Confidence 9999999999999999999999999999999999999999999999888888877766666643211 1 Q ss_pred hhhccC Q lcl|NC_020414. 510 QEMKEG 515 (515) Q Consensus 510 ~~~~~~ 515 (515) +.+..+ T Consensus 545 ~~~~~~ 550 (556) T protein:vir:73 545 TAIANA 550 (556) T ss_pred HHHHHh Confidence 111000 No 25 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=100.00 E-value=7e-155 Score=865.70 Aligned_cols=499 Identities=10% Similarity=0.104 Sum_probs=429.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCC-----ccccccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDN-----ETSQNGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~-----~~~~~~~dst~~~a~~~Laa~l~s 75 (515) |+++. +++|++||+.|+++|++||++|+||++||+|+++++.++. ....++|||||++|+++||||||| T Consensus 1 m~~~~------~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~ 74 (559) T protein:vir:95 1 MAETT------KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMS 74 (559) T ss_pred CChhh------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHH Confidence 99886 8899999999999999999999999999999987654332 234578999999999999999999 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC--Cc Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK--GA 153 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~--~~ 153 (515) +||||++|||||+++|+.+. +..++++||++||+.|+.+|++||||.++|++|+||++|||+|+|++++ .+ T Consensus 75 ~ltpp~~~WF~l~~~d~~~~-------e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~ 147 (559) T protein:vir:95 75 GITSPARPWFRLATPDPEMM-------DYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDI 147 (559) T ss_pred hhcCCCCcccccccCCcccc-------chHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCce Confidence 99999999999999887654 3568999999999999999999999999999999999999999999765 33 Q ss_pred --EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--hhccCCC-cccEEEEEEEEEcCC-------- Q lcl|NC_020414. 154 --MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--KGKKCKE-DDNVKLYTHAQYAGE-------- 220 (515) Q Consensus 154 --~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~~~~~~~-~~~v~v~~~v~~~~~-------- 220 (515) |++|||++|||++|++|+||+|||+|+||+++|+++||.+..+.. ...+.++ +.+|+|||+|+|+.+ T Consensus 148 ~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~ 227 (559) T protein:vir:95 148 IRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDS 227 (559) T ss_pred eEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEecccccccccccc Confidence 567999999999999999999999999999999999998765432 2233344 557999999998643 Q ss_pred ---CCeEEEEEe--CCeeecccCCcccccCcEEEEeeeecCCCccccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccCce Q lcl|NC_020414. 221 ---GFWKINQSA--DDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRP-LVEDYSGDLFVIQFLSEAVARGAALMADIKY 294 (515) Q Consensus 221 ---~~~~~~~e~--~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrg-p~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~ 294 (515) +|.|||++. +++++++|||| ++|||+++||++.+|++|||| |++++|||+|+||.|+++.+++++++++||| T Consensus 228 ~~~pf~s~~~e~~~~~~~~l~esg~--~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~ 305 (559) T protein:vir:95 228 KNKPFKSVYYEVGGDNDKLLRESGF--DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) T ss_pred ccceEEEEEEEecCCCceeeecCCc--ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 367899887 55789999999 669999999999999999999 9999999999999999999999999999999 Q ss_pred eecCccccChhhccCCCCcceecCC-cccccccccCCccchHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCCCHHHH Q lcl|NC_020414. 295 LIRPGSQTDVDHFVNSGTGEVITGV-EEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVTAVEI 370 (515) Q Consensus 295 l~~~~g~~~~~~~~~~~~g~~~~g~-~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~---~l~~~~~~~~TAtEi 370 (515) ++++++.+++.++.+++.+.+..+. .+.+.|.+..+ .+++.+...|++++++|+++||.+ ++.++++++|||||| T Consensus 306 ~v~~~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~~-~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV 384 (559) T protein:vir:95 306 VAPTSLKNQRASLLPGDITYIDQITGQDGFRPAYLVN-PSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAV 384 (559) T ss_pred eccccccccceeeeccceeeeCCCCCcccceeecccc-cchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHH Confidence 9999999888887766655443332 24566665443 688889999999999999999866 467899999999999 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChhhccce-eeeehHHHHHHHHH---HHHHHHHHHHH Q lcl|NC_020414. 371 QRDALEIEQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSELVDPV-IVTGIEALGRMAEL---DKLANFAQYMS 441 (515) Q Consensus 371 ~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~~~~~~-~v~~l~~l~ra~~~---~~l~~~~~~v~ 441 (515) ++|++|++++|||||+||+.|||.|||+|++ .+.+|++|+++.... .|+|+++|+|+|+. ++|.+++++++ T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~ 464 (559) T protein:vir:95 385 IEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIG 464 (559) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999974 367899998875443 47888888887765 56889999999 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh-hhhhc------- Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI-QQEMK------- 513 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~-~~~~~------- 513 (515) .+++++|+++|+||+|++++.+++++|||+++|||++||+++||||++++|++|+++++.+|++.+- ..+.+ T Consensus 465 ~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~~l 544 (559) T protein:vir:95 465 QLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPSVL 544 (559) T ss_pred HHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCChhHH Confidence 9999999999999999999999999999999999999999999999999988888777766644321 11111 Q ss_pred cC Q lcl|NC_020414. 514 EG 515 (515) Q Consensus 514 ~~ 515 (515) |+ T Consensus 545 ~~ 546 (559) T protein:vir:95 545 SA 546 (559) T ss_pred HH Confidence 11 No 26 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=100.00 E-value=1.1e-87 Score=497.39 Aligned_cols=498 Identities=13% Similarity=0.113 Sum_probs=364.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc----------cccCCCCC--CccccccccccHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP----------YLMNNKGD--NETSQNGWQGVGAQATNH 68 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P----------~~~~~~~~--~~~~~~~~dst~~~a~~~ 68 (515) -+.+++-..+-...|.+||+.+++.|++||.+|+||++|..+ +.+...++ ...+.+++|++..++++. T Consensus 15 ~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~ 94 (641) T protein:vir:94 15 SAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVET 94 (641) T ss_pred chhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHH Confidence 222233223345679999999999999999999999977655 33222222 122347999999999999 Q ss_pred HHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE Q lcl|NC_020414. 69 LANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK 148 (515) Q Consensus 69 Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~ 148 (515) |+++||+++|| +++||++.+.++...+ +.++ ++..+...++.++|+..++..+.+.+.+||+++-+ T Consensus 95 l~s~Lm~~~~p-~~~wf~~~p~~~ed~~----------~A~~---~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~ 160 (641) T protein:vir:94 95 LVAYFKGATFP-SDDWFDLKGMVPELAD----------AARV---VKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRL 160 (641) T ss_pred HhhHHhhhhcC-CCceEEEecCCCChHH----------HHHH---HHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEe Confidence 99999999997 8999999987655321 1222 23456667889999999999999999999997744 Q ss_pred eC------------------------------CCcEEEEEcceEEEeeCCCCCee----EEEEEEEecHHHHHHH--hcc Q lcl|NC_020414. 149 PS------------------------------KGAMSAVPMHHYVVNRDTNGDLM----DVILLQEKALRTFDPA--TRM 192 (515) Q Consensus 149 d~------------------------------~~~~r~~pl~~y~v~~d~~G~vd----~i~r~~~~t~~ql~~~--~~~ 192 (515) +- ...++++||..|-|-.|+.++++ ++||++++|+.+|..+ |+. T Consensus 161 ~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~ 240 (641) T protein:vir:94 161 GWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELHELVTSGYYDL 240 (641) T ss_pred ehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHHHHHhcCCCCh Confidence 21 01246678776666666666665 5688899999999877 544 Q ss_pred cccch---hhhccCCCcc-------------cEEEEEEEEEcCCCCeEEEEEeCCeeecccCCccc-ccCcEEEEeeeec Q lcl|NC_020414. 193 AIEVG---MKGKKCKEDD-------------NVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKA-EKLPFIPLTWKRS 255 (515) Q Consensus 193 ~~~~~---~~~~~~~~~~-------------~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~-~~~P~~~~Rw~~~ 255 (515) +.... ..+....++. .+++|..+..++..+|+||++++|+++++++||.. ++|||+++||.+. T Consensus 241 d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~ 320 (641) T protein:vir:94 241 DLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPD 320 (641) T ss_pred hhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceec Confidence 32211 1111111211 11223334445667899999999999999888743 4689999999999 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCcccccccccCCccchH Q lcl|NC_020414. 256 YGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLT 335 (515) Q Consensus 256 ~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~ 335 (515) ++++||+||++++|||+|+||.+++..++++.++++|||+++++|+++++++..++.|.+..+..+++.|+..+ ..+|+ T Consensus 321 ~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~PG~ii~~~~~~~v~pl~~~-~~~~~ 399 (641) T protein:vir:94 321 RDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAKPGAVFKVAQHGSLQPIDMG-RQDFV 399 (641) T ss_pred CCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeeccCCcceeeCCCCcceeecCC-ccccc Confidence 99999999999999999999999999999999999999999999999999998777677777888889988654 46899 Q ss_pred HHHHHHHHHHHHHHHHHHHHhh----ccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH--------- Q lcl|NC_020414. 336 PISAVLEVYTRRIGVIFMMETM----TRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ--------- 402 (515) Q Consensus 336 ~~~~~i~~~~~rI~~afl~~~l----~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~--------- 402 (515) ..+..++.++.+|+++|+.+.+ ..+++++||||||+++.+|+...||+++++|+.||+.||+.+++. T Consensus 400 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p 479 (641) T protein:vir:94 400 VTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTP 479 (641) T ss_pred hhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccch Confidence 9999999999999999986544 337778899999999999999999999999999999999887532 Q ss_pred ------------hcCCCCChhhcccee-eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhc-- Q lcl|NC_020414. 403 ------------EAGDSFTSELVDPVI-VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQI-- 467 (515) Q Consensus 403 ------------~~~~~~p~~~~~~~~-v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~-- 467 (515) +.++++|++.++.++ +.+++..+++.+++++++++++++.+++ .|++++++|+|.+++.+++.. T Consensus 480 ~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~-~P~v~d~~d~~~~~~~~~~~~g~ 558 (641) T protein:vir:94 480 ETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGR-VPQIGQSLDYALILEDLLRQMRF 558 (641) T ss_pred hhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhc-ChhhhhcCCHHHHHHHHHHHhCC Confidence 123445555555443 4577777777778888888888877776 588999999999999999875 Q ss_pred CCchhccCCHHHHHHHHHHHHHHHHHHHHHHHh----hhhccchhhh-------------------hhccC Q lcl|NC_020414. 468 SAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGV----AKAVPGVIQQ-------------------EMKEG 515 (515) Q Consensus 468 Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~----~~a~~~~~~~-------------------~~~~~ 515 (515) |+|..++|+++...+-++++++.+ ++++.+++ +.+.+....+ -++|+ T Consensus 559 ~~p~~~ir~~~~~~~~~~~~~~~~-q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 628 (641) T protein:vir:94 559 TDPMRYIKKAEAPPAAPPIAPAEP-GALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDVAPEA 628 (641) T ss_pred CCchhhccCccCchhHHHHHHHHH-HHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhhhHHH Confidence 578889999875433332222111 12222222 2122222111 11111 No 27 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=100.00 E-value=1.4e-67 Score=387.12 Aligned_cols=500 Identities=12% Similarity=0.101 Sum_probs=350.3 Q ss_pred CCCccccc----cc---c-HHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc----------cCC--CCCCccccccccc Q lcl|NC_020414. 1 MQDTILEY----GG---Q-RSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL----------MNN--KGDNETSQNGWQG 60 (515) Q Consensus 1 ~~~~~~~~----~~---~-~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~----------~~~--~~~~~~~~~~~ds 60 (515) ||.|-+.- -+ . ...+.++|+.+++.|+.|+.+|++++++..+.- ... .++...+.+++.+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~~~ 82 (651) T protein:vir:80 3 LATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKITTG 82 (651) T ss_pred ccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCccccCh Confidence 44333221 11 1 355899999999999999999999998777731 111 1111233468999 Q ss_pred cHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh Q lcl|NC_020414. 61 VGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIV 140 (515) Q Consensus 61 t~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 140 (515) +-..+++++.+.|+..+|| +.+||++.+.++.. ..+.+-+-|+..+...++.++|+...+.+++|.++ T Consensus 83 ~v~~~ve~~~~~l~~~~~~-~~~~~~~~p~~~~d-----------~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~ 150 (651) T protein:vir:80 83 KAFEAIETIHAYLMSATFP-NKNWFDVVPAKPGQ-----------DNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLI 150 (651) T ss_pred hHHHHHHHHHHHHHHhhcC-CCceeEeccCCchh-----------HHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcc Confidence 9999999999999999997 68999999854322 12334455666777788999999999999999999 Q ss_pred hCceEEEE--eCC---------------------------------CcEEEEEcceEEEeeCCCCCeeEEE-EEEEecHH Q lcl|NC_020414. 141 AGNCLLYK--PSK---------------------------------GAMSAVPMHHYVVNRDTNGDLMDVI-LLQEKALR 184 (515) Q Consensus 141 ~G~~~l~~--d~~---------------------------------~~~r~~pl~~y~v~~d~~G~vd~i~-r~~~~t~~ 184 (515) +|||++-+ |.. ..++.+|+.+|++..++.+.-|+-| .+..+|.. T Consensus 151 ~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~ 230 (651) T protein:vir:80 151 TGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKA 230 (651) T ss_pred cCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHH Confidence 99998732 110 1256689999999999987666633 34456766 Q ss_pred HHHHHhc----ccccc----hh--------------hh-----ccCCCcccEEEEEEEEE---cCCCCeEEEEEeCCeee Q lcl|NC_020414. 185 TFDPATR----MAIEV----GM--------------KG-----KKCKEDDNVKLYTHAQY---AGEGFWKINQSADDIPV 234 (515) Q Consensus 185 ql~~~~~----~~~~~----~~--------------~~-----~~~~~~~~v~v~~~v~~---~~~~~~~~~~e~~~~~i 234 (515) ++.+... .+... .. .. ...++...|+||+|..+ ++++++++|+..+|+++ T Consensus 231 ~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~i 310 (651) T protein:vir:80 231 DILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEV 310 (651) T ss_pred HHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEE Confidence 6543221 00000 00 00 01234567888887432 45668999999999988 Q ss_pred cc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCC Q lcl|NC_020414. 235 GK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGT 312 (515) Q Consensus 235 ~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~ 312 (515) ++ +.+|+ ++|||+++||.+.+|+.||+||++.++|+.+.||.+++..++++.++++|+|+++++|+++++++...+. T Consensus 311 l~~~~~~~~-~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l~~~pg 389 (651) T protein:vir:80 311 LRFEQNPYW-CGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDVYTEPG 389 (651) T ss_pred ecccccCCC-CCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHhhcCCC Confidence 75 66666 5799999999999999999999999999999999999999999999999999999999999999987777 Q ss_pred cceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhcc----CCCCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_020414. 313 GEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTR----RDAERVTAVEIQRDALEIEQNMGGVYSLF 388 (515) Q Consensus 313 g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~----~~~~~~TAtEi~~r~~E~~~~LGpv~~rl 388 (515) |.++.+.++++.+++.+ ..+++.++..|+.++++|++.|+...+.+ ++.+++|||||+.+++|+...||++|++| T Consensus 390 ~vi~~~~~~~~~~l~~~-~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l 468 (651) T protein:vir:80 390 KVFLVSDHGDLQPLANQ-SSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHI 468 (651) T ss_pred ceEEecCCCCceeeccC-cccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHH Confidence 87788999999988765 45899999999999999999997654333 55678999999999999999999999999 Q ss_pred HHHHHHHHHHHHHH-----hcCCCCCh----------------hhcccee-eeehHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020414. 389 AMTMQTPIAMWGLQ-----EAGDSFTS----------------ELVDPVI-VTGIEALGRMAELDKLANFAQYMSLPQTW 446 (515) Q Consensus 389 ~~E~l~Pli~r~~~-----~~~~~~p~----------------~~~~~~~-v~~l~~l~ra~~~~~l~~~~~~v~~~a~~ 446 (515) +.||+.||+.|++. ...|+++. .++...+ +.++++.+...+.+.+++.+++++.+++ T Consensus 469 ~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~- 547 (651) T protein:vir:80 469 EETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQ- 547 (651) T ss_pred HHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhcc- Confidence 99999999988753 12222211 1222222 4456666555555555555555555555 Q ss_pred ChHHHhcCCHHHHHHHHHHhcCCc--hhccCCHHHHHHHHHHH-----HHH----HHHHHHHHHhhhhc------cchhh Q lcl|NC_020414. 447 PEPAQRAIRWGDYMDWVRGQISAE--LPFLKSEEEMQQEMAQQ-----AQA----QQEAMLNEGVAKAV------PGVIQ 509 (515) Q Consensus 447 ~p~~~d~id~d~~~~~~a~~~Gvp--~~~irs~eev~~~rq~~-----~~~----~q~~~~~~~~~~a~------~~~~~ 509 (515) .|.+.+.+|...+++.+++..|++ ..++..+++.+....+. ++. .+.++++.++.+++ +...+ T Consensus 548 ~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 627 (651) T protein:vir:80 548 VPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMSEMYGT 627 (651) T ss_pred CCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455778889999999999999985 45777765554322111 110 00011111111111 01111 Q ss_pred hhhccC Q lcl|NC_020414. 510 QEMKEG 515 (515) Q Consensus 510 ~~~~~~ 515 (515) +...|+ T Consensus 628 ~~~~~~ 633 (651) T protein:vir:80 628 PNADQM 633 (651) T ss_pred HHHHHH Confidence 111112 No 28 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=100.00 E-value=4.2e-36 Score=214.56 Aligned_cols=480 Identities=14% Similarity=0.117 Sum_probs=312.1 Q ss_pred CCCcc------ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCC--CccccccccccHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTI------LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD--NETSQNGWQGVGAQATNHLANK 72 (515) Q Consensus 1 ~~~~~------~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~--~~~~~~~~dst~~~a~~~Laa~ 72 (515) |+-|. ......+..++++|+.+.+.|++++..|.|+++|..-+.....++ -....++|=+.....++++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~~ 80 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHSN 80 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhhhhcccccccccchhHHHHHHHHHHHH Confidence 44332 112345688899999999999999999999999998865433332 1222357777788999999999 Q ss_pred HHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC Q lcl|NC_020414. 73 LAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG 152 (515) Q Consensus 73 l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~ 152 (515) ||+.+|| ++.||++....+.... .++ =+.+++.+...|+.+||+.++...++|++++|+|.+=+.... T Consensus 81 l~~~~Fp-~~~w~~~v~~~~~~~~---------~~~--~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~ 148 (584) T protein:vir:95 81 YFSSLFP-NDDWLRWVGYGKGDST---------KTK--AKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEA 148 (584) T ss_pred HHHhhcC-ccceeeeecCCCchhh---------HHH--HHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEee Confidence 9999999 7999999987654321 111 134566677778999999999999999999999976442211 Q ss_pred c---------------EEE--EEcceEEEeeCCCCCeeEEE--EEEEecHHHHHHHhccc--------ccch-------- Q lcl|NC_020414. 153 A---------------MSA--VPMHHYVVNRDTNGDLMDVI--LLQEKALRTFDPATRMA--------IEVG-------- 197 (515) Q Consensus 153 ~---------------~r~--~pl~~y~v~~d~~G~vd~i~--r~~~~t~~ql~~~~~~~--------~~~~-------- 197 (515) + -++ ++.-++++..++ +.++... ++..+|..+|.+...+. .... T Consensus 149 ~~~e~~e~~~v~~~~~prieriSP~d~~~Dpsa-~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~ 227 (584) T protein:vir:95 149 KYKEMTDGTLVPDYIGPRLVRISPLDIVFNPLA-TSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHL 227 (584) T ss_pred cceeeeccccccccccceEEeeChhheeecCCC-CCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCC Confidence 1 122 334577888888 5566532 46678999997765221 0000 Q ss_pred hhhccC--------CCc-----------ccEEEEEE--EEEc--CCCC--eEEEEEeCCeeecc--cCCcccccCcEEEE Q lcl|NC_020414. 198 MKGKKC--------KED-----------DNVKLYTH--AQYA--GEGF--WKINQSADDIPVGK--ENRIKAEKLPFIPL 250 (515) Q Consensus 198 ~~~~~~--------~~~-----------~~v~v~~~--v~~~--~~~~--~~~~~e~~~~~i~~--esgy~~~~~P~~~~ 250 (515) ..+... +.+ ..|+++.. ..++ .++- +.+..-.++.++++ +.-|+.+.+||++. T Consensus 228 ~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~ 307 (584) T protein:vir:95 228 GGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHV 307 (584) T ss_pred CCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEE Confidence 000000 000 11333221 0111 1111 22223346666655 66799899999999 Q ss_pred eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCcccccccccCC Q lcl|NC_020414. 251 TWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEEDIHIVQLGK 330 (515) Q Consensus 251 Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~ 330 (515) .|.......||.|+.+-++|-.+.||.+.+..+.++.++++|++. ++++++++..++.+.+.++..+++.++... T Consensus 308 ~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k----~~~~~~~~~~~pg~~~~~~~~~~~q~~~p~- 382 (584) T protein:vir:95 308 GWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLK----IIGEVEEFVWGPGAEIHLDQGGDVQEIAKN- 382 (584) T ss_pred cceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCccee----eccccchhcccCCceeecCCCCCcceecCc- Confidence 999999999999999999999999999999999999999999643 357778887776666677888887776532 Q ss_pred ccchHHHHHHHHHHHHHHHHHH---HHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH----- Q lcl|NC_020414. 331 YADLTPISAVLEVYTRRIGVIF---MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ----- 402 (515) Q Consensus 331 ~~~l~~~~~~i~~~~~rI~~af---l~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~----- 402 (515) ..++-.+...|+-+.....+-- .+..+.. .+..-||+++ +..+...+.++-+....|-.|++++++. T Consensus 383 a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~-~~~~~TAtg~----s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~ 457 (584) T protein:vir:95 383 VNYIINADNQIQMLEDRMELYAGAPREAMGIR-TPGEKTAFEV----QQLGNAAGRIFQEKVTTFEVELLEPVLNAMLET 457 (584) T ss_pred hhhhhHHHHHHHHHHHHHHhhhCCChhhcccc-cchhhhHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2344444444444444443310 1111221 2223466665 6666677788888888887777776421 Q ss_pred h----c----------------CCCCChhhcccee-eeehHH---HHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHH Q lcl|NC_020414. 403 E----A----------------GDSFTSELVDPVI-VTGIEA---LGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGD 458 (515) Q Consensus 403 ~----~----------------~~~~p~~~~~~~~-v~~l~~---l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~ 458 (515) + . +-+++.++++.++ +..+.+ +.|+|..+++.++++ +++.+.++.+.+-.. T Consensus 458 ~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq-----~~~~~~i~p~~~~~~ 532 (584) T protein:vir:95 458 ATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFN-----SQIGQMILPHTSGKA 532 (584) T ss_pred HHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHH-----hhhhhhccccchHHH Confidence 0 0 1123445555542 333333 567787788777766 255566777888889 Q ss_pred HHHHHHHhcCCch-hccCCHHHHHHH-HHHHHHHHHHHHHHHHhhhhccchh Q lcl|NC_020414. 459 YMDWVRGQISAEL-PFLKSEEEMQQE-MAQQAQAQQEAMLNEGVAKAVPGVI 508 (515) Q Consensus 459 ~~~~~a~~~Gvp~-~~irs~eev~~~-rq~~~~~~q~~~~~~~~~~a~~~~~ 508 (515) +.+.+++..+.|. .+.+++-.|+.- ..|+..+++++.++.++..++.|++ T Consensus 533 l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 533 LATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred HHHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 9999999999997 466665555442 2333333444555666666777777 No 29 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=100.00 E-value=1.6e-34 Score=205.85 Aligned_cols=489 Identities=10% Similarity=0.014 Sum_probs=296.3 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCc-c-ccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE-T-SQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~-~-~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) |-.|-.....=...++.+|+++.+.|+..+..|.|+++|+.-+....+++.. . ..+++-+...+.+.+|.+.+++++| T Consensus 11 ~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~~s~t~~k~~~~~~~l~a~~~~~~f 90 (599) T protein:vir:31 11 MLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFKNSTTINKLAHLHLMITTSYMEHLL 90 (599) T ss_pred HhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcccccchHHHHHHHHHHHHHHHhhhc Confidence 2222211111234589999999999999999999999998775443333321 1 1235556667999999999999999 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCc----- Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGA----- 153 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~----- 153 (515) | ++.||++..-++... .+..=+.+++.|...|+.|+|+.+....+.|++.+|||+.-++.... T Consensus 91 p-~~~w~d~~~~~~~~~-----------~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~ 158 (599) T protein:vir:31 91 P-NRNWVDFVGFDNDSV-----------NAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTA 158 (599) T ss_pred C-CccceEeeecCCchh-----------HHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeec Confidence 9 999999998765431 12222445667888899999999999999999999999766552211 Q ss_pred ------------EEEEEcceEEEeeCCCCCeeEEE--EEEEecHHHHHHHhcccc--------cch-----hh---hccC Q lcl|NC_020414. 154 ------------MSAVPMHHYVVNRDTNGDLMDVI--LLQEKALRTFDPATRMAI--------EVG-----MK---GKKC 203 (515) Q Consensus 154 ------------~r~~pl~~y~v~~d~~G~vd~i~--r~~~~t~~ql~~~~~~~~--------~~~-----~~---~~~~ 203 (515) ++-+..-.+++..++ +.++.++ +|..+|..+|-+...+.. ... +. ++.. T Consensus 159 d~~v~~~~~~P~~ervsP~Di~~Dp~A-~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d 237 (599) T protein:vir:31 159 ENQVIKNYSGTVTERLSPSDVFWDVTA-DSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALAD 237 (599) T ss_pred ccccccccccceEEeecccceeeCCCC-CCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccc Confidence 222344678888888 5566543 688888888877554211 000 00 0000 Q ss_pred -------------CCcccEEEEEE-----------EEEcC--CCCe--EEEEEeCCeeecc--cCCcccccCcEEEEeee Q lcl|NC_020414. 204 -------------KEDDNVKLYTH-----------AQYAG--EGFW--KINQSADDIPVGK--ENRIKAEKLPFIPLTWK 253 (515) Q Consensus 204 -------------~~~~~v~v~~~-----------v~~~~--~~~~--~~~~e~~~~~i~~--esgy~~~~~P~~~~Rw~ 253 (515) ++..++.-||- ..++. ++-| .+-.-+|+..+++ .--|+.++.||++..|. T Consensus 238 ~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~ 317 (599) T protein:vir:31 238 GYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYE 317 (599) T ss_pred hhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEee Confidence 11111111110 01111 1111 1222235555544 33488777899999999 Q ss_pred ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcceecCCcccccccccCCccc Q lcl|NC_020414. 254 RSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEEDIHIVQLGKYAD 333 (515) Q Consensus 254 ~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~ 333 (515) ...++.||.||...++|-.-.||.+.+..+.+...+++| +++-.|.+.+.++.+.++..+..+..+++.++. +..+ T Consensus 318 P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p--~l~~~~dl~~eD~~~~P~~v~~~~d~~~vq~~~--p~s~ 393 (599) T protein:vir:31 318 FQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHP--SLKKVGDVREKGMRGGPNHVFEVEETGDVQYMT--PPAE 393 (599) T ss_pred eeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcc--cccccccccccCccCCCCcceeecCCCcccccc--Cchh Confidence 999999999999999999999999999999999999988 444566688888876654444445666665543 2223 Q ss_pred hHHHHHHHHHHHHHHHHHH---HHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH--hc---- Q lcl|NC_020414. 334 LTPISAVLEVYTRRIGVIF---MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ--EA---- 404 (515) Q Consensus 334 l~~~~~~i~~~~~rI~~af---l~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~--~~---- 404 (515) ...+...|+..+.+..+.= .+.++.+..++ -||+||....++...........+..+++.||+++++. +. T Consensus 394 ~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~-~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~ 472 (599) T protein:vir:31 394 VLQPDNQLSITLQLMEDLSGAPKESIGQRTAGE-KTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDA 472 (599) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhcCCcccch-hhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 3334444554444433311 22233333333 59999999999999999999999999999999998642 11 Q ss_pred ---------------CCCCChhhccce-eeeehHH---HHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHH Q lcl|NC_020414. 405 ---------------GDSFTSELVDPV-IVTGIEA---LGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRG 465 (515) Q Consensus 405 ---------------~~~~p~~~~~~~-~v~~l~~---l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~ 465 (515) +-++-.++++.. .+..+++ +.|++-.+++.++++ +++.+....++.-.++...++. T Consensus 473 ~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~-----~~~~q~~~P~~~~k~l~~~l~~ 547 (599) T protein:vir:31 473 SDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILG-----GPLGAALAPHMSRTKLFNAVEY 547 (599) T ss_pred ccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhc-----ccCCCccchhhHHHHHHHHHHH Confidence 111223333332 2333332 668888888887776 4444445555555566666666 Q ss_pred hcCCch-hccCCHHHHHHHHHHHHHHHHHHHHH----HHhhhhccchhhhhhc Q lcl|NC_020414. 466 QISAEL-PFLKSEEEMQQEMAQQAQAQQEAMLN----EGVAKAVPGVIQQEMK 513 (515) Q Consensus 466 ~~Gvp~-~~irs~eev~~~rq~~~~~~q~~~~~----~~~~~a~~~~~~~~~~ 513 (515) ....-. .+.+..--|++- |+...++|.+..+ |.-+.-.+++..+.+| T Consensus 548 ~~~l~~~~~~~~~va~~eq-q~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 548 LGDLDAYGIFTFGIGVQED-QQLARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred HHhccccccCCCchhHHHH-HHHHHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 443322 233332222111 1111111111111 1222223444444444 No 30 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.96 E-value=7e-28 Score=169.45 Aligned_cols=493 Identities=8% Similarity=0.004 Sum_probs=281.9 Q ss_pred CCCccccccccHHHHHHHHHHH----HHhhhh-HHHHHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKF----SKKRSP-YLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~l----k~~R~~-~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s 75 (515) ||+---....+..++.+.+.++ ++-... +...+.+-.+|.+-.... .......+++.+.-...++.+.+.|+. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~--~~~~~~s~~~~~~v~~~v~~~~~~l~~ 78 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFG--NERPGKSGIVSRDVQETVDWIMPSLMK 78 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCC--cccCCCCccccHHHHHHHHHHHHHHHH Confidence 8877444445555555555444 332221 112333444444322111 112223456677777789999999999 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHH-HHHhcCCHHHHHHHHHHHHhhCceEE---EEe-- Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMK-ALEQRQFRPAIVEVFKHLIVAGNCLL---YKP-- 149 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~l~~snf~~~~~~~~~dl~~~G~~~l---~~d-- 149 (515) .+|+ +.+||++.|..+...+. . +.++..+.- ....++.+..++.+++|.+..|+|++ |.. T Consensus 79 ~~~~-~~~~~~~~p~~~~D~~~----------a---~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~ 144 (705) T protein:vir:88 79 VFTS-GGQVVKYEPDTAEDVEQ----------A---EQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVL 144 (705) T ss_pred hhcC-CCceEEEeeCChhHHHH----------H---HHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEecccccc Confidence 9887 99999999865443221 1 112222222 24566678999999999999999876 421 Q ss_pred ---------------------CC------------------------C--cEEEEEcceEEEeeCCCCCeeE--EEEEEE Q lcl|NC_020414. 150 ---------------------SK------------------------G--AMSAVPMHHYVVNRDTNGDLMD--VILLQE 180 (515) Q Consensus 150 ---------------------~~------------------------~--~~r~~pl~~y~v~~d~~G~vd~--i~r~~~ 180 (515) ++ + .++.||..+|++..++.+--|. +++++. T Consensus 145 ~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~ 224 (705) T protein:vir:88 145 KPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREK 224 (705) T ss_pred chhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEe Confidence 10 0 1344677899999988775554 567889 Q ss_pred ecHHHHHHHh---------cccccch---h---hhc---------------cCCCcccEEEEEEEEE---cCCCCeEEEE Q lcl|NC_020414. 181 KALRTFDPAT---------RMAIEVG---M---KGK---------------KCKEDDNVKLYTHAQY---AGEGFWKINQ 227 (515) Q Consensus 181 ~t~~ql~~~~---------~~~~~~~---~---~~~---------------~~~~~~~v~v~~~v~~---~~~~~~~~~~ 227 (515) +|..+|...+ ..+.... . ... .......|++|.|..+ ++++.+++|. T Consensus 225 ~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~ 304 (705) T protein:vir:88 225 YTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRR 304 (705) T ss_pred ccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEE Confidence 9999985432 1100000 0 000 0011224777776443 4466666554 Q ss_pred E-eCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhh Q lcl|NC_020414. 228 S-ADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDH 306 (515) Q Consensus 228 e-~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~ 306 (515) - ..|.++++...+ +.+||++.++.+.++..||+|++....+-.+.+|.+.+..+.++..+++|.++++ +|..++++ T Consensus 305 ~~~~g~~il~~~~~--~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~-~g~v~~~d 381 (705) T protein:vir:88 305 ILYVGDYIISNEPW--DCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVL-DGQVNLED 381 (705) T ss_pred EEEeCccccccccC--CCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecc-ccccCccc Confidence 3 245667766555 4599999999999999999999999999999999999999999999999999995 56667877 Q ss_pred ccCCCCcceecC-CcccccccccCCccchHHHHHHHHHHHHHHHHHH-H--HHhhccCC--CCCCCHHHHHHHHHHHHHH Q lcl|NC_020414. 307 FVNSGTGEVITG-VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF-M--METMTRRD--AERVTAVEIQRDALEIEQN 380 (515) Q Consensus 307 ~~~~~~g~~~~g-~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l--~~~l~~~~--~~~~TAtEi~~r~~E~~~~ 380 (515) +....+|.++.- ..+.+.+++.+. --+.+...++.+.+.|++.. + +.++...+ ....||+.|....+..... T Consensus 382 ~~~~~pg~vv~~~~~~~i~~~~~~~--~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r 459 (705) T protein:vir:88 382 LLTNEAAGIVRVKSMNSITPLETPQ--LSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQ 459 (705) T ss_pred ccccCCCeeEEecCCCccccccCCc--CcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHH Confidence 777777877653 234555654332 23345566777777777654 1 11121212 2358999999998888888 Q ss_pred hhhhHHHHHHHHHHHHHHHHHH---hc------------CCCCChhhc----cceeeeehHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 381 MGGVYSLFAMTMQTPIAMWGLQ---EA------------GDSFTSELV----DPVIVTGIEALGRMAELDKLANFAQYMS 441 (515) Q Consensus 381 LGpv~~rl~~E~l~Pli~r~~~---~~------------~~~~p~~~~----~~~~v~~l~~l~ra~~~~~l~~~~~~v~ 441 (515) +.-..-.+...++.+++.+++. .- .-.+.+.+. ++.+.+.++...+.+....+...++... T Consensus 460 ~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q 539 (705) T protein:vir:88 460 IDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQ 539 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHH Confidence 8887777777788888766532 11 111111222 2333456777777777777777777665 Q ss_pred HhhcCChHHHhcCCH---HHHHHHHHHhcCC--chhccCCHHHHHHHHHHHH----HHHHH--------H----HHHHHh Q lcl|NC_020414. 442 LPQTWPEPAQRAIRW---GDYMDWVRGQISA--ELPFLKSEEEMQQEMAQQA----QAQQE--------A----MLNEGV 500 (515) Q Consensus 442 ~~a~~~p~~~d~id~---d~~~~~~a~~~Gv--p~~~irs~eev~~~rq~~~----~~~q~--------~----~~~~~~ 500 (515) .+.+.+ ...+.++. .++...++...|+ +..++......+++..+.+ +.++. + +..++. T Consensus 540 ~l~~~~-~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~ 618 (705) T protein:vir:88 540 AVVGGG-GLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALA 618 (705) T ss_pred Hhhccc-chhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 555442 33333433 3455556665554 2334333222222111000 00000 0 000000 Q ss_pred hhhccchhhhhh--ccC Q lcl|NC_020414. 501 AKAVPGVIQQEM--KEG 515 (515) Q Consensus 501 ~~a~~~~~~~~~--~~~ 515 (515) .++......++. ++. T Consensus 619 ~q~e~q~~q~E~q~~q~ 635 (705) T protein:vir:88 619 KQAEAQMKQVEAQIRLA 635 (705) T ss_pred HHHHHHHHHHHHHHHHH Confidence 000000000000 000 No 31 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.91 E-value=5.2e-22 Score=137.26 Aligned_cols=494 Identities=10% Similarity=0.011 Sum_probs=250.1 Q ss_pred CCCcc---------cccc----cc---HHHHHHHHHHHHHhhhhHH---HHHHHHHHhhcccccCCCCCCcccccccccc Q lcl|NC_020414. 1 MQDTI---------LEYG----GQ---RSKIPKLWEKFSKKRSPYL---DRAKHFAKLTLPYLMNNKGDNETSQNGWQGV 61 (515) Q Consensus 1 ~~~~~---------~~~~----~~---~~~l~~r~~~lk~~R~~~e---~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst 61 (515) =.+|+ ..+. ++ ...|.+-++..+....+.. ..|-+++-|. ....++. ...+..+.... T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~grs~vv~~~ 78 (763) T protein:vir:95 2 EQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIE-GKAKPPK--VKGRSQVQPKL 78 (763) T ss_pred CcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhcc-ccCcccc--cCCCccccCHH Confidence 01111 1111 11 2233333333322222222 2354444333 1111111 12233567777 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|NC_020414. 62 GAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVA 141 (515) Q Consensus 62 ~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 141 (515) -...++-+-+.|+-.+++ +..||++.|..+...+. ......++ -+-....++-+..++.++++.+.. T Consensus 79 v~~~ve~~~~~l~~~f~~-~~~~~~~~P~~~~D~~~------A~q~t~~~------n~~~~~~~~~~~~~~~~~~~~l~~ 145 (763) T protein:vir:95 79 VRRQAEWRYSALTEPFLG-SNKLFKVTPVTWEDVQG------ARQNELVL------NYQFRTKLNRVSFIDNYVRSVVDD 145 (763) T ss_pred HHHHHHHHHHHHHHhhcC-CCcEEEEecCCcchHHH------HHHHHHHH------HHHHhhcCchhhHHHHHHHHHhhc Confidence 788999999999999988 77899999877654321 11111111 112345677788899999999999 Q ss_pred CceEE--EEeCC--------------------------------------------------------C----------- Q lcl|NC_020414. 142 GNCLL--YKPSK--------------------------------------------------------G----------- 152 (515) Q Consensus 142 G~~~l--~~d~~--------------------------------------------------------~----------- 152 (515) ||+++ |-+.. . T Consensus 146 ~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (763) T protein:vir:95 146 GTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGT 225 (763) T ss_pred CcceEEEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccc Confidence 99864 22100 0 Q ss_pred -------------cEEEEEcceEEEeeCCCCCee---EEEEEEEecHHHHHHHh-cccc-c----ch-h-----h----- Q lcl|NC_020414. 153 -------------AMSAVPMHHYVVNRDTNGDLM---DVILLQEKALRTFDPAT-RMAI-E----VG-M-----K----- 199 (515) Q Consensus 153 -------------~~r~~pl~~y~v~~d~~G~vd---~i~r~~~~t~~ql~~~~-~~~~-~----~~-~-----~----- 199 (515) .++.+|..+|+|..++.+.++ -+++++.+|..+|...- +... . +. . . T Consensus 226 ~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 305 (763) T protein:vir:95 226 TTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTT 305 (763) T ss_pred eeEEEEEEecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccc Confidence 012366778888888776444 35789999999996531 1110 0 00 0 0 Q ss_pred --h-cc-CCCcccEEEEEEEEE---cCCCCeEEEE-EeCCeeecc--cCCcccccCcEEEEeeeecCCCccccchHHHHH Q lcl|NC_020414. 200 --G-KK-CKEDDNVKLYTHAQY---AGEGFWKINQ-SADDIPVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYS 269 (515) Q Consensus 200 --~-~~-~~~~~~v~v~~~v~~---~~~~~~~~~~-e~~~~~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l 269 (515) . .. .....+|.+|.|..+ ++++.+++|. -..|..+++ ++-|+.+.|||+++.+.+.++..||.|.++.+. T Consensus 306 ~~~~~~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~ 385 (763) T protein:vir:95 306 PQEFQISDPMRKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLG 385 (763) T ss_pred hhhccCCCcccceEEEEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhh Confidence 0 00 111246777776543 4566666654 334445544 455666679999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCccee---cCCccc--ccccccCC-ccchHHHHHHHHH Q lcl|NC_020414. 270 GDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVI---TGVEED--IHIVQLGK-YADLTPISAVLEV 343 (515) Q Consensus 270 ~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~---~g~~~~--v~~~~~~~-~~~l~~~~~~i~~ 343 (515) +..+.+|++.+..+..+..+++|.|+++.+. .+..+.....+|.++ +|.... +.+...+. .+.+..+.+.++. T Consensus 386 d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~ga-v~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~ 464 (763) T protein:vir:95 386 DNQAVLGAVMRGMIDLLGRSANGQRGMPKGM-LDALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQ 464 (763) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCcEEeeccc-ccchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHH Confidence 9999999999999999999999999997655 454444444555554 333322 22222211 1233333333333 Q ss_pred HHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH---h----------cCC---C Q lcl|NC_020414. 344 YTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ---E----------AGD---S 407 (515) Q Consensus 344 ~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~---~----------~~~---~ 407 (515) ..+.+.-.--+......++...||++|..+.+.....+..++.++.. .+.+++++++. . .++ + T Consensus 465 ~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~-~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~ 543 (763) T protein:vir:95 465 EAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAK-GMSEIGNKIIAMNAVFLAEHEVVRITNEEFVT 543 (763) T ss_pred HHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhCCCCcEEEEeCCcccc Confidence 33322212122222233444579999999888888888777766654 67888776532 1 111 2 Q ss_pred CChhhccce--eeeehHH-HHHHHHHHHHHHHHHHHHHhhcCChHH--------HhcCCHHHHHHHHHHhcCCchhccCC Q lcl|NC_020414. 408 FTSELVDPV--IVTGIEA-LGRMAELDKLANFAQYMSLPQTWPEPA--------QRAIRWGDYMDWVRGQISAELPFLKS 476 (515) Q Consensus 408 ~p~~~~~~~--~v~~l~~-l~ra~~~~~l~~~~~~v~~~a~~~p~~--------~d~id~d~~~~~~a~~~Gvp~~~irs 476 (515) +.+.+.... ++..+++ -.+.+....+..+++.++. .+++.+ ++..+...+++.+.....-|..+-.- T Consensus 544 v~~~~~~~~~DV~V~~~~as~~~q~~~~l~~ll~~l~~--~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~ 621 (763) T protein:vir:95 544 IKREDLKGNFDLEVDISTAEVDNQKSQDLGFMLQTIGP--NVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQ 621 (763) T ss_pred ccHHHhcCCcceEEecccchHHHHHHHHHHHHHHHhcc--ccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhh Confidence 222222222 2222233 2234444455555554432 233332 23344444444433332222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh-hhcc------------chhhhhhccC Q lcl|NC_020414. 477 EEEMQQEMAQQAQAQQEAMLNEGVA-KAVP------------GVIQQEMKEG 515 (515) Q Consensus 477 ~eev~~~rq~~~~~~q~~~~~~~~~-~a~~------------~~~~~~~~~~ 515 (515) ..+.++.+ ++++++++++.++.+. +|+. ...-..+++. T Consensus 622 qaqle~~~-~q~e~~~~~akaq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~ 672 (763) T protein:vir:95 622 LKQLAVEK-AQLENEELRSKIRLNDAQAQKAMAERDNKNLDYLEQESGTKHA 672 (763) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 0000000000000000 0000 0000001111 No 32 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.67 E-value=1.7e-14 Score=96.10 Aligned_cols=490 Identities=12% Similarity=0.024 Sum_probs=233.0 Q ss_pred CCCccccc-------cc----cHH---HHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCC-Cc-----cccccccc Q lcl|NC_020414. 1 MQDTILEY-------GG----QRS---KIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD-NE-----TSQNGWQG 60 (515) Q Consensus 1 ~~~~~~~~-------~~----~~~---~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~-~~-----~~~~~~ds 60 (515) |++...+. .. .++ +|.++|..-...-..|...+.+-.+|..-. ..+.. .. ......-+ T Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~~~g~p~~~~N 99 (776) T protein:vir:93 22 LSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNI--QWSQDEIDELKERGQAPTVYN 99 (776) T ss_pred CCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCC--CCCHHHHHHHHhcCCceEEec Confidence 32222111 11 112 333444444333444555555555664211 11110 00 11112222 Q ss_pred cHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh Q lcl|NC_020414. 61 VGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIV 140 (515) Q Consensus 61 t~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 140 (515) .-...++.+.+..-. +++=+++.+.++... ++.+. ++..+......+++..+...+|.+.+. T Consensus 100 ~i~~~i~~v~g~~~~-----nr~~~~~~p~~~~d~----------~~Ae~---l~~~~~~~~~~~~~~~~~~~af~d~~~ 161 (776) T protein:vir:93 100 VISQSVNWIIGSEKR-----GRSDFKVLPRRKDGG----------KAAER---KTALLKYLSDVNHTPFERSMAFEETTK 161 (776) T ss_pred chHHHHHHHHHHHHh-----CCcceEEecCChhHH----------HHHHH---HHHHHHHHHHhhcHHHHHHHHHHHhhh Confidence 223333333322222 566677777543321 12233 333455556788999999999999999 Q ss_pred hCceEE--EEeCCC---c--EEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhcccccchhh-----h---- Q lcl|NC_020414. 141 AGNCLL--YKPSKG---A--MSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIEVGMK-----G---- 200 (515) Q Consensus 141 ~G~~~l--~~d~~~---~--~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~~~~~-----~---- 200 (515) .|.|++ +.|.+. . .+.++..++++..++.- ...-+|++.++|.+++...|++....-.. . T Consensus 162 ~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 241 (776) T protein:vir:93 162 AGIGWLESQVQDENDGEPIYAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWG 241 (776) T ss_pred cCcceEEEEeeccCCCCceEeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccc Confidence 998875 344321 1 23456667777655532 23347889999999999998854221100 0 Q ss_pred -------------------------ccCCCcccEEEEEEEEEcCC------------------------------CC--- Q lcl|NC_020414. 201 -------------------------KKCKEDDNVKLYTHAQYAGE------------------------------GF--- 222 (515) Q Consensus 201 -------------------------~~~~~~~~v~v~~~v~~~~~------------------------------~~--- 222 (515) ........|.|+.+.++.+. |. T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~ 321 (776) T protein:vir:93 242 TDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVL 321 (776) T ss_pred hhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceee Confidence 00011235666666443210 10 Q ss_pred -----e--EEEEEeCCeeecc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_020414. 223 -----W--KINQSADDIPVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIK 293 (515) Q Consensus 223 -----~--~~~~e~~~~~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~ 293 (515) . .+++.. |.++++ .+-|+.+.|||++.-....+.+-||.|.+....+-.+.+|++....+.. ..+.+ T Consensus 322 ~~~~~~~v~~~~~~-g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~---l~~~~ 397 (776) T protein:vir:93 322 AVSPMMRMHCAIMT-TRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYI---LSTNK 397 (776) T ss_pred hheeeeeeEEEEEe-cchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHh---hcCCc Confidence 0 122222 333443 3557767899999999999999999999999999999999888776654 34567 Q ss_pred eeecCccccChhhccC--CCCcceecCCcccccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHH Q lcl|NC_020414. 294 YLIRPGSQTDVDHFVN--SGTGEVITGVEEDIHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAV 368 (515) Q Consensus 294 ~l~~~~g~~~~~~~~~--~~~g~~~~g~~~~v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAt 368 (515) +++..+.+-+.+.+.. +.+|.++....+......+....++ +...+.++...+.|+..- .--++... +...+.. T Consensus 398 ~~~~~gav~~~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~-~n~~Sg~ 476 (776) T protein:vir:93 398 VLMEEGAVDDIDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRT-TNAVSGV 476 (776) T ss_pred eeeccccccchHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCC-cchhhHH Confidence 8888777777776554 4566665544444333333222222 233444444555444431 11122322 2346788 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHH------HHHHHHHHHHh------cCCC-------C----Chhhc-----cceeeee Q lcl|NC_020414. 369 EIQRDALEIEQNMGGVYSLFAMTM------QTPIAMWGLQE------AGDS-------F----TSELV-----DPVIVTG 420 (515) Q Consensus 369 Ei~~r~~E~~~~LGpv~~rl~~E~------l~Pli~r~~~~------~~~~-------~----p~~~~-----~~~~v~~ 420 (515) -|..|.+.-...+..++.++..-+ +.-||...+.. .+.. + +..++ .+.+..+ T Consensus 477 ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~ 556 (776) T protein:vir:93 477 AIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEA 556 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeec Confidence 899999888888888888776533 22233222210 0110 0 11111 1112122 Q ss_pred -hHHHHHHHHHHHHHHHHHHHHH-h-hcCCh---HHHhcCCHHHHHHHHHHhcCCc--hhccCCHHHHHHHHHHH--HHH Q lcl|NC_020414. 421 -IEALGRMAELDKLANFAQYMSL-P-QTWPE---PAQRAIRWGDYMDWVRGQISAE--LPFLKSEEEMQQEMAQQ--AQA 490 (515) Q Consensus 421 -l~~l~ra~~~~~l~~~~~~v~~-~-a~~~p---~~~d~id~d~~~~~~a~~~Gvp--~~~irs~eev~~~rq~~--~~~ 490 (515) -++..|.+..+.+.++++.+.. + ..+.+ +.++.-+.+++.+.+-...+-+ ..--..+++.+....+. ++. T Consensus 557 ~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~ 636 (776) T protein:vir:93 557 EWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQY 636 (776) T ss_pred ccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHH Confidence 2344466665555544432210 0 00111 1233335566666666665532 12222222221111100 000 Q ss_pred HHHHHH----HHHhhhhccchh-----hhhh--------------ccC Q lcl|NC_020414. 491 QQEAML----NEGVAKAVPGVI-----QQEM--------------KEG 515 (515) Q Consensus 491 ~q~~~~----~~~~~~a~~~~~-----~~~~--------------~~~ 515 (515) ++.++. .+++..+...+- ++.. +++ T Consensus 637 q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a 684 (776) T protein:vir:93 637 NDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDA 684 (776) T ss_pred HHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhh Confidence 000000 000000000000 0000 000 No 33 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.52 E-value=3e-12 Score=83.73 Aligned_cols=502 Identities=11% Similarity=0.038 Sum_probs=227.3 Q ss_pred CCCcccccc-------------c---cHHHHHHHHHHHHHhhhhHHHHHHHH----HHhhcccccCCCCC-C-----ccc Q lcl|NC_020414. 1 MQDTILEYG-------------G---QRSKIPKLWEKFSKKRSPYLDRAKHF----AKLTLPYLMNNKGD-N-----ETS 54 (515) Q Consensus 1 ~~~~~~~~~-------------~---~~~~l~~r~~~lk~~R~~~e~~w~e~----~~~~~P~~~~~~~~-~-----~~~ 54 (515) ||+.--++- . +.+.+..++-..-.....+...|++- .+|..- ...+.. . ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~ 78 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGG--EQWPSQVRTERELEQR 78 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCC--CCCCHHHHHHHHhcCC Confidence 433211110 1 22222222222222234445555433 334311 011110 0 001 Q ss_pred cc-cccccHHHHHHHHHHHHHHhhcCCCCCceecCCChH------------HHhhhhccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 55 QN-GWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAK------------GEKVLDDRGLKKTQLATIFARVETTAMKA 121 (515) Q Consensus 55 ~~-~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~------------~~~~~~~~~~~~~~v~~~L~~ve~~~~~~ 121 (515) .. .|+=++.. ++...+..- .+++=+++.|.+. ........+..-.++.+.| +..+... T Consensus 79 p~~~~N~i~~~-v~~v~g~~~-----~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l---~~~~~~~ 149 (711) T protein:vir:10 79 PCLVNNVLPTF-VDQVLGDQR-----QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVF---TGLIKNI 149 (711) T ss_pred CcEEEcchHHH-HHHHhhhHh-----hCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHH---HHHHHHH Confidence 11 13333332 222222221 1333333333210 0111111111112233333 3334455 Q ss_pred HHhcCCHHHHHHHHHHHHhhCceEE--EEeC---C---Cc--EEEEE-cceEEEeeCC---CCC-eeEEEEEEEecHHHH Q lcl|NC_020414. 122 LEQRQFRPAIVEVFKHLIVAGNCLL--YKPS---K---GA--MSAVP-MHHYVVNRDT---NGD-LMDVILLQEKALRTF 186 (515) Q Consensus 122 l~~snf~~~~~~~~~dl~~~G~~~l--~~d~---~---~~--~r~~p-l~~y~v~~d~---~G~-vd~i~r~~~~t~~ql 186 (515) ...++...+...+|.+.+..|.|++ +.|. + .- ++.++ ..++++..++ ++. ..-+|++.+|+.+++ T Consensus 150 ~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~ 229 (711) T protein:vir:10 150 EYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCLIDDTMSKEKF 229 (711) T ss_pred HHhcChhHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCccccccChhhhcceeeeecCCHHHH Confidence 6678899999999999988888864 2221 1 11 33332 3456664433 332 444899999999999 Q ss_pred HHHhcccccchhhh-ccCC-----CcccEEEEEEEEEcC---------C-------------------C----------C Q lcl|NC_020414. 187 DPATRMAIEVGMKG-KKCK-----EDDNVKLYTHAQYAG---------E-------------------G----------F 222 (515) Q Consensus 187 ~~~~~~~~~~~~~~-~~~~-----~~~~v~v~~~v~~~~---------~-------------------~----------~ 222 (515) ...|+......... ...+ ..+.|.+....++.+ + + . T Consensus 230 ~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) T protein:vir:10 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) T ss_pred HHhCCchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhce Confidence 99998653221110 0011 113343332221110 0 0 0 Q ss_pred eEEEE-EeCCeeec-ccCCcccccCcEEEE--eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecC Q lcl|NC_020414. 223 WKINQ-SADDIPVG-KENRIKAEKLPFIPL--TWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRP 298 (515) Q Consensus 223 ~~~~~-e~~~~~i~-~esgy~~~~~P~~~~--Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~ 298 (515) .++|. -..|.+++ ..+-|+...|||+++ .+...++..++.|.+....+-.+.+|++....+..+....+++|++.+ T Consensus 310 ~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~ 389 (711) T protein:vir:10 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSE 389 (711) T ss_pred eeEEEEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecC Confidence 12222 12344444 334576666999865 355677888888899999999999999999999999999999999988 Q ss_pred ccccChhhcc---CCCCcceecCCcccc--cccccCCccc-hHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHHH Q lcl|NC_020414. 299 GSQTDVDHFV---NSGTGEVITGVEEDI--HIVQLGKYAD-LTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVEI 370 (515) Q Consensus 299 ~g~~~~~~~~---~~~~g~~~~g~~~~v--~~~~~~~~~~-l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtEi 370 (515) +.+-+.+... .+.+|.++.-+++.. .+++....+. -+.....++...+.|.+.- .-.++.... ...|..-| T Consensus 390 gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~-n~~Sg~ai 468 (711) T protein:vir:10 390 GNVEGREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAI 468 (711) T ss_pred cccCChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCc-cchHHHHH Confidence 8887766532 355666654333321 1222222222 2334455555555554431 111233333 34788999 Q ss_pred HHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHHh------cCCCCChh--------------------hc----- Q lcl|NC_020414. 371 QRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQE------AGDSFTSE--------------------LV----- 413 (515) Q Consensus 371 ~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~~------~~~~~p~~--------------------~~----- 413 (515) ..|.+.-...|...+.++..- ++.-||...+.. .+..-+.. ++ T Consensus 469 ~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~ 548 (711) T protein:vir:10 469 IARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKY 548 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeee Confidence 999999888888888777642 222333322210 01100000 01 Q ss_pred ccee-eeehHHHHHHHHHHHHHHHHHHHHHhhc-CChHHH---hcCCHHHHHHHHHHhcCCchhccCCHH----HHHHHH Q lcl|NC_020414. 414 DPVI-VTGIEALGRMAELDKLANFAQYMSLPQT-WPEPAQ---RAIRWGDYMDWVRGQISAELPFLKSEE----EMQQEM 484 (515) Q Consensus 414 ~~~~-v~~l~~l~ra~~~~~l~~~~~~v~~~a~-~~p~~~---d~id~d~~~~~~a~~~Gvp~~~irs~e----ev~~~r 484 (515) .+.+ +.+-.+-.|.+....+.++++.+..+.. +.+.++ |.-+.++++..+....+-+ ....... +.++.. T Consensus 549 Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~-~~~~~~~~~~qq~~~e~ 627 (711) T protein:vir:10 549 DVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPN-VLSKDEREAIEEDMPEQ 627 (711) T ss_pred EEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcc-cCcchhhhHHHHHHHHH Confidence 1111 2333455566655655555543322111 222233 4445677777777665543 2222111 111111 Q ss_pred HHHHHHHHHHHHHHHhh---------hhccc---hhhhhhccC Q lcl|NC_020414. 485 AQQAQAQQEAMLNEGVA---------KAVPG---VIQQEMKEG 515 (515) Q Consensus 485 q~~~~~~q~~~~~~~~~---------~a~~~---~~~~~~~~~ 515 (515) +++.++.|.++...++. +|... ..++..+.+ T Consensus 628 qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q 670 (711) T protein:vir:10 628 TEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQ 670 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111001000000000 00000 000000000 No 34 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.38 E-value=2.6e-11 Score=78.60 Aligned_cols=489 Identities=11% Similarity=0.035 Sum_probs=230.2 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhc-ccccCCCCCCc------c---ccc-cccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTL-PYLMNNKGDNE------T---SQN-GWQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~-P~~~~~~~~~~------~---~~~-~~dst~~~a~~~L 69 (515) ||++.... ..++..||.......+.|-..+.+=.+|.. +..-.+..... . ... .|+=++.. ++.. T Consensus 1 m~~~~~~~---~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~-v~~v 76 (708) T protein:vir:10 1 MAETLEKK---HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATE-LNRI 76 (708) T ss_pred CchhHHHH---HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHH-HHHH Confidence 99986432 466777777766555566555555444432 21111110000 0 111 13433332 3322 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE- Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK- 148 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~- 148 (515) .+.-. .+++=+++.|.++.- + .++.+.| +..+......++...+...+|.+.+..|-|++-+ T Consensus 77 ~g~~~-----~nr~d~~v~P~~~~~--------d-~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~ 139 (708) T protein:vir:10 77 IAEYR-----NNRITVKFRPGDREA--------S-EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) T ss_pred HHHHH-----hCCcceEEEcCCCCc--------h-HHHHHHH---HHHHHHHHHhcCchHHHHHHHHhhhhcccceeeee Confidence 22221 255666666654221 0 1223333 3334455567899999999999999988886532 Q ss_pred -e---CC------Cc--EEE--EEcceEEEeeCC---CCC-eeEEEEEEEecHHHHHHHhcccccchh--hh--ccCC-- Q lcl|NC_020414. 149 -P---SK------GA--MSA--VPMHHYVVNRDT---NGD-LMDVILLQEKALRTFDPATRMAIEVGM--KG--KKCK-- 204 (515) Q Consensus 149 -d---~~------~~--~r~--~pl~~y~v~~d~---~G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~--~~--~~~~-- 204 (515) | +. .+ +++ .|..++++.-++ ++. -.-+||..+|+.+++...|++...... .. ..+. T Consensus 140 ~d~~~e~d~~~~~~~i~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~ 219 (708) T protein:vir:10 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNW 219 (708) T ss_pred eccccccCCCCCccccceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccc Confidence 1 11 12 222 244455544332 221 234788999999999999986533210 00 0000 Q ss_pred -CcccEEEEEE-----------EEEcC----------C------------CC----------eEEEE-EeCCeeeccc-C Q lcl|NC_020414. 205 -EDDNVKLYTH-----------AQYAG----------E------------GF----------WKINQ-SADDIPVGKE-N 238 (515) Q Consensus 205 -~~~~v~v~~~-----------v~~~~----------~------------~~----------~~~~~-e~~~~~i~~e-s 238 (515) ..+.|.|... +.+++ + |+ ..||. ...|..++.+ + T Consensus 220 ~~~d~v~v~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~ 299 (708) T protein:vir:10 220 FGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPR 299 (708) T ss_pred cCCCceEEEEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCC Confidence 0011211111 11111 0 00 11222 2345555533 4 Q ss_pred CcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhc--------- Q lcl|NC_020414. 239 RIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHF--------- 307 (515) Q Consensus 239 gy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~--------- 307 (515) -|+...|||+++-+.. ..|..++.|.+....+-.+.+|+..-..+..+..+-+.+++++++.+.....- T Consensus 300 ~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~ 379 (708) T protein:vir:10 300 RIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRP 379 (708) T ss_pred CCCCCceeeEEEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccch Confidence 4666678988774433 36677778999999999999999998999988888888888877765433211 Q ss_pred -------cCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHHHHHHHHHHH Q lcl|NC_020414. 308 -------VNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVEIQRDALEIE 378 (515) Q Consensus 308 -------~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtEi~~r~~E~~ 378 (515) ....+|.++++... +-.+....--+.....++...+.|.+.- .-.++.+.+ ..+..-|..|++.-. T Consensus 380 ~~~~~~~~~~~~G~~~~~~~~---~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~s--n~SG~aI~~rq~qg~ 454 (708) T protein:vir:10 380 AFLPLREVRDKSGNIIAGATP---AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS--NIAQETVNNLMNRAD 454 (708) T ss_pred hhhccccccccccccccccCC---ccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCcc--chHHHHHHHHHHHHH Confidence 11223333332211 1011111111223344444444454442 222334432 358888999999999 Q ss_pred HHhhhhHHHHHH------HHHHHHHHHHHHh------cCC--------------CC-Chh-----hccc---eee---ee Q lcl|NC_020414. 379 QNMGGVYSLFAM------TMQTPIAMWGLQE------AGD--------------SF-TSE-----LVDP---VIV---TG 420 (515) Q Consensus 379 ~~LGpv~~rl~~------E~l~Pli~r~~~~------~~~--------------~~-p~~-----~~~~---~~v---~~ 420 (515) ..+...+.+|.. +++.-||...+.. .++ ++ .+. ++.. .++ .+ T Consensus 455 ~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p 534 (708) T protein:vir:10 455 MASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGP 534 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEeccc Confidence 999999887663 3445555443321 010 01 110 1111 122 23 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHh----hcCChHHHhc---CCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 421 IEALGRMAELDKLANFAQYMSLP----QTWPEPAQRA---IRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQE 493 (515) Q Consensus 421 l~~l~ra~~~~~l~~~~~~v~~~----a~~~p~~~d~---id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~ 493 (515) -.+-.|.+..+.+.++++.+... +.+-+-+++. -+.++++..+-..++.+...--..++.+++.++.++++|+ T Consensus 535 ~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~ 614 (708) T protein:vir:10 535 SYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQS 614 (708) T ss_pred CchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHH Confidence 34455666666666555544321 1111223333 3455677777666654332211222222222222222211 Q ss_pred H--HHHHHhhh--hccc-----hhhhh------hccC Q lcl|NC_020414. 494 A--MLNEGVAK--AVPG-----VIQQE------MKEG 515 (515) Q Consensus 494 ~--~~~~~~~~--a~~~-----~~~~~------~~~~ 515 (515) + ++++++.+ +... ..++. ..+. T Consensus 615 q~~~~~~e~qa~~~~~qAe~~ka~a~a~~~~~~a~q~ 651 (708) T protein:vir:10 615 QPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTA 651 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 00000000 0000 00000 0000 No 35 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.29 E-value=1.9e-10 Score=73.85 Aligned_cols=487 Identities=10% Similarity=0.026 Sum_probs=224.3 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCC-Ccc---ccc-cccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD-NET---SQN-GWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~-~~~---~~~-~~dst~~~a~~~Laa~l~s 75 (515) ||+++.. -.++..||.........|-....+=.+|..-. ..+.. ... ..+ .|+-++. .++. +.+ T Consensus 1 m~d~~~~----~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~--Qw~~~~~~~l~~q~rp~~N~i~~-~i~~----v~g 69 (725) T protein:vir:92 1 MADNENR----LESILSRFDADWTASDEARREAKNDLFFSRIS--QWDDWLSQYTTLQYRGQFDVVRP-VVRK----LVS 69 (725) T ss_pred CCchHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcccchHH-HHHH----HHh Confidence 9998754 47778888877766666666655556665311 11110 000 011 2333332 2222 222 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE-----EEeC Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL-----YKPS 150 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~d~ 150 (515) .-- .+++=+++.|.++... ++.+.|.. .+......|+..-+-..+|.+.+..|.|.+ |.++ T Consensus 70 ~e~-~nr~d~~v~P~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~ 135 (725) T protein:vir:92 70 EMR-QNPIDVLYRPKDGASP----------DAADVLMG---MYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQ 135 (725) T ss_pred hHH-hCCcceEEecCCccHH----------HHHHHHHH---HHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCC Confidence 211 2555566666543221 23333332 344445689999999999999998888864 2222 Q ss_pred CC-----cEEEEE----cceEEEeeCCC---CC-eeEEEEEEEecHH---HHHHHhcccccchhhh---cc----CCCcc Q lcl|NC_020414. 151 KG-----AMSAVP----MHHYVVNRDTN---GD-LMDVILLQEKALR---TFDPATRMAIEVGMKG---KK----CKEDD 207 (515) Q Consensus 151 ~~-----~~r~~p----l~~y~v~~d~~---G~-vd~i~r~~~~t~~---ql~~~~~~~~~~~~~~---~~----~~~~~ 207 (515) +. .+++.| +.++++..++. +. -.-+||..+|+.. ++.++|+.+..+..-. .. ....+ T Consensus 136 d~~~~~~~i~~~~i~~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 215 (725) T protein:vir:92 136 SPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQD 215 (725) T ss_pred CCCCCceeeEEeeccCChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCC Confidence 21 245544 34455544432 21 1125677788865 5566777533221110 00 01123 Q ss_pred cEEEEEEEEEc-----------C----------------------CCC----------eEEEEE-eCCeeecccCC-ccc Q lcl|NC_020414. 208 NVKLYTHAQYA-----------G----------------------EGF----------WKINQS-ADDIPVGKENR-IKA 242 (515) Q Consensus 208 ~v~v~~~v~~~-----------~----------------------~~~----------~~~~~e-~~~~~i~~esg-y~~ 242 (515) .|.|..+.++. + .|+ .++|.. ..|.+++.... |+. T Consensus 216 ~vrv~e~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~ 295 (725) T protein:vir:92 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAG 295 (725) T ss_pred eEEEEEEEEEEEEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCC Confidence 34443332211 1 010 123322 35556665433 444 Q ss_pred ccCcEEEEeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcc--e--- Q lcl|NC_020414. 243 EKLPFIPLTWK--RSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGE--V--- 315 (515) Q Consensus 243 ~~~P~~~~Rw~--~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~--~--- 315 (515) ..|||+++-.. ...|..|+.|.+....+-.+.+|+..-..+..+....+.++++..+-+-........+.+. + T Consensus 296 ~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 375 (725) T protein:vir:92 296 EHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLN 375 (725) T ss_pred CceeeEEEEeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeecc Confidence 45899875322 3689999999999999999999999999998888888888888765442212111111111 1 Q ss_pred -ecCCccc--ccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_020414. 316 -ITGVEED--IHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) Q Consensus 316 -~~g~~~~--v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 389 (515) ++...+. ..++..-..+.+ +.....++..++.|++.- ...++.+..+ .++.--|..|++.-...|...+..|. T Consensus 376 ~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg~~~l~~~~Dnl~ 454 (725) T protein:vir:92 376 RTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGG-QVAYDTVNQLNMRADLETYVFQDNLA 454 (725) T ss_pred ccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCch-hhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 112221112222 344556666666665552 1123344333 35666788999888888888876655 Q ss_pred ------HHHHHHHHHHHHHhc------CC-----------CCCh----h-----hc--ccee---eeehHHHHHHHHHHH Q lcl|NC_020414. 390 ------MTMQTPIAMWGLQEA------GD-----------SFTS----E-----LV--DPVI---VTGIEALGRMAELDK 432 (515) Q Consensus 390 ------~E~l~Pli~r~~~~~------~~-----------~~p~----~-----~~--~~~~---v~~l~~l~ra~~~~~ 432 (515) -+++.-||...+... ++ ..+. . ++ +-.+ +.+-.+-.|.+.... T Consensus 455 ~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~ 534 (725) T protein:vir:92 455 TAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAE 534 (725) T ss_pred HHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHH Confidence 344555555443210 00 0000 0 01 1111 223334445555555 Q ss_pred HHHHHHHHHHhhcCCh----HHHhcCCH---HHHHHHHHHhcCCchhc--cCCHHHHHHHHHHHHHHHHHHHHH----HH Q lcl|NC_020414. 433 LANFAQYMSLPQTWPE----PAQRAIRW---GDYMDWVRGQISAELPF--LKSEEEMQQEMAQQAQAQQEAMLN----EG 499 (515) Q Consensus 433 l~~~~~~v~~~a~~~p----~~~d~id~---d~~~~~~a~~~Gvp~~~--irs~eev~~~rq~~~~~~q~~~~~----~~ 499 (515) +.++++.+..++.... ..++..|. +++++.+....+. ... -.++++.+++.+++ +++++++.+ .+ T Consensus 535 l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~-~~~~~~~~~e~~q~~~~~q-qa~~~q~~~e~~~~q 612 (725) T protein:vir:92 535 ILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ-MGVKKPETPEEQQWLVEAQ-QAKQGQQDPAMVQAQ 612 (725) T ss_pred HHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhch-hccCCccchhhhHHHHHHH-HHHHhhhHHHHHHHH Confidence 5554443332211111 12333343 3334444332222 111 11233322222211 111111000 00 Q ss_pred hh--hhccch---------hh-hh---hccC Q lcl|NC_020414. 500 VA--KAVPGV---------IQ-QE---MKEG 515 (515) Q Consensus 500 ~~--~a~~~~---------~~-~~---~~~~ 515 (515) +. ++.... +. +. ..+. T Consensus 613 a~~~~~qae~~kaqaE~~k~q~~a~~~~~~a 643 (725) T protein:vir:92 613 GVLLQGQAELAKAQNQTLSLQIDAAKVEAQN 643 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 00 00 0000 No 36 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.25 E-value=3.4e-10 Score=72.52 Aligned_cols=488 Identities=10% Similarity=0.023 Sum_probs=223.0 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCC-Ccc---cccc-ccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGD-NET---SQNG-WQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~-~~~---~~~~-~dst~~~a~~~Laa~l~s 75 (515) ||++++. -.++..||......-..|-....+=.+|..- ...+.. ... ..++ |+-++ ..++.+ .+ T Consensus 1 m~d~~~~----~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G--~QW~~~~~~~l~~q~rp~~N~i~-~~v~~v----~g 69 (725) T protein:vir:10 1 MADNENR----LESILSRFDADWTASDEARREAKNDLFFSRV--SQWDDWLSQYTTLQYRGQFDVVR-PVVRKL----VS 69 (725) T ss_pred CCchHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcccchH-HHHHHH----Hh Confidence 9998754 3677777776665555555555555555531 111110 000 0122 33333 222222 22 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE-----EEeC Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL-----YKPS 150 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~d~ 150 (515) .-- .+++=+++.|.++... ++.+.|.. .+......++..-+-..+|.+.+..|.|++ |.++ T Consensus 70 ~e~-~nr~d~~v~p~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~ 135 (725) T protein:vir:10 70 EMR-QNPIDVLYRPKDGASP----------DAADVLMG---MYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ 135 (725) T ss_pred hHH-hCCcceEEecCCcchH----------HHHHHHHH---HHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCC Confidence 211 2555566666543221 23333332 344445678999999999999998888874 3332 Q ss_pred CC-----cEEEEE----cceEEEeeCC---CCC-eeEEEEEEEecH---HHHHHHhcccccch---hhhcc----CCCcc Q lcl|NC_020414. 151 KG-----AMSAVP----MHHYVVNRDT---NGD-LMDVILLQEKAL---RTFDPATRMAIEVG---MKGKK----CKEDD 207 (515) Q Consensus 151 ~~-----~~r~~p----l~~y~v~~d~---~G~-vd~i~r~~~~t~---~ql~~~~~~~~~~~---~~~~~----~~~~~ 207 (515) +. .++.+| ..++++..++ ++. -.-+||..+|+- .++++.|+.+.... ..... ....+ T Consensus 136 d~~~~~~~i~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 215 (725) T protein:vir:10 136 SPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQD 215 (725) T ss_pred CCCCCceeeeeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCC Confidence 21 134444 3445555443 221 223568888885 44556676543211 10100 11122 Q ss_pred cEEEEEEEEEc-----------C----------------------CCC----------eEEEEE-eCCeeecccCC-ccc Q lcl|NC_020414. 208 NVKLYTHAQYA-----------G----------------------EGF----------WKINQS-ADDIPVGKENR-IKA 242 (515) Q Consensus 208 ~v~v~~~v~~~-----------~----------------------~~~----------~~~~~e-~~~~~i~~esg-y~~ 242 (515) .|.|+.+.++. + .|+ .+||.. ..|.+++.... |+. T Consensus 216 ~vrv~E~~~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~ 295 (725) T protein:vir:10 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAG 295 (725) T ss_pred eEEEEEEEEEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCC Confidence 34433332211 0 010 123322 35556665443 444 Q ss_pred ccCcEEEEeee--ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCcce-e--- Q lcl|NC_020414. 243 EKLPFIPLTWK--RSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEV-I--- 316 (515) Q Consensus 243 ~~~P~~~~Rw~--~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~-~--- 316 (515) ..|||+++-.. ...|..|+.|.+....+-.+.+|+.....+..+..+.+.++++..+.+-..+.....+.+.. + T Consensus 296 ~~fP~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~ 375 (725) T protein:vir:10 296 EHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLN 375 (725) T ss_pred CceeEEEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecc Confidence 45899875323 35889999999999999999999999999999888888888887654432222211111111 1 Q ss_pred --cCCccc--ccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_020414. 317 --TGVEED--IHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) Q Consensus 317 --~g~~~~--v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 389 (515) +...+. ..++..-..+.+ +.....++..++.|++.- ...++.+.++ .++.--|..|++.-...|...+..|. T Consensus 376 ~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg~~~l~~~~Dnl~ 454 (725) T protein:vir:10 376 RTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGG-QVAYDTVNQLNMRADLETYVFQDNLA 454 (725) T ss_pred cccccCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCch-hhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 112211112222 345556666666666553 1223344332 35666788888888888888877665 Q ss_pred H------HHHHHHHHHHHHhc------CCC-----------CCh-h--------hc--ccee---eeehHHHHHHHHHHH Q lcl|NC_020414. 390 M------TMQTPIAMWGLQEA------GDS-----------FTS-E--------LV--DPVI---VTGIEALGRMAELDK 432 (515) Q Consensus 390 ~------E~l~Pli~r~~~~~------~~~-----------~p~-~--------~~--~~~~---v~~l~~l~ra~~~~~ 432 (515) . +++.-||...+... ++. .+. . ++ +-.+ +.+-.+-.|.+.... T Consensus 455 ~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~ 534 (725) T protein:vir:10 455 TAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSE 534 (725) T ss_pred HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHH Confidence 4 34555554443210 010 000 0 01 1111 223334445555555 Q ss_pred HHHHHHHHHHhhcCChHH----HhcCC---HHHHHHHHHHhcCCch-hccCCHHHHHHHHHHHHHHHHHHHHH----HHh Q lcl|NC_020414. 433 LANFAQYMSLPQTWPEPA----QRAIR---WGDYMDWVRGQISAEL-PFLKSEEEMQQEMAQQAQAQQEAMLN----EGV 500 (515) Q Consensus 433 l~~~~~~v~~~a~~~p~~----~d~id---~d~~~~~~a~~~Gvp~-~~irs~eev~~~rq~~~~~~q~~~~~----~~~ 500 (515) +.++++.+..++.....+ ++..| .+++++.+....+... .=-.++++.+++.++++ +++.++.+ .++ T Consensus 535 l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq-~~~~q~~~e~~q~~~ 613 (725) T protein:vir:10 535 ILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQ-AKQGQQDPAMVQAQG 613 (725) T ss_pred HHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHH-HHHhhhHHHHHHHHH Confidence 555555443322221112 22222 2344444443332211 01122333222221111 11110000 000 Q ss_pred --h---------hhccchhhhhh----ccC Q lcl|NC_020414. 501 --A---------KAVPGVIQQEM----KEG 515 (515) Q Consensus 501 --~---------~a~~~~~~~~~----~~~ 515 (515) . ++....+..+. .+. T Consensus 614 ~~~~~qae~~ka~aE~~k~~~~a~~~~~~a 643 (725) T protein:vir:10 614 VLLQGQAELAKAQNQTLSLQIDAAKVEAQN 643 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 00000000000 000 No 37 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.18 E-value=7.6e-10 Score=70.58 Aligned_cols=487 Identities=11% Similarity=0.050 Sum_probs=218.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCC-CCcc---cccc-ccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKG-DNET---SQNG-WQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~-~~~~---~~~~-~dst~~~a~~~Laa~l~s 75 (515) ||+-++. -.+|..||.........|-....+=.+|..- ...+. .... ..++ |+=++. .++.+.+.-- T Consensus 1 m~d~~~~----~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~q~rp~~N~i~~-~i~~v~g~~~- 72 (725) T protein:vir:77 1 MADNENR----LESILSRFDADWTASDEARREAKNDLFFSRV--SQWDDWLSQYTTLQYRGQFDVVRP-VVRKLVSEMR- 72 (725) T ss_pred CCchHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHhhCC--CCCCHHHHHHHHhcCCCccccHHH-HHHHHHhhHH- Confidence 9986533 4667777776665555555555555555431 01111 0000 0122 322222 2332222221 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE-----EEeC Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL-----YKPS 150 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-----~~d~ 150 (515) .+++=+++.|.++... ++.+.|.. .+......|+..-+-..+|.+.+..|.|++ |.++ T Consensus 73 ----~nr~d~~v~P~~~~d~----------~~Ae~l~~---~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~ 135 (725) T protein:vir:77 73 ----QNPIDVLYRPKDGARP----------DAADVLMG---MYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQ 135 (725) T ss_pred ----hCCcceEEecCCccHH----------HHHHHHHH---HHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCC Confidence 2556666666553221 23333332 344445688999999999999998888864 2222 Q ss_pred CC-----cEEEEE----cceEEEeeCCC---CC-eeEEEEEEEecHH---HHHHHhcccccchhh---hc----cCCCcc Q lcl|NC_020414. 151 KG-----AMSAVP----MHHYVVNRDTN---GD-LMDVILLQEKALR---TFDPATRMAIEVGMK---GK----KCKEDD 207 (515) Q Consensus 151 ~~-----~~r~~p----l~~y~v~~d~~---G~-vd~i~r~~~~t~~---ql~~~~~~~~~~~~~---~~----~~~~~~ 207 (515) +. .++.+| ..++++..++. +. -.-+||..+++.+ ++.++++.+..+... .. .....+ T Consensus 136 d~~~~~~~i~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 215 (725) T protein:vir:77 136 SPTSNNQVIRREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQD 215 (725) T ss_pred CCCCCceeeEEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCC Confidence 21 134444 34455544432 21 1126788888876 455566543222110 00 011123 Q ss_pred cEEEEEEEEEc-----------C----------------------CCC----------eEEEEE-eCCeeecccC-Cccc Q lcl|NC_020414. 208 NVKLYTHAQYA-----------G----------------------EGF----------WKINQS-ADDIPVGKEN-RIKA 242 (515) Q Consensus 208 ~v~v~~~v~~~-----------~----------------------~~~----------~~~~~e-~~~~~i~~es-gy~~ 242 (515) .|.|..+.++. + .|. .++|.. ..|.+++.+. -|+. T Consensus 216 ~vrv~E~~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~ 295 (725) T protein:vir:77 216 TIQIAEFYEVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAG 295 (725) T ss_pred eeEEEEEEEEEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCC Confidence 34333332211 0 011 123332 3666666553 3655 Q ss_pred ccCcEEEEe--eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCc------c Q lcl|NC_020414. 243 EKLPFIPLT--WKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTG------E 314 (515) Q Consensus 243 ~~~P~~~~R--w~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g------~ 314 (515) ..|||++.- .....|..|+.|.+....+-.+.+|+..-..+..+..+.+.++++..+-+-..+......++ . T Consensus 296 ~~~P~vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 375 (725) T protein:vir:77 296 EHIPIVPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLN 375 (725) T ss_pred CccceEEEeeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceeccc Confidence 568998643 23578999999999999999999999999999888888888888776543222211111111 0 Q ss_pred eecCCccc--ccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_020414. 315 VITGVEED--IHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) Q Consensus 315 ~~~g~~~~--v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 389 (515) .+....+. .+++..-...++ +.....++...+.|.+.- ...++....+ .++.--|..|++.-...+...+..|. T Consensus 376 ~~~~~~g~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg~~~~~~~~Dnl~ 454 (725) T protein:vir:77 376 RTDENSGDLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGG-QVAFDTVNQLNMRADLETYVFQDNLA 454 (725) T ss_pred ccccCCCcccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCch-hhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111111 112211112233 233445555555555442 2223344333 25666788888888888887776654 Q ss_pred ------HHHHHHHHHHHHHh------cCCC----C----------C-hh--hcc-----cee---eeehHHHHHHHHHHH Q lcl|NC_020414. 390 ------MTMQTPIAMWGLQE------AGDS----F----------T-SE--LVD-----PVI---VTGIEALGRMAELDK 432 (515) Q Consensus 390 ------~E~l~Pli~r~~~~------~~~~----~----------p-~~--~~~-----~~~---v~~l~~l~ra~~~~~ 432 (515) -+++.-||...+.. .+.. . . +. .++ -.+ +.+-.+-.|.+.... T Consensus 455 ~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~ 534 (725) T protein:vir:77 455 TAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAE 534 (725) T ss_pred HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHH Confidence 33455555443321 0000 0 0 00 001 111 222334446555555 Q ss_pred HHHHHHHHHHhhcCChH----HHhcCCH---HHHHHHHHHhcCCchhcc--CCHHHHHHHHHHHHHHHHHHHHH----HH Q lcl|NC_020414. 433 LANFAQYMSLPQTWPEP----AQRAIRW---GDYMDWVRGQISAELPFL--KSEEEMQQEMAQQAQAQQEAMLN----EG 499 (515) Q Consensus 433 l~~~~~~v~~~a~~~p~----~~d~id~---d~~~~~~a~~~Gvp~~~i--rs~eev~~~rq~~~~~~q~~~~~----~~ 499 (515) +.++++.+..++..... .++..|. +++++.+...... .... .++++-++ +++.++++++++.. ++ T Consensus 535 l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~-~~~~q~~~~~e~q~-~~~~qq~~~~q~~~e~~q~q 612 (725) T protein:vir:77 535 ILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ-MGVKKPETPEEQQW-LVEAQQAKQGQQDPAMVQAQ 612 (725) T ss_pred HHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhh-hhccCCCChhhHHH-HHHHHHHHHHhHHHHHHHHH Confidence 55555443322221111 2233343 3333333332221 1111 12222111 11111111111000 00 Q ss_pred h--h---------hhccchh--------hhh----------hccC Q lcl|NC_020414. 500 V--A---------KAVPGVI--------QQE----------MKEG 515 (515) Q Consensus 500 ~--~---------~a~~~~~--------~~~----------~~~~ 515 (515) + . ++....+ +++ +.|. T Consensus 613 ~~~~~~qa~~~kaq~e~~k~q~~a~~~~~~a~~~aa~~~~~~~q~ 657 (725) T protein:vir:77 613 GVLLQGQAELAKAQNQTLSLQIDAAKVEAQNQLNAARIAEIFNNM 657 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0 0000000 000 0000 No 38 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.12 E-value=1.6e-09 Score=68.76 Aligned_cols=484 Identities=11% Similarity=0.042 Sum_probs=224.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHh-hcccccCCCCC-Cc------cc-ccc---ccccHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKL-TLPYLMNNKGD-NE------TS-QNG---WQGVGAQATNH 68 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~-~~P~~~~~~~~-~~------~~-~~~---~dst~~~a~~~ 68 (515) ||++..+. .+++..||......-+.|...|++=.+| ..+.. ..+.. .. .. .++ |+-++...-.. T Consensus 1 ma~~~~~~---~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~-Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v 76 (708) T protein:vir:17 1 MAETLEKK---HERIMLRFDRAYSPQQEVREKCIEATRFARVPGG-QWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) T ss_pred CchhHHHH---HHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCC-CCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHH Confidence 99997543 5666667766655556666666665543 11111 11110 00 00 011 34333332222 Q ss_pred HHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE-- Q lcl|NC_020414. 69 LANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL-- 146 (515) Q Consensus 69 Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l-- 146 (515) +...- .+++=+++.|.++.- + .++.+.| +..+......++...+...+|.+.+..|.|++ T Consensus 77 ~g~e~------~nr~d~~v~p~~~~~--------d-~~~Ae~l---~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~ 138 (708) T protein:vir:17 77 IAEYR------NNRITVKFRPGDREA--------S-EELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRL 138 (708) T ss_pred HhhHh------hCCcceEEecCCCcc--------h-HHHHHHH---HHHHHHHHHhcCchhHHhHHHHHhhhcccceeee Confidence 22111 255556666653211 1 1223333 33344556688999999999999999988865 Q ss_pred ---EEeCC------Cc--EEE--EEcceEEEeeCC---CCCeeE--EEEEEEecHHHHHHHhcccccchh-hh---ccC- Q lcl|NC_020414. 147 ---YKPSK------GA--MSA--VPMHHYVVNRDT---NGDLMD--VILLQEKALRTFDPATRMAIEVGM-KG---KKC- 203 (515) Q Consensus 147 ---~~d~~------~~--~r~--~pl~~y~v~~d~---~G~vd~--i~r~~~~t~~ql~~~~~~~~~~~~-~~---~~~- 203 (515) |++++ .+ ++. .|..++++.-++ ++ -|. +||...|+.+++...|+....... .. ... T Consensus 139 ~~d~~~e~d~~~~~~~i~i~~~~~~~~~v~~Dp~a~~~D~-sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~ 217 (708) T protein:vir:17 139 TSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDK-SDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEY 217 (708) T ss_pred eecccccCCCCCCccccceEeeccchhheecCccccccCh-hhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccc Confidence 33221 12 222 255677765554 32 233 689999999999999986542211 00 000 Q ss_pred --CCcccEEEEEEE--E---------EcC----------C------------CC----------eEEEEE-eCCeeeccc Q lcl|NC_020414. 204 --KEDDNVKLYTHA--Q---------YAG----------E------------GF----------WKINQS-ADDIPVGKE 237 (515) Q Consensus 204 --~~~~~v~v~~~v--~---------~~~----------~------------~~----------~~~~~e-~~~~~i~~e 237 (515) -..+.|.|.... + .++ + |+ +.||.. ..|..++.+ T Consensus 218 ~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~ 297 (708) T protein:vir:17 218 DWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEK 297 (708) T ss_pred cccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccC Confidence 001223222211 0 111 0 00 122222 356656644 Q ss_pred -CCcccccCcEEEE---eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhh------- Q lcl|NC_020414. 238 -NRIKAEKLPFIPL---TWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDH------- 306 (515) Q Consensus 238 -sgy~~~~~P~~~~---Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~------- 306 (515) +-|+...|||++. ||.+ .|...-.|.+..+.+-.+.+|+..-..+..+.+..+-+++++.+.+..... T Consensus 298 ~~~~p~~~fP~vP~~g~r~~~-d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~ 376 (708) T protein:vir:17 298 PRRIPGEHIPLIPVYGKRWFI-DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNK 376 (708) T ss_pred CCCCCCCccceEEEecccccc-cCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhccc Confidence 3355566888776 4543 555645688999999999999999999998888888888887654321110 Q ss_pred ---------ccCCCCcceecCCcc--cccccccCCccchHHHHHHHHHHHHH--HHHHHHHHhhccCCCCCCCHHHHHHH Q lcl|NC_020414. 307 ---------FVNSGTGEVITGVEE--DIHIVQLGKYADLTPISAVLEVYTRR--IGVIFMMETMTRRDAERVTAVEIQRD 373 (515) Q Consensus 307 ---------~~~~~~g~~~~g~~~--~v~~~~~~~~~~l~~~~~~i~~~~~r--I~~afl~~~l~~~~~~~~TAtEi~~r 373 (515) -.....|.+++|..- -+.+.++. .+.++.++...+++++. |+.+ ++.+.+ .++..-|..| T Consensus 377 ~~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~-~~~~~llq~~~~~i~~~tGi~d~----~~G~~s--n~SG~Ai~~r 449 (708) T protein:vir:17 377 KRPAFLPLREVRDKYGNIIAGATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQA----MQQMPS--NIAQETVNNL 449 (708) T ss_pred chhhhhhhhccCCcccccccccCCcccCCCcccc-HHHHHHHHHHHHHHHHhcCCChH----HccCcc--chHHHHHHHH Confidence 001223434333221 12222221 12233333333333332 2333 233322 3566678888 Q ss_pred HHHHHHHhhhhHHHHH------HHHHHHHHHHHHHhc------CC---------------CCChh-----hccc---eee Q lcl|NC_020414. 374 ALEIEQNMGGVYSLFA------MTMQTPIAMWGLQEA------GD---------------SFTSE-----LVDP---VIV 418 (515) Q Consensus 374 ~~E~~~~LGpv~~rl~------~E~l~Pli~r~~~~~------~~---------------~~p~~-----~~~~---~~v 418 (515) ++.-...+...+..+. -+++.-||...+... +. ..++. ++.. .++ T Consensus 450 q~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~ 529 (708) T protein:vir:17 450 MNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVT 529 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEE Confidence 8888888888777655 555666666554210 00 01111 1111 111 Q ss_pred ---eehHHHHHHHHHHHHHHHHHHHHHhhcCC----hHHHhc---CCHHHHHHHHHHhcCCchhc-cCCHHHHHHHHHHH Q lcl|NC_020414. 419 ---TGIEALGRMAELDKLANFAQYMSLPQTWP----EPAQRA---IRWGDYMDWVRGQISAELPF-LKSEEEMQQEMAQQ 487 (515) Q Consensus 419 ---~~l~~l~ra~~~~~l~~~~~~v~~~a~~~----p~~~d~---id~d~~~~~~a~~~Gvp~~~-irs~eev~~~rq~~ 487 (515) .+-.+-.|.+..+.+.++++.+....+.- +-+++. -+.++++..+...+..+... -..+++.++..++. T Consensus 530 v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~q 609 (708) T protein:vir:17 530 VDVGPSYTARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQ 609 (708) T ss_pred EecccCchhHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHH Confidence 22334556665666655554433211111 113333 34466777776666543211 11222222211111 Q ss_pred HHHHHHH-HHHH--Hhh---------hhc-------cchh-------------hhhhccC Q lcl|NC_020414. 488 AQAQQEA-MLNE--GVA---------KAV-------PGVI-------------QQEMKEG 515 (515) Q Consensus 488 ~~~~q~~-~~~~--~~~---------~a~-------~~~~-------------~~~~~~~ 515 (515) +.+++++ ++++ ++. ++. ..++ .+..++. T Consensus 610 q~~q~q~~~~~~eaqa~~~~~qAe~~ka~aea~~~q~~a~q~~~~~~~a~~~a~q~~~q~ 669 (708) T protein:vir:17 610 MAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQA 669 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1100000 0000 000 000 0000 0000000 No 39 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.09 E-value=2.3e-09 Score=67.94 Aligned_cols=488 Identities=10% Similarity=0.020 Sum_probs=226.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHH----HHHHhhcccccCCCC-CCc-----ccccc-ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAK----HFAKLTLPYLMNNKG-DNE-----TSQNG-WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst~~~a~~~L 69 (515) |++.... .++++...+.+..+..+... .+.|+ +-.+|..- ...+. ... +.... |+-++.. ++.. T Consensus 8 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~-v~~v 82 (714) T protein:vir:27 8 MATKNDN-GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPT-VDGV 82 (714) T ss_pred ccCCCCc-chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHH-HHHH Confidence 5554433 34555555566655555433 33455 44555421 01111 000 01111 3333332 2222 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EE Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--LY 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~ 147 (515) .+. -- .+++=+++.|.+...+ -.++.+.| +..+......+++..+...+|.+.+..|-|. +| T Consensus 83 ~g~----~~-~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:27 83 LGM----EA-KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred HhH----HH-hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 222 21 2555566666432111 01223333 3345556667899999999999998877765 45 Q ss_pred EeCCC-----cEEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhccccc--chhhh----------c----- Q lcl|NC_020414. 148 KPSKG-----AMSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIE--VGMKG----------K----- 201 (515) Q Consensus 148 ~d~~~-----~~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~--~~~~~----------~----- 201 (515) .+.+. .++.+|..++++..++.. .-.-+|++.++|.+++...|++.+. ..... . T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~ 226 (714) T protein:vir:27 147 RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccc Confidence 54331 255677788888765432 1224789999999999999986321 00000 0 Q ss_pred -------------------cCCCcccEEEEEEEEEcC---------CCC---------------------------eEEE Q lcl|NC_020414. 202 -------------------KCKEDDNVKLYTHAQYAG---------EGF---------------------------WKIN 226 (515) Q Consensus 202 -------------------~~~~~~~v~v~~~v~~~~---------~~~---------------------------~~~~ 226 (515) ......+|.|+.+.+... +|. ..++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:27 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred ccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 000123466655533211 000 1111 Q ss_pred -EEeCCeeecc--cCCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 227 -QSADDIPVGK--ENRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 227 -~e~~~~~i~~--esgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) ..+.|.+++. .+-|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+- .+..+-++ +.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~--~l~~~~~~-~~~~a~ 381 (714) T protein:vir:27 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTW--LLQAKRVI-MDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHH--hhcCCcee-eecCcc Confidence 1134445554 356776679998764443 456666 5788888888999975444433 24556555 445555 Q ss_pred cChh-hcc--CCCCcceecCC---ccc---ccccccCCcc-chHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHH Q lcl|NC_020414. 302 TDVD-HFV--NSGTGEVITGV---EED---IHIVQLGKYA-DLTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVE 369 (515) Q Consensus 302 ~~~~-~~~--~~~~g~~~~g~---~~~---v~~~~~~~~~-~l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtE 369 (515) ...+ .+. .+.+|.++.-+ .+. ..+++..... -.+.....++...+.|++.- .-.++.+.. ...+..- T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvA 460 (714) T protein:vir:27 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVA 460 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHH Confidence 3322 221 23444443221 111 1122222222 22333444444445554431 111233332 2356667 Q ss_pred HHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHH--------hcCCC----------CCh------hhc---cce Q lcl|NC_020414. 370 IQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQ--------EAGDS----------FTS------ELV---DPV 416 (515) Q Consensus 370 i~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~--------~~~~~----------~p~------~~~---~~~ 416 (515) |..|++.-...|...+.+|..- ++.-||...+. +.... ++. -++ +.. T Consensus 461 i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~D 540 (714) T protein:vir:27 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEE Confidence 9999999999998888666543 33334433321 11010 000 001 111 Q ss_pred e---eeehHHHHHHHHHHHHHHHHHHHHHh-hc-CChHHH---hcCCHHHHHHHHHHhcCCch--hccCCHHHHHHHHHH Q lcl|NC_020414. 417 I---VTGIEALGRMAELDKLANFAQYMSLP-QT-WPEPAQ---RAIRWGDYMDWVRGQISAEL--PFLKSEEEMQQEMAQ 486 (515) Q Consensus 417 ~---v~~l~~l~ra~~~~~l~~~~~~v~~~-a~-~~p~~~---d~id~d~~~~~~a~~~Gvp~--~~irs~eev~~~rq~ 486 (515) + +.+-++-.|.+..+.+.++++.+... .. ..+-++ |.-+.+++++.+-..+|.+. .....+++.++..++ T Consensus 541 v~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q 620 (714) T protein:vir:27 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQ 620 (714) T ss_pred EEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHH Confidence 1 23345666777777777777654322 11 222233 44456789999999888753 333333332222221 Q ss_pred HHHHHHHH-HHH-------------HHh-hhh-ccchhhhh---hccC Q lcl|NC_020414. 487 QAQAQQEA-MLN-------------EGV-AKA-VPGVIQQE---MKEG 515 (515) Q Consensus 487 ~~~~~q~~-~~~-------------~~~-~~a-~~~~~~~~---~~~~ 515 (515) ..+++|.+ ++. +.+ +++ ....-++. .++| T Consensus 621 ~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~ 668 (714) T protein:vir:27 621 ALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG 668 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 000 000 000 00000000 0000 No 40 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.09 E-value=2.3e-09 Score=67.94 Aligned_cols=488 Identities=10% Similarity=0.020 Sum_probs=226.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHH----HHHHhhcccccCCCC-CCc-----ccccc-ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAK----HFAKLTLPYLMNNKG-DNE-----TSQNG-WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst~~~a~~~L 69 (515) |++.... .++++...+.+..+..+... .+.|+ +-.+|..- ...+. ... +.... |+-++.. ++.. T Consensus 8 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~-v~~v 82 (714) T protein:vir:81 8 MATKNDN-GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPT-VDGV 82 (714) T ss_pred ccCCCCc-chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHH-HHHH Confidence 5554433 34555555566655555433 33455 44555421 01111 000 01111 3333332 2222 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EE Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--LY 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~ 147 (515) .+. -- .+++=+++.|.+...+ -.++.+.| +..+......+++..+...+|.+.+..|-|. +| T Consensus 83 ~g~----~~-~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:81 83 LGM----EA-KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred HhH----HH-hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 222 21 2555566666432111 01223333 3345556667899999999999998877765 45 Q ss_pred EeCCC-----cEEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhccccc--chhhh----------c----- Q lcl|NC_020414. 148 KPSKG-----AMSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIE--VGMKG----------K----- 201 (515) Q Consensus 148 ~d~~~-----~~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~--~~~~~----------~----- 201 (515) .+.+. .++.+|..++++..++.. .-.-+|++.++|.+++...|++.+. ..... . T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~ 226 (714) T protein:vir:81 147 RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccc Confidence 54331 255677788888765432 1224789999999999999986321 00000 0 Q ss_pred -------------------cCCCcccEEEEEEEEEcC---------CCC---------------------------eEEE Q lcl|NC_020414. 202 -------------------KCKEDDNVKLYTHAQYAG---------EGF---------------------------WKIN 226 (515) Q Consensus 202 -------------------~~~~~~~v~v~~~v~~~~---------~~~---------------------------~~~~ 226 (515) ......+|.|+.+.+... +|. ..++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:81 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred ccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 000123466655533211 000 1111 Q ss_pred -EEeCCeeecc--cCCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 227 -QSADDIPVGK--ENRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 227 -~e~~~~~i~~--esgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) ..+.|.+++. .+-|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+- .+..+-++ +.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~--~l~~~~~~-~~~~a~ 381 (714) T protein:vir:81 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTW--LLQAKRVI-MDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHH--hhcCCcee-eecCcc Confidence 1134445554 356776679998764443 456666 5788888888999975444433 24556555 445555 Q ss_pred cChh-hcc--CCCCcceecCC---ccc---ccccccCCcc-chHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHH Q lcl|NC_020414. 302 TDVD-HFV--NSGTGEVITGV---EED---IHIVQLGKYA-DLTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVE 369 (515) Q Consensus 302 ~~~~-~~~--~~~~g~~~~g~---~~~---v~~~~~~~~~-~l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtE 369 (515) ...+ .+. .+.+|.++.-+ .+. ..+++..... -.+.....++...+.|++.- .-.++.+.. ...+..- T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvA 460 (714) T protein:vir:81 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVA 460 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHH Confidence 3322 221 23444443221 111 1122222222 22333444444445554431 111233332 2356667 Q ss_pred HHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHH--------hcCCC----------CCh------hhc---cce Q lcl|NC_020414. 370 IQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQ--------EAGDS----------FTS------ELV---DPV 416 (515) Q Consensus 370 i~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~--------~~~~~----------~p~------~~~---~~~ 416 (515) |..|++.-...|...+.+|..- ++.-||...+. +.... ++. -++ +.. T Consensus 461 i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~D 540 (714) T protein:vir:81 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEE Confidence 9999999999998888666543 33334433321 11010 000 001 111 Q ss_pred e---eeehHHHHHHHHHHHHHHHHHHHHHh-hc-CChHHH---hcCCHHHHHHHHHHhcCCch--hccCCHHHHHHHHHH Q lcl|NC_020414. 417 I---VTGIEALGRMAELDKLANFAQYMSLP-QT-WPEPAQ---RAIRWGDYMDWVRGQISAEL--PFLKSEEEMQQEMAQ 486 (515) Q Consensus 417 ~---v~~l~~l~ra~~~~~l~~~~~~v~~~-a~-~~p~~~---d~id~d~~~~~~a~~~Gvp~--~~irs~eev~~~rq~ 486 (515) + +.+-++-.|.+..+.+.++++.+... .. ..+-++ |.-+.+++++.+-..+|.+. .....+++.++..++ T Consensus 541 v~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q 620 (714) T protein:vir:81 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQ 620 (714) T ss_pred EEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHH Confidence 1 23345666777777777777654322 11 222233 44456789999999888753 333333332222221 Q ss_pred HHHHHHHH-HHH-------------HHh-hhh-ccchhhhh---hccC Q lcl|NC_020414. 487 QAQAQQEA-MLN-------------EGV-AKA-VPGVIQQE---MKEG 515 (515) Q Consensus 487 ~~~~~q~~-~~~-------------~~~-~~a-~~~~~~~~---~~~~ 515 (515) ..+++|.+ ++. +.+ +++ ....-++. .++| T Consensus 621 ~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~ 668 (714) T protein:vir:81 621 ALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG 668 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 000 000 000 00000000 0000 No 41 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.09 E-value=2.3e-09 Score=67.94 Aligned_cols=488 Identities=10% Similarity=0.020 Sum_probs=226.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHH----HHHHhhcccccCCCC-CCc-----ccccc-ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAK----HFAKLTLPYLMNNKG-DNE-----TSQNG-WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst~~~a~~~L 69 (515) |++.... .++++...+.+..+..+... .+.|+ +-.+|..- ...+. ... +.... |+-++.. ++.. T Consensus 8 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~-v~~v 82 (714) T protein:vir:10 8 MATKNDN-GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPT-VDGV 82 (714) T ss_pred ccCCCCc-chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHH-HHHH Confidence 5554433 34555555566655555433 33455 44555421 01111 000 01111 3333332 2222 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EE Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--LY 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~ 147 (515) .+. -- .+++=+++.|.+...+ -.++.+.| +..+......+++..+...+|.+.+..|-|. +| T Consensus 83 ~g~----~~-~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:10 83 LGM----EA-KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred HhH----HH-hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 222 21 2555566666432111 01223333 3345556667899999999999998877765 45 Q ss_pred EeCCC-----cEEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhccccc--chhhh----------c----- Q lcl|NC_020414. 148 KPSKG-----AMSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIE--VGMKG----------K----- 201 (515) Q Consensus 148 ~d~~~-----~~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~--~~~~~----------~----- 201 (515) .+.+. .++.+|..++++..++.. .-.-+|++.++|.+++...|++.+. ..... . T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~ 226 (714) T protein:vir:10 147 RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccc Confidence 54331 255677788888765432 1224789999999999999986321 00000 0 Q ss_pred -------------------cCCCcccEEEEEEEEEcC---------CCC---------------------------eEEE Q lcl|NC_020414. 202 -------------------KCKEDDNVKLYTHAQYAG---------EGF---------------------------WKIN 226 (515) Q Consensus 202 -------------------~~~~~~~v~v~~~v~~~~---------~~~---------------------------~~~~ 226 (515) ......+|.|+.+.+... +|. ..++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:10 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred ccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 000123466655533211 000 1111 Q ss_pred -EEeCCeeecc--cCCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 227 -QSADDIPVGK--ENRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 227 -~e~~~~~i~~--esgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) ..+.|.+++. .+-|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+- .+..+-++ +.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~--~l~~~~~~-~~~~a~ 381 (714) T protein:vir:10 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTW--LLQAKRVI-MDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHH--hhcCCcee-eecCcc Confidence 1134445554 356776679998764443 456666 5788888888999975444433 24556555 445555 Q ss_pred cChh-hcc--CCCCcceecCC---ccc---ccccccCCcc-chHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHH Q lcl|NC_020414. 302 TDVD-HFV--NSGTGEVITGV---EED---IHIVQLGKYA-DLTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVE 369 (515) Q Consensus 302 ~~~~-~~~--~~~~g~~~~g~---~~~---v~~~~~~~~~-~l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtE 369 (515) ...+ .+. .+.+|.++.-+ .+. ..+++..... -.+.....++...+.|++.- .-.++.+.. ...+..- T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvA 460 (714) T protein:vir:10 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVA 460 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHH Confidence 3322 221 23444443221 111 1122222222 22333444444445554431 111233332 2356667 Q ss_pred HHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHH--------hcCCC----------CCh------hhc---cce Q lcl|NC_020414. 370 IQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQ--------EAGDS----------FTS------ELV---DPV 416 (515) Q Consensus 370 i~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~--------~~~~~----------~p~------~~~---~~~ 416 (515) |..|++.-...|...+.+|..- ++.-||...+. +.... ++. -++ +.. T Consensus 461 i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~D 540 (714) T protein:vir:10 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEE Confidence 9999999999998888666543 33334433321 11010 000 001 111 Q ss_pred e---eeehHHHHHHHHHHHHHHHHHHHHHh-hc-CChHHH---hcCCHHHHHHHHHHhcCCch--hccCCHHHHHHHHHH Q lcl|NC_020414. 417 I---VTGIEALGRMAELDKLANFAQYMSLP-QT-WPEPAQ---RAIRWGDYMDWVRGQISAEL--PFLKSEEEMQQEMAQ 486 (515) Q Consensus 417 ~---v~~l~~l~ra~~~~~l~~~~~~v~~~-a~-~~p~~~---d~id~d~~~~~~a~~~Gvp~--~~irs~eev~~~rq~ 486 (515) + +.+-++-.|.+..+.+.++++.+... .. ..+-++ |.-+.+++++.+-..+|.+. .....+++.++..++ T Consensus 541 v~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q 620 (714) T protein:vir:10 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQ 620 (714) T ss_pred EEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHH Confidence 1 23345666777777777777654322 11 222233 44456789999999888753 333333332222221 Q ss_pred HHHHHHHH-HHH-------------HHh-hhh-ccchhhhh---hccC Q lcl|NC_020414. 487 QAQAQQEA-MLN-------------EGV-AKA-VPGVIQQE---MKEG 515 (515) Q Consensus 487 ~~~~~q~~-~~~-------------~~~-~~a-~~~~~~~~---~~~~ 515 (515) ..+++|.+ ++. +.+ +++ ....-++. .++| T Consensus 621 ~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~ 668 (714) T protein:vir:10 621 ALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG 668 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 000 000 000 00000000 0000 No 42 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.09 E-value=2.3e-09 Score=67.94 Aligned_cols=488 Identities=10% Similarity=0.020 Sum_probs=226.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHH----HHHHhhcccccCCCC-CCc-----ccccc-ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAK----HFAKLTLPYLMNNKG-DNE-----TSQNG-WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst~~~a~~~L 69 (515) |++.... .++++...+.+..+..+... .+.|+ +-.+|..- ...+. ... +.... |+-++.. ++.. T Consensus 8 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~-v~~v 82 (714) T protein:vir:32 8 MATKNDN-GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPT-VDGV 82 (714) T ss_pred ccCCCCc-chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHH-HHHH Confidence 5554433 34555555566655555433 33455 44555421 01111 000 01111 3333332 2222 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EE Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--LY 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~ 147 (515) .+. -- .+++=+++.|.+...+ -.++.+.| +..+......+++..+...+|.+.+..|-|. +| T Consensus 83 ~g~----~~-~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:32 83 LGM----EA-KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred HhH----HH-hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 222 21 2555566666432111 01223333 3345556667899999999999998877765 45 Q ss_pred EeCCC-----cEEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhccccc--chhhh----------c----- Q lcl|NC_020414. 148 KPSKG-----AMSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIE--VGMKG----------K----- 201 (515) Q Consensus 148 ~d~~~-----~~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~--~~~~~----------~----- 201 (515) .+.+. .++.+|..++++..++.. .-.-+|++.++|.+++...|++.+. ..... . T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~ 226 (714) T protein:vir:32 147 RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccc Confidence 54331 255677788888765432 1224789999999999999986321 00000 0 Q ss_pred -------------------cCCCcccEEEEEEEEEcC---------CCC---------------------------eEEE Q lcl|NC_020414. 202 -------------------KCKEDDNVKLYTHAQYAG---------EGF---------------------------WKIN 226 (515) Q Consensus 202 -------------------~~~~~~~v~v~~~v~~~~---------~~~---------------------------~~~~ 226 (515) ......+|.|+.+.+... +|. ..++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:32 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred ccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 000123466655533211 000 1111 Q ss_pred -EEeCCeeecc--cCCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 227 -QSADDIPVGK--ENRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 227 -~e~~~~~i~~--esgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) ..+.|.+++. .+-|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+- .+..+-++ +.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~--~l~~~~~~-~~~~a~ 381 (714) T protein:vir:32 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTW--LLQAKRVI-MDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHH--hhcCCcee-eecCcc Confidence 1134445554 356776679998764443 456666 5788888888999975444433 24556555 445555 Q ss_pred cChh-hcc--CCCCcceecCC---ccc---ccccccCCcc-chHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHH Q lcl|NC_020414. 302 TDVD-HFV--NSGTGEVITGV---EED---IHIVQLGKYA-DLTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVE 369 (515) Q Consensus 302 ~~~~-~~~--~~~~g~~~~g~---~~~---v~~~~~~~~~-~l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtE 369 (515) ...+ .+. .+.+|.++.-+ .+. ..+++..... -.+.....++...+.|++.- .-.++.+.. ...+..- T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvA 460 (714) T protein:vir:32 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVA 460 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHH Confidence 3322 221 23444443221 111 1122222222 22333444444445554431 111233332 2356667 Q ss_pred HHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHH--------hcCCC----------CCh------hhc---cce Q lcl|NC_020414. 370 IQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQ--------EAGDS----------FTS------ELV---DPV 416 (515) Q Consensus 370 i~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~--------~~~~~----------~p~------~~~---~~~ 416 (515) |..|++.-...|...+.+|..- ++.-||...+. +.... ++. -++ +.. T Consensus 461 i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~D 540 (714) T protein:vir:32 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEE Confidence 9999999999998888666543 33334433321 11010 000 001 111 Q ss_pred e---eeehHHHHHHHHHHHHHHHHHHHHHh-hc-CChHHH---hcCCHHHHHHHHHHhcCCch--hccCCHHHHHHHHHH Q lcl|NC_020414. 417 I---VTGIEALGRMAELDKLANFAQYMSLP-QT-WPEPAQ---RAIRWGDYMDWVRGQISAEL--PFLKSEEEMQQEMAQ 486 (515) Q Consensus 417 ~---v~~l~~l~ra~~~~~l~~~~~~v~~~-a~-~~p~~~---d~id~d~~~~~~a~~~Gvp~--~~irs~eev~~~rq~ 486 (515) + +.+-++-.|.+..+.+.++++.+... .. ..+-++ |.-+.+++++.+-..+|.+. .....+++.++..++ T Consensus 541 v~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q 620 (714) T protein:vir:32 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQ 620 (714) T ss_pred EEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHH Confidence 1 23345666777777777777654322 11 222233 44456789999999888753 333333332222221 Q ss_pred HHHHHHHH-HHH-------------HHh-hhh-ccchhhhh---hccC Q lcl|NC_020414. 487 QAQAQQEA-MLN-------------EGV-AKA-VPGVIQQE---MKEG 515 (515) Q Consensus 487 ~~~~~q~~-~~~-------------~~~-~~a-~~~~~~~~---~~~~ 515 (515) ..+++|.+ ++. +.+ +++ ....-++. .++| T Consensus 621 ~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~ 668 (714) T protein:vir:32 621 ALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG 668 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 000 000 000 00000000 0000 No 43 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.09 E-value=2.3e-09 Score=67.94 Aligned_cols=488 Identities=10% Similarity=0.020 Sum_probs=226.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHH----HHHHhhcccccCCCC-CCc-----ccccc-ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAK----HFAKLTLPYLMNNKG-DNE-----TSQNG-WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst~~~a~~~L 69 (515) |++.... .++++...+.+..+..+... .+.|+ +-.+|..- ...+. ... +.... |+-++.. ++.. T Consensus 8 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~R~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~~-v~~v 82 (714) T protein:vir:99 8 MATKNDN-GATPRFSQRQLQALCSDIDS-QPKWRDAANKACAYYDG--DQLPPEVLQVLKDRGQPMTIHNLIAPT-VDGV 82 (714) T ss_pred ccCCCCc-chhHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHHH-HHHH Confidence 5554433 34555555566655555433 33455 44555421 01111 000 01111 3333332 2222 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE--EE Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--LY 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~--l~ 147 (515) .+. -- .+++=+++.|.+...+ -.++.+.| +..+......+++..+...+|.+.+..|-|. +| T Consensus 83 ~g~----~~-~nr~~~~v~p~~~~~~--------~~~~Ae~l---~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:99 83 LGM----EA-KTRTDLVVMSDEPDDE--------TEKLAEAI---NAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred HhH----HH-hCCcceEEecCCCCch--------hHHHHHHH---HHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 222 21 2555566666432111 01223333 3345556667899999999999998877765 45 Q ss_pred EeCCC-----cEEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhccccc--chhhh----------c----- Q lcl|NC_020414. 148 KPSKG-----AMSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIE--VGMKG----------K----- 201 (515) Q Consensus 148 ~d~~~-----~~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~--~~~~~----------~----- 201 (515) .+.+. .++.+|..++++..++.. .-.-+|++.++|.+++...|++.+. ..... . T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~ 226 (714) T protein:vir:99 147 RNSDPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPS 226 (714) T ss_pred cccCCCCCCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccccc Confidence 54331 255677788888765432 1224789999999999999986321 00000 0 Q ss_pred -------------------cCCCcccEEEEEEEEEcC---------CCC---------------------------eEEE Q lcl|NC_020414. 202 -------------------KCKEDDNVKLYTHAQYAG---------EGF---------------------------WKIN 226 (515) Q Consensus 202 -------------------~~~~~~~v~v~~~v~~~~---------~~~---------------------------~~~~ 226 (515) ......+|.|+.+.+... +|. ..++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:99 227 PLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred ccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 000123466655533211 000 1111 Q ss_pred -EEeCCeeecc--cCCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 227 -QSADDIPVGK--ENRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 227 -~e~~~~~i~~--esgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) ..+.|.+++. .+-|+...|||++.-... ..|..| |.+..+.+-.+.+|+..-..+- .+..+-++ +.++++ T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~--~l~~~~~~-~~~~a~ 381 (714) T protein:vir:99 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTW--LLQAKRVI-MDEDAT 381 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCcee--ehhhhchhHHHHHHHHHHHHHH--hhcCCcee-eecCcc Confidence 1134445554 356776679998764443 456666 5788888888999975444433 24556555 445555 Q ss_pred cChh-hcc--CCCCcceecCC---ccc---ccccccCCcc-chHHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHH Q lcl|NC_020414. 302 TDVD-HFV--NSGTGEVITGV---EED---IHIVQLGKYA-DLTPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVE 369 (515) Q Consensus 302 ~~~~-~~~--~~~~g~~~~g~---~~~---v~~~~~~~~~-~l~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtE 369 (515) ...+ .+. .+.+|.++.-+ .+. ..+++..... -.+.....++...+.|++.- .-.++.+.. ...+..- T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvA 460 (714) T protein:vir:99 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVA 460 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHH Confidence 3322 221 23444443221 111 1122222222 22333444444445554431 111233332 2356667 Q ss_pred HHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHH--------hcCCC----------CCh------hhc---cce Q lcl|NC_020414. 370 IQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQ--------EAGDS----------FTS------ELV---DPV 416 (515) Q Consensus 370 i~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~--------~~~~~----------~p~------~~~---~~~ 416 (515) |..|++.-...|...+.+|..- ++.-||...+. +.... ++. -++ +.. T Consensus 461 i~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~D 540 (714) T protein:vir:99 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEE Confidence 9999999999998888666543 33334433321 11010 000 001 111 Q ss_pred e---eeehHHHHHHHHHHHHHHHHHHHHHh-hc-CChHHH---hcCCHHHHHHHHHHhcCCch--hccCCHHHHHHHHHH Q lcl|NC_020414. 417 I---VTGIEALGRMAELDKLANFAQYMSLP-QT-WPEPAQ---RAIRWGDYMDWVRGQISAEL--PFLKSEEEMQQEMAQ 486 (515) Q Consensus 417 ~---v~~l~~l~ra~~~~~l~~~~~~v~~~-a~-~~p~~~---d~id~d~~~~~~a~~~Gvp~--~~irs~eev~~~rq~ 486 (515) + +.+-++-.|.+..+.+.++++.+... .. ..+-++ |.-+.+++++.+-..+|.+. .....+++.++..++ T Consensus 541 v~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q 620 (714) T protein:vir:99 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQ 620 (714) T ss_pred EEEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHH Confidence 1 23345666777777777777654322 11 222233 44456789999999888753 333333332222221 Q ss_pred HHHHHHHH-HHH-------------HHh-hhh-ccchhhhh---hccC Q lcl|NC_020414. 487 QAQAQQEA-MLN-------------EGV-AKA-VPGVIQQE---MKEG 515 (515) Q Consensus 487 ~~~~~q~~-~~~-------------~~~-~~a-~~~~~~~~---~~~~ 515 (515) ..+++|.+ ++. +.+ +++ ....-++. .++| T Consensus 621 ~~~~~q~~lq~~~~~a~~~k~eae~~~a~a~a~~~~~~a~~~~~~~~~ 668 (714) T protein:vir:99 621 ALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQG 668 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111110 000 000 000 00000000 0000 No 44 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.00 E-value=6.5e-09 Score=65.48 Aligned_cols=478 Identities=11% Similarity=0.035 Sum_probs=201.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHH----Hhh-cccccCCCCCCc-------ccccc---ccccHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFA----KLT-LPYLMNNKGDNE-------TSQNG---WQGVGAQA 65 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~----~~~-~P~~~~~~~~~~-------~~~~~---~dst~~~a 65 (515) ||++.... ..++..||.... .|.+.|+.-+ +|. .+..-.+..... ...++ |+-++... T Consensus 1 ma~~~~~~---l~~~~~~~~~~~----~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v 73 (720) T protein:vir:35 1 MAETLQKR---HEQIMRKFDRAH----SPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTEL 73 (720) T ss_pred CchHHHHH---HHHHHHHHHHHH----hhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHH Confidence 99885211 233344444333 3334444322 332 121111110000 01122 34443322 Q ss_pred HHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE Q lcl|NC_020414. 66 TNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL 145 (515) Q Consensus 66 ~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 145 (515) +++.+.-- .+++=+++.|.+..- + .++.+.| +..+......++...+...+|.+.+..|.|+ T Consensus 74 -----~~v~g~~~-~nr~d~~v~P~~~~~------d---~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~ 135 (720) T protein:vir:35 74 -----NRIISEYR-HNRITVKFRPGDKTA------S---EALANKL---NGLFRADYEETDGGEACDNAFDDGSTGGFGC 135 (720) T ss_pred -----HHHHhHHH-hCCCceEEEcCCCcc------h---HHHHHHH---HHHHHHHHHhcCchHHHhHHHHHhhhcccee Confidence 23333322 255556666653220 0 1223333 2334445567889999999999999988887 Q ss_pred EEE-----eCCCc--------EEEE--EcceEEEeeCCC---CC-eeEEEEEEEecHHHHHHHhcccccchhh------h Q lcl|NC_020414. 146 LYK-----PSKGA--------MSAV--PMHHYVVNRDTN---GD-LMDVILLQEKALRTFDPATRMAIEVGMK------G 200 (515) Q Consensus 146 l~~-----d~~~~--------~r~~--pl~~y~v~~d~~---G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~~------~ 200 (515) +-+ +..++ ++++ |..++++..++. +. -.-+||...|+.+++...|+++...... . T Consensus 136 ~~v~~d~~~~~d~~~~~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~ 215 (720) T protein:vir:35 136 FRLTTNLVNALDPMDERQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWD 215 (720) T ss_pred EEeeecccccCCCCcccceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCcccccccccccccc Confidence 632 22111 2222 445666654442 21 2236788889999999999975432110 0 Q ss_pred ccCCCcccEEEEEEEEE-----------cC----------C------------C----------CeEEEEE-eCCeeecc Q lcl|NC_020414. 201 KKCKEDDNVKLYTHAQY-----------AG----------E------------G----------FWKINQS-ADDIPVGK 236 (515) Q Consensus 201 ~~~~~~~~v~v~~~v~~-----------~~----------~------------~----------~~~~~~e-~~~~~i~~ 236 (515) ........|.+..+.++ ++ + + .+.||.. +.|..++. T Consensus 216 ~d~~~~~~v~i~E~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~ 295 (720) T protein:vir:35 216 YDWYDVDVVYIAKYYEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLE 295 (720) T ss_pred ccccCCCceEEEEeeEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcc Confidence 01111223333332111 11 0 0 0123322 35555553 Q ss_pred c-CCcccccCcEEEE---eeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC-------hh Q lcl|NC_020414. 237 E-NRIKAEKLPFIPL---TWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD-------VD 305 (515) Q Consensus 237 e-sgy~~~~~P~~~~---Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~-------~~ 305 (515) + +-++...|||+++ ||. .+|..+..|.+....+-.+.+|+..-..+..+...-.-++...++++-. ++ T Consensus 296 ~~~~~p~~~fP~vP~~g~r~~-~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~a~~~~~~~~~~~a~~~ 374 (720) T protein:vir:35 296 KAQRIPGEHIPLIPVYGKRWF-IDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPIVGKSQIKTLEKYWANRN 374 (720) T ss_pred cCCCCCCCccceEEEEeeeec-cCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccccCcchHHHHHHHhhccc Confidence 3 3344455888765 444 3677777888888999999999877777776654433333333332111 11 Q ss_pred hc---------cCCCCcceecC--CcccccccccCCccchHHHHHHHHHHHHH--HHHHHHHHhhccCCCCCCCHHHHHH Q lcl|NC_020414. 306 HF---------VNSGTGEVITG--VEEDIHIVQLGKYADLTPISAVLEVYTRR--IGVIFMMETMTRRDAERVTAVEIQR 372 (515) Q Consensus 306 ~~---------~~~~~g~~~~g--~~~~v~~~~~~~~~~l~~~~~~i~~~~~r--I~~afl~~~l~~~~~~~~TAtEi~~ 372 (515) .. ....+|.+++. ...-..+.++. .+-++.++....++++. |..+ ++.+..+ .+..-|.. T Consensus 375 ~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~-~~~~~llq~~~~~i~~vsGi~~~----~lG~~sn--~SG~Ai~~ 447 (720) T protein:vir:35 375 KNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLN-QAMAALLQQTGADIQEVTGSSQA----MQPMPSN--IAKETVNH 447 (720) T ss_pred cccccccccccccccCcccccCCCcccccCCCCCc-hHHHHHHHHHHHHHHHHhCCChH----HcCcccc--hHHHHHHH Confidence 11 01123333221 11111112221 12233344444443332 2333 3344333 46668888 Q ss_pred HHHHHHHHhhhhHHHHH------HHHHHHHHHHHHHh------cC----C-----------CCChh-----hccc---ee Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFA------MTMQTPIAMWGLQE------AG----D-----------SFTSE-----LVDP---VI 417 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~------~E~l~Pli~r~~~~------~~----~-----------~~p~~-----~~~~---~~ 417 (515) |++.-...+...+..|. -+++.-||...+.. .+ + +.++. ++.. .+ T Consensus 448 rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv 527 (720) T protein:vir:35 448 LMHRSDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDV 527 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEE Confidence 88888888888776654 44555555544321 01 0 11111 1111 11 Q ss_pred ee---ehHHHHHHHHHHHHHHHHHHHHHhhcC-------ChHHHhcCCHH---HHHHHHHHhcCCchhccC--CHHHHHH Q lcl|NC_020414. 418 VT---GIEALGRMAELDKLANFAQYMSLPQTW-------PEPAQRAIRWG---DYMDWVRGQISAELPFLK--SEEEMQQ 482 (515) Q Consensus 418 v~---~l~~l~ra~~~~~l~~~~~~v~~~a~~-------~p~~~d~id~d---~~~~~~a~~~Gvp~~~ir--s~eev~~ 482 (515) +. +-.+-.|.+....+.+++ ..+.+. .+.++...|+. +++..+...+. |...+. ..++-++ T Consensus 528 ~v~~~p~~~s~req~~~~m~qll---~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~-~~~~~~~~~~e~qq~ 603 (720) T protein:vir:35 528 TVDVGPSYTARRDATVSVLTNLL---AGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLL-TQGVVKPRNTEEEQM 603 (720) T ss_pred EEecccCcccHHHHHHHHHHHHH---HhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcc-hhcccCccChhHHHH Confidence 21 223334555555444444 333222 22345555554 44444433322 111122 1222222 Q ss_pred HHHHHHHHHHHHHHHHHh----hhhccch--hhhhhccC Q lcl|NC_020414. 483 EMAQQAQAQQEAMLNEGV----AKAVPGV--IQQEMKEG 515 (515) Q Consensus 483 ~rq~~~~~~q~~~~~~~~----~~a~~~~--~~~~~~~~ 515 (515) +.++.++++|++...+.+ .++.... .....++. T Consensus 604 ~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaqa~~~~~ 642 (720) T protein:vir:35 604 VAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAKNEELAI 642 (720) T ss_pred HHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 211111111111000000 0000000 00000000 No 45 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=98.94 E-value=1.2e-08 Score=64.00 Aligned_cols=489 Identities=11% Similarity=0.029 Sum_probs=218.4 Q ss_pred CCCcccc------ccccHHHHHHHHHHHHHhhhhHHHHHH----HHHHhhcccccCCCC-CCc-----ccccc-ccccHH Q lcl|NC_020414. 1 MQDTILE------YGGQRSKIPKLWEKFSKKRSPYLDRAK----HFAKLTLPYLMNNKG-DNE-----TSQNG-WQGVGA 63 (515) Q Consensus 1 ~~~~~~~------~~~~~~~l~~r~~~lk~~R~~~e~~w~----e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst~~ 63 (515) |.+-... .+-++......|..+..++. +.+.|+ +-.+|..- ...+. ... ..... |+-++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~r~~a~~d~~fy~G--~Qw~~~~~~~l~~~g~p~~~~N~i~~ 77 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDID-SQPLWRDAANKACAYYDG--DQLAPEVIQVLKDRGQPMTIHNLIAP 77 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHh-hhHHHHHHHHHHHHhhcC--CCCCHHHHHHHHhcCCCcEEeccHHH Confidence 3221111 01122223344444444432 345565 44445421 01110 000 00111 333332 Q ss_pred HHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCc Q lcl|NC_020414. 64 QATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGN 143 (515) Q Consensus 64 ~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~ 143 (515) .++... +.-- .+++=+++.|.+...+ -.++. +.++..+......++...+...+|.+.+..|- T Consensus 78 -~v~~v~----g~~~-~nr~~~~v~pr~~~~~--------~~~~A---e~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~ 140 (714) T protein:vir:10 78 -TVDGVL----GMEA-KTRTDLIVMSDDPNDE--------TEKLA---EAINAEFADACRLGNMNKARSDAYAEQIKAGL 140 (714) T ss_pred -HHHHHH----HHHH-hCCcceEEecCCCChh--------hHHHH---HHHHHHHHHHHHhhchhHHHHHHHHHhhhccc Confidence 222222 2222 2455556666432211 01122 23344555667788999999999999988877 Q ss_pred eEE--EEeCCC-----cEEEEEcceEEEeeCCCC----CeeEEEEEEEecHHHHHHHhcccccc--hhh----------- Q lcl|NC_020414. 144 CLL--YKPSKG-----AMSAVPMHHYVVNRDTNG----DLMDVILLQEKALRTFDPATRMAIEV--GMK----------- 199 (515) Q Consensus 144 ~~l--~~d~~~-----~~r~~pl~~y~v~~d~~G----~vd~i~r~~~~t~~ql~~~~~~~~~~--~~~----------- 199 (515) |.+ +.|.+. .++.+|..++++..++.- .-.-+|++.++|.+++...|+..... ... T Consensus 141 G~~~~~~d~d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~ 220 (714) T protein:vir:10 141 SWVEVRRNSEPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTV 220 (714) T ss_pred ceEEeeeccCCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchh Confidence 665 555432 245566678888765432 12236899999999999999853210 000 Q ss_pred -----------------hc------cCCCcccEEEEEEEEEcC---------CCC------------------------- Q lcl|NC_020414. 200 -----------------GK------KCKEDDNVKLYTHAQYAG---------EGF------------------------- 222 (515) Q Consensus 200 -----------------~~------~~~~~~~v~v~~~v~~~~---------~~~------------------------- 222 (515) .. ......+|.++.+.+..+ +|- T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~ 300 (714) T protein:vir:10 221 TEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVG 300 (714) T ss_pred hhhhcccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceeccc Confidence 00 001123466666633211 110 Q ss_pred --eEEE-EEeCCeeeccc--CCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCcee Q lcl|NC_020414. 223 --WKIN-QSADDIPVGKE--NRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYL 295 (515) Q Consensus 223 --~~~~-~e~~~~~i~~e--sgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l 295 (515) .++| .-+.|.+++.+ +-|+...|||++.-... ..|..| |.+....+-.+.+|+..-..+-+ +..+-+ + T Consensus 301 ~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~--G~vr~~~d~Qr~~N~~~s~~~~~--l~~~~~-~ 375 (714) T protein:vir:10 301 RVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY--GLISRAIPAQDEVNFRRIKLTWL--LQAKRV-I 375 (714) T ss_pred ceeeEEEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccc--eehhhhhhHHHHHHHHHHHHHHH--HhCCce-e Confidence 1121 12344555544 45666678887654333 445555 67777888888898755444332 345544 4 Q ss_pred ecCccccCh-hhcc--CCCCcceec---CC---cccccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhccCCCC Q lcl|NC_020414. 296 IRPGSQTDV-DHFV--NSGTGEVIT---GV---EEDIHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTRRDAE 363 (515) Q Consensus 296 ~~~~g~~~~-~~~~--~~~~g~~~~---g~---~~~v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~~~~~ 363 (515) +.++++..- +.+. .+.+|.++. +. .+...++.......+ +.....++...+.|++.- .-.++.+. +. T Consensus 376 ~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~-~n 454 (714) T protein:vir:10 376 MDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD-SG 454 (714) T ss_pred eccccccccHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCC-cc Confidence 445554332 2221 133444432 11 111122333332222 234444555555555542 11122332 23 Q ss_pred CCCHHHHHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHHh--------c-CCCCCh---------------hhc Q lcl|NC_020414. 364 RVTAVEIQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQE--------A-GDSFTS---------------ELV 413 (515) Q Consensus 364 ~~TAtEi~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~~--------~-~~~~p~---------------~~~ 413 (515) ..+..-|..|++.-...|+..+.+|..- ++.-||...+.. . ...... -++ T Consensus 455 a~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi 534 (714) T protein:vir:10 455 ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDI 534 (714) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccc Confidence 3577779999999999999888776653 233334332211 0 100000 001 Q ss_pred ---ccee---eeehHHHHHHHHHHHHHHHHHHHHHh-hc-CChHHHhcC---CHHHHHHHHHHhcCCch--hccCCHHHH Q lcl|NC_020414. 414 ---DPVI---VTGIEALGRMAELDKLANFAQYMSLP-QT-WPEPAQRAI---RWGDYMDWVRGQISAEL--PFLKSEEEM 480 (515) Q Consensus 414 ---~~~~---v~~l~~l~ra~~~~~l~~~~~~v~~~-a~-~~p~~~d~i---d~d~~~~~~a~~~Gvp~--~~irs~eev 480 (515) +..+ +.+-.+-.|.+..+.+.++++.+... +. ..+.+++.. +.+++++.+...+|.+. +-...+++. T Consensus 535 ~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~ 614 (714) T protein:vir:10 535 SRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQE 614 (714) T ss_pred eeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhH Confidence 1111 12334555666666666665543211 11 122233444 46789999999998753 333333322 Q ss_pred HHHHHHHHHHHHHH-HH-------------HHHh-hhhc-cchhhhh---------------------hccC Q lcl|NC_020414. 481 QQEMAQQAQAQQEA-ML-------------NEGV-AKAV-PGVIQQE---------------------MKEG 515 (515) Q Consensus 481 ~~~rq~~~~~~q~~-~~-------------~~~~-~~a~-~~~~~~~---------------------~~~~ 515 (515) ++-.++..+++|.+ ++ ++.+ ++|. ...-++. .+++ T Consensus 615 ~q~~~~~~~~~q~~l~~~e~~a~~~k~eaea~~~~aqa~~~~~~a~~~~~~~~~q~~~~~~~~a~~a~~l~~ 686 (714) T protein:vir:10 615 VAAQQQALQQQQAELQMREMAGRVAKLEADAARAHAAAQRDNASAQREVALTQGQRYVDALNQAHTAEIITG 686 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22111111111100 00 0000 0000 0000000 0000 No 46 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.84 E-value=3.1e-08 Score=61.75 Aligned_cols=487 Identities=13% Similarity=0.056 Sum_probs=205.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhc-ccccCCCCCCc---------ccccc-ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTL-PYLMNNKGDNE---------TSQNG-WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~-P~~~~~~~~~~---------~~~~~-~dst~~~a~~~L 69 (515) |++.... .-.++..||..-....+.|-..+++=.+|.. +..-.+..... ..... |+-++ ..++. T Consensus 1 m~e~~~~---~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~-~~v~~- 75 (706) T protein:vir:10 1 MAESRQK---QHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVA-TELNR- 75 (706) T ss_pred CCcchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchH-HHHHH- Confidence 9884311 1233444444433333333333333344442 21110110000 11112 33332 23333 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--- Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--- 146 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--- 146 (515) +.+..-- +++=+++.|.+... -.++.+.| +..+......++...+...+|.+.+..|.|.+ T Consensus 76 ---v~g~~~~-nr~~~~v~P~~~~~---------d~~~Ae~l---~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~ 139 (706) T protein:vir:10 76 ---IISEYRN-NRISVKFRPGDNAA---------SEELANKL---NGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (706) T ss_pred ---HhhHHHh-CCCceEEecCCCCc---------hHHHHHHH---HHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEee Confidence 2222222 44445666532110 01122222 33444556688999999999999999888864 Q ss_pred --EEeCCC------cEEE----EEcceEEEeeC---CCCC-eeEEEEEEEecHHHHHHHhcccccchh--hhcc-----C Q lcl|NC_020414. 147 --YKPSKG------AMSA----VPMHHYVVNRD---TNGD-LMDVILLQEKALRTFDPATRMAIEVGM--KGKK-----C 203 (515) Q Consensus 147 --~~d~~~------~~r~----~pl~~y~v~~d---~~G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~--~~~~-----~ 203 (515) |.+..+ .+.. .|+.++++.-+ .++. -.-+||...|+.+++...|++...... .... . T Consensus 140 ~d~~~~~d~~~~~~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~ 219 (706) T protein:vir:10 140 TSFVNEYDPMDERQRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWF 219 (706) T ss_pred eccccccCCCCCCccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhcccccccccc Confidence 222211 1221 26677776544 3443 234789999999999999986532210 0000 0 Q ss_pred CCc--ccEEEE-------EEEE-EcC----------------------CCC----------eEEEE-EeCCeeeccc-CC Q lcl|NC_020414. 204 KED--DNVKLY-------THAQ-YAG----------------------EGF----------WKINQ-SADDIPVGKE-NR 239 (515) Q Consensus 204 ~~~--~~v~v~-------~~v~-~~~----------------------~~~----------~~~~~-e~~~~~i~~e-sg 239 (515) .++ ...++| +.++ .++ .++ ..+|. ...|..++.+ +- T Consensus 220 ~~d~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p 299 (706) T protein:vir:10 220 TPDVVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRR 299 (706) T ss_pred CCCcceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCC Confidence 000 000001 1111 110 010 12222 2345555533 55 Q ss_pred cccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc-------cChhhc--- Q lcl|NC_020414. 240 IKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ-------TDVDHF--- 307 (515) Q Consensus 240 y~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~-------~~~~~~--- 307 (515) |+...|||+++-..+ ..+.-...|.+....+-.+.+|+..-..+..+...-+-++.+.++.+ .++... T Consensus 300 ~~~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 379 (706) T protein:vir:10 300 IPGEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPA 379 (706) T ss_pred CCCCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhccccccc Confidence 766778888653222 25555667788888888999998777777665554443333332211 111100 Q ss_pred ------cCCCCcceecCCcccccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCHHHHHHHHHHHH Q lcl|NC_020414. 308 ------VNSGTGEVITGVEEDIHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTRRDAERVTAVEIQRDALEIE 378 (515) Q Consensus 308 ------~~~~~g~~~~g~~~~v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~~~~~~~TAtEi~~r~~E~~ 378 (515) ....+|.+++... ....++ .+.+ +.....++.-.+.|.+.- .-.++.+.++ ++.--|..|++.-. T Consensus 380 ~l~~~~~~~~~g~i~~~~~-~~~~~~---~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn--~SG~Ai~~rq~qg~ 453 (706) T protein:vir:10 380 FLPLRTVTDKTGNVVAPAN-VAGYTQ---APVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN--VARETVNSLLNRSD 453 (706) T ss_pred chhcccccCCCCccccccc-ccccCC---CcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc--hHHHHHHHHHHHHH Confidence 0112343333211 111111 1112 122333444444444442 2223444332 57778899998888 Q ss_pred HHhhhhHHHHH------HHHHHHHHHHHHHh------cCC----CC---------C--hh-----hccc---eee---ee Q lcl|NC_020414. 379 QNMGGVYSLFA------MTMQTPIAMWGLQE------AGD----SF---------T--SE-----LVDP---VIV---TG 420 (515) Q Consensus 379 ~~LGpv~~rl~------~E~l~Pli~r~~~~------~~~----~~---------p--~~-----~~~~---~~v---~~ 420 (515) ..+...+..|. -+++.-||...+.. .+. +. | +. ++.. .++ .+ T Consensus 454 ~~~~~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p 533 (706) T protein:vir:10 454 MASFIYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGP 533 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEeccc Confidence 88888876554 34455555443321 010 10 0 00 1111 111 22 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCC----hHHHhcCCH---HHHHHHHHHhcCCchhccCCH-HHHHHHHHHHHHHHH Q lcl|NC_020414. 421 IEALGRMAELDKLANFAQYMSLPQTWP----EPAQRAIRW---GDYMDWVRGQISAELPFLKSE-EEMQQEMAQQAQAQQ 492 (515) Q Consensus 421 l~~l~ra~~~~~l~~~~~~v~~~a~~~----p~~~d~id~---d~~~~~~a~~~Gvp~~~irs~-eev~~~rq~~~~~~q 492 (515) -.+-.|.+..+.+.++++.+....++- +-+++..|+ +++++.+-..++. ....... ++.+++.++.+++++ T Consensus 534 ~~~t~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~-q~~~~~~~~~eq~~~~q~qq~q~ 612 (706) T protein:vir:10 534 SYSARRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLT-QGIVKPRNQQEQAIVQQAQQAQA 612 (706) T ss_pred CcchHHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcc-cCCccccchhHHHHHHHHHHHHH Confidence 334446666666555555332111111 223444443 3455555444442 2222322 222222222222221 Q ss_pred HHHHHH----Hhhhhcc-----ch--------------hhhhhccC Q lcl|NC_020414. 493 EAMLNE----GVAKAVP-----GV--------------IQQEMKEG 515 (515) Q Consensus 493 ~~~~~~----~~~~a~~-----~~--------------~~~~~~~~ 515 (515) +++..+ ++.++.. .. ..+.+++. T Consensus 613 ~q~~~~~~~~~aq~~~~qA~~~k~~a~~~q~~~~a~~a~~qa~~~~ 658 (706) T protein:vir:10 613 TQPDPNMLLAQAQMVVAQAEAQKSQNETVQTQIKAFTAQQDAMESQ 658 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111110 0000000 00 00111100 No 47 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.82 E-value=3.5e-08 Score=61.43 Aligned_cols=432 Identities=11% Similarity=0.035 Sum_probs=174.2 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCCCCccccccccccHHHHHHHHHHHHH- Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKGDNETSQNGWQGVGAQATNHLANKLA- 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~- 74 (515) |++++.-- +...+.....++..++ ++++.+.+|..-. ....-....+..++..+-+..++++++..|. T Consensus 1 ~~~~~~~d--~~~~i~~L~~~~~~~~----~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~ 74 (488) T protein:vir:23 1 MAETESID--PEKLRDQLLDAFENKQ----NELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQEL 74 (488) T ss_pred CCcccCCC--HHHHHHHHHHHHHHHH----HHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhc Confidence 99888444 3333344444444333 4445555554221 1100000111123445666677777776553 Q ss_pred -HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC-- Q lcl|NC_020414. 75 -QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK-- 151 (515) Q Consensus 75 -s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~-- 151 (515) +..+|....+=-....+ .++.+. +...+..+||.....++.++..++|.+.+++... T Consensus 75 ~Gf~~~~~~~~~~~~~~d-------------~~~~~~-------l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~ 134 (488) T protein:vir:23 75 EGFRIPSANGEEPESGGE-------------NDPASE-------LWDWWQANNLDIEATLGHTDALIYGTAYITISMPDP 134 (488) T ss_pred cceeccCCcccccccccc-------------hhHHHH-------HHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCc Confidence 22222121111111111 111222 3345677899999999999999999997765321 Q ss_pred --------C--cEEEEEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCC Q lcl|NC_020414. 152 --------G--AMSAVPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGE 220 (515) Q Consensus 152 --------~--~~r~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~ 220 (515) . .+++++-.+ |++.-+..+++...++.+.- . . +..+..++ ...++ T Consensus 135 ~~~~~~~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~----------~---------~---~~~~~~~~--~y~~~ 190 (488) T protein:vir:23 135 EVDFDVDPEVPLIRVEPPTALYAEVDPRTRKVLYAIRAIYG----------A---------D---GNEIVSAT--LYLPD 190 (488) T ss_pred ccccCCCCCcceEEEeccceeEEEEecCCCceEEEEEEEEe----------c---------C---CCcEEEEE--EEecC Confidence 1 256665555 55554566777766655430 0 0 01111111 11122 Q ss_pred CCeEEEEEeCCee-ecccCCcccccCcEEEEeeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHhccCceeec- Q lcl|NC_020414. 221 GFWKINQSADDIP-VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDY-SGDLFVIQFLSEAVARGAALMADIKYLIR- 297 (515) Q Consensus 221 ~~~~~~~e~~~~~-i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~-l~d~k~L~~l~~~~~~~~~~a~~p~~l~~- 297 (515) . ..+|...++.- +.....+....+|++.++.+...++.+|+|=..+. .+-+..++...-.....++..+.|...+- T Consensus 191 ~-~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G 269 (488) T protein:vir:23 191 T-TMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFG 269 (488) T ss_pred c-EEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhC Confidence 1 11222222322 22121122246999999999888999999955433 34455666666666666666666543321 Q ss_pred ---Cccc---cChhhccCCCCcceecCCcc-cccccccCCccchHHHHHHHHHHHHHHHHHHHH-----HhhccCCCCCC Q lcl|NC_020414. 298 ---PGSQ---TDVDHFVNSGTGEVITGVEE-DIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-----ETMTRRDAERV 365 (515) Q Consensus 298 ---~~g~---~~~~~~~~~~~g~~~~g~~~-~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-----~~l~~~~~~~~ 365 (515) ++.. .+...+.....|.+.....+ ++...+++ .++++ ..++.++.-|...+.. ..+.......- T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~-~~~~~---~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~ 345 (488) T protein:vir:23 270 AKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFS-AAELR---NFVDALDALDRKAASYSGLPPQYLSSSSDNPA 345 (488) T ss_pred CCcccccccccccchhhhhhhhhhccCCCCCCceeEecC-CCChH---HHHHHHHHHHHHHhcccCCCHHHhccccCcch Confidence 1100 01111112222322211111 12223332 22333 4444444444433211 01111111112 Q ss_pred CHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccc--ee--eeehHHHHHHHHHHHHH Q lcl|NC_020414. 366 TAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDP--VI--VTGIEALGRMAELDKLA 434 (515) Q Consensus 366 TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~--~~--v~~l~~l~ra~~~~~l~ 434 (515) ++.-++ .+++++...+|..+.++..-+ -.++. ....+.+..++ .+ ..+.+-++.+....+|. T Consensus 346 Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~-----~~~~~--~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~ 418 (488) T protein:vir:23 346 SAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLA-----YKMVK--GGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLF 418 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhc--CCCcchhhccceEEecCCCCCCHHHHHHHHHHHH Confidence 343332 233444444444443333211 11112 12233332222 22 11223233333222221 Q ss_pred HHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 435 NFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 435 ~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) + .... .+..+. +.+.+|.... ..++++++.+++.+..+ .++.+..+.+....-.++... T Consensus 419 ---~---~g~~-------~~s~et----~~~~l~~~~d---~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 477 (488) T protein:vir:23 419 ---A---NGAG-------LIPRER----GWVDMGYTIV---EREQMRQWLEQDQKQGL-GLIGSLYGASTPEGKPGEAPV 477 (488) T ss_pred ---h---cccc-------cCCHHH----HHHhCCCCch---HHHHHHHHHHHHHHHHH-HHHHHHhccCCCcccCCCCCC Confidence 1 1011 111222 2222332110 12334443333222211 222333333332222333333 Q ss_pred C Q lcl|NC_020414. 515 G 515 (515) Q Consensus 515 ~ 515 (515) | T Consensus 478 ~ 478 (488) T protein:vir:23 478 G 478 (488) T ss_pred C Confidence 3 No 48 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.78 E-value=4.9e-08 Score=60.65 Aligned_cols=429 Identities=10% Similarity=0.038 Sum_probs=190.7 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCC-CCccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKG-DNETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) |..++...--+.+.+.+.-+.....|.+ +++++.+|..-. ...... ......|+..+.+...++.+++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 107 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 7777766655666666666655555544 445555554331 111111 1112235566777777777776654 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+ +++.+++. +.+ .+...+..++|.....++.++..++|.+.+ |.++++ T Consensus 108 g--~p~-----~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~ 160 (512) T protein:vir:97 108 G--NPI-----QCQDDDKD-------------VLE-------AIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) T ss_pred c--cCc-----eeccCChH-------------HHH-------HHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCC Confidence 4 121 22333321 111 233445668899999999999999998765 556665 Q ss_pred cEE--EEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AMS--AVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~r--~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .++ +++. .-|++.-|. .+++...+|.+++.... .....++++.-.+.++..+.+..+ T Consensus 161 ~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~-------------------~~~~~~~~~~~vyt~~~i~~~~~~ 221 (512) T protein:vir:97 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-------------------KTDEDEVFTVDLFTSHGVYRYLTS 221 (512) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecc-------------------ccccceEEEEEEEeCCcEEEEEec Confidence 544 4544 445554333 36777666665432110 001111222212222322221111 Q ss_pred eCCe-----eecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC Q lcl|NC_020414. 229 ADDI-----PVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD 303 (515) Q Consensus 229 ~~~~-----~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~ 303 (515) -.+. ........+-+.+|++.++ ++..|+|=.+..++-+..++.+.-......+....|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (512) T protein:vir:97 222 RTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (512) T ss_pred CCCcccccccccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCC Confidence 1110 1111111112457877654 34678998899999999999887777777778777766543222223 Q ss_pred hhhccCCCCcceec-------------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHHH Q lcl|NC_020414. 304 VDHFVNSGTGEVIT-------------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAVE 369 (515) Q Consensus 304 ~~~~~~~~~g~~~~-------------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAtE 369 (515) ...+.....+..+. +..+......+....+.......++.++..|-..-. .+.....-+...|+.. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~A 376 (512) T protein:vir:97 297 PVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA 376 (512) T ss_pred chhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHH Confidence 33222222111111 001111111222334556666777777766633211 1100001112356666 Q ss_pred HHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCC-CCChh--hccceeee--ehHHHHHHHHHHHHHHHH Q lcl|NC_020414. 370 IQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD-SFTSE--LVDPVIVT--GIEALGRMAELDKLANFA 437 (515) Q Consensus 370 i~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~-~~p~~--~~~~~~v~--~l~~l~ra~~~~~l~~~~ 437 (515) +.. ++.+++..++..+.++-. +|-.++..... ..+.+ .+.+.+-. +.+.+..++.+.++ . T Consensus 377 l~~~~~~l~~ka~~k~~~f~~~l~~~~~-----li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl---~ 448 (512) T protein:vir:97 377 MKYKLFGLEQRTKTKEGLFTKGLRRRAK-----LLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS---G 448 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH---h Confidence 643 445555555555444322 11112221111 11212 23333321 23333333322222 1 Q ss_pred HHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 438 QYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 438 ~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) |. ++ ...+++.+ -+++ -+++|++.+.+++++..+..+. ... ..+++..+.-.+. T Consensus 449 ---gi---iS--------~et~~~~l---~~v~----d~~~E~eri~~E~~~~~~~~~~--~~~-~~~~~~~~~~~~~ 502 (512) T protein:vir:97 449 ---GK---IS--------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQK--GIY-KDPRDINDDEQDD 502 (512) T ss_pred ---cc---Cc--------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhh--ccc-CCCCCCCCCCCCC Confidence 11 11 12223222 1222 2356777666654433222211 111 1111111111111 No 49 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=98.77 E-value=5.7e-08 Score=60.31 Aligned_cols=476 Identities=12% Similarity=0.015 Sum_probs=211.5 Q ss_pred CCCcccccc------------ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCC-CCc-----ccccc-cccc Q lcl|NC_020414. 1 MQDTILEYG------------GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKG-DNE-----TSQNG-WQGV 61 (515) Q Consensus 1 ~~~~~~~~~------------~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~-~~~-----~~~~~-~dst 61 (515) |+-||.+-- .+... ..+|..-......|....++-.+|..-. ..+. ... ..... |+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~q~~~r~~a~~d~~fy~G~--QW~~~~~~~l~~~g~p~~~~N~i 77 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDE-YADINYEIEDQPAWRAVADKEMDYADGN--QLDTELLRRQQALGIPPAVEDLI 77 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHH-HHHHHHHHhccHHHHHHHHHHHHhhcCC--CCCHHHHHHHHhcCCCcEEEcch Confidence 666663321 11122 2233333222233443344445554211 1111 000 01111 3333 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|NC_020414. 62 GAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVA 141 (515) Q Consensus 62 ~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 141 (515) +. .++...+ .-- .+++=+++.+.++.- -.++.+.| +..+......+++..+...+|.+.+.. T Consensus 78 ~~-~v~~v~g----~~~-~nr~d~~v~Pr~~~~---------d~~~Ae~l---~~~~~~~~~~~~~~~~~s~Af~~~i~~ 139 (772) T protein:vir:10 78 GP-ALLSLQG----YEA-VTRTDWRVTPNGDVG---------GQEVADAL---NYRLNTAERQSGADRACSEAFRPQIAC 139 (772) T ss_pred HH-HHHHHHH----HHH-hcCcceEEecCCCch---------HHHHHHHH---HHHHHHHHHhcChHHHHHHHHHHhhhc Confidence 32 2332222 222 255556666642110 01233333 334555566889999999999998888 Q ss_pred CceEE--EEeCCC-----cEEEEEcceEEEeeCCCCCeeE---EEEEEEecHHHHHHHhcccccch-h--h--------- Q lcl|NC_020414. 142 GNCLL--YKPSKG-----AMSAVPMHHYVVNRDTNGDLMD---VILLQEKALRTFDPATRMAIEVG-M--K--------- 199 (515) Q Consensus 142 G~~~l--~~d~~~-----~~r~~pl~~y~v~~d~~G~vd~---i~r~~~~t~~ql~~~~~~~~~~~-~--~--------- 199 (515) |-|.+ +.+.+. .++.++..++++.-++.....+ +||...|+.+++...|++...-. . . T Consensus 140 G~Gw~e~~~~~d~~~~~i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~ 219 (772) T protein:vir:10 140 GIGWVEVSRESDPFKFPYRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQP 219 (772) T ss_pred CceeEEeccccCCCCCCeEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcc Confidence 87654 333221 2455677788887766544444 78999999999999998642100 0 0 Q ss_pred ----hc------------------------cCCCcccEEEEEEEEEcC---------CCC-------------------- Q lcl|NC_020414. 200 ----GK------------------------KCKEDDNVKLYTHAQYAG---------EGF-------------------- 222 (515) Q Consensus 200 ----~~------------------------~~~~~~~v~v~~~v~~~~---------~~~-------------------- 222 (515) .. .....++|+|+.+.++.+ +|. T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~ 299 (772) T protein:vir:10 220 DLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRI 299 (772) T ss_pred cccccccccccccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhccc Confidence 00 001125677777643321 110 Q ss_pred -------eEEE-EEeCCeeecc--cCCcccccCcEEEEeeee--cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020414. 223 -------WKIN-QSADDIPVGK--ENRIKAEKLPFIPLTWKR--SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMA 290 (515) Q Consensus 223 -------~~~~-~e~~~~~i~~--esgy~~~~~P~~~~Rw~~--~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~ 290 (515) ..+| ..+.|.+++. .+-|+...|||++.-... ..|..| |.+....+-.+.+|+..-..+..+ +. T Consensus 300 ~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l--~~ 375 (772) T protein:vir:10 300 SPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGM--SV 375 (772) T ss_pred chheeeeeEEEEEEEecceeeccCCCCCCCCccceEEEeeeEeccCCccc--chhhhhhhHHHHHHHHHHHHHHHH--hc Confidence 0111 1234556664 477887789998764333 455666 688888888888998655444432 22 Q ss_pred cCceeecCccccCh-hh-c--cCCCCcceecCCcc---c-ccccccCCccch-HHHHHHHHHHHHHHHHHH--HHHhhcc Q lcl|NC_020414. 291 DIKYLIRPGSQTDV-DH-F--VNSGTGEVITGVEE---D-IHIVQLGKYADL-TPISAVLEVYTRRIGVIF--MMETMTR 359 (515) Q Consensus 291 ~p~~l~~~~g~~~~-~~-~--~~~~~g~~~~g~~~---~-v~~~~~~~~~~l-~~~~~~i~~~~~rI~~af--l~~~l~~ 359 (515) +. .+. +.|.++. +. + ..+.++.++.-+++ . -.+++......+ ......++...+.|.+.- .-.++.+ T Consensus 376 ~~-~~~-~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~ 453 (772) T protein:vir:10 376 AR-VER-TKGAVAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGR 453 (772) T ss_pred cc-ccc-cCCCccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCC Confidence 22 233 3443332 11 1 22334444332222 1 112222222223 233444444445555431 2123344 Q ss_pred CCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH------HHHHHHHHHHH-----------hcCCC------CC------h Q lcl|NC_020414. 360 RDAERVTAVEIQRDALEIEQNMGGVYSLFAMT------MQTPIAMWGLQ-----------EAGDS------FT------S 410 (515) Q Consensus 360 ~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E------~l~Pli~r~~~-----------~~~~~------~p------~ 410 (515) ... ..+..-|..|++.-...|...+.+|..- ++.-||...+. +..++ -+ + T Consensus 454 ~~n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg 532 (772) T protein:vir:10 454 KGT-ATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTG 532 (772) T ss_pred Ccc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceeccccc Confidence 333 4677889999999999999998776543 34444444331 11000 00 0 Q ss_pred h-----hccc---eee---eehHHHHHHHHHHHHHHHHHHHHHhhcCChHH--------HhcC---CHHHHHHHHHHhcC Q lcl|NC_020414. 411 E-----LVDP---VIV---TGIEALGRMAELDKLANFAQYMSLPQTWPEPA--------QRAI---RWGDYMDWVRGQIS 468 (515) Q Consensus 411 ~-----~~~~---~~v---~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~--------~d~i---d~d~~~~~~a~~~G 468 (515) . ++.. .++ .+-.+-.|.+..+.+. ++++ .++|++ ++.. +.+++++.+-...+ T Consensus 533 ~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~---ql~~---~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~ 606 (772) T protein:vir:10 533 AAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMS---EAVK---SMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQ 606 (772) T ss_pred ccceeccceeeeEEEEeeccccchHHHHHHHHHHH---HHHh---ccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhc Confidence 0 1111 111 2223334444444444 4333 233432 2222 34567776666655 Q ss_pred CchhccCCHHHHHHHHHH-HHHHHHHHHH----HHHhhh-----h-ccchhhhhh----------ccC Q lcl|NC_020414. 469 AELPFLKSEEEMQQEMAQ-QAQAQQEAML----NEGVAK-----A-VPGVIQQEM----------KEG 515 (515) Q Consensus 469 vp~~~irs~eev~~~rq~-~~~~~q~~~~----~~~~~~-----a-~~~~~~~~~----------~~~ 515 (515) -+ +.++.++..++ .+++.+.++. .+..++ | +.-..++.. +++ T Consensus 607 ~~-----~peq~~~~~~q~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~a 669 (772) T protein:vir:10 607 QQ-----TPEQIQQQIDQAVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQA 669 (772) T ss_pred cC-----ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 32 22332222211 1111110000 000000 0 000000000 000 No 50 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.71 E-value=9.1e-08 Score=59.20 Aligned_cols=431 Identities=12% Similarity=0.062 Sum_probs=166.6 Q ss_pred CCC-ccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccccCCCCCCccccccccccHHHHHHHHHHHHHHhh Q lcl|NC_020414. 1 MQD-TILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVL 77 (515) Q Consensus 1 ~~~-~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~l 77 (515) |+. +.-...++-+.+.++...+...+.+...+++++|+=- +|.+...-...-...+..-+-+..+++.++..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l---- 76 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQ---- 76 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhh---- Confidence 432 2222344656666666665555544444444444221 111100000000001122344455566555544 Q ss_pred cCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC--- Q lcl|NC_020414. 78 FPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKG--- 152 (515) Q Consensus 78 tpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~--- 152 (515) ++-+ |+.. .+.. ..+ .+......++|.....++.++..++|.+.+++ +++. T Consensus 77 ~~~g---~~~~-~~~~-------------~~~-------~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~ 132 (484) T protein:vir:77 77 ELEG---FRLG-GADK-------------ADE-------QLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDP 132 (484) T ss_pred ccCc---eecC-Ccch-------------hHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCccc Confidence 2222 2221 1110 111 12334566899999999999999999987655 3332 Q ss_pred -------cEEEEEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeE Q lcl|NC_020414. 153 -------AMSAVPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWK 224 (515) Q Consensus 153 -------~~r~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~ 224 (515) .+++++-.+ |++.-+..+++...+|.+.-. .......+++|+ ++.-+. T Consensus 133 ~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~-------------------~~~~~~~~~~y~-----~~~~~~ 188 (484) T protein:vir:77 133 GVDPEVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDE-------------------EGNEVIGATLYL-----PNNTVI 188 (484) T ss_pred ccccccceEEEeccceeEEEecCCCCceEEEEEEEEee-------------------cCCcEEEEEEEe-----cCeEEE Confidence 256665544 445444456666655544321 000011122221 111111 Q ss_pred EEEEeCCe-eecc--cCCcccccCcEEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCceeec--- Q lcl|NC_020414. 225 INQSADDI-PVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVED-YSGDLFVIQFLSEAVARGAALMADIKYLIR--- 297 (515) Q Consensus 225 ~~~e~~~~-~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~--- 297 (515) ++ ..++. .... +-+| ..||++.++.+...++.+|+|-... ..+-+..++...-.....++..+.|...+- T Consensus 189 ~~-~~~~~~~~~~~~~~~~--g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~ 265 (484) T protein:vir:77 189 WN-REDGQWVQVANVAHNL--EMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVK 265 (484) T ss_pred EE-ecCCceEeeccccCCC--CCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCC Confidence 11 11121 1111 2334 4599999998888888999996654 334455666666666666666665543221 Q ss_pred Cccc-cChh---hccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-----HhhccCCCCCCCHH Q lcl|NC_020414. 298 PGSQ-TDVD---HFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-----ETMTRRDAERVTAV 368 (515) Q Consensus 298 ~~g~-~~~~---~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-----~~l~~~~~~~~TAt 368 (515) ++-. .... .+.....|.+.....++....++. .++++ .-++.++.-|...... ..+.......-++. T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~ 341 (484) T protein:vir:77 266 GEELGVDPETGQTLFDAYLARILAFEDHESKAQQFS-AAELR---NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAE 341 (484) T ss_pred cchhcccccccchhhhhhhhhhcccCCCCceeEeec-CCChH---HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHH Confidence 1100 0000 011111222222222233333332 22333 3344444444333210 01111101112344 Q ss_pred HHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee-eehHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 369 EIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV-TGIEALGRMAELDKLANFAQYM 440 (515) Q Consensus 369 Ei~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v-~~l~~l~ra~~~~~l~~~~~~v 440 (515) -+. .+++++...+|..+.++.. ++-.+.. ....+....++.++ +...+-.-++.++.+.. T Consensus 342 Al~~~~~~l~~ka~~k~~~f~~~l~~~~~-----l~~~~~~--~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~k----- 409 (484) T protein:vir:77 342 AIRSSESRLVKTVERKNKIFGGAWEQAMR-----VAYKVMN--GGDIPPEYYRMESIWRDPSTPTYAAKADAATK----- 409 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhC--CCCcccccccceEEecCCCCCCHHHHHHHHHH----- Confidence 333 2446666666666655433 1111111 22233333222211 11111112222222222 Q ss_pred HHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhh---hhccchhhhhhccC Q lcl|NC_020414. 441 SLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVA---KAVPGVIQQEMKEG 515 (515) Q Consensus 441 ~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~---~a~~~~~~~~~~~~ 515 (515) +++.... .+.- +.+...+|.... ..++++++++++.... ++.+.++.+ +..+.+.+.+..++ T Consensus 410 --l~~~g~g---i~s~----et~~~~l~~~~~---~~~e~~~~~~ee~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (484) T protein:vir:77 410 --LYNNGQG---VIPK----ERARIDMGYSIT---EREEMRKWDEEEQAQG-LGLMGTMFGTDPSGGGNPDNPETPEP 474 (484) T ss_pred --HHhccCC---CCCH----HHHHhcCCCChh---HHHHHHHHHHHHHHHH-HHHHhhhccccccCCCCCCCCCcccc Confidence 2221110 1111 122223343211 1233333333322111 111111111 11111111221222 No 51 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.64 E-value=1.6e-07 Score=57.84 Aligned_cols=417 Identities=11% Similarity=0.081 Sum_probs=165.6 Q ss_pred CCCccccccccHHHHH-HHHHHHHHhhhhHHHHHHHHHHhhcccccCC-CCC----CccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIP-KLWEKFSKKRSPYLDRAKHFAKLTLPYLMNN-KGD----NETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~-~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~-~~~----~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) |-+++ +-..+. ..+.++.. |. ++.+++.+|..-.--.. .+. .-+.-+..-+-+..++++++..| T Consensus 8 ~~~~~-----~~~~~~~~l~~~~~~-~~---~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l- 77 (485) T protein:vir:10 8 QEEIE-----DPAIARDEMVSAFED-ST---QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ- 77 (485) T ss_pred CCCCC-----CHHHHHHHHHHHHHH-HH---HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh- Confidence 44444 233333 33333332 22 34455555543321100 010 00111223456666777766655 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-- Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-- 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~-- 152 (515) +|.+ |+.. .+.. ..+ .+...+..++|.....++.++..++|.|.+++-.+. T Consensus 78 ---~~~g---~~~~-~~~~-------------~~~-------~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~ 130 (485) T protein:vir:10 78 ---AVEG---FRFG-DADE-------------ADE-------ELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQ 130 (485) T ss_pred ---cccc---eecC-CCch-------------hHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcc Confidence 3322 3221 1111 011 122335568999999999999999999977653221 Q ss_pred ----------cEEEEEcceEEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCC Q lcl|NC_020414. 153 ----------AMSAVPMHHYVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG 221 (515) Q Consensus 153 ----------~~r~~pl~~y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~ 221 (515) .+++++..+.++..| ..+++...++.+.- . ..+.-..+++|+ ++. T Consensus 131 ~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~-~------------------~~~~~~~~~~y~-----~~~ 186 (485) T protein:vir:10 131 IDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSKAIRVAYD-A------------------EGNEIQAATLYT-----PND 186 (485) T ss_pred cccccCCCeeEEEEEccceeEEEEcCCCCceeEEEEEEEe-e------------------CCCeEEEEEEEe-----CCe Confidence 256666555444444 45666655554320 0 001111122222 221 Q ss_pred CeEEEEEeCCeeec--ccCCcccccCcEEEEeeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHhccCceeec- Q lcl|NC_020414. 222 FWKINQSADDIPVG--KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDY-SGDLFVIQFLSEAVARGAALMADIKYLIR- 297 (515) Q Consensus 222 ~~~~~~e~~~~~i~--~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~-l~d~k~L~~l~~~~~~~~~~a~~p~~l~~- 297 (515) .+.++..-++-... .+-+| ..||++.+..+...+..||+|=.... .+-+..++...-.....++..+.|...+- T Consensus 187 ~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G 264 (485) T protein:vir:10 187 IFGWYRVENEWQEWFNNPHGL--GVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG 264 (485) T ss_pred EEEEEEcCCceEEeccccCCC--CcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhc Confidence 12211111111111 12334 45999999999999999999965543 34456677766666667777776643321 Q ss_pred ---Ccccc---ChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-----HhhccCCCCCCC Q lcl|NC_020414. 298 ---PGSQT---DVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-----ETMTRRDAERVT 366 (515) Q Consensus 298 ---~~g~~---~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-----~~l~~~~~~~~T 366 (515) ++... +...+.....|.+..-..++....++. .++++ ..++.++.-|++.... ..+.......-+ T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~-~~~~~---~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~S 340 (485) T protein:vir:10 265 IKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFS-AAELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPAS 340 (485) T ss_pred CCcccccccccccchhhhhcccceeccCCCCceEEeec-ccchH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhH Confidence 11000 001111122233322112233333433 23343 3344444444333211 011111111123 Q ss_pred HHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcc--cee--eeehHHHHHHHHHHHHHH Q lcl|NC_020414. 367 AVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVD--PVI--VTGIEALGRMAELDKLAN 435 (515) Q Consensus 367 AtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~--~~~--v~~l~~l~ra~~~~~l~~ 435 (515) +.-+. .+.+++...+++.+.++.. |+.. +.+ ....+.+..+ +.+ ..+-+.++.|+...+|. T Consensus 341 g~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~-----l~~~-~~~-~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~- 412 (485) T protein:vir:10 341 AEAIRAAESRLIKKVERKNSIFGGAWEEAMR-----LAYR-MMK-GGDVPPDMLRMETVWRDPSTPTYAAKADAASKLY- 412 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHH-HhC-CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHH- Confidence 43332 3335556666655544432 1111 122 2222323222 222 11222222222222221 Q ss_pred HHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHH-HHHHH-HHHHHHhhhhccchhhhhh- Q lcl|NC_020414. 436 FAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQA-QAQQE-AMLNEGVAKAVPGVIQQEM- 512 (515) Q Consensus 436 ~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~-~~~q~-~~~~~~~~~a~~~~~~~~~- 512 (515) + ... -.+..+.+. +.+|+. ++++++++..++ ++++. .++..+.+. .++.-++.. T Consensus 413 --~-------ag~---~~~s~et~~----~~lg~~------~~~~~~~~~~~ee~~~~~~~~~~~~~~~-~~~~~~~~~~ 469 (485) T protein:vir:10 413 --N-------GGT---GVIPRERAR----KDMGYS------IAEREEMRRWDEEEAAMGLGLIGTMVDP-NPTVPGSPSP 469 (485) T ss_pred --h-------ccc---cCCCHHHHH----HhCCCC------HhHHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCCCCCCc Confidence 1 100 012222222 234543 334444332111 11111 122222221 111111111 Q ss_pred -ccC Q lcl|NC_020414. 513 -KEG 515 (515) Q Consensus 513 -~~~ 515 (515) +++ T Consensus 470 ~~~~ 473 (485) T protein:vir:10 470 APAP 473 (485) T ss_pred cccc Confidence 111 No 52 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.63 E-value=1.7e-07 Score=57.66 Aligned_cols=432 Identities=12% Similarity=0.053 Sum_probs=174.7 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccC-CCCCC------ccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMN-NKGDN------ETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~-~~~~~------~~~~~~~dst~~~a~~~Laa~l 73 (515) +.-++... +.+.+......|-.....-.++.+.+.+|..-.-.. .-+.+ ...++..-+-+..+++.++..| T Consensus 16 ~~~p~~~~--~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l 93 (501) T protein:vir:25 16 VEFPEDSM--SREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNL 93 (501) T ss_pred ccCCcccC--ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhh Confidence 33333222 444444434433333333345556666664321100 00100 0011122345555566555543 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSK 151 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~ 151 (515) +|.+ |++. |.. ..+.+ ......++|....+++.++..+||.+.+++ +.+ T Consensus 94 ----~~~g---f~~~--d~~---------~~~~l-----------~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~ 144 (501) T protein:vir:25 94 ----SVVG---YRNA--LAK---------ENDPA-----------WEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDE 144 (501) T ss_pred ----cccc---eecC--Ccc---------chHHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCC Confidence 3433 4432 211 11112 233567889999999999999999987655 443 Q ss_pred C-cEEEEEc-ceEEEeeCCCC--CeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEE------EcCCC Q lcl|NC_020414. 152 G-AMSAVPM-HHYVVNRDTNG--DLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQ------YAGEG 221 (515) Q Consensus 152 ~-~~r~~pl-~~y~v~~d~~G--~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~------~~~~~ 221 (515) . .+++++- .-+++-.|+.. ++.-.+|.+....+ .+.. . ...-+....+|+... ....+ T Consensus 145 ~~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~-------~~~~--~---~~~~y~~~~~~~~~~~~~~~~~~~~~ 212 (501) T protein:vir:25 145 GPVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKD-------AKPH--R---RGVLYDDTYMYELDLGEVVLGDAGGG 212 (501) T ss_pred CCeEEEeccccEEEEEecCCCCcceeEEEEEEeeccc-------cCcc--e---eEEEecCeeEEEEecCceeeeecccc Confidence 3 4666654 44666666543 34444433321111 0000 0 000000111222110 00111 Q ss_pred CeEEEEEeCCee-------ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCce Q lcl|NC_020414. 222 FWKINQSADDIP-------VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKY 294 (515) Q Consensus 222 ~~~~~~e~~~~~-------i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~ 294 (515) .+.. ....+.. ....-+| ..||++.+.=+. ..+.+|+|=.+..++-+..++...-..+..++..+.|.. T Consensus 213 ~~~~-~~~~~~~~~~~~~~~~~~~~~--~~vPiv~f~N~~-~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~ 288 (501) T protein:vir:25 213 QATQ-QPVNVREVTDVIEHGATFEGK--PVCPVVRFVNGR-DADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQR 288 (501) T ss_pred cccc-ccccccccccccccccccCCc--cceeeEeccCcc-ccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHH Confidence 1110 0111111 1223344 347888755433 335689997777788888888877777777777777642 Q ss_pred eecCccccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHH-HHH-HhhccCCCCCCCHHHHH- Q lcl|NC_020414. 295 LIRPGSQTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVI-FMM-ETMTRRDAERVTAVEIQ- 371 (515) Q Consensus 295 l~~~~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~a-fl~-~~l~~~~~~~~TAtEi~- 371 (515) .+. +-..+.........|.+..-..++....++. .++++.....++.+-..|... ..- ..+.. .....++.-+. T Consensus 289 ~i~-G~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~-~~~N~Sg~Al~~ 365 (501) T protein:vir:25 289 VIS-GWTGSKAEVLKASALRVWTFEDPEVKAQAFP-PASVEPYNLILEEMLQHVAMVAQISPAQVTG-KMINVSAEALAA 365 (501) T ss_pred HHh-CCCCCccchhhhcccceeccCCCCceEEEec-ccChHHHHHHHHHHHHHHHhhcCCChhhhcc-ccCChHHHHHHH Confidence 221 1111111111223333322111223333332 345554444444444444221 000 00110 01123555443 Q ss_pred ------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee----eeehHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 372 ------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI----VTGIEALGRMAELDKLANFAQYMS 441 (515) Q Consensus 372 ------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~----v~~l~~l~ra~~~~~l~~~~~~v~ 441 (515) .+++.|.+.+|..+.++-. .++.-.+..-+....++.+ ..+-+.++.|.-..+|. + T Consensus 366 ~~~~l~~ka~~k~~~f~~~l~~~~r--------l~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~---~--- 431 (501) T protein:vir:25 366 AEANQQRKLAAKRESFGESWEQLLR--------LAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLA---S--- 431 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHH---h--- Confidence 3445666666666655422 1121111111222223222 11223333333222222 1 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhh-------ccchhhhhhcc Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKA-------VPGVIQQEMKE 514 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a-------~~~~~~~~~~~ 514 (515) ++++.+ .+ +....|++ ++++++++++++++....++.+.++.. .++...+...+ T Consensus 432 --~gis~e--------t~---~~~~~g~~------~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (501) T protein:vir:25 432 --AGIPIE--------HL---LSMVPGMT------QQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNE 492 (501) T ss_pred --cCCCHH--------HH---HHHcCCCC------HHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCcccccc Confidence 123322 12 22344654 566666665554443333333222211 11111122222 Q ss_pred C Q lcl|NC_020414. 515 G 515 (515) Q Consensus 515 ~ 515 (515) | T Consensus 493 ~ 493 (501) T protein:vir:25 493 G 493 (501) T ss_pred c Confidence 2 No 53 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=98.45 E-value=6.2e-07 Score=54.63 Aligned_cols=462 Identities=11% Similarity=0.094 Sum_probs=211.6 Q ss_pred CCCccccccccHHHH---HHHHHHHHHhhhhHHHHHHHHHHhhcccccC----CCCCCccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKI---PKLWEKFSKKRSPYLDRAKHFAKLTLPYLMN----NKGDNETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~~~~~~~l---~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~----~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (515) |+----|++..+..+ .+-|=....+ .-...++.+.+|..-.-.. ..++ .+..++++.|..-+++++.-| T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~--~RlaaY~ly~d~y~n~~~el~~il~G~--dr~~~~~ps~r~~V~~~~~~L 76 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDK--NRVRAYDLYENIYLNSAETLKLVLRGD--DSVPILMPSGRKIVEAVHRFL 76 (563) T ss_pred CCccccccCCCcccccccccccCCHHHH--HHHHHHHHHHHhhcCchhhhhhhcCCC--ceeeeccchHHHHHHHHHHhc Confidence 666665666555522 2222111111 1344445555554432111 1122 233468888888888855433 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSK 151 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~ 151 (515) +....|+- .+++.+ .+. ... +++.+.....+-|+.....++-.+..+.|-+++++ |++ T Consensus 77 -----g~~~~~~V-e~~~~d------e~~-~~a-------vq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~ 136 (563) T protein:vir:74 77 -----GVGFDYLV-EPDMGD------EGI-RQS-------LNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPN 136 (563) T ss_pred -----CCCcEEec-CccccC------cch-HHH-------HHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccc Confidence 44445542 222211 011 111 45556667888999999999999999999998776 432 Q ss_pred ----CcEEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhcc-CCCccc--EEEEEEEE-E---- Q lcl|NC_020414. 152 ----GAMSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKK-CKEDDN--VKLYTHAQ-Y---- 217 (515) Q Consensus 152 ----~~~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~-~~~~~~--v~v~~~v~-~---- 217 (515) .+.++.++ +.|+-..|+ +.|-.+|-..-.+...++......+-..++... .++... .++.+-++ + T Consensus 137 K~~g~R~rv~~vDP~~~fp~~dp-d~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~ 215 (563) T protein:vir:74 137 KKAGERISVDEVDPRQIFLIEDG-STVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGN 215 (563) T ss_pred cccCCCceEeecCCceeeeccCC-CCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhcccc Confidence 24666665 566666666 446555533333332233333322211111111 111111 11111111 1 Q ss_pred -cCCCCeEE--EEEeCCee----------ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 218 -AGEGFWKI--NQSADDIP----------VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVAR 284 (515) Q Consensus 218 -~~~~~~~~--~~e~~~~~----------i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~ 284 (515) +..++-+. --+.++.. +..--|+ .||++++=...++++||+|-..+.+.-|+.||.- .... T Consensus 216 wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~----iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~--~Td~ 289 (563) T protein:vir:74 216 WDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQ----LPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQS--LTDE 289 (563) T ss_pred ccccCccchhhhcccchhhhhhhhchhhhccccccC----ccEEEcCCCCCcccccchhhHHHHHHHHHHHhhh--hhHH Confidence 22222111 11222211 1111223 5888877788899999999999999999999954 4444 Q ss_pred HHHHhccCceeecCccccChhhcc------CCCCcceec--CCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 285 GAALMADIKYLIRPGSQTDVDHFV------NSGTGEVIT--GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMET 356 (515) Q Consensus 285 ~~~~a~~p~~l~~~~g~~~~~~~~------~~~~g~~~~--g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~ 356 (515) ....+....-++.=++....+.-. +-++|.++. +....-....++..++++.++..|+++..|. .+-.. T Consensus 290 s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~era---l~~~s 366 (563) T protein:vir:74 290 DATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKG---IAEGS 366 (563) T ss_pred HHHHHhcCCCeEEeccccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHH---HHhhc Confidence 445555542222222222111100 012344422 1111122334555677888888888776532 11100 Q ss_pred ------hccCCCCCC---CHHHH-----HHHHHHHHHHhhhhHHHHHHHH---HHHHHHHH-HHhcCCCCChhhc----- Q lcl|NC_020414. 357 ------MTRRDAERV---TAVEI-----QRDALEIEQNMGGVYSLFAMTM---QTPIAMWG-LQEAGDSFTSELV----- 413 (515) Q Consensus 357 ------l~~~~~~~~---TAtEi-----~~r~~E~~~~LGpv~~rl~~E~---l~Pli~r~-~~~~~~~~p~~~~----- 413 (515) +...|..+. .|=|+ -.+.+||+..|=.++-++..++ +.|..+++ ..+.++..-+..- T Consensus 367 ~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~ 446 (563) T protein:vir:74 367 GTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNEC 446 (563) T ss_pred cCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCce Confidence 111122221 22222 3445555554444444443332 44555554 3455554322221 Q ss_pred ccee-eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHH Q lcl|NC_020414. 414 DPVI-VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQ 492 (515) Q Consensus 414 ~~~~-v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q 492 (515) .+++ -.+.-|.-+++-.+++..+.+ ...|....+++.+.++ |.|-. --++|++++...+-.. T Consensus 447 ~v~ivf~p~~P~d~~~vv~~~~tl~~------------aGiiSretAv~~L~~~-g~~~p--dae~e~~~ie~~~i~~-- 509 (563) T protein:vir:74 447 SVVCIFADPMPVNKTQVTQDTLLLQQ------------AHLILRKMAVAKLRSI-GWEYP--EVDDQGNALTDDDIAD-- 509 (563) T ss_pred EEEEEeCCCCCccHHHHHHHHHHHHH------------cCchhHHHHHHHHHhC-CCCCC--cHHHHHhhcCHHHHHH-- Confidence 1222 234556666666665553332 1234566777777766 54321 1134444443332222 Q ss_pred HHHHHHHhhhhccch--------hh--hhhccC Q lcl|NC_020414. 493 EAMLNEGVAKAVPGV--------IQ--QEMKEG 515 (515) Q Consensus 493 ~~~~~~~~~~a~~~~--------~~--~~~~~~ 515 (515) +++++|-+.++.+ ++ +.--|| T Consensus 510 --~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g 540 (563) T protein:vir:74 510 --MLLAEAEADASLGLSAMDNGGAGEQQFDDQG 540 (563) T ss_pred --HHHHHhhccCcccceecccCCCCcccccccC Confidence 1122222222222 11 111122 No 54 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.41 E-value=7.7e-07 Score=54.11 Aligned_cols=422 Identities=12% Similarity=0.047 Sum_probs=163.3 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccccc-CCCCCCc----cccccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLM-NNKGDNE----TSQNGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~-~~~~~~~----~~~~~~dst~~~a~~~Laa~l~s 75 (515) +.+++ +-..+..++-.-...+ .++.+.+.+|..-.-- ..-+.+. +..+...+-+..+++.++..| T Consensus 8 ~~e~~-----~~~~~~~~l~~~~~~~---~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l-- 77 (486) T protein:vir:42 8 MEEIE-----DPAVVREEMISAFEDA---SKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQ-- 77 (486) T ss_pred CCCcc-----cHHHHHHHHHHHHHHH---HHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhh-- Confidence 66776 3333443332222223 2444445555432110 0001000 011223345556666665544 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CC-- Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP--SK-- 151 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d--~~-- 151 (515) +|.+ |++.-.+. ....+ ...+..++|.....++.++..++|.+.+++- +. T Consensus 78 --~~~g---~~~~~~~~----------~~~~~-----------~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~ 131 (486) T protein:vir:42 78 --AVEG---FRLGDADE----------ADEEL-----------WQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQL 131 (486) T ss_pred --cccc---eecCCCch----------hHHHH-----------HHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCccc Confidence 3433 22221111 01112 2334568899999999999999999877663 21 Q ss_pred ------C--cEEEEEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCC Q lcl|NC_020414. 152 ------G--AMSAVPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGF 222 (515) Q Consensus 152 ------~--~~r~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~ 222 (515) . .+++++-.+ |++.-+..+++...+|.+.- .+. +.-..+++| .++.. T Consensus 132 ~~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~------~~~-------------~~~~~~~~y-----~~~~~ 187 (486) T protein:vir:42 132 DLGWDQNVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYD------KEG-------------NEIQAATLY-----TPMET 187 (486) T ss_pred ccccCCCeeEEEEecccceEEEEeCCCCCeEEEEEEEEe------cCC-------------CeEEEEEEE-----cCCcE Confidence 1 245565444 55554467777776665531 000 000112222 12211 Q ss_pred eEEEEEeCCee-ec--ccCCcccccCcEEEEeeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHhccCceeec- Q lcl|NC_020414. 223 WKINQSADDIP-VG--KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDY-SGDLFVIQFLSEAVARGAALMADIKYLIR- 297 (515) Q Consensus 223 ~~~~~e~~~~~-i~--~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~-l~d~k~L~~l~~~~~~~~~~a~~p~~l~~- 297 (515) +. |...++.. +. .+-+| ..+|++.++.+...+..+|+|=.... .+-+-.++...-.....++..+.|...+. T Consensus 188 ~~-~~~~~~~~~~~~~~~h~~--g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G 264 (486) T protein:vir:42 188 IG-WFRADGEWAEWFNVPHGL--GVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG 264 (486) T ss_pred EE-EEecCCcEEeecceecCC--CCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhc Confidence 11 11122221 11 12234 45999999998888999999966543 34455666665555566666665543321 Q ss_pred --CccccC----hhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH-----hhccCCCCCCC Q lcl|NC_020414. 298 --PGSQTD----VDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME-----TMTRRDAERVT 366 (515) Q Consensus 298 --~~g~~~----~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~-----~l~~~~~~~~T 366 (515) ++.... .........|.+.....+++...++. .++++ .-++.++.-|......- .+.......-+ T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~S 340 (486) T protein:vir:42 265 IKPEEIGVDSETGQTLFDAYLARILAFEDAEGKIQQFS-AAELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADNPAS 340 (486) T ss_pred CCccccccccccccchhhhhhchhcccCCCCceEEeec-ccCHH---HHHHHHHHHHHHHhcccCCCHHHhccccCchhH Confidence 110000 00011111222221111233333332 22333 34444554444332110 11111111123 Q ss_pred HHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeeeh--HHHHHHHHHHHHHHHH Q lcl|NC_020414. 367 AVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGI--EALGRMAELDKLANFA 437 (515) Q Consensus 367 AtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l--~~l~ra~~~~~l~~~~ 437 (515) +.-++. +++++...+++.+.++-. ++.++.. ....+.+..++. ++.- .+-..++.++.+..+. T Consensus 341 g~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~-----l~~~~~~--~~~~~~d~~~i~-v~w~~~~~~s~~~~ad~~~kl~ 412 (486) T protein:vir:42 341 AEAIRAAESRLIKKVERKNLMFGGAWEEAMR-----IAYRIMK--GGDVPPDMLRME-TVWRDPSTPTYAAKADAATKLY 412 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhc--CCCccccceeee-EEecCCCCCCHHHHHHHHHHHH Confidence 443332 335556666665555432 1212222 222233332222 2221 1222222223333222 Q ss_pred HHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhh---hccchh------ Q lcl|NC_020414. 438 QYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAK---AVPGVI------ 508 (515) Q Consensus 438 ~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~---a~~~~~------ 508 (515) + ....+ +. .+++ ...+|+... ..+|++++++++....+. .+.+..++ ..+... T Consensus 413 ~---~~~g~-------~s-~et~---~~~lg~~~d---~~~e~~~~~~e~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 474 (486) T protein:vir:42 413 G---NGQGV-------IP-RERA---RIDMGYSVK---EREEMRRWDEEEAAMGLG-LLGTMVDADPTVPGSPSPTAPPK 474 (486) T ss_pred h---cccCC-------CC-HHHH---HhcCCCChh---HHHHHHHHHHHHHHHHHH-HHHHhhcCCCCCCCCCCCCCCCC Confidence 2 11111 11 1111 123443211 113444443333222111 11111110 000011 Q ss_pred hhhh--ccC Q lcl|NC_020414. 509 QQEM--KEG 515 (515) Q Consensus 509 ~~~~--~~~ 515 (515) +++. +.| T Consensus 475 ~~~~~~~~~ 483 (486) T protein:vir:42 475 PQPAIESSG 483 (486) T ss_pred CCcccCCCC Confidence 1111 111 No 55 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.32 E-value=1.3e-06 Score=52.79 Aligned_cols=425 Identities=10% Similarity=0.023 Sum_probs=178.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCC-CCC--cccc--ccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNK-GDN--ETSQ--NGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~~--~~~~--~~~dst~~~a~~~Laa~l~s 75 (515) -+.|--+.|++...... .+.|..+...+.++.+++.+|..-....+. +.. ...+ +..-+-+..+++.||..|. T Consensus 9 ~~~~~~~~~l~~~e~~~-i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~- 86 (504) T protein:vir:99 9 SKFTFRIPELNDDVVDK-VNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCN- 86 (504) T ss_pred cccccccCCCCHHHHHH-HHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhc- Confidence 55666666777665221 333333333444566667666543211111 110 0001 1233455666777766542 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC- Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKG- 152 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~- 152 (515) .-+ |++. +... .... +++....++|....+++.++..+||.+.+++ +.+. T Consensus 87 ---~~G---f~~~--d~~~--------~~~~-----------l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~ 139 (504) T protein:vir:99 87 ---LES---FVWP--DGDY--------GSIG-----------GPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGE 139 (504) T ss_pred ---cce---eeCC--CCCh--------hhHH-----------HHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCC Confidence 222 2222 1100 0111 2233566899999999999999999998766 3332 Q ss_pred ---cEEEEEcc-eEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 ---AMSAVPMH-HYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ---~~r~~pl~-~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .+++++-. -|++--+..+++...++..... .......+++|. ++ ..+++. T Consensus 140 ~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~~d-------------------~~g~~~~~~~y~-----~~--~~~~~~ 193 (504) T protein:vir:99 140 PDSLIHVKSAMQATGEWNSRRNAMDSLLSITSRD-------------------AEGHPTGIALYE-----DG--VTVTAD 193 (504) T ss_pred ceeEEEEeccceeEEEEeCCCCceeEEEEEEEec-------------------CCCeEEEEEEEc-----CC--cEEEEE Confidence 25666544 4555444455555444322100 000011233332 11 112222 Q ss_pred eC--CeeecccCCcccccCcEEEEeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhccCceee--------- Q lcl|NC_020414. 229 AD--DIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVE-DYSGDLFVIQFLSEAVARGAALMADIKYLI--------- 296 (515) Q Consensus 229 ~~--~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~-~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~--------- 296 (515) .+ +........+.. .+|++.+..+...++.||+|-.. ..++-+..++...-..+..+++.+.|-..+ T Consensus 194 ~~~~~~~~~~~~~~~~-gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~ 272 (504) T protein:vir:99 194 MDDDGDWHADVRTHKL-GVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFR 272 (504) T ss_pred EcCCceeeeccccCCC-CcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccc Confidence 21 111111112221 27999998888888999999543 556777888888878788888777764222 Q ss_pred cCccccChhhccCCCCcce--ecCCccc-------ccccccCCccchHHHHHHHHHHHHHHHHHHHHH-----hhc-cCC Q lcl|NC_020414. 297 RPGSQTDVDHFVNSGTGEV--ITGVEED-------IHIVQLGKYADLTPISAVLEVYTRRIGVIFMME-----TMT-RRD 361 (515) Q Consensus 297 ~~~g~~~~~~~~~~~~g~~--~~g~~~~-------v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~-----~l~-~~~ 361 (515) ..+|. +...-....+.+ ++...+. +..-++ ..++++... +.++.-|....+.. .|. ..+ T Consensus 273 ~~d~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~-~~~~l~~~~---~~l~~~i~~~a~~t~~P~~~lG~~~~ 346 (504) T protein:vir:99 273 NKDGS--MKPAWQIALARVFALPDDEDEPDAARARADVKQF-PASSPQPHI---EMLEQIAMMFSGETSIPVESLGFSNR 346 (504) T ss_pred ccccc--ccchhhhhhhhhhcCCCccccccccCccceeeec-CCCChHHHH---HHHHHHHHHHHhhhCCCHHHhccccc Confidence 01111 111101111111 1211111 111122 123444333 33333332222111 111 111 Q ss_pred CCCCCHHHH-------HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee----eeehHHHHHHHHH Q lcl|NC_020414. 362 AERVTAVEI-------QRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI----VTGIEALGRMAEL 430 (515) Q Consensus 362 ~~~~TAtEi-------~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~----v~~l~~l~ra~~~ 430 (515) ...-+|.-| ..++++|.+.+|..+.++..- +. .+.+.....+.+..++.+ ..+.+.+++|... T Consensus 347 ~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rl-----a~-~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~ 420 (504) T protein:vir:99 347 ANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIR-----AL-AIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAG 420 (504) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HH-HHhcCCCccccccccceeEecCCCccCHHHHHHHH Confidence 112244433 334456666666666554431 11 123334455555544332 2233333333333 Q ss_pred HHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhh---hcc-- Q lcl|NC_020414. 431 DKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAK---AVP-- 505 (515) Q Consensus 431 ~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~---a~~-- 505 (515) .+| .+.... .+.+ . +.+.+.+|+ +++|++.+.+.+++++....+.+.+.+ +.+ T Consensus 421 ~Kl---~~ag~~--l~~~-------~----~~l~~~lg~------~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~ 478 (504) T protein:vir:99 421 AKM---LGAGPE--WLKE-------T----EVGLELLGL------TPQQAKRALAERRRASSVSIIEALNRRQQEAATAG 478 (504) T ss_pred HHH---Hhhccc--cccc-------h----HHHHhhcCC------CHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCC Confidence 332 221000 0000 1 112234455 355666554443332222222221111 111 Q ss_pred ----------------chhhhhhccC Q lcl|NC_020414. 506 ----------------GVIQQEMKEG 515 (515) Q Consensus 506 ----------------~~~~~~~~~~ 515 (515) ...+.+.++| T Consensus 479 ~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 479 EDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred CCCCcCCCCCCCCCCCccCCCcccCC Confidence 1112223333 No 56 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.32 E-value=1.3e-06 Score=52.77 Aligned_cols=413 Identities=9% Similarity=0.029 Sum_probs=186.9 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCCCCccccccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKGDNETSQNGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s 75 (515) |-+.+ ..+.+.|.+..+..... ..+++++.+|..-. +-.. .......|+..+.+...++.+++-|++ T Consensus 11 ~p~d~---~~~~~~l~~~i~~~~~~----~~r~~~~~~yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g 82 (453) T protein:vir:39 11 FPKDE---PITNEVVTKFMEKHRLE----VARYEYLKNMYRGIMAIDAEPTK-DLWKPDNRLTVNFTKYIVDTFTGYFNG 82 (453) T ss_pred cCCCC---CCCHHHHHHHHHHHHHH----HHHHHHHHHHhhccCchhcCCCc-cccCccceeecchHHHHHHHHhhhhcc Confidence 33322 23666666666655433 33455555554321 1111 111122345566777778777776643 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCc Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKGA 153 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~~ 153 (515) -| +.++..++. .. ..+...+..++|.....++.++..++|.+.+++ |.++. T Consensus 83 --~~-----~~~~~~d~~-------------~~-------~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~ 135 (453) T protein:vir:39 83 --IP-----VKKSHSDKE-------------TL-------SKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQ 135 (453) T ss_pred --cC-----ceeccCChH-------------HH-------HHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCc Confidence 11 222322211 11 234455777899999999999999999987654 55544 Q ss_pred E--EEEEc-ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeC Q lcl|NC_020414. 154 M--SAVPM-HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSAD 230 (515) Q Consensus 154 ~--r~~pl-~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~ 230 (515) + ++++- .-|++.-|..++....+.++... .+....+++|+ ++. .+++..+ T Consensus 136 ~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~--------------------~~~~~~~~~yt-----~~~--i~~~~~~ 188 (453) T protein:vir:39 136 TNVIYNTPENMFMVYDDTIKQEPLFAVRYGYD--------------------DDYKLYGEVYT-----KET--TYALNGT 188 (453) T ss_pred eEEEEEcccceEEEecCCCCCeEEEEEEEEEe--------------------CCeEEEEEEEe-----CCe--EEEEEec Confidence 4 45544 34555555555544444443311 00011223322 221 1122222 Q ss_pred C--eeecc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhh Q lcl|NC_020414. 231 D--IPVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDH 306 (515) Q Consensus 231 ~--~~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~ 306 (515) + -.+.. +-+| ..||++.++. +.+|+|=.+...+-+-.++.+.-.....++....|.+.+. +.....+. T Consensus 189 ~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~-g~~~~~~~ 260 (453) T protein:vir:39 189 MGFYNMTEQAPNPF--DDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL-GAAVEEED 260 (453) T ss_pred CCceeeecccccCC--CceeEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee-cCCCCchh Confidence 1 12221 2234 3588877653 4579998888999999999988888888888888865543 22222222 Q ss_pred ccCCCC-cce-ecC-----CcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHH------- Q lcl|NC_020414. 307 FVNSGT-GEV-ITG-----VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQR------- 372 (515) Q Consensus 307 ~~~~~~-g~~-~~g-----~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~------- 372 (515) +..... +.+ +++ ..+++..+. ...+.+.....++.++..|...-..-.+....-...|+..+.. T Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~lt--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ 338 (453) T protein:vir:39 261 LKNIRSNRVINYYGESSEAKNVDVKFLE--KPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSN 338 (453) T ss_pred hhhhhhcceeeecCCCCCCCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHH Confidence 221111 111 121 112233332 3345677777777777766443211011111112345555433 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--eehHHHHHHHHHHHHHHHHHHHHHhhcCChHH Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMSLPQTWPEPA 450 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~ 450 (515) ++.++...+|..+.++.. ++..++...+-..-...+.+.+- .+.+-++.+ +.+..+ ++ T Consensus 339 ka~~~~~~~~~~l~~~~~-----li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a---~~~~kl-------~g----- 398 (453) T protein:vir:39 339 LALSFQRKFQSSLNSRYK-----LYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQA---ETANIL-------MG----- 398 (453) T ss_pred HHHHHHHHHHHHHHHHHH-----HHHHHHhccCCccccccceEEeCCCCCcCHHHHH---HHHHHH-------hc----- Confidence 334444555554444322 11112222211111122333331 122222222 222222 11 Q ss_pred HhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 451 QRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 451 ~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) .+....++..+ -+++ -.++|++.+.++.++..+..+..++...-..+......+| T Consensus 399 --~is~et~l~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 399 --ITSQETALSVI---SVIP----DVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred --cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 12222223222 1122 1356777766655544443332222222111122222222 No 57 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.24 E-value=2.1e-06 Score=51.66 Aligned_cols=430 Identities=12% Similarity=0.084 Sum_probs=172.5 Q ss_pred CCCccccccccHHHHH-HHHHHHHHhhhhHHHHHHHHHHhhccccc---CC-CCCCccccccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIP-KLWEKFSKKRSPYLDRAKHFAKLTLPYLM---NN-KGDNETSQNGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~-~r~~~lk~~R~~~e~~w~e~~~~~~P~~~---~~-~~~~~~~~~~~dst~~~a~~~Laa~l~s 75 (515) +....+.. +-+.+. ...-.+...+-.....|+++|+=--+-+. .. .+.....+++--+.+...++.+|+-|.+ T Consensus 13 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~ 90 (496) T protein:vir:38 13 MRRMGLLK--ALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFN 90 (496) T ss_pred HHHhccch--hhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhC Confidence 11111110 111111 01111112222345556666532111111 01 1111112223335566667766654432 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCCc Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKGA 153 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~~ 153 (515) -. |+ ++.++. ...++|. ..+..++|...+.++..+...+|.+.+ |.|.+.. T Consensus 91 ~p--~~-----i~~~d~-------------~~~e~l~-------~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~ 143 (496) T protein:vir:38 91 EK--VK-----INIDDK-------------AAEEFVL-------NVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKN 143 (496) T ss_pred Cc--ce-----EeeCCh-------------HHHHHHH-------HHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCc Confidence 21 11 232331 2233333 345668899999999999999999876 4565544 Q ss_pred --EEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCC---eEEEEE Q lcl|NC_020414. 154 --MSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGF---WKINQS 228 (515) Q Consensus 154 --~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~---~~~~~e 228 (515) +.++|-.+++--.+..|.+..+.....++.+ +.....++++ +..+..+ +.+|.. T Consensus 144 ~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~------------------~~~y~~le~h---~~~~~~~~I~~~~y~~ 202 (496) T protein:vir:38 144 VKVSFATADCMYPLSNDSENVDECVIANSFHKN------------------NKYYTLLEWN---EWQGDVYTVTTELYQS 202 (496) T ss_pred EEEEEEcccceEEEEecCCcEEEEEEEEEEEeC------------------CeEEEEEEEE---EEeCceEEEEEEEEec Confidence 5567877776444446777654433333210 1111111111 1111111 111211 Q ss_pred eCCeee-------------cccCCccc-ccCcEEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020414. 229 ADDIPV-------------GKENRIKA-EKLPFIPLT----WKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMA 290 (515) Q Consensus 229 ~~~~~i-------------~~esgy~~-~~~P~~~~R----w~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~ 290 (515) .++..+ ..+..|.. ...||+..+ .+...++.||+|-..++++-+..|+..--......+. . T Consensus 203 ~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~ 281 (496) T protein:vir:38 203 DDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-G 281 (496) T ss_pred CCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-c Confidence 121111 11111211 123343332 3446678899999999999999998877666665544 5 Q ss_pred cCceeecCccccChhhccCCCC--------cce--ecCCccc-ccccccCCccch--HHHHHHHHHHHHHHHHHH-H-HH Q lcl|NC_020414. 291 DIKYLIRPGSQTDVDHFVNSGT--------GEV--ITGVEED-IHIVQLGKYADL--TPISAVLEVYTRRIGVIF-M-ME 355 (515) Q Consensus 291 ~p~~l~~~~g~~~~~~~~~~~~--------g~~--~~g~~~~-v~~~~~~~~~~l--~~~~~~i~~~~~rI~~af-l-~~ 355 (515) ++.+.++ +.++....-..+.. ..+ +.+...+ ...++. ...++ ..-...++.+.+.|...- + .. T Consensus 282 ~~~i~v~-~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~ 359 (496) T protein:vir:38 282 KKKVLVP-SSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKD-ISVEIRSTEFIESINAMLRIYAMQVGLSAG 359 (496) T ss_pred ccceecc-hHHhhccCCCCCccccCCCCccceEEEeecCCCccccccee-eccccCHHHHHHHHHHHHHHHHHhhCCChh Confidence 5565553 33332211000000 001 1111111 111110 01122 222333444434333221 0 11 Q ss_pred hhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-----hc--CCCCChhhccceee--eehHHHHH Q lcl|NC_020414. 356 TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-----EA--GDSFTSELVDPVIV--TGIEALGR 426 (515) Q Consensus 356 ~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-----~~--~~~~p~~~~~~~~v--~~l~~l~r 426 (515) ++....+...||+||..+.+...+...- ..+.....+..+++-++. .. +...+...+.+.+- .+.+.... T Consensus 360 ~f~~~~~g~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~ 438 (496) T protein:vir:38 360 TFTFDENGLKTATEVVSEKSETYQTKNS-HSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTT 438 (496) T ss_pred hcCCCccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHH Confidence 2222233456999998877777665443 444444455555443321 11 11122222333321 12222222 Q ss_pred HHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_020414. 427 MAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPG 506 (515) Q Consensus 427 a~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~ 506 (515) ++...++ ++ +++ +....+ +....|+ |++|++++.++.++.+ .++ ...-..+ T Consensus 439 ~~~~~~~------~~--~Gi-------iS~et~---l~~~~~~------~d~ea~~el~ri~~E~-~~~----~~~~d~~ 489 (496) T protein:vir:38 439 INRYTNA------KN--QGM-------IPLKIA---LQRAWNI------TEAEADEWAEMLAKEK-QAE----MPNNDMN 489 (496) T ss_pred HHHHHHH------Hh--cCC-------CCHHHH---HHhcCCC------ChHHHHHHHHHHHHhh-hcc----Ccccccc Confidence 2222221 11 111 112222 2233343 4555544433222111 111 1111112 Q ss_pred hhhhhhc Q lcl|NC_020414. 507 VIQQEMK 513 (515) Q Consensus 507 ~~~~~~~ 513 (515) ..+++-+ T Consensus 490 ~~~~~~e 496 (496) T protein:vir:38 490 GIFGEEE 496 (496) T ss_pred CCCCCCC Confidence 3333333 No 58 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.23 E-value=2.3e-06 Score=51.54 Aligned_cols=427 Identities=10% Similarity=0.027 Sum_probs=189.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCCCCC-CccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNNKGD-NETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~~-~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) |-.++...-.+-+.+.+..+.....+.+ +++++.+|..- .......+ .....|+..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhhc Confidence 4444444444555555555554444444 44555555432 11111111 112234556677777777765544 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+. ++.++.. +. ..+...+..++|.....++.++..++|.+.+ |.|+++ T Consensus 108 g--~p~~-----~~~~~~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:96 108 G--NPIQ-----YQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred c--CCce-----eecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 3 1211 2333221 11 2344556678999999999999999999865 556665 Q ss_pred cEE--EEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AMS--AVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~r--~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .++ +++. .-|++.-|. .+++...+|.+.....+ ....-++++.-.+.++..+.+... T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d-------------------~~~~~~~~~~~iyt~~~i~~~~~~ 221 (511) T protein:vir:96 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-------------------KTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc-------------------ccccceEEEEEEEeCCcEEEEEec Confidence 544 4443 445554443 46666666665432110 001112233322333332222111 Q ss_pred eCCe-----eecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC Q lcl|NC_020414. 229 ADDI-----PVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD 303 (515) Q Consensus 229 ~~~~-----~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~ 303 (515) -++. ........+-..+|++.++- +.+|+|=.+..++-+..++.+.-..........+|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~ 296 (511) T protein:vir:96 222 RTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccccCCceeeEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCC Confidence 1111 11111111113578877653 4579999999999999999888888888887777765543322233 Q ss_pred hhhccCCCCccee--------------cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHH Q lcl|NC_020414. 304 VDHFVNSGTGEVI--------------TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAV 368 (515) Q Consensus 304 ~~~~~~~~~g~~~--------------~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAt 368 (515) ...+..-..+..+ .+...++..+ ....+.+.....++.+.+.|...-. .+.....-+...|+. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~ 374 (511) T protein:vir:96 297 PVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 374 (511) T ss_pred chhhcccccccceecccccccccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHH Confidence 2222211111111 0111222222 2334556667777777776644321 110000111235666 Q ss_pred HHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC-CCCChh--hccceee--eehHHHHHHHHHHHHHHH Q lcl|NC_020414. 369 EIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG-DSFTSE--LVDPVIV--TGIEALGRMAELDKLANF 436 (515) Q Consensus 369 Ei~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~-~~~p~~--~~~~~~v--~~l~~l~ra~~~~~l~~~ 436 (515) .+.. ++.+++..++..+.++.. +|..++.... ...+.+ .+.+.+- .+.+.+..++. +..+ T Consensus 375 Al~~~~~~l~~k~~~k~~~~~~~l~~~~~-----li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~---~~kl 446 (511) T protein:vir:96 375 AMKYKLFGLEQRTKTKEGLFTKGLRRRAK-----LLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKA---YIDS 446 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHH---HHHH Confidence 6533 334444455544444321 1111222111 111222 2333332 22333333322 2211 Q ss_pred HHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 437 AQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 437 ~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) ++ .+....+++.+ -+++ -.++|++.+.++++......+ ......+....+.-.++ T Consensus 447 -------~G-------~iS~et~l~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~---~~~~~~~~~~~~~~~~~ 501 (511) T protein:vir:96 447 -------GG-------KISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQ---KGIYKDPRDINDDEQDD 501 (511) T ss_pred -------hc-------cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHh---hccccCCCCCCCCCCCC Confidence 11 12223333322 1222 135677777665543222211 11112222222222222 No 59 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.18 E-value=3e-06 Score=50.87 Aligned_cols=427 Identities=10% Similarity=0.027 Sum_probs=187.8 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCCC-CccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKGD-NETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~-~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) |-..+...-.+-+.+.+........+.+ +++++.+|..-. ......+ .....|+..+.+...++..++-|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 5444444434445566665555444443 444555554321 1111111 112234556666777776665554 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+ +++.++.. +. ..+...+..++|.....++.+++.++|.+.. |.+.++ T Consensus 108 g--~p~-----~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg 160 (511) T protein:vir:10 108 G--NPI-----QYQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDD 160 (511) T ss_pred c--cCc-----eeecCchH-------------HH-------HHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 3 121 12333221 11 1234456678899999999999999999865 556665 Q ss_pred cEE--EEE-cceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AMS--AVP-MHHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~r--~~p-l~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .++ +++ ..-|++--|. .+++...+|.+.....+ .....++++.-.+.++..+.+... T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d-------------------~~~~~~~~~~~iyt~~~i~~~~~~ 221 (511) T protein:vir:10 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-------------------KTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc-------------------cCccceEEEEEEEeCCcEEEEEec Confidence 544 443 3445554443 35666656555432110 001111222222233322222211 Q ss_pred eCCe-----eecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC Q lcl|NC_020414. 229 ADDI-----PVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD 303 (515) Q Consensus 229 ~~~~-----~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~ 303 (515) -++. ........+-..+|++.++- +.+|.|=.+..++-+..++.+.-..........+|.+.+.-....+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~f~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~ 296 (511) T protein:vir:10 222 RTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccccCcceeEEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCC Confidence 1111 11111111114578777653 4578998899999999999877777777777777765443222222 Q ss_pred hhhccCCCCccee--------c------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHH Q lcl|NC_020414. 304 VDHFVNSGTGEVI--------T------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAV 368 (515) Q Consensus 304 ~~~~~~~~~g~~~--------~------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAt 368 (515) ...+..-..+.++ . +...++..+ ....+.......++.++..|...-. .+.....-+...|+. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l--~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~ 374 (511) T protein:vir:10 297 PVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 374 (511) T ss_pred chhhccchhccceecccccccccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHH Confidence 2222211111111 1 111122222 2334556666777777776643311 000000011245777 Q ss_pred HHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC-CCCChh--hccceeee--ehHHHHHHHHHHHHHHH Q lcl|NC_020414. 369 EIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG-DSFTSE--LVDPVIVT--GIEALGRMAELDKLANF 436 (515) Q Consensus 369 Ei~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~-~~~p~~--~~~~~~v~--~l~~l~ra~~~~~l~~~ 436 (515) .+... ..+++..++..+.++-. +|..++.... ..-+.+ .+++.+-. +.+.+..++.+.++ T Consensus 375 Al~~~~~~l~~k~~~k~~~f~~~l~~~~~-----li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--- 446 (511) T protein:vir:10 375 AMKYKLFGLEQRTKTKEGLFTKGLRRRAK-----LLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--- 446 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHH--- Confidence 66544 56666666666555432 1111222111 111222 23333322 23333333222222 Q ss_pred HHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 437 AQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 437 ~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) . |. +....++..+ -+++ -.++|++.+.++++......+ ......++...+.-.++ T Consensus 447 ~---G~-----------iS~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~---~~~~~~~~~~~~~~~~~ 501 (511) T protein:vir:10 447 G---GK-----------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQ---KGIYKDPRDINDDEQDD 501 (511) T ss_pred h---cc-----------CcHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHh---hhcccCCCCCCCCCCCC Confidence 1 11 1122233222 1222 135677766665443222111 11112222232222222 No 60 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=98.17 E-value=3.2e-06 Score=50.72 Aligned_cols=434 Identities=11% Similarity=0.062 Sum_probs=201.3 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCc-------ccc-ccccccHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE-------TSQ-NGWQGVGAQATNHLANK 72 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~-------~~~-~~~dst~~~a~~~Laa~ 72 (515) |.|-. .---.-..+..+|+.++.--.. ...|++...-.||..-..+..+. ... -.|-+.-.+.++.++ T Consensus 32 m~dV~-~~hp~y~a~~~~W~~ird~~~G-~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~-- 107 (535) T protein:vir:80 32 LPNVG-YQRVEFGEMLPKWRKIMDCLSG-QEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMM-- 107 (535) T ss_pred CCCCC-cCCHHHHHHHHHHHHHHHHhcC-hHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHh-- Confidence 66421 0001123444555544322222 35566666666776322221111 111 134444445555544 Q ss_pred HHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC- Q lcl|NC_020414. 73 LAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK- 151 (515) Q Consensus 73 l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~- 151 (515) +.+|- ..|.+.+ + ..++.+++.| -+...+++.-+..++.+...+|-+.+++|.. T Consensus 108 --G~vfr-k~p~~~~--p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~ 162 (535) T protein:vir:80 108 --GQVFS-RDPIRQL--P--------------PALEAIVEDI------DGEGVSLDQQAKKALGYTMGFGRAAIFTDYPN 162 (535) T ss_pred --chhhc-CCcceec--c--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEeecC Confidence 44442 2244432 1 1244455444 2345678888889999999999999999842 Q ss_pred Cc----------------EEEEEcce---EEEe-eCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEE Q lcl|NC_020414. 152 GA----------------MSAVPMHH---YVVN-RDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKL 211 (515) Q Consensus 152 ~~----------------~r~~pl~~---y~v~-~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v 211 (515) .+ +..|+-.+ +-.. .|..+++.-+..+++.+.+. ..|+ .+.++. T Consensus 163 ~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~d--d~f~--------------~~~~~q 226 (535) T protein:vir:80 163 VGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQD--DGFE--------------TTYVQQ 226 (535) T ss_pred CCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecC--CCcc--------------cceeEE Confidence 11 22333222 2222 23344455455555543221 2333 355667 Q ss_pred EEEEEEcCCCCeEEEEEe---CC-------eeec-ccCCcccccCcEEEEeeeecCCCcccc--chHHHHHHHHHHHHHH Q lcl|NC_020414. 212 YTHAQYAGEGFWKINQSA---DD-------IPVG-KENRIKAEKLPFIPLTWKRSYGEDWGR--PLVEDYSGDLFVIQFL 278 (515) Q Consensus 212 ~~~v~~~~~~~~~~~~e~---~~-------~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGr--gp~~~~l~d~k~L~~l 278 (515) |..+.++.+|.|.+.... ++ .++. ..++ +.+++|++.|.-..+..+.. .| |=|+..||.- T Consensus 227 ~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pP----Ll~LA~lni~ 299 (535) T protein:vir:80 227 WRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNG---NPFKEIPFQFIGPLDNNADIDHPP----LLDLCEVNIG 299 (535) T ss_pred EEEEEecCCceEEEEEEEeecCCccccccceeecccCCC---cccCeeEEEEeecCCCCCCCCccc----hHHHHHHHHH Confidence 888888877777654431 11 2232 2344 34788888887655555544 34 3345555422 Q ss_pred ---HHH-HHHHHHHhccCceeec-C-----ccccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHH Q lcl|NC_020414. 279 ---SEA-VARGAALMADIKYLIR-P-----GSQTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRI 348 (515) Q Consensus 279 ---~~~-~~~~~~~a~~p~~l~~-~-----~g~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI 348 (515) ..+ .-..+..+..|...+. . +.......+..|.+..+.-+...+.+.+++. +..+. .+.++++++++ T Consensus 300 Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~-~~~~a--~~~l~~~e~qM 376 (535) T protein:vir:80 300 HYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQIT-PNSVP--FEAMTHKESQM 376 (535) T ss_pred HhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeec-cchhH--HHHHHHHHHHH Confidence 222 3333444555533221 1 1122222233333333322333344555543 23333 35677777777 Q ss_pred HHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhh----ccceee-eehHH Q lcl|NC_020414. 349 GVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSEL----VDPVIV-TGIEA 423 (515) Q Consensus 349 ~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~----~~~~~v-~~l~~ 423 (515) ++. -..++. ......||+|...+.+..-.+|.-+...+..-+-. ++.++-.=.+..+.++. ++..++ ..+ T Consensus 377 ~~l-Ga~ll~-~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~-aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~l-- 451 (535) T protein:vir:80 377 IAM-GANLLV-KSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRK-ALRWANQFQTGIVNDETVEYNLNTDFPAARL-- 451 (535) T ss_pred HHH-HHHhhc-cCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHH-HHHHHHHHcCCccCCCceEEEeccccccccC-- Confidence 653 122233 33445899999988888888888887777766544 33333221222222222 222332 222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_020414. 424 LGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKA 503 (515) Q Consensus 424 l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a 503 (515) +.+.++.++..+. ...|..+.+++.+ ...||...-+..++|...+..+.+. ....++.. T Consensus 452 -----d~~~~~all~~~~---------~G~Is~et~~~~L-~r~gvl~~~~~~eee~~ri~~E~~~------~~~~~g~~ 510 (535) T protein:vir:80 452 -----TPNERAELILEWQ---------QGAITFKEMRAGL-RRAGVASEDDAKAETEGKATVEFIA------KTAAAGKV 510 (535) T ss_pred -----CHHHHHHHHHHHh---------cCCCCHHHHHHHH-HhCCCCCcccchHHHHHHHHhhhhh------ccccCCCC Confidence 1122332222221 1235556666665 4456644323334443333222111 11112211 Q ss_pred ------------ccchhhhhhccC Q lcl|NC_020414. 504 ------------VPGVIQQEMKEG 515 (515) Q Consensus 504 ------------~~~~~~~~~~~~ 515 (515) ...+.|+.++.| T Consensus 511 ~d~~~~g~~~~~~~~~~~~~~~~~ 534 (535) T protein:vir:80 511 GDAASGGTNKAKLNNGNGGGNQAG 534 (535) T ss_pred CCCCCCCCCcCcccCCccccccCC Confidence 222445666666 No 61 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.14 E-value=3.8e-06 Score=50.29 Aligned_cols=427 Identities=10% Similarity=0.047 Sum_probs=178.2 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc---cCC---CCCCccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL---MNN---KGDNETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~---~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) +-..+.-..-+.+.+.+..+.-+..+ .++++++.+|....- ... ........++..+-+...++..++-|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~---~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~ 107 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLA 107 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhc Confidence 22222112222233333333333333 334555666654421 110 011111224455556666666655443 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE--EeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY--KPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~--~d~~~ 152 (515) +- . ++++..+.. ....+.+ .+...+..++|....+++.+++.++|.+.++ .+.++ T Consensus 108 g~----p---~~~~~~d~~---------~~~~~~~-------~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg 164 (502) T protein:vir:48 108 GN----P---IRVEYDDNE---------DNSQNDD-------AIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYD 164 (502) T ss_pred cc----C---eeEecCCcc---------chhHHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCC Confidence 21 1 122322211 1112233 2344567789999999999999999998654 46655 Q ss_pred cE--EEEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AM--SAVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~--r~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .+ ++++. .-|++-.|. .+++...+|.+..... .+....+++|+ ++. .+++. T Consensus 165 ~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~------------------~~~~~~~~iyt-----~~~--i~~~~ 219 (502) T protein:vir:48 165 ETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL------------------QNAKDVVEIYT-----NQH--IYTLD 219 (502) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeec------------------CCcEEEEEEEe-----CCe--EEEEE Confidence 54 44543 445555444 4666666655442111 01112233332 221 22222 Q ss_pred eCC--eeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccCh-h Q lcl|NC_020414. 229 ADD--IPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDV-D 305 (515) Q Consensus 229 ~~~--~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~-~ 305 (515) .++ ..+..+..+. ..+|++.++ ++..|.|-.+.+++-+..++.+.-......+....|.+.+.-...... . T Consensus 220 ~~~~~~~~~~~~~~~-g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 293 (502) T protein:vir:48 220 ASDSFNEISVTPHAF-GTVPITEFL-----NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGM 293 (502) T ss_pred eCCceeeccceecCC-CccceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccc Confidence 222 2222222221 357887654 345799999999999999998888888888888877655432211111 0 Q ss_pred hccCC-CCcceec-------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHHHHHHH--- Q lcl|NC_020414. 306 HFVNS-GTGEVIT-------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAVEIQRD--- 373 (515) Q Consensus 306 ~~~~~-~~g~~~~-------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAtEi~~r--- 373 (515) ..... ..+.+.. |..+....-.+....+.+.....++.+.+.|...=. .+......+...|+..+... T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~ 373 (502) T protein:vir:48 294 QASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFG 373 (502) T ss_pred chhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHH Confidence 00000 0111111 111122222222334456666677777776643211 11000011233577766533 Q ss_pred ----HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCC--CCChhhccceeee--ehHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 374 ----ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD--SFTSELVDPVIVT--GIEALGRMAELDKLANFAQYMSLPQT 445 (515) Q Consensus 374 ----~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~--~~p~~~~~~~~v~--~l~~l~ra~~~~~l~~~~~~v~~~a~ 445 (515) +.++...++..+.++.. +|-.++..... ......+.+.+-. +.+.+..++ .+.. +++ T Consensus 374 l~~k~~~~~~~~~~~l~~~~~-----li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~---~~~k-------l~g 438 (502) T protein:vir:48 374 LDQDRVDTQSQFTQGLKRRYR-----LAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVS---ILND-------LGG 438 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----HHHHHHhhcccccccccccceEEeCCCCCcCHHHHHH---HHHH-------Hhc Confidence 34444444444433322 11111222221 1111123333321 222222222 2221 111 Q ss_pred CChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 446 WPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 446 ~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) + |.-+.+++.+ -+++ -.++|++.+.+++.+...... ........+.+ .+...++ T Consensus 439 ~-------iS~et~l~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~-~~~~~~~~~~~-~d~~~e~ 492 (502) T protein:vir:48 439 Q-------VSQETALSLS---GLVE----NPTEELDKINEESSKIDFKGY-PSYFYDNVGKY-TDEVKET 492 (502) T ss_pred c-------CcHHHHHHhC---CCCC----CHHHHHHHHHHHHHhhhhhcc-ccccccccccc-CCCccCC Confidence 1 1122233332 1121 124666666655433222111 11111111111 1111122 No 62 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.13 E-value=4e-06 Score=50.18 Aligned_cols=408 Identities=10% Similarity=0.017 Sum_probs=162.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccccCCCCCCccccccccccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~lt 78 (515) |-+++ +..+....+... .+.++....+.+|+-- ++.........-+.-++..+-+..+++.++..| + T Consensus 1 ~~~~~------~~~i~~l~~~~~-~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l----~ 69 (441) T protein:vir:80 1 MNSDE------LALIEGMYDRIQ-RLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERL----D 69 (441) T ss_pred CCccH------HHHHHHHHHHHH-HHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhh----c Confidence 65555 223333333332 2333344444444221 222111111110112344555556666665554 3 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCCc--E Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKGA--M 154 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~~--~ 154 (515) |.+ | ..++. .+++. ....++|.....++.++..++|.+.+ |.|.++. + T Consensus 70 ~~g---~--~~~d~------------~~l~~-----------i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i 121 (441) T protein:vir:80 70 WLG---W--TNGDG------------YGLDG-----------VYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSV 121 (441) T ss_pred ccc---c--cCCCh------------HHHHH-----------HHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEE Confidence 333 2 11111 12222 34568999999999999999999865 4455543 5 Q ss_pred EEEEcceE-EEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCC-e Q lcl|NC_020414. 155 SAVPMHHY-VVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADD-I 232 (515) Q Consensus 155 r~~pl~~y-~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~-~ 232 (515) ++++..+. ++.-+..+++...++++.... +....+++|. ++..+. |...++ . T Consensus 122 ~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~--------------------~~~~~~~vy~-----~~~~~~-~~~~~~~~ 175 (441) T protein:vir:80 122 RPQSPKNCTGKFSADGSRLDAGLVVQQTCD--------------------PEVVEAELLL-----PDVIVQ-VERRGSRE 175 (441) T ss_pred EEEccceEEEEEeCCCCceeEEEEEEEEec--------------------CceEEEEEEe-----cCeEEE-EEEcCCcc Confidence 55655554 444345567776666554210 0011122321 111111 111111 1 Q ss_pred e-ecc--cCCcccccCcEEEEeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC---hh Q lcl|NC_020414. 233 P-VGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVE-DYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD---VD 305 (515) Q Consensus 233 ~-i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~-~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~---~~ 305 (515) . ... .-+| +.+|++++.-+...++.||+|-.. +..+-+-.++...-......+....|...+- |... .. T Consensus 176 ~~~~~~~~~~~--g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--G~~~~~~~~ 251 (441) T protein:vir:80 176 WVEVDRIPNVL--GAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--GVSADEFSQ 251 (441) T ss_pred eeeccccccCC--CceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee--cCCcccccc Confidence 1 111 1233 459999988788888899999543 3556666777777777777777777754331 1100 01 Q ss_pred hccCCCCccee--cCCccc--ccccccCCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHHH------- Q lcl|NC_020414. 306 HFVNSGTGEVI--TGVEED--IHIVQLGKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQR------- 372 (515) Q Consensus 306 ~~~~~~~g~~~--~g~~~~--v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~~------- 372 (515) .......|.+. ++..+. +...++. .++++.....++.+...|...- + ...+.......-++.-+.. T Consensus 252 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~ 330 (441) T protein:vir:80 252 PGWVLSMASVWAVDKDDDGDTPNVGSFP-VNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVK 330 (441) T ss_pred chhhhcccccccCCCCCCCCcceeEecC-ccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHH Confidence 11111223332 222111 2222222 2344433333333333222110 0 0111111111113443332 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhh--ccceeee--ehHHHHHHHHHHHHHHHHHHHHHhhcCCh Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSEL--VDPVIVT--GIEALGRMAELDKLANFAQYMSLPQTWPE 448 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~--~~~~~v~--~l~~l~ra~~~~~l~~~~~~v~~~a~~~p 448 (515) ..+++...+++.+.++.. ++..+ .+.....+... +.+.+-. +.+.++.++.+.++. + ++..+ T Consensus 331 k~~~~~~~f~~~l~~~~~-----l~~~~-~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~---~-----~g~~~ 396 (441) T protein:vir:80 331 RAERRQTSFGQGWLSVGF-----LAAKA-LDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLV---G-----AGILP 396 (441) T ss_pred HHHHHHHHHHHHHHHHHH-----HHHHH-hcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHH---h-----cCccc Confidence 234444444444433222 11111 12222222221 2222211 122222222222221 1 11111 Q ss_pred HHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHH-HHHHHHhhhhccchh Q lcl|NC_020414. 449 PAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQE-AMLNEGVAKAVPGVI 508 (515) Q Consensus 449 ~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~-~~~~~~~~~a~~~~~ 508 (515) + .... +...+|. +++|++++.+.+++++.. .++..+.. ..+.-+ T Consensus 397 -~----s~~~----~~~~l~~------~~~e~~~~~~e~~e~~~~~~~~~~~~~-~~~~~~ 441 (441) T protein:vir:80 397 -A----DSRT----VLEMLGL------DDVQVEAVMRHRAESSDPLAVLAGAIS-RQTNEV 441 (441) T ss_pred -c----cHHH----HHHhCCC------CHHHHHHHHHHHHHHHHHHHHHhhhhh-cccccC Confidence 1 1111 2233343 356666655544333221 22222222 122222 No 63 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.11 E-value=4.5e-06 Score=49.91 Aligned_cols=412 Identities=9% Similarity=0.033 Sum_probs=187.2 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCC-CCCCccccccccccHHHHHHHHHHHHHHhhcC Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNN-KGDNETSQNGWQGVGAQATNHLANKLAQVLFP 79 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~-~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltp 79 (515) |... ..++.+.+.+..+++.. |.+....++++|+-.-+-+... ........|+-.+.+...++..++-|.+- | T Consensus 11 ~~~~---~~~~~~~i~~~i~~~~~-~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~ 84 (453) T protein:vir:73 11 YSRD---EEITDKVVNDFMKKHQE-EVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGI--P 84 (453) T ss_pred cccc---ccCCHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhhhhhccc--C Confidence 3322 23466667776666654 4456666666666432211111 11111223455677777888777666431 2 Q ss_pred CCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCCcEE-- Q lcl|NC_020414. 80 AQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKGAMS-- 155 (515) Q Consensus 80 p~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~~~r-- 155 (515) +.+...+.. ..+ .+...+..++|.....++.++..++|.+.+ |.++++.++ T Consensus 85 -----~~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~ 139 (453) T protein:vir:73 85 -----IKKTHDDKS-------------VLE-------AMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVI 139 (453) T ss_pred -----ceeecCChH-------------HHH-------HHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEE Confidence 122332211 122 223335568899999999999999999865 446555444 Q ss_pred EE-EcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCe-- Q lcl|NC_020414. 156 AV-PMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDI-- 232 (515) Q Consensus 156 ~~-pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~-- 232 (515) ++ |..-|++..|..++....+.++.... +....+++|+. + ..+++..++. T Consensus 140 ~~~p~~~~~v~dd~~~~~~~~~i~~~~~~--------------------~~~~~~~vyt~-----~--~i~~~~~~~~~~ 192 (453) T protein:vir:73 140 YCSPLNVFMVYDDSIKQKPLFAVYYGFDE--------------------EGNLSGTVYTL-----L--ETISITGKAGEV 192 (453) T ss_pred EEcccceEEEEeCCCCceeEEEEEEEEec--------------------CceEEEEEEeC-----C--eEEEEEecCCce Confidence 44 45667777777677655555544321 01122344431 2 1112222111 Q ss_pred eecc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCC Q lcl|NC_020414. 233 PVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNS 310 (515) Q Consensus 233 ~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~ 310 (515) .+.. .-+| ..||++.++ ++.+|+|=.+...+-+-.++.+.-......+....|.+.+. +.....+..... T Consensus 193 ~~~~~~~~~~--g~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~-g~~~~~~~~~~~ 264 (453) T protein:vir:73 193 KFGESTYNVY--SDLPIVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFL-GAEVDEEDAKNI 264 (453) T ss_pred EEccceeccC--CceeEEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee-cCCCCchhhhcc Confidence 1221 2233 358887653 34679998888888888899888888888888888865542 111111111111 Q ss_pred CCcce------ecC------CcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHH------ Q lcl|NC_020414. 311 GTGEV------ITG------VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQR------ 372 (515) Q Consensus 311 ~~g~~------~~g------~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~------ 372 (515) ..+.+ .++ ...+++.+. ...+.......++.++..|-..-..-.+........|+..+.. T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~d~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~ 342 (453) T protein:vir:73 265 KDNRLINFFDKNSNGQGTNAAKVDVKFLD--KPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMS 342 (453) T ss_pred cccccccccccccccccccccCceeEEee--ecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHH Confidence 11111 111 111232222 2234555667777777777443211011111112356655532 Q ss_pred -HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeee--ehHHHHHHHHHHHHHHHHHHHHHhhcCChH Q lcl|NC_020414. 373 -DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVT--GIEALGRMAELDKLANFAQYMSLPQTWPEP 449 (515) Q Consensus 373 -r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~--~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~ 449 (515) +++++...+|..+.++..- +..++...........+++.+-. +.+.++.++ .+.... | T Consensus 343 ~ka~~~~~~~~~~l~~~~~l-----i~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~---~~~k~~---g-------- 403 (453) T protein:vir:73 343 NLALSFQRKFQSALNRRYSL-----WSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAE---TANILK---G-------- 403 (453) T ss_pred HHHHHHHHHHHHHHHHHHHH-----HHHHHhccCCccccccceEEeCCCCCCCHHHHHH---HHHHHh---c-------- Confidence 2344444444444333221 11122222221111223333322 222232222 222111 1 Q ss_pred HHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhh Q lcl|NC_020414. 450 AQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEM 512 (515) Q Consensus 450 ~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~ 512 (515) .+..+.+++.+ -+++ -.++|++.+.++++++..+++- +.-.++..+-|.| T Consensus 404 ---iis~et~~~~~---~~~~----d~~~E~~ri~~E~~~~~~~~~~---~~~~~~~~~~~~~ 453 (453) T protein:vir:73 404 ---ITSEETALSVI---SVIP----DVQAEMEKIKKKKLLQLSLTRT---SNLVRMKQMRGNL 453 (453) T ss_pred ---cCcHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHh---ccCCcchhhhcCC Confidence 12222222222 1222 2356777776654433322221 1212222333333 No 64 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.09 E-value=4.8e-06 Score=49.76 Aligned_cols=425 Identities=10% Similarity=0.051 Sum_probs=186.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCCCCC-CccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNNKGD-NETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~~-~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) |-.++...-.+.+.+.+..+.....|.+.. +.+.+|..- .......+ .....|+..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~---~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRL---KVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhHHHH---HHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhhc Confidence 555554444455556665555555554444 444444432 11111111 112235666777777777776554 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+ +++.+++. +.+ .+...+..++|.....++.++..++|.+.+ |.+.++ T Consensus 108 g--~p~-----~~~~~d~~-------------~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~ 160 (511) T protein:vir:93 108 G--NPI-----QYQDDDKD-------------VLE-------VIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred c--cCe-----eeccCChH-------------HHH-------HHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 3 121 22333321 122 233445668899999999999999999865 456555 Q ss_pred cE--EEEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AM--SAVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~--r~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .+ ++++. .-|++.-|. .+++...+|.+.....+ ....-.+++.-.+.++..+. |.. T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~-------------------~~~~~~~~~~~iyt~~~i~~-~~~ 220 (511) T protein:vir:93 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-------------------KTDEDEVFTVDLFTSHGVYR-YLT 220 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecc-------------------ccccceEEEEEEEeCCcEEE-EEe Confidence 44 44544 445554433 46776666655432110 00001111211122222111 111 Q ss_pred eCCe-------eec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcc Q lcl|NC_020414. 229 ADDI-------PVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGS 300 (515) Q Consensus 229 ~~~~-------~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g 300 (515) .++. ... ..-+| +.+|++.++- +..|+|=.+..++-+..++.+.-......+....|.+.+.-.. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~ 293 (511) T protein:vir:93 221 SRTNGLKLTPRENGFESHSF--ERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred cCCCccccccccccccccCC--CccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCc Confidence 1111 111 11233 3578877653 4578998899999999999887777777777777765543222 Q ss_pred ccChhhccCCCCccee--------------cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-HhhccCCCCCC Q lcl|NC_020414. 301 QTDVDHFVNSGTGEVI--------------TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-ETMTRRDAERV 365 (515) Q Consensus 301 ~~~~~~~~~~~~g~~~--------------~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-~~l~~~~~~~~ 365 (515) ......+..-..+.+. .+...++..+ ....+.+.....++.+...|...-.. +.....-+... T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~ 371 (511) T protein:vir:93 294 NLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ 371 (511) T ss_pred ccCchhhcccccccceecccccccccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc Confidence 2222222211111111 1111222232 23345666677777777777433211 00000111235 Q ss_pred CHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCC-CCChhh--ccceee--eehHHHHHHHHHHHH Q lcl|NC_020414. 366 TAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD-SFTSEL--VDPVIV--TGIEALGRMAELDKL 433 (515) Q Consensus 366 TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~-~~p~~~--~~~~~v--~~l~~l~ra~~~~~l 433 (515) |+..+.. ++.+++..++..+.++-. +|-.++..... ..+... +.+.+- .+.+.+..++. + T Consensus 372 Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~-----li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~---~ 443 (511) T protein:vir:93 372 SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK-----LLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKA---Y 443 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHH---H Confidence 6665543 334444444444433222 11112222221 112222 333332 23333333322 2 Q ss_pred HHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhh-hh Q lcl|NC_020414. 434 ANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ-EM 512 (515) Q Consensus 434 ~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~-~~ 512 (515) ..+ +++ +....++..+ -+++ -+++|++.+.++++...... +.......++...+ .. T Consensus 444 ~kl-------~g~-------iS~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~ 500 (511) T protein:vir:93 444 IDS-------GGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKA--QKGIYKDPRDINDDEQD 500 (511) T ss_pred HHH-------hcc-------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHH--hhhcccCCCCCCCCCCC Confidence 211 111 2222233322 1222 23567777766554332211 11111112111111 11 Q ss_pred ccC Q lcl|NC_020414. 513 KEG 515 (515) Q Consensus 513 ~~~ 515 (515) .++ T Consensus 501 ~~~ 503 (511) T protein:vir:93 501 DDT 503 (511) T ss_pred Ccc Confidence 111 No 65 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.07 E-value=5.4e-06 Score=49.46 Aligned_cols=416 Identities=12% Similarity=0.074 Sum_probs=188.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc--cCCC--CCCccccccccccHHHHHHHHHHHHHHh Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL--MNNK--GDNETSQNGWQGVGAQATNHLANKLAQV 76 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~--~~~~--~~~~~~~~~~dst~~~a~~~Laa~l~s~ 76 (515) +--|. ...++.+.|.+..+..+.+ .++.+.+.+|..-.- .... .......|+..+.+...++..++-|++ T Consensus 9 ~~~~~-~~~~~~~~i~~~i~~~~~~----~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g- 82 (452) T protein:vir:36 9 MTFSK-DEPITVEVVTKFMEKHKLE----VARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNG- 82 (452) T ss_pred EEcCC-ccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhcc- Confidence 11111 1123667777766665433 344556666655421 1111 111112345566777777777766653 Q ss_pred hcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE--EeCCCcE Q lcl|NC_020414. 77 LFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY--KPSKGAM 154 (515) Q Consensus 77 ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~--~d~~~~~ 154 (515) .| +.++..+.. .. ..+...+..++|....+++.++...+|.+.++ .|.++.+ T Consensus 83 -----~~-~~~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~ 136 (452) T protein:vir:36 83 -----IP-VKKSHSDKE-------------IL-------TKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQT 136 (452) T ss_pred -----cC-ceeecCChh-------------HH-------HHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCee Confidence 11 223333321 11 12344566789999999999999999998754 4655544 Q ss_pred --EEEEcce-EEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeC Q lcl|NC_020414. 155 --SAVPMHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSAD 230 (515) Q Consensus 155 --r~~pl~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~ 230 (515) ++++..+ |.+.-|. .+.+...+|.+.-. +....+++|+ ++..+.+....+ T Consensus 137 ~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~---------------------~~~~~~~vyt-----~~~i~~~~~~~~ 190 (452) T protein:vir:36 137 NVVYNSPENMFMVYDDTVKQEPLFAVRYGVDE---------------------DKKLQGEVYT-----LLETIKISGEND 190 (452) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEEec---------------------CceEEEEEEe-----cCeEEEEEEcCC Confidence 4454433 4443333 34444444443210 0112234443 221122222112 Q ss_pred Ceeecc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhcc Q lcl|NC_020414. 231 DIPVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFV 308 (515) Q Consensus 231 ~~~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~ 308 (515) +-.+.. .-+| ..||++..+. +..|+|=.+...+-+-.++.+.-......+....|.+.+. +.....+... T Consensus 191 ~~~~~~~~~~~~--g~iPvv~~~n-----~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~-g~~~~~~~~~ 262 (452) T protein:vir:36 191 EISFGEGTYNPY--PDLPVVEFYF-----NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL-GAAVEEEDLK 262 (452) T ss_pred ceEEecceeccC--CcccEEEecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee-cCCcCchhhh Confidence 222221 2234 3478776644 3468888888888899999988888888888888866553 2222233322 Q ss_pred CCCCcce--ecC-Cc---ccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHH-------HH Q lcl|NC_020414. 309 NSGTGEV--ITG-VE---EDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRD-------AL 375 (515) Q Consensus 309 ~~~~g~~--~~g-~~---~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r-------~~ 375 (515) ....+.. ++. .. .++..+ ....+.......++.+++.|...-..-.+........|+..+..+ .. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~ 340 (452) T protein:vir:36 263 NIRSNRVINYYADGEGKNVDVKFL--EKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLAL 340 (452) T ss_pred hhhhcceEEecCCCCccCCcceeE--eecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHH Confidence 2222211 111 11 123322 223456666677777777663332110111112124566665432 34 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--eehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhc Q lcl|NC_020414. 376 EIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRA 453 (515) Q Consensus 376 E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~ 453 (515) ++...++..+.++..= |-.++...+-......+++.+- .+.+.+..++- +... ++ . T Consensus 341 ~~~~~~~~~l~~~~~l-----i~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~---~~k~-------~g-------~ 398 (452) T protein:vir:36 341 SFQRKFQSSLNSRYKL-----FCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAET---ANIL-------MG-------I 398 (452) T ss_pred HHHHHHHHHHHHHHHH-----HHHHHhccCCccccccceEEeCCCCCcCHHHHHHH---HHHH-------hc-------c Confidence 4444444444432221 1112222222211122333332 22233333322 2211 11 1 Q ss_pred CCHHHHHHHHHHhcC-CchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 454 IRWGDYMDWVRGQIS-AELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 454 id~d~~~~~~a~~~G-vp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) +....+++ .+| ++ -.++|++.+.++++++.+..+..++-..-.....++..+| T Consensus 399 iS~et~~~----~~~~~~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 399 TSQETALS----VISVIP----DVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred CChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHhhccCCCCcccccCccccCC Confidence 22222332 222 21 1357777777665544433332222222222222333333 No 66 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.03 E-value=6.4e-06 Score=49.06 Aligned_cols=406 Identities=10% Similarity=0.040 Sum_probs=181.1 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCC-CCCccccccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNK-GDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) +|.+.|.+..++++..+ +.....+++|+=--+-+.... .......++-.+.+...++..++-|++ .| +.++ T Consensus 1 l~~~~l~~~i~~~~~~~-~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g--~~-----~~~~ 72 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFN-LSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIG--VP-----VQTS 72 (429) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcc--cC-----ceee Confidence 78999999888876433 333333444332111111111 111122345566777777777776654 12 2233 Q ss_pred CChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCc--EEEEEc-ceEE Q lcl|NC_020414. 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKGA--MSAVPM-HHYV 163 (515) Q Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~~--~r~~pl-~~y~ 163 (515) .+++ .+.+ .+...+..++|.....++.++..++|.+.+++ +.++. +++++- .-|+ T Consensus 73 ~~~~-------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~ 132 (429) T protein:vir:98 73 HENK-------------QVSN-------YLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFI 132 (429) T ss_pred cCCh-------------HHHH-------HHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEE Confidence 3221 1222 23334556889999999999999999987544 55544 444533 3455 Q ss_pred EeeCCC-CCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeC-CeeecccCCcc Q lcl|NC_020414. 164 VNRDTN-GDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSAD-DIPVGKENRIK 241 (515) Q Consensus 164 v~~d~~-G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~-~~~i~~esgy~ 241 (515) +.-|.. +.+...+|.+.- .+ .+++.+....+. ..+|...+ +..+......+ T Consensus 133 v~dd~~~~~~~~~i~~~~~---------------------~~-----~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 185 (429) T protein:vir:98 133 VYDDSIRQKPLFAVRYFYN---------------------KG-----GVLEGSYSDASN-ITYFKDGEKGIEIGESEPHP 185 (429) T ss_pred EEeCCCCCceEEEEEEEEe---------------------cC-----ceEEEEEEeCce-EEEEEecCCceEeccccccc Confidence 544433 344444443310 00 012222233222 22222222 22222222111 Q ss_pred cccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCccee--cCC Q lcl|NC_020414. 242 AEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVI--TGV 319 (515) Q Consensus 242 ~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~--~g~ 319 (515) -..||++..+ ++.+|+|=.+...+-+..++.+.-......+....|.+.+. +.....+.......+.++ ++. T Consensus 186 ~g~vPvv~~~-----n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~-g~~~~~~~~~~~~~~~~~~~~~~ 259 (429) T protein:vir:98 186 FDGVPMIEYV-----ENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKIL-GAELDDETLKSLRDTRIINLKDT 259 (429) T ss_pred CCccceEEec-----CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCCCcchhhhHhhCceeeccCC Confidence 2458887643 45689999999999999999998888888888888865543 211222222222112222 211 Q ss_pred ---cccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHH Q lcl|NC_020414. 320 ---EEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQR-------DALEIEQNMGGVYSLFA 389 (515) Q Consensus 320 ---~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~ 389 (515) ..++..+. ...+.+.....++.+.+.|...-..-.+...+....|+..+.. +.+++...+|..+.++. T Consensus 260 ~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 337 (429) T protein:vir:98 260 DAQQLTVEFLQ--KPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRY 337 (429) T ss_pred CCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12233332 3345666677777777776443211011111112346655533 34455555555444432 Q ss_pred HHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhc Q lcl|NC_020414. 390 MTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQI 467 (515) Q Consensus 390 ~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~ 467 (515) . ++-.++...........+.+.+ ..+.+.+..+ +.+..+ ++ .+..+.++ +.+ T Consensus 338 ~-----li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a---~~~~kl-------~g-------~is~et~~----~~l 391 (429) T protein:vir:98 338 K-----LIASYPTSKIGPKDWIGIKYKFTRNLPANLLEES---QIAGNL-------AG-------IVSEETQV----GVL 391 (429) T ss_pred H-----HHHHHhccCCCccccccceEEeCCCCCcCHHHHH---HHHHHH-------hc-------cCchHHHH----HhC Confidence 2 1111222111111111223322 1222222222 222221 11 12222223 222 Q ss_pred C-CchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhh Q lcl|NC_020414. 468 S-AELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ 510 (515) Q Consensus 468 G-vp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~ 510 (515) | ++ -.++|++.+.+++.++.+.++ ............. T Consensus 392 ~~v~----d~~~E~~ri~~E~~~~~~~~~--~~~~~~~~~~~~~ 429 (429) T protein:vir:98 392 SIVE----NPQKEIERKNSDKSTLISRQA--GGLNGQNTTTILE 429 (429) T ss_pred CCCC----CHHHHHHHHHHHHHHHHHHHH--hhhcCCCCCCCCC Confidence 2 22 125666666665443322111 1111011111111 No 67 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.02 E-value=6.7e-06 Score=48.93 Aligned_cols=431 Identities=9% Similarity=0.027 Sum_probs=183.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCCCC-CCccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNNKG-DNETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) +...+..+-.+-+.+.+..+.....+.+ +++++.+|..- ....... ......|+..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhH---HHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 4444444444555566666655555544 44445555432 1111111 1112235666777778887776554 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+. ++.++.. . ...+...+..++|.....++.++...+|.+.+ |.|.++ T Consensus 108 g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg 160 (511) T protein:vir:96 108 G--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred c--cCce-----eecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 3 1222 2332221 1 12344456668899999999999999999865 556665 Q ss_pred cEE--EEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AMS--AVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~r--~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .++ +++. .-|++.-|. .+++...+|.+..... +......+++.-.+.++..+.+... T Consensus 161 ~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~-------------------~~~~~~~~~~~~vyt~~~i~~~~~~ 221 (511) T protein:vir:96 161 ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI-------------------DKTDEDEVFTVDLFTSHGVYRYLTN 221 (511) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec-------------------cccccceEEEEEEEeCCcEEEEEec Confidence 444 4443 445554443 3566655555433210 0001111222222233322222111 Q ss_pred eCCe-e----ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC Q lcl|NC_020414. 229 ADDI-P----VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD 303 (515) Q Consensus 229 ~~~~-~----i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~ 303 (515) -.+. . .......+-+.+|++.++- +.+|+|=.+...+-+..++.+.-......+....|.+.+......+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (511) T protein:vir:96 222 RTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC Confidence 1111 0 1111111123477776543 4579998999999999999887777777777777765553322233 Q ss_pred hhhccCCCCccee--------c------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHH Q lcl|NC_020414. 304 VDHFVNSGTGEVI--------T------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAV 368 (515) Q Consensus 304 ~~~~~~~~~g~~~--------~------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAt 368 (515) ...+.....+.++ . +...++..+ ....+.......++.+.+.|...-. -+.....-+...|+. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~ 374 (511) T protein:vir:96 297 PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 374 (511) T ss_pred chhhcccccccceeccccceeccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHH Confidence 3332211111111 1 111122222 2233455566667776666643211 110000111234666 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH---HHhcCC-CCChhh--ccceee--eehHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 369 EIQRDALEIEQNMGGVYSLFAMTMQTPIAMWG---LQEAGD-SFTSEL--VDPVIV--TGIEALGRMAELDKLANFAQYM 440 (515) Q Consensus 369 Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~---~~~~~~-~~p~~~--~~~~~v--~~l~~l~ra~~~~~l~~~~~~v 440 (515) .+...-. .+........+.-.+.+.-+++.+ +..... .-+.+. +.+.+- .+.+.+..++ .+..+. T Consensus 375 Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d---~~~kl~--- 447 (511) T protein:vir:96 375 AMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK---AYIDSG--- 447 (511) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHH---HHHHHh--- Confidence 6543221 111111222222233333333222 221111 112222 333332 2233333332 222211 Q ss_pred HHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 441 SLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 441 ~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) |. +....++..+ -+++ -.++|++.+.++++.+....+. ......+...+.-.++ T Consensus 448 G~-----------iS~et~l~~l---~~v~----d~~~El~ri~~E~~~~~~~~~~---~~~~~~~~~~~~~~~~ 501 (511) T protein:vir:96 448 GK-----------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQK---GIYKDPRDINDDEQDD 501 (511) T ss_pred cc-----------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhh---ccccCCCCCCCCCCCC Confidence 11 2222233222 1222 1356666666654432222221 1111112211111111 No 68 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.02 E-value=6.7e-06 Score=48.93 Aligned_cols=431 Identities=9% Similarity=0.027 Sum_probs=183.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCCCC-CCccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNNKG-DNETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) +...+..+-.+-+.+.+..+.....+.+ +++++.+|..- ....... ......|+..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhH---HHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhc Confidence 4444444444555566666655555544 44445555432 1111111 1112235666777778887776554 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+. ++.++.. . ...+...+..++|.....++.++...+|.+.+ |.|.++ T Consensus 108 g--~p~~-----~~~~d~~-------------~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg 160 (511) T protein:vir:78 108 G--NPIQ-----YQDDDKD-------------V-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred c--cCce-----eecCchH-------------H-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 3 1222 2332221 1 12344456668899999999999999999865 556665 Q ss_pred cEE--EEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AMS--AVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~r--~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .++ +++. .-|++.-|. .+++...+|.+..... +......+++.-.+.++..+.+... T Consensus 161 ~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~-------------------~~~~~~~~~~~~vyt~~~i~~~~~~ 221 (511) T protein:vir:78 161 ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPI-------------------DKTDEDEVFTVDLFTSHGVYRYLTN 221 (511) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec-------------------cccccceEEEEEEEeCCcEEEEEec Confidence 444 4443 445554443 3566655555433210 0001111222222233322222111 Q ss_pred eCCe-e----ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC Q lcl|NC_020414. 229 ADDI-P----VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD 303 (515) Q Consensus 229 ~~~~-~----i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~ 303 (515) -.+. . .......+-+.+|++.++- +.+|+|=.+...+-+..++.+.-......+....|.+.+......+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (511) T protein:vir:78 222 RTNGLKLTPRENSFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred CCCcccccccccccccCcCcccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC Confidence 1111 0 1111111123477776543 4579998999999999999887777777777777765553322233 Q ss_pred hhhccCCCCccee--------c------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHH Q lcl|NC_020414. 304 VDHFVNSGTGEVI--------T------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAV 368 (515) Q Consensus 304 ~~~~~~~~~g~~~--------~------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAt 368 (515) ...+.....+.++ . +...++..+ ....+.......++.+.+.|...-. -+.....-+...|+. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~ 374 (511) T protein:vir:78 297 PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 374 (511) T ss_pred chhhcccccccceeccccceeccccccCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHH Confidence 3332211111111 1 111122222 2233455566667776666643211 110000111234666 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHH---HHhcCC-CCChhh--ccceee--eehHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 369 EIQRDALEIEQNMGGVYSLFAMTMQTPIAMWG---LQEAGD-SFTSEL--VDPVIV--TGIEALGRMAELDKLANFAQYM 440 (515) Q Consensus 369 Ei~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~---~~~~~~-~~p~~~--~~~~~v--~~l~~l~ra~~~~~l~~~~~~v 440 (515) .+...-. .+........+.-.+.+.-+++.+ +..... .-+.+. +.+.+- .+.+.+..++ .+..+. T Consensus 375 Al~~~~~-~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d---~~~kl~--- 447 (511) T protein:vir:78 375 AMKYKLF-GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK---AYIDSG--- 447 (511) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHH---HHHHHh--- Confidence 6543221 111111222222233333333222 221111 112222 333332 2233333332 222211 Q ss_pred HHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 441 SLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 441 ~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) |. +....++..+ -+++ -.++|++.+.++++.+....+. ......+...+.-.++ T Consensus 448 G~-----------iS~et~l~~l---~~v~----d~~~El~ri~~E~~~~~~~~~~---~~~~~~~~~~~~~~~~ 501 (511) T protein:vir:78 448 GK-----------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQK---GIYKDPRDINDDEQDD 501 (511) T ss_pred cc-----------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhh---ccccCCCCCCCCCCCC Confidence 11 2222233222 1222 1356666666654432222221 1111112211111111 No 69 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=474 Identities=12% Similarity=0.059 Sum_probs=196.9 Q ss_pred CCC---ccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhh Q lcl|NC_020414. 1 MQD---TILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVL 77 (515) Q Consensus 1 ~~~---~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~l 77 (515) |.+ |+-+ -|=+++.++|.+.-+.-..+...|++-.+.+-=...+...+.....+.| |-|.|.++..+ T Consensus 1 m~~~~~~~~~--~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~~~r~--------nl~~sni~~i~ 70 (663) T protein:vir:34 1 MNESQPTDFA--DTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDAETRW--------NLFSTNIQTQM 70 (663) T ss_pred CCccccccch--hcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCcccccc--------chhhhhHHHHh Confidence 665 5433 3668899999876554444555666655555443222222222222223 45555544332 Q ss_pred --------cCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHH--HhcCCHHHHHHHHHHHHhhCceEE- Q lcl|NC_020414. 78 --------FPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL--EQRQFRPAIVEVFKHLIVAGNCLL- 146 (515) Q Consensus 78 --------tpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l--~~snf~~~~~~~~~dl~~~G~~~l- 146 (515) -|.-+|=|.-. |.. -.+..-+.+|+.+...+ +..+|+..+..+..+.+..|-|++ T Consensus 71 P~iYar~P~p~V~~rf~d~--d~~------------~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~ 136 (663) T protein:vir:34 71 ASLYGQTPKVSVSRRFADA--DDD------------VARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCR 136 (663) T ss_pred hhhhcCCCcceeeecccCc--ccc------------hhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEE Confidence 11112222211 100 13444455566565566 447799999999999776555544 Q ss_pred --EEe-------------CCC-----------------cEEE--EEcceEEEee-CCCCCeeEEEEEEEecHHHHHHHhc Q lcl|NC_020414. 147 --YKP-------------SKG-----------------AMSA--VPMHHYVVNR-DTNGDLMDVILLQEKALRTFDPATR 191 (515) Q Consensus 147 --~~d-------------~~~-----------------~~r~--~pl~~y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~ 191 (515) |.. +.. ++.+ +.-.+|.+.- -..-.|+=|.++-.||-+++.+.|+ T Consensus 137 v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~ 216 (663) T protein:vir:34 137 IRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFD 216 (663) T ss_pred EEeecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhc Confidence 521 100 0111 1111121110 0113688888999999999999997 Q ss_pred ccccchhh---h-----c-c-----CCCcccEEEEEEEEEcCCCCeEEEEEeCCe--------eecccCCcccccCcEEE Q lcl|NC_020414. 192 MAIEVGMK---G-----K-K-----CKEDDNVKLYTHAQYAGEGFWKINQSADDI--------PVGKENRIKAEKLPFIP 249 (515) Q Consensus 192 ~~~~~~~~---~-----~-~-----~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~--------~i~~esgy~~~~~P~~~ 249 (515) .+...... . . + .+...+..|+.-.... -.+||.-++|. ..+...||- -||+.. T Consensus 217 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~---~~~V~w~~eg~~~~L~~~~p~lgl~~ff--PcPrpl 291 (663) T protein:vir:34 217 ADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKG---GRKVDWYVEGYSAVLDTQPDPLGLESFF--PCPKPL 291 (663) T ss_pred CChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecC---CcEEEEEEcCcceecccCCCCCCCCCCC--CCcccc Confidence 55421110 0 0 0 1111244444322222 12333333332 223344552 288887 Q ss_pred EeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC-hhhccCCCCcceecCCc-------c Q lcl|NC_020414. 250 LTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD-VDHFVNSGTGEVITGVE-------E 321 (515) Q Consensus 250 ~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~-~~~~~~~~~g~~~~g~~-------~ 321 (515) .-....++-+=+-..+ -|=.-++.+|.++..+-. ...+++|-++++.+...+ .+.+..+..+.++|-.. + T Consensus 292 ~~~~~~ds~ipvpd~~-~y~~~~~E~n~~t~Rin~-l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~g 369 (663) T protein:vir:34 292 LANWTTDKVVPRPDFV-LAQDLYKEIDLVSTRITL-LERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKG 369 (663) T ss_pred cceecCCCeecCCcHH-HHHHHHHHHHHHHHHHHH-HHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhc Confidence 7777666544333444 777888899988776655 578899999997544432 23354555566655321 1 Q ss_pred ----cccccccCCccchHHHHHHHHHHHHHHHHHHHHHhh---ccCCCC--CCCHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_020414. 322 ----DIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETM---TRRDAE--RVTAVEIQRDALEIEQNMGGVYSLFAMTM 392 (515) Q Consensus 322 ----~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l---~~~~~~--~~TAtEi~~r~~E~~~~LGpv~~rl~~E~ 392 (515) .+.-+++. ....+...+-+.+..|+...+--++ .+|++. +-||||=. =|.+.++.-+...+.|+ T Consensus 370 g~~k~I~~~pi~---~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~----IKsq~gS~RIqe~qdev 442 (663) T protein:vir:34 370 GLRGVVDWFPLE---PVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQG----VKAKFGSIRLQRLQDEV 442 (663) T ss_pred Cccchhhcccch---hHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHH----HHHHHHhHHHHHHHHHH Confidence 11111111 1222222233444556665542121 234432 23554421 12244444444444443 Q ss_pred HHHH---HHH-------------HHHhcCCCCCh---------hh----cc---cee-----eeehHHHH-HHHHH---H Q lcl|NC_020414. 393 QTPI---AMW-------------GLQEAGDSFTS---------EL----VD---PVI-----VTGIEALG-RMAEL---D 431 (515) Q Consensus 393 l~Pl---i~r-------------~~~~~~~~~p~---------~~----~~---~~~-----v~~l~~l~-ra~~~---~ 431 (515) ..-+ +.. +....+.++|. .+ ++ +.+ +.+ ..++ +.... . T Consensus 443 qR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~-D~~~eK~~~~E~l~ 521 (663) T protein:vir:34 443 ARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQ-DFAALRNEKMEVLS 521 (663) T ss_pred HHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcC-ChHHHHHHHHHHHH Confidence 2222 111 11223334441 11 11 111 111 1222 22222 2 Q ss_pred HHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchh-------ccCCHHHHHHH----------HHHHHHHHHHH Q lcl|NC_020414. 432 KLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP-------FLKSEEEMQQE----------MAQQAQAQQEA 494 (515) Q Consensus 432 ~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~-------~irs~eev~~~----------rq~~~~~~q~~ 494 (515) .|..+++-++.+++.-|+.... .-+++......+-+-.. +.-..+++++- .+..++++|++ T Consensus 522 ~i~~~~qq~~pl~~q~p~~~p~--l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k 599 (663) T protein:vir:34 522 GIASFMQGVAPLAQQVPGSAPF--LLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMK 599 (663) T ss_pred HHHHHHHHHHHHHHhhhhhHHH--HHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHH Confidence 3444444454444444432221 11222222111111000 01111111110 00001111111 Q ss_pred HHHH--Hhhhhc--------cchh--hhhhccC Q lcl|NC_020414. 495 MLNE--GVAKAV--------PGVI--QQEMKEG 515 (515) Q Consensus 495 ~~~~--~~~~a~--------~~~~--~~~~~~~ 515 (515) ...+ .|+.++ ..+. +.+-++. T Consensus 600 ~q~~~aeAq~e~q~~~~~~ql~~~~~~~k~~~~ 632 (663) T protein:vir:34 600 GQQEMAKVQAEVQGDLLRIQAETQANETKERQQ 632 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111 000000 0000 0000000 No 70 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.97 E-value=8.6e-06 Score=48.35 Aligned_cols=428 Identities=11% Similarity=0.070 Sum_probs=180.7 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc---ccCCC---CCCccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY---LMNNK---GDNETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) +-..+.-..-+...+.+..+..+..+. ++++++.+|.... ..... .......++..+.+...++..++-|+ T Consensus 30 ~~~~~~~~~~~~~~i~~~i~~~~~~~~---~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~ 106 (501) T protein:vir:96 30 ADNLEELMVNNWELLKNFINHHKLRQA---PRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLA 106 (501) T ss_pred ccccccccCChHHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhc Confidence 333222222233334444444433332 3455666665442 11111 11112235566777777777776555 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE--EeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY--KPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~--~d~~~ 152 (515) +- |+ +++..+.. +...+.+ .+...+..++|.....++.++..++|.+.++ .+.++ T Consensus 107 g~--p~-----~~~~~~~~---------~~~~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg 163 (501) T protein:vir:96 107 GN--PI-----RVEYDDND---------DNSQNDD-------AIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYD 163 (501) T ss_pred cc--Ce-----eEeeCCcc---------chhHHHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCC Confidence 31 11 22322211 1122333 3444567789999999999999999998764 46555 Q ss_pred cE--EEEEcce-EEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AM--SAVPMHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~--r~~pl~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .+ ++++..+ |++.-+. .+++...+|.+..... ......+++| .++..+.+... T Consensus 164 ~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~------------------~~~~~~~~vy-----t~~~i~~~~~~ 220 (501) T protein:vir:96 164 ETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL------------------QSAKDVVEIY-----TDEHIYTLDAS 220 (501) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecC------------------CCcEEEEEEE-----cCCcEEEEeeC Confidence 44 4454444 4454333 4666666555432111 0001112222 22322221111 Q ss_pred eCCeeecccC-CcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC---- Q lcl|NC_020414. 229 ADDIPVGKEN-RIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD---- 303 (515) Q Consensus 229 ~~~~~i~~es-gy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~---- 303 (515) .+...+.... .| ..+|++.++ ++..|+|-.+...+-+..++.+.-......+....|.+.+.-..... T Consensus 221 ~~~~~~~~~~~~~--g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~ 293 (501) T protein:vir:96 221 DDFNEISVTTHAF--GTVPITEYL-----NNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQ 293 (501) T ss_pred CCceeccccccCC--CccceEEec-----CCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccc Confidence 1112222222 23 357876653 45689999999999999999888888888888888865542111111 Q ss_pred hhhccCCCCcceec-------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHHHHHHH-- Q lcl|NC_020414. 304 VDHFVNSGTGEVIT-------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAVEIQRD-- 373 (515) Q Consensus 304 ~~~~~~~~~g~~~~-------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAtEi~~r-- 373 (515) ...+. ..+.+.. |...++.+-.+....+.......++.+++.|...=. .+......+...|+..+... T Consensus 294 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~ 371 (501) T protein:vir:96 294 ASDMK--RTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLF 371 (501) T ss_pred hhhhh--hcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHH Confidence 01111 1111111 111112221222223444555666666665533211 11000011233566665432 Q ss_pred -----HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC--CCCChhhccceee--eehHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020414. 374 -----ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG--DSFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMSLPQ 444 (515) Q Consensus 374 -----~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~--~~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~~~a 444 (515) +.++...++..+.++.. ++-.++.... .......+++.+- .+.+.+..++ .+..+. |. T Consensus 372 ~l~~ka~~~~~~~~~~l~~~~~-----li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad---~~~kl~---g~-- 438 (501) T protein:vir:96 372 GLDQDRVDTQSQFTKGLKRRYR-----LAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVS---ILTGLG---GQ-- 438 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhcccccccccccceEEeCCCCCcCHHHHHH---HHHHHh---cc-- Confidence 34444555544443321 1111122111 1211122333331 2223232322 222211 11 Q ss_pred cCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHH---HhhhhccchhhhhhccC Q lcl|NC_020414. 445 TWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNE---GVAKAVPGVIQQEMKEG 515 (515) Q Consensus 445 ~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~---~~~~a~~~~~~~~~~~~ 515 (515) |....++..+ -+++ -.++|++.+.++++++.......+ ..+...+...-....+| T Consensus 439 ---------iS~et~~~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~ 496 (501) T protein:vir:96 439 ---------VSQETALSLS---GLVE----SPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDF 496 (501) T ss_pred ---------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCcc Confidence 2223333322 1221 135677666665544322211111 11111111111111112 No 71 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.94 E-value=9.9e-06 Score=48.02 Aligned_cols=414 Identities=14% Similarity=0.078 Sum_probs=165.8 Q ss_pred ccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccccc-CCCCCC----ccccccccccHHHHHHHHHHHHHHhhcCCCC Q lcl|NC_020414. 8 YGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLM-NNKGDN----ETSQNGWQGVGAQATNHLANKLAQVLFPAQR 82 (515) Q Consensus 8 ~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~-~~~~~~----~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~ 82 (515) ..-.++.+....+.+..++ .+...+.+|..-.-. ...+.. -...+...+-+..+++.+++.| ++.+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~----~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~g- 71 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEG- 71 (480) T ss_pred CCCHHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhh----ccCc- Confidence 2224556666666664433 333444455332110 011100 0111233455566666666654 3322 Q ss_pred CceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--------CCCc- Q lcl|NC_020414. 83 SFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP--------SKGA- 153 (515) Q Consensus 83 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d--------~~~~- 153 (515) |...- |.. ..+ .+...+..++|.....+++++..++|.+.+++- .++. T Consensus 72 --~~~~~-d~~-------------~~~-------~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~ 128 (480) T protein:vir:78 72 --FRISE-DSE-------------GLE-------ELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP 128 (480) T ss_pred --eecCC-Cch-------------hHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee Confidence 22221 111 111 123345678999999999999999999876653 2222 Q ss_pred -EEEEEcceEEEeeCC--CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeC Q lcl|NC_020414. 154 -MSAVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSAD 230 (515) Q Consensus 154 -~r~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~ 230 (515) +++++..+.++..|+ .+++...+|.+.-. + .......+++|+ ++ ...+|...+ T Consensus 129 ~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~----------~--------~~~~~~~~~~y~-----~~-~~~~~~~~~ 184 (480) T protein:vir:78 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTR----------D--------DVAVPDRATLYL-----PD-ETVPLRRNG 184 (480) T ss_pred EEEEEcccceEEEEcCCCccceEEEEEEEEee----------c--------CCCceEEEEEEe-----CC-eEEEEEecC Confidence 556666665556664 56777666555310 0 000112223332 11 111121111 Q ss_pred Cee----e-c--ccCCcccccCcEEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCceeecCcccc Q lcl|NC_020414. 231 DIP----V-G--KENRIKAEKLPFIPLTWKRSYGEDWGRPLVED-YSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQT 302 (515) Q Consensus 231 ~~~----i-~--~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~ 302 (515) +.. . . .+-+| ..||++.++.+...+..||+|=..+ ..+-+-.++...-.....++..+.|...+. |. T Consensus 185 ~~~~~~~~~~~~~~~~~--g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~- 259 (480) T protein:vir:78 185 GLNDQWVVDGDVIKHGL--GVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV- 259 (480) T ss_pred CCccccccccccccCCC--CCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--cC- Confidence 110 0 1 12334 3599999999888899999997665 346566677776666777776666643331 11 Q ss_pred Chhhcc--------CCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-----HhhccCCCCCCCHHH Q lcl|NC_020414. 303 DVDHFV--------NSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-----ETMTRRDAERVTAVE 369 (515) Q Consensus 303 ~~~~~~--------~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-----~~l~~~~~~~~TAtE 369 (515) .++.+. ....|.+..-..++....+++ .++++.. ++.++.-|...+.. ..+.......-++.- T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~---~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~A 335 (480) T protein:vir:78 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRNF---AEEMEVFRKEAASITGLPPQYLSSSSENPASAEA 335 (480) T ss_pred CccccccccccchhhhhhhhhccCCCCCceEEecC-ccCHHHH---HHHHHHHHHHHhcccCCChHHhccccCcchHHHH Confidence 111110 111122211111233333443 2344433 33344433333211 011111111123433 Q ss_pred HHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeeehHHH--HHHHHHHHHHHHHHHH Q lcl|NC_020414. 370 IQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEAL--GRMAELDKLANFAQYM 440 (515) Q Consensus 370 i~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l--~ra~~~~~l~~~~~~v 440 (515) ++. .++++...+++-+.++-. .++.-.+-..+.+..++. ++.-.+. .-++.++.+..+.+ T Consensus 336 lk~~~~~l~~ka~~~~~~f~~~l~~~~~--------l~~~~~g~~~~~~~~~i~-v~f~~~~~~s~~~~ad~~~kl~~-- 404 (480) T protein:vir:78 336 IIATDSRIVKMAERKGRIFGGAWERAMR--------IAMQIMGREVTEEYTRLE-TVWRDPSTPTVAAKADAVSKLYA-- 404 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHcCCCccccceeee-EEecCCCCCCHHHHHHHHHHHHH-- Confidence 322 223444444443333211 111111222222222222 2222211 12222333322222 Q ss_pred HHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHH--HHHHHH---Hhhhhccchhhhhh-cc Q lcl|NC_020414. 441 SLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQ--EAMLNE---GVAKAVPGVIQQEM-KE 514 (515) Q Consensus 441 ~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q--~~~~~~---~~~~a~~~~~~~~~-~~ 514 (515) .... .+..+. +...+|.. +++++.+.+.+++..+ ..++.. +.+.+.+...+++. .+ T Consensus 405 -~g~~-------~~s~et----~~~~lg~~------~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (480) T protein:vir:78 405 -NGQG-------PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) T ss_pred -hccc-------cCCHHH----HHhcCCCC------HhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCc Confidence 1110 112221 22224432 4455554432221111 111111 11111111111111 11 Q ss_pred C Q lcl|NC_020414. 515 G 515 (515) Q Consensus 515 ~ 515 (515) . T Consensus 467 ~ 467 (480) T protein:vir:78 467 T 467 (480) T ss_pred c Confidence 1 No 72 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.94 E-value=1e-05 Score=47.99 Aligned_cols=427 Identities=11% Similarity=0.127 Sum_probs=176.7 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcccccccc--ccHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ--GVGAQATNHLANKLAQVLF 78 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d--st~~~a~~~Laa~l~s~lt 78 (515) |.+|.- .-+-..+. .+..++......|+.+|+=-.|.+-.....+.+..+... +.+...++.+|+-+.+-.. T Consensus 20 ~~~~~~-----~~~~~~~i-~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i~~~~A~lv~~e~~ 93 (508) T protein:vir:15 20 VTGSLS-----KITDDPRI-SIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTAARRIASVVFNEKA 93 (508) T ss_pred cccchH-----Hhhccccc-ccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHHHHHHHhhhhCCCc Confidence 222210 00000111 112233344556666665544432111112222223333 4556667777666644321 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC-cEE Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG-AMS 155 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~-~~r 155 (515) . |.+. +. +...++|. ..+..++|+..+.+++.+..++|.+++ |+|.+. .+. T Consensus 94 ~-----i~v~--~~------------~~~~e~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~ 147 (508) T protein:vir:15 94 E-----IHVK--DN------------NEADKFLN-------DVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGNHIKIA 147 (508) T ss_pred e-----EEeC--Cc------------hHHHHHHH-------HHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCCeeEEE Confidence 1 2221 11 11233444 457779999999999999999999875 667543 266 Q ss_pred EEEcceEE-EeeCCCCCeeEEE-EEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEE---cCCCCeE----EE Q lcl|NC_020414. 156 AVPMHHYV-VNRDTNGDLMDVI-LLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQY---AGEGFWK----IN 226 (515) Q Consensus 156 ~~pl~~y~-v~~d~~G~vd~i~-r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~---~~~~~~~----~~ 226 (515) +++...++ +..|..+....+| ++++.+- +..-.+|+.++. .+++.+. +| T Consensus 148 ~v~ad~~~P~~~d~~~~~~~af~~~~~~~~----------------------~~~~~~yt~lE~h~~~~~~~~~I~n~ly 205 (508) T protein:vir:15 148 WVRADQFYPLQSNTNDISEAAIASRTQRTE----------------------SNQTKYYTLLEFHQWQDNGSYQITNELY 205 (508) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEeec----------------------CCCceEEEEEEEEEEecCcceEEEEEEE Confidence 77777766 4556544333333 2222100 011123443332 2222222 22 Q ss_pred EEeC----Ceee-cc---------c----CCcccccCcEEEEeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 227 QSAD----DIPV-GK---------E----NRIKAEKLPFIPLTWK----RSYGEDWGRPLVEDYSGDLFVIQFLSEAVAR 284 (515) Q Consensus 227 ~e~~----~~~i-~~---------e----sgy~~~~~P~~~~Rw~----~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~ 284 (515) ..-+ |.++ +. + .|+. .-||..++.. ...++.||+|-...+.+-+..||..--.... T Consensus 206 ~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~--~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~ 283 (508) T protein:vir:15 206 KSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQ--RPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIW 283 (508) T ss_pred ecCCchhcCcccchhhcccccCCCcceEecCCC--cceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHH Confidence 2211 2221 11 1 2321 2344444432 2336789999999999999999987777666 Q ss_pred HHHHhccCceeecCccccChhh--ccCCCCcc--e--ecCCccc---ccccccCCccchHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_020414. 285 GAALMADIKYLIRPGSQTDVDH--FVNSGTGE--V--ITGVEED---IHIVQLGKYADLTPISAVLEVYTRRIGVIFMM- 354 (515) Q Consensus 285 ~~~~a~~p~~l~~~~g~~~~~~--~~~~~~g~--~--~~g~~~~---v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~- 354 (515) -. ...++...+++ ++++.+. ...-..+. + +.+..++ +..++ ..-....-...++.+.+.|....-+ T Consensus 284 e~-~~~~~~i~v~~-~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~~~~~l~~~~~~~gls 359 (508) T protein:vir:15 284 EI-RLGQKHIAVQP-GMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMT--TPIRTVQYKDAIDHFIKEFEVQIGLS 359 (508) T ss_pred HH-Hhcccceeech-HHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEee--cccChHHHHHHHHHHHHHHHHHhCCC Confidence 55 56666655543 3333221 00000111 1 1111111 11111 0001112233444444444333211 Q ss_pred -HhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH--h----cCC-------CCChhhccce--ee Q lcl|NC_020414. 355 -ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ--E----AGD-------SFTSELVDPV--IV 418 (515) Q Consensus 355 -~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~--~----~~~-------~~p~~~~~~~--~v 418 (515) .++........|||||....+...+...- ..+.-..-+..|++.++. . ... ..+.....++ += T Consensus 360 ~~~f~~~~~~~~TAtei~s~~~~~~~t~~~-~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~ 438 (508) T protein:vir:15 360 TGTFSYSNDGVKTATEVVSNNSMTYQTRSS-YLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFD 438 (508) T ss_pred chhcccccCccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeC Confidence 11222222335999998888777776654 333333344455443321 1 111 1122222222 21 Q ss_pred eehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 419 TGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNE 498 (515) Q Consensus 419 ~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~ 498 (515) .++.+-.. +. +.+..+.++ +++ +....++ ....| -|++|++++.++.++.+....... T Consensus 439 D~i~~d~~-~~---~~~~~~~v~--aGi-------~s~e~~i---~~~~g------~~deea~~el~ri~~E~~~~~~~~ 496 (508) T protein:vir:15 439 DGVFVNKD-KQ---LEEDAKVLA--IGA-------LSKQTFL---QRNYG------MTDEQAAEELAKIQSEAPTDTFEG 496 (508) T ss_pred CCCCCCHH-HH---HHHHHHHHh--cCC-------CCHHHHH---HhcCC------CChHHHHHHHHHHHHhccccCccc Confidence 22211111 11 111112111 121 1122222 22334 346666665554333222111111 Q ss_pred Hhhhhccchhhh Q lcl|NC_020414. 499 GVAKAVPGVIQQ 510 (515) Q Consensus 499 ~~~~a~~~~~~~ 510 (515) +.....+|.-|. T Consensus 497 ~~~~~~~g~~ge 508 (508) T protein:vir:15 497 GRSAILNGGDGE 508 (508) T ss_pred cccccCCCCCCC Confidence 111112222222 No 73 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.94 E-value=1e-05 Score=47.99 Aligned_cols=416 Identities=11% Similarity=0.009 Sum_probs=169.0 Q ss_pred CCCccccccccHHHHHHHHH-HHHHhhhhHHHHHHHHHHhhccccc-CCCC--CCccc-----cccccccHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWE-KFSKKRSPYLDRAKHFAKLTLPYLM-NNKG--DNETS-----QNGWQGVGAQATNHLAN 71 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~-~lk~~R~~~e~~w~e~~~~~~P~~~-~~~~--~~~~~-----~~~~dst~~~a~~~Laa 71 (515) |-+.-. ..++.+.+.+... +|-.+.....++++.+.+|..-..- .... ..... .++.-+-+..+++.+++ T Consensus 1 ~~~~p~-~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 79 (479) T protein:vir:99 1 MIDLPD-EDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQ 79 (479) T ss_pred CccCCc-ccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHh Confidence 777664 3677777765443 3333333444566666666533210 0000 00000 01123445555665555 Q ss_pred HHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe-- Q lcl|NC_020414. 72 KLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP-- 149 (515) Q Consensus 72 ~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d-- 149 (515) .| +|.+ |+. .+... ...+. ..+..++|....++++++..++|.+.+++- T Consensus 80 ~l----~~~g---f~~--~d~~~---------~~~~~-----------~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~ 130 (479) T protein:vir:99 80 QL----IVDG---YRK--TGTNE---------NAKGW-----------DTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSG 130 (479) T ss_pred hc----cccc---ccC--CCchh---------hHHHH-----------HHHHhcChhHHHHHHHHHHhhcCceEEEEecC Confidence 33 4544 332 22111 11122 334567899999999999999999877653 Q ss_pred -----CCCc--EEEEEc-ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCC Q lcl|NC_020414. 150 -----SKGA--MSAVPM-HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG 221 (515) Q Consensus 150 -----~~~~--~r~~pl-~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~ 221 (515) .+.. +++++- .-+++..|+......+|. ++.+ .+..+.+|+. . T Consensus 131 ~~~~d~~g~~~i~~~~p~~~~~iydd~~~~~~~~~~---~~~~--------------------~~~~~~~~~~-----~- 181 (479) T protein:vir:99 131 ISPLDGTTVARIKCIDPRDAFAIWEDPYWDEWPKYL---LERQ--------------------PNGQYWWWTE-----E- 181 (479) T ss_pred CCCcCCCCceEEEEechhheEEEecCCcccceeeEE---Eeec--------------------CceeEEEEec-----c- Confidence 2222 455543 335555454432222221 1110 1111222210 0 Q ss_pred CeEEEEEeCCe-ee--cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecC Q lcl|NC_020414. 222 FWKINQSADDI-PV--GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRP 298 (515) Q Consensus 222 ~~~~~~e~~~~-~i--~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~ 298 (515) .+.+|...++. .+ ...-+| ..||++.++-+...+ .+|+|=.+..++-+-.++...-.....++.-+.|.+.+.- T Consensus 182 ~~~~~~~~~~~~~~~~~~~h~~--g~vPvv~f~n~~~~~-~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G 258 (479) T protein:vir:99 182 DYSIFEFKQGKFIYRETVSHDY--GHIPFVRYVNVMDLR-GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATG 258 (479) T ss_pred eEEEEEecCCceeeccccccCC--CCcceEEeecCCCcC-cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcC Confidence 01111111111 11 112234 358999988776663 5899988888888888888888888888877777533321 Q ss_pred ----cc-ccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhcc---CCCCCCCHHHH Q lcl|NC_020414. 299 ----GS-QTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTR---RDAERVTAVEI 370 (515) Q Consensus 299 ----~g-~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~---~~~~~~TAtEi 370 (515) +. ..+........ +.++.-..+++...+++ .++++. .++.++.-|...+....+.. ......++.-+ T Consensus 259 ~~~~~~~~~~~~~~~~~~-~~i~~~~~~~~~~~q~~-~~~~~~---~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al 333 (479) T protein:vir:99 259 LMLPEGANADQEKMRFAQ-ESMLISQNEKASFGAIP-AAPLDG---LLNAYKESLLEFLALAQLPPHIAGQIVNVAADAL 333 (479) T ss_pred CCcccccccchhcccccc-ccceeecCCCceEEEec-ccchHH---HHHHHHHHHHHHhccCCCCHHHcccccchHHHHH Confidence 00 01111111111 12222112233333333 233433 33333333333221100000 00112355444 Q ss_pred HHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee----eehHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 371 QRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV----TGIEALGRMAELDKLANFAQY 439 (515) Q Consensus 371 ~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v----~~l~~l~ra~~~~~l~~~~~~ 439 (515) ... ++.+.+.+++.+.++-. .++.-.+-..+.......+. ..-+..+.++...+|. + T Consensus 334 ~~~~~~l~~ka~~~~~~f~~al~~~~~--------l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~---~- 401 (479) T protein:vir:99 334 AAGTRQTMQKLFEKQATWKASHNQTMR--------LVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMV---E- 401 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHH---h- Confidence 332 34444444444443322 22221111112222222221 1122222222222211 1 Q ss_pred HHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhh----hh--ccchhhhhhc Q lcl|NC_020414. 440 MSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVA----KA--VPGVIQQEMK 513 (515) Q Consensus 440 v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~----~a--~~~~~~~~~~ 513 (515) +. .+..+.++.. ..|++ +++++.+++.++...+..++++... ++ .+++.++... T Consensus 402 ----ag-------~is~et~l~~---l~gv~------~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (479) T protein:vir:99 402 ----SL-------KIPAEGVWDM---IPNLD------QSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNM 461 (479) T ss_pred ----cC-------CCCHHHHHHh---cCCCC------HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCC Confidence 11 1222222222 23554 4555555443333222222222211 11 1111111111 Q ss_pred cC Q lcl|NC_020414. 514 EG 515 (515) Q Consensus 514 ~~ 515 (515) +. T Consensus 462 ~~ 463 (479) T protein:vir:99 462 QQ 463 (479) T ss_pred CC Confidence 11 No 74 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.81 E-value=1.7e-05 Score=46.70 Aligned_cols=420 Identities=10% Similarity=0.048 Sum_probs=178.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----c---cCCCC--CC-ccccccccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----L---MNNKG--DN-ETSQNGWQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~---~~~~~--~~-~~~~~~~dst~~~a~~~L 69 (515) |...+...-.+.+.|.+..+..+.++ .+++.+.+|..-. . ....+ +. ....|+..+-+...++.+ T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~~----~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:94 35 IVRTNNKPETLEEMIVRYIKQHLEKL----PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred ccccCCchhhHHHHHHHHHHHHHHHH----HHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHH Confidence 33222222235566666556555433 3445555554321 0 00000 11 111245567777788887 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE-- Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY-- 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~-- 147 (515) ++-|++ -| +.++..+.. +.+.|.. .+ .++|.....++.++..++|.+.++ T Consensus 111 ~~yl~G--~p-----~~~~~~d~~-------------~~~~l~~-------~~-~n~~~~~~~~~~~~a~~~G~a~~~v~ 162 (492) T protein:vir:94 111 VSYIVG--KP-----IAFKHTDDE-------------VVKRIDE-------VL-GNRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred Hhhhcc--cC-----ceeccCchH-------------HHHHHHH-------HH-hccHHHHHHHHHHHHhhCCeEEEEEE Confidence 766543 12 122332221 1222211 12 357888899999999999998654 Q ss_pred EeCCCcE--EEEEc-ceEEEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE------EEE Q lcl|NC_020414. 148 KPSKGAM--SAVPM-HHYVVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH------AQY 217 (515) Q Consensus 148 ~d~~~~~--r~~pl-~~y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~------v~~ 217 (515) .|.++.+ ++++- .-|++.- +..+++...+|.+...- ...+++|+. +.. T Consensus 163 ~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~----------------------~~~~~~y~~~~v~~~~~~ 220 (492) T protein:vir:94 163 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN----------------------ETKVEYWDKVTVNYYVYE 220 (492) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecc----------------------ceeEEEEecCeEEEEEEe Confidence 4655544 44533 4455543 34677776666654210 112333331 111 Q ss_pred cCCCCeEEEEEeCCeeecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceee Q lcl|NC_020414. 218 AGEGFWKINQSADDIPVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLI 296 (515) Q Consensus 218 ~~~~~~~~~~e~~~~~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~ 296 (515) ...-...+-.+.++.++.. ..+| ..+|++..+- +.+|.|=.+..++-+..++.+.-......+....|.+.+ T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 293 (492) T protein:vir:94 221 NGSLIPDYSNNLENSKTHFSTGSW--GKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL 293 (492) T ss_pred cCeeeeccccccccccccccccCC--CccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 1000000001111112211 1234 3478776643 457999899999999999988888888888888886554 Q ss_pred cCccccChhhcc-C-CCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhc-cCCCCCCCHHHHHH- Q lcl|NC_020414. 297 RPGSQTDVDHFV-N-SGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMT-RRDAERVTAVEIQR- 372 (515) Q Consensus 297 ~~~g~~~~~~~~-~-~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~-~~~~~~~TAtEi~~- 372 (515) .--...+..... . ...+.+.-+..+++..+. ...+.......++.++..|...-..-.+. ..-+...|+.-+.. T Consensus 294 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 371 (492) T protein:vir:94 294 KNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 371 (492) T ss_pred ecCCcccchhhHHHHhhccceecCCCCcceeEe--ccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHH Confidence 211111111110 0 111222223333444433 23455666777777777665432210001 11112345544432 Q ss_pred ------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020414. 373 ------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMSLPQ 444 (515) Q Consensus 373 ------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~~~a 444 (515) +..++...++..+.++ ++.+..-..-......+.+.+ ..+.+.+..+ +.+.. ++ T Consensus 372 ~~~l~~k~~~k~~~f~~~l~~~--------~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~---~~~~k-------l~ 433 (492) T protein:vir:94 372 YTNLNLKADKLARKAKVAIQEL--------LWFVFEHFDIKGEHKDVDISFNYNKVANTELQV---QTAQQ-------SM 433 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcCCcccceeeEEecCCCCCCHHHHH---HHHHH-------Hh Confidence 2344444444444332 222221111111111222222 1222222222 22211 11 Q ss_pred cCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccch-hhhhhccC Q lcl|NC_020414. 445 TWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGV-IQQEMKEG 515 (515) Q Consensus 445 ~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~-~~~~~~~~ 515 (515) + .+....++..+ -+++ -.++|++.+.++++++++..+.........+.. .+....++ T Consensus 434 g-------iiS~et~~~~l---~~v~----d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 491 (492) T protein:vir:94 434 G-------IVSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNKES 491 (492) T ss_pred c-------cCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhccccccccCCCCccccCCccccC Confidence 1 11122222222 1222 235677777766554443322211111110000 01111111 No 75 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=415 Identities=10% Similarity=0.059 Sum_probs=178.3 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc--ccC------CCCCC---ccccccccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY--LMN------NKGDN---ETSQNGWQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~--~~~------~~~~~---~~~~~~~dst~~~a~~~L 69 (515) |.......-.+.+.|.+..+..+.++ .+++.+.+|..-. ... ..+.. ....|+..+-+...++.. T Consensus 35 ~~~~~~~~~~~~~~i~~~i~~~~~~~----~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:97 35 IVRTNNKPETLEEMIVRYIKQHLEKL----PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred cccCCCchhhHHHHHHHHHHHHHHHH----HHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHH Confidence 33322222335666666666555433 4445555554331 000 00000 111245567777778877 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--E Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--Y 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~ 147 (515) ++-|++ .| +.++..++. +.+.|. ..+ .++|.....++.++...+|.|.+ | T Consensus 111 ~~yl~g--~p-----~~~~~~d~~-------------~~~~l~-------~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~ 162 (492) T protein:vir:97 111 VSYIVG--KP-----IAFKHTDDE-------------VVKRID-------EVL-GNRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred hhhhcc--cC-----ceeccCchH-------------HHHHHH-------HHH-hccHHHHHHHHHHHHhhcCeEEEEEE Confidence 766543 22 123333321 112221 122 36888999999999999999865 4 Q ss_pred EeCCCcE--EEEEcc-eEEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE------EEE Q lcl|NC_020414. 148 KPSKGAM--SAVPMH-HYVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH------AQY 217 (515) Q Consensus 148 ~d~~~~~--r~~pl~-~y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~------v~~ 217 (515) .|.++.+ ++++.. -|++.-| ..+++...+|.+...- ...+++|+. +.. T Consensus 163 ~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~----------------------~~~~~~y~~~~v~~~~~~ 220 (492) T protein:vir:97 163 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN----------------------ETKVEYWDKVTVNYYVYE 220 (492) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecc----------------------ceeEEEEecCeEEEEEEe Confidence 5655544 445444 4555443 4677877777664210 112333321 111 Q ss_pred cCCCCeEEEEEeCCeeec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceee Q lcl|NC_020414. 218 AGEGFWKINQSADDIPVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLI 296 (515) Q Consensus 218 ~~~~~~~~~~e~~~~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~ 296 (515) ...-...+..+.+...+. ...+| ..+|++.++. +.+|+|=.+...+-+..++.+--......+....|.+.+ T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~ 293 (492) T protein:vir:97 221 NGSLIPDYSNNLENSKTHFSTGSW--GKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL 293 (492) T ss_pred cCeeeecccccccccccccccCCC--CCcceEEecC-----CCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeee Confidence 110000011111111111 12234 3478776654 357899888999999999988777777778888886554 Q ss_pred cCccccChhhcc-C-CCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-HhhccCCCCCCCHHHHHH- Q lcl|NC_020414. 297 RPGSQTDVDHFV-N-SGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-ETMTRRDAERVTAVEIQR- 372 (515) Q Consensus 297 ~~~g~~~~~~~~-~-~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-~~l~~~~~~~~TAtEi~~- 372 (515) .-....+..... . ...+.+.-+..+++..+. ...+.......++.+++.|...-.. +.....-+...|+.-+.. T Consensus 294 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~ 371 (492) T protein:vir:97 294 KNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 371 (492) T ss_pred ecCCcccchhHHHHHhhccceecCCCCcceeEe--ccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHH Confidence 211111111110 1 111222222233444433 2235566667777777666443211 000011112345544322 Q ss_pred ------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020414. 373 ------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMSLPQ 444 (515) Q Consensus 373 ------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~~~a 444 (515) ++.++...++.-+.+ +++.+..-..-......+.+.+ ..+.+.+..++ .+... + T Consensus 372 ~~~l~~ka~~~~~~f~~~l~~--------~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~---~~~kl-------~ 433 (492) T protein:vir:97 372 YTNLNLKADKLARKAKVAIQE--------LLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ---TAQQS-------M 433 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHH---HHHHH-------h Confidence 223344444443333 2222221111111112223222 12222222222 22211 1 Q ss_pred cCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 445 TWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 445 ~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) + .+....+++.+ -+++ -.++|++.+.++.+++++..+.....+. ..+.-.++ T Consensus 434 G-------~iS~et~l~~l---~~v~----d~~~Eleri~~E~~~~~~~~~~~~~~~~-----~~~~~~~~ 485 (492) T protein:vir:97 434 G-------IVSHETVLENH---PFVE----DLQAELERIEQEQTEYNKQLPNLDDGGA-----DSAQQQER 485 (492) T ss_pred c-------cCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhhccccCCC-----CCCccccc Confidence 1 12222222222 1122 1356777777665544333222111111 11111111 No 76 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.75 E-value=2.2e-05 Score=46.09 Aligned_cols=422 Identities=13% Similarity=0.072 Sum_probs=158.4 Q ss_pred CCCccc---cccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCCCCCCccccccccccHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTIL---EYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNNKGDNETSQNGWQGVGAQATNHLANK 72 (515) Q Consensus 1 ~~~~~~---~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 72 (515) |.-+.+ ..-.+...+....+.+..++ ++.+.+.+|..- .+.......-+..+..-+-+..+++++++. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~----~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~ 76 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQN----QNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHH----HHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhh Confidence 221111 11123333333334443332 333334444322 111000000011123334555666666655 Q ss_pred HHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--C Q lcl|NC_020414. 73 LAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP--S 150 (515) Q Consensus 73 l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d--~ 150 (515) | ++.+ | .+.-++.. ...++ ..+..++|.....++.++..++|.+.+++- . T Consensus 77 l----~~~g--~-~~~~~~~~----------~~~l~-----------~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~ 128 (485) T protein:vir:24 77 Q----AVEG--F-RLGDADEA----------DEELW-----------QWWQANNLDIEAPLGYTDAYVHGRSYITISRPD 128 (485) T ss_pred h----ccCc--e-ecCCCchh----------HHHHH-----------HHHHhcChhHHHHHHHHHHhhcCceEEEEecCC Confidence 4 3322 2 22211110 11122 334567899999999999999999987663 2 Q ss_pred CC----------cEEEEEcceEEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcC Q lcl|NC_020414. 151 KG----------AMSAVPMHHYVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAG 219 (515) Q Consensus 151 ~~----------~~r~~pl~~y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~ 219 (515) +. .+++++-.+.++..| ..+++...++.+.-. .......+++| .+ T Consensus 129 ~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~y-----~~ 184 (485) T protein:vir:24 129 PQIDLGWDPNVPLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDA-------------------EGNEIQAATLY-----TP 184 (485) T ss_pred cccccccCCCcceEEEeccceeEEEeeCCcCceeEEEEEEEee-------------------cCCeEEEEEEE-----cC Confidence 21 245555455444444 456666655544310 00111112222 22 Q ss_pred CCCeEEEEEeCCeeec--c-cCCcccccCcEEEEeeeecCCCccccchHHHH-HHHHHHHHHHHHHHHHHHHHhccCcee Q lcl|NC_020414. 220 EGFWKINQSADDIPVG--K-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDY-SGDLFVIQFLSEAVARGAALMADIKYL 295 (515) Q Consensus 220 ~~~~~~~~e~~~~~i~--~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~-l~d~k~L~~l~~~~~~~~~~a~~p~~l 295 (515) +. ...|...++.... . +-+| +.+|++.++.+...+..||+|-..+. .+-+..++...-.....++..+.|-.. T Consensus 185 ~~-~~~~~~~~~~~~~~~~~~h~~--g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~ 261 (485) T protein:vir:24 185 NE-TFGWFRAEGEWVEWFSDPHGL--GAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRL 261 (485) T ss_pred Cc-EEEEEecCCceEeecccccCC--CcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 21 1112222332211 1 2234 45999999988888889999976653 344555666655666666666666433 Q ss_pred ec---Ccccc-C---hhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-----HhhccCCCC Q lcl|NC_020414. 296 IR---PGSQT-D---VDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-----ETMTRRDAE 363 (515) Q Consensus 296 ~~---~~g~~-~---~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-----~~l~~~~~~ 363 (515) +. ++... . ...+....+|.+..-..+++...++. .++++ .-++.++.-|...... ..+.....- T Consensus 262 i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~-~~~~e---~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n 337 (485) T protein:vir:24 262 IFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFS-AAELA---NFTNALDQIAKQVAAYTGLPPQYLSTAADN 337 (485) T ss_pred hccCCccccccccccccchhhhcccceeccCCCCceEEeec-ccchH---HHHHHHHHHHHHHhcccCCCHHHhccccCc Confidence 21 11000 0 00111112333222111233333332 22333 3334444444333211 011111111 Q ss_pred CCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcc--ceee--eehHHHHHHHHHHH Q lcl|NC_020414. 364 RVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVD--PVIV--TGIEALGRMAELDK 432 (515) Q Consensus 364 ~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~--~~~v--~~l~~l~ra~~~~~ 432 (515) ..++.-+.. +++++.+.++..+.++-. ++..++ + ....+....+ +.+- .+.+-++.+....+ T Consensus 338 ~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~-----l~~~~~-~-~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~k 410 (485) T protein:vir:24 338 PASAEAIRAAESRLIKKVERKNAIFGGAWEEAMR-----LAYRLM-K-GGDVPPDMLRMETVWRDPSTPTYAAKADAATK 410 (485) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHh-c-CCCCccccceeeEEecCCCCCCHHHHHHHHHH Confidence 123433322 234444444444444322 111111 1 1222222222 2221 12233333333333 Q ss_pred HHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHH-HHHHHHHHHHhhhhccc----h Q lcl|NC_020414. 433 LANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQ-AQQEAMLNEGVAKAVPG----V 507 (515) Q Consensus 433 l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~-~~q~~~~~~~~~~a~~~----~ 507 (515) |. + .... .+..+.+ .+.+|.. +++++++++.+.+ ..+..+.......+... . T Consensus 411 l~---~---~g~~-------~~s~et~----~~~l~~~------~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~ 467 (485) T protein:vir:24 411 LY---G---NGQG-------VIPRERA----RKDMGYS------IAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSP 467 (485) T ss_pred HH---h---cccc-------cCCHHHH----HhhCCCC------HhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCC Confidence 22 1 1001 1111222 2334542 4444444332221 11222111111111110 0 Q ss_pred hhhhhccC Q lcl|NC_020414. 508 IQQEMKEG 515 (515) Q Consensus 508 ~~~~~~~~ 515 (515) .+.+.+.+ T Consensus 468 ~~~e~~~~ 475 (485) T protein:vir:24 468 NPTPAPKP 475 (485) T ss_pred CCCCCCCC Confidence 11111111 No 77 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.74 E-value=2.3e-05 Score=46.01 Aligned_cols=425 Identities=11% Similarity=0.053 Sum_probs=185.7 Q ss_pred CCCcc---ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----c-----ccCC-CC------CCccccccccc Q lcl|NC_020414. 1 MQDTI---LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----Y-----LMNN-KG------DNETSQNGWQG 60 (515) Q Consensus 1 ~~~~~---~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~-----~~~~-~~------~~~~~~~~~ds 60 (515) |-++. ....++.+.|.+..+.-+..|..+...++.+-.+..- . +... .+ ......|+..+ T Consensus 3 ~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n 82 (474) T protein:vir:10 3 LYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNS 82 (474) T ss_pred hHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccc Confidence 22221 1224577777777777666666555544444332211 0 0000 00 01111244455 Q ss_pred cHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh Q lcl|NC_020414. 61 VGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIV 140 (515) Q Consensus 61 t~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 140 (515) -+...++..++-|++- |+. +...+. .....+++++|. ..+..++|.....++.++..+ T Consensus 83 ~~~~ivd~~~~yl~g~--pv~-----~~~~~~--------~~~~e~~~~~l~-------~~~~~n~~~~~~~~~~~~~~~ 140 (474) T protein:vir:10 83 FDSEIVDTRVGYLHGV--PVT-----YDLDEN--------AEKNEKLKKFIT-------NFAIRNSVDDEDSEIGKMAAI 140 (474) T ss_pred hHHHHHHhHhhheecc--cee-----EeeCCC--------CcchHHHHHHHH-------HHHhhcCHhHHHHHHHHHHhh Confidence 5555566555544331 332 222111 011223444433 346668899999999999999 Q ss_pred hCceEEE--EeCCCcEE--EEEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEE Q lcl|NC_020414. 141 AGNCLLY--KPSKGAMS--AVPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHA 215 (515) Q Consensus 141 ~G~~~l~--~d~~~~~r--~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v 215 (515) +|.|.++ .+.++.++ +++..+ |++- |..+.....+|.+...- ......+++.- T Consensus 141 ~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~-d~~~~~~~~i~~~~~~~---------------------~~~~~~~~~~~ 198 (474) T protein:vir:10 141 CGYGARLAYIDTNGDIRIKNIDPYNVIFVG-DNILEPTYSLRYFYEKD---------------------DDNGTDYVYAE 198 (474) T ss_pred cCeEEEEEEeCCCCeeEEEEEcccceEEEE-cCCCceEEEEEEEEEee---------------------CCCceEEEEEE Confidence 9998754 46665444 454444 5454 66666655454443210 01111222222 Q ss_pred EEcCCCCeEEEEEeCCe----eecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020414. 216 QYAGEGFWKINQSADDI----PVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMA 290 (515) Q Consensus 216 ~~~~~~~~~~~~e~~~~----~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~ 290 (515) .++++. .+++..++. .+.. .-+| ..+|++.++ ++.+|.|=.+...+-+..++.+.-......+... T Consensus 199 ~y~~~~--~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 269 (474) T protein:vir:10 199 FYDNAY--YYVFRGEGIDALQEVGRYEHLF--DYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTR 269 (474) T ss_pred EEcCce--EEEEeecCCCcccccccccCCC--CccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 233332 122222221 1221 1223 347777543 4678999999999999999998888888888888 Q ss_pred cCceeecCccccChhhccCCC-Cccee-cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCH Q lcl|NC_020414. 291 DIKYLIRPGSQTDVDHFVNSG-TGEVI-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTA 367 (515) Q Consensus 291 ~p~~l~~~~g~~~~~~~~~~~-~g~~~-~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TA 367 (515) .|.+.+.-. -+..+...... .|.+. .+..+++..+. ...+.......++.+++.|...-. .+.....-+...|+ T Consensus 270 ~~~l~i~g~-~~~~~~~~~~~~~~~i~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg 346 (474) T protein:vir:10 270 LAYLVLRGM-GMSEEMIQETQKSGAFELFDKDMDVKYLT--KDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPI 346 (474) T ss_pred cchhhhccC-CCCchhhhhhhhcceeEecCCCCceeEEe--ccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchH Confidence 887655321 11222222222 23332 23233444333 334556667777777777744321 11000111234566 Q ss_pred HHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCCh-h--hccceee--eehHHHHHHHHHHHHHH Q lcl|NC_020414. 368 VEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTS-E--LVDPVIV--TGIEALGRMAELDKLAN 435 (515) Q Consensus 368 tEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~-~--~~~~~~v--~~l~~l~ra~~~~~l~~ 435 (515) ..+..+ ..++...++..+.++..- |-.++........+ . .+.+.+. .+.+.+..++-+.++. T Consensus 347 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l-----i~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~- 420 (474) T protein:vir:10 347 IGMKLKLMALENKCMTFERKMTAMLRYQFKV-----ILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK- 420 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh- Confidence 666442 233333333333322221 11122222222111 1 2333332 2333333333322221 Q ss_pred HHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhc-c Q lcl|NC_020414. 436 FAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMK-E 514 (515) Q Consensus 436 ~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~-~ 514 (515) | .+....+++.+. +++ -.++|++.+.++.+++.+..... ..++.-++..+ + T Consensus 421 -----g-----------~iS~et~~~~l~---~v~----d~~~E~eri~~E~~e~~~~~~~~-----~~~~~~~~~~~~~ 472 (474) T protein:vir:10 421 -----G-----------QVSERTRLGQSQ---LVD----DVDYELDEMEKESLEFNDKLPDI-----DEGDANDKSQNNQ 472 (474) T ss_pred -----c-----------cCchHHHHHhCC---CCC----CHHHHHHHHHHHHHHHHhhcccc-----cCCCcCCCCcccc Confidence 1 112222222220 121 12456666655443333221110 00111111111 1 Q ss_pred C Q lcl|NC_020414. 515 G 515 (515) Q Consensus 515 ~ 515 (515) . T Consensus 473 s 473 (474) T protein:vir:10 473 S 473 (474) T ss_pred C Confidence 1 No 78 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.74 E-value=2.3e-05 Score=46.01 Aligned_cols=425 Identities=11% Similarity=0.053 Sum_probs=185.7 Q ss_pred CCCcc---ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----c-----ccCC-CC------CCccccccccc Q lcl|NC_020414. 1 MQDTI---LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----Y-----LMNN-KG------DNETSQNGWQG 60 (515) Q Consensus 1 ~~~~~---~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~-----~~~~-~~------~~~~~~~~~ds 60 (515) |-++. ....++.+.|.+..+.-+..|..+...++.+-.+..- . +... .+ ......|+..+ T Consensus 3 ~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n 82 (474) T protein:vir:94 3 LYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNS 82 (474) T ss_pred hHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccc Confidence 22221 1224577777777777666666555544444332211 0 0000 00 01111244455 Q ss_pred cHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh Q lcl|NC_020414. 61 VGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIV 140 (515) Q Consensus 61 t~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 140 (515) -+...++..++-|++- |+. +...+. .....+++++|. ..+..++|.....++.++..+ T Consensus 83 ~~~~ivd~~~~yl~g~--pv~-----~~~~~~--------~~~~e~~~~~l~-------~~~~~n~~~~~~~~~~~~~~~ 140 (474) T protein:vir:94 83 FDSEIVDTRVGYLHGV--PVT-----YDLDEN--------AEKNEKLKKFIT-------NFAIRNSVDDEDSEIGKMAAI 140 (474) T ss_pred hHHHHHHhHhhheecc--cee-----EeeCCC--------CcchHHHHHHHH-------HHHhhcCHhHHHHHHHHHHhh Confidence 5555566555544331 332 222111 011223444433 346668899999999999999 Q ss_pred hCceEEE--EeCCCcEE--EEEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEE Q lcl|NC_020414. 141 AGNCLLY--KPSKGAMS--AVPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHA 215 (515) Q Consensus 141 ~G~~~l~--~d~~~~~r--~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v 215 (515) +|.|.++ .+.++.++ +++..+ |++- |..+.....+|.+...- ......+++.- T Consensus 141 ~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~-d~~~~~~~~i~~~~~~~---------------------~~~~~~~~~~~ 198 (474) T protein:vir:94 141 CGYGARLAYIDTNGDIRIKNIDPYNVIFVG-DNILEPTYSLRYFYEKD---------------------DDNGTDYVYAE 198 (474) T ss_pred cCeEEEEEEeCCCCeeEEEEEcccceEEEE-cCCCceEEEEEEEEEee---------------------CCCceEEEEEE Confidence 9998754 46665444 454444 5454 66666655454443210 01111222222 Q ss_pred EEcCCCCeEEEEEeCCe----eecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020414. 216 QYAGEGFWKINQSADDI----PVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMA 290 (515) Q Consensus 216 ~~~~~~~~~~~~e~~~~----~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~ 290 (515) .++++. .+++..++. .+.. .-+| ..+|++.++ ++.+|.|=.+...+-+..++.+.-......+... T Consensus 199 ~y~~~~--~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 269 (474) T protein:vir:94 199 FYDNAY--YYVFRGEGIDALQEVGRYEHLF--DYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTR 269 (474) T ss_pred EEcCce--EEEEeecCCCcccccccccCCC--CccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 233332 122222221 1221 1223 347777543 4678999999999999999998888888888888 Q ss_pred cCceeecCccccChhhccCCC-Cccee-cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCH Q lcl|NC_020414. 291 DIKYLIRPGSQTDVDHFVNSG-TGEVI-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTA 367 (515) Q Consensus 291 ~p~~l~~~~g~~~~~~~~~~~-~g~~~-~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TA 367 (515) .|.+.+.-. -+..+...... .|.+. .+..+++..+. ...+.......++.+++.|...-. .+.....-+...|+ T Consensus 270 ~~~l~i~g~-~~~~~~~~~~~~~~~i~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg 346 (474) T protein:vir:94 270 LAYLVLRGM-GMSEEMIQETQKSGAFELFDKDMDVKYLT--KDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPI 346 (474) T ss_pred cchhhhccC-CCCchhhhhhhhcceeEecCCCCceeEEe--ccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchH Confidence 887655321 11222222222 23332 23233444333 334556667777777777744321 11000111234566 Q ss_pred HHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCCh-h--hccceee--eehHHHHHHHHHHHHHH Q lcl|NC_020414. 368 VEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTS-E--LVDPVIV--TGIEALGRMAELDKLAN 435 (515) Q Consensus 368 tEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~-~--~~~~~~v--~~l~~l~ra~~~~~l~~ 435 (515) ..+..+ ..++...++..+.++..- |-.++........+ . .+.+.+. .+.+.+..++-+.++. T Consensus 347 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l-----i~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~- 420 (474) T protein:vir:94 347 IGMKLKLMALENKCMTFERKMTAMLRYQFKV-----ILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK- 420 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh- Confidence 666442 233333333333322221 11122222222111 1 2333332 2333333333322221 Q ss_pred HHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhc-c Q lcl|NC_020414. 436 FAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMK-E 514 (515) Q Consensus 436 ~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~-~ 514 (515) | .+....+++.+. +++ -.++|++.+.++.+++.+..... ..++.-++..+ + T Consensus 421 -----g-----------~iS~et~~~~l~---~v~----d~~~E~eri~~E~~e~~~~~~~~-----~~~~~~~~~~~~~ 472 (474) T protein:vir:94 421 -----G-----------QVSERTRLGQSQ---LVD----DVDYELDEMEKESLEFNDKLPDI-----DEGDANDKSQNNQ 472 (474) T ss_pred -----c-----------cCchHHHHHhCC---CCC----CHHHHHHHHHHHHHHHHhhcccc-----cCCCcCCCCcccc Confidence 1 112222222220 121 12456666655443333221110 00111111111 1 Q ss_pred C Q lcl|NC_020414. 515 G 515 (515) Q Consensus 515 ~ 515 (515) . T Consensus 473 s 473 (474) T protein:vir:94 473 S 473 (474) T ss_pred C Confidence 1 No 79 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=97.73 E-value=2.4e-05 Score=45.91 Aligned_cols=387 Identities=10% Similarity=0.001 Sum_probs=161.7 Q ss_pred hcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHH Q lcl|NC_020414. 40 TLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAM 119 (515) Q Consensus 40 ~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~ 119 (515) ++|.-.++.- +...++..-+-+..+++.++..|. +.+ |+. .|... ...+. T Consensus 1 ~l~~~~~~~~-~~~~~~~v~n~~~~ivd~~~~~l~----~~g---f~~--~d~~~---------~~~~~----------- 50 (434) T protein:vir:98 1 MLPKNAEQAF-LDFQRKARTNFCGLIANASVHRLL----ALG---VTG--PDGEP---------DTRAS----------- 50 (434) T ss_pred CCCCCccHHH-HHhhhhhhccchHHHHHHHHhhhc----cCc---eec--CCCch---------HHHHH----------- Confidence 3333111111 111122344667777777777553 333 432 22111 11122 Q ss_pred HHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC---------cEEEEEcce-EEEeeCCCCCeeEEEEEEEecHHHHH Q lcl|NC_020414. 120 KALEQRQFRPAIVEVFKHLIVAGNCLLYKP--SKG---------AMSAVPMHH-YVVNRDTNGDLMDVILLQEKALRTFD 187 (515) Q Consensus 120 ~~l~~snf~~~~~~~~~dl~~~G~~~l~~d--~~~---------~~r~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~ 187 (515) +.+.+++|.....+++++..++|.+.+++. ++. .+++++-.+ +++.-+..+++...++.+....+ T Consensus 51 ~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~--- 127 (434) T protein:vir:98 51 RWWQANRLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDID--- 127 (434) T ss_pred HHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccC--- Confidence 335668999999999999999999877653 221 166675544 44444445666665554432211 Q ss_pred HHhcccccchhhhccCCCcccEEEEEEEEE----cCCC-CeE----EEEEeCCeeecccCCcccccCcEEEEeeeecCCC Q lcl|NC_020414. 188 PATRMAIEVGMKGKKCKEDDNVKLYTHAQY----AGEG-FWK----INQSADDIPVGKENRIKAEKLPFIPLTWKRSYGE 258 (515) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~----~~~~-~~~----~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~ 258 (515) +.....+.+++.++. .... .+. .++..+..+-...-+| ..||++.+.=+...++ T Consensus 128 ---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~--g~vPvv~f~N~~~~~~ 190 (434) T protein:vir:98 128 ---------------GFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDL--GGMQLVEFARMPDLGE 190 (434) T ss_pred ---------------CceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCC--CccceEEeccCCCcCc Confidence 011111222111111 1110 000 0000000011112244 3589998876665544 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeec---C-cc------ccChhhccCCCCcceecCCccccccccc Q lcl|NC_020414. 259 DWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIR---P-GS------QTDVDHFVNSGTGEVITGVEEDIHIVQL 328 (515) Q Consensus 259 ~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~---~-~g------~~~~~~~~~~~~g~~~~g~~~~v~~~~~ 328 (515) +|+|=.+..++.+..++...-..+..++..+.|...+. + +. ............|.+..-..++....++ T Consensus 191 -~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~ 269 (434) T protein:vir:98 191 -DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQL 269 (434) T ss_pred -CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCCCceEEEe Confidence 79998899999999999988888888888877743331 0 00 0000011111112211101122333333 Q ss_pred CCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHH Q lcl|NC_020414. 329 GKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMW 399 (515) Q Consensus 329 ~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r 399 (515) + .++++.....+..+.+.|...= + ...+. .+....++..+.. +.+.|.+.+|..+.++. +. T Consensus 270 ~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~-~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~--------rl 339 (434) T protein:vir:98 270 D-ATDLSGFLKEHASDVRDMLTISQTPTYLYA-TDLVNISADTIGALDILHVAKVREHIASFSEGLESVL--------AL 339 (434) T ss_pred c-CcchHHHHHHHHHHHHHHhcccCCCHHHhc-cccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HH Confidence 2 2344443333444333332110 0 00111 1112345555433 33444555554444332 22 Q ss_pred HHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCH Q lcl|NC_020414. 400 GLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSE 477 (515) Q Consensus 400 ~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~ 477 (515) ++.-.+-......+++.+ ..+-+.++.|+-+.+|. + ++++.+ .+...+|. ++ T Consensus 340 ~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~---~-----~g~~~e------------~~~~~lg~------~~ 393 (434) T protein:vir:98 340 AAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLK---S-----IGYPLD------------VIAEELDE------SP 393 (434) T ss_pred HHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHH---h-----cCCcHH------------HHHHhCCC------CH Confidence 222112111111122222 12222233333322221 1 122221 22344564 45 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 478 EEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 478 eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +|++.+.+++.+.....+ ..+.++.....|+...+| T Consensus 394 ~e~~r~~~e~~~~~~~~~--~~~~~~~~~~~g~~~~~~ 429 (434) T protein:vir:98 394 ARVRRIVAGAASQALLAA--SLLPAPGAPSAGNVPDSG 429 (434) T ss_pred HHHHHHHHHHHHHHHHHH--hhhccCCCCCCCCCCccc Confidence 677766654332222221 112222222233333334 No 80 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=97.66 E-value=3.2e-05 Score=45.24 Aligned_cols=426 Identities=12% Similarity=0.060 Sum_probs=179.5 Q ss_pred CCCcccc-ccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc---cCC---CCCCccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILE-YGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL---MNN---KGDNETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~-~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~---~~~~~~~~~~~dst~~~a~~~Laa~l 73 (515) -.+.... ..-+.+.+.+..+.-+..+ .++|+++.+|....- ... ........|+..+.+...++..++-| T Consensus 29 ~~~~~~~~~~~~~~~l~~~i~~~~~~~---~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl 105 (501) T protein:vir:27 29 RADNLEELMVNNWELLKNFINHHKLRQ---APRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYL 105 (501) T ss_pred ccccccccccccHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhh Confidence 2222222 2233334444444333333 334556666654421 111 11111223455677777777776666 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSK 151 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~ 151 (515) ++- | +.++..+... ...+. ..+...+..++|.....++.++..++|.+.+++ +.+ T Consensus 106 ~g~------p-~~~~~~d~~~---------~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded 162 (501) T protein:vir:27 106 AGN------P-IRVEYDDNDN---------NSQND-------DTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEY 162 (501) T ss_pred ccc------C-eeEecCCccc---------hHHHH-------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCC Confidence 432 1 1223222111 11222 234445677899999999999999999987654 555 Q ss_pred Cc--EEEEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEE Q lcl|NC_020414. 152 GA--MSAVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQ 227 (515) Q Consensus 152 ~~--~r~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 227 (515) +. +++++. .-|++.-+. .+++...+|.+..... .+....+++| .++. .+++ T Consensus 163 ~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~------------------~~~~~~~~vy-----t~~~--v~~~ 217 (501) T protein:vir:27 163 DETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTL------------------QNAKDVVEIY-----TNEH--IYTL 217 (501) T ss_pred CceEEEEEccceeEEEecCCCCCceEEEEEEEEeeec------------------CCcEEEEEEE-----eCCe--EEEE Confidence 44 445544 445555444 3556555555432111 0111122232 2232 2222 Q ss_pred EeCC--eeeccc-CCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccC- Q lcl|NC_020414. 228 SADD--IPVGKE-NRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTD- 303 (515) Q Consensus 228 e~~~--~~i~~e-sgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~- 303 (515) ..++ ..+... -+| ..+|++.++ ++..|+|-.+..++-+..++.+.-......+....|.+.+.-....+ T Consensus 218 ~~~~~~~~~~~~~~~~--g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~ 290 (501) T protein:vir:27 218 DASDDFNEISVTTHAF--GTVPITEFL-----NNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK 290 (501) T ss_pred EeCCceeeccccccCC--CcccEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCc Confidence 2222 222222 233 358887764 34679999999999999999988888888888777765543111111 Q ss_pred hhhccC-CCCcceec-------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCCHHHHHHH- Q lcl|NC_020414. 304 VDHFVN-SGTGEVIT-------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVTAVEIQRD- 373 (515) Q Consensus 304 ~~~~~~-~~~g~~~~-------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~TAtEi~~r- 373 (515) .+.... ...+.+.. |..+.+.+-.+....+.+.....++.+++.|...-. .+......+...|+..+... T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~ 370 (501) T protein:vir:27 291 GMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKL 370 (501) T ss_pred ccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHH Confidence 100000 01111211 111112221222223444556666666666543311 11000011223466555332 Q ss_pred ------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC--CCCChhhccceee--eehHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 374 ------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG--DSFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMSLP 443 (515) Q Consensus 374 ------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~--~~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~~~ 443 (515) +.++++.++..+.++..= |-.++.... ..+....+.+.+- .+.+.+..++ .+..+ T Consensus 371 ~~l~~ka~~~~~~~~~~l~~~~~l-----i~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad---~~~kl------- 435 (501) T protein:vir:27 371 FGLDQDRVDTQSQFTQGLKRRYRL-----AARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVS---ILTGL------- 435 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhhcccccccccccceEEeCCCCCcCHHHHHH---HHHHH------- Confidence 344444444444333221 111122111 1111122333331 2223233222 22221 Q ss_pred hcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 444 QTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 444 a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +++ |....++..+ -+++ -.++|++.+.+++++.....+.. ... -..+..++...+. T Consensus 436 ~g~-------iS~et~l~~l---~~v~----D~~~E~eri~~E~~e~~~~~~~~-~~~-~~~~~~~d~~~~~ 491 (501) T protein:vir:27 436 GGQ-------VSQETALSLS---GLVE----SPNEELDKINKEVSEIDFKGYSN-DFN-EHVGKYTDEVKET 491 (501) T ss_pred hcc-------CcHHHHHHhC---CCCC----CHHHHHHHHHHHHHhhhHhhhcC-ccc-cccccccCCCCCC Confidence 111 2122222221 1222 13567777766544322221110 000 0001111111111 No 81 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=421 Identities=8% Similarity=0.057 Sum_probs=180.4 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc---ccCCC----C-CCccccccccccHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY---LMNNK----G-DNETSQNGWQGVGAQATNHLANK 72 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~----~-~~~~~~~~~dst~~~a~~~Laa~ 72 (515) |.++. ..-++.+.+.+..+..+.+ ..++|+.+.+|.... ..... . ......++..+.+...++..++- T Consensus 22 ~~~~~-~~~~~~~~i~~~i~~~~~~---~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~ 97 (481) T protein:vir:10 22 VVSDL-AELLKEENLRNFISRHQTE---QVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGY 97 (481) T ss_pred eeecc-hhhcCHHHHHHHHHHHHHH---HHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhh Confidence 22222 1223555566655554433 344566666665442 11111 0 11112244556666667666654 Q ss_pred HHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeC Q lcl|NC_020414. 73 LAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPS 150 (515) Q Consensus 73 l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~ 150 (515) |.+ .|. .++..+... .+ .+...+..++|.....++.++..++|.+.+ |.+. T Consensus 98 l~g------~~~-~~~~~d~~~-------------~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~ 150 (481) T protein:vir:10 98 LTG------NPI-TITHQDNQT-------------ND-------KIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDF 150 (481) T ss_pred hcc------CCc-eEecCChhH-------------HH-------HHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCC Confidence 432 222 222222211 11 233446678899999999999999999765 4466 Q ss_pred CCc--EEEEEcceEEEeeCC--CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEE Q lcl|NC_020414. 151 KGA--MSAVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKIN 226 (515) Q Consensus 151 ~~~--~r~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 226 (515) ++. +++++-.+.++..|. .+++...+|.++..-. .+.....+++| .++. .++ T Consensus 151 dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~-----------------~~~~~~~~~~y-----~~~~--i~~ 206 (481) T protein:vir:10 151 EDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDK-----------------DKVPVQHVEVY-----TTDK--IYY 206 (481) T ss_pred CCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeC-----------------CCceEEEEEEE-----ecCe--EEE Confidence 554 445655554443443 3556666655542100 00001112222 2221 122 Q ss_pred EEeCCe--eec-c-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcccc Q lcl|NC_020414. 227 QSADDI--PVG-K-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQT 302 (515) Q Consensus 227 ~e~~~~--~i~-~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~ 302 (515) ++.++. ..+ + .-+| ..+|++..+- +.+|+|=.+...+-+..++.+.-......+....|.+.+...... T Consensus 207 ~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~ 279 (481) T protein:vir:10 207 IEIKGGTYHRVEEVEHYY--NDVPIIEYLN-----DQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDL 279 (481) T ss_pred EEecCCceeecccccccC--CceeEEEeec-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCC Confidence 222221 112 1 1123 3478776542 467999888888888889888777777777777776655322222 Q ss_pred ChhhccCCCCcceec----------CCcccccccccCCccchHHHHHHHHHHHHHHHHHH-HHHhhccCCCCCCCHHHHH Q lcl|NC_020414. 303 DVDHFVNSGTGEVIT----------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF-MMETMTRRDAERVTAVEIQ 371 (515) Q Consensus 303 ~~~~~~~~~~g~~~~----------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l~~~l~~~~~~~~TAtEi~ 371 (515) +.++......+..+. +...++..+. ...+.+.....++.++..|...- ..+......+...|+..+. T Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~ 357 (481) T protein:vir:10 280 DSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVY--KQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMK 357 (481) T ss_pred CccchhhhhhccceeccccccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHH Confidence 222221111111111 1112222222 12234445566666665553321 1110001111234655543 Q ss_pred HH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCC-CCChhhccceee--eehHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 372 RD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD-SFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMS 441 (515) Q Consensus 372 ~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~-~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~ 441 (515) .+ .+.+...++..+.++-. ++-+++..... ......+.+.+- .+.+.+..++ .+..+. | T Consensus 358 ~~~~~l~~k~~~~~~~~~~~l~~~~~-----li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~---~~~kl~---g 426 (481) T protein:vir:10 358 YKLFGLEQVRAIKERLFKKGLMKRYK-----LLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESIN---AFNALS---G 426 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHH---HHHHHh---c Confidence 32 23333444444333221 11112211111 111112333331 1222233332 222211 1 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) . +....+++.+ -+++ -.++|++.++++.+++... ......+.+.......+--|| T Consensus 427 ~-----------is~et~~~~l---~~i~----d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 427 G-----------VSESTRLSLL---DFID----NPKEELEKMQEEEAQREKQ-ADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred c-----------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhh-hhhccCCccCCCCCCCCCCCC Confidence 1 2222233322 1111 1356777777665444332 222233333333333344444 No 82 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.60 E-value=3.9e-05 Score=44.77 Aligned_cols=419 Identities=10% Similarity=0.089 Sum_probs=188.2 Q ss_pred CCCc-----------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccc---cCCCCC---Ccccccccccc Q lcl|NC_020414. 1 MQDT-----------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYL---MNNKGD---NETSQNGWQGV 61 (515) Q Consensus 1 ~~~~-----------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~---~~~~~~---~~~~~~~~dst 61 (515) |-.| +.+...+++.|.+..+..+.+ -+....++++|.-- ++.+ ....+. +....++..+- T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~-~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:95 7 MPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQ-LDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNF 85 (474) T ss_pred cCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHH-HHHHHHHHHHhcccCchhccccccccccccccccccceeccch Confidence 2222 223345777777777766544 34445566665421 1111 111111 11112455677 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|NC_020414. 62 GAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVA 141 (515) Q Consensus 62 ~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 141 (515) +...++..++-|.+ -|+ .+...+... ...++.| + .+||...+.++.++...+ T Consensus 86 ~~~Ivd~~~~~l~g--~p~-----~~~~~d~~~---------~~~l~~~-----------~-~n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:95 86 HQNLVDQKVSYVAS--KPV-----TYSCEDESV---------LKIIHDV-----------L-DTRWDNKLIDILTATSNK 137 (474) T ss_pred HHHHHHHHHhhhcc--CCc-----eeccCchHH---------HHHHHHH-----------H-hccHHHHHHHHHHHHhhc Confidence 77777777766543 221 234333221 1112222 2 368999999999999999 Q ss_pred CceEEE--EeCCCcEE--EEE-cceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEE Q lcl|NC_020414. 142 GNCLLY--KPSKGAMS--AVP-MHHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHA 215 (515) Q Consensus 142 G~~~l~--~d~~~~~r--~~p-l~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v 215 (515) |.+.++ .|.++.++ +++ ..-|.+..|. .|.+.-++|.+...- ...+++|+.- T Consensus 138 G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~----------------------~~~~~~y~~~ 195 (474) T protein:vir:95 138 GIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNN----------------------EEKVEFWTDT 195 (474) T ss_pred CcEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcC----------------------eeEEEEEeCC Confidence 998654 45555444 443 3445554443 577777776654210 1123333210 Q ss_pred ---EE-cCCCCeEEEE--EeCCeeec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 216 ---QY-AGEGFWKINQ--SADDIPVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAAL 288 (515) Q Consensus 216 ---~~-~~~~~~~~~~--e~~~~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~ 288 (515) ++ ...+.+.... ...+.... ..-+| ..+|++.++. +.+|.|=.+...+-+..+|.+--......+. T Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~ 268 (474) T protein:vir:95 196 TVTYYVLENGGLIPDYYYGANHIQSHFSNGNW--GRVPFIAFKN-----NPEEVSDIWMYKSLIDAIDKRLSDAQNMFDE 268 (474) T ss_pred eEEEEEEcCCccccccccCcccccccccccCC--CccceEeecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01 1111111000 00111111 11223 3588887654 4679998899999999999888888888888 Q ss_pred hccCceeecCccccChhhcc-CCCCccee-cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCC Q lcl|NC_020414. 289 MADIKYLIRPGSQTDVDHFV-NSGTGEVI-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERV 365 (515) Q Consensus 289 a~~p~~l~~~~g~~~~~~~~-~~~~g~~~-~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~ 365 (515) ...|.+.+..-..-+...+. ....+.++ ....+++..+. ...+.......++.+.+.|...-. .+......+... T Consensus 269 ~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~ 346 (474) T protein:vir:95 269 SVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGGVETIQ--VEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAP 346 (474) T ss_pred hcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc Confidence 88887655321111112111 11112222 22233444443 234667777778887777754321 111101111234 Q ss_pred CHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHH Q lcl|NC_020414. 366 TAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANF 436 (515) Q Consensus 366 TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~ 436 (515) |+..+. .+++++...++..+.++-. .+..-.+-......+.+.+ ..+.+-+..++ T Consensus 347 Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~--------li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~-------- 410 (474) T protein:vir:95 347 SGIALKFLYGNLDLKANKLKNKATVAIQELIG--------FIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQ-------- 410 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccceeeEEeccCCCcCHHHHHH-------- Confidence 665543 3345555555555444332 2221111121122233332 12222222221 Q ss_pred HHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 437 AQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 437 ~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) .+... ..+....++..+ -+++ -.++|++.+.++++++.+..+.... .......+.-..+ T Consensus 411 --~~~~~--------g~iS~et~i~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~---~~~d~~~~~~~~~ 469 (474) T protein:vir:95 411 --IIAQS--------QYLSRETLVKSS---PLVD----DYKAELERIEQEQMEYNKQLPNLDD---GGADGAQQQERSN 469 (474) T ss_pred --HHHhc--------CCCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhccccccc---ccCCCCcCCCCCc Confidence 11111 122233333322 1221 1356777776655444332221111 1111111111111 No 83 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.60 E-value=3.9e-05 Score=44.74 Aligned_cols=417 Identities=10% Similarity=0.066 Sum_probs=177.3 Q ss_pred CCCccccccc----cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc--ccCC------CCCC---ccccccccccHHHH Q lcl|NC_020414. 1 MQDTILEYGG----QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY--LMNN------KGDN---ETSQNGWQGVGAQA 65 (515) Q Consensus 1 ~~~~~~~~~~----~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~--~~~~------~~~~---~~~~~~~dst~~~a 65 (515) |-+.--+... +.+.|.+..+.... ...+++.+.+|..-. .... .... ....|+..+-+... T Consensus 22 ~~~~~~~~~~~~e~~~~~i~~~i~~~~~----~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~I 97 (483) T protein:vir:12 22 IFDAIVRTNNKPETLEEMIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANL 97 (483) T ss_pred hhhcccccCCchhhHHHHHHHHHHHHHH----HHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHH Confidence 1111112222 33444444444433 334566666665432 0000 0000 11124556777777 Q ss_pred HHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE Q lcl|NC_020414. 66 TNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL 145 (515) Q Consensus 66 ~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 145 (515) ++..++-|++ .| +.++..+... ...++.| ...+|.....++.++..++|.+. T Consensus 98 vd~~~~~l~G--~p-----~~~~~~d~~~---------~~~l~~~------------~~n~~~~~~~~~~~~~~~~G~~y 149 (483) T protein:vir:12 98 VDQKVSYIVG--KP-----IAFKHTDDEV---------VKRIDEV------------LGNRFDDKLHSVLTGASNKGIEW 149 (483) T ss_pred HHHHhhhhcc--cC-----ceeccCChHH---------HHHHHHH------------HhccHHHHHHHHHHHHhhCCeEE Confidence 7777766543 12 2233333211 1112222 23678889999999999999986 Q ss_pred E--EEeCCCc--EEEEEcce-EEEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEE------ Q lcl|NC_020414. 146 L--YKPSKGA--MSAVPMHH-YVVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYT------ 213 (515) Q Consensus 146 l--~~d~~~~--~r~~pl~~-y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~------ 213 (515) + |.|.++. +++++..+ |++.- +..+++...+|.++..- ...+++|+ T Consensus 150 ~~v~~d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~----------------------~~~~~~y~~~~v~~ 207 (483) T protein:vir:12 150 LHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN----------------------ETKVEYWDKVTVNY 207 (483) T ss_pred EEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeec----------------------ceEEEEEecCeEEE Confidence 4 5566655 44555444 44433 34577777666654310 01233332 Q ss_pred EEEEcCCCCeEEEEEeCCeeecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_020414. 214 HAQYAGEGFWKINQSADDIPVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADI 292 (515) Q Consensus 214 ~v~~~~~~~~~~~~e~~~~~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p 292 (515) .+.....-...+..+.+...+.. ..+| ..+|++.++- +.+|+|=.+...+-+..++.+--......+....| T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~ 280 (483) T protein:vir:12 208 YVYENGSLIPDYSNNLENSKTHFSTGSW--GKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNEL 280 (483) T ss_pred EEEeCCeeeecccccccccccccccCCC--CccceEEecC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 11111000001111112222211 2233 3478776653 45799988889999999998888888888888888 Q ss_pred ceeecCccccChhhcc-CC-CCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-HhhccCCCCCCCHHH Q lcl|NC_020414. 293 KYLIRPGSQTDVDHFV-NS-GTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-ETMTRRDAERVTAVE 369 (515) Q Consensus 293 ~~l~~~~g~~~~~~~~-~~-~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-~~l~~~~~~~~TAtE 369 (515) .+.+.-.+.-+..... .. ..+.+.....+++..+. ...+.......++.+++.|...-.. +.....-+...|+.. T Consensus 281 ~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 358 (483) T protein:vir:12 281 TYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 358 (483) T ss_pred eeeeecCCcccchhHHHhhhhccccccCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHH Confidence 7655321111111110 01 11222222333444443 2345566667777777666443211 000011112345555 Q ss_pred HH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 370 IQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYM 440 (515) Q Consensus 370 i~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v 440 (515) +. .++.++...++..+.++.. .+..-..-......+++.+ ..+.+.+..++ .+.. T Consensus 359 l~~~~~~l~~k~~~~~~~f~~~l~~~~~--------li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~---~~~k----- 422 (483) T protein:vir:12 359 LEFLYTNLNLKADKLARKAKVAIQELLW--------FVFEHFDIKGEHKDVDISFNYNKVANTELQVQ---TAQQ----- 422 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhcCCCccceeeEEeCCCCCCCHHHHHH---HHHH----- Confidence 42 2334555555554444322 2111111111112223222 12222222222 2221 Q ss_pred HHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 441 SLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 441 ~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) ++++ +....++..+ -+++ -.++|++.+.++++++++..+. ......+..+++-+.+ T Consensus 423 --l~Gi-------iS~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~---~~~~~~d~~~~~~~~~ 478 (483) T protein:vir:12 423 --SMGI-------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPN---LDDGGADGAQQQERSN 478 (483) T ss_pred --Hhcc-------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhccc---ccccccCCcccCCCCC Confidence 1111 2222222222 1221 1356777766655443322211 1111111111111111 No 84 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=97.57 E-value=4.4e-05 Score=44.49 Aligned_cols=459 Identities=12% Similarity=0.074 Sum_probs=204.8 Q ss_pred CCCccccccccHHHHH------HHHHHHHHhhhhHHHHHHHHHHhhcccc----cCCCCCCcc-ccccccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIP------KLWEKFSKKRSPYLDRAKHFAKLTLPYL----MNNKGDNET-SQNGWQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~------~r~~~lk~~R~~~e~~w~e~~~~~~P~~----~~~~~~~~~-~~~~~dst~~~a~~~L 69 (515) |+-.--|+++++.--. .+-... ...-...++.+.+|..-.- ....++..+ ...++++.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~---d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~--- 74 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDF---DKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLI--- 74 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHH---HHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhh--- Confidence 5554444444432100 001111 1122344556666655421 112233322 3356788874433 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE- Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK- 148 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~- 148 (515) .+-+-.+.| +..|+- +.. - ++|+..+...+++.|++....++-.+..+.|-+++++ T Consensus 75 -~~~~~~~~~-g~~~~~----~~~----------~-------e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~ 131 (527) T protein:vir:10 75 -EAKMRFLGQ-GLKWEF----SKK----------D-------AKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLI 131 (527) T ss_pred -CCcceeecc-Cccccc----cch----------h-------HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 333333333 444521 110 0 1234444556778999999999999999999998766 Q ss_pred -eC-C---CcEEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--h-hccCCCcccE-----EEEE Q lcl|NC_020414. 149 -PS-K---GAMSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--K-GKKCKEDDNV-----KLYT 213 (515) Q Consensus 149 -d~-~---~~~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~-~~~~~~~~~v-----~v~~ 213 (515) |+ + .+.++..+ +.|+..+|++| ++.+-+.+-.....++..-+....-.+ + ....++.... ..|+ T Consensus 132 wD~~k~~~~R~~v~~~DP~~~f~~ed~d~-~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt 210 (527) T protein:vir:10 132 GDDEKDEGSRLSLHEVDPSTYFPYEDPRY-PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYT 210 (527) T ss_pred eccCCCcCCCceEeecCcceeeeeecCCC-CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeee Confidence 33 2 13555444 78888888876 555544433222223322221111000 1 0111111111 1112 Q ss_pred EEE-----EcCC---C--CeEEEEEeCCeeecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 214 HAQ-----YAGE---G--FWKINQSADDIPVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAV 282 (515) Q Consensus 214 ~v~-----~~~~---~--~~~~~~e~~~~~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~ 282 (515) ... +++. + ..++-..+++.++-+ .-.| .-.|++.++=...++++||+|=..+.+.-+..||...-.. T Consensus 211 ~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~ 288 (527) T protein:vir:10 211 EELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDE 288 (527) T ss_pred eceeeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHH Confidence 211 1110 1 112222334443321 1122 2268988888889999999999999999999999887777 Q ss_pred HHHHHHhccCceeecCccccChhhccCCCCcceecCCcc----cccccccCCccchHHHHHHHHHHHHHHHHHHHHH--h Q lcl|NC_020414. 283 ARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEE----DIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--T 356 (515) Q Consensus 283 ~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~----~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~ 356 (515) ...+...-.|..-. +|+...+.--...+-.|-||..- .-....+....++...+.-+..+..+|...=-.- . T Consensus 289 s~is~~sG~Pi~~~--tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA 366 (527) T protein:vir:10 289 DLIMVFGGLGFYAT--DSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIA 366 (527) T ss_pred HHHHHHhCCceeee--cccccccccCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeee Confidence 77777666665433 34433321101011112222111 1122223333455555566666665554331000 0 Q ss_pred hccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH-HHHHHHHHH-H---------HhcCCCCChhh--cccee-eeehH Q lcl|NC_020414. 357 MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMT-MQTPIAMWG-L---------QEAGDSFTSEL--VDPVI-VTGIE 422 (515) Q Consensus 357 l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E-~l~Pli~r~-~---------~~~~~~~p~~~--~~~~~-v~~l~ 422 (515) +...|..+ --+.+ -++..|+|++.|.+.. ++.-.+.|. . ...+--+.+.. ..+.+ -.+.- T Consensus 367 ~G~vD~s~-~~SG~-----ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~l 440 (527) T protein:vir:10 367 VGVVDAAV-AESGI-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPK 440 (527) T ss_pred eccccCCc-CcHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccC Confidence 11112222 11222 1234455555555443 112222211 0 00011111111 11111 23344 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHH---H Q lcl|NC_020414. 423 ALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNE---G 499 (515) Q Consensus 423 ~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~---~ 499 (515) |.-+++-.+++..+.+ + --+....+++.+.++.| +--.+.|++++.+.++++.-....+. . T Consensus 441 P~D~~avie~v~tL~~-----a-------Gi~S~~tAv~~L~~~~g----~eD~E~E~~~I~~era~~a~a~a~A~~~~~ 504 (527) T protein:vir:10 441 PVNSEKRFNQLLQLWE-----A-------GLIPAKKLTEELSKIMG----FELTEEDFKQATEDKKTQGIAQAEAADPFG 504 (527) T ss_pred CCCHHHHHHHHHHHHH-----c-------CchhHHHHHHHHHhccC----CCChHHHHHHHHHHHHHHhHHhhhhcCchh Confidence 5555555555543332 1 23456777888888777 33456777787766554432222111 1 Q ss_pred hhhhccchh--hhhhccC Q lcl|NC_020414. 500 VAKAVPGVI--QQEMKEG 515 (515) Q Consensus 500 ~~~a~~~~~--~~~~~~~ 515 (515) |+++..+++ ++.-++| T Consensus 505 a~~~~~~g~~~~~~d~~~ 522 (527) T protein:vir:10 505 AQMAAEQGIPDEEDDQAL 522 (527) T ss_pred hhhccccCCCCCCccccc Confidence 111122222 2222233 No 85 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=419 Identities=10% Similarity=0.055 Sum_probs=179.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCC---CCCC---ccccccccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNN---KGDN---ETSQNGWQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~---~~~~---~~~~~~~dst~~~a~~~L 69 (515) -+.++-.--.+.+.|.+..+..+.+ ..+++.+.+|..-. +-.. .... ....|+..+-+...++.+ T Consensus 15 ~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 90 (472) T protein:vir:93 15 IVRTNNKPETLEEMIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 90 (472) T ss_pred eeeecCchhhHHHHHHHHHHHHHHH----HHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHH Confidence 2222211122444444444544433 34555666665331 1000 0000 111245567777888888 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE-- Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY-- 147 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~-- 147 (515) ++-|.+ .| +.+...+.. +.+.|. ..+ .++|...+.++.++..++|.+.++ T Consensus 91 ~~~l~g--~~-----~~~~~~d~~-------------~~~~l~-------~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~ 142 (472) T protein:vir:93 91 VSYIVG--KP-----IAFKHTDDE-------------VVKRID-------EVL-GNRFDDKLHSVLTGASNKGIEWLHPY 142 (472) T ss_pred hhhhcc--cC-----eeeccCChH-------------HHHHHH-------HHH-hccHHHHHHHHHHHHhhcCeEEEEEE Confidence 876643 12 223333321 122221 122 368999999999999999998654 Q ss_pred EeCCCcE--EEEEcce-EEEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE-----EEEc Q lcl|NC_020414. 148 KPSKGAM--SAVPMHH-YVVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH-----AQYA 218 (515) Q Consensus 148 ~d~~~~~--r~~pl~~-y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~-----v~~~ 218 (515) .|.++.+ ++++..+ |++.- +..+++...+|.++..- ...+++|+. .... T Consensus 143 ~d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~----------------------~~~~~~~~~~~~~~~~~~ 200 (472) T protein:vir:93 143 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN----------------------ETKVEYWDKVTVNYYVYE 200 (472) T ss_pred ECCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeec----------------------ceeEEEEecCeEEEEEEe Confidence 4555544 4454444 54433 34677777666654310 011233321 0111 Q ss_pred CCCC-eEEEEEeCCeeec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceee Q lcl|NC_020414. 219 GEGF-WKINQSADDIPVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLI 296 (515) Q Consensus 219 ~~~~-~~~~~e~~~~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~ 296 (515) .... ..+....+...+. ..-+| ..+|++.++. +.+|+|=.+...+-+-.++.+--......+....|.+.+ T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~--~~vPvv~~~n-----n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~ 273 (472) T protein:vir:93 201 NGSLIPDYSNNLENSKTHFSTGSW--GKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL 273 (472) T ss_pred cCeeeecccccccccccccccCCC--CCcceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEe Confidence 1100 0111111222221 22334 3588887764 458999999999999999988888888888888886555 Q ss_pred cCccccChhhcc-C-CCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhc-cCCCCCCCHHHHH-- Q lcl|NC_020414. 297 RPGSQTDVDHFV-N-SGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMT-RRDAERVTAVEIQ-- 371 (515) Q Consensus 297 ~~~g~~~~~~~~-~-~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~-~~~~~~~TAtEi~-- 371 (515) .-....+..... . ...+.+.....+++..+.. ..+.......++.++..|...-..-.+. ...+...|+.-+. T Consensus 274 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~ 351 (472) T protein:vir:93 274 TNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQV--EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFL 351 (472) T ss_pred ecCCcccchhhHHHHhhccccccCCCCcceeEee--cCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHH Confidence 311111111110 0 1112222223334444432 2345556667777766664432110000 0111234555432 Q ss_pred -----HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_020414. 372 -----RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTW 446 (515) Q Consensus 372 -----~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~ 446 (515) .+++++...+|..+.++.. .+..-.+.......+.+.+ ++..|-..+..++.+... ++ T Consensus 352 ~~~l~~ka~~~~~~~~~~l~~~~~--------li~~~~~~~~~~~~i~v~f-~~~~p~~~~~~~~~~~k~-------~g- 414 (472) T protein:vir:93 352 YTNLNLKADKLARKAKVAIQELLW--------FVFEHFDIKGEHKDVDISF-NYNKVANTELQVQTAQQS-------MG- 414 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccceeeEEe-CCCCCCCHHHHHHHHHHH-------hc- Confidence 2345555555555544322 1111111111111222222 222221122222222221 11 Q ss_pred ChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 447 PEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 447 ~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) .+....+++.+ -+++ -.++|++.+.++++++++..+..... ......+.-.++ T Consensus 415 ------iis~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~~~~~~---~~d~~~~~~~~~ 467 (472) T protein:vir:93 415 ------IVSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPNLDDG---GADGAQQQERSN 467 (472) T ss_pred ------cCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhccCcCcc---cCCCCCCCCCCC Confidence 11122222222 1222 13567777666554433332211110 001111111111 No 86 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=97.55 E-value=4.6e-05 Score=44.36 Aligned_cols=459 Identities=12% Similarity=0.073 Sum_probs=205.0 Q ss_pred CCCccccccccHHHHH------HHHHHHHHhhhhHHHHHHHHHHhhcccc----cCCCCCCcc-ccccccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIP------KLWEKFSKKRSPYLDRAKHFAKLTLPYL----MNNKGDNET-SQNGWQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~------~r~~~lk~~R~~~e~~w~e~~~~~~P~~----~~~~~~~~~-~~~~~dst~~~a~~~L 69 (515) |+-.--|+++++.--. .+-... ...-...++.+.+|..-.- ....++..+ ...++++.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~---d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~--- 74 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDF---DKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLI--- 74 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHH---HHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhh--- Confidence 5554444444432100 001111 1122344556666655421 112233322 3356788874333 Q ss_pred HHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE- Q lcl|NC_020414. 70 ANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK- 148 (515) Q Consensus 70 aa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~- 148 (515) .+-+-.+.| +..|+- +.. - ++|+..+...+++.|++....++-.+..+.|-+++++ T Consensus 75 -~~~~~~~~~-g~~~~~----~~~----------~-------e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~ 131 (527) T protein:vir:10 75 -EAKMRFLGQ-GLKWEF----SKK----------D-------AKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLI 131 (527) T ss_pred -CCcceeecc-Cccccc----cch----------h-------HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEe Confidence 333333333 444521 110 0 1234444556778999999999999999999998766 Q ss_pred -eC-C---CcEEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchh--h-hccCCCcccE-----EEEE Q lcl|NC_020414. 149 -PS-K---GAMSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGM--K-GKKCKEDDNV-----KLYT 213 (515) Q Consensus 149 -d~-~---~~~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~--~-~~~~~~~~~v-----~v~~ 213 (515) |+ + .+.++..+ +.|+..+|++| ++.+-+.+-.....++..-+....-.+ + ....++.... ..|+ T Consensus 132 wD~~k~~~~R~~v~~~DP~~~f~~ed~d~-~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt 210 (527) T protein:vir:10 132 GDDEKDEGSRLSLHEVDPSTYFPYEDPRY-PGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYT 210 (527) T ss_pred eccCCCcCCCceEeecCcceeeeeecCCC-CCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeee Confidence 33 2 13555444 78888888876 555544433222223322221111000 1 0111111111 1112 Q ss_pred EEE-----EcCC---C--CeEEEEEeCCeeecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 214 HAQ-----YAGE---G--FWKINQSADDIPVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAV 282 (515) Q Consensus 214 ~v~-----~~~~---~--~~~~~~e~~~~~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~ 282 (515) ... +++. + ..++-..+++.++-+ .-.| .-.|++.++=...++++||+|=..+.+.-+..||...-.. T Consensus 211 ~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi--~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~ 288 (527) T protein:vir:10 211 EELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQI--TTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDE 288 (527) T ss_pred eceeeccccccccccccchhhhhhhcCceeeecccCCC--CccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHH Confidence 211 1110 1 112222334443321 1122 2268988888889999999999999999999999887777 Q ss_pred HHHHHHhccCceeecCccccChhhccCCCCcceecCCcc----cccccccCCccchHHHHHHHHHHHHHHHHHHHHH--h Q lcl|NC_020414. 283 ARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVITGVEE----DIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--T 356 (515) Q Consensus 283 ~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~~~----~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~ 356 (515) ...+...-.|..-. +|+...+.--...+-.|-||..- .-....+....++...+.-++.+..+|...=-.- . T Consensus 289 s~is~~sG~Pi~~~--tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA 366 (527) T protein:vir:10 289 DLIMVFGGLGFYAT--DSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIA 366 (527) T ss_pred HHHHHHhCCceeee--cccccccccCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeee Confidence 77777766665433 34433321101011112222111 1122223333455555666666666554331000 0 Q ss_pred hccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHH-HHHHHHHHH-H---------HhcCCCCChhh--cccee-eeehH Q lcl|NC_020414. 357 MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMT-MQTPIAMWG-L---------QEAGDSFTSEL--VDPVI-VTGIE 422 (515) Q Consensus 357 l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E-~l~Pli~r~-~---------~~~~~~~p~~~--~~~~~-v~~l~ 422 (515) +...|..+ --+.+ -++..|+|++.|.+.. ++.-.+.|. . ...+--+.+.. ..+.+ -.+.- T Consensus 367 ~G~vD~s~-~~SG~-----ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~l 440 (527) T protein:vir:10 367 VGVVDAAV-AESGI-----ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPK 440 (527) T ss_pred eccccCCc-CcHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccC Confidence 11112222 11222 1234455555555443 122222211 0 00011111111 11111 23344 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHH---H Q lcl|NC_020414. 423 ALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNE---G 499 (515) Q Consensus 423 ~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~---~ 499 (515) |.-+++-.+++..+.+ + --+....+++.+.++.| +--.+.|++++.+.++++.-....+. . T Consensus 441 P~D~~avie~v~tL~~-----a-------GiiS~etAv~~L~~~~g----~eD~E~E~~~I~~era~~a~a~a~a~~~~~ 504 (527) T protein:vir:10 441 PVNNEKRFAQLLELWE-----A-------GLIPAKKLTEELSKIMG----FELTEEDFRQATEDKKTQGIAQAEAADPFG 504 (527) T ss_pred CCCHHHHHHHHHHHHH-----c-------CchhHHHHHHHHHhccC----CCchHHHHHHHHHHHHHHhHHhhhhcCchh Confidence 5555555555543332 1 23456777888888777 33456677777776654432222111 1 Q ss_pred hhhhccchh--hhhhccC Q lcl|NC_020414. 500 VAKAVPGVI--QQEMKEG 515 (515) Q Consensus 500 ~~~a~~~~~--~~~~~~~ 515 (515) |+++..+++ ++.-++| T Consensus 505 a~~~~~~g~~~~~~d~~~ 522 (527) T protein:vir:10 505 AQMAAEQGIPDEEDDQAL 522 (527) T ss_pred hhhccccCCCCCCccccc Confidence 111222222 2222333 No 87 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.52 E-value=5.1e-05 Score=44.10 Aligned_cols=412 Identities=14% Similarity=0.103 Sum_probs=163.6 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccccc-CCCCCC--c--cccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLM-NNKGDN--E--TSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~-~~~~~~--~--~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) |-. ....+...++.+..+ ..+...+.+|..-.-- ...+.. . +..+...+-+..+++.+++.| ++. T Consensus 1 ~~t--~~d~i~~L~~~~~~~----~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l----~~~ 70 (480) T protein:vir:78 1 MTT--YHEHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIE 70 (480) T ss_pred CCC--HHHHHHHHHHHHHHH----HHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhh----ccC Confidence 222 455566666655433 3444455555433210 000100 0 011234455666677766655 333 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--------CCC Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP--------SKG 152 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d--------~~~ 152 (515) + |... .+.. ..+ .+...+..++|.....++.++...+|.+.+++- .+. T Consensus 71 g---~~~~-~d~~-------------~~~-------~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~ 126 (480) T protein:vir:78 71 G---FRIS-EDSE-------------GLE-------ELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG 126 (480) T ss_pred c---eecC-CCch-------------hHH-------HHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCC Confidence 2 2222 1111 111 123345678999999999999999999876653 222 Q ss_pred c--EEEEEcceEEEeeCC--CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 A--MSAVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~--~r~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) . +++++..+.++..|+ .+++...+|.+.-. . .......+++|+ ++ ....|.. T Consensus 127 ~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~-d-----------------~~~~~~~~~~y~-----~~-~~~~~~~ 182 (480) T protein:vir:78 127 IPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D-----------------DVAVPDRATLYL-----PD-ETVPLRR 182 (480) T ss_pred eeEEEEEcccceEEEEcCCCccceEEEEEEEEee-c-----------------CCcceEEEEEEe-----CC-eEEEEEe Confidence 2 556666665555565 45666655554211 0 000112233332 11 1111111 Q ss_pred eCCe----ee-c--ccCCcccccCcEEEEeeeecCCCccccchHHH-HHHHHHHHHHHHHHHHHHHHHhccCceeecCcc Q lcl|NC_020414. 229 ADDI----PV-G--KENRIKAEKLPFIPLTWKRSYGEDWGRPLVED-YSGDLFVIQFLSEAVARGAALMADIKYLIRPGS 300 (515) Q Consensus 229 ~~~~----~i-~--~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~-~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g 300 (515) .++. .. . .+-+| ..+|++.+..+...+..||+|=..+ ..+-+-.++...-.....++..+.|...+. | T Consensus 183 ~~~~~~~~~~~~~~~~~~~--g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G 258 (480) T protein:vir:78 183 NGGLNDQWVVDGDVIKHGL--GVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) T ss_pred cCCCcccccccccccccCC--CCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--C Confidence 1111 00 1 12234 3599999998888899999996654 346666677776666667776676643321 1 Q ss_pred ccChhhcc--------CCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-----HhhccCCCCC-CC Q lcl|NC_020414. 301 QTDVDHFV--------NSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-----ETMTRRDAER-VT 366 (515) Q Consensus 301 ~~~~~~~~--------~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-----~~l~~~~~~~-~T 366 (515) . .++.+. ....|.+..-..+++...+++ .++++.. ++.++.-|...+.. ..+.. +... -+ T Consensus 259 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~---~~~l~~~i~~~~~~~~~p~~~fg~-~~~n~~S 332 (480) T protein:vir:78 259 V-TTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRNF---AEEMEVFRKEAASITGLPPQYLSS-SSENPAS 332 (480) T ss_pred C-CccccccccccchhhhhhhhhccCCCCCceEEecC-ccCHHHH---HHHHHHHHHHHhcccCCCHHHhcc-ccCchhH Confidence 1 111110 011121111111223333332 2344433 33344434333211 01111 1111 13 Q ss_pred HHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeeehHHH--HHHHHHHHHHHHH Q lcl|NC_020414. 367 AVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEAL--GRMAELDKLANFA 437 (515) Q Consensus 367 AtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l--~ra~~~~~l~~~~ 437 (515) +.-+.. +++++...++..+.++-. .++.-.+-..+.+..++ .++.-.+. .-++.++.+..+. T Consensus 333 g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~r--------l~~~~~~~~~~~~~~~i-~v~w~~~~~~s~~~~ad~~~kl~ 403 (480) T protein:vir:78 333 AEAIIATDSRIVKMAERKGRIFGGAWERAMR--------IAMQIMGREVTEEYTRL-ETVWRDPSTPTVAAKADAVSKLY 403 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHcCCCccccceee-eEEecCCCCCCHHHHHHHHHHHH Confidence 333322 233444444444333321 11111111222222222 22322221 1222233332222 Q ss_pred HHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhh------hhccc-hhhh Q lcl|NC_020414. 438 QYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVA------KAVPG-VIQQ 510 (515) Q Consensus 438 ~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~------~a~~~-~~~~ 510 (515) + ... ..+.-+. +...+|. ++++++.+.+.+.+.. .....+..+ .+.+. ..|+ T Consensus 404 ~---~g~-------~~~s~et----~~~~lg~------~~d~~~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (480) T protein:vir:78 404 A---NGQ-------GPIPKEQ----ARIDLGY------TATQREQMRDWDKQET-EDMIDTLYSTTKAQADATPKPTVTE 462 (480) T ss_pred H---hcc-------cCCCHHH----HHhcCCC------CHhHHHHHHHHHHHHH-HHHHHHhhccccCCCccccCCCCCC Confidence 1 110 0111111 1223443 3555555543222111 111111111 11111 1111 Q ss_pred hhccC Q lcl|NC_020414. 511 EMKEG 515 (515) Q Consensus 511 ~~~~~ 515 (515) ...+. T Consensus 463 ~~~~~ 467 (480) T protein:vir:78 463 TKTET 467 (480) T ss_pred CCCcc Confidence 11111 No 88 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.47 E-value=6e-05 Score=43.73 Aligned_cols=425 Identities=12% Similarity=0.070 Sum_probs=180.9 Q ss_pred ccccccHHHHHHHHHHHHHhh--hhHHHHHHHHHHhhcc--------cccCCC--CCCccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKR--SPYLDRAKHFAKLTLP--------YLMNNK--GDNETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R--~~~e~~w~e~~~~~~P--------~~~~~~--~~~~~~~~~~dst~~~a~~~Laa~l 73 (515) |--+.+-++.++.|=+-+..- -.....+..++.=-.+ .++... ..+...+++--+.+...++.+|+-| T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll 80 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYI 80 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhh Confidence 777777777777776433211 1122222222211111 011111 1111122332345677777777766 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSK 151 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~ 151 (515) .+-.. + +.+...+. .+...++++|+ ..+..++|+..+.+.+.+..+.|.+++ |.+.. T Consensus 81 ~~e~~--~---i~v~~~~~---------~d~e~~~~~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~ 139 (518) T protein:vir:78 81 SGKPL--S---IDVTGVNG---------SKDENLTKQLK-------EALRIDNFDSKSVKIVELAGGSGVSAVKINILNG 139 (518) T ss_pred cCCCc--e---EEecCccc---------cCcHHHHHHHH-------HHHHhccHHHHHHHHHHHhhccCceEEEEEEECC Confidence 54421 1 33322111 11123444444 457789999999999999999999875 56543 Q ss_pred C-cEEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCC--------- Q lcl|NC_020414. 152 G-AMSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG--------- 221 (515) Q Consensus 152 ~-~~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~--------- 221 (515) . .+.+++-..|+... .+|++..+..-.+.... .+-.+|+.+++...+ T Consensus 140 ~~~i~~v~ad~~~P~~-~~g~~~~~~f~~~~~~~----------------------~k~~~y~~lE~he~~~~~~~~~~~ 196 (518) T protein:vir:78 140 RPSISVHSSSQFWIDF-KNNEPFRFNFFEEIPTS----------------------NKADIYYLVESREIKQWDKEGKKL 196 (518) T ss_pred eeEEEEEcCCeeEEEe-ecCcEEEEEEEEEeecC----------------------CcceeEEEEEeeccccccceeecc Confidence 2 35566666666543 34766554332222110 011234443332110 Q ss_pred -C----eEEEEEe----------------------CCee--ecccCCcccccCcEEEEeeeec-----CCCccccchHHH Q lcl|NC_020414. 222 -F----WKINQSA----------------------DDIP--VGKENRIKAEKLPFIPLTWKRS-----YGEDWGRPLVED 267 (515) Q Consensus 222 -~----~~~~~e~----------------------~~~~--i~~esgy~~~~~P~~~~Rw~~~-----~g~~YGrgp~~~ 267 (515) . +.+|... ++.. ..-..|. ..|+++...+.. .++.||+|-... T Consensus 197 ~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~---~~~~~~~~~n~~~N~~~~~splG~S~~~~ 273 (518) T protein:vir:78 197 SGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGL---KSMGAYLINNSPSNTRYPHLNLGESDLSQ 273 (518) T ss_pred cceeEEEEEeeecCcccccccccccccccccccccccCccceeeccCC---ccceEEeeccccccccccCCCcCcchHhh Confidence 0 0111110 0100 0001121 257777766544 356779999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCCCc----------ce--ecCCcc---c----cccccc Q lcl|NC_020414. 268 YSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSGTG----------EV--ITGVEE---D----IHIVQL 328 (515) Q Consensus 268 ~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g----------~~--~~g~~~---~----v~~~~~ 328 (515) +.+-++.||..--++..-... .++...|++ .++.... ..++.+ .+ +.+..+ . +..++ T Consensus 274 ~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~-~~l~~~~-~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~- 349 (518) T protein:vir:78 274 CTNYLFAVDYFFTVYMREGEK-TKTKIAASE-RMFRKKV-NKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQ- 349 (518) T ss_pred hhHHHHHHHHHHHHHHHHHHh-CCceeeech-hHhccCC-CCCCCccccccCCCCceEEEecCcCCCCCccccceeeee- Confidence 999999999888777777654 777766643 3332211 011111 01 111111 0 11111 Q ss_pred CCccchH--HHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH---HHHHHHHHHHHH Q lcl|NC_020414. 329 GKYADLT--PISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA---MTMQTPIAMWGL 401 (515) Q Consensus 329 ~~~~~l~--~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~---~E~l~Pli~r~~ 401 (515) .++. .-...++.+-+.|.... + ..++.. ++...|||||..+.+...+.+--.-..+. .+++.-++. ++ T Consensus 350 ---~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~-~~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~-l~ 424 (518) T protein:vir:78 350 ---GDFRDGSYRETMEYFAQKAVSKSGYNPATFNL-GNREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLY-LL 424 (518) T ss_pred ---cccChHHHHHHHHHHHHHHHHhhCCChhhcCc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 1221 12223333333332221 0 112222 33458999999888876554432222221 122222211 12 Q ss_pred Hh----cCCCCChhhccc--eeeee--hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhc Q lcl|NC_020414. 402 QE----AGDSFTSELVDP--VIVTG--IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPF 473 (515) Q Consensus 402 ~~----~~~~~p~~~~~~--~~v~~--l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~ 473 (515) .. .+...+.....+ .+=.+ .+...+++.... .++ ++ .+..+.+++.+ ..| T Consensus 425 ~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~------~v~--aG-------imS~e~~i~~~--~~~----- 482 (518) T protein:vir:78 425 TGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNN------MNS--AL-------AMSVEEKVKLI--HPK----- 482 (518) T ss_pred HhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHH------HHh--cC-------CCCHHHHHHHh--CCC----- Confidence 21 111122222222 22111 222222222111 111 11 12334444433 112 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh-hhhhccC Q lcl|NC_020414. 474 LKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI-QQEMKEG 515 (515) Q Consensus 474 irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~-~~~~~~~ 515 (515) .+++|++++.++-++.+ .++....+..+ |...++| T Consensus 483 -~~deea~~e~~ri~~E~------~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 483 -WEDEEIQAEVKRIYLEN------AIGEVPDPEAIGGMETKGG 518 (518) T ss_pred -CCHHHHHHHHHHHHHHh------cccCCCCCccccCCCCCCC Confidence 25566554433211111 11222333344 4444444 No 89 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.45 E-value=6.5e-05 Score=43.53 Aligned_cols=425 Identities=11% Similarity=0.058 Sum_probs=188.4 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCCCC-CCccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNNKG-DNETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~~~-~~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) |-.++..+-.+-+.+.+..+.....|.+ +++++.+|..- ....... ......|+..+.+...++..++-|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~---r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhc Confidence 5555554444555566655555555544 44555555432 1111111 1112235667777777877776554 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+. ++.++.. +. ..+...+..++|.....++.++..++|.+.+ |.++++ T Consensus 108 g--~p~~-----~~~~d~~-------------~~-------~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:99 108 G--NPIQ-----YQDDDKD-------------VL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred c--cCce-----eecCchH-------------HH-------HHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 3 2222 2322221 11 2334456668899999999999999999865 456655 Q ss_pred cE--EEEEcce-EEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE Q lcl|NC_020414. 153 AM--SAVPMHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) Q Consensus 153 ~~--r~~pl~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e 228 (515) .+ ++++..+ |++.-+. .+.+...+|.+..... +....-++++.-.+.++..+.+..+ T Consensus 161 ~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~-------------------~~~~~~~~~~~~vyt~~~i~~~~~~ 221 (511) T protein:vir:99 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPI-------------------DKTDEDEVFTVDLFTSHGVYRYLTS 221 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeec-------------------ccCccceEEEEEEEeCCcEEEEEec Confidence 44 4454444 4444333 4677666666544210 0001111222222233322222211 Q ss_pred eCC------eeec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 229 ADD------IPVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 229 ~~~------~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) -.+ ..+. ..-+| ..+|++.++- +..|+|-.+..++-+..++.+.-......+....|.+.+.-.+. T Consensus 222 ~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~ 294 (511) T protein:vir:99 222 RTNGLKLTPRENGFESHSF--ERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN 294 (511) T ss_pred CCccccccccccccccCCC--CccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcc Confidence 111 1111 12233 3578877654 35799999999999999998888888877777777655432222 Q ss_pred cChhhccCCCCccee--------------cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCCCCCC Q lcl|NC_020414. 302 TDVDHFVNSGTGEVI--------------TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDAERVT 366 (515) Q Consensus 302 ~~~~~~~~~~~g~~~--------------~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~~~~T 366 (515) .+...+..-..+.++ .+...++..+. ...+.......++.+.+.|...-+ .+.....-+...| T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~--~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~S 372 (511) T protein:vir:99 295 LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 372 (511) T ss_pred cCchhhcccccccceecccccccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccch Confidence 222222111111111 11112233322 233455566777777766643211 1100001112356 Q ss_pred HHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCC-CCChhh--ccceee--eehHHHHHHHHHHHHH Q lcl|NC_020414. 367 AVEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD-SFTSEL--VDPVIV--TGIEALGRMAELDKLA 434 (515) Q Consensus 367 AtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~-~~p~~~--~~~~~v--~~l~~l~ra~~~~~l~ 434 (515) +..+..+ +.++++.++..+.++-. +|..++..... ..+... +.+.+- .+.+.+..++. +. T Consensus 373 g~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~-----li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~---~~ 444 (511) T protein:vir:99 373 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK-----LLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKA---YI 444 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHH---HH Confidence 6655433 23333444433333221 11112222211 112222 233332 12222333222 22 Q ss_pred HHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 435 NFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 435 ~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) ... | .+....+++.+ -+++ -.++|++.+.++++.+.+..+. .....+........+ T Consensus 445 kl~---G-----------iiS~et~l~~l---~~v~----D~~~E~~ri~~E~~~~~~~~~~---~~~~~~~~~~~~~~~ 500 (511) T protein:vir:99 445 DSG---G-----------KISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQK---NMYQDPRNINDDEQD 500 (511) T ss_pred HHh---c-----------cCCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhh---cccccCCCCCCCCCC Confidence 111 1 12223333332 1222 1367777777765543322221 111122222222222 Q ss_pred C Q lcl|NC_020414. 515 G 515 (515) Q Consensus 515 ~ 515 (515) + T Consensus 501 ~ 501 (511) T protein:vir:99 501 D 501 (511) T ss_pred C Confidence 2 No 90 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.26 E-value=0.00011 Score=42.23 Aligned_cols=425 Identities=11% Similarity=0.015 Sum_probs=175.1 Q ss_pred CCCcccc-----ccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccccCCCCCCccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILE-----YGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYLMNNKGDNETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~-----~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 73 (515) |.+++.. ..++-..+.+..+..+.+|.+....++++|+=- ++.+-..........|+..+-+...++..++-| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l 80 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYM 80 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhh Confidence 5444321 223556777777777666544444444444311 111111111111223566677777777777666 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEe-- Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKP-- 149 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d-- 149 (515) .+ -|+. ++.+++ .+.++|. ..+...+|.....++.++..++|.+.+ |+. T Consensus 81 ~g--~~~~-----~~~~d~-------------~~~~~l~-------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~ 133 (489) T protein:vir:99 81 LG--VPVE-----YKNENK-------------DLQAAID-------LMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKI 133 (489) T ss_pred cc--CCce-----eecCCh-------------hHHHHHH-------HHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccC Confidence 53 1222 233332 1233333 335567899999999999999998864 432 Q ss_pred --CCCc--EEEEEcceEEEeeCC--CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCe Q lcl|NC_020414. 150 --SKGA--MSAVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFW 223 (515) Q Consensus 150 --~~~~--~r~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~ 223 (515) .+.. +++++..+++...|. .+.+...+|.+... .+ .......+++|+ ++..+ T Consensus 134 ~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~-------~~----------~~~~~~~~~~y~-----~~~i~ 191 (489) T protein:vir:99 134 DDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDID-------YG----------SGKRKQIIKAYT-----SDTIY 191 (489) T ss_pred cCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEEEe-------cC----------CCceEEEEEEEe-----CCcEE Confidence 2222 445655554444443 34455555544321 00 001111222322 22111 Q ss_pred EEEE---EeCCeeecc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecC Q lcl|NC_020414. 224 KINQ---SADDIPVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRP 298 (515) Q Consensus 224 ~~~~---e~~~~~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~ 298 (515) .+.. ..++..+.. .-+| ..+|++.++. +..|+|-.....+-+-.++.+.-...........|.+.+.- T Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g 264 (489) T protein:vir:99 192 TYEDYNLETKGMRLKDYEGHFF--KGVPVNEYAN-----NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAG 264 (489) T ss_pred EEEecCCCcccceecccccccC--CceeEEEeec-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhcc Confidence 1111 112222221 2334 4589887764 35688888888888888988888888777777766544421 Q ss_pred cccc--Chhh----ccCCCC-----------cceecCC--------cccccccccCCccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 299 GSQT--DVDH----FVNSGT-----------GEVITGV--------EEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM 353 (515) Q Consensus 299 ~g~~--~~~~----~~~~~~-----------g~~~~g~--------~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl 353 (515) .... .+.. ...... +.++... ..++.. +....+.......++.+...|-..-. T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~~~~~~~~~~~~~l~~~i~~~s~ 342 (489) T protein:vir:99 265 NAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYF--LKKEYDTAGSEAYKNRLVADILRFTF 342 (489) T ss_pred CCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceee--eeecCChHHHHHHHHHHHHHHHHHhC Confidence 1000 0000 000000 1111100 111111 12223444455566666555532211 Q ss_pred -HHhhccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCC--Ch--hhccceee--e Q lcl|NC_020414. 354 -METMTRRDAERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSF--TS--ELVDPVIV--T 419 (515) Q Consensus 354 -~~~l~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~--p~--~~~~~~~v--~ 419 (515) .+......+...|+..+.. +.++++..++..+.++..= +..++....... .. ..+.+.+- . T Consensus 343 ~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~l-----i~~~~~~~~~~~~~~~~~~~i~v~f~~~~ 417 (489) T protein:vir:99 343 TPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRL-----AANIWAIKGNEATTYSLVNDTSIVFTPNL 417 (489) T ss_pred CcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhhcCCccccccccccceEEeCCCC Confidence 0000000112346665533 2566666666665554431 111121111111 11 12333331 1 Q ss_pred ehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 420 GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEG 499 (515) Q Consensus 420 ~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~ 499 (515) +.+.++.+ +.+.++. | .+....+++.+ -+++.. ..++|++.+.+++.+.++. .++ T Consensus 418 p~d~~~~~---~~~~kl~---g-----------iis~et~~~~l---~~v~~~--d~~~E~~ri~~E~~~~~~~---~~~ 472 (489) T protein:vir:99 418 PQNDNEIV---TAAQNLY---G-----------IVSDQTIFEIL---NTVTGV--DAEAELKRLKEEADKKQSL---PEP 472 (489) T ss_pred CcCHHHHH---HHHHHHh---c-----------cCCHHHHHHhc---CCCCch--hHHHHHHHHHHHHHHHhcc---ccc Confidence 22222222 2222211 1 12233333322 122110 1234444444433222111 111 Q ss_pred hhhhccchhhhhh-ccC Q lcl|NC_020414. 500 VAKAVPGVIQQEM-KEG 515 (515) Q Consensus 500 ~~~a~~~~~~~~~-~~~ 515 (515) ...++.-++.- .+. T Consensus 473 --~~~~~~~~~~~~~~~ 487 (489) T protein:vir:99 473 --RLVGDASGQEEPTAE 487 (489) T ss_pred --cccCCCCCCcCCCCC Confidence 11111111111 111 No 91 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=97.24 E-value=0.00012 Score=42.09 Aligned_cols=449 Identities=12% Similarity=0.072 Sum_probs=189.9 Q ss_pred CCCccccc-cccHH---HHHHHHHHHHHhhhhH--HHHHHHHHHhhcccccCCC--CCCcccc-ccccccHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEY-GGQRS---KIPKLWEKFSKKRSPY--LDRAKHFAKLTLPYLMNNK--GDNETSQ-NGWQGVGAQATNHLAN 71 (515) Q Consensus 1 ~~~~~~~~-~~~~~---~l~~r~~~lk~~R~~~--e~~w~e~~~~~~P~~~~~~--~~~~~~~-~~~dst~~~a~~~Laa 71 (515) |++-+-.. ..... .+..+|+.+ |.-+ ....++...-.||.....+ ....+.+ -.|-+.-.+.++.++. T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~i---rd~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G 77 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVI---ETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSG 77 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHH---HHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhh Confidence 77654221 11122 234444444 3333 3344555554555422111 1112222 2466666677777765 Q ss_pred HHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHH-HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC Q lcl|NC_020414. 72 KLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLAT-IFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPS 150 (515) Q Consensus 72 ~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~-~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~ 150 (515) .++.- ||. .|..+ .+.+.. +++.| -....+++.-+..++.+...+|-+.++||. T Consensus 78 ~vf~k--~p~-~~~~~----------------p~~~~~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~ 132 (513) T protein:vir:97 78 KPFSE--PIK-LNEDV----------------PKAIEETILPDV------DLQGNNLDVFARQWFREGMAKALCHVLIDM 132 (513) T ss_pred hhhhc--Ccc-cCcCc----------------hHHHHHHHhhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEec Confidence 55442 332 12111 112333 33333 234567888888899999999999999974 Q ss_pred CC------------------c----EEEEEcce---EEEe-eCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCC Q lcl|NC_020414. 151 KG------------------A----MSAVPMHH---YVVN-RDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCK 204 (515) Q Consensus 151 ~~------------------~----~r~~pl~~---y~v~-~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~ 204 (515) .. + +..|+-.+ +-.. .|..+.+.-+..+++...+ ..|+. T Consensus 133 P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~---Dgf~~------------ 197 (513) T protein:vir:97 133 PRPAPREDGQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQ---DGFAE------------ 197 (513) T ss_pred CCCCCccchhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeec---CCCcc------------ Confidence 31 0 23343322 2222 2334444445445544422 12321 Q ss_pred CcccEEEEEEEEEcCCCCeEEEEEeCC-------eeecccCCcccccCcEEEEeeeecCCCcccc--chHHHHHHHHHHH Q lcl|NC_020414. 205 EDDNVKLYTHAQYAGEGFWKINQSADD-------IPVGKENRIKAEKLPFIPLTWKRSYGEDWGR--PLVEDYSGDLFVI 275 (515) Q Consensus 205 ~~~~v~v~~~v~~~~~~~~~~~~e~~~-------~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGr--gp~~~~l~d~k~L 275 (515) ..++.|.... .+.|.+|-...+ ..+..+++- .+++|++.|....+..+.. .|.. |+..| T Consensus 198 --~~~~q~rvL~---~g~~~v~r~~~~~~~~~~e~~~~~~g~~---~l~~IP~v~~~~~~~~~~~~~pPLl----~LA~l 265 (513) T protein:vir:97 198 --VCKRRIRVLE---PGLVQLWEPVKKSNAQKEEWALADEWAT---GLNYVPLVTFYADRQGFMMGKPPLL----DLAHL 265 (513) T ss_pred --eEEEEEEEEe---CceEEEEEeecCCCccccceEEecCCCC---cCCceeEEEEecCCCCCCCCccchH----HHHHH Confidence 1222222221 234555443221 234445553 4677888777666555433 4533 55555 Q ss_pred HHH---HHHHHHH-HHHhccCceeecCccccC--hhhccCCCCcce-ecCCcccccccccCCccchHHHHHHHHHHHHHH Q lcl|NC_020414. 276 QFL---SEAVARG-AALMADIKYLIRPGSQTD--VDHFVNSGTGEV-ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRI 348 (515) Q Consensus 276 ~~l---~~~~~~~-~~~a~~p~~l~~~~g~~~--~~~~~~~~~g~~-~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI 348 (515) |.- ..+-++. +..+..|. ++-. |..+ .+.+.-+.+..+ .|+..+....++. .+..+......++++++.+ T Consensus 266 n~~hy~~~Sd~~~il~~~~~P~-l~~~-G~~~~~~~~i~iG~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~qm 342 (513) T protein:vir:97 266 NVAHWQSASDQRHILTVSRFPI-LACS-GASGEDSDPVVVGPNKVLYNPDPAGRFYYVEH-TGQAIAAGRTDLKDLEEQM 342 (513) T ss_pred HHHHHhhhhhHHHHHHhcccce-eeee-cCCcCCCCceEeeccccccCCCCCCcceeecc-CchhHHHHHHHHHHHHHHH Confidence 532 2222233 33444443 3322 2211 123333333322 3332334444443 3567888888999999988 Q ss_pred HHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-hcCCCCChhhccceee-eeh----- Q lcl|NC_020414. 349 GVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-EAGDSFTSELVDPVIV-TGI----- 421 (515) Q Consensus 349 ~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-~~~~~~p~~~~~~~~v-~~l----- 421 (515) +++= ..++ .......||++.+.+.+..-..|.-+...+..-+..-|-..... +..+.-....++..+. ..+ T Consensus 343 ~~~G-a~ll-~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wlg~~~~~~~v~in~dF~~~~~~~~~~ 420 (513) T protein:vir:97 343 AGYG-AEFL-KRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDITADWLRLGPNGGTVELVKDYDLEEMDAPGL 420 (513) T ss_pred HHHH-HHhh-ccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCccEEEeccccCcccCCHHHH Confidence 7653 2222 33344589999999999999999988777766544333222211 2222211222333332 122 Q ss_pred HHHHHHHHHHHH--HHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_020414. 422 EALGRMAELDKL--ANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQE-AMLNE 498 (515) Q Consensus 422 ~~l~ra~~~~~l--~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~-~~~~~ 498 (515) .++.++.....| ..+...+..--.++|+..+...++++.+.+....|..---.....++. .+..+. ..-.. T Consensus 421 ~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~------~~~~~~~~~~~~ 494 (513) T protein:vir:97 421 QALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNP------PEGGEGEGEGEG 494 (513) T ss_pred HHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCC------CCCCCCCCCCCC Confidence 112222221111 112222222112233222111223333333322221100000000000 000000 00000 Q ss_pred HhhhhccchhhhhhccC Q lcl|NC_020414. 499 GVAKAVPGVIQQEMKEG 515 (515) Q Consensus 499 ~~~~a~~~~~~~~~~~~ 515 (515) ..+.-++++.|+.+.-| T Consensus 495 ~~~~~~~~~~~~~~~~~ 511 (513) T protein:vir:97 495 EGGEGGEGGEGGGNPGG 511 (513) T ss_pred CCCCCCCccccCCCCCC Confidence 11112333444444444 No 92 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.19 E-value=0.00014 Score=41.79 Aligned_cols=413 Identities=11% Similarity=0.090 Sum_probs=177.4 Q ss_pred CCCc-----------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCC---CCC---Cccccccc Q lcl|NC_020414. 1 MQDT-----------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNN---KGD---NETSQNGW 58 (515) Q Consensus 1 ~~~~-----------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~---~~~---~~~~~~~~ 58 (515) |-.| +...-.+.+.|.+..+..+. | ..+.+++.+|..- .+-.. .+. +....++. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~ 82 (474) T protein:vir:96 7 MPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRIT 82 (474) T ss_pred CCCCCCCCcchhhhccccccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccccccccc Confidence 1000 11111244445555554443 2 2333444444322 11100 000 11112445 Q ss_pred cccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|NC_020414. 59 QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHL 138 (515) Q Consensus 59 dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl 138 (515) .+-+...++..++-|++ -|+ .++..+... ...++.| + .++|...+.++.++. T Consensus 83 ~n~~k~Iv~~~~~yl~g--~p~-----~~~~~~~~~---------~~~l~~~-----------~-~n~~~~~~~~l~~~~ 134 (474) T protein:vir:96 83 TNFHQNLVDQKVSYVAG--KPV-----TYAHDDDKV---------LDVIHQV-----------L-DTRWDNKLIDILTAA 134 (474) T ss_pred cchHHHHHHhhhhhhcc--cCc-----eeccCChHH---------HHHHHHH-----------H-hccHHHHHHHHHHHH Confidence 56666666666665544 222 233333211 1112222 2 368999999999999 Q ss_pred HhhCceEE--EEeCCCcEE--EEEcce-EEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEE Q lcl|NC_020414. 139 IVAGNCLL--YKPSKGAMS--AVPMHH-YVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLY 212 (515) Q Consensus 139 ~~~G~~~l--~~d~~~~~r--~~pl~~-y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~ 212 (515) ..+|.+.+ |.++++.++ +++..+ |++--| ..+.+.-.+|.++.. ....+++| T Consensus 135 ~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~----------------------~~~~~~vy 192 (474) T protein:vir:96 135 SNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFN----------------------GETKVEYW 192 (474) T ss_pred hhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec----------------------CeeEEEEE Confidence 99999875 456665444 454444 545433 357777777665421 01123444 Q ss_pred EEE---E--EcCCCCeEEEEEeCCe-eec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 213 THA---Q--YAGEGFWKINQSADDI-PVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARG 285 (515) Q Consensus 213 ~~v---~--~~~~~~~~~~~e~~~~-~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~ 285 (515) +.- + ....++.......+.. .+. ..-+| ..+|++.++. +.+|.|=.+..++-+-.++.+--..... T Consensus 193 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~ 265 (474) T protein:vir:96 193 TAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSW--ERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNM 265 (474) T ss_pred eCCeEEEEEEcCCceeeccccccccccCcccccCC--CccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 320 0 1111111111111111 111 12244 3478876653 4679998899999999999888888888 Q ss_pred HHHhccCceeecCccccChhhccC-C-CCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCC Q lcl|NC_020414. 286 AALMADIKYLIRPGSQTDVDHFVN-S-GTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDA 362 (515) Q Consensus 286 ~~~a~~p~~l~~~~g~~~~~~~~~-~-~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~ 362 (515) .+....|.+.+.--+.-+...... . ..+.+..+..+++..+. ...+.......++.++..|...-. .+......+ T Consensus 266 ~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~ 343 (474) T protein:vir:96 266 FDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQ--VEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFG 343 (474) T ss_pred HHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeEEe--ccCCHHHHHHHHHHHHHHHHHHhCCcCccccccc Confidence 888888865442211111111111 1 11222223334444443 234566667777777776644321 111101112 Q ss_pred CCCCHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHH Q lcl|NC_020414. 363 ERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKL 433 (515) Q Consensus 363 ~~~TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l 433 (515) ...|+..+. .++.+++..++..+.++-. .+..-.+-......+.+.+ ..+.+.+..++.+ T Consensus 344 ~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~--------~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~--- 412 (474) T protein:vir:96 344 SATSGIALKFLYTNLNLKANKLKNKANVALQELMQ--------FILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG--- 412 (474) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccceeeEEecCCCccCHHHHHHHH--- Confidence 234554443 2334455555554444322 2222111122222333333 2223333333211 Q ss_pred HHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhc Q lcl|NC_020414. 434 ANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMK 513 (515) Q Consensus 434 ~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~ 513 (515) .. ++ .+.-..++..+ -+++ -.++|++.+.+++.++++..+ ...++...+.-+ T Consensus 413 -------~~-~g-------iiS~et~~~~l---p~v~----D~~~E~eri~~E~~~~~~~~~------~~~~~~~~~~~~ 464 (474) T protein:vir:96 413 -------AQ-SQ-------YLSKETLVRHH---PWVD----DPKAELERLDEEQLELNKQLP------NLDDGGADGAQQ 464 (474) T ss_pred -------HH-cC-------CCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhcc------ccccccCCCCCC Confidence 11 11 12222222222 1111 135666666665543322211 111122222111 Q ss_pred cC Q lcl|NC_020414. 514 EG 515 (515) Q Consensus 514 ~~ 515 (515) ++ T Consensus 465 ~~ 466 (474) T protein:vir:96 465 QQ 466 (474) T ss_pred cC Confidence 11 No 93 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.19 E-value=0.00014 Score=41.79 Aligned_cols=413 Identities=11% Similarity=0.090 Sum_probs=177.4 Q ss_pred CCCc-----------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----cccCC---CCC---Cccccccc Q lcl|NC_020414. 1 MQDT-----------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMNN---KGD---NETSQNGW 58 (515) Q Consensus 1 ~~~~-----------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~~~---~~~---~~~~~~~~ 58 (515) |-.| +...-.+.+.|.+..+..+. | ..+.+++.+|..- .+-.. .+. +....++. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~---~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~ 82 (474) T protein:vir:95 7 MPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQ-K---LKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRIT 82 (474) T ss_pred CCCCCCCCcchhhhccccccchHHHHHHHHHHHHH-H---HHHHHHHHHHhcccCccccccchhhhcccccccccccccc Confidence 1000 11111244445555554443 2 2333444444322 11100 000 11112445 Q ss_pred cccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH Q lcl|NC_020414. 59 QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHL 138 (515) Q Consensus 59 dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl 138 (515) .+-+...++..++-|++ -|+ .++..+... ...++.| + .++|...+.++.++. T Consensus 83 ~n~~k~Iv~~~~~yl~g--~p~-----~~~~~~~~~---------~~~l~~~-----------~-~n~~~~~~~~l~~~~ 134 (474) T protein:vir:95 83 TNFHQNLVDQKVSYVAG--KPV-----TYAHDDDKV---------LDVIHQV-----------L-DTRWDNKLIDILTAA 134 (474) T ss_pred cchHHHHHHhhhhhhcc--cCc-----eeccCChHH---------HHHHHHH-----------H-hccHHHHHHHHHHHH Confidence 56666666666665544 222 233333211 1112222 2 368999999999999 Q ss_pred HhhCceEE--EEeCCCcEE--EEEcce-EEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEE Q lcl|NC_020414. 139 IVAGNCLL--YKPSKGAMS--AVPMHH-YVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLY 212 (515) Q Consensus 139 ~~~G~~~l--~~d~~~~~r--~~pl~~-y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~ 212 (515) ..+|.+.+ |.++++.++ +++..+ |++--| ..+.+.-.+|.++.. ....+++| T Consensus 135 ~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~----------------------~~~~~~vy 192 (474) T protein:vir:95 135 SNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFN----------------------GETKVEYW 192 (474) T ss_pred hhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec----------------------CeeEEEEE Confidence 99999875 456665444 454444 545433 357777777665421 01123444 Q ss_pred EEE---E--EcCCCCeEEEEEeCCe-eec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 213 THA---Q--YAGEGFWKINQSADDI-PVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARG 285 (515) Q Consensus 213 ~~v---~--~~~~~~~~~~~e~~~~-~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~ 285 (515) +.- + ....++.......+.. .+. ..-+| ..+|++.++. +.+|.|=.+..++-+-.++.+--..... T Consensus 193 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vPvv~~~n-----n~~~~~d~e~v~~liDa~d~~~S~~~~~ 265 (474) T protein:vir:95 193 TAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSW--ERVPFIAFKN-----NPEEVSDIWMYKSFVDAIDKRLSDVQNM 265 (474) T ss_pred eCCeEEEEEEcCCceeeccccccccccCcccccCC--CccceEEecC-----CCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 320 0 1111111111111111 111 12244 3478876653 4679998899999999999888888888 Q ss_pred HHHhccCceeecCccccChhhccC-C-CCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccCCC Q lcl|NC_020414. 286 AALMADIKYLIRPGSQTDVDHFVN-S-GTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRRDA 362 (515) Q Consensus 286 ~~~a~~p~~l~~~~g~~~~~~~~~-~-~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~~~ 362 (515) .+....|.+.+.--+.-+...... . ..+.+..+..+++..+. ...+.......++.++..|...-. .+......+ T Consensus 266 ~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~ 343 (474) T protein:vir:95 266 FDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQ--VEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFG 343 (474) T ss_pred HHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeEEe--ccCCHHHHHHHHHHHHHHHHHHhCCcCccccccc Confidence 888888865442211111111111 1 11222223334444443 234566667777777776644321 111101112 Q ss_pred CCCCHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHH Q lcl|NC_020414. 363 ERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKL 433 (515) Q Consensus 363 ~~~TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l 433 (515) ...|+..+. .++.+++..++..+.++-. .+..-.+-......+.+.+ ..+.+.+..++.+ T Consensus 344 ~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~--------~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~--- 412 (474) T protein:vir:95 344 SATSGIALKFLYTNLNLKANKLKNKANVALQELMQ--------FILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG--- 412 (474) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccceeeEEecCCCccCHHHHHHHH--- Confidence 234554443 2334455555554444322 2222111122222333333 2223333333211 Q ss_pred HHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhc Q lcl|NC_020414. 434 ANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMK 513 (515) Q Consensus 434 ~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~ 513 (515) .. ++ .+.-..++..+ -+++ -.++|++.+.+++.++++..+ ...++...+.-+ T Consensus 413 -------~~-~g-------iiS~et~~~~l---p~v~----D~~~E~eri~~E~~~~~~~~~------~~~~~~~~~~~~ 464 (474) T protein:vir:95 413 -------AQ-SQ-------YLSKETLVRHH---PWVD----DPKAELERLDEEQLELNKQLP------NLDDGGADGAQQ 464 (474) T ss_pred -------HH-cC-------CCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhcc------ccccccCCCCCC Confidence 11 11 12222222222 1111 135666666665543322211 111122222111 Q ss_pred cC Q lcl|NC_020414. 514 EG 515 (515) Q Consensus 514 ~~ 515 (515) ++ T Consensus 465 ~~ 466 (474) T protein:vir:95 465 QQ 466 (474) T ss_pred cC Confidence 11 No 94 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.15 E-value=0.00015 Score=41.55 Aligned_cols=424 Identities=12% Similarity=0.056 Sum_probs=183.0 Q ss_pred CCCcc------ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc---ccCC-CCCCccccccccccHHHHHHHHH Q lcl|NC_020414. 1 MQDTI------LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY---LMNN-KGDNETSQNGWQGVGAQATNHLA 70 (515) Q Consensus 1 ~~~~~------~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~-~~~~~~~~~~~dst~~~a~~~La 70 (515) ||=-. .....+.+.|.+..+.++.+ .++++.+.+|.... +... ........|+-.+-+...++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~ 76 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNR----KKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMNV 76 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHH----HHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHHh Confidence 44211 11133444555555555432 34445555554442 1111 11112223455566666777776 Q ss_pred HHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EE Q lcl|NC_020414. 71 NKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YK 148 (515) Q Consensus 71 a~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~ 148 (515) +-|.+- |+ +++..+... .. .+...+..++|.....++.++...+|.+.+ |. T Consensus 77 ~~l~g~--p~-----~~~~~~~~~---------~~-----------~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~ 129 (499) T protein:vir:10 77 GFMTGN--PV-----KYVAEKGKN---------ID-----------DILEVFNQIDIHKHDIELEKDLSVFGYGYELLYL 129 (499) T ss_pred hhhccc--Cc-----eeecCChhH---------HH-----------HHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEe Confidence 655432 22 223322111 11 133345567899999999999999999865 44 Q ss_pred eCCCc-------------------EEEE-EcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCccc Q lcl|NC_020414. 149 PSKGA-------------------MSAV-PMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDN 208 (515) Q Consensus 149 d~~~~-------------------~r~~-pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~ 208 (515) +.++. ++++ |..-|.+.-|..++....+.++..+.. . ........ T Consensus 130 ~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~----------~-----~~~~~~~~ 194 (499) T protein:vir:10 130 KKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKD----------L-----EGNTNGYS 194 (499) T ss_pred cccccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEee----------c-----CCCceEEE Confidence 54432 2233 344566655555544333333221100 0 00011112 Q ss_pred EEEEEEEEEcCCCCeEEEEE------eCCeeecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 209 VKLYTHAQYAGEGFWKINQS------ADDIPVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEA 281 (515) Q Consensus 209 v~v~~~v~~~~~~~~~~~~e------~~~~~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~ 281 (515) +++|+ ++..+.+... .++..+.. .-+| ..+|++.++- +.+|.|=.+...+-+..++.+--. T Consensus 195 ~~iyt-----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~~~~d~e~v~~liD~~~~~~S~ 262 (499) T protein:vir:10 195 ITVYM-----PQRIVEYRTKTTMEVSANDPIVYDGENLF--GAVPIIEFRN-----NEERQGDFEQLISLIDAYNLLQTD 262 (499) T ss_pred EEEEe-----CCeEEEEEecCCccccCcceecccccCCC--CccceEEecC-----CCCCCCchHhHHHHHHHHHHHHHH Confidence 22332 2211111100 11122222 1234 4588887654 467899888999999999988888 Q ss_pred HHHHHHHhccCceeecCccccC-hhhccCCCCcce-ec-CC-cccccccccCCccchHHHHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_020414. 282 VARGAALMADIKYLIRPGSQTD-VDHFVNSGTGEV-IT-GV-EEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-ET 356 (515) Q Consensus 282 ~~~~~~~a~~p~~l~~~~g~~~-~~~~~~~~~g~~-~~-g~-~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-~~ 356 (515) .....+....|.+.+.-..... .+.......|.+ .. +. ..+++.+ ....+.......++.+.+.|.+.-.. +. T Consensus 263 ~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~ 340 (499) T protein:vir:10 263 RISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWL--TKSFDETQVNLLSQSIENDIHKISYVPNM 340 (499) T ss_pred HHHHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEE--eccCCHHHHHHHHHHHHHHHHHHhCcccC Confidence 8888888888876654221111 111111122222 11 11 1223333 33456677788888888877553211 10 Q ss_pred hccCCCCCCCHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--eehHHHHHH Q lcl|NC_020414. 357 MTRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TGIEALGRM 427 (515) Q Consensus 357 l~~~~~~~~TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~l~~l~ra 427 (515) ....-+...|+..+. .+..++...++..+.++..=++ .++....-......+.+.+- .+.+.+..+ T Consensus 341 ~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~-----~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~ 415 (499) T protein:vir:10 341 NDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQ-----TIVNIKGANDDASGCKISLVANIPSNLSDVV 415 (499) T ss_pred CchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhccCCccccccceEEeCCCCCCCHHHHH Confidence 000112234666653 3345555666655544433222 11221111111122333331 222222222 Q ss_pred HHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-c Q lcl|NC_020414. 428 AELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVP-G 506 (515) Q Consensus 428 ~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~-~ 506 (515) +.+.++ ++ .+....+++.+ -+++ -+++|++.+.+++++.....+ +......+ . T Consensus 416 ~~~~kl----------~g-------~iS~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~--~~~~~~~~~~ 469 (499) T protein:vir:10 416 NNVKNA----------DG-------IIPRKYTYSWL---PDVD----NPQDVIDEMNQQDAETIKKNQ--EALRGQDPDR 469 (499) T ss_pred HHHHHH----------hc-------cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHH--hhhccCCCCC Confidence 222222 11 12223333222 1222 145677777665544322222 11111110 1 Q ss_pred hhhhhhccC Q lcl|NC_020414. 507 VIQQEMKEG 515 (515) Q Consensus 507 ~~~~~~~~~ 515 (515) +..+...++ T Consensus 470 ~~~~~~~~~ 478 (499) T protein:vir:10 470 LELEDKQDD 478 (499) T ss_pred CCCCCCCcc Confidence 101111111 No 95 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=97.12 E-value=0.00016 Score=41.37 Aligned_cols=425 Identities=13% Similarity=0.129 Sum_probs=170.4 Q ss_pred CCCcc--------ccccccHHHHHHHHH----HHHHhhhhHHHHHHHHHHhhccccc---CCC-CCCccccccccccHHH Q lcl|NC_020414. 1 MQDTI--------LEYGGQRSKIPKLWE----KFSKKRSPYLDRAKHFAKLTLPYLM---NNK-GDNETSQNGWQGVGAQ 64 (515) Q Consensus 1 ~~~~~--------~~~~~~~~~l~~r~~----~lk~~R~~~e~~w~e~~~~~~P~~~---~~~-~~~~~~~~~~dst~~~ 64 (515) |=+-- -+.+ -++++..... .+..+.......|+.+|.=--|-+. ... +.....+++--+.+.. T Consensus 1 m~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMG-LLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKV 79 (499) T ss_pred ChhHHHHHHHHHHHHhc-cccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHH Confidence 10000 0000 0001111100 0111222445566666542112111 111 1111122333466667 Q ss_pred HHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCce Q lcl|NC_020414. 65 ATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNC 144 (515) Q Consensus 65 a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 144 (515) .++.+|+-|++- |++ ++.++. +..++|. ..+..++|...+.++..+....|.+ T Consensus 80 iv~~~a~~l~~e--p~~-----i~~~d~-------------~~~e~l~-------~~~~~n~f~~~~~~~~~~a~~~G~~ 132 (499) T protein:vir:80 80 TAKYMSKLLFNE--KVK-----INIDDE-------------TAEEFVL-------NVLKTNGFTKNMERYIEYGEAMGGF 132 (499) T ss_pred HHHHHHHhhhCC--cce-----EeeCCH-------------HHHHHHH-------HHHhhccHHHHHHHHHHHHhhcCcE Confidence 777777655443 222 333332 2333433 3455678999999999999999998 Q ss_pred EE--EEeCCCc--EEEEEcceEE-EeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEE--- Q lcl|NC_020414. 145 LL--YKPSKGA--MSAVPMHHYV-VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQ--- 216 (515) Q Consensus 145 ~l--~~d~~~~--~r~~pl~~y~-v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~--- 216 (515) ++ |.|.+.. +..++-.+++ +..| .|++..+.....++.+ ++ .|+.++ T Consensus 133 ~~~~~~D~~~~~~i~~v~a~~~~Pi~~d-~~~~~~~~f~~~~~~~------------------~~------~y~~lE~h~ 187 (499) T protein:vir:80 133 VIKVYHDGNKNVKVSFATADCMYPLSND-SENVDECLIANSFHKN------------------NK------YYKLLEWNE 187 (499) T ss_pred EEEEEECCCCcEEEEEEcCCceEEEEec-CCCeEEEEEEEEEeec------------------Ce------EEEEEEEEE Confidence 76 5666554 5567777765 4555 5777665544443321 00 122221 Q ss_pred EcCCC--CeE----EEEEeC----Ceee-c-------cc----CCcccccCcEEEEe----eeecCCCccccchHHHHHH Q lcl|NC_020414. 217 YAGEG--FWK----INQSAD----DIPV-G-------KE----NRIKAEKLPFIPLT----WKRSYGEDWGRPLVEDYSG 270 (515) Q Consensus 217 ~~~~~--~~~----~~~e~~----~~~i-~-------~e----sgy~~~~~P~~~~R----w~~~~g~~YGrgp~~~~l~ 270 (515) +.+.. .+. +|...+ |..+ + .+ .|+ ...||+.++ .++..++.+|+|-...+.+ T Consensus 188 ~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~--~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~ 265 (499) T protein:vir:80 188 WKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSL--TRPTFIYIKPNIANNKNLTSPLGISVYANALD 265 (499) T ss_pred ecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCC--CccceEeecCCccccccCCCccCCchHhhHHH Confidence 11111 111 111111 1111 0 01 122 224555443 3445688899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCceeecCccccChh-hccCC-------CCc--ceecCCccccc-ccccCCccch--HHH Q lcl|NC_020414. 271 DLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVD-HFVNS-------GTG--EVITGVEEDIH-IVQLGKYADL--TPI 337 (515) Q Consensus 271 d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~-~~~~~-------~~g--~~~~g~~~~v~-~~~~~~~~~l--~~~ 337 (515) -+..|+..--......+. .+..+.++ ..++.+. +.... ... ..+.+..++.+ .++. -..++ ..- T Consensus 266 lid~lD~~~s~~~~e~~~-~~~~i~v~-~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~ir~e~~ 342 (499) T protein:vir:80 266 TLKTLDLMFDSYYQEFKL-GKKKVLVP-SSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKD-ISVEIRSTEF 342 (499) T ss_pred HHHHHHHHHHHHHHHHHh-cccceecc-hhhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeE-ecCcCChHHH Confidence 999999887777666544 45555553 2233221 11000 000 01122211111 1110 01122 112 Q ss_pred HHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH--h---c--CCCC Q lcl|NC_020414. 338 SAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ--E---A--GDSF 408 (515) Q Consensus 338 ~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~--~---~--~~~~ 408 (515) ...++.+.+.|.... + ..++........|||||..+.+...+...-.-..+ ..-|..|++-++. . . +... T Consensus 343 ~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~ 421 (499) T protein:vir:80 343 IESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLI-EQGIKEMIVSILEVGKLIKAYDGDTV 421 (499) T ss_pred HHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccccCCCC Confidence 233333333332221 0 11122223344699999988877777655422222 2233444433321 1 1 1112 Q ss_pred Chhhccceeee--ehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHH Q lcl|NC_020414. 409 TSELVDPVIVT--GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQ 486 (515) Q Consensus 409 p~~~~~~~~v~--~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~ 486 (515) +...+.+.+=. ..+..+.++. ..+.++ +++ +.... .++...|+ |++|++++.++ T Consensus 422 ~~~~v~v~f~d~i~~d~~~~~~~------~~~~~~--~Gi-------~S~et---~l~~~~~~------~d~ea~~el~~ 477 (499) T protein:vir:80 422 ELDTITVDFDDSIAQDEDTTINR------YTTAKN--QGM-------IPLKI---ALQRAWNI------TEAEADEWAEM 477 (499) T ss_pred CccceEEEeCCCCCCCHHHHHHH------HHHHHH--cCC-------CCHHH---HHhhcCCC------ChHHHHHHHHH Confidence 22223333311 1122222111 111111 111 11111 23344454 45555444332 Q ss_pred HHHHHHHHHHHHHhhhhccchhhhhhc Q lcl|NC_020414. 487 QAQAQQEAMLNEGVAKAVPGVIQQEMK 513 (515) Q Consensus 487 ~~~~~q~~~~~~~~~~a~~~~~~~~~~ 513 (515) .++.+ .+. ...-..+++.|+-. T Consensus 478 i~~E~-~~~----~~~~d~~g~~ge~e 499 (499) T protein:vir:80 478 LAKEK-QAE----IPNNDMTGIFGEEE 499 (499) T ss_pred HHHHh-hcC----CCCCCccccCCCCC Confidence 22111 100 00001112222222 No 96 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=97.01 E-value=0.00021 Score=40.73 Aligned_cols=377 Identities=11% Similarity=0.021 Sum_probs=173.1 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccccCCCCCC-ccccccccccHHHHHHHHHHHHHHhhcCCCCCcee Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYLMNNKGDN-ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFR 86 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFr 86 (515) .+...|....+++..++ +......++|+-- +|.+...-... ...-+..-+-+..++++||..|. ..+ |+ T Consensus 1 ~~~~~i~~L~~~~~~~~-~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G---f~ 72 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHK-RRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE---FE 72 (409) T ss_pred CCHHHHHHHHHHHHHHh-HHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc----cCc---cc Confidence 57777777777775544 3333333333321 11111100000 00002233455566666655432 122 11 Q ss_pred cCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEe--CCC--cEEEEEcceE Q lcl|NC_020414. 87 VDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKP--SKG--AMSAVPMHHY 162 (515) Q Consensus 87 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d--~~~--~~r~~pl~~y 162 (515) ..| .++ +....+++|.....++.++..++|.+.+++- ++. .+++++-.+. T Consensus 73 --~~d-------------~~l-----------~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~ 126 (409) T protein:vir:94 73 --NDD-------------FTV-----------NEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNA 126 (409) T ss_pred --CCc-------------hHH-----------HHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceE Confidence 111 112 2345678999999999999999999987763 332 3666766554 Q ss_pred EEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcc Q lcl|NC_020414. 163 VVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIK 241 (515) Q Consensus 163 ~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~ 241 (515) ++..|+ .+++...++...- + .........+| .++..+.++.. ++.......+| T Consensus 127 ~~i~D~~~~~~~~a~~~~~~-----------d--------~~~~~~~~~~~-----~~~~~~~~~~~-~~~~~~~~n~~- 180 (409) T protein:vir:94 127 TGIIDPITGLLTEGYAVLER-----------D--------ENNNVVLEAHF-----LPDRTDYYYRD-SRNNISIANPT- 180 (409) T ss_pred EEEEecCCCceeeeEEEEEe-----------c--------CCCceEEEEEE-----ecCcEEEEEec-CceeEeeeCCC- Confidence 444455 3555544443210 0 00000011111 11111111111 11111123355 Q ss_pred cccCcEEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhccCceee---cCccccChhhccCCCCccee- Q lcl|NC_020414. 242 AEKLPFIPLTWKRSYGEDWGRPLV-EDYSGDLFVIQFLSEAVARGAALMADIKYLI---RPGSQTDVDHFVNSGTGEVI- 316 (515) Q Consensus 242 ~~~~P~~~~Rw~~~~g~~YGrgp~-~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~---~~~g~~~~~~~~~~~~g~~~- 316 (515) ..||++.+..+...++.||+|-. +..++-+..+|+..-..+..++..+.|-..+ .+++.. .+.+ ....+.+. T Consensus 181 -g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~-~~~~-~~~~~~i~~ 257 (409) T protein:vir:94 181 -GHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEP-METW-KATVSSMLQ 257 (409) T ss_pred -CCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcc-cchh-hhhHHHhhc Confidence 35999999999889999999965 4577778889999888899999988884322 333311 1111 11223332 Q ss_pred -cCCccc--ccccccCCccchHHHHHHHHHHHHHHHHHHHHH-----hhccCCCCCCCHHHH-------HHHHHHHHHHh Q lcl|NC_020414. 317 -TGVEED--IHIVQLGKYADLTPISAVLEVYTRRIGVIFMME-----TMTRRDAERVTAVEI-------QRDALEIEQNM 381 (515) Q Consensus 317 -~g~~~~--v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~-----~l~~~~~~~~TAtEi-------~~r~~E~~~~L 381 (515) |...++ +..-++ ..++++.. ++.++.-|+...+.. .+.....-.-+|.-| ..++++|.+.+ T Consensus 258 ~~~d~dg~~~~v~q~-~~~~l~~~---~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~f 333 (409) T protein:vir:94 258 FTKDEDGDKPTLGQF-TQPSMSPF---TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSL 333 (409) T ss_pred CCCCCCCCCceEEec-CCCChhHH---HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 222111 222233 23455543 334444333332211 111101111234333 23556777777 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee----eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHH Q lcl|NC_020414. 382 GGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI----VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWG 457 (515) Q Consensus 382 Gpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~----v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d 457 (515) |.-+.++.. |+. .+.+..+..+.+..+..+ +..-+....|+.++.+..+.+. .|.. .+. T Consensus 334 g~~~~~~~r-----la~-~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~a-------g~~~---~~~- 396 (409) T protein:vir:94 334 GAGLLNVAY-----LAA-CLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQA-------IPEF---INK- 396 (409) T ss_pred HHHHHHHHH-----HHH-HHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHh-------cccc---cch- Confidence 766665543 111 123444455554444332 2233455555555555544441 1111 111 Q ss_pred HHHHHHHHhcCCchhc Q lcl|NC_020414. 458 DYMDWVRGQISAELPF 473 (515) Q Consensus 458 ~~~~~~a~~~Gvp~~~ 473 (515) +.+.+.+|.+..= T Consensus 397 ---~~~~~~lG~~~~d 409 (409) T protein:vir:94 397 ---DTIRDLTGIEGGE 409 (409) T ss_pred ---hHHHHHcCCCCCC Confidence 1233344443211 No 97 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.00 E-value=0.00022 Score=40.67 Aligned_cols=419 Identities=11% Similarity=0.051 Sum_probs=177.7 Q ss_pred CCCc-----------------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc--ccCCC-CCCccccccccc Q lcl|NC_020414. 1 MQDT-----------------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY--LMNNK-GDNETSQNGWQG 60 (515) Q Consensus 1 ~~~~-----------------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~--~~~~~-~~~~~~~~~~ds 60 (515) |+++ +. .++-+.+.+..+..+..+.+ +++.+.+|..-. ..... .......|+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~---~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n 75 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGE--KLTSNELLGFIAYNETVLKP---RYRENMKLYLGKHKILTAPEKETGADNRIVVN 75 (470) T ss_pred CccccCCcccccCCceEEeCCCC--CcCHHHHHHHHHHHHHhhHH---HHHHHHHHhccccccccCcccccCCcceeecc Confidence 3322 22 23556677766655444433 445555555431 00010 111112245556 Q ss_pred cHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh Q lcl|NC_020414. 61 VGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIV 140 (515) Q Consensus 61 t~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~ 140 (515) -+...++..++-|++- |+ +++..+.. ...+ .+...+..++|.....++.++..+ T Consensus 76 ~~~~Ivd~~~~~l~g~--p~-----~~~~~~d~------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~ 129 (470) T protein:vir:99 76 SAKYVVDVYNGYFCGI--EP-----KLALLNDS------------SKID-------EIARWNRQENFFDTINEISKQCDI 129 (470) T ss_pred hHHHHHHHHhhhhccC--Ce-----eEeeCCch------------hHHH-------HHHHHHHhcCHhHHHHHHHHHHHh Confidence 6666676666655432 21 12222210 0011 233445678999999999999999 Q ss_pred hCceEE--EEeCCCcE--EEEEcceEEEeeCCCCC--eeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE Q lcl|NC_020414. 141 AGNCLL--YKPSKGAM--SAVPMHHYVVNRDTNGD--LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH 214 (515) Q Consensus 141 ~G~~~l--~~d~~~~~--r~~pl~~y~v~~d~~G~--vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 214 (515) +|.+.+ |.+.++.+ ++++..+.++..|..+. +...+|.++.. .+.....|.. T Consensus 130 ~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~----------------------~~~~~~~~~~ 187 (470) T protein:vir:99 130 FGRSIASIYQGEDARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDN----------------------SNNWTDAYGV 187 (470) T ss_pred cCeeEEEEEeCCCCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEe----------------------cCCeeEEEEE Confidence 999765 44665544 45555555444454432 33334333321 0011112222 Q ss_pred EEEcCCCCeEEEEEeCCe--eec-c-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020414. 215 AQYAGEGFWKINQSADDI--PVG-K-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMA 290 (515) Q Consensus 215 v~~~~~~~~~~~~e~~~~--~i~-~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~ 290 (515) ++. ++..+.+...-.+. ... . .-+| ..+|++..+ ++.+|+|=.+..++-+..++.+.-......+... T Consensus 188 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~ 259 (470) T protein:vir:99 188 IQY-ADKFYKFKGYDIEEDTNAAGYAINPY--GLVPAVEFF-----ENEERQGIFDSIKTLINALDKVISQKANQVEYFD 259 (470) T ss_pred EEe-cCeEEEEEecccccccccccccccCC--CccceEeec-----CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhc Confidence 222 22222211111111 111 1 1223 347877654 3568999999999999999988888888888888 Q ss_pred cCceeecCccccCh---hhccCCC-Ccce-ecC----CcccccccccCCccchHHHHHHHHHHHHHHHHHHH-HHhhccC Q lcl|NC_020414. 291 DIKYLIRPGSQTDV---DHFVNSG-TGEV-ITG----VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-METMTRR 360 (515) Q Consensus 291 ~p~~l~~~~g~~~~---~~~~~~~-~g~~-~~g----~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~~~l~~~ 360 (515) .|.+.+.-.+.-.. +.+.... .+.+ +++ ...++..+. ...+.......++.+.+.|-..-. .+..... T Consensus 260 ~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 337 (470) T protein:vir:99 260 NAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIA--KPDADQMQENLIQHLTDFIFMMAMVPNIQDKN 337 (470) T ss_pred CceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEe--ecCChHHHHHHHHHHHHHHHHHhCCccccccc Confidence 88765532111000 0011111 1111 111 112233332 223455556666666666533211 0100011 Q ss_pred CCCCCCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCC-CCChhhcccee--eeehHHHHHHHHH Q lcl|NC_020414. 361 DAERVTAVEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD-SFTSELVDPVI--VTGIEALGRMAEL 430 (515) Q Consensus 361 ~~~~~TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~-~~p~~~~~~~~--v~~l~~l~ra~~~ 430 (515) .+...|+..+..+ ++++...++..+.++.. ++..++..... ......+.+.+ ..+.+.+..++.+ T Consensus 338 ~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-----li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~ 412 (470) T protein:vir:99 338 FAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYR-----IVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNA 412 (470) T ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHH Confidence 1233577766543 34444444444433222 11111211111 11112233333 2233444444433 Q ss_pred HHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHH-Hhhhhccchhh Q lcl|NC_020414. 431 DKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNE-GVAKAVPGVIQ 509 (515) Q Consensus 431 ~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~-~~~~a~~~~~~ 509 (515) .++. |. ++ ...++..+ -++ -.++|++.+.++++.+.+..+... ....+..++.+ T Consensus 413 ~kl~------gi---is--------~et~l~~l---~~v-----d~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ 467 (470) T protein:vir:99 413 KNAE------GI---VS--------KKTQLGMI---PDI-----EPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNA 467 (470) T ss_pred HHHh------cc---CC--------HHHHHHhC---CCC-----CHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCc Confidence 3321 11 11 12222222 122 235677777665543332221111 11111111111 Q ss_pred hhh Q lcl|NC_020414. 510 QEM 512 (515) Q Consensus 510 ~~~ 512 (515) ++= T Consensus 468 ee~ 470 (470) T protein:vir:99 468 EEE 470 (470) T ss_pred cCC Confidence 111 No 98 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=96.96 E-value=0.00024 Score=40.47 Aligned_cols=377 Identities=10% Similarity=0.020 Sum_probs=171.8 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccccCCCCCC-ccccccccccHHHHHHHHHHHHHHhhcCCCCCcee Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYLMNNKGDN-ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFR 86 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~~-~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFr 86 (515) .+...|....+++..++ +......++|+-- +|.+...-... ...-+..-+-+..++++||..|. ..+ |+ T Consensus 1 ~~~~~i~~L~~~~~~~~-~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~----~~G---f~ 72 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHK-RRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV----FRE---FE 72 (409) T ss_pred CCHHHHHHHHHHHHHHh-HHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcc----ccc---cc Confidence 46677777777775444 3333333344321 11111100000 00011233455566666655442 112 21 Q ss_pred cCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CC--cEEEEEcce- Q lcl|NC_020414. 87 VDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPS--KG--AMSAVPMHH- 161 (515) Q Consensus 87 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~--~~--~~r~~pl~~- 161 (515) ..| .++ ++...+++|.....++.++..++|.+.+++-+ +. .+++++..+ T Consensus 73 --~~d-------------~~l-----------~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~ 126 (409) T protein:vir:16 73 --NDD-------------FTV-----------NEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNA 126 (409) T ss_pred --Ccc-------------hHH-----------HHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccce Confidence 111 112 23456799999999999999999999887632 22 366665544 Q ss_pred EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcc Q lcl|NC_020414. 162 YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIK 241 (515) Q Consensus 162 y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~ 241 (515) +++--+..+++...++...- + .........+| .++..+. +...++......-++ T Consensus 127 ~~i~D~~~~~~~~a~~~~~~-----------d--------~~~~~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~- 180 (409) T protein:vir:16 127 TGIIDPITGLLTEGYAVLER-----------D--------ENNNVVLEAHF-----LPDRTDY-YYRDSRNNISIANPT- 180 (409) T ss_pred EEEeecccccceeeeEEEEe-----------c--------CCCceEEEEEE-----ecCcEEE-EEecCccccceecCC- Confidence 54443334555444432110 0 00000111111 1111111 111122111223345 Q ss_pred cccCcEEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhccCceee---cCccccChhhccCCCCcce-- Q lcl|NC_020414. 242 AEKLPFIPLTWKRSYGEDWGRPLV-EDYSGDLFVIQFLSEAVARGAALMADIKYLI---RPGSQTDVDHFVNSGTGEV-- 315 (515) Q Consensus 242 ~~~~P~~~~Rw~~~~g~~YGrgp~-~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~---~~~g~~~~~~~~~~~~g~~-- 315 (515) ..||++.+..+...++.||+|=. +..++-+..++...-..+..++..+.|-..+ .+++.. .+.+ ....|.+ T Consensus 181 -g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~-~~~~-~~~~~~i~~ 257 (409) T protein:vir:16 181 -GNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEP-METW-KATVSSMLQ 257 (409) T ss_pred -CCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCc-cchh-hhhhhHhhc Confidence 35999999999898999999844 4577778889988888888889888885332 122211 1111 1122333 Q ss_pred ecCCccc--ccccccCCccchHHHHHHHHHHHHHHHHHHHHH-----hhccCCCCCCCHHHH-------HHHHHHHHHHh Q lcl|NC_020414. 316 ITGVEED--IHIVQLGKYADLTPISAVLEVYTRRIGVIFMME-----TMTRRDAERVTAVEI-------QRDALEIEQNM 381 (515) Q Consensus 316 ~~g~~~~--v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~-----~l~~~~~~~~TAtEi-------~~r~~E~~~~L 381 (515) +|...++ +..-++ ..++++.. ++.++.-|+...+.. .+.....-.-+|.-| ..++++|.+.+ T Consensus 258 ~~~d~~g~~~~v~q~-~~~~l~~~---~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~f 333 (409) T protein:vir:16 258 FTKDEDGDKPTLGQF-TQPSMSPF---TEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSL 333 (409) T ss_pred cCCCCCCCCceEEec-CCCChhHH---HHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222211 222233 23455543 333443333322211 111111111233333 33567777888 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee----eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHH Q lcl|NC_020414. 382 GGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI----VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWG 457 (515) Q Consensus 382 Gpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~----v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d 457 (515) |..+.++.. |+. .+.+..+..+.+..+... +..-+....|+.++.+..+.+.. |-. .+. T Consensus 334 g~~l~~~~r-----la~-~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~-------~~~---~~~- 396 (409) T protein:vir:16 334 GAGLLNVAY-----LAA-CLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAI-------PEF---INK- 396 (409) T ss_pred HHHHHHHHH-----HHH-HHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhc-------ccc---cch- Confidence 877776554 111 123444555655544332 12333334455555555444311 111 011 Q ss_pred HHHHHHHHhcCCchhc Q lcl|NC_020414. 458 DYMDWVRGQISAELPF 473 (515) Q Consensus 458 ~~~~~~a~~~Gvp~~~ 473 (515) +.+.+.+|....= T Consensus 397 ---~v~~~~~g~~~~d 409 (409) T protein:vir:16 397 ---DTIRDLTGIKGAE 409 (409) T ss_pred ---hHHHHhccCCCCC Confidence 1222333443211 No 99 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.88 E-value=0.00028 Score=40.05 Aligned_cols=415 Identities=9% Similarity=0.066 Sum_probs=189.1 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc--cCC-------CCC------CccccccccccHHHHHHHHHHHHH Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL--MNN-------KGD------NETSQNGWQGVGAQATNHLANKLA 74 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~--~~~-------~~~------~~~~~~~~dst~~~a~~~Laa~l~ 74 (515) ++.+++.+.-+.+...++....+++.+.+|..-.- ... .+. +....|+-.+-+...++..++-|+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 78888888888888888888888999998865520 000 000 011123445555555555554443 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) + -|+ .++..+... ...+++| +. .||...+.++.++...+|.+.+ |.|.+. T Consensus 81 G--~p~-----~~~~~d~~~---------~~~l~~~-----------~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~ 132 (470) T protein:vir:10 81 S--VFP-----DIDVGKDAD---------NKKIIDV-----------LG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDG 132 (470) T ss_pred c--cce-----eeecCchHH---------HHHHHHH-----------Hh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCC Confidence 3 121 233333211 1123333 22 4677888888899999998764 667766 Q ss_pred cEEE--EEcce-EEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE---EE--EcCCCCe Q lcl|NC_020414. 153 AMSA--VPMHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH---AQ--YAGEGFW 223 (515) Q Consensus 153 ~~r~--~pl~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~---v~--~~~~~~~ 223 (515) .+++ ++..+ |.+--|. .+++..++|.+...-.+ + ......+++|+. .+ ....+.. T Consensus 133 ~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~-----~-----------~~~~~~~e~yt~~~~~~~~~~~~~~~ 196 (470) T protein:vir:10 133 NFRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDPD-----S-----------GKYFTVHEYWTDKEAQFFRTNATDST 196 (470) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeecC-----C-----------ceEEEEEEEEcCCcEEEEEeecCcce Confidence 5554 44444 4443333 46776666665442110 0 000011222220 00 0000000 Q ss_pred ---------E--EEEEeCCeeec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020414. 224 ---------K--INQSADDIPVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMAD 291 (515) Q Consensus 224 ---------~--~~~e~~~~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~ 291 (515) . .....++..+. ..-+| ..+|++.++ ++.+|.|=.+...+-+-.++.+.-......+...+ T Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~ 269 (470) T protein:vir:10 197 VIEPYNIITSYDLSAGYETGQSNTLKHNF--GRVPFIEFS-----KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQT 269 (470) T ss_pred eccccccccccccccccccccccccccCC--CeeeEEEee-----cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 0 00000111111 11223 236666554 24689999999999999999999999999999999 Q ss_pred CceeecCccccChhhc-cCCCC-cce-ecC----CcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCC Q lcl|NC_020414. 292 IKYLIRPGSQTDVDHF-VNSGT-GEV-ITG----VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAER 364 (515) Q Consensus 292 p~~l~~~~g~~~~~~~-~~~~~-g~~-~~g----~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~ 364 (515) |.+.+.-.+..+...+ ..... +.+ ++. ...++..+ ....+.......++.+++.|-+.-..-.+...+... T Consensus 270 ~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l--t~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn 347 (470) T protein:vir:10 270 VILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKL--QIDIPVEARDDALKITRKNIFLFGQGIDPANFESSN 347 (470) T ss_pred cceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEE--eecCChHHHHHHHHHHHHHHHHHhCCCCCCcccccc Confidence 9877653222221111 11111 222 221 11223333 334566777888888888775432111111112233 Q ss_pred CCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHH Q lcl|NC_020414. 365 VTAVEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLAN 435 (515) Q Consensus 365 ~TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~ 435 (515) .|+..+..+ +.+++..++..+.++..= |-..+... ..+...+.+.+ ..+.+.+..++-+.. T Consensus 348 ~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~-----i~~~l~~~--~~d~~~i~i~f~~~~p~d~~e~~~~~~~--- 417 (470) T protein:vir:10 348 ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA-----IMRYLNFS--DADKRHISQHWTRTKVEDSLTKAQIVST--- 417 (470) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhccc--CcccceeeEEeccCCCCCHHHHHHHHHH--- Confidence 566655332 344444444444333221 11111111 11222233333 223333333322111 Q ss_pred HHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccC-CHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 436 FAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLK-SEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 436 ~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~ir-s~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) +++ .+.-..++..+ + ++. .++|++++.++++++++.. +......+++..++ | T Consensus 418 -------~~g-------~iS~et~l~~~----p----~v~D~~~E~eri~~E~~e~~~~~---~~~~~~~~~~~dde--~ 470 (470) T protein:vir:10 418 -------VAN-------YSSKEAVAKAN----P----IVDDWQQELKDLAKDKEENDPYS---NQADELNGKGVNDE--Q 470 (470) T ss_pred -------Hhc-------cCcHHHHHHhC----C----CCCCHHHHHHHHHHHHHHHHHhh---ccccccCCCCCCCC--C Confidence 111 22233333322 2 122 3566666665544332211 11111111111111 1 No 100 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=96.77 E-value=0.00035 Score=39.54 Aligned_cols=389 Identities=11% Similarity=0.050 Sum_probs=174.8 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccC-CCCCC--cccc---ccccccHHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMN-NKGDN--ETSQ---NGWQGVGAQATNHLANKLAQVLFPAQRS 83 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~-~~~~~--~~~~---~~~dst~~~a~~~Laa~l~s~ltpp~~~ 83 (515) -+...+...+.++..++. +.+.+.+|.....-. .-+.. ...+ +..-+-+..+++.||..+. .-+ T Consensus 1 m~~~~i~~L~~~~~~~~~----r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~----~~G-- 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKT----GVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRII----FRE-- 70 (422) T ss_pred CChHHHHHHHHHHHHHHH----HHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccc----cce-- Confidence 355556666666654433 444555554332110 00110 0111 1122334455555554321 111 Q ss_pred ceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCc---EEEEE Q lcl|NC_020414. 84 FFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKGA---MSAVP 158 (515) Q Consensus 84 WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~~---~r~~p 158 (515) | ...|. ++ +....++++....+++.++..++|.+.+++ +++.+ +++++ T Consensus 71 -f--~~~d~-------------~l-----------~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~s 123 (422) T protein:vir:97 71 -F--TNDDF-------------NA-----------WEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIE 123 (422) T ss_pred -e--eCCch-------------hH-----------HHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEec Confidence 1 11111 11 234567999999999999999999998877 43222 55665 Q ss_pred cceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeeccc Q lcl|NC_020414. 159 MHHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKE 237 (515) Q Consensus 159 l~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~e 237 (515) -.+.++..|+ .+++...++++... .+..... ..++++ . -.++...++...-.+ T Consensus 124 p~~~~~i~D~~~~~~~~a~~~~~~~----------------------~~~~~~~-~~~~~~-~--~~~~~~~~~~~~~~~ 177 (422) T protein:vir:97 124 ASKATGILDPTTFLLTEGYAILESD----------------------SNGNPTL-EAYFTD-K--DIWYYPKKGKPYNIK 177 (422) T ss_pred hhhEEEEEeCCCCcceeeEEEEEec----------------------CCCcEEE-EEEEcC-c--eEEEEcCCCcccccc Confidence 5443333354 34333333322110 0011111 111111 1 111222222222223 Q ss_pred CCcccccCcEEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhccCceee---cCccccChhhccCCCCc Q lcl|NC_020414. 238 NRIKAEKLPFIPLTWKRSYGEDWGRPLV-EDYSGDLFVIQFLSEAVARGAALMADIKYLI---RPGSQTDVDHFVNSGTG 313 (515) Q Consensus 238 sgy~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~---~~~g~~~~~~~~~~~~g 313 (515) -++ +.+|++++..+...++.||+|-. +..++-+..++...-..+..++..+.|-..+ .+++... +.-....+ T Consensus 178 ~~~--g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~--~~~~~~~~ 253 (422) T protein:vir:97 178 NPT--GHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPM--EKWRATVS 253 (422) T ss_pred CCC--CCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccC--chhhhhhh Confidence 344 34899999999999999999976 5678888889999888888889888885322 2332211 11111223 Q ss_pred ce--ecCCccc--ccccccCCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHH-------HHHHHHHHHH Q lcl|NC_020414. 314 EV--ITGVEED--IHIVQLGKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEI-------QRDALEIEQN 380 (515) Q Consensus 314 ~~--~~g~~~~--v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi-------~~r~~E~~~~ 380 (515) .+ ++...+. +..-++ ..++++.....++.+...|...= + ...+.....-.-+|.-| ..+.++|.+. T Consensus 254 ~i~~~~~de~~~~~~v~q~-~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~ 332 (422) T protein:vir:97 254 TLLEISKDEDGDKPTVGQF-TTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRS 332 (422) T ss_pred hhhccCCCCCCCcceeeec-CCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHH Confidence 33 2222211 222222 23455543333333333332110 0 00111111111234333 3345777777 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee----eeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCH Q lcl|NC_020414. 381 MGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI----VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRW 456 (515) Q Consensus 381 LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~----v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~ 456 (515) +|..+.++..= + ..+.+..+..+....+... +.+.+....|+.++.+..+.+. .|. ..+. T Consensus 333 fg~~l~~~~rl-----a-~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a-------~~~---~~~~ 396 (422) T protein:vir:97 333 FSSGFLNVAYI-----A-VCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQA-------IPG---FMDA 396 (422) T ss_pred HHHHHHHHHHH-----H-HHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhh-------ccc---cccH Confidence 78777765431 1 1233444444444333332 1245666666666665544432 111 1223 Q ss_pred HHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHH Q lcl|NC_020414. 457 GDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQA 490 (515) Q Consensus 457 d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~ 490 (515) +.+ .+.+|... ++++...+-++++.. T Consensus 397 ~~~----~~~lg~~~----~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 397 DVI----RDLTGVKG----ADKPIPAITEVTTDG 422 (422) T ss_pred HHH----HHHcCCCc----hhHHHHHHHhhhccC Confidence 322 23345422 133322222221111 No 101 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=96.42 E-value=0.00064 Score=38.09 Aligned_cols=417 Identities=9% Similarity=0.083 Sum_probs=184.8 Q ss_pred CCCc-----------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccc---cCCCCC---Ccccccccccc Q lcl|NC_020414. 1 MQDT-----------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYL---MNNKGD---NETSQNGWQGV 61 (515) Q Consensus 1 ~~~~-----------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~---~~~~~~---~~~~~~~~dst 61 (515) |--| +.+.-.+.+.|.+..+..+. |-+...+++++|+-- +..+ ....+. +....++..+- T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:97 7 MPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNF 85 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecch Confidence 1000 01111356677777766654 445666666666421 1111 111111 11122455677 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|NC_020414. 62 GAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVA 141 (515) Q Consensus 62 ~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 141 (515) +...++..++-|++ -| +.++..++. ..+.|. . +..+||...+.++.++...+ T Consensus 86 ~k~Ivd~~~~~l~g--~p-----~~~~~~d~~-------------~~~~l~-------~-~~~n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:97 86 HQNLVDQKVSYVAS--KP-----VTYSCEDEN-------------VLKVIH-------D-VLDTRWDNKLIDILTATSNK 137 (474) T ss_pred HHHHHHHHHhhhhc--CC-----ceeccCcHH-------------HHHHHH-------H-HHhccHHHHHHHHHHHHhhc Confidence 77777777766654 12 223433322 122221 1 22478999999999999999 Q ss_pred CceEE--EEeCCCc--EEEEEcceEEEeeCC--CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEE Q lcl|NC_020414. 142 GNCLL--YKPSKGA--MSAVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHA 215 (515) Q Consensus 142 G~~~l--~~d~~~~--~r~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v 215 (515) |.+.+ |.|.+.. +++++..+.++..|. .+++.-++|.++..- ...+++|+.- T Consensus 138 G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~----------------------~~~~~~yt~~ 195 (474) T protein:vir:97 138 GIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNN----------------------EEKVEFWTDT 195 (474) T ss_pred CceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecC----------------------eEEEEEEeCC Confidence 99865 4555544 445655554433343 578887777765210 0122333210 Q ss_pred -----EEcCCCC-eEEEEEeCCeee-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 216 -----QYAGEGF-WKINQSADDIPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAAL 288 (515) Q Consensus 216 -----~~~~~~~-~~~~~e~~~~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~ 288 (515) .....++ .......++... ....+| ..+|++.++. +.+|.|=.....+-+-.+|.+--......+. T Consensus 196 ~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~ 268 (474) T protein:vir:97 196 TVTYYVLENGGLIPDYYYGANHVQSHFSNGNW--GRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDE 268 (474) T ss_pred eEEEEEEcCCccccccccCcCcccccccccCC--CccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011111 011111111111 112234 3478877654 4689998899999999999988888888888 Q ss_pred hccCceeecCccccChhhc-cCCCCccee-cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-HhhccCCCCCC Q lcl|NC_020414. 289 MADIKYLIRPGSQTDVDHF-VNSGTGEVI-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-ETMTRRDAERV 365 (515) Q Consensus 289 a~~p~~l~~~~g~~~~~~~-~~~~~g~~~-~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-~~l~~~~~~~~ 365 (515) ...|.+.+..-..-+...+ .+...+.++ ....+++..+. ...+.......++.++..|...-.. +......+... T Consensus 269 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~ 346 (474) T protein:vir:97 269 SVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQ--VEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAP 346 (474) T ss_pred hcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCcccccccc Confidence 8888766542211111111 111112222 22223344433 2345666677777777766443211 10001111234 Q ss_pred CHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHH Q lcl|NC_020414. 366 TAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANF 436 (515) Q Consensus 366 TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~ 436 (515) |+..+. .++.++...++..+.++-. .+..-.+-......+.+.+ ..+.+-+..++ . T Consensus 347 Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~--------li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~---~---- 411 (474) T protein:vir:97 347 SGIALKFLYGNLDLKANKLKNKATVAIQELIS--------FIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQ---I---- 411 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccceeeEEeccCcccCHHHHHH---H---- Confidence 665442 3445555555555555332 1111111111112233332 11222222221 1 Q ss_pred HHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 437 AQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 437 ~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +.... .+....++..+ -+++ -.++|++.+.++++++++.... . ...+..++...++ T Consensus 412 ---~~~~g--------~iS~et~l~~l---~~v~----D~~~E~eri~~E~~~~~~~~~~---~--~~~~~~~~~~~~~ 467 (474) T protein:vir:97 412 ---IAQSQ--------YLSRETLVKSS---PLVD----DYKAELERIEQEQMEYNKQLPN---L--DDGGADGAQQQEG 467 (474) T ss_pred ---HHHcC--------CCCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhccc---c--CCCCCCCcccCCC Confidence 11111 12233333322 1221 1246666666654433222111 0 1111111111222 No 102 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=96.42 E-value=0.00064 Score=38.09 Aligned_cols=417 Identities=9% Similarity=0.083 Sum_probs=184.8 Q ss_pred CCCc-----------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccc---cCCCCC---Ccccccccccc Q lcl|NC_020414. 1 MQDT-----------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYL---MNNKGD---NETSQNGWQGV 61 (515) Q Consensus 1 ~~~~-----------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~---~~~~~~---~~~~~~~~dst 61 (515) |--| +.+.-.+.+.|.+..+..+. |-+...+++++|+-- +..+ ....+. +....++..+- T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:94 7 MPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRK-QLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNF 85 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHH-HHHHHHHHHHHhccccchhcccchhccccccccccCcceeecch Confidence 1000 01111356677777766654 445666666666421 1111 111111 11122455677 Q ss_pred HHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|NC_020414. 62 GAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVA 141 (515) Q Consensus 62 ~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 141 (515) +...++..++-|++ -| +.++..++. ..+.|. . +..+||...+.++.++...+ T Consensus 86 ~k~Ivd~~~~~l~g--~p-----~~~~~~d~~-------------~~~~l~-------~-~~~n~~~~~~~e~~~~~~~~ 137 (474) T protein:vir:94 86 HQNLVDQKVSYVAS--KP-----VTYSCEDEN-------------VLKVIH-------D-VLDTRWDNKLIDILTATSNK 137 (474) T ss_pred HHHHHHHHHhhhhc--CC-----ceeccCcHH-------------HHHHHH-------H-HHhccHHHHHHHHHHHHhhc Confidence 77777777766654 12 223433322 122221 1 22478999999999999999 Q ss_pred CceEE--EEeCCCc--EEEEEcceEEEeeCC--CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEE Q lcl|NC_020414. 142 GNCLL--YKPSKGA--MSAVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHA 215 (515) Q Consensus 142 G~~~l--~~d~~~~--~r~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v 215 (515) |.+.+ |.|.+.. +++++..+.++..|. .+++.-++|.++..- ...+++|+.- T Consensus 138 G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~----------------------~~~~~~yt~~ 195 (474) T protein:vir:94 138 GIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNN----------------------EEKVEFWTDT 195 (474) T ss_pred CceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecC----------------------eEEEEEEeCC Confidence 99865 4555544 445655554433343 578887777765210 0122333210 Q ss_pred -----EEcCCCC-eEEEEEeCCeee-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 216 -----QYAGEGF-WKINQSADDIPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAAL 288 (515) Q Consensus 216 -----~~~~~~~-~~~~~e~~~~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~ 288 (515) .....++ .......++... ....+| ..+|++.++. +.+|.|=.....+-+-.+|.+--......+. T Consensus 196 ~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~~~s~~~~~~~~ 268 (474) T protein:vir:94 196 TVTYYVLENGGLIPDYYYGANHVQSHFSNGNW--GRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDKRLSDAQNMFDE 268 (474) T ss_pred eEEEEEEcCCccccccccCcCcccccccccCC--CccceEEecC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011111 011111111111 112234 3478877654 4689998899999999999988888888888 Q ss_pred hccCceeecCccccChhhc-cCCCCccee-cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH-HhhccCCCCCC Q lcl|NC_020414. 289 MADIKYLIRPGSQTDVDHF-VNSGTGEVI-TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM-ETMTRRDAERV 365 (515) Q Consensus 289 a~~p~~l~~~~g~~~~~~~-~~~~~g~~~-~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~-~~l~~~~~~~~ 365 (515) ...|.+.+..-..-+...+ .+...+.++ ....+++..+. ...+.......++.++..|...-.. +......+... T Consensus 269 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~ 346 (474) T protein:vir:94 269 SVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQ--VEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAP 346 (474) T ss_pred hcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCccccCcccccccc Confidence 8888766542211111111 111112222 22223344433 2345666677777777766443211 10001111234 Q ss_pred CHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHH Q lcl|NC_020414. 366 TAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANF 436 (515) Q Consensus 366 TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~ 436 (515) |+..+. .++.++...++..+.++-. .+..-.+-......+.+.+ ..+.+-+..++ . T Consensus 347 Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~--------li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~---~---- 411 (474) T protein:vir:94 347 SGIALKFLYGNLDLKANKLKNKATVAIQELIS--------FIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQ---I---- 411 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccceeeEEeccCcccCHHHHHH---H---- Confidence 665442 3445555555555555332 1111111111112233332 11222222221 1 Q ss_pred HHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 437 AQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 437 ~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +.... .+....++..+ -+++ -.++|++.+.++++++++.... . ...+..++...++ T Consensus 412 ---~~~~g--------~iS~et~l~~l---~~v~----D~~~E~eri~~E~~~~~~~~~~---~--~~~~~~~~~~~~~ 467 (474) T protein:vir:94 412 ---IAQSQ--------YLSRETLVKSS---PLVD----DYKAELERIEQEQMEYNKQLPN---L--DDGGADGAQQQEG 467 (474) T ss_pred ---HHHcC--------CCCHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhccc---c--CCCCCCCcccCCC Confidence 11111 12233333322 1221 1246666666654433222111 0 1111111111222 No 103 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=96.34 E-value=0.00072 Score=37.80 Aligned_cols=429 Identities=9% Similarity=0.036 Sum_probs=182.9 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc---cCCC----CCCccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL---MNNK----GDNETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~---~~~~----~~~~~~~~~~dst~~~a~~~Laa~l 73 (515) +.-.+..-.++-+.+.+..+..+.+|. ++|+++.+|....- .... .......|+-.+.+...++..++-| T Consensus 13 ~~~~~~~~~l~~~~i~~li~~~~~~~~---~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l 89 (506) T protein:vir:94 13 LIYQESLENLTPNKIMKFITHHFNYQR---PRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYS 89 (506) T ss_pred eecccchhcCCHHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhh Confidence 222333445677778777766655554 45667777765421 1110 1111223455667777777777666 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSK 151 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~ 151 (515) ++- | +.++..++. .++ .+...+..++|.....++.++...+|.+.+ |.+++ T Consensus 90 ~G~--p-----~~~~~~d~~-------------~~~-------~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded 142 (506) T protein:vir:94 90 VGN--P-----INVKLPDDG-------------SNS-------GFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGED 142 (506) T ss_pred ccc--C-----ceeecCcch-------------HHH-------HHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCC Confidence 542 2 122332221 111 233445678999999999999999999875 45555 Q ss_pred CcEE--EEEc-ceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEE Q lcl|NC_020414. 152 GAMS--AVPM-HHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQ 227 (515) Q Consensus 152 ~~~r--~~pl-~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 227 (515) +.++ +++- .-|++.-|. .+.+.-.+|.+...-. ..+....+..|..+ +.+...+.+.. T Consensus 143 ~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~-----------------~~~~~~~~~~~~~~-yt~~~~~~~~~ 204 (506) T protein:vir:94 143 NEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELV-----------------DDNQVSTINYVPET-WTADTYTLYNP 204 (506) T ss_pred CeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeec-----------------cCCceeEEEEEEEE-EeCceEEEecc Confidence 5444 4444 445554443 3566555555543211 00111111122222 22222221111 Q ss_pred EeCC-eeeccc-CCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccCh- Q lcl|NC_020414. 228 SADD-IPVGKE-NRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDV- 304 (515) Q Consensus 228 e~~~-~~i~~e-sgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~- 304 (515) ...+ .....+ -+| ..+|++..+= +..|.|-.+...+-+-.++.+.-..+...+...+|.+.+--...... T Consensus 205 ~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~ 277 (506) T protein:vir:94 205 TPIMGKMQVDTTKPI--TTFPVVEFKN-----SNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFE 277 (506) T ss_pred ccCccceeccccccC--CccceEEecC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCcccccc Confidence 1111 122222 233 3578876532 34577877888888888887777777766666665544321100000 Q ss_pred ------------------------h---hcc-----CCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 305 ------------------------D---HFV-----NSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF 352 (515) Q Consensus 305 ------------------------~---~~~-----~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af 352 (515) . .+. ....+....|......+-.+....+.+.....++.+...|-..- T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s 357 (506) T protein:vir:94 278 GSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFS 357 (506) T ss_pred chhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHh Confidence 0 000 00001111111111222223333456667777777777664321 Q ss_pred H-HHhhccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCCh--hhcccee--eee Q lcl|NC_020414. 353 M-METMTRRDAERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTS--ELVDPVI--VTG 420 (515) Q Consensus 353 l-~~~l~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~--~~~~~~~--v~~ 420 (515) . .+......+...|+..+.. +..++...++..+.++-. +|..++......... ..+++.+ ..+ T Consensus 358 ~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~-----li~~~~~~~~~~~~~d~~~i~i~f~~~~p 432 (506) T protein:vir:94 358 HTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQ-----IISDIENSIHGDWTFDPQELTFTFRDNLP 432 (506) T ss_pred CccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHhcCCccccccccceEEeCCCCC Confidence 1 1100001123456665543 334555555544443322 111122211111111 1233333 222 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 421 IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGV 500 (515) Q Consensus 421 l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~ 500 (515) -+.+..++-+.++ ++ .+....++..+ -+++ -.++|++.+.++++++.. ...+.. T Consensus 433 ~d~~e~a~~~~kl----------~g-------~iS~et~~~~l---p~v~----d~~~E~~ri~~E~~~~~~--~~~~~~ 486 (506) T protein:vir:94 433 ADNISQIKALVQA----------GA-------TLPQKYLYQQL---PGVT----NPQDIVDMMKEQSANGDY--SFDQNG 486 (506) T ss_pred cCHHHHHHHHHHH----------hc-------cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHhh--cchhhc Confidence 3333333322221 11 12223333322 1222 124666666655433221 111111 Q ss_pred hhhccchhhhhhccC Q lcl|NC_020414. 501 AKAVPGVIQQEMKEG 515 (515) Q Consensus 501 ~~a~~~~~~~~~~~~ 515 (515) .....+...+...+. T Consensus 487 ~~~~~~~~~~~~~~~ 501 (506) T protein:vir:94 487 VISNDGQTNTTATQT 501 (506) T ss_pred CCCcccCcccccccc Confidence 111111111111111 No 104 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=96.29 E-value=0.00078 Score=37.63 Aligned_cols=426 Identities=10% Similarity=0.089 Sum_probs=165.4 Q ss_pred CCCccccc---cccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcccccccc--ccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEY---GGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ--GVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~---~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d--st~~~a~~~Laa~l~s 75 (515) +....... ...-.--..+|.+.+.-|.-|+..|.++..+. .++...++... +.+...++.+|+-+.+ T Consensus 18 ~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~--------~~~~~~~~~~~slnl~~~i~~~~A~lv~~ 89 (522) T protein:vir:47 18 MQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKN--------TDGDIKSRPMNHLPIARTASKKIASLVYN 89 (522) T ss_pred hhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccc--------cCcchhcccceecchHHHHHHHHhhhhcC Confidence 11110000 00000011223322222333333333322111 11111112223 4455666666655544 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC- Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG- 152 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~- 152 (515) -.. .++++++ .++++| ...+..++|+..+.+++....+.|.+++ |.|.+. T Consensus 90 e~~-------~i~v~d~-------------~~~~~l-------~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~ 142 (522) T protein:vir:47 90 EQA-------TITTKNE-------------ILQKFL-------DDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKV 142 (522) T ss_pred Ccc-------eeecCCh-------------HHHHHH-------HHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCce Confidence 321 1222221 234444 3457779999999999999999998765 666432 Q ss_pred cEEEEEcceEE-EeeCCCCCeeEEE-EEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcC----------- Q lcl|NC_020414. 153 AMSAVPMHHYV-VNRDTNGDLMDVI-LLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAG----------- 219 (515) Q Consensus 153 ~~r~~pl~~y~-v~~d~~G~vd~i~-r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~----------- 219 (515) .+.+++-..++ +..|..|.+..++ .+...+-. ....+||.++... T Consensus 143 ~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~----------------------~~~~~yt~lE~he~~~~~~~~~~~ 200 (522) T protein:vir:47 143 RVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEG----------------------RKNVYYTLVEFHEWVTADGQETGS 200 (522) T ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeecc----------------------cceeEEEEEEEeeecccccccccc Confidence 25567777766 4677777665443 33221111 1112233222210 Q ss_pred ---CCCeE----EEEEeC----Ceee-c---------ccC-CcccccCcEEE----Eeeee-cCCCccccchHHHHHHHH Q lcl|NC_020414. 220 ---EGFWK----INQSAD----DIPV-G---------KEN-RIKAEKLPFIP----LTWKR-SYGEDWGRPLVEDYSGDL 272 (515) Q Consensus 220 ---~~~~~----~~~e~~----~~~i-~---------~es-gy~~~~~P~~~----~Rw~~-~~g~~YGrgp~~~~l~d~ 272 (515) .+.+. +|...+ |.++ + .+. .+..-.-|.++ +.++. ..++.||+|-...+.+.+ T Consensus 201 ~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~i 280 (522) T protein:vir:47 201 TNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTI 280 (522) T ss_pred cccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHH Confidence 00011 111111 1110 1 111 11100124322 12333 337889999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC-----------CcceecCC-----cccccccccCCccchHH Q lcl|NC_020414. 273 FVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG-----------TGEVITGV-----EEDIHIVQLGKYADLTP 336 (515) Q Consensus 273 k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~-----------~g~~~~g~-----~~~v~~~~~~~~~~l~~ 336 (515) +.||..--++..-.. ..+-...|+ ..++....-..++ ...+++.. .+.+..++.. -.... T Consensus 281 d~lD~~~s~~~~e~~-~g~~~i~v~-~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~--ir~e~ 356 (522) T protein:vir:47 281 DFINRSYDEFMWEVR-MGQRRVIVP-EHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSP--IRAND 356 (522) T ss_pred HHHHHHHHHHHHHHH-hccceeecc-hHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccc--cChHH Confidence 999976655555433 333333332 2222221100000 01111111 1112221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHH--HhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH---H--hcCCCCC Q lcl|NC_020414. 337 ISAVLEVYTRRIGVIFMM--ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGL---Q--EAGDSFT 409 (515) Q Consensus 337 ~~~~i~~~~~rI~~afl~--~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~---~--~~~~~~p 409 (515) ....++.+-+.|....=+ .++........|||||..+.+...+...-.-..+. .-|..|++-++ . +..-..+ T Consensus 357 ~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~-~al~~lv~~i~~l~~~~~~~~~~~ 435 (522) T protein:vir:47 357 YILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVE-QSIKELCVSMCELGKAVGVYSGEI 435 (522) T ss_pred HHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhhccCCC Confidence 233344444444332211 12222233346999999999888887766433333 33444554432 1 1111111 Q ss_pred hhh--ccceeeeeh--HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHH Q lcl|NC_020414. 410 SEL--VDPVIVTGI--EALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMA 485 (515) Q Consensus 410 ~~~--~~~~~v~~l--~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq 485 (515) ... +.+.+=.++ +.-+..+. ..+.++ ++ .+....++ ....|+ |++|++++.+ T Consensus 436 ~~~~~i~v~f~D~i~~D~~~~~~~------~~~~v~--aG-------~~s~e~~i---~~~~g~------~eeea~~el~ 491 (522) T protein:vir:47 436 PELDDISVNLDDGVFTDRHAELDY------WAKMVA--AG-------FSTKKRAI---GKTLNI------SGVEAEKELN 491 (522) T ss_pred CCcceeEEEcCCCCCCCHHHHHHH------HHHHHh--cC-------CCCHHHHH---HhcCCC------ChHHHHHHHH Confidence 122 222222222 11111111 111111 11 12222322 233443 4555544443 Q ss_pred HHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 486 QQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 486 ~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +-++ ++.++ .....-..++..+.-++| T Consensus 492 ri~~-E~~~~--~~~~~~~~~~~~~~~~~~ 518 (522) T protein:vir:47 492 AINS-ELLPM--NDAELAIYGMHDQNEEKA 518 (522) T ss_pred HHHH-hhccC--CCCCCCCCCCCCcccccC Confidence 2211 11110 000001111222222333 No 105 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=96.28 E-value=0.00079 Score=37.58 Aligned_cols=434 Identities=9% Similarity=0.071 Sum_probs=172.7 Q ss_pred ccccccHH-------------HHHHHHH----HHHHhhhhHHHHHHHHHHhhcccccCCCCCCcccccccc--ccHHHHH Q lcl|NC_020414. 6 LEYGGQRS-------------KIPKLWE----KFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ--GVGAQAT 66 (515) Q Consensus 6 ~~~~~~~~-------------~l~~r~~----~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d--st~~~a~ 66 (515) |-.=-..+ .+.+-.. .+...-.....+|+.+|+=-.|-+....+++...++..- +.+..++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 10000000 0000000 000111123344666654333333222222222222222 3444555 Q ss_pred HHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE Q lcl|NC_020414. 67 NHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL 146 (515) Q Consensus 67 ~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l 146 (515) +.+|+-|.+-+.. + ++++....+. ........+++|++ .+..++|+..+.+++.+..+.|.+++ T Consensus 81 ~~~A~Ll~~e~~~-----i--~v~d~~~~~~--~~~~~~~~~e~l~~-------i~~~n~f~~~~~~~~e~a~a~G~~a~ 144 (517) T protein:vir:98 81 DVLSGLVFNEQCE-----V--YVSDAKDEEK--KDNSFKTAHEFIQH-------VFQHNKFIKNLSDYLEPTFALGGLTV 144 (517) T ss_pred HHhhhhhcCCcce-----E--Eecccccccc--cccchhHHHHHHHH-------HHHhccHHHHHHHHHHHHhhhCCEEE Confidence 5555554333222 2 2222211100 00112224555554 47788999999999999999999875 Q ss_pred --EEeCCCc-EEEEEcceEEE-eeCCCCCeeEEE-EEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcC-- Q lcl|NC_020414. 147 --YKPSKGA-MSAVPMHHYVV-NRDTNGDLMDVI-LLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAG-- 219 (515) Q Consensus 147 --~~d~~~~-~r~~pl~~y~v-~~d~~G~vd~i~-r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~-- 219 (515) |.|.... +.+++-..|+- .-|..|.+..+| .++..+.+ ..-.+|+.+++.. T Consensus 145 k~~~d~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~----------------------~~~~~Yt~lE~H~~~ 202 (517) T protein:vir:98 145 RPYVDNGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIG----------------------NKTVYYTLLEFHEWE 202 (517) T ss_pred EEEEeCCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeec----------------------CCceEEEEEEEEecC Confidence 6775443 66677666553 667777555443 33332211 0011333333211 Q ss_pred -----CCCeEE----EEEeC----Ceee--------cccCCc-ccccCcEEEE----eee-ecCCCccccchHHHHHHHH Q lcl|NC_020414. 220 -----EGFWKI----NQSAD----DIPV--------GKENRI-KAEKLPFIPL----TWK-RSYGEDWGRPLVEDYSGDL 272 (515) Q Consensus 220 -----~~~~~~----~~e~~----~~~i--------~~esgy-~~~~~P~~~~----Rw~-~~~g~~YGrgp~~~~l~d~ 272 (515) ++.+.+ |...+ |.++ +.+..| ..-..|.++. -.+ ...++.||+|-...+++.+ T Consensus 203 ~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~ 282 (517) T protein:vir:98 203 KTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTL 282 (517) T ss_pred ceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHH Confidence 122221 21111 1111 111111 1001242221 222 2336789999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCceeecCccccChh-hccCCCCcc--------e--ecCCcccccccccCCccch--HHHHH Q lcl|NC_020414. 273 FVIQFLSEAVARGAALMADIKYLIRPGSQTDVD-HFVNSGTGE--------V--ITGVEEDIHIVQLGKYADL--TPISA 339 (515) Q Consensus 273 k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~-~~~~~~~g~--------~--~~g~~~~v~~~~~~~~~~l--~~~~~ 339 (515) +.||..--+...-... .+..+.++++ ++.+. +......+. + +.+..+.-....+ .+++ ..... T Consensus 283 d~lD~~~s~~~~e~~~-g~~~i~vp~~-~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~--~~~iR~e~~~~ 358 (517) T protein:vir:98 283 KKINDTYDQFWWEIKM-GQRTVFVSDV-MLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDV--THDIRTEQYKE 358 (517) T ss_pred HHHHHHHHHHHHHHHh-CCcceecChh-hhccccCCCCcccCCCCCcccceeeeccCCCCCCceeee--ccccchHHHHH Confidence 9999777666664444 5666555433 33221 100001110 0 1111111100000 1122 12333 Q ss_pred HHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-----hcCC-CC-Ch Q lcl|NC_020414. 340 VLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-----EAGD-SF-TS 410 (515) Q Consensus 340 ~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-----~~~~-~~-p~ 410 (515) .++.+-+.|.... + ..++......-.|||||..+.+...+...- +.+.-...|.-|++-++. +... .+ +. T Consensus 359 ~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~-~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~ 437 (517) T protein:vir:98 359 AINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRND-HVYEVEQFIKGLVISVLELAKTYKLFGGEIPSA 437 (517) T ss_pred HHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCC Confidence 3444444343222 1 112222223336999999998888877665 333333444444444321 1111 11 22 Q ss_pred hhccceeeeehH--HHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHH Q lcl|NC_020414. 411 ELVDPVIVTGIE--ALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQA 488 (515) Q Consensus 411 ~~~~~~~v~~l~--~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~ 488 (515) ..+.+.+=.++. .-+.+.. .. +.++ ++ .+....++ ...+| -|++|.+++.++-+ T Consensus 438 ~~v~v~f~D~i~~D~~~~~~~---~~---~~v~--aG-------~ms~~~~i---~~~~g------~~eeeA~~e~~~i~ 493 (517) T protein:vir:98 438 EHIGVDFDDGVFQDRSALLRF---YG---QAKT--FG-------FIPTVEAI---QRIFK------VPKKTAEQWLEEIR 493 (517) T ss_pred cceEEEcCCCCCCCHHHHHHH---HH---HHHh--cC-------CCCHHHHH---HHhCC------CChHHHHHHHHHHH Confidence 223333222221 1121111 11 1111 11 12333333 33345 25666554443221 Q ss_pred HHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 489 QAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 489 ~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) ..+ +.+.+....+...++ T Consensus 494 ~E~---------~~~~~~~~~~~~~~~ 511 (517) T protein:vir:98 494 KDQ---------IELDPVTISQRAQKR 511 (517) T ss_pred Hhc---------cccCCCCccccccCC Confidence 111 111222222222222 No 106 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=96.17 E-value=0.00092 Score=37.23 Aligned_cols=412 Identities=10% Similarity=0.061 Sum_probs=177.9 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----c-cC--CCCC-----------CccccccccccHHHHHHHHH Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----L-MN--NKGD-----------NETSQNGWQGVGAQATNHLA 70 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~-~~--~~~~-----------~~~~~~~~dst~~~a~~~La 70 (515) .+.+.+.+....+..+.+....++.++.+|..-. + .. ..+. +....|+..+-+-..++..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 5777777777777766555566777777776431 0 00 0000 00111344455555555555 Q ss_pred HHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EE Q lcl|NC_020414. 71 NKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YK 148 (515) Q Consensus 71 a~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~ 148 (515) +-|.+ -|+. ++..+.. ..+.|.. .+ .++|.....++.++...+|.+.+ |. T Consensus 81 ~yl~G--~p~~-----~~~~~~~-------------~~~~l~~-------~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~ 132 (471) T protein:vir:10 81 AYALT--YPPT-----FDVDDKK-------------VNDMIVD-------VL-GDDYERISKQLCVNAGNAGIAWLHVWK 132 (471) T ss_pred hhhcc--cCce-----eccCChH-------------HHHHHHH-------HH-hcCHHHHHHHHHHHHhhCCeEEEEEEe Confidence 44433 2222 2333221 2222221 12 37899999999999999999864 55 Q ss_pred eCC-Cc--EEEEEcce-EEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEE------EEEE Q lcl|NC_020414. 149 PSK-GA--MSAVPMHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYT------HAQY 217 (515) Q Consensus 149 d~~-~~--~r~~pl~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~------~v~~ 217 (515) +.+ +. +++++-.+ |.+--+. .+++...+|.|...... .......+++|+ ...- T Consensus 133 d~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~----------------~~~~~~~~~vy~~~~~~~y~~~ 196 (471) T protein:vir:10 133 DASDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDET----------------DGKNYTVYEYWNDKECSFYRHE 196 (471) T ss_pred eCCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEeeccC----------------CCceeEEEEEEeCCcEEEEEec Confidence 643 33 44454444 4443333 45666666666432210 001111122222 1111 Q ss_pred cCCCCeEEE--------EEeCCee--ecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 218 AGEGFWKIN--------QSADDIP--VGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGA 286 (515) Q Consensus 218 ~~~~~~~~~--------~e~~~~~--i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~ 286 (515) .......+. ...++.. ... .-+| ..+|++.++. +.+|.|=.+...+-+-.++.+.-...... T Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~iPvv~~~n-----~~~~~sd~e~v~~liDa~d~~~S~~~~~~ 269 (471) T protein:vir:10 197 KEKPLEELETFQAISLIDTMNGDRSSDNSFKHDF--GLVPFIPFKN-----NEIETNDLKPIKDLVDVYDKVFSGFVNDT 269 (471) T ss_pred CCcccccccccccccccccccccccccccccCCC--CceeEEEecc-----CCCCCCchHHHHHHHHHHHHHHHHHHHHH Confidence 111011100 0011111 111 2234 3478776654 45788888899999999998888888888 Q ss_pred HHhccCceeecCc-cccChhhccCCC-Ccce-ecCC----cccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020414. 287 ALMADIKYLIRPG-SQTDVDHFVNSG-TGEV-ITGV----EEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTR 359 (515) Q Consensus 287 ~~a~~p~~l~~~~-g~~~~~~~~~~~-~g~~-~~g~----~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~ 359 (515) +...+|.+.+.-. +....+...... .+.+ .++. ..++..+ ....+.+.....++.+++.|-..-..-.+.. T Consensus 270 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~ 347 (471) T protein:vir:10 270 DDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTI--AIDIPTEARNLILERTKKQIFISGQGVNPET 347 (471) T ss_pred HHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEE--eecCChHHHHHHHHHHHHHHHHHhCCcCCCc Confidence 8888886554321 111111111111 1222 1111 1123333 2334667777778877777744321100111 Q ss_pred CCCCCCCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--eehHHHHHHHHH Q lcl|NC_020414. 360 RDAERVTAVEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TGIEALGRMAEL 430 (515) Q Consensus 360 ~~~~~~TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~l~~l~ra~~~ 430 (515) ......|+.-+..+ +.++...++..+.++..=++ ..+ +.. ....+.+.+- .+.+.+..++-+ T Consensus 348 ~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~-~~~---d~~~i~i~f~~~~p~n~~e~~~~~ 418 (471) T protein:vir:10 348 DKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMIL-----KHL-GLS---DKLKIKQTWTRNSINNDTEMAQVV 418 (471) T ss_pred ccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHh-ccC---CCceeEEEeCCCCCCCHHHHHHHH Confidence 11123455544332 44555555555444332111 111 111 1122333332 222222222221 Q ss_pred HHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhh Q lcl|NC_020414. 431 DKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ 510 (515) Q Consensus 431 ~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~ 510 (515) .+ +++ .+.-..++..+ -+++ -.++|++.+.++++++++... -.++.-.+ T Consensus 419 ~k----------l~g-------~iS~et~~~~~---p~v~----D~~~E~eri~~E~~~~~~~~~-------~~~~~~~~ 467 (471) T protein:vir:10 419 ST----------LAT-------ITSRENVAKSN---PIVE----DWQDELRLQKAEQEGRSEKLY-------DMEEVEHE 467 (471) T ss_pred HH----------Hhc-------cCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhccc-------ccCCCCCc Confidence 11 111 12222222222 1221 125666666554433221111 01111111 Q ss_pred hhcc Q lcl|NC_020414. 511 EMKE 514 (515) Q Consensus 511 ~~~~ 514 (515) .=-| T Consensus 468 ~e~~ 471 (471) T protein:vir:10 468 SEVE 471 (471) T ss_pred cccC Confidence 1111 No 107 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=96.11 E-value=0.001 Score=37.04 Aligned_cols=398 Identities=11% Similarity=0.066 Sum_probs=177.9 Q ss_pred HHHHHHHhhhhHHHHHHHHHHhhccc---ccCCC---CCCccccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCCh Q lcl|NC_020414. 18 LWEKFSKKRSPYLDRAKHFAKLTLPY---LMNNK---GDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTA 91 (515) Q Consensus 18 r~~~lk~~R~~~e~~w~e~~~~~~P~---~~~~~---~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d 91 (515) .....+++| .++++.+.+|..-. ..... .......++..+-+...+++.++-|++- |+. | ...+ T Consensus 1 ~~~~~~~~~---~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~--~~~--~---~~~~ 70 (440) T protein:vir:95 1 MLAAFLGSQ---KQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGN--PVS--I---GVME 70 (440) T ss_pred ChhhHHHHH---HHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheecc--Cce--E---eeCC Confidence 223333333 33455555554321 11111 1111223455666666676666555331 222 2 2222 Q ss_pred HHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCCc--EEEEEcceEEEeeC Q lcl|NC_020414. 92 KGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKGA--MSAVPMHHYVVNRD 167 (515) Q Consensus 92 ~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~~--~r~~pl~~y~v~~d 167 (515) ... .+..+ .+...+..++|.....++.++..++|.+.+++ |.++. +++++-.+.++..| T Consensus 71 ~~~----------~~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d 133 (440) T protein:vir:95 71 GGS----------ADQLS-------TIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRD 133 (440) T ss_pred Ccc----------HHHHH-------HHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEc Confidence 111 11111 23355677899999999999999999987654 65544 45565555555555 Q ss_pred CC--CCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeC----Ceeec--ccCC Q lcl|NC_020414. 168 TN--GDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSAD----DIPVG--KENR 239 (515) Q Consensus 168 ~~--G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~----~~~i~--~esg 239 (515) +. +++...+|.+... ....+++|+ ++.... |.... +-... ..-+ T Consensus 134 ~~~~~~~~~~i~~~~~~----------------------~~~~~~vyt-----~~~~~~-~~~~~~~~~~~~~~~~~~~~ 185 (440) T protein:vir:95 134 LTVEQNIIAAVHLPIYA----------------------DKVNMTVYT-----KDKVIT-YKPYSNNSVRLVVDDVKKHS 185 (440) T ss_pred CCCCCceEEEEEEEEec----------------------CceEEEEEe-----CCeEEE-EEEecCCccceeecceeecc Confidence 54 4565555544211 011233332 111111 11110 01111 1123 Q ss_pred cccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecC---ccccChhhccCCCC-cce Q lcl|NC_020414. 240 IKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRP---GSQTDVDHFVNSGT-GEV 315 (515) Q Consensus 240 y~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~---~g~~~~~~~~~~~~-g~~ 315 (515) | ..||++.++. +.+|+|=.+...+-+..++.+.-......+....|.+.+.- ......+....... +.+ T Consensus 186 ~--g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~ 258 (440) T protein:vir:95 186 Y--NDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANML 258 (440) T ss_pred C--ceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccce Confidence 4 3488887654 45799999999999999999988888888888888654321 01111221111110 111 Q ss_pred -ec--------CCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhcc-CCCCCCCHHHHHH-------HHHHHH Q lcl|NC_020414. 316 -IT--------GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTR-RDAERVTAVEIQR-------DALEIE 378 (515) Q Consensus 316 -~~--------g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~-~~~~~~TAtEi~~-------r~~E~~ 378 (515) .+ +...++..+. ...+.+.....++.++..|...-..-.+.. .-+...|+..+.. ++++++ T Consensus 259 ~~~~~~~~~~~~~~~~~~~lt--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 336 (440) T protein:vir:95 259 FLKTGISTTGQQTTADASYIY--KQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKE 336 (440) T ss_pred ecccccccccCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHH Confidence 10 1112233332 224566667777777776644321000000 0112457776533 356666 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHh-cCCCCChhhccceee--eehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCC Q lcl|NC_020414. 379 QNMGGVYSLFAMTMQTPIAMWGLQE-AGDSFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIR 455 (515) Q Consensus 379 ~~LGpv~~rl~~E~l~Pli~r~~~~-~~~~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id 455 (515) ..+|..+.++-. ++..++.. .........+.+.+- .+.+-+..++.+.++ .+ .+. T Consensus 337 ~~~~~~l~~~~~-----li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl---~g--------------~iS 394 (440) T protein:vir:95 337 TYFTKALRRRYE-----LISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA---GG--------------EIS 394 (440) T ss_pred HHHHHHHHHHHH-----HHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH---hc--------------cCc Confidence 666666554433 12222221 122222223333332 223333333322222 11 122 Q ss_pred HHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 456 WGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 456 ~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) ...+++.+ -+++ .++|++.+.+++.+++ ....+.. ...-.++...| T Consensus 395 ~et~~~~l---~~~d-----~~~E~~ri~~E~~~~~--~~~~~~~---~~~~~~~~~~e 440 (440) T protein:vir:95 395 QETLMENA---SFTD-----YKTEHSRILKQGGSSD--LEIGQIV---GDADVGQADTE 440 (440) T ss_pred HHHHHHhC---CCCC-----cHHHHHHHHHHHHHhh--hhHHhhc---cCCCCCCcCCC Confidence 23333322 1222 3556666665433222 1112222 22223333444 No 108 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=96.09 E-value=0.001 Score=36.97 Aligned_cols=377 Identities=11% Similarity=0.056 Sum_probs=161.5 Q ss_pred ccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCC--CCccccccccccHHHHHHHHHHHHHHhhcCC Q lcl|NC_020414. 8 YGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKG--DNETSQNGWQGVGAQATNHLANKLAQVLFPA 80 (515) Q Consensus 8 ~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~--~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp 80 (515) +. -+.++-+.+.+|..-. +...-. -+. .-+..-+-+..+++.||..|. .. T Consensus 1 l~------------------~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~-~~~~v~nw~~~~Vds~a~rl~----~~ 57 (410) T protein:vir:95 1 MN------------------LYQSRVNLRYKHYAMQHYEAPTGITIPAHIRA-KYQAVLGWAAKGVDSLADRLI----FR 57 (410) T ss_pred CC------------------cchhhHHHHHHHhcCCCCccccchhccHHHHh-HHHhhcchhHHHHHHhHhhhc----cc Confidence 11 1222222333333221 110000 000 012234555566666665443 11 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eCCC--cEEE Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKG--AMSA 156 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~~~--~~r~ 156 (515) + | ...|. ++. ....+++|.....++.++..++|.+.+++ +++. .+++ T Consensus 58 G---f--~~~d~-------------~l~-----------~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~ 108 (410) T protein:vir:95 58 A---F--ANDDF-------------NVT-----------EIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQV 108 (410) T ss_pred c---c--cCCCc-------------hHH-----------HHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEE Confidence 2 1 11111 122 33557999999999999999999998877 3333 3666 Q ss_pred EEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeec Q lcl|NC_020414. 157 VPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVG 235 (515) Q Consensus 157 ~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~ 235 (515) ++-.+ +++--+..+++..-++...- ........+.+|+ ++ ..+++.-++..-. T Consensus 109 ~sP~~~~~i~Dp~~~~~~~al~~~~~-------------------~~~~~~~~~~~~~-----~~--~~~~~~~~~~~~~ 162 (410) T protein:vir:95 109 IESSNATGVIDPITGLLVEGYAVLAR-------------------DDYNRPTLEAYFE-----PN--ATHFIPKDGEPYS 162 (410) T ss_pred EcccceEEEEeCCCCceEEEEEEEEe-------------------cCCCeEEEEEEEe-----CC--cEEEEeeCCcccc Confidence 65544 44443334555544432110 0000011222222 11 1222222221111 Q ss_pred ccCCcccccCcEEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhccCceee---cCccccChhhccCCC Q lcl|NC_020414. 236 KENRIKAEKLPFIPLTWKRSYGEDWGRPLV-EDYSGDLFVIQFLSEAVARGAALMADIKYLI---RPGSQTDVDHFVNSG 311 (515) Q Consensus 236 ~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~---~~~g~~~~~~~~~~~ 311 (515) .+-++ ..||++++..+...++.||+|=. +..++-+..+++..-..+..++..+.|-..+ .+++... +.-... T Consensus 163 ~~~~~--g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~--~~~~~~ 238 (410) T protein:vir:95 163 VTNET--GIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPM--EKWKAT 238 (410) T ss_pred ccCCC--CCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcC--chhhhh Confidence 12344 35999999999888999999943 5677778889998888888889988884322 2332211 111112 Q ss_pred Ccceec--CCccc--ccccccCCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHH-------HHHHHHHH Q lcl|NC_020414. 312 TGEVIT--GVEED--IHIVQLGKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEI-------QRDALEIE 378 (515) Q Consensus 312 ~g~~~~--g~~~~--v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi-------~~r~~E~~ 378 (515) .+.+.. ...++ ...-++ ..++++.....+..+...|...= + ...+.....-.-+|.-| ..++++|. T Consensus 239 ~~~i~~~~~~~~~~~~~v~q~-~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~ 317 (410) T protein:vir:95 239 VSSLLTISSSDKGVKPSVGQF-TTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQ 317 (410) T ss_pred hhhheeccCCCCCCcceEEec-CCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 233322 22111 122233 24566644333333333332110 0 00111101111233333 23456777 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee-e---ehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcC Q lcl|NC_020414. 379 QNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV-T---GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAI 454 (515) Q Consensus 379 ~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v-~---~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~i 454 (515) ..+|.-+.++.. |+ ..+.+..+..+.+..+..++ . ..+.-..|+.++.+..+.+. .|- .+ T Consensus 318 ~~fg~~l~~~~r-----la-~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a-------~~g---~~ 381 (410) T protein:vir:95 318 RSLGAGLLNVAY-----VA-ACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQA-------LPG---YI 381 (410) T ss_pred HHHHHHHHHHHH-----HH-HHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHh-------ccC---Cc Confidence 777776665543 11 11234444445554444332 2 22222334444443332221 111 11 Q ss_pred CHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 455 RWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEA 494 (515) Q Consensus 455 d~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~ 494 (515) +.+ .+.+.+|. |+++++.++.+.+ .++.+ T Consensus 382 ~~~----~~~~~lg~------~~~~~~~~~~~e~-~~~g~ 410 (410) T protein:vir:95 382 NAE----TIRDLTGI------AGDMSAKPVVSEG-GSNGE 410 (410) T ss_pred cHH----HHHHhcCC------ChHHHHHHHHHHH-HhCCC Confidence 122 23334454 3333332222111 11111 No 109 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=96.04 E-value=0.0011 Score=36.84 Aligned_cols=420 Identities=12% Similarity=0.118 Sum_probs=170.5 Q ss_pred ccccccHHHHHHHHH-------H-----------HHHhhhhHHHHHHHHHHhhcccccCCCCCCcccccccc--ccHHHH Q lcl|NC_020414. 6 LEYGGQRSKIPKLWE-------K-----------FSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ--GVGAQA 65 (515) Q Consensus 6 ~~~~~~~~~l~~r~~-------~-----------lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d--st~~~a 65 (515) |-+=-..+.+.+|+- . +..+.......|+.+|.=--|-+.....++....+... +.+... T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 111001111111100 0 00111122344555543222211111111111222233 455566 Q ss_pred HHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE Q lcl|NC_020414. 66 TNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL 145 (515) Q Consensus 66 ~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 145 (515) ++.+|+-+.+- || .+++++. ...++|+ +.+..++|+..+.+++.+..+.|.++ T Consensus 81 ~~~~A~ll~~e--~~-----~i~~~d~-------------~~~e~l~-------~i~~~n~f~~~~~~~~e~a~a~G~~~ 133 (505) T protein:vir:79 81 SAKLASLIFNE--QC-----QVTVSDE-------------TANDFLD-------DVFQQNDFYTTFEEKLEEWIALGSGC 133 (505) T ss_pred HHHHHhhhcCC--Cc-----eeecCCh-------------HHHHHHH-------HHHHhccHHHHHHHHHHHHhhcCCeE Confidence 66666644443 22 1333331 2344444 34667899999999999999999987 Q ss_pred E--EEeCCC-cEEEEEcceEE-EeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEE---c Q lcl|NC_020414. 146 L--YKPSKG-AMSAVPMHHYV-VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQY---A 218 (515) Q Consensus 146 l--~~d~~~-~~r~~pl~~y~-v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~---~ 218 (515) + |+|.+. .+.+++-..++ +..|..+....+|..+.... .+ ..-.+|+.+++ + T Consensus 134 ~k~~~D~~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~------------------~~---~~~~~yt~lE~h~~~ 192 (505) T protein:vir:79 134 VRPYVDSGKIKLAWATADQVYPLQADTNQVNELAIASRTTEV------------------EN---HRTIYYTLLEFHQWD 192 (505) T ss_pred EEEEEeCCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEe------------------cC---CcceEEEEEEEEEec Confidence 5 667543 36677877765 55666554444443221100 00 11123444433 2 Q ss_pred CCCCe---EEEEEeC----Ceee-c----------cc---CCcccccCcEEEEe---ee-ecCCCccccchHHHHHHHHH Q lcl|NC_020414. 219 GEGFW---KINQSAD----DIPV-G----------KE---NRIKAEKLPFIPLT---WK-RSYGEDWGRPLVEDYSGDLF 273 (515) Q Consensus 219 ~~~~~---~~~~e~~----~~~i-~----------~e---sgy~~~~~P~~~~R---w~-~~~g~~YGrgp~~~~l~d~k 273 (515) +..+. .+|..-+ |..+ + .+ .|+. .-+|..++ ++ ...++.+|+|-...+.+-+. T Consensus 193 ~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~--~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id 270 (505) T protein:vir:79 193 HGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLK--HPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVID 270 (505) T ss_pred CceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCC--cceEEEecCCcccccccCCccCCchhhhhHHHHH Confidence 22111 1222211 1111 1 11 2221 12233322 22 23466799999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC----------Ccc--eec--CCcccccccccCCccch--HHH Q lcl|NC_020414. 274 VIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG----------TGE--VIT--GVEEDIHIVQLGKYADL--TPI 337 (515) Q Consensus 274 ~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~----------~g~--~~~--g~~~~v~~~~~~~~~~l--~~~ 337 (515) .|+..--+.....+. .+....+++ .++....-..+. ... +.. +..+....-.+ ..++ ... T Consensus 271 ~lD~~~s~~~~e~~~-g~~~i~v~~-~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~--~~~ir~e~~ 346 (505) T protein:vir:79 271 AINRTHDQFVDEVKK-GQRRLIVPA-EWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDA--TSPIRVADY 346 (505) T ss_pred HHHHHHHHHHHHHHh-cccceeech-HHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEe--cccCCHHHH Confidence 999876666665543 344433322 222211100000 000 001 11111111011 1122 222 Q ss_pred HHHHHHHHHHHHHHHHH--HhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH--hc--------- Q lcl|NC_020414. 338 SAVLEVYTRRIGVIFMM--ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ--EA--------- 404 (515) Q Consensus 338 ~~~i~~~~~rI~~afl~--~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~--~~--------- 404 (515) ...++.+-++|....=+ .++........|||||..+.+...+...-.-..+ ...|..|++.++. .. T Consensus 347 ~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~-~~al~~li~~i~~~~~~~~~~~~g~~ 425 (505) T protein:vir:79 347 QATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQV-EKTIKALTYAILELASVPSFYADGQA 425 (505) T ss_pred HHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcccccccc Confidence 33344444433322111 1222223334699999998888888777543333 4455666665432 11 Q ss_pred --CCCCChhhccceeeeeh--HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHH Q lcl|NC_020414. 405 --GDSFTSELVDPVIVTGI--EALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEM 480 (515) Q Consensus 405 --~~~~p~~~~~~~~v~~l--~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev 480 (515) .+.++...+.+.+=.++ +.-+.. +. ..+.++ +++ +....+ +....| -|++|+ T Consensus 426 ~~~~~~~~~~i~v~f~d~i~~d~~~~~---~~---~~~~v~--~Gi-------~s~e~~---l~~~~~------~~eeea 481 (505) T protein:vir:79 426 RWTGDVDSLDITINFNDGVFVDQESKR---AA---DLQAVQ--AQV-------MPKKQF---LMRNYG------LDEEEA 481 (505) T ss_pred cccCCCCceeEEEEeCCCCCCCHHHHH---HH---HHHHHH--cCC-------CCHHHH---HHhcCC------CChHHH Confidence 12222222333222222 111111 11 111111 111 112222 222334 345665 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccchhhhh Q lcl|NC_020414. 481 QQEMAQQAQAQQEAMLNEGVAKAVPGVIQQE 511 (515) Q Consensus 481 ~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~ 511 (515) +++.++-++ ++. .+.-.....||+ T Consensus 482 ~~el~ri~~-E~~------~~~p~~~~~gg~ 505 (505) T protein:vir:79 482 DEWLAQIDA-ENS------TAEPEFNQFGGD 505 (505) T ss_pred HHHHHHHHH-hcc------ccCCCchhccCC Confidence 554432211 111 111223344555 No 110 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=96.04 E-value=0.0011 Score=36.82 Aligned_cols=435 Identities=11% Similarity=0.060 Sum_probs=190.0 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHH--HHHHHHHHhhcccccCCC--CCCc-----cccc-cccccHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYL--DRAKHFAKLTLPYLMNNK--GDNE-----TSQN-GWQGVGAQATNHLA 70 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e--~~w~e~~~~~~P~~~~~~--~~~~-----~~~~-~~dst~~~a~~~La 70 (515) |.|-.-+ --.=..+..+|+.+ |.-+. ..|++...-.||.....+ ..+. ...+ .|-+...+.++. T Consensus 1 m~~V~~~-hp~y~~~~~~W~~i---rd~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~-- 74 (501) T protein:vir:95 1 MPNVSFI-RPELGKLLPLYYLI---RDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFG-- 74 (501) T ss_pred CCCCCCC-CHHHHHHHHHHHHH---HHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHH-- Confidence 7752211 01123344444443 54453 456666666777532221 1111 1111 244444444444 Q ss_pred HHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC Q lcl|NC_020414. 71 NKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPS 150 (515) Q Consensus 71 a~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~ 150 (515) |++.+|- .+-.++.+ ..++.+++.| -+...+.+.-+..++.+...+|-+.+++|. T Consensus 75 --l~G~vf~---k~p~~~~p--------------~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~ 129 (501) T protein:vir:95 75 --LVGQVFM---RDPVVKVP--------------ALLNPLVANA------TGSGINLTQLAKRAVSLNLAYSRAGLLVDY 129 (501) T ss_pred --Hhhhhhc---CCcceeCc--------------HHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEEee Confidence 4444442 11122221 2244455444 234567888888999999999999999984 Q ss_pred CC----c---------------EEEEEcceE---EEe-eCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcc Q lcl|NC_020414. 151 KG----A---------------MSAVPMHHY---VVN-RDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDD 207 (515) Q Consensus 151 ~~----~---------------~r~~pl~~y---~v~-~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~ 207 (515) .. + +..|+-.+. -.. .|...++.-+..+++.+.+. .+|+ .+ T Consensus 130 P~~~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d--~~f~--------------~~ 193 (501) T protein:vir:95 130 PTTEAEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAAD--DGFE--------------MK 193 (501) T ss_pred cCCCCcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecC--CCcc--------------cc Confidence 31 1 223332221 111 22333444444444444321 2333 24 Q ss_pred cEEEEEEEEEcCCCCeE--EEEEeCC-----------------eee-cccCCcccccCcEEEEeeeecCCCccccchHHH Q lcl|NC_020414. 208 NVKLYTHAQYAGEGFWK--INQSADD-----------------IPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVED 267 (515) Q Consensus 208 ~v~v~~~v~~~~~~~~~--~~~e~~~-----------------~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~ 267 (515) .++.|..+.++.+|.+. +|.+-+. ... ...|+ +.+++|++.|.-..+...+.| .- T Consensus 194 ~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~--~p 268 (501) T protein:vir:95 194 TSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQG---KRLTEIPFMFIGSENNDSNPD--NP 268 (501) T ss_pred eeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccceeeeeccCC---CcCCeeeEEEEecCCCCCCCC--cc Confidence 56677777777666543 3322110 111 22333 458899999976666654443 11 Q ss_pred HHHHHHHHHHH---H-HHHHHHHHHhccCceee-cCcc----ccChhhccCCCCcceecCCcccccccccCCccchHHHH Q lcl|NC_020414. 268 YSGDLFVIQFL---S-EAVARGAALMADIKYLI-RPGS----QTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPIS 338 (515) Q Consensus 268 ~l~d~k~L~~l---~-~~~~~~~~~a~~p~~l~-~~~g----~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~ 338 (515) .|=|+..||.- . -..-..+..+..|...+ ..+. ..+...+.-+.+..+.....++...++.. +..+ .. T Consensus 269 PLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~-~~~i--~~ 345 (501) T protein:vir:95 269 NFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQAS-ENTM--LK 345 (501) T ss_pred chHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeecccccccCCCCCceeEEecC-hhhH--HH Confidence 22244444422 1 22333444455553222 1110 01111122222222211222333444432 2233 35 Q ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-hcCCCCChhhcccee Q lcl|NC_020414. 339 AVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-EAGDSFTSELVDPVI 417 (515) Q Consensus 339 ~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-~~~~~~p~~~~~~~~ 417 (515) ..|++++.+++++= . .+........||++...+....-..|.-+...+..-+..-|-..+.. +..+.=....++..+ T Consensus 346 ~~l~~l~~~m~~~G-a-~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df 423 (501) T protein:vir:95 346 EAMDTKERQMVALG-A-KLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQADSGVKFELNTDF 423 (501) T ss_pred HHHHHHHHHHHHHH-H-hhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEeccc Confidence 66777777776642 1 22334444589999999999999999998888876654432222211 222211111133333 Q ss_pred e-eehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 418 V-TGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAML 496 (515) Q Consensus 418 v-~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~ 496 (515) . ..+.+ +.++.++... -...|..+.+.+.+.+ .||+.. ..+++.++++....++..... T Consensus 424 ~~~~~~~-------~~~~al~~~~---------~~G~is~~t~~~~L~~-~~v~~~--~~~~e~e~i~~~~~~~~~~~~- 483 (501) T protein:vir:95 424 DIARMTP-------DERRSLVEEW---------QKGAITFEEMRTGLRK-AGVATE--DDSKAKEKIAKDTAEAMALAT- 483 (501) T ss_pred ccccCCH-------HHHHHHHHHH---------hCCCCcHHHHHHHHHh-CCCCCh--hHHHHHHHHHhhhcCcccccc- Confidence 2 12211 1122222211 1123445555555533 466542 113333444332221111111 Q ss_pred HHHhhhhccchhhhhh-ccC Q lcl|NC_020414. 497 NEGVAKAVPGVIQQEM-KEG 515 (515) Q Consensus 497 ~~~~~~a~~~~~~~~~-~~~ 515 (515) .+. .-+...|+++ --+ T Consensus 484 --~~~-~~~~~~gg~~~~~~ 500 (501) T protein:vir:95 484 --PAN-VPGDGSGGDNVGNS 500 (501) T ss_pred --cCC-CCCCCcccccccCC Confidence 111 1111223333 111 No 111 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=95.63 E-value=0.0017 Score=35.75 Aligned_cols=412 Identities=11% Similarity=0.096 Sum_probs=181.9 Q ss_pred CCCc----------------cccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc--cccCCCC---C---Cccccc Q lcl|NC_020414. 1 MQDT----------------ILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP--YLMNNKG---D---NETSQN 56 (515) Q Consensus 1 ~~~~----------------~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P--~~~~~~~---~---~~~~~~ 56 (515) |+++ +...-++.+.|.+..+.... |.+....|+++|+=.-+ ....... . .....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKE-NVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR 79 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccc Confidence 4443 11223456666666665554 44556666666643321 1111100 0 011124 Q ss_pred cccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHH Q lcl|NC_020414. 57 GWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFK 136 (515) Q Consensus 57 ~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~ 136 (515) +..+-+...++..++-|.+ -|+. ++.++.. ..+.|.+ .+ ..||...+.++.+ T Consensus 80 i~~n~~~~Iv~~~~~~l~g--~p~~-----~~~~d~~-------------~~~~l~~-------~~-~n~~~~~~~~~~~ 131 (468) T protein:vir:96 80 MYTNYHQNLVDQKVAYAVA--NPVT-----YGTEDEK-------------SLKTIQE-------VL-NHKWDDKLVDILT 131 (468) T ss_pred cccchHHHHHHHHHhhhcc--CCce-----eccCChH-------------HHHHHHH-------HH-hcCHHHHHHHHHH Confidence 5556666666666655543 2222 2333321 2222222 22 3688889999999 Q ss_pred HHHhhCceEE--EEeCCCcEEE--EEc-ceEEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEE Q lcl|NC_020414. 137 HLIVAGNCLL--YKPSKGAMSA--VPM-HHYVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVK 210 (515) Q Consensus 137 dl~~~G~~~l--~~d~~~~~r~--~pl-~~y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~ 210 (515) +..++|.+.+ |.|.++.+++ ++. .-|.+-.| ..|.+..++|.++..-. ..++ T Consensus 132 ~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~----------------------~~~~ 189 (468) T protein:vir:96 132 AASNKGVEWIQPYVDEQGEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGG----------------------ERVE 189 (468) T ss_pred HHhhcCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCc----------------------eEEE Confidence 9999999874 5666655543 433 33555433 35777766666643210 1122 Q ss_pred EEE------EEEEcCCCCeEEEEEe----CCeeecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHH Q lcl|NC_020414. 211 LYT------HAQYAGEGFWKINQSA----DDIPVGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLS 279 (515) Q Consensus 211 v~~------~v~~~~~~~~~~~~e~----~~~~i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~ 279 (515) +|+ ...........+.... ++..+.. .-+| ..+|++.++ ++.+|.|=.+...+-+..++.+- T Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~ 262 (468) T protein:vir:96 190 YWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSW--NRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRL 262 (468) T ss_pred EEeCCeEEEEEEcCCceeecccccccccccceeeccccccC--CcccEEEec-----CCCCCCCchHHHHHHHHHHHHHH Confidence 221 1111111000000000 1111111 1223 347777653 35679998889999999999888 Q ss_pred HHHHHHHHHhccCceeecCccccChhhccC-C-CCcce-ecCCc-ccccccccCCccchHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_020414. 280 EAVARGAALMADIKYLIRPGSQTDVDHFVN-S-GTGEV-ITGVE-EDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM-M 354 (515) Q Consensus 280 ~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~-~-~~g~~-~~g~~-~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl-~ 354 (515) -......+....|.+.+.....-+...+.. . ..+.+ +++.. +++..+. ...+.+.....++.++..|...-. . T Consensus 263 S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p 340 (468) T protein:vir:96 263 SDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVDTIQ--IDVPVQSAKEYLDMLRDYVIEFGQGV 340 (468) T ss_pred HHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCCCcceEEe--ecCChHHHHHHHHHHHHHHHHHhCcc Confidence 888888888888876553211111111111 1 11222 22322 2333333 223556667777777776654421 1 Q ss_pred HhhccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--eehHHHH Q lcl|NC_020414. 355 ETMTRRDAERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TGIEALG 425 (515) Q Consensus 355 ~~l~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~l~~l~ 425 (515) +......+...|+..+.. .+.++...++..+.++- +.++.-.+-......+.+.+- .+.+.+. T Consensus 341 ~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~--------~li~~~~g~~~d~~~i~i~f~~~~p~d~~e 412 (468) T protein:vir:96 341 DFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELL--------QYIIDFYKLSIKVQDVEITFNFNVMVNELE 412 (468) T ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHhCCCcccceeeEEecCCCCcCHHH Confidence 111111123456665532 23455555555444432 222221111211122333221 1222221 Q ss_pred HHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_020414. 426 RMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVP 505 (515) Q Consensus 426 ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~ 505 (515) .++ .+... ..+.-..+++.+ -+++ -.++|++.+.++++++.+.+ . T Consensus 413 ~a~----------~~~~~--------g~iS~et~i~~l---~~v~----D~~~E~~ri~~E~~~~~~~~----------~ 457 (468) T protein:vir:96 413 QSQ----------IGVNS--------QYLSKETVVTNH---PWVD----DPVAEMERIDQEELALPSIE----------E 457 (468) T ss_pred HHH----------HHHhc--------CCCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHh----------h Confidence 111 11111 122223333322 1221 12466666655443332211 2 Q ss_pred chhhhhhccC Q lcl|NC_020414. 506 GVIQQEMKEG 515 (515) Q Consensus 506 ~~~~~~~~~~ 515 (515) ...|+...+. T Consensus 458 ~~~~~~~~~~ 467 (468) T protein:vir:96 458 GLNGKENNEP 467 (468) T ss_pred ccCCCCCCCC Confidence 2334444444 No 112 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=95.20 E-value=0.0026 Score=34.80 Aligned_cols=409 Identities=10% Similarity=0.030 Sum_probs=176.9 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCC-CCCc--ccc--ccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNK-GDNE--TSQ--NGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~~~--~~~--~~~dst~~~a~~~Laa~l~s 75 (515) |-+++. ..+...+..+..+ ..+.+.+.+|..-...... +... ..+ +..-+-+..+++.||..|. T Consensus 12 l~~~~~------~~~~~L~~~~~~~----~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~- 80 (474) T protein:vir:81 12 LSNDEN------ALINGLLAQIENL----RWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCN- 80 (474) T ss_pred CChhHH------HHHHHHHHHHHHH----hhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhc- Confidence 555542 2344444443333 3344455555432211000 1000 000 1233445566666666443 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeC--CC- Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPS--KG- 152 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~--~~- 152 (515) +-+ |++. +... +... +++...+++|.....+++++..+||.+.+++-. ++ T Consensus 81 ---~~G---f~~~--d~~~--------~~~~-----------l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~ 133 (474) T protein:vir:81 81 ---LEG---FVWP--DGDL--------DSLG-----------GTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDE 133 (474) T ss_pred ---ccc---eECC--CCCc--------cchH-----------HHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCC Confidence 111 2222 2110 0011 234456799999999999999999999887732 22 Q ss_pred ---cEEEEEcceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEE--EEE--EcCCCCeE Q lcl|NC_020414. 153 ---AMSAVPMHHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYT--HAQ--YAGEGFWK 224 (515) Q Consensus 153 ---~~r~~pl~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~--~v~--~~~~~~~~ 224 (515) .+++++-.+.++..|+ .+++...++.... + ..++ -....+|. .++ ..+++-+. T Consensus 134 ~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~-----------~-------~~g~-~~~~~ly~~~~~~~~~~~~~~~~ 194 (474) T protein:vir:81 134 PEALIHVKDASEATGEWNRRRRGLNNLLSIIDK-----------D-------KEGK-VLSLALYLDNETVTAQRDKATLK 194 (474) T ss_pred ceeEEEEeccceEEEEEeCCCCcceeeeEEEEE-----------c-------CCCc-EEEEEEEeCCcEEEEEEcCccce Confidence 2666665443333344 3333322222100 0 0000 01112221 000 11111122 Q ss_pred EEEEeCCeeecccCCcccccCcEEEEeeeecCCCccccchH-HHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcccc- Q lcl|NC_020414. 225 INQSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLV-EDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQT- 302 (515) Q Consensus 225 ~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~-~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~- 302 (515) |..+. .+-++ .+|++++..+..-++.+|+|-. +..++-+..+|+..-..+..++..+.|-..+- |.. T Consensus 195 w~~~~------~~~~~---gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~--G~~~ 263 (474) T protein:vir:81 195 WQVDR------DEHVY---GVPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLL--GADE 263 (474) T ss_pred eeecc------CCCCC---CcceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheee--cCCh Confidence 22111 12233 2799999988888999999965 56778888899998888889999888853331 111 Q ss_pred --------ChhhccCCCCcce--ecCCccc-c------cccccCCccchHHHHHHHHHHHHHHHHHHHHH-----hhccC Q lcl|NC_020414. 303 --------DVDHFVNSGTGEV--ITGVEED-I------HIVQLGKYADLTPISAVLEVYTRRIGVIFMME-----TMTRR 360 (515) Q Consensus 303 --------~~~~~~~~~~g~~--~~g~~~~-v------~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~-----~l~~~ 360 (515) ++...-....+.+ ++...+. + +.-++. .++++.- ++.++.-|....... .|... T Consensus 264 ~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~-~a~l~~~---~~~l~~~~~~~a~~t~iP~~~lG~~ 339 (474) T protein:vir:81 264 SALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFP-AASPDAH---WSDINGLAKLFAREASLPDTAVAIS 339 (474) T ss_pred hhcccccccccchhhhhHHHHhcCCCcccccccccccccccccC-CCChhHH---HHHHHHHHHHHHhhhCCCHHHhccc Confidence 1111111111222 2222221 1 111111 2334432 333333333322111 11111 Q ss_pred C-CCCCCHHHH-------HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhc--CCCCChhhcccee----eeehHHHHH Q lcl|NC_020414. 361 D-AERVTAVEI-------QRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEA--GDSFTSELVDPVI----VTGIEALGR 426 (515) Q Consensus 361 ~-~~~~TAtEi-------~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~--~~~~p~~~~~~~~----v~~l~~l~r 426 (515) + ...-+|.-| ..++++|...+|.-+.++.. |+ ..+.+. ..+.+.+..+... ....+.+++ T Consensus 340 ~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~r-----la-~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~ 413 (474) T protein:vir:81 340 GLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFI-----RA-LAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQ 413 (474) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HH-HHHhCCCCccccchhhccceeEecCCCccCHHHH Confidence 1 111234433 33556777777776665543 11 112222 2345555544432 233444444 Q ss_pred HHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_020414. 427 MAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPG 506 (515) Q Consensus 427 a~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~ 506 (515) |..+.++.+. ...+++. .+ ..+.+|+ |++|++.+++.+.+++-...+.+......++ T Consensus 414 aDa~~Kl~~a------~~~~~~~--------~~---~~~~lg~------t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~ 470 (474) T protein:vir:81 414 ADAGMKQLAA------VPWLAET--------EV---GLELIGL------TPQQARRAMADKRRVQGRGTLQALIDRSNNG 470 (474) T ss_pred HHHHHHHHhc------ccCCCcH--------HH---HHhhcCC------CHHHHHHHHHHHHHHhHHHHHHHHHhcCCCC Confidence 4444333321 1112211 11 1123354 4778877665544443333334333333333 Q ss_pred hhhh Q lcl|NC_020414. 507 VIQQ 510 (515) Q Consensus 507 ~~~~ 510 (515) +-+| T Consensus 471 ~~aq 474 (474) T protein:vir:81 471 ATAQ 474 (474) T ss_pred CCCC Confidence 3344 No 113 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=95.20 E-value=0.0026 Score=34.80 Aligned_cols=422 Identities=9% Similarity=0.043 Sum_probs=184.5 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccccc--CCC-----CCCcccccccc-----ccHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLM--NNK-----GDNETSQNGWQ-----GVGAQATNH 68 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~--~~~-----~~~~~~~~~~d-----st~~~a~~~ 68 (515) -|+ +.+.+++... -.|.. ..++|+-|.+.+--.+. -.. ......+..|. ++=.-+.+. T Consensus 3 ~~~-~~~~~V~~~h--p~y~a-------~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~ 72 (491) T protein:vir:95 3 TAN-GQGSGVKTKH--REWLH-------YAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRR 72 (491) T ss_pred ccC-CccCCCCccC--HHHHH-------HHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHH Confidence 222 3333333322 22222 23456666655432110 000 00001011111 111112333 Q ss_pred HHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE Q lcl|NC_020414. 69 LANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK 148 (515) Q Consensus 69 Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~ 148 (515) ....|++.+|- ..|.+. .++ .++.+++.| -....+.+.-+...+.+...+|-+.+++ T Consensus 73 tl~~l~G~vfr-k~p~~~--~p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilV 129 (491) T protein:vir:95 73 TLSGMVGSVMR-KEPEIN--IPK--------------ELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGGLLV 129 (491) T ss_pred HHHHHhchhhc-CCceee--ccH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHHcCeEEEEE Confidence 33444444443 334442 211 234455544 2446678888899999999999999999 Q ss_pred eCCC------------c----EEEEEcceE---EE-eeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCccc Q lcl|NC_020414. 149 PSKG------------A----MSAVPMHHY---VV-NRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDN 208 (515) Q Consensus 149 d~~~------------~----~r~~pl~~y---~v-~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~ 208 (515) |... + +..|+-.+. -. ..|..+++.-+..+++..+++=..+|+ .+. T Consensus 130 D~P~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~--------------~~~ 195 (491) T protein:vir:95 130 DAPETAAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFE--------------TKY 195 (491) T ss_pred ecCCCcccCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcc--------------cce Confidence 8532 1 334443332 12 235555666666666544433333333 355 Q ss_pred EEEEEEEEEcCCCCeEE--EEE-eCCe-------eecccCCcccccCcEEEEeeeecCCCcccc--chHHHHHHHHHHHH Q lcl|NC_020414. 209 VKLYTHAQYAGEGFWKI--NQS-ADDI-------PVGKENRIKAEKLPFIPLTWKRSYGEDWGR--PLVEDYSGDLFVIQ 276 (515) Q Consensus 209 v~v~~~v~~~~~~~~~~--~~e-~~~~-------~i~~esgy~~~~~P~~~~Rw~~~~g~~YGr--gp~~~~l~d~k~L~ 276 (515) ++.|..+.+..++.+.+ |.. .++. .+..+|+ +.+++|++.|....+..+.. .|.. |+..|| T Consensus 196 ~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~ln 268 (491) T protein:vir:95 196 GEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGE---SLRGVIPFTFIGATNNDATIDDAPLL----PLAELN 268 (491) T ss_pred EEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCC---cccCeeEEEEEecCCCCCCCCcCchH----HHHHHH Confidence 67777777766665543 321 1221 1223444 34778888887666665544 4533 555554 Q ss_pred HH---HHHHHH-HHHHhccCceee-cCcc-------ccChhhccCCC-CcceecCCcccccccccCCccchHHHHHHHHH Q lcl|NC_020414. 277 FL---SEAVAR-GAALMADIKYLI-RPGS-------QTDVDHFVNSG-TGEVITGVEEDIHIVQLGKYADLTPISAVLEV 343 (515) Q Consensus 277 ~l---~~~~~~-~~~~a~~p~~l~-~~~g-------~~~~~~~~~~~-~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~ 343 (515) .- ..+-.+ .+..+..|.+.+ .-+. .+++..+.-+. .+...| ..++.+.++.+ +..+ +...|.+ T Consensus 269 i~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~~~~lP-~~~~~~~ie~~-~~~~--~~~~l~~ 344 (491) T protein:vir:95 269 IGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRCGHNLG-YGGSAQLIQAG-ENNL--ARQNMLD 344 (491) T ss_pred HHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcCCcCCC-CCCccceeecC-cchH--HHHHHHH Confidence 32 222233 333444443332 2111 11122222111 111112 22333444433 1222 4666777 Q ss_pred HHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChh-h--cccee-ee Q lcl|NC_020414. 344 YTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSE-L--VDPVI-VT 419 (515) Q Consensus 344 ~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~-~--~~~~~-v~ 419 (515) ++.+.+.+= ..++ ..+ ...||++...+...--..|..+...+..-+-.-|-..+.. .+...+.+ . ++..+ +. T Consensus 345 ~e~qm~~~G-a~l~-~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-~G~~~~~~v~i~~n~dF~~~ 420 (491) T protein:vir:95 345 KEQQAIQIG-AQLI-TPS-QQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMM-LGKPEDSEVEFQLNMDFFLQ 420 (491) T ss_pred HHHHHHHHH-HHhc-cCC-cchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCceEEEeecccccc Confidence 777665541 1222 233 3589999999999999999998888877665554333333 22222222 1 23333 22 Q ss_pred ehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 420 GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEG 499 (515) Q Consensus 420 ~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~ 499 (515) .+. ++. +..++.... ...|....+.+.+ ...||+.. ..+++..++.++ + T Consensus 421 ~~~----~~~---~~all~~~~---------~G~is~~t~~~~L-~~~~vl~~--~~e~~~~~ie~~------------~ 469 (491) T protein:vir:95 421 PMT----AQD---RAAWMADIN---------AGLLPATAYYAAL-RKAGVTDW--TDEDILNAIEDA------------P 469 (491) T ss_pred cCC----HHH---HHHHHHHHh---------cCCCCHHHHHHHH-HhCCCCCc--cHHHHHHHHHhc------------C Confidence 232 112 222222111 1233333333333 44566521 122222222221 1 Q ss_pred hhhhccchhhhhhccC Q lcl|NC_020414. 500 VAKAVPGVIQQEMKEG 515 (515) Q Consensus 500 ~~~a~~~~~~~~~~~~ 515 (515) ..-.+...+++++.|+ T Consensus 470 ~~~~~~~~~~~~~~~~ 485 (491) T protein:vir:95 470 LPSGAVTQVAGEIPQA 485 (491) T ss_pred CCCCccccccccchhh Confidence 1111111222233222 No 114 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=95.13 E-value=0.0027 Score=34.67 Aligned_cols=411 Identities=10% Similarity=0.095 Sum_probs=177.9 Q ss_pred CCCccccccccH------------------HHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCC-CC-----CC Q lcl|NC_020414. 1 MQDTILEYGGQR------------------SKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNN-KG-----DN 51 (515) Q Consensus 1 ~~~~~~~~~~~~------------------~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~-~~-----~~ 51 (515) |++-- .++++ +.|.+.....+ ...++++.+.+|.... +... .. .. T Consensus 1 ~~~~~--~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~----~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~ 74 (478) T protein:vir:10 1 MISIN--WPWDKPYHEQVVEQIKPKYETQEEMILRLVREHK----ENIDNITMGERYYNHHPDILDAPPKRDVNGDYDET 74 (478) T ss_pred Ccccc--CCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhccccccccccccccc Confidence 55431 22222 22222233332 2234455566665432 1000 00 00 Q ss_pred ccccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHH Q lcl|NC_020414. 52 ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAI 131 (515) Q Consensus 52 ~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~ 131 (515) ....|+..+-+...++..++-|++ -|+. ++..+... ...+.+ .+ ..+|.... T Consensus 75 ~~~~ki~~n~~~~ivd~~~~~l~g--~~~~-----~~~~~d~~---------~~~l~~-----------~~-~n~~~~~~ 126 (478) T protein:vir:10 75 KPDWRMYTNYHQNLVDQKVAYAVA--NPVT-----FGVDNDKA---------LKQIQH-----------TL-NHKWDDKL 126 (478) T ss_pred cccceeccchHHHHHHHHHhhhcc--CCee-----eecCChHH---------HHHHHH-----------HH-hcCHHHHH Confidence 111234556666777777766553 1211 23332211 111222 22 36899999 Q ss_pred HHHHHHHHhhCceEE--EEeCCCcEE--EEEc-ceEEEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCC Q lcl|NC_020414. 132 VEVFKHLIVAGNCLL--YKPSKGAMS--AVPM-HHYVVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKE 205 (515) Q Consensus 132 ~~~~~dl~~~G~~~l--~~d~~~~~r--~~pl-~~y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~ 205 (515) .++.++..++|.+.+ |.|.++.++ +++- .-|.+-- +..|.+...+|.++..- T Consensus 127 ~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~---------------------- 184 (478) T protein:vir:10 127 VDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG---------------------- 184 (478) T ss_pred HHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecC---------------------- Confidence 999999999999875 556665554 3433 4455543 34677877777664211 Q ss_pred cccEEEEEE------EEEcCCCCeEEEEEeCCe----eec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHH Q lcl|NC_020414. 206 DDNVKLYTH------AQYAGEGFWKINQSADDI----PVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFV 274 (515) Q Consensus 206 ~~~v~v~~~------v~~~~~~~~~~~~e~~~~----~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~ 274 (515) .+.+++|+. ..-.....+.......+. ... ..-+| ..+|++.++. +.+|+|=.....+-+.. T Consensus 185 ~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vPvv~~~n-----~~~g~sd~~~v~~liDa 257 (478) T protein:vir:10 185 AERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSW--GRVPFIPFKN-----NPQEVSDLFMYKTIIDA 257 (478) T ss_pred ceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccC--CccceEEecc-----CCCCCCcHHHHHHHHHH Confidence 011333321 111111111111111111 011 11233 3478877654 46899988889999999 Q ss_pred HHHHHHHHHHHHHHhccCceeecCccccChhhc-cCCC-Ccce-ecCCc-ccccccccCCccchHHHHHHHHHHHHHHHH Q lcl|NC_020414. 275 IQFLSEAVARGAALMADIKYLIRPGSQTDVDHF-VNSG-TGEV-ITGVE-EDIHIVQLGKYADLTPISAVLEVYTRRIGV 350 (515) Q Consensus 275 L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~-~~~~-~g~~-~~g~~-~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~ 350 (515) ++.+.-......+....|.+.+.--+.-+.... .... .+.+ +++.. +++..+. ...+.......++.++..|-. T Consensus 258 ~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~ 335 (478) T protein:vir:10 258 LDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDTIK--VEVPIDSVKEYTKMLRDYIIE 335 (478) T ss_pred HHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecCCCCCcceEEe--ecCChHHHHHHHHHHHHHHHH Confidence 998888888888888888655421111111111 0111 1222 33322 2333332 233566666777777766644 Q ss_pred HHH-HHhhccCCCCCCCHHHHHHH-------HHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--ee Q lcl|NC_020414. 351 IFM-METMTRRDAERVTAVEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TG 420 (515) Q Consensus 351 afl-~~~l~~~~~~~~TAtEi~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~ 420 (515) .-. .+......+...|+..+..+ +.++...++..+.+ ++..++.-.+-......+.+.+- .+ T Consensus 336 ~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~--------~~~li~~~~g~~~~~~~i~i~f~~~~p 407 (478) T protein:vir:10 336 FGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQE--------LLQYIIDFYRLDVKVQDIEITFNFNVM 407 (478) T ss_pred HhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhCCCcccccceEEecCCCC Confidence 321 11111111234566655332 34444444444443 23222222222222222333331 22 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcC-CchhccCCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 421 IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQIS-AELPFLKSEEEMQQEMAQQAQAQQEAMLNEG 499 (515) Q Consensus 421 l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~G-vp~~~irs~eev~~~rq~~~~~~q~~~~~~~ 499 (515) .+.+..|+-+ ..++++ +....+++.+ | ++ -.++|++.+.++..++++. T Consensus 408 ~d~~e~a~~~----------~kl~g~-------iS~et~~~~l----~~v~----D~~~E~~ri~~E~~~~~~~------ 456 (478) T protein:vir:10 408 VNELENSQIA----------MNSTGL-------LSKETILSNH----AWVE----DPVAEMERIEQENIELNQQ------ 456 (478) T ss_pred CCHHHHHHHH----------HHHhCC-------CChHHHHHhC----CCCC----CHHHHHHHHHHHHHHHHhh------ Confidence 2333333221 111221 2233333333 2 11 1346677776654433322 Q ss_pred hhhhccchhhhhhccC Q lcl|NC_020414. 500 VAKAVPGVIQQEMKEG 515 (515) Q Consensus 500 ~~~a~~~~~~~~~~~~ 515 (515) ......+..++...+. T Consensus 457 ~~~~~~~~~~~~~~~~ 472 (478) T protein:vir:10 457 LPDIEEGLNGEQQRQS 472 (478) T ss_pred ccccccccCCCCCCCC Confidence 1111222223333333 No 115 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=94.97 E-value=0.0031 Score=34.38 Aligned_cols=425 Identities=13% Similarity=0.126 Sum_probs=181.5 Q ss_pred CCCccc---c-ccccHHHHHHHHHHHHHh-hhhHHHHHHHHHHhh--cccccC-CCCCC-------ccccccccccHHHH Q lcl|NC_020414. 1 MQDTIL---E-YGGQRSKIPKLWEKFSKK-RSPYLDRAKHFAKLT--LPYLMN-NKGDN-------ETSQNGWQGVGAQA 65 (515) Q Consensus 1 ~~~~~~---~-~~~~~~~l~~r~~~lk~~-R~~~e~~w~e~~~~~--~P~~~~-~~~~~-------~~~~~~~dst~~~a 65 (515) +-++++ . .+.+-..+.+..+.+..+ |-+...+++++|+-- ++.+-. ..++. ....|+-.+-+... T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~I 85 (479) T protein:vir:79 6 ISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLL 85 (479) T ss_pred ecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHH Confidence 112211 1 111233455555555333 434444444444311 121100 00100 11123445566667 Q ss_pred HHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceE Q lcl|NC_020414. 66 TNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL 145 (515) Q Consensus 66 ~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~ 145 (515) ++..++-|++- |+ +++.+++. +++.+ ..+..++|.....++.++..++|.++ T Consensus 86 vd~~~~~l~g~--p~-----~~~~~~~~-------------~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~~~ 137 (479) T protein:vir:79 86 VDQKVGYSVGN--PI-----VFNADDDN-------------LTKLL--------NDLLGEEFDDTITELYLNASNKGVEW 137 (479) T ss_pred HHHHHhhhhcC--Cc-----eeccCCHH-------------HHHHH--------HHHHhcCHHHHHHHHHHHHHhcCeEE Confidence 77766666542 22 22333322 22222 23344789999999999999999976 Q ss_pred E--EEeCCCcE--EEEEcce-EEEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE----- Q lcl|NC_020414. 146 L--YKPSKGAM--SAVPMHH-YVVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH----- 214 (515) Q Consensus 146 l--~~d~~~~~--r~~pl~~-y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~----- 214 (515) + |.|.++.+ ++++-.+ |.+-- ...+++...+|.++..-. .++....+++|+. T Consensus 138 ~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~-----------------~~~~~~~~e~y~~~~i~~ 200 (479) T protein:vir:79 138 LHPYINRKGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDI-----------------DGNKIKRVEYYTENDITY 200 (479) T ss_pred EEEEeCCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeec-----------------CCceEEEEEEEeCCcEEE Confidence 5 45655544 4454444 44432 235667766666554210 0001112222221 Q ss_pred EEEcCCCCeE-E---------EEEeCCeee-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 215 AQYAGEGFWK-I---------NQSADDIPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVA 283 (515) Q Consensus 215 v~~~~~~~~~-~---------~~e~~~~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~ 283 (515) ......++.. . ....+...+ ...-+| ..+|++..+- +.+|+|=.+...+-+..++.+--... T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vPvv~~~n-----n~~g~sd~~~v~~liDa~d~~~S~~~ 273 (479) T protein:vir:79 201 FIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGW--GKVPFIPFKN-----NEKCVSDLTFYKSLIDIYDNNISTLA 273 (479) T ss_pred EEecCCcccccccccccccccccccccccccccccCC--CcccEEEecC-----CCCCCcchhhhHHHHHHHHHHHHHHH Confidence 0111111110 0 000111111 112234 3478887654 46799988889999999998888888 Q ss_pred HHHHHhccCceeecCccccChhh-ccCCCCcceec-CCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCC Q lcl|NC_020414. 284 RGAALMADIKYLIRPGSQTDVDH-FVNSGTGEVIT-GVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRD 361 (515) Q Consensus 284 ~~~~~a~~p~~l~~~~g~~~~~~-~~~~~~g~~~~-g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~ 361 (515) ...+...+|.+.+.......... ......+.++. ...+++..+. ...+.......++.++..|...-..-.+.... T Consensus 274 ~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 351 (479) T protein:vir:79 274 DNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKLE--INIPVEAKKELLDRLEKNIIIFGQGVNPESQN 351 (479) T ss_pred HHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCcceEEe--ccCCHHHHHHHHHHHHHHHHHHhCcccccccc Confidence 88888888876553211111111 11111122222 2223444443 33466777888888887775543211112222 Q ss_pred CCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhc-CCCCChhhccceeeee--hHHHHHHHHHH Q lcl|NC_020414. 362 AERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEA-GDSFTSELVDPVIVTG--IEALGRMAELD 431 (515) Q Consensus 362 ~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~-~~~~p~~~~~~~~v~~--l~~l~ra~~~~ 431 (515) ....|++.+.. .+.+++..++..+.++..- +..++... ........+++.+-.. .+-+..+ + T Consensus 352 ~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~l-----i~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a---~ 423 (479) T protein:vir:79 352 TGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWF-----VCEYLKISGNKSYDYKTVQITFNHSMIINEAEKI---D 423 (479) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhccCCCccccccceEEeCCCCCcCHHHHH---H Confidence 23356666644 3444555544444443221 11111111 1122222233333222 2222222 2 Q ss_pred HHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhh Q lcl|NC_020414. 432 KLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQE 511 (515) Q Consensus 432 ~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~ 511 (515) .+..+ ++ .+....++..+ -+++ -.++|++.+.+++.++.+.... . ++...+. T Consensus 424 ~~~kl-------~g-------~iS~et~l~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~---~----~~~~~~~ 475 (479) T protein:vir:79 424 MAAKS-------TG-------IVSDETIVSNH---PWVE----DVNDELERLKKQEDTQKEYDDL---I----PNNQDGV 475 (479) T ss_pred HHHHH-------hc-------cCcHHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHhc---c----CcccCCC Confidence 22211 11 12233333322 1121 1356666665544433322221 1 1111222 Q ss_pred hccC Q lcl|NC_020414. 512 MKEG 515 (515) Q Consensus 512 ~~~~ 515 (515) +-|. T Consensus 476 ~~e~ 479 (479) T protein:vir:79 476 IDET 479 (479) T ss_pred cCcC Confidence 2222 No 116 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=94.66 E-value=0.0038 Score=33.85 Aligned_cols=418 Identities=12% Similarity=0.060 Sum_probs=151.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCCC--CccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKGD--NETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~--~~~~~~~~dst~~~a~~~Laa~l 73 (515) -+.|. -+.+.....++.. ..++.+.+.+|..-. +...... +....++..+-+..+++.+++.| T Consensus 2 ~~~t~------~~~~~~l~~~~~~----~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 2 TASTP------AEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCH------HHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhh Confidence 22232 1223333333322 234444454444321 1000000 01112234455556666666554 Q ss_pred HHhhcCCCCCceecCCC-hHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLT-AKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PS 150 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~ 150 (515) ++ .++ ++... |... ... +.+.+.+++|.....++.++..++|.+.+++ +. T Consensus 72 ~~------~~~-~~~~~~d~~~---------~~~-----------~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~ 124 (456) T protein:vir:10 72 IP------NGI-TVGGSADSDL---------ALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD 124 (456) T ss_pred cc------CCe-ecCCCCCcch---------HHH-----------HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCC Confidence 32 122 22211 1110 011 2233556889999999999999999986544 44 Q ss_pred CC--cEEEEEcceEEEeeC-CCCC-eeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEE Q lcl|NC_020414. 151 KG--AMSAVPMHHYVVNRD-TNGD-LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKIN 226 (515) Q Consensus 151 ~~--~~r~~pl~~y~v~~d-~~G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 226 (515) +. .+++++..+.++..| ..++ +...+|.++. . ...+ .......++...+.|..+........... T Consensus 125 ~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~-~----d~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) T protein:vir:10 125 DGTATITADSPETMVVSVDPLQPWRIRAAMRWWRD-L----DAES------DFAIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) T ss_pred CCceEEEEEccceeEEEEcCCCCcceEEEEEEEEe-c----CCce------eEEEEEeccceeEEEEEEEEeecccceee Confidence 43 356665555444444 3433 3333333331 0 0000 00001111222222222211111001111 Q ss_pred EEeCCee-ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceee--------- Q lcl|NC_020414. 227 QSADDIP-VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLI--------- 296 (515) Q Consensus 227 ~e~~~~~-i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~--------- 296 (515) ...++.. ...+..+....+|++.. .+..|.|-.+..++-+-.++...-..+..++..+.|-..+ T Consensus 194 ~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~ 267 (456) T protein:vir:10 194 TRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPN 267 (456) T ss_pred eecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccc Confidence 1112221 11111111123555432 2356889888888888888866655555555554432111 Q ss_pred -cCccc-cChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHH- Q lcl|NC_020414. 297 -RPGSQ-TDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQ- 371 (515) Q Consensus 297 -~~~g~-~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~- 371 (515) +.+|. .++........|.+.... ++....++. .++++.....++.+...|...= + ...+.. +....|+.-+. T Consensus 268 ~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~-~~~N~Sg~Ai~~ 344 (456) T protein:vir:10 268 VDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMP-DSANQSAEGAHN 344 (456) T ss_pred ccccccccchhhhhhhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcc-cccChHHHHHHH Confidence 11111 111111122233332221 222222332 2445544444444444432110 0 000110 11223555442 Q ss_pred ------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 372 ------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMSLP 443 (515) Q Consensus 372 ------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~~~ 443 (515) .+++++.+.+|+-+.++.. .++.-.+ ......+++.+ ..+-+.++.|+-+.++. + T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~r--------l~~~~~g-~~~~~~~~v~w~~~~~~~~~~~ada~~kl~---~----- 407 (456) T protein:vir:10 345 IEKGFLFKCEDRLSIAKIGLEAILV--------KALQIEG-ESVEDTVDVSFESPDRVTLGEKYSAASLAK---A----- 407 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHhcC-CCcccceeEEecCCCCcCHHHHHHHHHHHH---H----- Confidence 2335555555555544332 1111111 11112233322 12223233332222221 1 Q ss_pred hcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhh Q lcl|NC_020414. 444 QTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQE 511 (515) Q Consensus 444 a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~ 511 (515) ++++.. ... ...+|+. +++++++..+|...++.++....+. .+...|-. T Consensus 408 ~gi~~~--------~~~---~~~lg~~------~~~i~~~e~er~~~e~~~~~~~~~~--~~~~~~~~ 456 (456) T protein:vir:10 408 AGESWA--------SIR---RNILNYN------ADQIKQDDLDRAREQITLFAGNPVQ--RPQEDGSR 456 (456) T ss_pred cCCChH--------HHH---HhhCCCC------HHHHHHHHHHHHHHHHHHHhhhhhh--cCCCCCCC Confidence 122221 111 2244553 3444333333322222221111111 11222222 No 117 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=94.66 E-value=0.0038 Score=33.85 Aligned_cols=418 Identities=12% Similarity=0.060 Sum_probs=151.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccCCCCC--CccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNNKGD--NETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~~~~~--~~~~~~~~dst~~~a~~~Laa~l 73 (515) -+.|. -+.+.....++.. ..++.+.+.+|..-. +...... +....++..+-+..+++.+++.| T Consensus 2 ~~~t~------~~~~~~l~~~~~~----~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 2 TASTP------AEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCH------HHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhh Confidence 22232 1223333333322 234444454444321 1000000 01112234455556666666554 Q ss_pred HHhhcCCCCCceecCCC-hHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE--eC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLT-AKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PS 150 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~-d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~--d~ 150 (515) ++ .++ ++... |... ... +.+.+.+++|.....++.++..++|.+.+++ +. T Consensus 72 ~~------~~~-~~~~~~d~~~---------~~~-----------~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~ 124 (456) T protein:vir:10 72 IP------NGI-TVGGSADSDL---------ALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD 124 (456) T ss_pred cc------CCe-ecCCCCCcch---------HHH-----------HHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCC Confidence 32 122 22211 1110 011 2233556889999999999999999986544 44 Q ss_pred CC--cEEEEEcceEEEeeC-CCCC-eeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEE Q lcl|NC_020414. 151 KG--AMSAVPMHHYVVNRD-TNGD-LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKIN 226 (515) Q Consensus 151 ~~--~~r~~pl~~y~v~~d-~~G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 226 (515) +. .+++++..+.++..| ..++ +...+|.++. . ...+ .......++...+.|..+........... T Consensus 125 ~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~-~----d~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) T protein:vir:10 125 DGTATITADSPETMVVSVDPLQPWRIRAAMRWWRD-L----DAES------DFAIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) T ss_pred CCceEEEEEccceeEEEEcCCCCcceEEEEEEEEe-c----CCce------eEEEEEeccceeEEEEEEEEeecccceee Confidence 43 356665555444444 3433 3333333331 0 0000 00001111222222222211111001111 Q ss_pred EEeCCee-ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceee--------- Q lcl|NC_020414. 227 QSADDIP-VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLI--------- 296 (515) Q Consensus 227 ~e~~~~~-i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~--------- 296 (515) ...++.. ...+..+....+|++.. .+..|.|-.+..++-+-.++...-..+..++..+.|-..+ T Consensus 194 ~~~~~~~~~~~~~~~~~~~~pvv~~------~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~ 267 (456) T protein:vir:10 194 TRISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPN 267 (456) T ss_pred eecCCceeeccccCCCCCceeEEEe------cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccc Confidence 1112221 11111111123555432 2356889888888888888866655555555554432111 Q ss_pred -cCccc-cChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHHH- Q lcl|NC_020414. 297 -RPGSQ-TDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEIQ- 371 (515) Q Consensus 297 -~~~g~-~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi~- 371 (515) +.+|. .++........|.+.... ++....++. .++++.....++.+...|...= + ...+.. +....|+.-+. T Consensus 268 ~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~-~~~N~Sg~Ai~~ 344 (456) T protein:vir:10 268 VDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMP-DSANQSAEGAHN 344 (456) T ss_pred ccccccccchhhhhhhhccccccCC-CCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcc-cccChHHHHHHH Confidence 11111 111111122233332221 222222332 2445544444444444432110 0 000110 11223555442 Q ss_pred ------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 372 ------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMSLP 443 (515) Q Consensus 372 ------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~~~ 443 (515) .+++++.+.+|+-+.++.. .++.-.+ ......+++.+ ..+-+.++.|+-+.++. + T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~r--------l~~~~~g-~~~~~~~~v~w~~~~~~~~~~~ada~~kl~---~----- 407 (456) T protein:vir:10 345 IEKGFLFKCEDRLSIAKIGLEAILV--------KALQIEG-ESVEDTVDVSFESPDRVTLGEKYSAASLAK---A----- 407 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHhcC-CCcccceeEEecCCCCcCHHHHHHHHHHHH---H----- Confidence 2335555555555544332 1111111 11112233322 12223233332222221 1 Q ss_pred hcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhh Q lcl|NC_020414. 444 QTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQE 511 (515) Q Consensus 444 a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~ 511 (515) ++++.. ... ...+|+. +++++++..+|...++.++....+. .+...|-. T Consensus 408 ~gi~~~--------~~~---~~~lg~~------~~~i~~~e~er~~~e~~~~~~~~~~--~~~~~~~~ 456 (456) T protein:vir:10 408 AGESWA--------SIR---RNILNYN------ADQIKQDDLDRAREQITLFAGNPVQ--RPQEDGSR 456 (456) T ss_pred cCCChH--------HHH---HhhCCCC------HHHHHHHHHHHHHHHHHHHhhhhhh--cCCCCCCC Confidence 122221 111 2244553 3444333333322222221111111 11222222 No 118 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=94.58 E-value=0.004 Score=33.72 Aligned_cols=421 Identities=9% Similarity=0.061 Sum_probs=187.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccccc--CCCC-----CCcccccccc-----ccHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLM--NNKG-----DNETSQNGWQ-----GVGAQATNH 68 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~--~~~~-----~~~~~~~~~d-----st~~~a~~~ 68 (515) ....+.+.+++... -.|.. ..++|+-|.+.+--... -... .....+..|. ++=.-+.+. T Consensus 2 ~~~~~~~~~V~~~h--p~y~a-------~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~ 72 (489) T protein:vir:78 2 LTENGQGSGVKTKH--REWLH-------YAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRR 72 (489) T ss_pred ccCCCccCCCCccC--HHHHH-------HHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHH Confidence 22223344333322 22332 23456666655433110 0000 0000011111 000112333 Q ss_pred HHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEE Q lcl|NC_020414. 69 LANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK 148 (515) Q Consensus 69 Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~ 148 (515) ....|++.+|- ..|++.+ ++ .++.+++.| -....+.+.-+...+.+...+|-+.+++ T Consensus 73 tl~~l~G~vfr-k~p~~~~--p~--------------~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~~ilV 129 (489) T protein:vir:78 73 TLSGMVGSVMR-KEPEINI--PK--------------ELEYLLKNA------DGSGVGLIQHAQDTLMEIDSVGRGGLLV 129 (489) T ss_pred HHHHHhchhhc-CCcceec--cH--------------HHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeEEEEE Confidence 44445555554 4456532 21 234455544 2446678888899999999999999999 Q ss_pred eCCCc----------------EEEEEcceE---EEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCccc Q lcl|NC_020414. 149 PSKGA----------------MSAVPMHHY---VVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDN 208 (515) Q Consensus 149 d~~~~----------------~r~~pl~~y---~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~ 208 (515) |.... +..|+-.+. -..+ |..+++.-+..+++...++=...|+ .+. T Consensus 130 D~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~--------------~~~ 195 (489) T protein:vir:78 130 DAPETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFE--------------TKY 195 (489) T ss_pred eeCCCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCcc--------------cee Confidence 85311 334443332 2222 3344555555555444332222232 355 Q ss_pred EEEEEEEEEcCCCCeEE--EEE-eCCe------eec-ccCCcccccCcEEEEeeeecCCCcccc--chHHHHHHHHHHHH Q lcl|NC_020414. 209 VKLYTHAQYAGEGFWKI--NQS-ADDI------PVG-KENRIKAEKLPFIPLTWKRSYGEDWGR--PLVEDYSGDLFVIQ 276 (515) Q Consensus 209 v~v~~~v~~~~~~~~~~--~~e-~~~~------~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGr--gp~~~~l~d~k~L~ 276 (515) ++.|....+..++.+.+ |.+ .+|. .+. .+|+ +.+++|++.|....+..+.. .|.. |+..|| T Consensus 196 ~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~---~~l~~IPfv~~~~~~~~~~~~~pPLl----~LA~ln 268 (489) T protein:vir:78 196 GEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGE---SLRGVIPFTFIGATNNDATIDDAPLL----PLAELN 268 (489) T ss_pred EEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCC---CccCeeeEEEEecCCCCCCCCcCchH----HHHHHH Confidence 67777777776665532 322 2222 122 3444 35788999988777666654 3533 555554 Q ss_pred HH---HHH-HHHHHHHhccCceeec-CccccChhhccCCCCcceecCC--------cccccccccCCccchHHHHHHHHH Q lcl|NC_020414. 277 FL---SEA-VARGAALMADIKYLIR-PGSQTDVDHFVNSGTGEVITGV--------EEDIHIVQLGKYADLTPISAVLEV 343 (515) Q Consensus 277 ~l---~~~-~~~~~~~a~~p~~l~~-~~g~~~~~~~~~~~~g~~~~g~--------~~~v~~~~~~~~~~l~~~~~~i~~ 343 (515) .- ..+ .-..+..+..|.+.+. .+. .+...+..+....++-|. .++.+.++.. .. ..+.+.|.+ T Consensus 269 i~Hy~~ssd~~~~l~~~~~P~l~i~G~d~-~~~~~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~-~~--~~~r~~l~~ 344 (489) T protein:vir:78 269 IGHYRNSADNEESSFVVGQPTLFIYPGEN-LTPQAFKEANPNGIKFGSRRGHNLGYGGSAQLIQAG-EN--NLARQNMLD 344 (489) T ss_pred HHHhhhhhHHHHHHHHcccceeeeecCcc-CCcccccccCccceeeCCcccccCCCCCCcceeccC-cc--hHHHHHHHH Confidence 32 222 3333344444533332 111 111111111122222222 2223333332 12 234666777 Q ss_pred HHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCCh-hh--ccceee-e Q lcl|NC_020414. 344 YTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTS-EL--VDPVIV-T 419 (515) Q Consensus 344 ~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~-~~--~~~~~v-~ 419 (515) ++.+..++= ..++ .. +...||++...+...--..|..+...+..-+-.-|-..+.. .+...+. .. ++..+. . T Consensus 345 le~qm~~lG-a~l~-~~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-~G~~~~~~~~i~~n~dF~~~ 420 (489) T protein:vir:78 345 KEQQAIQIG-AQLI-TP-TQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVM-LGKPEDTEVEFRLNMDFFLE 420 (489) T ss_pred HHHHHHHHh-hhhc-cC-CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCceEEEeecccCcc Confidence 777665531 1122 22 33689999999999999999988888877665544333333 2222221 11 233332 2 Q ss_pred ehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 420 GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEG 499 (515) Q Consensus 420 ~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~ 499 (515) .+. .+.+..++..+. ...|..+.+.+.+ ...||.. .+.++++...+. ++ T Consensus 421 ~~d-------~~~~~al~~~~~---------~G~is~~t~~~~L-~~~gv~d---~~~e~~~~ei~~-----------~~ 469 (489) T protein:vir:78 421 PMT-------AQDRAAWMADIN---------AGLLPATAYYAAL-RKAGVTD---WTDADIKDAVAD-----------QP 469 (489) T ss_pred cCC-------HHHHHHHHHHHh---------cCCCCHHHHHHHH-HhCCCCC---ccHHHHHHHHhh-----------cC Confidence 222 122222222111 1234344444444 3345532 233333322221 11 Q ss_pred hhhhccchhhhhhccC Q lcl|NC_020414. 500 VAKAVPGVIQQEMKEG 515 (515) Q Consensus 500 ~~~a~~~~~~~~~~~~ 515 (515) .+.+.-++++..++ T Consensus 470 --~~~~~~~~g~~~~~ 483 (489) T protein:vir:78 470 --LPVATEVQGEIPQS 483 (489) T ss_pred --CCcccCCcccCCCC Confidence 11222233333333 No 119 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=94.29 E-value=0.0048 Score=33.28 Aligned_cols=423 Identities=12% Similarity=0.138 Sum_probs=167.3 Q ss_pred CCCccccccccHHHHHHHHH----HHHHhhhhHHHHHHHHHHhhcccccCCCCCCcccccccc--ccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWE----KFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ--GVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~----~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d--st~~~a~~~Laa~l~ 74 (515) +-.....+ -.+.+.+.-+ .+..++......|+.+|+=-.|.+.-....+....+... +.+...++.+|+-+. T Consensus 11 ~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~ 88 (500) T protein:vir:98 11 VTRSKYVM--TTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVF 88 (500) T ss_pred HHHHHHHh--hcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhc Confidence 00000000 0001111000 012223333445555554222221111111111112222 445556666665443 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) +-. | .++++++ ...++|. ..+..++|+..+.+++.+..+.|.+++ |.|.+. T Consensus 89 ~e~--~-----~i~~~d~-------------~~~~~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~ 141 (500) T protein:vir:98 89 NEQ--A-----EIKVDDD-------------AANEFIS-------ETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK 141 (500) T ss_pred CCc--c-----eEecCCh-------------HHHHHHH-------HHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Confidence 322 1 1233332 2334443 446778999999999999999998875 666544 Q ss_pred -cEEEEEcceEEE-eeCCCCCeeEEEEEE-EecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEc----CCCC--- Q lcl|NC_020414. 153 -AMSAVPMHHYVV-NRDTNGDLMDVILLQ-EKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYA----GEGF--- 222 (515) Q Consensus 153 -~~r~~pl~~y~v-~~d~~G~vd~i~r~~-~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~----~~~~--- 222 (515) .+.+++...++- .-|..|.+..+|... ..+.. .+. .+|+.++.. .+++ T Consensus 142 ~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~----------------~~~------~~yt~lE~h~~~~~~~~~I~ 199 (500) T protein:vir:98 142 VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTIN----------------GKE------VYYTLIEFHEWQSSDDYVIS 199 (500) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeec----------------CCc------eEEEEEEEEEEeCCceeEEE Confidence 266777777554 556666555544221 11110 000 133333321 1111 Q ss_pred eEEEEEeC----Ceee-c--------cc---CCcccccCc-EEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 223 WKINQSAD----DIPV-G--------KE---NRIKAEKLP-FIPLT----WKRSYGEDWGRPLVEDYSGDLFVIQFLSEA 281 (515) Q Consensus 223 ~~~~~e~~----~~~i-~--------~e---sgy~~~~~P-~~~~R----w~~~~g~~YGrgp~~~~l~d~k~L~~l~~~ 281 (515) ..+|..-+ |..+ + .+ .|+ ..| |..++ =+...++.||.|-...+.+-+..|+..--+ T Consensus 200 n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~---~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~ 276 (500) T protein:vir:98 200 NELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDV---TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDE 276 (500) T ss_pred EEEEecccccccCcccccccccCCcCcceEeccC---CCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHH Confidence 11222111 1111 0 11 222 123 22221 233447789999999999999999988877 Q ss_pred HHHHHHHhccCceeecCccccChhhccCCCCccee---------------cCCcccccccccCCccch--HHHHHHHHHH Q lcl|NC_020414. 282 VARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVI---------------TGVEEDIHIVQLGKYADL--TPISAVLEVY 344 (515) Q Consensus 282 ~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~---------------~g~~~~v~~~~~~~~~~l--~~~~~~i~~~ 344 (515) .....+. .+....++ +.++.... .+..|... .+..++-..++. -..++ ......++.+ T Consensus 277 ~~~e~~~-g~~~i~v~-~~~l~~~~--~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~-~~~~ir~e~~~~~l~~~ 351 (500) T protein:vir:98 277 FMWEVKM-GQRRVAVP-ESLTALTV--RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQD-LTTPIRADDYIKAINEG 351 (500) T ss_pred HHHHHHh-Ccceeeec-hHHhcccC--CCCCccccCCcccCCCcceEEEcCCCCCcCcceeE-eccccChHHHHHHHHHH Confidence 7766544 55554553 33332221 00111111 111111000110 01122 1122333333 Q ss_pred HHHHHHHHHH--HhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-----hcCC-CCC-hhhccc Q lcl|NC_020414. 345 TRRIGVIFMM--ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-----EAGD-SFT-SELVDP 415 (515) Q Consensus 345 ~~rI~~afl~--~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-----~~~~-~~p-~~~~~~ 415 (515) -+.|....=+ .++........|||||..+.+...+...-.-..+ ..-+.-|++-++. .... .+| ...+.+ T Consensus 352 l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~-~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v 430 (500) T protein:vir:98 352 LSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALV-EQSLKELVISIFEIAKAYDLYQSEVPSMDNISI 430 (500) T ss_pred HHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEE Confidence 3333222101 1122222234699999888877777665533222 3334444433321 1111 111 112332 Q ss_pred eeeeeh--HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 416 VIVTGI--EALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQE 493 (515) Q Consensus 416 ~~v~~l--~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~ 493 (515) .+=.++ +.-+.+ +.+. +.++ +++ +....++ .+..|+ |++|++++.++.++.+ . T Consensus 431 ~f~d~i~~d~~~~~---~~~~---~~v~--aGi-------~s~~~~i---~~~~g~------~eeea~~~l~~i~~E~-~ 485 (500) T protein:vir:98 431 SLDDGVFTDRDAEL---DYWI---KVVN--AGF-------GTREMAI---QKVLNV------TEEKAQEIAAEINTGI-V 485 (500) T ss_pred EeCCCCCCCHHHHH---HHHH---HHHH--cCC-------CCHHHHH---HhcCCC------CHHHHHHHHHHHHHhc-c Confidence 221111 111111 1111 1111 111 2222222 334454 5667666654432211 1 Q ss_pred HHHHHHhhhhccchhhhh Q lcl|NC_020414. 494 AMLNEGVAKAVPGVIQQE 511 (515) Q Consensus 494 ~~~~~~~~~a~~~~~~~~ 511 (515) .+.-..-....+.|+ T Consensus 486 ---~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 486 ---DEINQQRTDTHLYGE 500 (500) T ss_pred ---ccCCCCCccccccCC Confidence 111011111222222 No 120 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=94.29 E-value=0.0048 Score=33.28 Aligned_cols=423 Identities=12% Similarity=0.138 Sum_probs=167.3 Q ss_pred CCCccccccccHHHHHHHHH----HHHHhhhhHHHHHHHHHHhhcccccCCCCCCcccccccc--ccHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWE----KFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ--GVGAQATNHLANKLA 74 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~----~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d--st~~~a~~~Laa~l~ 74 (515) +-.....+ -.+.+.+.-+ .+..++......|+.+|+=-.|.+.-....+....+... +.+...++.+|+-+. T Consensus 11 ~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~ 88 (500) T protein:vir:30 11 VTRSKYVM--TTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVF 88 (500) T ss_pred HHHHHHHh--hcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhc Confidence 00000000 0001111000 012223333445555554222221111111111112222 445556666665443 Q ss_pred HhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCCC Q lcl|NC_020414. 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSKG 152 (515) Q Consensus 75 s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~~ 152 (515) +-. | .++++++ ...++|. ..+..++|+..+.+++.+..+.|.+++ |.|.+. T Consensus 89 ~e~--~-----~i~~~d~-------------~~~~~l~-------~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~ 141 (500) T protein:vir:30 89 NEQ--A-----EIKVDDD-------------AANEFIS-------ETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK 141 (500) T ss_pred CCc--c-----eEecCCh-------------HHHHHHH-------HHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Confidence 322 1 1233332 2334443 446778999999999999999998875 666544 Q ss_pred -cEEEEEcceEEE-eeCCCCCeeEEEEEE-EecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEc----CCCC--- Q lcl|NC_020414. 153 -AMSAVPMHHYVV-NRDTNGDLMDVILLQ-EKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYA----GEGF--- 222 (515) Q Consensus 153 -~~r~~pl~~y~v-~~d~~G~vd~i~r~~-~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~----~~~~--- 222 (515) .+.+++...++- .-|..|.+..+|... ..+.. .+. .+|+.++.. .+++ T Consensus 142 ~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~----------------~~~------~~yt~lE~h~~~~~~~~~I~ 199 (500) T protein:vir:30 142 VRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTIN----------------GKE------VYYTLIEFHEWQSSDDYVIS 199 (500) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeec----------------CCc------eEEEEEEEEEEeCCceeEEE Confidence 266777777554 556666555544221 11110 000 133333321 1111 Q ss_pred eEEEEEeC----Ceee-c--------cc---CCcccccCc-EEEEe----eeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 223 WKINQSAD----DIPV-G--------KE---NRIKAEKLP-FIPLT----WKRSYGEDWGRPLVEDYSGDLFVIQFLSEA 281 (515) Q Consensus 223 ~~~~~e~~----~~~i-~--------~e---sgy~~~~~P-~~~~R----w~~~~g~~YGrgp~~~~l~d~k~L~~l~~~ 281 (515) ..+|..-+ |..+ + .+ .|+ ..| |..++ =+...++.||.|-...+.+-+..|+..--+ T Consensus 200 n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~---~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~ 276 (500) T protein:vir:30 200 NELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDV---TRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDE 276 (500) T ss_pred EEEEecccccccCcccccccccCCcCcceEeccC---CCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHH Confidence 11222111 1111 0 11 222 123 22221 233447789999999999999999988877 Q ss_pred HHHHHHHhccCceeecCccccChhhccCCCCccee---------------cCCcccccccccCCccch--HHHHHHHHHH Q lcl|NC_020414. 282 VARGAALMADIKYLIRPGSQTDVDHFVNSGTGEVI---------------TGVEEDIHIVQLGKYADL--TPISAVLEVY 344 (515) Q Consensus 282 ~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~---------------~g~~~~v~~~~~~~~~~l--~~~~~~i~~~ 344 (515) .....+. .+....++ +.++.... .+..|... .+..++-..++. -..++ ......++.+ T Consensus 277 ~~~e~~~-g~~~i~v~-~~~l~~~~--~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~-~~~~ir~e~~~~~l~~~ 351 (500) T protein:vir:30 277 FMWEVKM-GQRRVAVP-ESLTALTV--RTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQD-LTTPIRADDYIKAINEG 351 (500) T ss_pred HHHHHHh-Ccceeeec-hHHhcccC--CCCCccccCCcccCCCcceEEEcCCCCCcCcceeE-eccccChHHHHHHHHHH Confidence 7766544 55554553 33332221 00111111 111111000110 01122 1122333333 Q ss_pred HHHHHHHHHH--HhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-----hcCC-CCC-hhhccc Q lcl|NC_020414. 345 TRRIGVIFMM--ETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-----EAGD-SFT-SELVDP 415 (515) Q Consensus 345 ~~rI~~afl~--~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-----~~~~-~~p-~~~~~~ 415 (515) -+.|....=+ .++........|||||..+.+...+...-.-..+ ..-+.-|++-++. .... .+| ...+.+ T Consensus 352 l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~-~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v 430 (500) T protein:vir:30 352 LSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALV-EQSLKELVISIFEIAKAYDLYQSEVPSMDNISI 430 (500) T ss_pred HHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEE Confidence 3333222101 1122222234699999888877777665533222 3334444433321 1111 111 112332 Q ss_pred eeeeeh--HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 416 VIVTGI--EALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQE 493 (515) Q Consensus 416 ~~v~~l--~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~ 493 (515) .+=.++ +.-+.+ +.+. +.++ +++ +....++ .+..|+ |++|++++.++.++.+ . T Consensus 431 ~f~d~i~~d~~~~~---~~~~---~~v~--aGi-------~s~~~~i---~~~~g~------~eeea~~~l~~i~~E~-~ 485 (500) T protein:vir:30 431 SLDDGVFTDRDAEL---DYWI---KVVN--AGF-------GTREMAI---QKVLNV------TEEKAQEIAAEINTGI-V 485 (500) T ss_pred EeCCCCCCCHHHHH---HHHH---HHHH--cCC-------CCHHHHH---HhcCCC------CHHHHHHHHHHHHHhc-c Confidence 221111 111111 1111 1111 111 2222222 334454 5667666654432211 1 Q ss_pred HHHHHHhhhhccchhhhh Q lcl|NC_020414. 494 AMLNEGVAKAVPGVIQQE 511 (515) Q Consensus 494 ~~~~~~~~~a~~~~~~~~ 511 (515) .+.-..-....+.|+ T Consensus 486 ---~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 486 ---DEINQQRTDTHLYGE 500 (500) T ss_pred ---ccCCCCCccccccCC Confidence 111011111222222 No 121 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=94.22 E-value=0.005 Score=33.19 Aligned_cols=356 Identities=9% Similarity=0.061 Sum_probs=139.4 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCc--cccccccccHH-HHHHHHHHHHHHhhcCCCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGWQGVGA-QATNHLANKLAQVLFPAQR 82 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~--~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~ 82 (515) |.+ |+.+...+..-...-..+..+..|..+.....+. ...+.....+. .|++.+|+.+ +.+ T Consensus 1 Mgl----------f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~i-a~l----- 64 (384) T protein:vir:49 1 MPI----------FNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDL-ATA----- 64 (384) T ss_pred Ccc----------ccccccCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHH-hhC----- Confidence 443 2222222211111111222333443332221111 11122333334 3444444433 333 Q ss_pred CceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhc----CCHHHHHHHHHHHHhhCceEEEEeCCC---cEE Q lcl|NC_020414. 83 SFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQR----QFRPAIVEVFKHLIVAGNCLLYKPSKG---AMS 155 (515) Q Consensus 83 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~---~~r 155 (515) ||- +. +.... . .+.+- +.+.=....+.++...||+.+++..+. ... T Consensus 65 ~~~-~~--~~~~~-------------~-----------l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~ 117 (384) T protein:vir:49 65 KIT-TS--RKQLQ-------------G-----------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMK 117 (384) T ss_pred cee-ee--cchhh-------------h-----------hhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEE Confidence 331 11 11100 0 11122 234445566677888999998875432 234 Q ss_pred EEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCee Q lcl|NC_020414. 156 AVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIP 233 (515) Q Consensus 156 ~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~ 233 (515) .+|| ..+-+..+.++.. ++.. +..++.. T Consensus 118 L~~l~~~~v~v~~~~~~~~--------------------------------------~~y~------------~~~~~~~ 147 (384) T protein:vir:49 118 WEYLRPSQVSFNRLDNQNG--------------------------------------LYYN------------ITFDDPR 147 (384) T ss_pred EEEEcCceeEEEEcCCCce--------------------------------------EEEE------------EEecCcc Confidence 4555 3333333322210 1111 1111111 Q ss_pred ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhcc----- Q lcl|NC_020414. 234 VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFV----- 308 (515) Q Consensus 234 i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~----- 308 (515) ......|+.++ +++.|+....+..||.||..-+...+.......+.......-...|..++.-++....+... T Consensus 148 ~~~~~~~~~~e--Vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~ 225 (384) T protein:vir:49 148 IPPKQHVPQGD--ILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRS 225 (384) T ss_pred ccceeEecCcc--EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHH Confidence 11111111122 56677766778899999999999999999988888888888888887766544444432110 Q ss_pred ---CC-CCcce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHH-HHH Q lcl|NC_020414. 309 ---NS-GTGEV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALE-IEQ 379 (515) Q Consensus 309 ---~~-~~g~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E-~~~ 379 (515) .+ ..|.+ +++.. ++.++.. +..+.+. .+..+..++.|-++|=.- .+...++..-|++.+.+.... ... T Consensus 226 ~~~~~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~ 302 (384) T protein:vir:49 226 RQAMKQMQGGPLVLDDLE-DFTPLEI-KSNVAQL-LSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSR 302 (384) T ss_pred HHhcccCCccceecCCCc-eEEEccC-ChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHH Confidence 00 11211 21111 2233322 2234443 456677788898888221 122223333455554332222 222 Q ss_pred HhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccce-eeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHH Q lcl|NC_020414. 380 NMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPV-IVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGD 458 (515) Q Consensus 380 ~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~-~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~ 458 (515) .|-|+.+++..+|..-+..-+. +....... +.--++.|.|+-- .....+...++...-.+.++....+... T Consensus 303 ~l~pi~~~i~~~l~~~l~~~~~-------~~~~~~~~~~~~~~~~l~~~~~-~t~~e~~~~l~~~g~~~ne~r~~~~~~p 374 (384) T protein:vir:49 303 FLRPFVSELSKKLSCEVDADIL-------PAVDPTGSNYIGLINSMVKTGT-LAQNQGLYVLQQAEILPKDLPEGETDST 374 (384) T ss_pred HHHHHHHHHHHHhchhhhhhhh-------hhhhccchHHHHHHHHHhhcCc-ccHHHHHHHHhhCCCCChhHHHHcCCCC Confidence 3455555555554322110000 00000000 1111222222211 1111111111111111222222222111 Q ss_pred HHHHHHHhcCCc-hhc Q lcl|NC_020414. 459 YMDWVRGQISAE-LPF 473 (515) Q Consensus 459 ~~~~~a~~~Gvp-~~~ 473 (515) + --|.. ..+ T Consensus 375 ~------~gGd~~~~~ 384 (384) T protein:vir:49 375 L------KGGETNEQY 384 (384) T ss_pred C------CCCCCCCCC Confidence 0 01111 123 No 122 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=94.18 E-value=0.0052 Score=33.13 Aligned_cols=412 Identities=10% Similarity=0.093 Sum_probs=174.6 Q ss_pred CCCcccccc------------------ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----c-cCCCC-----CC Q lcl|NC_020414. 1 MQDTILEYG------------------GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----L-MNNKG-----DN 51 (515) Q Consensus 1 ~~~~~~~~~------------------~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~-~~~~~-----~~ 51 (515) |++-- .. .+.+.|.+..+..+.+ ..+++.+.+|..-. + ..... .+ T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~ 74 (478) T protein:vir:10 1 MISIN--WPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKEN----IDNITMGERYYNHHPDILDAPFKRDVNGDYDET 74 (478) T ss_pred Ccccc--ccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHH----HHHHHHHHHHhcccccccccchhhhcccccccc Confidence 44320 11 1334444444444332 23444555553211 0 00000 00 Q ss_pred ccccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHH Q lcl|NC_020414. 52 ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAI 131 (515) Q Consensus 52 ~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~ 131 (515) ....|+-.+.+...++..++-|++ -| +.+..++... ...++. .+ .++|.... T Consensus 75 ~~~~ki~~n~~k~ivd~~~~yl~g--~p-----~~~~~~~~~~---------~~~l~~-----------~~-~n~~~~~~ 126 (478) T protein:vir:10 75 KPDWRMYTNYHQNLVDQKVAYAVA--NP-----VTFGVDNDKA---------LKQIQH-----------TL-NHKWDDKL 126 (478) T ss_pred cccceeccchHHHHHHHHhhhhcc--cC-----ceeecCChHH---------HHHHHH-----------HH-hccHHHHH Confidence 111134455566666666666654 22 2233333211 111222 22 36899999 Q ss_pred HHHHHHHHhhCceEE--EEeCCCcEEE--EEc-ceEEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCC Q lcl|NC_020414. 132 VEVFKHLIVAGNCLL--YKPSKGAMSA--VPM-HHYVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKE 205 (515) Q Consensus 132 ~~~~~dl~~~G~~~l--~~d~~~~~r~--~pl-~~y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~ 205 (515) .++.++...+|.+.+ |.|.++.+++ ++. .-|.+..| ..|++...+|.+...- T Consensus 127 ~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~---------------------- 184 (478) T protein:vir:10 127 VDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG---------------------- 184 (478) T ss_pred HHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeC---------------------- Confidence 999999999999865 4566665554 433 33555443 4678777776665321 Q ss_pred cccEEEEEE------EEEcCCCCeEEEEEeCCe---e-ecc-cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHH Q lcl|NC_020414. 206 DDNVKLYTH------AQYAGEGFWKINQSADDI---P-VGK-ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFV 274 (515) Q Consensus 206 ~~~v~v~~~------v~~~~~~~~~~~~e~~~~---~-i~~-esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~ 274 (515) ...+++|+. ..-.....+.......+. . ... .-+| ..+|++.++. +.+|.|-.+...+-+-. T Consensus 185 ~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----~~~g~sd~e~v~~liDa 257 (478) T protein:vir:10 185 AERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSW--GRVPFIPFKN-----NPQEVSDLFMYKTIIDA 257 (478) T ss_pred ceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccC--CcceEEEecc-----CCCCCCcHHHHHHHHHH Confidence 011222221 110101001111111110 1 111 1223 3578887765 45799999999999999 Q ss_pred HHHHHHHHHHHHHHhccCceeecCccccChhhccC--CCCcce-ecCCc-ccccccccCCccchHHHHHHHHHHHHHHHH Q lcl|NC_020414. 275 IQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN--SGTGEV-ITGVE-EDIHIVQLGKYADLTPISAVLEVYTRRIGV 350 (515) Q Consensus 275 L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~--~~~g~~-~~g~~-~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~ 350 (515) ++.+.-......+....|.+.+.--..-+...... ...+.+ +++.. +++..+. ...+.......++.+++.|.. T Consensus 258 ~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~ 335 (478) T protein:vir:10 258 LDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDTIK--VEVPIDSVKEYTKMLRDYIIE 335 (478) T ss_pred HHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHH Confidence 99888888888888888865542111111111100 111222 33322 3344433 234566677777777776644 Q ss_pred HHHH-HhhccCCCCCCCHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--ee Q lcl|NC_020414. 351 IFMM-ETMTRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TG 420 (515) Q Consensus 351 afl~-~~l~~~~~~~~TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~ 420 (515) .-.. +......+...|+..+. ....++...++..+.++-. .++.-.+.......+.+.+. .+ T Consensus 336 ~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~--------li~~~~~~~~d~~~i~i~f~~~~p 407 (478) T protein:vir:10 336 FGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQ--------YIIDFYRLDVRVQDIEITFNFNVM 407 (478) T ss_pred HhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhCCCcccccceEEeCCCCC Confidence 3211 10001111234555543 2345555555555544322 22221122222222333331 12 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 421 IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGV 500 (515) Q Consensus 421 l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~ 500 (515) .+.+..++. +..+++ .+....++..+ -+++ -.++|++.+.++..++.+...... T Consensus 408 ~~~~e~~~~----------~~~~~g-------~iS~et~i~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~-- 461 (478) T protein:vir:10 408 VNELENSQI----------AMNSTG-------LLSKETILGNH---SWVQ----DPVAEMERIEQENIELNQQLPDIE-- 461 (478) T ss_pred CCHHHHHHH----------HHHHhC-------CCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhccccC-- Confidence 222222211 111111 12222222222 1121 134666666655444322211000 Q ss_pred hhhccchhhhhhccC Q lcl|NC_020414. 501 AKAVPGVIQQEMKEG 515 (515) Q Consensus 501 ~~a~~~~~~~~~~~~ 515 (515) .+....+..++ T Consensus 462 ----~~~~d~~~~~~ 472 (478) T protein:vir:10 462 ----EGLNDEQQRQS 472 (478) T ss_pred ----CCCcccccccC Confidence 01111111122 No 123 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=93.91 E-value=0.006 Score=32.77 Aligned_cols=356 Identities=10% Similarity=0.076 Sum_probs=138.9 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHhhcccccCCC-CCCccc-cccccccHH-HHHHHHHHHHHHhhcCCCCCceecCCChH Q lcl|NC_020414. 16 PKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNK-GDNETS-QNGWQGVGA-QATNHLANKLAQVLFPAQRSFFRVDLTAK 92 (515) Q Consensus 16 ~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~-~~~~~~-~~~~dst~~-~a~~~Laa~l~s~ltpp~~~WFrl~~~d~ 92 (515) -+.|+.+...+..-... ...++.+...... ++..-. .......+. .|++.+|+.+ +. -||--...... T Consensus 1 Mg~f~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~i-a~-----~~~~~~~~~~~ 71 (382) T protein:vir:48 1 MPIFNLATESPPDNQGG---FFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDL-AT-----VKLITSRKKLQ 71 (382) T ss_pred CccccccccCCcccccc---cccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhh-cc-----Cceeeecchhh Confidence 22233333222211111 1111212111111 111111 111222233 3444444444 22 24321111110 Q ss_pred HHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHhhCceEEEEeCCC---cEEEEEc--ceEE Q lcl|NC_020414. 93 GEKVLDDRGLKKTQLATIFARVETTAMKALEQRQ----FRPAIVEVFKHLIVAGNCLLYKPSKG---AMSAVPM--HHYV 163 (515) Q Consensus 93 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~pl--~~y~ 163 (515) . .+.+-| .+.=+..++.+|...||+.+++..+. ....+|| ..+- T Consensus 72 ~---------------------------L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~ 124 (382) T protein:vir:48 72 G---------------------------IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVS 124 (382) T ss_pred h---------------------------hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeE Confidence 0 111222 34555566777888999999885432 1345555 3444 Q ss_pred EeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccc Q lcl|NC_020414. 164 VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAE 243 (515) Q Consensus 164 v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~ 243 (515) +..+..|... .+.+..++...+....|+.+ T Consensus 125 v~~~~~~~~~--------------------------------------------------~y~~~~~~~~~~~~~~~~~~ 154 (382) T protein:vir:48 125 FNRLDNKDGI--------------------------------------------------YYNITFDDPRIPPKQHVPQN 154 (382) T ss_pred EEEcCCCCeE--------------------------------------------------EEEEEecCccccceeEEcCc Confidence 4444433210 11112222211111122222 Q ss_pred cCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC---------CCCcc Q lcl|NC_020414. 244 KLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN---------SGTGE 314 (515) Q Consensus 244 ~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~---------~~~g~ 314 (515) -+++.|+....+..||.||..-+...+...+...+.......-...|.+++.-++.++++.... ...|. T Consensus 155 --evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~ 232 (382) T protein:vir:48 155 --DVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGG 232 (382) T ss_pred --cEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCC Confidence 2577787777788999999999999999999999999998888888988776666555533211 01122 Q ss_pred e--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_020414. 315 V--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAM 390 (515) Q Consensus 315 ~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~ 390 (515) + +++. -++.++.. +..+.+. .+..+..+..|-++|-.. .+...+ .-|..| .....-....|-|.+.++.. T Consensus 233 ~~vl~~g-~~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~afgVp~~~lg~~~--~~~~~~-~~~~~~~~~~l~p~~~~i~~ 306 (382) T protein:vir:48 233 PLVLDDL-EDFTPLEI-KSNVSQL-LKQADWTTGQFAKVYGIPDNVVGGQG--DQQSSL-EMSSDLYSKAVSRYLRPFLS 306 (382) T ss_pred eeEcCCC-ceEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCC--CcccHH-HHHHHHHHHHHHHHHHHHHH Confidence 1 2211 12333322 2234443 355677778888888321 111111 112221 11223344555565555555 Q ss_pred HHHHHHHHHHHHhcCCCCChhhccce-eeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCC Q lcl|NC_020414. 391 TMQTPIAMWGLQEAGDSFTSELVDPV-IVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISA 469 (515) Q Consensus 391 E~l~Pli~r~~~~~~~~~p~~~~~~~-~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gv 469 (515) |+-.-|..+.-......+- .... +..-+..|.|+- .-....+...++...-.+.++... +. .. T Consensus 307 ~l~~~l~~~~~~~~~~~~~---~~~~~~~~~~~~l~~~g-~~t~~e~r~~l~~~g~~~~~~~~~---~~---------~~ 370 (382) T protein:vir:48 307 ELSQKLSCDVDADIFPAVD---PTGSNYISRINSLVKTG-TLAQNQGLYILQQAEILPKELPNG---EN---------PN 370 (382) T ss_pred HHHHHhcChhhhhhhhhhc---cchhHHHHHHHHHhhcC-ccCHHHHHHHHhhCCCCCcchhhh---hc---------CC Confidence 5433222111000000000 0000 000011111110 000111111111100011111000 00 00 Q ss_pred chhccCCHHHHHHH Q lcl|NC_020414. 470 ELPFLKSEEEMQQE 483 (515) Q Consensus 470 p~~~irs~eev~~~ 483 (515) |+ +.--|+=++. T Consensus 371 ~~--~~GGd~~~~~ 382 (382) T protein:vir:48 371 ST--LKGGEEDGQD 382 (382) T ss_pred CC--CCCCCCCCCC Confidence 10 1110110000 No 124 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=92.43 E-value=0.0068 Score=32.46 Aligned_cols=236 Identities=11% Similarity=0.061 Sum_probs=95.0 Q ss_pred ccccccHHHHHHHHHHHHHhhh-hHHHHHHHHHHhhcccccCCCCCCcccccccc-ccHHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRS-PYLDRAKHFAKLTLPYLMNNKGDNETSQNGWQ-GVGAQATNHLANKLAQVLFPAQRS 83 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~-~~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~d-st~~~a~~~Laa~l~s~ltpp~~~ 83 (515) |-+ |.... +|+ .....|..-.--+.|++....+..-....... ++--.|++.+|+.+.+. | T Consensus 1 Mgl----------F~~~~-~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~l------p 63 (251) T protein:vir:46 1 MGI----------FYKNE-KRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARM------P 63 (251) T ss_pred CCc----------ccccc-ccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhC------c Confidence 222 22111 221 11111111111122332222111111111122 22234555555555433 4 Q ss_pred ceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHH-HhcCCHH----HHHHHHHHHHhhCceEEEEeCCC---cEE Q lcl|NC_020414. 84 FFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL-EQRQFRP----AIVEVFKHLIVAGNCLLYKPSKG---AMS 155 (515) Q Consensus 84 WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~snf~~----~~~~~~~dl~~~G~~~l~~d~~~---~~r 155 (515) |.-.. ..... . ++-+...| .+-|-+. =+.....++..+|||.+|+..+. ... T Consensus 64 ~~~~~-~~~~~--------~-----------~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~ 123 (251) T protein:vir:46 64 IRVTV-NGQIN--------Y-----------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMN 123 (251) T ss_pred eEEee-Ccccc--------c-----------cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEE Confidence 43222 11100 0 11112223 2344433 34455667778899999885432 234 Q ss_pred EEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCee Q lcl|NC_020414. 156 AVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIP 233 (515) Q Consensus 156 ~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~ 233 (515) .+|| ...-+..|.+|++- | .|+.+..+..+... T Consensus 124 L~~i~~~~v~v~~~~~g~~~--~----------------------------------~~~~~~~~~~g~~~--------- 158 (251) T protein:vir:46 124 LTFRKTSEIELKSDARGRLY--Y----------------------------------FHQRIDSNGNNIER--------- 158 (251) T ss_pred EEEECCceEEEEECCCCcEE--E----------------------------------EEEEeccCCcceeE--------- Confidence 5555 45556666666321 0 01111111111110 Q ss_pred ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcccc-ChhhccCCCC Q lcl|NC_020414. 234 VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQT-DVDHFVNSGT 312 (515) Q Consensus 234 i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~-~~~~~~~~~~ 312 (515) .|+.++ +++.|....+| .||.||...+...+...+...+.......-...|..++.-++.+ +.+ T Consensus 159 -----~~~~~d--iiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e------- 223 (251) T protein:vir:46 159 -----NVKFED--MLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKK------- 223 (251) T ss_pred -----EECCcc--EEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHH------- Confidence 111111 35556554444 79999999999999998888887777766666665444322211 111 Q ss_pred cceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCC--CCHHH Q lcl|NC_020414. 313 GEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAER--VTAVE 369 (515) Q Consensus 313 g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~--~TAtE 369 (515) ..+.+++++++.+- ...++.+ +--+| T Consensus 224 ---------------------------~~~~~~~~~~~~~~----g~~n~g~~~~gm~~ 251 (251) T protein:vir:46 224 ---------------------------ARDRAREEFPKVLV----ELNKLGKLSYSMNQ 251 (251) T ss_pred ---------------------------HHHHHHHHHHHHhc----CcccccccccccCC Confidence 11222232322221 0001100 01111 No 125 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=91.39 E-value=0.016 Score=30.42 Aligned_cols=400 Identities=10% Similarity=0.030 Sum_probs=149.4 Q ss_pred HHHHHHHHHhhhhH-----HH-HHH--HHHHhhcccccCCCCCCccccccccccHH-HHHHHHHHHHHHhhcCCCCCcee Q lcl|NC_020414. 16 PKLWEKFSKKRSPY-----LD-RAK--HFAKLTLPYLMNNKGDNETSQNGWQGVGA-QATNHLANKLAQVLFPAQRSFFR 86 (515) Q Consensus 16 ~~r~~~lk~~R~~~-----e~-~w~--e~~~~~~P~~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~~WFr 86 (515) -..|+.|+..-+.. +. .|. +.+.+.+-. ....+..=.........+. .|++.+|+.+.+ + ||-- T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~-l-----p~~~ 73 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGA-VAASGETVTPHDALQVSAVFASVRLLSETIAT-L-----PLST 73 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcc-cccCCceechHHhhccHHHHHHHHHHHHhhcc-C-----ceEE Confidence 22233333221111 11 010 000000000 0000100001112222233 355555555433 2 3322 Q ss_pred cCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhc----CCHHHHHHHHHHHHhhCceEEEEeCCCc--EEEEEc- Q lcl|NC_020414. 87 VDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQR----QFRPAIVEVFKHLIVAGNCLLYKPSKGA--MSAVPM- 159 (515) Q Consensus 87 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~~--~r~~pl- 159 (515) ..-.+... .++. ...++..++.. +.+.-+..++.++...||+.+++..+.+ ...+|| T Consensus 74 ~~~~~~~~----------~~~~------~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l~~l~ 137 (457) T protein:vir:13 74 YSKRGGSR----------KEIV------TPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGLDVLD 137 (457) T ss_pred EEecCCcc----------cccc------cchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 21111000 0111 11222334432 2445566677788889999988744332 334454 Q ss_pred -ceEEEeeCCCCCe-eEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeeccc Q lcl|NC_020414. 160 -HHYVVNRDTNGDL-MDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKE 237 (515) Q Consensus 160 -~~y~v~~d~~G~v-d~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~e 237 (515) ..+.+..+..+.. ..+|+.|. +..++..... T Consensus 138 p~~v~v~~~~~~~~~~~~~~~y~----------------------------------------------~~~~~~~~~~- 170 (457) T protein:vir:13 138 PTKIHVHMVMVDGLRRKVFEAYD----------------------------------------------IDADGNEVLL- 170 (457) T ss_pred cCceEEEEecCCCccceeEEEEE----------------------------------------------EecCCceeeE- Confidence 2333333222210 01111111 1111111100 Q ss_pred CCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC------ Q lcl|NC_020414. 238 NRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG------ 311 (515) Q Consensus 238 sgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~------ 311 (515) ..|..+ -+++.|+....+..||.||...+...+.....+.+.......-...|..++.-++.++++...... T Consensus 171 ~~~~~~--diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~ 248 (457) T protein:vir:13 171 GWFTPR--DVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAA 248 (457) T ss_pred EeeCcc--ceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHH Confidence 012111 256777776777889999999999999999988888888888888888877777766665422111 Q ss_pred -----C-c--ceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHH-HHHH Q lcl|NC_020414. 312 -----T-G--EVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALE-IEQN 380 (515) Q Consensus 312 -----~-g--~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E-~~~~ 380 (515) + | .++++.. +..++.. +..+.+. .+..+..+..|-++|-.- ++...+....+..-+.+.... .... T Consensus 249 ~~g~~nag~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~t 325 (457) T protein:vir:13 249 NSGVDNAHRVALLTEGA-KFSKVAM-SPDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS 325 (457) T ss_pred hcCccccCcceecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH Confidence 0 1 1122111 2223222 1234443 334456677788888321 122222222222333333222 2345 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcC-ChHHHhcCCHHH Q lcl|NC_020414. 381 MGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTW-PEPAQRAIRWGD 458 (515) Q Consensus 381 LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~-~p~~~d~id~d~ 458 (515) |.|.+.++.++|-.=|+ ++. + .+..++.+ ++.|.|.--.+........++ ...+ +-++...++.+. T Consensus 326 l~P~~~~ie~~ln~~L~--------~~~--~-~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~-~G~~T~NE~R~~~gl~P 393 (457) T protein:vir:13 326 LRPWLERIEAGFNRLLF--------AET--A-DRFRFVKFNLDEIKRGAPKERMELWSLGLQ-NGIYSIDEVRAAEDMTP 393 (457) T ss_pred HHHHHHHHHHHHHHhhc--------Ccc--c-cCceeEEeechhhhccCHHHHHHHHHHHHh-CCCcCHHHHHHHhCCCC Confidence 66666666665543322 111 1 11112222 222222211111111111111 0111 122333333322 Q ss_pred HHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccch----hhhhhccC Q lcl|NC_020414. 459 YMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGV----IQQEMKEG 515 (515) Q Consensus 459 ~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~----~~~~~~~~ 515 (515) +=.-.++.+=+|..+..-.+..+.. ......+.... ..+.-++| T Consensus 394 i~~g~~d~~~~~~n~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~g 441 (457) T protein:vir:13 394 LPDGLGEKYRVPLNLGEVGEEPEPE-------------PAPAPPAIEPPAEEPDEEPEPEG 441 (457) T ss_pred CCCCcccceeecccccccccccccc-------------ccCCCCCCCCCccccCCCCCCCC Confidence 2121222222333222211111000 00011111111 11111122 No 126 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=89.10 E-value=0.028 Score=29.07 Aligned_cols=367 Identities=8% Similarity=0.017 Sum_probs=141.6 Q ss_pred ccccccHHHHHHHHHHHHHhhhh---HHHHHHHHHHhhcccccCCCCCCccccccc-cccHHHHHHHHHHHHHHhhcCCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSP---YLDRAKHFAKLTLPYLMNNKGDNETSQNGW-QGVGAQATNHLANKLAQVLFPAQ 81 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~---~e~~w~e~~~~~~P~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~s~ltpp~ 81 (515) |-+ |+.++...+. -...|-+. ...+ ..+..-...... .++--.|++.+|+.+.+ T Consensus 1 M~~----------f~~~~~~~~~~~~~~~~~~~~---~~~~---~~~~~v~~~~al~~~~V~~~v~~ia~~ia~------ 58 (397) T protein:vir:38 1 MPL----------LKLNKSHSQGFSLNDPDWVNF---LTGG---EAQKYVSADTALKNSDIFSLIMQLSGDLAM------ 58 (397) T ss_pred Ccc----------hhhhhcccCcccCCchhhhhh---hcCC---cCCceechHHhhccHHHHHHHHHHHHHHhh------ Confidence 433 2222221111 11222211 1000 000000001112 23333466666655532 Q ss_pred CCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCC---cEEEEE Q lcl|NC_020414. 82 RSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG---AMSAVP 158 (515) Q Consensus 82 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~p 158 (515) -|| ...++ ....++..- -.--..+.-+..+..++..+|||.+++..+. ....+| T Consensus 59 ~p~---~~~~~-------------~~~~l~~~P-------N~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~ 115 (397) T protein:vir:38 59 VRY---TSESD-------------RSQSIISNP-------SVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEY 115 (397) T ss_pred Ccc---ccccc-------------HHHHHHhcC-------CCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEE Confidence 233 11111 112221110 0112344556677778888999998875432 234555 Q ss_pred c--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecc Q lcl|NC_020414. 159 M--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGK 236 (515) Q Consensus 159 l--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~ 236 (515) + ..+-+..+.+|.. ++.++.. ++...+. T Consensus 116 l~~~~v~i~~~~~~~~--~~y~~~~------------------------------------------------~~~~~~~ 145 (397) T protein:vir:38 116 LRPSQVQPMLLQDGSG--LIYNINF------------------------------------------------DEPAIGY 145 (397) T ss_pred EcCceeEEEEcCCCce--EEEEEEe------------------------------------------------ccccccc Confidence 5 4555555555431 1111111 0000000 Q ss_pred cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhcc-------- Q lcl|NC_020414. 237 ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFV-------- 308 (515) Q Consensus 237 esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~-------- 308 (515) .-.|+.++ +++.|.....+..||.||..-+...+.......+.......-...|..++.-++.++++... T Consensus 146 ~~~~~~~e--iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~ 223 (397) T protein:vir:38 146 MENVPAAD--VIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEI 223 (397) T ss_pred eeEecCcc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHH Confidence 00111112 45566665667789999999999999999988888888888788887776655544433211 Q ss_pred --CCC-CcceecCCcccccccccCC-ccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_020414. 309 --NSG-TGEVITGVEEDIHIVQLGK-YADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMG 382 (515) Q Consensus 309 --~~~-~g~~~~g~~~~v~~~~~~~-~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E~~~~LG 382 (515) .+. .|.++. ..+.+...++.. ..+.+ ..+..+..+..|-.+|-.. .+....+. .+..| +...-....|- T Consensus 224 ~~~~~n~~~~~v-l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~afgVp~~~lg~~~~~-~~~~e--~~~~~~~~~l~ 298 (397) T protein:vir:38 224 SKQIHNSDGPVV-IDALEDYKPLEVKGNIAS-LLNQVDWTRDQIAKVYGVPDSYLNGQGDQ-QSSIT--QISGQYAKSLN 298 (397) T ss_pred HhcccccCCcee-cCCCceEEecCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccHHH--HHHHHHHHHHH Confidence 111 111111 112222222332 23444 3455677888898888321 12211111 12222 11222334555 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHH Q lcl|NC_020414. 383 GVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDW 462 (515) Q Consensus 383 pv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~ 462 (515) |.+..+..||-.-|+ + ....+....---....|+.....+. + . ..+..+++-+ T Consensus 299 P~~~~ie~~ln~~l~--------~---~~~~~~~~~~~~d~~~~~~~~~~~~---~-----~-------G~~t~nE~R~- 351 (397) T protein:vir:38 299 RYVQAIVGELNDKLH--------A---NISANIRFAIDAMGDQYASTISSSV---K-----G-------GTIAGNQARF- 351 (397) T ss_pred HHHHHHHHHHHHhcc--------C---hhcccccccccCCHHHHHHHHHHHH---h-----C-------CCcCHHHHHH- Confidence 655555555432222 1 1111111111112233332222211 1 0 1123333222 Q ss_pred HHHhcCCch---hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 463 VRGQISAEL---PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 463 ~a~~~Gvp~---~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) .+|.|+ .=+...+. ............+... -..+...++ T Consensus 352 ---~lg~~p~~~~d~~~~~~---------~~~~~~~~~~~~~g~~--~~~~~~e~~ 393 (397) T protein:vir:38 352 ---ILQNSGYLAKDLPDPEK---------EPQQAIQLIQQEGGEN--DGNNSDERG 393 (397) T ss_pred ---HhCCCCCCCCccccccc---------cccccccccccccCCC--CCCCCCCCC Confidence 233332 10000000 0000000000000000 001111122 No 127 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=89.00 E-value=0.029 Score=29.02 Aligned_cols=433 Identities=10% Similarity=0.036 Sum_probs=171.7 Q ss_pred CCC-------------ccccc------cccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhccc-----ccC---CCCCC-- Q lcl|NC_020414. 1 MQD-------------TILEY------GGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMN---NKGDN-- 51 (515) Q Consensus 1 ~~~-------------~~~~~------~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~-----~~~---~~~~~-- 51 (515) ||+ ..-+. ..+...+.+..+ ..|. ++++.+.+|..-. +.. ..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~---~~~~---~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~ 74 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLID---EHNP---EPLLKGVRYYMCENDIEKKRRTYYDAAGQQL 74 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHH---hhcH---HHHHHHHHHhccccchhhccchhcccccccc Confidence 332 11000 001222222222 1222 4455666665431 110 00000 Q ss_pred ----ccccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020414. 52 ----ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQF 127 (515) Q Consensus 52 ----~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf 127 (515) ....|+-.+-+...++.+++-|. . .| +.++..++. +.+.+. .+..++| T Consensus 75 ~~~~~~~~ri~~n~~~~ivd~~~~yl~----g--~~-~~~~~~d~~-------------~~~~l~--------~~~~n~~ 126 (503) T protein:vir:59 75 VDDTKTNNRTSHAWHKLFVDQKTQYLV----G--EP-VTFTSDNKT-------------LLEYVN--------ELADDDF 126 (503) T ss_pred cccccccceeecchHHHHHHHHHhhhh----c--CC-eeeccCcHH-------------HHHHHH--------HHHhcCH Confidence 01113334455556666665543 2 11 223433322 222222 2234789 Q ss_pred HHHHHHHHHHHHhhCceEEE--EeCCCc--EEEEEcce-EEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhc Q lcl|NC_020414. 128 RPAIVEVFKHLIVAGNCLLY--KPSKGA--MSAVPMHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGK 201 (515) Q Consensus 128 ~~~~~~~~~dl~~~G~~~l~--~d~~~~--~r~~pl~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~ 201 (515) .....++.++...+|.++++ .|.++. +++++-.+ |++.-|. .+.+..++|.++..- . T Consensus 127 ~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~-------~---------- 189 (503) T protein:vir:59 127 DDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKG-------I---------- 189 (503) T ss_pred HHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEec-------C---------- Confidence 99999999999999998765 455544 44555444 4444443 477777766665310 0 Q ss_pred cCCCcccEEEEEEE-----EEcCCCCeEE---EEEeC---Ceee-cccCCcccccCcEEEEeeeecCCCccccchHHHHH Q lcl|NC_020414. 202 KCKEDDNVKLYTHA-----QYAGEGFWKI---NQSAD---DIPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYS 269 (515) Q Consensus 202 ~~~~~~~v~v~~~v-----~~~~~~~~~~---~~e~~---~~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l 269 (515) .+.....+++|+.- .....+ +.. +.+.. .... ...-+| ..+|++.++- +.+|.|=...+. T Consensus 190 ~~~~~~~~evy~~~~i~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~vPiv~~~n-----n~~~~sd~~~~~ 261 (503) T protein:vir:59 190 MGEETQKAELYTDTHVYYYEKIDGV-YQMDYSYGENNPRPHMTKGGQAIGW--GRVPIIPFKN-----NEEMVSDLKFYK 261 (503) T ss_pred CCceEEEEEEEeCCcEEEEEEcCCc-ccccccccccccccceeecceeccC--CccceEEecC-----CCCCCcchhhhH Confidence 00111223333220 000010 000 00000 0000 011223 3478776653 457999888899 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCceeecCccccChhh-ccCCC-CcceecCCcccccccccCCccchHHHHHHHHHHHHH Q lcl|NC_020414. 270 GDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDH-FVNSG-TGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRR 347 (515) Q Consensus 270 ~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~-~~~~~-~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~r 347 (515) +-+..++.+.-......+....|.+.+..-..-+... ..... .+.+.....+++..+.. ..+.+.....++.++.. T Consensus 262 ~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~ 339 (503) T protein:vir:59 262 DLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGGVDTLRA--EIPVDSAAKELERIQDE 339 (503) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCcceeEec--cCCHHHHHHHHHHHHHH Confidence 9999999888888888888888876553211111111 11111 12222233334555442 23556667777777777 Q ss_pred HHHHHHHHhhc-cCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH---HhcCCC--CChhhccceeeeeh Q lcl|NC_020414. 348 IGVIFMMETMT-RRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGL---QEAGDS--FTSELVDPVIVTGI 421 (515) Q Consensus 348 I~~afl~~~l~-~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~---~~~~~~--~p~~~~~~~~v~~l 421 (515) |...-..-.+. ...+...|+..+..+..-..... -...+.-.+.|.-+++.++ ...... .+...+.+.+-.++ T Consensus 340 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~-~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~ 418 (503) T protein:vir:59 340 LYKSAQAVDNSPETIGGGATGPALENLYALLDLKA-NMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTR 418 (503) T ss_pred HHHHhcccCCCcccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCC Confidence 75543211111 11234467777654322221111 1122222222333332222 211111 11222343332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020414. 422 EALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVA 501 (515) Q Consensus 422 ~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~ 501 (515) |-..++.++.+..+.+ +.+ +....++..+ -+++ -.++|++.+.+++++..+...-..... T Consensus 419 -p~d~~~~~~~~~kl~~-----~Gi-------iS~et~l~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~ 478 (503) T protein:vir:59 419 -IQNDSEIVQSLVQGVT-----GGI-------MSKETAVARN---PFVQ----DPEEELARIEEEMNQYAEMQGNLLDDE 478 (503) T ss_pred -CCCHHHHHHHHHHHHh-----CCC-------CchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhhhccccCcc Confidence 1112222222222211 111 1112222221 1121 124666666543332221111000000 Q ss_pred ------hhccchh--hhhhccC Q lcl|NC_020414. 502 ------KAVPGVI--QQEMKEG 515 (515) Q Consensus 502 ------~a~~~~~--~~~~~~~ 515 (515) ....+.. ++....| T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~g 500 (503) T protein:vir:59 479 GGDDDLEEDDPNAGAAESGGAG 500 (503) T ss_pred CCCCCCCcCCCCCCcccCCCCC Confidence 0000000 1111111 No 128 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=86.41 E-value=0.046 Score=27.93 Aligned_cols=407 Identities=10% Similarity=0.094 Sum_probs=158.1 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCc-ccccc--ccccHHHHHHHHHHHHHHhh Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE-TSQNG--WQGVGAQATNHLANKLAQVL 77 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~-~~~~~--~dst~~~a~~~Laa~l~s~l 77 (515) ||+--.++ ...++..-.++.+.| .+++-|.......... -+... ..++--.|++.+|+.+. T Consensus 1 ~~~~~~~~----------~~~~~~~~~~~~~~~---~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA--- 64 (460) T protein:vir:10 1 MANRIIRA----------LRELTGLDNKFNDAF---IKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTV--- 64 (460) T ss_pred CchhHHHH----------HhhhhccCCCchHHH---HHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhh--- Confidence 76655322 122222233334445 4666665433221111 11221 23344456666666653 Q ss_pred cCCCCCceecCCChHH-Hhhhhcc------------chhHHHHHHHHHHHHHHHHHHHHhcC----CHHHHHHHHHHHHh Q lcl|NC_020414. 78 FPAQRSFFRVDLTAKG-EKVLDDR------------GLKKTQLATIFARVETTAMKALEQRQ----FRPAIVEVFKHLIV 140 (515) Q Consensus 78 tpp~~~WFrl~~~d~~-~~~~~~~------------~~~~~~v~~~L~~ve~~~~~~l~~sn----f~~~~~~~~~dl~~ 140 (515) +-||.-....... ..+.... ......+ ..+...+......+.+=| .+.-...++.++.. T Consensus 65 ---~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll 140 (460) T protein:vir:10 65 ---AVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRL-DTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRL 140 (460) T ss_pred ---hCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchh-hhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhh Confidence 3455433221110 0000000 0000001 111222222333333333 44555666677888 Q ss_pred hCceEEEEeCCC-----c--EEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEE Q lcl|NC_020414. 141 AGNCLLYKPSKG-----A--MSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKL 211 (515) Q Consensus 141 ~G~~~l~~d~~~-----~--~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v 211 (515) +|||.+|+.... + ...||| +.+-+..+.+|.+-. +++. + T Consensus 141 ~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~--~~~~------------------------------~ 188 (460) T protein:vir:10 141 NGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLS--TDSP------------------------------I 188 (460) T ss_pred cCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceee--eeee------------------------------e Confidence 999998875421 1 234554 455566665553321 1000 0 Q ss_pred EEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEEEEeeeec-----CCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 212 YTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFIPLTWKRS-----YGEDWGRPLVEDYSGDLFVIQFLSEAVARGA 286 (515) Q Consensus 212 ~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~-----~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~ 286 (515) ..+.+..+|... .|+.+ -++++|+... .+..||.||..-+...+.......+...... T Consensus 189 -----------~~~~~~~~g~~~----~~~~~--evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f 251 (460) T protein:vir:10 189 -----------KSYMLIQGDQFI----EFNED--EVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTM 251 (460) T ss_pred -----------eEEEEecCceeE----Eeccc--ceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 000011111110 01111 1355554333 3567999999999999999888888888777 Q ss_pred HHhccCceeecCccccChhhccCCC------------Ccce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 287 ALMADIKYLIRPGSQTDVDHFVNSG------------TGEV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF 352 (515) Q Consensus 287 ~~a~~p~~l~~~~g~~~~~~~~~~~------------~g~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af 352 (515) .....|-+++..++.++++...... .|.+ +++. -+..++.. +..+.+. .+..+..+..|-++| T Consensus 252 ~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g-~~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~f 328 (460) T protein:vir:10 252 QNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGE-IAFTKISL-NTDELKP-FDYLKYDQKAICNAL 328 (460) T ss_pred hcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCC-ceEEEccC-ChhHHHH-HHHHHHHHHHHHHHh Confidence 7777777777777766655322111 0111 1111 12222221 1233333 455566778888888 Q ss_pred HH--HhhccCCCCCCCHHHHHHHHHH-HHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHH- Q lcl|NC_020414. 353 MM--ETMTRRDAERVTAVEIQRDALE-IEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRM- 427 (515) Q Consensus 353 l~--~~l~~~~~~~~TAtEi~~r~~E-~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra- 427 (515) =. ..+...++...|-.-+.+.... ....|.|...++..||-.-|+ ++ .+...-.++.+ .+.+..- T Consensus 329 gVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~--------~~--~~~~~~~~i~~d~~~l~~l~ 398 (460) T protein:vir:10 329 GWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFI--------KR--FKGYENAVIEWDISELPEMQ 398 (460) T ss_pred CCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------Cc--ccccCCceEEeecchhhhHH Confidence 21 1222222222222222222222 233566666666666544332 11 11111111221 1122111 Q ss_pred HHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccch Q lcl|NC_020414. 428 AELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGV 507 (515) Q Consensus 428 ~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~ 507 (515) .+...+.. ++..-.--+-++...++.+.+-+.-++.+=+|..++..+ .+. ++ +.++. T Consensus 399 ~d~~~~~~---~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~-~~~---~~----------------~~~~~ 455 (460) T protein:vir:10 399 TDMVAMAS---WLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRID-DVS---NN----------------LIDSA 455 (460) T ss_pred HHHHHHHH---HHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchh-hcc---cc----------------cCCCc Confidence 12222221 121100001223333322222111122222222222111 100 00 00000 Q ss_pred hhhhhc Q lcl|NC_020414. 508 IQQEMK 513 (515) Q Consensus 508 ~~~~~~ 513 (515) -..+| T Consensus 456 -~nq~~ 460 (460) T protein:vir:10 456 -FNQNQ 460 (460) T ss_pred -ccCCC Confidence 00000 No 129 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=84.77 E-value=0.058 Score=27.37 Aligned_cols=417 Identities=10% Similarity=0.032 Sum_probs=148.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc-cCCCCC------CccccccccccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL-MNNKGD------NETSQNGWQGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~-~~~~~~------~~~~~~~~dst~~~a~~~Laa~l 73 (515) |-+ .|-..+.+ .|..+.....++.+.+.+|..-.- ....+. +....+...+-+..+++.+++.| T Consensus 1 ~~~------~t~~~~~~---~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:79 1 MTA------STPAEWLP---VLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCC------CCHHHHHH---HHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhh Confidence 221 12222222 222222223334445555543211 000010 01111233456666666666655 Q ss_pred HHhhcCCCCCceecCCCh-HHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE--EeC Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTA-KGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY--KPS 150 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d-~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~--~d~ 150 (515) ++- + |++...+ ... ...+ .+.+.+++|.....++.++...+|.|.++ .+. T Consensus 72 ~~~------g-~~~~~~~d~~~---------~~~~-----------~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~e 124 (456) T protein:vir:79 72 IPN------G-ITVGGSADSDL---------ALRA-----------RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD 124 (456) T ss_pred ccC------C-eecCCCCCccH---------HHHH-----------HHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCC Confidence 332 2 2222111 111 1112 23455678999999999999999998654 455 Q ss_pred CCc--EEEEEcce-EEEeeCCCCC-eeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEE Q lcl|NC_020414. 151 KGA--MSAVPMHH-YVVNRDTNGD-LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKIN 226 (515) Q Consensus 151 ~~~--~r~~pl~~-y~v~~d~~G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 226 (515) ++. +++++..+ +++.-+..++ +...+|.+. ... +.. ....-..++..+.++...+...+..+... T Consensus 125 dg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~-~~d----~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) T protein:vir:79 125 DGTATITADSPETMVVSVDPLQPWRIRSAMRWWR-DLD----AES------DFAIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) T ss_pred CCceEEEEeccceeEEEEcCCCCCceEEEEEEEE-ecC----Cce------eEEEEEcCCceEEEEEEEEeeccccceee Confidence 543 55554444 4444443443 444444432 110 000 00001122222222222211111111111 Q ss_pred EEeCCe-eecc--cCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeec------ Q lcl|NC_020414. 227 QSADDI-PVGK--ENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIR------ 297 (515) Q Consensus 227 ~e~~~~-~i~~--esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~------ 297 (515) ...++. .... +.++ ..+|++.++ +..|.|=.+..++-+-.++...-.....++..+.|...+. T Consensus 194 ~~~~~~~~~~~~~~~~~--~~~pvv~~~------N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~ 265 (456) T protein:vir:79 194 TRISDSWVPVGDAVVTG--SPPPVVVYQ------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRL 265 (456) T ss_pred eccCCceeecccccCCC--CceeEEEec------CCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccc Confidence 111111 1111 1222 346665542 4678887777777666666554444445555555432221 Q ss_pred ----Cccc-cChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCCCCHHHH Q lcl|NC_020414. 298 ----PGSQ-TDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAERVTAVEI 370 (515) Q Consensus 298 ----~~g~-~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~~TAtEi 370 (515) ..|- .+.........|.+.... ++....++. ..+++.....++.+...|...= + ...+. .+....++.-+ T Consensus 266 ~~~d~~g~~i~~~~~~~~~~~~~~~~~-~~~~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~-~~~~N~Sg~Al 342 (456) T protein:vir:79 266 PKVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGA 342 (456) T ss_pred ccccccccccchhhhhhhhccccccCC-CCcceeeec-ccChHHHHHHHHHHHHHHHhhcCCChhHhc-ccccCcHHHHH Confidence 1110 111111112233332222 222222322 2344443333333333332110 0 00011 01123455544 Q ss_pred HH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 371 QR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLANFAQYMS 441 (515) Q Consensus 371 ~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~l~ra~~~~~l~~~~~~v~ 441 (515) .. +.+.++..+++.+.++..=++ .+.+. .....+++.+ ..+.+.+++|+-..++. + T Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~------~~~g~---~~~~~i~v~w~~~~~~s~~~~ada~~kl~---~--- 407 (456) T protein:vir:79 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKAL------QIEGE---SVEDTVDVSFESPDRVTLGEKYSAASLAK---A--- 407 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HhcCC---CccccceEEeCCCCCcCHHHHHHHHHHHH---h--- Confidence 33 345555666665554433110 12221 1112233322 12223333333222221 1 Q ss_pred HhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhh Q lcl|NC_020414. 442 LPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQ 509 (515) Q Consensus 442 ~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~ 509 (515) ++++.. .. ....+|+.. +++++.-.+|..+++.++....++....++.- T Consensus 408 --~G~~~~--------~~---~~~~lg~~~------~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 408 --AGESWA--------SI---RRNILNYNA------DQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred --cCCChH--------HH---HHhcCCCCH------HHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 122221 11 123345533 33332222222222111111111111111111 No 130 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=80.02 E-value=0.099 Score=26.10 Aligned_cols=407 Identities=12% Similarity=0.098 Sum_probs=175.4 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCC--CCCc--------cccccccc----cHH--H Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNK--GDNE--------TSQNGWQG----VGA--Q 64 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~--~~~~--------~~~~~~ds----t~~--~ 64 (515) |.=+.-.. .=..+..+|+.. |......-+...+-.||.....+ ..+. +..+.|+. .+. - T Consensus 14 m~V~~~hp--~y~a~~~~W~~~---~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n 88 (488) T protein:vir:96 14 MLTPIYHP--DYLVNAPQWLRN---LDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVN 88 (488) T ss_pred ecccccCH--HHHHHhhhhhHh---hhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCc Confidence 66333111 233444455433 44455555555566677642211 1111 11111211 111 2 Q ss_pred HHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCce Q lcl|NC_020414. 65 ATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNC 144 (515) Q Consensus 65 a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 144 (515) +.+...+.|++.+|- ..|=+.+ +. ...++.+++.| -....+.+.-+...+.+...+|-+ T Consensus 89 ~~~~tl~~l~G~vfr-k~p~~~~-~~-------------~~~l~~l~~d~------D~~G~~L~~f~~~~~~~~l~~G~~ 147 (488) T protein:vir:96 89 IVNPTMNAITGAVMR-REPEFDT-MD-------------NPVLIGLRDNI------DGKGNGIDQECKQALNALQWGSRC 147 (488) T ss_pred hhHHHHHHhcchhhc-cCceecc-CC-------------cHHHHHHHhcc------CCCCCCHHHHHHHHHHHHHhcCeE Confidence 233333344444442 1111111 11 12345555554 244677888889999999999999 Q ss_pred EEEEeCCC-----------c----EEEEEcce---EEEee-CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCC Q lcl|NC_020414. 145 LLYKPSKG-----------A----MSAVPMHH---YVVNR-DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKE 205 (515) Q Consensus 145 ~l~~d~~~-----------~----~r~~pl~~---y~v~~-d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~ 205 (515) .++||... + +..|+-.+ +-..+ |....+.-+..+++.... +..+ T Consensus 148 ~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~----------------D~~~- 210 (488) T protein:vir:96 148 GWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQER----------------DGGT- 210 (488) T ss_pred EEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEec----------------cCCC- Confidence 99998542 1 22333322 22222 222233334344433211 0001 Q ss_pred cccEEEEEEEEEcCCCCeEEEEEeCCee----ec-ccCCcccccCcEEEEeeeecCCCccccc--hHHHHHHHHHHHHHH Q lcl|NC_020414. 206 DDNVKLYTHAQYAGEGFWKINQSADDIP----VG-KENRIKAEKLPFIPLTWKRSYGEDWGRP--LVEDYSGDLFVIQFL 278 (515) Q Consensus 206 ~~~v~v~~~v~~~~~~~~~~~~e~~~~~----i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrg--p~~~~l~d~k~L~~l 278 (515) ....+.+..... .+|.|.++...++.. +. .+|+ +.+++|++.|....+..+..| |.. |+..||.- T Consensus 211 ~~~~~~~~~~~l-~~g~~~v~~~~~~~~~~e~~~~~~g~---~~l~~IP~v~~~~~~~~~~~~~pPLl----dLA~lnl~ 282 (488) T protein:vir:96 211 YVSKQRLINHRL-VDGLCEFQEVTDDEYSDEWTPVLINS---KQSDTIPFFLASSQSNEWCIDSTPLT----SLAEISLS 282 (488) T ss_pred cccceEEEEEEE-ECcEEEEEEEecCCcccceEeecCCC---cccCeeEEEEEecCCCCCCCCCCchH----HHHHHHHH Confidence 111112111112 245677766544331 22 2344 347788888887766665444 533 44444422 Q ss_pred ---HHHHHHHH-HHhccCceeecCccccChhhccCCCCcceecCC-------cccccccccCCccchHHHHHHHHHHHHH Q lcl|NC_020414. 279 ---SEAVARGA-ALMADIKYLIRPGSQTDVDHFVNSGTGEVITGV-------EEDIHIVQLGKYADLTPISAVLEVYTRR 347 (515) Q Consensus 279 ---~~~~~~~~-~~a~~p~~l~~~~g~~~~~~~~~~~~g~~~~g~-------~~~v~~~~~~~~~~l~~~~~~i~~~~~r 347 (515) ..+-++.+ ..+--|+|....++. .++.......+.+..|. .++...++.+ .+.+ +.+.+++++.+ T Consensus 283 Hy~~ssd~~~il~~~~~p~lv~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~-~~~l--~~~~l~~l~~q 358 (488) T protein:vir:96 283 IYVMNAYSNKAMILANEAKWMVDMGDM-NKTMASEMNPLGFTLAGRMPYYVKNGDVKVIQAQ-FSPE--TENKVEKLFEQ 358 (488) T ss_pred HHhhhhHHHHHHHhcCCceeeeccCCC-CcccccccccceeeecccccccccCCceeecCCc-hhHH--HHHHHHHHHHH Confidence 22222333 334445554433332 22211111112222221 1223222221 1122 36667777777 Q ss_pred HHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH-h-----cCCCCChhhccceee-ee Q lcl|NC_020414. 348 IGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQ-E-----AGDSFTSELVDPVIV-TG 420 (515) Q Consensus 348 I~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~-~-----~~~~~p~~~~~~~~v-~~ 420 (515) +.++= ..++ ... ...||++...+...--..|..+...+..-+..-|-..... | ..+.-+...++..++ .. T Consensus 359 m~~~G-a~l~-~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ 435 (488) T protein:vir:96 359 AVKVG-ASLF-TQQ-SNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVE 435 (488) T ss_pred HHHHh-Hhhc-cCC-CcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCcc Confidence 65531 1122 233 3479999999999999999988887776654433222211 2 112212222333332 12 Q ss_pred h-----HHHHHHHHHHHH--HHHHHHHHHhhcCChHHHhcCCHHHHHHHHHH-hcCC Q lcl|NC_020414. 421 I-----EALGRMAELDKL--ANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRG-QISA 469 (515) Q Consensus 421 l-----~~l~ra~~~~~l--~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~-~~Gv 469 (515) + .++-.+-....| ..+...+..-..+. +.+++++..+.+++ .+|+ T Consensus 436 ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~----~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 436 VNPQMLQVAYAAMMEGNLPQVSWFELLKRARVVR----GDMSKEEFDEHIAELGFGM 488 (488) T ss_pred CCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCC----ccCCHHHHHHHHhhcCCCC Confidence 2 222222222111 22223332211122 33466666666653 2233 No 131 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=79.70 E-value=0.1 Score=26.02 Aligned_cols=193 Identities=8% Similarity=0.019 Sum_probs=84.7 Q ss_pred EEEcCCCCeEEEEEeCCeee-cccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_020414. 215 AQYAGEGFWKINQSADDIPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIK 293 (515) Q Consensus 215 v~~~~~~~~~~~~e~~~~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~ 293 (515) ++...+|.+++.+....... +..-.|..++ +++.|.....+..||.+|..-++..+..-+...+-....-.-...|. T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g~~~~~~~~e--ilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~ 78 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKSEIYEYNKND--VIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHMG 78 (219) T ss_pred CceeecCeEEEEEecceecCCceeEEecccc--EEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc Confidence 55555665544333221111 1122233333 46666544446689999999888887765555544444444456666 Q ss_pred eee-cCccccChhhcc----------CCCCc--cee--cCC-cccccccccC-CccchHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_020414. 294 YLI-RPGSQTDVDHFV----------NSGTG--EVI--TGV-EEDIHIVQLG-KYADLTPISAVLEVYTRRIGVIFMME- 355 (515) Q Consensus 294 ~l~-~~~g~~~~~~~~----------~~~~g--~~~--~g~-~~~v~~~~~~-~~~~l~~~~~~i~~~~~rI~~afl~~- 355 (515) .++ .+++.++++... .+.++ .++ +|. .+.+...++. +..+.|. .+.-+-.+..|-++|-.. T Consensus 79 gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qf-le~rk~~~~eIa~~fgVPp 157 (219) T protein:vir:98 79 FILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEF-ANIKNISAQDVLTSHRFPP 157 (219) T ss_pred eEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHH-HHHHHhhHHHHHHHhCCCH Confidence 543 455555553211 11110 012 121 1222222222 1234443 334444566688888321 Q ss_pred -hhccCCCCC---CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhh-c--cceeeeehH Q lcl|NC_020414. 356 -TMTRRDAER---VTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSEL-V--DPVIVTGIE 422 (515) Q Consensus 356 -~l~~~~~~~---~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~-~--~~~~v~~l~ 422 (515) ++...+..+ -++++. ...=....|.|.+.++..+|-.= ..+|+.+ + +-...+-.. T Consensus 158 ~~lG~~~~~~~~~sn~eq~--~~~f~~~tL~P~~~~ie~~ln~~----------~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 158 GLSGIIPVNTAGLGDPLKI--REAYQADEVLPLQEIIAESINSD----------YEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred HHcccccCCCCCccCHHHH--HHHHHHHHHHHHHHHHHHHhhhh----------hcCCCccEEeecCcccccCC Confidence 122222222 244433 22445566777777777766321 1112111 1 000011111 No 132 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=76.72 E-value=0.13 Score=25.40 Aligned_cols=387 Identities=10% Similarity=0.032 Sum_probs=146.6 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCcc---ccc-cccccHHHHHHHHHHHHHHhhcCCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNET---SQN-GWQGVGAQATNHLANKLAQVLFPAQ 81 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~~---~~~-~~dst~~~a~~~Laa~l~s~ltpp~ 81 (515) |- |+++-.+|+..-....-.............+.... ... .-.++--.|++.+|+.+.+ T Consensus 1 m~-----------~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~------ 63 (416) T protein:vir:12 1 ML-----------LERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAK------ 63 (416) T ss_pred Cc-----------cchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhh------ Confidence 33 33333334332111000111111111111110000 011 1223333456655555542 Q ss_pred CCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHH-HhcC----CHHHHHHHHHHHHhhCceEEEEeCCC-c-- Q lcl|NC_020414. 82 RSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKAL-EQRQ----FRPAIVEVFKHLIVAGNCLLYKPSKG-A-- 153 (515) Q Consensus 82 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l-~~sn----f~~~~~~~~~dl~~~G~~~l~~d~~~-~-- 153 (515) -||--....+....+. ....| +..| .+-| .+.=+...+.++..+||+.+|+..+. + T Consensus 64 l~~~~~~~~~~~~~~~---------~~~~l-------~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~ 127 (416) T protein:vir:12 64 LPIHTYKRTDGGIERK---------PEHKS-------AHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYP 127 (416) T ss_pred CceEEEEecCCccccc---------cccHH-------HHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 2442222221111100 00111 1112 2222 33445666778888999998875432 2 Q ss_pred EEEEEcce--EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCC Q lcl|NC_020414. 154 MSAVPMHH--YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADD 231 (515) Q Consensus 154 ~r~~pl~~--y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~ 231 (515) ...|||.. .-+..+..+.. +| +.+..+| T Consensus 128 ~~L~~l~~~~v~v~~~~~~~~--------------------------------------~~------------~~~~~~g 157 (416) T protein:vir:12 128 EALFPLRPDYTNAYVHPTTGM--------------------------------------LW------------YQTVLNG 157 (416) T ss_pred EEEEEECCcceEEEEeCCCcE--------------------------------------EE------------EEEecCC Confidence 34566632 22222222210 00 1111222 Q ss_pred eeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhcc--- Q lcl|NC_020414. 232 IPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFV--- 308 (515) Q Consensus 232 ~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~--- 308 (515) ..+ .|+. .-+++.|+...+ ..||.||..-+...+.......+.......-...|..++.-++.++++... T Consensus 158 ~~~----~~~~--~eiih~~~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~ 230 (416) T protein:vir:12 158 KAI----ELYD--YEVLHFKGLSTD-GIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVR 230 (416) T ss_pred eEE----EecC--ccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHH Confidence 211 1222 235667765444 489999999999999998888888888888888887777766666655321 Q ss_pred -------CCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHH--Hhhc-cCCCCCCCHHHHHHHHHHHH Q lcl|NC_020414. 309 -------NSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMM--ETMT-RRDAERVTAVEIQRDALEIE 378 (515) Q Consensus 309 -------~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~--~~l~-~~~~~~~TAtEi~~r~~E~~ 378 (515) .++.-.++++.. +..++.. +..+.+.+ +..+..+..|-++|-. ..+. ..++..-++++.. ..=.. T Consensus 231 ~~~~~~~~~~~~~vl~~g~-~~~~l~~-~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~--~~f~~ 305 (416) T protein:vir:12 231 KEWKRVNKVENIAIIDYGL-EYQSISM-PLQEAQFV-ESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQS--IEYVR 305 (416) T ss_pred HHHHHHhcCCCeeecCCCc-eEEEccC-ChhhHHHH-HHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHH--HHHHH Confidence 222222233322 2333332 23455543 4456667788888832 1122 1122222333332 12223 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcC-ChHHHhcCCH Q lcl|NC_020414. 379 QNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTW-PEPAQRAIRW 456 (515) Q Consensus 379 ~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~-~p~~~d~id~ 456 (515) ..|.|...++.+||-.-|+ ++ .+.....++.. ++.|.|+-..+.....-..+.. .-+ +-++...++. T Consensus 306 ~~l~P~~~~ie~~l~~~l~--------~~--~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~-G~~T~NE~R~~~gl 374 (416) T protein:vir:12 306 NTLQPWIVNFEQELNVKLF--------LD--HDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHET-GVLNKDEIRELLER 374 (416) T ss_pred HHHHHHHHHHHHHHHHhhc--------Cc--hhhcCCceEEeechhhhccCHHHHHHHHHHHHhC-CCcCHHHHHHHhCC Confidence 3455555555554432222 11 11111112222 2223222111111111111110 001 1122222222 Q ss_pred HHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 457 GDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 457 d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +.+ +. ++.+-+|..++.- +.+. + . +.. .++.+.. -|+..-+| T Consensus 375 ~Pi-~g-gd~~~~~~n~~~~-~~~~---~-----~---~~~-~~~~~~~--gge~~~~g 416 (416) T protein:vir:12 375 NPI-EN-GDKYISSLNYVFL-DFLE---E-----Y---QRL-KAGGAMK--GGDNKNEG 416 (416) T ss_pred CCC-CC-cceeeeccccccc-cccc---h-----h---hcc-ccccccC--CCCCcCCC Confidence 111 00 1111111111111 0000 0 0 000 1111111 14444555 No 133 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=76.51 E-value=0.13 Score=25.36 Aligned_cols=361 Identities=11% Similarity=0.082 Sum_probs=141.7 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCCCCc-ccc-ccc-cccHHHHHHHHHHHHHHhhcCCCCCceecCCChH Q lcl|NC_020414. 16 PKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE-TSQ-NGW-QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAK 92 (515) Q Consensus 16 ~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~~~~-~~~-~~~-dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~ 92 (515) -+.|+.++..++.-...-.+...++.|........+. -+. ... .++--.|++.+|+.+.+. |+ ++. +. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~------p~-~~~--~~ 71 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATV------KL-TAS--RK 71 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccC------ce-eec--cc Confidence 3345555544443332222222233232221111111 111 111 233334566666555442 22 111 10 Q ss_pred HHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHH----HHHHHHHHHHhhCceEEEEeCCCc---EEEEEc--ceEE Q lcl|NC_020414. 93 GEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRP----AIVEVFKHLIVAGNCLLYKPSKGA---MSAVPM--HHYV 163 (515) Q Consensus 93 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~----~~~~~~~dl~~~G~~~l~~d~~~~---~r~~pl--~~y~ 163 (515) . . ...+.+-|.+. -+..++.++...||+.+++..+.. ...+|+ ..+. T Consensus 72 ~-------------~-----------~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~ 127 (386) T protein:vir:48 72 Q-------------L-----------QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVS 127 (386) T ss_pred h-------------h-----------HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeE Confidence 0 0 11233444433 344556677788999988755422 334444 4454 Q ss_pred EeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccc Q lcl|NC_020414. 164 VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAE 243 (515) Q Consensus 164 v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~ 243 (515) +.++..|... .+++..++........|+.+ T Consensus 128 v~~~~~~~~~--------------------------------------------------~y~~~~~~~~~~~~~~~~~~ 157 (386) T protein:vir:48 128 FNRLDNKDGI--------------------------------------------------YYNITFDDPRIPPKQHVPQG 157 (386) T ss_pred EEEcCCCceE--------------------------------------------------EEEEEecCccccceeEecCc Confidence 5554433210 11111111111111122222 Q ss_pred cCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCC---------CCcc Q lcl|NC_020414. 244 KLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNS---------GTGE 314 (515) Q Consensus 244 ~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~---------~~g~ 314 (515) -+++.|.....+..||.||..-+...+.....+.+.......-...|..++..++.++.+....- ..|. T Consensus 158 --evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~ 235 (386) T protein:vir:48 158 --DVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGG 235 (386) T ss_pred --cEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCC Confidence 24666666667779999999999999999999999999888888888877776666655432110 0111 Q ss_pred e--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_020414. 315 V--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAM 390 (515) Q Consensus 315 ~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~ 390 (515) + +++. -++.++.. +..+.+. .+..+..++.|-.+|=.. .+.. .+..-+++|- ...-....|.|.+..+.. T Consensus 236 ~~vl~~g-~~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~~e~~--~~~~~~~~l~P~~~~ie~ 309 (386) T protein:vir:48 236 PLVLDDL-EEFTPLEI-KSNVSQL-LKQADWTTGQFAKVYGIPENVVGG-QGDQQSSLEM--SLDLYNKAVSRYLRPFLS 309 (386) T ss_pred ceecCCC-ceEEEcCC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCC-CCCcccHHHH--HHHHHHHHHHHHHHHHHH Confidence 1 1111 12222221 1233332 455566677888887221 1111 1111122222 112234445666666655 Q ss_pred HHHHHHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCc Q lcl|NC_020414. 391 TMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAE 470 (515) Q Consensus 391 E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp 470 (515) ||-.-|..++ ..+.....-.+...++.....+.+ .+.+..+++ +.....-|++ T Consensus 310 ~l~~~l~~~~-----------~~~~~~~~~~d~~~~~~~~~~l~~---------------~g~~t~nE~-r~~lg~~~~~ 362 (386) T protein:vir:48 310 ELSQKLSCDV-----------DADILPAVDPTGSNSVSRINSMVK---------------SGTLAQNQG-LYILQQAEIL 362 (386) T ss_pred HHHHhhcchh-----------hcchhhhhccChHHHHHHHHHHHh---------------CCCcCHHHH-HHHhhcCCCC Confidence 5533332111 000000000111122222211110 011111111 1111111111 Q ss_pred hhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh Q lcl|NC_020414. 471 LPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI 508 (515) Q Consensus 471 ~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~ 508 (515) ..-++.-+. . -..-.++.. ..+-. T Consensus 363 ~~~~~~~~~---~---------~~~~~~gGd--~~~~~ 386 (386) T protein:vir:48 363 PKELPEGEN---P---------NKTTLKGGE--INGED 386 (386) T ss_pred CccchhhcC---C---------CCCccCCCC--CCCCC Confidence 110000000 0 000000000 00000 No 134 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=75.18 E-value=0.15 Score=25.11 Aligned_cols=413 Identities=10% Similarity=0.011 Sum_probs=148.7 Q ss_pred HHHHHHHHHhhhh------HHHHHHHHHHhhcccccCCCCCCc-cccccccccHH-HHHHHHHHHHHHhhcCCCCCceec Q lcl|NC_020414. 16 PKLWEKFSKKRSP------YLDRAKHFAKLTLPYLMNNKGDNE-TSQNGWQGVGA-QATNHLANKLAQVLFPAQRSFFRV 87 (515) Q Consensus 16 ~~r~~~lk~~R~~------~e~~w~e~~~~~~P~~~~~~~~~~-~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~~WFrl 87 (515) -..|+.|..+... -...|..+.-.+........++.. ....-....+. .|++.+|..+. .+ ||.-. T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA-~l-----p~~~~ 74 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIA-TL-----PLSTY 74 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHh-hC-----ceEEE Confidence 3344444332111 001111111111110000000000 00011222223 35555554443 22 44211 Q ss_pred CCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHh----cCCHHHHHHHHHHHHhhCceEEEEeCCCc--EEEEEc-- Q lcl|NC_020414. 88 DLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQ----RQFRPAIVEVFKHLIVAGNCLLYKPSKGA--MSAVPM-- 159 (515) Q Consensus 88 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~dl~~~G~~~l~~d~~~~--~r~~pl-- 159 (515) ...+... .+++. ..+...+.+ -+.+.-+..+..++...||+.+++....+ ...+|| T Consensus 75 ~~~~~~~----------~~~~~------~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~l~p 138 (457) T protein:vir:62 75 SKRGGTR----------KEIDT------PEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDVLDP 138 (457) T ss_pred EecCCcc----------ccccc------hHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcC Confidence 1111100 01110 011111222 23566677778888889999988854332 334555 Q ss_pred ceEEEeeCCCCCe-eEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccC Q lcl|NC_020414. 160 HHYVVNRDTNGDL-MDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKEN 238 (515) Q Consensus 160 ~~y~v~~d~~G~v-d~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~es 238 (515) ..+.+.++..+.. ..+|+ .+.+..++.... -. T Consensus 139 ~~v~v~~~~~~~~~~~~~~----------------------------------------------~y~~~~~g~~~~-~~ 171 (457) T protein:vir:62 139 TKIHVHMVMVDGLRRKVFE----------------------------------------------AYDIDADGNEVL-LG 171 (457) T ss_pred cceEEEEeccCCccceeEE----------------------------------------------EEEEccCCceeE-EE Confidence 2333333322211 01111 111111111110 01 Q ss_pred CcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC------- Q lcl|NC_020414. 239 RIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG------- 311 (515) Q Consensus 239 gy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~------- 311 (515) .|..++ +++.|.....|..||.||..-+...+.....+.+.......-...|..++.-++.++++...... T Consensus 172 ~~~~~e--iih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~ 249 (457) T protein:vir:62 172 WFTPRD--VLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAAN 249 (457) T ss_pred eeCccc--eEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHh Confidence 111122 46667666667789999999999999888888888888777777887766666666655432111 Q ss_pred ----C-c--ceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHH-HHHHh Q lcl|NC_020414. 312 ----T-G--EVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALE-IEQNM 381 (515) Q Consensus 312 ----~-g--~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E-~~~~L 381 (515) + | .++++.. +..++.+ +..|.+. .+..+..+..|-++|-.- ++...+....+..-+.+.... ....| T Consensus 250 ~G~~nag~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l 326 (457) T protein:vir:62 250 SGVDNAHRVALLTEGA-KFSKVAM-SPDEAQF-LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSL 326 (457) T ss_pred cCccccCcceecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHH Confidence 0 1 1112111 1222221 1234443 344456677788888321 122222222222333222222 23345 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcC-ChHHHhcCCHHHH Q lcl|NC_020414. 382 GGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTW-PEPAQRAIRWGDY 459 (515) Q Consensus 382 Gpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~-~p~~~d~id~d~~ 459 (515) .|.+.++..+|-.=|+ ++.. .+..++.+ ++.|-|+--.+........++. ..+ +-++...++.+.+ T Consensus 327 ~P~~~~ie~~ln~~L~--------~~~~---~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~-G~~T~NE~R~~~gl~pi 394 (457) T protein:vir:62 327 RPWLERIEAGFNRLLF--------AETA---DRFRFVKFNLDEIKRGAPKERMELWSLGLQN-GIYSIDEVRAAEDMTPL 394 (457) T ss_pred HHHHHHHHHHHHhhhc--------Cccc---cCceEEEeechhhhccCHHHHHHHHHHHHhC-CCcCHHHHHHHhCCCCC Confidence 6666666555433222 1111 11122322 2233332111111111111110 111 1223333333222 Q ss_pred HHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhh-----hccchhhhhhccC Q lcl|NC_020414. 460 MDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAK-----AVPGVIQQEMKEG 515 (515) Q Consensus 460 ~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~-----a~~~~~~~~~~~~ 515 (515) =.-.++.+-+|..+....++.+..-+....+.+ ....+.+.. ..+.+-..+-++| T Consensus 395 ~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~~~~ 454 (457) T protein:vir:62 395 PDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAID-PPAEEPADDEEPDNAEGDPDEGETEDD 454 (457) T ss_pred CCCCcceeeeccccccccccccccccCCCccCC-CCccCCCCCCCCCCCCCCCcccccccc Confidence 111222222333322211111100000000000 000000000 0011111222222 No 135 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=74.99 E-value=0.15 Score=25.07 Aligned_cols=324 Identities=9% Similarity=0.046 Sum_probs=121.7 Q ss_pred hhcccccCCC-CCCcc--c--cccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHH Q lcl|NC_020414. 39 LTLPYLMNNK-GDNET--S--QNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFAR 113 (515) Q Consensus 39 ~~~P~~~~~~-~~~~~--~--~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ 113 (515) -+++. +..- ..+.. . ...+-+.+. ...+.+.++..+ ...++.... +....|. . T Consensus 1 m~m~~-f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~a-------l~~~~v~----~ 58 (392) T protein:vir:39 1 MILPI-LNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAA-------LRNSDLF----S 58 (392) T ss_pred Ccchh-hhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHh-------hccHHHH----H Confidence 11221 1110 00000 0 000100000 000111111100 001111000 0011111 1 Q ss_pred HHHHHHHH----------------HHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC--c-EEEEEc--ceEEEeeCC Q lcl|NC_020414. 114 VETTAMKA----------------LEQRQF----RPAIVEVFKHLIVAGNCLLYKPSKG--A-MSAVPM--HHYVVNRDT 168 (515) Q Consensus 114 ve~~~~~~----------------l~~snf----~~~~~~~~~dl~~~G~~~l~~d~~~--~-~r~~pl--~~y~v~~d~ 168 (515) |-..+... +.+-|- +.=+..++.++..+||+.+++..+. . ...+|+ .+.-+..+. T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~ 138 (392) T protein:vir:39 59 IILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFE 138 (392) T ss_pred HHHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 22222222 222232 4444556667778899888774332 1 344554 333333333 Q ss_pred CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEE Q lcl|NC_020414. 169 NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFI 248 (515) Q Consensus 169 ~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~ 248 (515) +|.. ..+.+..++......-.|+.++ ++ T Consensus 139 ~~~~--------------------------------------------------~~y~~~~~~~~~~~~~~~~~~e--ii 166 (392) T protein:vir:39 139 YENG--------------------------------------------------MYYNITFDDPKIEPILQAPQSD--LI 166 (392) T ss_pred CCce--------------------------------------------------EEEEEEecCcccceeEEEcccc--EE Confidence 2211 1111111111111111232222 57 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcc-ccChh--------hccCCCC-cc--ee Q lcl|NC_020414. 249 PLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGS-QTDVD--------HFVNSGT-GE--VI 316 (515) Q Consensus 249 ~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g-~~~~~--------~~~~~~~-g~--~~ 316 (515) +.|+...+|..||.||..-+...+.....+.+.......-...|..++.-.+ ....+ .+....+ |. ++ T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl 246 (392) T protein:vir:39 167 HMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeec Confidence 7788777788999999999999999999999988888888888876654222 21111 1111111 11 12 Q ss_pred cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_020414. 317 TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQT 394 (515) Q Consensus 317 ~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~ 394 (515) ++.. +..++.. +..+.+. .+..+..+..|-++|=.. .+. +...-|..+ .+...=....|-|.+.++.+|+-. T Consensus 247 ~~g~-~~~~l~~-~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg--~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:39 247 DDLE-EFTALEI-KSNVAQL-LSQTDWTSKQYAKVYGLPDSYIG--GQGDQQSSI-QQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred CCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhC--CCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211 2223322 2234443 355666777888888221 121 112222211 112223445677777777776643 Q ss_pred HHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHH-----------H-------------HHHHHHHHhhcCChHH Q lcl|NC_020414. 395 PIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKL-----------A-------------NFAQYMSLPQTWPEPA 450 (515) Q Consensus 395 Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l-----------~-------------~~~~~v~~~a~~~p~~ 450 (515) =|...+ .++...+-..+...++....++ . +....+..+...+. T Consensus 321 ~L~~~~-----------~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~-- 387 (392) T protein:vir:39 321 KLSDHI-----------SVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQS-- 387 (392) T ss_pred hccccc-----------cccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCC-- Confidence 332110 0000000000111111111110 0 00011222221110 Q ss_pred HhcCCH Q lcl|NC_020414. 451 QRAIRW 456 (515) Q Consensus 451 ~d~id~ 456 (515) ..--+ T Consensus 388 -~~p~p 392 (392) T protein:vir:39 388 -NEPVP 392 (392) T ss_pred -CCCCC Confidence 00000 No 136 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=74.99 E-value=0.15 Score=25.07 Aligned_cols=324 Identities=9% Similarity=0.046 Sum_probs=121.7 Q ss_pred hhcccccCCC-CCCcc--c--cccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHH Q lcl|NC_020414. 39 LTLPYLMNNK-GDNET--S--QNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFAR 113 (515) Q Consensus 39 ~~~P~~~~~~-~~~~~--~--~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ 113 (515) -+++. +..- ..+.. . ...+-+.+. ...+.+.++..+ ...++.... +....|. . T Consensus 1 m~m~~-f~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~----~~~v~~~~a-------l~~~~v~----~ 58 (392) T protein:vir:10 1 MILPI-LNFINQTNDPPEVGSVQSYFPDGN------DAQIMESLLGDN----NEWVSARAA-------LRNSDLF----S 58 (392) T ss_pred Ccchh-hhhhhcccccccccccccccccCc------hhhhhhhhcCCC----CceechHHh-------hccHHHH----H Confidence 11221 1110 00000 0 000100000 000111111100 001111000 0011111 1 Q ss_pred HHHHHHHH----------------HHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC--c-EEEEEc--ceEEEeeCC Q lcl|NC_020414. 114 VETTAMKA----------------LEQRQF----RPAIVEVFKHLIVAGNCLLYKPSKG--A-MSAVPM--HHYVVNRDT 168 (515) Q Consensus 114 ve~~~~~~----------------l~~snf----~~~~~~~~~dl~~~G~~~l~~d~~~--~-~r~~pl--~~y~v~~d~ 168 (515) |-..+... +.+-|- +.=+..++.++..+||+.+++..+. . ...+|+ .+.-+..+. T Consensus 59 ~i~~ia~~ia~lp~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~ 138 (392) T protein:vir:10 59 IILQLSSDLAIVKINAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFE 138 (392) T ss_pred HHHHHHHhhccCceeeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 22222222 222232 4444556667778899888774332 1 344554 333333333 Q ss_pred CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEE Q lcl|NC_020414. 169 NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFI 248 (515) Q Consensus 169 ~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~ 248 (515) +|.. ..+.+..++......-.|+.++ ++ T Consensus 139 ~~~~--------------------------------------------------~~y~~~~~~~~~~~~~~~~~~e--ii 166 (392) T protein:vir:10 139 YENG--------------------------------------------------MYYNITFDDPKIEPILQAPQSD--LI 166 (392) T ss_pred CCce--------------------------------------------------EEEEEEecCcccceeEEEcccc--EE Confidence 2211 1111111111111111232222 57 Q ss_pred EEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCcc-ccChh--------hccCCCC-cc--ee Q lcl|NC_020414. 249 PLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGS-QTDVD--------HFVNSGT-GE--VI 316 (515) Q Consensus 249 ~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g-~~~~~--------~~~~~~~-g~--~~ 316 (515) +.|+...+|..||.||..-+...+.....+.+.......-...|..++.-.+ ....+ .+....+ |. ++ T Consensus 167 h~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl 246 (392) T protein:vir:10 167 HMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVL 246 (392) T ss_pred EecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeec Confidence 7788777788999999999999999999999988888888888876654222 21111 1111111 11 12 Q ss_pred cCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_020414. 317 TGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQT 394 (515) Q Consensus 317 ~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~ 394 (515) ++.. +..++.. +..+.+. .+..+..+..|-++|=.. .+. +...-|..+ .+...=....|-|.+.++.+|+-. T Consensus 247 ~~g~-~~~~l~~-~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg--~~~~~~~~~-~~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:10 247 DDLE-EFTALEI-KSNVAQL-LSQTDWTSKQYAKVYGLPDSYIG--GQGDQQSSI-QQISGMYASALNRYLRPAISELEY 320 (392) T ss_pred CCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhC--CCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHH Confidence 2211 2223322 2234443 355666777888888221 121 112222211 112223445677777777776643 Q ss_pred HHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHH-----------H-------------HHHHHHHHhhcCChHH Q lcl|NC_020414. 395 PIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKL-----------A-------------NFAQYMSLPQTWPEPA 450 (515) Q Consensus 395 Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l-----------~-------------~~~~~v~~~a~~~p~~ 450 (515) =|...+ .++...+-..+...++....++ . +....+..+...+. T Consensus 321 ~L~~~~-----------~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~e~l~~~~~Gd~-- 387 (392) T protein:vir:10 321 KLSDHI-----------SVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQS-- 387 (392) T ss_pred hccccc-----------cccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchhcCCCCCCCCCC-- Confidence 332110 0000000000111111111110 0 00011222221110 Q ss_pred HhcCCH Q lcl|NC_020414. 451 QRAIRW 456 (515) Q Consensus 451 ~d~id~ 456 (515) ..--+ T Consensus 388 -~~p~p 392 (392) T protein:vir:10 388 -NEPVP 392 (392) T ss_pred -CCCCC Confidence 00000 No 137 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=73.35 E-value=0.17 Score=24.78 Aligned_cols=419 Identities=12% Similarity=0.107 Sum_probs=179.6 Q ss_pred CCCccccccccHHHHHHHHHHHHHhhhhH--HHHHHHHHHhhcccccCCC--CCCcccc-ccccccHHHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPY--LDRAKHFAKLTLPYLMNNK--GDNETSQ-NGWQGVGAQATNHLANKLAQ 75 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk~~R~~~--e~~w~e~~~~~~P~~~~~~--~~~~~~~-~~~dst~~~a~~~Laa~l~s 75 (515) |.=+--.. .=.....+|+.+ |.-+ ...+++...-.||..-..+ ....+.. -.|-+...+.++.+ ++ T Consensus 1 m~V~~~hp--~y~a~~~~W~~~---rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~----~G 71 (452) T protein:vir:94 1 MPIETKHP--EYLAYENDWIDC---RVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSAL----SG 71 (452) T ss_pred CCCCCcCH--HHHHHHHHHHHH---HHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHH----hc Confidence 54222111 223333344433 3222 2344444444455321111 1111111 13444444455444 44 Q ss_pred hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCC-Cc- Q lcl|NC_020414. 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK-GA- 153 (515) Q Consensus 76 ~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~-~~- 153 (515) .+|. ..| .++.++. +..+.. -....+.+.-+...+.+...+|-+.+++|.. .+ T Consensus 72 ~vf~-k~p--~~~~p~~--------------l~~~~~--------D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~ 126 (452) T protein:vir:94 72 MVLD-QPP--VITHPDA--------------MSKYFE--------DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGG 126 (452) T ss_pred hhhc-CCc--eecccHH--------------HHHHHh--------cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCC Confidence 4443 212 2232221 122211 2457788888999999999999999999854 33 Q ss_pred ---EEEEEcce-EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEE- Q lcl|NC_020414. 154 ---MSAVPMHH-YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS- 228 (515) Q Consensus 154 ---~r~~pl~~-y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e- 228 (515) +..|+-.+ .=...|..|+..-+..+++..+++-..+|+. +.++.|.+.... +|.|.++.. T Consensus 127 rPy~~~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~--------------~~~~~yRvL~l~-~g~~~v~~~~ 191 (452) T protein:vir:94 127 DPYISVYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQ--------------NIRVRYRCLELV-DGLLQITVHE 191 (452) T ss_pred ceEEEEechhhhcCccccccCCeeEEEEEEEEEEecCCCcccc--------------eeEEEEEEEEEe-CCeEEEEEEE Confidence 33444333 1233455666655555666555554555653 233344444333 233443321 Q ss_pred -eCCe-------eecccCCcccccCcEEEEeeeecCCCcc--ccchHHHHHHHHHHHHH----HHHHHHHHHHHhccCce Q lcl|NC_020414. 229 -ADDI-------PVGKENRIKAEKLPFIPLTWKRSYGEDW--GRPLVEDYSGDLFVIQF----LSEAVARGAALMADIKY 294 (515) Q Consensus 229 -~~~~-------~i~~esgy~~~~~P~~~~Rw~~~~g~~Y--Grgp~~~~l~d~k~L~~----l~~~~~~~~~~a~~p~~ 294 (515) .++. .....+| +.+++|++.|-...+... |..| |=|+..||. .+-..-..+..+..|.. T Consensus 192 ~~~~~~~~~~~~~~~~~~~---~~l~~IP~v~~~~~~~~~~~~~pP----Ll~LA~ln~~hy~~~sd~~~~l~~~~~P~l 264 (452) T protein:vir:94 192 TQDGKVWELAKTSTIQNVG---VTMDYIPFFCITPSGLSMTPAKPP----MIDIVDINYSHYRTSADLEHGRHFTGLPTP 264 (452) T ss_pred ccCCceeeeccceeecCCC---cccceeEEEEEcCCCCCCCCCccc----hHHHHHHHHHHhcchhHHHHHHHHccccee Confidence 1111 1223344 246778888776666544 4445 335544442 23334444555555643 Q ss_pred eecCccccChhhccCCCCcce-ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHH-HHH Q lcl|NC_020414. 295 LIRPGSQTDVDHFVNSGTGEV-ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVE-IQR 372 (515) Q Consensus 295 l~~~~g~~~~~~~~~~~~g~~-~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtE-i~~ 372 (515) .+. |..+-..+.-+.+..+ .|........++. .+..+......|+++++.++++= . .+........|++| ... T Consensus 265 ~~~--g~~~~~~i~iG~~~~~~lpe~~~~~~yie~-~g~~i~~~~~~l~~le~~m~~~G-a-~ll~~~~~~~~s~ea~~~ 339 (452) T protein:vir:94 265 WIT--GAESQSTMHIGSTKAWVIPEVAAKVGFLEF-TGQGLQSLEKALSEKQAQLASLS-A-RLIDNSTRGSEATETVKL 339 (452) T ss_pred Eee--cCcCCCceEecccccccCCCCCCcceEEcc-CchhHHHHHHHHHHHHHHHHHHH-H-HhhccCCCcchHHHHHHH Confidence 332 2222223333332222 2221223444443 35567888888988888886641 1 22233333344554 445 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCC-hhhccceee-eehHHHHHHHHHHHHHHHHHHHHHhhcCChHH Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFT-SELVDPVIV-TGIEALGRMAELDKLANFAQYMSLPQTWPEPA 450 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p-~~~~~~~~v-~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~ 450 (515) +....-..|.-+..++..-+. =++.++..=.+.... ...++..++ ..+.+ +.+..+ ++.+ . T Consensus 340 ~~~~~~s~L~~~a~~~e~al~-~~l~~~a~w~g~~~~~~v~~n~dF~~~~~~~----~~~~al---~~~~---------~ 402 (452) T protein:vir:94 340 RYMSETASLKSVTRAVEALLN-KAYSCIMDMESMGGTLNIKLNSAFLDSKLTA----AELKAW---VEAY---------L 402 (452) T ss_pred HHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCceEEEeccccccccCCH----HHHHHH---HHHH---------h Confidence 555555778887777766553 233332221111211 111223332 22221 222222 2211 1 Q ss_pred HhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 451 QRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 451 ~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) ...|....+.+.+-+ .||+ ..++|.+.+..++.. +.+ .+.+.+..-| T Consensus 403 ~G~is~~t~~~~L~~-~gvl----~~~~e~~~i~~E~~~--------~~~-----~~~~~~~~~~ 449 (452) T protein:vir:94 403 SGGISKEIYIHALKV-GKVL----PPPGESMGVIPDPPA--------PEP-----SPSNTPPNPS 449 (452) T ss_pred cCCCcHHHHHHHHHh-CCCC----CCccCHHHHHHHhhc--------cCc-----ccCCCCCCCc Confidence 123434444444433 4553 223333332222110 011 1111111111 No 138 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=73.17 E-value=0.17 Score=24.75 Aligned_cols=313 Identities=9% Similarity=0.043 Sum_probs=119.3 Q ss_pred HHHhhcC------------CCCCceecCCChHHHhh---------hh-ccchhHHHHHHHHHHHHHHH------------ Q lcl|NC_020414. 73 LAQVLFP------------AQRSFFRVDLTAKGEKV---------LD-DRGLKKTQLATIFARVETTA------------ 118 (515) Q Consensus 73 l~s~ltp------------p~~~WFrl~~~d~~~~~---------~~-~~~~~~~~v~~~L~~ve~~~------------ 118 (515) |+.++|. .-..||.-. .++.+-. .. ..-+....|..-.+.+...+ T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~ 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDG-NDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN 79 (392) T ss_pred CcchhhhhhhcccCcccccccccccccC-chhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh Confidence 3333321 000111000 0000000 00 00000111111111111111 Q ss_pred HHHHHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC--c-EEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHH Q lcl|NC_020414. 119 MKALEQRQF----RPAIVEVFKHLIVAGNCLLYKPSKG--A-MSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPA 189 (515) Q Consensus 119 ~~~l~~snf----~~~~~~~~~dl~~~G~~~l~~d~~~--~-~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~ 189 (515) ...+.+-|- +.=+...+.++..+||+.+++..+. . ...+|| ..+-+..+.+|.. T Consensus 80 ~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~----------------- 142 (392) T protein:vir:74 80 QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENG----------------- 142 (392) T ss_pred hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce----------------- Confidence 111222222 4444555667777888877764322 1 234444 3333333333311 Q ss_pred hcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHH Q lcl|NC_020414. 190 TRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYS 269 (515) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l 269 (515) ..+.+..++......-.|+.++ +++.|+...+|..||.||..-+. T Consensus 143 ---------------------------------~~y~~~~~~~~~~~~~~~~~~e--vih~~~~~~~~~~~G~s~i~~~~ 187 (392) T protein:vir:74 143 ---------------------------------MYYNITFDDPKIEPILQAPQSD--LIHMKLLSIDGGKTGISPLYSLR 187 (392) T ss_pred ---------------------------------EEEEEEecCCccceeEEEcCcc--EEEecCCCCCCccccccHHHHHH Confidence 1111111111111111222222 56667766778889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCceeec-CccccChh--------hccCCCC-cc--eecCCcccccccccCCccchHHH Q lcl|NC_020414. 270 GDLFVIQFLSEAVARGAALMADIKYLIR-PGSQTDVD--------HFVNSGT-GE--VITGVEEDIHIVQLGKYADLTPI 337 (515) Q Consensus 270 ~d~k~L~~l~~~~~~~~~~a~~p~~l~~-~~g~~~~~--------~~~~~~~-g~--~~~g~~~~v~~~~~~~~~~l~~~ 337 (515) ..+.......+.......-...|..++. +++....+ .+..+.+ |. ++++.. .+.++.. +..+.+. T Consensus 188 ~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~-~~~~l~~-~~~d~q~- 264 (392) T protein:vir:74 188 RESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLE-EFTALEI-KSNVAQL- 264 (392) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCc-eEEEccC-ChhHHHH- Confidence 9999999999999988888888876654 22222211 1211111 11 122211 2233322 2334443 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee Q lcl|NC_020414. 338 SAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI 417 (515) Q Consensus 338 ~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~ 417 (515) .+..+..+..|-++|-...-..-+...-|.. +.+..+-....|.|.+.++.+|+-.=|...+ .++... T Consensus 265 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~~-----------~~~~~~ 332 (392) T protein:vir:74 265 LSQTDWTSKQYAKVYGLPDSYIGGQGDQQSS-IQQISGMYASALNRYLRPAISELEYKLSDHI-----------SVNMRP 332 (392) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhccchh-----------cccchh Confidence 4556667778888883211111111222221 1122223445667777776666533222110 000000 Q ss_pred eeehHHHHHHHHHHHHH------------------------HHHHHHHHhhcCChHHHhcCCH Q lcl|NC_020414. 418 VTGIEALGRMAELDKLA------------------------NFAQYMSLPQTWPEPAQRAIRW 456 (515) Q Consensus 418 v~~l~~l~ra~~~~~l~------------------------~~~~~v~~~a~~~p~~~d~id~ 456 (515) .--.+...++....++. +....+..+...+. ..--+ T Consensus 333 ~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~~~Gd~---~~p~p 392 (392) T protein:vir:74 333 AIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPENTNKKTTGQS---NEPVP 392 (392) T ss_pred hhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCCCCCCC---CCCCC Confidence 00001111111111110 00011222222211 00000 No 139 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=73.15 E-value=0.17 Score=24.75 Aligned_cols=438 Identities=15% Similarity=0.061 Sum_probs=182.4 Q ss_pred CCCccccccccHHHHHHHHHHHH-Hhh-hhHHHHHHHHHHhhcccc--c----CCCCCC--------ccccccccccHHH Q lcl|NC_020414. 1 MQDTILEYGGQRSKIPKLWEKFS-KKR-SPYLDRAKHFAKLTLPYL--M----NNKGDN--------ETSQNGWQGVGAQ 64 (515) Q Consensus 1 ~~~~~~~~~~~~~~l~~r~~~lk-~~R-~~~e~~w~e~~~~~~P~~--~----~~~~~~--------~~~~~~~dst~~~ 64 (515) |-+|. +.-..+.+...+...- ... +....+.+.+.+|..-.- . ...++. ...-|+..+-+.. T Consensus 1 ~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~ 78 (537) T protein:vir:78 1 MTSPL--LNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTE 78 (537) T ss_pred CCccc--ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHH Confidence 76665 3335555555443321 111 222345556666654420 0 000100 0112456667777 Q ss_pred HHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCce Q lcl|NC_020414. 65 ATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNC 144 (515) Q Consensus 65 a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~ 144 (515) .++..++-|++. |+. ++..+.. ..++.+ .+...+ ..+|.....++.+++..+|.+ T Consensus 79 Ivd~~~~yl~G~--Pv~-----~~~~d~~----------~~e~~~-------~l~~~~-~~~~~~~~~el~~~~s~~G~a 133 (537) T protein:vir:78 79 LVDQLAQYLLSN--GVE-----VKVKDED----------NTQLDE-------ILQEYF-DEDFQATIDTLVTNASKKGFE 133 (537) T ss_pred HHHHHhhhhccc--Cce-----eecCcch----------hHHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCee Confidence 777777776553 332 2222211 111222 222223 467888889999999999998 Q ss_pred E--EEEeCCCcEEE--EEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEE------ Q lcl|NC_020414. 145 L--LYKPSKGAMSA--VPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH------ 214 (515) Q Consensus 145 ~--l~~d~~~~~r~--~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~------ 214 (515) . +|.|+++.+++ ++-.+.+.--|..|....++|.+.....+-.... ...-..+++|+. T Consensus 134 y~~~y~de~~~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~------------~~~~~~~evyt~~~i~~y 201 (537) T protein:vir:78 134 GIFARTTSEGKLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQS------------TETIWHADVWNEEAVCYY 201 (537) T ss_pred EEEeeecCCCceEEEEEccceeEEEEcCCCCceeEEEEEeeeeccccccC------------cceEEEEEEEcCCcEEEE Confidence 5 56777776654 4434544445667888888887776533211110 011112222221 Q ss_pred EEEcCCCCe-------------EEEEEeCC----------eee-cccCCcccccCcEEEEeeeecCCCccccchHHHHHH Q lcl|NC_020414. 215 AQYAGEGFW-------------KINQSADD----------IPV-GKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSG 270 (515) Q Consensus 215 v~~~~~~~~-------------~~~~e~~~----------~~i-~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~ 270 (515) ... ..+.. .-++...+ ... ...-+| ..+|++.++= +.+|.|=.++..+ T Consensus 202 ~~~-~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~iPvv~f~n-----n~~~~sd~e~v~~ 273 (537) T protein:vir:78 202 IQD-DEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSY--SKFPFQLLYN-----NKDGMSDVKRVKS 273 (537) T ss_pred Eec-CCcccccccccccccccccceeeeccccccccccccccccccccCC--cceeEEEecc-----CccCCCchhhhHH Confidence 111 01000 00111000 000 111233 3477766554 4678999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCceeecCccccChhhc-cCC-CCcce-ecCCcccccccccCCccchHHHHHHHHHHHHH Q lcl|NC_020414. 271 DLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHF-VNS-GTGEV-ITGVEEDIHIVQLGKYADLTPISAVLEVYTRR 347 (515) Q Consensus 271 d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~-~~~-~~g~~-~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~r 347 (515) -+-.++.+.-......+...+|.+.+.-.+..+...+ ... ..+.+ +.|..+++..+. ...+.......++.+++. T Consensus 274 LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~l~--~~~~~~~~e~~ld~L~~~ 351 (537) T protein:vir:78 274 IIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEIQT--VSIPYEARKAKMDIDVEN 351 (537) T ss_pred HHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCCCceeEEE--ecCCHHHHHHHHHHHHHH Confidence 9999999888888888888888765542222221111 111 12333 344445555544 345667777888888887 Q ss_pred HHHHHHHHhhccCCCCCCCHHHHH-------HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC-CCCChhhccceeee Q lcl|NC_020414. 348 IGVIFMMETMTRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG-DSFTSELVDPVIVT 419 (515) Q Consensus 348 I~~afl~~~l~~~~~~~~TAtEi~-------~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~-~~~p~~~~~~~~v~ 419 (515) |-+.-+............|...+. .++.++++.++..+.++..-++ .++.-.. .......+.+.+.. T Consensus 352 I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~-----~~~~~~~~~~~d~~~i~i~f~~ 426 (537) T protein:vir:78 352 IYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVV-----SDIALRGLGEYDSNDICFEIEP 426 (537) T ss_pred HHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHhhcCCcccccceeeEEecc Confidence 754322111222233334554332 2344555555544444332111 1111111 11122223333322 Q ss_pred --ehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHH--HHHHHH Q lcl|NC_020414. 420 --GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQ--AQQEAM 495 (515) Q Consensus 420 --~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~--~~q~~~ 495 (515) +.+-+..++-+.++ .+...+.. ..++.. ++ ++-+.++.+.+.++..+ ...... T Consensus 427 ~~P~n~~e~a~~~~~l-------~~~giiS~--------eT~l~~----~p----~vdd~e~ek~~~ee~~~~~~~~~~~ 483 (537) T protein:vir:78 427 HVLANELDIATTRKTE-------AETEALKI--------GNIMTV----AP----RIGDDETLKLIAEELDLDYNELKDA 483 (537) T ss_pred CCCCCHHHHHHHHHHH-------HhcCcchH--------HHHHHh----CC----CCCCHHHHHHHHHHHHhhhhhhhhh Confidence 22222222211111 10001111 111111 11 11111111111111000 000000 Q ss_pred HHHHhhhhc-cchhhhhhccC Q lcl|NC_020414. 496 LNEGVAKAV-PGVIQQEMKEG 515 (515) Q Consensus 496 ~~~~~~~a~-~~~~~~~~~~~ 515 (515) ..++..+.. ...-.+++.+| T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~ 504 (537) T protein:vir:78 484 LAEQDAQSLDVSPDVQAMLDG 504 (537) T ss_pred hhhhcccccCcCcchhhhcCC Confidence 000000000 00001111111 No 140 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=71.60 E-value=0.19 Score=24.49 Aligned_cols=371 Identities=10% Similarity=0.056 Sum_probs=142.5 Q ss_pred ccccccHHHHHHHHHHHHH---hhhhHHHHHHHHHHhhcccccCCCCCCccc-cccccccHH-HHHHHHHHHHHHhhcCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSK---KRSPYLDRAKHFAKLTLPYLMNNKGDNETS-QNGWQGVGA-QATNHLANKLAQVLFPA 80 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~---~R~~~e~~w~e~~~~~~P~~~~~~~~~~~~-~~~~dst~~-~a~~~Laa~l~s~ltpp 80 (515) |- -|+.++. .|++-. ..+.|..+...+....+ ..-....+. .|++.+|+.+. T Consensus 1 MG----------~~~~~~~~~~~~~~~~-------~~~~~~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA------ 57 (411) T protein:vir:81 1 MG----------WWSRLTRFFRPRNETV-------DMTNPLLLQWLGVDPDTPRNQLSEATYFACLKILSESLG------ 57 (411) T ss_pred Cc----------hHHHHHhhccCccccc-------ccchHHHHHHhcCcccChhhhhccHHHHHHHHHHHHhHh------ Confidence 22 1222211 111111 11112111111111111 111222223 34444444333 Q ss_pred CCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCCc-- Q lcl|NC_020414. 81 QRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKGA-- 153 (515) Q Consensus 81 ~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~~-- 153 (515) +-||--..-.+....+ .. +..+...|+ +- +.+.=+...+.+|...||+.+++..+.+ T Consensus 58 ~lp~~~~~~~~~~~~~-----~~-----------~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~ 121 (411) T protein:vir:81 58 KLPLKMYQKTERGIVK-----SD-----------REELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQL 121 (411) T ss_pred hCceeEEEecCCceee-----ec-----------ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCce Confidence 2344322211111000 00 011122222 22 3344566667778889999988765432 Q ss_pred EEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCC Q lcl|NC_020414. 154 MSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADD 231 (515) Q Consensus 154 ~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~ 231 (515) ...+|+ +.+.+..|..|.+.. ....+|+ +....+| T Consensus 122 ~~l~~l~~~~v~~~~~~~~~~~~--------------------------------~~~~~~~-----------~~~~~~g 158 (411) T protein:vir:81 122 QALWILPSQYVTIVVDDRGLLGE--------------------------------KNAIWYR-----------YNDPYDG 158 (411) T ss_pred EEEEEECCceEEEEEcCcccccc--------------------------------cceEEEE-----------EEecCCc Confidence 234444 555555565553110 0000111 1111122 Q ss_pred eeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC-- Q lcl|NC_020414. 232 IPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN-- 309 (515) Q Consensus 232 ~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~-- 309 (515) ..+ .|+.+ -+++.|+....+..||.||..-+...+.......+.......-...|..++.-++.++++.... T Consensus 159 ~~~----~~~~~--eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~ 232 (411) T protein:vir:81 159 KMY----VFRND--EILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLV 232 (411) T ss_pred eEE----EEccc--cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHH Confidence 211 12222 2577777666677899999999999999999998888888888888887766556555543211 Q ss_pred --------C-CC-cce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCCHHHHHHHH Q lcl|NC_020414. 310 --------S-GT-GEV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVTAVEIQRDA 374 (515) Q Consensus 310 --------~-~~-g~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~TAtEi~~r~ 374 (515) + .+ |.+ +++.. ++.++... ..+.+.+ +..+..+..|-.+|-.. .+.. .++..=++++.. . T Consensus 233 ~~~~~~~~g~~n~g~~~vl~~g~-~~~~l~~~-~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~--~ 307 (411) T protein:vir:81 233 KGFEQFANGSKNAGKIIPVPLGM-KLVPLDIK-LTDSQFF-ELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQN--L 307 (411) T ss_pred HHHHHHhcCccccCCceecCCCc-eEEEccCC-HHHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHH--H Confidence 1 11 111 22211 23333221 2344433 44566678888888321 1221 222222333221 1 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhc-cceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh Q lcl|NC_020414. 375 LEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELV-DPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQR 452 (515) Q Consensus 375 ~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~-~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d 452 (515) .=....|.|.+.++..+|-.-|+ +.... ...++.+ ++.|.|.--.+........++. . T Consensus 308 ~f~~~~l~P~~~~ie~~l~~~ll-----------~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~--g------- 367 (411) T protein:vir:81 308 AFYVDTLLYVLKQYEEEITYKIL-----------SNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQN--G------- 367 (411) T ss_pred HHHHHHHHHHHHHHHHHHHhhcC-----------ChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhC--C------- Confidence 11223344444444444322221 11111 1112332 2333332111111111111110 1 Q ss_pred cCCHHHHHHHHHHhcCCchh-----ccCCHHH--HHHHHHHHHHHHHHHHHHHHhhhhccchhhhhh Q lcl|NC_020414. 453 AIRWGDYMDWVRGQISAELP-----FLKSEEE--MQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEM 512 (515) Q Consensus 453 ~id~d~~~~~~a~~~Gvp~~-----~irs~ee--v~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~ 512 (515) .+.++++ -+.+|.|+- ++.+..- ++.+-+ + ..-||++ T Consensus 368 ~~t~NE~----R~~~gl~p~~ggD~~~~~~n~~pl~~~~~-------------~------~~kgGd~ 411 (411) T protein:vir:81 368 IMTPNEA----RDYLDMPADDYGNNLMANGNYIPLSMLGA-------------N------YGKGGDS 411 (411) T ss_pred CcCHHHH----HHHhCCCCCCCCCeeeeccCccchhhhhh-------------h------hccCCCC Confidence 1122222 122344321 1111100 000000 0 0012222 No 141 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=67.21 E-value=0.26 Score=23.83 Aligned_cols=404 Identities=9% Similarity=-0.066 Sum_probs=143.1 Q ss_pred ccccc--cccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHH--HHHHHH-HHhcCCH Q lcl|NC_020414. 54 SQNGW--QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVE--TTAMKA-LEQRQFR 128 (515) Q Consensus 54 ~~~~~--dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve--~~~~~~-l~~snf~ 128 (515) .+.+- +++.-.|++.+|..+. +-||- +......... .........+..+|...+ ..+..+ +....+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia------~~p~~-i~~~~~~~~~-~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~ 72 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVA------GFGIN-IIPHPEAEDP-DRDGEQYERVWDFWFGDDSNWQVGPMESERATAT 72 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhh------cCCeE-EEEccCcccc-cchhhhhhhHHHHhhccCCCccccchhhHhhHHH Confidence 43332 3444467777777664 22332 2111100000 000001111111111110 000000 1122345 Q ss_pred HHHHHHHHHHHhhCceEEEEeCCC---cEEEEEcceEEE--eeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccC Q lcl|NC_020414. 129 PAIVEVFKHLIVAGNCLLYKPSKG---AMSAVPMHHYVV--NRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKC 203 (515) Q Consensus 129 ~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~pl~~y~v--~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~ 203 (515) .-+..++.|+..+|||.+++-.+. .+..+||..-.| ..|..+.+.. .. T Consensus 73 ~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~---------------------------~~ 125 (467) T protein:vir:31 73 NVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQL---------------------------LE 125 (467) T ss_pred HHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEee---------------------------cC Confidence 566778888889999998874332 245666643222 2222211100 00 Q ss_pred CCcccEEEEEEEEE-c-CCCCeEEEEEeCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 204 KEDDNVKLYTHAQY-A-GEGFWKINQSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEA 281 (515) Q Consensus 204 ~~~~~v~v~~~v~~-~-~~~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~ 281 (515) .....+.+|...+. + .......+.+.........-.++.+ =+++.|.....+..||.+|..-++..+.......+- T Consensus 126 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 203 (467) T protein:vir:31 126 EKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGTSVSNPAN--ELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDY 203 (467) T ss_pred CceeeEEeccccceeecccceeeeeeeeccccccceeEeccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 00001111111000 0 0011111222211111111122222 257777776778899999999999988777777766 Q ss_pred HHHHHHHhccCceeec-CccccChhhccCC-------------------------CCcceecCCcc----cccccccC-- Q lcl|NC_020414. 282 VARGAALMADIKYLIR-PGSQTDVDHFVNS-------------------------GTGEVITGVEE----DIHIVQLG-- 329 (515) Q Consensus 282 ~~~~~~~a~~p~~l~~-~~g~~~~~~~~~~-------------------------~~g~~~~g~~~----~v~~~~~~-- 329 (515) ......-...|..++. +++.++++..... +.-.++++... .+....+. T Consensus 204 ~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~ 283 (467) T protein:vir:31 204 NIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVG 283 (467) T ss_pred HHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEecccc Confidence 6665555666765543 4555554432110 00011111110 00000000 Q ss_pred CccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCC-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020414. 330 KYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVT-AVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG 405 (515) Q Consensus 330 ~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~T-AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~ 405 (515) ...|.+ ..+..+..+..|.++|-.. .+.. .++..-| +++.. ..=....|.|.+.++..+|-.-|+...... T Consensus 284 ~~~d~q-f~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~--~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~-- 358 (467) T protein:vir:31 284 IDEEAS-FLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQR--KEFAEETIQPKQHDFGELLYELVHKQGLDA-- 358 (467) T ss_pred ChhhHH-HHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHH--HHHHHHHHHHHHHHHHHHHHHhhcchhhcc-- Confidence 111222 1344455666788887321 1211 1221112 22221 122344566666666665544433111100 Q ss_pred CCCChhhccc--eeeeehHHHHHHHHHHHHHH----HHHHHHHhhcCChHHHh-cCCHHHHHHHHHHhcC--Cchhc--- Q lcl|NC_020414. 406 DSFTSELVDP--VIVTGIEALGRMAELDKLAN----FAQYMSLPQTWPEPAQR-AIRWGDYMDWVRGQIS--AELPF--- 473 (515) Q Consensus 406 ~~~p~~~~~~--~~v~~l~~l~ra~~~~~l~~----~~~~v~~~a~~~p~~~d-~id~d~~~~~~a~~~G--vp~~~--- 473 (515) .+.-++. ..+-..+...|+.-...+.+ +.+-+-.+-..+|- -+ .+....... +...| .|..- T Consensus 359 ---~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi-~d~~~~~~~~~~--~~~~~~~~~~~~~~~ 432 (467) T protein:vir:31 359 ---PDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPF-PEEHVYGGETLV--AEVTGGSGPGGGIGD 432 (467) T ss_pred ---CCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CcccccCCcccc--cccccccCCCCcccC Confidence 0001111 11222333344333222111 01111111122221 00 000000000 00001 01000 Q ss_pred ---cCCHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_020414. 474 ---LKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVP 505 (515) Q Consensus 474 ---irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~ 505 (515) =-.+++.+..-..-+....++++.+...+|.. T Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 433 QIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred cCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 00011111111111122233444444444444 No 142 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=65.76 E-value=0.28 Score=23.63 Aligned_cols=359 Identities=9% Similarity=0.061 Sum_probs=135.8 Q ss_pred HHHHHHHHHhhhhHHHHHHHHHHhhcccccCCCC-CCc-cccccccccHH-HHHHHHHHHHHHhhcCCCCCceecCCChH Q lcl|NC_020414. 16 PKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKG-DNE-TSQNGWQGVGA-QATNHLANKLAQVLFPAQRSFFRVDLTAK 92 (515) Q Consensus 16 ~~r~~~lk~~R~~~e~~w~e~~~~~~P~~~~~~~-~~~-~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~~WFrl~~~d~ 92 (515) -+.|+.++..+++....-.....+..+....... +.. .........+. .|++.+|+.+ +.+ |+. +. +. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~i-a~~-----p~~-~~--~~ 71 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDL-ATA-----KIT-TS--RK 71 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHh-hhC-----cee-ec--cc Confidence 3335555555544322212222233222221111 111 11112222233 3444444433 332 221 11 11 Q ss_pred HHhhhhccchhHHHHHHHHHHHHHHHHHHHHhc----CCHHHHHHHHHHHHhhCceEEEEeCCCc---EEEEEc--ceEE Q lcl|NC_020414. 93 GEKVLDDRGLKKTQLATIFARVETTAMKALEQR----QFRPAIVEVFKHLIVAGNCLLYKPSKGA---MSAVPM--HHYV 163 (515) Q Consensus 93 ~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~~---~r~~pl--~~y~ 163 (515) .. +.. +.+- +.+.=+..++.++...||+.+++..+.. ...+|+ +.+- T Consensus 72 ~~-------------~~l-----------~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~ 127 (386) T protein:vir:49 72 QL-------------QGI-----------VDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVS 127 (386) T ss_pred hh-------------hhh-----------hhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeE Confidence 10 111 1122 3344556667788889999988754321 334444 3444 Q ss_pred EeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccc Q lcl|NC_020414. 164 VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAE 243 (515) Q Consensus 164 v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~ 243 (515) +..+.+|... + | .+..++........|+.+ T Consensus 128 v~~~~~~~~~--~-----------------------------------y-------------~~~~~~~~~~~~~~~~~~ 157 (386) T protein:vir:49 128 FNRLDNQNGL--Y-----------------------------------Y-------------NITFDDPHIAPKQHVPQN 157 (386) T ss_pred EEEcCCCceE--E-----------------------------------E-------------EEEEcCccccceeEEccc Confidence 4444332211 0 0 011111111111112112 Q ss_pred cCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhh----------ccCCCCc Q lcl|NC_020414. 244 KLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDH----------FVNSGTG 313 (515) Q Consensus 244 ~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~----------~~~~~~g 313 (515) -+++.|+....+..||.||..-+...+.......+.......-...|..++.-++..+.+. ... ..| T Consensus 158 --evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~-n~g 234 (386) T protein:vir:49 158 --DILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQ-MQG 234 (386) T ss_pred --cEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhcc-CCC Confidence 2577777777788999999999999999999998888888888888887665444444421 111 112 Q ss_pred ce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHHHHHHhhhhHHHHH Q lcl|NC_020414. 314 EV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALEIEQNMGGVYSLFA 389 (515) Q Consensus 314 ~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E~~~~LGpv~~rl~ 389 (515) .+ +++.. ++.++.. +..+.+ ..+..+..+..|-++|-.. .+.......-+++.+. +-....+-|.+..+. T Consensus 235 ~~~vl~~g~-~~~~l~~-~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~---~~~~~~i~~~l~~i~ 308 (386) T protein:vir:49 235 GPLVLDDLE-DFTPLEI-KSNVAQ-LLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIY---NIYFKSVSRYLRPFV 308 (386) T ss_pred CceecCCCc-eEEEccC-ChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHH---HHHHHHHHHHHHHHH Confidence 21 22211 2333322 123334 2445677788888888321 1221122222332221 111222333333333 Q ss_pred HHHHHHHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCC Q lcl|NC_020414. 390 MTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISA 469 (515) Q Consensus 390 ~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gv 469 (515) .+|-. ..+.. ..++.......+...++...+.+. . .+.+..+++-..+ ...|+ T Consensus 309 ~~~~~--------~l~~~---~~~~~~~~~~~d~~~~~~~~~~l~----------~-----~g~~t~nE~r~~l-~~~~~ 361 (386) T protein:vir:49 309 SEMSK--------KLSCE---VDVDISPAVDPTGSNYISLINSMV----------K-----SGTLAQNQGLYIL-QQAEI 361 (386) T ss_pred HHHHH--------Hhcch---hcccchhhhccCHHHHHHHHHHHH----------h-----CCCcCHHHHHHHH-hhCCC Confidence 22211 11111 000100000001111111111110 0 0122233322221 12222 Q ss_pred chhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhh-hhhccC Q lcl|NC_020414. 470 ELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQ-QEMKEG 515 (515) Q Consensus 470 p~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~-~~~~~~ 515 (515) .+.-++..+... .....| +.+.|= T Consensus 362 ~~~~~~~~~~~~----------------------~~~~~gGd~~~~~ 386 (386) T protein:vir:49 362 LPKELPDGKNPN----------------------RTSLKGGEINEQD 386 (386) T ss_pred CCCcCcchhccC----------------------CCCCCCCCCCCCC Confidence 111111111000 000000 000000 No 143 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=65.51 E-value=0.28 Score=23.59 Aligned_cols=375 Identities=11% Similarity=-0.001 Sum_probs=139.8 Q ss_pred HHHHHHHHHhhhhH-HHHHHHHHHhhcccccCCCCCCccccccccccHH-HHHHHHHHHHHHhhcCCCCCceecCCChHH Q lcl|NC_020414. 16 PKLWEKFSKKRSPY-LDRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGA-QATNHLANKLAQVLFPAQRSFFRVDLTAKG 93 (515) Q Consensus 16 ~~r~~~lk~~R~~~-e~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~ 93 (515) -..|+.|..+|++. ...+.+..++.-.......+..=.........+. .|++.+|+.+. +-||--....+.. T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia------~~p~~~~~~~~~~ 74 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVG------MLPCNLYHLNGSL 74 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhc------cCceEEEEecCCc Confidence 34455554444332 2233333333322111111110000111122223 34544444432 3343221111110 Q ss_pred HhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCCc--EEEEEc--ceEEE Q lcl|NC_020414. 94 EKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKGA--MSAVPM--HHYVV 164 (515) Q Consensus 94 ~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~~--~r~~pl--~~y~v 164 (515) ... ..... +...|. +- +.+.=+..+..++...||+.+|+..+.+ ...+|| +.+.+ T Consensus 75 ~~~---------~~~~~-------~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~L~~l~~~~v~~ 138 (414) T protein:vir:44 75 KQR---------ATGER-------LHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPGCVVP 138 (414) T ss_pred eee---------cccch-------HHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcCceEEE Confidence 000 00111 111222 22 3444556667777789999887744333 445666 33444 Q ss_pred eeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCccccc Q lcl|NC_020414. 165 NRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEK 244 (515) Q Consensus 165 ~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~ 244 (515) ..+..|++ +|..... ++.. .+ |..+ T Consensus 139 ~~~~~~~~--------------------------------------~y~~~~~--~g~~-~~-------------~~~~- 163 (414) T protein:vir:44 139 KLNSSWEP--------------------------------------VYQVTFP--DGST-DV-------------LSQE- 163 (414) T ss_pred EECCCCcE--------------------------------------EEEEEec--CceE-EE-------------Eccc- Confidence 44444322 1111111 1100 00 1111 Q ss_pred CcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCC-----------CC- Q lcl|NC_020414. 245 LPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNS-----------GT- 312 (515) Q Consensus 245 ~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~-----------~~- 312 (515) -+++.|.... +..||.||..-+...+.....+.+.......-...|..++.-++.++++....- .+ T Consensus 164 -evih~~~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~ 241 (414) T protein:vir:44 164 -DIWHVRTLTL-DGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNA 241 (414) T ss_pred -cEEEecCCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 1344443322 347999999999988988888888888877778888777766666655432111 00 Q ss_pred cc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCCHHHHHHHHHHHHHHhhhhHHH Q lcl|NC_020414. 313 GE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVTAVEIQRDALEIEQNMGGVYSL 387 (515) Q Consensus 313 g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~TAtEi~~r~~E~~~~LGpv~~r 387 (515) |. ++++.. +..++.. +..+.+. .+..+-.+..|-++|-.. .+.. .++..-+++|.. .. T Consensus 242 ~~~~vl~~g~-~~~~l~~-~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~--------------~~ 304 (414) T protein:vir:44 242 HRPMILEMGL-DWKSMAL-NAEDSQF-LETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG--------------LG 304 (414) T ss_pred CcceecCCCc-eEEEccC-ChHHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH--------------HH Confidence 11 111111 2233322 1234443 334455567788888321 1221 122223333332 22 Q ss_pred HHHHHHHHHHHHH---HHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHH Q lcl|NC_020414. 388 FAMTMQTPIAMWG---LQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWV 463 (515) Q Consensus 388 l~~E~l~Pli~r~---~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~ 463 (515) +...-+.|++.++ +...+....+. ...++.. ++.|.|.-........-..++ .. .+.+++ + T Consensus 305 ~~~~~l~P~~~~ie~~ln~~L~~~~~~--~~~~i~fd~~~ll~~d~~~~~~~~~~~~~--~G-------~~t~NE----~ 369 (414) T protein:vir:44 305 FINYSLVPYLTRIEQRINTGLVRKSKQ--GVFYAKFNAGALLRGDMKSRFEAYATGIN--WG-------IYSPND----C 369 (414) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCcccc--CceEEEEechhhhccCHHHHHHHHHHHHh--CC-------CcCHHH----H Confidence 3444556665554 22222211111 1223332 233333211111111111111 00 112222 1 Q ss_pred HHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 464 RGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 464 a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) -+.+|.|+- ---|+ ...... ..... +.+...+++-..+ T Consensus 370 R~~~gl~p~--~ggD~---~~~~~n------~~~~~---~~~~~~~~~~~~~ 407 (414) T protein:vir:44 370 RDLEDMNPR--PGGDV---YLTPMN------MTTKP---SDGSKAGKQKDNA 407 (414) T ss_pred HHHhCCCCC--CCcce---eccccc------ccccC---CccccCCCCCCCC Confidence 223454431 00000 000000 00000 0000111111111 No 144 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=65.18 E-value=0.29 Score=23.55 Aligned_cols=406 Identities=10% Similarity=0.029 Sum_probs=170.0 Q ss_pred ccHHHHHHHHHHHHHhhhhHHHHHHHHHHhh--cccccCCCCC---Cc---cccccccccHHHHHHHHHHHHHHhhcCCC Q lcl|NC_020414. 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLT--LPYLMNNKGD---NE---TSQNGWQGVGAQATNHLANKLAQVLFPAQ 81 (515) Q Consensus 10 ~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~--~P~~~~~~~~---~~---~~~~~~dst~~~a~~~Laa~l~s~ltpp~ 81 (515) +|.+.|.+..+..+.++ +.....+++|.=- ++.+-..... .. ..-|+..+-+...++..++-|++ T Consensus 1 l~~~~i~~~i~~~~~~~-~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G------ 73 (451) T protein:vir:10 1 MELEKIRAIISADAARR-QEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFT------ 73 (451) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheec------ Confidence 79999999998887644 4444444443210 0011000000 00 01133345555555555543322 Q ss_pred CCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEE--EEeCC-------- Q lcl|NC_020414. 82 RSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPSK-------- 151 (515) Q Consensus 82 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l--~~d~~-------- 151 (515) .| ..++.++.. +..+.+ ..+..++|.....++.++...+|.|.+ |.|++ T Consensus 74 ~p-~~~~~~~~~------------~~~~~~--------~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~ 132 (451) T protein:vir:10 74 YP-VLFDIDNNK------------ELNEKV--------TDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTN 132 (451) T ss_pred cc-ceeecCCcH------------HHHHHH--------HHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccc Confidence 11 112222211 111111 112347899999999999999998764 55543 Q ss_pred Cc--EEEE-EcceEEEeeCC-CCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEE Q lcl|NC_020414. 152 GA--MSAV-PMHHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQ 227 (515) Q Consensus 152 ~~--~r~~-pl~~y~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 227 (515) .. ++++ |..-|++-.|. .+.+.-.+|.+......- +. .....+++.-.+.++..+.+.. T Consensus 133 ~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~----------------~~-~~~~~~~~~e~yt~~~~~~~~~ 195 (451) T protein:vir:10 133 QTFKYGVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVK----------------GQ-IQKQAYTYVEFWTDKILDKYKF 195 (451) T ss_pred cceeEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc----------------cc-ccceEEEEEEEEeCCeEEEEEe Confidence 22 3444 33445554443 567777776664322210 00 0111111111223333222211 Q ss_pred Ee---CCeeec---ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc Q lcl|NC_020414. 228 SA---DDIPVG---KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ 301 (515) Q Consensus 228 e~---~~~~i~---~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~ 301 (515) .- .+..+. .+-+| ..||++.++. +.+|.|=.+...+-+..++.+.-......+...+|.+.+.--+. T Consensus 196 ~~~~~~~~~~~~~~~~~~~--g~vPvv~~~n-----n~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~ 268 (451) T protein:vir:10 196 FGVSCCGSQIEHITVQHRF--NSVPFVEFSN-----NIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGG 268 (451) T ss_pred cccCccccccccccccCCC--CeeeEEEecc-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc Confidence 11 111111 12244 3578776543 45688888889999999998888888888888888655531111 Q ss_pred cC-hhhccCCCC-cce-ecC----CcccccccccCCccchHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCHHHHHH-- Q lcl|NC_020414. 302 TD-VDHFVNSGT-GEV-ITG----VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQR-- 372 (515) Q Consensus 302 ~~-~~~~~~~~~-g~~-~~g----~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~~~~TAtEi~~-- 372 (515) .. .+....... +.+ +++ ..+++..+. ...+.+.....++.+...|-..-..-.+........|+.-+.. T Consensus 269 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~ 346 (451) T protein:vir:10 269 EDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQ--IEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFY 346 (451) T ss_pred ccchhhHHHHhhCCeEEecCcCCccCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHH Confidence 11 111111111 222 111 122333333 3346677778888887777544211001111112345544422 Q ss_pred -----HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceee--eehHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020414. 373 -----DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV--TGIEALGRMAELDKLANFAQYMSLPQT 445 (515) Q Consensus 373 -----r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v--~~l~~l~ra~~~~~l~~~~~~v~~~a~ 445 (515) ++.+++..++..+.++-. .++.-.. ......+.+.+- .+.+.+..++-+. .+++ T Consensus 347 ~~l~~k~~~k~~~f~~~l~~~~~--------li~~~~~-~~d~~~i~i~f~~~~p~n~~e~~~~~~----------kl~g 407 (451) T protein:vir:10 347 RKLELKSGLLETEFRTSFDKLIK--------AILYFLG-VTDYKKIQQTYTRNMMSNDLEDADIAT----------KSVG 407 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhC-CCCccceeEEecCCCCCCHHHHHHHHH----------HHhc Confidence 334444444444433221 1111111 111122333321 1222222222111 1111 Q ss_pred CChHHHhcCCHHHHHHHHHHhcCCchhccCCHH-HHHHHHHHHHHHHHHHHHHHHhhhhccchhhh Q lcl|NC_020414. 446 WPEPAQRAIRWGDYMDWVRGQISAELPFLKSEE-EMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ 510 (515) Q Consensus 446 ~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~e-ev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~ 510 (515) .+.-..++..+ .++-+.+ +.+.+.++++ .+.+++++..+. .++ T Consensus 408 -------~iS~et~~~~~--------p~v~d~~~e~~~~~ee~~--~~~~~~~~~~~~-----~~~ 451 (451) T protein:vir:10 408 -------IIPTKIILRHH--------PWVDDVEEAEKLYLEEKK--IQASKVSDDYNN-----FTE 451 (451) T ss_pred -------cCchHHHHHhC--------CCCCCHHHHHHHHHHHHH--HHHHHHHhhcCC-----CCC Confidence 12222222221 1222322 2222222211 112222222221 222 No 145 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=64.48 E-value=0.3 Score=23.46 Aligned_cols=421 Identities=10% Similarity=0.026 Sum_probs=164.0 Q ss_pred CCCccccc--cccHHHHHHHHHHHHHhhhhHHHHHHHHHHh---hcccccCCCCCCccccccc--cccHHHHHHHHHHHH Q lcl|NC_020414. 1 MQDTILEY--GGQRSKIPKLWEKFSKKRSPYLDRAKHFAKL---TLPYLMNNKGDNETSQNGW--QGVGAQATNHLANKL 73 (515) Q Consensus 1 ~~~~~~~~--~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~---~~P~~~~~~~~~~~~~~~~--dst~~~a~~~Laa~l 73 (515) |+.-.... .++++. ..+ ..+-. +.. ...-++-+.| ..|.+. +-......| +..+-.++++.|..+ T Consensus 1 ~~~~~~a~~~~~~~~a-~~~-~~~~~-~~g-~~~~~d~~~~~~~~~~~~~----~~~~l~~lY~~~~l~r~iVd~~a~d~ 72 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKI-VNR-NDFMV-GHG-KANSRDKLTRQTPGNGQKL----DLKACENLYASNSIAMNIVDIISEDM 72 (461) T ss_pred Cccchhhhhhhhhhhh-hhh-hHHHh-hcC-CcchhhhhhccccCccccc----CHHHHHHHHHhCCccchhhccchHHh Confidence 66544433 111111 111 11100 000 0000111111 001000 000011122 233334455544443 Q ss_pred HHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCc Q lcl|NC_020414. 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGA 153 (515) Q Consensus 74 ~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~ 153 (515) | +.|+.+.-.++.. ...++.+ +.+-+....+.+++..--.||.+.+++.-++. T Consensus 73 ----~---r~g~~i~~~~~~~---------~~~~~~~-----------~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~ 125 (461) T protein:vir:80 73 ----V---RAGWSLKTDNKEM---------KKNIESK-----------WRKLKTKDRFQKLYADKRLYGDGFLSIGVVSS 125 (461) T ss_pred ----h---cCCeeeecCCHHH---------HHHHHHH-----------HHHhhHHHHHHHHHHhhcccccEEEEEEeecC Confidence 3 3677776543221 1122333 23346788899999998899998877632111 Q ss_pred -----EEEEEcceEEEeeCCCCC--eeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEE Q lcl|NC_020414. 154 -----MSAVPMHHYVVNRDTNGD--LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKIN 226 (515) Q Consensus 154 -----~r~~pl~~y~v~~d~~G~--vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 226 (515) .-.-||. ...-+. ...+|.+..++..... .+..+ ...+.| +.|....... +..+ T Consensus 126 ~~~~~~~~~pl~-----~~~~~~~~~l~~~~~~~i~~~~~~----~dp~s---p~fg~P----~~y~i~~~~~---~~~~ 186 (461) T protein:vir:80 126 NREQADLSTAID-----PKTIKSIPYINTFNTQKVTQLYLN----QDMFS---EHFGEV----EFFEVNRVSQ---LGEE 186 (461) T ss_pred CccccCccCCcc-----cccccceeEEEeccccccchhhhc----ccCcC---cccccc----eEEEEecccc---cccc Confidence 1111221 111111 1222333333322211 11100 011111 1222111110 0000 Q ss_pred EEeCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecC------cc Q lcl|NC_020414. 227 QSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRP------GS 300 (515) Q Consensus 227 ~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~------~g 300 (515) ...+ . ....+..-|..+++...=...++..||+|-.+..++.++..+.......+.+..+.-+.+-.+. +. T Consensus 187 ~~~~--~-~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~ 263 (461) T protein:vir:80 187 ILSG--T-TASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDD 263 (461) T ss_pred cccc--c-cCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchH Confidence 0000 0 0111111144566666666777888999999999999999988888777666555444433321 10 Q ss_pred ---ccChhhccCCCCcceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHH---HHHhhccCCCCCCCHH-HHHHH Q lcl|NC_020414. 301 ---QTDVDHFVNSGTGEVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIF---MMETMTRRDAERVTAV-EIQRD 373 (515) Q Consensus 301 ---~~~~~~~~~~~~g~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af---l~~~l~~~~~~~~TAt-Ei~~r 373 (515) ....-.......|..+-+..+++..+. .+|.-+...++.+.+.|.-+- ..-.+.+..+..=|.+ ++. T Consensus 264 ~~~~~~~~~~~~~~~g~~~~d~~e~~e~~~----~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~-- 337 (461) T protein:vir:80 264 KANLTAMLDFMFRTEALAIIKGDEQLTKES----TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVM-- 337 (461) T ss_pred HHHHHHHHHHhcCCceEEEEcCCcceEEEe----cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHH-- Confidence 011111111233444444444443332 234445566667777776653 1111222223322333 232 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHHHHHH-------HhcCCCCChhhcccee-eeeh---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 374 ALEIEQNMGGVYSLFAMTMQTPIAMWGL-------QEAGDSFTSELVDPVI-VTGI---EALGRMAELDKLANFAQYMSL 442 (515) Q Consensus 374 ~~E~~~~LGpv~~rl~~E~l~Pli~r~~-------~~~~~~~p~~~~~~~~-v~~l---~~l~ra~~~~~l~~~~~~v~~ 442 (515) .+---+.+++.-.+.|.+++++ .+..+.+.+...+..+ -.+| +.-.+|.-......+.+. T Consensus 338 ------~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~--- 408 (461) T protein:vir:80 338 ------NYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQI--- 408 (461) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--- Confidence 1222233444444555554432 2333443333222222 1233 333343333333333332 Q ss_pred hhcCChHHHhcCCHHHHHHHHHHhcCCc-hh-ccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 443 PQTWPEPAQRAIRWGDYMDWVRGQISAE-LP-FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 443 ~a~~~p~~~d~id~d~~~~~~a~~~Gvp-~~-~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +.+. ..|+.+++.+.+...+|.+ .+ +--...+...+..+.-+.. .++.-+| T Consensus 409 ~~~~-----g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~e~~~g 461 (461) T protein:vir:80 409 YIVN-----GVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAY-----------------AKKNADG 461 (461) T ss_pred HHhc-----CCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccc-----------------cccCCCC Confidence 2111 2578888888777777753 32 2222222222211110000 0011111 No 146 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=63.37 E-value=0.32 Score=23.31 Aligned_cols=291 Identities=7% Similarity=-0.006 Sum_probs=117.5 Q ss_pred eeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcC-CCCeEEEEEeCCeee-cc---cCCccccc-- Q lcl|NC_020414. 172 LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAG-EGFWKINQSADDIPV-GK---ENRIKAEK-- 244 (515) Q Consensus 172 vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~-~~~~~~~~e~~~~~i-~~---esgy~~~~-- 244 (515) |-++++++. +..+.+ ..+.+++ ..+..+.+..++..+ ++ ..|.-... T Consensus 1 v~Eivw~~~-------------------------~g~~~~-~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp 54 (355) T protein:vir:78 1 MFEQVYRIE-------------------------NGRARL-GKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIP 54 (355) T ss_pred CeEEEEEee-------------------------CCeEEE-eeeeecCccceeeeeeccCCceeEEEecCCCCCCcceec Confidence 222222110 011111 1122222 223233333333321 11 11211111 Q ss_pred -CcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccC-ceeecCccccC--h---------------- Q lcl|NC_020414. 245 -LPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADI-KYLIRPGSQTD--V---------------- 304 (515) Q Consensus 245 -~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p-~~l~~~~g~~~--~---------------- 304 (515) +=|++.|....+|+.||.|+...+..-..--+...+..+..+++-..| |+..-|.+... . T Consensus 55 ~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l 134 (355) T protein:vir:78 55 VDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEG 134 (355) T ss_pred cCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHH Confidence 228999999999999999999999998888888899999999886555 44433332111 0 Q ss_pred ----hhccCC-CCcceecCCcccccccccC-CccchHHHHHHHHHHHHHHHHHHHHHhhccCCC---CCCCHHHHHHHH- Q lcl|NC_020414. 305 ----DHFVNS-GTGEVITGVEEDIHIVQLG-KYADLTPISAVLEVYTRRIGVIFMMETMTRRDA---ERVTAVEIQRDA- 374 (515) Q Consensus 305 ----~~~~~~-~~g~~~~g~~~~v~~~~~~-~~~~l~~~~~~i~~~~~rI~~afl~~~l~~~~~---~~~TAtEi~~r~- 374 (515) ..+..+ ..|.++|-.. .+..++.. ...+ ....|+-+.+.|+.++|-.+|..... ......|++... T Consensus 135 ~~~~~~i~~g~~a~~iip~g~-~ie~~ea~g~~~~---~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~ 210 (355) T protein:vir:78 135 LQLAKEFRAGEAAGGYIPHGA-NFTLTGVQGKLPE---MDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFF 210 (355) T ss_pred HHHHHHhhCCcceeEeecCCc-eEEEeecCCCccc---HHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHH Confidence 111111 1345566433 34444332 2233 34678999999999999877765332 223445553321 Q ss_pred HHHHHHhhhhH-HHHHHHHHHHHHHHHHHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhc Q lcl|NC_020414. 375 LEIEQNMGGVY-SLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRA 453 (515) Q Consensus 375 ~E~~~~LGpv~-~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~ 453 (515) ..+...-.-.+ +.|+..++.||+..-+... ...|. +....+.... ... ++.+ ..+..+. -. T Consensus 211 ~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~-~~~P~--~~~~~~~~~~-~~~---a~~~-------~~l~~~G----~~ 272 (355) T protein:vir:78 211 TGSLNAVMKHIADVTQQHVVEDLVDQNWGPE-EPAPR--LVPAQLGKEQ-PVT---AEAI-------RALVECG----AF 272 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCE--EEecCcChhH-HHH---HHHH-------HHHHhCC----Cc Confidence 22222111111 2233334444433221111 11111 1111111100 111 1111 1221111 11 Q ss_pred CCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHH--------------------------HHHHHHHHHHhhhhccch Q lcl|NC_020414. 454 IRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQ--------------------------AQQEAMLNEGVAKAVPGV 507 (515) Q Consensus 454 id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~--------------------------~~q~~~~~~~~~~a~~~~ 507 (515) +..+....++.+.+|+|..--.. +++..-.+.... .+...+.....-.|.... T Consensus 273 ~~~~~~~~~~~e~~gip~p~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~~~~~~~ 351 (355) T protein:vir:78 273 TADPELEKDLRARYGLPAPAERD-DGADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRGPLRRRPRHPAHRRC 351 (355) T ss_pred cccHHHHHHHHHHhCCCCCCCCC-cccCCccccccccccccccCCccccccccccCCCCCChhhhHHHHHHhhccccCCC Confidence 22344566778889987532211 111100000000 000000000110010000 Q ss_pred hhhhhccC Q lcl|NC_020414. 508 IQQEMKEG 515 (515) Q Consensus 508 ~~~~~~~~ 515 (515) .+. | T Consensus 352 ~~~----~ 355 (355) T protein:vir:78 352 APD----G 355 (355) T ss_pred CCC----C Confidence 000 0 No 147 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=51.18 E-value=0.59 Score=21.85 Aligned_cols=384 Identities=11% Similarity=0.051 Sum_probs=121.1 Q ss_pred hhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHH---- Q lcl|NC_020414. 39 LTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARV---- 114 (515) Q Consensus 39 ~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~v---- 114 (515) ..+.. .+.-.+.. + ++ +++-+....+-......+++..-...... -+....|..-.+.+ T Consensus 1 ~~~~~---~~~~~~p~--~-----~~----~~~~~~~~~~~~~~~g~~~~~~~~~~~~~---~~~~~~V~acV~~IA~~i 63 (518) T protein:vir:78 1 MLLAN---GQTLSAPA--M-----AE----LSPQMQDSYYYAPAVGMQLERQFSLYGGI---YKNQPWVRTVIAKRAQAL 63 (518) T ss_pred CcccC---ceeeccch--h-----hh----hhhhhhhcccccceeceecccccchhhHH---hhhhHHHHHHHHHHHHhh Confidence 11110 00000000 0 00 00000000000000000000000000000 00000111111100 Q ss_pred -------------------HHHHHHHHHhcCCH----HHHHHHHHHHHhhCceEEEEeCCC---cEEEEEc--ceEEEee Q lcl|NC_020414. 115 -------------------ETTAMKALEQRQFR----PAIVEVFKHLIVAGNCLLYKPSKG---AMSAVPM--HHYVVNR 166 (515) Q Consensus 115 -------------------e~~~~~~l~~snf~----~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~pl--~~y~v~~ 166 (515) ...+...+.+=|-+ .=+..++.+|...||+.+++..+. ....||| +.+.+.. T Consensus 64 A~lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~ 143 (518) T protein:vir:78 64 ARLPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKR 143 (518) T ss_pred ccCceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEE Confidence 01112223333322 334556667777899988875432 1344555 3333334 Q ss_pred CCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCc Q lcl|NC_020414. 167 DTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLP 246 (515) Q Consensus 167 d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P 246 (515) |.++. ...+++...+..-.+...|+.++ T Consensus 144 ~~~~~--------------------------------------------------~~~y~~~~~~~~~~~~~~~~~~e-- 171 (518) T protein:vir:78 144 NSRTG--------------------------------------------------RYEYYFQAGAGVGTQLVSFADDE-- 171 (518) T ss_pred cCCCC--------------------------------------------------EEEEEEEecCCccceeEEecCCc-- Confidence 33211 11111111110000001122122 Q ss_pred EEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC----------C-C-Ccc Q lcl|NC_020414. 247 FIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN----------S-G-TGE 314 (515) Q Consensus 247 ~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~----------~-~-~g~ 314 (515) +++.|+...+|..||.||..-+...+.......+.......-...|..++.-++.++++.... + . .|. T Consensus 172 IiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~ 251 (518) T protein:vir:78 172 VVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGK 251 (518) T ss_pred EEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCc Confidence 466676666677799999999888888888888887777777778877766666666554211 0 0 111 Q ss_pred e--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHH-HHHHhhhhHHHHH Q lcl|NC_020414. 315 V--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALE-IEQNMGGVYSLFA 389 (515) Q Consensus 315 ~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E-~~~~LGpv~~rl~ 389 (515) + +++.. ...++.. +..+.+. .+..+..+..|-++|-.- .+...+.. |-.=+.+.... ....|.|.+.++. T Consensus 252 ~~vL~~G~-~~~~l~~-~~~d~q~-le~r~~~~~eIa~afgVPp~~lg~~~~s--t~sn~e~~~~~f~~~tL~P~~~~ie 326 (518) T protein:vir:78 252 TMVVEEGM-EPIPLQL-TAVEMQF-IEARQLNREEVCGVYDIAPPIVHILDRA--TFSNISAQMRAFYRDTMAIPIARIQ 326 (518) T ss_pred eeEcCCCc-eEEeccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhccCCCC--CchhHHHHHHHHHHHHHHHHHHHHH Confidence 1 11111 2223322 1234444 344455667788888321 12221211 22212111111 2234555555555 Q ss_pred HHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcC-ChHHHhcCCHHHHHHHHHHhc Q lcl|NC_020414. 390 MTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTW-PEPAQRAIRWGDYMDWVRGQI 467 (515) Q Consensus 390 ~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~-~p~~~d~id~d~~~~~~a~~~ 467 (515) .+|-.-|+ +.... ..++++ ++.|-|.--.+........++ ..-+ +-++...++.+.+=+.-++.+ T Consensus 327 ~eln~~L~--------~~~~~----~~~~~fd~~~Llr~D~~~r~~~~~~~~~-~G~lT~NE~R~~~gl~pie~~~gD~~ 393 (518) T protein:vir:78 327 SAMDKYVG--------QYWVR----KNRMKFDIDDVIQPDWEAKSESTQKMVN-SGVATPNEGREIMGLPRSDDPKADEL 393 (518) T ss_pred HHHHHhhc--------ccccC----cceEEeechhhhccCHHHHHHHHHHHHh-CCCcCHHHHHHHhCCCCCCCCCCcee Confidence 55433221 11111 111111 122222111111111111111 0001 112222222111100001111 Q ss_pred CCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh-hhhhccC Q lcl|NC_020414. 468 SAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI-QQEMKEG 515 (515) Q Consensus 468 Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~-~~~~~~~ 515 (515) -++..+..- +.. -++...-+ +.-..+...+.+..- ++...++ T Consensus 394 ~v~~n~~pl-~~~---~~~~~~g~--~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) T protein:vir:78 394 YANSALQPL-GAT---PDGAVEGE--EAPAPKRPASTPVASLDQSPPAS 436 (518) T ss_pred eecccceec-ccc---cccccCCC--CCCCCCCCCcccccccccCcccc Confidence 111111100 000 00000000 000000000000000 0111111 No 148 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=48.21 E-value=0.68 Score=21.51 Aligned_cols=298 Identities=11% Similarity=0.080 Sum_probs=127.8 Q ss_pred cCCCCCCccc------cccc---cccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHH- Q lcl|NC_020414. 45 MNNKGDNETS------QNGW---QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARV- 114 (515) Q Consensus 45 ~~~~~~~~~~------~~~~---dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~v- 114 (515) +.....+... ...| |+.+...-..+-. .+....-.+-.|++-=.+-..+.++.... .+...+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~-~~~~~~~~~~~~~~pP~~~~~La~l~~~~-------~~h~~~L 72 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAIDPTAWMTD-YTGVFYNPYGEYYQPPIDRKGLAKVARAN-------AHHGAIL 72 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccccCcchhHh-hhhhhhccCcceecCCCCHHHHHHHhhcc-------hhhhhHH Confidence 2211111000 0011 2222211111111 22223334556765444333343322111 111000 Q ss_pred HHHHHHHHHhcCC---HHHHHHHHHHHHhhCceEEEEeCC---CcEEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHH Q lcl|NC_020414. 115 ETTAMKALEQRQF---RPAIVEVFKHLIVAGNCLLYKPSK---GAMSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDP 188 (515) Q Consensus 115 e~~~~~~l~~snf---~~~~~~~~~dl~~~G~~~l~~d~~---~~~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~ 188 (515) ..+ .....+.| +..+..+..|+.++|||.+++..+ ..+..+||..-++.+..+|+. . T Consensus 73 ~~k--~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d~~~--~------------- 135 (337) T protein:vir:78 73 MAR--RNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRREDGCF--V------------- 135 (337) T ss_pred Hhh--hccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeCCeE--E------------- Confidence 000 01112223 357788889999999998887433 125566765444444433321 0 Q ss_pred HhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEEEEeeeecCCCccccchHHHH Q lcl|NC_020414. 189 ATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDY 268 (515) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~ 268 (515) |+..++... .|+.++ ++..|.....+.+||.+|..-+ T Consensus 136 -------------------------------------~~~~~~~~~----~~~~~e--IiHik~~~~~~~~~Gls~~~~a 172 (337) T protein:vir:78 136 -------------------------------------YLQQGKPNL----IYRPDD--VIWLAQYDPEQQVYGMPDYLGG 172 (337) T ss_pred -------------------------------------EEEcCCceE----EECCcc--EEEECCCCCCCCcccccHHHHH Confidence 000111110 122222 4566654445779999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCceee-cCccccChhhcc----------CCCCc--ce--ecCC-cccccccccCC-c Q lcl|NC_020414. 269 SGDLFVIQFLSEAVARGAALMADIKYLI-RPGSQTDVDHFV----------NSGTG--EV--ITGV-EEDIHIVQLGK-Y 331 (515) Q Consensus 269 l~d~k~L~~l~~~~~~~~~~a~~p~~l~-~~~g~~~~~~~~----------~~~~g--~~--~~g~-~~~v~~~~~~~-~ 331 (515) +..+-.-+..++-..+.-.-...|-.++ .+++.++++... .++++ .+ .||. .+.+...++.. . T Consensus 173 ~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~ 252 (337) T protein:vir:78 173 LQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIA 252 (337) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCCh Confidence 8888766655555555555556665554 355445544211 11111 11 2222 33344444443 2 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCCC---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_020414. 332 ADLTPISAVLEVYTRRIGVIFMME---TMTRRDAERVT---AVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG 405 (515) Q Consensus 332 ~~l~~~~~~i~~~~~rI~~afl~~---~l~~~~~~~~T---AtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~ 405 (515) .+.+ ..+..+-.++.|-++|=.- .....+...-| +++... .=....|.|...++.+++-..++........ T Consensus 253 ~d~q-fle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~--~f~~~~L~P~~~~ie~~~n~~ll~~~~~~~f 329 (337) T protein:vir:78 253 TKDE-FAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDA--TYARNEVLPLCELVQDAINSAGLPRALWVTF 329 (337) T ss_pred hHHH-HHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHH--HHHHHHHHHHHHHHHHHHhhhcCChhhceec Confidence 3444 2444555666788887321 12222322223 333322 2234556677777777664433322222222 Q ss_pred CCCChhhc Q lcl|NC_020414. 406 DSFTSELV 413 (515) Q Consensus 406 ~~~p~~~~ 413 (515) ...+..++ T Consensus 330 ~~~~~~~~ 337 (337) T protein:vir:78 330 RETIGAAV 337 (337) T ss_pred cccccccC Confidence 22223333 No 149 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=42.73 E-value=0.87 Score=20.91 Aligned_cols=379 Identities=11% Similarity=0.029 Sum_probs=118.8 Q ss_pred hhccc---ccCCC-CCCc-cccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHH Q lcl|NC_020414. 39 LTLPY---LMNNK-GDNE-TSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFAR 113 (515) Q Consensus 39 ~~~P~---~~~~~-~~~~-~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ 113 (515) ..+.. +..+. .+.. .....|....+.. ..++....-|..+. .....|..-.+. T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~a-------------~~~~~V~acV~~ 58 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVG---------MQLERQFSLYGGIY-------------KNQPWVRTVIAK 58 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccc---------eecccccchhhHHH-------------hhhHHHHHHHHH Confidence 11110 00000 0000 0000110000000 00000000000000 000011111111 Q ss_pred -----------------------HHHHHHHHHHhcCC----HHHHHHHHHHHHhhCceEEEEeCCC---cEEEEEc--ce Q lcl|NC_020414. 114 -----------------------VETTAMKALEQRQF----RPAIVEVFKHLIVAGNCLLYKPSKG---AMSAVPM--HH 161 (515) Q Consensus 114 -----------------------ve~~~~~~l~~snf----~~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~pl--~~ 161 (515) ....++..+.+=|- +.=...++.++..+||+.+++..+. ....||| +. T Consensus 59 IA~~iA~lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~ 138 (518) T protein:vir:10 59 RAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSR 138 (518) T ss_pred HHHhhccCceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCc Confidence 11112222333332 2334455567777899988875432 1344555 33 Q ss_pred EEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcc Q lcl|NC_020414. 162 YVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIK 241 (515) Q Consensus 162 y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~ 241 (515) +.+..|..+.. ..+++...+..-.+...|+ T Consensus 139 v~v~~~~~~~~--------------------------------------------------~~y~~~~~~~~~~~~~~~~ 168 (518) T protein:vir:10 139 VAIKRNSRTGR--------------------------------------------------YEYYFQAGAGVGTQLVSFA 168 (518) T ss_pred eEEEEcCCCCE--------------------------------------------------EEEEEEecCCccceEEEec Confidence 44444332111 1111111000000000111 Q ss_pred cccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC---------- Q lcl|NC_020414. 242 AEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG---------- 311 (515) Q Consensus 242 ~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~---------- 311 (515) .+ =+++.|+....|-.||.||..-+...+.....+.+.......-...|..++.-++.++++...... T Consensus 169 ~~--eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~ 246 (518) T protein:vir:10 169 DD--EVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGS 246 (518) T ss_pred CC--cEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCc Confidence 11 146666665667679999999888888888888888777777777887766655555544321100 Q ss_pred -C-cce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCC-CCCCCHHHHHHHHHHHHHHhhhh Q lcl|NC_020414. 312 -T-GEV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRD-AERVTAVEIQRDALEIEQNMGGV 384 (515) Q Consensus 312 -~-g~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~-~~~~TAtEi~~r~~E~~~~LGpv 384 (515) + |.+ +++. -...++.+ +..|.+. .+..+..+..|-++|-.. ++...+ +..-++++.. ..=....|.|. T Consensus 247 ~nag~v~vL~~G-~~~~~l~~-s~~D~q~-le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~--~~f~~~tL~P~ 321 (518) T protein:vir:10 247 SNTGKTMVVEEG-MEPIPLQL-TAVEMQF-IEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM--RAFYRDTMAIP 321 (518) T ss_pred cccCcceEcCCC-ceEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHH--HHHHHHHHHHH Confidence 0 111 1111 11222222 1234443 344455667788888321 122222 2111222211 11122334555 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcC-ChHHHhcCCHHHHHHH Q lcl|NC_020414. 385 YSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTW-PEPAQRAIRWGDYMDW 462 (515) Q Consensus 385 ~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~-~p~~~d~id~d~~~~~ 462 (515) +.++..+|-.-|+ +... ...++.+ ++.|-|.--.+........++. .-+ +-++...++.+.+=+. T Consensus 322 l~~ie~~ln~~L~--------~~~~----~~~~~~fd~~~llr~D~~~r~~~~~~~~~~-G~lT~NE~R~~~Gl~pie~~ 388 (518) T protein:vir:10 322 IARIQSAMDKYVG--------QYWV----RKNRMKFDIDDVIQPDWEAKSESTQKMVNS-GVATPNEGREIMGLPRSDDP 388 (518) T ss_pred HHHHHHHHHHhhc--------cccc----CCceEEEechhhhccCHHHHHHHHHHHHhC-CCcCHHHHHHHhCCCCCCCC Confidence 5555554433221 1111 1112221 2222221111111111111110 001 1122222221111000 Q ss_pred HHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchh-hhhhccC Q lcl|NC_020414. 463 VRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVI-QQEMKEG 515 (515) Q Consensus 463 ~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~-~~~~~~~ 515 (515) -++.+-++..+. +-+.. -++...-+ +.-..+...+.+..- .+...++ T Consensus 389 ~gD~~~~~~n~~-pl~~~---~~~~~~g~--~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) T protein:vir:10 389 KADELYANSALQ-PLGAT---PDGAVEGE--EAPAPKRPASTPVASLDQSPPTS 436 (518) T ss_pred CCCeeeecccce-ecccc---cccccCCC--CCCCCCCCCcccccccccccccc Confidence 011110111100 00000 00000000 000000000000000 0111111 No 150 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=36.95 E-value=1.1 Score=20.26 Aligned_cols=416 Identities=11% Similarity=0.083 Sum_probs=173.8 Q ss_pred CCCcc----------------ccccccHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcc-----ccc---CCCCCC---cc Q lcl|NC_020414. 1 MQDTI----------------LEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLM---NNKGDN---ET 53 (515) Q Consensus 1 ~~~~~----------------~~~~~~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P-----~~~---~~~~~~---~~ 53 (515) |++-. ...-...+.+.+..+..+. |-+.. +.+.+|..- .+- ...... .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~---~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKP-KIDDI---TVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHH-HHHHH---HHHHHHhccCCcchhccchhccccccccccc Confidence 43321 0011134444455554443 22333 344444322 111 110110 11 Q ss_pred ccccccccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHH Q lcl|NC_020414. 54 SQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVE 133 (515) Q Consensus 54 ~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 133 (515) ..|+..+-+...++..++-|++ -|+ .++..+... ...+++|+ ..||.....+ T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~g--~p~-----~~~~~d~~~---------~~~l~~~~------------~n~~~~~~~~ 128 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAVA--NPV-----TFSSDDDKS---------LKTIQEVL------------NHKWDDKLVD 128 (474) T ss_pred chhcccchHHHHHHhhhhhhcc--cCc-----eeecCchHH---------HHHHHHHH------------hcCHHHHHHH Confidence 1234455566666666655544 121 223333221 11233332 3578888999 Q ss_pred HHHHHHhhCceEE--EEeCCCcEE--EEEcce-EEEeeC-CCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcc Q lcl|NC_020414. 134 VFKHLIVAGNCLL--YKPSKGAMS--AVPMHH-YVVNRD-TNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDD 207 (515) Q Consensus 134 ~~~dl~~~G~~~l--~~d~~~~~r--~~pl~~-y~v~~d-~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~ 207 (515) +.++...+|.+.+ |.|.+..++ +++..+ |.+--| ..+.+...+|.++.. ... T Consensus 129 ~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~----------------------~~~ 186 (474) T protein:vir:96 129 ILTAASNKGIEWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLD----------------------GAE 186 (474) T ss_pred HHHHHHhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeec----------------------Cce Confidence 9999999998764 566665544 454444 444333 367776666665421 011 Q ss_pred cEEEEE------EEEEcCCCCeEEEE----EeCCeeec-ccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHH Q lcl|NC_020414. 208 NVKLYT------HAQYAGEGFWKINQ----SADDIPVG-KENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQ 276 (515) Q Consensus 208 ~v~v~~------~v~~~~~~~~~~~~----e~~~~~i~-~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~ 276 (515) .+++|+ ....+......... ...+..+. ..-+| ..+|++.++. +.+|+|=.+...+-+-.++ T Consensus 187 ~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--g~iPvv~~~n-----n~~g~sd~e~v~~liDa~d 259 (474) T protein:vir:96 187 RVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSW--GRVPFIPFKN-----NPQEMSDLFMYKTIIDAMD 259 (474) T ss_pred EEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCC--CceeEEEecc-----CCCCCCcHHHHHHHHHHHH Confidence 123332 11111110000000 00111122 22334 4588887765 4579998899999999999 Q ss_pred HHHHHHHHHHHHhccCceeecCccccChhhc-cCCCC-cce-ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_020414. 277 FLSEAVARGAALMADIKYLIRPGSQTDVDHF-VNSGT-GEV-ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFM 353 (515) Q Consensus 277 ~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~-~~~~~-g~~-~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl 353 (515) .+.-......+....|.+.+.-.+.-+.... ..... +.+ +++..+++..+. ...+.+.....++.+++.|-..-. T Consensus 260 ~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~ 337 (474) T protein:vir:96 260 KRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDGSGVDTIQ--IEVPVQSSKEYLDMLRDYVIEFGQ 337 (474) T ss_pred HHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCCCceeEEe--ecCChHHHHHHHHHHHHHHHHHhC Confidence 8888888888888888655432111111111 11111 222 334444455443 334666677777777766644321 Q ss_pred -HHhhccCCCCCCCHHHHHH-------HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhcccee--eeehHH Q lcl|NC_020414. 354 -METMTRRDAERVTAVEIQR-------DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI--VTGIEA 423 (515) Q Consensus 354 -~~~l~~~~~~~~TAtEi~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~--v~~l~~ 423 (515) .+......+...|+..+.. .+.++...++..+.+ +++.++.-.+.......+.+.+ ..+.+. T Consensus 338 ~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~--------~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~ 409 (474) T protein:vir:96 338 GVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQE--------LLQYIIDFYKLNIKVQDVEITFNFNVMVNE 409 (474) T ss_pred CccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhCCCcccceeeEEeccCCCcCH Confidence 1110011112345554432 223444444444333 3333332222222112233332 122222 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_020414. 424 LGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKA 503 (515) Q Consensus 424 l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a 503 (515) +..++ .+.+ + ..+.-..++..+ -+++ -.++|++.+.+++++..+... ..-+ - T Consensus 410 ~e~~~----------~~~~-a-------g~iS~et~~~~~---~~v~----d~~~E~~ri~~E~~e~~~~~~--~~~~-~ 461 (474) T protein:vir:96 410 LEQSQ----------IGVQ-S-------QYLSKETVVTNH---PWVD----DPVAELERIEQDNIDFNKQLP--PLEG-D 461 (474) T ss_pred HHHHH----------HHHh-c-------CCCchHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHhccc--cccc-c Confidence 22111 1111 1 123334444332 1221 124566666554433222110 0000 0 Q ss_pred ccchhhhhhccC Q lcl|NC_020414. 504 VPGVIQQEMKEG 515 (515) Q Consensus 504 ~~~~~~~~~~~~ 515 (515) ..++..++-++- T Consensus 462 ~~~~~~d~~~e~ 473 (474) T protein:vir:96 462 ANGRAQDNESET 473 (474) T ss_pred cccccCCCcccC Confidence 111111111111 No 151 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=34.82 E-value=1.3 Score=20.02 Aligned_cols=447 Identities=9% Similarity=-0.002 Sum_probs=175.5 Q ss_pred CCCcccccccc-HH---HHHHHHHHHHHhhhhHHH--HHHHHHHhh-cccccCCCCC--------Ccccccc--ccccHH Q lcl|NC_020414. 1 MQDTILEYGGQ-RS---KIPKLWEKFSKKRSPYLD--RAKHFAKLT-LPYLMNNKGD--------NETSQNG--WQGVGA 63 (515) Q Consensus 1 ~~~~~~~~~~~-~~---~l~~r~~~lk~~R~~~e~--~w~e~~~~~-~P~~~~~~~~--------~~~~~~~--~dst~~ 63 (515) |+.-..+..+- |- ....+....+..++.|+. .-+....|. .|.....+.. +.+++.+ -++.+. T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 44322222100 00 011111112222222321 001111111 1211111100 0111223 367888 Q ss_pred HHHHHHHHHHHH--hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh Q lcl|NC_020414. 64 QATNHLANKLAQ--VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVA 141 (515) Q Consensus 64 ~a~~~Laa~l~s--~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~ 141 (515) .+++.+++.+++ +++|..++..+....++.+++. ....-+.|.+. .-+.+-.+.+||.....++...+.- T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~-----ie~~w~~Wa~~---~~~D~~g~~~f~~lq~l~~r~~~~d 152 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTL-----IEGNWQQWIKK---GNCDVTGRYHFVTLLHLWMETLARD 152 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHHHHHH-----HHHHHHHhcCC---cCcceeccCCHHHHHHHHHHHHhhC Confidence 999999999995 8999988876655444443321 11222334221 1123344568999999999999999 Q ss_pred CceEEEEeCCCcEEEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCC Q lcl|NC_020414. 142 GNCLLYKPSKGAMSAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEG 221 (515) Q Consensus 142 G~~~l~~d~~~~~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~ 221 (515) |-+++-.-.. ..+.+- ++-.-+-.+.|..... ...+ ..-.|..-|+.+..| T Consensus 153 GE~f~~~~~~----------------~~~~~~--~~lqliepd~l~~~~n----------~~~~-~~~~i~~GIe~d~~G 203 (505) T protein:vir:96 153 GEVLVREHRG----------------YPNKWG--YALQILECDRLDLNYN----------ADLQ-NGNRIRMSIELDAWE 203 (505) T ss_pred CceEEEEeec----------------CCCCcc--eEEEEechhhcCCCCC----------cccC-CcCeEEeceEECCCC Confidence 9876422111 011111 1111122222211110 0001 112366778888777 Q ss_pred CeEEEEEeCC---ee-ec--ccCCcccccCc--EEEEee-eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_020414. 222 FWKINQSADD---IP-VG--KENRIKAEKLP--FIPLTW-KRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADI 292 (515) Q Consensus 222 ~~~~~~e~~~---~~-i~--~esgy~~~~~P--~~~~Rw-~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p 292 (515) ...-|+-.+. .. .. ..+..+ ...| -+..-| ...+|..=|.+..--+|..++.|+....+.+.++..++.. T Consensus 204 r~~aY~i~~~hPgd~~~~~~~~~~~~-~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~ 282 (505) T protein:vir:96 204 RPVAYHLLVNHPGDNSYCYHYAGQTY-ERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKK 282 (505) T ss_pred ceEEEEEeecCCCccccccccccccc-cccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhh Confidence 6554443221 11 11 111111 1234 223333 3458888899999999999999999999999999999887 Q ss_pred ceeecCc-c-ccChhh------ccCCCCcceecCCcc-cccccccC-CccchHHHHHHHHHHHHHHHHHH--HHHhhccC Q lcl|NC_020414. 293 KYLIRPG-S-QTDVDH------FVNSGTGEVITGVEE-DIHIVQLG-KYADLTPISAVLEVYTRRIGVIF--MMETMTRR 360 (515) Q Consensus 293 ~~l~~~~-g-~~~~~~------~~~~~~g~~~~g~~~-~v~~~~~~-~~~~l~~~~~~i~~~~~rI~~af--l~~~l~~~ 360 (515) ...+..+ + ...+.. ....++|.+..-.++ ++.....+ ..++|.. -...+...|..++ =+..+ .. T Consensus 283 a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~---f~~~~lr~iaaglgi~ye~l-t~ 358 (505) T protein:vir:96 283 VGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFGA---FVKSSLRGVAAGMGPAYNRL-AH 358 (505) T ss_pred eeeeecCCccCCCccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHHH---HHHHHHHHHHhhcCCCHHHH-hc Confidence 7665422 1 111110 111223433222222 23333322 1234431 1122222233332 01111 12 Q ss_pred CCCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChh---hccceee----eehHHHHHHH Q lcl|NC_020414. 361 DAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSE---LVDPVIV----TGIEALGRMA 428 (515) Q Consensus 361 ~~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~---~~~~~~v----~~l~~l~ra~ 428 (515) |-..++=.=++.-..|.-..+--.=..+..-|+.|+..+.+ .|.++.+... ..+...+ ..++|+-.++ T Consensus 359 D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~ 438 (505) T protein:vir:96 359 DLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSK 438 (505) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHH Confidence 33334433333333333333332223444556777666543 3555443211 1122221 2234443332 Q ss_pred HHHH-HHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchhccCCHHHHHHHHHHHHHHHHHHHH-HHHhhhhccc Q lcl|NC_020414. 429 ELDK-LANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAML-NEGVAKAVPG 506 (515) Q Consensus 429 ~~~~-l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~-~~~~~~a~~~ 506 (515) .... |..-+. .. ..+....|.+ .++|-..+....+...+.=+ .......... T Consensus 439 a~~~~i~~G~~------t~--------------~~~~a~~G~D------~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~ 492 (505) T protein:vir:96 439 AHSESIKNRTR------SR--------------SSIIRAAGDD------PEDVFDEIAWEEQLMRDKGVNPTPPEQESKD 492 (505) T ss_pred HHHHHHHcCCC------CH--------------HHHHHHcCCC------HHHHHHHHHHHHHHHHHcCCCCCCCCCCCCC Confidence 2211 110000 00 0011111221 12222211111111100000 0000000000 Q ss_pred hhhhhhccC Q lcl|NC_020414. 507 VIQQEMKEG 515 (515) Q Consensus 507 ~~~~~~~~~ 515 (515) +..++-.++ T Consensus 493 ~~~~~~~~~ 501 (505) T protein:vir:96 493 ATTDEEDDS 501 (505) T ss_pred CCCCCCCCC Confidence 001111111 No 152 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=31.59 E-value=1.5 Score=19.64 Aligned_cols=377 Identities=11% Similarity=0.047 Sum_probs=137.3 Q ss_pred ccccccHHHHHHHHHHHHHhhhh-HHHH-HHHHHHhhcccccCCCCCCcccccccc-ccHHHHHHHHHHHHHHhhcCCCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSP-YLDR-AKHFAKLTLPYLMNNKGDNETSQNGWQ-GVGAQATNHLANKLAQVLFPAQR 82 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~-~e~~-w~e~~~~~~P~~~~~~~~~~~~~~~~d-st~~~a~~~Laa~l~s~ltpp~~ 82 (515) |-+ |...+ +|+. .-.. |..... ++|......+..-....-.. ++--.|++.+|+.+.+ - T Consensus 1 Mg~----------f~~~~-~r~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~------~ 62 (416) T protein:vir:45 1 MGI----------FYKNE-KRDLQYNEDDLQMMVQ-TLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLAR------M 62 (416) T ss_pred CCc----------ccccc-cccccCCCcchhHHHH-HhccccccCccccchhhhhcchHHHHHHHHHHHhhcc------C Confidence 222 22111 2211 1111 112111 23321111111111111112 2222356666555543 2 Q ss_pred CceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC---cE Q lcl|NC_020414. 83 SFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QRQ----FRPAIVEVFKHLIVAGNCLLYKPSKG---AM 154 (515) Q Consensus 83 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~d~~~---~~ 154 (515) || ++.-.... .... .++..|+ +=| .+.-...++.++..+||+.+++..+. .. T Consensus 63 p~-~~~~~~~~------------~~~~-------~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~ 122 (416) T protein:vir:45 63 PI-RVTVNGQI------------NYSD-------RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 122 (416) T ss_pred ce-EEecCccc------------cccc-------hHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 43 44321110 0111 1222232 222 33445667777888999998875432 23 Q ss_pred EEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCe Q lcl|NC_020414. 155 SAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDI 232 (515) Q Consensus 155 r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~ 232 (515) ..||| ..+.+..|.+|++--.|. .+++. T Consensus 123 ~L~~i~~~~v~v~~~~~g~~~~~~~--------------------------------------------------~~~~~ 152 (416) T protein:vir:45 123 NLTFRKTSEIELKSDARGRLYYFHQ--------------------------------------------------RIDSN 152 (416) T ss_pred EEEEEcCceeEEEECCCccEEEEEE--------------------------------------------------EecCC Confidence 45555 566667777765321110 00000 Q ss_pred eecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc-cChhh---cc Q lcl|NC_020414. 233 PVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ-TDVDH---FV 308 (515) Q Consensus 233 ~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~-~~~~~---~~ 308 (515) -......|+.+ -+++.|+...+ ..||.||..-+...+.......+.......-...|..++.-++. .+++. +. T Consensus 153 ~~~~~~~~~~~--evihir~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~ 229 (416) T protein:vir:45 153 GNNIERNVKFE--DMLDIKFYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAR 229 (416) T ss_pred CceeEEEEccc--cEEEeccCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHH Confidence 00000011111 23555655444 48999999999988888888888877777777777766543333 33321 11 Q ss_pred C----CCC-----cc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHH Q lcl|NC_020414. 309 N----SGT-----GE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDAL 375 (515) Q Consensus 309 ~----~~~-----g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~ 375 (515) . .-. |. ++++.. +..++... ..+.+. .+.....+..|-.+|-.- .+.. +...-+.+|. .. T Consensus 230 ~~~~~~~~g~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~~~~~---~~ 302 (416) T protein:vir:45 230 EEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGI-ETANMSITDA---NL 302 (416) T ss_pred HHHHHHhcCccccCceeecCCCc-eeEeccCC-HHHHHH-HHHHHHHHHHHHHHhCCCHHHcCC-CCCCccHHHH---HH Confidence 1 001 11 111111 22222211 122332 344455667788888321 1221 2222222222 12 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcC Q lcl|NC_020414. 376 EIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAI 454 (515) Q Consensus 376 E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~i 454 (515) .....|-|....+..||-.-|. +.-.. .++++ ++.|.|.-...........++ .. .+ T Consensus 303 ~~~~~l~P~~~~ie~~ln~~l~--------~~~~~-----~~~~f~~~~l~~~D~~~~~~~~~~~~~--~G-------~~ 360 (416) T protein:vir:45 303 DYLSTLKPYITCVCAELNFKFN--------DEYVN-----REFKFDTTEIRVVDEKTQAEIDKINID--SG-------KM 360 (416) T ss_pred HHHHHHHHHHHHHHHHHhhhcc--------ccccC-----ceEEEechhhhccCHHHHHHHHHHHHh--CC-------Cc Confidence 2333455555555555443321 11111 11221 222322211111111111111 00 12 Q ss_pred CHHHHHHHHHHhcCCchh-------ccCCHHHHHH-HHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 455 RWGDYMDWVRGQISAELP-------FLKSEEEMQQ-EMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 455 d~d~~~~~~a~~~Gvp~~-------~irs~eev~~-~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) ..++ +-+.+|.|+- +..+-.-+.. +..+ .+ ..........+-||+--| T Consensus 361 T~NE----~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~-------~~-~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 361 NIDE----IRQRDGLAPIPGGNGSIHRVDLNHVNIELVDE-------YQ-MNKSRATDKKLKGGEENE 416 (416) T ss_pred CHHH----HHHHhCCCCCCCCCcceEeecccccccccccc-------cC-cccccccccccCCCCCCC Confidence 2222 2223344330 0000000000 0000 00 001111111122222222 No 153 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=31.59 E-value=1.5 Score=19.64 Aligned_cols=377 Identities=11% Similarity=0.047 Sum_probs=137.3 Q ss_pred ccccccHHHHHHHHHHHHHhhhh-HHHH-HHHHHHhhcccccCCCCCCcccccccc-ccHHHHHHHHHHHHHHhhcCCCC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSP-YLDR-AKHFAKLTLPYLMNNKGDNETSQNGWQ-GVGAQATNHLANKLAQVLFPAQR 82 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~-~e~~-w~e~~~~~~P~~~~~~~~~~~~~~~~d-st~~~a~~~Laa~l~s~ltpp~~ 82 (515) |-+ |...+ +|+. .-.. |..... ++|......+..-....-.. ++--.|++.+|+.+.+ - T Consensus 1 Mg~----------f~~~~-~r~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~------~ 62 (416) T protein:vir:81 1 MGI----------FYKNE-KRDLQYNEDDLQMMVQ-TLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLAR------M 62 (416) T ss_pred CCc----------ccccc-cccccCCCcchhHHHH-HhccccccCccccchhhhhcchHHHHHHHHHHHhhcc------C Confidence 222 22111 2211 1111 112111 23321111111111111112 2222356666555543 2 Q ss_pred CceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hcC----CHHHHHHHHHHHHhhCceEEEEeCCC---cE Q lcl|NC_020414. 83 SFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QRQ----FRPAIVEVFKHLIVAGNCLLYKPSKG---AM 154 (515) Q Consensus 83 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~dl~~~G~~~l~~d~~~---~~ 154 (515) || ++.-.... .... .++..|+ +=| .+.-...++.++..+||+.+++..+. .. T Consensus 63 p~-~~~~~~~~------------~~~~-------~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~ 122 (416) T protein:vir:81 63 PI-RVTVNGQI------------NYSD-------RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 122 (416) T ss_pred ce-EEecCccc------------cccc-------hHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 43 44321110 0111 1222232 222 33445667777888999998875432 23 Q ss_pred EEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCe Q lcl|NC_020414. 155 SAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDI 232 (515) Q Consensus 155 r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~ 232 (515) ..||| ..+.+..|.+|++--.|. .+++. T Consensus 123 ~L~~i~~~~v~v~~~~~g~~~~~~~--------------------------------------------------~~~~~ 152 (416) T protein:vir:81 123 NLTFRKTSEIELKSDARGRLYYFHQ--------------------------------------------------RIDSN 152 (416) T ss_pred EEEEEcCceeEEEECCCccEEEEEE--------------------------------------------------EecCC Confidence 45555 566667777765321110 00000 Q ss_pred eecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccc-cChhh---cc Q lcl|NC_020414. 233 PVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQ-TDVDH---FV 308 (515) Q Consensus 233 ~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~-~~~~~---~~ 308 (515) -......|+.+ -+++.|+...+ ..||.||..-+...+.......+.......-...|..++.-++. .+++. +. T Consensus 153 ~~~~~~~~~~~--evihir~~~~d-~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~ 229 (416) T protein:vir:81 153 GNNIERNVKFE--DMLDIKFYSLD-GINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAR 229 (416) T ss_pred CceeEEEEccc--cEEEeccCCCC-CccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHH Confidence 00000011111 23555655444 48999999999988888888888877777777777766543333 33321 11 Q ss_pred C----CCC-----cc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHH Q lcl|NC_020414. 309 N----SGT-----GE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDAL 375 (515) Q Consensus 309 ~----~~~-----g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~ 375 (515) . .-. |. ++++.. +..++... ..+.+. .+.....+..|-.+|-.- .+.. +...-+.+|. .. T Consensus 230 ~~~~~~~~g~~nag~~~vl~~g~-~~~~l~~~-~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~~~~~---~~ 302 (416) T protein:vir:81 230 EEFHKSFSGTKQAGKVVVLDESM-TFDQLEVD-TEVLKL-IRENKSSTREIAGVFGIPLHKFGI-ETANMSITDA---NL 302 (416) T ss_pred HHHHHHhcCccccCceeecCCCc-eeEeccCC-HHHHHH-HHHHHHHHHHHHHHhCCCHHHcCC-CCCCccHHHH---HH Confidence 1 001 11 111111 22222211 122332 344455667788888321 1221 2222222222 12 Q ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcC Q lcl|NC_020414. 376 EIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAI 454 (515) Q Consensus 376 E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~i 454 (515) .....|-|....+..||-.-|. +.-.. .++++ ++.|.|.-...........++ .. .+ T Consensus 303 ~~~~~l~P~~~~ie~~ln~~l~--------~~~~~-----~~~~f~~~~l~~~D~~~~~~~~~~~~~--~G-------~~ 360 (416) T protein:vir:81 303 DYLSTLKPYITCVCAELNFKFN--------DEYVN-----REFKFDTTEIRVVDEKTQAEIDKINID--SG-------KM 360 (416) T ss_pred HHHHHHHHHHHHHHHHHhhhcc--------ccccC-----ceEEEechhhhccCHHHHHHHHHHHHh--CC-------Cc Confidence 2333455555555555443321 11111 11221 222322211111111111111 00 12 Q ss_pred CHHHHHHHHHHhcCCchh-------ccCCHHHHHH-HHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 455 RWGDYMDWVRGQISAELP-------FLKSEEEMQQ-EMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 455 d~d~~~~~~a~~~Gvp~~-------~irs~eev~~-~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) ..++ +-+.+|.|+- +..+-.-+.. +..+ .+ ..........+-||+--| T Consensus 361 T~NE----~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~-------~~-~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 361 NIDE----IRQRDGLAPIPGGNGSIHRVDLNHVNIELVDE-------YQ-MNKSRATDKKLKGGEENE 416 (416) T ss_pred CHHH----HHHHhCCCCCCCCCcceEeecccccccccccc-------cC-cccccccccccCCCCCCC Confidence 2222 2223344330 0000000000 0000 00 001111111122222222 No 154 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=31.54 E-value=1.5 Score=19.63 Aligned_cols=400 Identities=13% Similarity=0.051 Sum_probs=152.0 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccccc-----c-------cccHHHHHHHHHHHHHHhhcCCC---CCceecCCChHHH Q lcl|NC_020414. 30 LDRAKHFAKLTLPYLMNNKGDNETSQNG-----W-------QGVGAQATNHLANKLAQVLFPAQ---RSFFRVDLTAKGE 94 (515) Q Consensus 30 e~~w~e~~~~~~P~~~~~~~~~~~~~~~-----~-------dst~~~a~~~Laa~l~s~ltpp~---~~WFrl~~~d~~~ 94 (515) ...-.-+..++. +-+....+. | +=.+....+-|+.++... |+. +.|+.+...+.. T Consensus 1 ~~~~D~~~~~~~-------~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~--~a~d~~r~~~~i~~~d~~- 70 (437) T protein:vir:52 1 MKFFDGIKSLAL-------KLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIK--RPEDMVRNWREIYSNDLN- 70 (437) T ss_pred CchhhhhHhHHh-------cCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhc--chHHhhcCCceEecCCCC- Confidence 111111111111 101111111 1 111223344455555554 333 688888653311 Q ss_pred hhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEEEeCCCcEEEEEcceEEEeeCCCCCeeE Q lcl|NC_020414. 95 KVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSKGAMSAVPMHHYVVNRDTNGDLMD 174 (515) Q Consensus 95 ~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~~~r~~pl~~y~v~~d~~G~vd~ 174 (515) ...++ .+.+.+.+-++...+.++++.--.||.+++++..+..--.-|+. ..|.+.. T Consensus 71 ---------~~~~~--------~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~-------~~~~~~~ 126 (437) T protein:vir:52 71 ---------SKQLD--------LFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLK-------PTERLKR 126 (437) T ss_pred ---------HHHHH--------HHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccc-------cCCceeE Confidence 11111 23344555678899999999888899998887544321122331 1233322 Q ss_pred EEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEEEEeeee Q lcl|NC_020414. 175 VILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFIPLTWKR 254 (515) Q Consensus 175 i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~ 254 (515) + +. +...++....-.+. +.. ...+.+...|... . ......|| -.+++.-.|. ..| . T Consensus 127 ~-~v--~~~~~v~~~~~~~~-dp~----s~~fg~p~~y~v~-~-~~~~~~iH----~SRii~~~~~---~~~-------~ 182 (437) T protein:vir:52 127 L-II--LPKWKISPTGTKDD-DVL----SPNFGRYSEYSIL-G-GSQSITVH----HSRLIILNAN---DAP-------L 182 (437) T ss_pred E-EE--echhhccccccccc-ccc----ccccCcceEEEEe-c-CCcceeEc----cceeEEecCc---cCC-------C Confidence 1 11 11111100000000 000 0001111122211 0 00001111 1122221111 112 2 Q ss_pred cCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecC-cccc---------C-hhhc--cCCCCcceecCCcc Q lcl|NC_020414. 255 SYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRP-GSQT---------D-VDHF--VNSGTGEVITGVEE 321 (515) Q Consensus 255 ~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~-~g~~---------~-~~~~--~~~~~g~~~~g~~~ 321 (515) ..+.-||+|+.+-.+..++..+.......+.+..+....+-++- ...+ . .+.+ ..+..|.++-+..+ T Consensus 183 ~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 262 (437) T protein:vir:52 183 SDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAEN 262 (437) T ss_pred ccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCc Confidence 33667899999999999999998888887776665544443320 0000 0 0000 01122333333334 Q ss_pred cccccccCCccchHHHHHHHHHHHHHHHHHH---HHHhhccCCCCCC-CHH-HHHHHHHHHHHHhhhhHHHHHHHHHHHH Q lcl|NC_020414. 322 DIHIVQLGKYADLTPISAVLEVYTRRIGVIF---MMETMTRRDAERV-TAV-EIQRDALEIEQNMGGVYSLFAMTMQTPI 396 (515) Q Consensus 322 ~v~~~~~~~~~~l~~~~~~i~~~~~rI~~af---l~~~l~~~~~~~~-TAt-Ei~~r~~E~~~~LGpv~~rl~~E~l~Pl 396 (515) ++..+.. +|.-+...+....+.|..++ ..-.+.+..+ .+ |.+ +++.-.. -+..++...+.|+ T Consensus 263 ~~e~~~~----~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~-Glasge~D~~~yyd--------~i~~~Qe~~l~p~ 329 (437) T protein:vir:52 263 EYDRKEL----TFTGLKDLLTEFRNAVAGAADMPVTILFGQSVS-GLASGDEDIQNYHE--------AIRRLQETRLRPI 329 (437) T ss_pred ceEEEec----CcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcc-cccccHHHHHHHHH--------HHHHHHHHHHHHH Confidence 4444332 34445566777788888776 2112233222 23 212 3222111 1333444456666 Q ss_pred HHHH----HHhcCCCCChhhccceeeeehHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHhcCCchh Q lcl|NC_020414. 397 AMWG----LQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP 472 (515) Q Consensus 397 i~r~----~~~~~~~~p~~~~~~~~v~~l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a~~~Gvp~~ 472 (515) ++++ +...+..+|+ .+...+ .+|..+.-..+++......+.+..+.+. ..++++++.+.+.+....+ . T Consensus 330 le~l~~~i~~~~~g~~~~-~~~~~f-~pL~~~s~kekae~~~~~a~a~~~~~~~-----g~i~~~e~r~~L~~~g~~~-~ 401 (437) T protein:vir:52 330 FEIIDPLICNELFGGLPA-DWWFEF-VPLTTVKQEQQINMLNTFATAANTLIQN-----GVLNEYQIANELRESGLFA-N 401 (437) T ss_pred HHHHHHHHHHHhcCCCCC-cceEEe-CCcCCcCHHHHHHHHHHHHHHHHHHHhc-----CCCCHHHHHHHHHhcCCCC-C Confidence 6553 2333344443 233322 1333333333332222222222222221 2466777777665532222 1 Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhcc Q lcl|NC_020414. 473 FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKE 514 (515) Q Consensus 473 ~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~ 514 (515) + ++++++..........+.++ .....+...+....| T Consensus 402 i--~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 402 I--SAEHIEELKNADEFAGNFEE----PEKMEGAQVQNSEDQ 437 (437) T ss_pred C--CccccccccCCCCCCCccCC----CCCCCCCCCCCCCCC Confidence 1 22222221110000000000 000000001111111 No 155 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=25.72 E-value=2 Score=18.90 Aligned_cols=257 Identities=8% Similarity=0.011 Sum_probs=107.2 Q ss_pred CCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHH-HHhcCCHHHHHHHHHHHHhhCceEEEEeCCC-c--E Q lcl|NC_020414. 79 PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKA-LEQRQFRPAIVEVFKHLIVAGNCLLYKPSKG-A--M 154 (515) Q Consensus 79 pp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~snf~~~~~~~~~dl~~~G~~~l~~d~~~-~--~ 154 (515) =++-||--..- ++ ..+..|. ..+... -...+.+.=+..++.++...|||++++..+. + . T Consensus 1 ia~l~~~~~~~-~~-------------~~~~~l~---~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~ 63 (278) T protein:vir:78 1 MASLPLKMYED-YK-------------VVNTEVS---DLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPS 63 (278) T ss_pred CccceeEEEec-Cc-------------ccccHHH---HHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEE Confidence 01233321110 00 0111111 111100 0112344556777788889999988875332 2 3 Q ss_pred EEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCe Q lcl|NC_020414. 155 SAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDI 232 (515) Q Consensus 155 r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~ 232 (515) ..+|| ..+-+..+.+|.. +++.+. ..+|. T Consensus 64 ~l~~l~~~~v~v~~~~~~~~--~~y~~~-----------------------------------------------~~~g~ 94 (278) T protein:vir:78 64 KLFLLNPDVVEMLIENQSRE--LYYSIH-----------------------------------------------AATGN 94 (278) T ss_pred EEEEECCceeEEEEcCCCce--EEEEEE-----------------------------------------------cCCce Confidence 45555 3344444443321 111110 00111 Q ss_pred eecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC--- Q lcl|NC_020414. 233 PVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN--- 309 (515) Q Consensus 233 ~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~--- 309 (515) .+ .|+.+ -+++.|.....+..||.||..-+...+...+...+..+... ...|..++..++.++.+.... T Consensus 95 ~~----~~~~~--evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~i~~~~~~l~~e~~~~~~~ 166 (278) T protein:vir:78 95 KL----IVHNM--DMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEM--QKPDSFMLKYGSNVGKEKRQQVLE 166 (278) T ss_pred EE----EEccc--cEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHh--cCCCcEEEEeCCCCCHHHHHHHHH Confidence 10 11111 24555655556778999999999988888777766544322 334556665555554433211 Q ss_pred ------CCCcce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCCHHHHHHHHHHHH Q lcl|NC_020414. 310 ------SGTGEV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVTAVEIQRDALEIE 378 (515) Q Consensus 310 ------~~~g~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~TAtEi~~r~~E~~ 378 (515) ...|.+ +++.. ++.++.. +..+.+. .+..+...+.|-.+|=.. .+.. .++..-|++|... .=.. T Consensus 167 ~~~~~~~~~g~~~vl~~g~-~~~~l~~-~~~d~~~-~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~--~~~~ 241 (278) T protein:vir:78 167 DFKQYYEENGGILFQEPGV-EIEPLPK-KYVSEDI-VASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNR--FYLQ 241 (278) T ss_pred HHHHHhccCCCceecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--HHHH Confidence 112222 22211 2233322 1234443 444566777888888221 1222 2222334444321 1112 Q ss_pred HHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHH Q lcl|NC_020414. 379 QNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEAL 424 (515) Q Consensus 379 ~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l 424 (515) ..|-|...++..+|-.=|+ + +.+.....++.+ ++.| T Consensus 242 ~~l~P~~~~i~~~ln~~L~--------~--~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 242 HTLLPIVKQYEEEFNRKLL--------T--KTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHHHHHhhcC--------C--hhHhcCCceEEEecccC Confidence 2344444444444322211 1 122222222221 1112 No 156 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=23.80 E-value=2.3 Score=18.64 Aligned_cols=389 Identities=11% Similarity=0.002 Sum_probs=144.9 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHH-----HHHHHHhhcccccCCCCCCccccccccccHHH-HHHHHHHHHHHhhcC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDR-----AKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQ-ATNHLANKLAQVLFP 79 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~-----w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~-a~~~Laa~l~s~ltp 79 (515) |-+ ...+.+.|. ..+|+.-... -..+..|.-. ...+-.-..+..+...+.. |++.+|+.+ +. T Consensus 1 M~~---~~r~~~~~~--~~~r~~~~~~~~~~~~~~~~~~~g~---~~~~~~v~~~~al~~~~v~~~i~~ia~~i-a~--- 68 (432) T protein:vir:10 1 MKI---VDSVKKFFN--FEKRQTSQVIELNKDDEKLLEWLGI---SPSTISVKGKNALKVATVFACIKILSESV-SK--- 68 (432) T ss_pred CCh---HHHHHHhcC--ccccCcccccccCCchHHHHHHhCC---CcCccccchhhhhccHHHHHHHHHHHHhh-cc--- Confidence 222 111111111 0122211100 0111222100 0001011112234444443 444444333 33 Q ss_pred CCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCC-- Q lcl|NC_020414. 80 AQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKG-- 152 (515) Q Consensus 80 p~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~-- 152 (515) -||--....+....+ ..+. .+...|+ +- +.+.-+..++.++..+||+.+++..+. T Consensus 69 --lp~~~~~~~~~~~~~---------~~~~-------~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 130 (432) T protein:vir:10 69 --LPLKIYQEDEYGIQR---------GTKH-------YLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG 130 (432) T ss_pred --CceEEEEecCCceee---------cccc-------HHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 244211111110000 0111 1122222 22 344456667777888999999875432 Q ss_pred -cEEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEe Q lcl|NC_020414. 153 -AMSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSA 229 (515) Q Consensus 153 -~~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~ 229 (515) ....||| +...+..|..|.+..-. ...+++.. T Consensus 131 ~~~~L~~i~~~~v~v~~d~~~~~~~~~---------------------------------------------~~~y~~~~ 165 (432) T protein:vir:10 131 KVQALWPIDASKVTVYIDDVGLLNSKT---------------------------------------------KMWYVVNT 165 (432) T ss_pred cEEEEEEEcCceeEEEEcCcccccccc---------------------------------------------eEEEEEec Confidence 1334555 34444444444321100 01111112 Q ss_pred CCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC Q lcl|NC_020414. 230 DDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN 309 (515) Q Consensus 230 ~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~ 309 (515) +|... .|+.+ -+++.|.....+..||.||..-+...+.......+.......-...|..++.-++.++++.... T Consensus 166 ~g~~~----~~~~~--eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~ 239 (432) T protein:vir:10 166 GGQQR----VLKPE--EILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKV 239 (432) T ss_pred CCeEE----EEccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHH Confidence 22111 12111 2566666556677899999999999999999999988888888888887776666555543211 Q ss_pred CC------------Ccc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCCHHHHHH Q lcl|NC_020414. 310 SG------------TGE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVTAVEIQR 372 (515) Q Consensus 310 ~~------------~g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~TAtEi~~ 372 (515) .. .|. ++++.. +..++.. +..+.+. .+..+..++.|-.+|-.. .+.. .++..-+++|... T Consensus 240 ~~~~~~~~~~g~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~ 316 (432) T protein:vir:10 240 FRENFESMSSGLQNSHRIALMPVGY-QFQPISL-NMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 316 (432) T ss_pred HHHHHHHHhcccccCCcceecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 00 011 122111 2223322 1234443 344566678888888321 1211 1222223333211 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHH Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQ 451 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~ 451 (515) .=....|-|...++.++|-.-|+ ++ .+.-...++.. ++.|.|.--.+........++. . T Consensus 317 --~~~~~~l~P~~~~ie~~ln~kLl--------~~--~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~--G------ 376 (432) T protein:vir:10 317 --QFYTDTLQATLTMYEQEMTYKLF--------LD--SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQG--G------ 376 (432) T ss_pred --HHHHHHHHHHHHHHHHHHHHhhc--------Ch--hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhC--C------ Confidence 11223344444444444432222 11 11111122222 2233332111111111111110 0 Q ss_pred hcCCHHHHHHHHHHhcCCch-----hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 452 RAIRWGDYMDWVRGQISAEL-----PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 452 d~id~d~~~~~~a~~~Gvp~-----~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) .+.++++ -+.+|.|+ .++.+..-+ -+.. ..+ +....+...++..++| T Consensus 377 -~~t~NE~----R~~~g~~pi~ggD~~~~~~n~~-~~~~-~~~----------~~~k~~~~~~~~~~~~ 428 (432) T protein:vir:10 377 -FLKPNEA----RSKEDLPPEAGGDRLLVNGNML-PIDM-AGQ----------AYLKGGDTNGEVSKEG 428 (432) T ss_pred -CcCHHHH----HHHhCCCCCCCCCeEeeccccc-chhh-ccc----------cccCCCCCCCCCCCCC Confidence 1122222 12234332 011111000 0000 000 0001112223333344 No 157 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=23.80 E-value=2.3 Score=18.64 Aligned_cols=389 Identities=11% Similarity=0.002 Sum_probs=144.9 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHH-----HHHHHHhhcccccCCCCCCccccccccccHHH-HHHHHHHHHHHhhcC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDR-----AKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQ-ATNHLANKLAQVLFP 79 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~-----w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~-a~~~Laa~l~s~ltp 79 (515) |-+ ...+.+.|. ..+|+.-... -..+..|.-. ...+-.-..+..+...+.. |++.+|+.+ +. T Consensus 1 M~~---~~r~~~~~~--~~~r~~~~~~~~~~~~~~~~~~~g~---~~~~~~v~~~~al~~~~v~~~i~~ia~~i-a~--- 68 (432) T protein:vir:10 1 MKI---VDSVKKFFN--FEKRQTSQVIELNKDDEKLLEWLGI---SPSTISVKGKNALKVATVFACIKILSESV-SK--- 68 (432) T ss_pred CCh---HHHHHHhcC--ccccCcccccccCCchHHHHHHhCC---CcCccccchhhhhccHHHHHHHHHHHHhh-cc--- Confidence 222 111111111 0122211100 0111222100 0001011112234444443 444444333 33 Q ss_pred CCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCC-- Q lcl|NC_020414. 80 AQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKG-- 152 (515) Q Consensus 80 p~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~-- 152 (515) -||--....+....+ ..+. .+...|+ +- +.+.-+..++.++..+||+.+++..+. T Consensus 69 --lp~~~~~~~~~~~~~---------~~~~-------~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 130 (432) T protein:vir:10 69 --LPLKIYQEDEYGIQR---------GTKH-------YLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG 130 (432) T ss_pred --CceEEEEecCCceee---------cccc-------HHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 244211111110000 0111 1122222 22 344456667777888999999875432 Q ss_pred -cEEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEe Q lcl|NC_020414. 153 -AMSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSA 229 (515) Q Consensus 153 -~~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~ 229 (515) ....||| +...+..|..|.+..-. ...+++.. T Consensus 131 ~~~~L~~i~~~~v~v~~d~~~~~~~~~---------------------------------------------~~~y~~~~ 165 (432) T protein:vir:10 131 KVQALWPIDASKVTVYIDDVGLLNSKT---------------------------------------------KMWYVVNT 165 (432) T ss_pred cEEEEEEEcCceeEEEEcCcccccccc---------------------------------------------eEEEEEec Confidence 1334555 34444444444321100 01111112 Q ss_pred CCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC Q lcl|NC_020414. 230 DDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN 309 (515) Q Consensus 230 ~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~ 309 (515) +|... .|+.+ -+++.|.....+..||.||..-+...+.......+.......-...|..++.-++.++++.... T Consensus 166 ~g~~~----~~~~~--eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~ 239 (432) T protein:vir:10 166 GGQQR----VLKPE--EILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKV 239 (432) T ss_pred CCeEE----EEccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHH Confidence 22111 12111 2566666556677899999999999999999999988888888888887776666555543211 Q ss_pred CC------------Ccc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCCHHHHHH Q lcl|NC_020414. 310 SG------------TGE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVTAVEIQR 372 (515) Q Consensus 310 ~~------------~g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~TAtEi~~ 372 (515) .. .|. ++++.. +..++.. +..+.+. .+..+..++.|-.+|-.. .+.. .++..-+++|... T Consensus 240 ~~~~~~~~~~g~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~ 316 (432) T protein:vir:10 240 FRENFESMSSGLQNSHRIALMPVGY-QFQPISL-NMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 316 (432) T ss_pred HHHHHHHHhcccccCCcceecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 00 011 122111 2223322 1234443 344566678888888321 1211 1222223333211 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHH Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQ 451 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~ 451 (515) .=....|-|...++.++|-.-|+ ++ .+.-...++.. ++.|.|.--.+........++. . T Consensus 317 --~~~~~~l~P~~~~ie~~ln~kLl--------~~--~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~--G------ 376 (432) T protein:vir:10 317 --QFYTDTLQATLTMYEQEMTYKLF--------LD--SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQG--G------ 376 (432) T ss_pred --HHHHHHHHHHHHHHHHHHHHhhc--------Ch--hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhC--C------ Confidence 11223344444444444432222 11 11111122222 2233332111111111111110 0 Q ss_pred hcCCHHHHHHHHHHhcCCch-----hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 452 RAIRWGDYMDWVRGQISAEL-----PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 452 d~id~d~~~~~~a~~~Gvp~-----~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) .+.++++ -+.+|.|+ .++.+..-+ -+.. ..+ +....+...++..++| T Consensus 377 -~~t~NE~----R~~~g~~pi~ggD~~~~~~n~~-~~~~-~~~----------~~~k~~~~~~~~~~~~ 428 (432) T protein:vir:10 377 -FLKPNEA----RSKEDLPPEAGGDRLLVNGNML-PIDM-AGQ----------AYLKGGDTNGEVSKEG 428 (432) T ss_pred -CcCHHHH----HHHhCCCCCCCCCeEeeccccc-chhh-ccc----------cccCCCCCCCCCCCCC Confidence 1122222 12234332 011111000 0000 000 0001112223333344 No 158 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=23.80 E-value=2.3 Score=18.64 Aligned_cols=389 Identities=11% Similarity=0.002 Sum_probs=144.9 Q ss_pred ccccccHHHHHHHHHHHHHhhhhHHHH-----HHHHHHhhcccccCCCCCCccccccccccHHH-HHHHHHHHHHHhhcC Q lcl|NC_020414. 6 LEYGGQRSKIPKLWEKFSKKRSPYLDR-----AKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQ-ATNHLANKLAQVLFP 79 (515) Q Consensus 6 ~~~~~~~~~l~~r~~~lk~~R~~~e~~-----w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~-a~~~Laa~l~s~ltp 79 (515) |-+ ...+.+.|. ..+|+.-... -..+..|.-. ...+-.-..+..+...+.. |++.+|+.+ +. T Consensus 1 M~~---~~r~~~~~~--~~~r~~~~~~~~~~~~~~~~~~~g~---~~~~~~v~~~~al~~~~v~~~i~~ia~~i-a~--- 68 (432) T protein:vir:10 1 MKI---VDSVKKFFN--FEKRQTSQVIELNKDDEKLLEWLGI---SPSTISVKGKNALKVATVFACIKILSESV-SK--- 68 (432) T ss_pred CCh---HHHHHHhcC--ccccCcccccccCCchHHHHHHhCC---CcCccccchhhhhccHHHHHHHHHHHHhh-cc--- Confidence 222 111111111 0122211100 0111222100 0001011112234444443 444444333 33 Q ss_pred CCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCC-- Q lcl|NC_020414. 80 AQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKG-- 152 (515) Q Consensus 80 p~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~-- 152 (515) -||--....+....+ ..+. .+...|+ +- +.+.-+..++.++..+||+.+++..+. T Consensus 69 --lp~~~~~~~~~~~~~---------~~~~-------~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 130 (432) T protein:vir:10 69 --LPLKIYQEDEYGIQR---------GTKH-------YLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKG 130 (432) T ss_pred --CceEEEEecCCceee---------cccc-------HHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 244211111110000 0111 1122222 22 344456667777888999999875432 Q ss_pred -cEEEEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEe Q lcl|NC_020414. 153 -AMSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSA 229 (515) Q Consensus 153 -~~r~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~ 229 (515) ....||| +...+..|..|.+..-. ...+++.. T Consensus 131 ~~~~L~~i~~~~v~v~~d~~~~~~~~~---------------------------------------------~~~y~~~~ 165 (432) T protein:vir:10 131 KVQALWPIDASKVTVYIDDVGLLNSKT---------------------------------------------KMWYVVNT 165 (432) T ss_pred cEEEEEEEcCceeEEEEcCcccccccc---------------------------------------------eEEEEEec Confidence 1334555 34444444444321100 01111112 Q ss_pred CCeeecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccC Q lcl|NC_020414. 230 DDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVN 309 (515) Q Consensus 230 ~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~ 309 (515) +|... .|+.+ -+++.|.....+..||.||..-+...+.......+.......-...|..++.-++.++++.... T Consensus 166 ~g~~~----~~~~~--eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~ 239 (432) T protein:vir:10 166 GGQQR----VLKPE--EILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKV 239 (432) T ss_pred CCeEE----EEccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHH Confidence 22111 12111 2566666556677899999999999999999999988888888888887776666555543211 Q ss_pred CC------------Ccc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhcc-CCCCCCCHHHHHH Q lcl|NC_020414. 310 SG------------TGE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTR-RDAERVTAVEIQR 372 (515) Q Consensus 310 ~~------------~g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~-~~~~~~TAtEi~~ 372 (515) .. .|. ++++.. +..++.. +..+.+. .+..+..++.|-.+|-.. .+.. .++..-+++|... T Consensus 240 ~~~~~~~~~~g~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~ 316 (432) T protein:vir:10 240 FRENFESMSSGLQNSHRIALMPVGY-QFQPISL-NMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 316 (432) T ss_pred HHHHHHHHhcccccCCcceecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 00 011 122111 2223322 1234443 344566678888888321 1211 1222223333211 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHH Q lcl|NC_020414. 373 DALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQ 451 (515) Q Consensus 373 r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~ 451 (515) .=....|-|...++.++|-.-|+ ++ .+.-...++.. ++.|.|.--.+........++. . T Consensus 317 --~~~~~~l~P~~~~ie~~ln~kLl--------~~--~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~--G------ 376 (432) T protein:vir:10 317 --QFYTDTLQATLTMYEQEMTYKLF--------LD--SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQG--G------ 376 (432) T ss_pred --HHHHHHHHHHHHHHHHHHHHhhc--------Ch--hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhC--C------ Confidence 11223344444444444432222 11 11111122222 2233332111111111111110 0 Q ss_pred hcCCHHHHHHHHHHhcCCch-----hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 452 RAIRWGDYMDWVRGQISAEL-----PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 452 d~id~d~~~~~~a~~~Gvp~-----~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) .+.++++ -+.+|.|+ .++.+..-+ -+.. ..+ +....+...++..++| T Consensus 377 -~~t~NE~----R~~~g~~pi~ggD~~~~~~n~~-~~~~-~~~----------~~~k~~~~~~~~~~~~ 428 (432) T protein:vir:10 377 -FLKPNEA----RSKEDLPPEAGGDRLLVNGNML-PIDM-AGQ----------AYLKGGDTNGEVSKEG 428 (432) T ss_pred -CcCHHHH----HHHhCCCCCCCCCeEeeccccc-chhh-ccc----------cccCCCCCCCCCCCCC Confidence 1122222 12234332 011111000 0000 000 0001112223333344 No 159 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=22.11 E-value=2.5 Score=18.40 Aligned_cols=385 Identities=11% Similarity=0.007 Sum_probs=144.9 Q ss_pred HHHHHHHHH--hhhh--HHH---HHHHHHHhhcccccCCCCCCccccccccccHHHHHHHHHHHHHHhhcCCCCCceecC Q lcl|NC_020414. 16 PKLWEKFSK--KRSP--YLD---RAKHFAKLTLPYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVD 88 (515) Q Consensus 16 ~~r~~~lk~--~R~~--~e~---~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~s~ltpp~~~WFrl~ 88 (515) -+.++.+.+ +|.. ... .-..+.++.-. ...+-.-.....+...+..+|-++-|.-++. -||--.. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~---~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~-----l~~~~~~ 72 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGI---SPSTISVKGKNALKVATVFACIKILSESVSK-----LPLKIYQ 72 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcC---CCCcceechhhhhccHHHHHHHHHHHHhhcc-----CceEEEE Confidence 122222221 1211 110 00111222100 0001000111234444444333333333333 2443222 Q ss_pred CChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCC---cEEEEEc- Q lcl|NC_020414. 89 LTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKG---AMSAVPM- 159 (515) Q Consensus 89 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~pl- 159 (515) -.+....+ ..+. .+...|+ +- +.+.=+..++.++..+||+.+++..+. ....||+ T Consensus 73 ~~~~~~~~---------~~~~-------~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 136 (429) T protein:vir:10 73 EDEYGIQR---------GTKH-------YLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 136 (429) T ss_pred ecCCceee---------cccc-------HHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 11111000 0111 1222232 12 344456677888888999999875432 1344555 Q ss_pred -ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccC Q lcl|NC_020414. 160 -HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKEN 238 (515) Q Consensus 160 -~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~es 238 (515) ..+.+..|..|.+..-++ +| +.+..+|... T Consensus 137 ~~~v~v~~~~~~~~~~~~~---------------------------------~~------------~~~~~~g~~~---- 167 (429) T protein:vir:10 137 ASKVTVYIDDVGLLNSKTK---------------------------------MW------------YVVNTGGQQR---- 167 (429) T ss_pred CceeEEEEcCcccccccce---------------------------------EE------------EEEccCCeEE---- Confidence 455555565554321111 11 1111111110 Q ss_pred CcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCC-------- Q lcl|NC_020414. 239 RIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNS-------- 310 (515) Q Consensus 239 gy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~-------- 310 (515) .|+. --+++.|.....+..||.||..-+...+.......+.......-...|..++.-++.++++..... T Consensus 168 ~~~~--~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~ 245 (429) T protein:vir:10 168 VLKP--EEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMS 245 (429) T ss_pred EEcc--ccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHh Confidence 1211 135677766666779999999999999999999999988888888888877665555554422110 Q ss_pred ---C-Cc--ceecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhc-cCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020414. 311 ---G-TG--EVITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMT-RRDAERVTAVEIQRDALEIEQNM 381 (515) Q Consensus 311 ---~-~g--~~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~-~~~~~~~TAtEi~~r~~E~~~~L 381 (515) . .| .++++.. .+.++.+ +..+.+. .+..+..++.|-.+|-.. .+. ..++..-+++|.... =....| T Consensus 246 ~g~~n~~~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~--f~~~~l 320 (429) T protein:vir:10 246 SGLQNSHRIALMPVGY-QFQPISL-NMSDAQF-LENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ--FYTDTL 320 (429) T ss_pred ccccccCceeecCCCc-eEEEccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH--HHHHHH Confidence 0 01 1122111 2233322 2234553 344456677888888321 121 122222343433111 122233 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHH Q lcl|NC_020414. 382 GGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYM 460 (515) Q Consensus 382 Gpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~ 460 (515) -|....+.++|-.-|+ ++ .+.-...+++. ++.|.|.--.+.....-..++ .. .+-++++ T Consensus 321 ~P~~~~ie~~ln~kl~--------~~--~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~--~G-------~~T~NE~- 380 (429) T protein:vir:10 321 QATLTMYEQEMTYKLF--------LD--SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQ--GG-------FLKPNEA- 380 (429) T ss_pred HHHHHHHHHHHHHhhc--------Ch--hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHh--CC-------CcCHHHH- Confidence 3444444333322221 11 11111112222 223322211111111111111 00 1122222 Q ss_pred HHHHHhcCCch-----hccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 461 DWVRGQISAEL-----PFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 461 ~~~a~~~Gvp~-----~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) -+.+|.|+ .++.+..-+ -+ + ..- .....-+.-.++.-++| T Consensus 381 ---R~~~gl~p~~ggD~~~~~~n~~-~~-d----~~~------~~~~k~g~~~~~~~~~~ 425 (429) T protein:vir:10 381 ---RSKEDLPPEAGGDRLLVNGNML-PI-D----MAG------QAYLKGGDTNGEVSKEG 425 (429) T ss_pred ---HHHhCCCCCCCcCeeeeccccc-ch-h----hcc------ccccCCCCCCCCCCCCC Confidence 12234432 111111000 00 0 000 00000111223333333 No 160 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=21.84 E-value=2.5 Score=18.37 Aligned_cols=389 Identities=11% Similarity=0.062 Sum_probs=141.5 Q ss_pred cHHHHHHHHHHHHHhhhhHHHHHHHHHHhhcccc-------cCCCCCCccccccccccHH-HHHHHHHHHHHHhhcCCCC Q lcl|NC_020414. 11 QRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL-------MNNKGDNETSQNGWQGVGA-QATNHLANKLAQVLFPAQR 82 (515) Q Consensus 11 ~~~~l~~r~~~lk~~R~~~e~~w~e~~~~~~P~~-------~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~ 82 (515) =++...+++.++++...+|... .++ ...|.. ....+..-.........+. .|++.+|+.+.+ - T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~--~~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~------l 71 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGV--PIS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIAT------L 71 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCC--ccc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhh------C Confidence 1222334444444433333211 000 000100 0011100001112222333 355555554432 2 Q ss_pred CceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHH-hc----CCHHHHHHHHHHHHhhCceEEEEeCCCc--EE Q lcl|NC_020414. 83 SFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALE-QR----QFRPAIVEVFKHLIVAGNCLLYKPSKGA--MS 155 (515) Q Consensus 83 ~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~dl~~~G~~~l~~d~~~~--~r 155 (515) ||.-....+..-.. .+ .+..+...|. +- +.+.=....+.++...||+.+++..+.+ .. T Consensus 72 p~~~~~~~~~g~~~---------~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~~~~ 136 (437) T protein:vir:10 72 PLNLYQTKPDGTRV---------LA------KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGVLIG 136 (437) T ss_pred ceeEEEEcCCCcee---------ec------cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEE Confidence 55422211110000 00 0111222233 23 3444566667777889999988765443 33 Q ss_pred EEEc--ceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCee Q lcl|NC_020414. 156 AVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIP 233 (515) Q Consensus 156 ~~pl--~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~ 233 (515) .||| ..+.+..+.+|.+- |. |...+|.. T Consensus 137 L~~l~p~~v~i~~~~~g~~~--------------------------------------y~------------~~~~~g~~ 166 (437) T protein:vir:10 137 LELMLPQRTTVKRLTSGALQ--------------------------------------YT------------YRNVDGTV 166 (437) T ss_pred EEEEcCcceEEEECCCCeEE--------------------------------------EE------------EEecCceE Confidence 4555 33444443333211 10 11111211 Q ss_pred ecccCCcccccCcEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC-- Q lcl|NC_020414. 234 VGKENRIKAEKLPFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG-- 311 (515) Q Consensus 234 i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~-- 311 (515) . .|..+ =+++.|....+ ..||.||..-+...+.....+.+.......-...|-.++.-++.++++...... T Consensus 167 ~----~~~~~--dIih~r~~~~d-~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~ 239 (437) T protein:vir:10 167 S----TLAED--DVFHVRGFSLD-GLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTD 239 (437) T ss_pred E----EEccc--cEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHH Confidence 0 01111 13555544333 489999999999999888888888888878888887777666666655432111 Q ss_pred -----C-----cc--eecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCCCCCCCHHHHHHHHHH- Q lcl|NC_020414. 312 -----T-----GE--VITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAERVTAVEIQRDALE- 376 (515) Q Consensus 312 -----~-----g~--~~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~~~~~TAtEi~~r~~E- 376 (515) . |. ++++.. +..++.. +..+.+. .+..+-.+..|-.+|-.. .+...+....+..-+.+.... T Consensus 240 ~~~~~~g~~nag~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f 316 (437) T protein:vir:10 240 LAEQFGGAMQAGKTMVLEAGM-KYQAITM-NPGDVQL-LETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGF 316 (437) T ss_pred HHHHhcCccccCcceeccCCc-eEEeccC-ChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHH Confidence 0 11 111111 1222221 2234443 333455567788888321 222222222333334333322 Q ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCC Q lcl|NC_020414. 377 IEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIR 455 (515) Q Consensus 377 ~~~~LGpv~~rl~~E~l~Pli~r~~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id 455 (515) ....|.|.+.++..||-.-|+ + +.+. .-.+++. ++.|.|+--.+........+.. ..+. T Consensus 317 ~~~tl~P~~~~ie~~l~~kll--------~--~~e~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~---------G~~T 376 (437) T protein:vir:10 317 LTFTLRPWLTRIEQAARRSLL--------R--PGER-DQFYAEFSVEGLLRADSAGRAAFYSTMTQN---------GLMT 376 (437) T ss_pred HHHHHHHHHHHHHHHHHhhcc--------C--cccc-CceEEEEechhhhccCHHHHHHHHHHHHhC---------CCcC Confidence 233355555555544432221 1 1111 1112221 2333332111111111111110 0111 Q ss_pred HHHHHHHHHHhcCCch-h----cc-CCH--HHHHHHHHHHHHHHHHHHHHHHhhhhccchhhh-hhccC Q lcl|NC_020414. 456 WGDYMDWVRGQISAEL-P----FL-KSE--EEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQ-EMKEG 515 (515) Q Consensus 456 ~d~~~~~~a~~~Gvp~-~----~i-rs~--eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~-~~~~~ 515 (515) ++++ -+.+|.|+ . ++ .+. .-+...-++ ....++..+-.++.++ +...| T Consensus 377 ~NE~----R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~ 433 (437) T protein:vir:10 377 RDEC----RAKENLPPMGGNAAVLTVQSALLPIDKLGEH--------TTATAAQDALKAWLYQEEKTRA 433 (437) T ss_pred HHHH----HHHhCCCCCCCCcceEeecCcccchhhccCc--------CCCcchhccccccCCCCCCCCc Confidence 2221 12233322 0 00 000 000000000 0000000000000000 11111 No 161 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=21.80 E-value=2.5 Score=18.36 Aligned_cols=436 Identities=10% Similarity=0.005 Sum_probs=161.3 Q ss_pred CCCcccccc-ccHHHHHHHHHHHHHhhhhHHHHHH-HHHHhhcccccCCCC---CC----cccccc--ccccHHHHHHHH Q lcl|NC_020414. 1 MQDTILEYG-GQRSKIPKLWEKFSKKRSPYLDRAK-HFAKLTLPYLMNNKG---DN----ETSQNG--WQGVGAQATNHL 69 (515) Q Consensus 1 ~~~~~~~~~-~~~~~l~~r~~~lk~~R~~~e~~w~-e~~~~~~P~~~~~~~---~~----~~~~~~--~dst~~~a~~~L 69 (515) |-==+---| +.-+...+|.. .+..+..|+.--. .-..+--+....+.. +. .+++.+ -++.+..+++.+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~-ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~ 79 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLR-SRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKL 79 (502) T ss_pred CchHhhHHhhcChHHHHHHHh-hHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 110000000 00011111111 1111222211000 000000000000000 00 011112 367888999999 Q ss_pred HHHHHH--hhcCCCCCceecCCChHHHhhhhccchhHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCceEEE Q lcl|NC_020414. 70 ANKLAQ--VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLY 147 (515) Q Consensus 70 aa~l~s--~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~dl~~~G~~~l~ 147 (515) ++.+++ +++|..++=..-...+..++ +.....-+.|.+.| .+-.+.+||.....++..++.-|-+++. T Consensus 80 ~~nvVG~ggi~~~~~~~~~~~~~~~~~~-----~~ie~~w~~Wa~~~-----D~~g~~~f~~~q~l~~r~~~~dGE~f~~ 149 (502) T protein:vir:79 80 EERVVGKNGIIVEPHPVLRNGAIARDLA-----AEIRTRWSEWSVSP-----EVTGQFTRPMLERLMLRTWLRDGEVFAQ 149 (502) T ss_pred HHhhccCCceeeeeccCCCChhHHHHHH-----HHHHHHHHHhhcCc-----CccccCCHHHHHHHHHHHHHhCCceEEE Confidence 999996 56665444111100011111 01112223333222 2334678999999999999999998764 Q ss_pred E--eCCCcE---EEEEcceEEEeeCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCC Q lcl|NC_020414. 148 K--PSKGAM---SAVPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGF 222 (515) Q Consensus 148 ~--d~~~~~---r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~ 222 (515) . ++.... ..||+.==.|. .+.|+... . +.-.|..-|+.+..|. T Consensus 150 ~~~~~~~~~~~g~~~~l~lq~ie-----------------pd~l~~~~-----------~----~~~~i~~GVe~d~~Gr 197 (502) T protein:vir:79 150 MVSGRINSLTPSAGVHFWLEALE-----------------PDFIPMTS-----------D----ESNRLNQGVFVDDWGR 197 (502) T ss_pred EeecccCccCCCcccceEEEEec-----------------chhcCCCC-----------C----CCCeeEeeeEECCCCc Confidence 3 221110 01121100010 11110000 0 0012555667766665 Q ss_pred eEEEEEeCCeeecccCCcccccCc---EEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeec-C Q lcl|NC_020414. 223 WKINQSADDIPVGKENRIKAEKLP---FIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIR-P 298 (515) Q Consensus 223 ~~~~~e~~~~~i~~esgy~~~~~P---~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~-~ 298 (515) ..-|+-.. .|.+.-++..-...| +++.-....+|..=|.+..--+|..++.|+.+..+.+.++..++.....+. + T Consensus 198 ~~aY~i~~-~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~ 276 (502) T protein:vir:79 198 PEKYLVYK-SRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKG 276 (502) T ss_pred eEEEEEee-cCCCCCcccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecC Confidence 44333221 111110000001122 455555567899999999999999999999999999999999888766554 2 Q ss_pred ccccC-hh--------hccCCCCcceecC-Cc-ccccccccC-CccchHHHHHHHHHHHHHHHHHH-H-HHhhccCCCCC Q lcl|NC_020414. 299 GSQTD-VD--------HFVNSGTGEVITG-VE-EDIHIVQLG-KYADLTPISAVLEVYTRRIGVIF-M-METMTRRDAER 364 (515) Q Consensus 299 ~g~~~-~~--------~~~~~~~g~~~~g-~~-~~v~~~~~~-~~~~l~~~~~~i~~~~~rI~~af-l-~~~l~~~~~~~ 364 (515) ++-.. +. ....-.+|.+++. .+ .++....-. ..++|. .-...+...|..++ + +..+. .|-.. T Consensus 277 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~---~f~~~~lr~iaaglGi~ye~lt-~D~s~ 352 (502) T protein:vir:79 277 DGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLE---TFRNGQLRAVAAGSRLSFSSTA-RNYNG 352 (502) T ss_pred CCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHH---HHHHHHHHHHHhhcCCCHHHHh-ccccc Confidence 21100 00 0111224444332 22 233333222 223443 22233333344443 0 11111 12111 Q ss_pred CCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH-----HhcCCCCChh----hcccee----eeehHHHHHHHHHH Q lcl|NC_020414. 365 VTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGL-----QEAGDSFTSE----LVDPVI----VTGIEALGRMAELD 431 (515) Q Consensus 365 ~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~-----~~~~~~~p~~----~~~~~~----v~~l~~l~ra~~~~ 431 (515) +=.-++.-..|.-..+--.=..|...|+.|+..+.+ .|.++.|... ..+... ...++|+--++... T Consensus 353 -nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~ 431 (502) T protein:vir:79 353 -TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWK 431 (502) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHH Confidence 222222222333222222222344467777766643 3555533211 112221 12345544333221 Q ss_pred -HHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHH------HhcCCchhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_020414. 432 -KLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVR------GQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAV 504 (515) Q Consensus 432 -~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a------~~~Gvp~~~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~ 504 (515) .|..-+.....+ -...=.|+++.++.++ +.+|++... ..+.... T Consensus 432 ~~i~~Gl~t~~~~-----~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~------------------------~~~~~~~ 482 (502) T protein:vir:79 432 IQIRGGAATESDW-----VRAGGRNPDDVKRRRKAEIDENRKLDLVFDT------------------------DPASDKG 482 (502) T ss_pred HHHHcCCCCHHHH-----HHHcCCCHHHHHHHHHHHHHHHHHcCCCCCC------------------------CCCCCCC Confidence 111000000000 0000112222221111 111221110 0000000 Q ss_pred cch----------hhhhhcc Q lcl|NC_020414. 505 PGV----------IQQEMKE 514 (515) Q Consensus 505 ~~~----------~~~~~~~ 514 (515) ... -+++.+| T Consensus 483 ~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 483 GSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCC Confidence 000 0111111 No 162 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=21.47 E-value=2.6 Score=18.31 Aligned_cols=369 Identities=12% Similarity=0.017 Sum_probs=138.7 Q ss_pred HHHHHHHHhhhhHH-HHHHHHHHhhcccccCCCCCCccccccccccHH-HHHHHHHHHHHHhhcCCCCCceecCCChHHH Q lcl|NC_020414. 17 KLWEKFSKKRSPYL-DRAKHFAKLTLPYLMNNKGDNETSQNGWQGVGA-QATNHLANKLAQVLFPAQRSFFRVDLTAKGE 94 (515) Q Consensus 17 ~r~~~lk~~R~~~e-~~w~e~~~~~~P~~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~ 94 (515) --|..+..+|+... ..+-+..+.+-.......+..-.........+. .|++.+|+.+. +-||--....+... T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA------~~p~~~~~~~~~~~ 74 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVG------MLPCSLYKISGTLK 74 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhh------hCceEEEEecCCcc Confidence 33344433333221 122233332222111111111011111222233 34444444332 22332222211110 Q ss_pred hhhhccchhHHHHHHHHHHHHHHHHHHHH-h----cCCHHHHHHHHHHHHhhCceEEEEeCCCc--EEEEEc--ceEEEe Q lcl|NC_020414. 95 KVLDDRGLKKTQLATIFARVETTAMKALE-Q----RQFRPAIVEVFKHLIVAGNCLLYKPSKGA--MSAVPM--HHYVVN 165 (515) Q Consensus 95 ~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~----snf~~~~~~~~~dl~~~G~~~l~~d~~~~--~r~~pl--~~y~v~ 165 (515) . .+ .+..+...|+ + -+.+.-+...+.++...||+.+++..+.+ ...||| ..+-+. T Consensus 75 ~----------~~------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~g~~~~L~~l~~~~v~~~ 138 (413) T protein:vir:48 75 T----------RV------VDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKALGEVVELLPIDPGCVEPK 138 (413) T ss_pred e----------ee------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCCCcEEEEEEEcCceEEEE Confidence 0 00 0111112222 2 24455566777788889999988754433 334554 333344 Q ss_pred eCCCCCeeEEEEEEEecHHHHHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccC Q lcl|NC_020414. 166 RDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKL 245 (515) Q Consensus 166 ~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~ 245 (515) .|..|.+ +| ....+ .+.. .. |+.++ T Consensus 139 ~~~~~~~--~y------------------------------------~~~~~--~g~~---------~~-----~~~~e- 163 (413) T protein:vir:48 139 LNSQWQP--VY------------------------------------QVTFP--DGSV---------DV-----LTQDE- 163 (413) T ss_pred EcCCceE--EE------------------------------------EEEec--CceE---------EE-----Ecccc- Confidence 4443321 11 00000 1100 00 11111 Q ss_pred cEEEEeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCceeecCccccChhhccCCC------------Cc Q lcl|NC_020414. 246 PFIPLTWKRSYGEDWGRPLVEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGSQTDVDHFVNSG------------TG 313 (515) Q Consensus 246 P~~~~Rw~~~~g~~YGrgp~~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~~~~g~~~~~~~~~~~------------~g 313 (515) +++.|-... +..||.||...+...+.....+.+.......-...|..++.-++.++++...... .| T Consensus 164 -vih~~~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g 241 (413) T protein:vir:48 164 -IWHVRTLTL-DGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAH 241 (413) T ss_pred -EEEecCcCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccC Confidence 233443322 3479999999999999988888888888777778887777666655554321100 11 Q ss_pred ce--ecCCcccccccccCCccchHHHHHHHHHHHHHHHHHHHHH--hhccCC-CCCCCHHHHHHHHHHHHHHhhhhHHHH Q lcl|NC_020414. 314 EV--ITGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRD-AERVTAVEIQRDALEIEQNMGGVYSLF 388 (515) Q Consensus 314 ~~--~~g~~~~v~~~~~~~~~~l~~~~~~i~~~~~rI~~afl~~--~l~~~~-~~~~TAtEi~~r~~E~~~~LGpv~~rl 388 (515) .+ +++.. ++.++.. +..+.+. .+..+..+..|-.+|-.. .+...+ +..-++++.. ..+ T Consensus 242 ~~~vl~~g~-~~~~l~~-~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~--------------~~f 304 (413) T protein:vir:48 242 RPMILEMGL-DWKSMAL-NAEDSQF-LETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG--------------LGF 304 (413) T ss_pred cceecCCCc-eEEeccC-ChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH--------------HHH Confidence 11 11111 2233322 1234442 345566677788887321 122222 2222333332 123 Q ss_pred HHHHHHHHHHHH---HHhcCCCCChhhccceeeee-hHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHH Q lcl|NC_020414. 389 AMTMQTPIAMWG---LQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVR 464 (515) Q Consensus 389 ~~E~l~Pli~r~---~~~~~~~~p~~~~~~~~v~~-l~~l~ra~~~~~l~~~~~~v~~~a~~~p~~~d~id~d~~~~~~a 464 (515) ...-+.|++.++ +...+.. +.+ ....+++. ++.|.|+--.......-..++. ..+.++++ - T Consensus 305 ~~~~i~P~~~~ie~~l~~~L~~-~~~-~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~---------g~~T~NE~----R 369 (413) T protein:vir:48 305 INYSLVPYLTRIEQRINTGLVR-ESK-QGKFYAKFNAGALLRGDMKSRFEAYATGINW---------GIYSPNDC----R 369 (413) T ss_pred HHHHHHHHHHHHHHHHHhhccC-ccc-cCCeEEEEechhhhccCHHHHHHHHHHHHhC---------CCcCHHHH----H Confidence 344455555544 2222221 111 11223332 3334332111111111111110 01112221 1 Q ss_pred HhcCCchh-----ccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhccC Q lcl|NC_020414. 465 GQISAELP-----FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) Q Consensus 465 ~~~Gvp~~-----~irs~eev~~~rq~~~~~~q~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) +.+|.|+- ++.+..-+ .+. .+++-.+.+...| T Consensus 370 ~~~g~~p~~ggD~~~~~~n~~------------------~~~-~~~~~~~~~~~~~ 406 (413) T protein:vir:48 370 DLEDMNPRPGGDVYLTPMNMT------------------TSP-SAGDDNGKKKESG 406 (413) T ss_pred HHhCCCCCCCcceeecccccc------------------ccc-cccccCCCCCCCC Confidence 23444321 11111000 000 0111111111222 No 163 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=21.31 E-value=2.6 Score=18.29 Aligned_cols=308 Identities=9% Similarity=0.065 Sum_probs=120.8 Q ss_pred hcccccCCCCC--Cccccc--c--c-cccHHHHHHHHHHHHHHhhcCCCCCceecCCChHHHhhhhccchhHHH-HHHHH Q lcl|NC_020414. 40 TLPYLMNNKGD--NETSQN--G--W-QGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQ-LATIF 111 (515) Q Consensus 40 ~~P~~~~~~~~--~~~~~~--~--~-dst~~~a~~~Laa~l~s~ltpp~~~WFrl~~~d~~~~~~~~~~~~~~~-v~~~L 111 (515) .-..+...... .....+ - | |.++...+ ..++..+-.+-.|++--.+-..+.++......... +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~-----~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~--- 72 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPAL-----DYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILH--- 72 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccch-----hhhhhhhcCCccccCCCCCHHHHHHHhhccccccccee--- Confidence 11111111110 000001 1 1 33333211 23333333455677644444444443221111100 10 Q ss_pred HHHHHHHHHHHHhcC---CHHHHHHHHHHHHhhCceEEEEeCCC---cEEEEEcceEEEeeCCCCCeeEEEEEEEecHHH Q lcl|NC_020414. 112 ARVETTAMKALEQRQ---FRPAIVEVFKHLIVAGNCLLYKPSKG---AMSAVPMHHYVVNRDTNGDLMDVILLQEKALRT 185 (515) Q Consensus 112 ~~ve~~~~~~l~~sn---f~~~~~~~~~dl~~~G~~~l~~d~~~---~~r~~pl~~y~v~~d~~G~vd~i~r~~~~t~~q 185 (515) +.+.+.....+-| -...+.++..|+.++|||.+++..+. .+..+||...++.+..+|...-.++... T Consensus 73 --~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~~----- 145 (345) T protein:vir:37 73 --SRANMVSSLYEGGKALSRMDMRALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGYSYLMKKSL----- 145 (345) T ss_pred --eechHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCeeEEEEEeE----- Confidence 0011111111112 13345667788889999998875432 2556776543343322221111110000 Q ss_pred HHHHhcccccchhhhccCCCcccEEEEEEEEEcCCCCeEEEEEeCCeeecccCCcccccCcEEEEeeeecCCCccccchH Q lcl|NC_020414. 186 FDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKENRIKAEKLPFIPLTWKRSYGEDWGRPLV 265 (515) Q Consensus 186 l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~e~~~~~i~~esgy~~~~~P~~~~Rw~~~~g~~YGrgp~ 265 (515) ...++... .|+.++ ++..|.....+..||.+|. T Consensus 146 -----------------------------------------~~~~g~~~----~~~~~d--Vihir~~~~~~~~~Gls~~ 178 (345) T protein:vir:37 146 -----------------------------------------YDTAQEIY----RYDAKD--IIFIKLYDPMQQVYGSPDY 178 (345) T ss_pred -----------------------------------------ecCCceEE----EEcccc--EEEecCCCCCCCcccccHH Confidence 00011110 122222 4556644445678999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCceee-cCccccChhhc-------c---CCCCc-c-e--ecC-CcccccccccC Q lcl|NC_020414. 266 EDYSGDLFVIQFLSEAVARGAALMADIKYLI-RPGSQTDVDHF-------V---NSGTG-E-V--ITG-VEEDIHIVQLG 329 (515) Q Consensus 266 ~~~l~d~k~L~~l~~~~~~~~~~a~~p~~l~-~~~g~~~~~~~-------~---~~~~g-~-~--~~g-~~~~v~~~~~~ 329 (515) .-++..+-.-+..++-..+.-.-...|..++ .++..++.++. . ..+++ . + .|+ ..+.+...+++ T Consensus 179 ~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls 258 (345) T protein:vir:37 179 VGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIG 258 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEcc Confidence 9888887666555555555444455565443 34444443321 1 11111 1 1 222 23344433333 Q ss_pred C-ccchHHHHHHHHHHHHHHHHHHHHH--hhc-cCC--CCCCCHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_020414. 330 K-YADLTPISAVLEVYTRRIGVIFMME--TMT-RRD--AERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQE 403 (515) Q Consensus 330 ~-~~~l~~~~~~i~~~~~rI~~afl~~--~l~-~~~--~~~~TAtEi~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~~ 403 (515) . ..+.+. .+..+-.++.|-.+|=.- .+. ..+ +..-++++... .=....|.|...++.+++- . T Consensus 259 ~~~~d~qf-~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~--~f~~~~l~P~~~~ie~~ln---------~ 326 (345) T protein:vir:37 259 DTGTKDEF-ANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYRE--VYHYDEVMPLQEIIAETIN---------Q 326 (345) T ss_pred CChhHHHH-HHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHhh---------h Confidence 2 234442 334455677788888211 111 111 12223444322 2222334454444444331 1 Q ss_pred cCCCCChhhccceeeeeh-HHHHH Q lcl|NC_020414. 404 AGDSFTSELVDPVIVTGI-EALGR 426 (515) Q Consensus 404 ~~~~~p~~~~~~~~v~~l-~~l~r 426 (515) .+++++.. .+.+. .-|+| T Consensus 327 -~~~~~~~~----~i~F~~~~L~~ 345 (345) T protein:vir:37 327 -DPEIKNLL----KIKFREQNFAK 345 (345) T ss_pred -hccCCCcc----eEEecchhhcC Confidence 23333221 11211 11222 Done!