Query lcl|NC_018086.1_cdsid_YP_006488738.1 [gene=efb11] [protein=phage portal protein] [protein_id=YP_006488738.1] [location=7619..9154] Match_columns 511 No_of_seqs 146 out of 531 Neff 9.6 Searched_HMMs 1612 Date Thu Nov 7 13:10:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_11 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_11_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106571 Length: 499 100.0 2.6E-99 2E-102 561.0 54.4 478 1-504 1-499 (499) 2 protein:vir:3964 Length: 453 # 100.0 1.9E-97 1E-100 550.9 51.8 446 12-478 1-453 (453) 3 protein:vir:5961 Length: 503 # 100.0 9.2E-97 6E-100 547.1 52.9 486 1-497 1-503 (503) 4 protein:vir:3609 Length: 452 # 100.0 5.9E-97 4E-100 548.1 51.8 446 12-478 1-452 (452) 5 protein:vir:1236 Length: 483 # 100.0 5.8E-97 4E-100 548.2 50.3 466 1-494 1-483 (483) 6 protein:vir:9306 Length: 511 # 100.0 6.4E-96 3.9E-99 542.5 52.8 479 1-494 13-511 (511) 7 protein:vir:94805 Length: 492 100.0 1.5E-96 1E-99 545.9 48.6 466 1-494 10-492 (492) 8 protein:vir:102330 Length: 451 100.0 4.3E-96 2.7E-99 543.4 50.9 427 28-467 1-451 (451) 9 protein:vir:103951 Length: 511 100.0 1.2E-95 7.4E-99 541.0 53.3 479 1-490 13-511 (511) 10 protein:vir:79043 Length: 479 100.0 5.9E-96 3.7E-99 542.7 51.3 461 8-479 1-479 (479) 11 protein:vir:97171 Length: 512 100.0 1.1E-95 6.6E-99 541.3 52.6 477 2-494 1-512 (512) 12 protein:vir:105461 Length: 470 100.0 6.5E-96 4E-99 542.4 51.1 441 28-478 1-470 (470) 13 protein:vir:96366 Length: 511 100.0 1.2E-95 7.2E-99 541.0 52.4 476 1-490 13-511 (511) 14 protein:vir:78805 Length: 511 100.0 1.2E-95 7.2E-99 541.0 52.4 476 1-490 13-511 (511) 15 protein:vir:99781 Length: 511 100.0 1E-95 6.2E-99 541.4 51.9 479 1-490 13-511 (511) 16 protein:vir:97336 Length: 492 100.0 6.1E-96 3.8E-99 542.6 50.2 466 1-494 10-492 (492) 17 protein:vir:96240 Length: 511 100.0 2.6E-95 1.6E-98 539.1 53.1 479 1-490 13-511 (511) 18 protein:vir:2732 Length: 501 # 100.0 2.4E-95 1.5E-98 539.4 51.9 472 1-492 1-501 (501) 19 protein:vir:4898 Length: 502 # 100.0 3E-95 1.8E-98 538.8 52.2 473 1-505 1-502 (502) 20 protein:vir:102950 Length: 471 100.0 2.3E-95 1.4E-98 539.4 50.6 438 28-475 1-471 (471) 21 protein:vir:94498 Length: 474 100.0 6.6E-95 4.1E-98 536.9 51.3 463 1-485 5-474 (474) 22 protein:vir:97447 Length: 474 100.0 6.6E-95 4.1E-98 536.9 51.3 463 1-485 5-474 (474) 23 protein:vir:733 Length: 453 # 100.0 9.8E-95 6E-98 536.0 49.9 441 1-473 1-453 (453) 24 protein:vir:93747 Length: 472 100.0 9.7E-95 6E-98 536.0 49.7 462 1-494 1-472 (472) 25 protein:vir:95899 Length: 474 100.0 1.5E-94 9.3E-98 535.0 50.1 461 1-488 5-474 (474) 26 protein:vir:96266 Length: 474 100.0 1.5E-94 9.3E-98 535.0 50.1 461 1-488 5-474 (474) 27 protein:vir:9871 Length: 429 # 100.0 2.7E-94 1.7E-97 533.5 51.2 424 28-472 1-429 (429) 28 protein:vir:96494 Length: 501 100.0 3.3E-94 2E-97 533.1 51.1 472 1-505 1-501 (501) 29 protein:vir:105292 Length: 478 100.0 2.9E-93 1.8E-96 527.9 51.5 464 2-484 1-478 (478) 30 protein:vir:107112 Length: 478 100.0 3.3E-93 2E-96 527.6 51.6 463 2-484 1-478 (478) 31 protein:vir:105889 Length: 474 100.0 3.5E-93 2.2E-96 527.5 50.7 449 14-483 1-474 (474) 32 protein:vir:94101 Length: 474 100.0 3.5E-93 2.2E-96 527.5 50.7 449 14-483 1-474 (474) 33 protein:vir:95113 Length: 474 100.0 4.7E-93 2.9E-96 526.8 51.3 462 1-485 5-474 (474) 34 protein:vir:99522 Length: 470 100.0 1.3E-92 7.8E-96 524.4 52.6 457 1-478 1-470 (470) 35 protein:vir:78083 Length: 537 100.0 8.8E-93 5.5E-96 525.3 51.5 486 14-511 1-530 (537) 36 protein:vir:94546 Length: 506 100.0 6E-93 3.8E-96 526.2 49.9 465 1-507 1-506 (506) 37 protein:vir:96839 Length: 474 100.0 2.7E-92 1.7E-95 522.6 50.3 462 2-480 1-474 (474) 38 protein:vir:106639 Length: 481 100.0 1.6E-91 1E-94 518.3 51.9 459 1-484 1-481 (481) 39 protein:vir:96179 Length: 468 100.0 1.7E-91 1E-94 518.3 50.6 455 2-480 1-468 (468) 40 protein:vir:9922 Length: 489 # 100.0 1.2E-90 7.5E-94 513.6 50.2 455 8-483 1-489 (489) 41 protein:vir:95806 Length: 440 100.0 1.8E-90 1.1E-93 512.6 48.6 422 36-473 1-440 (440) 42 protein:vir:4223 Length: 486 # 100.0 8.2E-85 5.1E-88 481.6 45.0 457 16-497 1-486 (486) 43 protein:vir:2427 Length: 485 # 100.0 2.2E-84 1.4E-87 479.2 45.2 457 14-498 1-485 (485) 44 protein:vir:7768 Length: 484 # 100.0 8.9E-83 5.5E-86 470.4 43.7 454 14-491 1-484 (484) 45 protein:vir:78537 Length: 480 100.0 5.5E-82 3.4E-85 466.1 47.4 455 14-504 1-480 (480) 46 protein:vir:104082 Length: 485 100.0 8.1E-82 5E-85 465.2 46.3 456 16-498 1-485 (485) 47 protein:vir:78227 Length: 480 100.0 1.5E-81 9.1E-85 463.7 46.9 454 27-504 1-480 (480) 48 protein:vir:2341 Length: 488 # 100.0 2E-81 1.2E-84 463.1 46.4 448 22-499 1-488 (488) 49 protein:vir:80680 Length: 441 100.0 1.8E-79 1.1E-82 452.3 44.2 425 25-479 1-441 (441) 50 protein:vir:2500 Length: 501 # 100.0 2.7E-79 1.7E-82 451.3 44.7 473 1-498 1-501 (501) 51 protein:vir:99072 Length: 479 100.0 1.8E-76 1.1E-79 435.9 45.0 449 20-505 1-479 (479) 52 protein:vir:102602 Length: 456 100.0 2.4E-76 1.5E-79 435.2 43.2 432 25-477 1-456 (456) 53 protein:vir:105819 Length: 456 100.0 2.4E-76 1.5E-79 435.2 43.2 432 25-477 1-456 (456) 54 protein:vir:99916 Length: 504 100.0 5.7E-76 3.5E-79 433.1 43.2 470 2-503 1-504 (504) 55 protein:vir:7987 Length: 456 # 100.0 3.1E-75 1.9E-78 429.1 44.4 432 25-477 1-456 (456) 56 protein:vir:98444 Length: 434 100.0 7.6E-70 4.7E-73 399.5 39.8 408 61-487 1-434 (434) 57 protein:vir:8184 Length: 474 # 100.0 8.1E-70 5E-73 399.4 39.7 441 13-478 1-474 (474) 58 protein:vir:9751 Length: 422 # 100.0 2.7E-70 1.7E-73 402.0 37.0 399 28-454 1-422 (422) 59 protein:vir:94742 Length: 409 100.0 3.9E-70 2.4E-73 401.1 37.5 387 28-441 1-409 (409) 60 protein:vir:9568 Length: 410 # 100.0 6.9E-70 4.3E-73 399.7 35.8 388 40-456 1-410 (410) 61 protein:vir:1634 Length: 409 # 100.0 6.5E-69 4E-72 394.4 37.0 387 28-441 1-409 (409) 62 protein:vir:38 Length: 496 # N 100.0 1.2E-59 7.7E-63 343.5 44.8 453 1-473 1-496 (496) 63 protein:vir:80959 Length: 499 100.0 1.3E-56 7.9E-60 327.0 45.4 453 1-473 1-499 (499) 64 protein:vir:1587 Length: 508 # 100.0 2.8E-53 1.7E-56 308.7 44.2 458 3-474 1-508 (508) 65 protein:vir:79703 Length: 505 100.0 8.4E-53 5.2E-56 306.1 45.0 452 3-468 1-505 (505) 66 protein:vir:3028 Length: 500 # 100.0 5.9E-49 3.7E-52 285.0 46.3 455 3-471 1-500 (500) 67 protein:vir:9815 Length: 500 # 100.0 5.9E-49 3.7E-52 285.0 46.3 455 3-471 1-500 (500) 68 protein:vir:4782 Length: 522 # 100.0 7.2E-48 4.5E-51 279.0 42.4 466 3-484 1-522 (522) 69 protein:vir:101494 Length: 527 100.0 7.7E-49 4.8E-52 284.4 36.5 476 1-489 1-527 (527) 70 protein:vir:102239 Length: 527 100.0 8.6E-49 5.4E-52 284.1 36.5 476 1-489 1-527 (527) 71 protein:vir:78907 Length: 518 100.0 8.7E-44 5.4E-47 256.7 42.4 450 3-470 1-518 (518) 72 protein:vir:98883 Length: 517 100.0 1.7E-42 1.1E-45 249.6 43.3 459 3-477 1-517 (517) 73 protein:vir:7430 Length: 563 # 100.0 2.6E-42 1.6E-45 248.5 39.2 486 1-508 1-563 (563) 74 protein:vir:97265 Length: 513 100.0 9.1E-32 5.6E-35 190.8 38.3 459 25-503 1-513 (513) 75 protein:vir:94956 Length: 452 100.0 1.4E-31 8.8E-35 189.7 35.0 423 28-490 1-452 (452) 76 protein:vir:95149 Length: 501 100.0 4.6E-28 2.8E-31 170.5 35.0 438 28-499 1-501 (501) 77 protein:vir:80453 Length: 535 100.0 7.4E-27 4.6E-30 163.8 38.6 474 1-502 1-535 (535) 78 protein:vir:78393 Length: 489 99.9 2.5E-26 1.6E-29 160.9 36.8 438 13-493 1-489 (489) 79 protein:vir:95014 Length: 491 99.9 6.2E-26 3.8E-29 158.8 36.6 438 13-481 1-491 (491) 80 protein:vir:96783 Length: 488 99.9 6.1E-25 3.8E-28 153.3 30.5 422 18-464 1-488 (488) 81 protein:vir:93630 Length: 776 99.8 1.3E-18 8.1E-22 118.6 33.7 499 5-511 1-735 (776) 82 protein:vir:108295 Length: 711 99.7 6.2E-16 3.9E-19 103.9 36.2 487 1-511 1-658 (711) 83 protein:vir:80040 Length: 461 99.7 1.6E-17 1E-20 112.6 27.5 420 1-496 1-461 (461) 84 protein:vir:105619 Length: 772 99.7 2.2E-15 1.3E-18 101.0 38.5 499 3-511 1-741 (772) 85 protein:vir:9950 Length: 714 # 99.7 2E-15 1.3E-18 101.1 37.4 474 1-511 1-642 (714) 86 protein:vir:3296 Length: 714 # 99.7 2E-15 1.3E-18 101.1 37.4 474 1-511 1-642 (714) 87 protein:vir:817 Length: 714 # 99.7 2E-15 1.3E-18 101.1 37.4 474 1-511 1-642 (714) 88 protein:vir:2764 Length: 714 # 99.7 2E-15 1.3E-18 101.1 37.4 474 1-511 1-642 (714) 89 protein:vir:10117 Length: 714 99.7 2E-15 1.3E-18 101.1 37.4 474 1-511 1-642 (714) 90 protein:vir:104437 Length: 714 99.7 8.4E-16 5.2E-19 103.2 33.0 477 1-511 1-642 (714) 91 protein:vir:79538 Length: 502 99.7 2.5E-14 1.5E-17 95.2 40.0 426 28-506 1-502 (502) 92 protein:vir:8846 Length: 705 # 99.6 2.5E-14 1.6E-17 95.1 33.4 463 1-511 1-615 (705) 93 protein:vir:80165 Length: 651 99.6 3.8E-14 2.3E-17 94.2 32.1 479 1-511 1-641 (651) 94 protein:vir:6382 Length: 553 # 99.6 7.8E-13 4.8E-16 87.0 36.1 443 28-501 1-553 (553) 95 protein:vir:96738 Length: 505 99.5 1.6E-12 9.8E-16 85.3 38.0 434 1-506 1-505 (505) 96 protein:vir:5249 Length: 437 # 99.5 5.2E-13 3.2E-16 87.9 34.1 399 34-500 1-437 (437) 97 protein:vir:107742 Length: 537 99.5 1.9E-13 1.2E-16 90.3 29.5 440 1-511 46-536 (537) 98 protein:vir:95449 Length: 584 99.5 2.2E-12 1.3E-15 84.5 34.8 444 1-472 1-584 (584) 99 protein:vir:3420 Length: 533 # 99.5 5.7E-12 3.5E-15 82.2 39.1 427 1-490 1-533 (533) 100 protein:vir:94049 Length: 532 99.5 1E-12 6.4E-16 86.3 29.5 435 1-498 33-532 (532) 101 protein:vir:100920 Length: 725 99.5 6.6E-14 4.1E-17 92.8 22.4 475 23-511 1-642 (725) 102 protein:vir:77597 Length: 725 99.5 3.2E-13 2E-16 89.1 26.0 474 14-511 1-635 (725) 103 protein:vir:389 Length: 530 # 99.4 1.3E-11 8.3E-15 80.2 39.6 427 28-490 1-530 (530) 104 protein:vir:105429 Length: 708 99.4 2.1E-13 1.3E-16 90.1 23.5 485 25-511 1-670 (708) 105 protein:vir:10321 Length: 495 99.4 1.8E-11 1.1E-14 79.5 36.1 432 26-511 1-494 (495) 106 protein:vir:9263 Length: 725 # 99.4 2.8E-13 1.7E-16 89.4 21.4 475 23-511 1-642 (725) 107 protein:vir:96068 Length: 765 99.4 1.1E-11 7.1E-15 80.6 30.2 450 1-511 40-566 (765) 108 protein:vir:95542 Length: 548 99.4 6.1E-11 3.8E-14 76.6 39.6 449 28-507 1-548 (548) 109 protein:vir:79647 Length: 435 99.4 1.7E-11 1E-14 79.7 29.4 399 14-499 1-435 (435) 110 protein:vir:105520 Length: 706 99.4 1.5E-12 9.3E-16 85.4 23.5 468 25-511 1-653 (706) 111 protein:vir:104338 Length: 422 99.3 8E-11 5E-14 75.9 32.4 385 44-497 1-422 (422) 112 protein:vir:172 Length: 708 # 99.3 2.6E-12 1.6E-15 84.1 24.1 480 1-511 1-654 (708) 113 protein:vir:3520 Length: 720 # 99.3 6.3E-11 3.9E-14 76.5 31.5 462 25-511 1-638 (720) 114 protein:vir:99563 Length: 862 99.3 1.5E-10 9.4E-14 74.4 32.9 431 1-511 91-593 (862) 115 protein:vir:8883 Length: 543 # 99.3 3.2E-10 2E-13 72.7 37.3 460 1-496 1-543 (543) 116 protein:vir:3139 Length: 599 # 99.2 8.5E-11 5.3E-14 75.8 27.2 456 1-490 1-599 (599) 117 protein:vir:94709 Length: 522 99.2 4E-10 2.5E-13 72.1 37.3 449 22-497 1-522 (522) 118 protein:vir:1538 Length: 535 # 99.2 4.7E-10 2.9E-13 71.7 39.7 452 1-500 1-535 (535) 119 protein:vir:10447 Length: 536 99.2 5E-10 3.1E-13 71.6 38.5 456 14-496 1-536 (536) 120 protein:vir:2198 Length: 536 # 99.2 5.1E-10 3.2E-13 71.5 38.6 456 14-496 1-536 (536) 121 protein:vir:102668 Length: 547 99.2 6.9E-10 4.3E-13 70.8 40.5 443 28-503 1-547 (547) 122 protein:vir:98506 Length: 555 99.2 7.6E-10 4.7E-13 70.6 37.5 456 24-489 1-555 (555) 123 protein:vir:107404 Length: 555 99.2 7.6E-10 4.7E-13 70.6 37.5 456 24-489 1-555 (555) 124 protein:vir:107822 Length: 555 99.2 7.6E-10 4.7E-13 70.6 37.5 456 24-489 1-555 (555) 125 protein:vir:107662 Length: 427 99.2 9E-10 5.6E-13 70.2 30.9 393 3-502 1-427 (427) 126 protein:vir:95315 Length: 559 99.2 9.9E-10 6.2E-13 69.9 38.3 463 25-500 1-559 (559) 127 protein:vir:80644 Length: 551 99.2 8.4E-10 5.2E-13 70.3 28.7 466 1-510 3-551 (551) 128 protein:vir:7321 Length: 556 # 99.1 1.9E-09 1.2E-12 68.5 34.7 460 25-493 1-556 (556) 129 protein:vir:63755 Length: 547 99.1 2.1E-09 1.3E-12 68.2 32.3 465 3-510 1-547 (547) 130 protein:vir:3361 Length: 535 # 99.1 2.5E-09 1.5E-12 67.8 38.8 452 1-500 1-535 (535) 131 protein:vir:103765 Length: 549 99.1 2.5E-09 1.6E-12 67.7 37.2 446 25-495 1-549 (549) 132 protein:vir:78696 Length: 542 98.9 1.8E-08 1.1E-11 63.1 39.2 450 25-500 1-542 (542) 133 protein:vir:3153 Length: 467 # 98.9 2.3E-08 1.4E-11 62.4 29.3 399 71-511 1-466 (467) 134 protein:vir:94572 Length: 535 98.8 3.1E-08 1.9E-11 61.8 35.9 454 3-495 1-535 (535) 135 protein:vir:102727 Length: 945 98.8 3.4E-08 2.1E-11 61.5 35.0 423 1-511 60-539 (945) 136 protein:vir:1785 Length: 555 # 98.8 3.6E-08 2.3E-11 61.4 34.3 452 28-511 1-555 (555) 137 protein:vir:7853 Length: 518 # 98.8 4.2E-08 2.6E-11 61.0 30.9 425 1-511 1-455 (518) 138 protein:vir:100039 Length: 522 98.7 7.9E-08 4.9E-11 59.5 35.5 442 28-498 1-522 (522) 139 protein:vir:99672 Length: 532 98.7 8.3E-08 5.1E-11 59.4 36.4 447 14-501 1-532 (532) 140 protein:vir:94599 Length: 641 98.7 8.5E-08 5.2E-11 59.4 32.0 479 1-497 1-641 (641) 141 protein:vir:101648 Length: 518 98.7 1E-07 6.2E-11 59.0 30.2 424 21-511 1-455 (518) 142 protein:vir:95821 Length: 763 98.7 1.1E-07 6.6E-11 58.8 33.2 469 3-511 1-649 (763) 143 protein:vir:79772 Length: 648 98.6 2.4E-07 1.5E-10 56.9 35.5 439 1-511 32-515 (648) 144 protein:vir:102080 Length: 429 98.5 3.5E-07 2.2E-10 56.0 25.7 400 3-488 1-429 (429) 145 protein:vir:1326 Length: 457 # 98.5 4.2E-07 2.6E-10 55.5 33.8 417 34-511 1-456 (457) 146 protein:vir:4156 Length: 542 # 98.5 5.8E-07 3.6E-10 54.8 26.6 431 13-511 1-474 (542) 147 protein:vir:1380 Length: 422 # 98.4 6.4E-07 3.9E-10 54.6 29.5 389 34-502 1-422 (422) 148 protein:vir:8418 Length: 409 # 98.4 6.6E-07 4.1E-10 54.5 29.3 383 3-493 1-409 (409) 149 protein:vir:3843 Length: 397 # 98.4 8.1E-07 5E-10 54.0 31.6 379 3-506 1-397 (397) 150 protein:vir:6240 Length: 457 # 98.4 9.5E-07 5.9E-10 53.6 32.5 414 34-511 1-452 (457) 151 protein:vir:81152 Length: 411 98.4 1E-06 6.4E-10 53.4 30.5 382 14-486 1-411 (411) 152 protein:vir:105002 Length: 432 98.4 1E-06 6.5E-10 53.4 31.2 397 14-488 1-432 (432) 153 protein:vir:102855 Length: 432 98.4 1E-06 6.5E-10 53.4 31.2 397 14-488 1-432 (432) 154 protein:vir:107605 Length: 432 98.4 1E-06 6.5E-10 53.4 31.2 397 14-488 1-432 (432) 155 protein:vir:93610 Length: 454 98.4 1.1E-06 6.9E-10 53.2 32.5 410 33-511 1-447 (454) 156 protein:vir:80796 Length: 574 98.3 1.2E-06 7.3E-10 53.1 32.4 465 1-511 1-552 (574) 157 protein:vir:1266 Length: 416 # 98.3 1.2E-06 7.3E-10 53.1 29.1 388 3-491 1-416 (416) 158 protein:vir:9359 Length: 348 # 98.3 1.2E-06 7.5E-10 53.0 31.4 326 92-491 1-348 (348) 159 protein:vir:100691 Length: 535 98.3 1.5E-06 9.6E-10 52.4 34.8 454 3-510 1-535 (535) 160 protein:vir:4952 Length: 386 # 98.3 1.7E-06 1E-09 52.3 30.0 369 3-477 1-386 (386) 161 protein:vir:78589 Length: 695 98.2 2.2E-06 1.3E-09 51.6 24.9 452 1-511 46-572 (695) 162 protein:vir:96988 Length: 516 98.2 2.4E-06 1.5E-09 51.4 36.2 426 1-482 1-516 (516) 163 protein:vir:103860 Length: 528 98.1 4.5E-06 2.8E-09 49.9 39.5 410 1-511 1-452 (528) 164 protein:vir:4194 Length: 540 # 98.1 4.9E-06 3E-09 49.7 27.6 424 13-511 1-470 (540) 165 protein:vir:3648 Length: 695 # 98.1 5.3E-06 3.3E-09 49.5 25.2 446 1-511 38-572 (695) 166 protein:vir:81095 Length: 416 98.1 5.4E-06 3.4E-09 49.4 30.7 391 3-488 1-416 (416) 167 protein:vir:4598 Length: 416 # 98.1 5.4E-06 3.4E-09 49.4 30.7 391 3-488 1-416 (416) 168 protein:vir:5737 Length: 419 # 98.0 6.1E-06 3.8E-09 49.2 30.3 387 3-502 1-419 (419) 169 protein:vir:101541 Length: 694 98.0 6.1E-06 3.8E-09 49.2 25.3 449 1-511 33-571 (694) 170 protein:vir:4454 Length: 414 # 98.0 6.5E-06 4.1E-09 49.0 35.8 382 34-510 1-414 (414) 171 protein:vir:7017 Length: 515 # 98.0 7E-06 4.3E-09 48.9 39.5 430 20-479 1-515 (515) 172 protein:vir:103330 Length: 517 98.0 8.3E-06 5.1E-09 48.4 36.5 428 23-498 1-517 (517) 173 protein:vir:100150 Length: 437 98.0 8.9E-06 5.5E-09 48.3 35.3 395 28-507 1-437 (437) 174 protein:vir:95599 Length: 563 98.0 9.4E-06 5.8E-09 48.1 30.0 459 1-511 1-561 (563) 175 protein:vir:99312 Length: 563 98.0 9.4E-06 5.8E-09 48.1 30.0 459 1-511 1-561 (563) 176 protein:vir:99232 Length: 526 97.9 1.3E-05 7.9E-09 47.4 40.1 408 1-511 1-450 (526) 177 protein:vir:94426 Length: 409 97.9 1.5E-05 9E-09 47.1 31.4 382 28-491 1-409 (409) 178 protein:vir:96980 Length: 409 97.8 1.6E-05 9.8E-09 46.9 31.5 385 28-491 1-409 (409) 179 protein:vir:107880 Length: 491 97.8 1.7E-05 1.1E-08 46.7 36.8 393 1-511 1-421 (491) 180 protein:vir:105641 Length: 516 97.8 1.9E-05 1.2E-08 46.5 36.4 426 1-479 1-516 (516) 181 protein:vir:3989 Length: 392 # 97.8 2E-05 1.2E-08 46.4 33.1 374 1-494 1-392 (392) 182 protein:vir:1023 Length: 392 # 97.8 2E-05 1.2E-08 46.4 33.1 374 1-494 1-392 (392) 183 protein:vir:102118 Length: 409 97.8 2E-05 1.2E-08 46.3 34.6 382 22-486 1-409 (409) 184 protein:vir:78641 Length: 278 97.8 2.1E-05 1.3E-08 46.2 26.3 261 92-408 1-278 (278) 185 protein:vir:99853 Length: 488 97.7 2.4E-05 1.5E-08 45.9 34.7 381 28-511 1-413 (488) 186 protein:vir:2683 Length: 412 # 97.7 2.4E-05 1.5E-08 45.9 33.2 383 34-491 1-412 (412) 187 protein:vir:6322 Length: 510 # 97.7 2.7E-05 1.7E-08 45.6 36.2 428 29-476 1-510 (510) 188 protein:vir:78942 Length: 510 97.7 2.7E-05 1.7E-08 45.6 38.4 427 29-482 1-510 (510) 189 protein:vir:106716 Length: 698 97.7 2.8E-05 1.7E-08 45.6 23.3 447 1-511 46-574 (698) 190 protein:vir:105782 Length: 449 97.7 3.2E-05 2E-08 45.2 26.2 403 28-487 1-449 (449) 191 protein:vir:79233 Length: 526 97.6 3.9E-05 2.4E-08 44.8 39.4 406 1-511 1-447 (526) 192 protein:vir:7407 Length: 392 # 97.6 4.1E-05 2.6E-08 44.6 33.6 374 1-485 1-392 (392) 193 protein:vir:4854 Length: 386 # 97.6 4.5E-05 2.8E-08 44.4 31.3 367 34-477 1-386 (386) 194 protein:vir:99452 Length: 651 97.5 4.8E-05 2.9E-08 44.3 27.6 463 1-511 1-557 (651) 195 protein:vir:4828 Length: 382 # 97.5 4.8E-05 3E-08 44.3 32.6 361 34-477 1-382 (382) 196 protein:vir:4995 Length: 384 # 97.5 5.6E-05 3.4E-08 43.9 25.8 362 34-476 1-384 (384) 197 protein:vir:483 Length: 413 # 97.4 6.6E-05 4.1E-08 43.5 35.0 385 31-510 1-413 (413) 198 protein:vir:100882 Length: 383 97.4 6.7E-05 4.2E-08 43.5 29.9 359 34-499 1-383 (383) 199 protein:vir:93943 Length: 409 97.4 7.1E-05 4.4E-08 43.3 33.7 386 23-491 1-409 (409) 200 protein:vir:79063 Length: 491 97.4 8.3E-05 5.2E-08 43.0 36.9 393 1-511 1-421 (491) 201 protein:vir:79984 Length: 441 97.4 8.7E-05 5.4E-08 42.9 34.5 400 11-501 1-441 (441) 202 protein:vir:9408 Length: 441 # 97.4 8.7E-05 5.4E-08 42.9 34.5 400 11-501 1-441 (441) 203 protein:vir:96579 Length: 576 97.3 0.00011 7.1E-08 42.2 31.8 441 1-511 54-535 (576) 204 protein:vir:1431 Length: 419 # 97.2 0.00013 8.2E-08 41.9 28.5 383 35-500 1-419 (419) 205 protein:vir:189 Length: 424 # 97.2 0.00013 8.3E-08 41.8 27.1 382 23-484 1-424 (424) 206 protein:vir:98396 Length: 441 97.1 0.00016 9.7E-08 41.4 33.8 400 11-501 1-441 (441) 207 protein:vir:4337 Length: 434 # 97.1 0.00016 9.8E-08 41.4 31.2 405 1-509 1-434 (434) 208 protein:vir:105064 Length: 421 97.1 0.00017 1E-07 41.3 30.5 389 3-511 1-420 (421) 209 protein:vir:1884 Length: 424 # 97.1 0.00018 1.1E-07 41.1 30.0 381 23-484 1-424 (424) 210 protein:vir:1986 Length: 512 # 97.0 0.00023 1.5E-07 40.5 38.6 410 1-511 1-449 (512) 211 protein:vir:95378 Length: 406 96.9 0.00027 1.7E-07 40.1 25.1 378 3-511 1-406 (406) 212 protein:vir:10362 Length: 432 96.8 0.00031 1.9E-07 39.8 30.9 387 28-501 1-432 (432) 213 protein:vir:104500 Length: 537 96.8 0.00033 2E-07 39.7 29.3 440 1-502 1-537 (537) 214 protein:vir:4509 Length: 424 # 96.8 0.00035 2.2E-07 39.5 31.9 379 21-493 1-424 (424) 215 protein:vir:81072 Length: 432 96.6 0.00046 2.9E-07 38.9 32.9 387 28-501 1-432 (432) 216 protein:vir:9702 Length: 406 # 96.5 0.00055 3.4E-07 38.5 29.3 381 34-502 1-406 (406) 217 protein:vir:100187 Length: 385 96.4 0.00064 4E-07 38.1 31.2 362 34-476 1-385 (385) 218 protein:vir:960 Length: 413 # 96.4 0.00067 4.2E-07 38.0 29.0 364 3-486 1-413 (413) 219 protein:vir:3868 Length: 417 # 96.3 0.00079 4.9E-07 37.6 29.6 385 34-499 1-417 (417) 220 protein:vir:103219 Length: 201 96.2 0.00046 2.9E-07 38.9 12.3 186 275-486 1-201 (201) 221 protein:vir:81218 Length: 423 96.1 0.0011 6.6E-07 36.9 31.0 372 34-510 1-423 (423) 222 protein:vir:1082 Length: 359 # 96.0 0.0011 7E-07 36.7 27.3 338 3-440 1-359 (359) 223 protein:vir:80333 Length: 419 96.0 0.0012 7.4E-07 36.6 32.2 373 35-511 1-412 (419) 224 protein:vir:97060 Length: 432 95.9 0.0013 8.3E-07 36.3 31.4 387 28-506 1-432 (432) 225 protein:vir:80211 Length: 514 95.8 0.0014 8.7E-07 36.2 37.2 430 28-474 1-514 (514) 226 protein:vir:77981 Length: 448 95.0 0.003 1.8E-06 34.4 33.6 399 1-511 1-440 (448) 227 protein:vir:104892 Length: 558 94.9 0.0031 1.9E-06 34.3 27.0 463 1-510 5-558 (558) 228 protein:vir:101647 Length: 460 94.9 0.0033 2.1E-06 34.2 33.2 396 1-488 1-460 (460) 229 protein:vir:100249 Length: 431 94.8 0.0035 2.2E-06 34.0 34.1 376 34-510 1-431 (431) 230 protein:vir:6210 Length: 394 # 94.6 0.004 2.5E-06 33.7 26.2 366 3-502 1-394 (394) 231 protein:vir:106999 Length: 564 94.4 0.0044 2.7E-06 33.5 23.8 468 1-509 5-564 (564) 232 protein:vir:98816 Length: 446 94.3 0.0048 3E-06 33.3 26.9 393 1-444 1-446 (446) 233 protein:vir:78161 Length: 355 94.2 0.0051 3.1E-06 33.2 28.7 302 135-511 1-354 (355) 234 protein:vir:104259 Length: 403 93.6 0.0069 4.3E-06 32.4 30.3 370 1-485 1-403 (403) 235 protein:vir:108215 Length: 469 93.3 0.0082 5.1E-06 32.0 38.5 412 28-511 1-468 (469) 236 protein:vir:101289 Length: 395 92.8 0.0099 6.1E-06 31.6 27.9 367 34-511 1-395 (395) 237 protein:vir:100650 Length: 395 92.8 0.0099 6.1E-06 31.6 27.9 367 34-511 1-395 (395) 238 protein:vir:9507 Length: 395 # 92.8 0.0099 6.1E-06 31.6 27.9 367 34-511 1-395 (395) 239 protein:vir:79511 Length: 448 92.8 0.01 6.2E-06 31.5 33.1 397 1-511 1-440 (448) 240 protein:vir:8100 Length: 466 # 92.0 0.013 8.2E-06 30.9 33.1 418 3-502 1-466 (466) 241 protein:vir:103177 Length: 533 91.6 0.015 9.3E-06 30.6 26.8 447 1-510 5-533 (533) 242 protein:vir:80134 Length: 403 91.4 0.016 9.8E-06 30.5 28.3 371 34-494 1-403 (403) 243 protein:vir:94666 Length: 723 91.1 0.017 1.1E-05 30.2 33.7 394 50-511 1-441 (723) 244 protein:vir:95254 Length: 488 89.6 0.025 1.6E-05 29.3 35.5 434 14-508 1-488 (488) 245 protein:vir:8317 Length: 409 # 89.5 0.026 1.6E-05 29.3 28.4 349 34-479 1-409 (409) 246 protein:vir:5839 Length: 533 # 88.6 0.031 2E-05 28.8 24.9 439 3-511 1-525 (533) 247 protein:vir:95965 Length: 385 79.0 0.11 6.7E-05 25.9 22.6 348 34-485 1-385 (385) 248 protein:vir:4698 Length: 251 # 76.9 0.13 8.1E-05 25.4 14.7 241 3-316 1-251 (251) 249 protein:vir:345 Length: 663 # 75.9 0.14 8.8E-05 25.2 27.6 462 1-511 1-603 (663) 250 protein:vir:78191 Length: 351 70.2 0.21 0.00013 24.3 22.6 301 40-415 1-351 (351) 251 protein:vir:103458 Length: 524 69.3 0.23 0.00014 24.1 21.5 411 1-473 27-524 (524) 252 protein:vir:7208 Length: 524 # 69.3 0.23 0.00014 24.1 21.4 411 1-474 27-524 (524) 253 protein:vir:1150 Length: 350 # 64.7 0.3 0.00018 23.5 22.3 294 37-411 1-350 (350) 254 protein:vir:6896 Length: 523 # 60.3 0.38 0.00023 22.9 22.5 411 1-473 9-523 (523) 255 protein:vir:1661 Length: 378 # 58.9 0.4 0.00025 22.7 20.8 335 44-494 1-378 (378) 256 protein:vir:100598 Length: 516 54.5 0.5 0.00031 22.2 23.8 406 1-473 29-516 (516) 257 protein:vir:98567 Length: 340 52.3 0.56 0.00035 22.0 19.1 291 55-412 1-340 (340) 258 protein:vir:6058 Length: 344 # 50.4 0.61 0.00038 21.8 19.7 287 37-412 1-344 (344) 259 protein:vir:267 Length: 348 # 49.1 0.65 0.0004 21.6 22.8 298 55-418 1-348 (348) 260 protein:vir:103971 Length: 376 45.8 0.76 0.00047 21.2 21.9 310 28-415 1-376 (376) 261 protein:vir:101189 Length: 516 44.0 0.82 0.00051 21.1 22.8 400 1-473 37-516 (516) 262 protein:vir:101806 Length: 516 44.0 0.82 0.00051 21.1 22.8 400 1-473 37-516 (516) 263 protein:vir:79207 Length: 351 43.9 0.83 0.00051 21.0 21.9 301 40-421 1-351 (351) 264 protein:vir:108049 Length: 524 43.2 0.86 0.00053 21.0 22.6 411 1-474 31-524 (524) 265 protein:vir:93867 Length: 378 42.0 0.9 0.00056 20.8 20.1 334 34-494 1-378 (378) 266 protein:vir:100328 Length: 346 41.4 0.93 0.00058 20.8 23.9 289 55-413 1-346 (346) 267 protein:vir:81017 Length: 521 40.4 0.97 0.0006 20.6 25.8 411 1-472 12-521 (521) 268 protein:vir:5691 Length: 344 # 40.0 0.99 0.00061 20.6 21.6 281 61-413 1-344 (344) 269 protein:vir:94002 Length: 378 38.0 1.1 0.00068 20.4 23.3 335 34-494 1-378 (378) 270 protein:vir:94869 Length: 378 30.4 1.6 0.00098 19.5 20.9 335 34-494 1-378 (378) 271 protein:vir:3780 Length: 345 # 28.7 1.7 0.0011 19.3 26.6 290 61-409 1-345 (345) 272 protein:vir:4089 Length: 395 # 26.4 1.9 0.0012 19.0 29.8 366 34-493 1-395 (395) 273 protein:vir:5665 Length: 511 # 25.1 2.1 0.0013 18.8 23.7 397 1-474 31-511 (511) 274 protein:vir:106282 Length: 521 24.3 2.2 0.0014 18.7 23.4 409 1-473 1-521 (521) 275 protein:vir:98853 Length: 219 23.8 2.3 0.0014 18.7 15.2 194 173-412 1-219 (219) 276 protein:vir:2013 Length: 344 # 20.6 2.7 0.0017 18.2 21.1 276 55-413 1-344 (344) No 1 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=2.6e-99 Score=561.03 Aligned_cols=478 Identities=19% Similarity=0.315 Sum_probs=399.4 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) ||+.|.+.+- -+.++++++.|.+++++|..++++|+++++||.|+|++++++ .....++++|+++|| T Consensus 1 ~~~~~~~~~~------------~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~-~~~~~~~~~ki~~n~ 67 (499) T protein:vir:10 1 MAVVIDKDLL------------DDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHE-FDNATVEAANVMVNH 67 (499) T ss_pred CccchhhhHH------------hhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCC-cCcCCCCcceeecch Confidence 8887776632 223466788899999999999999999999999999987654 345678899999999 Q ss_pred HHHHHHHHHhhhhccCceecC-chhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC-------------- Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESG-DEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK-------------- 145 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g-------------- 145 (511) +++||++.++|+||+|++++. +++..+.++++|+.|+|+..+.+++++++++|+||+++|.+++| T Consensus 68 ~~~Iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~ 147 (499) T protein:vir:10 68 AKYITDMNVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLT 147 (499) T ss_pred HHHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccc Confidence 999999999999999999975 45667889999999999999999999999999999999999877 Q ss_pred ---ceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccc Q lcl|NC_018086. 146 ---KHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDY 222 (511) Q Consensus 146 ---~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 222 (511) .++++.++|++++++|++....++++++|+|...+..+++.++++++||++.+++|.....+.. ...+... T Consensus 148 ~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~------~~~~~~~ 221 (499) T protein:vir:10 148 PNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEV------SANDPIV 221 (499) T ss_pred cccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccc------cCcceec Confidence 3678999999999999999999999999999888777788889999999999999987665432 2234456 Q ss_pred cceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec Q lcl|NC_018086. 223 EVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD 302 (511) Q Consensus 223 ~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~ 302 (511) ...+|+||.||||+|+|++.|+|+|+++++|||+||.++|++++.++++++|+++++|+..+++.+....++.+.++.++ T Consensus 222 ~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~ 301 (499) T protein:vir:10 222 YDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPP 301 (499) T ss_pred ccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhhcceeccC Confidence 77899999999999999999999999999999999999999999999999999999999888777777777766666654 Q ss_pred --CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 --EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYE 379 (511) Q Consensus 303 --~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 379 (511) ++++++|++++.+.+++++++++|.+.|+.+|++|+++++.+ |++||+||++++++|.+|+.++++.|+.+|+++++ T Consensus 302 ~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~ 381 (499) T protein:vir:10 302 REEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLK 381 (499) T ss_pred CCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 567899999999999999999999999999999999998876 68999999999999999999999999999999999 Q ss_pred HHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 380 LVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIAL 459 (511) Q Consensus 380 li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~ 459 (511) +|+.+++..+ ..++..+++|+|++++|.|.++.++++++++|++|+||++++||+++|+++|++||++|+++.++... T Consensus 382 li~~~~~~~~--~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~ 459 (499) T protein:vir:10 382 LIQTIVNIKG--ANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQ 459 (499) T ss_pred HHHHHHhccC--CccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH Confidence 9999988665 35567789999999999999999999999999999999999999999999999999999988777665 Q ss_pred hhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccC Q lcl|NC_018086. 460 QNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAI 504 (511) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (511) ..+....+..+..+...+++....++. ..+...+ .+.+++ T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~----~~~~~~ 499 (499) T protein:vir:10 460 EALRGQDPDRLELEDKQDDSSENDKEA-GSNHNQS----HRTRAV 499 (499) T ss_pred hhhccCCCCCCCCCCCCcccCCCCCCC-ccccccC----CCCCCC Confidence 554333222221111111111111111 1111111 112222 No 2 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=1.9e-97 Score=550.87 Aligned_cols=446 Identities=25% Similarity=0.367 Sum_probs=389.5 Q ss_pred cccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhh Q lcl|NC_018086. 12 DIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY 91 (511) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~ 91 (511) -+.+.+..++...++.++.+.|.+++++|..+++||+++++||+|+|+++.++. ....++++|+++||+++||++.++| T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~ 79 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPT-KDLWKPDNRLTVNFTKYIVDTFTGY 79 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCC-ccccCccceeecchHHHHHHHHhhh Confidence 456777888888889999999999999999999999999999999999876653 5667889999999999999999999 Q ss_pred hhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceE Q lcl|NC_018086. 92 LAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPV 170 (511) Q Consensus 92 l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~ 170 (511) ++|+|++++++ ++..+.|+++|+.|+|+.++.+++++++++|+||++||.+++|++++++++|.+++++|++...+++. T Consensus 80 l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~ 159 (453) T protein:vir:39 80 FNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFMVYDDTIKQEPL 159 (453) T ss_pred hcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEecCCCCCeEE Confidence 99999999754 55678899999999999999999999999999999999999999999999999999999998888899 Q ss_pred EEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHH Q lcl|NC_018086. 171 AAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQ 250 (511) Q Consensus 171 ~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v 250 (511) +++++|.. +....++++|+++.+++|....+ .|...+..+|+||.||||+|+|+++|+|+|+++ T Consensus 160 ~~ir~~~~-----~~~~~~~~~yt~~~i~~~~~~~~-----------~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v 223 (453) T protein:vir:39 160 FAVRYGYD-----DDYKLYGEVYTKETTYALNGTMG-----------FYNMTEQAPNPFDDLPVVEFYFNEERMSIFESV 223 (453) T ss_pred EEEEEEEe-----CCeEEEEEEEeCCeEEEEEecCC-----------ceeeecccccCCCceeEEEecCCCCCCcchhhh Confidence 99998753 33467899999999999976544 344567789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec------CCCceeeeecCCCHHHHHHHHH Q lcl|NC_018086. 251 LSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD------EDGMVKFITKDVNDKHIENIKN 324 (511) Q Consensus 251 ~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~------~~~~~~~~~~~~~~~~~~~~~~ 324 (511) ++|||+||+++|++++.++++++|+++++|...+. +...+++..+++.++ ++++++|++++.+.++++++++ T Consensus 224 ~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~ 301 (453) T protein:vir:39 224 ISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE--EDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLD 301 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCc--hhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHH Confidence 99999999999999999999999999999986653 344556666776654 4678999999999999999999 Q ss_pred HHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC Q lcl|NC_018086. 325 RAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV 404 (511) Q Consensus 325 ~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~ 404 (511) +|.+.|+.+|++|+++++.+||+||+||++++++|..||+++++.|+.+|++++++|+++++..+. ..+..+++|+|+ T Consensus 302 ~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~i~v~f~ 379 (453) T protein:vir:39 302 RLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSN--KEAWKDIEYTFT 379 (453) T ss_pred HHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeC Confidence 999999999999999999999999999999999999999999999999999999999999886554 456678999999 Q ss_pred CCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 405 RNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 405 ~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) +++|.|.++.|+++++++|++|+||+++++|+++|+++|++||++|+++.............+..+..+...+| T Consensus 380 ~~~p~~~~~~a~~~~kl~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 380 RNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred CCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 99999999999999999999999999999999999999999999998877665433332222222222222222 No 3 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=9.2e-97 Score=547.09 Aligned_cols=486 Identities=19% Similarity=0.237 Sum_probs=400.4 Q ss_pred CCCccchhhcccccCchhhHhhhhcc--CCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc---------Ccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRR--NFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF---------DDT 69 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~---------~~~ 69 (511) || -|+....+.++.....+....+. +.+.+.|.++++.| ++++|+++++||.|+|++..+... ... T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~ 77 (503) T protein:vir:59 1 MA-DIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDD 77 (503) T ss_pred Cc-ccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhccccccccccc Confidence 44 46666667777666665554443 44566777888877 457899999999999998765432 345 Q ss_pred ccccceeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEE Q lcl|NC_018086. 70 NKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRF 149 (511) Q Consensus 70 ~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i 149 (511) .++++|+++||+++||++.++|++|+|++++++++....+++.|..|+|+..+.+++++++++|+||+++|.+++|++++ T Consensus 78 ~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i 157 (503) T protein:vir:59 78 TKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDY 157 (503) T ss_pred ccccceeecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEE Confidence 57788999999999999999999999999998887776666777789999999999999999999999999999999999 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccc---ccccccccccee Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIP---EELEIKDYEVHP 226 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 226 (511) ++++|.+++++|++...+++.++||+|.... ..++.+.++++|+++.+++|....+++...... ....+......+ T Consensus 158 ~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~-~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (503) T protein:vir:59 158 VIFPAEEMIVVYKDNTRRDILFALRYYSYKG-IMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQA 236 (503) T ss_pred EEEccceeEEEEeCCCCCceEEEEEEEEEec-CCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeeccee Confidence 9999999999999998899999999987654 456677889999999999999887766433221 122234456789 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCc Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGM 306 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~ 306 (511) |+|++||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|.++++.+++..+++..+++.++++++ T Consensus 237 ~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (503) T protein:vir:59 237 IGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGG 316 (503) T ss_pred ccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCc Confidence 99999999999999999999999999999999999999999999999999999999888888899999999999999999 Q ss_pred eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) ++|++++++.++++.+++.|++.|+.+|++|+++++.+ |++||+||++++++|.++|+++++.|+.+|++++++|+.++ T Consensus 317 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~ 396 (503) T protein:vir:59 317 VDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYL 396 (503) T ss_pred ceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999988765 57999999999999999999999999999999999999999 Q ss_pred HhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018086. 386 EFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFK 463 (511) Q Consensus 386 ~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 463 (511) +..+........+++|+|++++|.|.++.+++++++ +|++|+||++++||+++||++|++||++|+++..+....... T Consensus 397 ~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 476 (503) T protein:vir:59 397 RNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLD 476 (503) T ss_pred HhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccC Confidence 887665545567799999999999999999999998 589999999999999999999999999998876664322211 Q ss_pred ccccCCCCCCccccccCCCCCCccccccCCCCcc Q lcl|NC_018086. 464 QTSAVQGASTAAANKLDKNPANTSTITTTDPVAA 497 (511) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (511) . .++.+...++....++.+..++| +.+ T Consensus 477 ~----~~~~~~~~~~~~~~~~~~~~~~g---~~~ 503 (503) T protein:vir:59 477 D----EGGDDDLEEDDPNAGAAESGGAG---QVS 503 (503) T ss_pred c----cCCCCCCCcCCCCCCcccCCCCC---CcC Confidence 1 11111111110000000111111 000 No 4 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=5.9e-97 Score=548.15 Aligned_cols=446 Identities=26% Similarity=0.369 Sum_probs=388.3 Q ss_pred cccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhh Q lcl|NC_018086. 12 DIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY 91 (511) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~ 91 (511) -...++..+.+..++.++.+.|.+++++|..+++||+++++||+|+|+++.++. ..+.++++|+++||+++||++.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~ 79 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPA-KDSWKPDNRLAVNFTKYIVDTFTGY 79 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcc-ccccCccceeecchHHHHHHHHhhh Confidence 456788888889999999999999999999999999999999999999876553 5667889999999999999999999 Q ss_pred hhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceE Q lcl|NC_018086. 92 LAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPV 170 (511) Q Consensus 92 l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~ 170 (511) ++|+|++++++ ++..+.|+++|+.|+|+..+.+++++++++|+||+++|.+++|++++++++|.+++++||+...++++ T Consensus 80 l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~ 159 (452) T protein:vir:36 80 FNGIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDTVKQEPL 159 (452) T ss_pred hcccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceE Confidence 99999999755 45678899999999999999999999999999999999999999999999999999999999888999 Q ss_pred EEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHH Q lcl|NC_018086. 171 AAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQ 250 (511) Q Consensus 171 ~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v 250 (511) +++|+|... ....++++||++.+++|....++ +......+|+||.||||+|+|+++|+|+|+++ T Consensus 160 ~~i~~~~~~-----~~~~~~~vyt~~~i~~~~~~~~~-----------~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v 223 (452) T protein:vir:36 160 FAVRYGVDE-----DKKLQGEVYTLLETIKISGENDE-----------ISFGEGTYNPYPDLPVVEFYFNEERMSIFESV 223 (452) T ss_pred EEEEEEEec-----CceEEEEEEecCeEEEEEEcCCc-----------eEEecceeccCCcccEEEecCCCCCCcchHHH Confidence 999988632 23567899999999999766543 34556789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCC-----ceeeeecCCCHHHHHHHHHH Q lcl|NC_018086. 251 LSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDG-----MVKFITKDVNDKHIENIKNR 325 (511) Q Consensus 251 ~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 325 (511) ++|+|+||+++|++++.++++++|+++++|...+. +...+++.++++.++.++ +++|++++.+.+++++++++ T Consensus 224 ~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 301 (452) T protein:vir:36 224 ISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE--EDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDR 301 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc--hhhhhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHH Confidence 99999999999999999999999999999986543 455666667888876643 68999999999999999999 Q ss_pred HHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC Q lcl|NC_018086. 326 AKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR 405 (511) Q Consensus 326 l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~ 405 (511) |.+.|+.+|++|+++++.+||+||+||++++++|..||+++++.|+.+|++++++|+.+++..+. ..+..+++|.|++ T Consensus 302 l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~i~i~f~~ 379 (452) T protein:vir:36 302 LTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSN--KDSWKDIEYTFTR 379 (452) T ss_pred HHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeCC Confidence 99999999999999999999999999999999999999999999999999999999999887654 3456689999999 Q ss_pred CCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 406 NLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 406 ~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) ++|.|.++.++++++++|++|.||+++++|+++|+++|++||++|+++..+..........+..+..+...+| T Consensus 380 ~~p~d~~~~a~~~~k~~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 380 NEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred CCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCCCCcccccCccccCC Confidence 9999999999999999999999999999999999999999999998776554332222211111111111111 No 5 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=5.8e-97 Score=548.20 Aligned_cols=466 Identities=17% Similarity=0.196 Sum_probs=386.6 Q ss_pred CCCccchhhcccccCchhhHhh-----h----hccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc----- Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKH-----F----IRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF----- 66 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-----~----~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~----- 66 (511) ||++.-+- |.++.+..+-.. + .+.+...+.|.+++++|..++++|+++.+||+|+|+++.++.+ T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~ 78 (483) T protein:vir:12 1 MAQALIKG--GNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG 78 (483) T ss_pred CccchhcC--CceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Confidence 88886543 555543333221 1 1223445678889999999999999999999999998766533 Q ss_pred -CccccccceeccchHHHHHHHHHhhhhccCceecCchhh-HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC Q lcl|NC_018086. 67 -DDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKT-IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN 144 (511) Q Consensus 67 -~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~-~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~ 144 (511) ..+.++++|+++||+++||++.++|++|+|++++++++. .+.++++| .|+++..+.+++++++++|+||+++|.+++ T Consensus 79 ~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d 157 (483) T protein:vir:12 79 AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEE 157 (483) T ss_pred cccccccccccccchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEEcCC Confidence 456678899999999999999999999999999876654 45566555 588999999999999999999999999999 Q ss_pred CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccc Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEV 224 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (511) |++++++++|.+++++||+...+++++++|+|.... ..++++|+++.+++|....+..... ......+..... T Consensus 158 ~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~------~~~~~~y~~~~v~~~~~~~~~~~~~-~~~~~~~~~~~~ 230 (483) T protein:vir:12 158 GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN------ETKVEYWDKVTVNYYVYENGSLIPD-YSNNLENSKTHF 230 (483) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEEeec------ceEEEEEecCeEEEEEEeCCeeeec-cccccccccccc Confidence 999999999999999999988899999999987542 2357999999999998776554332 222334556678 Q ss_pred eeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCC Q lcl|NC_018086. 225 HPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDED 304 (511) Q Consensus 225 ~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~ 304 (511) .+|+||.||||+|+|+++|+|+|+++++|+|+||.++|++++.++++++|+++++|...+..+++...++..+++.++++ T Consensus 231 ~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~ 310 (483) T protein:vir:12 231 STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDN 310 (483) T ss_pred ccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhccccccCCC Confidence 89999999999999999999999999999999999999999999999999999999988888888888888899999999 Q ss_pred CceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 305 GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCS 383 (511) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 383 (511) ++++|++++.+.+.+++++++|.++|+.+|++|+++++.+ +++||+||++++.+|..||.++++.|+.+|++++++|++ T Consensus 311 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 390 (483) T protein:vir:12 311 GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 390 (483) T ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999887 579999999999999999999999999999999999999 Q ss_pred HHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018086. 384 YLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFK 463 (511) Q Consensus 384 ~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 463 (511) +++.. .+..+++|+|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++.... .. T Consensus 391 ~~~~~-----~~~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~-~~ 464 (483) T protein:vir:12 391 HFDIK-----GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN-LD 464 (483) T ss_pred HhcCC-----CccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccc-cc Confidence 87643 2456789999999999999999999999999999999999999999999999999998876553211 10 Q ss_pred ccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 464 QTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) + .+......+++++ ..+++ T Consensus 465 ---~----~~~d~~~~~~~~~-----~~e~e 483 (483) T protein:vir:12 465 ---D----GGADGAQQQERSN-----NKESE 483 (483) T ss_pred ---c----cccCCcccCCCCC-----cccCC Confidence 0 0000000011110 00001 No 6 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=6.4e-96 Score=542.48 Aligned_cols=479 Identities=20% Similarity=0.297 Sum_probs=379.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccC-CcCccccccceecc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHS-RSSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVH 78 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~ 78 (511) .-.-|++-++-..+....-.....+...+.++|.+++++|. .++++|+++++||.|+|+++++. ......++++|+++ T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:93 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeec Confidence 11111111111111111111111223446788999999986 56789999999999999987654 34566788999999 Q ss_pred chHHHHHHHHHhhhhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) ||+++||++.++|++|+|++++++ ++..+.|+++|+.|+|+.++.++++++++||+||++||.+++|++++++++|+++ T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~ 172 (511) T protein:vir:93 93 DYASYISDFINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST 172 (511) T ss_pred chHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccee Confidence 999999999999999999999754 4566789999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceE Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVL 235 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 235 (511) +++||+...++++++||+|...... +.+.+.++++||++.+++|...+.+... .........+|+||.|||| T Consensus 173 ~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~------~~~~~~~~~~~~~g~vPvv 246 (511) T protein:vir:93 173 FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK------LTPRENGFESHSFERMPIT 246 (511) T ss_pred EEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc------cccccccccccCCCccceE Confidence 9999999888999999999765433 3456788999999999999876654322 2234456789999999999 Q ss_pred eecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCcee-------------eec Q lcl|NC_018086. 236 EIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVI-------------VTD 302 (511) Q Consensus 236 ~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i-------------~~~ 302 (511) +|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|.......+. ......+++ ... T Consensus 247 ~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:93 247 EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-RKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhh-cccccccceecccccccccccccCC Confidence 999999999999999999999999999999999999999999999765443322 222222222 234 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ++++++|++++.+.++++.++++|.+.|+.+|++|+++++++ ||+||+||+++++++..||.++++.|+.+|++++++| T Consensus 326 ~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:93 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 578899999999999999999999999999999999999877 6899999999999999999999999999999999999 Q ss_pred HHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 382 CSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQ 460 (511) Q Consensus 382 ~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (511) +++++..+.. ...+..++++.|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++.... T Consensus 406 ~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:93 406 ETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 9998876543 345667899999999999999999999999999999999999999999999999999998877765544 Q ss_pred hccccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 461 NFKQTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) ............+ ++++..+... +.+ T Consensus 486 ~~~~~~~~~~~~~-----~~~~~~~~~~---~~~ 511 (511) T protein:vir:93 486 GIYKDPRDINDDE-----QDDDTKDTVD---KKE 511 (511) T ss_pred hcccCCCCCCCCC-----CCCccccccc---ccC Confidence 3322111111100 0000000000 000 No 7 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=1.5e-96 Score=545.87 Aligned_cols=466 Identities=17% Similarity=0.198 Sum_probs=382.9 Q ss_pred CCCccchhhcccccCchh-----hHhhh----hccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc----- Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNI-----RRKHF----IRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF----- 66 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-----~~~~~----~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~----- 66 (511) .|++.-+ -|.++.++. .+..+ .+.....+.|.+++.+|..++++|+++.+||.|+|++..++.+ T Consensus 10 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~ 87 (492) T protein:vir:94 10 VAQALIK--GGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG 87 (492) T ss_pred HHHHHhc--CCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc Confidence 2222211 234442221 22211 2233446678889999999999999999999999998766543 Q ss_pred -CccccccceeccchHHHHHHHHHhhhhccCceecCchhh-HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC Q lcl|NC_018086. 67 -DDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKT-IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN 144 (511) Q Consensus 67 -~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~-~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~ 144 (511) ....++++|+++||+++||++.++|++|+|++++++++. .+.|+++| .|+|+..+.++++++++||+||+++|.+++ T Consensus 88 ~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d 166 (492) T protein:vir:94 88 AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEE 166 (492) T ss_pred cccccccccccccchHHHHHHHHHhhhcccCceeccCchHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCC Confidence 345678899999999999999999999999999876654 45555555 589999999999999999999999999999 Q ss_pred CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccc Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEV 224 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (511) |++++++++|.+++++||+...+++++++|+|...+ ..++++|+++.+++|....++..... .....+..... T Consensus 167 g~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~------~~~~~~y~~~~v~~~~~~~~~~~~~~-~~~~~~~~~~~ 239 (492) T protein:vir:94 167 GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN------ETKVEYWDKVTVNYYVYENGSLIPDY-SNNLENSKTHF 239 (492) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEeecc------ceeEEEEecCeEEEEEEecCeeeecc-ccccccccccc Confidence 999999999999999999988889999999987542 23579999999999987766554333 22334556778 Q ss_pred eeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCC Q lcl|NC_018086. 225 HPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDED 304 (511) Q Consensus 225 ~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~ 304 (511) .+|+||.||||+|+|+++|+|+|+++++|+|+||+++|++++.++++++|+++++|++.++..++...++..+++.++++ T Consensus 240 ~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~ 319 (492) T protein:vir:94 240 STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDN 319 (492) T ss_pred cccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHHHhhccceecCCC Confidence 89999999999999999999999999999999999999999999999999999999988888888888888999999999 Q ss_pred CceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 305 GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCS 383 (511) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 383 (511) ++++|++++.+.+++++++++|.++|+.+|++|+++++.+ |++||+||++++++|..||+++++.|+.+|++++++|+. T Consensus 320 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~ 399 (492) T protein:vir:94 320 GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 399 (492) T ss_pred CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999887 479999999999999999999999999999999999999 Q ss_pred HHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018086. 384 YLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFK 463 (511) Q Consensus 384 ~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 463 (511) +++... +..+++|.|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++... .+. T Consensus 400 ~~~~~~-----~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~-~~~ 473 (492) T protein:vir:94 400 HFDIKG-----EHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLP-NLD 473 (492) T ss_pred HhcCCc-----ccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc-ccc Confidence 876532 45678999999999999999999999999999999999999999999999999999877665421 111 Q ss_pred ccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 464 QTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) . .+..++ ... +.+.+.+++ T Consensus 474 ~-~~~~~~--~~~---------~~~~~~e~e 492 (492) T protein:vir:94 474 D-GGADSA--QQQ---------ERSNNKESE 492 (492) T ss_pred c-ccCCCC--ccc---------cCCccccCC Confidence 1 000000 000 000000111 No 8 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=4.3e-96 Score=543.41 Aligned_cols=427 Identities=19% Similarity=0.280 Sum_probs=381.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCC------cCccccccceeccchHHHHHHHHHhhhhccCceecC Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRT------FDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESG 101 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~------~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~ 101 (511) ++.+.|.+++++|..++++|+++++||.|+|+++.+.. .....++++|+++||+++||++.++|+||+|+++++ T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~ 80 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDI 80 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeec Confidence 89999999999999999999999999999999887643 234556888999999999999999999999999974 Q ss_pred -chhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC--------CceEEEEEcccceEEEecCCCCCceEEE Q lcl|NC_018086. 102 -DEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN--------KKHRFKAVSPMNCLIAYSADLDEEPVAA 172 (511) Q Consensus 102 -d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~--------g~~~i~~~~p~~~~~v~d~~~~~~~~~~ 172 (511) +++....+++.|..|+++.++.+++++++++|+||+++|.+++ |++++..++|.+++++|++...+++.++ T Consensus 81 ~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~~~ 160 (451) T protein:vir:10 81 DNNKELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELEAV 160 (451) T ss_pred CCcHHHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceEEE Confidence 4444455566666799999999999999999999999999875 7889999999999999999988999999 Q ss_pred EEEEEEeecCCc----ceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhH Q lcl|NC_018086. 173 IYYNTVISDITG----HQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFE 248 (511) Q Consensus 173 v~~~~~~~~~~~----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~ 248 (511) ||+|....+..+ +.+.++++||++.+++|........ ....+....+|+||+||||+|+|++.|.|+|+ T Consensus 161 ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e 233 (451) T protein:vir:10 161 IRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCC-------GSQIEHITVQHRFNSVPFVEFSNNIKKQSDLS 233 (451) T ss_pred EEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCcc-------ccccccccccCCCCeeeEEEeccCCCCCCchh Confidence 999987776554 4567899999999999986654332 23445677899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC-----CCceeeeecCCCHHHHHHHH Q lcl|NC_018086. 249 AQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE-----DGMVKFITKDVNDKHIENIK 323 (511) Q Consensus 249 ~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~-----~~~~~~~~~~~~~~~~~~~~ 323 (511) +|++|||+||.++|++++.++++++|+++++|+.+++.+++..+++..+++.++. +++++|++++.+.++++.++ T Consensus 234 ~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 313 (451) T protein:vir:10 234 KYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIIL 313 (451) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHH Confidence 9999999999999999999999999999999999888888899999999888763 57899999999999999999 Q ss_pred HHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEe Q lcl|NC_018086. 324 NRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVF 403 (511) Q Consensus 324 ~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f 403 (511) ++|.++|+.+|++|+++++.+||+||+||++++++|.+||+++++.|+.+|++++++|+.+++.. +..+++|+| T Consensus 314 ~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~------d~~~i~i~f 387 (451) T protein:vir:10 314 EILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT------DYKKIQQTY 387 (451) T ss_pred HHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC------CccceeEEe Confidence 99999999999999999999999999999999999999999999999999999999999987642 456789999 Q ss_pred CCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_018086. 404 VRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSA 467 (511) Q Consensus 404 ~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 467 (511) ++++|.|.++.++++++++|++|+||+++++|+++|+++|++++++|++++.+...+.++.... T Consensus 388 ~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 388 TRNMMSNDLEDADIATKSVGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred cCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 9999999999999999999999999999999999999999999998888877776666655333 No 9 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=1.2e-95 Score=540.98 Aligned_cols=479 Identities=20% Similarity=0.302 Sum_probs=380.9 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC-CcCccccccceecc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVH 78 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~ 78 (511) .-.-|++-++-..+..........+...+.++|.+++.+|.. +++||+++++||.|+|+++++. ......++++|+++ T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:10 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeec Confidence 111111111111111111111112234567889999999865 5799999999999999987654 44567788999999 Q ss_pred chHHHHHHHHHhhhhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) ||+++||++.++|++|+|++++++ ++..+.|+++|+.|+|+.++.++++++++||+||+++|.+++|+++++++||.++ T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~ 172 (511) T protein:vir:10 93 DYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMST 172 (511) T ss_pred chHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccee Confidence 999999999999999999999765 4567889999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceE Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVL 235 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 235 (511) +++|++...++++++||+|...... +.+.+.++++||++.+++|....+++.. .........+|+||.|||| T Consensus 173 ~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~------~~~~~~~~~~~~~~~vPvv 246 (511) T protein:vir:10 173 FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK------LTPRENGFESHSFERMPIT 246 (511) T ss_pred EEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCccc------ccccccccccccCcceeEE Confidence 9999999888999999999765443 3356778999999999999877654422 2234567789999999999 Q ss_pred eecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceee-------------ec Q lcl|NC_018086. 236 EIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIV-------------TD 302 (511) Q Consensus 236 ~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~-------------~~ 302 (511) +|+|+.+|+|+|+++++|||+||.++|++++.++++++|+++++|.......+ .......+++. .. T Consensus 247 ~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:10 247 EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchh-hccchhccceecccccccccccccCC Confidence 99999999999999999999999999999999999999999999975544332 22222233332 24 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ++++++|++++.+.+++++++++|.++|+.+|++|+++++++ ||+||+||+++++++.+||.++++.|+.+|++++++| T Consensus 326 ~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li 405 (511) T protein:vir:10 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 467899999999999999999999999999999999999877 6899999999999999999999999999999999999 Q ss_pred HHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 382 CSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQ 460 (511) Q Consensus 382 ~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (511) +.+++..+.. ...+..+++|+|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+.... T Consensus 406 ~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:10 406 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 9998876543 345667899999999999999999999999999999999999999999999999999998887776544 Q ss_pred hccccccCCCCCCccccccCCCCCCccccc Q lcl|NC_018086. 461 NFKQTSAVQGASTAAANKLDKNPANTSTIT 490 (511) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (511) .........+..+.+ ++....+.... T Consensus 486 ~~~~~~~~~~~~~~~----~~~~~~~~~~~ 511 (511) T protein:vir:10 486 GIYKDPRDINDDEQD----DDTKDTVDKKE 511 (511) T ss_pred hcccCCCCCCCCCCC----CcccCcccccC Confidence 332221111111110 00000000000 No 10 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=5.9e-96 Score=542.65 Aligned_cols=461 Identities=22% Similarity=0.283 Sum_probs=394.6 Q ss_pred hhcccccCchhhHhhhhccCCCHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCcccccCCc--------Cccccccceec Q lcl|NC_018086. 8 INAGDIITTNIRRKHFIRRNFDLRELITLAEMHS--RSSSAYGVLYDYYKGNHIAIQSRTF--------DDTNKPNSKIV 77 (511) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~--~~~~~~~~~~~yY~G~~~~~~~~~~--------~~~~~~~~ri~ 77 (511) +|+.+++........+.. .+...+.++++.+. .++++|+++++||+|+|++++++.. .+..++++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~ 78 (479) T protein:vir:79 1 MLNIYISETDLIKVQLKK--ESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAI 78 (479) T ss_pred CCCceecccceEeecccc--CChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceee Confidence 666666666554444332 23344556666553 3568899999999999998766432 34557888999 Q ss_pred cchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) +||+++||++.++|++|+|++++++++..+.+++.|..|+|+..+.++++.++++|+||+++|.+++|+++++++||.++ T Consensus 79 ~~~~~~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~ 158 (479) T protein:vir:79 79 NNYHKLLVDQKVGYSVGNPIVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEA 158 (479) T ss_pred cchHHHHHHHHHhhhhcCCceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccee Confidence 99999999999999999999999988888788888888999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccc--------cccccccccceeccC Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIP--------EELEIKDYEVHPNLL 229 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~ 229 (511) +++||+....++.+++|+|...+ .+++.+.++++|+++.+++|....+++...... ....+......+|+| T Consensus 159 ~~v~d~~~~~~~~~~ir~y~~~~-~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (479) T protein:vir:79 159 IPIWDSKRQRELVAFIRFYYIED-IDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGW 237 (479) T ss_pred EEEEeCCCCCceEEEEEEEEEee-cCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCC Confidence 99999988888999999998765 445677889999999999998877655322211 123345567789999 Q ss_pred CccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceee Q lcl|NC_018086. 230 QKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKF 309 (511) Q Consensus 230 g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 309 (511) |+||||+|+|+++|+|+|+++++|+|+||.++|++++.++++++|+++++|.+.+..+++..+++..+++.++++++++| T Consensus 238 ~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 317 (479) T protein:vir:79 238 GKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDK 317 (479) T ss_pred CcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCcceE Confidence 99999999999999999999999999999999999999999999999999998877778888888899999999999999 Q ss_pred eecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 310 ITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 310 ~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) ++++.+.+++++++++|++.|+.+|++|+++++.+|++||+||++++++|..+|..+++.|+.+|++++++|+.+++..+ T Consensus 318 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 397 (479) T protein:vir:79 318 LEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISG 397 (479) T ss_pred EeccCCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988764 Q ss_pred CCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCC Q lcl|NC_018086. 390 KAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQ 469 (511) Q Consensus 390 ~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (511) . .+++..+++|.|++++|.|+++.|+++++++|++|.||+++++|+++|+++|++||++|+++..+......+... T Consensus 398 ~-~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~--- 473 (479) T protein:vir:79 398 N-KSYDYKTVQITFNHSMIINEAEKIDMAAKSTGIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPNNQD--- 473 (479) T ss_pred C-CccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCcccC--- Confidence 3 466778899999999999999999999999999999999999999999999999999998876654332211111 Q ss_pred CCCCcccccc Q lcl|NC_018086. 470 GASTAAANKL 479 (511) Q Consensus 470 ~~~~~~~~~~ 479 (511) ...++. T Consensus 474 ----~~~~e~ 479 (479) T protein:vir:79 474 ----GVIDET 479 (479) T ss_pred ----CCcCcC Confidence 111111 No 11 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=1.1e-95 Score=541.27 Aligned_cols=477 Identities=21% Similarity=0.320 Sum_probs=378.4 Q ss_pred CCccchh-hcccccCchhhHhhhhc-------------cCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC-C Q lcl|NC_018086. 2 AIPNGQI-NAGDIITTNIRRKHFIR-------------RNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR-T 65 (511) Q Consensus 2 ~~~~~~~-~~~~~~~~~~~~~~~~~-------------~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~-~ 65 (511) -++++.- -.+++ ..+.++..-.+ ...+.++|.+++++|.. ++++|+++++||.|+|+++++. . T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~ 79 (512) T protein:vir:97 1 MLKANEFETDTDL-RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (512) T ss_pred CccceeccCceee-eeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 1222210 00111 11111111111 12235778889999864 5789999999999999987654 4 Q ss_pred cCccccccceeccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC Q lcl|NC_018086. 66 FDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN 144 (511) Q Consensus 66 ~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~ 144 (511) ...+.++++|+++||+++||++.++|++|+|+++++++ +..+.|+++|+.|+|+.++.++++++++||+||+++|.+++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded 159 (512) T protein:vir:97 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD 159 (512) T ss_pred ccccccCcceeecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCC Confidence 45677899999999999999999999999999997554 56688999999999999999999999999999999999999 Q ss_pred CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccc Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDY 222 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 222 (511) |++++++++|++++++||+...++++++||+|...... ..+.+.++++||++.+++|.....+... ..+... T Consensus 160 ~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~------~~~~~~ 233 (512) T protein:vir:97 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK------LTPREN 233 (512) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc------cccccc Confidence 99999999999999999999988999999999865443 3356788999999999999876654321 223456 Q ss_pred cceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCcee--- Q lcl|NC_018086. 223 EVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVI--- 299 (511) Q Consensus 223 ~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i--- 299 (511) ...+|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|....+..+.. .....+++ T Consensus 234 ~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~-~~~~~~~~~~~ 312 (512) T protein:vir:97 234 GFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVR-KQKEANVLFLE 312 (512) T ss_pred ccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhh-hhhhccccccc Confidence 77899999999999999999999999999999999999999999999999999999997654433322 22222222 Q ss_pred -----------eecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 300 -----------VTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKE 367 (511) Q Consensus 300 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~ 367 (511) ...++++++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||++++++|.+||++++ T Consensus 313 ~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~ 392 (512) T protein:vir:97 313 PTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKE 392 (512) T ss_pred ccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHH Confidence 234567899999999999999999999999999999999999887 58999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHH Q lcl|NC_018086. 368 SKFRKVLAKRYELVCSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEK 446 (511) Q Consensus 368 ~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~r 446 (511) +.|+.+|++++++|++++...+.. ...+..+++++|++++|.|.++.++++++++|++|.||++++||+++|+++|++| T Consensus 393 ~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~er 472 (512) T protein:vir:97 393 GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKK 472 (512) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHH Confidence 999999999999999998866543 3456678999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 447 ADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 447 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) |++|+++.++................ +++++..+.. .+.+ T Consensus 473 i~~E~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~---~~~~ 512 (512) T protein:vir:97 473 IEEDEKESIKKAQKGIYKDPRDINDD-----EQDDDTKDTV---DKKE 512 (512) T ss_pred HHHHHHHHHHHHhhcccCCCCCCCCC-----CCCCCccccc---cccC Confidence 99999887776543322211111110 0000000000 0000 No 12 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=6.5e-96 Score=542.45 Aligned_cols=441 Identities=17% Similarity=0.222 Sum_probs=379.4 Q ss_pred CCHHHHHH----HHHHHHHHHHHHHHHHHHhcCCCcccccCCc----------CccccccceeccchHHHHHHHHHhhhh Q lcl|NC_018086. 28 FDLRELIT----LAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF----------DDTNKPNSKIVHNFPKLLVDTSTAYLA 93 (511) Q Consensus 28 ~~~~~l~~----~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~----------~~~~~~~~ri~~n~~k~ivd~~~~~l~ 93 (511) ++.+.|.+ ++..|+.++++|+++++||.|+|+|+.+... .+..++++|+++||+++||++.++|+| T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 44444444 4556778889999999999999998766432 234578899999999999999999999 Q ss_pred ccCceecC-chhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEE Q lcl|NC_018086. 94 GEPITESG-DEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAA 172 (511) Q Consensus 94 g~~~~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ 172 (511) |+|++++. +++..+.++++|. +++...+.+++++++++|+||+++|.+++|++++++++|.+++|+|++.+.+++.++ T Consensus 81 G~p~~~~~~d~~~~~~l~~~~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a~ 159 (470) T protein:vir:10 81 SVFPDIDVGKDADNKKIIDVLG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLGI 159 (470) T ss_pred ccceeeecCchHHHHHHHHHHh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 99999974 4556788889887 467778889999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccc---------cccccccccceeccCCccceEeecCCccc Q lcl|NC_018086. 173 IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIP---------EELEIKDYEVHPNLLQKFPVLEIIANEER 243 (511) Q Consensus 173 v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~g~iPvv~~~n~~~g 243 (511) ||+|...+..+++.+.++++||++.+++|.....+....... ........+..+|+||+||||+|+||++| T Consensus 160 ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g 239 (470) T protein:vir:10 160 LRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYR 239 (470) T ss_pred EEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCC Confidence 999998888878888899999999999998776554332221 11233446678999999999999999999 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC-----CCceeeeecCCCHHH Q lcl|NC_018086. 244 LGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE-----DGMVKFITKDVNDKH 318 (511) Q Consensus 244 ~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~-----~~~~~~~~~~~~~~~ 318 (511) +|+|+++++|||+||.++|++++.++++++|+++++|+.+++.+++..+++..+++.+++ +++++|++++.+.++ T Consensus 240 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~ 319 (470) T protein:vir:10 240 LPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEA 319 (470) T ss_pred CCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHH Confidence 999999999999999999999999999999999999999888888889999889888865 467899999999999 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc Q lcl|NC_018086. 319 IENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYE 398 (511) Q Consensus 319 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~ 398 (511) ++.++++|.++|+.+|++|++++..+||+||+||++++++|.+||+++++.|+++|++++++|+++++.. ..+..+ T Consensus 320 ~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~----~~d~~~ 395 (470) T protein:vir:10 320 RDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS----DADKRH 395 (470) T ss_pred HHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc----Ccccce Confidence 9999999999999999999999999999999999999999999999999999999999999999988653 346678 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 399 VTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 399 i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) ++|+|++++|.|+++.++++++++|++|+||+++++|+++||++|++||++|+++..+... .+.+ .++.+...++ T Consensus 396 i~i~f~~~~p~d~~e~~~~~~~~~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~-~~~~----~~~~~~dde~ 470 (470) T protein:vir:10 396 ISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPYSN-QADE----LNGKGVNDEQ 470 (470) T ss_pred eeEEeccCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhc-cccc----cCCCCCCCCC Confidence 9999999999999999999999999999999999999999999999999999888655432 1111 1111111111 No 13 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=1.2e-95 Score=541.04 Aligned_cols=476 Identities=21% Similarity=0.310 Sum_probs=380.3 Q ss_pred CCCccc---hhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC-CcCccccccce Q lcl|NC_018086. 1 MAIPNG---QINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSK 75 (511) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~r 75 (511) .-.-|+ ...+|.... ......+...+.++|.+++++|.. ++++|+++++||.|+|+++++. ......++++| T Consensus 13 ~~~~~~~~~~~~~n~~~~---~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k 89 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYT---YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR 89 (511) T ss_pred hhhhhhhhhhhhhCCccc---ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcce Confidence 111111 111111111 111122234467889999999874 5689999999999999987554 44567788999 Q ss_pred eccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) +++||+++||++.++|++|+|+++++++ +..+.|+++|+.|+++.++.++++++++||+||+++|.+++|++++++++| T Consensus 90 i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p 169 (511) T protein:vir:96 90 VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (511) T ss_pred eecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcc Confidence 9999999999999999999999997654 567889999999999999999999999999999999999999999999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCcc Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKF 232 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 232 (511) ++++++|++...++++++||+|...... +.+.+.++++||++.+++|....+++.. .........+|+||.| T Consensus 170 ~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~------~~~~~~~~~~~~~g~v 243 (511) T protein:vir:96 170 MSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLK------LTPRENSFESHSFERM 243 (511) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc------ccccccccccCcCccc Confidence 9999999999888999999999865433 3356778999999999999877655432 2234567789999999 Q ss_pred ceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceee------------ Q lcl|NC_018086. 233 PVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIV------------ 300 (511) Q Consensus 233 Pvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~------------ 300 (511) |||+|+|+.+|+|+|+++++|||+||.++|++++.++++++|+++++|.......+ .......+++. T Consensus 244 Pvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 322 (511) T protein:vir:96 244 PITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRKQKEANVLFLEPTVYVDAEGR 322 (511) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-hcccccccceeccccceeccccc Confidence 99999999999999999999999999999999999999999999999975443332 22222222222 Q ss_pred -ecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 301 -TDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRY 378 (511) Q Consensus 301 -~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 378 (511) ..++++++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||++++++|..||.++++.|+.+|++++ T Consensus 323 ~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 402 (511) T protein:vir:96 323 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 402 (511) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23467899999999999999999999999999999999999887 6899999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 379 ELVCSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADI 457 (511) Q Consensus 379 ~li~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~ 457 (511) ++|+.+++..+.. ...+..+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+. T Consensus 403 ~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~ 482 (511) T protein:vir:96 403 KLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 9999998876543 345667899999999999999999999999999999999999999999999999999998887776 Q ss_pred HHhhccccccCCCCCCccccccCCCCCCccccc Q lcl|NC_018086. 458 ALQNFKQTSAVQGASTAAANKLDKNPANTSTIT 490 (511) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (511) +..............+. .++......... T Consensus 483 ~~~~~~~~~~~~~~~~~----~~~~~~~~~e~~ 511 (511) T protein:vir:96 483 AQKGIYKDPRDINDDEQ----DDDTKDTVDKKE 511 (511) T ss_pred HhhccccCCCCCCCCCC----CCCccCcccccC Confidence 54433222211111110 000000000000 No 14 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=1.2e-95 Score=541.04 Aligned_cols=476 Identities=21% Similarity=0.310 Sum_probs=380.3 Q ss_pred CCCccc---hhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC-CcCccccccce Q lcl|NC_018086. 1 MAIPNG---QINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSK 75 (511) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~r 75 (511) .-.-|+ ...+|.... ......+...+.++|.+++++|.. ++++|+++++||.|+|+++++. ......++++| T Consensus 13 ~~~~~~~~~~~~~n~~~~---~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k 89 (511) T protein:vir:78 13 LRGNINYLFNDEANVVYT---YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR 89 (511) T ss_pred hhhhhhhhhhhhhCCccc---ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcce Confidence 111111 111111111 111122234467889999999874 5689999999999999987554 44567788999 Q ss_pred eccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) +++||+++||++.++|++|+|+++++++ +..+.|+++|+.|+++.++.++++++++||+||+++|.+++|++++++++| T Consensus 90 i~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p 169 (511) T protein:vir:78 90 VAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (511) T ss_pred eecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcc Confidence 9999999999999999999999997654 567889999999999999999999999999999999999999999999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCcc Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKF 232 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 232 (511) ++++++|++...++++++||+|...... +.+.+.++++||++.+++|....+++.. .........+|+||.| T Consensus 170 ~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~------~~~~~~~~~~~~~g~v 243 (511) T protein:vir:78 170 MSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLK------LTPRENSFESHSFERM 243 (511) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc------ccccccccccCcCccc Confidence 9999999999888999999999865433 3356778999999999999877655432 2234567789999999 Q ss_pred ceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceee------------ Q lcl|NC_018086. 233 PVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIV------------ 300 (511) Q Consensus 233 Pvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~------------ 300 (511) |||+|+|+.+|+|+|+++++|||+||.++|++++.++++++|+++++|.......+ .......+++. T Consensus 244 Pvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 322 (511) T protein:vir:78 244 PITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRKQKEANVLFLEPTVYVDAEGR 322 (511) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh-hcccccccceeccccceeccccc Confidence 99999999999999999999999999999999999999999999999975443332 22222222222 Q ss_pred -ecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 301 -TDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRY 378 (511) Q Consensus 301 -~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 378 (511) ..++++++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||++++++|..||.++++.|+.+|++++ T Consensus 323 ~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 402 (511) T protein:vir:78 323 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 402 (511) T ss_pred cCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23467899999999999999999999999999999999999887 6899999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 379 ELVCSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADI 457 (511) Q Consensus 379 ~li~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~ 457 (511) ++|+.+++..+.. ...+..+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+. T Consensus 403 ~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~ 482 (511) T protein:vir:78 403 KLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 9999998876543 345667899999999999999999999999999999999999999999999999999998887776 Q ss_pred HHhhccccccCCCCCCccccccCCCCCCccccc Q lcl|NC_018086. 458 ALQNFKQTSAVQGASTAAANKLDKNPANTSTIT 490 (511) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (511) +..............+. .++......... T Consensus 483 ~~~~~~~~~~~~~~~~~----~~~~~~~~~e~~ 511 (511) T protein:vir:78 483 AQKGIYKDPRDINDDEQ----DDDTKDTVDKKE 511 (511) T ss_pred HhhccccCCCCCCCCCC----CCCccCcccccC Confidence 54433222211111110 000000000000 No 15 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=1e-95 Score=541.39 Aligned_cols=479 Identities=20% Similarity=0.303 Sum_probs=381.1 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC-CcCccccccceecc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVH 78 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~ 78 (511) .-.-|++-++-..+....-.....+...+.++|.+++++|.. ++++|+++++||.|+|+++++. ......++++|+++ T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:99 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeec Confidence 111111111111111111111112234467788999999864 5789999999999999987654 44566788999999 Q ss_pred chHHHHHHHHHhhhhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) ||+++||++.++|++|+|++++++ ++..+.|+++|+.|+|+.++.+++++++++|+||+++|.+++|++++++++|.++ T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~ 172 (511) T protein:vir:99 93 DYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST 172 (511) T ss_pred chHHHHHHHHHhhhcccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccee Confidence 999999999999999999999755 4567889999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceE Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVL 235 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 235 (511) +++||+...++++++||+|...... +.+.+.++++|+++.+++|.....+.. ..........+|+||.|||| T Consensus 173 ~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~------~~~~~~~~~~~~~~g~vPvv 246 (511) T protein:vir:99 173 FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL------KLTPRENGFESHSFERMPIT 246 (511) T ss_pred EEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccc------cccccccccccCCCCccceE Confidence 9999999888999999999765433 335677899999999999987665432 12234567789999999999 Q ss_pred eecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceee-------------ec Q lcl|NC_018086. 236 EIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIV-------------TD 302 (511) Q Consensus 236 ~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~-------------~~ 302 (511) +|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|....+..+ ...+...+++. .. T Consensus 247 ~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:99 247 EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRKQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchh-hcccccccceecccccccccccccCC Confidence 99999999999999999999999999999999999999999999975544332 22222222222 34 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ++++++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||++++++|..||.++++.|+.+|++++++| T Consensus 326 ~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li 405 (511) T protein:vir:99 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 567899999999999999999999999999999999999877 6899999999999999999999999999999999999 Q ss_pred HHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 382 CSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQ 460 (511) Q Consensus 382 ~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (511) +++++..+.. ...+..+++|+|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++..+. T Consensus 406 ~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:99 406 ETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 9999876543 345566899999999999999999999999999999999999999999999999999999887776654 Q ss_pred hccccccCCCCCCccccccCCCCCCccccc Q lcl|NC_018086. 461 NFKQTSAVQGASTAAANKLDKNPANTSTIT 490 (511) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (511) .........+..+.+.++ ..+..... T Consensus 486 ~~~~~~~~~~~~~~~~~~----~~~~d~~e 511 (511) T protein:vir:99 486 NMYQDPRNINDDEQDDST----KDSIDKKE 511 (511) T ss_pred cccccCCCCCCCCCCCCC----cCcccccC Confidence 433222221111111111 11111100 No 16 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=6.1e-96 Score=542.60 Aligned_cols=466 Identities=17% Similarity=0.195 Sum_probs=382.1 Q ss_pred CCCccchhhcccccCc-----hhhHhhh----hccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc----- Q lcl|NC_018086. 1 MAIPNGQINAGDIITT-----NIRRKHF----IRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF----- 66 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~-----~~~~~~~----~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~----- 66 (511) .|++.-+ -|.++.+ ...+..+ .+..+..+.|.+++.+|..++++|+++.+||+|+|++..++.+ T Consensus 10 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~ 87 (492) T protein:vir:97 10 VAQALIK--GGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG 87 (492) T ss_pred HHHHHhc--CCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccc Confidence 2222111 1333322 1222211 1234556678889999999999999999999999998766543 Q ss_pred -CccccccceeccchHHHHHHHHHhhhhccCceecCchhhH-HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC Q lcl|NC_018086. 67 -DDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTI-KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN 144 (511) Q Consensus 67 -~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~-~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~ 144 (511) ..+.++++|+++||+++||++.++|++|+|++++++++.. +.++++ ..|+++..+.++++++++||+||+++|.+++ T Consensus 88 ~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~-~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~d 166 (492) T protein:vir:97 88 AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEV-LGNRFDDKLHSVLTGASNKGIEWLHPYLDEE 166 (492) T ss_pred cccccccccccccchHHHHHHHHhhhhcccCceeccCchHHHHHHHHH-HhccHHHHHHHHHHHHhhcCeEEEEEEecCC Confidence 3556788999999999999999999999999998766544 555555 4689999999999999999999999999999 Q ss_pred CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccc Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEV 224 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (511) |++++++++|.+++++||+...+++.+++|+|.... ..++++|+++.+++|....+...... .....+..... T Consensus 167 g~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~------~~~~~~y~~~~v~~~~~~~~~~~~~~-~~~~~~~~~~~ 239 (492) T protein:vir:97 167 GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN------ETKVEYWDKVTVNYYVYENGSLIPDY-SNNLENSKTHF 239 (492) T ss_pred CceEEEEEcccceEEEEcCCCCCceEEEEEEEeecc------ceeEEEEecCeEEEEEEecCeeeecc-ccccccccccc Confidence 999999999999999999988889999999997542 23578999999999987766543222 22334556778 Q ss_pred eeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCC Q lcl|NC_018086. 225 HPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDED 304 (511) Q Consensus 225 ~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~ 304 (511) .+|+||.||||+|+|+++|+|+|+++++|+|+||+++|++++.++++++|+++++|++.++.+++..+++..+++.++++ T Consensus 240 ~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 319 (492) T protein:vir:97 240 STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDN 319 (492) T ss_pred ccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhhccceecCCC Confidence 89999999999999999999999999999999999999999999999999999999988888888888999999999999 Q ss_pred CceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 305 GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCS 383 (511) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 383 (511) ++++|++++.+.+++++++++|+++|+.+|++|+++++.+ +++||+||++++++|..||.++++.|+.+|++++++|+. T Consensus 320 ~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 399 (492) T protein:vir:97 320 GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 399 (492) T ss_pred CcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999887 579999999999999999999999999999999999999 Q ss_pred HHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018086. 384 YLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFK 463 (511) Q Consensus 384 ~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 463 (511) +++.. .+..+++|+|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+.... .. T Consensus 400 ~~~~~-----~~~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~-~~ 473 (492) T protein:vir:97 400 HFDIK-----GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPN-LD 473 (492) T ss_pred HhcCC-----cccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc-cc Confidence 87643 2456789999999999999999999999999999999999999999999999999998866553221 11 Q ss_pred ccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 464 QTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) +...+.+. .++.++.... + T Consensus 474 ------~~~~~~~~-~~~~~~~~~~-----e 492 (492) T protein:vir:97 474 ------DGGADSAQ-QQERSNNKES-----E 492 (492) T ss_pred ------cCCCCCCc-cccccccccc-----C Confidence 00000000 0111100000 0 No 17 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=2.6e-95 Score=539.13 Aligned_cols=479 Identities=20% Similarity=0.295 Sum_probs=380.2 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC-CcCccccccceecc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVH 78 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~ 78 (511) .-.-|++-++-..+....-.....+...+.++|.+++++|.. +++||+++++||.|+|+++++. ......++++|+++ T Consensus 13 ~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:96 13 LRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAH 92 (511) T ss_pred hhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeec Confidence 111111111111111111111112234567889999999864 6789999999999999987654 34566788999999 Q ss_pred chHHHHHHHHHhhhhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) ||+++||++.++|++|+|++++++ ++..+.|+++|+.|+|+.++.+++++++++|+||+++|.+++|++++++++|.++ T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~ 172 (511) T protein:vir:96 93 DYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST 172 (511) T ss_pred chHHHHHHHHHhhhccCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccee Confidence 999999999999999999999755 4567889999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCceEEEEEEEEEeecC--CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceE Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDI--TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVL 235 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 235 (511) +++|++....+++++||+|...... ..+.+.++++||++.+++|.....++.. .........+|+||.|||| T Consensus 173 ~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~------~~~~~~~~~~~~~~~vPvv 246 (511) T protein:vir:96 173 FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK------LTPRENGFESHSFERMPIT 246 (511) T ss_pred EEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc------ccccccccccccCCceeeE Confidence 9999999888999999999765433 3356778999999999999877655432 2334567789999999999 Q ss_pred eecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceee-------------ec Q lcl|NC_018086. 236 EIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIV-------------TD 302 (511) Q Consensus 236 ~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~-------------~~ 302 (511) +|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|.......+.. .....+++. .. T Consensus 247 ~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) T protein:vir:96 247 EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVR-KQKEANVLFLEPTVYADSEGRETE 325 (511) T ss_pred EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhc-ccccccceecccccccccccccCC Confidence 9999999999999999999999999999999999999999999997654433322 222222222 34 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ++++++|++++.+.+++++++++|.+.|+.+|++|+++++++ |++||+||+++++++.++|.++++.|+.+|++++++| T Consensus 326 ~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li 405 (511) T protein:vir:96 326 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 467899999999999999999999999999999999999887 5899999999999999999999999999999999999 Q ss_pred HHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 382 CSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQ 460 (511) Q Consensus 382 ~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (511) +.+++..+.. ...+..+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++..+.... T Consensus 406 ~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:96 406 ETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 9998876533 345667899999999999999999999999999999999999999999999999999998877666544 Q ss_pred hccccccCCCCCCccccccCCCCCCccccc Q lcl|NC_018086. 461 NFKQTSAVQGASTAAANKLDKNPANTSTIT 490 (511) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (511) .........+..+.+ ++.......+. T Consensus 486 ~~~~~~~~~~~~~~~----~~~~~~~~~~~ 511 (511) T protein:vir:96 486 GIYKDPRDINDDEQD----DDTKDTVDKKE 511 (511) T ss_pred ccccCCCCCCCCCCC----CcccccccccC Confidence 332211111110000 00000000000 No 18 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=2.4e-95 Score=539.37 Aligned_cols=472 Identities=22% Similarity=0.332 Sum_probs=383.7 Q ss_pred CCCccchhhcccc------c--CchhhH--hhhhc-cCCCHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCc-ccccCCcC Q lcl|NC_018086. 1 MAIPNGQINAGDI------I--TTNIRR--KHFIR-RNFDLRELITLAEMHSRS-SSAYGVLYDYYKGNHI-AIQSRTFD 67 (511) Q Consensus 1 ~~~~~~~~~~~~~------~--~~~~~~--~~~~~-~~~~~~~l~~~~~~~~~~-~~~~~~~~~yY~G~~~-~~~~~~~~ 67 (511) |-+-.+=.=.+.. . +.+..+ ..+.+ ...+.+.|.++|.+|..+ ++||+++.+||.|+|+ +..+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRK 80 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccC Confidence 5544441111111 0 001111 11222 223356688999999754 6899999999999864 55556667 Q ss_pred ccccccceeccchHHHHHHHHHhhhhccCceecCch-----hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDE-----KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) +..++++|+++||+++||++.++|++|+|+++++++ ...+.++++|+.|+|+..+.+++++++++|+||+++|.+ T Consensus 81 ~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d 160 (501) T protein:vir:27 81 DREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRN 160 (501) T ss_pred ccccccceeccchHHHHHHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeC Confidence 788899999999999999999999999999987653 245668899999999999999999999999999999999 Q ss_pred CCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccc Q lcl|NC_018086. 143 RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDY 222 (511) Q Consensus 143 ~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 222 (511) ++|++++++++|.+++++||+...++++++||+|......+ .+.++++||++.+++|...++ +.+. T Consensus 161 ed~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~--~~~~~~vyt~~~v~~~~~~~~------------~~~~ 226 (501) T protein:vir:27 161 EYDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQN--AKDVVEIYTNEHIYTLDASDD------------FNEI 226 (501) T ss_pred CCCceEEEEEccceeEEEecCCCCCceEEEEEEEEeeecCC--cEEEEEEEeCCeEEEEEeCCc------------eeec Confidence 99999999999999999999998889999999998665433 467889999999999875533 3456 Q ss_pred cceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec Q lcl|NC_018086. 223 EVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD 302 (511) Q Consensus 223 ~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~ 302 (511) ...+|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|...+..+++...++..+++.+. T Consensus 227 ~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~ 306 (501) T protein:vir:27 227 SVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLK 306 (501) T ss_pred cccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeec Confidence 77899999999999999999999999999999999999999999999999999999998887777777788877777654 Q ss_pred C---------CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 E---------DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRK 372 (511) Q Consensus 303 ~---------~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~ 372 (511) . +++++|++++.+.+++++++++|.++|+.+|++|+++++++ ||+||+||++++++|.+||..+++.|+. T Consensus 307 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~ 386 (501) T protein:vir:27 307 PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQ 386 (501) T ss_pred ccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 45789999999999999999999999999999999999887 6899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_018086. 373 VLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQ 452 (511) Q Consensus 373 ~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~ 452 (511) +|++++++|+++++..+...+++..+++|+|++++|.|.++.++++++++|++|++|+++++|+++||++|++||++|++ T Consensus 387 ~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~E~~ 466 (501) T protein:vir:27 387 GLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEVS 466 (501) T ss_pred HHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 99999999999999887777788888999999999999999999999999999999999999999999999999999876 Q ss_pred HHHHH-HHhhccccccCCCCCCccccccCCCCCCccccccC Q lcl|NC_018086. 453 KRADI-ALQNFKQTSAVQGASTAAANKLDKNPANTSTITTT 492 (511) Q Consensus 453 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (511) ..... ....++...+.. .+++....+.+...++. T Consensus 467 e~~~~~~~~~~~~~~~~~------~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 467 EIDFKGYSNDFNEHVGKY------TDEVKETHTDDFERAYE 501 (501) T ss_pred hhhHhhhcCccccccccc------cCCCCCCccccccccCC Confidence 53322 222222211111 11111111111111111 No 19 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=3e-95 Score=538.83 Aligned_cols=473 Identities=22% Similarity=0.337 Sum_probs=379.8 Q ss_pred CC-Cccc------hhhcccccCc----hhhHhhhhc-cCCCHHHHHHHHHHHHHH-HHHHHHHHHHhcCCC-cccccCCc Q lcl|NC_018086. 1 MA-IPNG------QINAGDIITT----NIRRKHFIR-RNFDLRELITLAEMHSRS-SSAYGVLYDYYKGNH-IAIQSRTF 66 (511) Q Consensus 1 ~~-~~~~------~~~~~~~~~~----~~~~~~~~~-~~~~~~~l~~~~~~~~~~-~~~~~~~~~yY~G~~-~~~~~~~~ 66 (511) |+ ..-| ..+-+..... ......+.+ ...+.+.|.++|++|..+ ++||+++.+||.|+| .+..++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~ 80 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR 80 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc Confidence 21 1111 0000000000 001111111 223356688899999754 689999999999985 56666666 Q ss_pred CccccccceeccchHHHHHHHHHhhhhccCceecCch-----hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee Q lcl|NC_018086. 67 DDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDE-----KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI 141 (511) Q Consensus 67 ~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~ 141 (511) .+..++++|+++||+++||++.++|++|+|+++++++ ...+.|+++|+.|+|+.++.++++++++||+||+++|. T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ 160 (502) T protein:vir:48 81 KDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYR 160 (502) T ss_pred cccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe Confidence 7778899999999999999999999999999997643 24556899999999999999999999999999999999 Q ss_pred CCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccc Q lcl|NC_018086. 142 DRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKD 221 (511) Q Consensus 142 ~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 221 (511) +++|++++++++|.+++++|++...++++++||+|......+ .+.++++||++.+++|...++ +.. T Consensus 161 dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~--~~~~~~iyt~~~i~~~~~~~~------------~~~ 226 (502) T protein:vir:48 161 SEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQN--AKDVVEIYTNQHIYTLDASDS------------FNE 226 (502) T ss_pred CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCC--cEEEEEEEeCCeEEEEEeCCc------------eee Confidence 999999999999999999999988889999999998765443 456789999999999875432 345 Q ss_pred ccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeee Q lcl|NC_018086. 222 YEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVT 301 (511) Q Consensus 222 ~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~ 301 (511) ....+|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|.......+....++..+++.+ T Consensus 227 ~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~ 306 (502) T protein:vir:48 227 ISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQL 306 (502) T ss_pred ccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeec Confidence 67789999999999999999999999999999999999999999999999999999999877666667777777777765 Q ss_pred c---------CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 302 D---------EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFR 371 (511) Q Consensus 302 ~---------~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~ 371 (511) . ++++++|++++.+.+++++++++|.++|+.+|++|+++++++ ||+||+||++++++|.+|+.++++.|+ T Consensus 307 ~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~ 386 (502) T protein:vir:48 307 KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFT 386 (502) T ss_pred cccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 356899999999999999999999999999999999999887 689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_018086. 372 KVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQR 451 (511) Q Consensus 372 ~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~ 451 (511) .+|++++++|+++++..+...+++..+++|+|++++|.|.++.++++++++|++|++|+++++|+++|+++|++||++|+ T Consensus 387 ~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~E~ 466 (502) T protein:vir:48 387 QGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINEES 466 (502) T ss_pred HHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHH Confidence 99999999999999988777778888899999999999999999999999999999999999999999999999999987 Q ss_pred HHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCC Q lcl|NC_018086. 452 QKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQ 505 (511) Q Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (511) ++................+.++. .+ +++. .+++-.+ T Consensus 467 ~~~~~~~~~~~~~~~~~~~~d~~-~e----~~~~-------------~~~~~~~ 502 (502) T protein:vir:48 467 SKIDFKGYPSYFYDNVGKYTDEV-KE----THTD-------------DFERVYE 502 (502) T ss_pred HhhhhhcccccccccccccCCCc-cC----CCCc-------------CcCCCCC Confidence 65332222211111111111111 00 1100 1111111 No 20 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=2.3e-95 Score=539.43 Aligned_cols=438 Identities=19% Similarity=0.265 Sum_probs=378.2 Q ss_pred CCH----HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc--------------CccccccceeccchHHHHHHHHH Q lcl|NC_018086. 28 FDL----RELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF--------------DDTNKPNSKIVHNFPKLLVDTST 89 (511) Q Consensus 28 ~~~----~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~--------------~~~~~~~~ri~~n~~k~ivd~~~ 89 (511) ++. +.|.+++.+|..++++|+++++||.|+|+++.+... ....++++|+++||+++||++.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 333 446667778888999999999999999998865432 12345788999999999999999 Q ss_pred hhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC-CCceEEEEEcccceEEEecCCCCCc Q lcl|NC_018086. 90 AYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR-NKKHRFKAVSPMNCLIAYSADLDEE 168 (511) Q Consensus 90 ~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~-~g~~~i~~~~p~~~~~v~d~~~~~~ 168 (511) +|++|+|++++++++......+.|..|+|+..+.++++.++++|+||+++|.++ +|+++++.++|.+++++|++....+ T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~~~ 160 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVLGDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLDKK 160 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCCCc Confidence 999999999998776554444444569999999999999999999999999985 6999999999999999999998889 Q ss_pred eEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc---------ccccccccccceeccCCccceEeecC Q lcl|NC_018086. 169 PVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI---------PEELEIKDYEVHPNLLQKFPVLEIIA 239 (511) Q Consensus 169 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~g~iPvv~~~n 239 (511) +++++|+|......+++.+.++++|+++.+++|....++...... .....+......+|+||+||||+|+| T Consensus 161 ~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 240 (471) T protein:vir:10 161 SIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN 240 (471) T ss_pred eEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc Confidence 999999999888888889999999999999999877654322211 11234566778899999999999999 Q ss_pred CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCC-----CceeeeecCC Q lcl|NC_018086. 240 NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDED-----GMVKFITKDV 314 (511) Q Consensus 240 ~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~~~~~~ 314 (511) +..|+|+|+++++|||+||.++|++++.++++++|+++++|++++..+++..+++.++++.++++ ++++|++++. T Consensus 241 ~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~ 320 (471) T protein:vir:10 241 NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDI 320 (471) T ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecC Confidence 99999999999999999999999999999999999999999988888888899999999988643 5899999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_018086. 315 NDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDL 394 (511) Q Consensus 315 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 394 (511) +.++++.++++|.++|+.+|++|+++++.+||+||+||++++++|.+||.++++.|+.+|++++++|+.+++.. T Consensus 321 ~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~------ 394 (471) T protein:vir:10 321 PTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS------ 394 (471) T ss_pred ChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------ Confidence 99999999999999999999999999999999999999999999999999999999999999999999988643 Q ss_pred cccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc Q lcl|NC_018086. 395 KPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA 474 (511) Q Consensus 395 ~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 474 (511) +..+++|.|++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++..+.. ....+..+..+. T Consensus 395 d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~----~~~~~~~~~~e~ 470 (471) T protein:vir:10 395 DKLKIKQTWTRNSINNDTEMAQVVSTLATITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKL----YDMEEVEHESEV 470 (471) T ss_pred CCceeEEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc----cccCCCCCcccc Confidence 3457899999999999999999999999999999999999999999999999999987765432 121222111111 Q ss_pred c Q lcl|NC_018086. 475 A 475 (511) Q Consensus 475 ~ 475 (511) + T Consensus 471 ~ 471 (471) T protein:vir:10 471 E 471 (471) T ss_pred C Confidence 1 No 21 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=6.6e-95 Score=536.92 Aligned_cols=463 Identities=17% Similarity=0.202 Sum_probs=383.5 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPNS 74 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~~ 74 (511) +-|++++..+.. -..+...+..++.+.|.+++..|..++++|+++.+||.|+|+++.+... .+..++++ T Consensus 5 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:94 5 IRMPWDKPYGEE-----VVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDW 79 (474) T ss_pred ccccCCCchhhH-----HHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcc Confidence 333333332111 1222223345667889999999999999999999999999998765432 35667889 Q ss_pred eeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) |+++||+++||++.++|+||+|++++++++....+.+.|.+|+|+..+.+++++++++|+||+++|.+++|++++++++| T Consensus 80 ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:94 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred eeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcc Confidence 99999999999999999999999999887766555555667899999999999999999999999999999999999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) .+++++||+...+++.+++|+|... ...++++||++.+++|...+++.... ......+......+|++|+||| T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~------~~~~~~~yt~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~vPv 232 (474) T protein:vir:94 160 EQAIPIWVDKEREELKSFIRYYKFN------NEEKVEFWTDTTVTYYVLENGGLIPD-YYYGANHVQSHFSNGNWGRVPF 232 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEec------CeEEEEEEeCCeEEEEEEcCCccccc-cccCcCcccccccccCCCccce Confidence 9999999998889999999998743 23478999999999998877654332 2223344556778999999999 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeecCC Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITKDV 314 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 314 (511) |+|+|+++|+|+|++|++|||+||+++|++++.++++++|+++++|+++++.+++..+++..+++.++++++++|++++. T Consensus 233 v~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~ 312 (474) T protein:vir:94 233 IAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQVEV 312 (474) T ss_pred EEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecC Confidence 99999999999999999999999999999999999999999999999988888888888899999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_018086. 315 NDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKD 393 (511) Q Consensus 315 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 393 (511) +.+++++++++|.+.|+.+|++|+++++++ |++||+||+++++++.+||.++++.|+.+|++++++|+++++.. T Consensus 313 ~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----- 387 (474) T protein:vir:94 313 PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK----- 387 (474) T ss_pred CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----- Confidence 999999999999999999999999998877 57999999999999999999999999999999999999887542 Q ss_pred ccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 394 LKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 394 ~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) .+..+++|+|++++|.|+++.|++++++ |++|++|+++++|+++|+++|++||++|+++.++... .... .+..+ + T Consensus 388 ~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~-~~~~-~~~~~--~ 462 (474) T protein:vir:94 388 TDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLP-NLDD-GGADG--A 462 (474) T ss_pred cccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc-ccCC-CCCCC--c Confidence 4566789999999999999999999886 8999999999999999999999999999877654321 1110 01100 0 Q ss_pred ccccccCCCCCC Q lcl|NC_018086. 474 AAANKLDKNPAN 485 (511) Q Consensus 474 ~~~~~~~~~~~~ 485 (511) .+.+++++..+. T Consensus 463 ~~~~~~~~~~~e 474 (474) T protein:vir:94 463 QQQEGSNNKESE 474 (474) T ss_pred ccCCCCcccccC Confidence 001111111111 No 22 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=6.6e-95 Score=536.92 Aligned_cols=463 Identities=17% Similarity=0.202 Sum_probs=383.5 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPNS 74 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~~ 74 (511) +-|++++..+.. -..+...+..++.+.|.+++..|..++++|+++.+||.|+|+++.+... .+..++++ T Consensus 5 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:97 5 IRMPWDKPYGEE-----VVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDW 79 (474) T ss_pred ccccCCCchhhH-----HHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcc Confidence 333333332111 1222223345667889999999999999999999999999998765432 35667889 Q ss_pred eeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) |+++||+++||++.++|+||+|++++++++....+.+.|.+|+|+..+.+++++++++|+||+++|.+++|++++++++| T Consensus 80 ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p 159 (474) T protein:vir:97 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPA 159 (474) T ss_pred eeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcc Confidence 99999999999999999999999999887766555555667899999999999999999999999999999999999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) .+++++||+...+++.+++|+|... ...++++||++.+++|...+++.... ......+......+|++|+||| T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~------~~~~~~~yt~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~vPv 232 (474) T protein:vir:97 160 EQAIPIWVDKEREELKSFIRYYKFN------NEEKVEFWTDTTVTYYVLENGGLIPD-YYYGANHVQSHFSNGNWGRVPF 232 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEec------CeEEEEEEeCCeEEEEEEcCCccccc-cccCcCcccccccccCCCccce Confidence 9999999998889999999998743 23478999999999998877654332 2223344556778999999999 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeecCC Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITKDV 314 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 314 (511) |+|+|+++|+|+|++|++|||+||+++|++++.++++++|+++++|+++++.+++..+++..+++.++++++++|++++. T Consensus 233 v~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~ 312 (474) T protein:vir:97 233 IAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQVEV 312 (474) T ss_pred EEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecC Confidence 99999999999999999999999999999999999999999999999988888888888899999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_018086. 315 NDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKD 393 (511) Q Consensus 315 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 393 (511) +.+++++++++|.+.|+.+|++|+++++++ |++||+||+++++++.+||.++++.|+.+|++++++|+++++.. T Consensus 313 ~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----- 387 (474) T protein:vir:97 313 PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK----- 387 (474) T ss_pred CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----- Confidence 999999999999999999999999998877 57999999999999999999999999999999999999887542 Q ss_pred ccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 394 LKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 394 ~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) .+..+++|+|++++|.|+++.|++++++ |++|++|+++++|+++|+++|++||++|+++.++... .... .+..+ + T Consensus 388 ~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~-~~~~-~~~~~--~ 462 (474) T protein:vir:97 388 TDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLP-NLDD-GGADG--A 462 (474) T ss_pred cccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc-ccCC-CCCCC--c Confidence 4566789999999999999999999886 8999999999999999999999999999877654321 1110 01100 0 Q ss_pred ccccccCCCCCC Q lcl|NC_018086. 474 AAANKLDKNPAN 485 (511) Q Consensus 474 ~~~~~~~~~~~~ 485 (511) .+.+++++..+. T Consensus 463 ~~~~~~~~~~~e 474 (474) T protein:vir:97 463 QQQEGSNNKESE 474 (474) T ss_pred ccCCCCcccccC Confidence 001111111111 No 23 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=9.8e-95 Score=535.98 Aligned_cols=441 Identities=25% Similarity=0.367 Sum_probs=376.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |-| .+..-+....+++++.+.|.+++++|..++++|+++.+||+|+|++.++. ...+.++++|+++|| T Consensus 1 ~~~-----------~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~-~~~~~~~~~ki~~n~ 68 (453) T protein:vir:73 1 MNL-----------KPIKLMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQK-AKDSWKPDNRLTNNF 68 (453) T ss_pred Ccc-----------ccceeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCC-CCCccCccceeecch Confidence 211 22222333456788889999999999999999999999999999987644 456778999999999 Q ss_pred HHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLI 159 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~ 159 (511) +++||++.++|++|+|+++++++ +..+.++++|+.|+|+..+.++++++++||+||+++|.+++|.+++++++|.++++ T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~ 148 (453) T protein:vir:73 69 AKYIVDTFVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFM 148 (453) T ss_pred HHHHHHHhhhhhcccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 99999999999999999998655 45678999999999999999999999999999999999999999999999999999 Q ss_pred EecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA 239 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 239 (511) +|++..++.++++++++... ....++++||++.+++|..... .+...+..+|+||.||||+|+| T Consensus 149 v~dd~~~~~~~~~i~~~~~~-----~~~~~~~vyt~~~i~~~~~~~~-----------~~~~~~~~~~~~g~vPvv~~~n 212 (453) T protein:vir:73 149 VYDDSIKQKPLFAVYYGFDE-----EGNLSGTVYTLLETISITGKAG-----------EVKFGESTYNVYSDLPIVEYNF 212 (453) T ss_pred EEeCCCCceeEEEEEEEEec-----CceEEEEEEeCCeEEEEEecCC-----------ceEEccceeccCCceeEEEecC Confidence 99999888899998877532 2345789999999999886543 3445677899999999999999 Q ss_pred CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceee-----------ecCCCcee Q lcl|NC_018086. 240 NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIV-----------TDEDGMVK 308 (511) Q Consensus 240 ~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~-----------~~~~~~~~ 308 (511) +++|+|+|+++++|+|+||+++|++++.++++++|+++++|...++ +...+++..+++. .+.+++++ T Consensus 213 ~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 290 (453) T protein:vir:73 213 NEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE--EDAKNIKDNRLINFFDKNSNGQGTNAAKVDVK 290 (453) T ss_pred CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc--hhhhcccccccccccccccccccccccCceeE Confidence 9999999999999999999999999999999999999999986553 2333343333333 23467799 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFM 388 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 388 (511) |++++.+.+++++++++|.+.|+.+|++|+++++.+||+||+||++++++|..||+++++.|+.+|++++++|+.+++.. T Consensus 291 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~ 370 (453) T protein:vir:73 291 FLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA 370 (453) T ss_pred EeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988765 Q ss_pred CCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_018086. 389 NKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAV 468 (511) Q Consensus 389 ~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (511) +. ..+..+++|+|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++........+... T Consensus 371 ~~--~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 448 (453) T protein:vir:73 371 SN--KDAWKDIEYTFTRNEPKDIKEQAETANILKGITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQ 448 (453) T ss_pred CC--ccccccceEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchh Confidence 44 4456689999999999999999999999999999999999999999999999999999998877655433222221 Q ss_pred CCCCC Q lcl|NC_018086. 469 QGAST 473 (511) Q Consensus 469 ~~~~~ 473 (511) .-++- T Consensus 449 ~~~~~ 453 (453) T protein:vir:73 449 MRGNL 453 (453) T ss_pred hhcCC Confidence 11111 No 24 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=9.7e-95 Score=536.01 Aligned_cols=462 Identities=17% Similarity=0.195 Sum_probs=377.5 Q ss_pred CCCccc--hhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccc Q lcl|NC_018086. 1 MAIPNG--QINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKP 72 (511) Q Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~ 72 (511) |--.+. ..+.+++... ..+.+...+.|.+++++|..++++|+++++||+|+|+++.++.+ ..+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~ 74 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRT------NNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKP 74 (472) T ss_pred CCCCCCcchhhhhceeee------cCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccc Confidence 211111 1111222111 11223445678889999999999999999999999998765543 345677 Q ss_pred cceeccchHHHHHHHHHhhhhccCceecCchhhH-HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTI-KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKA 151 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~-~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~ 151 (511) ++|+++||+++||++.++|++|+|++++++++.. +.++++| .|+|+..+.++++++++||+||++||.+++|++++++ T Consensus 75 ~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~ 153 (472) T protein:vir:93 75 DDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFR 153 (472) T ss_pred ccccccchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEE Confidence 8899999999999999999999999998666544 5555554 6899999999999999999999999999999999999 Q ss_pred EcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCc Q lcl|NC_018086. 152 VSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQK 231 (511) Q Consensus 152 ~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 231 (511) +||.+++++||+...+++.+++|+|..... .++++|+++.+++|....+...... .....+......+|+||. T Consensus 154 ~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 226 (472) T protein:vir:93 154 VPAEQGIPIWTDKEHEELEAFIRMYKLENE------TKVEYWDKVTVNYYVYENGSLIPDY-SNNLENSKTHFSTGSWGK 226 (472) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEeecc------eeEEEEecCeEEEEEEecCeeeecc-cccccccccccccCCCCC Confidence 999999999999888889999999875432 2578999999999887765443322 223345566788999999 Q ss_pred cceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeee Q lcl|NC_018086. 232 FPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFIT 311 (511) Q Consensus 232 iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 311 (511) ||||+|+|+++|+|+|++|++|+|+||+++|++++.++++++|+++++|.+.++.+++...++..+++.++++++++|++ T Consensus 227 vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 306 (472) T protein:vir:93 227 IPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQ 306 (472) T ss_pred cceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccccccCCCCcceeEe Confidence 99999999999999999999999999999999999999999999999999888777888888888999999999999999 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018086. 312 KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK 390 (511) Q Consensus 312 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 390 (511) ++++.+++++++++|+++|+.+|++|+++++.+ +|+||+||++++.+|..||+++++.|+.+|++++++|+.+++.. T Consensus 307 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-- 384 (472) T protein:vir:93 307 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-- 384 (472) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-- Confidence 999999999999999999999999999999887 57999999999999999999999999999999999999887643 Q ss_pred CccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCC Q lcl|NC_018086. 391 AKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQG 470 (511) Q Consensus 391 ~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 470 (511) .+..+++|+|++++|.|.++.++++++++|++|++|+++++|+++|+++|++||++|+++.+..... ... .+.. T Consensus 385 ---~~~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~-~~~-~~~d- 458 (472) T protein:vir:93 385 ---GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN-LDD-GGAD- 458 (472) T ss_pred ---cccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccC-cCc-ccCC- Confidence 3456789999999999999999999999999999999999999999999999999998776554211 110 0000 Q ss_pred CCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 471 ASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) +.++++.++.. +++ T Consensus 459 -----~~~~~~~~~~~-----~~e 472 (472) T protein:vir:93 459 -----GAQQQERSNNK-----ESE 472 (472) T ss_pred -----CCCCCCCCCcc-----cCC Confidence 00001111000 000 No 25 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=1.5e-94 Score=534.96 Aligned_cols=461 Identities=18% Similarity=0.232 Sum_probs=381.8 Q ss_pred CCCccchhhcccccCchhhHhhhhc-cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Ccccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIR-RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPN 73 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~ 73 (511) +-++++.=....+ +..++. .....+.|.+++++|..++++++++++||.|+|++..++.. ..+.+++ T Consensus 5 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:95 5 IRMPWDKPYGEEV------VEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPD 78 (474) T ss_pred ccCCCCCCCCcch------hhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccc Confidence 3444443322222 222333 23556678889999999999999999999999998876533 2456788 Q ss_pred ceeccchHHHHHHHHHhhhhccCceecCchhh-HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITESGDEKT-IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAV 152 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~-~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~ 152 (511) +|+++||+++||++.++|++|+|++++++++. .+.++++ ..|+++..+.+++++++++|+||+++|.+++|+++++++ T Consensus 79 ~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~-~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~ 157 (474) T protein:vir:95 79 WRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQV-LDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRV 157 (474) T ss_pred cccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHH-HhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Confidence 89999999999999999999999999876654 4555555 468999999999999999999999999999999999999 Q ss_pred cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCcc Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKF 232 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 232 (511) ||++++++|++....++++++|+|... ...++++|+++.+++|....++... .......+......+|+||.| T Consensus 158 ~p~~~~~v~d~~~~~~~~a~ir~~~~~------~~~~~~vy~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v 230 (474) T protein:vir:95 158 PAEQAIPIWTDKEREQLNAFIRIFTFN------GETKVEYWTAETVTYYVYENGGLIP-DFYYGDEHIQTHFSTGSWERV 230 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec------CeeEEEEEeCCeEEEEEEcCCceee-ccccccccccCcccccCCCcc Confidence 999999999999889999999998642 2357899999999999887765432 223334456677889999999 Q ss_pred ceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeec Q lcl|NC_018086. 233 PVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITK 312 (511) Q Consensus 233 Pvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 312 (511) |||+|+|+++|.|+|+++++|+|+||.++|++++.+++|++|+++++|+.+++.+++..+++..+++.++++++++|+++ T Consensus 231 Pvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~ 310 (474) T protein:vir:95 231 PFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQV 310 (474) T ss_pred ceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeEEec Confidence 99999999999999999999999999999999999999999999999998887788888889999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_018086. 313 DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA 391 (511) Q Consensus 313 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 391 (511) +.+.+++++++++|.++|+.+|++|++++++++ ++||+||+++++++.+||.++++.|+.+|++++++|+++++. T Consensus 311 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~---- 386 (474) T protein:vir:95 311 EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI---- 386 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---- Confidence 999999999999999999999999999988774 799999999999999999999999999999999999988753 Q ss_pred ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCC Q lcl|NC_018086. 392 KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGA 471 (511) Q Consensus 392 ~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 471 (511) .++..+++++|++++|.|.++.++++++ +|++|+||+++++|+++|+++|++||++|+++.++... .. .+. T Consensus 387 -~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~-~~------~~~ 457 (474) T protein:vir:95 387 -KLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLP-NL------DDG 457 (474) T ss_pred -CcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc-cc------ccc Confidence 3456789999999999999999999877 49999999999999999999999999999887654321 11 111 Q ss_pred CCccccccCCCCCCccc Q lcl|NC_018086. 472 STAAANKLDKNPANTST 488 (511) Q Consensus 472 ~~~~~~~~~~~~~~~~~ 488 (511) ....+.+.++..+.... T Consensus 458 ~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 458 GADGAQQQQQSENNQSK 474 (474) T ss_pred cCCCCCCcCCCCccccC Confidence 11111111111111101 No 26 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=1.5e-94 Score=534.96 Aligned_cols=461 Identities=18% Similarity=0.232 Sum_probs=381.8 Q ss_pred CCCccchhhcccccCchhhHhhhhc-cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Ccccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIR-RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPN 73 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~ 73 (511) +-++++.=....+ +..++. .....+.|.+++++|..++++++++++||.|+|++..++.. ..+.+++ T Consensus 5 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:96 5 IRMPWDKPYGEEV------VEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPD 78 (474) T ss_pred ccCCCCCCCCcch------hhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccc Confidence 3444443322222 222333 23556678889999999999999999999999998876533 2456788 Q ss_pred ceeccchHHHHHHHHHhhhhccCceecCchhh-HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITESGDEKT-IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAV 152 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~-~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~ 152 (511) +|+++||+++||++.++|++|+|++++++++. .+.++++ ..|+++..+.+++++++++|+||+++|.+++|+++++++ T Consensus 79 ~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~-~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~ 157 (474) T protein:vir:96 79 WRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQV-LDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRV 157 (474) T ss_pred cccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHH-HhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Confidence 89999999999999999999999999876654 4555555 468999999999999999999999999999999999999 Q ss_pred cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCcc Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKF 232 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 232 (511) ||++++++|++....++++++|+|... ...++++|+++.+++|....++... .......+......+|+||.| T Consensus 158 ~p~~~~~v~d~~~~~~~~a~ir~~~~~------~~~~~~vy~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v 230 (474) T protein:vir:96 158 PAEQAIPIWTDKEREQLNAFIRIFTFN------GETKVEYWTAETVTYYVYENGGLIP-DFYYGDEHIQTHFSTGSWERV 230 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec------CeeEEEEEeCCeEEEEEEcCCceee-ccccccccccCcccccCCCcc Confidence 999999999999889999999998642 2357899999999999887765432 223334456677889999999 Q ss_pred ceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeec Q lcl|NC_018086. 233 PVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITK 312 (511) Q Consensus 233 Pvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 312 (511) |||+|+|+++|.|+|+++++|+|+||.++|++++.+++|++|+++++|+.+++.+++..+++..+++.++++++++|+++ T Consensus 231 Pvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~ 310 (474) T protein:vir:96 231 PFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQV 310 (474) T ss_pred ceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCCCceeEEec Confidence 99999999999999999999999999999999999999999999999998887788888889999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_018086. 313 DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA 391 (511) Q Consensus 313 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 391 (511) +.+.+++++++++|.++|+.+|++|++++++++ ++||+||+++++++.+||.++++.|+.+|++++++|+++++. T Consensus 311 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~---- 386 (474) T protein:vir:96 311 EVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI---- 386 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---- Confidence 999999999999999999999999999988774 799999999999999999999999999999999999988753 Q ss_pred ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCC Q lcl|NC_018086. 392 KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGA 471 (511) Q Consensus 392 ~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 471 (511) .++..+++++|++++|.|.++.++++++ +|++|+||+++++|+++|+++|++||++|+++.++... .. .+. T Consensus 387 -~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~-~~------~~~ 457 (474) T protein:vir:96 387 -KLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLP-NL------DDG 457 (474) T ss_pred -CcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc-cc------ccc Confidence 3456789999999999999999999877 49999999999999999999999999999887654321 11 111 Q ss_pred CCccccccCCCCCCccc Q lcl|NC_018086. 472 STAAANKLDKNPANTST 488 (511) Q Consensus 472 ~~~~~~~~~~~~~~~~~ 488 (511) ....+.+.++..+.... T Consensus 458 ~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 458 GADGAQQQQQSENNQSK 474 (474) T ss_pred cCCCCCCcCCCCccccC Confidence 11111111111111101 No 27 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=2.7e-94 Score=533.53 Aligned_cols=424 Identities=25% Similarity=0.407 Sum_probs=376.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCchh-hH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEK-TI 106 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~-~~ 106 (511) ++.+.|.+++++|..++++|+++++||+|+|+++.+. .....++++|+++||+++||++.++|++|+|++++++++ .. T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~-~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~ 79 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQK-QKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQVS 79 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-ccccCCCcceeecchHHHHHHHHhhhhcccCceeecCChHHH Confidence 8999999999999999999999999999999987554 356678899999999999999999999999999987654 56 Q ss_pred HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 107 KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 107 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) +.++++|+.|+|+..+.+++++++++|+||+++|.+++|++++++++|.+++++||+....++++++|+|... .. T Consensus 80 ~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~-----~~ 154 (429) T protein:vir:98 80 NYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNK-----GG 154 (429) T ss_pred HHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEec-----Cc Confidence 7899999999999999999999999999999999999999999999999999999998888999999988532 24 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVN 266 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~ 266 (511) ..+.++|+.+.+++|.....+ +...+..+|+||+||||+|+|+++|+|+|+++++|+|+||+++|++++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~ 223 (429) T protein:vir:98 155 VLEGSYSDASNITYFKDGEKG-----------IEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKAN 223 (429) T ss_pred eEEEEEEeCceEEEEEecCCc-----------eEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHH Confidence 567789999998888765433 445677899999999999999999999999999999999999999999 Q ss_pred HHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCC----CceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccc Q lcl|NC_018086. 267 DIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDED----GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSK 342 (511) Q Consensus 267 ~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~----~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 342 (511) .++++++|+++++|.+.+. ++..++...+++.++++ ++++|++++.+.++++++++.|.++|+.+|++|+++++ T Consensus 224 ~~~~~~~p~~~i~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 301 (429) T protein:vir:98 224 DVEYFADAYLKILGAELDD--ETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDE 301 (429) T ss_pred HHHHhcCceeeeecCCCCc--chhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc Confidence 9999999999999987653 45667778888888643 47899999999999999999999999999999999999 Q ss_pred cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh Q lcl|NC_018086. 343 DFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR 422 (511) Q Consensus 343 ~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~ 422 (511) ++||+||+||++++++|..|+.++++.|+.+|++++++|+.+++..+. ..+..+++|.|++++|.|.++.|+++++++ T Consensus 302 ~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~--~~d~~~i~v~f~~~~p~~~~~~a~~~~kl~ 379 (429) T protein:vir:98 302 SFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIG--PKDWIGIKYKFTRNLPANLLEESQIAGNLA 379 (429) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--ccccccceEEeCCCCCcCHHHHHHHHHHHh Confidence 999999999999999999999999999999999999999999876554 456678999999999999999999999999 Q ss_pred ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCC Q lcl|NC_018086. 423 DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAS 472 (511) Q Consensus 423 g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 472 (511) |++|+||+++++|+++|+++|++||++|+++..+.....++.+......+ T Consensus 380 g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 380 GIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTILE 429 (429) T ss_pred ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCCCC Confidence 99999999999999999999999999998877665444333322222111 No 28 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=3.3e-94 Score=533.11 Aligned_cols=472 Identities=23% Similarity=0.331 Sum_probs=380.4 Q ss_pred CCCccc-----hh-hcccccCchhhHh----hhhccC-CCHHHHHHHHHHHHHH-HHHHHHHHHHhcCCC-cccccCCcC Q lcl|NC_018086. 1 MAIPNG-----QI-NAGDIITTNIRRK----HFIRRN-FDLRELITLAEMHSRS-SSAYGVLYDYYKGNH-IAIQSRTFD 67 (511) Q Consensus 1 ~~~~~~-----~~-~~~~~~~~~~~~~----~~~~~~-~~~~~l~~~~~~~~~~-~~~~~~~~~yY~G~~-~~~~~~~~~ 67 (511) |-...+ +. +.+......-+.. .+.+.. .+.+.|.+++.+|..+ .++|+++.+||.|+| .+..+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRK 80 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccC Confidence 444333 11 1111111111111 111111 2235588899999865 579999999999985 566666666 Q ss_pred ccccccceeccchHHHHHHHHHhhhhccCceecCch-----hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDE-----KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) ...++++|+++||+++||++.++|++|+|+++++++ ...+.|+++|+.|+|+..+.++++++++||+||+++|.+ T Consensus 81 ~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d 160 (501) T protein:vir:96 81 DNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRS 160 (501) T ss_pred ccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEc Confidence 778899999999999999999999999999997543 345668999999999999999999999999999999999 Q ss_pred CCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccc Q lcl|NC_018086. 143 RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDY 222 (511) Q Consensus 143 ~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 222 (511) ++|++++++++|.+++++||+...++++++|++|......+ .+.++++||++.+++|....+ +.+. T Consensus 161 edg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~--~~~~~~vyt~~~i~~~~~~~~------------~~~~ 226 (501) T protein:vir:96 161 EYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQS--AKDVVEIYTDEHIYTLDASDD------------FNEI 226 (501) T ss_pred CCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCC--cEEEEEEEcCCcEEEEeeCCC------------ceec Confidence 99999999999999999999998889999999997655433 456789999999999975433 3455 Q ss_pred cceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec Q lcl|NC_018086. 223 EVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD 302 (511) Q Consensus 223 ~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~ 302 (511) ...+|+||.||||+|+|+++|+|+|+++++|+|+||+++|++++.++++++|+++++|......+++...++..+++.+. T Consensus 227 ~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~ 306 (501) T protein:vir:96 227 SVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLK 306 (501) T ss_pred cccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeec Confidence 67899999999999999999999999999999999999999999999999999999999887777777888877777664 Q ss_pred ---------CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 ---------EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRK 372 (511) Q Consensus 303 ---------~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~ 372 (511) .+++++|++++.+.+++++++++|+++|+.+|++|+++++++ ||+||+||++++++|..||.++++.|+. T Consensus 307 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~ 386 (501) T protein:vir:96 307 PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTK 386 (501) T ss_pred ccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 245789999999999999999999999999999999999877 6799999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_018086. 373 VLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQ 452 (511) Q Consensus 373 ~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~ 452 (511) +|++++++|+.+++..+....++..+++|+|++++|.|.++.++++++++|++|++|+++++|+++||++|++||++|++ T Consensus 387 ~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~ 466 (501) T protein:vir:96 387 GLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEMS 466 (501) T ss_pred HHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 99999999999999887777788888999999999999999999999999999999999999999999999999999877 Q ss_pred HHHHHH-HhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCC Q lcl|NC_018086. 453 KRADIA-LQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQ 505 (511) Q Consensus 453 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (511) ...... ...+.+.. +++.++.. +..+..+++. .+ T Consensus 467 ~~~~~~~~~~~~~~~---------~~~~~~~~-e~~~d~~e~~---------~~ 501 (501) T protein:vir:96 467 EIDFKGYSNDFNEHV---------GKYTDEVK-ETHTDDFERE---------YE 501 (501) T ss_pred Hhhccccccchhhcc---------cccCCcCC-CCCCCccccc---------cC Confidence 543211 11111111 11111100 0000000000 00 No 29 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=2.9e-93 Score=527.91 Aligned_cols=464 Identities=16% Similarity=0.189 Sum_probs=381.4 Q ss_pred CCccchhhcccccCchhhHhhh-hccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccccc Q lcl|NC_018086. 2 AIPNGQINAGDIITTNIRRKHF-IRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPNS 74 (511) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~~ 74 (511) -+.|+|=+.--+.+ +.+..+ .+.+.+.+.|.+++.+|..++++|+++++||+|+|++..++.. ....++++ T Consensus 1 ~~~~~~~~~~~~~~--e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHE--QVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDW 78 (478) T ss_pred CccccCCCCchhHH--HHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccc Confidence 23444431111111 112222 2234667889999999999999999999999999998765432 34567888 Q ss_pred eeccchHHHHHHHHHhhhhccCceecCchh-hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEc Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAYLAGEPITESGDEK-TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVS 153 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~-~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~ 153 (511) |+++||+++||++.++|+||+|++++++++ ..+.|+++++ |+++..+.+++++++++|+||+++|.+++|++++++++ T Consensus 79 ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~ 157 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVP 157 (478) T ss_pred eeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEc Confidence 999999999999999999999999976654 4556777764 89999999999999999999999999999999999999 Q ss_pred ccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccc---cccccccccccceeccCC Q lcl|NC_018086. 154 PMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYRE---IPEELEIKDYEVHPNLLQ 230 (511) Q Consensus 154 p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g 230 (511) |.+++++|++...+++.+++|+|... ...++++|+++.+++|.+..+...... ......+......+|++| T Consensus 158 p~~~~~i~d~~~~~~~~~~v~~~~~~------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (478) T protein:vir:10 158 AEQAVPIWTNKERDELQAFIRVYELD------GAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWG 231 (478) T ss_pred ccceEEEEcCCCCCceEEEEEEEEec------CceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCC Confidence 99999999998888899999998643 234689999999999988765543222 222334455677899999 Q ss_pred ccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec--CCCcee Q lcl|NC_018086. 231 KFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD--EDGMVK 308 (511) Q Consensus 231 ~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~--~~~~~~ 308 (511) .||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|+++++..++..+++..+++.++ ++++++ T Consensus 232 ~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (478) T protein:vir:10 232 RVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVD 311 (478) T ss_pred ccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecCCCCCcce Confidence 999999999999999999999999999999999999999999999999999988878888888888888876 568899 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) |++++.+.++++.++++|++.|+.+|++|+++++++ ||+||+||+++++.|.+||+++++.|+.+|++++++|+++++. T Consensus 312 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~ 391 (478) T protein:vir:10 312 TIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL 391 (478) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 999999999999999999999999999999999887 5899999999999999999999999999999999999988742 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSA 467 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 467 (511) .++..+++|+|++++|.|.++.|+++++++|++|++|+++++|+++|+++|++||++|+++..+.... ... + T Consensus 392 -----~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~-~~~--~ 463 (478) T protein:vir:10 392 -----DVKVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILSNHAWVEDPVAEMERIEQENIELNQQLPD-IEE--G 463 (478) T ss_pred -----CcccccceEEecCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccc-ccc--c Confidence 45667899999999999999999999999999999999999999999999999999998765553211 111 1 Q ss_pred CCCCCCccccccCCCCC Q lcl|NC_018086. 468 VQGASTAAANKLDKNPA 484 (511) Q Consensus 468 ~~~~~~~~~~~~~~~~~ 484 (511) ..+ +.+.++.+.+++ T Consensus 464 ~~~--~~~~~~~~~~~~ 478 (478) T protein:vir:10 464 LNG--EQQRQSENNQPE 478 (478) T ss_pred cCC--CCCCCCCCCCCC Confidence 111 111111111111 No 30 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=3.3e-93 Score=527.61 Aligned_cols=463 Identities=16% Similarity=0.182 Sum_probs=382.9 Q ss_pred CCccchhhcccccCchhhHhhhhcc-CCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccccc Q lcl|NC_018086. 2 AIPNGQINAGDIITTNIRRKHFIRR-NFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPNS 74 (511) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~~ 74 (511) -+.|+|-+.--+.+. .++.+..+ ....+.|.+++++|..++++|+++.+||.|+|+++.++.. ....++++ T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHEQ--VVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDW 78 (478) T ss_pred CccccccCCchhhhH--HHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccc Confidence 235555432222222 23333443 4667778899999999999999999999999998765433 34567889 Q ss_pred eeccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEc Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVS 153 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~ 153 (511) |+++||+++||++.++|+||+|+++++++ +..+.++++| .|+++..+.+++++++++|+||++||.+++|++++++++ T Consensus 79 ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~ 157 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTL-NHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVP 157 (478) T ss_pred eeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEc Confidence 99999999999999999999999997655 4556677776 489999999999999999999999999999999999999 Q ss_pred ccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccc---cccccccccccceeccCC Q lcl|NC_018086. 154 PMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYRE---IPEELEIKDYEVHPNLLQ 230 (511) Q Consensus 154 p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g 230 (511) |.+++++|++...+++.+++|+|.... ..++++|+++.+++|....+...... ......+......+|+|| T Consensus 158 p~~~~~v~d~~~~~~~~~~ir~~~~~~------~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 231 (478) T protein:vir:10 158 AEQAVPIWTNKERDELQAFIRVYELDG------AERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWG 231 (478) T ss_pred ccceEEEEcCCCCCceEEEEEEEeeeC------ceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCC Confidence 999999999988888999999986432 34689999999999987765433221 112233445667899999 Q ss_pred ccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec--CCCcee Q lcl|NC_018086. 231 KFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD--EDGMVK 308 (511) Q Consensus 231 ~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~--~~~~~~ 308 (511) +||||+|+|++.|+|+|+++++|+|+||.++|++++.++++++|+++++|+++++..++..+++..+++.++ ++++++ T Consensus 232 ~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (478) T protein:vir:10 232 RVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVD 311 (478) T ss_pred cceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecCCCCCcce Confidence 999999999999999999999999999999999999999999999999999988878888888888888775 468899 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) |++++++.++++.++++|.+.|+.+|++|+++++++ |++||+||++++++|.+||.++++.|+.+|++++++|+++++. T Consensus 312 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 391 (478) T protein:vir:10 312 TIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL 391 (478) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 999999999999999999999999999999998887 5799999999999999999999999999999999999988753 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSA 467 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 467 (511) .++..+++|+|++++|.|.++.++++++++|++|++|+++++|+++|+++|++||++|+++......... T Consensus 392 -----~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~----- 461 (478) T protein:vir:10 392 -----DVRVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILGNHSWVQDPVAEMERIEQENIELNQQLPDIE----- 461 (478) T ss_pred -----CcccccceEEeCCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccC----- Confidence 4566789999999999999999999999999999999999999999999999999999887654322111 Q ss_pred CCCCCCccccc-cCCCCC Q lcl|NC_018086. 468 VQGASTAAANK-LDKNPA 484 (511) Q Consensus 468 ~~~~~~~~~~~-~~~~~~ 484 (511) .+..+....+ .+++++ T Consensus 462 -~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 462 -EGLNDEQQRQSEDNQSE 478 (478) T ss_pred -CCCcccccccCcCCCCC Confidence 1111111111 111111 No 31 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=3.5e-93 Score=527.46 Aligned_cols=449 Identities=25% Similarity=0.322 Sum_probs=383.7 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---cccC-------------CcCccccccceec Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA---IQSR-------------TFDDTNKPNSKIV 77 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~---~~~~-------------~~~~~~~~~~ri~ 77 (511) ++...-+.++.+++++.+.|.++|+.|..++.++.++.+||+|.++. ..++ ......++++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 66666677888889999999999999999999999999999997542 2221 1234556888999 Q ss_pred cchHHHHHHHHHhhhhccCceecCc------hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGD------EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKA 151 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d------~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~ 151 (511) +||+++||++.++|++|+|++++++ ++..+.|+++|+.|+++.++.+++++++++|+||+++|.+++|++++++ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~ 160 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKN 160 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEE Confidence 9999999999999999999998653 3455778999999999999999999999999999999999999999999 Q ss_pred EcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCc Q lcl|NC_018086. 152 VSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQK 231 (511) Q Consensus 152 ~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 231 (511) ++|++++++||+.. +++++|++|...++.++..++++++||++.+++|...+. ..+...+..+|+||. T Consensus 161 i~p~~~~~v~d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~~~~~g~ 228 (474) T protein:vir:10 161 IDPYNVIFVGDNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGI----------DALQEVGRYEHLFDY 228 (474) T ss_pred EcccceEEEEcCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCC----------CcccccccccCCCCc Confidence 99999999998644 578999999988888888889999999999999876532 235567788999999 Q ss_pred cceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeee-cCCCceeee Q lcl|NC_018086. 232 FPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVT-DEDGMVKFI 310 (511) Q Consensus 232 iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~ 310 (511) ||||+|+|+++|+|+|+++++|+|+||.++|++++.++++++|+++++|++.++ +...+++..+++.+ +++++++|+ T Consensus 229 vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~~~~~~~~~l 306 (474) T protein:vir:10 229 NPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELFDKDMDVKYL 306 (474) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEecCCCCceeEE Confidence 999999999999999999999999999999999999999999999999987654 44556666676655 678999999 Q ss_pred ecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 311 TKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 311 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) +++.+.+++++++++|.+.|+.+|++|+++++.+ ||+||+||++++++|.+||.++++.|+.+|++++++|+.+++..+ T Consensus 307 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:10 307 TKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 9999999999999999999999999999998876 689999999999999999999999999999999999999998765 Q ss_pred CC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_018086. 390 KA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAV 468 (511) Q Consensus 390 ~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (511) .. .+.+..++++.|++++|.|.++.|+++++++|++|++|+++++|+++|+++|++||++|+++..+...+ ..+ T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~----~~~- 461 (474) T protein:vir:10 387 YNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPD----IDE- 461 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccc----ccC- Confidence 43 456677899999999999999999999999999999999999999999999999999998765543211 111 Q ss_pred CCCCCccccccCCCC Q lcl|NC_018086. 469 QGASTAAANKLDKNP 483 (511) Q Consensus 469 ~~~~~~~~~~~~~~~ 483 (511) +......+.+++. T Consensus 462 --~~~~~~~~~~~s~ 474 (474) T protein:vir:10 462 --GDANDKSQNNQSE 474 (474) T ss_pred --CCcCCCCccccCC Confidence 1111010000000 No 32 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=3.5e-93 Score=527.46 Aligned_cols=449 Identities=25% Similarity=0.322 Sum_probs=383.7 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---cccC-------------CcCccccccceec Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA---IQSR-------------TFDDTNKPNSKIV 77 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~---~~~~-------------~~~~~~~~~~ri~ 77 (511) ++...-+.++.+++++.+.|.++|+.|..++.++.++.+||+|.++. ..++ ......++++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 66666677888889999999999999999999999999999997542 2221 1234556888999 Q ss_pred cchHHHHHHHHHhhhhccCceecCc------hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGD------EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKA 151 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d------~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~ 151 (511) +||+++||++.++|++|+|++++++ ++..+.|+++|+.|+++.++.+++++++++|+||+++|.+++|++++++ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~ 160 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKN 160 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEE Confidence 9999999999999999999998653 3455778999999999999999999999999999999999999999999 Q ss_pred EcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCc Q lcl|NC_018086. 152 VSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQK 231 (511) Q Consensus 152 ~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 231 (511) ++|++++++||+.. +++++|++|...++.++..++++++||++.+++|...+. ..+...+..+|+||. T Consensus 161 i~p~~~~~v~d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~~~~~g~ 228 (474) T protein:vir:94 161 IDPYNVIFVGDNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGI----------DALQEVGRYEHLFDY 228 (474) T ss_pred EcccceEEEEcCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCC----------CcccccccccCCCCc Confidence 99999999998644 578999999988888888889999999999999876532 235567788999999 Q ss_pred cceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeee-cCCCceeee Q lcl|NC_018086. 232 FPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVT-DEDGMVKFI 310 (511) Q Consensus 232 iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~ 310 (511) ||||+|+|+++|+|+|+++++|+|+||.++|++++.++++++|+++++|++.++ +...+++..+++.+ +++++++|+ T Consensus 229 vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~~~~~~~~~l 306 (474) T protein:vir:94 229 NPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELFDKDMDVKYL 306 (474) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEecCCCCceeEE Confidence 999999999999999999999999999999999999999999999999987654 44556666676655 678999999 Q ss_pred ecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 311 TKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 311 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) +++.+.+++++++++|.+.|+.+|++|+++++.+ ||+||+||++++++|.+||.++++.|+.+|++++++|+.+++..+ T Consensus 307 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 386 (474) T protein:vir:94 307 TKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKG 386 (474) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 9999999999999999999999999999998876 689999999999999999999999999999999999999998765 Q ss_pred CC-ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_018086. 390 KA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAV 468 (511) Q Consensus 390 ~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (511) .. .+.+..++++.|++++|.|.++.|+++++++|++|++|+++++|+++|+++|++||++|+++..+...+ ..+ T Consensus 387 ~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~----~~~- 461 (474) T protein:vir:94 387 YNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPD----IDE- 461 (474) T ss_pred CCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccc----ccC- Confidence 43 456677899999999999999999999999999999999999999999999999999998765543211 111 Q ss_pred CCCCCccccccCCCC Q lcl|NC_018086. 469 QGASTAAANKLDKNP 483 (511) Q Consensus 469 ~~~~~~~~~~~~~~~ 483 (511) +......+.+++. T Consensus 462 --~~~~~~~~~~~s~ 474 (474) T protein:vir:94 462 --GDANDKSQNNQSE 474 (474) T ss_pred --CCcCCCCccccCC Confidence 1111010000000 No 33 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=4.7e-93 Score=526.76 Aligned_cols=462 Identities=17% Similarity=0.198 Sum_probs=383.7 Q ss_pred CCCccchhhcccccCchhhHhhhhc-cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Ccccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIR-RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPN 73 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~ 73 (511) +-|++++- .+.+.+..+.+ ..++.+.|.+++++|..++++|+++.+||.|+|++.++... .+..+++ T Consensus 5 ~~~~~~~~------~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:95 5 IRMPWDKP------YGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPD 78 (474) T ss_pred eecCCCCc------hhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcccccccccccccccccc Confidence 33344333 22223333333 45778889999999999999999999999999998765443 3456778 Q ss_pred ceeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEc Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVS 153 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~ 153 (511) +|+++||+++||++.++|+||+|++++++++....+.+.|.+|+++..+.+++++++++|+||+++|.+++|++++++++ T Consensus 79 ~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~ 158 (474) T protein:vir:95 79 WRITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVP 158 (474) T ss_pred ceeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEc Confidence 89999999999999999999999999887766555555555688999999999999999999999999999999999999 Q ss_pred ccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccc Q lcl|NC_018086. 154 PMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP 233 (511) Q Consensus 154 p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 233 (511) |.+++++|++...+++.+++++|.... ..++++|+++.+++|....+++...... ..........+|++|.|| T Consensus 159 p~~~~~v~d~~~~~~~~~~i~~~~~~~------~~~~~~y~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~iP 231 (474) T protein:vir:95 159 AEQAIPIWVDKEREELKSFIRYYKFNN------EEKVEFWTDTTVTYYVLENGGLIPDYYY-GANHIQSHFSNGNWGRVP 231 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEEEcC------eeEEEEEeCCeEEEEEEcCCcccccccc-CcccccccccccCCCccc Confidence 999999999988888999999986432 2368999999999999887665433222 223345667899999999 Q ss_pred eEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeecC Q lcl|NC_018086. 234 VLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITKD 313 (511) Q Consensus 234 vv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 313 (511) ||+|+|++.|+|+|+++++|||+||+++|++++.++++++|+++++|+++++.+++..++...+++.++++++++|++++ T Consensus 232 vv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:95 232 FIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGGVETIQVE 311 (474) T ss_pred eEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeec Confidence 99999999999999999999999999999999999999999999999998887888888888999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_018086. 314 VNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK 392 (511) Q Consensus 314 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 392 (511) .+.++++.++++|.++|+.+|++|+++++++ |++||+||++++++|..||.++++.|+.+|++++++|+.+++. T Consensus 312 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~----- 386 (474) T protein:vir:95 312 VPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL----- 386 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----- Confidence 9999999999999999999999999998877 5799999999999999999999999999999999999988754 Q ss_pred cccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCC Q lcl|NC_018086. 393 DLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAS 472 (511) Q Consensus 393 ~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 472 (511) ..+..+++|+|++++|.|+++.|++++++ |++|++|++.++|+++|+++|++||++|+++....... ... .+. .. T Consensus 387 ~~d~~~i~v~f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~-~~~-~~~--d~ 461 (474) T protein:vir:95 387 KMDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPN-LDD-GGA--DG 461 (474) T ss_pred CcccceeeEEeccCCCcCHHHHHHHHHhc-CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccc-ccc-ccC--CC Confidence 35667899999999999999999999886 89999999999999999999999999998776553211 111 000 00 Q ss_pred CccccccCCCCCC Q lcl|NC_018086. 473 TAAANKLDKNPAN 485 (511) Q Consensus 473 ~~~~~~~~~~~~~ 485 (511) ..+.++.++..+. T Consensus 462 ~~~~~~~~~~~~~ 474 (474) T protein:vir:95 462 AQQQERSNDKESE 474 (474) T ss_pred CcCCCCCccCCCC Confidence 0111111111100 No 34 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=1.3e-92 Score=524.43 Aligned_cols=457 Identities=23% Similarity=0.301 Sum_probs=383.8 Q ss_pred CCCccchhhccc--ccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccCCcCccccccceec Q lcl|NC_018086. 1 MAIPNGQINAGD--IITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV 77 (511) Q Consensus 1 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~ 77 (511) |+. |++. ..+.+..+....+++++.++|.+++++|.. ++++|+++++||+|+|++.++. ..+.++++|++ T Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~--~~~~~~~~ki~ 73 (470) T protein:vir:99 1 MKD-----INYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAP--EKETGADNRIV 73 (470) T ss_pred Ccc-----ccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCc--ccccCCcceee Confidence 432 2222 235555555667788999999999999875 4589999999999999987654 34577899999 Q ss_pred cchHHHHHHHHHhhhhccCceecCc--hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGD--EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPM 155 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d--~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~ 155 (511) +||+++||++.++|++|+|++++++ ++..+.++++|+.|+|+.++.+++++++++|+||+++|.+++|++++++++|. T Consensus 74 ~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~ 153 (470) T protein:vir:99 74 VNSAKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPN 153 (470) T ss_pred cchHHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccc Confidence 9999999999999999999998653 35567899999999999999999999999999999999999999999999999 Q ss_pred ceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceE Q lcl|NC_018086. 156 NCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVL 235 (511) Q Consensus 156 ~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 235 (511) +++++||+....++.+++|+|... .++....++++|+++.+++|.....++ .....+..+|+||.|||| T Consensus 154 ~~~~i~d~~~~~~~~~~vr~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~g~vPvv 222 (470) T protein:vir:99 154 HAFIIYDDTVQRQPLAFVHYQIDN--SNNWTDAYGVIQYADKFYKFKGYDIEE---------DTNAAGYAINPYGLVPAV 222 (470) T ss_pred eeEEEEcCCCCcceEEEEEEEEEe--cCCeeEEEEEEEecCeEEEEEeccccc---------ccccccccccCCCccceE Confidence 999999999888899999998754 345567788999999999887654332 233456778999999999 Q ss_pred eecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc--chhhhhhhhCceeeec-----CCCcee Q lcl|NC_018086. 236 EIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD--SDSISNMKNDRVIVTD-----EDGMVK 308 (511) Q Consensus 236 ~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~--~~~~~~~~~~~~i~~~-----~~~~~~ 308 (511) +|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|+..+.+ .++...+...+++.++ ++++++ T Consensus 223 ~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (470) T protein:vir:99 223 EFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIG 302 (470) T ss_pred eecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcce Confidence 999999999999999999999999999999999999999999999865443 3456667777777764 467899 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) |++++.+.+.+++++++|.+.|+.+|++|+++++.+ |++||+||++++++|..|++++++.|+.+|++++++|+.+++. T Consensus 303 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 382 (470) T protein:vir:99 303 FIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFN 382 (470) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 999999999999999999999999999999998887 6899999999999999999999999999999999999999876 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSA 467 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 467 (511) .+. ..++..+++++|++++|.|.++.++++++++|++|.||++.+||++ |+++|++||++|+++..+........... T Consensus 383 ~~~-~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~~v-d~~~E~eri~~E~~~~~~~~~~~~~~~d~ 460 (470) T protein:vir:99 383 NKQ-DQELWSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIPDI-EPDAEMKQIAKEKADAIKQTQQLSMPIDI 460 (470) T ss_pred cCC-cccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCC-CHHHHHHHHHHHHHHHHHHHHhhcCCCCc Confidence 554 3556778999999999999999999999999999999999999998 79999999999987766544322211111 Q ss_pred CCCCCCccccc Q lcl|NC_018086. 468 VQGASTAAANK 478 (511) Q Consensus 468 ~~~~~~~~~~~ 478 (511) . +.++...++ T Consensus 461 ~-~~d~~~ee~ 470 (470) T protein:vir:99 461 L-KRDNNAEEE 470 (470) T ss_pred C-CCCCCccCC Confidence 1 011110000 No 35 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=8.8e-93 Score=525.26 Aligned_cols=486 Identities=16% Similarity=0.140 Sum_probs=368.2 Q ss_pred cCchhhHhhhhccCCCHH-HHHHHHHHH--HHHHHHHHHHHHHhcCCCcccccCCc---------CccccccceeccchH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLR-ELITLAEMH--SRSSSAYGVLYDYYKGNHIAIQSRTF---------DDTNKPNSKIVHNFP 81 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~-~l~~~~~~~--~~~~~~~~~~~~yY~G~~~~~~~~~~---------~~~~~~~~ri~~n~~ 81 (511) ++.+ ++...-.... .|.+.+..| ..++++|.++++||.|+|+|+.++.. ....++++|+++||+ T Consensus 1 ~~~~----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~ 76 (537) T protein:vir:78 1 MTSP----LLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFF 76 (537) T ss_pred CCcc----cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchH Confidence 2221 1111111122 233444444 36779999999999999999866532 355678899999999 Q ss_pred HHHHHHHHhhhhccCceecCchhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 82 KLLVDTSTAYLAGEPITESGDEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 82 k~ivd~~~~~l~g~~~~~~~d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) ++||++.++|++|+|++++++++..+ .+.++ ..|+++..+.+++++++++|+||+++|.+++|+++++.++|.++ T Consensus 77 k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~-~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~ 155 (537) T protein:vir:78 77 TELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEY-FDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTL 155 (537) T ss_pred HHHHHHHhhhhcccCceeecCcchhHHHHHHHHHH-hhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEcccee Confidence 99999999999999999987654333 34444 35899999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCceEEEEEEEEEeec----CCcceEEEEEEEcCCcEEEEEEccCccccccc-------------------- Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISD----ITGHQIRTYEVYTEDLIYKFSTDDEREVYREI-------------------- 213 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~----~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-------------------- 213 (511) ||+||+.. ++.+++++|..... .++..++++++||++.+++|....++...... T Consensus 156 ~pv~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 233 (537) T protein:vir:78 156 IPVFDDYG--VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEEST 233 (537) T ss_pred EEEEcCCC--CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccc Confidence 99999754 57778887765432 34467889999999999999877665422110 Q ss_pred -ccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh Q lcl|NC_018086. 214 -PEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN 292 (511) Q Consensus 214 -~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~ 292 (511) ............+|+||+||||+|+||.+|+|+|+++++|||+||.++|++++.+++|++|+++++|+.++..+++..+ T Consensus 234 ~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~ 313 (537) T protein:vir:78 234 DADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQN 313 (537) T ss_pred cccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHH Confidence 0112234556788999999999999999999999999999999999999999999999999999999988887888889 Q ss_pred hhhCceeeecC-CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 293 MKNDRVIVTDE-DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFR 371 (511) Q Consensus 293 ~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~ 371 (511) ++..+++.+++ +++++|++++.+.++++.++++|++.||.+|++|+.+...+||+||+||++++++|.+||..+++.|+ T Consensus 314 l~~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~ 393 (537) T protein:vir:78 314 IKAKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLR 393 (537) T ss_pred HhhcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHH Confidence 99999998874 68899999999999999999999999999999999988888999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_018086. 372 KVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADA 449 (511) Q Consensus 372 ~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~ 449 (511) .+|++++++|+.+++..+. ..++..++.++|++++|.|+++.|++++++. |++|++|+++++|+++|++.| +++++ T Consensus 394 ~~l~~~~~~i~~~~~~~~~-~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~e-k~~~e 471 (537) T protein:vir:78 394 KVLRWCADMVVSDIALRGL-GEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETL-KLIAE 471 (537) T ss_pred HHHHHHHHHHHHHHhhcCC-cccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHH-HHHHH Confidence 9999999999999987654 4667889999999999999999999999874 899999999999999998533 34444 Q ss_pred HHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 450 QRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 450 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) |.+...+...+...++..+.+......+...+..+..+.+. |++........-.-|.|| T Consensus 472 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~d~~~~~~~~~~~~~~~ 530 (537) T protein:vir:78 472 ELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQP---PVDPNQPVADPNVVPPTD 530 (537) T ss_pred HHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCC---CCCccCCCCCCCCCCCCC Confidence 43333333222222222222111111111111111111111 111111111111122222 No 36 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=6e-93 Score=526.16 Aligned_cols=465 Identities=23% Similarity=0.315 Sum_probs=379.0 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccC--CcCccccccceec Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSR--TFDDTNKPNSKIV 77 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~--~~~~~~~~~~ri~ 77 (511) |-.- |.+.+..... + .-+.++++.+.|.+++++|.. ++++|+++.+||+|+|+++..+ ......++++|++ T Consensus 1 ~~~~----~~~~~~~~~~-~-~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~ 74 (506) T protein:vir:94 1 MDYD----LTEHKQANLI-Y-QESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRAT 74 (506) T ss_pred CCcc----hhhhhcceee-c-ccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceee Confidence 3333 3333333321 1 123467888999999999854 6789999999999999875433 3456678899999 Q ss_pred cchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccc Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMN 156 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~ 156 (511) +||+++||++.++|++|+|+++++++ ...+.|+++|+.|+++..+.+++++++++|+||+++|.+++|+++++++||.+ T Consensus 75 ~n~~~~Iv~~~~~~l~G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~ 154 (506) T protein:vir:94 75 HSFAKYIADFQTSYSVGNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLD 154 (506) T ss_pred cchHHHHHHHhhhhhcccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccc Confidence 99999999999999999999997654 56678999999999999999999999999999999999999999999999999 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcc---eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccc Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGH---QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP 233 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 233 (511) ++++||+...++++++||+|......++. ...++++|+++.+++|.....+ +......+|+||.|| T Consensus 155 ~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~-----------~~~~~~~~~~~g~vP 223 (506) T protein:vir:94 155 TFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIM-----------GKMQVDTTKPITTFP 223 (506) T ss_pred eEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCc-----------cceeccccccCCccc Confidence 99999999888899999999766544332 4467889999998888654332 344556789999999 Q ss_pred eEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCc------------------------cchh Q lcl|NC_018086. 234 VLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSA------------------------DSDS 289 (511) Q Consensus 234 vv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~------------------------~~~~ 289 (511) ||+|+|++.|.|+|+++++|+|+||+++|++++.++++++|+++++|..... .... T Consensus 224 vv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (506) T protein:vir:94 224 VVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLEL 303 (506) T ss_pred eEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHH Confidence 9999999999999999999999999999999999999999999999964221 1133 Q ss_pred hhhhhhCceeeecC---------CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHH Q lcl|NC_018086. 290 ISNMKNDRVIVTDE---------DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPL 359 (511) Q Consensus 290 ~~~~~~~~~i~~~~---------~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l 359 (511) ...++.++++.+++ +++++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||++++++| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l 383 (506) T protein:vir:94 304 IKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGT 383 (506) T ss_pred HhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHH Confidence 34455667777664 35789999999999999999999999999999999998876 689999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_018086. 360 ENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITD 439 (511) Q Consensus 360 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d 439 (511) .+||.++++.|+.+|++++++|+.+++..+...+++..+++|.|++++|.|.++.|+++++++|++|++|+++++|+++| T Consensus 384 ~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~lp~v~d 463 (506) T protein:vir:94 384 VELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQKYLYQQLPGVTN 463 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Confidence 99999999999999999999999999987777778888899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCC Q lcl|NC_018086. 440 ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKK 507 (511) Q Consensus 440 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) +++|++||++|++...+... .... .+..+..+...++. ..++ | T Consensus 464 ~~~E~~ri~~E~~~~~~~~~-~~~~-~~~~~~~~~~~~~~------------~~e~-----------~ 506 (506) T protein:vir:94 464 PQDIVDMMKEQSANGDYSFD-QNGV-ISNDGQTNTTATQT------------DEEV-----------R 506 (506) T ss_pred HHHHHHHHHHHHHHHhhcch-hhcC-CCcccCcccccccc------------ccCC-----------C Confidence 99999999999876544321 1100 00111101000000 0000 0 No 37 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=2.7e-92 Score=522.61 Aligned_cols=462 Identities=18% Similarity=0.197 Sum_probs=378.8 Q ss_pred CCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccccce Q lcl|NC_018086. 2 AIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPNSK 75 (511) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~~r 75 (511) -|.|+|=+.--+......... +...+..+.|.+++++|..++++|+++.+||.|+|++..+... ..+.++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIK-PKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWR 79 (474) T ss_pred CeeeccCCCchhhhhHHHHhh-hccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchh Confidence 234554433333333222111 2234566778889999999999999999999999998766533 345578899 Q ss_pred eccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) +++||+++||++.++|+||+|+++++++ +..+.++++|. |++...+.+++++++++|+||+++|.+++|+++++.++| T Consensus 80 i~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p 158 (474) T protein:vir:96 80 MFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPA 158 (474) T ss_pred cccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcc Confidence 9999999999999999999999998654 55566777765 788899999999999999999999999999999999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccc---cccccccccceeccCCc Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIP---EELEIKDYEVHPNLLQK 231 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~g~ 231 (511) .+++|+|++....++.+++|+|.... ..++++||++.+++|...++........ ...........+|+||+ T Consensus 159 ~~~~~v~d~~~~~~~~~~vr~~~~~~------~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 232 (474) T protein:vir:96 159 EQAIPIWTNKERDTLKAFIRYYRLDG------AERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGR 232 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEeecC------ceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCc Confidence 99999999988888999999997432 3367999999999998776654332211 12233445678999999 Q ss_pred cceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC-CCceeee Q lcl|NC_018086. 232 FPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE-DGMVKFI 310 (511) Q Consensus 232 iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~ 310 (511) ||||+|+|+++|+|+|+++++|+|+||.++|++++.++++++|+++++|+.+++.+++..++..++++.+++ +++++|+ T Consensus 233 iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l 312 (474) T protein:vir:96 233 VPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDGSGVDTI 312 (474) T ss_pred eeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecCCCCceeEE Confidence 999999999999999999999999999999999999999999999999998887778888888999999885 5789999 Q ss_pred ecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 311 TKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 311 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) +++.+.++++.++++|.++|+.+|++|++++++++ ++||+||++++++|.+||.++++.|+.+|++++++|+.+++. T Consensus 313 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~-- 390 (474) T protein:vir:96 313 QIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL-- 390 (474) T ss_pred eecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-- Confidence 99999999999999999999999999999998875 799999999999999999999999999999999999988743 Q ss_pred CCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCC Q lcl|NC_018086. 390 KAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQ 469 (511) Q Consensus 390 ~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (511) .++..+++|+|++++|.|+++.++++.+ +|++|++|++.++|+++|+++|++||++|+++..+......++ .. T Consensus 391 ---~~~~~~i~i~f~~~~p~~~~e~~~~~~~-ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~---~~ 463 (474) T protein:vir:96 391 ---NIKVQDVEITFNFNVMVNELEQSQIGVQ-SQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPLEGD---AN 463 (474) T ss_pred ---CcccceeeEEeccCCCcCHHHHHHHHHh-cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccc---cc Confidence 4566788999999999999999998765 5999999999999999999999999999987655432111111 11 Q ss_pred CCCCccccccC Q lcl|NC_018086. 470 GASTAAANKLD 480 (511) Q Consensus 470 ~~~~~~~~~~~ 480 (511) +...+...+.+ T Consensus 464 ~~~~d~~~e~~ 474 (474) T protein:vir:96 464 GRAQDNESETN 474 (474) T ss_pred cccCCCcccCC Confidence 11000000000 No 38 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.6e-91 Score=518.33 Aligned_cols=459 Identities=25% Similarity=0.355 Sum_probs=380.4 Q ss_pred CCC-------ccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCccccc---CCcCcc Q lcl|NC_018086. 1 MAI-------PNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHS-RSSSAYGVLYDYYKGNHIAIQS---RTFDDT 69 (511) Q Consensus 1 ~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~yY~G~~~~~~~---~~~~~~ 69 (511) |.. .|++.++|++.-.+ . ..+.++.+.|.+++++|. .++++|+++++||+|+|++... ...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~----~-~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~ 75 (481) T protein:vir:10 1 MTVYTINNINTKFSPLANDDFVVS----D-LAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYG 75 (481) T ss_pred CeeEeeehhchhcccccCceeeee----c-chhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcccccccc Confidence 221 23455555544111 1 235678888999999985 6679999999999999876433 233456 Q ss_pred ccccceeccchHHHHHHHHHhhhhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceE Q lcl|NC_018086. 70 NKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHR 148 (511) Q Consensus 70 ~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~ 148 (511) .++++|+++||+++||++.++|++|+|++++++ ++..+.++++|++|+|+..+.+++++++++|+||+++|.+++|+++ T Consensus 76 ~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~ 155 (481) T protein:vir:10 76 DKADHRAVHNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDT 155 (481) T ss_pred ccccceeecchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEE Confidence 778899999999999999999999999999754 5667789999999999999999999999999999999999999999 Q ss_pred EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceecc Q lcl|NC_018086. 149 FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNL 228 (511) Q Consensus 149 i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (511) +++++|++++++||+....++++++++|...+ .++..+.++++|+++.+++|...+++ |..++..+|+ T Consensus 156 i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~y~~~~i~~~~~~~~~-----------~~~~~~~~~~ 223 (481) T protein:vir:10 156 FKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQD-KDKVPVQHVEVYTTDKIYYIEIKGGT-----------YHRVEEVEHY 223 (481) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEEEee-CCCceEEEEEEEecCeEEEEEecCCc-----------eeeccccccc Confidence 99999999999999988888999999987554 44567788999999999999876543 4456788999 Q ss_pred CCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeee------- Q lcl|NC_018086. 229 LQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVT------- 301 (511) Q Consensus 229 ~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~------- 301 (511) ||.||||+|+|+.+|+|+|+++++|||+||+++|++++.++++++|+++++|....+. +....+...+++.+ T Consensus 224 ~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 302 (481) T protein:vir:10 224 YNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDS-EDAKAFRDANMIHLEPGTNAN 302 (481) T ss_pred CCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCc-cchhhhhhccceecccccccc Confidence 9999999999999999999999999999999999999999999999999999654333 23444444444433 Q ss_pred --cCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 302 --DEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRY 378 (511) Q Consensus 302 --~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 378 (511) .++++++|++++.+.+++++++++|++.|+.+|++|+++++.+ +|+||+||++++++|..||+++++.|+.+|++++ T Consensus 303 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 382 (481) T protein:vir:10 303 GSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRY 382 (481) T ss_pred CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2356889999999999999999999999999999999998877 6899999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 379 ELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIA 458 (511) Q Consensus 379 ~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~ 458 (511) ++++.+++..+. ..++..+++++|++++|+|.++.++++++++|++|++|++++||+++|+++|++||++|+++..+.. T Consensus 383 ~li~~~~~~~~~-~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~ 461 (481) T protein:vir:10 383 KLLLNNVNLTGL-KQHNYAELTITFTPNLPKSMMESINAFNALSGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQA 461 (481) T ss_pred HHHHHHHhccCC-CccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhh Confidence 999999887553 4567788999999999999999999999999999999999999999999999999999988776643 Q ss_pred HhhccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 459 LQNFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) ....-.+. ....++++++.| T Consensus 462 ~~~~~~~~------~~~~~~~dd~~g 481 (481) T protein:vir:10 462 DKRGYGEA------FENHLNVDDSNG 481 (481) T ss_pred hhccCCcc------CCCCCCCCCCCC Confidence 32211111 111222222222 No 39 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=1.7e-91 Score=518.28 Aligned_cols=455 Identities=18% Similarity=0.203 Sum_probs=380.4 Q ss_pred CCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc------Cccccccce Q lcl|NC_018086. 2 AIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF------DDTNKPNSK 75 (511) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~------~~~~~~~~r 75 (511) -+.|++.+.-.+.+..++... ++..++.+.|.++++.|..++++|+++++||.|+|++++++.. .++.++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIK-PQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR 79 (468) T ss_pred CccccCCcCceeehheeeccc-ccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccc Confidence 223444432223332222222 3456778889999999999999999999999999998766533 345678899 Q ss_pred eccchHHHHHHHHHhhhhccCceecCchh-hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITESGDEK-TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~~~d~~-~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) +++||++.||++.++|+||+|++++++++ ..+.++++|+ |+++..+.+++++++++|+||++||.+++|++++++++| T Consensus 80 i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 158 (468) T protein:vir:96 80 MYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPA 158 (468) T ss_pred cccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcc Confidence 99999999999999999999999986654 5567777775 789999999999999999999999999999999999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc---ccccccccccceeccCCc Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI---PEELEIKDYEVHPNLLQK 231 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~g~ 231 (511) .+++++|++...+++.+++|+|.... ..++++|+++.+++|...++....... .....+......+|+||+ T Consensus 159 ~~~~~v~~~~~~~~~~~~ir~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (468) T protein:vir:96 159 EQAIPIWTNKERDELKAFIRLYELDG------GERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNR 232 (468) T ss_pred cceEEEEcCCCCCceEEEEEEEEecC------ceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCc Confidence 99999999988888999999986532 246799999999999877665432221 122234556778999999 Q ss_pred cceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC--CCceee Q lcl|NC_018086. 232 FPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE--DGMVKF 309 (511) Q Consensus 232 iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~--~~~~~~ 309 (511) ||||+|+|++.|+|+|+++++|+|+||.++|++++.++++++|+++++|+.+++.+++..+++.++++.+++ +++++| T Consensus 233 iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~ 312 (468) T protein:vir:96 233 VPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVDT 312 (468) T ss_pred ccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCCCCcceE Confidence 999999999999999999999999999999999999999999999999999888888888888899998874 467899 Q ss_pred eecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018086. 310 ITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFM 388 (511) Q Consensus 310 ~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 388 (511) ++++++.++++.++++|.++|+.+|++|++++++++ ++||+||+++++++.+||.++++.|+.+|++++++|+++++. T Consensus 313 l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~- 391 (468) T protein:vir:96 313 IQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL- 391 (468) T ss_pred EeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Confidence 999999999999999999999999999999988774 799999999999999999999999999999999999988643 Q ss_pred CCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_018086. 389 NKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAV 468 (511) Q Consensus 389 ~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (511) .++..+++|+|++++|.|.++.|++++++ |++|+||+++++|+++||++|++||++|+++..+.. +.++. T Consensus 392 ----~~d~~~i~i~f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~-~~~~~---- 461 (468) T protein:vir:96 392 ----SIKVQDVEITFNFNVMVNELEQSQIGVNS-QYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIE-EGLNG---- 461 (468) T ss_pred ----CcccceeeEEecCCCCcCHHHHHHHHHhc-CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHh-hccCC---- Confidence 35667899999999999999999998765 999999999999999999999999999987765532 11111 Q ss_pred CCCCCccccccC Q lcl|NC_018086. 469 QGASTAAANKLD 480 (511) Q Consensus 469 ~~~~~~~~~~~~ 480 (511) ...++|. T Consensus 462 -----~~~~~~~ 468 (468) T protein:vir:96 462 -----KENNEPT 468 (468) T ss_pred -----CCCCCCC Confidence 1111111 No 40 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=1.2e-90 Score=513.56 Aligned_cols=455 Identities=20% Similarity=0.303 Sum_probs=374.9 Q ss_pred hhcccccCchhhHhhhhccCCCHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHH Q lcl|NC_018086. 8 INAGDIITTNIRRKHFIRRNFDLRELITLAEMHS-RSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVD 86 (511) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd 86 (511) ++.+++... -.+.+++.+.+.+++++|. .++++|+++++||.|+|++.+++......++++|+++||+++||+ T Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~ 74 (489) T protein:vir:99 1 MLQEDFEAI------DYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITV 74 (489) T ss_pred CCccceeee------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHH Confidence 555555444 3346678899999999986 567999999999999999998888888888999999999999999 Q ss_pred HHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee----CCCCceEEEEEcccceEEEe Q lcl|NC_018086. 87 TSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI----DRNKKHRFKAVSPMNCLIAY 161 (511) Q Consensus 87 ~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~----~~~g~~~i~~~~p~~~~~v~ 161 (511) +.++|+||+|+++++++ +..+.++++|+.|+|+..+.+++++++++|+||+++|. +++|++++++++|.+++++| T Consensus 75 ~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~v~ 154 (489) T protein:vir:99 75 FEQGYMLGVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFVIY 154 (489) T ss_pred HHhhhhccCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEEEE Confidence 99999999999997654 56678899999999999999999999999999999986 56788999999999999999 Q ss_pred cCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc Q lcl|NC_018086. 162 SADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE 241 (511) Q Consensus 162 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~ 241 (511) ++....+++++|++|... +..++.+.++++|+++.+++|......+ ..+......+|+||.||||+|+|++ T Consensus 155 dd~~~~~~~~~i~~~~~~-~~~~~~~~~~~~y~~~~i~~~~~~~~~~--------~~~~~~~~~~~~~g~vPvv~~~n~~ 225 (489) T protein:vir:99 155 DDTYQRNSLMAVHFYDID-YGSGKRKQIIKAYTSDTIYTYEDYNLET--------KGMRLKDYEGHFFKGVPVNEYANNE 225 (489) T ss_pred cCCCCCceEEEEEEEEEe-cCCCceEEEEEEEeCCcEEEEEecCCCc--------ccceecccccccCCceeEEEeecCC Confidence 998888899999988754 4455677889999999999988654321 2345567889999999999999999 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch--hh--------------hhhhhCceeeecC-- Q lcl|NC_018086. 242 ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD--SI--------------SNMKNDRVIVTDE-- 303 (511) Q Consensus 242 ~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~--~~--------------~~~~~~~~i~~~~-- 303 (511) .|+|+|+++++|+|+||.++|++++.++++++|+++++|........ .. ...+..+++.+.+ T Consensus 226 ~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (489) T protein:vir:99 226 ERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNP 305 (489) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeecccc Confidence 99999999999999999999999999999999999999975433211 11 1112234444433 Q ss_pred -----CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 304 -----DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKR 377 (511) Q Consensus 304 -----~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 377 (511) +.+++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||+++++++.+||.++++.|+.+|+++ T Consensus 306 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~ 385 (489) T protein:vir:99 306 NPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRR 385 (489) T ss_pred CccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34789999999999999999999999999999999988776 689999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCC--ccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCC--CHHHHHHHHHHHHHH Q lcl|NC_018086. 378 YELVCSYLEFMNKA--KDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWIT--DARQEVEKADAQRQK 453 (511) Q Consensus 378 ~~li~~~~~~~~~~--~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~--d~~~E~~ri~~E~~~ 453 (511) +++|+.+++..+.. ......+++|+|++++|.|.++.++++++++|++|+||+++++|+++ |+++|++||++|++. T Consensus 386 ~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~ 465 (489) T protein:vir:99 386 LRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVTGVDAEAELKRLKEEADK 465 (489) T ss_pred HHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHH Confidence 99999998876543 23345679999999999999999999999999999999999999997 788999999998766 Q ss_pred HHHHHHhhccccccCCCCCCccccccCCCC Q lcl|NC_018086. 454 RADIALQNFKQTSAVQGASTAAANKLDKNP 483 (511) Q Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (511) ..... .. ...+....+.++..++| T Consensus 466 ~~~~~--~~----~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 466 KQSLP--EP----RLVGDASGQEEPTAEKP 489 (489) T ss_pred Hhccc--cc----cccCCCCCCcCCCCCCC Confidence 54421 11 11111111111111112 No 41 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=1.8e-90 Score=512.58 Aligned_cols=422 Identities=22% Similarity=0.329 Sum_probs=356.8 Q ss_pred HHH-HHHHHHHHHHHHHHHhcCCCccccc-CCcCccccccceeccchHHHHHHHHHhhhhccCceecCc----hhhHHHH Q lcl|NC_018086. 36 LAE-MHSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD----EKTIKAM 109 (511) Q Consensus 36 ~~~-~~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d----~~~~~~l 109 (511) ++. .+..+++||+++++||+|+|+++.. ......+++++|+++||+++||++.++|+||+|++++.+ ++..+.+ T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~l 80 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLSTI 80 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHHH Confidence 333 3356778999999999999997654 455677889999999999999999999999999998643 3456678 Q ss_pred HHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEE Q lcl|NC_018086. 110 QPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRT 189 (511) Q Consensus 110 ~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~ 189 (511) +++|+.|+|+.++.+++++++++|+||+++|.+++|+++++.++|++++++||+...+++++++++|...+ ..+ T Consensus 81 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~------~~~ 154 (440) T protein:vir:95 81 KDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYAD------KVN 154 (440) T ss_pred HHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC------ceE Confidence 99999999999999999999999999999999999999999999999999999998888999999886432 246 Q ss_pred EEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 190 YEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIA 269 (511) Q Consensus 190 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~ 269 (511) +++||++.+++|.....+ ...+...+..+|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.++ T Consensus 155 ~~vyt~~~~~~~~~~~~~--------~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~ 226 (440) T protein:vir:95 155 MTVYTKDKVITYKPYSNN--------SVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMS 226 (440) T ss_pred EEEEeCCeEEEEEEecCC--------ccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 789999999998865432 234566788999999999999999999999999999999999999999999999 Q ss_pred HhcCceeEeecCCCC--ccchhhhhhhhCceeee---------cCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_018086. 270 YWNDAYLWLQGFDLS--ADSDSISNMKNDRVIVT---------DEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPD 338 (511) Q Consensus 270 ~~~~p~l~~~G~~~~--~~~~~~~~~~~~~~i~~---------~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 338 (511) +|++|+++++|.... ..++....++..+++.+ .++++++|++++.+.+++++++++|.++|+.+|++|+ T Consensus 227 ~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~ 306 (440) T protein:vir:95 227 DLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPN 306 (440) T ss_pred HhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 999999999996432 23455556665555443 3467899999999999999999999999999999999 Q ss_pred cccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHH Q lcl|NC_018086. 339 LVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADM 417 (511) Q Consensus 339 ~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~ 417 (511) ++++.+ ||+||+||++++++|..||+++++.|+.+|++++++|+.+++.... ..++..+++|.|++++|+|.++.|++ T Consensus 307 ~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~-~~~~~~~v~i~f~~~~p~~~~~~ad~ 385 (440) T protein:vir:95 307 LDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAING-PVIEANKLTFTFHPNIPQDVWTEIKA 385 (440) T ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC-cccccccceEEeCCCCCCCHHHHHHH Confidence 999886 5899999999999999999999999999999999999999887653 45677789999999999999999999 Q ss_pred HHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 418 AVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 418 ~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) +++++|++|+||+++++|+++ +++|++||++|+++......+..+...+..+..+ T Consensus 386 ~~kl~g~iS~et~~~~l~~~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 386 YIEAGGEISQETLMENASFTD-YKTEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred HHHHhccCcHHHHHHhCCCCC-cHHHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 999999999999999999985 4678999999887655443222221111111111 No 42 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=8.2e-85 Score=481.58 Aligned_cols=457 Identities=14% Similarity=0.077 Sum_probs=362.1 Q ss_pred chhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc Q lcl|NC_018086. 16 TNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE 95 (511) Q Consensus 16 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~ 95 (511) ....+..+.+.+.....+.+|+.+|..+++|++++.+||+|+|++.+.+...++...+.++++||+++||++.++|+++. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~ 80 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVE 80 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhccc Confidence 22333344444454555667999999999999999999999999887766666555677889999999999999999999 Q ss_pred CceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC--------CCceEEEEEcccceEEEecCCCCC Q lcl|NC_018086. 96 PITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR--------NKKHRFKAVSPMNCLIAYSADLDE 167 (511) Q Consensus 96 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~--------~g~~~i~~~~p~~~~~v~d~~~~~ 167 (511) +++..++++..+.++++|+.|+|+.++.++++++++|||||++||.++ ++.++++++||.+++++||+.. + T Consensus 81 g~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~-~ 159 (486) T protein:vir:42 81 GFRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRI-N 159 (486) T ss_pred ceecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCC-C Confidence 999887778888899999999999999999999999999999999865 4567899999999999999875 4 Q ss_pred ceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-----c Q lcl|NC_018086. 168 EPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-----E 242 (511) Q Consensus 168 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~ 242 (511) ++.+++++|.. .+++.+.++++|+++.+++|....+++ ......+|+||.||||+|+|+. + T Consensus 160 ~~~~~~~~~~~---~~~~~~~~~~~y~~~~~~~~~~~~~~~-----------~~~~~~~h~~g~vPvv~~~n~~~~~~~~ 225 (486) T protein:vir:42 160 RVSKAIRVAYD---KEGNEIQAATLYTPMETIGWFRADGEW-----------AEWFNVPHGLGVVPVVPLPNRTRLSDLY 225 (486) T ss_pred CeEEEEEEEEe---cCCCeEEEEEEEcCCcEEEEEecCCcE-----------EeecceecCCCCceEEEeccccccCCCC Confidence 68889988753 345667789999999999998765543 3456778999999999999974 5 Q ss_pred cCchhHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch------hhhhhhhCceeeecCCCceeeeecCCC Q lcl|NC_018086. 243 RLGDFEA-QLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD------SISNMKNDRVIVTDEDGMVKFITKDVN 315 (511) Q Consensus 243 g~s~~~~-v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~~~~~~~ 315 (511) |.|+|+. |++|||+||+++|++++.++++++|+++++|.+.+.... .......++++. .+++++++.++ + T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~q~--~ 302 (486) T protein:vir:42 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILA-FEDAEGKIQQF--S 302 (486) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcc-cCCCCceEEee--c Confidence 8999985 999999999999999999999999999999986543221 111112233443 44567787554 4 Q ss_pred HHHHHHHHHHHHHHHHHHhCccccccccccC-----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018086. 316 DKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-----ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK 390 (511) Q Consensus 316 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 390 (511) ..+++++++.|+.+|++++++|++++..||. +||+||++++.+|..||+++++.|+.+|++++++++.+++..+. T Consensus 303 ~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~ 382 (486) T protein:vir:42 303 AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDV 382 (486) T ss_pred ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 5678899999999999999999888766642 69999999999999999999999999999999999887664332 Q ss_pred CccccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_018086. 391 AKDLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTS 466 (511) Q Consensus 391 ~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 466 (511) ..+..++++.|+++.|+|.++.+++++|++ |++|++|++.++|+++|+.+|++|+++|+.+.....+..++... T Consensus 383 --~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~ 460 (486) T protein:vir:42 383 --PPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDAD 460 (486) T ss_pred --cccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 345568999999999999999999999985 78999999999999999999999999888777666666555444 Q ss_pred cCCCCCCccccccCCCCCCccccccCCCCcc Q lcl|NC_018086. 467 AVQGASTAAANKLDKNPANTSTITTTDPVAA 497 (511) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (511) ...++.++..+++.++++. .++++.+ T Consensus 461 ~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 486 (486) T protein:vir:42 461 PTVPGSPSPTAPPKPQPAI-----ESSGGDA 486 (486) T ss_pred CCCCCCCCCCCCCCCCccc-----CCCCCCC Confidence 3333333333333222221 1111111 No 43 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=2.2e-84 Score=479.18 Aligned_cols=457 Identities=13% Similarity=0.091 Sum_probs=361.2 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhh Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLA 93 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~ 93 (511) ++. .+..+.+..++...+..|+++|..+++|++++.+||+|+|++.+.+...++..+++++++||+++||++.++|++ T Consensus 1 ~~~--~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:24 1 MTA--PLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCC--CCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhc Confidence 111 122233444555556779999999999999999999999998877776667777889999999999999999999 Q ss_pred ccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC--------CceEEEEEcccceEEEecCCC Q lcl|NC_018086. 94 GEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN--------KKHRFKAVSPMNCLIAYSADL 165 (511) Q Consensus 94 g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~--------g~~~i~~~~p~~~~~v~d~~~ 165 (511) ++|++...+++..+.++++|+.|+|+.++.++++++++||+||++||.+++ +.++|+.+||.+++++||+.. T Consensus 79 ~~g~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~ 158 (485) T protein:vir:24 79 VEGFRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRI 158 (485) T ss_pred cCceecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCc Confidence 999998887888889999999999999999999999999999999998864 567899999999999999887 Q ss_pred CCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc---- Q lcl|NC_018086. 166 DEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE---- 241 (511) Q Consensus 166 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~---- 241 (511) + ++.+++++|.. ..+..+.++++|+++.+++|....+. |......+|+||.||||+|+|++ T Consensus 159 ~-~~~~~~~~~~~---~~~~~~~~~~~y~~~~~~~~~~~~~~-----------~~~~~~~~h~~g~vPvv~f~n~~~~~~ 223 (485) T protein:vir:24 159 G-RPAKAIRVAYD---AEGNEIQAATLYTPNETFGWFRAEGE-----------WVEWFSDPHGLGAVPVVPLPNRTRLSD 223 (485) T ss_pred C-ceeEEEEEEEe---ecCCeEEEEEEEcCCcEEEEEecCCc-----------eEeecccccCCCcccEEEeccCcccCC Confidence 4 57777776643 23556778899999999999876544 33455678999999999999874 Q ss_pred -ccCchhH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch----hhhhhh-hCceeeecCCCceeeeecCC Q lcl|NC_018086. 242 -ERLGDFE-AQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD----SISNMK-NDRVIVTDEDGMVKFITKDV 314 (511) Q Consensus 242 -~g~s~~~-~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~----~~~~~~-~~~~i~~~~~~~~~~~~~~~ 314 (511) +|+|+|+ .|++|||+||+++|++++.++++++|+++++|.+.+.... ....+. ..+.+...+++++++.+ . T Consensus 224 ~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q--~ 301 (485) T protein:vir:24 224 LYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQ--F 301 (485) T ss_pred cCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCCCCceEEe--e Confidence 6899998 5999999999999999999999999999999986543221 111111 12233344456777654 4 Q ss_pred CHHHHHHHHHHHHHHHHHHhCcccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 315 NDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 315 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) +.+++++++++|+..|++++++|++++..|| ++||+||++++.+|..||+++++.|+.+|++++++++.+.+..+ T Consensus 302 ~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~ 381 (485) T protein:vir:24 302 SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGD 381 (485) T ss_pred cccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 5577889999999999999999988877664 26999999999999999999999999999999999988765433 Q ss_pred CCccccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018086. 390 KAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQT 465 (511) Q Consensus 390 ~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 465 (511) ...+..+++++|+++.|+|.++.+++++|+. |++|++|+++++|+++|+.+|++++++|+.+.....+..+... T Consensus 382 --~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~ 459 (485) T protein:vir:24 382 --VPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDA 459 (485) T ss_pred --CccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhccc Confidence 3456678999999999999999999999984 5799999999999999888899999888776665555655554 Q ss_pred ccCCCCCCccccccCCCCCCccccccCCCCccc Q lcl|NC_018086. 466 SAVQGASTAAANKLDKNPANTSTITTTDPVAAK 498 (511) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) ....++.++..++++++++..+ +.+. T Consensus 460 ~~~~~~~~~~~e~~~~~~~~~~-------~~~a 485 (485) T protein:vir:24 460 DPTVPGSPNPTPAPKPQPAIEG-------GDSA 485 (485) T ss_pred CCCCCCCCCCCCCCCCccCCCC-------CCCC Confidence 4444333333333333332221 1111 No 44 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=8.9e-83 Score=470.41 Aligned_cols=454 Identities=14% Similarity=0.104 Sum_probs=354.7 Q ss_pred cCchhhHhhhhccCCCHHHHHH-HHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhh Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELIT-LAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYL 92 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l 92 (511) +.+ -...+++++++.+++ ++..+..+.++++++.+||+|+|++.+.+...++...+.++++||+++||+++++|+ T Consensus 1 ~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 76 (484) T protein:vir:77 1 MTS----PLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQ 76 (484) T ss_pred CCC----cccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhh Confidence 222 123446777776665 666667778899999999999999877666666666667788999999999999999 Q ss_pred hccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCc--------eEEEEEcccceEEEecCC Q lcl|NC_018086. 93 AGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKK--------HRFKAVSPMNCLIAYSAD 164 (511) Q Consensus 93 ~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~--------~~i~~~~p~~~~~v~d~~ 164 (511) +++|++..++++..+.++++|++|+|+.++.++++++++||+||++||.+++|. ++|+.+||.+++++||+. T Consensus 77 ~~~g~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~ 156 (484) T protein:vir:77 77 ELEGFRLGGADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPR 156 (484) T ss_pred ccCceecCCcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCC Confidence 999999888788888999999999999999999999999999999999998774 568999999999999987 Q ss_pred CCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc--- Q lcl|NC_018086. 165 LDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE--- 241 (511) Q Consensus 165 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~--- 241 (511) . +++.+++++|... .+..+.++++|+++.+++|....+. |...+..+|+||+||||+|.|+. T Consensus 157 ~-~~~~~a~~~~~~~---~~~~~~~~~~y~~~~~~~~~~~~~~-----------~~~~~~~~~~~g~vPvv~f~N~~~~~ 221 (484) T protein:vir:77 157 T-RQVMRAIRAIEDE---EGNEVIGATLYLPNNTVIWNREDGQ-----------WVQVANVAHNLEMVPVIPIPNRTRLS 221 (484) T ss_pred C-CceEEEEEEEEee---cCCcEEEEEEEecCeEEEEEecCCc-----------eEeeccccCCCCCcceEEeccccccC Confidence 5 5788999887643 2445677899999999998766543 44556789999999999999874 Q ss_pred --ccCchhHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchh----hhhhh--hCceeeecCCCceeeeec Q lcl|NC_018086. 242 --ERLGDFEA-QLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDS----ISNMK--NDRVIVTDEDGMVKFITK 312 (511) Q Consensus 242 --~g~s~~~~-v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~----~~~~~--~~~~i~~~~~~~~~~~~~ 312 (511) +|+|+|++ |++|+|+||+++|++++.++++++|+++++|.+.+..... ...+. .++++.+ +++++++.+ T Consensus 222 ~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~q- 299 (484) T protein:vir:77 222 DLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAF-EDHESKAQQ- 299 (484) T ss_pred ccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhccc-CCCCceeEe- Confidence 68999984 9999999999999999999999999999999865542211 11111 2234444 445677654 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCcccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 313 DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 313 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) ++..+.+++++.|+.+|+++++++++++..|| ++||+||++++.+|..|++++++.|+.+|++++++++.+.+. T Consensus 300 -~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~ 378 (484) T protein:vir:77 300 -FSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNG 378 (484) T ss_pred -ecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 44566788999999999999988887766554 269999999999999999999999999999999999887653 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFK 463 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 463 (511) . ....+..++++.|+++.|+|.++.+++++|++ |++|.+|++.++|+++|+.+||+++++|+.+.....+.... T Consensus 379 ~--~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~ 456 (484) T protein:vir:77 379 G--DIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMF 456 (484) T ss_pred C--CcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 2 22345667999999999999999999999984 48999999999999999999999999887776655555444 Q ss_pred ccccCCCCCCccccccCCCCCCcccccc Q lcl|NC_018086. 464 QTSAVQGASTAAANKLDKNPANTSTITT 491 (511) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (511) ...+..++.+...+++++.+......++ T Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 457 GTDPSGGGNPDNPETPEPQPNPAEEAAA 484 (484) T ss_pred cccccCCCCCCCCCcccccCCCccccCC Confidence 4333333333333222222211111111 No 45 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=5.5e-82 Score=466.06 Aligned_cols=455 Identities=12% Similarity=0.068 Sum_probs=347.4 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhh Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLA 93 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~ 93 (511) +++ ..+.|..|+.+|..+++|++++++||+|+|++.+.....++...++|+++||+++||++.++|++ T Consensus 1 ~~t------------~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 68 (480) T protein:vir:78 1 MTT------------YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD 68 (480) T ss_pred CCC------------HHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhc Confidence 111 12346678899999999999999999999998777666666667889999999999999999999 Q ss_pred ccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee------CCCCceEEEEEcccceEEEecCCCCC Q lcl|NC_018086. 94 GEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI------DRNKKHRFKAVSPMNCLIAYSADLDE 167 (511) Q Consensus 94 g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~------~~~g~~~i~~~~p~~~~~v~d~~~~~ 167 (511) +++++...|++..+.++++|+.|+|+.++.++++++++||+||++||. +++|.++++++||.+++++||+...+ T Consensus 69 ~~g~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~ 148 (480) T protein:vir:78 69 IEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTR 148 (480) T ss_pred cCceecCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCcc Confidence 999998888888899999999999999999999999999999999986 45788999999999999999999889 Q ss_pred ceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-----c Q lcl|NC_018086. 168 EPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-----E 242 (511) Q Consensus 168 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~ 242 (511) ++.+++++|...++ ...+.++++|+++.+++|...+..... .....+..+|+||.||||+|+|+. + T Consensus 149 ~~~~~i~~~~~~d~--~~~~~~~~~y~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~ 219 (480) T protein:vir:78 149 RVTRAVRLYTTRDD--VAVPDRATLYLPDETVPLRRNGGLNDQ-------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRY 219 (480) T ss_pred ceEEEEEEEEeecC--CcceEEEEEEeCCeEEEEEecCCCccc-------ccccccccccCCCCcceEEeecccccCCcc Confidence 99999999865543 345678999999999999876543211 112356789999999999999874 5 Q ss_pred cCchhHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchh--h--hhhhhCceeeecCCCceeeeecCCCHH Q lcl|NC_018086. 243 RLGDFEA-QLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDS--I--SNMKNDRVIVTDEDGMVKFITKDVNDK 317 (511) Q Consensus 243 g~s~~~~-v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~--~--~~~~~~~~i~~~~~~~~~~~~~~~~~~ 317 (511) |+|+|+. |++|+|+||+++|++++.+++|++|+++++|.+.+..... . .....++++. .+++++++.+++ .+ T Consensus 220 G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~--~~ 296 (480) T protein:vir:78 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFK--AA 296 (480) T ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhcc-CCCCCceEEecC--cc Confidence 8999985 9999999999999999999999999999999875432211 1 1111233443 345678886654 34 Q ss_pred HHHHHHHHHHHHHHHHhCccccccccccC-----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_018086. 318 HIENIKNRAKLDIFSLSQTPDLVSKDFTA-----ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK 392 (511) Q Consensus 318 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 392 (511) ..+++++.++.+|+.+++++++++..||+ +||+||++++.+|..||+++++.|+.+|++++++++.+++. .. T Consensus 297 ~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~---~~ 373 (480) T protein:vir:78 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EV 373 (480) T ss_pred CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC---Cc Confidence 56778888888888888877776655542 69999999999999999999999999999999999887652 23 Q ss_pred cccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_018086. 393 DLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAV 468 (511) Q Consensus 393 ~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (511) ..+...++++|+++.|+|.++.+++++|+. |++|++|+++++|+++|+.+|++++++++.+....++.......+. T Consensus 374 ~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) T protein:vir:78 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) T ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCc Confidence 345667999999999999999999999874 4689999999999999988888877666555443332221111110 Q ss_pred CCCCCccccccCCCCCCccccccCCCCccccccccC Q lcl|NC_018086. 469 QGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAI 504 (511) Q Consensus 469 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (511) . .++.+.+. .+..+++.+++....++. T Consensus 454 ~--------~~~~~~~~-~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 454 A--------TPKPTVTE-TKTETQTSPSGFNRTKTR 480 (480) T ss_pred c--------ccCCCCCC-CCCccCCCcccCCCcCCC Confidence 0 00111110 011111122222222222 No 46 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=8.1e-82 Score=465.16 Aligned_cols=456 Identities=14% Similarity=0.088 Sum_probs=349.5 Q ss_pred chhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc Q lcl|NC_018086. 16 TNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE 95 (511) Q Consensus 16 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~ 95 (511) -.+++....+.......+..|+.+|..+++||+++++||+|+|++.+.+...++...++++++||+++||++.++|++++ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~ 80 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQAVE 80 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhccc Confidence 22333333333343444556888999999999999999999999877666666666677888999999999999999999 Q ss_pred CceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC--------CCceEEEEEcccceEEEecCCCCC Q lcl|NC_018086. 96 PITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR--------NKKHRFKAVSPMNCLIAYSADLDE 167 (511) Q Consensus 96 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~--------~g~~~i~~~~p~~~~~v~d~~~~~ 167 (511) +++..++++..+.++++|+.|+|+.++.++++++++||+||++||.++ ++.++|++++|.+++++||+..+ T Consensus 81 g~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~- 159 (485) T protein:vir:10 81 GFRFGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIG- 159 (485) T ss_pred ceecCCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCC- Confidence 999877888888999999999999999999999999999999999875 46788999999999999988765 Q ss_pred ceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-----c Q lcl|NC_018086. 168 EPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-----E 242 (511) Q Consensus 168 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~ 242 (511) ++.++++++.. ..+..++++++|+++.+++|....++ |......+|+||+||||+|+|+. + T Consensus 160 ~~~~~~~~~~~---~~~~~~~~~~~y~~~~~~~~~~~~~~-----------~~~~~~~~~~~g~vPvv~~~n~~~~~~~~ 225 (485) T protein:vir:10 160 RVSKAIRVAYD---AEGNEIQAATLYTPNDIFGWYRVENE-----------WQEWFNNPHGLGVVPVVPIPNRTRLSDLY 225 (485) T ss_pred ceeEEEEEEEe---eCCCeEEEEEEEeCCeEEEEEEcCCc-----------eEEeccccCCCCcccEEEeccccccCCCC Confidence 46677776653 23456778899999999999876544 33455678999999999999974 5 Q ss_pred cCchhHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch----hh--hhhhhCceeeecCCCceeeeecCCC Q lcl|NC_018086. 243 RLGDFEA-QLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD----SI--SNMKNDRVIVTDEDGMVKFITKDVN 315 (511) Q Consensus 243 g~s~~~~-v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~----~~--~~~~~~~~i~~~~~~~~~~~~~~~~ 315 (511) |+|+|++ |++|||+||+++|++++.++++++|+++++|.+.+.... .. .....++++. .+++++++.+. + T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~d~k~~q~--~ 302 (485) T protein:vir:10 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILA-FEDAEGKIQQF--S 302 (485) T ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceec-cCCCCceEEee--c Confidence 8999985 999999999999999999999999999999986543211 11 1112233344 45667887654 4 Q ss_pred HHHHHHHHHHHHHHHHHHhCcccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018086. 316 DKHIENIKNRAKLDIFSLSQTPDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK 390 (511) Q Consensus 316 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 390 (511) ...++++++.|+.+|++++.+|++++..|| ++||+||++++.+|..|++++++.|+.+|++++++++.+.+.. T Consensus 303 ~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~-- 380 (485) T protein:vir:10 303 AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGG-- 380 (485) T ss_pred ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-- Confidence 566788999999999999888877765554 3699999999999999999999999999999999988865432 Q ss_pred CccccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_018086. 391 AKDLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTS 466 (511) Q Consensus 391 ~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 466 (511) ....+..+++|.|+++.|+|.++.+++++|+. |++|++|+++++|+++++.++++++++|+.+.....++.+.... T Consensus 381 ~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~ 460 (485) T protein:vir:10 381 DVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPN 460 (485) T ss_pred CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 23445678999999999999999999999984 48999999999999988888898888776665554444443322 Q ss_pred cCCCCCCccccccCCCCCCccccccCCCCccc Q lcl|NC_018086. 467 AVQGASTAAANKLDKNPANTSTITTTDPVAAK 498 (511) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) +..++.+... +++ .+.+.++++.+. T Consensus 461 ~~~~~~~~~~----~~~---~~~~~~~~~~~~ 485 (485) T protein:vir:10 461 PTVPGSPSPA----PAP---KPAALESGGDAA 485 (485) T ss_pred CCCCCCCCcc----ccc---cCcCCCCCCCCC Confidence 2111111111 011 111111222222 No 47 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=1.5e-81 Score=463.73 Aligned_cols=454 Identities=12% Similarity=0.068 Sum_probs=346.1 Q ss_pred CCCHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCchhh Q lcl|NC_018086. 27 NFDLRE-LITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKT 105 (511) Q Consensus 27 ~~~~~~-l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~ 105 (511) =.+..+ |..|++.|..++++++++.+||+|+|++.+.....++...++|+++||+++||++.++|+++++++.++|++. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 122333 5668899999999999999999999998777666666667889999999999999999999999998888888 Q ss_pred HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee------CCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEe Q lcl|NC_018086. 106 IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI------DRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVI 179 (511) Q Consensus 106 ~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~------~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 179 (511) .+.++++|+.|+|+.++.++++++++||+||++||. +++|.++++++||.+++++||+...+++.+++++|... T Consensus 81 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~ 160 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee Confidence 999999999999999999999999999999999996 45788999999999999999999889999999998654 Q ss_pred ecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-----ccCchhHH-HHHH Q lcl|NC_018086. 180 SDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-----ERLGDFEA-QLSL 253 (511) Q Consensus 180 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~l 253 (511) ++ ...+.++++|+++.+++|....+.... .....+..+|+||+||||+|+|+. +|+|+|+. |++| T Consensus 161 ~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l 231 (480) T protein:vir:78 161 DD--VAVPDRATLYLPDETVPLRRNGGLNDQ-------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) T ss_pred cC--CCceEEEEEEeCCeEEEEEecCCCccc-------cccccccccCCCCCcceEEeecccccCCccCcccchhhHHHH Confidence 43 345678899999999999876543211 112346779999999999999873 68999986 9999 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhh--h--hhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHH Q lcl|NC_018086. 254 IDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSI--S--NMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLD 329 (511) Q Consensus 254 ~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~--~--~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 329 (511) +|+||+++|++++.++++++|+++++|.+.+...... . ....+.++. .+++++++.+++. ...+++++.++.+ T Consensus 232 ~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~--~~~~~~~~~l~~~ 308 (480) T protein:vir:78 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEFKA--AELRNFAEEMEVF 308 (480) T ss_pred HHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhcc-CCCCCceEEecCc--cCHHHHHHHHHHH Confidence 9999999999999999999999999998765432211 1 111223333 3466788866553 4577788888888 Q ss_pred HHHHhCcccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC Q lcl|NC_018086. 330 IFSLSQTPDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV 404 (511) Q Consensus 330 i~~~s~~p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~ 404 (511) |+++++++++++..|| ++||+||++++.+|..||+++++.|+.+|++++++++.+++. ....+...+++.|+ T Consensus 309 i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~---~~~~~~~~i~v~f~ 385 (480) T protein:vir:78 309 RKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EVTEEYTRLETVWR 385 (480) T ss_pred HHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC---CccccceeeeEEec Confidence 8888877776665553 369999999999999999999999999999999999887653 22345567899999 Q ss_pred CCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccC Q lcl|NC_018086. 405 RNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLD 480 (511) Q Consensus 405 ~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (511) ++.++|.++.+++++|+. |++|.+|+++++|+++|+.++++++++|+.+.....+....+..+.....+..++. T Consensus 386 ~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 463 (480) T protein:vir:78 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET-- 463 (480) T ss_pred CCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCC-- Confidence 999999999999999873 47999999999999988887777765554443322222221111111111111111 Q ss_pred CCCCCccccccCCCCccccccccC Q lcl|NC_018086. 481 KNPANTSTITTTDPVAAKEQEKAI 504 (511) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~ 504 (511) ++.+++..++-...+|. T Consensus 464 -------~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 464 -------KTETQTSPSGFNRTKTR 480 (480) T ss_pred -------CCccccccCCCCcccCC Confidence 11111122222222222 No 48 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=2e-81 Score=463.06 Aligned_cols=448 Identities=14% Similarity=0.088 Sum_probs=348.1 Q ss_pred hhhccCCCHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec Q lcl|NC_018086. 22 HFIRRNFDLRE-LITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES 100 (511) Q Consensus 22 ~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~ 100 (511) .-..++++... |.+|+.+|..+++|++++.+||+|+|++.+.....+....++|+++|||++||+++++|++.+|+++. T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~ 80 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIP 80 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceecc Confidence 22334566555 45688889999999999999999999988777666666678899999999999999998887776642 Q ss_pred ----------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC--------CCCceEEEEEcccceEEEec Q lcl|NC_018086. 101 ----------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID--------RNKKHRFKAVSPMNCLIAYS 162 (511) Q Consensus 101 ----------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~--------~~g~~~i~~~~p~~~~~v~d 162 (511) .+++..+.++++|+.|+|+..+.++++++++||+||++|+.+ +++.++|++++|.+++++|| T Consensus 81 ~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~d 160 (488) T protein:vir:23 81 SANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAEVD 160 (488) T ss_pred CCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEEEe Confidence 355677889999999999999999999999999999999874 45678999999999999999 Q ss_pred CCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc- Q lcl|NC_018086. 163 ADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE- 241 (511) Q Consensus 163 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~- 241 (511) +..+ ++.+++++|... ++..++++++|+++.+++|....++ |...+..+|+||+||||+|+|+. T Consensus 161 ~~~~-~~~~~~~~~~~~---~~~~~~~~~~y~~~~~~~~~~~~~~-----------~~~~~~~~h~~g~vPvv~f~n~~~ 225 (488) T protein:vir:23 161 PRTR-KVLYAIRAIYGA---DGNEIVSATLYLPDTTMTWLRAEGE-----------WEAPTSTPHGLEMVPVIPISNRTR 225 (488) T ss_pred cCCC-ceEEEEEEEEec---CCCcEEEEEEEecCcEEEEEecCCc-----------eEeccccccCCCCcceEEeccccc Confidence 7654 578888877532 3455778899999999999866543 34456789999999999999875 Q ss_pred ----ccCchhH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchh------hhhhhhCceeeecCCCceeee Q lcl|NC_018086. 242 ----ERLGDFE-AQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDS------ISNMKNDRVIVTDEDGMVKFI 310 (511) Q Consensus 242 ----~g~s~~~-~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~------~~~~~~~~~i~~~~~~~~~~~ 310 (511) +|+|+|+ .|++|+|+||+++|++++.++++++|+++++|...+..... ......++++.+++++++++. T Consensus 226 ~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~ 305 (488) T protein:vir:23 226 LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAE 305 (488) T ss_pred cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeE Confidence 6899998 48999999999999999999999999999999866543211 112233456777777788876 Q ss_pred ecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 311 TKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 311 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) +++ ..+++++++.|+.+|+++++++++++..|| ++||+||++++++|..||+++++.|+.+|++++++++.++ T Consensus 306 q~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 383 (488) T protein:vir:23 306 QFS--AAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMV 383 (488) T ss_pred ecC--CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 544 456788889999999988888877765553 3699999999999999999999999999999999999887 Q ss_pred HhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018086. 386 EFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQN 461 (511) Q Consensus 386 ~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~ 461 (511) +.... ..+..++++.|+++.|+|.++.+++++|++ |++|++|++.+||+++|+.+|++++++++++.....+.. T Consensus 384 ~~~~~--~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~ 461 (488) T protein:vir:23 384 KGGDI--PTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGS 461 (488) T ss_pred cCCCc--chhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHH Confidence 64332 345567999999999999999999999984 479999999999999999999999877766554444443 Q ss_pred ccccccCCCCCCccccccCCCCCCccccccCCCCcccc Q lcl|NC_018086. 462 FKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKE 499 (511) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) +.......+...+.. +++...|..+.+ T Consensus 462 ~~~~~~~~~~~~~~~-----------~~~~~~~e~~~a 488 (488) T protein:vir:23 462 LYGASTPEGKPGEAP-----------VGEPPAPEPDAA 488 (488) T ss_pred HhccCCCcccCCCCC-----------CCCCCCCCCCCC Confidence 322221111111110 111111111111 No 49 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1.8e-79 Score=452.32 Aligned_cols=425 Identities=16% Similarity=0.126 Sum_probs=334.6 Q ss_pred ccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCchh Q lcl|NC_018086. 25 RRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEK 104 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~ 104 (511) -++.+.+.|.+|+.+|..+++||+++.+||+|+|.+.+.+...++..+++|+++|||++||++.++|+++++++... T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~d--- 77 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNGD--- 77 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccCCC--- Confidence 22233445777899999999999999999999999877777777777789999999999999999999988886432 Q ss_pred hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCc Q lcl|NC_018086. 105 TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITG 184 (511) Q Consensus 105 ~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~ 184 (511) .+.++++|+.|+|+.++.++++++++||+||++||.+++|.+++++++|.+++++||+..+....+.++++. .. T Consensus 78 -~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~-~~---- 151 (441) T protein:vir:80 78 -GYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQT-CD---- 151 (441) T ss_pred -hHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCCceeEEEEEEEE-ec---- Confidence 245899999999999999999999999999999999999999999999999999999887665555544443 21 Q ss_pred ceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-----ccCchhHH-HHHHHHHHH Q lcl|NC_018086. 185 HQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-----ERLGDFEA-QLSLIDAYN 258 (511) Q Consensus 185 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~l~d~~~ 258 (511) ....++++|+++.+++|...+.+ .|...+..+|+||+||||+|.|++ +|+|+|+. |++|||+|| T Consensus 152 ~~~~~~~vy~~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~ 221 (441) T protein:vir:80 152 PEVVEAELLLPDVIVQVERRGSR----------EWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAV 221 (441) T ss_pred CceEEEEEEecCeEEEEEEcCCc----------ceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHH Confidence 23456889999999998765543 345567889999999999999875 58999975 999999999 Q ss_pred HHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeee-ecCCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_018086. 259 LAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFI-TKDVNDKHIENIKNRAKLDIFSLSQTP 337 (511) Q Consensus 259 ~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~i~~~s~~p 337 (511) +++|++++.++++++|+++++|...++..........++++.++++++.+.+ .++.+.+.++++++.|+.+|+.++.++ T Consensus 222 ~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~ 301 (441) T protein:vir:80 222 RTLLGQSVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEA 301 (441) T ss_pred HHHHHHHHHHHhhcCceeeeecCCccccccchhhhcccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhccc Confidence 9999999999999999999999887766655666666778888766543322 244556678888899999899988887 Q ss_pred cccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH Q lcl|NC_018086. 338 DLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA 412 (511) Q Consensus 338 ~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~ 412 (511) ++++..|| ++||+||++++.+|..||+++++.|+.+|++++++++.+++..... .....+++++|+++.|+|.+ T Consensus 302 ~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~-~~~~~~i~~~f~~~~~~~~~ 380 (441) T protein:vir:80 302 AVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDE-ADFFGDVGLRWRDASTPTRA 380 (441) T ss_pred CCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-cccceeeeEEeCCCCCcCHH Confidence 77655443 2599999999999999999999999999999999999988765433 33457899999999999999 Q ss_pred HHHHHHHHHh--cc--CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccccc Q lcl|NC_018086. 413 ELADMAVKLR--DM--LPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKL 479 (511) Q Consensus 413 e~a~~~~~~~--g~--~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (511) +.+++++++. |+ +|+++++.++|++++ |++|+++|+++..+......+. .+...++. T Consensus 381 e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~---e~~~~~~e~~e~~~~~~~~~~~-------~~~~~~~~ 441 (441) T protein:vir:80 381 ATADAVTKLVGAGILPADSRTVLEMLGLDDV---QVEAVMRHRAESSDPLAVLAGA-------ISRQTNEV 441 (441) T ss_pred HHHHHHHHHHhcCcccccHHHHHHhCCCCHH---HHHHHHHHHHHHHHHHHHHhhh-------hhcccccC Confidence 9999999985 43 588999999998764 4445555444333321111111 11111111 No 50 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=2.7e-79 Score=451.29 Aligned_cols=473 Identities=15% Similarity=0.122 Sum_probs=337.4 Q ss_pred CCCccc---hhhcccccCchhhHhhhhccCCCHH----HHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc-Cccccc Q lcl|NC_018086. 1 MAIPNG---QINAGDIITTNIRRKHFIRRNFDLR----ELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF-DDTNKP 72 (511) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~----~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~-~~~~~~ 72 (511) |--+.+ .+..++|. +.+..++.+ .+.+++..|..+++|++++++||+|+|++...... .+..+. T Consensus 1 ~~~~~~~~~~~~~~~~~--------~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~ 72 (501) T protein:vir:25 1 MTVPVDVIADAPAADVE--------FPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKE 72 (501) T ss_pred CcccchhhhccCccccc--------CCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhh Confidence 443322 33333332 223344433 35567778888999999999999999987554433 333344 Q ss_pred c-ceeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE Q lcl|NC_018086. 73 N-SKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKA 151 (511) Q Consensus 73 ~-~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~ 151 (511) . .++++|||++||+++++|++.++++.. |++..+.++++|+.|+|+.+..++++++++|||||++||.+++| +++++ T Consensus 73 ~~~~~v~n~~~~ivd~~a~~l~~~gf~~~-d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-~~i~~ 150 (501) T protein:vir:25 73 LAKLSVKNVLSLVRDSFAQNLSVVGYRNA-LAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-PVFRT 150 (501) T ss_pred hHhhhhcChHHHHHHHHHhhhcccceecC-CccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-CeEEE Confidence 3 456789999999999999999998854 44456778999999999999999999999999999999999887 68999 Q ss_pred EcccceEEEecC-CCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccc-----ccc-----cccccccc Q lcl|NC_018086. 152 VSPMNCLIAYSA-DLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREV-----YRE-----IPEELEIK 220 (511) Q Consensus 152 ~~p~~~~~v~d~-~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-----~~~-----~~~~~~~~ 220 (511) +||++++++|++ ..++.+++++++|....+ .....++++|++..+++|........ .+. ........ T Consensus 151 ~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (501) T protein:vir:25 151 RSPRQILAVYADPSVDAWPQYALETWVAQKD--AKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVI 228 (501) T ss_pred eccccEEEEEecCCCCcceeEEEEEEeeccc--cCcceeEEEecCeeEEEEecCceeeeecccccccccccccccccccc Confidence 999999999965 556678999998875543 33455788999998888754321100 000 00111222 Q ss_pred cccceeccCCccceEeecCC----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhC Q lcl|NC_018086. 221 DYEVHPNLLQKFPVLEIIAN----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKND 296 (511) Q Consensus 221 ~~~~~~~~~g~iPvv~~~n~----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~ 296 (511) .....+|+||.||||+|+|+ ++|+|+|++|++|+|+||+++|++++.++++++|+++++|.+.+.... ..+..+ T Consensus 229 ~~~~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~--~~~~~~ 306 (501) T protein:vir:25 229 EHGATFEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEV--LKASAL 306 (501) T ss_pred ccccccCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccch--hhhccc Confidence 34567899999999999994 568999999999999999999999999999999999999997765443 344455 Q ss_pred ceeeecCCCceeeeecCC-CHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 297 RVIVTDEDGMVKFITKDV-NDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVL 374 (511) Q Consensus 297 ~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 374 (511) +++.++ ++++++.+++. +.+.+..+++.+..+|+..|++|+..++.+ +|+||+||++++.+|.+++++|++.|+.+| T Consensus 307 ~i~~~~-~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l 385 (501) T protein:vir:25 307 RVWTFE-DPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESW 385 (501) T ss_pred ceeccC-CCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666654 55677766543 446666777777777777788888777754 689999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhcc-CChHHHHHhCCCCCCHHHHHHHHHHHHHH Q lcl|NC_018086. 375 AKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDM-LPDETIINQFPWITDARQEVEKADAQRQK 453 (511) Q Consensus 375 ~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~-~s~et~~~~l~~v~d~~~E~~ri~~E~~~ 453 (511) ++++++++.+.+. .......++++.|+++.|+|.++.|++++|+.|+ +|.++++.+++++++++ ++++++++++ T Consensus 386 ~~~~rl~~~~~~~---~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g~~~~~--ie~~~~~~~e 460 (501) T protein:vir:25 386 EQLLRLAAEMDDD---PDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPGMTQQT--IQAIKDSLRG 460 (501) T ss_pred HHHHHHHHHHhCC---CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCCCCHHH--HHHHHHHHHH Confidence 9999998877653 2234556799999999999999999999999876 89999999999998654 4455444333 Q ss_pred HHHH-HHhhccccccCCCCCCccccccCCCCCCccccccCCCCccc Q lcl|NC_018086. 454 RADI-ALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAK 498 (511) Q Consensus 454 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) +... .+..... ....+..+..+ +..+.+.+.++.++.++. T Consensus 461 ~~~~~~~~~~~~-~~~~~~~~~~~----~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 461 GEVKSLVDKLLS-NEPAPVPPPPP----QAAAQALNEGGVNGNGGA 501 (501) T ss_pred HhHHHHHHHhhc-cCcCCCCCCCC----CCCccccccccCCCCCCC Confidence 2222 1221111 11111111111 111111111111222222 No 51 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=1.8e-76 Score=435.90 Aligned_cols=449 Identities=13% Similarity=0.112 Sum_probs=319.3 Q ss_pred HhhhhccCCCHHHHH-----HHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCc---cccccceeccchHHHHHHHHHhh Q lcl|NC_018086. 20 RKHFIRRNFDLRELI-----TLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDD---TNKPNSKIVHNFPKLLVDTSTAY 91 (511) Q Consensus 20 ~~~~~~~~~~~~~l~-----~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~---~~~~~~ri~~n~~k~ivd~~~~~ 91 (511) -..+.+++++..++. .|+.+|..+++||+++++||+|+|.+........ ..+..+++++|||++||+++++| T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 80 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQ 80 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhh Confidence 334455667766543 4667899999999999999999999876543221 12233456789999999999999 Q ss_pred hhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee-----CCCCceEEEEEcccceEEEecCCCC Q lcl|NC_018086. 92 LAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI-----DRNKKHRFKAVSPMNCLIAYSADLD 166 (511) Q Consensus 92 l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~-----~~~g~~~i~~~~p~~~~~v~d~~~~ 166 (511) +++++++.. +++..+.++++|+.|+|+.++.++++++++||+||++||. +++|.++++++||++++++|++... T Consensus 81 l~~~gf~~~-d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iydd~~~ 159 (479) T protein:vir:99 81 LIVDGYRKT-GTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWEDPYW 159 (479) T ss_pred cccccccCC-CchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEEEecCCcc Confidence 999998854 4455677899999999999999999999999999999995 5678899999999999999988765 Q ss_pred CceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC----cc Q lcl|NC_018086. 167 EEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN----EE 242 (511) Q Consensus 167 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~----~~ 242 (511) ... .++.+. .+. ...+.+|+...++.|....+ .|...+..+|+||.||||+|.|+ ++ T Consensus 160 ~~~--~~~~~~--~~~----~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~h~~g~vPvv~f~n~~~~~~~ 220 (479) T protein:vir:99 160 DEW--PKYLLE--RQP----NGQYWWWTEEDYSIFEFKQG-----------KFIYRETVSHDYGHIPFVRYVNVMDLRGV 220 (479) T ss_pred cce--eeEEEe--ecC----ceeEEEEecceEEEEEecCC-----------ceeeccccccCCCCcceEEeecCCCcCcC Confidence 432 222111 111 12345778877766665443 34455778999999999999998 57 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhh---hhhhhCceeeecCCCceeeeecCCCHHHH Q lcl|NC_018086. 243 RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSI---SNMKNDRVIVTDEDGMVKFITKDVNDKHI 319 (511) Q Consensus 243 g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 319 (511) |+|+|+++++|+|+||+++|++++.++++++|+++++|....+..... ..+...+++.. +++++++.+.+ ...+ T Consensus 221 g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~q~~--~~~~ 297 (479) T protein:vir:99 221 CYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLIS-QNEKASFGAIP--AAPL 297 (479) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccceee-cCCCceEEEec--ccch Confidence 999999999999999999999999999999999999998654433222 23334556654 45677776543 4556 Q ss_pred HHHHHHHHHHHHHHhCc---cccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Q lcl|NC_018086. 320 ENIKNRAKLDIFSLSQT---PDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP 396 (511) Q Consensus 320 ~~~~~~l~~~i~~~s~~---p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~ 396 (511) +++++.|+..|++++++ |...++..+|+||+||++++.+|..+++.+++.|+.+|++++++++.+.+.. ..... T Consensus 298 ~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~---~~~~~ 374 (479) T protein:vir:99 298 DGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRT---EEATD 374 (479) T ss_pred HHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC---ccccc Confidence 77777777766666655 4445555678999999999999999999999999999999999998876432 23445 Q ss_pred cceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH--HHHhhc---cccccCC Q lcl|NC_018086. 397 YEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRAD--IALQNF---KQTSAVQ 469 (511) Q Consensus 397 ~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~--~~~~~~---~~~~~~~ 469 (511) .+++++|+++.++|.++.+++++|+ +|++|++|++.+||++++++ +++++++++.... .....+ .....+. T Consensus 375 ~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (479) T protein:vir:99 375 LDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQST--VNGWKEIYDREGDFGKYMRKLQNGPDPAEQR 452 (479) T ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH--HHHHHHHHHHHHHHHHHHHHHhcccCccccc Confidence 6789999999999999999999998 47899999999999998754 3444433332211 111111 1111111 Q ss_pred CCCCccccccCCCCCCccccccCCCCccccccccCC Q lcl|NC_018086. 470 GASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQ 505 (511) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (511) +..+ ++...++++.+++.+..-.+|-- T Consensus 453 ~~~~---------~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 453 GGPN---------GATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred CCCC---------CCCCCCCCCCCCcchhccCCCCC Confidence 1111 11111111111111111111110 No 52 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=2.4e-76 Score=435.17 Aligned_cols=432 Identities=13% Similarity=0.027 Sum_probs=323.1 Q ss_pred ccCCCHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC-ccccc-cceeccchHHHHHHHHHhhhhccCceecC Q lcl|NC_018086. 25 RRNFDLRE-LITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD-DTNKP-NSKIVHNFPKLLVDTSTAYLAGEPITESG 101 (511) Q Consensus 25 ~~~~~~~~-l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~-~~~~~-~~ri~~n~~k~ivd~~~~~l~g~~~~~~~ 101 (511) .++.++.+ +.+|+.+|..+++|++++++||+|+|++.+..... +..+. ++|+++||+++||++.++|++|+|+++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 66677755 56688999999999999999999999886544433 22333 57899999999999999999999999865 Q ss_pred c--hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEe Q lcl|NC_018086. 102 D--EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVI 179 (511) Q Consensus 102 d--~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 179 (511) + .+..+.++++|+.|+|+..+.++++++++||+||++||.+++|.+++++++|.+++++||+...+.+.+++++|... T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~ 160 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDL 160 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEec Confidence 3 45567899999999999999999999999999999999999999999999999999999999999999999998643 Q ss_pred ecCCcceEEEEEEEcCCcEEEEEEc-cC---cccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHH Q lcl|NC_018086. 180 SDITGHQIRTYEVYTEDLIYKFSTD-DE---REVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLID 255 (511) Q Consensus 180 ~~~~~~~~~~~~~~~~~~i~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d 255 (511) ++... +..+|....+.++... .. ............|......+|++|.|||++| +|++|+|+|+++++|+| T Consensus 161 ---d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi~liD 235 (456) T protein:vir:10 161 ---DAESD-FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) T ss_pred ---CCcee-EEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEe-cCCCCCchhhhhHHHHH Confidence 23333 3345555444443221 00 0001111123344555667899999999887 56789999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeEeecCCCCcc-----chhh-----hhhhhCceeeecCCCceeeeecCCCHHHHHHHHHH Q lcl|NC_018086. 256 AYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD-----SDSI-----SNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNR 325 (511) Q Consensus 256 ~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~-----~~~~-----~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) +||+++|++++.++++++|+++++|.+.+.. +... .....+.++.+++++++..+. +.+.+.+...++. T Consensus 236 a~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~~~l~~ 314 (456) T protein:vir:10 236 RINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-ANDFTPMLSAIKE 314 (456) T ss_pred HHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEec-ccChhHHHHHHHH Confidence 9999999999999999999999999754321 1111 111223456666666665442 3445566666666 Q ss_pred HHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC Q lcl|NC_018086. 326 AKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV 404 (511) Q Consensus 326 l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~ 404 (511) +...|+..+++|...++.+ +|+||+||++++.+|.+|++.+++.|+.+|++++++++.+.+ . .+..++++.|+ T Consensus 315 ~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g---~---~~~~~~~v~w~ 388 (456) T protein:vir:10 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG---E---SVEDTVDVSFE 388 (456) T ss_pred HHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---C---CcccceeEEec Confidence 6666667777777766654 589999999999999999999999999999999999876543 1 23346899999 Q ss_pred CCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 405 RNLPQSYAELADMAVKLR--DMLPDETIINQFPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 405 ~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) ++.|+|.++.|++++|+. |++|.++++.++|++++ .++|++|+++|+.+........ ++ +.++. T Consensus 389 ~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~-~~--------~~~~~ 456 (456) T protein:vir:10 389 SPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQR-PQ--------EDGSR 456 (456) T ss_pred CCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhc-CC--------CCCCC Confidence 999999999999999985 78999999999998754 3356777766655432211110 00 00000 No 53 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=2.4e-76 Score=435.17 Aligned_cols=432 Identities=13% Similarity=0.027 Sum_probs=323.1 Q ss_pred ccCCCHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC-ccccc-cceeccchHHHHHHHHHhhhhccCceecC Q lcl|NC_018086. 25 RRNFDLRE-LITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD-DTNKP-NSKIVHNFPKLLVDTSTAYLAGEPITESG 101 (511) Q Consensus 25 ~~~~~~~~-l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~-~~~~~-~~ri~~n~~k~ivd~~~~~l~g~~~~~~~ 101 (511) .++.++.+ +.+|+.+|..+++|++++++||+|+|++.+..... +..+. ++|+++||+++||++.++|++|+|+++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 66677755 56688999999999999999999999886544433 22333 57899999999999999999999999865 Q ss_pred c--hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEe Q lcl|NC_018086. 102 D--EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVI 179 (511) Q Consensus 102 d--~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 179 (511) + .+..+.++++|+.|+|+..+.++++++++||+||++||.+++|.+++++++|.+++++||+...+.+.+++++|... T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~ 160 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDL 160 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEec Confidence 3 45567899999999999999999999999999999999999999999999999999999999999999999998643 Q ss_pred ecCCcceEEEEEEEcCCcEEEEEEc-cC---cccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHH Q lcl|NC_018086. 180 SDITGHQIRTYEVYTEDLIYKFSTD-DE---REVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLID 255 (511) Q Consensus 180 ~~~~~~~~~~~~~~~~~~i~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d 255 (511) ++... +..+|....+.++... .. ............|......+|++|.|||++| +|++|+|+|+++++|+| T Consensus 161 ---d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi~liD 235 (456) T protein:vir:10 161 ---DAESD-FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) T ss_pred ---CCcee-EEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEe-cCCCCCchhhhhHHHHH Confidence 23333 3345555444443221 00 0001111123344555667899999999887 56789999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeEeecCCCCcc-----chhh-----hhhhhCceeeecCCCceeeeecCCCHHHHHHHHHH Q lcl|NC_018086. 256 AYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD-----SDSI-----SNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNR 325 (511) Q Consensus 256 ~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~-----~~~~-----~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) +||+++|++++.++++++|+++++|.+.+.. +... .....+.++.+++++++..+. +.+.+.+...++. T Consensus 236 a~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~~~l~~ 314 (456) T protein:vir:10 236 RINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-ANDFTPMLSAIKE 314 (456) T ss_pred HHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEec-ccChhHHHHHHHH Confidence 9999999999999999999999999754321 1111 111223456666666665442 3445566666666 Q ss_pred HHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC Q lcl|NC_018086. 326 AKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV 404 (511) Q Consensus 326 l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~ 404 (511) +...|+..+++|...++.+ +|+||+||++++.+|.+|++.+++.|+.+|++++++++.+.+ . .+..++++.|+ T Consensus 315 ~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g---~---~~~~~~~v~w~ 388 (456) T protein:vir:10 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG---E---SVEDTVDVSFE 388 (456) T ss_pred HHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---C---CcccceeEEec Confidence 6666667777777766654 589999999999999999999999999999999999876543 1 23346899999 Q ss_pred CCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 405 RNLPQSYAELADMAVKLR--DMLPDETIINQFPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 405 ~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) ++.|+|.++.|++++|+. |++|.++++.++|++++ .++|++|+++|+.+........ ++ +.++. T Consensus 389 ~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~-~~--------~~~~~ 456 (456) T protein:vir:10 389 SPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQR-PQ--------EDGSR 456 (456) T ss_pred CCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhc-CC--------CCCCC Confidence 999999999999999985 78999999999998754 3356777766655432211110 00 00000 No 54 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=5.7e-76 Score=433.11 Aligned_cols=470 Identities=11% Similarity=0.045 Sum_probs=336.0 Q ss_pred CCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchH Q lcl|NC_018086. 2 AIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFP 81 (511) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~ 81 (511) --+|+.-.+.......+ + ...+...|..|+.+|..++++++++.+||+|+|.+.+.+...++...++++++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~----l--~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~ 74 (504) T protein:vir:99 1 MTEETTSASKFTFRIPE----L--NDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWS 74 (504) T ss_pred CCccCCcccccccccCC----C--CHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcH Confidence 23455444333332211 1 11123447778889999999999999999999998776665555556778899999 Q ss_pred HHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCc--eEEEEEcccceEE Q lcl|NC_018086. 82 KLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKK--HRFKAVSPMNCLI 159 (511) Q Consensus 82 k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~--~~i~~~~p~~~~~ 159 (511) ++||+++++++..+|++...+++....++++|+.|+|+.+..++++++++|||||++||.+++|+ +.|+++||+++++ T Consensus 75 ~~iVd~~a~rl~~~Gf~~~d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~ 154 (504) T protein:vir:99 75 AKAVDTLARRCNLESFVWPDGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATG 154 (504) T ss_pred HHHHHHHHhhhccceeeCCCCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEE Confidence 99999999999999999877777788899999999999999999999999999999999998876 5688999999999 Q ss_pred EecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA 239 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 239 (511) +||+.. +++.++++++... .+ .....+++|+++.+++|.....+. ...+..+|++| ||||+|.| T Consensus 155 iyD~~~-~~~~~a~~~~~~d--~~-g~~~~~~~y~~~~~~~~~~~~~~~-----------~~~~~~~~~~g-vPvV~~~n 218 (504) T protein:vir:99 155 EWNSRR-NAMDSLLSITSRD--AE-GHPTGIALYEDGVTVTADMDDDGD-----------WHADVRTHKLG-VPVEVLPY 218 (504) T ss_pred EEeCCC-CceeEEEEEEEec--CC-CeEEEEEEEcCCcEEEEEEcCCce-----------eeeccccCCCC-cceEEecc Confidence 999865 5577777766542 23 345668999999999987654321 12456789998 89999998 Q ss_pred C-----cccCchhH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc----h--hhhhhhhCceeeecCCCc- Q lcl|NC_018086. 240 N-----EERLGDFE-AQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS----D--SISNMKNDRVIVTDEDGM- 306 (511) Q Consensus 240 ~-----~~g~s~~~-~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~----~--~~~~~~~~~~i~~~~~~~- 306 (511) + .+|+|+|+ .|++|+|++|++++++.+.++||++|+++++|...++.. . .......++++.++++++ T Consensus 219 ~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~ 298 (504) T protein:vir:99 219 KPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDE 298 (504) T ss_pred cccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCcccc Confidence 6 46899986 799999999999999999999999999999998764321 1 122233356777776543 Q ss_pred -------eeeeecCCCHHHHHHHHHHHHHHHHHH---hCccccccccc---cCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 -------VKFITKDVNDKHIENIKNRAKLDIFSL---SQTPDLVSKDF---TAASGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 307 -------~~~~~~~~~~~~~~~~~~~l~~~i~~~---s~~p~~~~~~~---~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) +++ ++++..++++|++.|+.+|+.+ |++|...++.. +|+||+||++++.+|..|+++|++.|+.+ T Consensus 299 ~~~~~~~~~~--~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~ 376 (504) T protein:vir:99 299 PDAARARADV--KQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPA 376 (504) T ss_pred ccccCcccee--eecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 444 3445555666777776666666 56665555432 46899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhcc-----CChHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDM-----LPDETIINQFPWITDARQEVEKAD 448 (511) Q Consensus 374 l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~-----~s~et~~~~l~~v~d~~~E~~ri~ 448 (511) |++++++++.+.+..+. ...+..++++.|+++.++|.++.|++++|+.+. .+.++++.++|+. ..|++|++ T Consensus 377 l~~~~rla~~~~~~~~~-~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~---~~ei~r~~ 452 (504) T protein:vir:99 377 FRRSMIRALAIKNGLDR-IPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLT---PQQAKRAL 452 (504) T ss_pred HHHHHHHHHHHhcCCCc-cccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCC---HHHHHHHH Confidence 99999998887764432 234556789999999999999999999998652 3458899999875 34566776 Q ss_pred HHHHHHHH-HHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccccccc Q lcl|NC_018086. 449 AQRQKRAD-IALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKA 503 (511) Q Consensus 449 ~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (511) ++++.... ..++.+..+....+..+...+++...+ +.++....++.+..-+ T Consensus 453 ~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~----a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 453 AERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEP----PANEPPAALGRPTLVG 504 (504) T ss_pred HHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCC----CCCCCCccCCCcccCC Confidence 65543332 223333332222222211111111111 1111111111111111 No 55 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=3.1e-75 Score=429.06 Aligned_cols=432 Identities=13% Similarity=0.017 Sum_probs=323.1 Q ss_pred ccCCCHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC-ccccc-cceeccchHHHHHHHHHhhhhccCceecC Q lcl|NC_018086. 25 RRNFDLREL-ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD-DTNKP-NSKIVHNFPKLLVDTSTAYLAGEPITESG 101 (511) Q Consensus 25 ~~~~~~~~l-~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~-~~~~~-~~ri~~n~~k~ivd~~~~~l~g~~~~~~~ 101 (511) ..+.++.++ ..|+.+|..++++++++.+||+|+|++.+..... +..+. ++++++||+++||++.++|++|+|++++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 80 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 556677654 4588889999999999999999999987654433 33333 44678999999999999999999999865 Q ss_pred c--hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEe Q lcl|NC_018086. 102 D--EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVI 179 (511) Q Consensus 102 d--~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 179 (511) + .+..+.++++|++|+|+.++.++++++++||+||+++|.+++|.+++++++|.+++++||+...+.+.+++++|... T Consensus 81 ~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~ 160 (456) T protein:vir:79 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) T ss_pred CCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCceEEEEEEEEec Confidence 3 34567899999999999999999999999999999999999999999999999999999999999999999998643 Q ss_pred ecCCcceEEEEEEEcCCcEEEEEEccC----cccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHH Q lcl|NC_018086. 180 SDITGHQIRTYEVYTEDLIYKFSTDDE----REVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLID 255 (511) Q Consensus 180 ~~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d 255 (511) + + ...+..+|+.+.++++..... ............+......+|+++.||||+|. |++|.|+|+++++||| T Consensus 161 d---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-N~~~~gd~e~v~~liD 235 (456) T protein:vir:79 161 D---A-ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHIDIIN 235 (456) T ss_pred C---C-ceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEec-CCCCCchhhhhHHHHH Confidence 2 2 334556788877776543211 01111122334455566789999999999984 6789999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeEeecCCCCcc-----chhh-----hhhhhCceeeecCCCceeeeecCCCHHHHHHHHHH Q lcl|NC_018086. 256 AYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD-----SDSI-----SNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNR 325 (511) Q Consensus 256 ~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~-----~~~~-----~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) +||+++|++++.++++++|++++.|...... +... .....+.++.+++++++..+ .+.+.+.+...++. T Consensus 236 ~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~~~q~-~~~~~~~~~~~l~~ 314 (456) T protein:vir:79 236 RINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWES-QTNDFTPMLSAIKE 314 (456) T ss_pred HHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCCCcceeee-cccChHHHHHHHHH Confidence 9999999999999999999999999754321 1111 11112345556666655432 23344555555555 Q ss_pred HHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC Q lcl|NC_018086. 326 AKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV 404 (511) Q Consensus 326 l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~ 404 (511) +..+|+..+++|...++.+ +|+||+||++++.+|.+||+.+++.|+++|++++++++.+.+. .+..++++.|+ T Consensus 315 ~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~------~~~~~i~v~w~ 388 (456) T protein:vir:79 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE------SVEDTVDVSFE 388 (456) T ss_pred HHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------CccccceEEeC Confidence 5556666666666666543 5899999999999999999999999999999999998766431 23356899999 Q ss_pred CCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 405 RNLPQSYAELADMAVKLR--DMLPDETIINQFPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 405 ~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) ++.|+|.++.|++++|+. |++|.++++..++++++ .++|++|+++|..+...... +.++++.+. T Consensus 389 ~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~---------~~~~~~~~~ 456 (456) T protein:vir:79 389 SPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPV---------QRPQEDGSR 456 (456) T ss_pred CCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHh---------hcCCCCCCC Confidence 999999999999999984 78999999999988654 34566666666444322111 111111111 No 56 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=7.6e-70 Score=399.53 Aligned_cols=408 Identities=13% Similarity=0.114 Sum_probs=294.9 Q ss_pred cccCCcCccccccce-eccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEe Q lcl|NC_018086. 61 IQSRTFDDTNKPNSK-IVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIH 139 (511) Q Consensus 61 ~~~~~~~~~~~~~~r-i~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v 139 (511) .-.+...+..+..+| +++|||++||+++++++++++++. .|.+..+.++++|++|+|+.++.++++++++|||||++| T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~-~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v 79 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTG-PDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYMLV 79 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceec-CCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEE Confidence 112333344455454 578999999999999999999885 455677889999999999999999999999999999999 Q ss_pred eeCCCC-------ceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccc-c Q lcl|NC_018086. 140 WIDRNK-------KHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVY-R 211 (511) Q Consensus 140 ~~~~~g-------~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~ 211 (511) |.++++ .+.|+++||.+++++||+..+ ++.++|++|.... ++.....+.+++...++++......... . T Consensus 80 ~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~-~~~~ai~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (434) T protein:vir:98 80 GAHPTRTEDNGRPSPLITMEHPSECIVEYDPETG-EPLVGLKVWHNDI--DGFGYARVFFDDTSFPYRTRERTGARLPWG 156 (434) T ss_pred ecCCCcccccCCceeEEEEeccceeEEEEeCCCC-ceEEEEEEEEecc--CCceEEEEEEeCcEEEEEEeeccccccccc Confidence 987643 467999999999999998765 5889998886433 3334333334444444443332221111 1 Q ss_pred ccccccccccccceeccCCccceEeecCC----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEIIAN----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS 287 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~~n~----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~ 287 (511) ..............+|+||+||||+|.|| .+|+|+|+++++|||+||+++|++++.++++++|+++++|.+.+... T Consensus 157 ~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~ 236 (434) T protein:vir:98 157 PDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRT 236 (434) T ss_pred cccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc Confidence 11111222344677899999999999998 67999999999999999999999999999999999999998765443 Q ss_pred hhhhh--------hhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc----cCccHHHHHHH Q lcl|NC_018086. 288 DSISN--------MKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF----TAASGQALKAA 355 (511) Q Consensus 288 ~~~~~--------~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~----~~~Sg~Ai~~~ 355 (511) +.... ......+.+.+++++++.+. +....+++++.|+.+|+.++..++++...| +|+||+||+++ T Consensus 237 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~--~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~ 314 (434) T protein:vir:98 237 DPATGMTVVDQPFVPSPSAVWASEGENTQFGQL--DATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGAL 314 (434) T ss_pred ccccccchhhhhhhccccccccCCCCCceEEEe--cCcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHH Confidence 22111 11222344455667777554 445667777777777777777766655444 47999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhcc-CChHHHHHhC Q lcl|NC_018086. 356 TQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDM-LPDETIINQF 434 (511) Q Consensus 356 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~-~s~et~~~~l 434 (511) +.+|..|++++++.|+.+|++++++++.+.+. ..+..++++.|+++.|+|.++.+++++|+.++ +|.+++++++ T Consensus 315 ~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~-----~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~~~e~~~~~l 389 (434) T protein:vir:98 315 DILHVAKVREHIASFSEGLESVLALAAAQAGV-----PEDYTEAEVRWANPAHVTMAVKADAATKLKSIGYPLDVIAEEL 389 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----ChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCCcHHHHHHhC Confidence 99999999999999999999999998765322 34556799999999999999999999999886 8999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCcc Q lcl|NC_018086. 435 PWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTS 487 (511) Q Consensus 435 ~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (511) |+.+ +|++|+++|+++..........+.. ....++.++++.+.+| T Consensus 390 g~~~---~e~~r~~~e~~~~~~~~~~~~~~~~-----~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 390 DESP---ARVRRIVAGAASQALLAASLLPAPG-----APSAGNVPDSGGAVDG 434 (434) T ss_pred CCCH---HHHHHHHHHHHHHHHHHHhhhccCC-----CCCCCCCCcccCCCCC Confidence 9853 6788888776654443322211111 1122222222222222 No 57 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=8.1e-70 Score=399.36 Aligned_cols=441 Identities=12% Similarity=0.044 Sum_probs=322.1 Q ss_pred ccCchh-hHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhh Q lcl|NC_018086. 13 IITTNI-RRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY 91 (511) Q Consensus 13 ~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~ 91 (511) ++.... ++.-+.+ .+...+..|+.++..++++++++.+||+|+|.+.+.+...++...++++++||++++|++++++ T Consensus 1 ~~~~~~~~~~gl~~--~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~r 78 (474) T protein:vir:81 1 MIQQQTVRIPSLSN--DENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARR 78 (474) T ss_pred CcCCCcCcCCCCCh--hHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhh Confidence 221111 1111111 1223466788899999999999999999999987776665555556788999999999999999 Q ss_pred hhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCc--eEEEEEcccceEEEecCCCCCce Q lcl|NC_018086. 92 LAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKK--HRFKAVSPMNCLIAYSADLDEEP 169 (511) Q Consensus 92 l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~--~~i~~~~p~~~~~v~d~~~~~~~ 169 (511) +..+||+..+++.....++++|+.|+|+....++++++++|||||++|+.+++|+ ++|+++||++++++||+..+ .+ T Consensus 79 l~~~Gf~~~d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~-~~ 157 (474) T protein:vir:81 79 CNLEGFVWPDGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRR-GL 157 (474) T ss_pred hcccceECCCCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCC-cc Confidence 9999999766666667899999999999999999999999999999999977665 77999999999999998765 56 Q ss_pred EEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-----ccC Q lcl|NC_018086. 170 VAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-----ERL 244 (511) Q Consensus 170 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~ 244 (511) .+++.++... .+| ....+++|+++.++++....+++. ...+..+|++| ||||+|.|++ +|+ T Consensus 158 ~~al~~~~~~--~~g-~~~~~~ly~~~~~~~~~~~~~~~~----------w~~~~~~~~~g-vPvV~~~n~~~~~~~~G~ 223 (474) T protein:vir:81 158 NNLLSIIDKD--KEG-KVLSLALYLDNETVTAQRDKATLK----------WQVDRDEHVYG-VPAQVLPYKPAPKRPFGQ 223 (474) T ss_pred eeeeEEEEEc--CCC-cEEEEEEEeCCcEEEEEEcCccce----------eeeccCCCCCC-cceEEecccccccCcCCc Confidence 6666655432 233 345678999999998876554321 12466789998 7999999864 799 Q ss_pred chh-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc----hhhhhh--hhCceeeecCCCceeee------e Q lcl|NC_018086. 245 GDF-EAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS----DSISNM--KNDRVIVTDEDGMVKFI------T 311 (511) Q Consensus 245 s~~-~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~----~~~~~~--~~~~~i~~~~~~~~~~~------~ 311 (511) |+| ++|++|+|++|++++++...++|+++|+++++|.+.+... .....+ .-.+++.++++.+++.. . T Consensus 224 s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~ 303 (474) T protein:vir:81 224 SRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADV 303 (474) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccc Confidence 998 5899999999999999999999999999999998765422 111112 22456777766543221 2 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHh---Cccccccc--cccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 312 KDVNDKHIENIKNRAKLDIFSLS---QTPDLVSK--DFTA-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 312 ~~~~~~~~~~~~~~l~~~i~~~s---~~p~~~~~--~~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) ++++.+++++|++.|+.+++.++ ++|...++ .+.| +||+||++++.+|..|++++++.|+.+|++++++++.+. T Consensus 304 ~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~ 383 (474) T protein:vir:81 304 KQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMK 383 (474) T ss_pred cccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 45566667777777776666665 66655554 2344 799999999999999999999999999999999998876 Q ss_pred HhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_018086. 386 EFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKRA-DIAL 459 (511) Q Consensus 386 ~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~-~~~~ 459 (511) +..... .......+++.|.++..++.++.+++++|+. |+.+.++++.++++. +.++++++.+++... ...+ T Consensus 384 ~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t---~~~i~~~~~~~~~~~~~~~~ 460 (474) T protein:vir:81 384 NKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLT---PQQARRAMADKRRVQGRGTL 460 (474) T ss_pred CCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCC---HHHHHHHHHHHHHHhHHHHH Confidence 543322 1234567899999999999999999999985 346677888888875 345666665543322 2223 Q ss_pred hhccccccCCCCCCccccc Q lcl|NC_018086. 460 QNFKQTSAVQGASTAAANK 478 (511) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~ 478 (511) ...... +..++.. + T Consensus 461 ~~l~~~-~~~~~~a----q 474 (474) T protein:vir:81 461 QALIDR-SNNGATA----Q 474 (474) T ss_pred HHHHhc-CCCCCCC----C Confidence 222111 1111100 0 No 58 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=2.7e-70 Score=402.01 Aligned_cols=399 Identities=11% Similarity=0.045 Sum_probs=312.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC-ccccccceeccchHHHHHHHHHhhhhccCceecCchhhH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD-DTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTI 106 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~-~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~ 106 (511) |+...|..|..++..++++++++.+||+|+|.+.+..... +..+..+++++|||+++|+++++++..+|++.. | T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~-d---- 75 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTND-D---- 75 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceeeCC-c---- Confidence 7888888899999999999999999999999987655443 334455678889999999999999998998742 2 Q ss_pred HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC-CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 107 KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR-NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 107 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~-~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) ..++++|+.|+|+....+++++|++|||||++|+.++ +|.++|+++||.+++++||+.++ ++.+++.+|... ..+. T Consensus 76 ~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~~-~~~~a~~~~~~~--~~~~ 152 (422) T protein:vir:97 76 FNAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTTF-LLTEGYAILESD--SNGN 152 (422) T ss_pred hhHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCCC-cceeeEEEEEec--CCCc Confidence 2479999999999999999999999999999999985 68899999999999999988765 466666665432 2333 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC-----cccCchh-HHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDF-EAQLSLIDAYNL 259 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~-~~v~~l~d~~~~ 259 (511) ...+.+|++..++++.. .+. ....+|++|.||||+|.|+ .+|+|+| +.|++|+|++|+ T Consensus 153 -~~~~~~~~~~~~~~~~~-~~~--------------~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r 216 (422) T protein:vir:97 153 -PTLEAYFTDKDIWYYPK-KGK--------------PYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKR 216 (422) T ss_pred -EEEEEEEcCceEEEEcC-CCc--------------cccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHH Confidence 23344555555544432 221 1235899999999999986 4689999 579999999999 Q ss_pred HHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCc---eeeeecCCCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_018086. 260 AVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGM---VKFITKDVNDKHIENIKNRAKLDIFSLSQT 336 (511) Q Consensus 260 ~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~ 336 (511) +++++...++|+++|+++++|.+.+..........-++++.++++.+ +++ ++++.+++++|++.|+.+++.+++. T Consensus 217 ~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v--~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (422) T protein:vir:97 217 TLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTV--GQFTTASMAPFMEHLKMYASLFAGG 294 (422) T ss_pred HHHHHHHHHHHhcchhhhhcccCcccccCchhhhhhhhhhccCCCCCCCccee--eecCCCChhHHHHHHHHHHHHHhcc Confidence 99999999999999999999997655444444444457888876544 444 4555666677777777777777655 Q ss_pred ccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcC- Q lcl|NC_018086. 337 PDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQS- 410 (511) Q Consensus 337 p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d- 410 (511) ++++...|| ++||+||++++.+|..|+++|++.|+.+|++++++++.+.+.... ......+++++|+++.|.+ T Consensus 295 s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~-~~~~~~~~~~~w~p~~~~~~ 373 (422) T protein:vir:97 295 SGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPY-LRNQFMDTVIKWEPLFEADA 373 (422) T ss_pred cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-cchhhccceEEEccCCCCCh Confidence 555444333 379999999999999999999999999999999998887665432 2345667999999888887 Q ss_pred --HHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_018086. 411 --YAELADMAVKLR----DMLPDETIINQFPWITDARQEVEKADAQRQKR 454 (511) Q Consensus 411 --~~e~a~~~~~~~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~ 454 (511) .++.|++++|+. |+++.++++++||+ ++++.|..++++++..- T Consensus 374 ~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 374 NMLTLVGDGAIKLNQAIPGFMDADVIRDLTGV-KGADKPIPAITEVTTDG 422 (422) T ss_pred HHHHHHHHHHHHHHhhccccccHHHHHHHcCC-CchhHHHHHHHhhhccC Confidence 677888899885 57889999999998 67788888877653322 No 59 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=3.9e-70 Score=401.12 Aligned_cols=387 Identities=11% Similarity=0.027 Sum_probs=311.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc-ccccceeccchHHHHHHHHHhhhhccCceecCchhhH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT-NKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTI 106 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~-~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~ 106 (511) |+.+.|..|..++..++++++++.+||+|+|.+.+.....++ .+.++|+++|||+++|+++++++..+|++. ++ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~--~d--- 75 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN--DD--- 75 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC--Cc--- Confidence 888889999999999999999999999999988766554443 345778999999999999999999899862 32 Q ss_pred HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 107 KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 107 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) ..++++|+.|+|+....+++++|++|||||++|+.+++|+++|+++||.+++++||+.. +++.++++++... .. .. T Consensus 76 ~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~-~~~~~a~~~~~~d--~~-~~ 151 (409) T protein:vir:94 76 FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPIT-GLLTEGYAVLERD--EN-NN 151 (409) T ss_pred hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCC-CceeeeEEEEEec--CC-Cc Confidence 35899999999999999999999999999999999999999999999999999998865 5688888877532 22 23 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC-----cccCchh-HHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDF-EAQLSLIDAYNLA 260 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~-~~v~~l~d~~~~~ 260 (511) .....+|+++.++++....+.+ ...+|++|.||||+|.|+ .+|+|+| +.|++|+|++|++ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~--------------~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~ 217 (409) T protein:vir:94 152 VVLEAHFLPDRTDYYYRDSRNN--------------ISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRT 217 (409) T ss_pred eEEEEEEecCcEEEEEecCcee--------------EeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHH Confidence 4456789999999987665433 245899999999999986 4689999 5799999999999 Q ss_pred HHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCc---eeeeecCCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_018086. 261 VSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGM---VKFITKDVNDKHIENIKNRAKLDIFSLSQTP 337 (511) Q Consensus 261 ~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p 337 (511) ++++.+.++|+++|+++++|.+.+...........++++.++++.+ +++. +++..++++|++.|+.+++.+++.+ T Consensus 218 ~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~--q~~~~~l~~~~~~l~~~~~~~a~~t 295 (409) T protein:vir:94 218 LERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLG--QFTQPSMSPFTEQLRTAAAGFAGET 295 (409) T ss_pred HHHHHHHHHHhcChhheeEecCCCCcccchhhhhHHHhhcCCCCCCCCCceEE--ecCCCChhHHHHHHHHHHHHHhhhc Confidence 9999999999999999999987654433333333467888875533 4543 4455556677777777777777655 Q ss_pred cccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcC-- Q lcl|NC_018086. 338 DLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQS-- 410 (511) Q Consensus 338 ~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d-- 410 (511) +++...|| ++||+||++++.+|..++++|++.|+.+|++++++++.+.+.... ...+..++++.|.|..|.+ T Consensus 296 ~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~-~~~~~~~~~v~W~p~~~~~~~ 374 (409) T protein:vir:94 296 GLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPY-LREQFRKTKPKWEPLFEADAS 374 (409) T ss_pred CCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc-cccccccceEEeccCCCcchH Confidence 55544433 379999999999999999999999999999999998887665432 2345567999999777666 Q ss_pred -HHHHHHHHHHHhc----cCChHHHHHhCCCCCCHH Q lcl|NC_018086. 411 -YAELADMAVKLRD----MLPDETIINQFPWITDAR 441 (511) Q Consensus 411 -~~e~a~~~~~~~g----~~s~et~~~~l~~v~d~~ 441 (511) .++.||+++|+.+ +.+.++++.++|+.++ + T Consensus 375 ~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~-d 409 (409) T protein:vir:94 375 MLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGG-E 409 (409) T ss_pred HHHHHHHHHHHHHHhcccccchhHHHHHcCCCCC-C Confidence 5778899999964 4567999999999642 2 No 60 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=6.9e-70 Score=399.75 Aligned_cols=388 Identities=10% Similarity=0.007 Sum_probs=302.7 Q ss_pred HHHHHHHHHHHHHHhcCCCcccccCCcCc-cccccceeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccCh Q lcl|NC_018086. 40 HSRSSSAYGVLYDYYKGNHIAIQSRTFDD-TNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYV 118 (511) Q Consensus 40 ~~~~~~~~~~~~~yY~G~~~~~~~~~~~~-~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~ 118 (511) +.-+.+|++++.+||+|+|.+.+.....+ ..+.++|+++||++++|+++++++..+|++. ++ ..++++|+.|+| T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~--~d---~~l~~i~~~N~l 75 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFAN--DD---FNVTEIFDRNNP 75 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccccC--CC---chHHHHHhhcCh Confidence 23335778899999999999876655443 3445678899999999999999999999863 22 248999999999 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcE Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLI 198 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i 198 (511) +....+++++|++|||||++|+.+++|.++|+++||.+++++||+.+ +++.++++++... .+.....+.+|+++.+ T Consensus 76 d~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~-~~~~~al~~~~~~---~~~~~~~~~~~~~~~~ 151 (410) T protein:vir:95 76 DIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPIT-GLLVEGYAVLARD---DYNRPTLEAYFEPNAT 151 (410) T ss_pred HHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCC-CceEEEEEEEEec---CCCeEEEEEEEeCCcE Confidence 99999999999999999999999999999999999999999999855 6788998876532 2334567789999999 Q ss_pred EEEEEccCcccccccccccccccccceeccCCccceEeecCC-----cccCchh-HHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018086. 199 YKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDF-EAQLSLIDAYNLAVSDSVNDIAYWN 272 (511) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~-~~v~~l~d~~~~~~s~~~~~~~~~~ 272 (511) +++...++. ...+|++|.||||+|.|+ ++|+|+| +.|++|+|++|++++++...++|++ T Consensus 152 ~~~~~~~~~---------------~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a 216 (410) T protein:vir:95 152 HFIPKDGEP---------------YSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYS 216 (410) T ss_pred EEEeeCCcc---------------ccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhc Confidence 988765432 235799999999999985 4689998 5699999999999999999999999 Q ss_pred CceeEeecCCCCccchhhhhhhhCceeeecCCCc---eeeeecCCCHHHHHHHHHHHHHHHHHHhCc---ccccccccc- Q lcl|NC_018086. 273 DAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGM---VKFITKDVNDKHIENIKNRAKLDIFSLSQT---PDLVSKDFT- 345 (511) Q Consensus 273 ~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~---p~~~~~~~~- 345 (511) +|+++++|.+.+.+.........++++.++++.+ +++. +++..++++|++.|+.+++.+++. |...++..+ T Consensus 217 ~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~--q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~ 294 (410) T protein:vir:95 217 WPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKGVKPSVG--QFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD 294 (410) T ss_pred chhheeeccCCCCCcCchhhhhhhhheeccCCCCCCcceEE--ecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccC Confidence 9999999997665554444455567888876543 4543 444555566666666666666654 544444322 Q ss_pred -CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC---CCCCcCHHHHHHHHHHH Q lcl|NC_018086. 346 -AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV---RNLPQSYAELADMAVKL 421 (511) Q Consensus 346 -~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~---~~~p~d~~e~a~~~~~~ 421 (511) ++||+||++++.+|..|+++|++.|+.+|++++++++.+.+.... ......++++.|. ++..++.++.+|+++|+ T Consensus 295 NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~-~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl 373 (410) T protein:vir:95 295 NPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRY-TRSQFVRTAVKWEPLFEADANTMTMIGDGVVKL 373 (410) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-cccccceeeEEeeecCCcchhhHHHHHHHHHHH Confidence 379999999999999999999999999999999998887654332 2345567899998 55556889999999998 Q ss_pred h----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 422 R----DMLPDETIINQFPWITDARQEVEKADAQRQKRAD 456 (511) Q Consensus 422 ~----g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~ 456 (511) . |+++.++++.+|||+++ ++..++.+|+++.-+ T Consensus 374 ~~a~~g~~~~~~~~~~lg~~~~--~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 374 NQALPGYINAETIRDLTGIAGD--MSAKPVVSEGGSNGE 410 (410) T ss_pred HHhccCCccHHHHHHhcCCChH--HHHHHHHHHHHhCCC Confidence 4 67899999999999753 233333344433222 No 61 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=6.5e-69 Score=394.41 Aligned_cols=387 Identities=11% Similarity=0.034 Sum_probs=309.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc-ccccceeccchHHHHHHHHHhhhhccCceecCchhhH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT-NKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTI 106 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~-~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~ 106 (511) |+...|..|..++..++++++++.+||+|+|.+.+.....++ .+.++|+++||++++|+++++++..++++. ++ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~--~d--- 75 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN--DD--- 75 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccC--cc--- Confidence 888889999999999999999999999999988665554443 345678899999999999999999899862 32 Q ss_pred HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 107 KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 107 ~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) ..++++|+.|+|+....+++++|++|||||++|+.+++|+++|+++||.+++++||+.. +++.+++++|... ..+ . T Consensus 76 ~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~~-~~~~~a~~~~~~d--~~~-~ 151 (409) T protein:vir:16 76 FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPIT-GLLTEGYAVLERD--ENN-N 151 (409) T ss_pred hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeeccc-ccceeeeEEEEec--CCC-c Confidence 35899999999999999999999999999999999999999999999999999998865 5677888776532 223 3 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC-----cccCchhH-HHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDFE-AQLSLIDAYNLA 260 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~-~v~~l~d~~~~~ 260 (511) .....+|+++.++++....+.+ ...+|++|.||||+|.|+ ++|+|+|. .|++|+|++|++ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~ 217 (409) T protein:vir:16 152 VVLEAHFLPDRTDYYYRDSRNN--------------ISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRT 217 (409) T ss_pred eEEEEEEecCcEEEEEecCccc--------------cceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHH Confidence 3456789999998887655432 346899999999999986 47999994 699999999999 Q ss_pred HHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCc---eeeeecCCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_018086. 261 VSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGM---VKFITKDVNDKHIENIKNRAKLDIFSLSQTP 337 (511) Q Consensus 261 ~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p 337 (511) ++++...++|+++|+++++|.+.+...........++++.++++.+ +++ ++++..++++|++.++.+++.+++.+ T Consensus 218 ~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v--~q~~~~~l~~~~~~l~~~~~~~a~~s 295 (409) T protein:vir:16 218 LERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTL--GQFTQPSMSPFTEQLRTAAAGFAGET 295 (409) T ss_pred HHHHHHHHHHhcChhheeEecCCCCCccchhhhhhhHhhccCCCCCCCCceE--EecCCCChhHHHHHHHHHHHHHhhhc Confidence 9999999999999999999997654433333444467888875533 444 34555666677777777666666555 Q ss_pred cccccccc----C-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcC-- Q lcl|NC_018086. 338 DLVSKDFT----A-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQS-- 410 (511) Q Consensus 338 ~~~~~~~~----~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d-- 410 (511) +++...|| | +||+||++++.+|..|+++|++.|+.+|++++++++.+.+..... .....++++.|.++.+++ T Consensus 296 ~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~-~~~~~~~~v~W~~~~~~~~~ 374 (409) T protein:vir:16 296 GLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYL-REQFSKTKPKWEPLFEADAS 374 (409) T ss_pred CCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-chhhccceEEecCCCCcchh Confidence 55444332 3 799999999999999999999999999999999988876654322 233467899999877555 Q ss_pred -HHHHHHHHHHHhc----cCChHHHHHhCCCCCCHH Q lcl|NC_018086. 411 -YAELADMAVKLRD----MLPDETIINQFPWITDAR 441 (511) Q Consensus 411 -~~e~a~~~~~~~g----~~s~et~~~~l~~v~d~~ 441 (511) .++.||+++|+.+ +...++++.++|+..+ + T Consensus 375 s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~-d 409 (409) T protein:vir:16 375 MLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGA-E 409 (409) T ss_pred hHHHHHHHHHHHHhhcccccchhHHHHhccCCCC-C Confidence 7889999999964 3456889999998642 2 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=1.2e-59 Score=343.53 Aligned_cols=453 Identities=11% Similarity=0.047 Sum_probs=309.0 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHH-----HHHHHHHHHHHHHHHhcCCCcccccCCcC--cccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAE-----MHSRSSSAYGVLYDYYKGNHIAIQSRTFD--DTNKPN 73 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-----~~~~~~~~~~~~~~yY~G~~~~~~~~~~~--~~~~~~ 73 (511) |=-.|...+++-+. .-+....+...+. -+..++.++.++++||+|+|++.++.... ...+.. T Consensus 1 m~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~ 69 (496) T protein:vir:38 1 MINQIIAGVKGVMR-----------RMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNR 69 (496) T ss_pred ChhHHHHHHHHHHH-----------HhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCcccc Confidence 22222222111111 1111122222222 13456678889999999999987654433 233445 Q ss_pred ceeccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAV 152 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~ 152 (511) +++++|||+.||+..++|++|+|++++.++ ...+.|+++++.|+|...+.+++..++++|.+|+.+|.|++|++++.++ T Consensus 70 ~~~~~n~~k~i~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v 149 (496) T protein:vir:38 70 RQLSMNLPKVTAKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFA 149 (496) T ss_pred ceeecchHHHHHHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEE Confidence 788999999999999999999999997555 5667789999999999999999999999999999999999999999999 Q ss_pred cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCC----cEEE--EEEccCccccccccc--ccccccccc Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTED----LIYK--FSTDDEREVYREIPE--ELEIKDYEV 224 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~----~i~~--~~~~~~~~~~~~~~~--~~~~~~~~~ 224 (511) +|.+++|+|++..+-...++++.+. .+++.+++++.|+.. .|.+ |............+. ......... T Consensus 150 ~~~~~~P~~~~~~~~~~~~f~~~~~----~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~ 225 (496) T protein:vir:38 150 TADCMYPLSNDSENVDECVIANSFH----KNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVV 225 (496) T ss_pred cccceEEEEecCCcEEEEEEEEEEE----eCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccce Confidence 9999999998765433334443332 346677777777532 1211 222111100000000 001112234 Q ss_pred eeccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe-------ecCCCCccch Q lcl|NC_018086. 225 HPNLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL-------QGFDLSADSD 288 (511) Q Consensus 225 ~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~-------~G~~~~~~~~ 288 (511) ..+++.++|+++|+++ +.|+|+|+++++++|+||.++|++++.++....++.+- .+..+..... T Consensus 226 ~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~ 305 (496) T protein:vir:38 226 PLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQY 305 (496) T ss_pred eecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccccC Confidence 5577889999998763 46899999999999999999999999999876666651 1111111112 Q ss_pred hhhhhhhCceeeecCCC---ceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccc--cccCccHHHHHHHHHHHHHHH Q lcl|NC_018086. 289 SISNMKNDRVIVTDEDG---MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSK--DFTAASGQALKAATQPLENKS 363 (511) Q Consensus 289 ~~~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~~Sg~Ai~~~~~~l~~k~ 363 (511) +..+.....++....++ .++.++.++..+.+...++.+.+.|...++.+...++ ..|..||.+++++++.|.+++ T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~ 385 (496) T protein:vir:38 306 FDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTK 385 (496) T ss_pred CCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHH Confidence 22222223333333222 3455556777888888888888888888887765443 445679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc--CCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_018086. 364 AVKESKFRKVLAKRYELVCSYLEFM--NKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITD 439 (511) Q Consensus 364 ~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d 439 (511) ..+++.|+.+|++++++|+.+.... ..+..+...++++.|++++|.|.++.+++++++ +|++|.+|++..+++++| T Consensus 386 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d 465 (496) T protein:vir:38 386 NSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITE 465 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCh Confidence 9999999999999999988765421 223345566799999999999999999999887 599999999999999987 Q ss_pred HH--HHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 440 AR--QEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 440 ~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) ++ +|++|+++|+++.++ ..+..+..|..+ T Consensus 466 ~ea~~el~ri~~E~~~~~~-----~~d~~~~~~~~e 496 (496) T protein:vir:38 466 AEADEWAEMLAKEKQAEMP-----NNDMNGIFGEEE 496 (496) T ss_pred HHHHHHHHHHHHhhhccCc-----cccccCCCCCCC Confidence 55 578888887655432 111111111111 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=1.3e-56 Score=327.03 Aligned_cols=453 Identities=11% Similarity=0.050 Sum_probs=307.7 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHH-----HHHHHHHHHHHHHHHHHhcCCCcccccCCcC--cccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITL-----AEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD--DTNKPN 73 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-----~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~--~~~~~~ 73 (511) |=-.|.+.+++-+ ..-.-...|... +.-+.....++.++++||.|+|+..+..... ...+.+ T Consensus 1 m~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~ 69 (499) T protein:vir:80 1 MINQIIAGVKGVM-----------RRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNR 69 (499) T ss_pred ChhHHHHHHHHHH-----------HHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcccc Confidence 1111111111111 000000011111 1123455577888899999999876654333 334557 Q ss_pred ceeccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAV 152 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~ 152 (511) +++++|+++.||++.|+|++|+|++++.++ ...+.|+++++.|+|...+.+++..|+++|.+|+.+|.|++|++++.++ T Consensus 70 ~~~s~n~~~~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v 149 (499) T protein:vir:80 70 RQLSMNLPKVTAKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFA 149 (499) T ss_pred ceeecchHHHHHHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEE Confidence 789999999999999999999999987554 5667789999999999999999999999999999999999999999999 Q ss_pred cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcC--CcEEEE-------EEccCcccccccc--ccccccc Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTE--DLIYKF-------STDDEREVYREIP--EELEIKD 221 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~--~~i~~~-------~~~~~~~~~~~~~--~~~~~~~ 221 (511) +|.+++|+|.+..+....++++.+.. +++.+++++.|+- .....| ............+ ....... T Consensus 150 ~a~~~~Pi~~d~~~~~~~~f~~~~~~----~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~ 225 (499) T protein:vir:80 150 TADCMYPLSNDSENVDECLIANSFHK----NNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIE 225 (499) T ss_pred cCCceEEEEecCCCeEEEEEEEEEee----cCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcC Confidence 99999999877644333444443332 3455556665432 111111 1111110000000 0011112 Q ss_pred ccceeccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE-------eecCCCCc Q lcl|NC_018086. 222 YEVHPNLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW-------LQGFDLSA 285 (511) Q Consensus 222 ~~~~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~-------~~G~~~~~ 285 (511) .....++++++|+++|+++ +.|+|+|+++++|+|+||+.+|++++.++....++.+ ..+.++.. T Consensus 226 ~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~ 305 (499) T protein:vir:80 226 PVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGST 305 (499) T ss_pred CceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCc Confidence 2334467899999999864 4589999999999999999999999999998888877 22222222 Q ss_pred cchhhhhhhhCceeeecCC-C--ceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccc--cccCccHHHHHHHHHHHH Q lcl|NC_018086. 286 DSDSISNMKNDRVIVTDED-G--MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSK--DFTAASGQALKAATQPLE 360 (511) Q Consensus 286 ~~~~~~~~~~~~~i~~~~~-~--~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~~Sg~Ai~~~~~~l~ 360 (511) ...+..+...+.++.+..+ + .++.++.++..+++...++.+.+.|...++.+...++ ..|..||.+++++++.+. T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~ 385 (499) T protein:vir:80 306 TQYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETY 385 (499) T ss_pred ccCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHH Confidence 2233333444445443322 2 3556667788899988899888888888887754443 445679999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC Q lcl|NC_018086. 361 NKSAVKESKFRKVLAKRYELVCSYLEFMN--KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPW 436 (511) Q Consensus 361 ~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~ 436 (511) .++..+++.|+.+|++++++|+.+....+ .+......+++|.|++.++.|..+.++++.++ +|++|.+|++.++++ T Consensus 386 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~ 465 (499) T protein:vir:80 386 QTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWN 465 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCC Confidence 99999999999999999999987655322 12234556899999999999999999998887 599999999999999 Q ss_pred CCCHH--HHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 437 ITDAR--QEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 437 v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) ++|.+ +|++|+++|+....+ ..+..+..|..+ T Consensus 466 ~~d~ea~~el~~i~~E~~~~~~-----~~d~~g~~ge~e 499 (499) T protein:vir:80 466 ITEAEADEWAEMLAKEKQAEIP-----NNDMTGIFGEEE 499 (499) T ss_pred CChHHHHHHHHHHHHHhhcCCC-----CCCccccCCCCC Confidence 88755 567787777544221 112222222222 No 64 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=2.8e-53 Score=308.74 Aligned_cols=458 Identities=11% Similarity=0.014 Sum_probs=309.4 Q ss_pred Cccchhhcccc------cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 3 IPNGQINAGDI------ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 3 ~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) |++++=+++=+ +.....++.+.+. +.|.--...+.+++++++||.|+++..+............++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~--------~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~ 72 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDD--------PRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKN 72 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcc--------cccccCHHHHHHHHHHHHHhcCCCcccccccCCCCcccccee Confidence 44443333221 1111111111110 011112345678889999999999876544433333445578 Q ss_pred ccchHHHHHHHHHhhhhccCceecC--chhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcc Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGEPITESG--DEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSP 154 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p 154 (511) ++|+++.|++.+|+++|++|++++. ++...+.|++++++|+|...+.+++..++++|.+++.+|.+. ++++|.+++| T Consensus 73 sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~v~a 151 (508) T protein:vir:15 73 TINMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIKIAWVRA 151 (508) T ss_pred ecchHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeEEEEEcC Confidence 8999999999999999999988764 344556799999999999999999999999999999999985 6799999999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc-----CCcEEEEEEccCc-----ccc-cc-cccccccccc Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT-----EDLIYKFSTDDER-----EVY-RE-IPEELEIKDY 222 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~~~~-----~~~-~~-~~~~~~~~~~ 222 (511) ..++|+..+..+....++++.+...+...++.+++++.|+ +..|.+....... ... .. .+....+ .. T Consensus 152 d~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l-~~ 230 (508) T protein:vir:15 152 DQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKEL-AP 230 (508) T ss_pred CeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCC-Cc Confidence 9999975554443344444544444444566677788876 3333333222211 111 00 0111111 22 Q ss_pred cceeccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe---ecCCCCccchhh Q lcl|NC_018086. 223 EVHPNLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL---QGFDLSADSDSI 290 (511) Q Consensus 223 ~~~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~---~G~~~~~~~~~~ 290 (511) ....+++.++|+++|+++ +.|+|+|+++++++|++|.++|++++.++....++.+. ...+.+....+. T Consensus 231 ~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~~~~~~ 310 (508) T protein:vir:15 231 QVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEHKPTFD 310 (508) T ss_pred ceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCCccccC Confidence 345578888999998762 46999999999999999999999999998777777773 333322221121 Q ss_pred hhhhhCceeeecCC--CceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccc--cccCccHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 291 SNMKNDRVIVTDED--GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSK--DFTAASGQALKAATQPLENKSAVK 366 (511) Q Consensus 291 ~~~~~~~~i~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~~Sg~Ai~~~~~~l~~k~~~~ 366 (511) .+...+..+...++ ..++.+++++..+.+...++.+.+.|...++.+...++ ..+..||++++++.+.+..++..+ T Consensus 311 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~ 390 (508) T protein:vir:15 311 TEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSY 390 (508) T ss_pred CCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHH Confidence 22122233333333 34667778888999999999999998888887755443 335579999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCC----------ccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC Q lcl|NC_018086. 367 ESKFRKVLAKRYELVCSYLEFMNKA----------KDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQF 434 (511) Q Consensus 367 ~~~~~~~l~~~~~li~~~~~~~~~~----------~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l 434 (511) ++.|+.+|++++++|+.++...+.. ......+++|.|.+.++.|..+.++..+++ +|++|++++++++ T Consensus 391 ~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~ 470 (508) T protein:vir:15 391 LTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRN 470 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc Confidence 9999999999999998876542211 112345689999999999999999888876 5999999999999 Q ss_pred CCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc Q lcl|NC_018086. 435 PWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA 474 (511) Q Consensus 435 ~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 474 (511) +++++ +++|++||++|+.+... ..++.....|.++. T Consensus 471 ~g~~deea~~el~ri~~E~~~~~~----~~~~~~~~~g~~ge 508 (508) T protein:vir:15 471 YGMTDEQAAEELAKIQSEAPTDTF----EGGRSAILNGGDGE 508 (508) T ss_pred CCCChHHHHHHHHHHHHhccccCc----cccccccCCCCCCC Confidence 88876 45688888888543211 11111111111111 No 65 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=8.4e-53 Score=306.07 Aligned_cols=452 Identities=9% Similarity=0.005 Sum_probs=311.8 Q ss_pred CccchhhcccccC------chhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 3 IPNGQINAGDIIT------TNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 3 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) |+++.-+++=|.. ....+..+.+.. .+.--.....+++++++||.|+++.++........+..+++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~--------~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~ 72 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDP--------RINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQ 72 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhccc--------CCCCCHHHHHHHHHHHHHhcCCCccccccccCCCcccccee Confidence 5555443332211 111111111100 01111234466778889999999877655555555566788 Q ss_pred ccchHHHHHHHHHhhhhccCceecCc-hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPM 155 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~ 155 (511) ++|+++.|++.+|+++|++|++++.+ +...+.|.+++++|+|...+.+++..++.+|.+++.+|.+. |+++|.+++|. T Consensus 73 slnl~~~i~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~~~i~~v~ad 151 (505) T protein:vir:79 73 SVNVTKLASAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS-GKIKLAWATAD 151 (505) T ss_pred ecchHHHHHHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC-CceEEEEEcCC Confidence 89999999999999999999998754 55677899999999999999999999999999999999984 78999999999 Q ss_pred ceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc----CCcEEEEEEcc-Cccccc------ccccccccccccc Q lcl|NC_018086. 156 NCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT----EDLIYKFSTDD-EREVYR------EIPEELEIKDYEV 224 (511) Q Consensus 156 ~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~----~~~i~~~~~~~-~~~~~~------~~~~~~~~~~~~~ 224 (511) .++|++.+..+....+++..|.......+..+++++.|+ +..|.+..... ....+. ..+.. .....+. T Consensus 152 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~-~~l~~~~ 230 (505) T protein:vir:79 152 QVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQY-EGLEPQV 230 (505) T ss_pred eeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccc-cccCcce Confidence 999997666666566767666665554455666788886 33333222211 110100 11111 1112234 Q ss_pred eeccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC-------CCCccch Q lcl|NC_018086. 225 HPNLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF-------DLSADSD 288 (511) Q Consensus 225 ~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~-------~~~~~~~ 288 (511) ..+++++.++++|+++ +.|+|+|+++++++|++|.++|++++.++....++.+-..+ .+..... T Consensus 231 ~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~ 310 (505) T protein:vir:79 231 KITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASET 310 (505) T ss_pred eecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccccc Confidence 4567888888888752 36899999999999999999999999999988887772211 0111110 Q ss_pred ----hhhhhhhCceeeecC-CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccc--cccCccHHHHHHHHHHHHH Q lcl|NC_018086. 289 ----SISNMKNDRVIVTDE-DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSK--DFTAASGQALKAATQPLEN 361 (511) Q Consensus 289 ----~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~~Sg~Ai~~~~~~l~~ 361 (511) +..+...+..+..++ ++.++.++.++..+++...++.+.+.|...++.+...++ ..+..||++++++.+.+.. T Consensus 311 ~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~ 390 (505) T protein:vir:79 311 HPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQ 390 (505) T ss_pred cccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHH Confidence 111111122222222 334566777788899999999999999988887654443 4456799999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCC--------CccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHH Q lcl|NC_018086. 362 KSAVKESKFRKVLAKRYELVCSYLEFMNK--------AKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETII 431 (511) Q Consensus 362 k~~~~~~~~~~~l~~~~~li~~~~~~~~~--------~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~ 431 (511) +++.+++.|+.+|+++++.|+.+....+. .......+++|.|.+.++.|..+.++...++ +|++|.++++ T Consensus 391 t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l 470 (505) T protein:vir:79 391 TRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFL 470 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHH Confidence 99999999999999999999887654221 2234456799999999999999999888776 5899999999 Q ss_pred HhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_018086. 432 NQFPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAV 468 (511) Q Consensus 432 ~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 468 (511) .++++++| +++|++||++|+...++. +++.++. T Consensus 471 ~~~~~~~eeea~~el~ri~~E~~~~~p~----~~~~gg~ 505 (505) T protein:vir:79 471 MRNYGLDEEEADEWLAQIDAENSTAEPE----FNQFGGD 505 (505) T ss_pred HhcCCCChHHHHHHHHHHHHhccccCCC----chhccCC Confidence 99999877 567899998886542222 1111111 No 66 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=5.9e-49 Score=284.99 Aligned_cols=455 Identities=8% Similarity=-0.027 Sum_probs=295.6 Q ss_pred CccchhhcccccCc-----hhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceec Q lcl|NC_018086. 3 IPNGQINAGDIITT-----NIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV 77 (511) Q Consensus 3 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~ 77 (511) |++++-+++=+-.. ..+++.+.+. ..+.--.+++.+|+++++||+|+++........+..+..++++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~--------~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~s 72 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDH--------PKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNH 72 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhcc--------ccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceee Confidence 66665444332110 0011111110 0011113566788999999999987654444445555667889 Q ss_pred cchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccc Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMN 156 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~ 156 (511) +|+++.|++.+|+++|++|++++.++ ...+.|+++++.|+|...+.+++..++..|.+|+.+|.+. ++++|.+++|.. T Consensus 73 lnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad~ 151 (500) T protein:vir:30 73 LPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAPV 151 (500) T ss_pred cchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCe Confidence 99999999999999999999987655 5667899999999999999999999999999999999985 679999999999 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc--CC---cEEEEEEccC-----ccccccccccccccccccee Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT--ED---LIYKFSTDDE-----REVYREIPEELEIKDYEVHP 226 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~--~~---~i~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 226 (511) ++|+..+..+....++++.+....+..+..+++++.|+ .+ .|.+...... +....... ........... T Consensus 152 ~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~-~~~~l~~~~~~ 230 (500) T protein:vir:30 152 FLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSE-VYKDLKDEAKV 230 (500) T ss_pred eEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCccccccc-ccCCcCcceEe Confidence 99987666554334443333333333344556777765 22 2322222211 11110111 11111233445 Q ss_pred ccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC--------CCCccchh Q lcl|NC_018086. 227 NLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF--------DLSADSDS 289 (511) Q Consensus 227 ~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~--------~~~~~~~~ 289 (511) +++.++|+++|+++ +.|.|+|+++++++|++|..+|++++.++....++.+-..+ ++.....+ T Consensus 231 ~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~ 310 (500) T protein:vir:30 231 TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRP 310 (500) T ss_pred ccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCc Confidence 67788888887642 46999999999999999999999999999877777663222 11111122 Q ss_pred hhhhhhC--ceeeecCC--CceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccc--ccccCccHHHHHHHHHHHHHHH Q lcl|NC_018086. 290 ISNMKND--RVIVTDED--GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVS--KDFTAASGQALKAATQPLENKS 363 (511) Q Consensus 290 ~~~~~~~--~~i~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~~~~~~Sg~Ai~~~~~~l~~k~ 363 (511) ..+.... ..+...++ ..++.+++++..+.+...++.+.+.|...++.+...+ ...+..||++++++++.+..++ T Consensus 311 ~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:30 311 RFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR 390 (500) T ss_pred ccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH Confidence 2222222 22222222 2356666777788888878777777776666554333 3345679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc--CCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_018086. 364 AVKESKFRKVLAKRYELVCSYLEFM--NKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITD 439 (511) Q Consensus 364 ~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d 439 (511) +.+++.|+.+|++++++|+.+.... .........+++|.|++.++.|..+.++..+++ +|++|.++++.++.++++ T Consensus 391 ~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~e 470 (500) T protein:vir:30 391 NSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTE 470 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCH Confidence 9999999999999999998765432 112222345789999999999999999888876 589999999988765554 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhccccccCCCC Q lcl|NC_018086. 440 --ARQEVEKADAQRQKRADIALQNFKQTSAVQGA 471 (511) Q Consensus 440 --~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 471 (511) ++++++++++|+.. ..+.+....+.-|. T Consensus 471 eea~~~l~~i~~E~~~----~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 471 EKAQEIAAEINTGIVD----EINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHHHHHHHhccc----cCCCCCccccccCC Confidence 23445666554211 00011111111111 No 67 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=5.9e-49 Score=284.99 Aligned_cols=455 Identities=8% Similarity=-0.027 Sum_probs=295.6 Q ss_pred CccchhhcccccCc-----hhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceec Q lcl|NC_018086. 3 IPNGQINAGDIITT-----NIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV 77 (511) Q Consensus 3 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~ 77 (511) |++++-+++=+-.. ..+++.+.+. ..+.--.+++.+|+++++||+|+++........+..+..++++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~--------~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~s 72 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDH--------PKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNH 72 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhcc--------ccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceee Confidence 66665444332110 0011111110 0011113566788999999999987654444445555667889 Q ss_pred cchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccc Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMN 156 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~ 156 (511) +|+++.|++.+|+++|++|++++.++ ...+.|+++++.|+|...+.+++..++..|.+|+.+|.+. ++++|.+++|.. T Consensus 73 lnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad~ 151 (500) T protein:vir:98 73 LPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAPV 151 (500) T ss_pred cchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCe Confidence 99999999999999999999987655 5667899999999999999999999999999999999985 679999999999 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc--CC---cEEEEEEccC-----ccccccccccccccccccee Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT--ED---LIYKFSTDDE-----REVYREIPEELEIKDYEVHP 226 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~--~~---~i~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 226 (511) ++|+..+..+....++++.+....+..+..+++++.|+ .+ .|.+...... +....... ........... T Consensus 152 ~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~-~~~~l~~~~~~ 230 (500) T protein:vir:98 152 FLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSE-VYKDLKDEAKV 230 (500) T ss_pred eEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCccccccc-ccCCcCcceEe Confidence 99987666554334443333333333344556777765 22 2322222211 11110111 11111233445 Q ss_pred ccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC--------CCCccchh Q lcl|NC_018086. 227 NLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF--------DLSADSDS 289 (511) Q Consensus 227 ~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~--------~~~~~~~~ 289 (511) +++.++|+++|+++ +.|.|+|+++++++|++|..+|++++.++....++.+-..+ ++.....+ T Consensus 231 ~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~ 310 (500) T protein:vir:98 231 TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRP 310 (500) T ss_pred ccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCc Confidence 67788888887642 46999999999999999999999999999877777663222 11111122 Q ss_pred hhhhhhC--ceeeecCC--CceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccc--ccccCccHHHHHHHHHHHHHHH Q lcl|NC_018086. 290 ISNMKND--RVIVTDED--GMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVS--KDFTAASGQALKAATQPLENKS 363 (511) Q Consensus 290 ~~~~~~~--~~i~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~~~~~~Sg~Ai~~~~~~l~~k~ 363 (511) ..+.... ..+...++ ..++.+++++..+.+...++.+.+.|...++.+...+ ...+..||++++++++.+..++ T Consensus 311 ~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~ 390 (500) T protein:vir:98 311 RFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMR 390 (500) T ss_pred ccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHH Confidence 2222222 22222222 2356666777788888878777777776666554333 3345679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc--CCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_018086. 364 AVKESKFRKVLAKRYELVCSYLEFM--NKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITD 439 (511) Q Consensus 364 ~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d 439 (511) +.+++.|+.+|++++++|+.+.... .........+++|.|++.++.|..+.++..+++ +|++|.++++.++.++++ T Consensus 391 ~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~e 470 (500) T protein:vir:98 391 NSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTE 470 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCH Confidence 9999999999999999998765432 112222345789999999999999999888876 589999999988765554 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhccccccCCCC Q lcl|NC_018086. 440 --ARQEVEKADAQRQKRADIALQNFKQTSAVQGA 471 (511) Q Consensus 440 --~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 471 (511) ++++++++++|+.. ..+.+....+.-|. T Consensus 471 eea~~~l~~i~~E~~~----~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 471 EKAQEIAAEINTGIVD----EINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHHHHHHHhccc----cCCCCCccccccCC Confidence 23445666554211 00011111111111 No 68 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=7.2e-48 Score=279.02 Aligned_cols=466 Identities=10% Similarity=0.014 Sum_probs=298.1 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHH--HHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELI--TLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |+|.+-+++=+-.-. .+ +..+++ ..|. ..+.-+.++..+|.+++.||+|+++.........+...++++++|+ T Consensus 1 m~~~~~~k~~~~k~~-~~--~~~~~~--~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl 75 (522) T protein:vir:47 1 MSLFQKVKDFFSRGR-YY--MQTSNL--NSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPI 75 (522) T ss_pred CchHHHHHHHHHHHH-HH--hhcccc--hhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecch Confidence 444433322221110 00 000000 0000 0122355677888999999999987654444444455567889999 Q ss_pred HHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLI 159 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~ 159 (511) ++.|++..|+++|++|++++.++ ...+.|++++++|+|...+.+++..++..|.+++.+|++ .+++++.+++|...+| T Consensus 76 ~~~i~~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v~ad~~~P 154 (522) T protein:vir:47 76 ARTASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFIQAPVFFP 154 (522) T ss_pred HHHHHHHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEEcCCceEE Confidence 99999999999999999987554 566789999999999999999999999999999999997 4789999999999999 Q ss_pred EecCCCCCceEEEEEEEEEee-cCCcceEEEEEEEc----------------CCcEEE--EEEccC---cc-c-cccccc Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVIS-DITGHQIRTYEVYT----------------EDLIYK--FSTDDE---RE-V-YREIPE 215 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~~~----------------~~~i~~--~~~~~~---~~-~-~~~~~~ 215 (511) +..+.... ..+++-+..... +..+..++.++.|+ +..|.. |..... +. + +...+. T Consensus 155 ~~~~~~~~-~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e 233 (522) T protein:vir:47 155 LESNTQDV-SSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDK 233 (522) T ss_pred EEEcCCce-EEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccccccc Confidence 86555433 333333222222 22222233455442 122221 111110 10 0 011111 Q ss_pred ccccccccceeccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC----- Q lcl|NC_018086. 216 ELEIKDYEVHPNLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF----- 281 (511) Q Consensus 216 ~~~~~~~~~~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~----- 281 (511) ...+ ......+++.++++++|+++ +.|+|+|++.++++|++|.++|++++.++....++.+-..+ T Consensus 234 ~~~l-~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~ 312 (522) T protein:vir:47 234 YKNL-EPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQY 312 (522) T ss_pred ccCC-CCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCC Confidence 1111 22334567788888888763 46999999999999999999999999999988887772221 Q ss_pred ---CCCccchhhhhh--hhCceeeec--CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccc--cccCccHHHH Q lcl|NC_018086. 282 ---DLSADSDSISNM--KNDRVIVTD--EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSK--DFTAASGQAL 352 (511) Q Consensus 282 ---~~~~~~~~~~~~--~~~~~i~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~~Sg~Ai 352 (511) .+........+. .-+..+... ++.+++.+++++..+.+...++.+.+.|...++.....++ ..+..||+++ T Consensus 313 ~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi 392 (522) T protein:vir:47 313 QRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEI 392 (522) T ss_pred CCCCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHH Confidence 111111111111 112223322 2345677778888888888888888877777766543333 3345789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChH Q lcl|NC_018086. 353 KAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN--KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDE 428 (511) Q Consensus 353 ~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~e 428 (511) ++..+.+..+++.+++.|+.+|+++++.|+.+....+ ........+++|.|.+.++.|..+.++..+++ +|++|.+ T Consensus 393 ~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e 472 (522) T protein:vir:47 393 VSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKK 472 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHH Confidence 9999999999999999999999999999987765322 12233456799999999999999998888886 5999999 Q ss_pred HHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 429 TIINQFPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 429 t~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) +++.+++++++ +++|++||++|+.+..+. .....|..+ ..++.++..| T Consensus 473 ~~i~~~~g~~eeea~~el~ri~~E~~~~~~~-------~~~~~~~~~-~~~~~~d~~~ 522 (522) T protein:vir:47 473 RAIGKTLNISGVEAEKELNAINSELLPMNDA-------ELAIYGMHD-QNEEKADDKG 522 (522) T ss_pred HHHHhcCCCChHHHHHHHHHHHHhhccCCCC-------CCCCCCCCC-cccccCCCCC Confidence 99998877765 456888888774432110 000111111 1111111111 No 69 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=7.7e-49 Score=284.37 Aligned_cols=476 Identities=12% Similarity=0.101 Sum_probs=312.2 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMH-SRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHN 79 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n 79 (511) |+. ++ -+|-.+ ..+ --.+.+| -..+..+ ++|+..|+.|.+||.+.+..+..........-..++.++ T Consensus 1 ~~~--~~---~~~~~~-~~~-~~g~~~~-----p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~p 68 (527) T protein:vir:10 1 MGQ--DK---RQYGST-QQL-RAGEANF-----PNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVP 68 (527) T ss_pred CCc--cc---cccCCC-cCc-CCccccC-----cccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeeh Confidence 332 21 111111 111 0011111 1113333 567899999999999987654432222222223457788 Q ss_pred hHHHHHHHHHhhhhccCceec---CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC----CCceEEEEE Q lcl|NC_018086. 80 FPKLLVDTSTAYLAGEPITES---GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR----NKKHRFKAV 152 (511) Q Consensus 80 ~~k~ivd~~~~~l~g~~~~~~---~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~----~g~~~i~~~ 152 (511) -.+++|+....|+ +.|..+. .+++..+.+..+++.|++..++.+..+++.+.|.+.+++-+|+ .++++++.+ T Consensus 69 s~~~~~~~~~~~~-~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~ 147 (527) T protein:vir:10 69 NGEKLIEAKMRFL-GQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEV 147 (527) T ss_pred hhHHhhCCcceee-ccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeec Confidence 8888888776654 4554432 3455667788889999999999999999999998766555543 247999999 Q ss_pred cccceEEEecCCCCCceEEE--EEEEEEeecCCcceE-EEE-----EEE-----cCCcEEEEE---EccCcccc-c-cc- Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAA--IYYNTVISDITGHQI-RTY-----EVY-----TEDLIYKFS---TDDEREVY-R-EI- 213 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~--v~~~~~~~~~~~~~~-~~~-----~~~-----~~~~i~~~~---~~~~~~~~-~-~~- 213 (511) ||...|++.|+...+.+... +.-|...++.....+ -++ ++- +...-+.|. +.-+.|.- . .+ T Consensus 148 DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~ 227 (527) T protein:vir:10 148 DPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPL 227 (527) T ss_pred CcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccccccccccc Confidence 99999999887665544443 222332222211111 000 000 001111110 11111110 0 00 Q ss_pred -----ccccccccccceeccCCccceEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 214 -----PEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 214 -----~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) .......+.+..++++++||||+|+|- .+|+|+++++++++|++|+++|+.+.++++...|+.+++|... T Consensus 228 ~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~ 307 (527) T protein:vir:10 228 EPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPP 307 (527) T ss_pred chhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccc Confidence 001233456788999999999999763 4799999999999999999999999999999999999999865 Q ss_pred Cccchh--hhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc---cCccHHHHHHHHHH Q lcl|NC_018086. 284 SADSDS--ISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF---TAASGQALKAATQP 358 (511) Q Consensus 284 ~~~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---~~~Sg~Ai~~~~~~ 358 (511) -+.... ...+..+.+|.+++++++..+......+.++.|++.|.+.|+..|++|...++.. .+.||.||+..+++ T Consensus 308 vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~P 387 (527) T protein:vir:10 308 RDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSA 387 (527) T ss_pred ccccCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHH Confidence 432211 1234456788899999998877666889999999999999999999999999843 36799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHh---cCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHH Q lcl|NC_018086. 359 LENKSAVKESKFRKVLAKRYE-LVCSYLEF---MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIIN 432 (511) Q Consensus 359 l~~k~~~~~~~~~~~l~~~~~-li~~~~~~---~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~ 432 (511) |.+++.+++..++-..++..+ ++..++.. ...........+.|.|.+++|.|.++.++.++++ +|++|.+|++. T Consensus 388 Llar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~ 467 (527) T protein:vir:10 388 ILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTE 467 (527) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHH Confidence 999999999999988876544 33333221 2222223445789999999999999999999888 58999999988 Q ss_pred hC---CCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCcccc Q lcl|NC_018086. 433 QF---PWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTI 489 (511) Q Consensus 433 ~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (511) +| ++++|+++|+++|.+++..++....+.......++++..-..++..++.++.-+- T Consensus 468 ~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 468 ELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 77 7799999999999988776665544333333323222222222222222222221 No 70 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=8.6e-49 Score=284.08 Aligned_cols=476 Identities=13% Similarity=0.101 Sum_probs=312.4 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMH-SRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHN 79 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n 79 (511) |+. ++ -+|-.+ ..+ --.+.+| -..+..+ ++|+..|+.|.+||.+.+..+..........-..++.++ T Consensus 1 ~~~--~~---~~~~~~-~~~-~~g~~~~-----p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~p 68 (527) T protein:vir:10 1 MGQ--DK---RQYGST-QQL-RAGEANF-----PNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVP 68 (527) T ss_pred CCc--cc---cccCCC-cCc-CCccccC-----cccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeeh Confidence 332 21 111111 111 0011111 1113333 567899999999999987654432222222223457788 Q ss_pred hHHHHHHHHHhhhhccCceec---CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC----CCceEEEEE Q lcl|NC_018086. 80 FPKLLVDTSTAYLAGEPITES---GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR----NKKHRFKAV 152 (511) Q Consensus 80 ~~k~ivd~~~~~l~g~~~~~~---~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~----~g~~~i~~~ 152 (511) -.+++|+....|+ +.|..+. .+++..+.+..+++.|++..++.+..+++.+.|.+.+++-+|+ .++++++.+ T Consensus 69 s~~~~~~~~~~~~-~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~ 147 (527) T protein:vir:10 69 NGEKLIEAKMRFL-GQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEV 147 (527) T ss_pred hhHHhhCCcceee-ccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeec Confidence 8888888776654 4554432 3455667788889999999999999999999998766555543 247999999 Q ss_pred cccceEEEecCCCCCceEEE--EEEEEEeecCCcceE-EEE-----EEE-----cCCcEEEEE---EccCcccc-c-cc- Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAA--IYYNTVISDITGHQI-RTY-----EVY-----TEDLIYKFS---TDDEREVY-R-EI- 213 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~--v~~~~~~~~~~~~~~-~~~-----~~~-----~~~~i~~~~---~~~~~~~~-~-~~- 213 (511) ||...|++.|+...+.+... +.-|...++.....+ -++ ++- +...-+.|. +.-+.|.- . .+ T Consensus 148 DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~ 227 (527) T protein:vir:10 148 DPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPL 227 (527) T ss_pred CcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccccccccccc Confidence 99999999887665544443 222332222211111 000 000 001111110 11111110 0 00 Q ss_pred -----ccccccccccceeccCCccceEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 214 -----PEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 214 -----~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) .......+.+..++++++||||+|+|- .+|+|+++++++++|++|+++|+.+.++++...|+.+++|... T Consensus 228 ~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~ 307 (527) T protein:vir:10 228 EPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPP 307 (527) T ss_pred chhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccc Confidence 001233456788999999999999763 4799999999999999999999999999999999999999865 Q ss_pred Cccchh--hhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc---cCccHHHHHHHHHH Q lcl|NC_018086. 284 SADSDS--ISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF---TAASGQALKAATQP 358 (511) Q Consensus 284 ~~~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---~~~Sg~Ai~~~~~~ 358 (511) -+.... ...+..+.+|.+++++++..+......+.++.|++.|.+.|+..|++|...++.. .+.||.||+..+++ T Consensus 308 vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~P 387 (527) T protein:vir:10 308 RDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSA 387 (527) T ss_pred ccccCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHH Confidence 432211 1234456788899999998877667889999999999999999999999999843 36799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHh---cCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHH Q lcl|NC_018086. 359 LENKSAVKESKFRKVLAKRYE-LVCSYLEF---MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIIN 432 (511) Q Consensus 359 l~~k~~~~~~~~~~~l~~~~~-li~~~~~~---~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~ 432 (511) |.+++.+++..++-..++..+ ++..++.. ...........+.|.|.+++|.|.++.++.++++ +|++|.+|++. T Consensus 388 Llar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~ 467 (527) T protein:vir:10 388 ILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTE 467 (527) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHH Confidence 999999999999988876544 33333221 2222223445789999999999999999999887 58999999988 Q ss_pred hC---CCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCcccc Q lcl|NC_018086. 433 QF---PWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTI 489 (511) Q Consensus 433 ~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (511) +| ++++|+++|+++|.+++..++....+.......++++..-..++..++.++.-+- T Consensus 468 ~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 468 ELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 77 7799999999999998877666544333333333222222222222222222221 No 71 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=8.7e-44 Score=256.66 Aligned_cols=450 Identities=11% Similarity=0.004 Sum_probs=277.8 Q ss_pred CccchhhcccccCchhhHhhhhccCC---CHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccc Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNF---DLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHN 79 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n 79 (511) |+|....+..| +.+-.... +.+.+......+......|.+ ++|.+. ++.....++...+++++| T Consensus 1 ~~~~~~~~~~i-------~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~----~w~~~~~~~~~~~~~~~~ 67 (518) T protein:vir:78 1 MGVWSVMTRFI-------KGWLNGKPNGSEPELIPKYLPLVPDNQKEWSK--DSYLTS----LWAQGYVPTVHDKLMNSG 67 (518) T ss_pred CcchhhHHHHH-------HHhhcCCCCccchhccHHHhhhcccchhhhhh--hhhhhh----hcccCCCCccccccccCC Confidence 44544332221 11111111 112222222222222222221 222222 112222345556789999 Q ss_pred hHHHHHHHHHhhhhccCceecC-------chhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Q lcl|NC_018086. 80 FPKLLVDTSTAYLAGEPITESG-------DEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAV 152 (511) Q Consensus 80 ~~k~ivd~~~~~l~g~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~ 152 (511) +++.|++..|+++|+++++++. ++...+.|++++++|+|...+.+++..++..|.+++.+|.+ +|++++.++ T Consensus 68 l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~~i~~v 146 (518) T protein:vir:78 68 TGNEIVVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRPSISVH 146 (518) T ss_pred hHHHHHHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCeeEEEEE Confidence 9999999999999999987742 34456789999999999999999999999999999988886 488999999 Q ss_pred cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCC------------cEEEEE--EccCcccccc-cc--- Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTED------------LIYKFS--TDDEREVYRE-IP--- 214 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~------------~i~~~~--~~~~~~~~~~-~~--- 214 (511) +|..++|+|.+.. ...++..........+..+++++.|..+ .|.+.. ...+..+... .+ T Consensus 147 ~ad~~~P~~~~g~---~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~ 223 (518) T protein:vir:78 147 SSSQFWIDFKNNE---PFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPE 223 (518) T ss_pred cCCeeEEEeecCc---EEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccc Confidence 9999999997643 3333322222222223344556655432 222211 1111110000 00 Q ss_pred ------cccccccccceeccCCccceEee-cC----C-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe Q lcl|NC_018086. 215 ------EELEIKDYEVHPNLLQKFPVLEI-IA----N-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL 278 (511) Q Consensus 215 ------~~~~~~~~~~~~~~~g~iPvv~~-~n----~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~ 278 (511) ......+....+++ ...|+++| +| + +.|.|+|+++++++|++|.++|++++.++....++.+. T Consensus 224 ~l~~~~~~~~~~e~~~~~tg-~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~ 302 (518) T protein:vir:78 224 QITSYLHTNDIQLNHSVSIG-LKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAAS 302 (518) T ss_pred ccccccccccCccceeeccC-CccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeec Confidence 00011111112233 34566655 32 1 34999999999999999999999999999877777764 Q ss_pred ecCC-----CCcc---chhhhhhhhCceeeecCC--Cc----eeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccc- Q lcl|NC_018086. 279 QGFD-----LSAD---SDSISNMKNDRVIVTDED--GM----VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKD- 343 (511) Q Consensus 279 ~G~~-----~~~~---~~~~~~~~~~~~i~~~~~--~~----~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~- 343 (511) ..+- .... ..+..+...+..+....+ ++ ++.+++++..+.+...++.+.+.|...++.+...++. T Consensus 303 ~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~ 382 (518) T protein:vir:78 303 ERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLG 382 (518) T ss_pred hhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcc Confidence 3221 1100 011111122333433322 22 4556778888988888898888888888876555542 Q ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----ccccccceeEEeCCCCCcCHHHHHHHHH Q lcl|NC_018086. 344 FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA----KDLKPYEVTPVFVRNLPQSYAELADMAV 419 (511) Q Consensus 344 ~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~----~~~~~~~i~i~f~~~~p~d~~e~a~~~~ 419 (511) .+..||+++++..+.+.+++..++..++.+|+++++.|+.++...... ......+++|.|++.++.|..+.++..+ T Consensus 383 ~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~ 462 (518) T protein:vir:78 383 NREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLN 462 (518) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHH Confidence 246799999999999999999999999999999999999877653221 2234457999999999999999999888 Q ss_pred HH--hccCChHHHHHh-CCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCC Q lcl|NC_018086. 420 KL--RDMLPDETIINQ-FPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQG 470 (511) Q Consensus 420 ~~--~g~~s~et~~~~-l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 470 (511) ++ +|++|.++++++ ++.++| +++|++||++|+........+.+..+....| T Consensus 463 ~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 463 NMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 75 589999999876 456665 4678899988854422110000000000000 No 72 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=1.7e-42 Score=249.57 Aligned_cols=459 Identities=10% Similarity=-0.037 Sum_probs=294.6 Q ss_pred CccchhhcccccCc-----hhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceec Q lcl|NC_018086. 3 IPNGQINAGDIITT-----NIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV 77 (511) Q Consensus 3 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~ 77 (511) |.|.+-+++=+-.. ..+++.+.+. . -+.--.....++.++++||+|+++..+........+...+++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~-~-------~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~s 72 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDH-E-------KINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMT 72 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcC-C-------ceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceee Confidence 55555444432110 0111111110 0 011112334566677899999998665443333344556788 Q ss_pred cchHHHHHHHHHhhhhccCceecCch------------hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC Q lcl|NC_018086. 78 HNFPKLLVDTSTAYLAGEPITESGDE------------KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK 145 (511) Q Consensus 78 ~n~~k~ivd~~~~~l~g~~~~~~~d~------------~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g 145 (511) +|+++.|+..+++++|+++++++.++ ...+.|.++++.|+|...+.+++..++..|.+++.+|++. | T Consensus 73 l~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~ 151 (517) T protein:vir:98 73 LNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN-G 151 (517) T ss_pred cCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC-C Confidence 99999999999999999998886443 2456789999999999999999999999999999999985 6 Q ss_pred ceEEEEEcccceEEEecCCCCCceEEEEEEE-EEeecCCcceEEEEEEEcCCcE--------EE---EEEccCc---ccc Q lcl|NC_018086. 146 KHRFKAVSPMNCLIAYSADLDEEPVAAIYYN-TVISDITGHQIRTYEVYTEDLI--------YK---FSTDDER---EVY 210 (511) Q Consensus 146 ~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~-~~~~~~~~~~~~~~~~~~~~~i--------~~---~~~~~~~---~~~ 210 (511) +++|.+++|..++|+-.+.. +...+++.++ ....+..+..+++++.|..+.+ ++ |.+.... ... T Consensus 152 ~~~I~~v~ad~~~Pl~~~~~-~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v 230 (517) T protein:vir:98 152 EIEFSWALANAFYPLRSNSN-GISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRI 230 (517) T ss_pred eeEEEEEcCCeeEEEEecCC-CeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccc Confidence 79999999999999644443 3344444332 2233333445667888876542 11 2211111 111 Q ss_pred cccccccccccccceeccCCccceEeecCC---------cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC Q lcl|NC_018086. 211 REIPEELEIKDYEVHPNLLQKFPVLEIIAN---------EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF 281 (511) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~ 281 (511) ........ ........++.+.++++|+++ +.|+|+|+++++++|++|..+|+++..++....++.+-..+ T Consensus 231 ~L~~~~e~-l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~ 309 (517) T protein:vir:98 231 PLEELYEG-MQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVM 309 (517) T ss_pred cccccccC-CCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhh Confidence 00111111 112334466777667777652 46999999999999999999999999999988877763332 Q ss_pred C---CCccchhh---hhh--hhCceeeecC-CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccc--ccCccHH Q lcl|NC_018086. 282 D---LSADSDSI---SNM--KNDRVIVTDE-DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKD--FTAASGQ 350 (511) Q Consensus 282 ~---~~~~~~~~---~~~--~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~~~~Sg~ 350 (511) - .+...... .+. ..+..+..+. +..++..++++..+.+...++.+.+.|...++.+...++. .+..||+ T Consensus 310 l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTAT 389 (517) T protein:vir:98 310 LRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTAT 389 (517) T ss_pred hccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHH Confidence 1 11111000 011 1122333322 2234555567778889999999999999999887655543 3456899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCC Q lcl|NC_018086. 351 ALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFM--NKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLP 426 (511) Q Consensus 351 Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s 426 (511) ++++..+.+.++++++++.|+.+|++++++|+.+.... .........+++|.|.+.++.|..+.++...++ +|++| T Consensus 390 Ei~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms 469 (517) T protein:vir:98 390 EIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIP 469 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCC Confidence 99999999999999999999999999999988765432 122223445799999999999999999988876 58999 Q ss_pred hHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 427 DETIINQFPWITD--ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 427 ~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) .++++.++.++++ +++|++||++|..... ++.......+..+.+.+ T Consensus 470 ~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~-----~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 470 TVEAIQRIFKVPKKTAEQWLEEIRKDQIELD-----PVTISQRAQKRMFGDEE 517 (517) T ss_pred HHHHHHHhCCCChHHHHHHHHHHHHhccccC-----CCCccccccCCCCCCCC Confidence 9999888755554 3556777776653211 11110100000000000 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=2.6e-42 Score=248.54 Aligned_cols=486 Identities=13% Similarity=0.119 Sum_probs=294.3 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |+ -+. -++... ...|.+..-+. + -...++|+.+|+.|.+||.|++..+..-..... ..-+..+. T Consensus 1 m~--~~~---~q~~p~---~~~fp~~~a~w---V--~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~d---r~~~~~ps 64 (563) T protein:vir:74 1 MP--YNH---KQYDPA---KPFLRGGDDNI---V--DENDKNRVRAYDLYENIYLNSAETLKLVLRGDD---SVPILMPS 64 (563) T ss_pred CC--ccc---cccCCC---ccccccccccc---C--CHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCc---eeeeccch Confidence 32 221 112211 12222222221 1 123356889999999999999865432222211 12344567 Q ss_pred HHHHHHHHHhhhhccCceecC-----chh----hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC----CCce Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESG-----DEK----TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR----NKKH 147 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~-----d~~----~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~----~g~~ 147 (511) .+++|++.+.| +|.|+.++. |+. ....|.++++++++..++.++.++|.+.|.+.+++-+|+ .+++ T Consensus 65 ~r~~V~~~~~~-Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~ 143 (563) T protein:vir:74 65 GRKIVEAVHRF-LGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERI 143 (563) T ss_pred HHHHHHHHHHh-cCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCc Confidence 88999996655 599999842 222 234567888999999999999999999998766555443 3589 Q ss_pred EEEEEcccceEEEecCCCCCce--EEEEEEEEEeecCCcceEEEEE--EE---cCCcEE-EEEEccCccccc-------- Q lcl|NC_018086. 148 RFKAVSPMNCLIAYSADLDEEP--VAAIYYNTVISDITGHQIRTYE--VY---TEDLIY-KFSTDDEREVYR-------- 211 (511) Q Consensus 148 ~i~~~~p~~~~~v~d~~~~~~~--~~~v~~~~~~~~~~~~~~~~~~--~~---~~~~i~-~~~~~~~~~~~~-------- 211 (511) ++..+||...|++-|++..... +..+.-|...++..... -++. .| .+.... .+.+..+.|.+. T Consensus 144 rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~-~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~ 222 (563) T protein:vir:74 144 SVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKL-ARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAI 222 (563) T ss_pred eEeecCCceeeeccCCCCcccceeeecccCCCCCcchhccc-eeeeeeeeeeCCCCCccceeeeccchhccccccccCcc Confidence 9999999999985444332111 11111222222211111 1111 01 011111 111111112110 Q ss_pred ---------ccccccccccccceeccCCccceEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_018086. 212 ---------EIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW 277 (511) Q Consensus 212 ---------~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~ 277 (511) ..-......+.+..|+++++||+|.|+|- .+|+|++++++++++++|+++|+.+.++.+...|+.+ T Consensus 223 ~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~v 302 (563) T protein:vir:74 223 SDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYV 302 (563) T ss_pred chhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEE Confidence 00001112244566999999999998763 4799999999999999999999999999999999999 Q ss_pred eecCCCCc---cchhhhhhhhCceeeecCCCce---eeeecCCCHHHHHHHHHHHHH-HHHHHhCccccccccc--c-Cc Q lcl|NC_018086. 278 LQGFDLSA---DSDSISNMKNDRVIVTDEDGMV---KFITKDVNDKHIENIKNRAKL-DIFSLSQTPDLVSKDF--T-AA 347 (511) Q Consensus 278 ~~G~~~~~---~~~~~~~~~~~~~i~~~~~~~~---~~~~~~~~~~~~~~~~~~l~~-~i~~~s~~p~~~~~~~--~-~~ 347 (511) +.|....+ .+-..+++..+.++.++++..+ ..+....+.+.++.|++.|.. .|+.+|++|...++.. + .. T Consensus 303 l~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~ 382 (563) T protein:vir:74 303 TNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAE 382 (563) T ss_pred eccccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeeccccccccc Confidence 99764332 1112244566778888877554 344444566889999998876 8899999999999843 3 57 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHh---cC------CCcccc-ccceeEEeCCCCCcCHHH Q lcl|NC_018086. 348 SGQALKAATQPLENKSAVKESKFRKVLAK----RYELVCSYLEF---MN------KAKDLK-PYEVTPVFVRNLPQSYAE 413 (511) Q Consensus 348 Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~----~~~li~~~~~~---~~------~~~~~~-~~~i~i~f~~~~p~d~~e 413 (511) ||.||+..+.+|.+++++|+..+..++++ .+++++.+... .+ +..++. ...++|.|.+.+|.|.++ T Consensus 383 SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~ 462 (563) T protein:vir:74 383 SGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQ 462 (563) T ss_pred chhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHH Confidence 99999999999999999999977777776 34343322221 11 112222 234789999999999999 Q ss_pred HHHHHHHH--hccCChHHHHHhC---CCC-CCHHHHHHHHHHHHHHHHHHH--H--hhccccccCCCCCCccccccCCCC Q lcl|NC_018086. 414 LADMAVKL--RDMLPDETIINQF---PWI-TDARQEVEKADAQRQKRADIA--L--QNFKQTSAVQGASTAAANKLDKNP 483 (511) Q Consensus 414 ~a~~~~~~--~g~~s~et~~~~l---~~v-~d~~~E~~ri~~E~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~ 483 (511) .++.++.+ +|++|++|++.+| +|. +|++.|+++|+.++-..+..+ . ..+.-+....++.++ ++.+++ T Consensus 463 vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~--~~~dd~- 539 (563) T protein:vir:74 463 VTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGE--QQFDDQ- 539 (563) T ss_pred HHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCc--cccccc- Confidence 99988777 5899999998887 664 477888888877655443222 1 111111111111111 111111 Q ss_pred CCccccccCCCCccccccccCCCCC Q lcl|NC_018086. 484 ANTSTITTTDPVAAKEQEKAIQKKP 508 (511) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) |+.-. +..+|+---+.-++++-.| T Consensus 540 g~p~~-~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 540 GNPID-QFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred CCchh-HcCCcccCCccccccCCCC Confidence 11111 1122333333444444444 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=9.1e-32 Score=190.76 Aligned_cols=459 Identities=11% Similarity=0.060 Sum_probs=284.5 Q ss_pred ccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc-----CCcCccccccc-----e-eccchHHHHHHHHHhhhh Q lcl|NC_018086. 25 RRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQS-----RTFDDTNKPNS-----K-IVHNFPKLLVDTSTAYLA 93 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~-----~~~~~~~~~~~-----r-i~~n~~k~ivd~~~~~l~ 93 (511) ..+-+++.+...+..|....++|+++++.|.|...+... ++........+ | +-.|+++.+++.+++++| T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 222222233445667788889999999999997544321 11111111111 1 337999999999999999 Q ss_pred ccCceecCc--hhhHHHH-HHH-HhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC------------------ceEEEE Q lcl|NC_018086. 94 GEPITESGD--EKTIKAM-QPV-FKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK------------------KHRFKA 151 (511) Q Consensus 94 g~~~~~~~d--~~~~~~l-~~~-~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g------------------~~~i~~ 151 (511) .+|+++..+ ....+.+ .++ .+.++++.+++.+.+.++.+|+++++|..+..+ +|.+.. T Consensus 81 ~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~ 160 (513) T protein:vir:97 81 SEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVM 160 (513) T ss_pred hcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEE Confidence 999987532 2222222 232 345789999999999999999999999765422 488999 Q ss_pred EcccceEEEecCC-CCCceEEEEEEEEEeecCC---cceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceec Q lcl|NC_018086. 152 VSPMNCLIAYSAD-LDEEPVAAIYYNTVISDIT---GHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPN 227 (511) Q Consensus 152 ~~p~~~~~v~d~~-~~~~~~~~v~~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (511) ++|.+++----.. .....+..|++.....+.+ .+.+..+.+++++.+..|+....+.. ....|.......| T Consensus 161 ~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~-----~~~e~~~~~~g~~ 235 (513) T protein:vir:97 161 IKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNA-----QKEEWALADEWAT 235 (513) T ss_pred ecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCc-----cccceEEecCCCC Confidence 9999875432111 2223344444433322222 34555667888887766654333221 1234566667788 Q ss_pred cCCccceEeecCCc----ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC Q lcl|NC_018086. 228 LLQKFPVLEIIANE----ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE 303 (511) Q Consensus 228 ~~g~iPvv~~~n~~----~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~ 303 (511) +++.||||++.... .+.+.+.++..|..++.+..|++..++.+.++|++++.|.+.+... . ..+....++.+|+ T Consensus 236 ~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~-~-i~iG~~~~~~lpe 313 (513) T protein:vir:97 236 GLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSD-P-VVVGPNKVLYNPD 313 (513) T ss_pred cCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCC-c-eEeeccccccCCC Confidence 99999999987532 3567789999999999999999999999999999999997654321 1 1234456788886 Q ss_pred -CCceeeeecCCCH-HHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 304 -DGMVKFITKDVND-KHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 304 -~~~~~~~~~~~~~-~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) +++++|++.+.+. +..+..++.+++.|......+ .....++.||++.+...+...+....+...+..++.++++++ T Consensus 314 ~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~l--l~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~ 391 (513) T protein:vir:97 314 PAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEF--LKRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDIT 391 (513) T ss_pred CCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHh--hccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7889999988654 667888999999987776543 223446799999999999999999999999999999999999 Q ss_pred HHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCC---CHHHHHHHHHHHHHH Q lcl|NC_018086. 382 CSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQF---PWIT---DARQEVEKADAQRQK 453 (511) Q Consensus 382 ~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l---~~v~---d~~~E~~ri~~E~~~ 453 (511) +.+++...... ..++.-.|..... ..+.++++.++ .|.+|.+|.+..| +.+. |.++++++++++-+. T Consensus 392 a~wlg~~~~~~---~v~in~dF~~~~~--~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~ 466 (513) T protein:vir:97 392 ADWLRLGPNGG---TVELVKDYDLEEM--DAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISE 466 (513) T ss_pred HHHhCCCCCcc---EEEeccccCcccC--CHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhh Confidence 99986422111 1122223322211 24566666665 5889999987765 2321 345555555554332 Q ss_pred HHHHH---HhhccccccCCCCCCccccccCCCCCCccccccCCCCcccccccc Q lcl|NC_018086. 454 RADIA---LQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKA 503 (511) Q Consensus 454 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (511) ..-.. ....++..+..+.... +....++..+ .--.+|+++.+.. T Consensus 467 ~~~~~~~d~~~~~~~~~~~~~~~~---~~~~~~~~~~---~~~~~~~~~~~~~ 513 (513) T protein:vir:97 467 AMGRAGLDLDPAQKNPPEGGEGEG---EGEGEGGEGG---EGGEGGGNPGGES 513 (513) T ss_pred ccCCCCccccccCCCCCCCCCCCC---CCCCCCCCCC---CccccCCCCCCCC Confidence 22111 1111111111101000 1111111111 1112222222211 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=100.00 E-value=1.4e-31 Score=189.71 Aligned_cols=423 Identities=9% Similarity=0.006 Sum_probs=271.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccC---CcCccc--cccc--e----eccchHHHHHHHHHhhhhccC Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSR---TFDDTN--KPNS--K----IVHNFPKLLVDTSTAYLAGEP 96 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~---~~~~~~--~~~~--r----i~~n~~k~ivd~~~~~l~g~~ 96 (511) |+ +...+..|....++|+++++.|.|...+...+ .++.+. ...+ | +-.|+++.+++.+++++|.+| T Consensus 1 m~---V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~ 77 (452) T protein:vir:94 1 MP---IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQP 77 (452) T ss_pred CC---CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCC Confidence 22 22345677888899999999999976543221 111111 1111 2 237999999999999999999 Q ss_pred ceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC-ceEEEEEcccceEEEecCCCCCceE-EEEE Q lcl|NC_018086. 97 ITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK-KHRFKAVSPMNCLIAYSADLDEEPV-AAIY 174 (511) Q Consensus 97 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g-~~~i~~~~p~~~~~v~d~~~~~~~~-~~v~ 174 (511) +++...+.. ..+..-...++++.+...+.+.++.+|+++++|..+..| +|.+..++|.+++- |.-...+.+. ..+| T Consensus 78 p~~~~p~~l-~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~-W~~~~~g~l~~v~lr 155 (452) T protein:vir:94 78 PVITHPDAM-SKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILN-WEEDEDGRLLMVVLR 155 (452) T ss_pred ceecccHHH-HHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcC-ccccccCCeeEEEEE Confidence 998665443 233223567899999999999999999999999887654 79999999999864 4422223232 2333 Q ss_pred EEEEeecC-Cc---ceEEEEEEEc--CCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc----ccC Q lcl|NC_018086. 175 YNTVISDI-TG---HQIRTYEVYT--EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE----ERL 244 (511) Q Consensus 175 ~~~~~~~~-~~---~~~~~~~~~~--~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~----~g~ 244 (511) ......+. ++ +.+..+.+++ ++.+..+++.......+. ...+..+....|+++.||+|.+.... .+. T Consensus 156 e~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~---~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~ 232 (452) T protein:vir:94 156 EFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWE---LAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAK 232 (452) T ss_pred EEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceee---eccceeecCCCcccceeEEEEEcCCCCCCCCCc Confidence 32222221 11 2333344443 544333222211111111 11223445567899999999887543 356 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC-CCceeeeecCCCH-HHHHHH Q lcl|NC_018086. 245 GDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE-DGMVKFITKDVND-KHIENI 322 (511) Q Consensus 245 s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~-~~~~~~ 322 (511) +.+.++..+.-++.+..|++..++...++|++++.|.+... ...+....+|.+++ +++++|++.+.+. +..+.. T Consensus 233 pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~----~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~ 308 (452) T protein:vir:94 233 PPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS----TMHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKA 308 (452) T ss_pred cchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC----ceEecccccccCCCCCCcceEEccCchhHHHHHHH Confidence 77899999999999999999999999999999999975322 22345567889996 8899999987654 677888 Q ss_pred HHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEE Q lcl|NC_018086. 323 KNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPV 402 (511) Q Consensus 323 ~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~ 402 (511) ++.+++.|.....-. +.....++.|++|.......-.+....+...+..++.+++++++.+++... ...|++. T Consensus 309 l~~le~~m~~~Ga~l-l~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~------~~~v~~n 381 (452) T protein:vir:94 309 LSEKQAQLASLSARL-IDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGG------TLNIKLN 381 (452) T ss_pred HHHHHHHHHHHHHHh-hccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC------ceEEEec Confidence 999999887766422 223334567888877766666677777777778888999999888876421 1123222 Q ss_pred eCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC--CCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 403 FVRNLPQSYAELADMAVKL--RDMLPDETIINQF--PWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 403 f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l--~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) -.-..+.-..+.++++.++ .|.+|.+|++..| +.+.|++.|.+++..|.++.... . T Consensus 382 ~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~-----------------~--- 441 (452) T protein:vir:94 382 SAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPS-----------------P--- 441 (452) T ss_pred cccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcc-----------------c--- Confidence 1112222234566666665 5899999998877 45667888888888774442100 0 Q ss_pred cCCCCCCccccc Q lcl|NC_018086. 479 LDKNPANTSTIT 490 (511) Q Consensus 479 ~~~~~~~~~~~~ 490 (511) ...|++.+... T Consensus 442 -~~~~~~~~~~~ 452 (452) T protein:vir:94 442 -SNTPPNPSSKA 452 (452) T ss_pred -CCCCCCCccCC Confidence 00111111100 No 76 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.95 E-value=4.6e-28 Score=170.46 Aligned_cols=438 Identities=10% Similarity=0.036 Sum_probs=262.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-----cCCcCc-----cccccc-----e-eccchHHHHHHHHHhh Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQ-----SRTFDD-----TNKPNS-----K-IVHNFPKLLVDTSTAY 91 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~-----~~~~~~-----~~~~~~-----r-i~~n~~k~ivd~~~~~ 91 (511) |- .+...+..|....++|++.++.+.|...+.. .++... ..+..+ | +-.|+++.+++.++++ T Consensus 1 m~--~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~ 78 (501) T protein:vir:95 1 MP--NVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQ 78 (501) T ss_pred CC--CCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhh Confidence 10 1223345678888999999999999865432 111110 001111 1 2379999999999999 Q ss_pred hhccCceecCchhhHHHHHHH-HhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC---------------ceEEEEEccc Q lcl|NC_018086. 92 LAGEPITESGDEKTIKAMQPV-FKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK---------------KHRFKAVSPM 155 (511) Q Consensus 92 l~g~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g---------------~~~i~~~~p~ 155 (511) +|.+++++..+......+.++ ...++++.+++.+++.++.+|+++++|..+..+ +|.+..++|. T Consensus 79 vf~k~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~ 158 (501) T protein:vir:95 79 VFMRDPVVKVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPT 158 (501) T ss_pred hhcCCcceeCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHh Confidence 999999885333322222222 344689999999999999999999999765321 4889999999 Q ss_pred ceEEEecCCC-CCceEEEEEEEEEe-ecCC---cceEEEEEEEcC--CcEE--E-EEEccCccc------cccccccccc Q lcl|NC_018086. 156 NCLIAYSADL-DEEPVAAIYYNTVI-SDIT---GHQIRTYEVYTE--DLIY--K-FSTDDEREV------YREIPEELEI 219 (511) Q Consensus 156 ~~~~v~d~~~-~~~~~~~v~~~~~~-~~~~---~~~~~~~~~~~~--~~i~--~-~~~~~~~~~------~~~~~~~~~~ 219 (511) +++---...+ ....+..|++.... ..++ .+.+..+.+.+. +..+ + |+....++. .........+ T Consensus 159 ~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~ 238 (501) T protein:vir:95 159 EIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVY 238 (501) T ss_pred hhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCccccccee Confidence 8754221222 22233333333222 2221 133333333332 2222 2 221111110 0111112233 Q ss_pred ccccceeccCCccceEeecCCc----ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc---hhhhh Q lcl|NC_018086. 220 KDYEVHPNLLQKFPVLEIIANE----ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS---DSISN 292 (511) Q Consensus 220 ~~~~~~~~~~g~iPvv~~~n~~----~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~---~~~~~ 292 (511) .......|+++.||+|.+.... .+.+.+.++..+.-++.+..|++...+.+.++|++|++|.+.+... +.... T Consensus 239 ~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~ 318 (501) T protein:vir:95 239 KPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVN 318 (501) T ss_pred eeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCcee Confidence 4444556899999999875432 2456788888888899899999999999999999999998654322 11223 Q ss_pred hhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 293 MKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRK 372 (511) Q Consensus 293 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~ 372 (511) +....++.+|++++++|++.+.+.- .+..++.+.+.|...... +...+.++.||++.+.......+........+.. T Consensus 319 ~G~~~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~Ga~--ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~ 395 (501) T protein:vir:95 319 FGSRGGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVALGAK--LVEQKEVQRTATEAELEAASEGSTLSSATKNVSA 395 (501) T ss_pred ecccccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHHHHHh--hccCCccchhHHHHHHHHHHHhHHHHHHHHHHHH Confidence 3445678899999999998765443 366688888888777533 3345556789999999999889999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccceeEEeCCCC-CcC-HHHHHHHHHHH--hccCChHHHHHhC---CCCC-CHHHHH Q lcl|NC_018086. 373 VLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNL-PQS-YAELADMAVKL--RDMLPDETIINQF---PWIT-DARQEV 444 (511) Q Consensus 373 ~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~-p~d-~~e~a~~~~~~--~g~~s~et~~~~l---~~v~-d~~~E~ 444 (511) ++.+++++++.+++.... .++|..++.. ... ..+.++++.++ .|.+|.+|++..| +.++ +.+.|. T Consensus 396 al~~~l~~~a~w~g~~~~-------~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~ 468 (501) T protein:vir:95 396 AFEWALKWAARWVGQADS-------GVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAK 468 (501) T ss_pred HHHHHHHHHHHHcCCCCC-------ceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHH Confidence 999999999998764321 1222222222 222 35556666665 5889999996655 4332 334555 Q ss_pred HHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccc Q lcl|NC_018086. 445 EKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKE 499 (511) Q Consensus 445 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) ++|+.+....... ......+.+...++. ++..+ T Consensus 469 e~i~~~~~~~~~~------~~~~~~~~~~~gg~~----------------~~~~~ 501 (501) T protein:vir:95 469 EKIAKDTAEAMAL------ATPANVPGDGSGGDN----------------VGNSE 501 (501) T ss_pred HHHHhhhcCcccc------cccCCCCCCCccccc----------------ccCCC Confidence 5555442211100 000000000000000 11111 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.95 E-value=7.4e-27 Score=163.83 Aligned_cols=474 Identities=7% Similarity=-0.020 Sum_probs=268.3 Q ss_pred CCCcc---chhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccC---CcCcc----- Q lcl|NC_018086. 1 MAIPN---GQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSR---TFDDT----- 69 (511) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~---~~~~~----- 69 (511) ||-.- ++--.....-++.--- -..-..+.-.+...+..|....++|++.++.|.|...+...+ .+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~ 79 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPP-TSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRD 79 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcC-CCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCC Confidence 54432 2221222221111100 000111112244556678888999999999999975543221 11111 Q ss_pred --ccccc-----e-eccchHHHHHHHHHhhhhccCceecCchhhHHHHHHH-HhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 70 --NKPNS-----K-IVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPV-FKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 70 --~~~~~-----r-i~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) ....+ | +-.|+++.+|+.+++++|.+++++...+.....+.++ ...++++.+++.+++.++.+|+++++|. T Consensus 80 ~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD 159 (535) T protein:vir:80 80 EEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTD 159 (535) T ss_pred cCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEe Confidence 11111 1 3379999999999999999998875433322222222 3346899999999999999999999997 Q ss_pred eCCCC-------------ceEEEEEcccceEEEecCCCC-CceEEEEEEEEE-eecC---CcceEEEEEEEcCC--cEEE Q lcl|NC_018086. 141 IDRNK-------------KHRFKAVSPMNCLIAYSADLD-EEPVAAIYYNTV-ISDI---TGHQIRTYEVYTED--LIYK 200 (511) Q Consensus 141 ~~~~g-------------~~~i~~~~p~~~~~v~d~~~~-~~~~~~v~~~~~-~~~~---~~~~~~~~~~~~~~--~i~~ 200 (511) .+..+ +|.+..++|.+++---.+.+. ...+..|++... ...+ +.+.+..+.++..+ ..++ T Consensus 160 ~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~ 239 (535) T protein:vir:80 160 YPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQ 239 (535) T ss_pred ecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEE Confidence 76544 388999999987553222221 223333433222 2222 22444444455442 2222 Q ss_pred ---EEEccCcccccccccccccccccceeccCCccceEeecCC--c--ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 201 ---FSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN--E--ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWND 273 (511) Q Consensus 201 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~--~--~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~ 273 (511) |.....+.... ....+..+....|+++.||||++... . .+.+.+.++..+.-++.+..|++.+++.+.++ T Consensus 240 v~~~~~~~~~~~~~---~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~ 316 (535) T protein:vir:80 240 VERWRRETQEEMYY---SYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQ 316 (535) T ss_pred EEEEEeecCCcccc---ccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcC Confidence 22211111110 11122334456689999999988643 2 25567889999999999999999999999999 Q ss_pred ceeEeecCCCCccch----hhhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccH Q lcl|NC_018086. 274 AYLWLQGFDLSADSD----SISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASG 349 (511) Q Consensus 274 p~l~~~G~~~~~~~~----~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg 349 (511) |++|+.|.+.....+ ..-.+....+|.++++++++|+..+.+.-.. ..++.+.+.|......+ .....++.++ T Consensus 317 P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~-~~l~~~e~qM~~lGa~l--l~~~~~~~Ta 393 (535) T protein:vir:80 317 PTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQITPNSVPF-EAMTHKESQMIAMGANL--LVKSGGNRTF 393 (535) T ss_pred ceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeeccchhHH-HHHHHHHHHHHHHHHHh--hccCcccccH Confidence 999999986432111 1122445578889999999999877665444 45777777777765433 1233456666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC-CCCCcC-HHHHHHHHHHH--hccC Q lcl|NC_018086. 350 QALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV-RNLPQS-YAELADMAVKL--RDML 425 (511) Q Consensus 350 ~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~-~~~p~d-~~e~a~~~~~~--~g~~ 425 (511) .+.+...+...+........+..++.+++++++.+++.... ...+.|..+ +....+ ..+.++++.++ .|.+ T Consensus 394 ~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~-----~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~I 468 (535) T protein:vir:80 394 GEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIVN-----DETVEYNLNTDFPAARLTPNERAELILEWQQGAI 468 (535) T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccC-----CCceEEEeccccccccCCHHHHHHHHHHHhcCCC Confidence 66566666667778888888899999999999988764211 112222221 112222 24456666665 5889 Q ss_pred ChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccc Q lcl|NC_018086. 426 PDETIINQF---PWIT---DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKE 499 (511) Q Consensus 426 s~et~~~~l---~~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) |.+|++..| +.++ +.++|+.|++.|...... ..|...+.+..+.... ..+.+ .++++- T Consensus 469 s~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~-----------~~g~~~d~~~~g~~~~-~~~~~----~~~~~~ 532 (535) T protein:vir:80 469 TFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTA-----------AAGKVGDAASGGTNKA-KLNNG----NGGGNQ 532 (535) T ss_pred CHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccc-----------cCCCCCCCCCCCCCcC-cccCC----cccccc Confidence 999998766 3331 235566666665322111 1111111111111100 00011 111111 Q ss_pred ccc Q lcl|NC_018086. 500 QEK 502 (511) Q Consensus 500 ~~~ 502 (511) -++ T Consensus 533 ~~~ 535 (535) T protein:vir:80 533 AGN 535 (535) T ss_pred CCC Confidence 111 No 78 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.94 E-value=2.5e-26 Score=160.94 Aligned_cols=438 Identities=10% Similarity=-0.007 Sum_probs=260.4 Q ss_pred ccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--CCcCcc---ccccc-----e-eccchH Q lcl|NC_018086. 13 IITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQS--RTFDDT---NKPNS-----K-IVHNFP 81 (511) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~--~~~~~~---~~~~~-----r-i~~n~~ 81 (511) ..+.+ -....+...+..|....++|++.++.|.|......+ ..+..+ ....+ | +-.|++ T Consensus 1 ~~~~~----------~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~ 70 (489) T protein:vir:78 1 MLTEN----------GQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFT 70 (489) T ss_pred CccCC----------CccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChH Confidence 11222 222234455667888899999999999996432111 111111 11111 1 237999 Q ss_pred HHHHHHHHhhhhccCceecCchhhHHHHHHH-HhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC------------ceE Q lcl|NC_018086. 82 KLLVDTSTAYLAGEPITESGDEKTIKAMQPV-FKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK------------KHR 148 (511) Q Consensus 82 k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g------------~~~ 148 (511) +.+++.+++++|.+++++...+.....+.++ ...++++.+.+.+.+.++.+|+++++|..+..+ +|. T Consensus 71 ~~tl~~l~G~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy 150 (489) T protein:vir:78 71 RRTLSGMVGSVMRKEPEINIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPT 150 (489) T ss_pred HHHHHHHhchhhcCCcceeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcE Confidence 9999999999999999885444332222333 344789999999999999999999999887654 588 Q ss_pred EEEEcccceEEEecCCC-CCceEEEEEEEEE--eec----CCcceEEEEEEEcCCcE-----EEEEEccCcccccccccc Q lcl|NC_018086. 149 FKAVSPMNCLIAYSADL-DEEPVAAIYYNTV--ISD----ITGHQIRTYEVYTEDLI-----YKFSTDDEREVYREIPEE 216 (511) Q Consensus 149 i~~~~p~~~~~v~d~~~-~~~~~~~v~~~~~--~~~----~~~~~~~~~~~~~~~~i-----~~~~~~~~~~~~~~~~~~ 216 (511) +..++|.+++----..+ ....+..|++... ..+ -..+.+..+.+++.+.. ..|+....+...... T Consensus 151 ~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~--- 227 (489) T protein:vir:78 151 IAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDV--- 227 (489) T ss_pred EEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCccccee--- Confidence 99999999754321221 2223333333332 111 12245556667766522 122222222111000 Q ss_pred cccccccceeccCCccceEeecCCc----ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchh--- Q lcl|NC_018086. 217 LEIKDYEVHPNLLQKFPVLEIIANE----ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDS--- 289 (511) Q Consensus 217 ~~~~~~~~~~~~~g~iPvv~~~n~~----~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~--- 289 (511) .........|+++.||+|.+.... .+.+.+.++..|.-++.+..|++..++...++|++++.|.+....... T Consensus 228 -~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~ 306 (489) T protein:vir:78 228 -VEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEA 306 (489) T ss_pred -eEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCccccccc Confidence 001123445789999999986432 245678899999999999999999999999999999999754322111 Q ss_pred ---hhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHh-CccccccccccCccHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 290 ---ISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLS-QTPDLVSKDFTAASGQALKAATQPLENKSAV 365 (511) Q Consensus 290 ---~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s-~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~ 365 (511) ...+.....+.++.+++++|++.+.+.. .+..++.++..|.... .+. ...++.|+++.+.....-.+.... T Consensus 307 ~~~~i~~g~~~~~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa~l~----~~~~~~Ta~~~~~~~~~~~S~L~~ 381 (489) T protein:vir:78 307 NPNGIKFGSRRGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLI----TPTQQITAQSARIQRGADTSVMAT 381 (489) T ss_pred CccceeeCCcccccCCCCCCcceeccCcchH-HHHHHHHHHHHHHHHhhhhc----cCCcchhHHHHHHHHHHhhHHHHH Confidence 1122344577888999999998876544 3566777777777653 333 233578999988888888999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC--CCCCCHH Q lcl|NC_018086. 366 KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQF--PWITDAR 441 (511) Q Consensus 366 ~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l--~~v~d~~ 441 (511) ....+..++.+++++++.+++..... .. ...+...|.. ..-..+.++++.++ .|.+|.+|.+..| +.+.|+. T Consensus 382 ~a~~~e~al~~~l~~~a~w~G~~~~~-~~-~i~~n~dF~~--~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~ 457 (489) T protein:vir:78 382 IARNVSQAYTDALRWVAVMLGKPEDT-EV-EFRLNMDFFL--EPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWT 457 (489) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCCCC-ce-EEEeecccCc--ccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc Confidence 99999999999999999997753211 00 1122333421 11124556666665 6899999998765 2343322 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCC Q lcl|NC_018086. 442 QEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTD 493 (511) Q Consensus 442 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (511) .+.++.|-+. + +...+....+ +-|.+ +++..+ T Consensus 458 --~e~~~~ei~~------~------~~~~~~~~~g----~~~~~--~q~~~~ 489 (489) T protein:vir:78 458 --DADIKDAVAD------Q------PLPVATEVQG----EIPQS--AQQQEK 489 (489) T ss_pred --HHHHHHHHhh------c------CCCcccCCcc----cCCCC--cccccC Confidence 1222222111 0 0000000011 10000 000000 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.94 E-value=6.2e-26 Score=158.79 Aligned_cols=438 Identities=10% Similarity=-0.011 Sum_probs=260.4 Q ss_pred ccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--CCcC---ccccccc-----e-eccchH Q lcl|NC_018086. 13 IITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQS--RTFD---DTNKPNS-----K-IVHNFP 81 (511) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~--~~~~---~~~~~~~-----r-i~~n~~ 81 (511) ..+.++. ...+...+..|....++|++.++.|.|......+ ..+. .+....+ | +-.|++ T Consensus 1 ~~~~~~~----------~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~ 70 (491) T protein:vir:95 1 MLTANGQ----------GSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFT 70 (491) T ss_pred CcccCCc----------cCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChH Confidence 1122222 2224445667888889999999999995421111 1111 1111111 1 337999 Q ss_pred HHHHHHHHhhhhccCceecCchhhHHHHHHH-HhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC------------ceE Q lcl|NC_018086. 82 KLLVDTSTAYLAGEPITESGDEKTIKAMQPV-FKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK------------KHR 148 (511) Q Consensus 82 k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g------------~~~ 148 (511) +.+++.+++++|.+++++...+.....+.++ ...++++.+.+.+.+.++.+|+++++|..+..+ +|. T Consensus 71 ~~tl~~l~G~vfrk~p~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy 150 (491) T protein:vir:95 71 RRTLSGMVGSVMRKEPEINIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPT 150 (491) T ss_pred HHHHHHHhchhhcCCceeeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcE Confidence 9999999999999999885443332222333 344789999999999999999999999876543 588 Q ss_pred EEEEcccceEEEecCC-CCCceEEEEEEEEEe--ec----CCcceEEEEEEEcC---CcE--EEEEEccCcccccccccc Q lcl|NC_018086. 149 FKAVSPMNCLIAYSAD-LDEEPVAAIYYNTVI--SD----ITGHQIRTYEVYTE---DLI--YKFSTDDEREVYREIPEE 216 (511) Q Consensus 149 i~~~~p~~~~~v~d~~-~~~~~~~~v~~~~~~--~~----~~~~~~~~~~~~~~---~~i--~~~~~~~~~~~~~~~~~~ 216 (511) +..++|.+++----.. .....+..|++.... .+ -..+.++.+.+++. +.+ ..|+....++.... T Consensus 151 ~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~---- 226 (491) T protein:vir:95 151 IAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEE---- 226 (491) T ss_pred EEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceee---- Confidence 9999999975432111 122334334433321 11 12234444444433 321 12222222211110 Q ss_pred cccccccceeccCCccceEeecCC--c--ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhh-- Q lcl|NC_018086. 217 LEIKDYEVHPNLLQKFPVLEIIAN--E--ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSI-- 290 (511) Q Consensus 217 ~~~~~~~~~~~~~g~iPvv~~~n~--~--~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~-- 290 (511) ..+.......|+++.||+|.+... . .+.+-+.++..|.-++.+..|++..++...++|++++.|.+....+... T Consensus 227 ~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~ 306 (491) T protein:vir:95 227 VVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEA 306 (491) T ss_pred eeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhcc Confidence 111112334578999999998643 2 2456688999999999999999999999999999999997643221111 Q ss_pred ----hhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHH-hCccccccccccCccHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 291 ----SNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSL-SQTPDLVSKDFTAASGQALKAATQPLENKSAV 365 (511) Q Consensus 291 ----~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~-s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~ 365 (511) ..+.....+.+|.++++++++.+.+.. .+..++.++..|... +.+. ...++.||++.+.....-.+.... T Consensus 307 ~~~~i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga~l~----~~~~~~Ta~~~~~~~~~~~S~L~~ 381 (491) T protein:vir:95 307 NPNGIKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLI----TPSQQITAESARIQRGADTSVMAT 381 (491) T ss_pred CcceeEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHHHhc----cCCcchhHHHHHHHHHHhhHHHHH Confidence 112334467788899999998876554 355677776666655 3332 123578999999988888999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCC--CCCCH- Q lcl|NC_018086. 366 KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFP--WITDA- 440 (511) Q Consensus 366 ~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~--~v~d~- 440 (511) ....+..++.+++++++.+++..... .. ...+...|.. ..-..+.++++.++ +|.+|.+|.+..|. .+.|+ T Consensus 382 ~a~~~e~al~~~l~~~a~w~G~~~~~-~v-~i~~n~dF~~--~~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~ 457 (491) T protein:vir:95 382 IARNVSQAYTDALRWVAMMLGKPEDS-EV-EFQLNMDFFL--QPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWT 457 (491) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCCCC-ce-EEEeeccccc--ccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc Confidence 99999999999999999997643211 00 1122333321 11124556666665 68999999987652 34333 Q ss_pred -HHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCC Q lcl|NC_018086. 441 -RQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDK 481 (511) Q Consensus 441 -~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (511) +++.++|++|. .+.+......|..++.++...+ T Consensus 458 ~e~~~~~ie~~~--------~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 458 DEDILNAIEDAP--------LPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHHHHHHhcC--------CCCCccccccccchhhhhhccC Confidence 33344443221 1111112222222222221111 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.92 E-value=6.1e-25 Score=153.35 Aligned_cols=422 Identities=9% Similarity=0.006 Sum_probs=247.5 Q ss_pred hhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---cccCCc---Ccc----------------ccccc- Q lcl|NC_018086. 18 IRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA---IQSRTF---DDT----------------NKPNS- 74 (511) Q Consensus 18 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~---~~~~~~---~~~----------------~~~~~- 74 (511) +-+++|++..-=..-+...+..|....++|++..+-+.|.-.- .+.++. ... ...++ T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~ 80 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLT 80 (488) T ss_pred CceeEEEeecceeecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhh Confidence 3344454442000012223445555556666555443331100 001100 000 00011 Q ss_pred --ee-ccchHHHHHHHHHhhhhccCceecCch-hhHHHHHH-H-HhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC--- Q lcl|NC_018086. 75 --KI-VHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQP-V-FKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK--- 145 (511) Q Consensus 75 --ri-~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~-~-~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g--- 145 (511) |. -.|+++.+++.+++++|.+++++..++ +..+.+.+ + .+.++++...+.+.+.++.+|+++++|..++.+ T Consensus 81 ~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ 160 (488) T protein:vir:96 81 WRLANYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATM 160 (488) T ss_pred hhccccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCH Confidence 22 259999999999999999999986543 33333332 2 345789999999999999999999999887543 Q ss_pred --------ceEEEEEcccceEEEecCCC-CCceEEEEEEEEEeecCCc-----ceEEEEEEEcCCcEEEEEEccCccccc Q lcl|NC_018086. 146 --------KHRFKAVSPMNCLIAYSADL-DEEPVAAIYYNTVISDITG-----HQIRTYEVYTEDLIYKFSTDDEREVYR 211 (511) Q Consensus 146 --------~~~i~~~~p~~~~~v~d~~~-~~~~~~~v~~~~~~~~~~~-----~~~~~~~~~~~~~i~~~~~~~~~~~~~ 211 (511) +|.+..++|.+++----... ....+..|++.....+.++ +.+.++-.++++.+..++...+.+. T Consensus 161 ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~-- 238 (488) T protein:vir:96 161 ADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYS-- 238 (488) T ss_pred HHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcc-- Confidence 48899999999765322222 2223444444332222221 2333334456665444443333221 Q ss_pred ccccccccccccceeccCCccceEeecCCc----ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEIIANE----ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS 287 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~----~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~ 287 (511) ..+.......|+++.||||++.... .+.+.+.++..|.-++.+..|++..++...+.|++++.+.+.+.. T Consensus 239 -----~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~- 312 (488) T protein:vir:96 239 -----DEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKT- 312 (488) T ss_pred -----cceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCcc- Confidence 1222333456789999999986432 356778899999999999999999999999999998754433221 Q ss_pred hhhhhhhh------CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHH Q lcl|NC_018086. 288 DSISNMKN------DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLEN 361 (511) Q Consensus 288 ~~~~~~~~------~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~ 361 (511) ....... .+.....+.|+++|+..+.+.- .+..++.+.+.|.....-. . ...++-||++.+.....-.+ T Consensus 313 -~~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l--~-~~~~~~Ta~~~~~~~~~~~S 387 (488) T protein:vir:96 313 -MASEMNPLGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASL--F-TQQSNETATGAAIRSGSSTA 387 (488) T ss_pred -cccccccceeeecccccccccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhh--c-cCCCcchHHHHHHHHHHhhH Confidence 1111111 1122222456788876654433 3666888888876655321 1 12346789999888888899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC-CCCcC-HHHHHHHHHHH--hccCChHHHHHhC--C Q lcl|NC_018086. 362 KSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR-NLPQS-YAELADMAVKL--RDMLPDETIINQF--P 435 (511) Q Consensus 362 k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~-~~p~d-~~e~a~~~~~~--~g~~s~et~~~~l--~ 435 (511) ........+..++.+++++++.+++......... +++|.-++ ..... ..+.++++.++ +|.+|.+|.+..| + T Consensus 388 ~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~--~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~ 465 (488) T protein:vir:96 388 SMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPD--ELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRA 465 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCcc--ceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhC Confidence 9999999999999999999999987654332211 22333222 12222 35567777776 6899999997765 3 Q ss_pred CCCC----HHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018086. 436 WITD----ARQEVEKADAQRQKRADIALQNFKQ 464 (511) Q Consensus 436 ~v~d----~~~E~~ri~~E~~~~~~~~~~~~~~ 464 (511) .+-+ .++|.++|+++ ..+. T Consensus 466 gvl~~d~~~e~~~~~ie~~----------g~~~ 488 (488) T protein:vir:96 466 RVVRGDMSKEEFDEHIAEL----------GFGM 488 (488) T ss_pred CcCCccCCHHHHHHHHhhc----------CCCC Confidence 3422 23444444321 1111 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.80 E-value=1.3e-18 Score=118.64 Aligned_cols=499 Identities=11% Similarity=0.040 Sum_probs=220.7 Q ss_pred cchhhc-------------ccccCch-hhHhhhhccCCCHHH----HHHHHHHHHH-------HHHHHHHHHHHhcCCCc Q lcl|NC_018086. 5 NGQINA-------------GDIITTN-IRRKHFIRRNFDLRE----LITLAEMHSR-------SSSAYGVLYDYYKGNHI 59 (511) Q Consensus 5 ~~~~~~-------------~~~~~~~-~~~~~~~~~~~~~~~----l~~~~~~~~~-------~~~~~~~~~~yY~G~~~ 59 (511) .|.+++ +.+.+.. ..-+.......+... +.+++..+.. -+....+-.+||.|++- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw 80 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQW 80 (776) T ss_pred CCCccccccccccccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC Confidence 233222 1221111 111111222232222 2223322222 22233456799999864 Q ss_pred cccc-CCcCccccccceeccchHHHHHHHHHhhhhccCce--ec----CchhhHH----HHHHHHhccChhHHHHHHHHH Q lcl|NC_018086. 60 AIQS-RTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPIT--ES----GDEKTIK----AMQPVFKENYVTDVNSEEVKL 128 (511) Q Consensus 60 ~~~~-~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~--~~----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~ 128 (511) .... ......+ .-.+.+|.++.+|+..+++...+.+. +. +|.+..+ .+..++..|+++...+.+..+ T Consensus 81 ~~~~~~~l~~~g--~p~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d 158 (776) T protein:vir:93 81 SQDEIDELKERG--QAPTVYNVISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEE 158 (776) T ss_pred CHHHHHHHHhcC--CceEEecchHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHH Confidence 2211 1111111 12478999999999999988766443 32 2222222 355667889999999999999 Q ss_pred HhhCCeEEEEeeeCC--CC-ceEEEEEcccceEEEecCCCCC----ceEEEEE-EEEE---------------------- Q lcl|NC_018086. 129 SGIFGHCFEIHWIDR--NK-KHRFKAVSPMNCLIAYSADLDE----EPVAAIY-YNTV---------------------- 178 (511) Q Consensus 129 a~~~G~~~~~v~~~~--~g-~~~i~~~~p~~~~~v~d~~~~~----~~~~~v~-~~~~---------------------- 178 (511) ++++|.||+-|+++. ++ .+++.+++|.+++ ||+.... ...+.++ .|.. T Consensus 159 ~~~~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 236 (776) T protein:vir:93 159 TTKAGIGWLESQVQDENDGEPIYAGAESWRNIL--WDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDN 236 (776) T ss_pred hhhcCcceEEEEeeccCCCCceEeeccChhhee--eccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhc Confidence 999999999888765 23 3555677888764 3432211 0111111 0000 Q ss_pred ------------------------------eecCCcceEEEEEEEcCCcEEEEEEcc----Cccccccc----------- Q lcl|NC_018086. 179 ------------------------------ISDITGHQIRTYEVYTEDLIYKFSTDD----EREVYREI----------- 213 (511) Q Consensus 179 ------------------------------~~~~~~~~~~~~~~~~~~~i~~~~~~~----~~~~~~~~----------- 213 (511) ..+...+.+..+++|....+....... ...+.... T Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 316 (776) T protein:vir:93 237 FETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVES 316 (776) T ss_pred ccccchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhc Confidence 000111234445665443222111100 00000000 Q ss_pred -----------------ccccccccccceeccCCccceEeecC-----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 214 -----------------PEELEIKDYEVHPNLLQKFPVLEIIA-----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYW 271 (511) Q Consensus 214 -----------------~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~ 271 (511) -...........|.+.+.+|+|+++. ...|.|.+..+++.++.+|..+|.+.+.+- T Consensus 317 g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~-- 394 (776) T protein:vir:93 317 GRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS-- 394 (776) T ss_pred CceeehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc-- Confidence 00001111222344557788887764 235789999999999999999999988763 Q ss_pred cCceeEeecCCCCccchhhhh-hhhCceeeecCCCc--eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-c Q lcl|NC_018086. 272 NDAYLWLQGFDLSADSDSISN-MKNDRVIVTDEDGM--VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-A 347 (511) Q Consensus 272 ~~p~l~~~G~~~~~~~~~~~~-~~~~~~i~~~~~~~--~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~ 347 (511) +.++.+-.|.-.+. ++.... .+.+.++.+..++. +++.....-...+...+..+...|..+|++.+...|..+| . T Consensus 395 ~~~~~~~~gav~~~-d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~ 473 (776) T protein:vir:93 395 TNKVLMEEGAVDDI-DEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAV 473 (776) T ss_pred CCceeeccccccch-HHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchh Confidence 45666656643222 222222 23456677665542 3333222223556677888899999999988877776654 7 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc----------cccc----------------cceeE Q lcl|NC_018086. 348 SGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK----------DLKP----------------YEVTP 401 (511) Q Consensus 348 Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~----------~~~~----------------~~i~i 401 (511) ||+|+...............+.|..++++++++++.+........ ...+ .+|.| T Consensus 474 Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v 553 (776) T protein:vir:93 474 SGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFII 553 (776) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEE Confidence 999999888777777777777777777777777666553321100 0000 12222 Q ss_pred EeCCCCCcCHHHHHHHHHHHhccCChHH-------HHHhC--CCCCCHHHHH-------------------H-------- Q lcl|NC_018086. 402 VFVRNLPQSYAELADMAVKLRDMLPDET-------IINQF--PWITDARQEV-------------------E-------- 445 (511) Q Consensus 402 ~f~~~~p~d~~e~a~~~~~~~g~~s~et-------~~~~l--~~v~d~~~E~-------------------~-------- 445 (511) .=.+..+.-..+..+.+..+.+.+..+. +++.. +..++..+.+ . T Consensus 554 ~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~ 633 (776) T protein:vir:93 554 DEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQ 633 (776) T ss_pred eecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHH Confidence 2222222212223333333322111110 11111 1100000000 0 Q ss_pred -HHHHH--------HHH-----HHHHHHh--hccc---ccc-----------CC-------CCCCccccccCCCCCCccc Q lcl|NC_018086. 446 -KADAQ--------RQK-----RADIALQ--NFKQ---TSA-----------VQ-------GASTAAANKLDKNPANTST 488 (511) Q Consensus 446 -ri~~E--------~~~-----~~~~~~~--~~~~---~~~-----------~~-------~~~~~~~~~~~~~~~~~~~ 488 (511) .++.+ +++ .++.... .... ... .. .......+......+...+ T Consensus 634 ~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a~q~a~qa~~~~~~~~~~a~~a~~~~~~a~~~~p 713 (776) T protein:vir:93 634 QQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGAVKDATDAATAIAFMPELAGLSDGILRESGWDDP 713 (776) T ss_pred HHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhcccccccc Confidence 00000 000 0000000 0000 000 00 0000000000000000000 Q ss_pred cccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 489 ITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ..+..+..++. ....+.+|.|- T Consensus 714 ~~p~~~~~~~~-~~~~~~~p~~p 735 (776) T protein:vir:93 714 NTPQPASAASG-MPPAPAQPAQP 735 (776) T ss_pred ccccccccccC-CCCCCCCCCCC Confidence 00000000000 00011111111 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.73 E-value=6.2e-16 Score=103.95 Aligned_cols=487 Identities=13% Similarity=0.057 Sum_probs=236.4 Q ss_pred CCCccchhhcccccCchhhHh-hhhccCCCHHH-HHHHHHHHH-------HHHHHHHHHHHHhcCCCcccccC-CcCccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRK-HFIRRNFDLRE-LITLAEMHS-------RSSSAYGVLYDYYKGNHIAIQSR-TFDDTN 70 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-l~~~~~~~~-------~~~~~~~~~~~yY~G~~~~~~~~-~~~~~~ 70 (511) ||-+=. +..+..+.-.+. .......+.+. +.++...+. +.+.....-.+||.|.+-...-. .....+ T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g 77 (711) T protein:vir:10 1 MAKKQK---KSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQ 77 (711) T ss_pred CCcccc---cccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcC Confidence 443221 223333322222 22333333333 333444322 23333445689999986421100 001111 Q ss_pred cccceeccchHHHHHHHHHhhhhccCcee--c--------------------------CchhhHHH----HHHHHhccCh Q lcl|NC_018086. 71 KPNSKIVHNFPKLLVDTSTAYLAGEPITE--S--------------------------GDEKTIKA----MQPVFKENYV 118 (511) Q Consensus 71 ~~~~ri~~n~~k~ivd~~~~~l~g~~~~~--~--------------------------~d~~~~~~----l~~~~~~n~~ 118 (511) .-.+.+|.++.+|+..+++.-.+.+.+ . +|.+..+. +..++..|+. T Consensus 78 --~p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~ 155 (711) T protein:vir:10 78 --RPCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDA 155 (711) T ss_pred --CCcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcCh Confidence 124778999999999999987665443 1 12222232 4456778999 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCC------CCceEEEEE-cccceEEEecCCCCC----ceE-EEEEEEEEee------ Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDR------NKKHRFKAV-SPMNCLIAYSADLDE----EPV-AAIYYNTVIS------ 180 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~------~g~~~i~~~-~p~~~~~v~d~~~~~----~~~-~~v~~~~~~~------ 180 (511) +...+.+..+++++|.||+-++.+. +|++++..+ +|.++ +||+.... ... ++.+.|...+ T Consensus 156 ~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~y 233 (711) T protein:vir:10 156 ETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSV--TIDPDAKKRDRSDMNWCLIDDTMSKEKFKALY 233 (711) T ss_pred hHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhhe--eeCccccccChhhhcceeeeecCCHHHHHHhC Confidence 9999999999999999998776532 478888777 68885 55543211 111 2222221110 Q ss_pred ------------------cCCcceEEEEEEEcCCcEEEEEE--ccCcccccccc-c------------------------ Q lcl|NC_018086. 181 ------------------DITGHQIRTYEVYTEDLIYKFST--DDEREVYREIP-E------------------------ 215 (511) Q Consensus 181 ------------------~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~-~------------------------ 215 (511) +.....+..+++|.......... ..+........ . T Consensus 234 p~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~ 313 (711) T protein:vir:10 234 PDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTY 313 (711) T ss_pred CchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEE Confidence 01112344455554433222111 11110000000 0 Q ss_pred ----ccccccccceeccCCccceEeecC-------CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe-ecCCC Q lcl|NC_018086. 216 ----ELEIKDYEVHPNLLQKFPVLEIIA-------NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL-QGFDL 283 (511) Q Consensus 216 ----~~~~~~~~~~~~~~g~iPvv~~~n-------~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~-~G~~~ 283 (511) .......+..|.+.+.+|+|+|.. ...+.|.+..+++.++.+|...|.+...+...+.+.+++ .|.-. T Consensus 314 ~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~ 393 (711) T protein:vir:10 314 WRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE 393 (711) T ss_pred EEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccC Confidence 000001122344456678776642 123457889999999999999999999999888876655 44322 Q ss_pred Cccchhhh-hhhhCceeeecCCC----ceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHH Q lcl|NC_018086. 284 SADSDSIS-NMKNDRVIVTDEDG----MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQ 357 (511) Q Consensus 284 ~~~~~~~~-~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~ 357 (511) +.++.... ..+.+.++.+..+. .+++...+.-...+...+......|-..|++.+...|..+ +.||+||..... T Consensus 394 ~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~ 473 (711) T protein:vir:10 394 GREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQR 473 (711) T ss_pred ChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHH Confidence 22222222 23345677776543 3444443444466777788889999999998877776654 479999999887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----------CCcccc-------------------------ccceeEE Q lcl|NC_018086. 358 PLENKSAVKESKFRKVLAKRYELVCSYLEFMN----------KAKDLK-------------------------PYEVTPV 402 (511) Q Consensus 358 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~----------~~~~~~-------------------------~~~i~i~ 402 (511) .-..........|..+.+++.++++.+..... .....+ ..+|.|. T Consensus 474 qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~ 553 (711) T protein:vir:10 474 QGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEe Confidence 77766666666777777777666666543211 100000 0123333 Q ss_pred eCCCCCcCHHHHHHHHHHHhccCChH------HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccc Q lcl|NC_018086. 403 FVRNLPQSYAELADMAVKLRDMLPDE------TIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAA 476 (511) Q Consensus 403 f~~~~p~d~~e~a~~~~~~~g~~s~e------t~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (511) =.+..+.-..+.+..+..+.+.+|.- .+++.+++ ++.++-.+++++.... .+...... T Consensus 554 ~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~-p~~~el~e~lr~~~~~---------------~~~~~~~~ 617 (711) T protein:vir:10 554 TGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPP---------------NVLSKDER 617 (711) T ss_pred eccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCC-CCHHHHHHHHHhhcCc---------------ccCcchhh Confidence 33344433344555555555554431 12333332 3333323333221000 00000000 Q ss_pred cccCCCCCCccccccCC-------CCccccccccCCCCCCCC Q lcl|NC_018086. 477 NKLDKNPANTSTITTTD-------PVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 477 ~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~ 511 (511) .+..... .....+..+ -+.-..+-.....++.++ T Consensus 618 ~~~qq~~-~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae 658 (711) T protein:vir:10 618 EAIEEDM-PEQTEPTPEQQVEMAKSQADMAQAEADTAQAQAD 658 (711) T ss_pred hHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 000000000 000000000111111111 No 83 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.72 E-value=1.6e-17 Score=112.60 Aligned_cols=420 Identities=10% Similarity=0.043 Sum_probs=202.5 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CcccccCCcCccccccc----e Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGN-HIAIQSRTFDDTNKPNS----K 75 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~-~~~~~~~~~~~~~~~~~----r 75 (511) |+ .|++...-.+... .. .+......+=.|. .+.........+...++ . T Consensus 1 ~~-~~~~a~~~~~~~~------------------------a~--~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~ 53 (461) T protein:vir:80 1 MY-SIDKAKQAKIDSK------------------------IV--NRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACEN 53 (461) T ss_pred Cc-cchhhhhhhhhhh------------------------hh--hhhHHHhhcCCcchhhhhhccccCcccccCHHHHHH Confidence 11 1111111111100 00 0000111111111 11000000000000011 0 Q ss_pred --eccchHHHHHHHHHhhhhccCceecCch-hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCC---ceEE Q lcl|NC_018086. 76 --IVHNFPKLLVDTSTAYLAGEPITESGDE-KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNK---KHRF 149 (511) Q Consensus 76 --i~~n~~k~ivd~~~~~l~g~~~~~~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g---~~~i 149 (511) -.+.+++.+|+..+..++.+++.+.+++ +..+.+...|++-++...+.++.+++..||.|++++-..... .... T Consensus 54 lY~~~~l~r~iVd~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~ 133 (461) T protein:vir:80 54 LYASNSIAMNIVDIISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLS 133 (461) T ss_pred HHHhCCccchhhccchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCcc Confidence 1357788999999999999999988755 455678888888889999999999999999999887653211 1111 Q ss_pred EEEcccce--EE---EecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccc Q lcl|NC_018086. 150 KAVSPMNC--LI---AYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEV 224 (511) Q Consensus 150 ~~~~p~~~--~~---v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (511) ..+.|... +. +|+.. .+.. . ....+..+. .++.|..+. .......... ... ........ T Consensus 134 ~pl~~~~~~~~~~l~~~~~~----~i~~-~--~~~~dp~sp-----~fg~P~~y~-i~~~~~~~~~-~~~--~~~~~~~~ 197 (461) T protein:vir:80 134 TAIDPKTIKSIPYINTFNTQ----KVTQ-L--YLNQDMFSE-----HFGEVEFFE-VNRVSQLGEE-ILS--GTTASTSE 197 (461) T ss_pred CCcccccccceeEEEecccc----ccch-h--hhcccCcCc-----ccccceEEE-Eecccccccc-ccc--cccCccce Confidence 11122110 00 01100 0000 0 000000000 111111110 0000000000 000 00000111 Q ss_pred eeccCCccceEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCC--ccc-hhhhhh--- Q lcl|NC_018086. 225 HPNLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLS--ADS-DSISNM--- 293 (511) Q Consensus 225 ~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~--~~~-~~~~~~--- 293 (511) .-|+ -++++|.+. -.|+|.++.+.+.+.++++++-.....+..+..+.+.+.|...- +.. .....+ T Consensus 198 ~iH~---SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~ 274 (461) T protein:vir:80 198 QIHR---SRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFM 274 (461) T ss_pred EEcc---ccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHh Confidence 2222 244555443 35999999999999999999988888888888887776664321 111 111111 Q ss_pred h-hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccc-cc--ccCccHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_018086. 294 K-NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVS-KD--FTAASGQALKAATQPLENKSAVKE-S 368 (511) Q Consensus 294 ~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~--~~~~Sg~Ai~~~~~~l~~k~~~~~-~ 368 (511) . ..+++.+..+.+++. .+.+.+.+...++.+...|...+++|-.-. +. .+++||..= .......++.++ . T Consensus 275 ~~~~g~~~~d~~e~~e~--~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D---~~~yyd~i~~~qe~ 349 (461) T protein:vir:80 275 FRTEALAIIKGDEQLTK--ESTNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYD---VMNYYARVSSIQEN 349 (461) T ss_pred cCCceEEEEcCCcceEE--EecCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHH---HHHHHHHHHHHHHH Confidence 1 234555666655544 455667888999999999999999997543 32 234566642 223445555555 5 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh---------ccCChHHHHHhCCCCCC Q lcl|NC_018086. 369 KFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR---------DMLPDETIINQFPWITD 439 (511) Q Consensus 369 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~---------g~~s~et~~~~l~~v~d 439 (511) .++..+++++++|+..........+.+..+++|.|++-.+.++++.|++..+.+ |++|.++++..+- T Consensus 350 ~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~---- 425 (461) T protein:vir:80 350 RLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRF---- 425 (461) T ss_pred HHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHH---- Confidence 678999999998876444344444555668999999999999999988765543 4444444432210 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCc Q lcl|NC_018086. 440 ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVA 496 (511) Q Consensus 440 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (511) .. ..+++ ..... ..+++.++.+.. .....+.+++.| T Consensus 426 -------------~~--~~~~~---~~~~~-~~~~~~~~~~~~--~~~~~~~e~~~g 461 (461) T protein:vir:80 426 -------------GR--FGLEN---SSKFS-GDSAEIDKLAKL--VYDAYAKKNADG 461 (461) T ss_pred -------------Hh--cCCCC---CccCC-CCCchhhhhhhh--ccccccccCCCC Confidence 00 00000 00000 010110000000 000000001111 No 84 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.72 E-value=2.2e-15 Score=100.98 Aligned_cols=499 Identities=11% Similarity=0.025 Sum_probs=217.7 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHH---HHHHHHHHHHHHHHhcCCCccccc-CCcCccccccceecc Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEM---HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKPNSKIVH 78 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~---~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~~~ri~~ 78 (511) |-|.....-....++... ...++.+.+..+... +.+-+....+..+||.|.+-...- ......+. -.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~--p~~~~ 74 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAG----DTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGI--PPAVE 74 (772) T ss_pred CCcchhhHHhhccCCccc----ccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcEEE Confidence 445432221111111000 123444554443333 333334445678899998642111 01111122 24789 Q ss_pred chHHHHHHHHHhhhhccCcee--cC-----chhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC--- Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGEPITE--SG-----DEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN--- 144 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~~~~~--~~-----d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~--- 144 (511) |.++.+|+..+++.-.+.+.+ .. |.+..+ .+..++..|+++...+.+..+++++|.||+-++.+.+ T Consensus 75 N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~ 154 (772) T protein:vir:10 75 DLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFK 154 (772) T ss_pred cchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCC Confidence 999999999999987765443 22 222222 3456677899999999999999999999998887653 Q ss_pred CceEEEEEcccceEEEecCCCCCceE---EEE-EEEEE---------------------------------e-------- Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPV---AAI-YYNTV---------------------------------I-------- 179 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~---~~v-~~~~~---------------------------------~-------- 179 (511) +.++|..++|.++ +||+..+.... +.+ ..|.. . T Consensus 155 ~~i~i~~v~p~~v--~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (772) T protein:vir:10 155 FPYRCRPIRRDEI--HWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGL 232 (772) T ss_pred CCeEEEeeCcccc--eecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccccccccc Confidence 4678999999985 45553321100 011 00000 0 Q ss_pred -----------------ecCCcceEEEEEEEcCCcEEEEEE--ccCccccccc--------------------------- Q lcl|NC_018086. 180 -----------------SDITGHQIRTYEVYTEDLIYKFST--DDEREVYREI--------------------------- 213 (511) Q Consensus 180 -----------------~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~--------------------------- 213 (511) .+...+.+..+++|......+... ..+....... T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~ 312 (772) T protein:vir:10 233 HNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRS 312 (772) T ss_pred ccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEE Confidence 000113344455554432222111 1111110000 Q ss_pred -ccccccccccceeccCCccceEeecCC---c--ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc Q lcl|NC_018086. 214 -PEELEIKDYEVHPNLLQKFPVLEIIAN---E--ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS 287 (511) Q Consensus 214 -~~~~~~~~~~~~~~~~g~iPvv~~~n~---~--~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~ 287 (511) -...........|.+.+.+|+|+|+-. . ...|.+..+++.++.+|...|.+...+...+ +..-.|.-...+. T Consensus 313 ~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~gav~~~d~ 390 (772) T protein:vir:10 313 YWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGAVAMTDA 390 (772) T ss_pred EEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCCccchhH Confidence 000011111223334455777765532 1 2347888999999999999999988765443 3322333222222 Q ss_pred hhhhhhh-hCceeeecCCC------ceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHHHHHHHH Q lcl|NC_018086. 288 DSISNMK-NDRVIVTDEDG------MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALKAATQPL 359 (511) Q Consensus 288 ~~~~~~~-~~~~i~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~~~~~~l 359 (511) .+...+. .+.++.+..+. .++......-...+...+......|-.++++-+...|..+| .||+||......- T Consensus 391 ~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg 470 (772) T protein:vir:10 391 QFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQS 470 (772) T ss_pred HHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHH Confidence 3333333 24566665432 12222222223556666777888899999887766666554 6999998876665 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------Cccc------------------------cc----cceeEE Q lcl|NC_018086. 360 ENKSAVKESKFRKVLAKRYELVCSYLEFMNK---------AKDL------------------------KP----YEVTPV 402 (511) Q Consensus 360 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~---------~~~~------------------------~~----~~i~i~ 402 (511) ..........+..+.+++.++++.+....-. .... +. .+|.|. T Consensus 471 ~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~ 550 (772) T protein:vir:10 471 NQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALE 550 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEee Confidence 5555566666666666666655554422110 0000 00 111111 Q ss_pred eCCCCCcCHHHHHHHHHHHhccCChHH-------------------H---HHhCCCCCCHHHHHHHH------------- Q lcl|NC_018086. 403 FVRNLPQSYAELADMAVKLRDMLPDET-------------------I---INQFPWITDARQEVEKA------------- 447 (511) Q Consensus 403 f~~~~p~d~~e~a~~~~~~~g~~s~et-------------------~---~~~l~~v~d~~~E~~ri------------- 447 (511) =.+..+.=..+..+.+.++.+.++.+. + ++.+-.-.+++.+...+ T Consensus 551 ~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~~~ 630 (772) T protein:vir:10 551 DVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKAGN 630 (772) T ss_pred ccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHH Confidence 111111112233333333332222211 1 11111112332111110 Q ss_pred -------HHH--H-HHHH-----H----------HHH------hhcccccc-------CCCCC---CccccccCCCCCCc Q lcl|NC_018086. 448 -------DAQ--R-QKRA-----D----------IAL------QNFKQTSA-------VQGAS---TAAANKLDKNPANT 486 (511) Q Consensus 448 -------~~E--~-~~~~-----~----------~~~------~~~~~~~~-------~~~~~---~~~~~~~~~~~~~~ 486 (511) ..+ + ++.+ + .++ ..+.+..+ ..|.. ..........++.. T Consensus 631 el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q~~q~a~~ad~~l~~~g~~~~~~~~~~~~~p~~~~~ 710 (772) T protein:vir:10 631 DIKLRELEIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQMPMIAPIADAVMQSAGYQRPNPAGDDPNYPIADQT 710 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhhhhhHHHHHHHHhcccccccccccCCCCCCCCCc Confidence 000 0 0000 0 000 00000000 00000 00000000000011 Q ss_pred cccccCCCCc----cccccc--cCCCCCCCC Q lcl|NC_018086. 487 STITTTDPVA----AKEQEK--AIQKKPKTD 511 (511) Q Consensus 487 ~~~~~~~~~~----~~~~~~--~~~~~~~~~ 511 (511) ++...+.++. +.+.+. +.+.+.+|. T Consensus 711 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 741 (772) T protein:vir:10 711 AAMNIRSPYIQGQGPAAEAEAESVSVRRNTS 741 (772) T ss_pred cCCCCCccCCCCCCCCCccccCCCCCccCCC Confidence 1111111111 111111 111111111 No 85 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.71 E-value=2e-15 Score=101.13 Aligned_cols=474 Identities=11% Similarity=0.048 Sum_probs=221.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHH-------HHHHHHHHHHHHHHhcCCCccccc-CCcCccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEM-------HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-------~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~ 72 (511) |.=.+.-+ .+ -.+...+.+...++... +.+-+....+..+||.|.+-...- ......+. T Consensus 1 ~~~~~~~~--~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~- 67 (714) T protein:vir:99 1 MKNETNTM--AT----------KNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ- 67 (714) T ss_pred CCcccccc--cC----------CCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC- Confidence 11111100 00 01111222222222222 222234445778999998742111 00111122 Q ss_pred cceeccchHHHHHHHHHhhhhccCcee--cC---chh---hHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAYLAGEPITE--SG---DEK---TIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~l~g~~~~~--~~---d~~---~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) -.+.+|.++.+|+..+++.-.+.+.+ .. ++. ..+ .+..++..|+++...+.+..+++++|.||+-++ T Consensus 68 -p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:99 68 -PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred -CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 24779999999999999987765443 21 221 222 355667789999999999999999999998888 Q ss_pred eCC---CCceEEEEEcccceEEEecCCCCC----c-eEEEEEEEEEeec------------------------------- Q lcl|NC_018086. 141 IDR---NKKHRFKAVSPMNCLIAYSADLDE----E-PVAAIYYNTVISD------------------------------- 181 (511) Q Consensus 141 ~~~---~g~~~i~~~~p~~~~~v~d~~~~~----~-~~~~v~~~~~~~~------------------------------- 181 (511) .+. ++.+++..++|.+++ ||+.... . ..++++.|...+. T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:99 147 RNSDPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred cccCCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 764 456899999999964 4442211 1 1112222211000 Q ss_pred -----------------------CCcceEEEEEEEcCCcEEEEE--EccCccccccccc--------------------- Q lcl|NC_018086. 182 -----------------------ITGHQIRTYEVYTEDLIYKFS--TDDEREVYREIPE--------------------- 215 (511) Q Consensus 182 -----------------------~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~--------------------- 215 (511) .+...+..+++|......... ..++......... T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:99 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 001123334555432221111 1111111000000 Q ss_pred -------ccccccccceeccCCccceEeecCC---cc--cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 216 -------ELEIKDYEVHPNLLQKFPVLEIIAN---EE--RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 216 -------~~~~~~~~~~~~~~g~iPvv~~~n~---~~--g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) ...+......|.+.+.+|+|+|... .. ..|.+..+++.++.+|...|.+...+ .++..++..|.-. T Consensus 305 v~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~ 382 (714) T protein:vir:99 305 IREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQ 382 (714) T ss_pred EEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccc Confidence 0000001112223344666555432 22 23678889999999999999988765 3555555556544 Q ss_pred Cccchhhhhhh-hCceeeecCCCc--------eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHH Q lcl|NC_018086. 284 SADSDSISNMK-NDRVIVTDEDGM--------VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALK 353 (511) Q Consensus 284 ~~~~~~~~~~~-~~~~i~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~ 353 (511) ..+........ .++++.+..+.. +++.....-...+...+......|-.+|++.+...|..+| .||+||. T Consensus 383 ~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~ 462 (714) T protein:vir:99 383 LSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAIS 462 (714) T ss_pred ccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHH Confidence 33333333222 234555543211 2222212223555666777788999999987776666554 7999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCccc-----------------------ccccee Q lcl|NC_018086. 354 AATQPLENKSAVKESKFRKVLAKRYELVCSYLEF----------MNKAKDL-----------------------KPYEVT 400 (511) Q Consensus 354 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----------~~~~~~~-----------------------~~~~i~ 400 (511) .....-..........+..+.+++.++++.+... .+..... -..+|. T Consensus 463 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:99 463 NLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEE Confidence 8877655555555566666666666655554321 1110000 011233 Q ss_pred EEeCCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKLRDMLPD-------ETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~~g~~s~-------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) |.=.+..|....+.++.+..+.+.++. ..+++.+.+ ++.++-.++|++. .+.+... .+ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~~~~~---~~ 607 (714) T protein:vir:99 543 LAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGTPKSP---DE 607 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCCCCCc---cc Confidence 333444444445566666666554433 334445544 5555555666442 0010000 00 Q ss_pred ccccccCCCCCCccccccCCCCccccccccCCC-CCCCC Q lcl|NC_018086. 474 AAANKLDKNPANTSTITTTDPVAAKEQEKAIQK-KPKTD 511 (511) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) .. +..+.......+. +......+-...+- -.++. T Consensus 608 ~~---~e~q~~~~~~q~~-~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:99 608 MT---PEEQEVAAQQQAL-QQQQAELQMREMAGRVAKLE 642 (714) T ss_pred cc---hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 00 0000000000000 00000000000000 00111 No 86 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.71 E-value=2e-15 Score=101.13 Aligned_cols=474 Identities=11% Similarity=0.048 Sum_probs=221.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHH-------HHHHHHHHHHHHHHhcCCCccccc-CCcCccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEM-------HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-------~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~ 72 (511) |.=.+.-+ .+ -.+...+.+...++... +.+-+....+..+||.|.+-...- ......+. T Consensus 1 ~~~~~~~~--~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~- 67 (714) T protein:vir:32 1 MKNETNTM--AT----------KNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ- 67 (714) T ss_pred CCcccccc--cC----------CCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC- Confidence 11111100 00 01111222222222222 222234445778999998742111 00111122 Q ss_pred cceeccchHHHHHHHHHhhhhccCcee--cC---chh---hHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAYLAGEPITE--SG---DEK---TIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~l~g~~~~~--~~---d~~---~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) -.+.+|.++.+|+..+++.-.+.+.+ .. ++. ..+ .+..++..|+++...+.+..+++++|.||+-++ T Consensus 68 -p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:32 68 -PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred -CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 24779999999999999987765443 21 221 222 355667789999999999999999999998888 Q ss_pred eCC---CCceEEEEEcccceEEEecCCCCC----c-eEEEEEEEEEeec------------------------------- Q lcl|NC_018086. 141 IDR---NKKHRFKAVSPMNCLIAYSADLDE----E-PVAAIYYNTVISD------------------------------- 181 (511) Q Consensus 141 ~~~---~g~~~i~~~~p~~~~~v~d~~~~~----~-~~~~v~~~~~~~~------------------------------- 181 (511) .+. ++.+++..++|.+++ ||+.... . ..++++.|...+. T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:32 147 RNSDPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred cccCCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 764 456899999999964 4442211 1 1112222211000 Q ss_pred -----------------------CCcceEEEEEEEcCCcEEEEE--EccCccccccccc--------------------- Q lcl|NC_018086. 182 -----------------------ITGHQIRTYEVYTEDLIYKFS--TDDEREVYREIPE--------------------- 215 (511) Q Consensus 182 -----------------------~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~--------------------- 215 (511) .+...+..+++|......... ..++......... T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:32 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 001123334555432221111 1111111000000 Q ss_pred -------ccccccccceeccCCccceEeecCC---cc--cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 216 -------ELEIKDYEVHPNLLQKFPVLEIIAN---EE--RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 216 -------~~~~~~~~~~~~~~g~iPvv~~~n~---~~--g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) ...+......|.+.+.+|+|+|... .. ..|.+..+++.++.+|...|.+...+ .++..++..|.-. T Consensus 305 v~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~ 382 (714) T protein:vir:32 305 IREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQ 382 (714) T ss_pred EEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccc Confidence 0000001112223344666555432 22 23678889999999999999988765 3555555556544 Q ss_pred Cccchhhhhhh-hCceeeecCCCc--------eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHH Q lcl|NC_018086. 284 SADSDSISNMK-NDRVIVTDEDGM--------VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALK 353 (511) Q Consensus 284 ~~~~~~~~~~~-~~~~i~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~ 353 (511) ..+........ .++++.+..+.. +++.....-...+...+......|-.+|++.+...|..+| .||+||. T Consensus 383 ~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~ 462 (714) T protein:vir:32 383 LSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAIS 462 (714) T ss_pred ccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHH Confidence 33333333222 234555543211 2222212223555666777788999999987776666554 7999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCccc-----------------------ccccee Q lcl|NC_018086. 354 AATQPLENKSAVKESKFRKVLAKRYELVCSYLEF----------MNKAKDL-----------------------KPYEVT 400 (511) Q Consensus 354 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----------~~~~~~~-----------------------~~~~i~ 400 (511) .....-..........+..+.+++.++++.+... .+..... -..+|. T Consensus 463 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:32 463 NLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEE Confidence 8877655555555566666666666655554321 1110000 011233 Q ss_pred EEeCCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKLRDMLPD-------ETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~~g~~s~-------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) |.=.+..|....+.++.+..+.+.++. ..+++.+.+ ++.++-.++|++. .+.+... .+ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~~~~~---~~ 607 (714) T protein:vir:32 543 LAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGTPKSP---DE 607 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCCCCCc---cc Confidence 333444444445566666666554433 334445544 5555555666442 0010000 00 Q ss_pred ccccccCCCCCCccccccCCCCccccccccCCC-CCCCC Q lcl|NC_018086. 474 AAANKLDKNPANTSTITTTDPVAAKEQEKAIQK-KPKTD 511 (511) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) .. +..+.......+. +......+-...+- -.++. T Consensus 608 ~~---~e~q~~~~~~q~~-~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:32 608 MT---PEEQEVAAQQQAL-QQQQAELQMREMAGRVAKLE 642 (714) T ss_pred cc---hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 00 0000000000000 00000000000000 00111 No 87 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.71 E-value=2e-15 Score=101.13 Aligned_cols=474 Identities=11% Similarity=0.048 Sum_probs=221.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHH-------HHHHHHHHHHHHHHhcCCCccccc-CCcCccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEM-------HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-------~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~ 72 (511) |.=.+.-+ .+ -.+...+.+...++... +.+-+....+..+||.|.+-...- ......+. T Consensus 1 ~~~~~~~~--~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~- 67 (714) T protein:vir:81 1 MKNETNTM--AT----------KNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ- 67 (714) T ss_pred CCcccccc--cC----------CCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC- Confidence 11111100 00 01111222222222222 222234445778999998742111 00111122 Q ss_pred cceeccchHHHHHHHHHhhhhccCcee--cC---chh---hHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAYLAGEPITE--SG---DEK---TIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~l~g~~~~~--~~---d~~---~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) -.+.+|.++.+|+..+++.-.+.+.+ .. ++. ..+ .+..++..|+++...+.+..+++++|.||+-++ T Consensus 68 -p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:81 68 -PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred -CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 24779999999999999987765443 21 221 222 355667789999999999999999999998888 Q ss_pred eCC---CCceEEEEEcccceEEEecCCCCC----c-eEEEEEEEEEeec------------------------------- Q lcl|NC_018086. 141 IDR---NKKHRFKAVSPMNCLIAYSADLDE----E-PVAAIYYNTVISD------------------------------- 181 (511) Q Consensus 141 ~~~---~g~~~i~~~~p~~~~~v~d~~~~~----~-~~~~v~~~~~~~~------------------------------- 181 (511) .+. ++.+++..++|.+++ ||+.... . ..++++.|...+. T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:81 147 RNSDPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred cccCCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 764 456899999999964 4442211 1 1112222211000 Q ss_pred -----------------------CCcceEEEEEEEcCCcEEEEE--EccCccccccccc--------------------- Q lcl|NC_018086. 182 -----------------------ITGHQIRTYEVYTEDLIYKFS--TDDEREVYREIPE--------------------- 215 (511) Q Consensus 182 -----------------------~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~--------------------- 215 (511) .+...+..+++|......... ..++......... T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:81 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 001123334555432221111 1111111000000 Q ss_pred -------ccccccccceeccCCccceEeecCC---cc--cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 216 -------ELEIKDYEVHPNLLQKFPVLEIIAN---EE--RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 216 -------~~~~~~~~~~~~~~g~iPvv~~~n~---~~--g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) ...+......|.+.+.+|+|+|... .. ..|.+..+++.++.+|...|.+...+ .++..++..|.-. T Consensus 305 v~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~ 382 (714) T protein:vir:81 305 IREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQ 382 (714) T ss_pred EEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccc Confidence 0000001112223344666555432 22 23678889999999999999988765 3555555556544 Q ss_pred Cccchhhhhhh-hCceeeecCCCc--------eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHH Q lcl|NC_018086. 284 SADSDSISNMK-NDRVIVTDEDGM--------VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALK 353 (511) Q Consensus 284 ~~~~~~~~~~~-~~~~i~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~ 353 (511) ..+........ .++++.+..+.. +++.....-...+...+......|-.+|++.+...|..+| .||+||. T Consensus 383 ~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~ 462 (714) T protein:vir:81 383 LSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAIS 462 (714) T ss_pred ccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHH Confidence 33333333222 234555543211 2222212223555666777788999999987776666554 7999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCccc-----------------------ccccee Q lcl|NC_018086. 354 AATQPLENKSAVKESKFRKVLAKRYELVCSYLEF----------MNKAKDL-----------------------KPYEVT 400 (511) Q Consensus 354 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----------~~~~~~~-----------------------~~~~i~ 400 (511) .....-..........+..+.+++.++++.+... .+..... -..+|. T Consensus 463 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:81 463 NLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEE Confidence 8877655555555566666666666655554321 1110000 011233 Q ss_pred EEeCCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKLRDMLPD-------ETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~~g~~s~-------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) |.=.+..|....+.++.+..+.+.++. ..+++.+.+ ++.++-.++|++. .+.+... .+ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~~~~~---~~ 607 (714) T protein:vir:81 543 LAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGTPKSP---DE 607 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCCCCCc---cc Confidence 333444444445566666666554433 334445544 5555555666442 0010000 00 Q ss_pred ccccccCCCCCCccccccCCCCccccccccCCC-CCCCC Q lcl|NC_018086. 474 AAANKLDKNPANTSTITTTDPVAAKEQEKAIQK-KPKTD 511 (511) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) .. +..+.......+. +......+-...+- -.++. T Consensus 608 ~~---~e~q~~~~~~q~~-~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:81 608 MT---PEEQEVAAQQQAL-QQQQAELQMREMAGRVAKLE 642 (714) T ss_pred cc---hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 00 0000000000000 00000000000000 00111 No 88 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.71 E-value=2e-15 Score=101.13 Aligned_cols=474 Identities=11% Similarity=0.048 Sum_probs=221.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHH-------HHHHHHHHHHHHHHhcCCCccccc-CCcCccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEM-------HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-------~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~ 72 (511) |.=.+.-+ .+ -.+...+.+...++... +.+-+....+..+||.|.+-...- ......+. T Consensus 1 ~~~~~~~~--~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~- 67 (714) T protein:vir:27 1 MKNETNTM--AT----------KNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ- 67 (714) T ss_pred CCcccccc--cC----------CCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC- Confidence 11111100 00 01111222222222222 222234445778999998742111 00111122 Q ss_pred cceeccchHHHHHHHHHhhhhccCcee--cC---chh---hHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAYLAGEPITE--SG---DEK---TIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~l~g~~~~~--~~---d~~---~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) -.+.+|.++.+|+..+++.-.+.+.+ .. ++. ..+ .+..++..|+++...+.+..+++++|.||+-++ T Consensus 68 -p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:27 68 -PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred -CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 24779999999999999987765443 21 221 222 355667789999999999999999999998888 Q ss_pred eCC---CCceEEEEEcccceEEEecCCCCC----c-eEEEEEEEEEeec------------------------------- Q lcl|NC_018086. 141 IDR---NKKHRFKAVSPMNCLIAYSADLDE----E-PVAAIYYNTVISD------------------------------- 181 (511) Q Consensus 141 ~~~---~g~~~i~~~~p~~~~~v~d~~~~~----~-~~~~v~~~~~~~~------------------------------- 181 (511) .+. ++.+++..++|.+++ ||+.... . ..++++.|...+. T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:27 147 RNSDPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred cccCCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 764 456899999999964 4442211 1 1112222211000 Q ss_pred -----------------------CCcceEEEEEEEcCCcEEEEE--EccCccccccccc--------------------- Q lcl|NC_018086. 182 -----------------------ITGHQIRTYEVYTEDLIYKFS--TDDEREVYREIPE--------------------- 215 (511) Q Consensus 182 -----------------------~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~--------------------- 215 (511) .+...+..+++|......... ..++......... T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:27 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 001123334555432221111 1111111000000 Q ss_pred -------ccccccccceeccCCccceEeecCC---cc--cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 216 -------ELEIKDYEVHPNLLQKFPVLEIIAN---EE--RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 216 -------~~~~~~~~~~~~~~g~iPvv~~~n~---~~--g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) ...+......|.+.+.+|+|+|... .. ..|.+..+++.++.+|...|.+...+ .++..++..|.-. T Consensus 305 v~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~ 382 (714) T protein:vir:27 305 IREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQ 382 (714) T ss_pred EEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccc Confidence 0000001112223344666555432 22 23678889999999999999988765 3555555556544 Q ss_pred Cccchhhhhhh-hCceeeecCCCc--------eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHH Q lcl|NC_018086. 284 SADSDSISNMK-NDRVIVTDEDGM--------VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALK 353 (511) Q Consensus 284 ~~~~~~~~~~~-~~~~i~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~ 353 (511) ..+........ .++++.+..+.. +++.....-...+...+......|-.+|++.+...|..+| .||+||. T Consensus 383 ~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~ 462 (714) T protein:vir:27 383 LSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAIS 462 (714) T ss_pred ccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHH Confidence 33333333222 234555543211 2222212223555666777788999999987776666554 7999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCccc-----------------------ccccee Q lcl|NC_018086. 354 AATQPLENKSAVKESKFRKVLAKRYELVCSYLEF----------MNKAKDL-----------------------KPYEVT 400 (511) Q Consensus 354 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----------~~~~~~~-----------------------~~~~i~ 400 (511) .....-..........+..+.+++.++++.+... .+..... -..+|. T Consensus 463 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:27 463 NLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEE Confidence 8877655555555566666666666655554321 1110000 011233 Q ss_pred EEeCCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKLRDMLPD-------ETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~~g~~s~-------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) |.=.+..|....+.++.+..+.+.++. ..+++.+.+ ++.++-.++|++. .+.+... .+ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~~~~~---~~ 607 (714) T protein:vir:27 543 LAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGTPKSP---DE 607 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCCCCCc---cc Confidence 333444444445566666666554433 334445544 5555555666442 0010000 00 Q ss_pred ccccccCCCCCCccccccCCCCccccccccCCC-CCCCC Q lcl|NC_018086. 474 AAANKLDKNPANTSTITTTDPVAAKEQEKAIQK-KPKTD 511 (511) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) .. +..+.......+. +......+-...+- -.++. T Consensus 608 ~~---~e~q~~~~~~q~~-~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:27 608 MT---PEEQEVAAQQQAL-QQQQAELQMREMAGRVAKLE 642 (714) T ss_pred cc---hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 00 0000000000000 00000000000000 00111 No 89 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.71 E-value=2e-15 Score=101.13 Aligned_cols=474 Identities=11% Similarity=0.048 Sum_probs=221.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHH-------HHHHHHHHHHHHHHhcCCCccccc-CCcCccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEM-------HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-------~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~ 72 (511) |.=.+.-+ .+ -.+...+.+...++... +.+-+....+..+||.|.+-...- ......+. T Consensus 1 ~~~~~~~~--~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~- 67 (714) T protein:vir:10 1 MKNETNTM--AT----------KNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ- 67 (714) T ss_pred CCcccccc--cC----------CCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC- Confidence 11111100 00 01111222222222222 222234445778999998742111 00111122 Q ss_pred cceeccchHHHHHHHHHhhhhccCcee--cC---chh---hHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAYLAGEPITE--SG---DEK---TIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~l~g~~~~~--~~---d~~---~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) -.+.+|.++.+|+..+++.-.+.+.+ .. ++. ..+ .+..++..|+++...+.+..+++++|.||+-++ T Consensus 68 -p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~ 146 (714) T protein:vir:10 68 -PMTIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVR 146 (714) T ss_pred -CcEEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEec Confidence 24779999999999999987765443 21 221 222 355667789999999999999999999998888 Q ss_pred eCC---CCceEEEEEcccceEEEecCCCCC----c-eEEEEEEEEEeec------------------------------- Q lcl|NC_018086. 141 IDR---NKKHRFKAVSPMNCLIAYSADLDE----E-PVAAIYYNTVISD------------------------------- 181 (511) Q Consensus 141 ~~~---~g~~~i~~~~p~~~~~v~d~~~~~----~-~~~~v~~~~~~~~------------------------------- 181 (511) .+. ++.+++..++|.+++ ||+.... . ..++++.|...+. T Consensus 147 ~~~d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~ 224 (714) T protein:vir:10 147 RNSDPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQ 224 (714) T ss_pred cccCCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhcccccccccccc Confidence 764 456899999999964 4442211 1 1112222211000 Q ss_pred -----------------------CCcceEEEEEEEcCCcEEEEE--EccCccccccccc--------------------- Q lcl|NC_018086. 182 -----------------------ITGHQIRTYEVYTEDLIYKFS--TDDEREVYREIPE--------------------- 215 (511) Q Consensus 182 -----------------------~~~~~~~~~~~~~~~~i~~~~--~~~~~~~~~~~~~--------------------- 215 (511) .+...+..+++|......... ..++......... T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~r 304 (714) T protein:vir:10 225 PSPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSR 304 (714) T ss_pred ccccccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccce Confidence 001123334555432221111 1111111000000 Q ss_pred -------ccccccccceeccCCccceEeecCC---cc--cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 216 -------ELEIKDYEVHPNLLQKFPVLEIIAN---EE--RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 216 -------~~~~~~~~~~~~~~g~iPvv~~~n~---~~--g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) ...+......|.+.+.+|+|+|... .. ..|.+..+++.++.+|...|.+...+ .++..++..|.-. T Consensus 305 v~~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~ 382 (714) T protein:vir:10 305 IREAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQ 382 (714) T ss_pred EEEEEEecCcccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccc Confidence 0000001112223344666555432 22 23678889999999999999988765 3555555556544 Q ss_pred Cccchhhhhhh-hCceeeecCCCc--------eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHH Q lcl|NC_018086. 284 SADSDSISNMK-NDRVIVTDEDGM--------VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALK 353 (511) Q Consensus 284 ~~~~~~~~~~~-~~~~i~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~ 353 (511) ..+........ .++++.+..+.. +++.....-...+...+......|-.+|++.+...|..+| .||+||. T Consensus 383 ~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~ 462 (714) T protein:vir:10 383 LSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAIS 462 (714) T ss_pred ccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHH Confidence 33333333222 234555543211 2222212223555666777788999999987776666554 7999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCccc-----------------------ccccee Q lcl|NC_018086. 354 AATQPLENKSAVKESKFRKVLAKRYELVCSYLEF----------MNKAKDL-----------------------KPYEVT 400 (511) Q Consensus 354 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----------~~~~~~~-----------------------~~~~i~ 400 (511) .....-..........+..+.+++.++++.+... .+..... -..+|. T Consensus 463 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~ 542 (714) T protein:vir:10 463 NLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIA 542 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEE Confidence 8877655555555566666666666655554321 1110000 011233 Q ss_pred EEeCCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKLRDMLPD-------ETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~~g~~s~-------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) |.=.+..|....+.++.+..+.+.++. ..+++.+.+ ++.++-.++|++. .+.+... .+ T Consensus 543 i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~~~~~---~~ 607 (714) T protein:vir:10 543 LAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGTPKSP---DE 607 (714) T ss_pred EeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCCCCCc---cc Confidence 333444444445566666666554433 334445544 5555555666442 0010000 00 Q ss_pred ccccccCCCCCCccccccCCCCccccccccCCC-CCCCC Q lcl|NC_018086. 474 AAANKLDKNPANTSTITTTDPVAAKEQEKAIQK-KPKTD 511 (511) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 511 (511) .. +..+.......+. +......+-...+- -.++. T Consensus 608 ~~---~e~q~~~~~~q~~-~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:10 608 MT---PEEQEVAAQQQAL-QQQQAELQMREMAGRVAKLE 642 (714) T ss_pred cc---hhhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 00 0000000000000 00000000000000 00111 No 90 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.69 E-value=8.4e-16 Score=103.22 Aligned_cols=477 Identities=13% Similarity=0.056 Sum_probs=224.7 Q ss_pred CCCccchh-hcccccCchhhHhhhhccCCCHHHHHHHHHH---HHHHHHHHHHHHHHhcCCCccccc-CCcCccccccce Q lcl|NC_018086. 1 MAIPNGQI-NAGDIITTNIRRKHFIRRNFDLRELITLAEM---HSRSSSAYGVLYDYYKGNHIAIQS-RTFDDTNKPNSK 75 (511) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~---~~~~~~~~~~~~~yY~G~~~~~~~-~~~~~~~~~~~r 75 (511) |+..|+.- +..+-. ....++.+.+..+... +..-+....+-.+||.|.+-...- ......+. -. T Consensus 1 ~~~~~~~~~~~~~~~---------~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~--p~ 69 (714) T protein:vir:10 1 MKNEINTTAMKNDHG---------STPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQ--PM 69 (714) T ss_pred CCcCcCcccCCCcch---------hhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--Cc Confidence 54444321 111100 0011233333333322 222234445778999998742110 11111122 24 Q ss_pred eccchHHHHHHHHHhhhhccCcee--c---Cch---hhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITE--S---GDE---KTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR 143 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~--~---~d~---~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~ 143 (511) +.+|.++.+|+..+++.-.+.+.+ . .++ +..+ .+..++..|+.+...+.+..++.++|.||+-++.+. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~ 149 (714) T protein:vir:10 70 TIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeecc Confidence 779999999999999987765543 2 122 1222 355667789999999999999999999999888765 Q ss_pred ---CCceEEEEEcccceEEEecCCCCC----ceE-EEEEEEEE------------------------------------- Q lcl|NC_018086. 144 ---NKKHRFKAVSPMNCLIAYSADLDE----EPV-AAIYYNTV------------------------------------- 178 (511) Q Consensus 144 ---~g~~~i~~~~p~~~~~v~d~~~~~----~~~-~~v~~~~~------------------------------------- 178 (511) ++++++..++|.+++. |+.... ... ++++.|.. T Consensus 150 d~~~~~i~i~~v~p~~v~~--Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~ 227 (714) T protein:vir:10 150 EPFGPEFKVSTVSRNEVFW--DWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred CCCCCCeEEEecChhheee--ccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhccc Confidence 3679999999998644 432211 000 01111100 Q ss_pred -------------e----ecCCcceEEEEEEEcCCcEEEEEEc--cCccccccccc------------------------ Q lcl|NC_018086. 179 -------------I----SDITGHQIRTYEVYTEDLIYKFSTD--DEREVYREIPE------------------------ 215 (511) Q Consensus 179 -------------~----~~~~~~~~~~~~~~~~~~i~~~~~~--~~~~~~~~~~~------------------------ 215 (511) . ...+.+.+..+++|........... .+..+...... T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~ 307 (714) T protein:vir:10 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIRE 307 (714) T ss_pred ccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEE Confidence 0 0001123445566654333222211 11111000000 Q ss_pred ----ccccccccceeccCCccceEeecCC---c--ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 216 ----ELEIKDYEVHPNLLQKFPVLEIIAN---E--ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 216 ----~~~~~~~~~~~~~~g~iPvv~~~n~---~--~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) ..........|.+.+.+|+|+|+.. . ...|.+..+++.++.+|...|.+...+ .+...++..|.....+ T Consensus 308 ~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~gav~~~d 385 (714) T protein:vir:10 308 AWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSD 385 (714) T ss_pred EEEecchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHH--hCCceeeccccccccH Confidence 0000111222334445666655432 1 234778889999999999999988766 3444555555543333 Q ss_pred chhhhhhh-hCceeeecCCC--------ceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHHHHH Q lcl|NC_018086. 287 SDSISNMK-NDRVIVTDEDG--------MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALKAAT 356 (511) Q Consensus 287 ~~~~~~~~-~~~~i~~~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~~~~ 356 (511) ........ .++++.+..+. .++......-...+...+......|-.+|++.+...|..+| .||+||.... T Consensus 386 ~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~ 465 (714) T protein:vir:10 386 NDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHH Confidence 33433332 24556554321 12222112223456667888889999999988777766554 6999998877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----------CCcccc-----------------------ccceeEEe Q lcl|NC_018086. 357 QPLENKSAVKESKFRKVLAKRYELVCSYLEFMN----------KAKDLK-----------------------PYEVTPVF 403 (511) Q Consensus 357 ~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~----------~~~~~~-----------------------~~~i~i~f 403 (511) ..-..........+..+.+++.++++.+....- ...... ..+|.|.= T Consensus 466 ~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~ 545 (714) T protein:vir:10 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEee Confidence 766666666666666676666666655543211 000000 01122222 Q ss_pred CCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccc Q lcl|NC_018086. 404 VRNLPQSYAELADMAVKLRDMLPD-------ETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAA 476 (511) Q Consensus 404 ~~~~p~d~~e~a~~~~~~~g~~s~-------et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (511) .+..+.-..+.++.+.++.+.++. ..+++.+.+ ++.++-+++|++.. +.+.... . T Consensus 546 ~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~-p~~~ei~~~ir~~~-----------~~~~~~~------~ 607 (714) T protein:vir:10 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL-----------GTPKSPD------E 607 (714) T ss_pred ccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-cCHHHHHHHHHHHc-----------CCCCCcc------c Confidence 222233234445555555443322 233444433 44455555554320 0000000 0 Q ss_pred cccCCCCCCccccccCCCCccccccccCC-CCCCCC Q lcl|NC_018086. 477 NKLDKNPANTSTITTTDPVAAKEQEKAIQ-KKPKTD 511 (511) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 511 (511) .++..+.........+ .......-...+ +-.++. T Consensus 608 ~~~e~q~~q~~~~~~~-~~q~~l~~~e~~a~~~k~e 642 (714) T protein:vir:10 608 MTPEEQEVAAQQQALQ-QQQAELQMREMAGRVAKLE 642 (714) T ss_pred cCcchhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 0000000000000000 000000000000 000111 No 91 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.69 E-value=2.5e-14 Score=95.16 Aligned_cols=426 Identities=8% Similarity=0.022 Sum_probs=232.0 Q ss_pred CCH-HHHHHHHH-HHHHHHHHHHHHHHHhcCCCcccccCCcCccccccc-----------e-----eccchHHHHHHHHH Q lcl|NC_018086. 28 FDL-RELITLAE-MHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNS-----------K-----IVHNFPKLLVDTST 89 (511) Q Consensus 28 ~~~-~~l~~~~~-~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~-----------r-----i~~n~~k~ivd~~~ 89 (511) |+. +.++.++. +...++.......+-|+|-..-..........-++. | .-.+|++-+|+..+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 221 22233222 222222222333455676432111110000000110 0 12689999999999 Q ss_pred hhhhcc-Cceec---------CchhhHHHHHHHHh---c-------cChhHHHHHHHHHHhhCCeEEEEeeeCCCCc--- Q lcl|NC_018086. 90 AYLAGE-PITES---------GDEKTIKAMQPVFK---E-------NYVTDVNSEEVKLSGIFGHCFEIHWIDRNKK--- 146 (511) Q Consensus 90 ~~l~g~-~~~~~---------~d~~~~~~l~~~~~---~-------n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~--- 146 (511) +.++|. ++++. .+++..+.+...|. + .+|......+.+.....|.+|+....++.+. T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 999997 55432 12333444444443 2 3688888889999999999998876654332 Q ss_pred -----eEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc--CCcEEEEEEccCccccccccccccc Q lcl|NC_018086. 147 -----HRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT--EDLIYKFSTDDEREVYREIPEELEI 219 (511) Q Consensus 147 -----~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~--~~~i~~~~~~~~~~~~~~~~~~~~~ 219 (511) +++..++|..+-.-+++ ......+|.+ +..|..+-| .++. |.. T Consensus 161 g~~~~l~lq~iepd~l~~~~~~--~~~i~~GVe~-----d~~Gr~~aY-~i~~~hPgd---------------------- 210 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDE--SNRLNQGVFV-----DDWGRPEKY-LVYKSRPVS---------------------- 210 (502) T ss_pred CcccceEEEEecchhcCCCCCC--CCeeEeeeEE-----CCCCceEEE-EEeecCCCC---------------------- Confidence 58899999887433322 2334445432 333444322 1111 110 Q ss_pred ccccceeccCCccc---eEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCc------ Q lcl|NC_018086. 220 KDYEVHPNLLQKFP---VLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSA------ 285 (511) Q Consensus 220 ~~~~~~~~~~g~iP---vv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~------ 285 (511) .....+..|| |+++... ..|.|.|..++..+..++....-........+.--.+++....+. T Consensus 211 ----~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~ 286 (502) T protein:vir:79 211 ----GRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGN 286 (502) T ss_pred ----CcccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccC Confidence 0011123455 5555432 368999999999988888777666555555555555555322111 Q ss_pred ---cchhhhhhhhCceee-ecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccc-cccccccCccHHHHHHHHHHHH Q lcl|NC_018086. 286 ---DSDSISNMKNDRVIV-TDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPD-LVSKDFTAASGQALKAATQPLE 360 (511) Q Consensus 286 ---~~~~~~~~~~~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~-~~~~~~~~~Sg~Ai~~~~~~l~ 360 (511) .......+..+.++. ++.+.++++.+.+.+...+..+...+...|....++|- ...+.++ .|-.++++.+.... T Consensus 287 ~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s-~nySs~R~~~~e~~ 365 (502) T protein:vir:79 287 GSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN-GTYSAQRQELVEST 365 (502) T ss_pred CCCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-chHHHHHHHHHHHH Confidence 111112233344554 78888999998888888999999999999999988884 3334454 37888888888888 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCcc--c--cccceeEEeCCCC--CcCHHHHHHHHHHH--hccCChHHHH Q lcl|NC_018086. 361 NKSAVKESKFRKVLAK-RYELVCSYLEFMNKAKD--L--KPYEVTPVFVRNL--PQSYAELADMAVKL--RDMLPDETII 431 (511) Q Consensus 361 ~k~~~~~~~~~~~l~~-~~~li~~~~~~~~~~~~--~--~~~~i~i~f~~~~--p~d~~e~a~~~~~~--~g~~s~et~~ 431 (511) ..+...+..|...+.+ +++..+...-..+...- + ...-..+.|..+- ..|....+++...+ .|+.|.+.++ T Consensus 366 r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~ 445 (502) T protein:vir:79 366 DGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWV 445 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHH Confidence 8888888777765544 44433333222222110 0 1112466774333 35777666666554 5899999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCC Q lcl|NC_018086. 432 NQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQK 506 (511) Q Consensus 432 ~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) ...|. |+++.++.+.+|++...+.-+.. .. .+.....+. ..+++.+..++++.+. +| T Consensus 446 a~~G~--D~~~v~~q~a~e~~~~~~~Gl~~-~~----~~~~~~~~~------~~~~~~~e~~~~~~~~-----e~ 502 (502) T protein:vir:79 446 RAGGR--NPDDVKRRRKAEIDENRKLDLVF-DT----DPASDKGGS------SAATKRQEPQHTDDQS-----EE 502 (502) T ss_pred HHcCC--CHHHHHHHHHHHHHHHHHcCCCC-CC----CCCCCCCCC------CCCCCCCCCCCCCCCC-----CC Confidence 99874 89999988888877765542211 00 000000000 0000000011111111 11 No 92 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.62 E-value=2.5e-14 Score=95.13 Aligned_cols=463 Identities=11% Similarity=0.077 Sum_probs=203.5 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHH--------HHHH-HHHHHHHHHhcCCCcccccCCcCcccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMH--------SRSS-SAYGVLYDYYKGNHIAIQSRTFDDTNK 71 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--------~~~~-~~~~~~~~yY~G~~~~~~~~~~~~~~~ 71 (511) || +..+. ..++.+++.+++..+ ...+ ....+..+||.|+.... ..++ T Consensus 1 ~~----k~~~~--------------~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~-----~~~~- 56 (705) T protein:vir:88 1 MA----KRRKI--------------KPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN-----ERPG- 56 (705) T ss_pred CC----ccccc--------------ccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc-----ccCC- Confidence 11 11111 123444444433322 2233 23456678999985321 1112 Q ss_pred ccceeccchHHHHHHHHHhhhh----ccC--cee----cCchhhHHHHHH-----HHhccChhHHHHHHHHHHhhCCeEE Q lcl|NC_018086. 72 PNSKIVHNFPKLLVDTSTAYLA----GEP--ITE----SGDEKTIKAMQP-----VFKENYVTDVNSEEVKLSGIFGHCF 136 (511) Q Consensus 72 ~~~ri~~n~~k~ivd~~~~~l~----g~~--~~~----~~d~~~~~~l~~-----~~~~n~~~~~~~~~~~~a~~~G~~~ 136 (511) ..+++.+.....|+.....|. +.+ +.+ .+|.+..+.+.. +...|+....+..++++++++|.|+ T Consensus 57 -~s~~~~~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi 135 (705) T protein:vir:88 57 -KSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGV 135 (705) T ss_pred -CCccccHHHHHHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeE Confidence 346777777777777777653 322 222 234433333322 2455666677888999999999999 Q ss_pred EEeeeCCC------------------------------------------------CceEEEEEcccceEEEecCC-CCC Q lcl|NC_018086. 137 EIHWIDRN------------------------------------------------KKHRFKAVSPMNCLIAYSAD-LDE 167 (511) Q Consensus 137 ~~v~~~~~------------------------------------------------g~~~i~~~~p~~~~~v~d~~-~~~ 167 (511) +.||++.. |++++..|+|.++++-.+-. ... T Consensus 136 ~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d 215 (705) T protein:vir:88 136 VKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDD 215 (705) T ss_pred EEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCccc Confidence 98877331 66888889999876432211 111 Q ss_pred ceEEEEEEEEEeecC-----Cc-------------------------------------------ceEEEEEEEcCCcEE Q lcl|NC_018086. 168 EPVAAIYYNTVISDI-----TG-------------------------------------------HQIRTYEVYTEDLIY 199 (511) Q Consensus 168 ~~~~~v~~~~~~~~~-----~~-------------------------------------------~~~~~~~~~~~~~i~ 199 (511) ....+.+++....+. +. ..++.+++|. T Consensus 216 ~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~----- 290 (705) T protein:vir:88 216 ARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYT----- 290 (705) T ss_pred CcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeee----- Confidence 111222221110000 00 0011111110 Q ss_pred EEEEccCcc-cccccccccccccccceeccCCccceEe-----ecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 200 KFSTDDERE-VYREIPEELEIKDYEVHPNLLQKFPVLE-----IIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWND 273 (511) Q Consensus 200 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~iPvv~-----~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~ 273 (511) ++...+.+. ...... ..+-.. ....++|.+|++. .+.+.+|.|.+..+.++++.+|..++.+.+++....+ T Consensus 291 ~~d~~~d~~~~~~~~~-~~g~~i--l~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~ 367 (705) T protein:vir:88 291 LLDVDGDGISELRRIL-YVGDYI--ISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQ 367 (705) T ss_pred EecccCCcceeeEEEE-EeCccc--cccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccC Confidence 000000000 000000 000000 0112456667664 3445679999999999999999999999999999999 Q ss_pred ceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc-----cCcc Q lcl|NC_018086. 274 AYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-----TAAS 348 (511) Q Consensus 274 p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-----~~~S 348 (511) |...+........ + .-....++++.+...+.+.++..+.-.......++.+...|...||+++.+-|.. ++.| T Consensus 368 ~~~~~~~g~v~~~-d-~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~T 445 (705) T protein:vir:88 368 GRSVVLDGQVNLE-D-LLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQA 445 (705) T ss_pred CceeccccccCcc-c-ccccCCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhh Confidence 8776532222111 1 1122344566666556667765555556667778999999999999998876632 2467 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCcc----------cc------ccceeEEeCCCCCcCH Q lcl|NC_018086. 349 GQALKAATQPLENKSAVKESKFRK-VLAKRYELVCSYLEFMNKAKD----------LK------PYEVTPVFVRNLPQSY 411 (511) Q Consensus 349 g~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~~~~----------~~------~~~i~i~f~~~~p~d~ 411 (511) +.++.........+.....+.|.. ++++++++++.++........ ++ ..++.+.-.. ...+. T Consensus 446 a~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~-~~~~~ 524 (705) T protein:vir:88 446 AMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGI-GNMNK 524 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeecc-ccchH Confidence 778877777777777777777753 566666666554433221110 00 1112222111 11122 Q ss_pred HHHHHHHHHHhccCChHHHH-HhCCCCCCHHHHHHHHHHHHHHHHHH-HHhhc-cccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 412 AELADMAVKLRDMLPDETII-NQFPWITDARQEVEKADAQRQKRADI-ALQNF-KQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 412 ~e~a~~~~~~~g~~s~et~~-~~l~~v~d~~~E~~ri~~E~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) .+....+..+..+.+.-... ...+.+. + ..+..+.++-...+.. ....+ .+.... ..........+..... T Consensus 525 eq~~a~l~~ll~~~q~l~~~~~~~~~~~-~-~~~~~~~~el~e~~~~k~~~~~~~~~~~~---e~~~~~~~~~q~e~~~- 598 (705) T protein:vir:88 525 DQQMLHLMRIWEMAQAVVGGGGLGVLVS-E-QNLYNILKEVTENAGYKDPDRFWTNPNSP---EALQAKAIREQKEAQP- 598 (705) T ss_pred HHHHHHHHHHHHHHHHhhcccchhhhcC-h-HHHHHHHHHHHHhhhhhhHHHHhhhhhhH---HHHHHHHhhhhhhhhH- Confidence 22222222111100000000 0001111 0 0011111110000000 00000 000000 0000000000000000 Q ss_pred cccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 489 ITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .....+..+...+...+ T Consensus 599 ------~~~~~~~q~e~~k~q~e 615 (705) T protein:vir:88 599 ------KPEDIKAQADAQRAQSD 615 (705) T ss_pred ------HHHHHHHHHHHHHHHHH Confidence 00000000000000111 No 93 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.60 E-value=3.8e-14 Score=94.18 Aligned_cols=479 Identities=10% Similarity=0.074 Sum_probs=206.0 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHH----HHHH----------HHHHHhcCCCcccccCCc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSS----SAYG----------VLYDYYKGNHIAIQSRTF 66 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~----------~~~~yY~G~~~~~~~~~~ 66 (511) |-|++.=. ..++.-+.++.-=...|.++..++...+ +++. ++.+||.|... .... T Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~---~~~~ 69 (651) T protein:vir:80 1 MKLATTTT--------DKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVL---RSVG 69 (651) T ss_pred Cccccccc--------chhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccc---cccC Confidence 33333211 1111111111100112333333332221 2232 33456655431 1111 Q ss_pred CccccccceeccchHHHHHHHHHhhhhcc----C--cee--cCchhh----HHHHHHHH----hccChhHHHHHHHHHHh Q lcl|NC_018086. 67 DDTNKPNSKIVHNFPKLLVDTSTAYLAGE----P--ITE--SGDEKT----IKAMQPVF----KENYVTDVNSEEVKLSG 130 (511) Q Consensus 67 ~~~~~~~~ri~~n~~k~ivd~~~~~l~g~----~--~~~--~~d~~~----~~~l~~~~----~~n~~~~~~~~~~~~a~ 130 (511) .+...-.++++.+..+..|+.....|+.. + +.+ ..+.+. .+.+..++ ..++|......+..+++ T Consensus 70 ~~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l 149 (651) T protein:vir:80 70 DVNADWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLL 149 (651) T ss_pred CCCCCCCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhc Confidence 11111234788999999999877776542 1 111 112221 22244443 46789999989999999 Q ss_pred hCCeEEEEeeeCC-------------------------------CCceEEEEEcccceEEEecCCCCC--ceEEEEEEEE Q lcl|NC_018086. 131 IFGHCFEIHWIDR-------------------------------NKKHRFKAVSPMNCLIAYSADLDE--EPVAAIYYNT 177 (511) Q Consensus 131 ~~G~~~~~v~~~~-------------------------------~g~~~i~~~~p~~~~~v~d~~~~~--~~~~~v~~~~ 177 (511) ++|.|++.|+++. .|.|++..++|.++++ |+.... ...+.++.+. T Consensus 150 ~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~ 227 (651) T protein:vir:80 150 ITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTK 227 (651) T ss_pred ccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeee Confidence 9999999887642 1567899999999865 443221 1112222221 Q ss_pred EeecC------------------C---------------------------cceEEEEEEEcCCcEEEEEEccCcc-ccc Q lcl|NC_018086. 178 VISDI------------------T---------------------------GHQIRTYEVYTEDLIYKFSTDDERE-VYR 211 (511) Q Consensus 178 ~~~~~------------------~---------------------------~~~~~~~~~~~~~~i~~~~~~~~~~-~~~ 211 (511) ...+. . ...+..+++|. ++..++.+. .+. T Consensus 228 t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~-----~~d~e~~~~~~~~ 302 (651) T protein:vir:80 228 TKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWG-----DIHLENKTYHDVV 302 (651) T ss_pred eHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEE-----EeeccCCceEEEE Confidence 11000 0 00111122221 111111110 000 Q ss_pred ccccccccccccceecc-CCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNL-LQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSA 285 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~-~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~ 285 (511) ...... .......++ +..+|++.++ ...+|+|..+.+.+.+..+|.+...+.+.+...+.|++.+....... T Consensus 303 v~~~g~--~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~ 380 (651) T protein:vir:80 303 VTIMGN--EVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQ 380 (651) T ss_pred EEEcCc--EEecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCcccc Confidence 000000 000111122 1234665443 34589999999999999999999999999999999998765322221 Q ss_pred cchhhhhhhhCceeeecCCCceeeeecC-CCHHHHHHHHHHHHHHHHHHhCcccccccc----ccCccHHHHHHHHHHHH Q lcl|NC_018086. 286 DSDSISNMKNDRVIVTDEDGMVKFITKD-VNDKHIENIKNRAKLDIFSLSQTPDLVSKD----FTAASGQALKAATQPLE 360 (511) Q Consensus 286 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~----~~~~Sg~Ai~~~~~~l~ 360 (511) .+.. ....++++.+...+++.++... .+.......++.+...+...++++.+..+. .++.|+.++......+. T Consensus 381 -~~~l-~~~pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~ 458 (651) T protein:vir:80 381 -PEDV-YTEPGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGG 458 (651) T ss_pred -HHHh-hcCCCceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHH Confidence 1111 1234567777777778777653 344566677899999999999998766543 23457777777776666 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCc----------------cccccceeEEeCCCCCcCH---HHHHHHHHH Q lcl|NC_018086. 361 NKSAVKESKFRK-VLAKRYELVCSYLEFMNKAK----------------DLKPYEVTPVFVRNLPQSY---AELADMAVK 420 (511) Q Consensus 361 ~k~~~~~~~~~~-~l~~~~~li~~~~~~~~~~~----------------~~~~~~i~i~f~~~~p~d~---~e~a~~~~~ 420 (511) ......-+.|.. ++..+++.++.++....... .+...++++.+.- .+... .+....+.+ T Consensus 459 ~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~~~~~~r~~~~~~ 537 (651) T protein:vir:80 459 NRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGSDHVIERKQYIED 537 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeee-eeccHHHHHHHHHHHHH Confidence 666666666655 45555544444433221100 0111123333321 12222 222222222 Q ss_pred Hh------ccCC---h-----HH---HHHhCCCCCCHHHHHHH-----HHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 421 LR------DMLP---D-----ET---IINQFPWITDARQEVEK-----ADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 421 ~~------g~~s---~-----et---~~~~l~~v~d~~~E~~r-----i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) +. +..+ . .. ++..+| +.+++.=+.. ....+++.+..+.....+..... ... . T Consensus 538 l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g-~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~----~~~-~ 611 (651) T protein:vir:80 538 RLTFIQAVAQVPEMGQLVDYKRILVDLLQHWG-FEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNM----LQN-Q 611 (651) T ss_pred HHHHHHhhccCCccchhhhHHHHHHHHHHHcC-CCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHH----HHH-H Confidence 21 1111 0 01 111222 2222210000 00000000000000000000000 000 0 Q ss_pred cCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 479 LDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ....++.....+.....+.-...+.+..|. T Consensus 612 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 641 (651) T protein:vir:80 612 ---LQADGGTQMMSEMYGTPNADQMQQELMATT 641 (651) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000000000000011122222 No 94 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.56 E-value=7.8e-13 Score=86.97 Aligned_cols=443 Identities=10% Similarity=0.024 Sum_probs=229.9 Q ss_pred CCHHHHHHHHHHHHHHHH--HHHHHHHHhcCCC--ccc---ccCCcCccc---cccc-----e-----eccchHHHHHHH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSS--AYGVLYDYYKGNH--IAI---QSRTFDDTN---KPNS-----K-----IVHNFPKLLVDT 87 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~--~~~~~~~yY~G~~--~~~---~~~~~~~~~---~~~~-----r-----i~~n~~k~ivd~ 87 (511) |. +...+.+.......+ +...-...|+|-. .-. +........ ..+. | .-.+|++-+|+. T Consensus 1 m~-~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~ 79 (553) T protein:vir:63 1 MT-KVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGY 79 (553) T ss_pred Cc-chhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 11 111222221111111 1111123455421 100 011110000 0111 1 126899999999 Q ss_pred HHhhhhccCceecC----------c----hhhHHHHH---HHHhcc-----------ChhHHHHHHHHHHhhCCeEEEEe Q lcl|NC_018086. 88 STAYLAGEPITESG----------D----EKTIKAMQ---PVFKEN-----------YVTDVNSEEVKLSGIFGHCFEIH 139 (511) Q Consensus 88 ~~~~l~g~~~~~~~----------d----~~~~~~l~---~~~~~n-----------~~~~~~~~~~~~a~~~G~~~~~v 139 (511) .+..++|.+++... + ++..+.+. +.|-++ +|......+++.....|.+|+.. T Consensus 80 ~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 159 (553) T protein:vir:63 80 QRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATA 159 (553) T ss_pred HHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEe Confidence 99999999988632 1 12223333 334321 47777888999999999999876 Q ss_pred eeCCC-C---ceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEE-EEEEcCCcEEEEEEccCcccccccc Q lcl|NC_018086. 140 WIDRN-K---KHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRT-YEVYTEDLIYKFSTDDEREVYREIP 214 (511) Q Consensus 140 ~~~~~-g---~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~ 214 (511) ...+. | .+++..++|..+..-++.........+|.+ +..|..+-| +--..|+..+....... T Consensus 160 ~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~-----d~~Gr~vaY~i~~~hPgd~~~~~~~~~-------- 226 (553) T protein:vir:63 160 EWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQY-----DKRGRPQGYWIQVAHPGDLYQMAPDMY-------- 226 (553) T ss_pred eeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEE-----CCCCceEEEEeeccCCCcccccccccc-------- Confidence 54432 2 368899999887665554444445555532 334544422 21122332221110000 Q ss_pred cccccccccceeccCCccc---eEeecC-----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCc- Q lcl|NC_018086. 215 EELEIKDYEVHPNLLQKFP---VLEIIA-----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSA- 285 (511) Q Consensus 215 ~~~~~~~~~~~~~~~g~iP---vv~~~n-----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~- 285 (511) .|.-+. .+..|| |+++.. -..|.|.|..++..+..++.............+.--.+++....+. T Consensus 227 ---~~~r~~----~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~ 299 (553) T protein:vir:63 227 ---KWKFVQ----QSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEF 299 (553) T ss_pred ---ceeeec----cccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhh Confidence 000000 011122 333322 2468999999988888888776655555554444444444211100 Q ss_pred ----------cc-------------------hhhhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_018086. 286 ----------DS-------------------DSISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQT 336 (511) Q Consensus 286 ----------~~-------------------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~ 336 (511) .. .....+..+.+..++.+.++++.+.+.+...+..+...+...|....++ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi 379 (553) T protein:vir:63 300 IHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGM 379 (553) T ss_pred hhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCC Confidence 00 0011234456677888889999988888889999999999999998888 Q ss_pred cc-cccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCc--cc-----------cccceeE Q lcl|NC_018086. 337 PD-LVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAK-RYELVCSYLEFMNKAK--DL-----------KPYEVTP 401 (511) Q Consensus 337 p~-~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~-~~~li~~~~~~~~~~~--~~-----------~~~~i~i 401 (511) |- ...+.++++|-.+.++.+..........+..|...+.+ +++..+...-..+... .. ...-+.+ T Consensus 380 ~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~ 459 (553) T protein:vir:63 380 SYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKC 459 (553) T ss_pred CHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhce Confidence 84 34466777888888888888888888888777665554 4444333222222111 00 0011346 Q ss_pred EeCCCCC--cCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 402 VFVRNLP--QSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 402 ~f~~~~p--~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) .|..+-. .|....+++.... .|+.|.+.++...| .|+++.++.+.+|.+...+.-+.. ...+....... T Consensus 460 ~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~-----~~~~~~~~~~~ 532 (553) T protein:vir:63 460 EWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLG--GDFRKSFAQRAREDALLKKYGLTF-----NLSAKRSLGDG 532 (553) T ss_pred eeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCC-----CCCCccccCCC Confidence 6744443 4666666666554 58999999999997 489998999888876655532210 00000000000 Q ss_pred ccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 478 KLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) .....++.+++ ..+...+..+ T Consensus 533 ~~~~~~~~~~~---~~~~~~~~~e 553 (553) T protein:vir:63 533 RDAATGIAEDP---AAAQTSQQGE 553 (553) T ss_pred cccCCCCCCCC---CCCCcccccC Confidence 00000000000 0001111111 No 95 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.55 E-value=1.6e-12 Score=85.28 Aligned_cols=434 Identities=7% Similarity=-0.040 Sum_probs=238.2 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccC----Cc--Ccccc-- Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMH-SRSSSAYGVLYDYYKGNHIAIQSR----TF--DDTNK-- 71 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~yY~G~~~~~~~~----~~--~~~~~-- 71 (511) |+= .......+.+.+.-- ..+........+.|+|-..-.... .+ ..... T Consensus 1 ~~r----------------------~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i 58 (505) T protein:vir:96 1 MKR----------------------AEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEI 58 (505) T ss_pred CCC----------------------CccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHH Confidence 221 112222222222211 112222233345677643211100 00 00000 Q ss_pred -ccc-----e-----eccchHHHHHHHHHhhhhc-cCceecC---------chhhHHHHHHHHh---c---------cCh Q lcl|NC_018086. 72 -PNS-----K-----IVHNFPKLLVDTSTAYLAG-EPITESG---------DEKTIKAMQPVFK---E---------NYV 118 (511) Q Consensus 72 -~~~-----r-----i~~n~~k~ivd~~~~~l~g-~~~~~~~---------d~~~~~~l~~~~~---~---------n~~ 118 (511) .+. | .-.+|++-+|+..+..++| .+++... +++..+.+...|. . .+| T Consensus 59 ~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f 138 (505) T protein:vir:96 59 YADLASLVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHF 138 (505) T ss_pred HHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCH Confidence 000 1 1368999999999999999 6876632 4444454444433 2 136 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCCCC--ceEEEEEcccceEEEecC--CCCCceEEEEEEEEEeecCCcceEEE-EEEE Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDRNK--KHRFKAVSPMNCLIAYSA--DLDEEPVAAIYYNTVISDITGHQIRT-YEVY 193 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~~g--~~~i~~~~p~~~~~v~d~--~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~~ 193 (511) ......+.+.....|.+|+.......+ .+++..++|..+-.-++. ........+|. .+..|..+-| +--- T Consensus 139 ~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe-----~d~~Gr~~aY~i~~~ 213 (505) T protein:vir:96 139 VTLLHLWMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIE-----LDAWERPVAYHLLVN 213 (505) T ss_pred HHHHHHHHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceE-----ECCCCceEEEEEeec Confidence 677888999999999999877654433 268899999987443322 11222344443 2333444322 1111 Q ss_pred cCCcEEEEEEccCcccccccccccccccccceeccCCccc---eEeecC-----CcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 194 TEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP---VLEIIA-----NEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 194 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n-----~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) .|+..+.. . ......+.+|| |+++.. -..|.|.|..++..+..++....... T Consensus 214 hPgd~~~~-~-------------------~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael 273 (505) T protein:vir:96 214 HPGDNSYC-Y-------------------HYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEM 273 (505) T ss_pred CCCccccc-c-------------------ccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHH Confidence 12211100 0 00011233455 344332 23689999999998888887776666 Q ss_pred HHHHHhcCceeEeecCCCCc-------cchhhhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_018086. 266 NDIAYWNDAYLWLQGFDLSA-------DSDSISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPD 338 (511) Q Consensus 266 ~~~~~~~~p~l~~~G~~~~~-------~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 338 (511) ......+.--.+++...... .+.....+..+.+..++.+.++++++.+.+...+..+...+...|....++|- T Consensus 274 ~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~y 353 (505) T protein:vir:96 274 IAAELGAKKVGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAY 353 (505) T ss_pred HHHHHhhhheeeeecCCccCCCccccccCccccccCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 66665555545566432211 11122234455677788889999998888889999999999999999888884 Q ss_pred c-ccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCc--cccc-cceeEEeCCCCC--cCH Q lcl|NC_018086. 339 L-VSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAK-RYELVCSYLEFMNKAK--DLKP-YEVTPVFVRNLP--QSY 411 (511) Q Consensus 339 ~-~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~-~~~li~~~~~~~~~~~--~~~~-~~i~i~f~~~~p--~d~ 411 (511) - ..+.++++|-.+.++.+......++..+..|...+.+ +++..+...-..+... ..+. .-..+.|..+-- .|. T Consensus 354 e~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP 433 (505) T protein:vir:96 354 NRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDP 433 (505) T ss_pred HHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccCh Confidence 3 3356777888899999888888888888888764444 4554444332222211 1111 113566743332 477 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCcccc Q lcl|NC_018086. 412 AELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTI 489 (511) Q Consensus 412 ~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (511) ...+++.... .|+.|.+.++...| .|+++.++.+..|++...+.-+... .+.........++.++.+ T Consensus 434 ~Ke~~a~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~---~~~~~~~~~~~~~~~~~~------ 502 (505) T protein:vir:96 434 AKDSKAHSESIKNRTRSRSSIIRAAG--DDPEDVFDEIAWEEQLMRDKGVNPT---PPEQESKDATTDEEDDSA------ 502 (505) T ss_pred HHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCC---CCCCCCCCCCCCCCCCCC------ Confidence 7667666554 58999999999987 4899999999888777655322100 000000000000000000 Q ss_pred ccCCCCccccccccCCC Q lcl|NC_018086. 490 TTTDPVAAKEQEKAIQK 506 (511) Q Consensus 490 ~~~~~~~~~~~~~~~~~ 506 (511) ..+ T Consensus 503 --------------~d~ 505 (505) T protein:vir:96 503 --------------SDD 505 (505) T ss_pred --------------CCC Confidence 000 No 96 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.55 E-value=5.2e-13 Score=87.95 Aligned_cols=399 Identities=10% Similarity=0.059 Sum_probs=200.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCC--cCc-cccccce-----eccchHHHHHHHHHhhhhccCceecCch-- Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRT--FDD-TNKPNSK-----IVHNFPKLLVDTSTAYLAGEPITESGDE-- 103 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~--~~~-~~~~~~r-----i~~n~~k~ivd~~~~~l~g~~~~~~~d~-- 103 (511) ++++ +-+...-.|-........ ... ....... -.+.+++.+|+..+.-++.+++.+.+++ T Consensus 1 ~~~~----------D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~ 70 (437) T protein:vir:52 1 MKFF----------DGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLN 70 (437) T ss_pred Cchh----------hhhHhHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCC Confidence 1111 111111111110000000 000 0000000 1357889999999999999999997643 Q ss_pred -hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC---------CceE-EEEEcccceEEE-ecCCCCCceEE Q lcl|NC_018086. 104 -KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN---------KKHR-FKAVSPMNCLIA-YSADLDEEPVA 171 (511) Q Consensus 104 -~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~---------g~~~-i~~~~p~~~~~v-~d~~~~~~~~~ 171 (511) +..+.++..|.+=++...+.++.+++-.||.|++++..+.. |.++ +.+++|..+.+. +.+.+...+-+ T Consensus 71 ~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~f 150 (437) T protein:vir:52 71 SKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNF 150 (437) T ss_pred HHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccccccccccccc Confidence 33456888888888899999999999999999998876542 2222 556666665432 11111111111 Q ss_pred -EEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHH Q lcl|NC_018086. 172 -AIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQ 250 (511) Q Consensus 172 -~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v 250 (511) -..+|.+.. +.. ...+.+.++++|... .+| ...++-.|.|.++.+ T Consensus 151 g~p~~y~v~~---~~~---~~~iH~SRii~~~~~--------------------------~~~--~~~~~~~G~s~le~~ 196 (437) T protein:vir:52 151 GRYSEYSILG---GSQ---SITVHHSRLIILNAN--------------------------DAP--LSDNDIWGVSDLEKI 196 (437) T ss_pred CcceEEEEec---CCc---ceeEccceeEEecCc--------------------------cCC--CccccccCCchHHHH Confidence 111222210 000 011233334443211 012 112344689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeecCC--CCc-cchhhh-------hhh-hCceeeecCCCceeeeecCCCHHHH Q lcl|NC_018086. 251 LSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFD--LSA-DSDSIS-------NMK-NDRVIVTDEDGMVKFITKDVNDKHI 319 (511) Q Consensus 251 ~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~--~~~-~~~~~~-------~~~-~~~~i~~~~~~~~~~~~~~~~~~~~ 319 (511) .+-+.+++++.-.....+..+..+.+.+.|.. ... .++... .++ ..+++.++.+.+.+. .+.+...+ T Consensus 197 ~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~--~~~~~sgl 274 (437) T protein:vir:52 197 IDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDR--KELTFTGL 274 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEE--EecCcCCH Confidence 99999999988888887777777777776631 111 111111 112 245677766655544 44566678 Q ss_pred HHHHHHHHHHHHHHhCcccccc-ccc-cC-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_018086. 320 ENIKNRAKLDIFSLSQTPDLVS-KDF-TA-ASGQALKAATQPLENKSAVKE-SKFRKVLAKRYELVCSYLEFMNKAKDLK 395 (511) Q Consensus 320 ~~~~~~l~~~i~~~s~~p~~~~-~~~-~~-~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~ 395 (511) ...++.....|...+++|-.-. +.. ++ +||..=... ....++.++ ..+...+++++++|+.-. .+.. . T Consensus 275 ~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~---yyd~i~~~Qe~~l~p~le~l~~~i~~~~--~g~~---~ 346 (437) T protein:vir:52 275 KDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQN---YHEAIRRLQETRLRPIFEIIDPLICNEL--FGGL---P 346 (437) T ss_pred HHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCC---C Confidence 8888999999999999996443 322 11 355543333 334444444 567888888888765421 1211 1 Q ss_pred ccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcc Q lcl|NC_018086. 396 PYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAA 475 (511) Q Consensus 396 ~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (511) .++++.|++-...++++.|+...+.+...+. ++ ..+.+ ++++..+++++ ...++............ T Consensus 347 -~~~~~~f~pL~~~s~kekae~~~~~a~a~~~--~~-~~g~i-~~~e~r~~L~~---------~g~~~~i~~~~~~~~~~ 412 (437) T protein:vir:52 347 -ADWWFEFVPLTTVKQEQQINMLNTFATAANT--LI-QNGVL-NEYQIANELRE---------SGLFANISAEHIEELKN 412 (437) T ss_pred -CcceEEeCCcCCcCHHHHHHHHHHHHHHHHH--HH-hcCCC-CHHHHHHHHHh---------cCCCCCCCccccccccC Confidence 2588999999999999988876554322111 11 12222 23222222211 01111111100000111 Q ss_pred ccccCCCCCCccccccCCCCccccc Q lcl|NC_018086. 476 ANKLDKNPANTSTITTTDPVAAKEQ 500 (511) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (511) .++..++........+++++.++++ T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 413 ADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred CCCCCCccCCCCCCCCCCCCCCCCC Confidence 1111111111111122223333333 No 97 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.52 E-value=1.9e-13 Score=90.32 Aligned_cols=440 Identities=11% Similarity=0.052 Sum_probs=209.3 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) -+++|.+-..+-.-..+.-.--+-..... .....+....... +-..+..||....-+-+. ...-+ -.+.+ T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~a~d~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~------l~a~Y-~~~~l 115 (537) T protein:vir:10 46 TMMAIRDHAIAMMPKVDGSHPDMAMDGLD-VEGGTFSAYANPN--LSEGLVLWYAQQAFIGHQ------MCALI-ATHWL 115 (537) T ss_pred ccCCCCCccCcccccccccccchhccccc-cchhhhhhhcccc--ccchhhhhccccCCccHH------HHHHH-HhCch Confidence 23344332221111111000001111110 0011111100000 001122222222111000 00001 12578 Q ss_pred HHHHHHHHHhhhhccCceecCch------hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC-CCc------- Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDE------KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR-NKK------- 146 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~------~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~-~g~------- 146 (511) ++.+|+..+.-++-+++.+.+++ +..+.+...|.+-++...+.++.+.+..||.+++++..+. +++ T Consensus 116 ~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~ 195 (537) T protein:vir:10 116 VNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFN 195 (537) T ss_pred hhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccc Confidence 89999999999999999886543 3446677788888899999999999999999988876532 221 Q ss_pred ---------eEEEEEcccceEEEecC---CCCCceEEE-EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 147 ---------HRFKAVSPMNCLIAYSA---DLDEEPVAA-IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 147 ---------~~i~~~~p~~~~~v~d~---~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) ..+.+++|..+.+...+ .+...+-++ ..+|.+ .++ .+.+.++++|.... T Consensus 196 ~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v----~g~------~iH~SRli~f~g~~-------- 257 (537) T protein:vir:10 196 IDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLI----NGK------KYHRSHLAIYINDE-------- 257 (537) T ss_pred cccccccceeEEEEechhhcccccchhhhccCCccccCCceeeee----cCe------EecceeEEEecCCC-------- Confidence 12445566555442111 110100110 011111 111 12334444432100 Q ss_pred ccccccccccceeccCCccceEeec-CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh- Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFPVLEII-ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS- 291 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iPvv~~~-n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~- 291 (511) +|-..-+ ++-.|+|.++.+.+-+..++++.-.....+..+..+++.+.|...-.+.+... T Consensus 258 ------------------~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~ 319 (537) T protein:vir:10 258 ------------------VVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDE 319 (537) T ss_pred ------------------CchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHH Confidence 1111101 12258999999999999999999888888888888888777643211111111 Q ss_pred h---h----hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccc-cccc--c-CccHHHHHHHHHHHH Q lcl|NC_018086. 292 N---M----KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLV-SKDF--T-AASGQALKAATQPLE 360 (511) Q Consensus 292 ~---~----~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~--~-~~Sg~Ai~~~~~~l~ 360 (511) . + ...+++.+..+.+ ++.....+...+...++.....|...+++|-.- ++.. | ++||+.=...|... T Consensus 320 r~~~~~~~r~n~g~~~id~e~e-~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~- 397 (537) T protein:vir:10 320 TMSWWTATRDNYQVRVVDKDNE-DVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEE- 397 (537) T ss_pred HHHHHHhhcCCcceeEecCCCc-eeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHH- Confidence 1 1 1234666665422 344445666778888999999999999999653 4432 1 46677544444443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHH Q lcl|NC_018086. 361 NKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL---------RDMLPDETII 431 (511) Q Consensus 361 ~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~---------~g~~s~et~~ 431 (511) ++.+|..+...+++++++|+... .+ . ..++++.|++-...++++.|+...+. .|+++...++ T Consensus 398 --I~~~Qe~l~p~l~~l~~ll~~~~--~~--~---~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr 468 (537) T protein:vir:10 398 --CESTQDDMRPLIDRHHQLVCRSH--LR--K---RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVN 468 (537) T ss_pred --HHHHHHHHHHHHHHHHHHHHHhc--CC--C---CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHH Confidence 34444457889999888876432 11 1 23688999999999999988754332 3678887777 Q ss_pred HhCCCCCCH-HHHH-HHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCC Q lcl|NC_018086. 432 NQFPWITDA-RQEV-EKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPK 509 (511) Q Consensus 432 ~~l~~v~d~-~~E~-~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (511) ..|....+. ...+ ..+..|..+.. .........+..+...+..+..++.....++.++..+ -++ T Consensus 469 ~~L~~~~~~g~~~l~~~~~~ed~e~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~a~ 534 (537) T protein:vir:10 469 EYLRMDPTLGFTSITPAMRPTDAEDI-----DVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDS---------GAA 534 (537) T ss_pred HHHhccCccccccccCCCChhhhhcc-----cCCccCCcCCCCCCCCCccccCCCCccccccCCCccC---------ccc Confidence 766432110 0000 00000000000 0000000001111111111111111111111111111 122 Q ss_pred CC Q lcl|NC_018086. 510 TD 511 (511) Q Consensus 510 ~~ 511 (511) |+ T Consensus 535 ~~ 536 (537) T protein:vir:10 535 FE 536 (537) T ss_pred cC Confidence 22 No 98 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.51 E-value=2.2e-12 Score=84.54 Aligned_cols=444 Identities=12% Similarity=0.049 Sum_probs=220.6 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHS----RSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) |+-.| .+++-+.+..-....+.++..... ....++..+++||.+...- .......+. .+++ T Consensus 1 ~~~~~------------~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~-~~~~~~~~~--r~~~ 65 (584) T protein:vir:95 1 MSVKV------------AELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTT-TTSNQGLPW--KNST 65 (584) T ss_pred CCcch------------hhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhh-hhhhccccc--cccc Confidence 22111 111222222222233444443332 2223557888888884321 111111222 2467 Q ss_pred ccchHHHHHHHHHhhhhcc----C-c-e----ecCchhh--HHHHH----HHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGE----P-I-T----ESGDEKT--IKAMQ----PVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~----~-~-~----~~~d~~~--~~~l~----~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) -+|.+.-+++..+.+++.- . + . ..+|.+. .+.+. +-+...++.....++..++.++|.|++.+. T Consensus 66 ~~~k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~ 145 (584) T protein:vir:95 66 TLPKLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVS 145 (584) T ss_pred chhHHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEe Confidence 7888888888777766542 1 1 1 1222222 23333 334667899999999999999999999887 Q ss_pred eCCC-------------CceEEEEEcccceEEEecCCCCCceE-E-EEEEEEEee------------------------- Q lcl|NC_018086. 141 IDRN-------------KKHRFKAVSPMNCLIAYSADLDEEPV-A-AIYYNTVIS------------------------- 180 (511) Q Consensus 141 ~~~~-------------g~~~i~~~~p~~~~~v~d~~~~~~~~-~-~v~~~~~~~------------------------- 180 (511) +... .++++..+||.++| ||+.-....- . .+|.+.... T Consensus 146 ~~~~~~e~~e~~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~ 223 (584) T protein:vir:95 146 FEAKYKEMTDGTLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEI 223 (584) T ss_pred EeecceeeeccccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHh Confidence 6432 26899999999887 5654421111 1 122111000 Q ss_pred ----------cCCcc-------eEEEEEEEcCCcEEEEEEcc--------Ccccccccc--cccccccccceeccCCccc Q lcl|NC_018086. 181 ----------DITGH-------QIRTYEVYTEDLIYKFSTDD--------EREVYREIP--EELEIKDYEVHPNLLQKFP 233 (511) Q Consensus 181 ----------~~~~~-------~~~~~~~~~~~~i~~~~~~~--------~~~~~~~~~--~~~~~~~~~~~~~~~g~iP 233 (511) +.++- ...-.+.|.++.+.-+...+ +.+...... ......-.+..+.+.+.+| T Consensus 224 ~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~P 303 (584) T protein:vir:95 224 CRHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAP 303 (584) T ss_pred ccCCCCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCC Confidence 00000 00011112222222111100 000000000 0000111123456678999 Q ss_pred eEeecC-----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCcee Q lcl|NC_018086. 234 VLEIIA-----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVK 308 (511) Q Consensus 234 vv~~~n-----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 308 (511) ++...- .-+|.|...-+.++++.+|.+.-.+.+++..+..|.+...+...+. ..+.+..+.....+++. T Consensus 304 F~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~~~------~~~pg~~~~~~~~~~~q 377 (584) T protein:vir:95 304 IYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVEEF------VWGPGAEIHLDQGGDVQ 377 (584) T ss_pred EEEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccccchh------cccCCceeecCCCCCcc Confidence 886543 3479999999999999999999999999999999976655532221 12245677777778888 Q ss_pred eeecCC-CHHHHHHHHHHHHHHHHHHhCccccccccc--cCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_018086. 309 FITKDV-NDKHIENIKNRAKLDIFSLSQTPDLVSKDF--TAASGQALKAATQPLENKSAVKESKFRKVL-AKRYELVCSY 384 (511) Q Consensus 309 ~~~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~ 384 (511) ++.++. +..+..+.+..+...+...|++|..+-|.. ++.++..+......+..-.+.+.+.|..++ ++++.++..+ T Consensus 378 ~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~ 457 (584) T protein:vir:95 378 EIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLET 457 (584) T ss_pred eecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 887664 334555668888899999999998887643 345666777777777888888888888877 7777777765 Q ss_pred HHhcC-CCccc---------------cccceeEEe--CCCCCcCHHHHHHHHHHHhc---------c---CChHHH---H Q lcl|NC_018086. 385 LEFMN-KAKDL---------------KPYEVTPVF--VRNLPQSYAELADMAVKLRD---------M---LPDETI---I 431 (511) Q Consensus 385 ~~~~~-~~~~~---------------~~~~i~i~f--~~~~p~d~~e~a~~~~~~~g---------~---~s~et~---~ 431 (511) -...- ..... ...++.-.| ......-..+.++..+.+.. + ++.... + T Consensus 458 ~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~l 537 (584) T protein:vir:95 458 ATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFV 537 (584) T ss_pred HHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHH Confidence 32211 11000 001111111 11111111222232222211 1 222221 1 Q ss_pred H---hCCC----CCCH----HHHHHHHHHHHHHHHHHHHhhccccccCCCCC Q lcl|NC_018086. 432 N---QFPW----ITDA----RQEVEKADAQRQKRADIALQNFKQTSAVQGAS 472 (511) Q Consensus 432 ~---~l~~----v~d~----~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 472 (511) . .+|. ..++ +.|.+....+.++.. .+. .+....|+. T Consensus 538 adl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~-~~~----~~~~~~~~~ 584 (584) T protein:vir:95 538 DDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDL-QLQ----AQMPAEGAI 584 (584) T ss_pred HHHhCCCcccccCCCcccchhHHHHhhhHHHHHHH-HHH----HhhhhccCC Confidence 1 1221 0111 111111111111100 111 111122222 No 99 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.49 E-value=5.7e-12 Score=82.23 Aligned_cols=427 Identities=11% Similarity=0.031 Sum_probs=227.3 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc----c-ccCCcCccc---cc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA----I-QSRTFDDTN---KP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~----~-~~~~~~~~~---~~ 72 (511) |.++= ...+.. ......+ .....||.|...- . +........ .. T Consensus 1 ~~~p~------------------------~~~~~~----~~~~~~~-~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~ 51 (533) T protein:vir:34 1 MKTPT------------------------IPTLLG----PDGMTSL-REYAGYHGGGSGFGGQLRSWNPPSESVDAALLP 51 (533) T ss_pred CCCch------------------------hhhhhc----ccccchH-HHHHhhhhccCCCCCcccccccCCCCHHHHHHH Confidence 11110 111111 1111111 1334566653211 0 011100000 00 Q ss_pred cc-----e-----eccchHHHHHHHHHhhhhccCceecC-------------chhhHHHHHHHHh---c----------- Q lcl|NC_018086. 73 NS-----K-----IVHNFPKLLVDTSTAYLAGEPITESG-------------DEKTIKAMQPVFK---E----------- 115 (511) Q Consensus 73 ~~-----r-----i~~n~~k~ivd~~~~~l~g~~~~~~~-------------d~~~~~~l~~~~~---~----------- 115 (511) .. | .-.+|++-+|+..+++++|.+++... +++..+.+...|. + T Consensus 52 ~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~ 131 (533) T protein:vir:34 52 NFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERK 131 (533) T ss_pred HHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccc Confidence 00 1 12689999999999999999988643 1222334433332 2 Q ss_pred cChhHHHHHHHHHHhhCCeEEEEeeeCCCC----ceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEE Q lcl|NC_018086. 116 NYVTDVNSEEVKLSGIFGHCFEIHWIDRNK----KHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYE 191 (511) Q Consensus 116 n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g----~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 191 (511) .+|......+++...+.|.+|+.....+.+ .+++..++|..+..-++.........+|.+ +..|..+-|. T Consensus 132 ~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~-----d~~Gr~~aY~- 205 (533) T protein:vir:34 132 RTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQI-----NDSGAALGYY- 205 (533) T ss_pred cCHHHHHHHHHHHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEE-----CCCCCeEEEE- Confidence 146777888999999999999887655432 368899999987554443333445555532 3334443221 Q ss_pred EEc---CCcEEEEEEccCcccccccccccccccccceeccCCccc---eEeecC-----CcccCchhHHHHHHHHHHHHH Q lcl|NC_018086. 192 VYT---EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP---VLEIIA-----NEERLGDFEAQLSLIDAYNLA 260 (511) Q Consensus 192 ~~~---~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n-----~~~g~s~~~~v~~l~d~~~~~ 260 (511) ++. ++... + .+. ..+ ....+| |+++.. -..|.|.|..++..+..++.. T Consensus 206 i~~~~~~~~~~-~----------------~~~---~~~-~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y 264 (533) T protein:vir:34 206 VSEDGYPGWMP-Q----------------KWT---WIP-RELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTL 264 (533) T ss_pred EeecCCCCccc-c----------------ccc---eee-eeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHH Confidence 221 11100 0 000 000 011122 444433 246899999988888888776 Q ss_pred HHHHHHHHHHhcCceeEeecCCCCc-------------cch---------------hhhhhhhCceeeecCCCceeeeec Q lcl|NC_018086. 261 VSDSVNDIAYWNDAYLWLQGFDLSA-------------DSD---------------SISNMKNDRVIVTDEDGMVKFITK 312 (511) Q Consensus 261 ~s~~~~~~~~~~~p~l~~~G~~~~~-------------~~~---------------~~~~~~~~~~i~~~~~~~~~~~~~ 312 (511) ...........+.-..+++....+. ..+ ....+..+.+..++.+.++++.+. T Consensus 265 ~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~ 344 (533) T protein:vir:34 265 QNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTA 344 (533) T ss_pred HHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCC Confidence 6555554444444344444221110 000 001244556777888889999988 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCcccc-ccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCC Q lcl|NC_018086. 313 DVNDKHIENIKNRAKLDIFSLSQTPDL-VSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAK-RYELVCSYLEFMNK 390 (511) Q Consensus 313 ~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~-~~~li~~~~~~~~~ 390 (511) +.+...+..+...+...|....++|-. ..+.++++|-.++++.+......+...+..|...+.+ +++..+...-..+. T Consensus 345 ~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~ 424 (533) T protein:vir:34 345 QDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRV 424 (533) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCc Confidence 888889999999999999998888843 3456778888888888888888888888777664433 33332222111221 Q ss_pred Cc-------ccc---ccceeEEeCCC--CCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 391 AK-------DLK---PYEVTPVFVRN--LPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRAD 456 (511) Q Consensus 391 ~~-------~~~---~~~i~i~f~~~--~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~ 456 (511) .. ++. ..-..+.|..+ ...|....+++.... .|+.|.+.++...| .|+++.++.+..|++...+ T Consensus 425 i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~ev~~q~a~e~~~~~~ 502 (533) T protein:vir:34 425 VTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRETMERRA 502 (533) T ss_pred ccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHh Confidence 11 110 01134667333 335777777666555 58999999999987 4899999999888777655 Q ss_pred HHHhhccccccCCCCCCccc--cccCCCCCCccccc Q lcl|NC_018086. 457 IALQNFKQTSAVQGASTAAA--NKLDKNPANTSTIT 490 (511) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 490 (511) .-+.. ...+...... ......++..+..+ T Consensus 503 ~gl~~-----~~~~~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 503 AGLKP-----PAWAAAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred cCCCC-----CCCCCcCccCCCCCCCCCCcccCCCC Confidence 42211 0010000000 00000000000101 No 100 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.47 E-value=1e-12 Score=86.29 Aligned_cols=435 Identities=8% Similarity=0.030 Sum_probs=205.7 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) ++++=..+.+..-..+....... ..+... ..+-.. +-.+. ...||....-+.+ .-..-++ .+.+ T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~--~~~a~~--~g~~~~---~~~~~--~~~~~~~~~~~~~------~l~a~Y~-~~~l 96 (532) T protein:vir:94 33 LGLATAHEIDPTAYSPYERNAAQ--NAMAMD--YGLQTG---RNGRN--ALSFVEATSWPGF------PTLALLA-QLPE 96 (532) T ss_pred hhhhhhhhhcccccccccccccc--cccccc--cccCcc---ccccc--ccccccccccchH------HHHHHHH-cCch Confidence 33222211111111110000000 000000 000000 00000 0011111100000 0000011 2466 Q ss_pred HHHHHHHHHhhhhccCceecCc------hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCc-------- Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGD------EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKK-------- 146 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d------~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~-------- 146 (511) ++.+|+..+.-++-+++.+.++ ++....+...|.+=++...+.++.+.+..||.|++++-...+|. T Consensus 97 ~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~ 176 (532) T protein:vir:94 97 YRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPL 176 (532) T ss_pred hhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccccccccc Confidence 7889999999888899988653 23345566777776788899999999999999988776543221 Q ss_pred ------------eEEEEEcccceEEE-ecCCCCCceEEE-EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccc Q lcl|NC_018086. 147 ------------HRFKAVSPMNCLIA-YSADLDEEPVAA-IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYRE 212 (511) Q Consensus 147 ------------~~i~~~~p~~~~~v-~d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 212 (511) ..+.+++|..+.+- |+..+...+-++ ..+|.+. .+ .-+.+.++++|... T Consensus 177 ~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~---~g------~~iH~SRli~f~g~-------- 239 (532) T protein:vir:94 177 LLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT---SG------KKIHSSRIHTVVGR-------- 239 (532) T ss_pred cccccccccceeeEEEeechheecccccccccccccccCCceeEEEc---cC------eeeccceEEEecCC-------- Confidence 12445555555442 111111111110 0111110 01 11234444444210 Q ss_pred cccccccccccceeccCCccceEeec-CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC-C-cc-ch Q lcl|NC_018086. 213 IPEELEIKDYEVHPNLLQKFPVLEII-ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL-S-AD-SD 288 (511) Q Consensus 213 ~~~~~~~~~~~~~~~~~g~iPvv~~~-n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~-~-~~-~~ 288 (511) .+|-...+ ++-.|+|.++.+.+-+..++.+.-.....+..+....+.+..... . .. .. T Consensus 240 ------------------~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~ 301 (532) T protein:vir:94 240 ------------------PVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQS 301 (532) T ss_pred ------------------CchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcchhHHH Confidence 01211111 122489999999999999999888887777777766665421111 1 11 11 Q ss_pred hhhhh------hh-CceeeecC-CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccc-cccc--c-CccHHHHHHHH Q lcl|NC_018086. 289 SISNM------KN-DRVIVTDE-DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLV-SKDF--T-AASGQALKAAT 356 (511) Q Consensus 289 ~~~~~------~~-~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~--~-~~Sg~Ai~~~~ 356 (511) ....+ +. .+++.++. +.+++.+ ..+.+.+...++.....|...+++|-.- ++.. | |+||+.=...| T Consensus 302 ~~~r~~~~~~~~~n~g~~~id~~~e~~e~~--~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~y 379 (532) T protein:vir:94 302 LDARLQLFNLYRDNRNIGALDKGTEEIQQT--NTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVW 379 (532) T ss_pred HHHHHHHHHhhcCCccceEEcCCCceeEEE--ecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHH Confidence 11111 11 24556654 3444444 4566678888999999999999999653 3432 1 45666433333 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHH-------H--hccCC Q lcl|NC_018086. 357 QPLENKSAVKE-SKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVK-------L--RDMLP 426 (511) Q Consensus 357 ~~l~~k~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~-------~--~g~~s 426 (511) ...++.++ ..+...+++++++|+... .+.. ..++++.|++-...+.+|.|+...+ + .|+++ T Consensus 380 ---yd~I~s~Qe~~l~p~le~l~~~l~~s~--~g~~----~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~ 450 (532) T protein:vir:94 380 ---YDFIAGYQATNLTPLMEWIIDLIQLSE--YGQI----DPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVID 450 (532) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCC----CCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 34444444 556788888888775422 1211 1258899999888899888765432 2 37899 Q ss_pred hHHHHHhCCCCCC-------H-HHHHHHHHHHHHHHHHHHHhhcccc--ccCCCCCCccccccCCCCC-CccccccCCCC Q lcl|NC_018086. 427 DETIINQFPWITD-------A-RQEVEKADAQRQKRADIALQNFKQT--SAVQGASTAAANKLDKNPA-NTSTITTTDPV 495 (511) Q Consensus 427 ~et~~~~l~~v~d-------~-~~E~~ri~~E~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 495 (511) .+.++.+|..-.+ + ..+++....+....... ..+..+. ....+..+...++.+.++. ..+++++++|+ T Consensus 451 ~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 529 (532) T protein:vir:94 451 AKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAA-ALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQPV 529 (532) T ss_pred HHHHHHHHhcCCccccccccccccccccccchhhhhccc-ccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCCCc Confidence 9888887753211 1 11111111111110000 0000010 1111222223333333333 23366666666 Q ss_pred ccc Q lcl|NC_018086. 496 AAK 498 (511) Q Consensus 496 ~~~ 498 (511) +.- T Consensus 530 ~~~ 532 (532) T protein:vir:94 530 GNR 532 (532) T ss_pred CCC Confidence 655 No 101 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.46 E-value=6.6e-14 Score=92.84 Aligned_cols=475 Identities=7% Similarity=-0.034 Sum_probs=204.8 Q ss_pred hhccCCCHHHHH----HHHHHHHHHHHHHHHHHHHhcCCCcccccC-CcCccccccceeccchHHHHHHHHHhhhhccCc Q lcl|NC_018086. 23 FIRRNFDLRELI----TLAEMHSRSSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPI 97 (511) Q Consensus 23 ~~~~~~~~~~l~----~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~ 97 (511) +-+.......+. ..+....+-+.....-.+||.|.+-..... ..+..+ |..+|.++.+|+..+++---+.+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~q~----rp~~N~i~~~v~~v~g~e~~nr~ 76 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcC----CCcccchHHHHHHHHhhHHhCCc Confidence 111111112222 222222333344556689999987432111 111112 33579999999999998665543 Q ss_pred e--ec----CchhhHHH----HHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC---C---CCceEEEEE----cccce Q lcl|NC_018086. 98 T--ES----GDEKTIKA----MQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID---R---NKKHRFKAV----SPMNC 157 (511) Q Consensus 98 ~--~~----~d~~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~---~---~g~~~i~~~----~p~~~ 157 (511) . +. +|.+..+. +..+...++.+...+.+..+++++|.||+-|..+ + ++.+.|..+ +|.++ T Consensus 77 d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v 156 (725) T protein:vir:10 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHV 156 (725) T ss_pred ceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHc Confidence 3 22 22232232 4455677899999999999999999999877433 2 234444432 33434 Q ss_pred EEEecCCCCCc----eE-EEEEEEEEe----------------------------ecCCcceEEEEEEEcCCcEEE--EE Q lcl|NC_018086. 158 LIAYSADLDEE----PV-AAIYYNTVI----------------------------SDITGHQIRTYEVYTEDLIYK--FS 202 (511) Q Consensus 158 ~~v~d~~~~~~----~~-~~v~~~~~~----------------------------~~~~~~~~~~~~~~~~~~i~~--~~ 202 (511) + ||+..... .. ++++.|... .+.....+..+++|....+.. +. T Consensus 157 ~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~ 234 (725) T protein:vir:10 157 I--WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) T ss_pred c--cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEeeEEEE Confidence 3 44422110 01 111111110 011122344444443322111 11 Q ss_pred Ec--cCccccccccc-------------------------------cccc-ccccceeccCCccceEeec---C----Cc Q lcl|NC_018086. 203 TD--DEREVYREIPE-------------------------------ELEI-KDYEVHPNLLQKFPVLEII---A----NE 241 (511) Q Consensus 203 ~~--~~~~~~~~~~~-------------------------------~~~~-~~~~~~~~~~g~iPvv~~~---n----~~ 241 (511) .. ..+......+. ..+. ......+.+-+.+|+|+|. - .+ T Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~ 314 (725) T protein:vir:10 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) T ss_pred eccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcc Confidence 10 01100000000 0000 0011122333445555543 2 12 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec----CCC-----ceeeeec Q lcl|NC_018086. 242 ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD----EDG-----MVKFITK 312 (511) Q Consensus 242 ~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~----~~~-----~~~~~~~ 312 (511) .+-|.+.++++.++.+|...|.+...+-....-.........+..............+... .+| .+.+... T Consensus 315 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~ 394 (725) T protein:vir:10 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) T ss_pred eeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccCcccCC Confidence 3348889999999999999999998876544433332222222222222222222222211 111 1222222 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--- Q lcl|NC_018086. 313 DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFM--- 388 (511) Q Consensus 313 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--- 388 (511) +.-...+...+......|..++++.+...|..+ +.||+|+.................+..+.+++.++++.+.... T Consensus 395 ~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~ 474 (725) T protein:vir:10 395 PEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDV 474 (725) T ss_pred CCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Confidence 222245556788888999999998776666655 4799999998777776666666777777777666655543221 Q ss_pred -------CCCc--cc----------------------cccceeEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHH Q lcl|NC_018086. 389 -------NKAK--DL----------------------KPYEVTPVFVRNLPQSYAELADMAVKLRDMLPD------ETII 431 (511) Q Consensus 389 -------~~~~--~~----------------------~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~------et~~ 431 (511) +... .+ ...++.|.=.|..+.-..+.++.++.+.+.++. .+++ T Consensus 475 er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~ 554 (725) T protein:vir:10 475 PRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLLLL 554 (725) T ss_pred CcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHHHH Confidence 1100 00 012344433333333234455555555443332 2233 Q ss_pred HhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc--cccccCCCC-CCc--ccccc-CCCCcccc-ccc Q lcl|NC_018086. 432 NQFPW--ITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA--AANKLDKNP-ANT--STITT-TDPVAAKE-QEK 502 (511) Q Consensus 432 ~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~~--~~~~~-~~~~~~~~-~~~ 502 (511) ..++. .+..++..++++++....... + ...+.... ......... ... ..... ......+. +.. T Consensus 555 ~~~~~~d~~~~~e~~erirkq~~~~~~~------~--~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~ 626 (725) T protein:vir:10 555 QYFTLLDGKGVEMMRDYANKQLIQMGVK------K--PETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQ 626 (725) T ss_pred HHhhcCCchhHHHHHHHHHhhhhhhccC------C--ccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 32322 233344456665432211000 0 00000000 000000000 000 00000 00000000 000 Q ss_pred cC-------CCCCCCC Q lcl|NC_018086. 503 AI-------QKKPKTD 511 (511) Q Consensus 503 ~~-------~~~~~~~ 511 (511) .. -.+..++ T Consensus 627 aE~~k~~~~a~~~~~~ 642 (725) T protein:vir:10 627 NQTLSLQIDAAKVEAQ 642 (725) T ss_pred HHHHHHHHHHHHHHHH Confidence 00 0000011 No 102 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.46 E-value=3.2e-13 Score=89.12 Aligned_cols=474 Identities=7% Similarity=-0.044 Sum_probs=203.5 Q ss_pred cCchhhHhhhhccCCCHHHHH----HHHHHHHHHHHHHHHHHHHhcCCCcccccC-CcCccccccceeccchHHHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELI----TLAEMHSRSSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVHNFPKLLVDTS 88 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~----~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~n~~k~ivd~~ 88 (511) +.. .......+. ..+....+-+.....-.+||.|.+-..... ..+. ..|..+|.++.+|+.. T Consensus 1 m~d---------~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~----q~rp~~N~i~~~i~~v 67 (725) T protein:vir:77 1 MAD---------NENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL----QYRGQFDVVRPVVRKL 67 (725) T ss_pred CCc---------hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHh----cCCCccccHHHHHHHH Confidence 111 111112222 222223333345556689999987432111 1111 1234579999999999 Q ss_pred HhhhhccCce--ec----CchhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC---C---CCceEEEEE Q lcl|NC_018086. 89 TAYLAGEPIT--ES----GDEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID---R---NKKHRFKAV 152 (511) Q Consensus 89 ~~~l~g~~~~--~~----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~---~---~g~~~i~~~ 152 (511) +++---+.+. +. +|.+..+ .+..+...++.+...+.+..+++++|.||+-|..| + ++.++|... T Consensus 68 ~g~~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~ 147 (725) T protein:vir:77 68 VSEMRQNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRRE 147 (725) T ss_pred HhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEe Confidence 9887655433 32 2222223 24555677899999999999999999999877543 2 234444433 Q ss_pred ----cccceEEEecCCCCCc-----eEEEEEEEEEe----------------------------ecCCcceEEEEEEEcC Q lcl|NC_018086. 153 ----SPMNCLIAYSADLDEE-----PVAAIYYNTVI----------------------------SDITGHQIRTYEVYTE 195 (511) Q Consensus 153 ----~p~~~~~v~d~~~~~~-----~~~~v~~~~~~----------------------------~~~~~~~~~~~~~~~~ 195 (511) +|.++ +||+..... ..+++..|... ++.+...+..+++|.. T Consensus 148 ~~~~~~~~v--~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r 225 (725) T protein:vir:77 148 PIHSACSHV--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEV 225 (725) T ss_pred ecccChhhc--eeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEE Confidence 34443 334322110 00111111100 0112234455565554 Q ss_pred CcEEEEE--Ecc--Cccc--ccc-----------ccc------------------ccccc-cccceeccCCccceEeec- Q lcl|NC_018086. 196 DLIYKFS--TDD--EREV--YRE-----------IPE------------------ELEIK-DYEVHPNLLQKFPVLEII- 238 (511) Q Consensus 196 ~~i~~~~--~~~--~~~~--~~~-----------~~~------------------~~~~~-~~~~~~~~~g~iPvv~~~- 238 (511) ..+.... ... .+.. +.. ... ..+.. ..+..+.+-+.+|+|+|. T Consensus 226 ~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g 305 (725) T protein:vir:77 226 VEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFG 305 (725) T ss_pred EEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEee Confidence 3322111 110 0000 000 000 00000 012223333445555443 Q ss_pred --C----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCcee-----eecCC--- Q lcl|NC_018086. 239 --A----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVI-----VTDED--- 304 (511) Q Consensus 239 --n----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i-----~~~~~--- 304 (511) - .+.+.|.+.++++.++.+|...|.+...+-....-..++.-...+..............+ ...++ T Consensus 306 ~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) T protein:vir:77 306 EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGDLP 385 (725) T ss_pred eeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCccc Confidence 2 123348888999999999999999987776544433222111111122222222111111 11111 Q ss_pred -Cceeeeec-CCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 305 -GMVKFITK-DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 305 -~~~~~~~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) +.+..... +++ ..+...+......|-.+|++.+-..|..+| .||+|+..........+......+..+.+++.+++ T Consensus 386 ~~~i~~~~~~~lp-~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~l 464 (725) T protein:vir:77 386 TQPLAYYENPEVP-QANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIY 464 (725) T ss_pred ccCccccCCCCch-HHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11222222 223 344557788888999999887766666554 79999999888777777777777777777777666 Q ss_pred HHHHHhc----------CCCccc------------------------cccceeEEeCCCCCcCHHHHHHHHHHHhccCCh Q lcl|NC_018086. 382 CSYLEFM----------NKAKDL------------------------KPYEVTPVFVRNLPQSYAELADMAVKLRDMLPD 427 (511) Q Consensus 382 ~~~~~~~----------~~~~~~------------------------~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~ 427 (511) +.+.... +..... ...++.|.=.+..+.=..+..+.++.+...++. T Consensus 465 L~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~ 544 (725) T protein:vir:77 465 QSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQ 544 (725) T ss_pred HHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccc Confidence 6553221 110000 002333333333222233444445554433332 Q ss_pred ------HHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccC----CCCCCc-ccccc-CC Q lcl|NC_018086. 428 ------ETIINQFPW--ITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLD----KNPANT-STITT-TD 493 (511) Q Consensus 428 ------et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-~~~~~-~~ 493 (511) .++...+.. .+..++.+++++++....... ....+.......+.. .+.... ...+. .. T Consensus 545 ~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~--------q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~ 616 (725) T protein:vir:77 545 GTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVK--------KPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLL 616 (725) T ss_pred cchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhcc--------CCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 122222221 122234455555432221110 000000000000000 000000 00000 00 Q ss_pred CCccc-cccccCCCCCCCC Q lcl|NC_018086. 494 PVAAK-EQEKAIQKKPKTD 511 (511) Q Consensus 494 ~~~~~-~~~~~~~~~~~~~ 511 (511) ..... .+......+...+ T Consensus 617 ~~qa~~~kaq~e~~k~q~~ 635 (725) T protein:vir:77 617 QGQAELAKAQNQTLSLQID 635 (725) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 00000 0000000000001 No 103 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.45 E-value=1.3e-11 Score=80.19 Aligned_cols=427 Identities=10% Similarity=0.024 Sum_probs=227.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--cc---cCCcCccc---cccc-----e-----eccchHHHHHHHHH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA--IQ---SRTFDDTN---KPNS-----K-----IVHNFPKLLVDTST 89 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~--~~---~~~~~~~~---~~~~-----r-----i~~n~~k~ivd~~~ 89 (511) |....++.+- .+.. ......||.|...- .. ........ ..+. | .-.+|++-+|+..+ T Consensus 1 ~~~~~~~~~~----~~~~-~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 75 (530) T protein:vir:38 1 MKIPSLVGPD----GKTS-LREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQ 75 (530) T ss_pred CccceeecCc----cccc-hHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 2221122211 1111 02334566553210 00 00000000 0000 0 12589999999999 Q ss_pred hhhhccCceecC-------------chhhHHHHHHHHh---c-----------cChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 90 AYLAGEPITESG-------------DEKTIKAMQPVFK---E-----------NYVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 90 ~~l~g~~~~~~~-------------d~~~~~~l~~~~~---~-----------n~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) ..++|.+++... +++..+.+.+.|. . .+|......+.+...+.|.+|+..... T Consensus 76 ~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~ 155 (530) T protein:vir:38 76 DHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWD 155 (530) T ss_pred HHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeec Confidence 999999987542 1222344444443 2 246778888999999999999877654 Q ss_pred CC-C---ceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc---CCcEEEEEEccCccccccccc Q lcl|NC_018086. 143 RN-K---KHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT---EDLIYKFSTDDEREVYREIPE 215 (511) Q Consensus 143 ~~-g---~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~ 215 (511) +. | .+++..++|..+-.-++.........+|.+ +..|..+-|. ++. ++... T Consensus 156 ~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~-----d~~Gr~~aY~-i~~~~~~~~~~---------------- 213 (530) T protein:vir:38 156 SDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKI-----NDSGAALGYY-VSDDGYPGWMA---------------- 213 (530) T ss_pred cCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEE-----CCCCceEEEE-EeeccCCCccc---------------- Confidence 43 3 368899999887544433333445555532 3344443221 211 11100 Q ss_pred cccccccccee--ccCCccceEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC----- Q lcl|NC_018086. 216 ELEIKDYEVHP--NLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL----- 283 (511) Q Consensus 216 ~~~~~~~~~~~--~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~----- 283 (511) .. ....+ ..++.--|+++... ..|.|.+..++..+..++.............+.-..+++.... T Consensus 214 -~~---~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~ 289 (530) T protein:vir:38 214 -QN---WTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAM 289 (530) T ss_pred -cc---cceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccc Confidence 00 01111 11222225555432 3689999998888888877665555555444444444442211 Q ss_pred --------Cccch---------------hhhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc- Q lcl|NC_018086. 284 --------SADSD---------------SISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL- 339 (511) Q Consensus 284 --------~~~~~---------------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~- 339 (511) ++... ....+..+.+..++.+.++++.+.+.+...+..+...+...|....++|-- T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~ 369 (530) T protein:vir:38 290 DFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQ 369 (530) T ss_pred cccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHH Confidence 10000 001234455667888889999988888899999999999999998888843 Q ss_pred ccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCc-----cccc-----cceeEEeCCC-- Q lcl|NC_018086. 340 VSKDFTAASGQALKAATQPLENKSAVKESKFRKVL-AKRYELVCSYLEFMNKAK-----DLKP-----YEVTPVFVRN-- 406 (511) Q Consensus 340 ~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~~~~~-----~~~~-----~~i~i~f~~~-- 406 (511) ..+.++++|-.+.++.+......++..+..|...+ +.+++..+...-..+... .++. .-..+.|..+ T Consensus 370 lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~ 449 (530) T protein:vir:38 370 LSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGR 449 (530) T ss_pred HhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCc Confidence 34567778888888888888888888887776643 333433222211112111 0010 0134566333 Q ss_pred CCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 407 LPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 407 ~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) ...|....+++.... .|+.|.+.++...| .|+++.++.+..|++...+.-+... .............+...+. T Consensus 450 ~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~---~~~~~~~~~~~~~~~~~~~ 524 (530) T protein:vir:38 450 MAIDGLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRESMERRAAGLNPP---AWAAAAFEAGVKKSNEEEQ 524 (530) T ss_pred cccChHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCC---CCcccccCCCCCCCCCCCC Confidence 335666666666554 58999999999887 4899999999888777655422110 0000000000000000000 Q ss_pred Cccccc Q lcl|NC_018086. 485 NTSTIT 490 (511) Q Consensus 485 ~~~~~~ 490 (511) +.++++ T Consensus 525 d~~~~a 530 (530) T protein:vir:38 525 DGARAA 530 (530) T ss_pred CCCCCC Confidence 000100 No 104 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.43 E-value=2.1e-13 Score=90.14 Aligned_cols=485 Identities=11% Similarity=-0.007 Sum_probs=204.9 Q ss_pred ccCCCHHHHHHHHHHH-------HHHHHHHHHHHHHh--cCCCcccccC-CcCccccc--cceeccchHHHHHHHHHhhh Q lcl|NC_018086. 25 RRNFDLRELITLAEMH-------SRSSSAYGVLYDYY--KGNHIAIQSR-TFDDTNKP--NSKIVHNFPKLLVDTSTAYL 92 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~-------~~~~~~~~~~~~yY--~G~~~~~~~~-~~~~~~~~--~~ri~~n~~k~ivd~~~~~l 92 (511) ..+.+.+.+.++...+ ...+.....-.+|| .|++-...-. .-...... .-.+.+|.++.+|+..+++- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 1111111122222222 22222222233455 5776321100 00000010 11477899999999999998 Q ss_pred hccCcee--c-C----chhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC----------CCceEEEE Q lcl|NC_018086. 93 AGEPITE--S-G----DEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR----------NKKHRFKA 151 (511) Q Consensus 93 ~g~~~~~--~-~----d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~----------~g~~~i~~ 151 (511) ..+.+.+ . . |.+..+ .+..++..|+.+...+.+..++.++|.||+.+..+. .+.+...+ T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:10 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred HhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceEEe Confidence 7765443 2 1 222222 345567789999999999999999999998775431 12222233 Q ss_pred EcccceEEEecCCCCCc-----eEEEEEEEEEe-------------------------ecCCcceEEEEEEEcCCcEEEE Q lcl|NC_018086. 152 VSPMNCLIAYSADLDEE-----PVAAIYYNTVI-------------------------SDITGHQIRTYEVYTEDLIYKF 201 (511) Q Consensus 152 ~~p~~~~~v~d~~~~~~-----~~~~v~~~~~~-------------------------~~~~~~~~~~~~~~~~~~i~~~ 201 (511) .+|... ++||+..... ..++++.|... ++.+...+..+++|........ T Consensus 161 ~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~~~ 239 (708) T protein:vir:10 161 YDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVD 239 (708) T ss_pred ecchhh-cccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEEEEEE Confidence 445422 1334322110 01111111100 0001122333343332211111 Q ss_pred E----E-ccCcccc-cccc--------cc---------------------ccc-ccccceeccCCccceEeecCC----- Q lcl|NC_018086. 202 S----T-DDEREVY-REIP--------EE---------------------LEI-KDYEVHPNLLQKFPVLEIIAN----- 240 (511) Q Consensus 202 ~----~-~~~~~~~-~~~~--------~~---------------------~~~-~~~~~~~~~~g~iPvv~~~n~----- 240 (511) . . .++.... .... .. .+. ......+-+++.+|+|+|... T Consensus 240 ~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d 319 (708) T protein:vir:10 240 VISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFID 319 (708) T ss_pred EEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccC Confidence 1 0 0010000 0000 00 000 011234455667788776421 Q ss_pred --cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh--hhhhCceeee----cCCCc------ Q lcl|NC_018086. 241 --EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS--NMKNDRVIVT----DEDGM------ 306 (511) Q Consensus 241 --~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~--~~~~~~~i~~----~~~~~------ 306 (511) +.+-|.+.++++.++.+|..+|.+...+-.......++............. +......+.. ...|. T Consensus 320 ~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~ 399 (708) T protein:vir:10 320 DIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGAT 399 (708) T ss_pred CCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccccccC Confidence 222477888999999999999999988876655544432221111111100 0000111100 00111 Q ss_pred -eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 -VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 307 -~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) ........-...+...+.....+|-.+|+..+-..|.-+|.||+||......-..........+..+.+++.++++.+. T Consensus 400 ~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li 479 (708) T protein:vir:10 400 PAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMA 479 (708) T ss_pred CccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122222233456666778888999999887777777678999999998877777777777777777777777666654 Q ss_pred Hh----------cCCCc------------cc---------c----ccceeEEeCCCCCcCHHHHHHHHHHHhccCCh--- Q lcl|NC_018086. 386 EF----------MNKAK------------DL---------K----PYEVTPVFVRNLPQSYAELADMAVKLRDMLPD--- 427 (511) Q Consensus 386 ~~----------~~~~~------------~~---------~----~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~--- 427 (511) .. .+... +. + ..+|.|.=.|..+.-..+..+.++++.+.++. T Consensus 480 ~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~ 559 (708) T protein:vir:10 480 REVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDP 559 (708) T ss_pred HHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCch Confidence 32 11100 00 0 11333333444444445566666666543331 Q ss_pred HH------HHHhCCCCCCHHHHHHHHHHHHHH------------HHHHHHhhccccccCCCCCCcccc----ccCCCCCC Q lcl|NC_018086. 428 ET------IINQFPWITDARQEVEKADAQRQK------------RADIALQNFKQTSAVQGASTAAAN----KLDKNPAN 485 (511) Q Consensus 428 et------~~~~l~~v~d~~~E~~ri~~E~~~------------~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 485 (511) .+ +++.+ ..+..++-.++|++.... .+........+.....-....... +...+... T Consensus 560 ~~~~~~~~~l~~~-D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~ka~ 638 (708) T protein:vir:10 560 MRPAIQGIILDNI-DGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKAT 638 (708) T ss_pred hhHHHHHHHHHhc-CCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12222 223334444555432100 000000000000000000000000 00000000 Q ss_pred cccccc------CCCCccccccccCCCCCCCC Q lcl|NC_018086. 486 TSTITT------TDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 486 ~~~~~~------~~~~~~~~~~~~~~~~~~~~ 511 (511) ....+. .+-....+..++.+...-.+ T Consensus 639 a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~ 670 (708) T protein:vir:10 639 NETAQTQIKAFTAQQDAMESQANTVYKLAQAR 670 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 00000000001111111111 No 105 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.43 E-value=1.8e-11 Score=79.50 Aligned_cols=432 Identities=9% Similarity=-0.001 Sum_probs=220.3 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccC---CcCcc--ccccc-----e-----eccchHHHHHHHHHh Q lcl|NC_018086. 26 RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSR---TFDDT--NKPNS-----K-----IVHNFPKLLVDTSTA 90 (511) Q Consensus 26 ~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~---~~~~~--~~~~~-----r-----i~~n~~k~ivd~~~~ 90 (511) =++...-+...-..... .....-|+|-..-.... ..... ..... | .-.+|++-+|+..++ T Consensus 1 m~~~~~~~~a~~~~~~~-----~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 75 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLV-----PVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVA 75 (495) T ss_pred CCcccccccccchhhhh-----HHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 00101111111111111 11223455532211111 11100 00000 1 126899999999999 Q ss_pred hhhccCceec---CchhhHHHHHHH---Hhc-------cChhHHHHHHHHHHhhCCeEEEEeeeCC--CC---ceEEEEE Q lcl|NC_018086. 91 YLAGEPITES---GDEKTIKAMQPV---FKE-------NYVTDVNSEEVKLSGIFGHCFEIHWIDR--NK---KHRFKAV 152 (511) Q Consensus 91 ~l~g~~~~~~---~d~~~~~~l~~~---~~~-------n~~~~~~~~~~~~a~~~G~~~~~v~~~~--~g---~~~i~~~ 152 (511) +++|.+++.. .+++..+.+... |.. .+|......+++.....|.+|+.....+ +| .+++..+ T Consensus 76 ~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqli 155 (495) T protein:vir:10 76 AAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQII 155 (495) T ss_pred hhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEe Confidence 9999998864 344444444444 443 3577788899999999999998664432 23 3689999 Q ss_pred cccceEEEecC---CCCCceEEEEEEEEEeecCCcceEE-EEEEEcCCcEEEEEEccCcccccccccccccccccceecc Q lcl|NC_018086. 153 SPMNCLIAYSA---DLDEEPVAAIYYNTVISDITGHQIR-TYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNL 228 (511) Q Consensus 153 ~p~~~~~v~d~---~~~~~~~~~v~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (511) +|..+..-++. ........+|.+ +..|..+- ++.--.++....... .. ......-+...|- T Consensus 156 epd~l~~~~~~~~~~~g~~i~~GIe~-----d~~Gr~vaY~i~~~hpgd~~~~~~-~~---------~~~rvpA~~vlH~ 220 (495) T protein:vir:10 156 EPDMLASDIPDETLPSGGYVKGGIRF-----SNGGKRKAYCFYRNHPAESSLIGD-PV---------DTVWIKAEHVLHV 220 (495) T ss_pred chhhcCCCCCCCCCCCCCEEEeceEE-----CCCCceEEEEEeecCCCccccccc-cc---------ceeeechhheEec Confidence 99997533322 122334555542 23344432 221112221110000 00 0000001122233 Q ss_pred CCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc-------------chhhhhhhh Q lcl|NC_018086. 229 LQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD-------------SDSISNMKN 295 (511) Q Consensus 229 ~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~-------------~~~~~~~~~ 295 (511) | |. .+.-..|.|.+..++.| ..++.............+.-..+++....+.. ......+.. T Consensus 221 f---~~--r~gQ~RGis~la~i~~l-~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p 294 (495) T protein:vir:10 221 T---VL--TVRSDAGAPWFQLLLRL-NELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNP 294 (495) T ss_pred c---cc--CCCcccCcchhHHHHHH-HHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCC Confidence 2 10 12334688988877764 44444443333333443433444443221111 011122444 Q ss_pred CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc-ccccccCccHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_018086. 296 DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL-VSKDFTAASGQALKAATQPLENKSAVKES-KFRKV 373 (511) Q Consensus 296 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~-~~~~~ 373 (511) +.+..++.+.++++.+.+.+...+..+...+...|....++|-- ..+.++++|-.++++.+......++..+. .+... T Consensus 295 G~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~ 374 (495) T protein:vir:10 295 GTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQ 374 (495) T ss_pred ceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 56777888889999988888889999999999999988888743 33566778888888888888878876554 45544 Q ss_pred H-HHHHHHHHHHHHhcCCCc--cc-cc--cceeEEeCCCC--CcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHH Q lcl|NC_018086. 374 L-AKRYELVCSYLEFMNKAK--DL-KP--YEVTPVFVRNL--PQSYAELADMAVKL--RDMLPDETIINQFPWITDARQE 443 (511) Q Consensus 374 l-~~~~~li~~~~~~~~~~~--~~-~~--~~i~i~f~~~~--p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E 443 (511) + +.+++..+...-..+... .+ +. .-..+.|..+- ..|....+++.... .|+.|.+.++...|. |+++. T Consensus 375 ~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~--D~~~v 452 (495) T protein:vir:10 375 FCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGY--DMEEL 452 (495) T ss_pred HHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHH Confidence 3 335554444332222211 11 11 11356674433 35777777766555 589999999999874 89988 Q ss_pred HHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 444 VEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 444 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ++.+..|++...+.-+. + ...+ .. .++. ++.+ ...++...+| T Consensus 453 ~~q~a~e~~~~~~~Gl~-~----~~~p---~~------~~~~---~~~~---------~~~~~~~~~~ 494 (495) T protein:vir:10 453 FDMISDANQLIDEYDLR-L----DSDP---RY------VNGS---GAEQ---------KSVMEAALNN 494 (495) T ss_pred HHHHHHHHHHHHHcCCC-C----CCCC---Cc------CCCc---cCCC---------CCCCCCCCCC Confidence 88888887766554221 0 0000 00 0000 0000 0000000000 No 106 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.39 E-value=2.8e-13 Score=89.41 Aligned_cols=475 Identities=8% Similarity=-0.036 Sum_probs=198.0 Q ss_pred hhccCCCHHHHH----HHHHHHHHHHHHHHHHHHHhcCCCcccccC-CcCccccccceeccchHHHHHHHHHhhhhccCc Q lcl|NC_018086. 23 FIRRNFDLRELI----TLAEMHSRSSSAYGVLYDYYKGNHIAIQSR-TFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPI 97 (511) Q Consensus 23 ~~~~~~~~~~l~----~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~-~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~ 97 (511) +-+.......+. ..+....+-+.....-.+||.|.+-..... ..+. ..|..+|.++.+|+..+++---+.+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~----q~rp~~N~i~~~i~~v~g~e~~nr~ 76 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTL----QYRGQFDVVRPVVRKLVSEMRQNPI 76 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh----cCCCcccchHHHHHHHHhhHHhCCc Confidence 111111112222 222223333344556689999987432111 1111 1233579999999999988655543 Q ss_pred e--ec----CchhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC---C---CCceEEEEE---cccceE Q lcl|NC_018086. 98 T--ES----GDEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID---R---NKKHRFKAV---SPMNCL 158 (511) Q Consensus 98 ~--~~----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~---~---~g~~~i~~~---~p~~~~ 158 (511) . +. +|.+..+ .+..+...++.+...+.+..+++++|.||+-|..+ + ++.++|... +|... T Consensus 77 d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~- 155 (725) T protein:vir:92 77 DVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH- 155 (725) T ss_pred ceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhh- Confidence 3 22 2223323 24555677999999999999999999999877543 2 234444432 33331 Q ss_pred EEecCCCCCc-----eEEEEEEEEEe----------------------------ecCCcceEEEEEEEcCCcEEE--EEE Q lcl|NC_018086. 159 IAYSADLDEE-----PVAAIYYNTVI----------------------------SDITGHQIRTYEVYTEDLIYK--FST 203 (511) Q Consensus 159 ~v~d~~~~~~-----~~~~v~~~~~~----------------------------~~~~~~~~~~~~~~~~~~i~~--~~~ 203 (511) ++||+..... ..++++.|... .+.....+..+++|....+.. +.. T Consensus 156 V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~ 235 (725) T protein:vir:92 156 VIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred cccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEeeeEEee Confidence 1233322110 00011111100 001123344445444322211 110 Q ss_pred c--cCccccccccc-------------------------------cccc-ccccceeccCCccceEeecC-------Ccc Q lcl|NC_018086. 204 D--DEREVYREIPE-------------------------------ELEI-KDYEVHPNLLQKFPVLEIIA-------NEE 242 (511) Q Consensus 204 ~--~~~~~~~~~~~-------------------------------~~~~-~~~~~~~~~~g~iPvv~~~n-------~~~ 242 (511) . ..+......+. ..+. ...+..+.+-+.+|+|+|.. .+. T Consensus 236 ~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:92 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred cCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCccc Confidence 0 00000000000 0000 00111223334455555432 123 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeec----CCC-----ceeeeecC Q lcl|NC_018086. 243 RLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTD----EDG-----MVKFITKD 313 (511) Q Consensus 243 g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~----~~~-----~~~~~~~~ 313 (511) +.|.+.++++.++.+|..+|.+...+-....-..++.-...+..............+... .+| .+++...+ T Consensus 316 ~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~ 395 (725) T protein:vir:92 316 YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENP 395 (725) T ss_pred ccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccccccccccccCCcccCCC Confidence 348889999999999999999987776544433322221222222222221111122111 111 12222222 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--- Q lcl|NC_018086. 314 VNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN--- 389 (511) Q Consensus 314 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--- 389 (511) .-...+...+......|..++++.+-..|..+ +.||+|+.................+..+.+++.++++.+..... T Consensus 396 ~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~ 475 (725) T protein:vir:92 396 EVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVP 475 (725) T ss_pred CchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 22345556788888999999997765555544 47999999887766666666666666666666665555432111 Q ss_pred -------CCcc--c----------------------cccceeEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHHH Q lcl|NC_018086. 390 -------KAKD--L----------------------KPYEVTPVFVRNLPQSYAELADMAVKLRDMLPD------ETIIN 432 (511) Q Consensus 390 -------~~~~--~----------------------~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~------et~~~ 432 (511) .... + ...++.|.=.|..+.-..+....++.+.+.++. -++.. T Consensus 476 r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~ 555 (725) T protein:vir:92 476 RNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) T ss_pred cEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHHHHHHHH Confidence 0000 0 012333333333322233444445555433332 12222 Q ss_pred hCC--CCCCHHHHHHHHHHHHHHHHHHH-------HhhccccccCCCCCCccccccCCCCCCccccccCCCCcccc-ccc Q lcl|NC_018086. 433 QFP--WITDARQEVEKADAQRQKRADIA-------LQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKE-QEK 502 (511) Q Consensus 433 ~l~--~v~d~~~E~~ri~~E~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 502 (511) .+. ..+..++..++++++........ ......+......... +.....+... ...... +-. T Consensus 556 ~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~---e~~~~qa~~~------~~qae~~kaq 626 (725) T protein:vir:92 556 YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDP---AMVQAQGVLL------QGQAELAKAQ 626 (725) T ss_pred HhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHH---HHHHHHHHHH------HHHHHHHHHH Confidence 222 12222344455543221110000 0000000000000000 0000000000 000000 000 Q ss_pred cCCCCC-------CCC Q lcl|NC_018086. 503 AIQKKP-------KTD 511 (511) Q Consensus 503 ~~~~~~-------~~~ 511 (511) ....|. .++ T Consensus 627 aE~~k~q~~a~~~~~~ 642 (725) T protein:vir:92 627 NQTLSLQIDAAKVEAQ 642 (725) T ss_pred HHHHHHHHHHHHHHHH Confidence 000000 000 No 107 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.39 E-value=1.1e-11 Score=80.57 Aligned_cols=450 Identities=11% Similarity=0.046 Sum_probs=202.8 Q ss_pred CCCccchhhcccccCchhhHhhhhc-------cCCC----HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIR-------RNFD----LRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT 69 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~----~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~ 69 (511) .+...........-......+++.. .... ..-+..+.--....... ..+..||....-+-+ . T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~f~gy------q 112 (765) T protein:vir:96 40 LGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVP-TMLQDWYNSQGFIGY------Q 112 (765) T ss_pred HHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchh-hHHHhhhcccCCccH------H Confidence 0000000000111111111111110 0000 00011111000000111 112333332211110 0 Q ss_pred ccccceeccchHHHHHHHHHhhhhccCceecCch-----hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC- Q lcl|NC_018086. 70 NKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDE-----KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR- 143 (511) Q Consensus 70 ~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~- 143 (511) ...-+ ..+.+++.+|+..+.-++.+++.+.+++ +..+.|...|++=++...+.++.+++-.||.+|+++-.+. T Consensus 113 l~alY-~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~ 191 (765) T protein:vir:96 113 ACAII-SQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESD 191 (765) T ss_pred HHHHH-HhCchhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEeccc Confidence 00001 1257788999999999999999886643 2345677778877889999999999999999998775532 Q ss_pred CCc---------------e-EEEEEcccceEEEe---cCCCCCceEEE-EEEEEEeecCCcceEEEEEEEcCCcEEEEEE Q lcl|NC_018086. 144 NKK---------------H-RFKAVSPMNCLIAY---SADLDEEPVAA-IYYNTVISDITGHQIRTYEVYTEDLIYKFST 203 (511) Q Consensus 144 ~g~---------------~-~i~~~~p~~~~~v~---d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 203 (511) +++ + .+.+++|..+.+.- ...+...+-++ ..+|.+ .++ -+.+.++++|.. T Consensus 192 D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i----~g~------~IH~SRli~~~g 261 (765) T protein:vir:96 192 DPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWII----SGK------KYHRSHLVVVRG 261 (765) T ss_pred CcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeee----cCc------eeccceEEEecC Confidence 211 1 13344444433311 00000000000 001110 000 011222222210 Q ss_pred ccCcccccccccccccccccceeccCCccceEeec-CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC Q lcl|NC_018086. 204 DDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFD 282 (511) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~ 282 (511) .. +|-+.-+ ++-.|+|.++.+.+-+.+++++.-.....+..+...++.+.+.. T Consensus 262 -------------------------~~-lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~ 315 (765) T protein:vir:96 262 -------------------------PQ-PPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEK 315 (765) T ss_pred -------------------------CC-chhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHh Confidence 00 1111101 12258999999999999999999888888888888777766543 Q ss_pred CCcc-chhhhh------hh-hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc-ccccc--c-CccHH Q lcl|NC_018086. 283 LSAD-SDSISN------MK-NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL-VSKDF--T-AASGQ 350 (511) Q Consensus 283 ~~~~-~~~~~~------~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~--~-~~Sg~ 350 (511) .-.. ...... .+ ..+++.+..+.+.+. .+.+...+...++.....|...+++|-. .++.. | |+||+ T Consensus 316 ~l~~~~~l~~r~~~~~~~r~n~g~~~id~ee~~e~--~s~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe 393 (765) T protein:vir:96 316 AIANEDAFNARLAFWIANRDNHGVKVIGIDETMEQ--FDTNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGE 393 (765) T ss_pred hhccHHHHHHHHHHHHHhcCCceeEEecCCcceeE--EecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcch Confidence 2111 111111 11 234566666655544 4466778888999999999999999963 33432 2 56777 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHH-------H- Q lcl|NC_018086. 351 ALKAATQPLENKSAVKE-SKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVK-------L- 421 (511) Q Consensus 351 Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~-------~- 421 (511) .=...|.. .++.+| ..+...|++++.+|+... . .+ .++++.|++-...++++.|+...+ + T Consensus 394 ~D~~nYyD---~I~s~Qe~~l~p~le~L~~li~~s~----~---i~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~ 462 (765) T protein:vir:96 394 HETISYHE---ELESIQEHIFDPLLERHYLLLAKSE----S---ID-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYI 462 (765) T ss_pred HHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhc----C---CC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 53333333 333333 567889999988876531 1 11 258999999999999988875433 2 Q ss_pred -hccCChHHHHHhCC------CCCCHHHHHHH---HHHHHHHHHHHHHhhccccccCCCCCCc------cccccCCCCCC Q lcl|NC_018086. 422 -RDMLPDETIINQFP------WITDARQEVEK---ADAQRQKRADIALQNFKQTSAVQGASTA------AANKLDKNPAN 485 (511) Q Consensus 422 -~g~~s~et~~~~l~------~v~d~~~E~~r---i~~E~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~ 485 (511) .|+++...++..|. +....+.+.+. +..+.... ....+...+.....+. ...+++..+.. T Consensus 463 ~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~----~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~ 538 (765) T protein:vir:96 463 NSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAE----LEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVP 538 (765) T ss_pred hcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCcccccc----ccCCCcccccccCccccccCCCCccCCCCcccc Confidence 37888888887763 21111111110 00000000 0000000000000000 00000000000 Q ss_pred ccc--cccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 486 TST--ITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 486 ~~~--~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ..+ ..+..+.+.+..+.+..+|..+- T Consensus 539 ~~p~~~~p~~~~~~~~~g~~~~~p~~~~ 566 (765) T protein:vir:96 539 AAPRGTKPLAKAAEEGAGEAATPPSRPN 566 (765) T ss_pred cCCcccCCccccccccCccccCcccccc Confidence 000 00011111111122222211111 No 108 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.36 E-value=6.1e-11 Score=76.58 Aligned_cols=449 Identities=9% Similarity=0.007 Sum_probs=233.5 Q ss_pred CCH-HHHHHHHH-HHHHHHHHHHHHHHHhcCCCcccccCCcCccccccc-----------e-----eccchHHHHHHHHH Q lcl|NC_018086. 28 FDL-RELITLAE-MHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNS-----------K-----IVHNFPKLLVDTST 89 (511) Q Consensus 28 ~~~-~~l~~~~~-~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~-----------r-----i~~n~~k~ivd~~~ 89 (511) |++ +.++..+. ....++.......+-|+|-..-..........-++. | .-++|++-+|+..+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 221 11222221 111111222233355676432111000001110110 0 12589999999999 Q ss_pred hhhhcc-Cceec-----Cchh----hHHH---HHHHHhc-------cChhHHHHHHHHHHhhCCeEEEEeeeCCCC---- Q lcl|NC_018086. 90 AYLAGE-PITES-----GDEK----TIKA---MQPVFKE-------NYVTDVNSEEVKLSGIFGHCFEIHWIDRNK---- 145 (511) Q Consensus 90 ~~l~g~-~~~~~-----~d~~----~~~~---l~~~~~~-------n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g---- 145 (511) +.++|. ++.+. .|.+ ..+. +++-|.. .+|......+.+.....|.+|+.....+.+ T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 999984 44332 2222 2233 3333432 347788888999999999999877654422 Q ss_pred ----ceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEE-EEEEcCCcEEEEEEccCcccccccccccccc Q lcl|NC_018086. 146 ----KHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRT-YEVYTEDLIYKFSTDDEREVYREIPEELEIK 220 (511) Q Consensus 146 ----~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 220 (511) .+++..++|..+-.-++.. ......+|. .+..|..+-| +.-..++...... .. T Consensus 161 g~~~~~~lqliepd~l~~~~~~~-~~~i~~GIE-----~D~~Grp~aY~i~~~hPgd~~~~~---~~------------- 218 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNL-SKGIVQGIE-----RDTWRRKRAYHLLKDHPGNLQTLG---GS------------- 218 (548) T ss_pred CcccceEEEEechhhcCCCCCCC-CCceeeeeE-----ECCCCceEEEEEeecCCCcccccc---cc------------- Confidence 2588999999874433322 233444543 2333444322 1111122211110 00 Q ss_pred cccceeccCCccc---eEeecC-----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc------ Q lcl|NC_018086. 221 DYEVHPNLLQKFP---VLEIIA-----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD------ 286 (511) Q Consensus 221 ~~~~~~~~~g~iP---vv~~~n-----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~------ 286 (511) ..+-+|| |+++.. -..|.|.|..++..+..++....-........+.--.+++....+.. T Consensus 219 ------~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~ 292 (548) T protein:vir:95 219 ------LAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGK 292 (548) T ss_pred ------cceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCc Confidence 0011222 333322 23689999998888888887776666655555554555554322111 Q ss_pred --chhhhhhhhCcee-eecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc-ccccccCccHHHHHHHHHHHHHH Q lcl|NC_018086. 287 --SDSISNMKNDRVI-VTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL-VSKDFTAASGQALKAATQPLENK 362 (511) Q Consensus 287 --~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~~~~Sg~Ai~~~~~~l~~k 362 (511) ......+..+.++ .+..+.++++.+.+.+...+..+...+...|....++|-- ..+.++ .|-.++++.+...... T Consensus 293 ~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R~~l~e~~r~ 371 (548) T protein:vir:95 293 DRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQRQELVEGWLG 371 (548) T ss_pred ccccccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHHHHHHHHHHH Confidence 1111123334444 3678889999988878889999999999999998888843 234454 4778888888888888 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHhcCCCc--c-cc-ccceeEEeCCCCC--cCHHHHHHHHHHH--hccCChHHHHHh Q lcl|NC_018086. 363 SAVKESKFRKVLAK-RYELVCSYLEFMNKAK--D-LK-PYEVTPVFVRNLP--QSYAELADMAVKL--RDMLPDETIINQ 433 (511) Q Consensus 363 ~~~~~~~~~~~l~~-~~~li~~~~~~~~~~~--~-~~-~~~i~i~f~~~~p--~d~~e~a~~~~~~--~g~~s~et~~~~ 433 (511) ....+..|...+.+ +++..+...-..+... . .+ ..-+.+.|..+-. .|....+++...+ +|+.|.+.++.. T Consensus 372 ~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~ 451 (548) T protein:vir:95 372 YDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARA 451 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHH Confidence 88777777765555 4554444332222211 0 01 1125678844332 5777777766555 589999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHhh--cccc-c---cCCCCCCccc-----------cc--------cCCCCCCccc Q lcl|NC_018086. 434 FPWITDARQEVEKADAQRQKRADIALQN--FKQT-S---AVQGASTAAA-----------NK--------LDKNPANTST 488 (511) Q Consensus 434 l~~v~d~~~E~~ri~~E~~~~~~~~~~~--~~~~-~---~~~~~~~~~~-----------~~--------~~~~~~~~~~ 488 (511) .|. |+++.++.+..|.+...+.-+.- .+.. . +..+..+.+. ++ +++=+-+... T Consensus 452 ~G~--D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (548) T protein:vir:95 452 RGR--DPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPD 529 (548) T ss_pred hCC--CHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCC Confidence 874 89998888888877655543211 0000 0 0011111000 00 0000001111 Q ss_pred cccCCCCccccccccCCCC Q lcl|NC_018086. 489 ITTTDPVAAKEQEKAIQKK 507 (511) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~ 507 (511) .....+++....+-+-+|+ T Consensus 530 ~~~~~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 530 FPNESNNGGADGQPSNPDP 548 (548) T ss_pred CCcccccCCCCCCCCCCCC Confidence 1122334444444444444 No 109 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.36 E-value=1.7e-11 Score=79.69 Aligned_cols=399 Identities=9% Similarity=0.044 Sum_probs=189.9 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC---ccccccce--eccchHHHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD---DTNKPNSK--IVHNFPKLLVDTS 88 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~---~~~~~~~r--i~~n~~k~ivd~~ 88 (511) +..++.++. ++...-+-+.....+.-......... .....-.. -.+.+++.+|+.. T Consensus 1 ~~~~m~~~~-------------------~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~ 61 (435) T protein:vir:79 1 MGVFMSDKV-------------------KAITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVI 61 (435) T ss_pred CCccccccc-------------------ccchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccc Confidence 222222221 00001111111122211000000000 00000001 1357889999999 Q ss_pred HhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC----------CCce-EEEEEcccce Q lcl|NC_018086. 89 TAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR----------NKKH-RFKAVSPMNC 157 (511) Q Consensus 89 ~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~----------~g~~-~i~~~~p~~~ 157 (511) +.-++.+++.+.++++ .+.+...|.+=++...+.++.+.+..||.|++++-... +|.+ .+.+++|..+ T Consensus 62 aed~~r~g~~i~g~~~-~~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i 140 (435) T protein:vir:79 62 PEEMVTPGFKVDGVKN-EKSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQI 140 (435) T ss_pred hHHhhcCCceecCCCh-HHHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhc Confidence 9999999999987653 35577777777888899999999999999988876532 1222 3445555544 Q ss_pred EEEecCCCCCceEE-EEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 158 LIAYSADLDEEPVA-AIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 158 ~~v~d~~~~~~~~~-~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) .+-.-+.+...+-+ -..+|.+....+... . .+.+.++++|... .+|-.. T Consensus 141 ~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~---~-~iH~SRli~~~g~--------------------------~~p~~~ 190 (435) T protein:vir:79 141 TIHERETNARSVRYGEPKLYKISPGGDIPE---F-FVHYSRICIIDGE--------------------------RVSNEK 190 (435) T ss_pred cchhhccCCcccccCcceEEEEecCCCCCc---e-EEcceeEEEecCC--------------------------cchhhh Confidence 33111111110100 011122111000000 0 1112222222100 011111 Q ss_pred -ecCCcccCchh-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC---Ccc-c-hhhhh---h---h-hCceeeec Q lcl|NC_018086. 237 -IIANEERLGDF-EAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL---SAD-S-DSISN---M---K-NDRVIVTD 302 (511) Q Consensus 237 -~~n~~~g~s~~-~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~---~~~-~-~~~~~---~---~-~~~~i~~~ 302 (511) ..++-.|.|.+ +.+.+-+..++++.......+..+..+.+.++|... ... . ..... . + .++.+.+. T Consensus 191 ~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~ 270 (435) T protein:vir:79 191 RRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGID 270 (435) T ss_pred ccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEe Confidence 12344578876 678888888998888888888777777777665311 011 1 11111 1 1 13444443 Q ss_pred -CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccc-cccc-c--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 -EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLV-SKDF-T--AASGQALKAATQPLENKSAVKESKFRKVLAKR 377 (511) Q Consensus 303 -~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~-~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 377 (511) ++.+.+.+ +.+...+...++.....|...+++|-.- ++.. + |+||..-...|...+.... +..+...++++ T Consensus 271 ~~~e~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Q--e~~l~p~l~~l 346 (435) T protein:vir:79 271 ATDEEYEVL--NSDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKR--VEDYKPILEFL 346 (435) T ss_pred cCCcceEEE--ecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHH--HHHHHHHHHHH Confidence 33444444 4566778889999999999999999643 3332 1 4677664444444433322 35678888888 Q ss_pred HHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 378 YELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADI 457 (511) Q Consensus 378 ~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~ 457 (511) +++++. . .++++.|+|-...++++.|+...+.+...+. + -..+.+ ++++..++++. . T Consensus 347 ~~li~~-------s-----~d~~~~f~pL~~~sekEkAei~~~~a~a~~~--~-~~~g~i-~~~e~r~~L~~----~--- 403 (435) T protein:vir:79 347 LPFMIS-------E-----TEWSIEFEPLSVPSDKDKAEIMAKNVESVVK--L-KAEQAI-NLKETRDTLRS----I--- 403 (435) T ss_pred HHHhhc-------C-----CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHH--H-HhcCCC-CHHHHHHHHHH----h--- Confidence 887642 1 3678999999999999998876655422211 1 112222 23222222211 0 Q ss_pred HHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccc Q lcl|NC_018086. 458 ALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKE 499 (511) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) ....+-.....+.. ++..+.+....+ +++-+. T Consensus 404 -~~~~~~~~~~~~~~----~~~~d~~~~~~~-----e~g~~~ 435 (435) T protein:vir:79 404 -CPDLKIMDNDNIEL----PEPEDLDPEPGQ-----EGGLNK 435 (435) T ss_pred -ccccCCCCcccccC----CccccCCCCCCC-----CCCCCC Confidence 00111001110111 111111111111 111111 No 110 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.35 E-value=1.5e-12 Score=85.40 Aligned_cols=468 Identities=10% Similarity=0.018 Sum_probs=203.8 Q ss_pred ccCCCHHHHHHHHHHH-------HHHHHHHHHHHHHh--cCCCcccccC--CcC---ccccccceeccchHHHHHHHHHh Q lcl|NC_018086. 25 RRNFDLRELITLAEMH-------SRSSSAYGVLYDYY--KGNHIAIQSR--TFD---DTNKPNSKIVHNFPKLLVDTSTA 90 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~-------~~~~~~~~~~~~yY--~G~~~~~~~~--~~~---~~~~~~~ri~~n~~k~ivd~~~~ 90 (511) ..+.+.+.+.++...+ .+.+.+...-.+|| .|.+-...-. ... ..+.| .+.+|..+.+|+..++ T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP--~~~~N~i~~~v~~v~g 78 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYP--KFEINKVATELNRIIS 78 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCC--ceEecchHHHHHHHhh Confidence 1111111222222222 23333334445666 5665322111 000 01112 5788999999999999 Q ss_pred hhhccCcee--c-----CchhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC---------CCceEEE Q lcl|NC_018086. 91 YLAGEPITE--S-----GDEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR---------NKKHRFK 150 (511) Q Consensus 91 ~l~g~~~~~--~-----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~---------~g~~~i~ 150 (511) +.--+.+.+ . +|.+..+ .+..++..|+.+...+.+..+++++|.||+-+..+- .+++.+. T Consensus 79 ~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~ 158 (706) T protein:vir:10 79 EYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVE 158 (706) T ss_pred HHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccceee Confidence 977664432 2 1222222 245567789999999999999999999998885431 2244454 Q ss_pred E-EcccceEEEecCCCCC----ceE-EEEEEEEEee------------------------cCCcceEEEEEEEcCCcEE- Q lcl|NC_018086. 151 A-VSPMNCLIAYSADLDE----EPV-AAIYYNTVIS------------------------DITGHQIRTYEVYTEDLIY- 199 (511) Q Consensus 151 ~-~~p~~~~~v~d~~~~~----~~~-~~v~~~~~~~------------------------~~~~~~~~~~~~~~~~~i~- 199 (511) . .+|.+. ++||+.... ... ++.+.|...+ +.....+...+.|...... T Consensus 159 ~v~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~ 237 (706) T protein:vir:10 159 PIYDPARS-VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRKESV 237 (706) T ss_pred eeccchhc-eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccceeE Confidence 3 367653 245543211 111 1222111000 0011222333333332211 Q ss_pred ---EEEEccCcccccccc----------ccccc----------------------ccccceeccCCccceEeecCC---- Q lcl|NC_018086. 200 ---KFSTDDEREVYREIP----------EELEI----------------------KDYEVHPNLLQKFPVLEIIAN---- 240 (511) Q Consensus 200 ---~~~~~~~~~~~~~~~----------~~~~~----------------------~~~~~~~~~~g~iPvv~~~n~---- 240 (511) .|.....+....... ...+. ...+..|.+.+.+|+|+|... T Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~ 317 (706) T protein:vir:10 238 DVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 317 (706) T ss_pred EEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccc Confidence 111100000000000 00000 000112333366777776432 Q ss_pred ---cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh-----hhCceeeecC----CCc-- Q lcl|NC_018086. 241 ---EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM-----KNDRVIVTDE----DGM-- 306 (511) Q Consensus 241 ---~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~-----~~~~~i~~~~----~~~-- 306 (511) ....|.+.++++.++.+|..+|.+.+.+-.... ..-.|.. +...+..... .....+...+ +|. T Consensus 318 d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~--~~~~~~~-~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~ 394 (706) T protein:vir:10 318 DDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPG--QTPIVDM-EQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVV 394 (706) T ss_pred cccCcccceeccchhhHHHHHHHHHHHHHHHHhcCC--cccccch-hHHHHHHHHhhhcccccccchhcccccCCCCccc Confidence 123477889999999999999999887643333 2222221 1111110000 0001111110 111 Q ss_pred -----eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 -----VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 307 -----~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ..++..+.-...+...+......|..+|++.+-..|..+|.||+||.................|..+.+++.+++ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~l 474 (706) T protein:vir:10 395 APANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIW 474 (706) T ss_pred ccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122222222244566677778889999998877777778899999999888777777778888888888887766 Q ss_pred HHHHHh----------cCCCccc---------------------c----ccceeEEeCCCCCcCHHHHHHHHHHHhcc-C Q lcl|NC_018086. 382 CSYLEF----------MNKAKDL---------------------K----PYEVTPVFVRNLPQSYAELADMAVKLRDM-L 425 (511) Q Consensus 382 ~~~~~~----------~~~~~~~---------------------~----~~~i~i~f~~~~p~d~~e~a~~~~~~~g~-~ 425 (511) +.+... .+..... + ..+|.|.=.+..+.-..+..+.++.+.+. . T Consensus 475 L~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~ 554 (706) T protein:vir:10 475 LSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGML 554 (706) T ss_pred HHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcC Confidence 665432 1110000 0 01233333344444445555666665432 2 Q ss_pred Ch--HH------HHHhCCCCCCHHHHHHHHHHHHHH-------------HHH-H-HHhhccccccCCCCCCccccccCCC Q lcl|NC_018086. 426 PD--ET------IINQFPWITDARQEVEKADAQRQK-------------RAD-I-ALQNFKQTSAVQGASTAAANKLDKN 482 (511) Q Consensus 426 s~--et------~~~~l~~v~d~~~E~~ri~~E~~~-------------~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (511) +. .+ +++.+. ++..++-.+++++.... .+. . .+..... .... ....+.. ... T Consensus 555 p~~~~~~~l~~~~~~~~d-~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~-~~~~--~~~~aq~-~~~ 629 (706) T protein:vir:10 555 PQDPMRPALMGIIIDNME-GEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQP-DPNM--LLAQAQM-VVA 629 (706) T ss_pred CcchhhHHHHHHHHhhcC-ccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHH-HHHH--HHHHHHH-HHH Confidence 21 12 222222 12233334444332110 000 0 0000000 0000 0000000 000 Q ss_pred CCCccccccCCCCccccccc---cCCCCCCCC Q lcl|NC_018086. 483 PANTSTITTTDPVAAKEQEK---AIQKKPKTD 511 (511) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 511 (511) .+. .+ ..+.+.. ...-++.+| T Consensus 630 qA~------~~--k~~a~~~q~~~~a~~a~~q 653 (706) T protein:vir:10 630 QAE------AQ--KSQNETVQTQIKAFTAQQD 653 (706) T ss_pred HHH------HH--HHHHHHHHHHHHHHHHHHH Confidence 000 00 0000000 000111222 No 111 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.34 E-value=8e-11 Score=75.93 Aligned_cols=385 Identities=10% Similarity=0.067 Sum_probs=188.5 Q ss_pred HHHHHHHHHHhcCCCcccccCCcCccccccc---e--eccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccCh Q lcl|NC_018086. 44 SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNS---K--IVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYV 118 (511) Q Consensus 44 ~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~---r--i~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~ 118 (511) +.+.+-+...+.|-++-.... ..+...... . -.+.+++.+|+..+.-++.+++.+.++++. ..+..-|.+=++ T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~-~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~-~~~~~~~~~l~~ 78 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIY-GSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-PAFWSRWDDLEM 78 (422) T ss_pred CccchhhHHHHcCCCCCcccc-CcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHH-HHHHHHHHHhhH Confidence 111122222233322211000 000000000 0 136788899999999999999999877643 446666777788 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCC----------CCce-EEEEEcccceEEEecCCCCCceEEE-EEEEEEeecCCcce Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDR----------NKKH-RFKAVSPMNCLIAYSADLDEEPVAA-IYYNTVISDITGHQ 186 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~----------~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~-v~~~~~~~~~~~~~ 186 (511) ...+.++.+.+..||.|++++-... .|.+ .+.+++|..+.|..-+.+...+-++ -.+|.+.....+.. T Consensus 79 ~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~ 158 (422) T protein:vir:10 79 TQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITTNESDMF 158 (422) T ss_pred HHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEecCCCCcc Confidence 8999999999999999998876532 1222 2445555554432111111111111 11122111111100 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce-EeecCCcccCchhHH-HHHHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV-LEIIANEERLGDFEA-QLSLIDAYNLAVSDS 264 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv-v~~~n~~~g~s~~~~-v~~l~d~~~~~~s~~ 264 (511) . .+.+.++++|... .+|- ....++-+|.|.+.. +.+-+..++++.-.. T Consensus 159 ---~-~iH~SRli~~~g~--------------------------~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~ 208 (422) T protein:vir:10 159 ---Y-DVHYSRIHIIDGE--------------------------RIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLA 208 (422) T ss_pred ---e-eeccceeEEeCCC--------------------------CchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHH Confidence 0 1122222222100 0121 122344568898886 678888888888888 Q ss_pred HHHHHHhcCceeEeecCCC---Cccc--hhhhhh------h-hCceeeec-CCCceeeeecCCCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 265 VNDIAYWNDAYLWLQGFDL---SADS--DSISNM------K-NDRVIVTD-EDGMVKFITKDVNDKHIENIKNRAKLDIF 331 (511) Q Consensus 265 ~~~~~~~~~p~l~~~G~~~---~~~~--~~~~~~------~-~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~l~~~i~ 331 (511) ...+..+....+.+.|... .... ...... + ..+.+.+. ++.+.+. .+.+.+.+...++.....|. T Consensus 209 ~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~--~~~~lsgl~~~~~~~~~~ia 286 (422) T protein:vir:10 209 TQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSV--LNSDIGGIDAFLDKKFDRIV 286 (422) T ss_pred HHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEE--EecccCChHHHHHHHHHHHH Confidence 8888777777777766311 1111 101100 1 12334443 3444444 45666678888999999999 Q ss_pred HHhCcccccc-ccc-c--CccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCC Q lcl|NC_018086. 332 SLSQTPDLVS-KDF-T--AASGQALKAATQPLENKSAVK-ESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRN 406 (511) Q Consensus 332 ~~s~~p~~~~-~~~-~--~~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~ 406 (511) ..+++|-.-. +.. + |+||..-...|... ++.+ +..+...+++++++|+. ..++++.|+|- T Consensus 287 aa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~---i~~~Qe~~l~p~l~~l~~~i~~------------s~~~~~~f~pL 351 (422) T protein:vir:10 287 ALSGIHEIILKNKNVGGVSSSQNTALETFHKL---VDRKRNAELLPILEFLIPFIVN------------AEEWSVEFNPL 351 (422) T ss_pred hhhCCCeeeeccCCcccccccchHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhcc------------cCCcEEEeCCC Confidence 9999996433 322 1 35666544433333 3333 35678889888888652 12678999999 Q ss_pred CCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCc Q lcl|NC_018086. 407 LPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANT 486 (511) Q Consensus 407 ~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (511) ...++++.|+...+.+...+. ++. .+ +-++++..++++.. ....+-..+..+.+....+.. +.|... T Consensus 352 ~~~sekekaei~~~~a~a~~~--~~~-~g-~i~~~e~r~~L~~~--------~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 418 (422) T protein:vir:10 352 AQESSKDKAEILEKNVNSIAA--LIA-AG-AMDIDEARDTLRTI--------APEVKINDGSVETEVTISETS-NDPLEV 418 (422) T ss_pred CCCCHHHHHHHHHHHHHHHHH--HHh-cC-CCCHHHHHHHhhhh--------cccccCCCCCCccccchhhcC-CCCCCC Confidence 999999988876555322211 111 12 12222222222111 000000000000000000000 000000 Q ss_pred cccccCCCCcc Q lcl|NC_018086. 487 STITTTDPVAA 497 (511) Q Consensus 487 ~~~~~~~~~~~ 497 (511) |.+- T Consensus 419 -------~~~d 422 (422) T protein:vir:10 419 -------PTDD 422 (422) T ss_pred -------CCCC Confidence 0000 No 112 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.34 E-value=2.6e-12 Score=84.07 Aligned_cols=480 Identities=11% Similarity=0.026 Sum_probs=195.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHH----HHHHHHHHHHHH--HHHHHHhcCCCcccccC-CcCccccc- Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELIT----LAEMHSRSSSAY--GVLYDYYKGNHIAIQSR-TFDDTNKP- 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~----~~~~~~~~~~~~--~~~~~yY~G~~~~~~~~-~~~~~~~~- 72 (511) ||=...++ ...+.. ........+.+. +.-..||.|.+-...-. .-+..+.. T Consensus 1 ma~~~~~~---------------------~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~ 59 (708) T protein:vir:17 1 MAETLEKK---------------------HERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFE 59 (708) T ss_pred CchhHHHH---------------------HHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhc Confidence 11111111 011111 111112222222 22236899986422100 00000111 Q ss_pred -cceeccchHHHHHHHHHhhhhccCce--ecC-----chhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 73 -NSKIVHNFPKLLVDTSTAYLAGEPIT--ESG-----DEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 73 -~~ri~~n~~k~ivd~~~~~l~g~~~~--~~~-----d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) .-.+.+|.++.+|+..+++---+.+. +.. |.+..+ .+..+...|+.+...+.+..+++++|.||+-+. T Consensus 60 ~rP~~~~N~i~~~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~ 139 (708) T protein:vir:17 60 KYPKFEINKVATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLT 139 (708) T ss_pred CCCceEEcchHHHHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeee Confidence 11477899999999999986655433 221 222222 345567789999999999999999999998774 Q ss_pred eC---C------CCceEEEEE-cc-cceEEEecCCCCCce----E-EEEEEEE-------------------------Ee Q lcl|NC_018086. 141 ID---R------NKKHRFKAV-SP-MNCLIAYSADLDEEP----V-AAIYYNT-------------------------VI 179 (511) Q Consensus 141 ~~---~------~g~~~i~~~-~p-~~~~~v~d~~~~~~~----~-~~v~~~~-------------------------~~ 179 (511) .+ + ..++.+..+ +| .+++ ||+.....- . ++++.|. .. T Consensus 140 ~d~~~e~d~~~~~~~i~i~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~ 217 (708) T protein:vir:17 140 SMLVNEYDPMDDRQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEY 217 (708) T ss_pred ecccccCCCCCCccccceEeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccc Confidence 32 1 123334332 33 4443 554331110 0 1111110 00 Q ss_pred ecCCcceEEEEEEEcCCcEE--EEEEc--cCccc-c-cccc----------------------c-------cccc-cccc Q lcl|NC_018086. 180 SDITGHQIRTYEVYTEDLIY--KFSTD--DEREV-Y-REIP----------------------E-------ELEI-KDYE 223 (511) Q Consensus 180 ~~~~~~~~~~~~~~~~~~i~--~~~~~--~~~~~-~-~~~~----------------------~-------~~~~-~~~~ 223 (511) ++.+...+..+++|...... .+... ..+.. . .... . ..+. .... T Consensus 218 ~~~~~d~vrv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~ 297 (708) T protein:vir:17 218 DWFDADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEK 297 (708) T ss_pred cccCCCeEEEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccC Confidence 11112333344444211110 01100 00000 0 0000 0 0000 1112 Q ss_pred ceeccCCccceEeecCC---ccc----CchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee-----cCCCCc-----c Q lcl|NC_018086. 224 VHPNLLQKFPVLEIIAN---EER----LGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ-----GFDLSA-----D 286 (511) Q Consensus 224 ~~~~~~g~iPvv~~~n~---~~g----~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~-----G~~~~~-----~ 286 (511) ..+-+++.+|+|+|... ..| -|.+.++++.++.+|..+|.+...+-.....+.++. |..... + T Consensus 298 ~~~~p~~~fP~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~ 377 (708) T protein:vir:17 298 PRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKK 377 (708) T ss_pred CCCCCCCccceEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccc Confidence 34445566777766431 122 366778999999999999999988766655443322 111100 0 Q ss_pred chhhhhhh--hCceeeecCCCce-eeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHH Q lcl|NC_018086. 287 SDSISNMK--NDRVIVTDEDGMV-KFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKS 363 (511) Q Consensus 287 ~~~~~~~~--~~~~i~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~ 363 (511) ........ ....-.+..++.. .....+.-...+...+......|-.+||+.+...|..+|.||+||........... T Consensus 378 ~~~~~~~~~~~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~ 457 (708) T protein:vir:17 378 RPAFLPLREVRDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMAS 457 (708) T ss_pred hhhhhhhhccCCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHH Confidence 00000000 0111111111111 11112222245566678888899999998877777777899999998877766666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC----------CC-c----------------------ccc--ccceeEEeCCCCC Q lcl|NC_018086. 364 AVKESKFRKVLAKRYELVCSYLEFMN----------KA-K----------------------DLK--PYEVTPVFVRNLP 408 (511) Q Consensus 364 ~~~~~~~~~~l~~~~~li~~~~~~~~----------~~-~----------------------~~~--~~~i~i~f~~~~p 408 (511) ......+..+.+++.++++.+....- .. . ++. ..+|.|.=.+..+ T Consensus 458 ~~~~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~ 537 (708) T protein:vir:17 458 FIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYT 537 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCch Confidence 66666666666666666555443211 00 0 000 0122222222222 Q ss_pred cCHHHHHHHHHHHhccCChH---H------HHHhCCCCCCHHHHHHHHHHHHHHHH------------HHHHhhcccccc Q lcl|NC_018086. 409 QSYAELADMAVKLRDMLPDE---T------IINQFPWITDARQEVEKADAQRQKRA------------DIALQNFKQTSA 467 (511) Q Consensus 409 ~d~~e~a~~~~~~~g~~s~e---t------~~~~l~~v~d~~~E~~ri~~E~~~~~------------~~~~~~~~~~~~ 467 (511) .-..+..++++++.+.++.. + +++.+. .+..++-.++|+....... ........+... T Consensus 538 t~r~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D-~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~ 616 (708) T protein:vir:17 538 ARRDATVSVLTNVLSSMLPADPMRPAIQGIILDNID-GEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQP 616 (708) T ss_pred hHHHHHHHHHHHHHHhcCCccchhHHHHHHHHHhcC-CCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHH Confidence 22334445555554332211 1 222222 2333333444433211000 000000000000 Q ss_pred CCCCCCccccccCCCCCCccccccCCCCccccccccC-CCCCCCC Q lcl|NC_018086. 468 VQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAI-QKKPKTD 511 (511) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 511 (511) ..-...........+ +. .+...+.....+. --+..++ T Consensus 617 ~~~~~eaqa~~~~~q-Ae------~~ka~aea~~~q~~a~q~~~~ 654 (708) T protein:vir:17 617 NPEMVLAQAQMVAAQ-AE------AQKATNETAQTQIKAFTAQQD 654 (708) T ss_pred HHHHHHHHHHHHHHH-HH------HHHHHHHHHHHHHHHHHHHHH Confidence 000000000000000 00 0000000000000 0001111 No 113 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.34 E-value=6.3e-11 Score=76.51 Aligned_cols=462 Identities=11% Similarity=0.030 Sum_probs=196.7 Q ss_pred ccCCCHHHHHHHHHHHH-------HHHHHHHHHHHHhc--CCCccccc----C-CcCccccccceeccchHHHHHHHHHh Q lcl|NC_018086. 25 RRNFDLRELITLAEMHS-------RSSSAYGVLYDYYK--GNHIAIQS----R-TFDDTNKPNSKIVHNFPKLLVDTSTA 90 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~~-------~~~~~~~~~~~yY~--G~~~~~~~----~-~~~~~~~~~~ri~~n~~k~ivd~~~~ 90 (511) ..+.+.+.+.++...+. +-+.....-.+||. |.+-.... + .....+.| .+.+|.++.+|+..++ T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P--~~~~N~i~~~v~~v~g 78 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYP--KFEINKISTELNRIIS 78 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCC--eEEEccHHHHHHHHHh Confidence 11111122222222221 11222223456664 66532110 0 00111222 3678999999999999 Q ss_pred hhhccCce--ec-----CchhhHH----HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC------C---CceEEE Q lcl|NC_018086. 91 YLAGEPIT--ES-----GDEKTIK----AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR------N---KKHRFK 150 (511) Q Consensus 91 ~l~g~~~~--~~-----~d~~~~~----~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~------~---g~~~i~ 150 (511) +---+.+. +. +|.+..+ .+..+...|+.+...+.+..+++++|.||+-|..+- . +.+++. T Consensus 79 ~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~ 158 (720) T protein:vir:35 79 EYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLE 158 (720) T ss_pred HHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEe Confidence 98655443 22 1222222 245567789999999999999999999999886541 1 123333 Q ss_pred E-Eccc-ceEEEecCCCCCc-----eEEEEEEEE------------------------EeecCCcceEEEEEEEcCCcEE Q lcl|NC_018086. 151 A-VSPM-NCLIAYSADLDEE-----PVAAIYYNT------------------------VISDITGHQIRTYEVYTEDLIY 199 (511) Q Consensus 151 ~-~~p~-~~~~v~d~~~~~~-----~~~~v~~~~------------------------~~~~~~~~~~~~~~~~~~~~i~ 199 (511) . .+|. ++ .||+..... ..+++..|. ..++.+...+..+++|...... T Consensus 159 ~v~~~~~~v--~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~~~ 236 (720) T protein:vir:35 159 PIYDPARSV--WFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKKES 236 (720) T ss_pred cccCchhhe--eecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEEEEE Confidence 2 2332 32 233322110 001111110 0011112233344444332211 Q ss_pred E----EEEccCccccccccc-------------------------------ccc-cccccceeccCCccceEeecCC--- Q lcl|NC_018086. 200 K----FSTDDEREVYREIPE-------------------------------ELE-IKDYEVHPNLLQKFPVLEIIAN--- 240 (511) Q Consensus 200 ~----~~~~~~~~~~~~~~~-------------------------------~~~-~~~~~~~~~~~g~iPvv~~~n~--- 240 (511) . +.....+........ ..+ .......+-+++.+|+|+|... T Consensus 237 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~ 316 (720) T protein:vir:35 237 VDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWF 316 (720) T ss_pred EEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeec Confidence 0 111000000000000 000 0011223445566777776531 Q ss_pred ----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhh---Cc--eeee----cCCC-- Q lcl|NC_018086. 241 ----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKN---DR--VIVT----DEDG-- 305 (511) Q Consensus 241 ----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~---~~--~i~~----~~~~-- 305 (511) +...|.+.++++.++.+|...|.+...+- +.+...-.|... ........+.. .+ .+.+ ..+| T Consensus 317 ~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~--~~~~~~~~~a~~-~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~ 393 (720) T protein:vir:35 317 IDDIERVEGHIAKAMDAQRLYNLQVSMLADSAT--QDTGSIPIVGKS-QIKTLEKYWANRNKNRPAFLPLNEIVDKQGNI 393 (720) T ss_pred cCCCcccceeeecchhHHHHHHHHHHHHHHHHH--cCCccccccCcc-hHHHHHHHhhccccccccccccccccccCccc Confidence 11247788899999999999999998885 344443334211 11111111111 10 0101 1111 Q ss_pred -----ceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 306 -----MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYEL 380 (511) Q Consensus 306 -----~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~l 380 (511) .+.+.....-.......+..-..+|-.+|++.+-..|..+|.||+||......-..........+..+.+++.++ T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~ 473 (720) T protein:vir:35 394 IAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEV 473 (720) T ss_pred ccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222332222234455667777788889998887777777889999999876666666655666666666666665 Q ss_pred HHHHHHh----------cCC-C-c---------------------ccc--ccceeEEeCCCCCcCHHHHHHHHHHHhccC Q lcl|NC_018086. 381 VCSYLEF----------MNK-A-K---------------------DLK--PYEVTPVFVRNLPQSYAELADMAVKLRDML 425 (511) Q Consensus 381 i~~~~~~----------~~~-~-~---------------------~~~--~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~ 425 (511) ++.+... .+. + . ++. ..+|.+.=.|..+.-..+..++++.+.+.+ T Consensus 474 lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~ 553 (720) T protein:vir:35 474 WLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGM 553 (720) T ss_pred HHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhc Confidence 5554321 110 0 0 000 012333333333333444555555554433 Q ss_pred ChHH---------HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCc Q lcl|NC_018086. 426 PDET---------IINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVA 496 (511) Q Consensus 426 s~et---------~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (511) +... +++.+. .+..++-.+++++....... ..+.. . ..+...+...+..+... T Consensus 554 ~p~~~~~~~~~~~ile~~d-~p~~~e~~erirk~~~~~~~-----------~~~~~---~---e~qq~~a~~qq~~qq~~ 615 (720) T protein:vir:35 554 LPQDPMRQVLQGIILDNME-GEGLDEFKEYNRKQLLTQGV-----------VKPRN---T---EEEQMVAQMIQQAQQPN 615 (720) T ss_pred CCCchhHHHHHHHHHHhcC-chhHHHHHHHHHhhcchhcc-----------cCccC---h---hHHHHHHHHHHHHHhHh Confidence 3221 122222 12223333444332111000 00000 0 00000000000000000 Q ss_pred ---cccc-----cccCCCCCCCC Q lcl|NC_018086. 497 ---AKEQ-----EKAIQKKPKTD 511 (511) Q Consensus 497 ---~~~~-----~~~~~~~~~~~ 511 (511) .+.+ -.+...+...+ T Consensus 616 ~e~~~aqa~l~qaqae~~kaqa~ 638 (720) T protein:vir:35 616 AELVAAQGVLMQGQAEVQKAKNE 638 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 00011111111 No 114 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.30 E-value=1.5e-10 Score=74.42 Aligned_cols=431 Identities=11% Similarity=0.049 Sum_probs=200.3 Q ss_pred CCCccchh----hcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcccccCCcCccccccce Q lcl|NC_018086. 1 MAIPNGQI----NAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSS-SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSK 75 (511) Q Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~r 75 (511) ++++|..+ +.|... .+-..+..+. ..-..+..+|....-+.+ .....+ T Consensus 91 ~~~~~~~~~~Dgl~n~~~--------------------~lG~~~~~s~y~~~~~~~~~~~~~~f~gy------ql~alY- 143 (862) T protein:vir:99 91 AIKAITGFAMDDGGGAPV--------------------PIGAEGKQSSYAVPEALQDWYLSQGFIGH------QACALI- 143 (862) T ss_pred hhhhhhhhhhhcchhhhh--------------------hccccccccccccchhccccccccCcccH------HHHHHH- Confidence 33333222 000000 0000000000 000011122211100000 000001 Q ss_pred eccchHHHHHHHHHhhhhccCceecCc-------hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC-CCc- Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITESGD-------EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR-NKK- 146 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~~~d-------~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~-~g~- 146 (511) ..+.+++.+|+..+.-++-+++.+.+. ++..+.+...|.+-++...+.++.+++-.||.+++++-.+. +++ T Consensus 144 ~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~ 223 (862) T protein:vir:99 144 AQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDY 223 (862) T ss_pred HhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchh Confidence 125778899999999999999998652 24456788888888889999999999999998876654322 221 Q ss_pred --------------e-EEEEEcccceEEE---ecCCCCCceEEE-EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCc Q lcl|NC_018086. 147 --------------H-RFKAVSPMNCLIA---YSADLDEEPVAA-IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDER 207 (511) Q Consensus 147 --------------~-~i~~~~p~~~~~v---~d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 207 (511) + .+.+++|..+.+. +...+...+-++ ..+|.+ .++ -+.+.++++|... T Consensus 224 LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I----~g~------~IH~SRliif~g~--- 290 (862) T protein:vir:99 224 YEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWII----SGQ------KYHRSHLIIARGP--- 290 (862) T ss_pred hhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeee----cCe------eeccceeEEecCC--- Confidence 1 2444555544331 000010000000 011110 010 1122233222100 Q ss_pred ccccccccccccccccceeccCCccceEee-cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 208 EVYREIPEELEIKDYEVHPNLLQKFPVLEI-IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~-~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) .+|-..- .++-.|.|.++.+.+.+..++++.......+..+...++.+.+...-.. T Consensus 291 -----------------------~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ 347 (862) T protein:vir:99 291 -----------------------QPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIAN 347 (862) T ss_pred -----------------------CchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhcc Confidence 0111100 1123589999999999999999988888888888888777766532111 Q ss_pred -chhhhh---h---hh-CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc-ccccc--c-CccHHHHHH Q lcl|NC_018086. 287 -SDSISN---M---KN-DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL-VSKDF--T-AASGQALKA 354 (511) Q Consensus 287 -~~~~~~---~---~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~--~-~~Sg~Ai~~ 354 (511) ...... + +. .+++.+..+.+.+. .+.+.+.+...++.....|...+++|-. .++.. | ++||..=.. T Consensus 348 ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~--ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~ 425 (862) T protein:vir:99 348 EDKFIQRLMFWVRYRDNHAVKVLGTDETMEQ--FDTSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETI 425 (862) T ss_pred HHHHHHHHHHHHhccCcceeEEecCCCceeE--EecccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHH Confidence 111111 1 12 34666665555444 4566678888999999999999999965 34432 2 467774333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHH-------HH--hccC Q lcl|NC_018086. 355 ATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAV-------KL--RDML 425 (511) Q Consensus 355 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~-------~~--~g~~ 425 (511) .|...+.-.. +..+...|++++.++..-+ +. ..++++.|++-...++++.|+... ++ .|++ T Consensus 426 nYyD~I~s~Q--E~~L~P~LerL~~li~~~l---g~-----~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvi 495 (862) T protein:vir:99 426 SYHEELESIQ--EHVYMPFLQRHYLISRLSL---GI-----QHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVI 495 (862) T ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHhc---CC-----CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCC Confidence 3333333222 3567788888776654322 11 136889999999999998887643 33 4788 Q ss_pred ChHHHHHhC------CC--CCCHHHHHHH-HHHHHHHHHHHHHhhccccccCCCCCCcccc------ccCCCCCCcc-cc Q lcl|NC_018086. 426 PDETIINQF------PW--ITDARQEVEK-ADAQRQKRADIALQNFKQTSAVQGASTAAAN------KLDKNPANTS-TI 489 (511) Q Consensus 426 s~et~~~~l------~~--v~d~~~E~~r-i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~-~~ 489 (511) +.+.++..| ++ +++.+.|-.. ...+..+. ....+......+.++..+. +++....+.. .+ T Consensus 496 spdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~----~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~ 571 (862) T protein:vir:99 496 SPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAA----YQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVPSM 571 (862) T ss_pred CHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccc----cccCCcccccccccccccccCCccccCCcccccccCCC Confidence 888777754 22 2221111000 11111110 0001111111111000000 0000000000 00 Q ss_pred ccCCCCccccccccCCCCC-CCC Q lcl|NC_018086. 490 TTTDPVAAKEQEKAIQKKP-KTD 511 (511) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~-~~~ 511 (511) .+-+..+ ...+.+..-+| +.| T Consensus 572 ~~g~~~~-~t~~~~a~~p~~~~~ 593 (862) T protein:vir:99 572 KPGQMVG-PEVGITAPMPEDDAP 593 (862) T ss_pred CCCCccc-cccccccCCCccccc Confidence 0111111 11122221111 222 No 115 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=99.25 E-value=3.2e-10 Score=72.67 Aligned_cols=460 Identities=9% Similarity=0.017 Sum_probs=201.4 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR----SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) || . ...+.+..+.+.+..+.++. -..+++.+.+|..-.- .............++ T Consensus 1 ~~-------------~------~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~~~~~~~~~~~ 58 (543) T protein:vir:88 1 MA-------------E------TKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSL---FPKDSDNSSTDYTTP 58 (543) T ss_pred Cc-------------c------cccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhcccc---CCCCCCccccccccc Confidence 11 1 11122333444444444433 3345555555555421 111111111122345 Q ss_pred ccchHHHHHHHHHhhhhcc--Cc----eecCch--------------hh-------HHHHHHHHhccChhHHHHHHHHHH Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGE--PI----TESGDE--------------KT-------IKAMQPVFKENYVTDVNSEEVKLS 129 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~--~~----~~~~d~--------------~~-------~~~l~~~~~~n~~~~~~~~~~~~a 129 (511) ..+-+...++.+++.|++- |. ++...+ +. .+.+...+..++|.....++.++. T Consensus 59 ~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L 138 (543) T protein:vir:88 59 WQAVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQL 138 (543) T ss_pred ccchHHHHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHH Confidence 6677777888887777652 21 111111 01 122445566788999999999999 Q ss_pred hhCCeEEEEeeeCCCCceE---EEEEcccceEEEecCCCCCceEEEEEEEEEeec-------------CCcceEEEEEEE Q lcl|NC_018086. 130 GIFGHCFEIHWIDRNKKHR---FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD-------------ITGHQIRTYEVY 193 (511) Q Consensus 130 ~~~G~~~~~v~~~~~g~~~---i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-------------~~~~~~~~~~~~ 193 (511) .++|.|.+++..+....++ ++.+ |..-+++--+.. +.+...+|.+..... ...+....++|| T Consensus 139 ~~~G~a~ly~~~~~~~~~~~~~~~~~-pl~~y~v~~d~~-G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~ 216 (543) T protein:vir:88 139 ALAGTALIYLPPPDASSNSYNPMKLY-TLHNHVVQRDAF-GNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVY 216 (543) T ss_pred HhhCceeeeeccCccccceecceEEe-EcceEEEeeCCC-CCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEE Confidence 9999998877665433222 3333 444455543433 456666665543211 111222344554 Q ss_pred cCCcEEEEEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 194 TEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDI 268 (511) Q Consensus 194 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~ 268 (511) +.- +.+.....+.... +-............++..+|++.++ ++.+|+|-..+..+-+..+|.+.-...... T Consensus 217 ~~V---~pr~~~~~~~~~~-~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 292 (543) T protein:vir:88 217 THI---YIDDESGDFLSYQ-EIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFA 292 (543) T ss_pred EEE---EeecCCCcccccc-cccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 431 1111111111111 1111111122233345667766543 357899999999999999999999999999 Q ss_pred HHhcCceeEeecCCCCccchhhhhh-h-hCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccc Q lcl|NC_018086. 269 AYWNDAYLWLQGFDLSADSDSISNM-K-NDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF 344 (511) Q Consensus 269 ~~~~~p~l~~~G~~~~~~~~~~~~~-~-~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 344 (511) +....|.+.+.-..... ...+ . ..+.+.....+++..+. ...+.......++.++..|...-..-.+..-+. T Consensus 293 ~~~~~pp~~v~~~g~~~----~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~ 368 (543) T protein:vir:88 293 MISSKVVGLVNPNGITQ----VRRLVKAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSG 368 (543) T ss_pred HHHhcCceeeccccccc----hhhcccCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCC Confidence 99999987653211110 1111 1 12233323334555443 334667777777777777765443222211222 Q ss_pred cCccHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCC-cCHHHHHHHH Q lcl|NC_018086. 345 TAASGQALKAATQPLENKSAV-----KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLP-QSYAELADMA 418 (511) Q Consensus 345 ~~~Sg~Ai~~~~~~l~~k~~~-----~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p-~d~~e~a~~~ 418 (511) ...|++.+......+.....- ....+..-+.+++.+ +...+.-.......+++.+...+. -...+.++.+ T Consensus 369 ~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~i----l~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l 444 (543) T protein:vir:88 369 ERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQ----LQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKL 444 (543) T ss_pred CcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH----HHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHH Confidence 345777665543333322211 112222233333332 222332222222345555543222 1122222222 Q ss_pred HHH---hccCC---------hHHHHHh----CCC-CC---CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 419 VKL---RDMLP---------DETIINQ----FPW-IT---DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 419 ~~~---~g~~s---------~et~~~~----l~~-v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) ... .+.++ ...++.. +|. .. ..++|++.++++++++............+..+......+. T Consensus 445 ~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~ 524 (543) T protein:vir:88 445 TQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEA 524 (543) T ss_pred HHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHH Confidence 221 12222 2223332 232 11 2357777777665544433332222222221111110000 Q ss_pred cCCCCCCc-cccccCCCCc Q lcl|NC_018086. 479 LDKNPANT-STITTTDPVA 496 (511) Q Consensus 479 ~~~~~~~~-~~~~~~~~~~ 496 (511) .+....+. .+..+..|+. T Consensus 525 ~~~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 525 MESAMDTAGVQPGPIATQV 543 (543) T ss_pred HHHHhhhcCCCCCCCCCCC Confidence 00000000 0111111111 No 116 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.24 E-value=8.5e-11 Score=75.78 Aligned_cols=456 Identities=13% Similarity=0.050 Sum_probs=225.3 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHH----HHHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMH----SRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) |..+|. +++..+.-.++..-=..++.++-..+ ......++.+++|-.- ++.........+++ +++ T Consensus 1 m~~~~~--------~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~-~~tr~t~~~~~~w~--~s~ 69 (599) T protein:vir:31 1 MSTDIK--------TLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDA-TDTRKTSNSKLPFK--NST 69 (599) T ss_pred CccchH--------HHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhh-hcccccccCCCCcc--ccc Confidence 444443 23222222222111112222222222 2223456677777432 22222222222232 467 Q ss_pred ccchHHHHHHHHHhhhhccCc------ee---cCchhh---HHH----HHHHHhccChhHHHHHHHHHHhhCCeEEEEee Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGEPI------TE---SGDEKT---IKA----MQPVFKENYVTDVNSEEVKLSGIFGHCFEIHW 140 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~~~------~~---~~d~~~---~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~ 140 (511) .+|..-.+++.+..++++-=+ .+ ..++.+ .+. +++-+...+|......+..+...+|-||..+. T Consensus 70 t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~ 149 (599) T protein:vir:31 70 TINKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTR 149 (599) T ss_pred chHHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeee Confidence 777777899998888776311 11 112111 112 34446677899999999999999999887765 Q ss_pred eCC------CC-------ceEEEEEcccceEEEecCCCC--CceEEEEEEEEEeecCCc-------------------ce Q lcl|NC_018086. 141 IDR------NK-------KHRFKAVSPMNCLIAYSADLD--EEPVAAIYYNTVISDITG-------------------HQ 186 (511) Q Consensus 141 ~~~------~g-------~~~i~~~~p~~~~~v~d~~~~--~~~~~~v~~~~~~~~~~~-------------------~~ 186 (511) .-. +| .|++..++|.++|+ |+.-. ......+|.+....+... +. T Consensus 150 ~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~--Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~ 227 (599) T protein:vir:31 150 HVKRMTVTAENQVIKNYSGTVTERLSPSDVFW--DVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREE 227 (599) T ss_pred EEEcceeecccccccccccceEEeecccceee--CCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhh Confidence 321 22 48899999988765 44322 222223333321100000 00 Q ss_pred EEE--------------------------EEEEcCCcEE--EEE-----EccCc-cccccccc-c-cccccccceeccCC Q lcl|NC_018086. 187 IRT--------------------------YEVYTEDLIY--KFS-----TDDER-EVYREIPE-E-LEIKDYEVHPNLLQ 230 (511) Q Consensus 187 ~~~--------------------------~~~~~~~~i~--~~~-----~~~~~-~~~~~~~~-~-~~~~~~~~~~~~~g 230 (511) ... .+.|.+..+. .|. ...++ +......- . .-..-.+..|.+.| T Consensus 228 ~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g 307 (599) T protein:vir:31 228 RRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDG 307 (599) T ss_pred ccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCC Confidence 000 0011111100 000 00000 00000000 0 00112233455667 Q ss_pred ccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCC Q lcl|NC_018086. 231 KFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDG 305 (511) Q Consensus 231 ~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~ 305 (511) ..|++... ...+|.|.+..+..+++.+|.+.-.+.+.+.-+..|+++..|.-...+-.+ ..+.++.+.+.+ T Consensus 308 ~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~eD~~~----~P~~v~~~~d~~ 383 (599) T protein:vir:31 308 SQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKGMRG----GPNHVFEVEETG 383 (599) T ss_pred CCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcccccccccccccCccC----CCCcceeecCCC Confidence 77876443 345789999999999999999999999999999999888777522222221 136788889999 Q ss_pred ceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc--cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_018086. 306 MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF--TAASGQALKAATQPLENKSAVKESKFRKVLAK-RYELVC 382 (511) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~-~~~li~ 382 (511) ++.+..++.+..+....+..+...+-..|+.|..+.|.. +..++..+.............+.+.|..++-+ +++-+. T Consensus 384 ~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~ 463 (599) T protein:vir:31 384 DVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYL 463 (599) T ss_pred ccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 999988888888888889999999999999999888754 34678888888888888888888888876544 555444 Q ss_pred HHHHhcCCCc--------c--------c--cccceeEEeCCCCCcCHHHHHHHHHHHhcc------------CChHHHHH Q lcl|NC_018086. 383 SYLEFMNKAK--------D--------L--KPYEVTPVFVRNLPQSYAELADMAVKLRDM------------LPDETIIN 432 (511) Q Consensus 383 ~~~~~~~~~~--------~--------~--~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~------------~s~et~~~ 432 (511) ++....-... + + +...-.+.+.+.-..-..+..+.++++.++ ++++.... T Consensus 464 e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~ 543 (599) T protein:vir:31 464 EQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFN 543 (599) T ss_pred HHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHH Confidence 4332211110 0 0 000111222222222234455555554322 23322211 Q ss_pred hCCC--------CCCHH-------HHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccc Q lcl|NC_018086. 433 QFPW--------ITDAR-------QEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTIT 490 (511) Q Consensus 433 ~l~~--------v~d~~-------~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (511) .+-+ +.-+. .+...++++-++..+.++ ..++-+. +++++++ T Consensus 544 ~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~---------------~~~~~~~--~~~~~~~ 599 (599) T protein:vir:31 544 AVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETAL---------------TQEEVGG--PTTDTGQ 599 (599) T ss_pred HHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhh---------------hhhhcCC--CCcccCC Confidence 1111 11000 011111111111111110 0000000 0000111 No 117 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=99.23 E-value=4e-10 Score=72.12 Aligned_cols=449 Identities=10% Similarity=0.015 Sum_probs=200.1 Q ss_pred hhhccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc-- Q lcl|NC_018086. 22 HFIRRNFDLRELITLAEMHSRS----SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE-- 95 (511) Q Consensus 22 ~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~-- 95 (511) .-..+.+..+.+.+..+.++.+ ..+++.+.+|..-.- .............++..+-+...++.+++.|++- T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSL---FPKESDNSSTEYTTPWQAVGARCLNNLAAKLMLALF 77 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCCCcccccccccccccHHHHHHHHHHHHHhhcC Confidence 1112334444455555555443 344555555544321 1111111122233466677777888888777652 Q ss_pred ---Cc-eecC----------chh----h-------HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EE Q lcl|NC_018086. 96 ---PI-TESG----------DEK----T-------IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RF 149 (511) Q Consensus 96 ---~~-~~~~----------d~~----~-------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i 149 (511) || ++.. +.+ . .+.+...+..++|.....++.++..++|.|++++..+..+.+ .+ T Consensus 78 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~ 157 (522) T protein:vir:94 78 PQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPM 157 (522) T ss_pred CCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeE Confidence 22 1211 111 1 122445567789999999999999999999988876665544 45 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeec------------CCcceEEEEEEEcCCcEEEEEEccCccccccccccc Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISD------------ITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEEL 217 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~------------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 217 (511) +.++-.+ +++--+. .+.+...+|.++.... ...+....+++|+.- .....++..... ... T Consensus 158 ~~~pl~~-y~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v-----~~~~~~~~~~~~-~~g 229 (522) T protein:vir:94 158 RMYRLVS-YVVQRDA-FGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHI-----YRQDDEYLRYEE-VEG 229 (522) T ss_pred EEEEcce-EEEeeCC-CcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEE-----EeeCCceeEEee-ccC Confidence 5655444 4443333 3446666665543211 011122344444320 011111111111 011 Q ss_pred ccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh Q lcl|NC_018086. 218 EIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN 292 (511) Q Consensus 218 ~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~ 292 (511) ..........+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-......+....|.+.+.-.... ....... T Consensus 230 ~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~-~~~~~~~ 308 (522) T protein:vir:94 230 IEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGIT-QPRRLNK 308 (522) T ss_pred ceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc-cchheec Confidence 111111222356677866544 3568999999999999999999999999999999998765411110 0111111 Q ss_pred hhhCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_018086. 293 MKNDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAV-KESK 369 (511) Q Consensus 293 ~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-~~~~ 369 (511) . ..+.+.....++++.+. ...+.......++.++..|...-..-.+..-+....|++.+......+.....- ..+. T Consensus 309 ~-~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl 387 (522) T protein:vir:94 309 A-ATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQ 387 (522) T ss_pred c-CCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHH Confidence 1 11233323333445443 334566667777777777766543322222222346777765543333322211 1111 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcC-HHHHHHHHHH----HhccC--------ChHHH----HH Q lcl|NC_018086. 370 FRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQS-YAELADMAVK----LRDML--------PDETI----IN 432 (511) Q Consensus 370 ~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d-~~e~a~~~~~----~~g~~--------s~et~----~~ 432 (511) -.+.|.-+++.++.++...+.-.......+++.+..++..- ..+.++.+.. ++++- ....+ .. T Consensus 388 ~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~ 467 (522) T protein:vir:94 388 SQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLN 467 (522) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHH Confidence 11122222222222332233322333334666665444431 1111222211 12211 11122 22 Q ss_pred hCCC-CC---CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcc Q lcl|NC_018086. 433 QFPW-IT---DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAA 497 (511) Q Consensus 433 ~l~~-v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (511) .+|. .. -.++|++.+.+++++............. ..++. .+++++++-..+ T Consensus 468 ~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~-~~~a~-------------~~~~~~~~~~~~ 522 (522) T protein:vir:94 468 ALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGA-NMGAA-------------VGQGAGEDMAQA 522 (522) T ss_pred HcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhhh-------------hhcccchhhhcC Confidence 3332 11 1245666665554333222111111110 00000 000000000000 No 118 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=99.22 E-value=4.7e-10 Score=71.75 Aligned_cols=452 Identities=10% Similarity=-0.001 Sum_probs=203.9 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHS----RSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) || ... .+.+..+.+.+..+.++ .-..+++.+.+|..-.- .............++ T Consensus 1 m~-------------~~~------~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~ 58 (535) T protein:vir:15 1 MA-------------DSK------RTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL---FPKESDNESTDYTTP 58 (535) T ss_pred CC-------------ccc------hhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCCCccccccccc Confidence 22 111 11122233333333333 33455666666655421 111111111222345 Q ss_pred ccchHHHHHHHHHhhhhcc--C---c-eecCch---------------------hhHHHHHHHHhccChhHHHHHHHHHH Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGE--P---I-TESGDE---------------------KTIKAMQPVFKENYVTDVNSEEVKLS 129 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~--~---~-~~~~d~---------------------~~~~~l~~~~~~n~~~~~~~~~~~~a 129 (511) ..+-+...++.+++.|++- | | ++...+ .....+...+..++|.....++.++. T Consensus 59 ~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L 138 (535) T protein:vir:15 59 WQAVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQL 138 (535) T ss_pred ccccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHH Confidence 5666777888877777652 2 1 121111 11123445577889999999999999 Q ss_pred hhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeec--------------CCcceEEEEEEEcC Q lcl|NC_018086. 130 GIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD--------------ITGHQIRTYEVYTE 195 (511) Q Consensus 130 ~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~ 195 (511) .++|.|.+++..+..+.++++.++-.+.++.-| . .+.+...+|.++.... ...+....+++|+. T Consensus 139 ~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d-~-~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~ 216 (535) T protein:vir:15 139 IVAGNALLYLPEPEGSYNPMKLYRLSSYVVQRD-A-YGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTH 216 (535) T ss_pred HhhCceeEEeecCCCCceeeEEEEcCeeEEeeC-C-CCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEE Confidence 999999888877766667788776555444433 3 3445666665543310 01112223333332 Q ss_pred CcEEEEEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 196 DLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAY 270 (511) Q Consensus 196 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~ 270 (511) - +.....+.+.... .....-........+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-......+. T Consensus 217 v---~~~~~~~~~~~~~-e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 292 (535) T protein:vir:15 217 V---YLDEESGDYLKYE-EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMI 292 (535) T ss_pred E---EEecCCCcEEEEE-EeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1111111111111 0011111111233456667776544 35789999999999999999999999999999 Q ss_pred hcCceeEeecCCCCccchhhhhh-h-hCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_018086. 271 WNDAYLWLQGFDLSADSDSISNM-K-NDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA 346 (511) Q Consensus 271 ~~~p~l~~~G~~~~~~~~~~~~~-~-~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 346 (511) ...|.+.+.-.... . ...+ . ..+.+.....++++.+. ...+.......++.++..|...-..-.+..-+... T Consensus 293 ~~~p~~lv~~~g~~---~-~~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r 368 (535) T protein:vir:15 293 SAKVIGLVNPAGIT---Q-PRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGER 368 (535) T ss_pred HhcCceeecccccc---c-chhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCcc Confidence 99998765311110 0 1111 1 12233323334455543 33456667777777777775543222221122234 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH-HHHHHH Q lcl|NC_018086. 347 ASGQALKAATQPLENKSAVKESKFRKVLAK--------RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY-AELADM 417 (511) Q Consensus 347 ~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~--------~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~-~e~a~~ 417 (511) .|++.+..... ++...++..+.+ +++.++.++...+.-.......+.+.|.-++..-- .+.++. T Consensus 369 ~TAtEV~~r~~-------E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~ 441 (535) T protein:vir:15 369 VTAEEIRYVAS-------ELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDK 441 (535) T ss_pred ccHHHHHHHHH-------HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHH Confidence 57776655433 333333333333 22222333333333333344457777765555321 111222 Q ss_pred H----HHHhccCC--------hHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 418 A----VKLRDMLP--------DETIINQF---PWIT-----DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 418 ~----~~~~g~~s--------~et~~~~l---~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) + ..++++-| ...++..+ -+++ ..++|++.+.+++++..... ....+..+..++..... T Consensus 442 l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~-~~a~~~g~~~~~~~~~~- 519 (535) T protein:vir:15 442 LERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIE-NAAATGGAGVGALATSS- 519 (535) T ss_pred HHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHH-HHHHHHHhhccchhccC- Confidence 2 22222211 12222222 1222 23556666554433322221 11111111111111000 Q ss_pred ccCCCCCCccccccCCCCccccc Q lcl|NC_018086. 478 KLDKNPANTSTITTTDPVAAKEQ 500 (511) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~ 500 (511) |... .... +.-|-.++ T Consensus 520 -p~~~-----~~~~-~~~g~~~~ 535 (535) T protein:vir:15 520 -PEAM-----QGAA-AQAGLDAT 535 (535) T ss_pred -hHHH-----HHHH-hccCCCCC Confidence 1100 0001 11111111 No 119 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=99.22 E-value=5e-10 Score=71.55 Aligned_cols=456 Identities=10% Similarity=0.011 Sum_probs=198.8 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHH----HHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSS----SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTST 89 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~ 89 (511) +.. .++++..+.+.+..+.++.++ .+++.+.+|..-.- .............++..+-+...++.++ T Consensus 1 m~~-------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~~dst~~~a~~~La 70 (536) T protein:vir:10 1 MAE-------KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDSDNASTDYQTPWQAVGARGLNNLA 70 (536) T ss_pred Ccc-------hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCCCcccccccccccccHHHHHHHHH Confidence 110 122444555666666655443 44455555544321 1111112222234566777777888888 Q ss_pred hhhhcc--C---c-eecCch--------------h-------hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 90 AYLAGE--P---I-TESGDE--------------K-------TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 90 ~~l~g~--~---~-~~~~d~--------------~-------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) +.|++- | | ++...+ + ....+...+..++|.....++.++..++|.+.+++..+ T Consensus 71 a~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~ 150 (536) T protein:vir:10 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) T ss_pred HHHHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC Confidence 777652 2 2 121111 0 11234556778899999999999999999999887655 Q ss_pred CCCceE-EEEEcccceEEEecCCCCCceEEEEEEEEEeec--------------CCcceEEEEEEEcCCcEEEEEEccCc Q lcl|NC_018086. 143 RNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD--------------ITGHQIRTYEVYTEDLIYKFSTDDER 207 (511) Q Consensus 143 ~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~~~i~~~~~~~~~ 207 (511) ..+.++ ++.++-.+++ +-.+.. +++...+|.+..... ...+....+++|+.- +....... T Consensus 151 ~~~~~~~~~~~pl~~~~-v~~d~~-G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V---~~~~~~~~ 225 (536) T protein:vir:10 151 EGSNYNPMKLYRLSSYV-VQRDAF-GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEASGE 225 (536) T ss_pred CCCceeeEEEEEcCeEE-EeeCCC-CCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEE---EEecCCCc Confidence 544343 5555544443 433333 456666665543310 011112233333221 01111111 Q ss_pred ccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee-cC Q lcl|NC_018086. 208 EVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ-GF 281 (511) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~-G~ 281 (511) +..... ....-...+....+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+. +. T Consensus 226 ~~~~~e-~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g 304 (536) T protein:vir:10 226 YLRYEE-VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) T ss_pred EEEEEe-ecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccc Confidence 111111 111111122334456677877554 35689999999999999999887777777777666554432 21 Q ss_pred CCCccchhhhhhhhCceeeecCCCceee--eecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHH Q lcl|NC_018086. 282 DLSADSDSISNMKNDRVIVTDEDGMVKF--ITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPL 359 (511) Q Consensus 282 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l 359 (511) .. ........ ..+.+.-...+++.. +....+.......++.++..|...-....+..-+....|++.+......+ T Consensus 305 ~~--~~~~~~~~-~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~ 381 (536) T protein:vir:10 305 IT--QPRRLTKA-QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASEL 381 (536) T ss_pred cc--chhhhccC-CCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHH Confidence 11 01111111 112222112233333 33445566667777777777655443222222222346777766554433 Q ss_pred HHHHH----H-HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCc-CHHHHHHHH----HHHhccCC--- Q lcl|NC_018086. 360 ENKSA----V-KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQ-SYAELADMA----VKLRDMLP--- 426 (511) Q Consensus 360 ~~k~~----~-~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~-d~~e~a~~~----~~~~g~~s--- 426 (511) ..... + ....+..-+.+++.+ +...+.-.......+.+.+.-++.. ...+.++.+ ..++++-| T Consensus 382 ~~~LG~v~~rl~~Ell~Pli~r~~~i----l~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~l 457 (536) T protein:vir:10 382 EDTLGGVYSILSQELQLPLVRVLLKQ----LQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRD 457 (536) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHH----HHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhh Confidence 33221 1 112222233333333 3222322222222345555433321 111122222 12222211 Q ss_pred -----hHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCC Q lcl|NC_018086. 427 -----DETII----NQFPWIT----DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTD 493 (511) Q Consensus 427 -----~et~~----~~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (511) ...++ ..+|..+ -.++|++.+.+++++.+...........+.+ +....+ .+..+-.....+.+ T Consensus 458 d~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~-~~~~~~---~~~~~~~~~~~g~~ 533 (536) T protein:vir:10 458 DPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA-AQATAS---PEAMAAAADSVGLQ 533 (536) T ss_pred cccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcC---chhHHhhhhccccC Confidence 12222 2334311 2367777777655443322111110000010 000000 01111111233334 Q ss_pred CCc Q lcl|NC_018086. 494 PVA 496 (511) Q Consensus 494 ~~~ 496 (511) |+. T Consensus 534 ~~~ 536 (536) T protein:vir:10 534 PGI 536 (536) T ss_pred CCC Confidence 443 No 120 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=99.22 E-value=5.1e-10 Score=71.53 Aligned_cols=456 Identities=11% Similarity=0.019 Sum_probs=199.6 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHH----HHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSS----SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTST 89 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~ 89 (511) +.. .++++..+.+.+..+.++.++ .+++.+.+|..-. ..............++..+-+...++.++ T Consensus 1 m~~-------~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~---~~~~~~~~~~~~~~~~~dst~~~a~~~La 70 (536) T protein:vir:21 1 MAE-------KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPS---LFPKDSDNASTDYQTPWQAVGARGLNNLA 70 (536) T ss_pred Ccc-------hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc---ccCCCCCcccccccccccccHHHHHHHHH Confidence 110 122444555666666655443 4445555554432 11111112222334566777777888888 Q ss_pred hhhhcc--C---c-eecCch--------------h-------hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 90 AYLAGE--P---I-TESGDE--------------K-------TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 90 ~~l~g~--~---~-~~~~d~--------------~-------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) +.|++- | | ++...+ + ....+...+..++|.....++.++..++|.+.+++..+ T Consensus 71 a~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~ 150 (536) T protein:vir:21 71 SKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP 150 (536) T ss_pred HHHHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC Confidence 777652 2 2 121111 0 11235556778899999999999999999999887655 Q ss_pred CCCceE-EEEEcccceEEEecCCCCCceEEEEEEEEEeec--------------CCcceEEEEEEEcCCcEEEEEEccCc Q lcl|NC_018086. 143 RNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD--------------ITGHQIRTYEVYTEDLIYKFSTDDER 207 (511) Q Consensus 143 ~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~~~i~~~~~~~~~ 207 (511) ..+.++ ++.++-.++ ++-.+.. +++...+|.+..... ...+....+++|+.- +...+... T Consensus 151 ~~~~~~~f~~~pl~~~-~v~~d~~-G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v---~~~~~~~~ 225 (536) T protein:vir:21 151 EGSNYNPMKLYRLSSY-VVQRDAF-GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI---YLDEDSGE 225 (536) T ss_pred CCCceeeEEEEEcCeE-EEeeCCC-CCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEE---EEecCCCc Confidence 544343 555554444 3433333 456666665543211 001112233333221 11111112 Q ss_pred ccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee-cC Q lcl|NC_018086. 208 EVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ-GF 281 (511) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~-G~ 281 (511) +..... ........+....+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+. +. T Consensus 226 ~~~~~e-~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g 304 (536) T protein:vir:21 226 YLRYEE-VEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG 304 (536) T ss_pred EEEEec-cCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccc Confidence 211111 111111123334457778877654 35689999999999999999887777777777666554432 21 Q ss_pred CCCccchhhhhhhhCceeeecCCCceee--eecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHH Q lcl|NC_018086. 282 DLSADSDSISNMKNDRVIVTDEDGMVKF--ITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPL 359 (511) Q Consensus 282 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l 359 (511) .. ........ ..+.+.-...+++.. +....+.......++.++..|...-....+..-+....|++.+......+ T Consensus 305 ~~--~~~~~~~~-~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtEV~~r~~E~ 381 (536) T protein:vir:21 305 IT--QPRRLTKA-QTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEEIRYVASEL 381 (536) T ss_pred cc--chhhhccC-CCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHHHHHHHHHH Confidence 11 01111111 112222111223333 33445566667777777777655443222222222346777766554433 Q ss_pred HHHHH----H-HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCc-CHHHHHHHH----HHHhccCC--- Q lcl|NC_018086. 360 ENKSA----V-KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQ-SYAELADMA----VKLRDMLP--- 426 (511) Q Consensus 360 ~~k~~----~-~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~-d~~e~a~~~----~~~~g~~s--- 426 (511) ..... + ....+..-+.+++.+ +...+.-.......+.+.+.-++.. ...+.++.+ ..++++-| T Consensus 382 ~~~LG~v~~rl~~Ell~Pli~r~~~i----l~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~l 457 (536) T protein:vir:21 382 EDTLGGVYSILSQELQLPLVRVLLKQ----LQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRD 457 (536) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHH----HHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhh Confidence 33221 1 112222233333333 3222322222222345555433321 111122222 12222211 Q ss_pred -----hHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCC Q lcl|NC_018086. 427 -----DETII----NQFPWIT----DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTD 493 (511) Q Consensus 427 -----~et~~----~~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (511) ...++ ..+|..+ -.++|++.+.+++++.+...........+.+ +....+ .+..+-.....+.+ T Consensus 458 d~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~-~~~~~~---~~~~~~~~~~~g~~ 533 (536) T protein:vir:21 458 DPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA-AQATAS---PEAMAAAADSVGLQ 533 (536) T ss_pred cccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcC---hhhHHhhhhccccC Confidence 12222 2234211 2357777777655443332111110000010 000000 00111111233334 Q ss_pred CCc Q lcl|NC_018086. 494 PVA 496 (511) Q Consensus 494 ~~~ 496 (511) |+. T Consensus 534 ~~~ 536 (536) T protein:vir:21 534 PGI 536 (536) T ss_pred CCC Confidence 443 No 121 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.19 E-value=6.9e-10 Score=70.81 Aligned_cols=443 Identities=9% Similarity=0.013 Sum_probs=216.0 Q ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccccc---CC---cCccccccceeccchHHHHHHHHHhhhhcc--C-- Q lcl|NC_018086. 28 FDLRELITLAEMHSRSS-SAYGVLYDYYKGNHIAIQS---RT---FDDTNKPNSKIVHNFPKLLVDTSTAYLAGE--P-- 96 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~-~~~~~~~~yY~G~~~~~~~---~~---~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~--~-- 96 (511) ++.+.|.+..+..+.++ +...++++||+---+-... .. .....+.+.++..+-+...++.+++.|++- | T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 77777777666665444 2233444444332111100 00 011123455777888888888888887753 2 Q ss_pred --c-eecC-ch------h-------hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC--CCceEEEEEcccce Q lcl|NC_018086. 97 --I-TESG-DE------K-------TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR--NKKHRFKAVSPMNC 157 (511) Q Consensus 97 --~-~~~~-d~------~-------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~--~g~~~i~~~~p~~~ 157 (511) | ++.. |. + ....+...+..++|.....++.++..++|.|.+++..++ .+.++++.++..++ T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~ 160 (547) T protein:vir:10 81 TKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDS 160 (547) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceE Confidence 2 2221 11 1 122345567778899999999999999999988887654 35678888888777 Q ss_pred EEEecCCCCCceEEEEEEEEEeec---------------------CCcceEEEEEEEcCCcEEE-----------EEEcc Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISD---------------------ITGHQIRTYEVYTEDLIYK-----------FSTDD 205 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~---------------------~~~~~~~~~~~~~~~~i~~-----------~~~~~ 205 (511) ++.-|. .+.+...+|.++.... ..+.....+++++.-+... +.... T Consensus 161 ~v~~d~--~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~ 238 (547) T protein:vir:10 161 YFEEDS--RGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTE 238 (547) T ss_pred EEeeCC--CcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccc Confidence 665544 3445556655433210 0111111222221100000 00000 Q ss_pred Ccc--cccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe Q lcl|NC_018086. 206 ERE--VYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL 278 (511) Q Consensus 206 ~~~--~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~ 278 (511) ..+ .+.+.. .. ..-....+|..+|++.++ ++.+|+|-.+...+-+..+|.+.-......+....|.+.+ T Consensus 239 ~p~~s~~~e~~-~~---~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v 314 (547) T protein:vir:10 239 RPFGKKWILKE-GA---VQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMV 314 (547) T ss_pred cceeEEEEEec-Cc---eeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceec Confidence 000 000000 00 001122345566766544 3568999999999999999999999999999999998865 Q ss_pred ecCCCCccchhhhhhhhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHH Q lcl|NC_018086. 279 QGFDLSADSDSISNMKNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQP 358 (511) Q Consensus 279 ~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~ 358 (511) .-.... . ..++..++++...+..+++.+....+.......++.++..|...-....+...+....|++.+...... T Consensus 315 ~~~g~~---~-~~~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~TAtEV~~r~~E 390 (547) T protein:vir:10 315 TERGLI---S-DIDLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMTATEVQVRYEL 390 (547) T ss_pred cccccc---c-cceecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCccccHHHHHHHHHH Confidence 421111 1 122334555555555567777767777777777888877776644332222222234677776654443 Q ss_pred HHHHHH----HHH-HHHHHHHHHHHHHHHHHHHhcCCCcc----c---cccceeEEeCCCCCcCHH-HH-------HHHH Q lcl|NC_018086. 359 LENKSA----VKE-SKFRKVLAKRYELVCSYLEFMNKAKD----L---KPYEVTPVFVRNLPQSYA-EL-------ADMA 418 (511) Q Consensus 359 l~~k~~----~~~-~~~~~~l~~~~~li~~~~~~~~~~~~----~---~~~~i~i~f~~~~p~d~~-e~-------a~~~ 418 (511) +..... +.+ ..+..-+.+++.++. ..+.-.. . ....+.|.+..++-+... +. ++.+ T Consensus 391 ~~~~LG~v~~rl~~E~l~Pli~r~~~il~----r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v 466 (547) T protein:vir:10 391 MQRLLGPTLGRLENDFLSPMIQRTFNIRF----RAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGST 466 (547) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHH----hcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHH Confidence 333321 111 233333344443332 2222111 1 234567777666554321 11 1222 Q ss_pred HHHhccCC-------hHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCC Q lcl|NC_018086. 419 VKLRDMLP-------DETIIN----QFPWIT----DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNP 483 (511) Q Consensus 419 ~~~~g~~s-------~et~~~----~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (511) ..++++-| ...++. .+| ++ -.++|++.+++++++........... . ..+ +.-.+. T Consensus 467 ~~laq~~P~vld~id~d~~~~~~a~~~G-vp~~~irs~eev~~~r~qr~~~~q~~~qaa~~--~------~~g-~~m~~~ 536 (547) T protein:vir:10 467 AQLAEINPEVLDIPDWDEMVRMLGSLLG-APQTLMRPKAKVTSIRKNRSQTQQKAEQAAIA--E------AEG-NAMEAQ 536 (547) T ss_pred HHhhccChhhhhcCCHHHHHHHHHHHhC-CChhccCCHHHHHHHHHHHHHHHHHHHHHHHH--H------HHH-HHHHhh Confidence 22233222 222222 223 32 13577777776655433322111000 0 000 000000 Q ss_pred CCccccccCCCCcccccccc Q lcl|NC_018086. 484 ANTSTITTTDPVAAKEQEKA 503 (511) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~ 503 (511) |. +++.-++|+ T Consensus 537 ~~---------~~a~~~~~~ 547 (547) T protein:vir:10 537 GK---------GQAALKENQ 547 (547) T ss_pred cC---------cccchhccC Confidence 00 000111111 No 122 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=99.18 E-value=7.6e-10 Score=70.57 Aligned_cols=456 Identities=11% Similarity=0.019 Sum_probs=203.9 Q ss_pred hccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc--C- Q lcl|NC_018086. 24 IRRNFDLRELITLAEMHSRS----SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE--P- 96 (511) Q Consensus 24 ~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~--~- 96 (511) -++..+.+.|.+..+..+.+ ..+++.+.+|..-...-..........+.+.++..+-+...++.+++.|++- | T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 11122333344444443332 3444455555422110001111112223345677788888888888887653 2 Q ss_pred ---c-eecC-ch------h-------hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceE Q lcl|NC_018086. 97 ---I-TESG-DE------K-------TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCL 158 (511) Q Consensus 97 ---~-~~~~-d~------~-------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~ 158 (511) | ++.. |. + ....+...+..++|.....++.++..++|.|.+++..+..+.+++..++..+.+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~ 160 (555) T protein:vir:98 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYA 160 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeE Confidence 2 2221 11 1 122345667788999999999999999999999888777777888888877776 Q ss_pred EEecCCCCCceEEEEEEEEEeec------------------CCcceE-EEEEEEcC----CcEEEEEEccCc--c--ccc Q lcl|NC_018086. 159 IAYSADLDEEPVAAIYYNTVISD------------------ITGHQI-RTYEVYTE----DLIYKFSTDDER--E--VYR 211 (511) Q Consensus 159 ~v~d~~~~~~~~~~v~~~~~~~~------------------~~~~~~-~~~~~~~~----~~i~~~~~~~~~--~--~~~ 211 (511) +.-|. .+.+...+|.+..... ...+.. .++++++. ............ + .++ T Consensus 161 v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:98 161 IAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred EeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 65443 3445666665432210 000111 12332221 000000000000 0 000 Q ss_pred ccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) ........ -...-+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-.....++....|.+.+...... T Consensus 239 ~~~~d~~~---vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~-- 313 (555) T protein:vir:98 239 EPGADETR---TLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN-- 313 (555) T ss_pred EeccCCcc---ccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc-- Confidence 00000000 0122234556666543 3568999999999999999998888888889888887765432111 Q ss_pred chhhhhhhhCceeeec--CCCce--eeeecCCCHHHHHHHHHHHHHHHHHHhCcc---ccccccccCccHHHHHHHHHHH Q lcl|NC_018086. 287 SDSISNMKNDRVIVTD--EDGMV--KFITKDVNDKHIENIKNRAKLDIFSLSQTP---DLVSKDFTAASGQALKAATQPL 359 (511) Q Consensus 287 ~~~~~~~~~~~~i~~~--~~~~~--~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p---~~~~~~~~~~Sg~Ai~~~~~~l 359 (511) . ...+..+++..+. ..++. -.+....+.......++.++..|...-... .+...+....|++.+......+ T Consensus 314 -~-~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~ 391 (555) T protein:vir:98 314 -Q-DISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEK 391 (555) T ss_pred -c-cceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHH Confidence 0 1122222221111 11221 122334466777777888888776554332 1222333346888776543333 Q ss_pred HHHHHH-----HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH-HH-------HHHHHHHhccCC Q lcl|NC_018086. 360 ENKSAV-----KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA-EL-------ADMAVKLRDMLP 426 (511) Q Consensus 360 ~~k~~~-----~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~-e~-------a~~~~~~~g~~s 426 (511) .....- ....+..-+.+.+.++.+....-.........+++|.+..++-+... +. ++.+..+.++-| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P 471 (555) T protein:vir:98 392 LLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKP 471 (555) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 322211 11223333333333332210111112234445577777666554211 11 122222233222 Q ss_pred -------hHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhh-cccc----ccCCC-CCCccccccCC-CCC Q lcl|NC_018086. 427 -------DETII----NQFPWIT----DARQEVEKADAQRQKRADIALQN-FKQT----SAVQG-ASTAAANKLDK-NPA 484 (511) Q Consensus 427 -------~et~~----~~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~-~~~~----~~~~~-~~~~~~~~~~~-~~~ 484 (511) ...++ ..+| ++ -.++|+++++++++......... ...+ ....+ ......+...+ -.+ T Consensus 472 ~vld~id~d~~~~~~a~~~G-vp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (555) T protein:vir:98 472 EVLDKFDADRWADTYADMLG-IDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRA 550 (555) T ss_pred hhhhcCCHHHHHHHHHHHhC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhh Confidence 12222 2233 22 23567777766544332221111 1111 01000 00000000000 011 Q ss_pred Ccccc Q lcl|NC_018086. 485 NTSTI 489 (511) Q Consensus 485 ~~~~~ 489 (511) ..+=. T Consensus 551 ~~~~~ 555 (555) T protein:vir:98 551 FSGYT 555 (555) T ss_pred hccCC Confidence 11110 No 123 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=99.18 E-value=7.6e-10 Score=70.57 Aligned_cols=456 Identities=11% Similarity=0.019 Sum_probs=203.9 Q ss_pred hccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc--C- Q lcl|NC_018086. 24 IRRNFDLRELITLAEMHSRS----SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE--P- 96 (511) Q Consensus 24 ~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~--~- 96 (511) -++..+.+.|.+..+..+.+ ..+++.+.+|..-...-..........+.+.++..+-+...++.+++.|++- | T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 11122333344444443332 3444455555422110001111112223345677788888888888887653 2 Q ss_pred ---c-eecC-ch------h-------hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceE Q lcl|NC_018086. 97 ---I-TESG-DE------K-------TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCL 158 (511) Q Consensus 97 ---~-~~~~-d~------~-------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~ 158 (511) | ++.. |. + ....+...+..++|.....++.++..++|.|.+++..+..+.+++..++..+.+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~ 160 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYA 160 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeE Confidence 2 2221 11 1 122345667788999999999999999999999888777777888888877776 Q ss_pred EEecCCCCCceEEEEEEEEEeec------------------CCcceE-EEEEEEcC----CcEEEEEEccCc--c--ccc Q lcl|NC_018086. 159 IAYSADLDEEPVAAIYYNTVISD------------------ITGHQI-RTYEVYTE----DLIYKFSTDDER--E--VYR 211 (511) Q Consensus 159 ~v~d~~~~~~~~~~v~~~~~~~~------------------~~~~~~-~~~~~~~~----~~i~~~~~~~~~--~--~~~ 211 (511) +.-|. .+.+...+|.+..... ...+.. .++++++. ............ + .++ T Consensus 161 v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:10 161 IAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred EeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 65443 3445666665432210 000111 12332221 000000000000 0 000 Q ss_pred ccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) ........ -...-+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-.....++....|.+.+...... T Consensus 239 ~~~~d~~~---vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~-- 313 (555) T protein:vir:10 239 EPGADETR---TLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN-- 313 (555) T ss_pred EeccCCcc---ccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc-- Confidence 00000000 0122234556666543 3568999999999999999998888888889888887765432111 Q ss_pred chhhhhhhhCceeeec--CCCce--eeeecCCCHHHHHHHHHHHHHHHHHHhCcc---ccccccccCccHHHHHHHHHHH Q lcl|NC_018086. 287 SDSISNMKNDRVIVTD--EDGMV--KFITKDVNDKHIENIKNRAKLDIFSLSQTP---DLVSKDFTAASGQALKAATQPL 359 (511) Q Consensus 287 ~~~~~~~~~~~~i~~~--~~~~~--~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p---~~~~~~~~~~Sg~Ai~~~~~~l 359 (511) . ...+..+++..+. ..++. -.+....+.......++.++..|...-... .+...+....|++.+......+ T Consensus 314 -~-~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~ 391 (555) T protein:vir:10 314 -Q-DISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEK 391 (555) T ss_pred -c-cceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHH Confidence 0 1122222221111 11221 122334466777777888888776554332 1222333346888776543333 Q ss_pred HHHHHH-----HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH-HH-------HHHHHHHhccCC Q lcl|NC_018086. 360 ENKSAV-----KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA-EL-------ADMAVKLRDMLP 426 (511) Q Consensus 360 ~~k~~~-----~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~-e~-------a~~~~~~~g~~s 426 (511) .....- ....+..-+.+.+.++.+....-.........+++|.+..++-+... +. ++.+..+.++-| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P 471 (555) T protein:vir:10 392 LLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKP 471 (555) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 322211 11223333333333332210111112234445577777666554211 11 122222233222 Q ss_pred -------hHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhh-cccc----ccCCC-CCCccccccCC-CCC Q lcl|NC_018086. 427 -------DETII----NQFPWIT----DARQEVEKADAQRQKRADIALQN-FKQT----SAVQG-ASTAAANKLDK-NPA 484 (511) Q Consensus 427 -------~et~~----~~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~-~~~~----~~~~~-~~~~~~~~~~~-~~~ 484 (511) ...++ ..+| ++ -.++|+++++++++......... ...+ ....+ ......+...+ -.+ T Consensus 472 ~vld~id~d~~~~~~a~~~G-vp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (555) T protein:vir:10 472 EVLDKFDADRWADTYADMLG-IDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRA 550 (555) T ss_pred hhhhcCCHHHHHHHHHHHhC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhh Confidence 12222 2233 22 23567777766544332221111 1111 01000 00000000000 011 Q ss_pred Ccccc Q lcl|NC_018086. 485 NTSTI 489 (511) Q Consensus 485 ~~~~~ 489 (511) ..+=. T Consensus 551 ~~~~~ 555 (555) T protein:vir:10 551 FSGYT 555 (555) T ss_pred hccCC Confidence 11110 No 124 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=99.18 E-value=7.6e-10 Score=70.57 Aligned_cols=456 Identities=11% Similarity=0.019 Sum_probs=203.9 Q ss_pred hccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc--C- Q lcl|NC_018086. 24 IRRNFDLRELITLAEMHSRS----SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE--P- 96 (511) Q Consensus 24 ~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~--~- 96 (511) -++..+.+.|.+..+..+.+ ..+++.+.+|..-...-..........+.+.++..+-+...++.+++.|++- | T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp 80 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMTSP 80 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhcCC Confidence 11122333344444443332 3444455555422110001111112223345677788888888888887653 2 Q ss_pred ---c-eecC-ch------h-------hHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceE Q lcl|NC_018086. 97 ---I-TESG-DE------K-------TIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCL 158 (511) Q Consensus 97 ---~-~~~~-d~------~-------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~ 158 (511) | ++.. |. + ....+...+..++|.....++.++..++|.|.+++..+..+.+++..++..+.+ T Consensus 81 ~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~ 160 (555) T protein:vir:10 81 ARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYA 160 (555) T ss_pred CCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeE Confidence 2 2221 11 1 122345667788999999999999999999999888777777888888877776 Q ss_pred EEecCCCCCceEEEEEEEEEeec------------------CCcceE-EEEEEEcC----CcEEEEEEccCc--c--ccc Q lcl|NC_018086. 159 IAYSADLDEEPVAAIYYNTVISD------------------ITGHQI-RTYEVYTE----DLIYKFSTDDER--E--VYR 211 (511) Q Consensus 159 ~v~d~~~~~~~~~~v~~~~~~~~------------------~~~~~~-~~~~~~~~----~~i~~~~~~~~~--~--~~~ 211 (511) +.-|. .+.+...+|.+..... ...+.. .++++++. ............ + .++ T Consensus 161 v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~~~ 238 (555) T protein:vir:10 161 IAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSVYF 238 (555) T ss_pred EeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEEEE Confidence 65443 3445666665432210 000111 12332221 000000000000 0 000 Q ss_pred ccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) ........ -...-+|..+|++.++ .+.+|+|-.+...+-+..+|.+.-.....++....|.+.+...... T Consensus 239 ~~~~d~~~---vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~-- 313 (555) T protein:vir:10 239 EPGADETR---TLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN-- 313 (555) T ss_pred EeccCCcc---ccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc-- Confidence 00000000 0122234556666543 3568999999999999999998888888889888887765432111 Q ss_pred chhhhhhhhCceeeec--CCCce--eeeecCCCHHHHHHHHHHHHHHHHHHhCcc---ccccccccCccHHHHHHHHHHH Q lcl|NC_018086. 287 SDSISNMKNDRVIVTD--EDGMV--KFITKDVNDKHIENIKNRAKLDIFSLSQTP---DLVSKDFTAASGQALKAATQPL 359 (511) Q Consensus 287 ~~~~~~~~~~~~i~~~--~~~~~--~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p---~~~~~~~~~~Sg~Ai~~~~~~l 359 (511) . ...+..+++..+. ..++. -.+....+.......++.++..|...-... .+...+....|++.+......+ T Consensus 314 -~-~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~E~ 391 (555) T protein:vir:10 314 -Q-DISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHEEK 391 (555) T ss_pred -c-cceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHHHH Confidence 0 1122222221111 11221 122334466777777888888776554332 1222333346888776543333 Q ss_pred HHHHHH-----HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH-HH-------HHHHHHHhccCC Q lcl|NC_018086. 360 ENKSAV-----KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA-EL-------ADMAVKLRDMLP 426 (511) Q Consensus 360 ~~k~~~-----~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~-e~-------a~~~~~~~g~~s 426 (511) .....- ....+..-+.+.+.++.+....-.........+++|.+..++-+... +. ++.+..+.++-| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P 471 (555) T protein:vir:10 392 LLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKP 471 (555) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCCh Confidence 322211 11223333333333332210111112234445577777666554211 11 122222233222 Q ss_pred -------hHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhh-cccc----ccCCC-CCCccccccCC-CCC Q lcl|NC_018086. 427 -------DETII----NQFPWIT----DARQEVEKADAQRQKRADIALQN-FKQT----SAVQG-ASTAAANKLDK-NPA 484 (511) Q Consensus 427 -------~et~~----~~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~-~~~~----~~~~~-~~~~~~~~~~~-~~~ 484 (511) ...++ ..+| ++ -.++|+++++++++......... ...+ ....+ ......+...+ -.+ T Consensus 472 ~vld~id~d~~~~~~a~~~G-vp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (555) T protein:vir:10 472 EVLDKFDADRWADTYADMLG-IDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNALTDVTRA 550 (555) T ss_pred hhhhcCCHHHHHHHHHHHhC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchhHHHHHhh Confidence 12222 2233 22 23567777766544332221111 1111 01000 00000000000 011 Q ss_pred Ccccc Q lcl|NC_018086. 485 NTSTI 489 (511) Q Consensus 485 ~~~~~ 489 (511) ..+=. T Consensus 551 ~~~~~ 555 (555) T protein:vir:10 551 FSGYT 555 (555) T ss_pred hccCC Confidence 11110 No 125 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.17 E-value=9e-10 Score=70.18 Aligned_cols=393 Identities=11% Similarity=0.073 Sum_probs=186.6 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCC-cCccccccce--eccc Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRT-FDDTNKPNSK--IVHN 79 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~-~~~~~~~~~r--i~~n 79 (511) |++ ++. +-+..+--|.++-..... .......-.. -.+. T Consensus 1 ~~~----------------------~~~-----------------d~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~ 41 (427) T protein:vir:10 1 MKI----------------------VKH-----------------DGYNDIFNGGADGSPKPFFMSDASYHVGSFYNDNA 41 (427) T ss_pred CCc----------------------ccc-----------------chHHHHhhcCCCCcccCccccCchHHHHHHHHcCc Confidence 000 000 111111122111000000 0000000001 1257 Q ss_pred hHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCC----------Cc-eE Q lcl|NC_018086. 80 FPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRN----------KK-HR 148 (511) Q Consensus 80 ~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~----------g~-~~ 148 (511) +++.+|+..+.-++.+++.+.++++. +.+...|.+=++...+.++.+.+..||.|++++-.+.. |. .. T Consensus 42 l~~~~Vd~~aed~~r~g~~i~g~~~~-~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~ 120 (427) T protein:vir:10 42 TAKRIVDVIPEEMVTAGFKMSGVKDE-KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEG 120 (427) T ss_pred hhhhhhccchHHhhcCCccccCccHH-HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeE Confidence 78899999999999999999886543 56777787778889999999999999999988765432 11 22 Q ss_pred EEEEcccceEEEecCCCCCceEEE-EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceec Q lcl|NC_018086. 149 FKAVSPMNCLIAYSADLDEEPVAA-IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPN 227 (511) Q Consensus 149 i~~~~p~~~~~v~d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (511) +.+++|..+.+-..+.+...+-++ ..+|.+.... +. .. -.+.+.++++|... T Consensus 121 l~v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~~-~~--~~-~~iH~SRli~~~g~----------------------- 173 (427) T protein:vir:10 121 VRVYDRFAITVEKRVTNARSPRYGEPEIYKVSPGD-NM--QP-YLIHHSRVFIADGE----------------------- 173 (427) T ss_pred EEEechhcccccccccCccccccCcceEEEEecCC-CC--cc-eEEccccEEEecCC----------------------- Confidence 444455444332111111100000 0111111100 00 00 01122222222100 Q ss_pred cCCccceEe-ecCCcccCchhH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC---Cccc--hhhhh------hh Q lcl|NC_018086. 228 LLQKFPVLE-IIANEERLGDFE-AQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL---SADS--DSISN------MK 294 (511) Q Consensus 228 ~~g~iPvv~-~~n~~~g~s~~~-~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~---~~~~--~~~~~------~~ 294 (511) .+|-.. ..++-.|.|.+. .+.+-+..++++.-.....+..+....+.+.|... .... ..... .+ T Consensus 174 ---~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~ 250 (427) T protein:vir:10 174 ---RVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS 250 (427) T ss_pred ---CchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhc Confidence 011111 123446888876 47787888888888888777777777777766421 1111 11111 11 Q ss_pred -hCceeeecC-CCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccc-cccc-c--CccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 -NDRVIVTDE-DGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLV-SKDF-T--AASGQALKAATQPLENKSAVKES 368 (511) Q Consensus 295 -~~~~i~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~-~--~~Sg~Ai~~~~~~l~~k~~~~~~ 368 (511) ..+.+.+.+ +.+.+ ..+.+...+...++.....|...+++|-.- ++.. + |+||..=...|...+.-. .+. T Consensus 251 ~~~~~~~l~~~~e~~e--~~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~--Qe~ 326 (427) T protein:vir:10 251 GVGRAIGIDAETEEYD--VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRK--REE 326 (427) T ss_pred CcccceeeecCCCcee--EEecccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHH--HHH Confidence 133444443 34444 345666778888999999999999999643 3322 1 466675433333333222 235 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_018086. 369 KFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKAD 448 (511) Q Consensus 369 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~ 448 (511) .+...+++++++|+. . .++++.|+|-...+++|.|+...+.+...+. ++. .+ +-++++..++++ T Consensus 327 ~l~p~l~~l~~~i~~-------s-----~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~--~~~-~g-vi~~~e~r~~L~ 390 (427) T protein:vir:10 327 DYRPLLEFLLPFIVD-------E-----EEWSIEFEPLSVPSKKEESEITKNNVESVTK--AIT-EQ-IIDLEEARDTLR 390 (427) T ss_pred HHHHHHHHHHHHhhc-------C-----CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHh-cC-CCCHHHHHHHHH Confidence 678888888887652 1 2678999999999999998876554322111 111 11 222333222222 Q ss_pred HHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccc Q lcl|NC_018086. 449 AQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEK 502 (511) Q Consensus 449 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (511) .. ... .+..+..+.+.+++....+.+ + .+...+.+++ T Consensus 391 ~~--------~~~----~~~~~~~~~~~e~~~~~~e~~-p----~~~e~~~d~~ 427 (427) T protein:vir:10 391 SI--------APE----FKLKDGNNINIREPEETTEPE-P----GLGEKLEDEN 427 (427) T ss_pred hh--------hcc----ccCCCCccccccccchhcCCC-C----CCCCCCCCCC Confidence 11 000 000111111111111111000 0 0001111111 No 126 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.16 E-value=9.9e-10 Score=69.94 Aligned_cols=463 Identities=10% Similarity=0.036 Sum_probs=204.6 Q ss_pred ccCCCHHHHHHHHHHHH----HHHHHHHHHHHHhc---CCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc-- Q lcl|NC_018086. 25 RRNFDLRELITLAEMHS----RSSSAYGVLYDYYK---GNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE-- 95 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~~----~~~~~~~~~~~yY~---G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~-- 95 (511) ...-+.+.|.+..+..+ .-..+++.+.+|.. +... ........+.+.++..+-+...++.+++.|++- T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~---~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFL---TSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGIT 77 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcC---CCCCCcccccccccccchHHHHHHHHHHHHHHhhc Confidence 11112233344333333 23345555555532 2110 001111122344667778888888888877653 Q ss_pred C----c-eecC-ch------hh-------HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccc Q lcl|NC_018086. 96 P----I-TESG-DE------KT-------IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMN 156 (511) Q Consensus 96 ~----~-~~~~-d~------~~-------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~ 156 (511) | | ++.. |+ +. ...+.+.+..++|.....++.++..++|.|.+++..+..+.++++.++..+ T Consensus 78 pp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l~~ 157 (559) T protein:vir:95 78 SPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPIGS 157 (559) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeecCe Confidence 2 2 2221 11 11 122455677788999999999999999999988877666667888888887 Q ss_pred eEEEecCCCCCceEEEEEEEEEeec------------------CCcce-EEEEEEEcC----CcEEEEEEccCcc----c Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISD------------------ITGHQ-IRTYEVYTE----DLIYKFSTDDERE----V 209 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~------------------~~~~~-~~~~~~~~~----~~i~~~~~~~~~~----~ 209 (511) .++.-|. .+.+...+|.++.... .+.+. -.++++++. ............. + T Consensus 158 ~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~s~ 235 (559) T protein:vir:95 158 YYLANSP--RGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFKSV 235 (559) T ss_pred EEEeeCC--CCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceEEEE Confidence 7665554 3445666665432210 00011 112332211 0000000000000 0 Q ss_pred ccccccccccccccceeccCCccceEeec-----CCcccCch-hHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC Q lcl|NC_018086. 210 YREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGD-FEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL 283 (511) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~-~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~ 283 (511) ++........ -....+|..+|++.++ +..+|+|. .....+-+..+|.+.-......+....|.+.+.+... T Consensus 236 ~~e~~~~~~~---~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~ 312 (559) T protein:vir:95 236 YYEVGGDNDK---LLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) T ss_pred EEEecCCCce---eeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceecccccc Confidence 0000000000 0122344556655443 35689994 8889999999999999999999999999887654221 Q ss_pred CccchhhhhhhhCceeeecCCC---ceeee-ecCCCHHHHHHHHHHHHHHHHHHhCccc---cccccccCccHHHHHHHH Q lcl|NC_018086. 284 SADSDSISNMKNDRVIVTDEDG---MVKFI-TKDVNDKHIENIKNRAKLDIFSLSQTPD---LVSKDFTAASGQALKAAT 356 (511) Q Consensus 284 ~~~~~~~~~~~~~~~i~~~~~~---~~~~~-~~~~~~~~~~~~~~~l~~~i~~~s~~p~---~~~~~~~~~Sg~Ai~~~~ 356 (511) . ...++..+++...+..+ .++.+ +.+.+...+...++.++..|...-..-. +...+....|++.+.... T Consensus 313 ~----~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~ 388 (559) T protein:vir:95 313 N----QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) T ss_pred c----cceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHH Confidence 1 11223333333333222 23332 1234455556666777776655444321 112222346888776654 Q ss_pred HHHHHHH----HHH-HHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH-HHH-------HHHHHHHhc Q lcl|NC_018086. 357 QPLENKS----AVK-ESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY-AEL-------ADMAVKLRD 423 (511) Q Consensus 357 ~~l~~k~----~~~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~-~e~-------a~~~~~~~g 423 (511) ..+.... .+. ...+..-+.+++.++.+....-.........+++|.+..++..-. .+. ++.+..+++ T Consensus 389 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq 468 (559) T protein:vir:95 389 EEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQ 468 (559) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 4433332 111 223333444444443331111111223344567777765554311 111 122222233 Q ss_pred cCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhh-ccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 424 MLP-------DETIINQF---PWIT----DARQEVEKADAQRQKRADIALQN-FKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 424 ~~s-------~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) +-| ...++..+ -+++ -.++|++.++.++++.+..+... ...+.......-++..... ...-++. T Consensus 469 ~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~-~~~l~~~ 547 (559) T protein:vir:95 469 VKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSD-PSVLSAM 547 (559) T ss_pred cChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCC-hhHHHHH Confidence 222 22233222 1122 13567777766554433321111 1010111000100110000 0000000 Q ss_pred cccCCCCccccc Q lcl|NC_018086. 489 ITTTDPVAAKEQ 500 (511) Q Consensus 489 ~~~~~~~~~~~~ 500 (511) .......++..+ T Consensus 548 ~~~~~~~~~~~~ 559 (559) T protein:vir:95 548 ANAVSGQGGQSQ 559 (559) T ss_pred HHhhcCccccCC Confidence 000011111111 No 127 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.15 E-value=8.4e-10 Score=70.34 Aligned_cols=466 Identities=12% Similarity=0.030 Sum_probs=183.7 Q ss_pred CCCccchhhcccccCchhhHhhhhc--------cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc-cc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIR--------RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT-NK 71 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~-~~ 71 (511) -+|+|+.-+-=+-.+.+-.+.++.. .+...+.|-+.......-..+-......+.+.+. .+....+. .. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~--~r~~~~~~~~l 80 (551) T protein:vir:80 3 NKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFK--TKPSIRNNQDL 80 (551) T ss_pred hhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccc--cCccccChhHH Confidence 5667775544222222222333222 1222222222111100000000000001111100 00011110 00 Q ss_pred ccc--ee-ccchHHHHHHHHHhhhhc-----------cCceec----------CchhhHHHHHHHHhcc---------Ch Q lcl|NC_018086. 72 PNS--KI-VHNFPKLLVDTSTAYLAG-----------EPITES----------GDEKTIKAMQPVFKEN---------YV 118 (511) Q Consensus 72 ~~~--ri-~~n~~k~ivd~~~~~l~g-----------~~~~~~----------~d~~~~~~l~~~~~~n---------~~ 118 (511) +.. .. ..+..+.+|+..++.+.. .|+.+. .+....+.+.+++.+- .+ T Consensus 81 ~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~ 160 (551) T protein:vir:80 81 HGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSF 160 (551) T ss_pred HHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchH Confidence 000 01 124445555555544321 222221 1122233455555432 23 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCCCCceE-EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCc Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDRNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDL 197 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 197 (511) ..+...+..+.+.+|.+|+.+..+.+|++. +..++|..+.++.++.... ....++|+... .+... ..|.++. T Consensus 161 ~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~-~~~~~~y~~~~---~g~~~---~~~~~~e 233 (551) T protein:vir:80 161 SSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKI-PDNGNRFVQVI---DQKIV---ATFNARE 233 (551) T ss_pred HHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCcccc-ccCceEEEEEe---CCcEE---EEEcccc Confidence 356667788889999999988888888864 7789999887776554311 11112222211 11111 1234444 Q ss_pred EEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_018086. 198 IYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW 277 (511) Q Consensus 198 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~ 277 (511) ++|++.. |........+|.|.++.+...+.....+..-....+...+.|-.+ T Consensus 234 iiH~~~n----------------------------~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~gi 285 (551) T protein:vir:80 234 MAFAVRN----------------------------PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGI 285 (551) T ss_pred eEEeccc----------------------------CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceE Confidence 4444311 000000112477777777777766666655555556666666644 Q ss_pred e--ecCC-CCcc--chhhhhhh--------hCceeee-cCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccc Q lcl|NC_018086. 278 L--QGFD-LSAD--SDSISNMK--------NDRVIVT-DEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKD 343 (511) Q Consensus 278 ~--~G~~-~~~~--~~~~~~~~--------~~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 343 (511) + .|.. .++. +.....+. .+++..+ .++.+.+.+..+.....+....+...+.|+..-++|....+. T Consensus 286 L~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~ 365 (551) T protein:vir:80 286 LQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINI 365 (551) T ss_pred EEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCc Confidence 4 4432 2221 11111111 1222233 333344444434444556677788888898888888655442 Q ss_pred ccCc-----cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHH Q lcl|NC_018086. 344 FTAA-----SGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMA 418 (511) Q Consensus 344 ~~~~-----Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~ 418 (511) .+.. .+..+-. +. ........+..+|.-+++.|...+...-. ..+. ..+.+.|......+.++.+... T Consensus 366 ~~~~~~~~~~~~s~t~--sn---~e~~~~~f~~~tL~P~~~~ie~~ln~~L~-~~~~-~~~~f~f~~~~~~~~~~~~~~~ 438 (551) T protein:vir:80 366 PNNGGATGSKGGSLNE--GN---SAEKNQASKNKGLQPLLGFIEDFINKHIV-AEFG-DKYTFQFVGGDIKSELESVKIL 438 (551) T ss_pred ccccccccccccccch--hh---HHHHHHHHHHHHHHHHHHHHHHHHHhhhc-cccC-CceEEEeeccChhhHHHHHHHH Confidence 2111 0111100 00 00111133334444444444333332111 1112 3467888877777777666544 Q ss_pred HHH-hccCChHHHHHhCCCCCC-H--HHHH-----HH----HHHHH--HHHHHHHHhhccccccCCCCC--CccccccCC Q lcl|NC_018086. 419 VKL-RDMLPDETIINQFPWITD-A--RQEV-----EK----ADAQR--QKRADIALQNFKQTSAVQGAS--TAAANKLDK 481 (511) Q Consensus 419 ~~~-~g~~s~et~~~~l~~v~d-~--~~E~-----~r----i~~E~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 481 (511) ... .|+++.-.++.+++.-+. + +.-+ .. ...++ .+..+...+...+..+...+. +...+..+. T Consensus 439 ~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 518 (551) T protein:vir:80 439 AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDT 518 (551) T ss_pred HHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCcccc Confidence 322 488998888888865321 1 1000 00 00000 000000000000100000000 000001111 Q ss_pred CCCCccccccCCCCccccccc----cCCCCCCC Q lcl|NC_018086. 482 NPANTSTITTTDPVAAKEQEK----AIQKKPKT 510 (511) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 510 (511) +++....++..++.++++... -.++-.+| T Consensus 519 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (551) T protein:vir:80 519 TGDIGKDGQRKDKDNANAGKQGMKGDKPNDWQT 551 (551) T ss_pred CCCccccccccCccccchhhhhcCCCCccccCC Confidence 111222222222333222211 11222223 No 128 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=99.11 E-value=1.9e-09 Score=68.45 Aligned_cols=460 Identities=11% Similarity=0.029 Sum_probs=204.9 Q ss_pred ccCCCHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc--C-- Q lcl|NC_018086. 25 RRNFDLRELITLAEMHSR----SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE--P-- 96 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~~~----~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~--~-- 96 (511) ....+.+.|.+..+..+. -..+++.+.+|..-.--...........+.+.++..+-+...++.+++.|++- | T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~ 80 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITSPA 80 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcCCC Confidence 111122333333333332 23455556666422100000011111122334677778888888888877653 2 Q ss_pred --c-eecC-ch-------------hhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE Q lcl|NC_018086. 97 --I-TESG-DE-------------KTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLI 159 (511) Q Consensus 97 --~-~~~~-d~-------------~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~ 159 (511) | ++.. |. .....+.+.+..++|.....++.++..++|.|.+++..++.+.+++..++..+.++ T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~~~~ 160 (556) T protein:vir:73 81 RPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGSYYL 160 (556) T ss_pred CcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecceeEE Confidence 2 2221 11 11223556677889999999999999999999998887777778888888887766 Q ss_pred EecCCCCCceEEEEEEEEEee---------c----------CCcceEEEEEEEc----CCcEEEEEEccCcc----cccc Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVIS---------D----------ITGHQIRTYEVYT----EDLIYKFSTDDERE----VYRE 212 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~---------~----------~~~~~~~~~~~~~----~~~i~~~~~~~~~~----~~~~ 212 (511) .-|. .+.+...+|.++..- + ..+..-.++++++ .............. +++. T Consensus 161 ~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~ 238 (556) T protein:vir:73 161 ANSP--RGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYFE 238 (556) T ss_pred eeCC--CCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEEEEEEE Confidence 5443 344566666554321 0 0011011223221 10000000000000 0000 Q ss_pred cccccccccccceeccCCccceEeec-----CCcccCch-hHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 213 IPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGD-FEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 213 ~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~-~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) ........ ...-+|..+|++.++ ++.+|+|. .....+-+..+|.+.-......+....|.+.+...... T Consensus 239 ~~~~~~~v---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~-- 313 (556) T protein:vir:73 239 SGGDSDKL---LRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKN-- 313 (556) T ss_pred ecCCCcee---cccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceeccccccc-- Confidence 00000000 112345556665443 45689994 99999999999999999999999999998876543111 Q ss_pred chhhhhhhhCceeee--cCC-Cceeeee-cCCCHHHHHHHHHHHHHHHHHHhCccc---cccccccCccHHHHHHHHHHH Q lcl|NC_018086. 287 SDSISNMKNDRVIVT--DED-GMVKFIT-KDVNDKHIENIKNRAKLDIFSLSQTPD---LVSKDFTAASGQALKAATQPL 359 (511) Q Consensus 287 ~~~~~~~~~~~~i~~--~~~-~~~~~~~-~~~~~~~~~~~~~~l~~~i~~~s~~p~---~~~~~~~~~Sg~Ai~~~~~~l 359 (511) ...++..+++... ..+ .+++.+. ...+...+...++.++..|...-.... +...+....|++.+......+ T Consensus 314 --~~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~ 391 (556) T protein:vir:73 314 --QRVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEK 391 (556) T ss_pred --cceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHH Confidence 1122333333322 222 2345432 234556666667777777655443321 122222346888776654433 Q ss_pred HHHH----HHH-HHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH-HH-------HHHHHHHhccCC Q lcl|NC_018086. 360 ENKS----AVK-ESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA-EL-------ADMAVKLRDMLP 426 (511) Q Consensus 360 ~~k~----~~~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~-e~-------a~~~~~~~g~~s 426 (511) .... .+. ...+..-+.+++.++.+.-..-.........+++|.+..++-.... .. ++.+..++++-| T Consensus 392 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~P 471 (556) T protein:vir:73 392 LLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKP 471 (556) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccCh Confidence 3332 111 1223333444444333211111112233445677887665543211 11 122222223222 Q ss_pred -------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHh--hccccccCCCCCCcc--ccccCCCCCCccc Q lcl|NC_018086. 427 -------DETIINQF---PWIT----DARQEVEKADAQRQKRADIALQ--NFKQTSAVQGASTAA--ANKLDKNPANTST 488 (511) Q Consensus 427 -------~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 488 (511) ...++..+ -+++ -.++|++.+++++.+....... ...+..+..+..... .+...-+..-.+. T Consensus 472 e~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~~l~~~~~~~ 551 (556) T protein:vir:73 472 EALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPSALTAIANAA 551 (556) T ss_pred hhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHHHHHHHHHhh Confidence 22233322 1122 2356666665554433322111 111000000000000 0000000011111 Q ss_pred cccCC Q lcl|NC_018086. 489 ITTTD 493 (511) Q Consensus 489 ~~~~~ 493 (511) ++.++ T Consensus 552 g~~~~ 556 (556) T protein:vir:73 552 GAPQQ 556 (556) T ss_pred cCCCC Confidence 11111 No 129 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.10 E-value=2.1e-09 Score=68.19 Aligned_cols=465 Identities=12% Similarity=0.046 Sum_probs=183.7 Q ss_pred CccchhhcccccCchhhHhhhhccC---CCHHHH-HHHHHHHHHHH-HHHHHHHHHhcCCCcccc-cCCcCcc-ccccc- Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRN---FDLREL-ITLAEMHSRSS-SAYGVLYDYYKGNHIAIQ-SRTFDDT-NKPNS- 74 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~l-~~~~~~~~~~~-~~~~~~~~yY~G~~~~~~-~~~~~~~-~~~~~- 74 (511) |.|+.-+.-.+.+.+..+.+..... ++..-+ ...+++..... .-|..-.-+......... ++...+. ..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 8888766665555555555443311 111111 01111111000 001000011111111100 0111110 00000 Q ss_pred -e-eccchHHHHHHHHHhhhhc-----------cCc--eec--------CchhhHHHHHHHHhcc---------ChhHHH Q lcl|NC_018086. 75 -K-IVHNFPKLLVDTSTAYLAG-----------EPI--TES--------GDEKTIKAMQPVFKEN---------YVTDVN 122 (511) Q Consensus 75 -r-i~~n~~k~ivd~~~~~l~g-----------~~~--~~~--------~d~~~~~~l~~~~~~n---------~~~~~~ 122 (511) . ...++.+.+|+..++.+.+ -++ ++. .+......+.+++.+- .+..+. T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~ 160 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFV 160 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHH Confidence 1 1124455555555443321 111 111 1222233455555431 234566 Q ss_pred HHHHHHHhhCCeEEEEeeeCCCCceE-EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEE Q lcl|NC_018086. 123 SEEVKLSGIFGHCFEIHWIDRNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKF 201 (511) Q Consensus 123 ~~~~~~a~~~G~~~~~v~~~~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 201 (511) ..+..+.+.+|.+|+.+..+.+|++. +..++|..+.++.+.... .....++|+... ++... ..+.++.++|+ T Consensus 161 ~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~~---~~~~~---~~~~~~eiih~ 233 (547) T protein:vir:63 161 KKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQVI---DQKIV---ATFNAREMAFA 233 (547) T ss_pred HHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccc-cccCceEEEEEc---CCcEE---EEeccccEEEe Confidence 77888899999999988888888764 677899888777654321 111112222211 11111 12334444444 Q ss_pred EEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE--ee Q lcl|NC_018086. 202 STDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW--LQ 279 (511) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~--~~ 279 (511) +.. |........+|.|.++.+...+.....+..-....+...+.|-.+ +. T Consensus 234 r~n----------------------------~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~ 285 (547) T protein:vir:63 234 VRN----------------------------PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIK 285 (547) T ss_pred ccc----------------------------CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEec Confidence 210 000000122577878777776666665555555555655666544 34 Q ss_pred cCC-CCcc--chhhhhhh--------hCceeeecCCCceeeeecC--CCHHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_018086. 280 GFD-LSAD--SDSISNMK--------NDRVIVTDEDGMVKFITKD--VNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA 346 (511) Q Consensus 280 G~~-~~~~--~~~~~~~~--------~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 346 (511) |.. .++. +.....+. .+++..+. +++++|.... .....+....+...+.|+..-++|....+..+. T Consensus 286 ~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~-~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~ 364 (547) T protein:vir:63 286 AAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS-AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNN 364 (547) T ss_pred CCCCCCHHHHHHHHHHHHHHhcCccccccccccc-CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccc Confidence 432 2221 11111111 12232332 2344554433 444556666777888888888888655442211 Q ss_pred c-----cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHH- Q lcl|NC_018086. 347 A-----SGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVK- 420 (511) Q Consensus 347 ~-----Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~- 420 (511) . ++..+-. +. ..+.....+..+|.-+++.|...++..-.. .+. ..+.+.|......+..+.+..... T Consensus 365 ~~~~~~~~~s~t~--sn---~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~-~~~-~~~~~~f~~~~~~~~~~~~~~~~~~ 437 (547) T protein:vir:63 365 GGATGSKGGSLNE--GN---SAEKNQASKNKGLQPLLGFIEDFINKHIVA-EFG-DKYTFQFVGGDIKSELESVKILAEK 437 (547) T ss_pred cccccccccccch--hh---HHHHHHHHHHHHHHHHHHHHHHHHHhhccc-ccC-CceEEEeeccccccHHHHHHHHHHH Confidence 0 1111110 00 001111233444444444444433321111 111 246778887777777776654332 Q ss_pred HhccCChHHHHHhCCCCCC---HHHHH-----HHH----HHHHH--HHHHHHHhhcccccc-CCCCCCc-cccccCCCCC Q lcl|NC_018086. 421 LRDMLPDETIINQFPWITD---ARQEV-----EKA----DAQRQ--KRADIALQNFKQTSA-VQGASTA-AANKLDKNPA 484 (511) Q Consensus 421 ~~g~~s~et~~~~l~~v~d---~~~E~-----~ri----~~E~~--~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~ 484 (511) ..|+++.-.++.+++.-.. -+.-+ ..+ ..++. +..+...+......+ ..+..+. .....+.++. T Consensus 438 ~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (547) T protein:vir:63 438 AKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGD 517 (547) T ss_pred hCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCC Confidence 2488998888888765321 11000 000 00000 000000000000000 0000000 0000011111 Q ss_pred CccccccCCCCccccccc----cCCCCCCC Q lcl|NC_018086. 485 NTSTITTTDPVAAKEQEK----AIQKKPKT 510 (511) Q Consensus 485 ~~~~~~~~~~~~~~~~~~----~~~~~~~~ 510 (511) ....++..++..+++... -.++-.+| T Consensus 518 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (547) T protein:vir:63 518 IGKDGQRKDKDNANAGKQGMKGDKPNDWQT 547 (547) T ss_pred cCccccccCccccchhhhhcCCCCccccCC Confidence 111222222222222211 11111222 No 130 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=99.09 E-value=2.5e-09 Score=67.75 Aligned_cols=452 Identities=10% Similarity=0.002 Sum_probs=202.2 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHH----HHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHS----RSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) || .. ..+.+..+.+.+..+.++ .-..+++.+.+|..-.- .............++ T Consensus 1 m~-------------~~------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~ 58 (535) T protein:vir:33 1 MA-------------DS------KRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL---FPKESDNESTDYTTP 58 (535) T ss_pred CC-------------hh------hhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCCCccccccccc Confidence 11 10 112223333433333333 33455556666654421 111111111222345 Q ss_pred ccchHHHHHHHHHhhhhcc--C---c-eecCch---------------------hhHHHHHHHHhccChhHHHHHHHHHH Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGE--P---I-TESGDE---------------------KTIKAMQPVFKENYVTDVNSEEVKLS 129 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~--~---~-~~~~d~---------------------~~~~~l~~~~~~n~~~~~~~~~~~~a 129 (511) ..+-+...++.+++.|++- | | ++...+ .....+...+..++|.....++.++. T Consensus 59 ~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L 138 (535) T protein:vir:33 59 WQAVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQL 138 (535) T ss_pred ccccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHH Confidence 5666777777777776652 2 2 121111 11123445577889999999999999 Q ss_pred hhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeec--------------CCcceEEEEEEEcC Q lcl|NC_018086. 130 GIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD--------------ITGHQIRTYEVYTE 195 (511) Q Consensus 130 ~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~ 195 (511) .++|.|.+++..+..+.++++.++-.+. ++-.+.. +.+...+|.++.... ...+....+++|+. T Consensus 139 ~~~G~a~l~~~~~~~~~~~f~~~pl~~~-~v~~d~~-G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~ 216 (535) T protein:vir:33 139 IVAGNALLYLPEPEGSYNPMKLYRLSSY-VVQRDAY-GNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTH 216 (535) T ss_pred HhhCceeEEeecCCCCceeeEEEEcCee-EEeeCCC-CCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEE Confidence 9999999988777666677887765544 4433433 445666665543310 00011111222221 Q ss_pred CcEEEEEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 196 DLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAY 270 (511) Q Consensus 196 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~ 270 (511) - +.....+.+.... ..............+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-......+. T Consensus 217 v---~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 292 (535) T protein:vir:33 217 V---YLDEESGDYLKYE-EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMI 292 (535) T ss_pred E---EeeCCCCcEEEEE-EEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0111111111111 0001101111222346667776544 35789999999999999999999999999999 Q ss_pred hcCceeEeecCCCCccchhhhhh-h-hCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_018086. 271 WNDAYLWLQGFDLSADSDSISNM-K-NDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA 346 (511) Q Consensus 271 ~~~p~l~~~G~~~~~~~~~~~~~-~-~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 346 (511) ...|.+.+.-.... . ...+ . ..+.+.....++++.+. ...+.......++.++..|...-..-.+..-+... T Consensus 293 ~~~p~~lv~~~g~~---~-~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r 368 (535) T protein:vir:33 293 SAKVIGLVNPAGIT---Q-PRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGER 368 (535) T ss_pred HhcCceeecccccc---c-hhhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCcc Confidence 99998765321111 1 1111 1 12233323334455543 33456667777777777775543222221122234 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH-HHHHHH Q lcl|NC_018086. 347 ASGQALKAATQPLENKSAVKESKFRKVLAK--------RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY-AELADM 417 (511) Q Consensus 347 ~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~--------~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~-~e~a~~ 417 (511) .|++.+..... ++...++..+.+ +++.++.++...+.-.......+.+.|.-++..-- .+.++. T Consensus 369 ~TAtEV~~r~~-------E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~ 441 (535) T protein:vir:33 369 VTAEEIRYVAS-------ELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDK 441 (535) T ss_pred ccHHHHHHHHH-------HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHH Confidence 57776655433 333333333333 22222333333333333444457777765555321 111222 Q ss_pred H----HHHhccCC--------hHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 418 A----VKLRDMLP--------DETIINQF---PWIT-----DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 418 ~----~~~~g~~s--------~et~~~~l---~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) + ..++++-| ...++..+ -+++ ..++|++.+.+++.+..... .......+..++. ..+ T Consensus 442 l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~-~~~~~~g~~~~~~--~~~ 518 (535) T protein:vir:33 442 LERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVE-NAAAAGGAGVGAL--ATS 518 (535) T ss_pred HHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHH-HHHHhhhhhhcch--hhc Confidence 2 22222211 12222222 1222 23556666555443322221 1111111111111 000 Q ss_pred ccCCCCCCccccccCCCCccccc Q lcl|NC_018086. 478 KLDKNPANTSTITTTDPVAAKEQ 500 (511) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~ 500 (511) + ..+.++-.+.-|-+.+ T Consensus 519 ----~--~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 519 ----S--PEAMQGAAAKAGLNAT 535 (535) T ss_pred ----C--ChhHHHHHHhccCCCC Confidence 0 1111111111111111 No 131 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.08 E-value=2.5e-09 Score=67.70 Aligned_cols=446 Identities=11% Similarity=0.009 Sum_probs=196.3 Q ss_pred ccCCCH---HHHHHHHHHHHHH-HHHHHHHHHHhcCCCccc------ccCCcCccccccceeccchHHHHHHHHHhhhhc Q lcl|NC_018086. 25 RRNFDL---RELITLAEMHSRS-SSAYGVLYDYYKGNHIAI------QSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAG 94 (511) Q Consensus 25 ~~~~~~---~~l~~~~~~~~~~-~~~~~~~~~yY~G~~~~~------~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g 94 (511) ..+.+. +.|.+..+..+.+ .+...+++++|+---+-. .........+.+.++..+-+...++.+++.|++ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~ 80 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDS 80 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHh Confidence 222111 1222222222222 222233334443221110 011111122234466777788888888887765 Q ss_pred c--C----c-eecC-chh------hHHH-------HHHHH--hccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEE Q lcl|NC_018086. 95 E--P----I-TESG-DEK------TIKA-------MQPVF--KENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKA 151 (511) Q Consensus 95 ~--~----~-~~~~-d~~------~~~~-------l~~~~--~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~ 151 (511) - | | ++.. ++. .... +...+ ..++|.....++.++..++|.|.+++..+..+.++++. T Consensus 81 ~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~~ 160 (549) T protein:vir:10 81 MITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYRN 160 (549) T ss_pred hccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEEE Confidence 3 2 2 2222 211 1111 22222 25778888899999999999999998877666677888 Q ss_pred EcccceEEEecCCCCCceEEEEEEEEEeec------------------CCcceEEEEEEEcC---C-cEEEEEEccCc-- Q lcl|NC_018086. 152 VSPMNCLIAYSADLDEEPVAAIYYNTVISD------------------ITGHQIRTYEVYTE---D-LIYKFSTDDER-- 207 (511) Q Consensus 152 ~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~------------------~~~~~~~~~~~~~~---~-~i~~~~~~~~~-- 207 (511) ++-.+.++.-|. .+.+...+|.+...-. ...+....+++|+. . ....-...... T Consensus 161 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p 238 (549) T protein:vir:10 161 VPMQRLWFAENN--SGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNMQ 238 (549) T ss_pred EEcCeEEEeeCC--CCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccCc Confidence 777766555543 3445666655432210 01112233444321 0 00000000000 Q ss_pred --ccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec Q lcl|NC_018086. 208 --EVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQG 280 (511) Q Consensus 208 --~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G 280 (511) .++... .. . .-....+|..+|++.++ ++.+|+|-.++..+-+..+|.+.-......+....|.+.+.- T Consensus 239 f~sv~~e~-~~-~---~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~ 313 (549) T protein:vir:10 239 FASYWLDE-GR-D---RIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANE 313 (549) T ss_pred eEEEEEEe-cC-C---EeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc Confidence 000000 00 0 01112345556665443 357899999999999999999999999999999999887642 Q ss_pred CCCCccchhhhhhhhCce--eee--cCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccc-cccccCccHHHHHHH Q lcl|NC_018086. 281 FDLSADSDSISNMKNDRV--IVT--DEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLV-SKDFTAASGQALKAA 355 (511) Q Consensus 281 ~~~~~~~~~~~~~~~~~~--i~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~~~~Sg~Ai~~~ 355 (511) ..... ..++..++. +.. .++..+..+....+.......++.++..|...-...-+. ..+....|++.+... T Consensus 314 ~g~~~----~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r 389 (549) T protein:vir:10 314 DGVLD----GFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQR 389 (549) T ss_pred ccccc----cceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHH Confidence 21111 011221221 111 223335555455566666777777777666544332111 112234677776655 Q ss_pred HHHHHHHH----HHH-HHHHHHHHHHHHHHHHHHHHhcCCCc----cc--cccceeEEeCCCCCcCH-HHHH-------H Q lcl|NC_018086. 356 TQPLENKS----AVK-ESKFRKVLAKRYELVCSYLEFMNKAK----DL--KPYEVTPVFVRNLPQSY-AELA-------D 416 (511) Q Consensus 356 ~~~l~~k~----~~~-~~~~~~~l~~~~~li~~~~~~~~~~~----~~--~~~~i~i~f~~~~p~d~-~e~a-------~ 416 (511) ...+.... .+. ...+..-+.+.+.++.+ .+.-. .. ...++.|.|..++-+.. .+.+ + T Consensus 390 ~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r----~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~ 465 (549) T protein:vir:10 390 AQEKGVLLAPTLGRTQSELLGPMIAREVDILAE----AGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQ 465 (549) T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh----cCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHH Confidence 44333332 221 22333444444444332 22211 11 23456677655444321 1111 2 Q ss_pred HHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHH-HhhccccccCCCCCCccccccCC Q lcl|NC_018086. 417 MAVKLRDMLP-------DETIINQF---PWIT----DARQEVEKADAQRQKRADIA-LQNFKQTSAVQGASTAAANKLDK 481 (511) Q Consensus 417 ~~~~~~g~~s-------~et~~~~l---~~v~----d~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 481 (511) .+..++++-| ...++..+ -+++ -.++|++.++++++++.... +.......+. .+...++ T Consensus 466 ~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~------~a~~~~~ 539 (549) T protein:vir:10 466 QLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAG------AIKDLSD 539 (549) T ss_pred HHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHhhhh Confidence 2222222212 12222222 1122 13566766665443322221 1111110000 0000000 Q ss_pred CCCCccccccCCCC Q lcl|NC_018086. 482 NPANTSTITTTDPV 495 (511) Q Consensus 482 ~~~~~~~~~~~~~~ 495 (511) . .+..++..+ T Consensus 540 ~----~ta~~~~~~ 549 (549) T protein:vir:10 540 A----QTAAQTARV 549 (549) T ss_pred h----cCCCcccCC Confidence 0 000111111 No 132 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.90 E-value=1.8e-08 Score=63.10 Aligned_cols=450 Identities=8% Similarity=0.014 Sum_probs=187.5 Q ss_pred ccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc------Cc- Q lcl|NC_018086. 25 RRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE------PI- 97 (511) Q Consensus 25 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~------~~- 97 (511) ....-...+..+-++...-..+++.+.+|..-.- .............++..+-+...++.+++.|++- || T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~WF 77 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYL---LTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSFF 77 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---CCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 0000011222233332333455556666654311 1111111112223555677778888888777652 22 Q ss_pred eecC-----------chh----h-------HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Q lcl|NC_018086. 98 TESG-----------DEK----T-------IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPM 155 (511) Q Consensus 98 ~~~~-----------d~~----~-------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~ 155 (511) ++.. +++ . ...+.+.+..++|.....++.++..++|.|.+++ +++ . ++.++-. T Consensus 78 ~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~~~-~--~~~~pl~ 152 (542) T protein:vir:78 78 KLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFA--GKK-T--LKVYPLD 152 (542) T ss_pred cccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEe--cCC-C--ceEEecc Confidence 2221 111 1 1234556778899999999999999999997765 343 2 3444333 Q ss_pred ceEEEecCCCCCceEEEEEEEEEeec---------------------CCcceEEEEE-EEcCCc--EEEEEEccCccccc Q lcl|NC_018086. 156 NCLIAYSADLDEEPVAAIYYNTVISD---------------------ITGHQIRTYE-VYTEDL--IYKFSTDDEREVYR 211 (511) Q Consensus 156 ~~~~v~d~~~~~~~~~~v~~~~~~~~---------------------~~~~~~~~~~-~~~~~~--i~~~~~~~~~~~~~ 211 (511) + +++--+.. +.+...+|.+..... ..+..+..+. ++.... +++..........+ T Consensus 153 ~-y~v~~d~~-G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s~ 230 (542) T protein:vir:78 153 R-YVIERDGD-GNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHRW 230 (542) T ss_pred e-eEEeeCCC-CCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEEE Confidence 3 44433333 445556665543311 0001111111 111111 11111000000000 Q ss_pred ccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD 286 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~ 286 (511) ................+|..+|++.++ .+.+|+|-..+..+-+..+|.+.-......+....|.+.+.-... .. T Consensus 231 ~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~-~~ 309 (542) T protein:vir:78 231 HQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSAT-TK 309 (542) T ss_pred EEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc-cc Confidence 000000000011223466667776544 356899999999999999999999999999999999876532111 11 Q ss_pred chhhhhhhhCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHH Q lcl|NC_018086. 287 SDSISNMKNDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSA 364 (511) Q Consensus 287 ~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~ 364 (511) ....... ..+.+.....++++.+. ...+.......++.++..|...-..-. .-+....|++.+... .+ T Consensus 310 ~~~~~~~-~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~--~~d~~rvTAtEV~~r-------~~ 379 (542) T protein:vir:78 310 PQSLARA-GTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILN--VRQSERTTATEVREV-------QM 379 (542) T ss_pred hhhcccC-CCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcccc--cCCcccccHHHHHHH-------HH Confidence 1111111 12233323334455443 334566677777777777765433211 111223566665554 33 Q ss_pred HHHHHHHHHHHHH--------HHHHHHHHHhcCCCccccccceeEEeCCCCCcC-HHHHHHHH----HHHhccCChHHH- Q lcl|NC_018086. 365 VKESKFRKVLAKR--------YELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQS-YAELADMA----VKLRDMLPDETI- 430 (511) Q Consensus 365 ~~~~~~~~~l~~~--------~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d-~~e~a~~~----~~~~g~~s~et~- 430 (511) ++...++..+.++ ++-++.++...+.-......-+++.+.-++..- ..+.++.+ +.++.++..+.+ T Consensus 380 E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~ 459 (542) T protein:vir:78 380 ELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQ 459 (542) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHH Confidence 4444444433332 111222232233222222223566665544321 11112222 111122222222 Q ss_pred --------HHhC---CCCC-----CHHHHHHHHHHHHHHHHHH-HH-hhccccccCCCCCCccccccCCCCCCccccccC Q lcl|NC_018086. 431 --------INQF---PWIT-----DARQEVEKADAQRQKRADI-AL-QNFKQTSAVQGASTAAANKLDKNPANTSTITTT 492 (511) Q Consensus 431 --------~~~l---~~v~-----d~~~E~~ri~~E~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (511) +..+ -+++ ...+|++..+++.+++... ++ ...++..+ ...++....+....+..++. T Consensus 460 ~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~-----~~~~~~~~~~~~a~~~~~~~ 534 (542) T protein:vir:78 460 QFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAK-----SPIGEKMMQQINAPGQEAPA 534 (542) T ss_pred hcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccc-----cccccchhhhcCCCCcCCCC Confidence 2222 1232 2245555554443332221 11 11111110 01111100010001111111 Q ss_pred CCCccccc Q lcl|NC_018086. 493 DPVAAKEQ 500 (511) Q Consensus 493 ~~~~~~~~ 500 (511) .|+-++.- T Consensus 535 ~~~~~~~~ 542 (542) T protein:vir:78 535 GPQTGEDL 542 (542) T ss_pred CCcccccC Confidence 11111111 No 133 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.87 E-value=2.3e-08 Score=62.43 Aligned_cols=399 Identities=9% Similarity=0.002 Sum_probs=164.6 Q ss_pred cccceeccchHHHHHHHHHhhhhccCceec-----C-c---hhhHHHHHHHHhc---c-----------ChhHHHHHHHH Q lcl|NC_018086. 71 KPNSKIVHNFPKLLVDTSTAYLAGEPITES-----G-D---EKTIKAMQPVFKE---N-----------YVTDVNSEEVK 127 (511) Q Consensus 71 ~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~-----~-d---~~~~~~l~~~~~~---n-----------~~~~~~~~~~~ 127 (511) ....--..++....|+..++.+.+-|+.+- . . ....+.+.++|.. | .+......+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 111111257788899999999988887652 0 1 1122333343332 2 23355667888 Q ss_pred HHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccC Q lcl|NC_018086. 128 LSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDE 206 (511) Q Consensus 128 ~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 206 (511) +...+|.||+.+..+..|++ .+..++|..+.+.-|... +.... .+...+ +.++......... T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~---------~~~~~---~~~~~~-~~~~~~~~~~~~~---- 143 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG---------FVQLL---EEKEKY-FGVAGDRYQTNGN---- 143 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce---------eEeec---CCceee-EEeccccceeecc---- Confidence 89999999998888888875 477788887766543321 00000 011111 1111111000000 Q ss_pred cccccccccccccccccceeccCCccceEeecCC-----cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe--e Q lcl|NC_018086. 207 REVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL--Q 279 (511) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~--~ 279 (511) +.................. .+..=-|++++.. ..|.|.+......++....+..-....+...+.|-.++ + T Consensus 144 ~~~~~~~~~~~~~~~~~~~--~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 221 (467) T protein:vir:31 144 GDLDPVFVDADDGSTGTSV--SNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVK 221 (467) T ss_pred cceeeeeeeecccccccee--EeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec Confidence 0000000000000000000 1111124555432 25777777665555544443333333444455555444 4 Q ss_pred cCCCCccchhhhhhh-----------------------hCceeeecCCCceeeeec--------CCCHHHHHHHHHHHHH Q lcl|NC_018086. 280 GFDLSADSDSISNMK-----------------------NDRVIVTDEDGMVKFITK--------DVNDKHIENIKNRAKL 328 (511) Q Consensus 280 G~~~~~~~~~~~~~~-----------------------~~~~i~~~~~~~~~~~~~--------~~~~~~~~~~~~~l~~ 328 (511) |...+. +....++ ....+.+..+.+.+.+.. ......+....+...+ T Consensus 222 ~~~l~~--e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~ 299 (467) T protein:vir:31 222 GAELTE--KGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEH 299 (467) T ss_pred CcCCCH--HHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHH Confidence 533322 2221111 012233343433322211 1123455666777788 Q ss_pred HHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccccceeEEeCC Q lcl|NC_018086. 329 DIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK--AKDLKPYEVTPVFVR 405 (511) Q Consensus 329 ~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~~~i~i~f~~ 405 (511) .|+..-++|..-.+... +..+..++.... ..+..+|.-+++.|...++..-. ........+++.+.. T Consensus 300 ~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~----------~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~ 369 (467) T protein:vir:31 300 DILKVHDVPPVIAGVVESGAFSTDAEEQRK----------EFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAK 369 (467) T ss_pred HHHHHhCCCHHHcccCCCCCcccCHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecch Confidence 88888888865443221 111111111111 11122233333333333221111 111122346666777 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCC Q lcl|NC_018086. 406 NLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNP 483 (511) Q Consensus 406 ~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (511) ....|..+.++++.++ .|+++...+++++++-+-++.++.- .......... +..+. ....+++...+ T Consensus 370 l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~------~~~~~~~~~~----~~~~~-~~~~~~~~~~~ 438 (467) T protein:vir:31 370 PDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYG------GETLVAEVTG----GSGPG-GGIGDQIEQLV 438 (467) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccC------Cccccccccc----ccCCC-CcccCcCCCCC Confidence 7888999999988877 5799999999998763322211100 0000000000 00000 00000000000 Q ss_pred CCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 484 ANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+....-...-...-+++...+.-...| T Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (467) T protein:vir:31 439 EDRADEIIDSYQADLETEQLIEIGANAD 466 (467) T ss_pred CCcccchHhhhhhccccchhhhhccccC Confidence 0000000000011112222222222222 No 134 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.84 E-value=3.1e-08 Score=61.79 Aligned_cols=454 Identities=9% Similarity=0.014 Sum_probs=193.3 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCcccccCCcCccccccceecc Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR----SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVH 78 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~ 78 (511) |++... +. .+..+.+.+..+.++. -..+++.+.+|..-.- .............++.. T Consensus 1 ~~~~~~-----------~~-----~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~~~~~~~~~~~~d 61 (535) T protein:vir:94 1 MASSQK-----------RE-----GFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSL---FPKDSDNASTDYTTPWQ 61 (535) T ss_pred CCchhh-----------hh-----hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---CCCCCCccccccCCccc Confidence 222222 11 1222223333333333 2455556666654321 11111111222345666 Q ss_pred chHHHHHHHHHhhhhcc-----Cc-eecCch--------------hhH-------HHHHHHHhccChhHHHHHHHHHHhh Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGE-----PI-TESGDE--------------KTI-------KAMQPVFKENYVTDVNSEEVKLSGI 131 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~-----~~-~~~~d~--------------~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~ 131 (511) +-+...++.+++.|++- || ++...+ +.. ..+...+..++|.....++.++..+ T Consensus 62 st~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~ 141 (535) T protein:vir:94 62 AVGARGLNNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVV 141 (535) T ss_pred ccHHHHHHHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHh Confidence 77777788877776652 22 121111 111 1234446678999999999999999 Q ss_pred CCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeec-------------CCcceEEEEEEEcCCcE Q lcl|NC_018086. 132 FGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD-------------ITGHQIRTYEVYTEDLI 198 (511) Q Consensus 132 ~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-------------~~~~~~~~~~~~~~~~i 198 (511) +|.|.+++..+++...+++.++-.+ +++-.+.. +++...+|.++.... ...+....+++|+.- T Consensus 142 ~G~a~l~~~~~~~~~~~f~~~pl~~-y~v~~d~~-G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v-- 217 (535) T protein:vir:94 142 AGNALLYIPEPEGTYNPMKLYRLSS-YVVQRDAF-GTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHI-- 217 (535) T ss_pred hCcEeEeeccCcCcccceEEEEcCe-EEEeeCCC-CCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEE-- Confidence 9999888876654445566665444 44443433 446666665543311 011122334444321 Q ss_pred EEEEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 199 YKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWND 273 (511) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~ 273 (511) +.......+...... ............+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-........... T Consensus 218 -~~~~~~~~~~~~~e~-~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~ 295 (535) T protein:vir:94 218 -YLDEESGEYLKYEEI-DGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAK 295 (535) T ss_pred -EeeCCCCcEEEEEEe-cCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 111111111111100 01001111233467777877554 35689999999999999999887777777777777 Q ss_pred ceeEeecCCCCccchhhhhhhhCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHH Q lcl|NC_018086. 274 AYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQA 351 (511) Q Consensus 274 p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~A 351 (511) |.+.+.-... ......... ..+.+.....+++..+. ...+.......++.++..|...-....+..-+....|++. T Consensus 296 ~~~lv~p~g~-~~~~~~~~~-~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~rvTAtE 373 (535) T protein:vir:94 296 VIGLVNPAGI-TQVRRLTKA-QTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRTGERVTAEE 373 (535) T ss_pred CCcccccccc-cchhhcccC-CCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCCCCCccHHH Confidence 7655431100 111111111 12233322233444433 3345566667777777766654432222222223457777 Q ss_pred HHHHHHHHHHHHH----H-HHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHH------- Q lcl|NC_018086. 352 LKAATQPLENKSA----V-KESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAV------- 419 (511) Q Consensus 352 i~~~~~~l~~k~~----~-~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~------- 419 (511) ++.....+..... + ....+..-+.+++.+ +...+.-......-+.+.+.-++. .+...+.+. T Consensus 374 V~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~i----l~r~g~lP~~p~~~v~~~~vs~la--~l~r~~~~~~l~~~~~ 447 (535) T protein:vir:94 374 IRYVASELEDTLGGVYSILSQELQLPMVRVLLKQ----LQATNQIPELPKEAVEPTISTGME--ALGRGQDLDKLERCIA 447 (535) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH----HHhCCCCCCCChhhccceEeehHH--HHHHHHHHHHHHHHHH Confidence 6554333332221 1 112222233333333 222232222222224444433322 222222222 Q ss_pred HHhccCC--------hHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHH--HhhccccccCCCCCCccccccCC Q lcl|NC_018086. 420 KLRDMLP--------DETIINQF---PWIT-----DARQEVEKADAQRQKRADIA--LQNFKQTSAVQGASTAAANKLDK 481 (511) Q Consensus 420 ~~~g~~s--------~et~~~~l---~~v~-----d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 481 (511) .++++-| ...++..+ -+++ ..++|++.+.+++++....+ +...++..+..+..... T Consensus 448 ~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~------ 521 (535) T protein:vir:94 448 AWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPE------ 521 (535) T ss_pred HHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChH------ Confidence 2222222 12222222 1222 23566666655544333221 11111111111100000 Q ss_pred CCCCccccccCCCC Q lcl|NC_018086. 482 NPANTSTITTTDPV 495 (511) Q Consensus 482 ~~~~~~~~~~~~~~ 495 (511) .......+.+..|. T Consensus 522 ~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 522 NMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHHHhccCCC Confidence 00001111122222 No 135 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.83 E-value=3.4e-08 Score=61.52 Aligned_cols=423 Identities=7% Similarity=-0.005 Sum_probs=175.1 Q ss_pred CCCccc---hhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccc------c Q lcl|NC_018086. 1 MAIPNG---QINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTN------K 71 (511) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~------~ 71 (511) -.+-|| ++++-+.+-.+++++. +.+....--+.|...-.-.....+.. . T Consensus 60 ~~~~~~~~~~~~kk~~i~~pfkkk~----------------------~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs 117 (945) T protein:vir:10 60 YSIIIFRKNQVLKKEKIIVPYNHQE----------------------PPFKFNLFEYSPESLMYLPSISDPDAFFLINLF 117 (945) T ss_pred eeeeeehhhhHHHhhcccccccccc----------------------cchhhhhhhccCccceecccccCccceeeehhh Confidence 223333 3322222222222221 11110000122222110000000000 0 Q ss_pred ccceeccchHHHHHHHHHhhhhccCceec---Cch---------hhHHHHHHHHhc-cCh-------hHHHHHHHHHHhh Q lcl|NC_018086. 72 PNSKIVHNFPKLLVDTSTAYLAGEPITES---GDE---------KTIKAMQPVFKE-NYV-------TDVNSEEVKLSGI 131 (511) Q Consensus 72 ~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d~---------~~~~~l~~~~~~-n~~-------~~~~~~~~~~a~~ 131 (511) .+..+...-....|+..++-+-+-|+.+- .+. .....+..++.+ |.. ......+..+.+. T Consensus 118 ~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL 197 (945) T protein:vir:10 118 RKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILT 197 (945) T ss_pred hhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhh Confidence 01112223344466666666666777541 111 112234555543 321 1245567789999 Q ss_pred CCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccc Q lcl|NC_018086. 132 FGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVY 210 (511) Q Consensus 132 ~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 210 (511) +|.+|+.+..+.+|++ .+..++|..+.+..+++... .. +|... .++... ..+.+..++++... T Consensus 198 ~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~~--~y---~Yv~~--idG~~~---~~v~a~DvIlhirn------ 261 (945) T protein:vir:10 198 IDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTGI--VV---GYVQE--VDGAIV---AHFDKRDVVLFRQN------ 261 (945) T ss_pred cCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCcE--EE---EEEEe--cCCceE---EEecCCceEEEecc------ Confidence 9999999988888986 57889999887766554321 11 11111 112111 11223222221110 Q ss_pred cccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHh----cCce--eEeecCCCC Q lcl|NC_018086. 211 REIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYW----NDAY--LWLQGFDLS 284 (511) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~----~~p~--l~~~G~~~~ 284 (511) ++.-| .....|.|.++.+. +++...+.-.....++| +.|- +.+.|.... T Consensus 262 ---------------~s~DG-------~~~GyGlSPIeaa~---~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~ 316 (945) T protein:vir:10 262 ---------------LTPDV-------YMYGYSLPPIEILY---KVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYK 316 (945) T ss_pred ---------------CCCCc-------ccccCCchHHHHHH---HHHHHHHHHHHHHHHHHHhCCCccceEEEecCcccc Confidence 00000 00113555555444 44443333333333333 3453 333332111 Q ss_pred -------ccchhhhhh----h-------hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_018086. 285 -------ADSDSISNM----K-------NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA 346 (511) Q Consensus 285 -------~~~~~~~~~----~-------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 346 (511) -.++....+ . .++.+.++++.+.+.++.......+.+..+...+.|+..-++|+...+.... T Consensus 317 d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~ 396 (945) T protein:vir:10 317 EGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEG 396 (945) T ss_pred ccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCC Confidence 111111111 1 1223445555555555544455566677788888899999998765544333 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hcc Q lcl|NC_018086. 347 ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDM 424 (511) Q Consensus 347 ~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~ 424 (511) .++..++.... ..+..+|.-++..|...++..- ........+.+.|+.....+..+.++++.++ .|+ T Consensus 397 st~SNiEqq~~----------~Fv~~tL~Pil~~IEqeLNrkL-l~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGi 465 (945) T protein:vir:10 397 SNKATAEVMAS----------LTKAKGLEPLMATISKGFDEVV-SEFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGF 465 (945) T ss_pred CCcchHHHHHH----------HHHHHHHHHHHHHHHHHHHHhc-cccccCceeEEEecchhccCHHHHHHHHHHHHhCCC Confidence 33333322211 1222333333333333222110 1112234577888777777888889888877 478 Q ss_pred CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc-cccccCCCCCCccccccCCCCCCccccccCCCCcccccccc Q lcl|NC_018086. 425 LPDETIINQFPWITDARQEVEKADAQRQKRADIALQNF-KQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKA 503 (511) Q Consensus 425 ~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (511) ++.-.++++++.-+-+.. +..-. ...+. +......+..+... ++.+.+...++.+.++....++. T Consensus 466 LTiNEvRe~lGLpPIeGG--D~lli--------~~nn~~P~d~~~ka~~ga~p----~q~aq~~~dqp~~kGGe~dEns~ 531 (945) T protein:vir:10 466 RSINEARMEKGLEPVPWG--DVPFS--------GLRNWKPEDEQAKAQQGAMP----PQLAQAMADQPSQQGGGVDENSS 531 (945) T ss_pred cCHHHHHHHhCCCCCCCc--ceeee--------ccccccccccccccccCCCC----cccccCCCCCCCCCCCCCCCCCC Confidence 999889888865321110 00000 00000 00000000000000 01111111112222222233334 Q ss_pred CCCCCCCC Q lcl|NC_018086. 504 IQKKPKTD 511 (511) Q Consensus 504 ~~~~~~~~ 511 (511) .+..++++ T Consensus 532 ~psE~kda 539 (945) T protein:vir:10 532 VPSEQKNA 539 (945) T ss_pred CCCcccch Confidence 44444544 No 136 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.82 E-value=3.6e-08 Score=61.37 Aligned_cols=452 Identities=9% Similarity=0.020 Sum_probs=185.4 Q ss_pred CCHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc------Cc Q lcl|NC_018086. 28 FDLRELITLAEMHSR----SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE------PI 97 (511) Q Consensus 28 ~~~~~l~~~~~~~~~----~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~------~~ 97 (511) |. +.+.+..+.++. -..+++.+.+|..-.- .............++..+-+...++.+++.|++- || T Consensus 1 m~-~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~W 76 (555) T protein:vir:17 1 MK-HSAQAKYMMLRADREDYLDSGRQSARLTLPYI---LTDEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSF 76 (555) T ss_pred Ch-hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcc Confidence 21 112233333332 2345555555554321 1111111222334566777788888888877652 22 Q ss_pred -eecCch----------hh-----------HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Q lcl|NC_018086. 98 -TESGDE----------KT-----------IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPM 155 (511) Q Consensus 98 -~~~~d~----------~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~ 155 (511) ++...+ +. ...+...+..++|.....++.++..++|.+.+++ ++++ + ++++ . T Consensus 77 F~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~~~~-~--~~~p-l 150 (555) T protein:vir:17 77 FKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQ--GKKN-L--KLYP-L 150 (555) T ss_pred cccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEe--cCCc-e--eEEE-c Confidence 222111 11 1124445667899999999999999999987655 4443 3 3333 3 Q ss_pred ceEEEecCCCCCceEEEEEEEEEeecC----Cc-------------------------------ceEEEEEEEcCCcEEE Q lcl|NC_018086. 156 NCLIAYSADLDEEPVAAIYYNTVISDI----TG-------------------------------HQIRTYEVYTEDLIYK 200 (511) Q Consensus 156 ~~~~v~d~~~~~~~~~~v~~~~~~~~~----~~-------------------------------~~~~~~~~~~~~~i~~ 200 (511) .-+++--+.. +.+...+|.++..... -+ .....+++|+.- T Consensus 151 ~~y~v~~d~~-G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~---- 225 (555) T protein:vir:17 151 DRFVVSRDGE-GNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYV---- 225 (555) T ss_pred CeEEEeeCCC-cCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecc---- Confidence 3344443433 4455566555422110 00 000112222210 Q ss_pred EEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_018086. 201 FSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAY 275 (511) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~ 275 (511) ....+.+.... ..............+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-......+....|. T Consensus 226 -~~~~~~~~~~~-e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp 303 (555) T protein:vir:17 226 -CRKDGQVKWHQ-ECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVV 303 (555) T ss_pred -cccCCeeEEEE-ecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Confidence 00000010000 0000000011224456667776544 3578999999999999999999988999999999998 Q ss_pred eEeecCCCCccchhhhhhhhCceeeecCCCceeeeecC--CCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHH Q lcl|NC_018086. 276 LWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITKD--VNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALK 353 (511) Q Consensus 276 l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~ 353 (511) +.+.-..... .... .....+.+.....++++.+... .+.......++.++..|...-..- +..+....|++.+. T Consensus 304 ~lv~~~g~~~-~~~l-~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~--~~~d~~r~TAtEV~ 379 (555) T protein:vir:17 304 FMVSPSATTK-PQNL-ALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML--QVRQSERTTATEVQ 379 (555) T ss_pred eeeccccccC-ccee-ecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc--CCCCcccchHHHHH Confidence 7653211111 0000 0111233332223345554422 345566666777776665443221 12222345776665 Q ss_pred HHHHHHHHHHHH----H-HHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCc-CHHHHHH----HHHHHhc Q lcl|NC_018086. 354 AATQPLENKSAV----K-ESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQ-SYAELAD----MAVKLRD 423 (511) Q Consensus 354 ~~~~~l~~k~~~----~-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~-d~~e~a~----~~~~~~g 423 (511) .....+.....- . ...+..-+.+++.+ +...+.-......-+.+.+.-.+.. ...+.++ .+..+++ T Consensus 380 ~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~i----l~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~l~~~~~~laq 455 (555) T protein:vir:17 380 ATVQELNEQIGGIYSNLTTELLQPYLARKLHL----LQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQLMEFITTLAQ 455 (555) T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH----HHhCCCCCCCCHhhhccceeehHHHHHHHHHHHHHHHHHHHHHh Confidence 543333322211 1 12222233333332 2222221111111122222211111 0111111 1222222 Q ss_pred c---------CChHHH----HHhCCC----CCCHHHHHHHHHHHHHHHHHHH--HhhccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 424 M---------LPDETI----INQFPW----ITDARQEVEKADAQRQKRADIA--LQNFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 424 ~---------~s~et~----~~~l~~----v~d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) + +....+ ...+|. +-..++|+++++.++++++... +....+..+...++.....-..++. T Consensus 456 ~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~- 534 (555) T protein:vir:17 456 TMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQE- 534 (555) T ss_pred hcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchh- Confidence 2 111222 333342 1134567777765544333221 1111111111111100000000000 Q ss_pred CccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 485 NTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) +-..++.+.++..+++-.+.- T Consensus 535 ------~a~~~~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 535 ------GAQDAGAAESETSSAEAQAGA 555 (555) T ss_pred ------hhhHHHHHHhhcCCcccccCC Confidence 011112222222222221111 No 137 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.80 E-value=4.2e-08 Score=61.00 Aligned_cols=425 Identities=10% Similarity=0.064 Sum_probs=180.5 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCc-cccccceeccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDD-TNKPNSKIVHN 79 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~-~~~~~~ri~~n 79 (511) |-++. |++.+.+...... ..+.+.|-|.... ....... ......-...+ T Consensus 1 ~~~~~-----~~~~~~p~~~~~~------------------------~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~ 50 (518) T protein:vir:78 1 MLLAN-----GQTLSAPAMAELS------------------------PQMQDSYYYAPAV-GMQLERQFSLYGGIYKNQP 50 (518) T ss_pred CcccC-----ceeeccchhhhhh------------------------hhhhhccccccee-ceecccccchhhHHhhhhH Confidence 32222 2222222211110 0112222221110 0000000 00000001123 Q ss_pred hHHHHHHHHHhhhhccCcee---cCch---hhHHHHHHHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-E Q lcl|NC_018086. 80 FPKLLVDTSTAYLAGEPITE---SGDE---KTIKAMQPVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-R 148 (511) Q Consensus 80 ~~k~ivd~~~~~l~g~~~~~---~~d~---~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~ 148 (511) .....|+..++-+-+-|+.+ ..+. .....+..++.+ |. .......+..+.+.+|.+|+++-.+..|++ . T Consensus 51 ~V~acV~~IA~~iA~lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~ 130 (518) T protein:vir:78 51 WVRTVIAKRAQALARLPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEK 130 (518) T ss_pred HHHHHHHHHHHhhccCceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEE Confidence 44556666666665556654 1111 111223444433 32 234566778888899999999998888886 4 Q ss_pred EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceecc Q lcl|NC_018086. 149 FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNL 228 (511) Q Consensus 149 i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (511) +..++|..+.+.++..... +.|+.... .+... ....+.++.+++++.-. . T Consensus 131 L~~l~p~~Vtv~~~~~~~~-----~~y~~~~~--~~~~~-~~~~~~~~eIiHir~~~----------------------~ 180 (518) T protein:vir:78 131 LMPMHPSRVAIKRNSRTGR-----YEYYFQAG--AGVGT-QLVSFADDEVVPIRFFN----------------------P 180 (518) T ss_pred EEEECCCceEEEEcCCCCE-----EEEEEEec--CCccc-eeEEecCCcEEEecCCC----------------------C Confidence 7788898888777653321 11111111 11110 11123444444432100 0 Q ss_pred CCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh------------hhC Q lcl|NC_018086. 229 LQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM------------KND 296 (511) Q Consensus 229 ~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~------------~~~ 296 (511) .+...|.|.+......+.....+.....+.+...+.|-.+++... .-.++....+ ..+ T Consensus 181 ---------dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~-~ls~e~~~~~k~~~~~~~~G~~nag 250 (518) T protein:vir:78 181 ---------DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSPEAQQRLREQFDRAHAGSSNTG 250 (518) T ss_pred ---------CcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHHHHHHHHHHhcCcccCC Confidence 001246677766666555555555555555666677766665422 1112222111 123 Q ss_pred ceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 297 RVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAK 376 (511) Q Consensus 297 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 376 (511) +++.++++.+++.++.+.....+.+..+...+.|+..-++|..-.+..++.+...++... ..++..+|.- T Consensus 251 ~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~----------~~f~~~tL~P 320 (518) T protein:vir:78 251 KTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM----------RAFYRDTMAI 320 (518) T ss_pred ceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHH----------HHHHHHHHHH Confidence 467777666665555444455566667777788888888886555443333332222211 1222233333 Q ss_pred HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHH Q lcl|NC_018086. 377 RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPW--ITDARQEVEKADAQRQ 452 (511) Q Consensus 377 ~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~--v~d~~~E~~ri~~E~~ 452 (511) ++..|...+...-.........+++..+.-+..|..+.++++.++ .|+++.-.++..++. ++++... ++-.... T Consensus 321 ~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD--~~~v~~n 398 (518) T protein:vir:78 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD--ELYANSA 398 (518) T ss_pred HHHHHHHHHHHhhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc--eeeeccc Confidence 333333322211111001112344555566778899999998887 478988888888765 3332211 1000000 Q ss_pred HHHHHHHhhccccccCCCCCCccccccCCCCCCcccccc-CCCCccccccc-cCCCCCCCC Q lcl|NC_018086. 453 KRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITT-TDPVAAKEQEK-AIQKKPKTD 511 (511) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~ 511 (511) ... +.. .......|........++..++.++.++. ....+.++..+ -+.+.+++| T Consensus 399 -~~p--l~~-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) T protein:vir:78 399 -LQP--LGA-TPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTE 455 (518) T ss_pred -cee--ccc-ccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccc Confidence 000 000 00011111111111111221111111111 11111111111 223333443 No 138 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.73 E-value=7.9e-08 Score=59.51 Aligned_cols=442 Identities=11% Similarity=0.045 Sum_probs=190.6 Q ss_pred CCHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc------Cc-ee Q lcl|NC_018086. 28 FDLRELI-TLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE------PI-TE 99 (511) Q Consensus 28 ~~~~~l~-~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~------~~-~~ 99 (511) ++.+... .+-.+...-..+++.+.+|..-.--. .............++..+-+...++.+++.|++- || ++ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~-~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 79 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLID-DDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFKL 79 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccC-CCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 4443322 23333333345556666665432100 0000111112234566777778888888777652 22 22 Q ss_pred cC-chh------------hH-------HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE Q lcl|NC_018086. 100 SG-DEK------------TI-------KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLI 159 (511) Q Consensus 100 ~~-d~~------------~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~ 159 (511) .. |.+ .. ..+...+..++|.....++.++..++|.|.+++ ++++ +++++-.+ ++ T Consensus 80 ~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~~~~---~~~~pl~~-y~ 153 (522) T protein:vir:10 80 QVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFM--GKDG---LKTFPLTR-YV 153 (522) T ss_pred cCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEE--cCCC---ceEEEcce-EE Confidence 21 110 11 123445777899999999999999999998665 4443 34444333 44 Q ss_pred EecCCCCCceEEEEEEEEEeec----------------CCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccc Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVISD----------------ITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYE 223 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~~----------------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (511) +--+.. +.+...+|.++.... ..++....+++|+.- +.+.+.+.+....... ....... T Consensus 154 v~~d~~-G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v---~p~~~~~~~~~~~~~~-~~~~~~~ 228 (522) T protein:vir:10 154 INRDGD-GNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYV---KLDKSSGRWVWHQEAF-DKIIPDS 228 (522) T ss_pred EeeCCC-CCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEE---EeeccCCceEEEEccC-Ccccccc Confidence 433333 445556665543200 011122234444311 1111111111111111 1101111 Q ss_pred ceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCce Q lcl|NC_018086. 224 VHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRV 298 (511) Q Consensus 224 ~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~ 298 (511) ....++..+|++.++ ++.+|+|-..+..+-+..+|.+.-......+....|.+.+.-..... ..... ....+. T Consensus 229 ~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~-~~~l~-~~~~~~ 306 (522) T protein:vir:10 229 RSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTK-PATIA-KAGNGA 306 (522) T ss_pred ccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccccccc-ccccc-CCCCcc Confidence 123466677776544 35689999999999999999998888889999999987763211111 00010 111233 Q ss_pred eeecCCCceeeeec--CCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHH----HH-HHHHHH Q lcl|NC_018086. 299 IVTDEDGMVKFITK--DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKS----AV-KESKFR 371 (511) Q Consensus 299 i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~----~~-~~~~~~ 371 (511) +.....+++..+.. ..+.......++.++..|...-... +..+....|++.+......+.... .+ ....+. T Consensus 307 ~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~--~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~ 384 (522) T protein:vir:10 307 IVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVM--NVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLI 384 (522) T ss_pred eecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhc--cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 33333445554432 3455666777888887777653221 222223468887766544333322 11 123333 Q ss_pred HHHHHHHHHHHHHHHhcCCCccc--cc-cceeEEeCCCCCcCHHHHHHHH----HHHhccCChHHH---------HHhC- Q lcl|NC_018086. 372 KVLAKRYELVCSYLEFMNKAKDL--KP-YEVTPVFVRNLPQSYAELADMA----VKLRDMLPDETI---------INQF- 434 (511) Q Consensus 372 ~~l~~~~~li~~~~~~~~~~~~~--~~-~~i~i~f~~~~p~d~~e~a~~~----~~~~g~~s~et~---------~~~l- 434 (511) .-+.+++.++.+ .+.-... +. ....|.+..++-+ ++.++.+ +.++.++..+.+ +..+ T Consensus 385 Pli~r~~~il~r----~g~lP~~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a 458 (522) T protein:vir:10 385 PYLNRTLLVLQR----SNQIPKLPKDIVRPTIVAGVNALGR--GQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLA 458 (522) T ss_pred HHHHHHHHHHHh----cCCCCCCCccccccccccchhHHHH--HHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHH Confidence 344444443322 2211111 11 1122333333332 2222222 222222222222 2221 Q ss_pred --CCCC-----CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccc Q lcl|NC_018086. 435 --PWIT-----DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAK 498 (511) Q Consensus 435 --~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) -+++ -.++|++.++.++++++..... ..+.+..++..... +..++...++.+. ++ .+ T Consensus 459 ~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~--~~~a~~~~~~~~~~--~~~~~~~~~~~~~--~~-~~ 522 (522) T protein:vir:10 459 AAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSL--VDQAGQMTGSPLMD--PTKNPQLMDEEQP--PM-EE 522 (522) T ss_pred HHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHH--HHHHHHHhcccccC--ccccHHHHHHhCC--CC-CC Confidence 1222 1345555555444433322111 01111111111110 1111100000000 00 00 No 139 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.72 E-value=8.3e-08 Score=59.41 Aligned_cols=447 Identities=9% Similarity=-0.004 Sum_probs=189.3 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHH----HHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSS----SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTST 89 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~ 89 (511) +.. ...+.+..+.+.+..+.++.++ .+++.+.+|..-.- ......+..+...++..+-+...++.++ T Consensus 1 m~~------~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~---~~~~~~~~~~~~~~~~dst~~~a~~~LA 71 (532) T protein:vir:99 1 MAE------VEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSATADGSTSYTTPWQSIGARGLNNLA 71 (532) T ss_pred Ccc------hhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc---cCCCCCcchhhccccccchHHHHHHHHH Confidence 111 1112233444555555554433 34444555544321 1111222223345666777888888888 Q ss_pred hhhhcc------Cc-eecCch--------------hhH-------HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee Q lcl|NC_018086. 90 AYLAGE------PI-TESGDE--------------KTI-------KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI 141 (511) Q Consensus 90 ~~l~g~------~~-~~~~d~--------------~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~ 141 (511) +.|++- || ++...+ +.. ..+...+..++|.....++.++..++|.|.+++.. T Consensus 72 a~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~ 151 (532) T protein:vir:99 72 SKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPS 151 (532) T ss_pred HHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecc Confidence 777652 22 222111 111 22345567789999999999999999999998865 Q ss_pred CCC---CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecC--------------CcceEEEEEEEcCCcEEEEEEc Q lcl|NC_018086. 142 DRN---KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDI--------------TGHQIRTYEVYTEDLIYKFSTD 204 (511) Q Consensus 142 ~~~---g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~--------------~~~~~~~~~~~~~~~i~~~~~~ 204 (511) ++. ....++.++-.+ +++--+.. +.+...+|.+++.... .++....+++|+.-+ .... T Consensus 152 ~~~~~~~~~~f~~~pl~~-y~v~~d~~-G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~---~~~~ 226 (532) T protein:vir:99 152 TEQVEGQSNAPKLYKLHN-FVVERDAY-DNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVY---RDPE 226 (532) T ss_pred cccccCcccceEEEEcCe-EEEeeCCC-CCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEE---ecCC Confidence 432 334566655444 34433333 4455666544432110 112233445544211 0000 Q ss_pred cCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee Q lcl|NC_018086. 205 DEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ 279 (511) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~ 279 (511) ...+. +................++..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+. T Consensus 227 ~~~~~-~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~ 305 (532) T protein:vir:99 227 AMVFR-SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN 305 (532) T ss_pred CCeeE-EEEeecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceec Confidence 00000 0000000001111122235556766544 35689999999999999999888888888888888876543 Q ss_pred cCCCCccchhhhhhhhCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHH Q lcl|NC_018086. 280 GFDLSADSDSISNMKNDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQ 357 (511) Q Consensus 280 G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~ 357 (511) -.. ......... ...+.+.-...++++.+. ...+.......++.++..|...-....+..-+....|++.+..... T Consensus 306 p~g-~~~~~~~~~-~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~ 383 (532) T protein:vir:99 306 PNG-VTQIRRVAK-ANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRVTAEEIRYVAG 383 (532) T ss_pred ccc-ccchhhhcc-CCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcccHHHHHHHHH Confidence 111 111111111 112223222223344443 2345666677777777776554322222122223357776655433 Q ss_pred HHHHHHH----H-HHHHHHHHHHHHHHHHHHHHHhcCCCccc--ccccee-EEeCCCCCcCHHHHHH----HHHHHhccC Q lcl|NC_018086. 358 PLENKSA----V-KESKFRKVLAKRYELVCSYLEFMNKAKDL--KPYEVT-PVFVRNLPQSYAELAD----MAVKLRDML 425 (511) Q Consensus 358 ~l~~k~~----~-~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~~i~-i~f~~~~p~d~~e~a~----~~~~~~g~~ 425 (511) .+..... + ....+..-+.+++.++. ..+.-... +..... +.+-.++- .++.++ .+..++.+. T Consensus 384 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~----r~g~lP~~p~~~~~~~iv~~is~La--raq~~~~l~~~~~~laq~~ 457 (532) T protein:vir:99 384 ELEDTLGGVYSLLSQELQLPLVKILLKELQ----ATSKIPNLPKEAVEPAIATGLEALG--RGHDLNKLNVFIDYMIKLA 457 (532) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHH----hcCCCCCCChhhcccceeecchHHH--HHHHHHHHHHHHHHHHhhc Confidence 3332221 1 11222333333333322 22221111 111111 22222221 222222 222222222 Q ss_pred C-------hHHHH----HhCCC----CCCHHHHHHHHHHHHHHHHH--HHHhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 426 P-------DETII----NQFPW----ITDARQEVEKADAQRQKRAD--IALQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 426 s-------~et~~----~~l~~----v~d~~~E~~ri~~E~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) + ...++ ..+|. +-..++|++.++++++.+.. .+....+...+.. .... T Consensus 458 p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~----~~~~----------- 522 (532) T protein:vir:99 458 GLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQA----AAAM----------- 522 (532) T ss_pred chhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cchh----------- Confidence 2 12222 22332 11235566555544332221 1111111111111 0000 Q ss_pred cccCCCCcccccc Q lcl|NC_018086. 489 ITTTDPVAAKEQE 501 (511) Q Consensus 489 ~~~~~~~~~~~~~ 501 (511) .+++.+..++ T Consensus 523 ---~~~~~~~~~~ 532 (532) T protein:vir:99 523 ---MQQQAGMPTQ 532 (532) T ss_pred ---HHhhcCCCCC Confidence 0000000000 No 140 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.72 E-value=8.5e-08 Score=59.36 Aligned_cols=479 Identities=12% Similarity=0.099 Sum_probs=194.2 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCH----HHHHHHHHHHHHHH----HHHHHHHHHhcCCCcccc-------cCC Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDL----RELITLAEMHSRSS----SAYGVLYDYYKGNHIAIQ-------SRT 65 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~l~~~~~~~~~~~----~~~~~~~~yY~G~~~~~~-------~~~ 65 (511) |.++.. ..+++--+. -+..+.. ..|.++++..+..+ ++|+.+++||........ +.. T Consensus 1 ~~~~~~----~~~~~~~~~----~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~ 72 (641) T protein:vir:94 1 MTIEMP----TPIIEDKES----AKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTT 72 (641) T ss_pred CccCCC----cccccCCcc----hhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhccccccccc Confidence 222211 111110000 0011222 22444444433322 456677777765332211 011 Q ss_pred cCccccccceeccchHHHHHHHHHhhhhcc--C----ceec----CchhhHHH----HHHHHhccChhHHHHHHHHHHhh Q lcl|NC_018086. 66 FDDTNKPNSKIVHNFPKLLVDTSTAYLAGE--P----ITES----GDEKTIKA----MQPVFKENYVTDVNSEEVKLSGI 131 (511) Q Consensus 66 ~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~--~----~~~~----~d~~~~~~----l~~~~~~n~~~~~~~~~~~~a~~ 131 (511) ........+|+..+.+...++.+++.+++- | +.+. +|++..+. +...+..+++........++++. T Consensus 73 ~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~ 152 (641) T protein:vir:94 73 GADDADWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVL 152 (641) T ss_pred ccchhcccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhh Confidence 111111134788888888888887776642 1 1221 23333322 33445667888888899999999 Q ss_pred CCeEEEEeeeCC----------------------------CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecC- Q lcl|NC_018086. 132 FGHCFEIHWIDR----------------------------NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDI- 182 (511) Q Consensus 132 ~G~~~~~v~~~~----------------------------~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~- 182 (511) +|.+++.++++. ...+++..++|.+++ +|+..+.....++++.....+. T Consensus 153 ~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~ 230 (641) T protein:vir:94 153 YGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELH 230 (641) T ss_pred cCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHH Confidence 999998886431 122455556666653 3443322111122221100000 Q ss_pred -------Cc-----------------ceEEEEE-EEcCC-cEEEEEE--ccCccccc--ccccccccccccceeccCCcc Q lcl|NC_018086. 183 -------TG-----------------HQIRTYE-VYTED-LIYKFST--DDEREVYR--EIPEELEIKDYEVHPNLLQKF 232 (511) Q Consensus 183 -------~~-----------------~~~~~~~-~~~~~-~i~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~~~g~i 232 (511) .+ .....+. +.+.. .+++|.. ...+...+ ..............-+.|..+ T Consensus 231 ~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~ 310 (641) T protein:vir:94 231 ELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGS 310 (641) T ss_pred HHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcC Confidence 00 0000000 00000 0111100 00000000 000000000111111234566 Q ss_pred ceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCce Q lcl|NC_018086. 233 PVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMV 307 (511) Q Consensus 233 Pvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~ 307 (511) |++.++ ...+|+|....+.+.+..+|.+.-...+.+....+|.+.+....... +.. -....++++.....+++ T Consensus 311 Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~-~~~-l~~~PG~ii~~~~~~~v 388 (641) T protein:vir:94 311 PFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILK-RED-VKAKPGAVFKVAQHGSL 388 (641) T ss_pred CeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccc-cce-eeccCCcceeeCCCCcc Confidence 877554 34689999999999999999999999999999999988654321111 111 11223455666666667 Q ss_pred eeeecC-CCHHHHHHHHHHHHHHHHHHhCcccccccc---cc-CccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_018086. 308 KFITKD-VNDKHIENIKNRAKLDIFSLSQTPDLVSKD---FT-AASGQALKAATQPLENKSAVKESKFRK-VLAKRYELV 381 (511) Q Consensus 308 ~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li 381 (511) +++... .+.......++.+...|....++..+..+. .+ +.|+..+..+......+.....+.|.. ++..+++-+ T Consensus 389 ~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~ 468 (641) T protein:vir:94 389 QPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKV 468 (641) T ss_pred eeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 776432 233333445566665655555443332211 12 347777777777777777666666663 565566555 Q ss_pred HHHHHhcC----------------CCccccccceeEEeCCCCCcCHH---HHHHHHHHHh------ccCCh--------- Q lcl|NC_018086. 382 CSYLEFMN----------------KAKDLKPYEVTPVFVRNLPQSYA---ELADMAVKLR------DMLPD--------- 427 (511) Q Consensus 382 ~~~~~~~~----------------~~~~~~~~~i~i~f~~~~p~d~~---e~a~~~~~~~------g~~s~--------- 427 (511) +.++.... .-......++...|.- +|...+ +.++.+..+. |..|. T Consensus 469 ~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~ 547 (641) T protein:vir:94 469 FSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYAL 547 (641) T ss_pred HHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHH Confidence 55443211 1111112223333321 233322 2233332222 11221 Q ss_pred --HHHHHhCCCCCCHH-------HH--HH--HHHHHHHHHHHHHHh-----------h-----ccccccCCCCCCccccc Q lcl|NC_018086. 428 --ETIINQFPWITDAR-------QE--VE--KADAQRQKRADIALQ-----------N-----FKQTSAVQGASTAAANK 478 (511) Q Consensus 428 --et~~~~l~~v~d~~-------~E--~~--ri~~E~~~~~~~~~~-----------~-----~~~~~~~~~~~~~~~~~ 478 (511) +.+++.++. .+|. .+ -. +.++.+++....+.. . +....+..|-.+ + + T Consensus 548 ~~~~~~~~~g~-~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~--~-~ 623 (641) T protein:vir:94 548 ILEDLLRQMRF-TDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDT--S-D 623 (641) T ss_pred HHHHHHHHhCC-CCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCc--h-h Confidence 222333331 1111 00 00 111001110000000 0 000000000000 0 0 Q ss_pred cCCCCCCccccccCCCCcc Q lcl|NC_018086. 479 LDKNPANTSTITTTDPVAA 497 (511) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~ 497 (511) ...+.--+++.+- ..+.- T Consensus 624 ~~~~~~~~~~~~~-~~~~~ 641 (641) T protein:vir:94 624 VAPEAMAAATQQI-TSGAL 641 (641) T ss_pred hhHHHHhcccccc-cccCC Confidence 0000000000000 00000 No 141 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.70 E-value=1e-07 Score=58.96 Aligned_cols=424 Identities=10% Similarity=0.049 Sum_probs=176.7 Q ss_pred hhh-hccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCc-cccccceeccchHHHHHHHHHhhhhccCce Q lcl|NC_018086. 21 KHF-IRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDD-TNKPNSKIVHNFPKLLVDTSTAYLAGEPIT 98 (511) Q Consensus 21 ~~~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~-~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~ 98 (511) -++ ..+.++.-.+ .+..+ .....|.+..... ...... ......-...+.....|+..++-+-.-|+. T Consensus 1 ~~~~~~~~~~~p~~-------~e~~~---~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~ 69 (518) T protein:vir:10 1 MLLANGQTLSAPAM-------AELSP---QMQDSYYYAPAVG-MQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) T ss_pred CcccCceeecCchh-------hhhhh---hhhcccccccccc-eecccccchhhHHHhhhHHHHHHHHHHHHhhccCceE Confidence 000 0000000000 00001 1112222211000 000000 000000011234455666666655555655 Q ss_pred e---cCch---hhHHHHHHHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCC Q lcl|NC_018086. 99 E---SGDE---KTIKAMQPVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDE 167 (511) Q Consensus 99 ~---~~d~---~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~ 167 (511) + ..+. .....+..++.+ |. .......+..+.+.+|.+|+++..+.+|++ .+..++|..+.+..+..... T Consensus 70 l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~ 149 (518) T protein:vir:10 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR 149 (518) T ss_pred EEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCE Confidence 3 1111 112233444433 32 234566777888999999999998888886 47888999888777654321 Q ss_pred ceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchh Q lcl|NC_018086. 168 EPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDF 247 (511) Q Consensus 168 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~ 247 (511) +.|+.... .+... ....+.++.++|++.-. . .+...|.|.+ T Consensus 150 -----~~y~~~~~--~~~~~-~~~~~~~~eViHir~~s----------------------~---------dg~~~G~spi 190 (518) T protein:vir:10 150 -----YEYYFQAG--AGVGT-QLVSFADDEVVPIRFFN----------------------P---------DGLERGLSLM 190 (518) T ss_pred -----EEEEEEec--CCccc-eEEEecCCcEEEecCCC----------------------C---------CcccccccHH Confidence 11111111 11110 11123444444442100 0 0012477777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCceeeecCCCceeeeecCCC Q lcl|NC_018086. 248 EAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDRVIVTDEDGMVKFITKDVN 315 (511) Q Consensus 248 ~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~~~~~~~~~~~~~~ 315 (511) ......+.....+.....+.++..+.|-.+++-... -.++.... +. .++++.++++.+.+.++.... T Consensus 191 ~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~ 269 (518) T protein:vir:10 191 ESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAV 269 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChh Confidence 666665555555555555555666667666553221 11121111 11 134666776666655554444 Q ss_pred HHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_018086. 316 DKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLK 395 (511) Q Consensus 316 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 395 (511) ...+.+..+...+.|+..-++|..-.+.....+...++... ..++..+|.-+++.|...++..-...... T Consensus 270 D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~----------~~f~~~tL~P~l~~ie~~ln~~L~~~~~~ 339 (518) T protein:vir:10 270 EMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM----------RAFYRDTMAIPIARIQSAMDKYVGQYWVR 339 (518) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcccccC Confidence 45566667777888888888886555443333332222211 12223333333333333222111111111 Q ss_pred ccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCC Q lcl|NC_018086. 396 PYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPW--ITDARQEVEKADAQRQKRADIALQNFKQTSAVQGA 471 (511) Q Consensus 396 ~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 471 (511) ...+++....-+..|..+.++++.++ .|+++.-.++..++. ++++... ++-.... ... +... ......++ T Consensus 340 ~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD--~~~~~~n-~~p--l~~~-~~~~~~g~ 413 (518) T protein:vir:10 340 KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD--ELYANSA-LQP--LGAT-PDGAVEGE 413 (518) T ss_pred CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCC--eeeeccc-cee--cccc-cccccCCC Confidence 22345555566678889999988887 478998888888764 2322111 1000000 000 0000 00111111 Q ss_pred CCccccccCCCCCCcccccc-CCCCccccccc-cCCCCCCCC Q lcl|NC_018086. 472 STAAANKLDKNPANTSTITT-TDPVAAKEQEK-AIQKKPKTD 511 (511) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~ 511 (511) .......++..+..+..++. +...+.++..+ -+.+..++| T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) T protein:vir:10 414 EAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTE 455 (518) T ss_pred CCCCCCCCCccccccccccccccCCCCCcccccccccccccc Confidence 11111111221111111111 11111111111 122333333 No 142 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.69 E-value=1.1e-07 Score=58.81 Aligned_cols=469 Identities=9% Similarity=0.017 Sum_probs=181.0 Q ss_pred CccchhhcccccC--chhhHhhhhccC--CCHHHHHHHHHH----HHHHHHHHHHHHHHhcCCCcccccCCcCccccccc Q lcl|NC_018086. 3 IPNGQINAGDIIT--TNIRRKHFIRRN--FDLRELITLAEM----HSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNS 74 (511) Q Consensus 3 ~~~~~~~~~~~~~--~~~~~~~~~~~~--~~~~~l~~~~~~----~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ 74 (511) |.=++- |.+-. +....+ +.++. .....|.+.+.. |...+.+...+.+||.|..+. ..+..++ .. T Consensus 1 ~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~g--rs 72 (763) T protein:vir:95 1 MEQNTD--SMVPLPDPSQATK-LTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKA---KPPKVKG--RS 72 (763) T ss_pred CCcCcc--CcCCCccccchhc-CCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccC---cccccCC--Cc Confidence 111110 11100 001111 12211 112333333322 333344444555554444321 1122222 23 Q ss_pred eeccchHHHHHHHHHhhh----hccCc--ee----cCchhhHH----HHHH-HHhccChhHHHHHHHHHHhhCCeEEEEe Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAYL----AGEPI--TE----SGDEKTIK----AMQP-VFKENYVTDVNSEEVKLSGIFGHCFEIH 139 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~l----~g~~~--~~----~~d~~~~~----~l~~-~~~~n~~~~~~~~~~~~a~~~G~~~~~v 139 (511) +++.+-.+..|+.....| ++.+- .+ .+|.+..+ .+.- ++..|+-......++++++.+|.|++.| T Consensus 73 ~vv~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~ 152 (763) T protein:vir:95 73 QVQPKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRV 152 (763) T ss_pred cccCHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEE Confidence 566666666666554433 33322 22 12333222 2332 4555666666778999999999999888 Q ss_pred eeCC---------------------------------------------------------------------------- Q lcl|NC_018086. 140 WIDR---------------------------------------------------------------------------- 143 (511) Q Consensus 140 ~~~~---------------------------------------------------------------------------- 143 (511) |++. T Consensus 153 ~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (763) T protein:vir:95 153 GWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEV 232 (763) T ss_pred eeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEE Confidence 7630 Q ss_pred --CCceEEEEEcccceEEEecCCCCCc--eEEEEEEEEEeecC---------------C--------------------- Q lcl|NC_018086. 144 --NKKHRFKAVSPMNCLIAYSADLDEE--PVAAIYYNTVISDI---------------T--------------------- 183 (511) Q Consensus 144 --~g~~~i~~~~p~~~~~v~d~~~~~~--~~~~v~~~~~~~~~---------------~--------------------- 183 (511) .+.|+|..++|.++++-.+-..+.. ..++.+++....+. . T Consensus 233 ~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (763) T protein:vir:95 233 PLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQIS 312 (763) T ss_pred EecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCC Confidence 1245677789998875332111111 11122222111100 0 Q ss_pred ---cceEEEEEEEcCCcEEEEEEccCcc-cc-cccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHH Q lcl|NC_018086. 184 ---GHQIRTYEVYTEDLIYKFSTDDERE-VY-REIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSL 253 (511) Q Consensus 184 ---~~~~~~~~~~~~~~i~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l 253 (511) .+.+...++|.. +...+.+. .. ............+..|.+.+.+|++.++ ...+|.|.+..++++ T Consensus 313 d~~~~~V~v~E~y~~-----~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~ 387 (763) T protein:vir:95 313 DPMRKRVVAYEYWGF-----WDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDN 387 (763) T ss_pred CcccceEEEEEeeee-----eccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHH Confidence 011111111110 00000000 00 0000011111122233444666766443 355799999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeEee-cCCCCccchhhhhhhhCceeeecCCCce----eeeecCCCHHHHHHHHHHHHH Q lcl|NC_018086. 254 IDAYNLAVSDSVNDIAYWNDAYLWLQ-GFDLSADSDSISNMKNDRVIVTDEDGMV----KFITKDVNDKHIENIKNRAKL 328 (511) Q Consensus 254 ~d~~~~~~s~~~~~~~~~~~p~l~~~-G~~~~~~~~~~~~~~~~~~i~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~ 328 (511) ++.+|..++.+.+++...+.|.+.+. |. .+.. .......+.++.+..+.++ .+...+.........+..+.. T Consensus 388 Qr~~N~~~~~~~d~l~~~~~~~~~v~~ga-v~~~--d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~ 464 (763) T protein:vir:95 388 QAVLGAVMRGMIDLLGRSANGQRGMPKGM-LDAL--NSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQ 464 (763) T ss_pred HHHHHHHHHHHHHHHHhhcCCcEEeeccc-ccch--hhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHH Confidence 99999999999999999888866443 33 2211 1112233455555544332 222222223455555666667 Q ss_pred HHHHHhCcccccccccc---CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--------cc--- Q lcl|NC_018086. 329 DIFSLSQTPDLVSKDFT---AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK--------DL--- 394 (511) Q Consensus 329 ~i~~~s~~p~~~~~~~~---~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~--- 394 (511) .+-..|+++..+-+..+ +.++.++...............+.|..+++.+++.++.++....... .+ T Consensus 465 ~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v 544 (763) T protein:vir:95 465 EAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTI 544 (763) T ss_pred HHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccc Confidence 77778888876654332 22333343333333444445556666676666666665544321110 00 Q ss_pred ------cccceeEEeCCCCCcCH-HHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH--HHHHhhcccc Q lcl|NC_018086. 395 ------KPYEVTPVFVRNLPQSY-AELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRA--DIALQNFKQT 465 (511) Q Consensus 395 ------~~~~i~i~f~~~~p~d~-~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~--~~~~~~~~~~ 465 (511) ...+|.|.-. +.+. .+.+..+..+...+. +..+ +. -...+..+..... .......... T Consensus 545 ~~~~~~~~~DV~V~~~---~as~~~q~~~~l~~ll~~l~--------~~~~-~~-~~~~il~~~~d~~~~~~~~~~lr~~ 611 (763) T protein:vir:95 545 KREDLKGNFDLEVDIS---TAEVDNQKSQDLGFMLQTIG--------PNVD-QQ-ITLNILAEIADLKRMPKLAHDLRTW 611 (763) T ss_pred cHHHhcCCcceEEecc---cchHHHHHHHHHHHHHHHhc--------cccC-hH-HHHHHHHHHHhhhchhhhHHHHHhc Confidence 0112222211 1111 122222222211110 0000 00 0000000000000 0000000000 Q ss_pred ccCCCCCCccccccCCCCCCccccccCCCCcccccc--ccCC-CCCCCC Q lcl|NC_018086. 466 SAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE--KAIQ-KKPKTD 511 (511) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~ 511 (511) .+..+. ...... ....++. ..+... ...+ ...+.. T Consensus 612 ---q~~~d~----~~q~qa---qle~~~~-q~e~~~~~akaq~~qaqa~ 649 (763) T protein:vir:95 612 ---QPQPDP----VQEQLK---QLAVEKA-QLENEELRSKIRLNDAQAQ 649 (763) T ss_pred ---CCCccc----hhhhHH---HHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 000000 000000 0000000 000000 0000 000000 No 143 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.58 E-value=2.4e-07 Score=56.85 Aligned_cols=439 Identities=8% Similarity=-0.025 Sum_probs=169.1 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) -.|.|+-....--=.--+..... -+++. .++ .|.. .......| |.++....+. ........--..++ T Consensus 32 ~~~~~~~~p~~~~~~~~~~~~~~----~d~~~--~~~----~r~g-~~~~~~~~-g~~~~~epp~-d~~~l~~l~~~np~ 98 (648) T protein:vir:79 32 ESMQLGEAPGAMPKGGGGGGSAK----RDPKM--SLV----KRIG-LAIMDGGG-GGRDFEEPEF-DFNEITSAYNTEGY 98 (648) T ss_pred cccccCCCccccCCCCccccccc----ccchh--HHH----HHhH-HHHHhhcC-CccccccCCc-CHHHHHHHHhcChH Confidence 22233322111000000000000 01110 001 1110 11111222 2222211111 00000011112566 Q ss_pred HHHHHHHHHhhhhccCceecCchh-hHHH--HHH-HHhcc---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCceE----- Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDEK-TIKA--MQP-VFKEN---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHR----- 148 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~~-~~~~--l~~-~~~~n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~----- 148 (511) .+..|+..+.-+.+-|+.+..+++ .... ... ...-| ........+..+.+.+|.||+.+-.+.+|.+- T Consensus 99 V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~ 178 (648) T protein:vir:79 99 VRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNV 178 (648) T ss_pred HHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhh Confidence 777888888888888876643221 1111 111 12222 34456777888899999999998888777421 Q ss_pred -----------EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccc Q lcl|NC_018086. 149 -----------FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEEL 217 (511) Q Consensus 149 -----------i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 217 (511) +..++|..+.+..++.. .+..|......++.. ..|.++. T Consensus 179 ~~~~~~~~v~~l~pl~p~~v~v~~d~~g------~~~~Y~y~~~g~~~~----~~~~~~d-------------------- 228 (648) T protein:vir:79 179 MGVGDSMPVAGYFPLNLASMKVKRDKFG------MIKGWQQEQEGQDKP----QKFKPED-------------------- 228 (648) T ss_pred hhhccccceeeeEeecCceeEEEEcCCC------ceeeeEEEecCCcee----EEecCcc-------------------- Confidence 11123333222221110 000000000000000 0011222 Q ss_pred ccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh- Q lcl|NC_018086. 218 EIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS- 291 (511) Q Consensus 218 ~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~- 291 (511) |++|+ +..+|.|.+......+.............+...+.|-.+++-.......+... T Consensus 229 ----------------IIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~ 292 (648) T protein:vir:79 229 ----------------IVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEG 292 (648) T ss_pred ----------------EEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHH Confidence 34443 22358888887777777666666566666777777877665211111111111 Q ss_pred ---hhh-hCc-eeeecCCCceeeeecC--CC--HHHHHHHHHHHHHHHHHHhCccccccccccC---ccHHHHHHHHHHH Q lcl|NC_018086. 292 ---NMK-NDR-VIVTDEDGMVKFITKD--VN--DKHIENIKNRAKLDIFSLSQTPDLVSKDFTA---ASGQALKAATQPL 359 (511) Q Consensus 292 ---~~~-~~~-~i~~~~~~~~~~~~~~--~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~---~Sg~Ai~~~~~~l 359 (511) .+. ..+ +.......+.+.+..+ .. ...+....+...+.|+..-++|....+..+. .++.+....+.. T Consensus 293 ~~e~~~~~~~~~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~- 371 (648) T protein:vir:79 293 EVDLVRGEVENMDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKD- 371 (648) T ss_pred HHHHHHHhcccccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHH- Confidence 111 111 2112222222222221 12 2245666677788899999999765553322 223333322221 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC Q lcl|NC_018086. 360 ENKSAVKESKFRKVLAKRY-ELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPW 436 (511) Q Consensus 360 ~~k~~~~~~~~~~~l~~~~-~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~ 436 (511) .+.-.+..+...+...+ +.++ +..........+ ..+++.|.+....|....++.+.++ +|++|...++.+++. T Consensus 372 --~i~~l~~~i~~~le~~~~~~ll-~e~~l~~~l~~d-~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGl 447 (648) T protein:vir:79 372 --RIKALQKVMATFINEFMVKEIL-MEGGFDPVLNPD-DKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGR 447 (648) T ss_pred --HHHHHHHHHHHHHHHHHHHHHh-hhhhcccccccc-ceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 12222222222222211 1111 000011000011 2366777777777888888887776 589999999998865 Q ss_pred CCCHHHH-HHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 437 ITDARQE-VEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 437 v~d~~~E-~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) -+-++.+ -..+..+.-. ..... ......+. ..+ +.....+..+.............+++.+-+|++. T Consensus 448 pPi~~g~~~~~l~~~~~~----~~~~~-~~~~~~~~--~~~-~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~ 515 (648) T protein:vir:79 448 DPVDDGEGRAKMHLQMVT----IAQAT-ALAALAPT--PAG-GSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQ 515 (648) T ss_pred CCCCCCCCcccccccccc----chhcc-ccccCCCC--CCC-CCCCCccccccccccCCCCCCCCCCCcCCCCccc Confidence 3221110 0011110000 00000 00000000 000 0010111111111111111112333334444545 No 144 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.53 E-value=3.5e-07 Score=55.96 Aligned_cols=400 Identities=9% Similarity=0.001 Sum_probs=172.4 Q ss_pred Cccch-hhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchH Q lcl|NC_018086. 3 IPNGQ-INAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFP 81 (511) Q Consensus 3 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~ 81 (511) |++.+ .++-..-.. ....... .....+..+.-+... ........ - +...-. T Consensus 1 M~~~~~~f~~~~r~~------~~~~~~~---------------~~~~~~~~~~g~~~~----~~~v~~~~-a--l~~~~v 52 (429) T protein:vir:10 1 MDSVKKFFNFEKRQT------SQVIELN---------------KDDEKLLEWLGISPS----TISVKGKN-A--LKVATV 52 (429) T ss_pred CchhhhhhcccccCc------ccccccC---------------CChHHHHHHhcCCCC----cceechhh-h--hccHHH Confidence 33331 211000000 0000000 000111122211110 00000000 0 112333 Q ss_pred HHHHHHHHhhhhccCceec--Cch---h-hHHHHHHHHh--cc---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EE Q lcl|NC_018086. 82 KLLVDTSTAYLAGEPITES--GDE---K-TIKAMQPVFK--EN---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RF 149 (511) Q Consensus 82 k~ivd~~~~~l~g~~~~~~--~d~---~-~~~~l~~~~~--~n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i 149 (511) ...|+..++-+-+-|+.+- .++ . ....+..+|. -| ........+..+.+.+|.+|+++..+..|++ .+ T Consensus 53 ~~~i~~ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 132 (429) T protein:vir:10 53 FACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQAL 132 (429) T ss_pred HHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 4455555555555666641 111 1 1122444443 22 2345667788889999999999988888886 67 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccC Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLL 229 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (511) ..++|..+.+..++.... ....+.|+... ..|.. ..+.++.++|+.... T Consensus 133 ~~i~~~~v~v~~~~~~~~--~~~~~~~~~~~-~~g~~----~~~~~~evih~~~~~------------------------ 181 (429) T protein:vir:10 133 WPIDASKVTVYIDDVGLL--NSKTKMWYVVN-TGGQQ----RVLKPEEILHFKNGI------------------------ 181 (429) T ss_pred EEEcCceeEEEEcCcccc--cccceEEEEEc-cCCeE----EEEccccEEEecCCC------------------------ Confidence 788888887766543211 11111111111 11111 123344443332100 Q ss_pred CccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCc Q lcl|NC_018086. 230 QKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDR 297 (511) Q Consensus 230 g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~ 297 (511) ..+...|.|.+..+...++.......-....++..+.|-.+++... .-.++.... +. .++ T Consensus 182 -------~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~ 253 (429) T protein:vir:10 182 -------TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHR 253 (429) T ss_pred -------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhccccccCc Confidence 0012247777777777666666555555555666666766655322 111111111 11 235 Q ss_pred eeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 298 VIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKR 377 (511) Q Consensus 298 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 377 (511) ++.++++.+++.+........+....+...+.|+..-++|....+.....+...++.. ....+..+|.-+ T Consensus 254 ~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~----------~~~f~~~~l~P~ 323 (429) T protein:vir:10 254 IALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQ----------QQQFYTDTLQAT 323 (429) T ss_pred eeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHH Confidence 6667666666555544444555666778888899999998765543332222222211 112233344444 Q ss_pred HHHHHHHHHhcCCC-ccc-cccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHH Q lcl|NC_018086. 378 YELVCSYLEFMNKA-KDL-KPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQK 453 (511) Q Consensus 378 ~~li~~~~~~~~~~-~~~-~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~ 453 (511) ++.|...+...--. ... ....+++.+..-+..|..+.++++.++ .|+++...++.+++.-+.+. .++.-.-... T Consensus 324 ~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD~~~~~~n~ 401 (429) T protein:vir:10 324 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAG--GDRLLVNGNM 401 (429) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeeeecccc Confidence 44443333321100 011 112344555566677899999999888 47899888888876532111 1111000000 Q ss_pred HHHHHHhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 454 RADIALQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) . .++..++ ....++.... + ..+.++.+. T Consensus 402 -~--~~d~~~~-~~~k~g~~~~--~-~~~~~~e~~ 429 (429) T protein:vir:10 402 -L--PIDMAGQ-AYLKGGDTNG--E-VSKEGNEGN 429 (429) T ss_pred -c--chhhccc-cccCCCCCCC--C-CCCCCCCCC Confidence 0 0000000 0000000000 0 000000000 No 145 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.50 E-value=4.2e-07 Score=55.54 Aligned_cols=417 Identities=10% Similarity=-0.012 Sum_probs=175.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCC------cccccCCcCccccc-cce--eccchHHHHHHHHHhhhhccCceec---C Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNH------IAIQSRTFDDTNKP-NSK--IVHNFPKLLVDTSTAYLAGEPITES---G 101 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~------~~~~~~~~~~~~~~-~~r--i~~n~~k~ivd~~~~~l~g~~~~~~---~ 101 (511) +.+...+..+..... ..-.+++. ...........+.. ... +...=....|+..++-+-+-|+.+- + T Consensus 1 Mg~~~~l~~r~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~ 78 (457) T protein:vir:13 1 MGFWSALFGRGHSPA--LDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRG 78 (457) T ss_pred Cchhhhhhccccccc--ccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 222222111100000 00000000 00000000000000 000 1111122345555555555676642 1 Q ss_pred c--h-hhHHHHHHHHhc--cC--hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEE Q lcl|NC_018086. 102 D--E-KTIKAMQPVFKE--NY--VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAI 173 (511) Q Consensus 102 d--~-~~~~~l~~~~~~--n~--~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 173 (511) + . .....+..++.. |. .......+..+.+.+|.||+.+..+ .|++ .+..++|..+.++.+...... .... T Consensus 79 ~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~-~~~~ 156 (457) T protein:vir:13 79 GSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLR-RKVF 156 (457) T ss_pred CcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCcc-ceeE Confidence 1 1 111234444432 22 2356677788889999999888655 4554 567788888876654332211 1111 Q ss_pred EEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHH Q lcl|NC_018086. 174 YYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSL 253 (511) Q Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l 253 (511) +.|... ..+.. .....|.++.++++..-. ..+.-.|.|.+..+... T Consensus 157 ~~y~~~--~~~~~-~~~~~~~~~diih~~~~~-------------------------------~~~~~~G~s~i~~~~~~ 202 (457) T protein:vir:13 157 EAYDID--ADGNE-VLLGWFTPRDVLHIPGMM-------------------------------LPGDFVGCSPISYARES 202 (457) T ss_pred EEEEEe--cCCce-eeEEeeCccceEEecCCC-------------------------------CCCccccccHHHHHHHH Confidence 222211 11211 122335555555543100 00112577888777776 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh------------hCceeeecCCCceeeeecCCCHHHHHH Q lcl|NC_018086. 254 IDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK------------NDRVIVTDEDGMVKFITKDVNDKHIEN 321 (511) Q Consensus 254 ~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~------------~~~~i~~~~~~~~~~~~~~~~~~~~~~ 321 (511) +.....+..-....+...+.|-.+++-.. .-.++....++ .++++.++++.+.+.+..+.....+.+ T Consensus 203 i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e 281 (457) T protein:vir:13 203 IGLALAAQKYGSKFFANGAMPGAVVEVPG-TMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQ 281 (457) T ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHH Confidence 66666655555556666677766665322 11222222211 135677777767666655555555666 Q ss_pred HHHHHHHHHHHHhCcccccccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccccc Q lcl|NC_018086. 322 IKNRAKLDIFSLSQTPDLVSKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA-KDLKPYE 398 (511) Q Consensus 322 ~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~~ 398 (511) ..+.....|+..-++|....+... +.++..++.+... .+..+|.-+++.|...+...-.. ....... T Consensus 282 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~----------f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~ 351 (457) T protein:vir:13 282 TRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIA----------FTMFSLRPWLERIEAGFNRLLFAETADRFRF 351 (457) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcCccccCcee Confidence 677888889998899875554332 2222322222111 22223333333333322211111 1112233 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc Q lcl|NC_018086. 399 VTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWIT--DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA 474 (511) Q Consensus 399 i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 474 (511) +++.++.-+..|..+.++++.++ .|+++.-.++.+++.-+ +...+ ..- +........+.....++ T Consensus 352 i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d--~~~------~~~n~~~~~~~~~~~~~--- 420 (457) T protein:vir:13 352 VKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGE--KYR------VPLNLGEVGEEPEPEPA--- 420 (457) T ss_pred EEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccc--cee------ecccccccccccccccc--- Confidence 55666676777889999998887 58899888888876532 22111 110 00000000000000000 Q ss_pred cccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 475 AANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ...++.++....+....++.+....+++-+.-.+-| T Consensus 421 -~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~ 456 (457) T protein:vir:13 421 -PAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDD 456 (457) T ss_pred -CCCCCCCCCccccCCCCCCCCCCccccCCCCccccc Confidence 000011111111111111222222222111111112 No 146 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.46 E-value=5.8e-07 Score=54.78 Aligned_cols=431 Identities=9% Similarity=0.033 Sum_probs=160.3 Q ss_pred ccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhh Q lcl|NC_018086. 13 IITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVL-YDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY 91 (511) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~ 91 (511) ..+ ..+.++.+.+.......... .+.+ ..++.+ . . .+...........-..++...+|+..++. T Consensus 1 ~~~----------~~~~i~s~~~~~~i~~~~~~-s~~~~~~~~~~-~--~-~pp~~~~~la~l~~~n~~v~scI~~ia~~ 65 (542) T protein:vir:41 1 MFN----------YHLSIRSLEKYKAIKREEVE-SQALGETRFEE-Y--V-EPKVNPLVLLSLLQVNPYHASACSIKAND 65 (542) T ss_pred Ccc----------ccccccccccchhhhhcccc-ccccccccCCc-c--c-cCCCCHHHHHHHHhhcHHHHHHHHHHHHH Confidence 111 11121111110000000000 0000 011110 0 0 00001111111111245667888888888 Q ss_pred hhccCceecCchhhHHHHHHHHhcc--ChhHHHHHHHHHHhhCCeEEEEeeeCCCCceE-EEEEcccceEEEecCCCCCc Q lcl|NC_018086. 92 LAGEPITESGDEKTIKAMQPVFKEN--YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHR-FKAVSPMNCLIAYSADLDEE 168 (511) Q Consensus 92 l~g~~~~~~~d~~~~~~l~~~~~~n--~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~-i~~~~p~~~~~v~d~~~~~~ 168 (511) +.+-|+.+..++. ..+..++-.. ........+..+.+.+|.||+.+..+..|++. +..++|..+.+..|... T Consensus 66 IA~l~~~~~~~~~--~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~--- 140 (542) T protein:vir:41 66 IIRTGYILEGDDE--GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR--- 140 (542) T ss_pred HhhCceeeecccc--hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe--- Confidence 8888988754332 2233433222 23456677888999999999999888888764 67788887766543221 Q ss_pred eEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC-----ccc Q lcl|NC_018086. 169 PVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EER 243 (511) Q Consensus 169 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g 243 (511) .+.++ .+....+...|.....+.. ..+. ....+..=-|+++++. ..| T Consensus 141 ---~~~~~------~~~~~~~~~~y~~~~~~~~--~~g~-----------------~~~~~~~~eIiHir~~~~~~~~~G 192 (542) T protein:vir:41 141 ---YRQTW------DGVNITHFKDYRYEGEINP--ETGE-----------------DQDSVGANELVFIHIPSPVCSYYG 192 (542) T ss_pred ---eEeee------cCCcceeEEeecccccccc--cccc-----------------cccccCcccEEEecCCCCCCCccc Confidence 01110 0111111111111110000 0000 0000111124555532 257 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHhcCcee--EeecCCCCcc-------chhh----hhh----h-----hCceeee Q lcl|NC_018086. 244 LGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYL--WLQGFDLSAD-------SDSI----SNM----K-----NDRVIVT 301 (511) Q Consensus 244 ~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l--~~~G~~~~~~-------~~~~----~~~----~-----~~~~i~~ 301 (511) .|.+......+.....+..-..+.+...+.|-. .+.|...++. .+.. ..+ . .++++.+ T Consensus 193 lspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL 272 (542) T protein:vir:41 193 VPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVF 272 (542) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEe Confidence 777776555554443333333333444445544 4445322111 1111 111 1 1234444 Q ss_pred c----CCCceeeeecCC--CHHHHHHHHHHHHHHHHHHhCccccccccccCc--cHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 302 D----EDGMVKFITKDV--NDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAA--SGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 302 ~----~~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~--Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) . .+++++|..... ....+....+...+.|+..-++|....+..... ++..++.. ....+..+ T Consensus 273 ~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~----------~~~f~~~t 342 (542) T protein:vir:41 273 SIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVT----------RRTYYESV 342 (542) T ss_pred eccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHH----------HHHHHHHH Confidence 2 134455544333 345566667777888888888886554432111 11111111 11222333 Q ss_pred HHHHHHHHHHHHHhcCCCccccccceeEEeC--CCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNKAKDLKPYEVTPVFV--RNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQR 451 (511) Q Consensus 374 l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~--~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~ 451 (511) |.-+++.|...++..-. .... ..+.+.|+ ..+..|..+.++.+ ...|+++...+++.|+.++--++.. +..-. T Consensus 343 L~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~~ll~~d~~~~~~~~-v~~GilT~NE~Re~L~g~~pgdd~~--l~p~~ 417 (542) T protein:vir:41 343 VRPQQNIISSILTDFFQ-VKFN-PKTRFKFNDETLLESDSVRNCALL-VQSGVLTPAEARERLFGLDGGPDIF--MVPSK 417 (542) T ss_pred HHHHHHHHHHHHHhhcc-cccC-CceEEEecchhhcchHHHHHHHHH-HhCCCCCHHHHHHhhCCCCCCCccc--ccccc Confidence 33333333333321111 1111 23455564 33333433333322 2258888877887664443111110 00000 Q ss_pred HHHHHHHHhhcccccc-CCCCCCccccccCCCCCCccccccCCCCccccccccCCCC-CCCC Q lcl|NC_018086. 452 QKRADIALQNFKQTSA-VQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKK-PKTD 511 (511) Q Consensus 452 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 511 (511) ........+.. .+.....+........++.-....++.+...++.+..+++ +-+- T Consensus 418 -----~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (542) T protein:vir:41 418 -----GAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAEFR 474 (542) T ss_pred -----ccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccchhhhhH Confidence 00000000000 0000000000000111111111112222222222111111 0000 No 147 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.44 E-value=6.4e-07 Score=54.56 Aligned_cols=389 Identities=8% Similarity=-0.045 Sum_probs=175.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccc--------cC-CcCccccccceeccchHHHHHHHHHhhhhccCceecCc-h Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQ--------SR-TFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD-E 103 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~--------~~-~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~ 103 (511) +.+...+..+...-......+.....+.. .. ..........-+..+-....|+..++-+-.-|+.+-.+ + T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 23333222111100000011110000000 00 00000000000222333445666666666667765211 1 Q ss_pred h-hHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEE Q lcl|NC_018086. 104 K-TIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYN 176 (511) Q Consensus 104 ~-~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 176 (511) + ....+.+++.. |. .......+..+.+.+|.||+.+..+..|++ .+..++|..+.++++++.......-++ T Consensus 81 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~~-- 158 (422) T protein:vir:13 81 EYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKVW-- 158 (422) T ss_pred ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceEE-- Confidence 1 11124444432 33 335677888889999999999988888875 577889999988876543211111111 Q ss_pred EEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHH Q lcl|NC_018086. 177 TVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDA 256 (511) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~ 256 (511) .......|.. ..+.++.++++.... + .+.-.|.|.+..+...++. T Consensus 159 y~~~~~~g~~----~~~~~~eiih~~~~~----------------------~---------~~~~~G~s~~~~~~~~i~~ 203 (422) T protein:vir:13 159 YVVTDKNGKE----HKLLPDEMLHFIGDI----------------------T---------LDGLIGIKPLDYLRCTIEN 203 (422) T ss_pred EEEEeCCCeE----EEEcccceEEEcCCC----------------------C---------CCCcccccHHHHHHHHHHH Confidence 1111112211 123344444432100 0 0112577888777777776 Q ss_pred HHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCceeeecCCCceeeeecCCCHHHHHHHHH Q lcl|NC_018086. 257 YNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDRVIVTDEDGMVKFITKDVNDKHIENIKN 324 (511) Q Consensus 257 ~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (511) ......-....++..+.|-.+++-... -.++.... +. .++++.++++.+++.+........+.+..+ T Consensus 204 ~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~ 282 (422) T protein:vir:13 204 GRATQEFINKFFKNGLSIKGIVQYVGD-LDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSK 282 (422) T ss_pred HHHHHHHHHHHHhccCCccEEEEeCCC-CCHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHH Confidence 666655555566666667766653221 11111111 11 234666776666666655555556667777 Q ss_pred HHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cc-ccccceeEE Q lcl|NC_018086. 325 RAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA-KD-LKPYEVTPV 402 (511) Q Consensus 325 ~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~-~~~~~i~i~ 402 (511) .....|+..-++|....+...+.+...++... ...+..+|.-+++.|...+...--. .. .....+++. T Consensus 283 ~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~----------~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd 352 (422) T protein:vir:13 283 LTKRELAATFGMKSYHLNDLERATFNNLTEQQ----------KDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFN 352 (422) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEee Confidence 88889999999987666544333332222211 1223333444433333333211100 01 111224444 Q ss_pred eCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccC Q lcl|NC_018086. 403 FVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLD 480 (511) Q Consensus 403 f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (511) +..-+..|..+.++++.++. |+++.-.++.+++.-+-+. .+++- ...+ ..+. +..+ +.. T Consensus 353 ~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD~~~-----------~~~n----~~~l-~~~~-~~~ 413 (422) T protein:vir:13 353 VDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEG--GDRLL-----------VNGN----MIPI-EMAG-EQY 413 (422) T ss_pred chhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee-----------eccC----ccch-hhcc-ccc Confidence 45556668888999988874 7898888888876532110 00000 0000 0000 0000 000 Q ss_pred CCCCCccccccCCCCccccccc Q lcl|NC_018086. 481 KNPANTSTITTTDPVAAKEQEK 502 (511) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~ 502 (511) .++|.. .++ T Consensus 414 ~~~g~~-------------~g~ 422 (422) T protein:vir:13 414 KKGGEK-------------GGK 422 (422) T ss_pred ccCCCc-------------CCC Confidence 000000 000 No 148 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.44 E-value=6.6e-07 Score=54.45 Aligned_cols=383 Identities=11% Similarity=0.015 Sum_probs=167.6 Q ss_pred Cccch-hhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 3 IPNGQ-INAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYD-YYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 3 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~-yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |+|+. .++...... ..... .... ..... ++.|.. .. ...=+.... T Consensus 1 Mgl~~~~f~~~~~~~-----~~~~~-------~~~~----------~~~~~~~~~g~~--------v~---~~~al~~~~ 47 (409) T protein:vir:84 1 MSLFTRIFSGPSEER-----TLTKI-------SGIP----------SPAEDWAMHGDR--------PG---ANSAMTLGA 47 (409) T ss_pred CchhhhhhcCCCccc-----ccccc-------cccc----------cccchhhccCcc--------cc---hhhhhccHH Confidence 44443 222211000 00000 0000 00000 011110 00 000012234 Q ss_pred HHHHHHHHHhhhhccCceecC--ch-h-hHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEe-eeCCCCce-EE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESG--DE-K-TIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIH-WIDRNKKH-RF 149 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~--d~-~-~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v-~~~~~g~~-~i 149 (511) ....|+..++-+-+-|+.+-. +. . ....+.+++.. | ........+..+.+.+|.+|+++ ..+..|++ .+ T Consensus 48 v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L 127 (409) T protein:vir:84 48 FYACVTLLADTVASLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAI 127 (409) T ss_pred HHHHHHHHHHhhhhCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEE Confidence 455677776666666775421 11 1 11223344421 2 23456677888999999999765 45677775 57 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccC Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLL 229 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (511) ..++|..+.+....+..... + +.. ...+++ .+.++.++++..-. T Consensus 128 ~~l~p~~v~v~~~~~~~~~~---~--~~~-~~~~g~------~~~~~dvih~~~~~------------------------ 171 (409) T protein:vir:84 128 MPIHPDCIHVTDAKDEDGDW---I--EPV-YRIDGK------VVPNHRIMHIKRYP------------------------ 171 (409) T ss_pred EEEcCceeEEEEcCCCcceE---E--EEE-ecCCce------EEchhhEEEecCCC------------------------ Confidence 78899887665433221100 1 111 111121 13344444432100 Q ss_pred CccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh---------hCceee Q lcl|NC_018086. 230 QKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK---------NDRVIV 300 (511) Q Consensus 230 g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~---------~~~~i~ 300 (511) ..+...|.|.++.+...++.......-..+.+...+.|-.+++.-. ...++....++ .++++. T Consensus 172 -------~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~n~g~~~v 243 (409) T protein:vir:84 172 -------VAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDA-DLTPDQVKQTQKQWIQSHHNRRLPAV 243 (409) T ss_pred -------CCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC-CCCHHHHHHHHHHHHHHhccCCCeee Confidence 0011257787777666666665555555555566666766665321 12222222221 234566 Q ss_pred ecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 301 TDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRY 378 (511) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 378 (511) ++++.+.+.+........+.+..+...+.|+..-++|..-.+... +.++..++...... +..+|.-++ T Consensus 244 l~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f----------~~~~l~P~~ 313 (409) T protein:vir:84 244 MSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINF----------VRHTLLPWL 313 (409) T ss_pred cCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHH----------HHHHHHHHH Confidence 766555555544444455666677888899999999875544332 22233332221111 122222222 Q ss_pred HHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 379 ELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRAD 456 (511) Q Consensus 379 ~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~ 456 (511) +.|...+...- .....+++.+..-+..|.++.++++.++ .|+++.-.+++.++.-+-+. .+.. T Consensus 314 ~~ie~~l~~~L----~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~g--gD~~--------- 378 (409) T protein:vir:84 314 RCIEQALDTFL----PRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPE--GDIH--------- 378 (409) T ss_pred HHHHHHHHHhc----cCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--ccee--------- Confidence 22222222110 0122356666776777899999988887 47898888888876532111 0000 Q ss_pred HHHhhccccccCCCCCCccccccCCCCCCccccccCC Q lcl|NC_018086. 457 IALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTD 493 (511) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (511) ....+. .........++.+.+...+...+.. T Consensus 379 --~~~~n~----~~~~~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 379 --LQPMNF----VPLGYVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred --eecccc----cccccCCccccCcCCCCCCccCCCC Confidence 000000 0000000011111000000000000 No 149 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.40 E-value=8.1e-07 Score=53.98 Aligned_cols=379 Identities=10% Similarity=0.030 Sum_probs=163.9 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+++..+....-.. ... -..+..+..+... ....... .-+...-.. T Consensus 1 M~~f~~~~~~~~~~----------~~~-----------------~~~~~~~~~~~~~----~~~v~~~---~al~~~~V~ 46 (397) T protein:vir:38 1 MPLLKLNKSHSQGF----------SLN-----------------DPDWVNFLTGGEA----QKYVSAD---TALKNSDIF 46 (397) T ss_pred CcchhhhhcccCcc----------cCC-----------------chhhhhhhcCCcC----CceechH---HhhccHHHH Confidence 33333221100000 000 0011111111110 0000000 001122223 Q ss_pred HHHHHHHhhhhccCceecCchhhHHHHHHHHhc-c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccce Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESGDEKTIKAMQPVFKE-N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNC 157 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~ 157 (511) ..|+..++-+-+-|++.. +.. +..++.+ | ........+..+.+.+|.||+.+-.+..|++ .+..++|..+ T Consensus 47 ~~v~~ia~~ia~~p~~~~--~~~---~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v 121 (397) T protein:vir:38 47 SLIMQLSGDLAMVRYTSE--SDR---SQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQV 121 (397) T ss_pred HHHHHHHHHHhhCccccc--ccH---HHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCcee Confidence 344555444444566532 211 2222222 2 3345667888899999999998888888875 6778899988 Q ss_pred EEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEee Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEI 237 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 237 (511) .+..+.+.. .+. |.+.....+.. ....+.++.++|++.-. . T Consensus 122 ~i~~~~~~~-~~~-----y~~~~~~~~~~--~~~~~~~~eiih~~~~~-------------------------------~ 162 (397) T protein:vir:38 122 QPMLLQDGS-GLI-----YNINFDEPAIG--YMENVPAADVIHIRLLS-------------------------------K 162 (397) T ss_pred EEEEcCCCc-eEE-----EEEEecccccc--ceeEecCccEEEecCCC-------------------------------C Confidence 776654332 111 11111111111 01123444444442110 0 Q ss_pred cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh-----------hhCceeeecCCCc Q lcl|NC_018086. 238 IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM-----------KNDRVIVTDEDGM 306 (511) Q Consensus 238 ~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~~ 306 (511) .+...|.|.+......+........-..+.+...+.|-.+++-..... .+....+ ..++++.++++.+ T Consensus 163 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~-~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~ 241 (397) T protein:vir:38 163 NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGL-LDAETRIARSKEISKQIHNSDGPVVIDALED 241 (397) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCC-HHHHHHHHHHHHHHhcccccCCceecCCCce Confidence 011257888887777777666666666666666677776665322211 1111111 1234566666666 Q ss_pred eeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLE 386 (511) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 386 (511) ++.+........+....+.....|+..-++|..-.+.....+ .++.. ....+..+|.-++..|...++ T Consensus 242 ~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~-~~~e~-----------~~~~~~~~l~P~~~~ie~~ln 309 (397) T protein:vir:38 242 YKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ-SSITQ-----------ISGQYAKSLNRYVQAIVGELN 309 (397) T ss_pred EEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHHH-----------HHHHHHHHHHHHHHHHHHHHH Confidence 555555555566677788888999998898865554332211 11111 112333444444444443333 Q ss_pred hcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018086. 387 FMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQ 464 (511) Q Consensus 387 ~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 464 (511) ..-.. + .++.+.| .+-.|..+.++++.++ .|+++...++..++.-.-+..++-.. + . . T Consensus 310 ~~l~~-~---~~~~~~~--~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~--~-----------~-~ 369 (397) T protein:vir:38 310 DKLHA-N---ISANIRF--AIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDP--E-----------K-E 369 (397) T ss_pred HhccC-h---hcccccc--cccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccc--c-----------c-c Confidence 21111 1 1122222 3445777888888777 47888888888775421000000000 0 0 0 Q ss_pred cccCCCCCCccccccCCCCCCccccccCCCCccccccccCCC Q lcl|NC_018086. 465 TSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQK 506 (511) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) ..........++++..+.... +.+.++ | T Consensus 370 ---~~~~~~~~~~~~g~~~~~~~~-----e~~~~~------~ 397 (397) T protein:vir:38 370 ---PQQAIQLIQQEGGENDGNNSD-----ERGSDP------E 397 (397) T ss_pred ---ccccccccccccCCCCCCCCC-----CCCCCC------C Confidence 000000000000000000000 000110 0 No 150 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.38 E-value=9.5e-07 Score=53.60 Aligned_cols=414 Identities=9% Similarity=-0.023 Sum_probs=171.0 Q ss_pred HHHHHHHHHHHHH--HH-HHHHHhcCCC-cccccCCcCccccc-cce--eccchHHHHHHHHHhhhhccCceecC-ch-- Q lcl|NC_018086. 34 ITLAEMHSRSSSA--YG-VLYDYYKGNH-IAIQSRTFDDTNKP-NSK--IVHNFPKLLVDTSTAYLAGEPITESG-DE-- 103 (511) Q Consensus 34 ~~~~~~~~~~~~~--~~-~~~~yY~G~~-~~~~~~~~~~~~~~-~~r--i~~n~~k~ivd~~~~~l~g~~~~~~~-d~-- 103 (511) +.+...+..+... .. .-...|.... ...........+.. ... +.+.-....|+..++-+-+-|+.+-. ++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 1111111100000 00 0000000000 00000000000000 000 01111223445555555555665421 11 Q ss_pred --h-hHHHHHHHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEE Q lcl|NC_018086. 104 --K-TIKAMQPVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYY 175 (511) Q Consensus 104 --~-~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~ 175 (511) . ....+..++.. |. .......+..+.+.+|.||+.+-.+ .|++ .+..++|..+.+..+..... .....+. T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~-~~~~~~~ 158 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGL-RRKVFEA 158 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCc-cceeEEE Confidence 1 11123333322 22 3456677888899999999888554 4554 56778888876654332211 1111112 Q ss_pred EEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHH Q lcl|NC_018086. 176 NTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLID 255 (511) Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d 255 (511) |... ..+.. .....|.++.+++++.-. .. ..-.|.|.++.+...+. T Consensus 159 y~~~--~~g~~-~~~~~~~~~eiih~r~~~----------------------~~---------~~~~G~sp~~~~~~~i~ 204 (457) T protein:vir:62 159 YDID--ADGNE-VLLGWFTPRDVLHIPGMM----------------------LP---------GDFVGCSPISYARESIG 204 (457) T ss_pred EEEc--cCCce-eEEEeeCccceEEecCCC----------------------CC---------CceecccHHHHHHHHHH Confidence 2211 11221 122334556565553110 00 01247777777776666 Q ss_pred HHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh------------hCceeeecCCCceeeeecCCCHHHHHHHH Q lcl|NC_018086. 256 AYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK------------NDRVIVTDEDGMVKFITKDVNDKHIENIK 323 (511) Q Consensus 256 ~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~------------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 323 (511) ....+..-....+...+.|-.+++-.. .-.++....++ .++++.++++.+.+.+..+.....+.+.. T Consensus 205 ~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~ 283 (457) T protein:vir:62 205 LALAAQKYGAHFFRNGAMPGAVVEVPG-TMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTR 283 (457) T ss_pred HHHHHHHHHHHHHhccCCcceEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHH Confidence 666665555566666667766655322 11222222211 13467777666666665555555666777 Q ss_pred HHHHHHHHHHhCccccccccccC--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccccccee Q lcl|NC_018086. 324 NRAKLDIFSLSQTPDLVSKDFTA--ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA-KDLKPYEVT 400 (511) Q Consensus 324 ~~l~~~i~~~s~~p~~~~~~~~~--~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~~i~ 400 (511) +.....|+..-++|....+.... .++..++..... .+..+|.-+++.|...+...-.. .......++ T Consensus 284 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~----------f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~ 353 (457) T protein:vir:62 284 QFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIA----------FTMFSLRPWLERIEAGFNRLLFAETADRFRFVK 353 (457) T ss_pred HHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHH----------HHHHHHHHHHHHHHHHHHhhhcCccccCceEEE Confidence 78888999999998755543322 223333222111 12222333333222222211111 111222355 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhccccccCCCC-CCcc Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADAQRQKRADIALQNFKQTSAVQGA-STAA 475 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~ 475 (511) +.+...+..|..+.++++.++ .|+++...++.+++.- ++.....-.+ ..............+. ...+ T Consensus 354 fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~--------~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:62 354 FNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRV--------PLNLGEIGEEPEPEPAPAPPA 425 (457) T ss_pred eechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeee--------ccccccccccccccccCCCcc Confidence 555666667889999998887 4789998899887653 2221000000 0000000000000000 0000 Q ss_pred ccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 476 ANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+++.+++.++ .++. ...+.+|...|. T Consensus 426 ~~~~~~~~~~~-----~~~~----~~~~~~d~~~~~ 452 (457) T protein:vir:62 426 IDPPAEEPADD-----EEPD----NAEGDPDEGETE 452 (457) T ss_pred CCCCccCCCCC-----CCCC----CCCCCCcccccc Confidence 11111111111 1111 111222222222 No 151 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.36 E-value=1e-06 Score=53.39 Aligned_cols=382 Identities=10% Similarity=0.007 Sum_probs=171.2 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHH-HHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhh Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAY-GVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYL 92 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l 92 (511) +....++..+... ........ ..+..+.-|.. ..+..-+...-....|+..++-+ T Consensus 1 MG~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~g~~~-----------~~~~~al~~~~V~~~v~~Ia~~i 56 (411) T protein:vir:81 1 MGWWSRLTRFFRP-------------RNETVDMTNPLLLQWLGVDP-----------DTPRNQLSEATYFACLKILSESL 56 (411) T ss_pred CchHHHHHhhccC-------------cccccccchHHHHHHhcCcc-----------cChhhhhccHHHHHHHHHHHHhH Confidence 1111111100000 00000000 00011111110 00111122222344566666666 Q ss_pred hccCceec---Cch--h-hHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEE Q lcl|NC_018086. 93 AGEPITES---GDE--K-TIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIA 160 (511) Q Consensus 93 ~g~~~~~~---~d~--~-~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v 160 (511) -.-|+.+- .+. + ....+..++.. | ........+..+.+.+|.||+++..+. |++ .+..++|..+.++ T Consensus 57 A~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~l~~l~~~~v~~~ 135 (411) T protein:vir:81 57 GKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQLQALWILPSQYVTIV 135 (411) T ss_pred hhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CceEEEEEECCceEEEE Confidence 56676541 111 1 11234444432 3 234566777888899999999887774 554 4777899998888 Q ss_pred ecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC Q lcl|NC_018086. 161 YSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN 240 (511) Q Consensus 161 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 240 (511) .++.........+ +|......++..+ .+.++.++|++... | .+. T Consensus 136 ~~~~~~~~~~~~~-~~~~~~~~~g~~~----~~~~~eiih~k~~~---------------------------~----~~~ 179 (411) T protein:vir:81 136 VDDRGLLGEKNAI-WYRYNDPYDGKMY----VFRNDEILHFKTSV---------------------------T----FDG 179 (411) T ss_pred EcCcccccccceE-EEEEEecCCceEE----EEccccEEEEcCCC---------------------------C----CCC Confidence 7654321111111 1111111112111 13444444442100 0 011 Q ss_pred cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh----hhh--------hCceeeecCCCcee Q lcl|NC_018086. 241 EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS----NMK--------NDRVIVTDEDGMVK 308 (511) Q Consensus 241 ~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~----~~~--------~~~~i~~~~~~~~~ 308 (511) -.|.|.+..+...++.......-..+.+...+.|-.+++.... -.++... .+. .++++.++++.+.+ T Consensus 180 ~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~ 258 (411) T protein:vir:81 180 ITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGD-LNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLV 258 (411) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CCHHHHHHHHHHHHHHhcCccccCCceecCCCceEE Confidence 2477777776666666665555555556666668777655321 1122111 111 13456666666665 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFM 388 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 388 (511) .+........+.+..+...+.|+..-++|....+.....+-..++.. ....+..+|.-++..|...+... T Consensus 259 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~----------~~~f~~~~l~P~~~~ie~~l~~~ 328 (411) T protein:vir:81 259 PLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQ----------NLAFYVDTLLYVLKQYEEEITYK 328 (411) T ss_pred EccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhh Confidence 55444444555666788889999999999765544332222111111 12333444555544444444321 Q ss_pred CCC-cc-ccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_018086. 389 NKA-KD-LKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQ 464 (511) Q Consensus 389 ~~~-~~-~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 464 (511) --. .. .....+++.+..-+..|..+.++++.++ .|+++.-.++..++.-+.+.. +..-.... ... ++... T Consensus 329 ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~gg--D~~~~~~n-~~p--l~~~~- 402 (411) T protein:vir:81 329 ILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYG--NNLMANGN-YIP--LSMLG- 402 (411) T ss_pred cCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--CeeeeccC-ccc--hhhhh- Confidence 111 11 1122355555666677889999998887 478888888888775332110 00000000 000 00000 Q ss_pred cccCCCCCCccccccCCCCCCc Q lcl|NC_018086. 465 TSAVQGASTAAANKLDKNPANT 486 (511) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~ 486 (511) .... ++|+. T Consensus 403 ---------~~~~----kgGd~ 411 (411) T protein:vir:81 403 ---------ANYG----KGGDS 411 (411) T ss_pred ---------hhhc----cCCCC Confidence 0000 00000 No 152 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.36 E-value=1e-06 Score=53.37 Aligned_cols=397 Identities=9% Similarity=0.013 Sum_probs=173.0 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHH------HH-HHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSS------SA-YGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVD 86 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------~~-~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd 86 (511) +....++. ++..-+++.. .. ...+..+.-+... ...... ..-+.+.-....|+ T Consensus 1 M~~~~r~~-------------~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~---~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIVDSVK-------------KFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG---KNALKVATVFACIK 60 (432) T ss_pred CChHHHHH-------------HhcCccccCcccccccCCchHHHHHHhCCCcC----ccccch---hhhhccHHHHHHHH Confidence 11111111 0000000000 00 0011111100000 000000 00011222334555 Q ss_pred HHHhhhhccCceec--Cch---h-hHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcc Q lcl|NC_018086. 87 TSTAYLAGEPITES--GDE---K-TIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSP 154 (511) Q Consensus 87 ~~~~~l~g~~~~~~--~d~---~-~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p 154 (511) ..++-+-+-|+.+- .++ . ....+..++.. | ........+..+.+.+|.+|+++..+..|++ .+..++| T Consensus 61 ~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~ 140 (432) T protein:vir:10 61 ILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDA 140 (432) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 55555555676641 111 1 12234444432 2 2345677788889999999999988888886 5678899 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.++.++.... ....+.|+... ..+.. ..+.+..++|++.. .| T Consensus 141 ~~v~v~~d~~~~~--~~~~~~~y~~~-~~g~~----~~~~~~eiih~r~~---------------------------~~- 185 (432) T protein:vir:10 141 SKVTVYIDDVGLL--NSKTKMWYVVN-TGGQQ----RVLKPEEILHFKNG---------------------------IT- 185 (432) T ss_pred ceeEEEEcCcccc--cccceEEEEEe-cCCeE----EEEccccEEEecCC---------------------------CC- Confidence 8887766543211 11111111111 11111 12334444443210 00 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCceeeec Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDRVIVTD 302 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~ 302 (511) .+.-.|.|.+..+...++.......-....+...+.|-.+++-.. .-.++.... +. .++++.++ T Consensus 186 ---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~ 261 (432) T protein:vir:10 186 ---LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMP 261 (432) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcccccCCcceecC Confidence 011247787777777777666655555556666666776665322 111111111 11 23566777 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVC 382 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 382 (511) ++.+.+.+..+.....+....+...+.|+..-++|....+.....+...++... ...+..+|.-+++.|. T Consensus 262 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~----------~~~~~~~l~P~~~~ie 331 (432) T protein:vir:10 262 VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQ----------QQFYTDTLQATLTMYE 331 (432) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHH Confidence 666666555444445556667788889999999987666543333322222211 1223334444444443 Q ss_pred HHHHhcC--CCccccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 383 SYLEFMN--KAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIA 458 (511) Q Consensus 383 ~~~~~~~--~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~ 458 (511) ..+...- .........+++.+..-+..|..+.++++.++. |+++...++..+++-+.+. .++...-... . . T Consensus 332 ~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~g--gD~~~~~~n~-~--~ 406 (432) T protein:vir:10 332 QEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAG--GDRLLVNGNM-L--P 406 (432) T ss_pred HHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeEeecccc-c--c Confidence 3333211 110111223555556667789999999988874 7899888888886533111 0000000000 0 0 Q ss_pred HhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 459 LQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) ++...+ ....++.+ .++. .+.++.+. T Consensus 407 ~~~~~~-~~~k~~~~--~~~~-~~~~~~~~ 432 (432) T protein:vir:10 407 IDMAGQ-AYLKGGDT--NGEV-SKEGNEGN 432 (432) T ss_pred hhhccc-cccCCCCC--CCCC-CCCCCCCC Confidence 000000 00000000 0000 00000000 No 153 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.36 E-value=1e-06 Score=53.37 Aligned_cols=397 Identities=9% Similarity=0.013 Sum_probs=173.0 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHH------HH-HHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSS------SA-YGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVD 86 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------~~-~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd 86 (511) +....++. ++..-+++.. .. ...+..+.-+... ...... ..-+.+.-....|+ T Consensus 1 M~~~~r~~-------------~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~---~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIVDSVK-------------KFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG---KNALKVATVFACIK 60 (432) T ss_pred CChHHHHH-------------HhcCccccCcccccccCCchHHHHHHhCCCcC----ccccch---hhhhccHHHHHHHH Confidence 11111111 0000000000 00 0011111100000 000000 00011222334555 Q ss_pred HHHhhhhccCceec--Cch---h-hHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcc Q lcl|NC_018086. 87 TSTAYLAGEPITES--GDE---K-TIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSP 154 (511) Q Consensus 87 ~~~~~l~g~~~~~~--~d~---~-~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p 154 (511) ..++-+-+-|+.+- .++ . ....+..++.. | ........+..+.+.+|.+|+++..+..|++ .+..++| T Consensus 61 ~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~ 140 (432) T protein:vir:10 61 ILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDA 140 (432) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 55555555676641 111 1 12234444432 2 2345677788889999999999988888886 5678899 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.++.++.... ....+.|+... ..+.. ..+.+..++|++.. .| T Consensus 141 ~~v~v~~d~~~~~--~~~~~~~y~~~-~~g~~----~~~~~~eiih~r~~---------------------------~~- 185 (432) T protein:vir:10 141 SKVTVYIDDVGLL--NSKTKMWYVVN-TGGQQ----RVLKPEEILHFKNG---------------------------IT- 185 (432) T ss_pred ceeEEEEcCcccc--cccceEEEEEe-cCCeE----EEEccccEEEecCC---------------------------CC- Confidence 8887766543211 11111111111 11111 12334444443210 00 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCceeeec Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDRVIVTD 302 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~ 302 (511) .+.-.|.|.+..+...++.......-....+...+.|-.+++-.. .-.++.... +. .++++.++ T Consensus 186 ---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~ 261 (432) T protein:vir:10 186 ---LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMP 261 (432) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcccccCCcceecC Confidence 011247787777777777666655555556666666776665322 111111111 11 23566777 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVC 382 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 382 (511) ++.+.+.+..+.....+....+...+.|+..-++|....+.....+...++... ...+..+|.-+++.|. T Consensus 262 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~----------~~~~~~~l~P~~~~ie 331 (432) T protein:vir:10 262 VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQ----------QQFYTDTLQATLTMYE 331 (432) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHH Confidence 666666555444445556667788889999999987666543333322222211 1223334444444443 Q ss_pred HHHHhcC--CCccccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 383 SYLEFMN--KAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIA 458 (511) Q Consensus 383 ~~~~~~~--~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~ 458 (511) ..+...- .........+++.+..-+..|..+.++++.++. |+++...++..+++-+.+. .++...-... . . T Consensus 332 ~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~g--gD~~~~~~n~-~--~ 406 (432) T protein:vir:10 332 QEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAG--GDRLLVNGNM-L--P 406 (432) T ss_pred HHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeEeecccc-c--c Confidence 3333211 110111223555556667789999999988874 7899888888886533111 0000000000 0 0 Q ss_pred HhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 459 LQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) ++...+ ....++.+ .++. .+.++.+. T Consensus 407 ~~~~~~-~~~k~~~~--~~~~-~~~~~~~~ 432 (432) T protein:vir:10 407 IDMAGQ-AYLKGGDT--NGEV-SKEGNEGN 432 (432) T ss_pred hhhccc-cccCCCCC--CCCC-CCCCCCCC Confidence 000000 00000000 0000 00000000 No 154 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.36 E-value=1e-06 Score=53.37 Aligned_cols=397 Identities=9% Similarity=0.013 Sum_probs=173.0 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHH------HH-HHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHH Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSS------SA-YGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVD 86 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------~~-~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd 86 (511) +....++. ++..-+++.. .. ...+..+.-+... ...... ..-+.+.-....|+ T Consensus 1 M~~~~r~~-------------~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~---~~al~~~~v~~~i~ 60 (432) T protein:vir:10 1 MKIVDSVK-------------KFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG---KNALKVATVFACIK 60 (432) T ss_pred CChHHHHH-------------HhcCccccCcccccccCCchHHHHHHhCCCcC----ccccch---hhhhccHHHHHHHH Confidence 11111111 0000000000 00 0011111100000 000000 00011222334555 Q ss_pred HHHhhhhccCceec--Cch---h-hHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcc Q lcl|NC_018086. 87 TSTAYLAGEPITES--GDE---K-TIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSP 154 (511) Q Consensus 87 ~~~~~l~g~~~~~~--~d~---~-~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p 154 (511) ..++-+-+-|+.+- .++ . ....+..++.. | ........+..+.+.+|.+|+++..+..|++ .+..++| T Consensus 61 ~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~ 140 (432) T protein:vir:10 61 ILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDA 140 (432) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 55555555676641 111 1 12234444432 2 2345677788889999999999988888886 5678899 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.++.++.... ....+.|+... ..+.. ..+.+..++|++.. .| T Consensus 141 ~~v~v~~d~~~~~--~~~~~~~y~~~-~~g~~----~~~~~~eiih~r~~---------------------------~~- 185 (432) T protein:vir:10 141 SKVTVYIDDVGLL--NSKTKMWYVVN-TGGQQ----RVLKPEEILHFKNG---------------------------IT- 185 (432) T ss_pred ceeEEEEcCcccc--cccceEEEEEe-cCCeE----EEEccccEEEecCC---------------------------CC- Confidence 8887766543211 11111111111 11111 12334444443210 00 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCceeeec Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDRVIVTD 302 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~ 302 (511) .+.-.|.|.+..+...++.......-....+...+.|-.+++-.. .-.++.... +. .++++.++ T Consensus 186 ---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~ 261 (432) T protein:vir:10 186 ---LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG-DLNEDAKKVFRENFESMSSGLQNSHRIALMP 261 (432) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcccccCCcceecC Confidence 011247787777777777666655555556666666776665322 111111111 11 23566777 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVC 382 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 382 (511) ++.+.+.+..+.....+....+...+.|+..-++|....+.....+...++... ...+..+|.-+++.|. T Consensus 262 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~----------~~~~~~~l~P~~~~ie 331 (432) T protein:vir:10 262 VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQ----------QQFYTDTLQATLTMYE 331 (432) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHH Confidence 666666555444445556667788889999999987666543333322222211 1223334444444443 Q ss_pred HHHHhcC--CCccccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 383 SYLEFMN--KAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIA 458 (511) Q Consensus 383 ~~~~~~~--~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~ 458 (511) ..+...- .........+++.+..-+..|..+.++++.++. |+++...++..+++-+.+. .++...-... . . T Consensus 332 ~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~g--gD~~~~~~n~-~--~ 406 (432) T protein:vir:10 332 QEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAG--GDRLLVNGNM-L--P 406 (432) T ss_pred HHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeEeecccc-c--c Confidence 3333211 110111223555556667789999999988874 7899888888886533111 0000000000 0 0 Q ss_pred HhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 459 LQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) ++...+ ....++.+ .++. .+.++.+. T Consensus 407 ~~~~~~-~~~k~~~~--~~~~-~~~~~~~~ 432 (432) T protein:vir:10 407 IDMAGQ-AYLKGGDT--NGEV-SKEGNEGN 432 (432) T ss_pred hhhccc-cccCCCCC--CCCC-CCCCCCCC Confidence 000000 00000000 0000 00000000 No 155 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.35 E-value=1.1e-06 Score=53.24 Aligned_cols=410 Identities=8% Similarity=-0.036 Sum_probs=172.9 Q ss_pred HHHHHHHHHHHH--------HHHHHH----HHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCcee- Q lcl|NC_018086. 33 LITLAEMHSRSS--------SAYGVL----YDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITE- 99 (511) Q Consensus 33 l~~~~~~~~~~~--------~~~~~~----~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~- 99 (511) |-.+..+-+... ..+..+ .+-|-|-. .......+..+ +...=....|+..++-+-.-|+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~---~~g~~v~~~~a---l~~~~V~~~v~~Ia~~iA~lp~~~~ 74 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAW---QQGVKADPEAV---LSFHAVFACISLISQDIAKMRLRLM 74 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchh---hcCcccChHHh---hccHHHHHHHHHHHHhhccCceEEE Confidence 111110000000 000000 00011100 00000000000 111112234555555555557664 Q ss_pred --cCch---hh-HHHHHHHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCc Q lcl|NC_018086. 100 --SGDE---KT-IKAMQPVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEE 168 (511) Q Consensus 100 --~~d~---~~-~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~ 168 (511) ..+. .. ...+..++.+ |. .......+..+.+.+|.||+++-.+..|++ .+..++|..+-++++++. . T Consensus 75 ~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g--~ 152 (454) T protein:vir:93 75 QTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDG--E 152 (454) T ss_pred EeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCC--c Confidence 1111 11 1123334433 33 235667778889999999999988888886 577889999887765432 1 Q ss_pred eEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhH Q lcl|NC_018086. 169 PVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFE 248 (511) Q Consensus 169 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~ 248 (511) +. |........... ....+..+.++|+.... ..+...|.|.+. T Consensus 153 ~~-----y~~~~~~~~~~~-~~~~~~~~eViH~k~~~-------------------------------~~~~~~G~sp~~ 195 (454) T protein:vir:93 153 VF-----YRITPDRNCGIT-EAVTVPAREVIHDRFNC-------------------------------FFHPLIGLPPVY 195 (454) T ss_pred EE-----EEEEeccccccc-eeEEecCcceEEeccCC-------------------------------CCCCceeccHHH Confidence 11 111111111001 11124444454442100 011224777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh-----------hhCceeeecCCCceeeeecCCCHH Q lcl|NC_018086. 249 AQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM-----------KNDRVIVTDEDGMVKFITKDVNDK 317 (511) Q Consensus 249 ~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~~~~~~~~~~~~~ 317 (511) .....+.....+.......+...+.|-.+++-.. .-.++....+ ..++++.++++.+.+.+....... T Consensus 196 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~ 274 (454) T protein:vir:93 196 AAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG-SITEENAKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDS 274 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-CCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEcccChhHH Confidence 6666666555555444555555556655554221 1122222222 123466777666666665555555 Q ss_pred HHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Q lcl|NC_018086. 318 HIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPY 397 (511) Q Consensus 318 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 397 (511) .+.+..+.....|+..-++|....+.....+...++... ...+..+|.-+++.|...+...-.. . ... T Consensus 275 q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~----------~~f~~~~l~P~~~~ie~~ln~~L~~-~-~~~ 342 (454) T protein:vir:93 275 QTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALE----------QQYYSQCLQTLIESIELLLDEALET-G-ENE 342 (454) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcC-C-CCc Confidence 666667788889999999987655443333322222111 1222223333333222222211100 1 112 Q ss_pred ceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcc Q lcl|NC_018086. 398 EVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAA 475 (511) Q Consensus 398 ~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (511) .+++.+...+..|..+.++.+.++ .|+++.-.++.+++.-+-+.. ++.-... ....++...+.... .. T Consensus 343 ~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg--D~~~~~~---~~~~~~~~~~~~~~-----~~ 412 (454) T protein:vir:93 343 STEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGG--DALYLQQ---QNYSLEALSRRDAR-----ED 412 (454) T ss_pred EEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--Ceeeecc---CccchHhhhccCcc-----cC Confidence 355666677778889999988887 478998888888765321100 0000000 00000000010000 00 Q ss_pred ccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 476 ANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .....+.++......+ ...+.........|+.++. T Consensus 413 ~~~~~~~~~~~~~~~~-~~d~~~~~~e~~~d~~~~~ 447 (454) T protein:vir:93 413 PFASSGKTASVPQAVA-ASDGNKAITETEHDAVKAM 447 (454) T ss_pred CCCCCccCCCCCCCCC-CCCCCCCccCCccchhhhh Confidence 0001111111110000 0111111122222333333 No 156 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.34 E-value=1.2e-06 Score=53.09 Aligned_cols=465 Identities=12% Similarity=0.026 Sum_probs=175.3 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc---------------cCC Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQ---------------SRT 65 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~---------------~~~ 65 (511) |.--.++.|-....+..+.++-+.-. +..+...+-+-.. +...-+.+.+.-+++..... ++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYK-MHLREIDTNVVNN--EPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPS 77 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhc-cccchhhhhhhhc--cCCCHHHHHHhHhhhcccccchhhhhccccccccCcCc Confidence 55555555555544444333322211 1111111111000 00000111111111111100 000 Q ss_pred cCccc-ccc-ce-e-ccchHHHHHHHHHhhhh-----------ccCceec---Cc-------hhhHHHHHHHHhcc---- Q lcl|NC_018086. 66 FDDTN-KPN-SK-I-VHNFPKLLVDTSTAYLA-----------GEPITES---GD-------EKTIKAMQPVFKEN---- 116 (511) Q Consensus 66 ~~~~~-~~~-~r-i-~~n~~k~ivd~~~~~l~-----------g~~~~~~---~d-------~~~~~~l~~~~~~n---- 116 (511) ..++. .+. .+ . .......+++..++-++ +-|+.+- .+ ......+.+++.+. T Consensus 78 ~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~ 157 (574) T protein:vir:80 78 IRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFR 157 (574) T ss_pred cCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCC Confidence 01100 000 00 0 12233344444333221 2344431 11 11223455555431 Q ss_pred -----ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEE Q lcl|NC_018086. 117 -----YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTY 190 (511) Q Consensus 117 -----~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~ 190 (511) .+..+...+..+.+.+|.+|+.+-.+.+|+| .+..++|..+.+..+..... .....+|+... ++... T Consensus 158 nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~-~~~~~~y~~~~---~g~~~--- 230 (574) T protein:vir:80 158 DPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKL-IKNGERFVQVI---DNRIV--- 230 (574) T ss_pred CCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc-ccCceEEEEEe---CCceE--- Confidence 2345667788889999999998888888876 46778999888776543211 01112222221 11111 Q ss_pred EEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 191 EVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAY 270 (511) Q Consensus 191 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~ 270 (511) ..+.++.+++++... ........+|.|.+..+...++....+..-..+.+.. T Consensus 231 ~~~~~~eiih~~~~~----------------------------~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~n 282 (574) T protein:vir:80 231 AKFNERELAFAVRNP----------------------------RADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSH 282 (574) T ss_pred EEEccccEEEEeccC----------------------------CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 123444444443110 0000012357788877777776666665555556666 Q ss_pred hcCceeEe--ecCC-CCcc--chhhhhhh--------hCce-eeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_018086. 271 WNDAYLWL--QGFD-LSAD--SDSISNMK--------NDRV-IVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQT 336 (511) Q Consensus 271 ~~~p~l~~--~G~~-~~~~--~~~~~~~~--------~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~ 336 (511) .+.|-.++ .+.. .+++ ......+. .+++ +.+.++.++..+........+....+...+.|+..-++ T Consensus 283 g~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgV 362 (574) T protein:vir:80 283 GGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGI 362 (574) T ss_pred cCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 66676554 3322 2221 11111111 1222 33344445544544445556667778888889999899 Q ss_pred cccccccccC--c--c-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH Q lcl|NC_018086. 337 PDLVSKDFTA--A--S-GQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY 411 (511) Q Consensus 337 p~~~~~~~~~--~--S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~ 411 (511) |....+.... . | +..+.. +... ......+..+|.-+++.|...+...-. ..+. ..+.+.|...-..+. T Consensus 363 Pp~~lG~~~~~t~~gs~~~~~n~--sn~E---~~~~~f~~~tL~P~~~~ie~~ln~~Ll-~~~~-~~~~~~f~~~d~~~~ 435 (574) T protein:vir:80 363 DPAEINFPNNGGATGSKGGSLNE--GNSK---EKMQASQNKGLQPLLRFIEDTVNTYIV-AEFG-EKYQFQFRGGDLSAQ 435 (574) T ss_pred CHHHhcccccccccccccccccc--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcC-CceEEEecccchhhH Confidence 8755432211 1 1 001000 0000 011122222333333333332221100 1111 235677876665555 Q ss_pred HHHHHHHHHH-hccCChHHHHHhCCCC--CCHHHH-----HHHHHH-------H---HHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 412 AELADMAVKL-RDMLPDETIINQFPWI--TDARQE-----VEKADA-------Q---RQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 412 ~e~a~~~~~~-~g~~s~et~~~~l~~v--~d~~~E-----~~ri~~-------E---~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) .+...+.... .|+++.-.++.+++.- ++-+.- +..+.. + +++.........+......+..+ T Consensus 436 ~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (574) T protein:vir:80 436 LDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEE 515 (574) T ss_pred HHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCC Confidence 5555443222 5899998888887542 211100 000000 0 00000000000000000000000 Q ss_pred ccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 474 AAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ....+.++........+....-.++...++ .++-+|| T Consensus 516 p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 552 (574) T protein:vir:80 516 PKDSQNDTDVSFQDEQQGLNGKSKKVNGKV-DDNVGKD 552 (574) T ss_pred CCCccccccchhhhhhhhhccchhhhcCCc-ccccccc Confidence 000011111111111111100011111111 1112222 No 157 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.34 E-value=1.2e-06 Score=53.08 Aligned_cols=388 Identities=11% Similarity=0.002 Sum_probs=180.7 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHH----HHHHHHHHHHHhcCCCcccccCCcCccccccceecc Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSR----SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVH 78 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~ 78 (511) |=+.++ ..+... ...-...+..+|-|..... ...... ..-+.. T Consensus 1 m~~~~~----------------------------f~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~---~~al~~ 47 (416) T protein:vir:12 1 MLLERM----------------------------FEKRSGSSDHEDGFNNILLNMFGGRKTAS--GERVSE---SNSLVQ 47 (416) T ss_pred Cccchh----------------------------cccccCccccCccchhHHHHhhcCccccc--Cceech---hhhhcc Confidence 111111 111000 0011123345554432111 110000 011223 Q ss_pred chHHHHHHHHHhhhhccCcee-c-Cch---hh-HHHHHHHHh-c-c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce Q lcl|NC_018086. 79 NFPKLLVDTSTAYLAGEPITE-S-GDE---KT-IKAMQPVFK-E-N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH 147 (511) Q Consensus 79 n~~k~ivd~~~~~l~g~~~~~-~-~d~---~~-~~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~ 147 (511) ......|+..++-+-.-|+.+ . .++ .. ...+..++. + | ........+..+.+.+|.||+++..+..|++ T Consensus 48 ~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~ 127 (416) T protein:vir:12 48 PDIFACVNVLSDDIAKLPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYP 127 (416) T ss_pred HHHHHHHHHHHHhhhhCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 444556676666666667653 1 111 11 111233332 2 2 2345667788889999999999988888876 Q ss_pred -EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccccee Q lcl|NC_018086. 148 -RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHP 226 (511) Q Consensus 148 -~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (511) .+..++|..+.++.++.... .+|... .+|..+ .+.+..+++++.- T Consensus 128 ~~L~~l~~~~v~v~~~~~~~~------~~~~~~--~~g~~~----~~~~~eiih~~~~---------------------- 173 (416) T protein:vir:12 128 EALFPLRPDYTNAYVHPTTGM------LWYQTV--LNGKAI----ELYDYEVLHFKGL---------------------- 173 (416) T ss_pred EEEEEECCcceEEEEeCCCcE------EEEEEe--cCCeEE----EecCccEEEecCc---------------------- Confidence 47788999887766554321 112211 122211 2334444444210 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh--------hhCce Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM--------KNDRV 298 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~--------~~~~~ 298 (511) + .+...|.|.+..+...++.......-..+.++..+.|-.++.-.. ..+++....+ ..+++ T Consensus 174 ------~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~~~~~~~ 242 (416) T protein:vir:12 174 ------S----TDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA-FLDEKPKENVRKEWKRVNKVENI 242 (416) T ss_pred ------C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC-CCCHHHHHHHHHHHHHHhcCCCe Confidence 0 011247777877777777766666666666677777766665321 1222222221 23456 Q ss_pred eeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 299 IVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRY 378 (511) Q Consensus 299 i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 378 (511) +.++++.+++.+........+.+..+...+.|+..-++|....+.....+...++... ...+..+|.-++ T Consensus 243 ~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~----------~~f~~~~l~P~~ 312 (416) T protein:vir:12 243 AIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQS----------IEYVRNTLQPWI 312 (416) T ss_pred eecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHH----------HHHHHHHHHHHH Confidence 7777766666665544555566667888888998888887665544333222221111 123334444444 Q ss_pred HHHHHHHHhcCC--CccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_018086. 379 ELVCSYLEFMNK--AKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKR 454 (511) Q Consensus 379 ~li~~~~~~~~~--~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~ 454 (511) +.|...+...-. ........+++.+..-+..|..+.++++.++ .|+++.-.++..++.-+-+.. ++.-.-.. . T Consensus 313 ~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~gg--d~~~~~~n-~ 389 (416) T protein:vir:12 313 VNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENG--DKYISSLN-Y 389 (416) T ss_pred HHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--ceeeeccc-c Confidence 444444332111 1111122355556666778999999998887 478988888888765321100 00000000 0 Q ss_pred HHHHHhhccccccCCCCCCccccccCCCCCCcccccc Q lcl|NC_018086. 455 ADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITT 491 (511) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (511) . ..+....... ...++...|.+...++ T Consensus 390 ~--~~~~~~~~~~--------~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 390 V--FLDFLEEYQR--------LKAGGAMKGGDNKNEG 416 (416) T ss_pred c--cccccchhhc--------cccccccCCCCCcCCC Confidence 0 0000000000 0000000000000000 No 158 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.34 E-value=1.2e-06 Score=53.04 Aligned_cols=326 Identities=10% Similarity=0.020 Sum_probs=147.8 Q ss_pred hhccCceecC-chhhHHHHHHHHh--cc---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCC Q lcl|NC_018086. 92 LAGEPITESG-DEKTIKAMQPVFK--EN---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSAD 164 (511) Q Consensus 92 l~g~~~~~~~-d~~~~~~l~~~~~--~n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~ 164 (511) +-.-|+.+.. ++.....+.++|. -| .-......+....+.+|.||+++..+..|++ .+..++|..+.++.++. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 2233444321 1122223444443 13 2334556777888999999999988888886 46778888877666543 Q ss_pred CCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccC Q lcl|NC_018086. 165 LDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERL 244 (511) Q Consensus 165 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~ 244 (511) .. .+ + |... ...+.. ..+.++.+++++.- ++ .+.-.|. T Consensus 81 ~~--~~-~---y~~~-~~~g~~----~~~~~~eiih~r~~----------------------~~---------~~~~~G~ 118 (348) T protein:vir:93 81 SR--EL-Y---YSIH-AATGNK----LIVHNMDMLHFKHI----------------------VA---------SNMVQGI 118 (348) T ss_pred Cc--EE-E---EEEE-cCCCeE----EEEccccEEEecCC----------------------CC---------CCceeec Confidence 21 11 1 1111 111211 12344444444210 00 0112466 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEe--ecCCCCccchhhhhh---------hhCceeeecCCCceeeeecC Q lcl|NC_018086. 245 GDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL--QGFDLSADSDSISNM---------KNDRVIVTDEDGMVKFITKD 313 (511) Q Consensus 245 s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~--~G~~~~~~~~~~~~~---------~~~~~i~~~~~~~~~~~~~~ 313 (511) |.++.+...++..+.+.. ..+..+..+-.++ .+...+ ++....+ ..++++.++++.+++.+..+ T Consensus 119 s~~~~~~~~i~~~~~~~~---~~~~~~~~~~~~i~~~~~~l~--~e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~ 193 (348) T protein:vir:93 119 SPIDVLKNTTDFDNAVRT---FNLTEMQKPDSFMLKYGSNVS--TEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKK 193 (348) T ss_pred cHHHHHHHHHHHHHHHHH---HHHHhcCCCceeEEecCCCCC--HHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCC Confidence 766665555554333221 1233344432222 232222 2222221 12346666666566555544 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Cc Q lcl|NC_018086. 314 VNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-AK 392 (511) Q Consensus 314 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~ 392 (511) .....+.+..+.....|+..-++|..-.+..+..+...++.... .++..+|.-+++.|...+...-- .. T Consensus 194 ~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~----------~~~~~~l~P~~~~ie~~l~~~l~~~~ 263 (348) T protein:vir:93 194 YVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNR----------FYLQHTLLPIVKQYEEEFNRKLLTKT 263 (348) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhCCcc Confidence 45556667777888899999999876555433333333322211 12223333333333333322110 00 Q ss_pred cc-cccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCC Q lcl|NC_018086. 393 DL-KPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQ 469 (511) Q Consensus 393 ~~-~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (511) .. ....+++.+..-+..|..+.++++.++ +|+++.-.+++.++.-+-+.. +.. +-..+- . T Consensus 264 ~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~gg--D~~-----------~~~~n~----~ 326 (348) T protein:vir:93 264 DREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGG--DKP-----------LISGDL----Y 326 (348) T ss_pred cccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCc--CeE-----------eecccc----c Confidence 11 122345555666667888999998888 478999888888865321100 000 000000 0 Q ss_pred CCCCccccccCCCCCCcccccc Q lcl|NC_018086. 470 GASTAAANKLDKNPANTSTITT 491 (511) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~ 491 (511) +.+.....+...++|..+...+ T Consensus 327 ~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 327 PIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred ccccchhhcccccCCCCCcCCC Confidence 0000000000011111111111 No 159 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.30 E-value=1.5e-06 Score=52.44 Aligned_cols=454 Identities=12% Similarity=0.032 Sum_probs=179.5 Q ss_pred Cccchhhcccc------cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-----ccCCcCcccc Q lcl|NC_018086. 3 IPNGQINAGDI------ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAI-----QSRTFDDTNK 71 (511) Q Consensus 3 ~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~-----~~~~~~~~~~ 71 (511) |+|-+-|.|.+ .+.-..+.+++ .+.|-+.+.--+. ..++-+.|-.-.. ++.......- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 69 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYD-----KDIVNKAIRPGRA------SARDTVDGIDIADGNVAGQYSVASISDV 69 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhh-----HHHHHhhhhhhhh------hhhccccccccccCCcccccccCccccc Confidence 66665554443 33333333332 2222232321111 1133333321100 1111111110 Q ss_pred cc----cee--ccchHHHHHHHHHhhh-------------hccCceec-----Cch---hhHHHHHHHHhc--cCh---- Q lcl|NC_018086. 72 PN----SKI--VHNFPKLLVDTSTAYL-------------AGEPITES-----GDE---KTIKAMQPVFKE--NYV---- 118 (511) Q Consensus 72 ~~----~ri--~~n~~k~ivd~~~~~l-------------~g~~~~~~-----~d~---~~~~~l~~~~~~--n~~---- 118 (511) .. .+. .....+.+|++.+..+ .+-|+.+- ... .....+..++.. |.+ T Consensus 70 ~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~ 149 (535) T protein:vir:10 70 LSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWR 149 (535) T ss_pred cCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChh Confidence 00 011 1223344444433222 12233321 111 112234455542 321 Q ss_pred ---hHHHHHHHHHHhhCC-eEEEEeeeCCCCceE-EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEE Q lcl|NC_018086. 119 ---TDVNSEEVKLSGIFG-HCFEIHWIDRNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVY 193 (511) Q Consensus 119 ---~~~~~~~~~~a~~~G-~~~~~v~~~~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 193 (511) ..+...+..+.+.+| .+|+.+..+..|++. +..++|..+.+.++......... ++.... +... ..+ T Consensus 150 ~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~---~~~~~~---~~~~---~~~ 220 (535) T protein:vir:10 150 DTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPRK---FEQFVS---ETKS---VKF 220 (535) T ss_pred HHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCceE---EEEEec---Ccee---EEE Confidence 234556666777776 579888888888874 78899999888776543221111 111111 1111 123 Q ss_pred cCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 194 TEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWND 273 (511) Q Consensus 194 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~ 273 (511) .++.+++++... .-.......|.|.++.+...+.....+..-..+.+...+. T Consensus 221 ~~~eiih~~~~~----------------------------~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~ 272 (535) T protein:vir:10 221 SERNLTFINYWN----------------------------LSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGT 272 (535) T ss_pred CcccEEEEeccC----------------------------CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 445555443110 0000011247777777777666666665555556666666 Q ss_pred ceeEee--cC-CCCccchhhhhhhh------------CceeeecCCCceeeeecC--CCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_018086. 274 AYLWLQ--GF-DLSADSDSISNMKN------------DRVIVTDEDGMVKFITKD--VNDKHIENIKNRAKLDIFSLSQT 336 (511) Q Consensus 274 p~l~~~--G~-~~~~~~~~~~~~~~------------~~~i~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~ 336 (511) |-.++. +. +....++....++. +++..+ .+.+++|.... .....+....+...+.|+..-++ T Consensus 273 p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl-~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgV 351 (535) T protein:vir:10 273 TRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPIL-AAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQM 351 (535) T ss_pred ccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccc-cCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCC Confidence 655443 32 11112222222211 122222 23345554444 34455666677778888888888 Q ss_pred ccccccccc-----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH Q lcl|NC_018086. 337 PDLVSKDFT-----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY 411 (511) Q Consensus 337 p~~~~~~~~-----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~ 411 (511) |..-.+... |.++......-..+. .....++..+|.-+++.|...+...-. ...+ ..+.+.|......|. T Consensus 352 Pp~~lG~~~~at~sn~~~~~~~~~~s~~E---~~~~~~~~~~L~P~l~~ie~~ln~~Ll-~~~~-~~~~f~f~~l~~~d~ 426 (535) T protein:vir:10 352 QPEEINFPNNGGSTGKSGTKSVNEGSTAK---AKLESSKDKGLTPLLSFIEQVINDKIM-RYVD-TDYRFSFTLGDAQDK 426 (535) T ss_pred CHHHhccccCcccccchhhhhhhhhhhHH---HHHHHHHHHHHHHHHHHHHHHHhhhcc-cccC-CeEEEEeccccccCH Confidence 865544322 221111111101111 111123333444444444444332111 1122 247788888788887 Q ss_pred HHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHH--HHHHHHHHHHH-HHHHhhccccccCCCCC-CccccccCCCCCCc Q lcl|NC_018086. 412 AELADMAVKL-RDMLPDETIINQFPWITDARQEV--EKADAQRQKRA-DIALQNFKQTSAVQGAS-TAAANKLDKNPANT 486 (511) Q Consensus 412 ~e~a~~~~~~-~g~~s~et~~~~l~~v~d~~~E~--~ri~~E~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 486 (511) .+.+++.... .|+++.-.++.+++.-+-+.... -.+..+.-... .......+...+..+.. +....+...+...+ T Consensus 427 ~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~ 506 (535) T protein:vir:10 427 LQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKD 506 (535) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccc Confidence 7766655432 46788888888876532111110 00100000000 00000000000000000 00000000000000 Q ss_pred cccccCCCC-----ccccccccCCCCCCC Q lcl|NC_018086. 487 STITTTDPV-----AAKEQEKAIQKKPKT 510 (511) Q Consensus 487 ~~~~~~~~~-----~~~~~~~~~~~~~~~ 510 (511) ......+|. .+++++...++.+-| T Consensus 507 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 507 YEKGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred cccCCCCCCCCCCcCCCCCccccccccCC Confidence 011111222 122222223333444 No 160 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.29 E-value=1.7e-06 Score=52.26 Aligned_cols=369 Identities=9% Similarity=0.040 Sum_probs=161.3 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+|+.-++....... .+...... .+.-.+.+.- ........ ..-+..+-.. T Consensus 1 M~~f~~~~~~~~~~~----------~~~~~~~~-------------~~~~~~~~~~---~~~~~v~~---~~al~~~~v~ 51 (386) T protein:vir:49 1 MPIFNITNLATESPP----------INQESFFD-------------IADSDFLASL---NSSEWVSA---ENALKNSDLF 51 (386) T ss_pred CchhhhhccCCCCcc----------cchhhhhh-------------hhhccccccc---cCCceech---hhhhccHHHH Confidence 555533322211110 00000000 0000000000 00000000 0011122233 Q ss_pred HHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEe Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAY 161 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~ 161 (511) ..|+..++-+-+-|+.+.... ....+.+-............+..+.+.+|.||+.+-.+..|++ .+..++|..+.++. T Consensus 52 ~~i~~ia~~ia~~p~~~~~~~-~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~ 130 (386) T protein:vir:49 52 SIISQLSNDLATAKITTSRKQ-LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNR 130 (386) T ss_pred HHHHHHHHHhhhCceeeccch-hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEE Confidence 455556665555677653221 1111111111123345667788889999999999888888876 56778888887665 Q ss_pred cCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc Q lcl|NC_018086. 162 SADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE 241 (511) Q Consensus 162 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~ 241 (511) ++... .+ . +.+.. .+..+.. ...+.++.++|+..-. +...- T Consensus 131 ~~~~~-~~--~-y~~~~-~~~~~~~---~~~~~~~evih~~~~~-------------------------------~~~~~ 171 (386) T protein:vir:49 131 LDNQN-GL--Y-YNITF-DDPHIAP---KQHVPQNDILHFRLLS-------------------------------VDGGL 171 (386) T ss_pred cCCCc-eE--E-EEEEE-cCccccc---eeEEccccEEEecCCC-------------------------------CCCcc Confidence 54321 11 1 11111 1100000 0123344444442100 00112 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh---------hhCceeeecCCCceeeeec Q lcl|NC_018086. 242 ERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM---------KNDRVIVTDEDGMVKFITK 312 (511) Q Consensus 242 ~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~---------~~~~~i~~~~~~~~~~~~~ 312 (511) .|.|.+..+...++.......-..+.+...+.|-.+++-..... ++....+ ..++++.++++.+++.+.. T Consensus 172 ~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~-~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~ 250 (386) T protein:vir:49 172 TSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGL-LDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEI 250 (386) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCC-hHHHHHHHHHHHHhccCCCCceecCCCceEEEccC Confidence 47788877777777666555555556666677776654322111 1111111 1235677776666666655 Q ss_pred CCCHHHHHHHHHHHHHHHHHHhCcccccccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_018086. 313 DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK 390 (511) Q Consensus 313 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 390 (511) +.....+.+..+...+.|+..-++|..-.+..+ ..++..++.. +...+..+++.+...+...-. T Consensus 251 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--------------~~~~i~~~l~~i~~~~~~~l~ 316 (386) T protein:vir:49 251 KSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNI--------------YFKSVSRYLRPFVSEMSKKLS 316 (386) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhc Confidence 555566677788889999999999876554322 2233333222 222333333332222221100 Q ss_pred CccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCC---CCCCHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018086. 391 AKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFP---WITDARQEVEKADAQRQKRADIALQNFKQT 465 (511) Q Consensus 391 ~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~---~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 465 (511) ..+.+.....+-.|..+.+..+.++ +|+++.-.+++++. +..+. +.+. ..... T Consensus 317 ------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~---~~~~------------~~~~~- 374 (386) T protein:vir:49 317 ------CEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKE---LPDG------------KNPNR- 374 (386) T ss_pred ------chhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCc---Ccch------------hccCC- Confidence 1123333344445666677777666 47787766766542 21110 0000 00000 Q ss_pred ccCCCCCCcccc Q lcl|NC_018086. 466 SAVQGASTAAAN 477 (511) Q Consensus 466 ~~~~~~~~~~~~ 477 (511) ....|+++++.+ T Consensus 375 ~~~~gGd~~~~~ 386 (386) T protein:vir:49 375 TSLKGGEINEQD 386 (386) T ss_pred CCCCCCCCCCCC Confidence 000111111100 No 161 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=98.24 E-value=2.2e-06 Score=51.63 Aligned_cols=452 Identities=10% Similarity=0.088 Sum_probs=200.9 Q ss_pred CCC--ccchhhcccccCchhhHhh---hh--ccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccc Q lcl|NC_018086. 1 MAI--PNGQINAGDIITTNIRRKH---FI--RRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPN 73 (511) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~ 73 (511) |+- +.+-+.+.-+.++.....+ +. -...++.+=...--.+.-...-++-+ -||.|..=+-+...+.-. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F~Gy~~la~la---- 120 (695) T protein:vir:78 46 MGRRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGFPGFPTLVLLA---- 120 (695) T ss_pred hcccccccccccccccCCCcccccceeceeccccCCccccchhhhhhcccccccccc-hhhhccCcchHHHHHHHh---- Confidence 321 1222222222222221111 00 01111111000000000000001111 122221111111100000 Q ss_pred ceeccchHHHHHHHHHhhhhccCceec---------------------CchhhHHHHHHHHhccChhHHHHHHHHHHhhC Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITES---------------------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIF 132 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~---------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~ 132 (511) -++-.+.++...+..+.-+-+..+ .+-+..+.|..-+++=++...+.++.+++-.| T Consensus 121 ---Q~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlf 197 (695) T protein:vir:78 121 ---QLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAF 197 (695) T ss_pred ---hccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 123344445555554432211111 11144566777788888889999999999999 Q ss_pred CeEEEEeeeCCCCc----eE--------------EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc Q lcl|NC_018086. 133 GHCFEIHWIDRNKK----HR--------------FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT 194 (511) Q Consensus 133 G~~~~~v~~~~~g~----~~--------------i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 194 (511) |.+..++-.+.++. |. +.+++|..+.|-.-+.. .+.. -.+|- T Consensus 198 GGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~s------------------pdfgk 257 (695) T protein:vir:78 198 GRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA------------------DDFYK 257 (695) T ss_pred cceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhc--cchh------------------hccCC Confidence 99986665544331 11 44555555544210000 0000 01111 Q ss_pred CCcEEEEEEccCcccccccccccccccccceeccCCccceEe-ec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 195 EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE-II--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYW 271 (511) Q Consensus 195 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~ 271 (511) |.+++-. +. +......+.|...|+-. ++ .+-.|.|....+.+-+++.+++.-.....+..+ T Consensus 258 P~~y~V~----G~------------kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~ 321 (695) T protein:vir:78 258 PSTWWMI----GT------------EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF 321 (695) T ss_pred CceEEEe----ce------------EEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh Confidence 1111100 00 00000000010011110 11 122478888888888888888877776666544 Q ss_pred cCceeEee---cCCCCccchhh------hhhhh-CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccc Q lcl|NC_018086. 272 NDAYLWLQ---GFDLSADSDSI------SNMKN-DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVS 341 (511) Q Consensus 272 ~~p~l~~~---G~~~~~~~~~~------~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 341 (511) +...+..- ........... ...+. .+++.++++ +=+|.+.+.+...+...+......|...+++|-.-. T Consensus 322 ~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~-~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkL 400 (695) T protein:vir:78 322 SVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKA-TEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKL 400 (695) T ss_pred hhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecC-CcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhh Confidence 44432110 00000111111 11222 345556532 235667788899999999999999999999996554 Q ss_pred cccc----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHH Q lcl|NC_018086. 342 KDFT----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADM 417 (511) Q Consensus 342 ~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~ 417 (511) ...+ |+||++=..-|...+.-.+ ++.+...+++++.+|.. +..+.. +. ++.+.|+|-...+++|.|+. T Consensus 401 fGqSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~r--S~~G~i---dp-di~~~fnPL~qmtd~EkAeI 472 (695) T protein:vir:78 401 LGITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQL--SLFGAV---DP-SIKWQWNALRELDDLEVAES 472 (695) T ss_pred hccCCccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH--HhcCCC---CC-cceEEeCCCCCcCHHHHHHH Confidence 3332 6889875555555544333 67889999998887743 333322 22 57889999999999998876 Q ss_pred HHHH---------hccCChHHHHHhCCCCCCHHHHH-HHHHHHHHHHHHHHHhhccccccCCCCCCcc-ccccCCCCCCc Q lcl|NC_018086. 418 AVKL---------RDMLPDETIINQFPWITDARQEV-EKADAQRQKRADIALQNFKQTSAVQGASTAA-ANKLDKNPANT 486 (511) Q Consensus 418 ~~~~---------~g~~s~et~~~~l~~v~d~~~E~-~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 486 (511) -.+- .|+++...+..+|.- |++.-- ..+..+.. .....-.+-.+..+..+.. ..+..+.++.+ T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~--d~~s~Y~~~~D~~d~----p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (695) T protein:vir:78 473 RYKQAQSDVLYVQEQVIRPDQVAARLNT--EPDGPYAGKLDANDD----PGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 546 (695) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhc--CCCcccccccccccC----CCcCccchhhhhHhhhcCcccccccCCCCCC Confidence 4332 366666656555421 100000 00000000 0000000000000000001 11112223345 Q ss_pred cccccCCCCccccccccCC-CCCCCC Q lcl|NC_018086. 487 STITTTDPVAAKEQEKAIQ-KKPKTD 511 (511) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~-~~~~~~ 511 (511) .++++..|+..+.-.+-.+ +.++++ T Consensus 547 ~~g~~~~~~~~~~~~~~~~~~ag~~~ 572 (695) T protein:vir:78 547 RAGATAPPTVANVNANVKPREAGAQD 572 (695) T ss_pred CCCCCCCCceeeeeccccccccCCCC Confidence 5666666777777777655 555666 No 162 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=98.22 E-value=2.4e-06 Score=51.43 Aligned_cols=426 Identities=10% Similarity=0.014 Sum_probs=186.3 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHH----HHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRS----SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) |..++++.. .+..+.|.+..+.++.+ ..+++.+.+|..-. .... ........++ T Consensus 1 ~~~~~~~~~-----------------~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~---~~~~--~~~~~~~~~~ 58 (516) T protein:vir:96 1 MKQSIDLEY-----------------GGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPY---LMND--KGDNETSQNG 58 (516) T ss_pred Ccchhhhhh-----------------hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhccc---ccCC--CCCccccCCc Confidence 655555552 22233344444444433 34555555555542 1111 1112223356 Q ss_pred ccchHHHHHHHHHhhhhcc------Cc-eecCchh--------------hH-------HHHHHHHhccChhHHHHHHHHH Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGE------PI-TESGDEK--------------TI-------KAMQPVFKENYVTDVNSEEVKL 128 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~------~~-~~~~d~~--------------~~-------~~l~~~~~~n~~~~~~~~~~~~ 128 (511) ..+-+...++.+++-|++- || ++..+++ .. ..+...+..++|.....++.++ T Consensus 59 ~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~ 138 (516) T protein:vir:96 59 WQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKH 138 (516) T ss_pred ccchHHHHHHHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHH Confidence 6677777788877777652 22 2221111 11 1244567778999999999999 Q ss_pred HhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEee----------------cCCcceEEEEEE Q lcl|NC_018086. 129 SGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVIS----------------DITGHQIRTYEV 192 (511) Q Consensus 129 a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~----------------~~~~~~~~~~~~ 192 (511) ..++|.+.+++ ++++.++ .++ ..-+++--+... .+...++...... ....+....+++ T Consensus 139 L~~~G~a~l~~--d~~~~~~--~~p-l~~y~v~~d~~G-~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v 212 (516) T protein:vir:96 139 LIVAGSCMLYK--PSKGAIS--AIP-MHHYVVNRDTNG-DLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKL 212 (516) T ss_pred HHhHCeEeEEe--cCCCCEE--EEE-cCeEEEeeCCCC-CeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEE Confidence 99999987554 5655543 443 333444444333 2333333221000 000111122333 Q ss_pred Ec-----CCcEEEEEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHH Q lcl|NC_018086. 193 YT-----EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVS 262 (511) Q Consensus 193 ~~-----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s 262 (511) |+ ++..+.+....++. ........+|..+|++.++ ++.+|+|-..+..+-+..+|.+.- T Consensus 213 ~~~v~~~~~~~~~~~~~~d~~-----------~~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~ 281 (516) T protein:vir:96 213 YTHAKYLGDGFWELKQSADDI-----------PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSE 281 (516) T ss_pred EEeeeeeCCceeEEEEEeCce-----------eeccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHH Confidence 33 22211111111111 0111122234456766543 357899999999999999998888 Q ss_pred HHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecCCCceeeeec--CCCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_018086. 263 DSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDEDGMVKFITK--DVNDKHIENIKNRAKLDIFSLSQTPDLV 340 (511) Q Consensus 263 ~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 340 (511) .+.........|.+.+.-... ....... ....+.+.....++++.+.. ..+.......++.++..|...-....+. T Consensus 282 ~~l~~~~~a~~~~~lv~p~g~-~~~~~l~-~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~ 359 (516) T protein:vir:96 282 AVARGAALMADIKYLIRPGAQ-TDVDHFV-NSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMT 359 (516) T ss_pred HHHHHHHHhcCCccccCcccc-cchhhhc-cCCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhc Confidence 888888888888765431111 0001011 11123343333344555443 2355666777777777766543322121 Q ss_pred cccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHhcCCCccccccceeEEeCCCCCcCHHHHH Q lcl|NC_018086. 341 SKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYE-L----VCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELA 415 (511) Q Consensus 341 ~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-l----i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 415 (511) .-.....|++.+.. +..++...++..+.++-. + |...+..... ......+.+.+... .+.+..+ T Consensus 360 ~r~~~rvTAtEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p--~lp~~~v~~~~vs~--l~~l~r~ 428 (516) T protein:vir:96 360 RRDAERVTAVEIQR-------DALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGE--SFTSDLVDPVIITG--IEALGRM 428 (516) T ss_pred cCCCccccHHHHHH-------HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCC--CCccccccceeech--HHHHHHH Confidence 11223357776554 455555666665555321 1 1111222221 11122233333221 1222221 Q ss_pred H---HH-------HHHhccCC-------hHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHHhhccccccCCC Q lcl|NC_018086. 416 D---MA-------VKLRDMLP-------DETIIN----QFPWIT----DARQEVEKADAQRQKRADIALQNFKQTSAVQG 470 (511) Q Consensus 416 ~---~~-------~~~~g~~s-------~et~~~----~l~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 470 (511) + .+ ..+.++-| ...++. .+| ++ -.++|++.+++++++..........-..+. T Consensus 429 ~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~G-vp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~-- 505 (516) T protein:vir:96 429 AELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQIS-AELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAV-- 505 (516) T ss_pred HHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhC-CCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh-- Confidence 1 11 11122112 122222 222 21 234566655554443332211110000111 Q ss_pred CCCccccccCCC Q lcl|NC_018086. 471 ASTAAANKLDKN 482 (511) Q Consensus 471 ~~~~~~~~~~~~ 482 (511) ....+++..+. T Consensus 506 -~~~~~~~~~~~ 516 (516) T protein:vir:96 506 -PGVIQQELKEA 516 (516) T ss_pred -hHHhhcccccC Confidence 11111111111 No 163 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.10 E-value=4.5e-06 Score=49.87 Aligned_cols=410 Identities=10% Similarity=0.003 Sum_probs=189.4 Q ss_pred CCCccchhhcccccCchhhH--------------hhhhccCCCHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCcccccCC Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRR--------------KHFIRRNFDLRELITLAEMH-SRSSSAYGVLYDYYKGNHIAIQSRT 65 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~yY~G~~~~~~~~~ 65 (511) |+.=+++. |.-+....-. .......+++..+...+..- ...+.++..+++.++- T Consensus 1 ~~~~~d~~--g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e--------- 69 (528) T protein:vir:10 1 MAAIVDIY--GNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEE--------- 69 (528) T ss_pred CCeeECCC--CCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHh--------- Confidence 66555543 3322211100 01111233333333322211 1112222222221110 Q ss_pred cCccccccceeccchHHHHHHHHHhhhhccCceecC---ch----hhHHHHHHHHhc-cChhHHHHHHHHHHhhCCeEE- Q lcl|NC_018086. 66 FDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESG---DE----KTIKAMQPVFKE-NYVTDVNSEEVKLSGIFGHCF- 136 (511) Q Consensus 66 ~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~---d~----~~~~~l~~~~~~-n~~~~~~~~~~~~a~~~G~~~- 136 (511) ...+..-.+.+....+.+.++++.. ++ +..+.+.+.+.+ .+|...+.. ..++.-||.++ T Consensus 70 -----------~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~ 137 (528) T protein:vir:10 70 -----------RDAHLFAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAI 137 (528) T ss_pred -----------hChHHHHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeE Confidence 1345666777777778888887742 21 233456666655 347766554 56788899765 Q ss_pred EEeeeCCCCceEEEE---EcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 137 EIHWIDRNKKHRFKA---VSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 137 ~~v~~~~~g~~~i~~---~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) +++|...+|...+.. .+|.. |. |++.... .. +. ......| ..+ T Consensus 138 Ei~w~~~~g~~~~~~~~~r~~~~-f~-~~~~~~~--~l--~~--~~~~~~g-----~~l--------------------- 183 (528) T protein:vir:10 138 ELDWSLQGREWLPQAFDHRPQSW-FQ-LNPDDQD--EL--RL--RDNSIAG-----EVL--------------------- 183 (528) T ss_pred EEEEeecCCceeEEEeeeecccc-ee-eccCCCc--EE--ec--cCCCCCc-----eee--------------------- Confidence 566654455443332 23221 11 1111110 00 00 0000000 001 Q ss_pred ccccccccccceeccCCccceEee--cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc---- Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFPVLEI--IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS---- 287 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~---- 287 (511) .+++.+=.++- ..++.|.|.+..+....--=+..+.+++..++.|+.|+++.+=.....++ T Consensus 184 -------------~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~ 250 (528) T protein:vir:10 184 -------------QPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEKVT 250 (528) T ss_pred -------------cCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHH Confidence 11222212211 12456888888776666666778888999999999999987732221211 Q ss_pred --hhhhhhhhCceeeecCCCceeeeecC-CCHHHHHHHHHHHHHHHHHHhCcccccccc-ccC-ccHHHHHHHHHHHHHH Q lcl|NC_018086. 288 --DSISNMKNDRVIVTDEDGMVKFITKD-VNDKHIENIKNRAKLDIFSLSQTPDLVSKD-FTA-ASGQALKAATQPLENK 362 (511) Q Consensus 288 --~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~-~Sg~Ai~~~~~~l~~k 362 (511) ....++....+..+|.+.++++++.. .+...++.+++.+.+.|...--.-.++... .+. .|...-+....-.... T Consensus 251 L~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di 330 (528) T protein:vir:10 251 LLRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRHDL 330 (528) T ss_pred HHHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHHHH Confidence 11234455667888999999999854 455668888888888877765433333322 111 2222112222223333 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCC Q lcl|NC_018086. 363 SAVKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-LPDETIINQFPWIT 438 (511) Q Consensus 363 ~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~s~et~~~~l~~v~ 438 (511) ++.-.+.+...+. ++++.++.+ ..........-..+.|....+.|..+.++++.+++ |+ +|.+.+.+.++. + T Consensus 331 ~~aDa~~i~~tln~~li~~l~~~---N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi-p 406 (528) T protein:vir:10 331 LAADARQLAATLSRDLLWPLLVL---NRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGI-P 406 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHh---CCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-C Confidence 4444455555553 344444432 22211111223567898899999999999998885 55 888888888874 2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 439 DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 439 d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .++.. +.+ +. ... ..........+ .......++......++....| T Consensus 407 ~p~~~-e~~-----------~~---~~~------~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~d 452 (528) T protein:vir:10 407 LPANG-EAV-----------LG---DQA------GAGIAQLSRRP------GPRIAALAQVIGPRYRDQEALD 452 (528) T ss_pred CCCCC-ccc-----------cc---CCC------cccccccCccc------ccccccccccccccccccchHH Confidence 22110 000 00 000 00000000000 0000011111111112222222 No 164 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.09 E-value=4.9e-06 Score=49.71 Aligned_cols=424 Identities=9% Similarity=-0.001 Sum_probs=156.0 Q ss_pred ccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhh Q lcl|NC_018086. 13 IITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYL 92 (511) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l 92 (511) ..+ ..+.+..+- ++...... ......+.+...-...+.......+..--...+....|+..+..+ T Consensus 1 ~~~----------~~~~~~~~~----~~~~~~~~-~~~~~~~~~~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~i 65 (540) T protein:vir:41 1 MFN----------YHLSIKSLE----KYRAIKGD-TDSQALKEDRFEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDI 65 (540) T ss_pred CCC----------cccChhhcc----chhhhhcc-ccccccccCCCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHH Confidence 111 111111111 11100000 000111111110000000000111111112456677888888888 Q ss_pred hccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEE Q lcl|NC_018086. 93 AGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVA 171 (511) Q Consensus 93 ~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~ 171 (511) .+.|+.+..++.....+.. -...........+..+.+.+|.||+.+..+..|++ .+..++|..+-+..+... T Consensus 66 a~~~~~i~~~~~~~~~~lp-N~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~------ 138 (540) T protein:vir:41 66 LRTGYLIDGDDGGVEELLR-ACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSR------ 138 (540) T ss_pred hcCCceEecCccchhhhcc-CCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCce------ Confidence 8888887554433222210 01112345667788889999999999888888875 477788887765543221 Q ss_pred EEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC-----cccCch Q lcl|NC_018086. 172 AIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGD 246 (511) Q Consensus 172 ~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~ 246 (511) ++.. .++....++..|.....+.. ..+ .....+..=-|+++++. ..|.|. T Consensus 139 ---~~~~---~d~~~~~~~~~~~~~~~~~~--~~g-----------------~~~~~~~~~eViHir~~~~~~~~~G~Sp 193 (540) T protein:vir:41 139 ---YMQT---WDGIHVTYFKDYRYEGEVNP--DNG-----------------EDQDGVGANEIIFIHLPSPICSYYGVPR 193 (540) T ss_pred ---eEee---ecCceeeeeecccccceeec--ccc-----------------ccceeecccceEEecCCCCCCCcccccH Confidence 1111 11222222222211111000 000 00001111125555432 257777 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cCCCCccc-----------hhhhhh---------hhCceeeecC- Q lcl|NC_018086. 247 FEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ--GFDLSADS-----------DSISNM---------KNDRVIVTDE- 303 (511) Q Consensus 247 ~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~--G~~~~~~~-----------~~~~~~---------~~~~~i~~~~- 303 (511) +......+.....+..-..+.++..+.|-.++. |.-.+... .....+ ..++.+.+.. T Consensus 194 i~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~ 273 (540) T protein:vir:41 194 YLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIP 273 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecC Confidence 776555555444444444444555556655543 42111100 011111 0123344331 Q ss_pred ---CCceeeeecC--CCHHHHHHHHHHHHHHHHHHhCccccccccc----cC-ccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 304 ---DGMVKFITKD--VNDKHIENIKNRAKLDIFSLSQTPDLVSKDF----TA-ASGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 304 ---~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~----~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) +++++|.... .....+.+..+...+.|+..-++|..-.+.. .+ ++.+.....+. ..+ T Consensus 274 ~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~-------------~~t 340 (540) T protein:vir:41 274 GGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYY-------------ESV 340 (540) T ss_pred CCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHH-------------HHH Confidence 3445554433 3445566777788888999889886544321 11 12222211111 111 Q ss_pred HHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQR 451 (511) Q Consensus 374 l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~ 451 (511) |.-+++.|...++..-.. ... ..+.+.|+..-.... +.+..+.++ .|+++.-.+++.|+.++--.++. +. T Consensus 341 L~P~~~~ie~~ln~~L~~-~~~-~~~~i~f~~~~ll~~-D~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~--l~--- 412 (540) T protein:vir:41 341 VRPQQEIVSSVLTDFIQL-KLD-PGARFVFNEEILMES-EFVHNYALLVQCGVLTPSEVREKLFGLDGGPDMF--MV--- 412 (540) T ss_pred HHHHHHHHHHHHHHhhhh-ccC-CceEEEecchhhcch-HHHHHHHHHHhCCCCCHHHHHHHhCcCcCCCccc--cc--- Confidence 222222222111110000 111 134566654333221 233333333 57888877877553332111110 00 Q ss_pred HHHHHHHHhhccccccCCCCCCccccccCC-----CCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 452 QKRADIALQNFKQTSAVQGASTAAANKLDK-----NPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) . .++.......+....+.+++.+ ....+......+...+..+.++..+..--| T Consensus 413 ----p---~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (540) T protein:vir:41 413 ----P---SSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSD 470 (540) T ss_pred ----c---cccccccccccccccCCCCccccccccchhcccccCccccccccccccccccccccc Confidence 0 0000000000000000000000 000000000000011111111111111111 No 165 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=98.07 E-value=5.3e-06 Score=49.50 Aligned_cols=446 Identities=10% Similarity=0.095 Sum_probs=197.6 Q ss_pred CCCccc----------hhhcccccCchhhHhh---h--hccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCC Q lcl|NC_018086. 1 MAIPNG----------QINAGDIITTNIRRKH---F--IRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRT 65 (511) Q Consensus 1 ~~~~~~----------~~~~~~~~~~~~~~~~---~--~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~ 65 (511) -|++.. -+...-+.++.....+ + +-...++.+....--.+.-...-++-+ -||.|..=+-+... T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F~Gy~~l 116 (695) T protein:vir:36 38 AAQPVPADFARRGALNALDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGFPGFPTL 116 (695) T ss_pred cccccchhhhhcccccccccccccCCCcccccceeceecccccCccccchhhhhhcccccccccc-hhhhccCcchHHHH Confidence 222222 1112222222211110 0 001111111100000000000011111 12322211111111 Q ss_pred cCccccccceeccchHHHHHHHHHhhhhccCceec---------------------CchhhHHHHHHHHhccChhHHHHH Q lcl|NC_018086. 66 FDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES---------------------GDEKTIKAMQPVFKENYVTDVNSE 124 (511) Q Consensus 66 ~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---------------------~d~~~~~~l~~~~~~n~~~~~~~~ 124 (511) +.-. -++-.+.++...+..+.-+-+..+ .+.+..+.|...+++=++...+.+ T Consensus 117 a~la-------Q~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~e 189 (695) T protein:vir:36 117 VLLA-------QLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRT 189 (695) T ss_pred HHHh-------hccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHH Confidence 0000 123334445555444432211111 112455678888888888899999 Q ss_pred HHHHHhhCCeEEEEeeeCCCCc----eE--------------EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 125 EVKLSGIFGHCFEIHWIDRNKK----HR--------------FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 125 ~~~~a~~~G~~~~~v~~~~~g~----~~--------------i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) +.+++-.||.+..++-.+.++. |. +.+++|..+.|-.-+.. .+.. T Consensus 190 aik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~s--------------- 252 (695) T protein:vir:36 190 TVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA--------------- 252 (695) T ss_pred HHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhc--cchh--------------- Confidence 9999999999986665544331 11 44555555544210000 0000 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe-ec--CCcccCchhHHHHHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE-II--ANEERLGDFEAQLSLIDAYNLAVSD 263 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~--n~~~g~s~~~~v~~l~d~~~~~~s~ 263 (511) -.+|-|.+++-. +. +......+.|...|+-. ++ .+-.|.|....+.+-+++.+++.-. T Consensus 253 ---pdfgkP~~y~V~----G~------------kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~ 313 (695) T protein:vir:36 253 ---DDFYKPSTWWMI----GT------------EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQS 313 (695) T ss_pred ---hccCCCceEEEe----ce------------EEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhH Confidence 011111111100 00 00000000010011110 11 1224788888888888888887776 Q ss_pred HHHHHHHhcCceeEee---cCCCCccchhh------hhhhh-CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 264 SVNDIAYWNDAYLWLQ---GFDLSADSDSI------SNMKN-DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSL 333 (511) Q Consensus 264 ~~~~~~~~~~p~l~~~---G~~~~~~~~~~------~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~ 333 (511) ....+..+....+.+- ........... ...+. .+++.++++ +=+|.+.+.+...+...+......|... T Consensus 314 v~~Li~~~~v~~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~-~Eefeq~stslSGLddVi~qf~q~VAga 392 (695) T protein:vir:36 314 VSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKA-TEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (695) T ss_pred HHHHHHhhhHHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecC-CcceEEEecccCCHHHHHHHHHHHHHhh Confidence 6666654443332110 00000111111 11222 345556532 2356677888999999999999999999 Q ss_pred hCcccccccccc----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCc Q lcl|NC_018086. 334 SQTPDLVSKDFT----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQ 409 (511) Q Consensus 334 s~~p~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~ 409 (511) +++|-.-....+ |+||++=..-|...+.-.+ ++.+...+++++.+|.. +..+.. +. ++.+.|+|--.. T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~r--S~~G~i---dp-di~~~fnPL~qm 464 (695) T protein:vir:36 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQL--SLFGAV---DP-SIKWQWNALREL 464 (695) T ss_pred hcCchhhhhccCcccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH--HhcCCC---CC-cceEEeCCCCCc Confidence 999965543332 6889875555555544333 67889999998887743 333322 22 578899999999 Q ss_pred CHHHHHHHHHHH---------hccCChHHHHHhCC------CC--CCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCC Q lcl|NC_018086. 410 SYAELADMAVKL---------RDMLPDETIINQFP------WI--TDARQEVEKADAQRQKRADIALQNFKQTSAVQGAS 472 (511) Q Consensus 410 d~~e~a~~~~~~---------~g~~s~et~~~~l~------~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 472 (511) +++|.|+.-.+- .|+++...+..+|. |. .|.+.+=-...+....-+ .+..... T Consensus 465 td~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~----------~~~~~~~ 534 (695) T protein:vir:36 465 DDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGV----------LTYVQRL 534 (695) T ss_pred CHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhh----------HhhhcCc Confidence 999988764332 36666665655542 21 010000000000000000 0000011 Q ss_pred CccccccCCCCCCccccccCCCCccccccccCC-CCCCCC Q lcl|NC_018086. 473 TAAANKLDKNPANTSTITTTDPVAAKEQEKAIQ-KKPKTD 511 (511) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 511 (511) ++.. ..+.++.+..+++..|+.++..-+.-+ +.++++ T Consensus 535 ~~~~--~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~ 572 (695) T protein:vir:36 535 AEGG--DTGAPGGARAGATAPPTVANVNANVNPREAGAQD 572 (695) T ss_pred cccc--ccCCCCcccccccCCCcccccccccCccccCCCC Confidence 1111 111223344455555666665555543 455566 No 166 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.07 E-value=5.4e-06 Score=49.45 Aligned_cols=391 Identities=12% Similarity=0.001 Sum_probs=170.8 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+|+.-.... . .......+..++ ....|........ ... .. -.+.. -.- T Consensus 1 Mg~f~~~~~r---------~---~~~~~~~~~~~~--------------~~~~~~~~~~~~~-~~~-~~-al~~~--~v~ 49 (416) T protein:vir:81 1 MGIFYKNEKR---------D---LQYNEDDLQMMV--------------QTLPGFQGTKLRQ-YKD-IE-AIRHS--DIF 49 (416) T ss_pred CCcccccccc---------c---ccCCCcchhHHH--------------HHhccccccCccc-cch-hh-hhcch--HHH Confidence 5555321000 0 000001111111 1111111000000 000 00 01111 112 Q ss_pred HHHHHHHhhhhccCceecCchhh--HHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcc Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESGDEKT--IKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSP 154 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~d~~~--~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p 154 (511) ..|+..++-+-+-|+++..+.+. ...+..+|.. |. .......+....+.+|.||+++..+..|++ .+..++| T Consensus 50 ~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~ 129 (416) T protein:vir:81 50 TAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKT 129 (416) T ss_pred HHHHHHHHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 24555555555567765433221 1223444432 32 234566778888999999999999888886 4778899 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.++.+.... + +++....+..+... ...+.+..+++++.- ++ T Consensus 130 ~~v~v~~~~~g~--~----~~~~~~~~~~~~~~--~~~~~~~evihir~~-----------------------~~----- 173 (416) T protein:vir:81 130 SEIELKSDARGR--L----YYFHQRIDSNGNNI--ERNVKFEDMLDIKFY-----------------------SL----- 173 (416) T ss_pred ceeEEEECCCcc--E----EEEEEEecCCCcee--EEEEccccEEEeccC-----------------------CC----- Confidence 988877654321 1 11111111111111 122444444444210 00 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh----hh--------hhCceeeec Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS----NM--------KNDRVIVTD 302 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~----~~--------~~~~~i~~~ 302 (511) +.-.|.|.+..+...++.......-..+.+...+.|-.+++--..-.+++... .+ ..++++.++ T Consensus 174 ----d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~ 249 (416) T protein:vir:81 174 ----DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD 249 (416) T ss_pred ----CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC Confidence 11247777777776666655555555555566666766654221111112111 11 113466676 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ++.+.+.++.+.....+....+...+.|+..-++|..-.+... +.|.+... ..|..+|.-++..| T Consensus 250 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~--------------~~~~~~l~P~~~~i 315 (416) T protein:vir:81 250 ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKPYITCV 315 (416) T ss_pred CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHHHHHHH Confidence 6655555554445555666677788889999999865443221 11211111 11223344444444 Q ss_pred HHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 382 CSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADAQRQKRADI 457 (511) Q Consensus 382 ~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~E~~~~~~~ 457 (511) ...+...-. .......+++.+..-+-.|..+.++++.++ .|+++.-.++..++.- ++.+...-.+...- .. T Consensus 316 e~~ln~~l~-~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~---~~- 390 (416) T protein:vir:81 316 CAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNH---VN- 390 (416) T ss_pred HHHHhhhcc-ccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccc---cc- Confidence 333332211 112233455555665667888999988887 4789888888887542 22222111110000 00 Q ss_pred HHhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 458 ALQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) .+......... .... .....+|+... T Consensus 391 -~~~~~~~~~~~--~~~~--~~~~kgGe~n~ 416 (416) T protein:vir:81 391 -IELVDEYQMNK--SRAT--DKKLKGGEENE 416 (416) T ss_pred -cccccccCccc--cccc--ccccCCCCCCC Confidence 00000000000 0000 00000000000 No 167 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.07 E-value=5.4e-06 Score=49.45 Aligned_cols=391 Identities=12% Similarity=0.001 Sum_probs=170.8 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+|+.-.... . .......+..++ ....|........ ... .. -.+.. -.- T Consensus 1 Mg~f~~~~~r---------~---~~~~~~~~~~~~--------------~~~~~~~~~~~~~-~~~-~~-al~~~--~v~ 49 (416) T protein:vir:45 1 MGIFYKNEKR---------D---LQYNEDDLQMMV--------------QTLPGFQGTKLRQ-YKD-IE-AIRHS--DIF 49 (416) T ss_pred CCcccccccc---------c---ccCCCcchhHHH--------------HHhccccccCccc-cch-hh-hhcch--HHH Confidence 5555321000 0 000001111111 1111111000000 000 00 01111 112 Q ss_pred HHHHHHHhhhhccCceecCchhh--HHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcc Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESGDEKT--IKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSP 154 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~d~~~--~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p 154 (511) ..|+..++-+-+-|+++..+.+. ...+..+|.. |. .......+....+.+|.||+++..+..|++ .+..++| T Consensus 50 ~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~ 129 (416) T protein:vir:45 50 TAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKT 129 (416) T ss_pred HHHHHHHHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 24555555555567765433221 1223444432 32 234566778888999999999999888886 4778899 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.++.+.... + +++....+..+... ...+.+..+++++.- ++ T Consensus 130 ~~v~v~~~~~g~--~----~~~~~~~~~~~~~~--~~~~~~~evihir~~-----------------------~~----- 173 (416) T protein:vir:45 130 SEIELKSDARGR--L----YYFHQRIDSNGNNI--ERNVKFEDMLDIKFY-----------------------SL----- 173 (416) T ss_pred ceeEEEECCCcc--E----EEEEEEecCCCcee--EEEEccccEEEeccC-----------------------CC----- Confidence 988877654321 1 11111111111111 122444444444210 00 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh----hh--------hhCceeeec Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS----NM--------KNDRVIVTD 302 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~----~~--------~~~~~i~~~ 302 (511) +.-.|.|.+..+...++.......-..+.+...+.|-.+++--..-.+++... .+ ..++++.++ T Consensus 174 ----d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~ 249 (416) T protein:vir:45 174 ----DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD 249 (416) T ss_pred ----CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC Confidence 11247777777776666655555555555566666766654221111112111 11 113466676 Q ss_pred CCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 303 EDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) ++.+.+.++.+.....+....+...+.|+..-++|..-.+... +.|.+... ..|..+|.-++..| T Consensus 250 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~--------------~~~~~~l~P~~~~i 315 (416) T protein:vir:45 250 ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKPYITCV 315 (416) T ss_pred CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHHHHHHH Confidence 6655555554445555666677788889999999865443221 11211111 11223344444444 Q ss_pred HHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 382 CSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADAQRQKRADI 457 (511) Q Consensus 382 ~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~E~~~~~~~ 457 (511) ...+...-. .......+++.+..-+-.|..+.++++.++ .|+++.-.++..++.- ++.+...-.+...- .. T Consensus 316 e~~ln~~l~-~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~---~~- 390 (416) T protein:vir:45 316 CAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNH---VN- 390 (416) T ss_pred HHHHhhhcc-ccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccc---cc- Confidence 333332211 112233455555665667888999988887 4789888888887542 22222111110000 00 Q ss_pred HHhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 458 ALQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) .+......... .... .....+|+... T Consensus 391 -~~~~~~~~~~~--~~~~--~~~~kgGe~n~ 416 (416) T protein:vir:45 391 -IELVDEYQMNK--SRAT--DKKLKGGEENE 416 (416) T ss_pred -cccccccCccc--cccc--ccccCCCCCCC Confidence 00000000000 0000 00000000000 No 168 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=98.05 E-value=6.1e-06 Score=49.19 Aligned_cols=387 Identities=9% Similarity=0.016 Sum_probs=175.4 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-ccCCcCccccccceeccchH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAI-QSRTFDDTNKPNSKIVHNFP 81 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~-~~~~~~~~~~~~~ri~~n~~ 81 (511) |.|++.+++..... ...+ .....|-.... ...... -+..-+...-. T Consensus 1 m~~~~~~~~~~~~~---------------------------~~~~---~~~~~~~~~~~~~~g~~v---~~~~al~~~~v 47 (419) T protein:vir:57 1 MFIPQFWKGRPSEN---------------------------RVNW---QVVPGGMRSSSSQAGVII---TPETALALSAV 47 (419) T ss_pred CcchhhhccCCccc---------------------------cccc---cccccccccccccCCcee---chHHhhccHHH Confidence 33333333321110 0000 00000000000 000000 00011122334 Q ss_pred HHHHHHHHhhhhccCcee---cCch--h--hHHHHHHHHh--cc---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-E Q lcl|NC_018086. 82 KLLVDTSTAYLAGEPITE---SGDE--K--TIKAMQPVFK--EN---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-R 148 (511) Q Consensus 82 k~ivd~~~~~l~g~~~~~---~~d~--~--~~~~l~~~~~--~n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~ 148 (511) ...|+..++-+-+-|+.+ ..+. + ....+.+++. -| ........+..+.+.+|.||+++..+..|++ . T Consensus 48 ~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~ 127 (419) T protein:vir:57 48 RACVTLLAESVAQLPCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITE 127 (419) T ss_pred HHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEE Confidence 556666666666667664 1111 1 1223555553 22 2345566788889999999999988888875 5 Q ss_pred EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceecc Q lcl|NC_018086. 149 FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNL 228 (511) Q Consensus 149 i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (511) +..++|..+.+..+... ..+|... ..+ .++..+.+++++.- T Consensus 128 L~pl~~~~v~v~~~~~g-------~~~y~~~--~~~------~~~~~~~vih~r~~------------------------ 168 (419) T protein:vir:57 128 LIPINPHKVIVLKGPDG-------MPYYDIP--SIG------EILPMRMVHHIKSF------------------------ 168 (419) T ss_pred EEEEcCcceEEEECCCc-------eEEEEEc--CCc------eEEchhhEEEecCc------------------------ Confidence 67788888776554321 1122211 111 12333333333200 Q ss_pred CCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC---CCCccchhhhhhh----------- Q lcl|NC_018086. 229 LQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF---DLSADSDSISNMK----------- 294 (511) Q Consensus 229 ~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~---~~~~~~~~~~~~~----------- 294 (511) | .+...|.|.+..+...++.......-....+...+.|-.++.-- +....++....++ T Consensus 169 ----~----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~ 240 (419) T protein:vir:57 169 ----S----LDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVR 240 (419) T ss_pred ----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhcccc Confidence 0 01124778777777777665555544455556666676555421 1111222222211 Q ss_pred -hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 -NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 295 -~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) .++++.++++.+++.+........+.+..+...+.|+..-++|....+.....+...++.. ....+..+ T Consensus 241 nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~----------~~~f~~~~ 310 (419) T protein:vir:57 241 NAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQ----------GLQYVIYT 310 (419) T ss_pred ccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHH----------HHHHHHHH Confidence 1356667766666555555555666777788888999999998655543322222222111 11233444 Q ss_pred HHHHHHHHHHHHHhcCC-CccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNK-AKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQ 450 (511) Q Consensus 374 l~~~~~li~~~~~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E 450 (511) |.-+++.|...+...-- ........+++.+...+..|..+.++++.++ .|+++.-.+++.++.-+-+.. +.+ T Consensus 311 l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gg--D~~--- 385 (419) T protein:vir:57 311 MLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGG--DKY--- 385 (419) T ss_pred HHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee--- Confidence 44444444333332111 1111222345555566677889999988887 478988888888765321110 000 Q ss_pred HHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccc Q lcl|NC_018086. 451 RQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEK 502 (511) Q Consensus 451 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (511) +.+.+. ...+. -...+.+.+.++++......+-| T Consensus 386 --------~~~~n~---------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 419 (419) T protein:vir:57 386 --------LTPLNM---------VDSKA-LTGIGKATPQQLKDIEAILCTRN 419 (419) T ss_pred --------eecccc---------ccccc-cccccCCCcccCcchhhhhhccC Confidence 000000 00000 00001111222222222222222 No 169 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=98.05 E-value=6.1e-06 Score=49.19 Aligned_cols=449 Identities=11% Similarity=0.079 Sum_probs=196.0 Q ss_pred CCCccchhhcccccC---chhhHhhhhccCCCHH-HHHHHHHHHHHHHHHH----HHHHHHhcCCCcccccCCc-----C Q lcl|NC_018086. 1 MAIPNGQINAGDIIT---TNIRRKHFIRRNFDLR-ELITLAEMHSRSSSAY----GVLYDYYKGNHIAIQSRTF-----D 67 (511) Q Consensus 1 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~----~~~~~yY~G~~~~~~~~~~-----~ 67 (511) |+.+-.|-.+.++.. .+....-..-+ ..+. .|.+++. -+..-| ..-..|-.+.+....-... . T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 108 (694) T protein:vir:10 33 IATAAAQPVPADFARRGALNALDAAPVAE-PSPSLRLARQFE---VDVSNYTPRERRAASYALDFNGTSMDALSFVTSSG 108 (694) T ss_pred hhhcCCCcccCCccccccchhhcccccCC-CCcchhhhhhcc---ccccCCCccccchhhhhhccCcccccchhhhhccC Confidence 222222211111110 00000000000 0010 1111111 111000 0111122221110000000 0 Q ss_pred ccccccc-ee-ccchHHHHHHHHHhhhhccCceec---------------------CchhhHHHHHHHHhccChhHHHHH Q lcl|NC_018086. 68 DTNKPNS-KI-VHNFPKLLVDTSTAYLAGEPITES---------------------GDEKTIKAMQPVFKENYVTDVNSE 124 (511) Q Consensus 68 ~~~~~~~-ri-~~n~~k~ivd~~~~~l~g~~~~~~---------------------~d~~~~~~l~~~~~~n~~~~~~~~ 124 (511) -.+.+.. -+ -++-.+.++...+..+.-+-+..+ .+-+..+.|..-+++=++...+.+ T Consensus 109 F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~e 188 (694) T protein:vir:10 109 FPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRT 188 (694) T ss_pred cchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 00 123344555555555533321111 111445667777888888899999 Q ss_pred HHHHHhhCCeEEEEeeeCCCCc----eE--------------EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 125 EVKLSGIFGHCFEIHWIDRNKK----HR--------------FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 125 ~~~~a~~~G~~~~~v~~~~~g~----~~--------------i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) +.+++-.||.+..++-.+.++. |. +.+++|..+.|-.-+.. .+.. T Consensus 189 aik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~s--------------- 251 (694) T protein:vir:10 189 TVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA--------------- 251 (694) T ss_pred HHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhhc--cchh--------------- Confidence 9999999999986665543331 11 44555555544210000 0000 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe-ec--CCcccCchhHHHHHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE-II--ANEERLGDFEAQLSLIDAYNLAVSD 263 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~--n~~~g~s~~~~v~~l~d~~~~~~s~ 263 (511) -.+|-|.+++-. +. +......+.|...|+-. ++ .+-.|.|....+.+-+++.+++.-. T Consensus 252 ---pdfgkP~~y~V~----G~------------~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~ 312 (694) T protein:vir:10 252 ---DDFYKPSTWWMI----GT------------EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQS 312 (694) T ss_pred ---hccCCCceEEEe----ce------------EEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhH Confidence 011111111100 00 00000000010011110 11 1224788888888888888887777 Q ss_pred HHHHHHHhcCceeEee---cCCCCccchhh------hhhhh-CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 264 SVNDIAYWNDAYLWLQ---GFDLSADSDSI------SNMKN-DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSL 333 (511) Q Consensus 264 ~~~~~~~~~~p~l~~~---G~~~~~~~~~~------~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~ 333 (511) ....+..++...+..- ........... ...+. .+++.++++ +=+|.+.+.+...+...+......|... T Consensus 313 v~~Li~~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~-~Eefeq~stslSGLddVi~qf~q~VAga 391 (694) T protein:vir:10 313 VSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKA-TEEFFQFNTPLSGLDALQAQAQEQMSAV 391 (694) T ss_pred HHHHHHhhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecC-CcceEEEecccCCHHHHHHHHHHHHHhh Confidence 7666654444432110 00000111111 11222 345556532 2356677888999999999999999999 Q ss_pred hCcccccccccc----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCc Q lcl|NC_018086. 334 SQTPDLVSKDFT----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQ 409 (511) Q Consensus 334 s~~p~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~ 409 (511) +++|-.-....+ |+||++=..-|...+.-.. ++.+...+++++.+|.. +..+.. +. ++.+.|+|--.. T Consensus 392 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Q--e~~L~p~L~rl~~ii~r--S~~G~i---dp-~i~~~fnPL~qm 463 (694) T protein:vir:10 392 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQ--RNALQQLMNDVIVMIQL--SLFGAV---DP-SIKWQWNALREL 463 (694) T ss_pred hcCchhhhhccCcccccccchhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH--HhcCCC---CC-cceEEeCCCCCc Confidence 999965543332 6889875555555544333 67889999998887743 333322 22 578899999999 Q ss_pred CHHHHHHHHHHH---------hccCChHHHHHhCC------CC--CCHHHHHHHHHHHH-HHHHHHHHhhccccccCCCC Q lcl|NC_018086. 410 SYAELADMAVKL---------RDMLPDETIINQFP------WI--TDARQEVEKADAQR-QKRADIALQNFKQTSAVQGA 471 (511) Q Consensus 410 d~~e~a~~~~~~---------~g~~s~et~~~~l~------~v--~d~~~E~~ri~~E~-~~~~~~~~~~~~~~~~~~~~ 471 (511) +++|.|+.-.+- .|+++...+..+|. |. .|.+.+=-...+.. ..... .... T Consensus 464 td~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~-----------~~~~ 532 (694) T protein:vir:10 464 DDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLT-----------YVQR 532 (694) T ss_pred CHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHh-----------hhcC Confidence 999988764332 36666665655542 21 01000000000000 00000 0001 Q ss_pred CCccccccCCCCCCccccccCCCCccccccccCC-CCCCCC Q lcl|NC_018086. 472 STAAANKLDKNPANTSTITTTDPVAAKEQEKAIQ-KKPKTD 511 (511) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 511 (511) .++.. ..+.++.+..+++..|+.++..-+.-+ +.++++ T Consensus 533 ~~~~~--~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~ 571 (694) T protein:vir:10 533 LAEGG--DTGAPGGARAGATAPPTVANVNANVNPREAGAQD 571 (694) T ss_pred ccccc--ccCCCCcccccccCCCcccccccccCccccCCCC Confidence 11111 111223344455555666655555533 455555 No 170 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.03 E-value=6.5e-06 Score=49.01 Aligned_cols=382 Identities=10% Similarity=-0.007 Sum_probs=171.2 Q ss_pred HHHHHHHHHHH-----HHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec---Cc--- Q lcl|NC_018086. 34 ITLAEMHSRSS-----SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES---GD--- 102 (511) Q Consensus 34 ~~~~~~~~~~~-----~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d--- 102 (511) +.+++.+..+. .....+...+.+...-. ........ .=+...-....|+..++-+-+-|+.+- .+ T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~-~g~~v~~~---~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~ 76 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTY-TGKQISSQ---RAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQ 76 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCcccc-CCceechh---hhhccHHHHHHHHHHHHHhccCceEEEEecCCcee Confidence 22332221111 01111222222111000 00000000 001123344555666666656676541 11 Q ss_pred hhhHHHHHHHHh--cc---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEE Q lcl|NC_018086. 103 EKTIKAMQPVFK--EN---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYN 176 (511) Q Consensus 103 ~~~~~~l~~~~~--~n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 176 (511) ......+..++. -| ........+....+.+|.||+++..+ .|++ .+..++|..+.+.++.... +. | T Consensus 77 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~--~~-----y 148 (414) T protein:vir:44 77 RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWE--PV-----Y 148 (414) T ss_pred ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCc--EE-----E Confidence 111122334432 12 34456677888899999999888765 4665 5778899988877764321 11 1 Q ss_pred EEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHH Q lcl|NC_018086. 177 TVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDA 256 (511) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~ 256 (511) .... .++.. ..+.++.+++++.- ++ +...|.|.+..+...++. T Consensus 149 ~~~~-~~g~~----~~~~~~evih~~~~-----------------------~~---------d~~~G~s~i~~~~~~i~~ 191 (414) T protein:vir:44 149 QVTF-PDGST----DVLSQEDIWHVRTL-----------------------TL---------DGLVGLNPIAYAREAISL 191 (414) T ss_pred EEEe-cCceE----EEEccccEEEecCC-----------------------CC---------CCcccccHHHHHHHHHHH Confidence 1111 11211 12444444444210 00 112477877777766666 Q ss_pred HHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh------------hCceeeecCCCceeeeecCCCHHHHHHHHH Q lcl|NC_018086. 257 YNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK------------NDRVIVTDEDGMVKFITKDVNDKHIENIKN 324 (511) Q Consensus 257 ~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~------------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (511) .........+.+...+.|-.++.-... -.++....++ .++++.++++.+.+.+..+.....+.+..+ T Consensus 192 ~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~ 270 (414) T protein:vir:44 192 AAATEEHGARLFSNGAVTSGVLRTEQT-LSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRK 270 (414) T ss_pred HHHHHHHHHHHHhccCCCceEEEeCCC-CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHH Confidence 666555555556666667666554221 1122222111 134666766656555544444455666677 Q ss_pred HHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccccceeEEe Q lcl|NC_018086. 325 RAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-AKDLKPYEVTPVF 403 (511) Q Consensus 325 ~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~~~~~~~i~i~f 403 (511) .....|+..-++|..-.+..+..+...++.+. ..++..+|.-+++.|...+...-. ........+++.+ T Consensus 271 ~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~----------~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~ 340 (414) T protein:vir:44 271 FQLEEICRLFRVPLHMVQNTDRATFNNIEELG----------LGFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAKFNA 340 (414) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCccccCceEEEEec Confidence 77888888888887555443333322222211 123334444444444443332111 1111222344555 Q ss_pred CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCC Q lcl|NC_018086. 404 VRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDK 481 (511) Q Consensus 404 ~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (511) ...+..|..+.++++.++ +|+++.-.++..++.-+-+.. +......+.. ..+.... T Consensus 341 ~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~gg-------------D~~~~~~n~~--~~~~~~~------- 398 (414) T protein:vir:44 341 GALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGG-------------DVYLTPMNMT--TKPSDGS------- 398 (414) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc-------------ceeccccccc--ccCCccc------- Confidence 566667889999998887 478888888887765221000 0000000000 0000000 Q ss_pred CCCCccccccCCCCccccccccCCCCCCC Q lcl|NC_018086. 482 NPANTSTITTTDPVAAKEQEKAIQKKPKT 510 (511) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) . +.++++++.+|++.+ T Consensus 399 -------~------~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 399 -------K------AGKQKDNANADETTS 414 (414) T ss_pred -------c------CCCCCCCCCCCCCCC Confidence 0 000111111111111 No 171 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=430 Identities=10% Similarity=0.020 Sum_probs=189.2 Q ss_pred Hhhhhc-cCCCHHHHHHHHHHHHHHHH----HHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhc Q lcl|NC_018086. 20 RKHFIR-RNFDLRELITLAEMHSRSSS----AYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAG 94 (511) Q Consensus 20 ~~~~~~-~~~~~~~l~~~~~~~~~~~~----~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g 94 (511) .++-+. ...+.+.|.+..+.++.++. +++.+.+|..-. .... ........++..+-+...++.+++.|++ T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~---~~~~--~~~~~~~~~~~dstg~~a~~~LAa~l~~ 75 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY---LMNN--KGDNETSQNGWQGVGAQATNHLANKLAQ 75 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc---ccCC--CCCcccccccccchHHHHHHHHHHHHHH Confidence 111111 34566666766666654432 444445554441 1111 1112222345566677777777777664 Q ss_pred c------Cc-eecCch----------hh-----------HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCc Q lcl|NC_018086. 95 E------PI-TESGDE----------KT-----------IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKK 146 (511) Q Consensus 95 ~------~~-~~~~d~----------~~-----------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~ 146 (511) - || ++...+ .. ...+...+..++|.....++.++..++|.+.+++ ++++. T Consensus 76 ~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--d~~~~ 153 (515) T protein:vir:70 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKGA 153 (515) T ss_pred hhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE--eCCCC Confidence 2 22 222111 00 1124445777899999999999999999987665 55555 Q ss_pred eEEEEEcccceEEEecCCCCCceEEEEEEEEEeec----------------CCcceEEEEEEEcCCcEEEEEEccCcccc Q lcl|NC_018086. 147 HRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD----------------ITGHQIRTYEVYTEDLIYKFSTDDEREVY 210 (511) Q Consensus 147 ~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~----------------~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 210 (511) ++ .++-.+ +++--+.. +.+...+|.+..... ...+....+++|+.- .. ...+.+.. T Consensus 154 ~~--~~pl~~-y~v~~d~~-G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v---~~-~~~~~~~~ 225 (515) T protein:vir:70 154 MS--AVPMHH-YVVNRDTN-GDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHA---QY-AGEGFWKI 225 (515) T ss_pred eE--EEEcCe-EEEeeCCC-cCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEE---Ee-cCCCceEE Confidence 44 343333 44444443 345555554432210 000111123333210 00 00111111 Q ss_pred cccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCc Q lcl|NC_018086. 211 REIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSA 285 (511) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~ 285 (511) ........ .......+|..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+.-... . T Consensus 226 ~~e~d~~~--~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~-~ 302 (515) T protein:vir:70 226 NQSADDIP--VGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQ-T 302 (515) T ss_pred EEecCcee--eccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccc-c Confidence 11111111 111222234556766543 357899999999999999999988888888888888876542111 0 Q ss_pred cchhhhhhhhCceeeecCCCceeeeec--CCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHH Q lcl|NC_018086. 286 DSDSISNMKNDRVIVTDEDGMVKFITK--DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKS 363 (511) Q Consensus 286 ~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~ 363 (511) ..... .....+.+.....++++.+.. ..+.......++.++..|...-....+........|++.+. .+. T Consensus 303 ~~~~l-~~~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV~-------~r~ 374 (515) T protein:vir:70 303 DVDHF-VNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEIQ-------RDA 374 (515) T ss_pred chhhc-cccCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHH-------HHH Confidence 00000 011123343333345555542 33556667777777777655443221111122235776655 445 Q ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHhcCCCccccccceeEEeCCCCCcCHHHHH---HHHHHH---hc----c---- Q lcl|NC_018086. 364 AVKESKFRKVLAKRYEL-----VCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELA---DMAVKL---RD----M---- 424 (511) Q Consensus 364 ~~~~~~~~~~l~~~~~l-----i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a---~~~~~~---~g----~---- 424 (511) .++...++..+.++-.- +...+..... ......+.+.+.. +.+....+ +.+... .+ + T Consensus 375 ~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p--~~P~~~v~~~~vs--~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~ 450 (515) T protein:vir:70 375 LEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGD--SFTSELVDPVIVT--GIEALGRMAELDKLANFAQYMSLPQTWPEPA 450 (515) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCC--CCChhhcccceeh--hHHHHHHHHHHHHHHHHHHHHHHHhccChhH Confidence 56666666666654221 1111221111 1111123333321 22222221 111111 11 1 Q ss_pred ---CCh----HHHHHhCCC---CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccccc Q lcl|NC_018086. 425 ---LPD----ETIINQFPW---ITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKL 479 (511) Q Consensus 425 ---~s~----et~~~~l~~---v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (511) +.. +.+...++. +-..++|++.++++++.....++-..+......+...+...+. T Consensus 451 ~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 451 QRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 111 122222221 1124577777766544433322111100000111111111111 No 172 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=97.98 E-value=8.3e-06 Score=48.44 Aligned_cols=428 Identities=10% Similarity=0.042 Sum_probs=178.0 Q ss_pred hhcc-CCCHHHHHHHHHHHHHHH----HHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc-- Q lcl|NC_018086. 23 FIRR-NFDLRELITLAEMHSRSS----SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE-- 95 (511) Q Consensus 23 ~~~~-~~~~~~l~~~~~~~~~~~----~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~-- 95 (511) ++.. ..+...+.+..+.++.++ .+++.+.+|..-. .......+....++..+-+...++.+++.|++- T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~-----~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 75 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPY-----LMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLF 75 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc-----cccCCCCCccccccccchHHHHHHHHHHHHHHhhc Confidence 1211 223344555555444333 3444445554431 111122233344666777778888888777652 Q ss_pred ----Cc-eecCch--------------hhH-------HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEE Q lcl|NC_018086. 96 ----PI-TESGDE--------------KTI-------KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRF 149 (511) Q Consensus 96 ----~~-~~~~d~--------------~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i 149 (511) || ++...+ +.. ..+...+..++|.....++.++..++|.+.+++ ++ +...+ T Consensus 76 pp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~~-~~~~~ 152 (517) T protein:vir:10 76 PAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYH--PD-KTSPI 152 (517) T ss_pred CCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--eC-CCCcE Confidence 22 122111 111 123455677899999999999999999987654 33 22344 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeecC----------------CcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDI----------------TGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~----------------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) +.++-. -+++--+.. +.+...++.+...... ..+....+++|+.- +. ...+.+..... T Consensus 153 ~~~pl~-~y~v~~d~~-G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v---~~-~~~~~~~~~~~ 226 (517) T protein:vir:10 153 QAVPLH-HYCVRRDNN-GTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHA---KR-TKDGKYLIRQS 226 (517) T ss_pred EEEEcC-eEEEeeCCC-cCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEE---EE-eCCCceEEEEE Confidence 554433 344444443 3344445444322100 00111223333310 11 11111111111 Q ss_pred ccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD 288 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~ 288 (511) .... ........++..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+.-... .... T Consensus 227 ~d~~--~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~-~~~~ 303 (517) T protein:vir:10 227 ADDV--PVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSY-TDIN 303 (517) T ss_pred eCce--eeccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccc-cchh Confidence 1000 0111122234567766543 356899999999999999998877777777777777665431110 0001 Q ss_pred hhhhhhhCceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 289 SISNMKNDRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVK 366 (511) Q Consensus 289 ~~~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~ 366 (511) .... ...+.+.-...+++..+. ...+.......++.++..|...-....+..-.....|++.+.. +..++ T Consensus 304 ~l~~-~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~rvTAtEV~~-------r~~E~ 375 (517) T protein:vir:10 304 QFVE-GGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAERVTAYEIQR-------DAMLV 375 (517) T ss_pred hccC-CCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHHHH-------HHHHH Confidence 0100 111222222223444443 2334566666677777666655443222222222356666554 44555 Q ss_pred HHHHHHHHHHH--------HHHHHHHHHhcCCCccccccceeEEeCCCCCc-CHHHHHHHHHHH---hc-c--------- Q lcl|NC_018086. 367 ESKFRKVLAKR--------YELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQ-SYAELADMAVKL---RD-M--------- 424 (511) Q Consensus 367 ~~~~~~~l~~~--------~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~-d~~e~a~~~~~~---~g-~--------- 424 (511) ...++..+.++ ++.++..+...... ..+.+.+.-.+.. ...+.++.+... .| + T Consensus 376 ~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~-----~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~ 450 (517) T protein:vir:10 376 EQSLGGVYSLFATTFQGPLARWFMNGISSILTS-----KNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQ 450 (517) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCC-----CCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHh Confidence 55556554442 11111111111111 1233333222211 111112222221 11 1 Q ss_pred -CChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHH--HhhccccccCCCCCCccccccCCCCCCccccccCC Q lcl|NC_018086. 425 -LPDETIIN----QFPWIT----DARQEVEKADAQRQKRADIA--LQNFKQTSAVQGASTAAANKLDKNPANTSTITTTD 493 (511) Q Consensus 425 -~s~et~~~----~l~~v~----d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (511) +....++. .+| ++ -.++|+++.++++.+..... ....+. . ..+.-.+++.+.++ T Consensus 451 ~id~d~~~~~~a~~~G-vp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~---~-----~~~~~~~~~~~~~~------ 515 (517) T protein:vir:10 451 AIKWPDFTDWVQGQIS-ANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGK---A-----IPDMVKNGQINPQG------ 515 (517) T ss_pred cCCHHHHHHHHHHHhC-CChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH---H-----HHHHHhCCCCCCCC------ Confidence 11122222 222 22 12455555544433222211 111000 0 00000111111111 Q ss_pred CCccc Q lcl|NC_018086. 494 PVAAK 498 (511) Q Consensus 494 ~~~~~ 498 (511) +. T Consensus 516 ---~~ 517 (517) T protein:vir:10 516 ---GQ 517 (517) T ss_pred ---CC Confidence 11 No 173 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.97 E-value=8.9e-06 Score=48.28 Aligned_cols=395 Identities=10% Similarity=0.033 Sum_probs=173.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--------ccCCcCcccc---ccceeccchHHHHHHHHHhhhhccC Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAI--------QSRTFDDTNK---PNSKIVHNFPKLLVDTSTAYLAGEP 96 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~--------~~~~~~~~~~---~~~ri~~n~~k~ivd~~~~~l~g~~ 96 (511) |. +.. .+.+.+...-..=|.|...-. ........+. +..=+...-....|+..++-+-.-| T Consensus 1 ~~-----~~~---~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp 72 (437) T protein:vir:10 1 MK-----QGK---QRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLP 72 (437) T ss_pred CC-----cch---hhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCc Confidence 10 000 111111111111111211000 0000000000 0000112223345565655555556 Q ss_pred cee---cCch----hhHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecC Q lcl|NC_018086. 97 ITE---SGDE----KTIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSA 163 (511) Q Consensus 97 ~~~---~~d~----~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~ 163 (511) +.+ ..+. .....+..+|.. | ........+....+.+|.||+++..+. |++ .+..++|..+.+..+. T Consensus 73 ~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~~~ 151 (437) T protein:vir:10 73 LNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKRLT 151 (437) T ss_pred eeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEECC Confidence 653 1111 111224444432 2 334566777888899999999988874 765 4677888888776543 Q ss_pred CCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCccc Q lcl|NC_018086. 164 DLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEER 243 (511) Q Consensus 164 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g 243 (511) .. . +. |.... .+|.. ..+.++.++|++.- + .+...| T Consensus 152 ~g--~----~~-y~~~~-~~g~~----~~~~~~dIih~r~~----------------------------~----~d~~~G 187 (437) T protein:vir:10 152 SG--A----LQ-YTYRN-VDGTV----STLAEDDVFHVRGF----------------------------S----LDGLMG 187 (437) T ss_pred CC--e----EE-EEEEe-cCceE----EEEccccEEEecCc----------------------------C----CCCccc Confidence 21 1 11 11111 12211 12344444444210 0 011247 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----hh--------hCceeeecCCCceeeee Q lcl|NC_018086. 244 LGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----MK--------NDRVIVTDEDGMVKFIT 311 (511) Q Consensus 244 ~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~~~~~~~~~~ 311 (511) .|.+..+...+........-....+...+.|-.++.... .-.++.... +. .++++.++++.+.+.+. T Consensus 188 ~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~ 266 (437) T protein:vir:10 188 LTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQ-ILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAIT 266 (437) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceeccCCceEEecc Confidence 777776666666555555555555666666766665422 111222111 11 13466676666655555 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHhCccccccccccC--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 312 KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA--ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 312 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~--~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) .......+....+...+.|+..-++|..-.+.... ..+..++... ...+..+|.-.+..|...+...- T Consensus 267 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~----------~~f~~~tl~P~~~~ie~~l~~kl 336 (437) T protein:vir:10 267 MNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQT----------LGFLTFTLRPWLTRIEQAARRSL 336 (437) T ss_pred CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHH----------HHHHHHHHHHHHHHHHHHHHhhc Confidence 44445556666777788899998988755543322 2222222221 12233344444433333333211 Q ss_pred -CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCH-HHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018086. 390 -KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDA-RQEVEKADAQRQKRADIALQNFKQT 465 (511) Q Consensus 390 -~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~-~~E~~ri~~E~~~~~~~~~~~~~~~ 465 (511) .........+++.+...+..|..+.++++.++ .|+++.-.++..++.-+-+ ..+. +. .. T Consensus 337 l~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~--~~------~~--------- 399 (437) T protein:vir:10 337 LRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAV--LT------VQ--------- 399 (437) T ss_pred cCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcce--Ee------ec--------- Confidence 11111222355555666777889999998877 4789988888887652200 0000 00 00 Q ss_pred ccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCC Q lcl|NC_018086. 466 SAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKK 507 (511) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) ....+ .+..+++ .+...+..+..+..+.+...++-+|+ T Consensus 400 ~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 400 SALLP-IDKLGEH---TTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred Ccccc-hhhccCc---CCCcchhccccccCCCCCCCCccccC Confidence 00000 0000000 01111111112222333333333344 No 174 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=97.95 E-value=9.4e-06 Score=48.15 Aligned_cols=459 Identities=12% Similarity=0.031 Sum_probs=169.0 Q ss_pred CC-------Cccchhhccccc------CchhhHhhhhccCCCHHHHHHHHHHH-HHHHHHHHHH----HHHhcCCCcccc Q lcl|NC_018086. 1 MA-------IPNGQINAGDII------TTNIRRKHFIRRNFDLRELITLAEMH-SRSSSAYGVL----YDYYKGNHIAIQ 62 (511) Q Consensus 1 ~~-------~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~----~~yY~G~~~~~~ 62 (511) || ++=++..+--+- ....+++.++.++....+|.+..+.- ......+... ..||.- .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~--- 76 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDK-RS--- 76 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccc-cc--- Confidence 22 111222111111 11223333333333333333322110 0111111111 112211 10 Q ss_pred cCCcCccc-cc-cc-eec-cchHHHHHHHHHhhhhc-------------cCceec-----Cch---hhHHHHHHHHhc-- Q lcl|NC_018086. 63 SRTFDDTN-KP-NS-KIV-HNFPKLLVDTSTAYLAG-------------EPITES-----GDE---KTIKAMQPVFKE-- 115 (511) Q Consensus 63 ~~~~~~~~-~~-~~-ri~-~n~~k~ivd~~~~~l~g-------------~~~~~~-----~d~---~~~~~l~~~~~~-- 115 (511) ...++. .+ .. .+. ....+.+|++.++.+.. -++.+. ..+ .....+..++.. T Consensus 77 --~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~ 154 (563) T protein:vir:95 77 --YMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTG 154 (563) T ss_pred --CCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcC Confidence 011110 00 00 111 23444445544433221 122211 011 111223333321 Q ss_pred -c------ChhHHHHHHHHHHhhCCeEEEEee--eCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 116 -N------YVTDVNSEEVKLSGIFGHCFEIHW--IDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 116 -n------~~~~~~~~~~~~a~~~G~~~~~v~--~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) + .+..+...+..+.+.+|.||+.+. .+..|++ .+..++|..+.+..+..... .....+++... .+. T Consensus 155 ~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~-~~~~~~y~~~~---~g~ 230 (563) T protein:vir:95 155 KDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKI-IKGGKRFVQVV---DKR 230 (563) T ss_pred CCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCce-eccceeEEEEe---CCc Confidence 1 244567778889999999988654 4556765 57788999988876654311 11111122111 111 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) .. ..+.+..++++... |-........|.|.++.+...+.....+..-.. T Consensus 231 ~~---~~~~~~evI~~~~~----------------------------~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~ 279 (563) T protein:vir:95 231 VV---ASFTSRELAMGIRN----------------------------PRTELSSSGYGLSEVEIAMKEFIAYNNTESFND 279 (563) T ss_pred ee---EEecCcceEEEecc----------------------------CCCCcccCcccchHHHHHHHHHHHHHHHHHHHH Confidence 11 12233332222110 000000122577777776666665555555555 Q ss_pred HHHHHhcCceeEee--cCC-CCcc--chhhhhhh--------hCce-eeecCCCceeeeecCCCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 266 NDIAYWNDAYLWLQ--GFD-LSAD--SDSISNMK--------NDRV-IVTDEDGMVKFITKDVNDKHIENIKNRAKLDIF 331 (511) Q Consensus 266 ~~~~~~~~p~l~~~--G~~-~~~~--~~~~~~~~--------~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~ 331 (511) +.+...+.|-.++. |.. .++. +.....+. .+++ +.++++.+.+.++.+.....+....+...+.|+ T Consensus 280 ~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia 359 (563) T protein:vir:95 280 RFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIIS 359 (563) T ss_pred HHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 55666666765543 432 2221 11111111 1222 456655555555555555667777888889999 Q ss_pred HHhCccccccccc--c----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC Q lcl|NC_018086. 332 SLSQTPDLVSKDF--T----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR 405 (511) Q Consensus 332 ~~s~~p~~~~~~~--~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~ 405 (511) ..-++|..-.+.. + ...|..+... . -.......+..+|.-+++.|...+...-.. .+. ..+.+.|.+ T Consensus 360 ~afgVPp~~lG~~~~~~~~~~~~~ss~~~s--n---~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~-~~~-~~~~~~f~r 432 (563) T protein:vir:95 360 ALYGIDPAEIGFPNRGGATGSKGGSTLNEA--D---PGKKQQQSQNKGLQPLLRFIEDLVNRHIIS-EYG-DKYTFQFVG 432 (563) T ss_pred HHhCCCHHHccccccccccccccccchhhc--c---HHHHHHHHHHHHHHHHHHHHHHHHHhhhch-hcc-cccEEEecc Confidence 9999987544321 1 1111111110 0 001111223333333333333333211101 111 245667766 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHH-----HHH-------HHHHHHHHHHHHhhccccccCC Q lcl|NC_018086. 406 NLPQSYAELADMAVKL--RDMLPDETIINQFPWIT--DARQEV-----EKA-------DAQRQKRADIALQNFKQTSAVQ 469 (511) Q Consensus 406 ~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~--d~~~E~-----~ri-------~~E~~~~~~~~~~~~~~~~~~~ 469 (511) .-+.+..+..+. .++ .|+++.-.++.+++.-+ +-+.-+ ..+ ..+.+. .+.......... .. T Consensus 433 ~D~~~~~e~~~~-~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~ 509 (563) T protein:vir:95 433 GDTKSATDKLNI-LKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGK-QKERLQMMMSLL-EG 509 (563) T ss_pred CCHHHHHHHHHH-HHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccc-cchhhhhccccc-CC Confidence 655544444432 233 48898888888876532 111000 000 000000 000000000000 00 Q ss_pred CCCCccccccCCC-------CCCccc----cccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 470 GASTAAANKLDKN-------PANTST----ITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 470 ~~~~~~~~~~~~~-------~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ..+....+++.. .+.++. .+..+...+..++...-+|+.-= T Consensus 510 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 561 (563) T protein:vir:95 510 -DNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEKSSDF 561 (563) T ss_pred -CCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcCcccc Confidence 000000000000 000010 11111122222222222222111 No 175 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=97.95 E-value=9.4e-06 Score=48.15 Aligned_cols=459 Identities=12% Similarity=0.031 Sum_probs=169.0 Q ss_pred CC-------Cccchhhccccc------CchhhHhhhhccCCCHHHHHHHHHHH-HHHHHHHHHH----HHHhcCCCcccc Q lcl|NC_018086. 1 MA-------IPNGQINAGDII------TTNIRRKHFIRRNFDLRELITLAEMH-SRSSSAYGVL----YDYYKGNHIAIQ 62 (511) Q Consensus 1 ~~-------~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~----~~yY~G~~~~~~ 62 (511) || ++=++..+--+- ....+++.++.++....+|.+..+.- ......+... ..||.- .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~--- 76 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDK-RS--- 76 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccc-cc--- Confidence 22 111222111111 11223333333333333333322110 0111111111 112211 10 Q ss_pred cCCcCccc-cc-cc-eec-cchHHHHHHHHHhhhhc-------------cCceec-----Cch---hhHHHHHHHHhc-- Q lcl|NC_018086. 63 SRTFDDTN-KP-NS-KIV-HNFPKLLVDTSTAYLAG-------------EPITES-----GDE---KTIKAMQPVFKE-- 115 (511) Q Consensus 63 ~~~~~~~~-~~-~~-ri~-~n~~k~ivd~~~~~l~g-------------~~~~~~-----~d~---~~~~~l~~~~~~-- 115 (511) ...++. .+ .. .+. ....+.+|++.++.+.. -++.+. ..+ .....+..++.. T Consensus 77 --~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~ 154 (563) T protein:vir:99 77 --YMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTG 154 (563) T ss_pred --CCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcC Confidence 011110 00 00 111 23444445544433221 122211 011 111223333321 Q ss_pred -c------ChhHHHHHHHHHHhhCCeEEEEee--eCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 116 -N------YVTDVNSEEVKLSGIFGHCFEIHW--IDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 116 -n------~~~~~~~~~~~~a~~~G~~~~~v~--~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) + .+..+...+..+.+.+|.||+.+. .+..|++ .+..++|..+.+..+..... .....+++... .+. T Consensus 155 ~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~-~~~~~~y~~~~---~g~ 230 (563) T protein:vir:99 155 KDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKI-IKGGKRFVQVV---DKR 230 (563) T ss_pred CCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCce-eccceeEEEEe---CCc Confidence 1 244567778889999999988654 4556765 57788999988876654311 11111122111 111 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) .. ..+.+..++++... |-........|.|.++.+...+.....+..-.. T Consensus 231 ~~---~~~~~~evI~~~~~----------------------------~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~ 279 (563) T protein:vir:99 231 VV---ASFTSRELAMGIRN----------------------------PRTELSSSGYGLSEVEIAMKEFIAYNNTESFND 279 (563) T ss_pred ee---EEecCcceEEEecc----------------------------CCCCcccCcccchHHHHHHHHHHHHHHHHHHHH Confidence 11 12233332222110 000000122577777776666665555555555 Q ss_pred HHHHHhcCceeEee--cCC-CCcc--chhhhhhh--------hCce-eeecCCCceeeeecCCCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 266 NDIAYWNDAYLWLQ--GFD-LSAD--SDSISNMK--------NDRV-IVTDEDGMVKFITKDVNDKHIENIKNRAKLDIF 331 (511) Q Consensus 266 ~~~~~~~~p~l~~~--G~~-~~~~--~~~~~~~~--------~~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~ 331 (511) +.+...+.|-.++. |.. .++. +.....+. .+++ +.++++.+.+.++.+.....+....+...+.|+ T Consensus 280 ~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia 359 (563) T protein:vir:99 280 RFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIIS 359 (563) T ss_pred HHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 55666666765543 432 2221 11111111 1222 456655555555555555667777888889999 Q ss_pred HHhCccccccccc--c----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC Q lcl|NC_018086. 332 SLSQTPDLVSKDF--T----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR 405 (511) Q Consensus 332 ~~s~~p~~~~~~~--~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~ 405 (511) ..-++|..-.+.. + ...|..+... . -.......+..+|.-+++.|...+...-.. .+. ..+.+.|.+ T Consensus 360 ~afgVPp~~lG~~~~~~~~~~~~~ss~~~s--n---~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~-~~~-~~~~~~f~r 432 (563) T protein:vir:99 360 ALYGIDPAEIGFPNRGGATGSKGGSTLNEA--D---PGKKQQQSQNKGLQPLLRFIEDLVNRHIIS-EYG-DKYTFQFVG 432 (563) T ss_pred HHhCCCHHHccccccccccccccccchhhc--c---HHHHHHHHHHHHHHHHHHHHHHHHHhhhch-hcc-cccEEEecc Confidence 9999987544321 1 1111111110 0 001111223333333333333333211101 111 245667766 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHH-----HHH-------HHHHHHHHHHHHhhccccccCC Q lcl|NC_018086. 406 NLPQSYAELADMAVKL--RDMLPDETIINQFPWIT--DARQEV-----EKA-------DAQRQKRADIALQNFKQTSAVQ 469 (511) Q Consensus 406 ~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~--d~~~E~-----~ri-------~~E~~~~~~~~~~~~~~~~~~~ 469 (511) .-+.+..+..+. .++ .|+++.-.++.+++.-+ +-+.-+ ..+ ..+.+. .+.......... .. T Consensus 433 ~D~~~~~e~~~~-~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~ 509 (563) T protein:vir:99 433 GDTKSATDKLNI-LKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGK-QKERLQMMMSLL-EG 509 (563) T ss_pred CCHHHHHHHHHH-HHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccc-cchhhhhccccc-CC Confidence 655544444432 233 48898888888876532 111000 000 000000 000000000000 00 Q ss_pred CCCCccccccCCC-------CCCccc----cccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 470 GASTAAANKLDKN-------PANTST----ITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 470 ~~~~~~~~~~~~~-------~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ..+....+++.. .+.++. .+..+...+..++...-+|+.-= T Consensus 510 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 561 (563) T protein:vir:99 510 -DNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEKSSDF 561 (563) T ss_pred -CCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcCcccc Confidence 000000000000 000010 11111122222222222222111 No 176 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.88 E-value=1.3e-05 Score=47.41 Aligned_cols=408 Identities=11% Similarity=0.011 Sum_probs=188.5 Q ss_pred CCCccchhhcccccCchhh--------------HhhhhccCCCHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccCC Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIR--------------RKHFIRRNFDLRELITLAEMHS-RSSSAYGVLYDYYKGNHIAIQSRT 65 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~yY~G~~~~~~~~~ 65 (511) |+.=+++- |.-+....- .....-..+++..+...+..-. ..+.++..+++..+- T Consensus 1 ~~~~~d~~--g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e--------- 69 (526) T protein:vir:99 1 MAQIVDVY--GNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEE--------- 69 (526) T ss_pred CCeeECCC--CCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHh--------- Confidence 66555543 332211110 0011112344433333222111 112222222211110 Q ss_pred cCccccccceeccchHHHHHHHHHhhhhccCceec---Cc----hhhHHHHHHHHhcc-ChhHHHHHHHHHHhhCCeEE- Q lcl|NC_018086. 66 FDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES---GD----EKTIKAMQPVFKEN-YVTDVNSEEVKLSGIFGHCF- 136 (511) Q Consensus 66 ~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d----~~~~~~l~~~~~~n-~~~~~~~~~~~~a~~~G~~~- 136 (511) ......-.+.+...-+.+.++++. .+ .+..+.+.+++.+- +|...+..+ .+|.-||.++ T Consensus 70 -----------~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~-lda~~~G~s~~ 137 (526) T protein:vir:99 70 -----------RDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDA-LDGIGHGYSCI 137 (526) T ss_pred -----------hChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHH-HHhhhhcceeE Confidence 134455566666677777787763 12 23345567777653 577776664 4788899765 Q ss_pred EEeeeCCCCceEEE---EEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 137 EIHWIDRNKKHRFK---AVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 137 ~~v~~~~~g~~~i~---~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) +++|...+|...+. ..+|... .|++..+.. +++. .....| ..+ T Consensus 138 Eivw~~~~g~~~~~~l~~r~~~~f--~~~~~~~~~----l~~~--~~~~~g-----~~l--------------------- 183 (526) T protein:vir:99 138 ELEWALQGREWMPLAFHHRPQSWF--QLNPEDQNE----LRLR--DNSPAG-----EAL--------------------- 183 (526) T ss_pred EEEEeecCCceeEEEeeeecccce--eeccCCCcE----EEec--CCCCCc-----eee--------------------- Confidence 56665545544333 3333221 122211110 0000 000000 001 Q ss_pred ccccccccccceeccCCccceEeec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc---- Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFPVLEII--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS---- 287 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iPvv~~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~---- 287 (511) .+++.|-.++-. .++.|.|.+..+.-..--=+..+.+++..++.|+.|+++.+=.....++ T Consensus 184 -------------~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~ 250 (526) T protein:vir:99 184 -------------QPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKAT 250 (526) T ss_pred -------------cCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHH Confidence 122222222211 2556788887765554444557888899999999999988732221211 Q ss_pred --hhhhhhhhCceeeecCCCceeeeecC-CCHHHHHHHHHHHHHHHHHHhCcccccccc-ccCccHHHH-HHHHHHHHHH Q lcl|NC_018086. 288 --DSISNMKNDRVIVTDEDGMVKFITKD-VNDKHIENIKNRAKLDIFSLSQTPDLVSKD-FTAASGQAL-KAATQPLENK 362 (511) Q Consensus 288 --~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~~Sg~Ai-~~~~~~l~~k 362 (511) ....++....+..+|.+.++++++.. .....++.+++.+.+.|...--.-.++.+. .|+.+.-|+ +....-.... T Consensus 251 L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di 330 (526) T protein:vir:99 251 LLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDL 330 (526) T ss_pred HHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHH Confidence 12234455678889999999999854 455678888898888887764332233221 122211122 2222223333 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCC Q lcl|NC_018086. 363 SAVKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-LPDETIINQFPWIT 438 (511) Q Consensus 363 ~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~s~et~~~~l~~v~ 438 (511) ++.-.+.+...+. ++++.++.+ .......-..-..+.|....+.|.++.++.+.+++ |+ +|.+.+.+.++. + T Consensus 331 ~~aDa~~i~~tln~~Li~~l~~~---N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gi-p 406 (526) T protein:vir:99 331 LASDARQLAATLSRDLLWPLLVL---NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI-P 406 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHh---CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCC-C Confidence 4444455555663 355554432 22111111123567888889999999999998885 55 888888888875 2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 439 DARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 439 d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+... +.+ +...... . ......+... ...+.......++...-| T Consensus 407 ~~~~~-e~~-----------l~~~~~~-----~-~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~d 450 (526) T protein:vir:99 407 QPAKN-EPV-----------LRSAAQP-----A-ILSRQHGQRV-----------AALATIVGPRYGDQQALD 450 (526) T ss_pred CCCCc-ccc-----------cCCCCCC-----c-cccccccccc-----------ccccccccccCcchhhHH Confidence 22110 000 0000000 0 0000000000 000000000111111111 No 177 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.86 E-value=1.5e-05 Score=47.11 Aligned_cols=382 Identities=9% Similarity=0.011 Sum_probs=158.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCcccccCCcCccccc-cceeccchHHHHHHHHHhhhhccCceecCc Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYY----KGNHIAIQSRTFDDTNKP-NSKIVHNFPKLLVDTSTAYLAGEPITESGD 102 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY----~G~~~~~~~~~~~~~~~~-~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d 102 (511) |.++.|...++. . .+..+. .+-.+...........-. +.=+...-....|+..++-+-.-|+.+-.. T Consensus 1 ~~~~~~~~~~k~---~-----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~ 72 (409) T protein:vir:94 1 MAKENIVTRIKK---K-----LIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYED 72 (409) T ss_pred Ccccccchhhhh---H-----HhhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeec Confidence 222222221111 0 011111 111110000000000000 000112333444555555555556654211 Q ss_pred -hhhHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEE Q lcl|NC_018086. 103 -EKTIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYY 175 (511) Q Consensus 103 -~~~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~ 175 (511) +.....+..++. -|. -......+....+.+|.||+++..+..|++ .+..++|..+.++.++.... + + T Consensus 73 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~--~----~ 146 (409) T protein:vir:94 73 YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE--L----Y 146 (409) T ss_pred ccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE--E----E Confidence 112222344443 232 334556778888999999999988888875 57778898887776554321 1 1 Q ss_pred EEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHH Q lcl|NC_018086. 176 NTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLID 255 (511) Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d 255 (511) |... ...+..+ .+.++.++|++.- ++ .+.-.|.|.+..+...++ T Consensus 147 y~~~-~~~g~~~----~~~~~dvih~r~~----------------------~~---------~~~~~G~s~l~~~~~~i~ 190 (409) T protein:vir:94 147 YSIH-AATGNKL----IVHNMDMLHFKHI----------------------VA---------SNMVQGISPIDVLKNTTD 190 (409) T ss_pred EEEE-cCCceEE----EEccccEEEecCC----------------------CC---------CCccccccHHHHHHHHHH Confidence 1111 1122211 2334444444210 00 011247777766655555 Q ss_pred HHHHHHHHHHHHHHHhcCce-eEe-ecCCCCccchhhhhh---------hhCceeeecCCCceeeeecCCCHHHHHHHHH Q lcl|NC_018086. 256 AYNLAVSDSVNDIAYWNDAY-LWL-QGFDLSADSDSISNM---------KNDRVIVTDEDGMVKFITKDVNDKHIENIKN 324 (511) Q Consensus 256 ~~~~~~s~~~~~~~~~~~p~-l~~-~G~~~~~~~~~~~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (511) ........ .+..+..+- .++ .+...+ ++....+ ..++++.++++.+++.+........+.+..+ T Consensus 191 ~~~~~~~~---~~~~~~~~~~~i~~~~~~l~--~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 265 (409) T protein:vir:94 191 FDNAVRTF---NLTEMQKPDSFMLKYGSNVG--KEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASEN 265 (409) T ss_pred HHHHHHHH---HHHhcCCCCeeEEecCCCCC--HHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHH Confidence 44433221 233333332 222 232222 2222211 1234666766656555544444455566677 Q ss_pred HHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCccc-cccceeEE Q lcl|NC_018086. 325 RAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN-KAKDL-KPYEVTPV 402 (511) Q Consensus 325 ~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~-~~~~i~i~ 402 (511) ...+.|+..-++|+.-.+..+..+...++.... .++..+|.-+++.|...+...- ..... ....+++. T Consensus 266 ~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~----------~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd 335 (409) T protein:vir:94 266 LTRERVANVFQLPSVFLNARSNTNFAKNEELNR----------FYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFN 335 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEee Confidence 777888888888876555433333333322211 2222233333333333322211 00011 11224444 Q ss_pred eCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccC Q lcl|NC_018086. 403 FVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLD 480 (511) Q Consensus 403 f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (511) ...-+..|..+.++++.++ .|+++.-.++..++.-+-+.. +... -..+. .+.+.....+.. T Consensus 336 ~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gg--D~~~-----------~~~n~----~~~~~~~~~~~~ 398 (409) T protein:vir:94 336 VKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGG--DKPL-----------ISGDL----YPIDTPLELRKS 398 (409) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--CeEe-----------ecccc----cccccchhhccc Confidence 4455567888899998887 578888778887765321100 0000 00000 000000000000 Q ss_pred CCCCCcccccc Q lcl|NC_018086. 481 KNPANTSTITT 491 (511) Q Consensus 481 ~~~~~~~~~~~ 491 (511) .++|..+...+ T Consensus 399 ~kGG~~n~~e~ 409 (409) T protein:vir:94 399 LKGGDKNVNES 409 (409) T ss_pred ccCCCCCcCCC Confidence 01111110000 No 178 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.84 E-value=1.6e-05 Score=46.92 Aligned_cols=385 Identities=9% Similarity=0.001 Sum_probs=158.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHH-HHhcCCCcccccCCcCccccc-cceeccchHHHHHHHHHhhhhccCceecCc-hh Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLY-DYYKGNHIAIQSRTFDDTNKP-NSKIVHNFPKLLVDTSTAYLAGEPITESGD-EK 104 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~-~yY~G~~~~~~~~~~~~~~~~-~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~~ 104 (511) |..+-|.+.++ ..+ +..+. .-..+-++...........-. +.-+...-....|+..++-+-.-|+.+-.. +. T Consensus 1 ~~~~~~~~~~k---~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~ 75 (409) T protein:vir:96 1 MAKENIVTRIK---KKL--IDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 75 (409) T ss_pred Cccccchhhhh---hHH--hhhhhccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeecccc Confidence 22111111111 000 00110 000110110000000000000 000112223344455555444456654211 12 Q ss_pred hHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEE Q lcl|NC_018086. 105 TIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTV 178 (511) Q Consensus 105 ~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 178 (511) ....+.+++. -|. -......+..+.+.+|.||+++..+..|++ .+..++|..+.++.++.... + +|.. T Consensus 76 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~-----~-~y~~ 149 (409) T protein:vir:96 76 VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE-----L-YYSI 149 (409) T ss_pred cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE-----E-EEEE Confidence 2223444443 232 234556778889999999999988888875 56678888887776543321 1 1111 Q ss_pred eecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHH Q lcl|NC_018086. 179 ISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYN 258 (511) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~ 258 (511) . ...+.. ..+.++.++|++.- + | .+.-.|.|.+..+...++..+ T Consensus 150 ~-~~~g~~----~~~~~~evih~r~~----------------------~-----~----~~~~~G~s~l~~~~~~i~~~~ 193 (409) T protein:vir:96 150 H-AATGNK----LIVHNMDMLHFKHI----------------------V-----A----SNMVQGISPIDVLKNTTDFDN 193 (409) T ss_pred E-cCCceE----EEEccccEEEeCCC----------------------C-----C----CCccccccHHHHHHHHHHHHH Confidence 1 111211 12334444443210 0 0 011247777766655555433 Q ss_pred HHHHHHHHHHHHhcCce--eEeecCCCCccchhhhhh--------h-hCceeeecCCCceeeeecCCCHHHHHHHHHHHH Q lcl|NC_018086. 259 LAVSDSVNDIAYWNDAY--LWLQGFDLSADSDSISNM--------K-NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAK 327 (511) Q Consensus 259 ~~~s~~~~~~~~~~~p~--l~~~G~~~~~~~~~~~~~--------~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 327 (511) .... . .+..++.+- ++..+...++ +....+ . .++++.++++.+++.+..+.....+.+..+... T Consensus 194 ~~~~-~--~~~~~~~~~~~i~~~~~~l~~--e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 268 (409) T protein:vir:96 194 AVRT-F--NLTEMQKPDSFMLKYGSNVST--EKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTR 268 (409) T ss_pred HHHH-H--HHHhcCCCceeEEecCCCCCH--HHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHH Confidence 3221 1 222333332 2223333222 222211 1 234666766666665554444555666677778 Q ss_pred HHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Cccc-cccceeEEeCC Q lcl|NC_018086. 328 LDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-AKDL-KPYEVTPVFVR 405 (511) Q Consensus 328 ~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~~~-~~~~i~i~f~~ 405 (511) +.|+..-++|+.-.+....++...++... ..++...|.-+++.|...+...-- .... ....+++.... T Consensus 269 ~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~----------~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ 338 (409) T protein:vir:96 269 ERVANVFQLPSIFLNARSNTNFAKNEELN----------RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKS 338 (409) T ss_pred HHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechh Confidence 88999999987655543333322222111 122233344443333333322110 0011 11224444455 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCC Q lcl|NC_018086. 406 NLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNP 483 (511) Q Consensus 406 ~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (511) -+-.|..+.++++.++ .|+++.-.+++.++.-+-+.. +.+ .-..+. .+.+.....+...++ T Consensus 339 ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~gg--D~~-----------~~~~n~----~~~~~~~~~~~~~~g 401 (409) T protein:vir:96 339 YLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGG--DKP-----------LISGDL----YPIDTPLELRKSLKG 401 (409) T ss_pred hhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCc--cee-----------eecccc----cccccchhhcccccC Confidence 5566888999998887 478888888888764321100 000 000000 000000000000111 Q ss_pred CCcccccc Q lcl|NC_018086. 484 ANTSTITT 491 (511) Q Consensus 484 ~~~~~~~~ 491 (511) |+.+...+ T Consensus 402 G~~n~~e~ 409 (409) T protein:vir:96 402 GDKNVNES 409 (409) T ss_pred CCCCcCCC Confidence 11111111 No 179 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.81 E-value=1.7e-05 Score=46.68 Aligned_cols=393 Identities=11% Similarity=0.038 Sum_probs=182.9 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhc-CC----C-cccccCCcCccccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYK-GN----H-IAIQSRTFDDTNKPNS 74 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~-G~----~-~~~~~~~~~~~~~~~~ 74 (511) |.-.| +..+++........ +.+...+.... +.-++-- |- . .++...-......... T Consensus 1 m~~~i--------~~~~g~p~~~~~~~---~~~~~~ia~~~-------~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m 62 (491) T protein:vir:10 1 MSKGL--------WVSPTEFVTFGEPD---KSLSSQIATRA-------RSIDFFALGMYLPNPDPVLKALGKDIRVYREL 62 (491) T ss_pred CCCce--------eCCCCCccCcccCC---hHHHHHHHhhh-------cccccccccCCccchHHHHHhcCCCHHHHHHH Confidence 44433 33333333222111 11122221000 0000000 00 0 0000000000000000 Q ss_pred eeccchHHHHHHHHHhhhhccCceec---CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEE-EEeeeCCCCceEE- Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAYLAGEPITES---GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCF-EIHWIDRNKKHRF- 149 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~l~g~~~~~~---~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~~~~g~~~i- 149 (511) ....+..-.+++...-+.+.++.+. .+++..+.+.+.+.+-.|+..+..+. ++.-||.++ +++|...+|...+ T Consensus 63 -~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~ 140 (491) T protein:vir:10 63 -RADAHVGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPI 140 (491) T ss_pred -hhChHHHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEE Confidence 1356677788888888888898884 34456678888888888888887764 788899765 5667554555443 Q ss_pred --EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceec Q lcl|NC_018086. 150 --KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPN 227 (511) Q Consensus 150 --~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (511) ..++|... .|++... + ++....+.. ...... T Consensus 141 ~l~~r~~~~f--~~d~~~~--l------------------------------~~~~~~~~~-------------~g~~l~ 173 (491) T protein:vir:10 141 DVVGKPADWF--VYDPENQ--L------------------------------RFRSKDHWM-------------QGEELP 173 (491) T ss_pred Eeeeecccce--eeccCCc--e------------------------------EEecCCCCC-------------Ccceec Confidence 33333322 2222111 0 000000000 000011 Q ss_pred cCCccceEee--cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc------hhhhhhhhCcee Q lcl|NC_018086. 228 LLQKFPVLEI--IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS------DSISNMKNDRVI 299 (511) Q Consensus 228 ~~g~iPvv~~--~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~------~~~~~~~~~~~i 299 (511) +++.|-.++- ..++.|.|.+..+....---+..+.+++..++.|+.|+++.+-.....++ ....++....++ T Consensus 174 ~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~ 253 (491) T protein:vir:10 174 ARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDCLEDMVQDAVA 253 (491) T ss_pred CCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCcEE Confidence 1222221211 12567888888877777777788899999999999999987742221211 122345556788 Q ss_pred eecCCCceeeeecCC---CHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 300 VTDEDGMVKFITKDV---NDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLA 375 (511) Q Consensus 300 ~~~~~~~~~~~~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 375 (511) .+|.+.++++++... +...++.+++.+.+.|...--.-.++.+..| .+.|.. . ..-....++.-.+.+...+. T Consensus 254 viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~v-h--~~v~~di~~~D~~~i~~tln 330 (491) T protein:vir:10 254 VVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQA-G--LEVTDDIRDGDKAVVSEAMN 330 (491) T ss_pred EecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhHHHH-H--HHHHHHHHHHHHHHHHHHHH Confidence 899999999987643 3446778888887776665432222222222 111211 1 11222333333455666666 Q ss_pred HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_018086. 376 KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-LPDETIINQFPWITDARQEVEKADAQRQ 452 (511) Q Consensus 376 ~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~s~et~~~~l~~v~d~~~E~~ri~~E~~ 452 (511) ++++-++. ...... . ...+.|.... .+....++.+.+++ |+ ++.+.+.+.++. +.++.+.. T Consensus 331 ~li~~l~~---~N~~~~--~--~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gi-p~~~~~~~------- 394 (491) T protein:vir:10 331 MLIRWICD---LNFDGA--D--RPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPAYFKRAYNL-QDGDLDER------- 394 (491) T ss_pred HHHHHHHH---hcCCCC--C--cceEEecCcC-chhHHHHHHHHHHHhCCCcCCHHHHHHHhCC-CCCCcCcc------- Confidence 65554443 222221 1 2345665432 33466788888875 55 788888787774 32211100 Q ss_pred HHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 453 KRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) . .+..... +..... ..+...+..+.-| T Consensus 395 -----~---~~~~~~~--------------~~~~~~----------~~~~~~~~~~~~d 421 (491) T protein:vir:10 395 -----P---LPVSAVD--------------TVGAAS----------FAEFEAPDQDALD 421 (491) T ss_pred -----c---cccCCCC--------------Cccccc----------ccccCCCCCCchH Confidence 0 0000000 000000 0000000011111 No 180 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=97.80 E-value=1.9e-05 Score=46.52 Aligned_cols=426 Identities=9% Similarity=0.000 Sum_probs=184.4 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHH----HHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSS----SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) |..+-++ ...+....|.+..+.++.++ .+++.+.+|..-. .... ........++ T Consensus 1 ~~~~~~~-----------------~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~---~~~~--~~~~~~~~~~ 58 (516) T protein:vir:10 1 MKQSTDL-----------------EYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPY---LMND--KGDNETSQNG 58 (516) T ss_pred CCchhhH-----------------hhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc---ccCC--CCCccccccc Confidence 2222111 12233444555555554333 3444444554441 1111 1112222356 Q ss_pred ccchHHHHHHHHHhhhhcc------Cc-eecCchh--------------hH-------HHHHHHHhccChhHHHHHHHHH Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGE------PI-TESGDEK--------------TI-------KAMQPVFKENYVTDVNSEEVKL 128 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~------~~-~~~~d~~--------------~~-------~~l~~~~~~n~~~~~~~~~~~~ 128 (511) ..+-+...++.+++-|++- || ++...+. .. ..+...+..++|.....++.++ T Consensus 59 ~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~ 138 (516) T protein:vir:10 59 WQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKH 138 (516) T ss_pred ccchHHHHHHHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHH Confidence 6677777788777777652 22 2221111 11 1244567788999999999999 Q ss_pred HhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeec----------------CCcceEEEEEE Q lcl|NC_018086. 129 SGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISD----------------ITGHQIRTYEV 192 (511) Q Consensus 129 a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~----------------~~~~~~~~~~~ 192 (511) ..++|.|.++ .++++.++ .++ ..-+++--+... .+...++....... ...+....+++ T Consensus 139 L~~~G~a~l~--~d~~~~~~--~~p-l~~y~v~~d~~G-~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i 212 (516) T protein:vir:10 139 LIVAGSCMLY--KPSKGAIS--AIP-MHHYVVNRDTNG-DLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKL 212 (516) T ss_pred HHhHCeEeEE--ecCCCCeE--EEE-cCeEEEeeCCCC-CeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEE Confidence 9999998654 46665544 443 333445444443 34445544331100 00111223333 Q ss_pred Ec-----CCcEEEEEEccCcccccccccccccccccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHH Q lcl|NC_018086. 193 YT-----EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVS 262 (511) Q Consensus 193 ~~-----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s 262 (511) |+ ++..+.+....++. .. ......+|..+|++.++ ++.+|+|-..+..+-+..+|.+.- T Consensus 213 ~t~v~~~~~~~~~~~~~~d~~---------~~--~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~ 281 (516) T protein:vir:10 213 YTHAKYLGEGFWELKQSADDI---------PV--GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSE 281 (516) T ss_pred EEEEEecCCCceEEEEeeCce---------ee--ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHH Confidence 32 22211111111111 11 11122234556766543 356899999999999999998888 Q ss_pred HHHHHHHHhcCceeEeecCCCCccchhhh-hhhhCceeeecCCCceeeeec--CCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_018086. 263 DSVNDIAYWNDAYLWLQGFDLSADSDSIS-NMKNDRVIVTDEDGMVKFITK--DVNDKHIENIKNRAKLDIFSLSQTPDL 339 (511) Q Consensus 263 ~~~~~~~~~~~p~l~~~G~~~~~~~~~~~-~~~~~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~ 339 (511) ...........|.+.+.-... ..... .....+.+......+++.+.. ..+.......++.++..|...-....+ T Consensus 282 ~~l~~~~~a~~~~~lv~p~g~---~~~~~l~~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l 358 (516) T protein:vir:10 282 AVARGAALMADIKYLIRPGAQ---TDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETM 358 (516) T ss_pred HHHHHHHHhcCCCcccCcccc---cchhhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhh Confidence 888888888887765531111 01111 111123343223334555443 235566667777777776554332211 Q ss_pred ccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHhcCCCccccccceeEEeCCCCCcCHHHH Q lcl|NC_018086. 340 VSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYE-L----VCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAEL 414 (511) Q Consensus 340 ~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-l----i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~ 414 (511) ..-.....|++.+. .+..++...++..+.++-. + |...+........-.. +.+.. ..+.+.... T Consensus 359 ~~rd~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~l--v~~~~--v~~i~~L~r 427 (516) T protein:vir:10 359 TRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDL--VDPVI--ITGIEALGR 427 (516) T ss_pred hccCCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhh--cCcce--ehhHHHHHH Confidence 11112235776655 4555666666666655422 1 1111111111111011 11111 111222222 Q ss_pred HHH---H-------HHHhccCC-----------hHHHHHhCCC---CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCC Q lcl|NC_018086. 415 ADM---A-------VKLRDMLP-----------DETIINQFPW---ITDARQEVEKADAQRQKRADIALQNFKQTSAVQG 470 (511) Q Consensus 415 a~~---~-------~~~~g~~s-----------~et~~~~l~~---v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 470 (511) ++. + ..+.++-+ .+.+...++- +-...+|++.+++++.+........-+......| T Consensus 428 aq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~ 507 (516) T protein:vir:10 428 MAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPG 507 (516) T ss_pred HHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 211 1 11112211 1222333321 1123566766666554333322111000011111 Q ss_pred CCCcccccc Q lcl|NC_018086. 471 ASTAAANKL 479 (511) Q Consensus 471 ~~~~~~~~~ 479 (511) .....-.+. T Consensus 508 ~~~~~~~~~ 516 (516) T protein:vir:10 508 VIQQELKEA 516 (516) T ss_pred hhhhhhhcC Confidence 111111111 No 181 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.78 E-value=2e-05 Score=46.36 Aligned_cols=374 Identities=9% Similarity=0.039 Sum_probs=165.9 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |.|+|+..+......+.. .......... ....+...+.|.. ....... .=+..+- T Consensus 1 m~m~~f~~~~~~~~~~~~--~~~~~~~~~~---------------~~~~~~~~~~~~~-----~~~v~~~---~al~~~~ 55 (392) T protein:vir:39 1 MILPILNFINQTNDPPEV--GSVQSYFPDG---------------NDAQIMESLLGDN-----NEWVSAR---AALRNSD 55 (392) T ss_pred Ccchhhhhhhcccccccc--cccccccccC---------------chhhhhhhhcCCC-----CceechH---HhhccHH Confidence 888888654432211100 0000000000 0000001111110 0000000 0011233 Q ss_pred HHHHHHHHHhhhhccCceecCchhhHHHHHH-HHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDEKTIKAMQP-VFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCL 158 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~-~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~ 158 (511) ....|+..++-+-.-|+++..... ..|.+ -............+..+.+.+|.||+++..+..|++ .+..++|..+. T Consensus 56 v~~~i~~ia~~ia~lp~~~~~~~~--~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~ 133 (392) T protein:vir:39 56 LFSIILQLSSDLAIVKINAEKKKN--QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVN 133 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh--hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeE Confidence 344566566655556776542221 11111 111112345566778899999999999988888886 67778888887 Q ss_pred EEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeec Q lcl|NC_018086. 159 IAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII 238 (511) Q Consensus 159 ~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 238 (511) +..+.... .+ +|....... .......+.++.++|+..-. .. T Consensus 134 ~~~~~~~~-~~-----~y~~~~~~~--~~~~~~~~~~~eiih~~~~~-------------------------------~~ 174 (392) T protein:vir:39 134 TYYFEYEN-GM-----YYNITFDDP--KIEPILQAPQSDLIHMKLLS-------------------------------ID 174 (392) T ss_pred EEEcCCCc-eE-----EEEEEecCc--ccceeEEEccccEEEecCCC-------------------------------CC Confidence 76654321 11 111111111 11111223444444442100 00 Q ss_pred CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cCCCCccchhhhhh--------hhCceeeecCCCcee Q lcl|NC_018086. 239 ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ--GFDLSADSDSISNM--------KNDRVIVTDEDGMVK 308 (511) Q Consensus 239 n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~--G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~ 308 (511) ....|.|.+..+...++....+..-....+...+.|-.+++ +....+ ++....+ ..++++.++++.+++ T Consensus 175 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~ 253 (392) T protein:vir:39 175 GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS-DKDKASRSRSFMKRSRSGGPVVLDDLEEFT 253 (392) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch-HHHHHHHHHHHhccccCCCeeecCCCceEE Confidence 11247787777766666555555555555566666654443 322111 1111111 123567777666666 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) .+........+.+..+...+.|+..-++|..-.+..+. .|.. + .....+..+|.-+++.|...+.. T Consensus 254 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~--~-----------~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:39 254 ALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI--Q-----------QISGMYASALNRYLRPAISELEY 320 (392) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH--H-----------HHHHHHHHHHHHHHHHHHHHHHH Confidence 55545555566777888888999988988655543322 2211 1 11223344444444444433332 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQF---PWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) .-. . ++++......-.|..+.+..+.++ +|+++...++..+ |+.+| |+.+ ..+. T Consensus 321 ~L~-~-----~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~------------~e~l 379 (392) T protein:vir:39 321 KLS-D-----HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA------------PENT 379 (392) T ss_pred hcc-c-----cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch------------hcCC Confidence 110 0 122222223334566777777776 4788887666544 44321 1111 0011 Q ss_pred cccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) +. ..|+ ++ .+|. | T Consensus 380 ~~---~~~G-----d~--~~p~---------p 392 (392) T protein:vir:39 380 NK---KTTG-----QS--NEPV---------P 392 (392) T ss_pred CC---CCCC-----CC--CCCC---------C Confidence 00 0000 00 0000 0 No 182 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.78 E-value=2e-05 Score=46.36 Aligned_cols=374 Identities=9% Similarity=0.039 Sum_probs=165.9 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |.|+|+..+......+.. .......... ....+...+.|.. ....... .=+..+- T Consensus 1 m~m~~f~~~~~~~~~~~~--~~~~~~~~~~---------------~~~~~~~~~~~~~-----~~~v~~~---~al~~~~ 55 (392) T protein:vir:10 1 MILPILNFINQTNDPPEV--GSVQSYFPDG---------------NDAQIMESLLGDN-----NEWVSAR---AALRNSD 55 (392) T ss_pred Ccchhhhhhhcccccccc--cccccccccC---------------chhhhhhhhcCCC-----CceechH---HhhccHH Confidence 888888654432211100 0000000000 0000001111110 0000000 0011233 Q ss_pred HHHHHHHHHhhhhccCceecCchhhHHHHHH-HHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDEKTIKAMQP-VFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCL 158 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~-~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~ 158 (511) ....|+..++-+-.-|+++..... ..|.+ -............+..+.+.+|.||+++..+..|++ .+..++|..+. T Consensus 56 v~~~i~~ia~~ia~lp~~~~~~~~--~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~ 133 (392) T protein:vir:10 56 LFSIILQLSSDLAIVKINAEKKKN--QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVN 133 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh--hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeE Confidence 344566566655556776542221 11111 111112345566778899999999999988888886 67778888887 Q ss_pred EEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeec Q lcl|NC_018086. 159 IAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEII 238 (511) Q Consensus 159 ~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 238 (511) +..+.... .+ +|....... .......+.++.++|+..-. .. T Consensus 134 ~~~~~~~~-~~-----~y~~~~~~~--~~~~~~~~~~~eiih~~~~~-------------------------------~~ 174 (392) T protein:vir:10 134 TYYFEYEN-GM-----YYNITFDDP--KIEPILQAPQSDLIHMKLLS-------------------------------ID 174 (392) T ss_pred EEEcCCCc-eE-----EEEEEecCc--ccceeEEEccccEEEecCCC-------------------------------CC Confidence 76654321 11 111111111 11111223444444442100 00 Q ss_pred CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cCCCCccchhhhhh--------hhCceeeecCCCcee Q lcl|NC_018086. 239 ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ--GFDLSADSDSISNM--------KNDRVIVTDEDGMVK 308 (511) Q Consensus 239 n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~--G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~ 308 (511) ....|.|.+..+...++....+..-....+...+.|-.+++ +....+ ++....+ ..++++.++++.+++ T Consensus 175 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~ 253 (392) T protein:vir:10 175 GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS-DKDKASRSRSFMKRSRSGGPVVLDDLEEFT 253 (392) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch-HHHHHHHHHHHhccccCCCeeecCCCceEE Confidence 11247787777766666555555555555566666654443 322111 1111111 123567777666666 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) .+........+.+..+...+.|+..-++|..-.+..+. .|.. + .....+..+|.-+++.|...+.. T Consensus 254 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~--~-----------~~~~f~~~~l~P~~~~ie~~l~~ 320 (392) T protein:vir:10 254 ALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI--Q-----------QISGMYASALNRYLRPAISELEY 320 (392) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH--H-----------HHHHHHHHHHHHHHHHHHHHHHH Confidence 55545555566777888888999988988655543322 2211 1 11223344444444444433332 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQF---PWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) .-. . ++++......-.|..+.+..+.++ +|+++...++..+ |+.+| |+.+ ..+. T Consensus 321 ~L~-~-----~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~------------~e~l 379 (392) T protein:vir:10 321 KLS-D-----HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA------------PENT 379 (392) T ss_pred hcc-c-----cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch------------hcCC Confidence 110 0 122222223334566777777776 4788887666544 44321 1111 0011 Q ss_pred cccccCCCCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) +. ..|+ ++ .+|. | T Consensus 380 ~~---~~~G-----d~--~~p~---------p 392 (392) T protein:vir:10 380 NK---KTTG-----QS--NEPV---------P 392 (392) T ss_pred CC---CCCC-----CC--CCCC---------C Confidence 00 0000 00 0000 0 No 183 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.78 E-value=2e-05 Score=46.33 Aligned_cols=382 Identities=11% Similarity=-0.021 Sum_probs=170.8 Q ss_pred hhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCcee-- Q lcl|NC_018086. 22 HFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITE-- 99 (511) Q Consensus 22 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~-- 99 (511) .+ +.+...++...+...+...-.+-|... ....... ..=+...-....|+..++-+-+-|+.+ T Consensus 1 m~---------f~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~v~~---~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 65 (409) T protein:vir:10 1 ML---------FRKGFKNQSQEISIDDKKILEWLGINP---SETYVNG---KSCLKQATVFGCIRILSDNISKLPIKIYQ 65 (409) T ss_pred Cc---------ccccccCcCCCCCCChHHHHHHhcCCc---Ccceech---hhhhccHHHHHHHHHHHHhhhhCceEEEE Confidence 11 111111111100000000000111100 0000000 000112333445566666555567654 Q ss_pred cCch---hhHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceE Q lcl|NC_018086. 100 SGDE---KTIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPV 170 (511) Q Consensus 100 ~~d~---~~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~ 170 (511) ..+. .....+..++. -|. .......+..+.+.+|.||+++..+..|++ .+..++|..+-++.++....... T Consensus 66 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~ 145 (409) T protein:vir:10 66 KKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSE 145 (409) T ss_pred ecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCcccccc Confidence 2111 11122344443 232 335567788889999999999988888876 57778888887776553211111 Q ss_pred EEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHH Q lcl|NC_018086. 171 AAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQ 250 (511) Q Consensus 171 ~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v 250 (511) .-+.|.. ....+.. ..+.++.+++++.-. .+...|.|.++.+ T Consensus 146 ~~~~y~~--~~~~g~~----~~~~~~evih~r~~~--------------------------------~d~~~G~s~i~~~ 187 (409) T protein:vir:10 146 NNVWYLY--TDDLGQR----HKFMSDEILHFKGLT--------------------------------ADGLAGLSVIELL 187 (409) T ss_pred ceEEEEE--EeCCcee----EEeccccEEEecCcC--------------------------------CCCcccccHHHHH Confidence 1111111 1111111 123344444432100 0112477777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh------------hhCceeeecCCCceeeeecCCCHHH Q lcl|NC_018086. 251 LSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM------------KNDRVIVTDEDGMVKFITKDVNDKH 318 (511) Q Consensus 251 ~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~------------~~~~~i~~~~~~~~~~~~~~~~~~~ 318 (511) ...++............++..+.|-.+++-.. .-.++....+ ..++++.++++.+.+.+..+..... T Consensus 188 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q 266 (409) T protein:vir:10 188 NHLIENGKSSETYLNNFFKNGLQVKGLVQYAG-DLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQ 266 (409) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC-CCCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHH Confidence 76666655555555555666666766655322 1112111111 1234666766666665555555556 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Ccc-ccc Q lcl|NC_018086. 319 IENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-AKD-LKP 396 (511) Q Consensus 319 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~~-~~~ 396 (511) +.+..+...+.|+..-++|....+..+..++..++... ...+..+|.-+++.|...+...-. ... ... T Consensus 267 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~----------~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~ 336 (409) T protein:vir:10 267 FLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQN----------REFYIDTLQSILNMYELEINYKLFLISEIKNG 336 (409) T ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCchhccCC Confidence 66777888889999999987666543333333322221 123333444444444333332111 111 112 Q ss_pred cceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc Q lcl|NC_018086. 397 YEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA 474 (511) Q Consensus 397 ~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 474 (511) ..+++.+...+-.|..+.++++.++ .|+++.-.++..++.-+-+. .+.+ .-+.+ ..+ .+. T Consensus 337 ~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~g--gD~~-----------~~~~n----~~~-~~~ 398 (409) T protein:vir:10 337 FYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEG--GDVL-----------LINGN----MIP-VKM 398 (409) T ss_pred cEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee-----------eeccC----ccc-hhh Confidence 2355555566667889999998888 47888877887776422100 0000 00000 000 000 Q ss_pred cccccCCCCCCc Q lcl|NC_018086. 475 AANKLDKNPANT 486 (511) Q Consensus 475 ~~~~~~~~~~~~ 486 (511) .++ ...++|.. T Consensus 399 ~~~-~~~kgGe~ 409 (409) T protein:vir:10 399 AGE-QYSKGGEK 409 (409) T ss_pred ccc-cccccCCC Confidence 000 00000000 No 184 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.76 E-value=2.1e-05 Score=46.21 Aligned_cols=261 Identities=9% Similarity=0.044 Sum_probs=124.7 Q ss_pred hhccCceecC-chhhHHHHHHHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCC Q lcl|NC_018086. 92 LAGEPITESG-DEKTIKAMQPVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSAD 164 (511) Q Consensus 92 l~g~~~~~~~-d~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~ 164 (511) +-.-|+.+-. ++.....+..++.. | ........+..+.+.+|.||+.+..+.+|++ .+..++|..+.+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 3233443311 11112223333321 2 2345677888899999999999888888875 57778899887766543 Q ss_pred CCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccC Q lcl|NC_018086. 165 LDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERL 244 (511) Q Consensus 165 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~ 244 (511) .. .+ +|.. ...+|.. ..+.++.+++++.- ++. +...|. T Consensus 81 ~~--~~----~y~~-~~~~g~~----~~~~~~evih~~~~----------------------~~~---------~~~~G~ 118 (278) T protein:vir:78 81 SR--EL----YYSI-HAATGNK----LIVHNMDMLHFKHI----------------------VAS---------NMVQGI 118 (278) T ss_pred Cc--eE----EEEE-EcCCceE----EEEccccEEEECCC----------------------CCC---------CCeeec Confidence 32 11 1111 1112211 12334444443210 000 112477 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcC-ceeEeecCCCCccchhhhhh---------hhCceeeecCCCceeeeecCC Q lcl|NC_018086. 245 GDFEAQLSLIDAYNLAVSDSVNDIAYWND-AYLWLQGFDLSADSDSISNM---------KNDRVIVTDEDGMVKFITKDV 314 (511) Q Consensus 245 s~~~~v~~l~d~~~~~~s~~~~~~~~~~~-p~l~~~G~~~~~~~~~~~~~---------~~~~~i~~~~~~~~~~~~~~~ 314 (511) |.+..+...++........ .+..++. |-.++. .+....++....+ ..++++.++++.+++.+.... T Consensus 119 s~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~-~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~ 194 (278) T protein:vir:78 119 SPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLK-YGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKY 194 (278) T ss_pred cHHHHHHHHHHHHHHHHHH---HHHHhcCCCcEEEE-eCCCCCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCCh Confidence 7777766666654443222 2333333 333322 2222222222221 123567777666666665555 Q ss_pred CHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_018086. 315 NDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDL 394 (511) Q Consensus 315 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 394 (511) ....+.+..+...+.|+..-++|..-.+...+.+...++.. ....+..+|.-+++.|...+...--.... T Consensus 195 ~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~----------~~~~~~~~l~P~~~~i~~~ln~~L~~~~e 264 (278) T protein:vir:78 195 VSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEEL----------NRFYLQHTLLPIVKQYEEEFNRKLLTKTD 264 (278) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCChhH Confidence 55666777788888999988998655544332222222111 11333334444554444444432111110 Q ss_pred cccceeEEeCCCCC Q lcl|NC_018086. 395 KPYEVTPVFVRNLP 408 (511) Q Consensus 395 ~~~~i~i~f~~~~p 408 (511) -.....+.|+.+.- T Consensus 265 ~~~g~~~~f~~~~l 278 (278) T protein:vir:78 265 REKIGILNLTLNLI 278 (278) T ss_pred hcCCceEEEecccC Confidence 01123456643333 No 185 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=97.73 E-value=2.4e-05 Score=45.93 Aligned_cols=381 Identities=8% Similarity=0.005 Sum_probs=180.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----cccccCCcCccccccce-eccchHHHHHHHHHhhhhccCceecC- Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNH----IAIQSRTFDDTNKPNSK-IVHNFPKLLVDTSTAYLAGEPITESG- 101 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~----~~~~~~~~~~~~~~~~r-i~~n~~k~ivd~~~~~l~g~~~~~~~- 101 (511) .....|...+. ....-.+....|..|-. .++......+ ...... .......-.+.+....+.+.++.+.. T Consensus 1 v~~~~l~~e~a---t~~~~~d~~~~~~~~l~~~~~~il~~a~~g~-~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~ 76 (488) T protein:vir:99 1 MEKPALGREIA---TSGDGRDITRPFISGLQVPNDSILQRRGGND-LRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAG 76 (488) T ss_pred CCccchhHHHH---HHHhhhhhhccccCCCCCCChHHHHhhccCC-HHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcC Confidence 11111111111 00000111122222211 1100000000 000000 12566777888888888888888842 Q ss_pred -c----hhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEE-EEeeeCCCCceEEEE---EcccceEEEecCCCCCceEEE Q lcl|NC_018086. 102 -D----EKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCF-EIHWIDRNKKHRFKA---VSPMNCLIAYSADLDEEPVAA 172 (511) Q Consensus 102 -d----~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~~~~g~~~i~~---~~p~~~~~v~d~~~~~~~~~~ 172 (511) + .+..+.+.+.+..-+|...+..+. ++.-||.++ +++|...+|...+.. .+|... .|++... T Consensus 77 ~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f--~~d~~~~------ 147 (488) T protein:vir:99 77 GDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRF--RYDQDGG------ 147 (488) T ss_pred CCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccce--eecCCCc------ Confidence 2 233456777787778888877755 788899765 566654455544333 333221 1222111 Q ss_pred EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEee--cCCcccCchhHHH Q lcl|NC_018086. 173 IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEI--IANEERLGDFEAQ 250 (511) Q Consensus 173 v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v 250 (511) . .+.+.+ . .......+.+++.+-.++. ..++.|.|.+..+ T Consensus 148 -------------l----~~~~~~---------~------------~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~ 189 (488) T protein:vir:99 148 -------------L----RLLTPN---------N------------MFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWL 189 (488) T ss_pred -------------e----EEeccC---------C------------CCCccccccCceEEEEeecCCCCCcccchHHHHH Confidence 0 000100 0 0001111222332222221 1256788888877 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch-------hhhhhhhCceeeecCCCceeeeecC-CCHHHHHHH Q lcl|NC_018086. 251 LSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD-------SISNMKNDRVIVTDEDGMVKFITKD-VNDKHIENI 322 (511) Q Consensus 251 ~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~-------~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~ 322 (511) ....--=+..+.+++..++.|+.|+++.+-.....+++ ...++....+..+|.+.++++++.. .+...++.+ T Consensus 190 ~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~l 269 (488) T protein:vir:99 190 YWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTL 269 (488) T ss_pred HHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHH Confidence 66655556678888899999999999877322111111 2234456678888999999999854 344567888 Q ss_pred HHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeE Q lcl|NC_018086. 323 KNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTP 401 (511) Q Consensus 323 ~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i 401 (511) ++.+.+.|...--.--++.+..++ |-..=+....-....++.-.+.+...+. ++++.++.+ +... .. -..+ T Consensus 270 i~~~d~~Isk~iLGqtlts~~~~G-s~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~----N~~~-~~--~p~~ 341 (488) T protein:vir:99 270 HDTMDATIAKVGLGQVASTQGTPG-RLGNDDLQADVRLDLVKADADLICESFNLGPARWLTEW----NFPG-AQ--PPRV 341 (488) T ss_pred HHHHHHHHHHHHhhhhhccccccc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CcCC-cC--Ccee Confidence 888888776654222222222111 1111111122233334444455555553 355444432 2221 11 2356 Q ss_pred EeCCCCCcCHHHHHHHHHHHh---cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 402 VFVRNLPQSYAELADMAVKLR---DM-LPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 402 ~f~~~~p~d~~e~a~~~~~~~---g~-~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) .|....+.|.++.++++.+++ |+ ++.+.+.+.++. +.++.+ +. .... T Consensus 342 ~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gi-p~~~~~--------~~--------------------~~~~ 392 (488) T protein:vir:99 342 YRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGV-EVESTQ--------AE--------------------ATAP 392 (488) T ss_pred EecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCC-CCcccc--------cc--------------------cccC Confidence 787888889889998888874 55 777777777764 211100 00 0000 Q ss_pred ccCCCCCCccccccCCCCccccccccCCCCC--CCC Q lcl|NC_018086. 478 KLDKNPANTSTITTTDPVAAKEQEKAIQKKP--KTD 511 (511) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 511 (511) .+... .+...+.++.+ .++ T Consensus 393 ~~~~~---------------~~~~~~~~~~~~~~~~ 413 (488) T protein:vir:99 393 TPSTE---------------FAEGDQPSDPAAAMAP 413 (488) T ss_pred CCccc---------------CCCCCCCCCchHHHHH Confidence 00000 00000000100 001 No 186 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.73 E-value=2.4e-05 Score=45.90 Aligned_cols=383 Identities=9% Similarity=0.007 Sum_probs=161.3 Q ss_pred HHHHHH--HHHHHHHHHHHHHHhcCCC-ccc-ccCCcCccccc---cceeccchHHHHHHHHHhhhhccCceecC-chhh Q lcl|NC_018086. 34 ITLAEM--HSRSSSAYGVLYDYYKGNH-IAI-QSRTFDDTNKP---NSKIVHNFPKLLVDTSTAYLAGEPITESG-DEKT 105 (511) Q Consensus 34 ~~~~~~--~~~~~~~~~~~~~yY~G~~-~~~-~~~~~~~~~~~---~~ri~~n~~k~ivd~~~~~l~g~~~~~~~-d~~~ 105 (511) ++++.+ ...+.+. ....++..... ... ........... ..-+..+.....|+..++-+-.-|+.+-. .+.. T Consensus 1 m~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 79 (412) T protein:vir:26 1 MNVIAKENIVTRIKK-KLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKVV 79 (412) T ss_pred Cccchhhhhhhhhhh-hHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccccc Confidence 111111 1111110 01111110000 000 00000000000 00112233344555555555556776421 2222 Q ss_pred HHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEe Q lcl|NC_018086. 106 IKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVI 179 (511) Q Consensus 106 ~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 179 (511) ...+..+|. -|. -......+..+.+.+|.||+++..+..|++ .+..++|..+.+..++.... + +|... T Consensus 80 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~--~----~y~~~ 153 (412) T protein:vir:26 80 NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE--L----YYSIH 153 (412) T ss_pred cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE--E----EEEEE Confidence 233444443 233 234556788899999999999988888875 57778899888777654321 1 11111 Q ss_pred ecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHH Q lcl|NC_018086. 180 SDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNL 259 (511) Q Consensus 180 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~ 259 (511) ...+.. ..+.++.+.|++.- ++ .+.-.|.|.++-+...++..+. T Consensus 154 -~~~g~~----~~~~~~evih~~~~----------------------~~---------~~~~~G~s~i~~~~~~i~~~~a 197 (412) T protein:vir:26 154 -AATGNK----LIVHNMDMLHFKHI----------------------VA---------SNMVQGISPIDVLKNTTDFDNA 197 (412) T ss_pred -cCCceE----EEEccccEEEeCCC----------------------CC---------CCCcccccHHHHHHHHHHHHHH Confidence 111211 12445555554210 00 0112467777655555554333 Q ss_pred HHHHHHHHHHHhcCc-eeEe-ecCCCCccchhhhhh--------h-hCceeeecCCCceeeeecCCCHHHHHHHHHHHHH Q lcl|NC_018086. 260 AVSDSVNDIAYWNDA-YLWL-QGFDLSADSDSISNM--------K-NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKL 328 (511) Q Consensus 260 ~~s~~~~~~~~~~~p-~l~~-~G~~~~~~~~~~~~~--------~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 328 (511) ... . .+..+..+ -.++ .+...++ +....+ . .++++.++++.+++.+........+.+..+.... T Consensus 198 ~~~-~--~~~~~~~~~~~i~~~~~~l~~--e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 272 (412) T protein:vir:26 198 VRT-F--NLTEMQKPDSFMLKYGSNVGK--EKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRE 272 (412) T ss_pred HHH-H--HHHhcCCCCceEEecCCCCCH--HHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHH Confidence 221 1 22333333 2232 2222222 222111 1 2346666665565555444444556666777788 Q ss_pred HHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCccc-cccceeEEeCCC Q lcl|NC_018086. 329 DIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN-KAKDL-KPYEVTPVFVRN 406 (511) Q Consensus 329 ~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~-~~~~i~i~f~~~ 406 (511) .|+..-++|..-.+..++.+...++... ...+..+|.-++..|...+...- ...+. ....+++.+..- T Consensus 273 ~Ia~afgVPp~~lg~~~~~~~sn~e~~~----------~~f~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l 342 (412) T protein:vir:26 273 RVANVFQLPSVFLNARSNTNFAKNEELN----------RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSY 342 (412) T ss_pred HHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhh Confidence 8999889886555443332222222111 12223334444444333332211 01111 112344445555 Q ss_pred CCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 407 LPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 407 ~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) +..|..+.++++.++ +|+++.-.++..++.-+-+. .++. .-..+. .+.+..........+| T Consensus 343 ~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g--gD~~-----------~~~~n~----~~~~~~~~~~~~~~gG 405 (412) T protein:vir:26 343 LRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG--GDKP-----------LISGDL----YPIDTPLELRKSLKGG 405 (412) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee-----------eecccc----cccccchhhcccccCC Confidence 667889999998887 47888888888886532110 0000 000000 0000000000000111 Q ss_pred Ccccccc Q lcl|NC_018086. 485 NTSTITT 491 (511) Q Consensus 485 ~~~~~~~ 491 (511) ..+...+ T Consensus 406 ~~n~~e~ 412 (412) T protein:vir:26 406 DKNVNES 412 (412) T ss_pred CCCcCCC Confidence 1110000 No 187 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=97.70 E-value=2.7e-05 Score=45.63 Aligned_cols=428 Identities=11% Similarity=-0.008 Sum_probs=174.8 Q ss_pred CHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc------Cc-e Q lcl|NC_018086. 29 DLRELITLAEMHSRSS---SAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE------PI-T 98 (511) Q Consensus 29 ~~~~l~~~~~~~~~~~---~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~------~~-~ 98 (511) =...+.++.++++ |. .+++.+.+|..-. ..............++..+-+...++.+++.|++- || + T Consensus 1 mk~~~~~~~~~lk-R~~~e~~w~e~a~~tlP~---~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) T protein:vir:63 1 MKTTAAMLWEKLR-DGSVEQRAIEFAKTTLPY---LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHHhhccc---cCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccc Confidence 0111223333332 22 3334444444431 11111111122223455666777777777776652 22 1 Q ss_pred ecCch--------------hhH-------HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 99 ESGDE--------------KTI-------KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 99 ~~~d~--------------~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) +...+ +.. ..+...+..++|.....++.++...+|.+.+++ ++++. +++.++-.+ T Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~--~~~~~-~~~~~pl~~- 152 (510) T protein:vir:63 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--DSDAA-TVVAWSLRS- 152 (510) T ss_pred cCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE--cCCCc-EEEEEEcce- Confidence 22111 111 125556778899999999999999999986665 45443 455554433 Q ss_pred EEEecCCCCCceEEEEEEEEEeec--------------CCcceEEEEEEEcCCcEEEEEEccCc---ccccccccccccc Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISD--------------ITGHQIRTYEVYTEDLIYKFSTDDER---EVYREIPEELEIK 220 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~ 220 (511) +++--|.. +.+...++.+..... ..++....+++|+.- +...... +.+..... +.. T Consensus 153 y~v~~d~~-G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V----~~~~~~~~~~~sv~~e~d--g~~ 225 (510) T protein:vir:63 153 YAVRRDAT-GRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHV----QRKKGTAMEYAELYHEID--GVR 225 (510) T ss_pred eEEeeCCC-cCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEE----EeecCCCceEEEEEEEec--Cce Confidence 44444443 334555554432210 011122233333311 0011110 00000000 001 Q ss_pred cccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhh Q lcl|NC_018086. 221 DYEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKN 295 (511) Q Consensus 221 ~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~ 295 (511) .......++..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+.- ++-........ .. T Consensus 226 ~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p-~g~~~~~~~~~-~~ 303 (510) T protein:vir:63 226 VGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVDDYQD-AE 303 (510) T ss_pred eccccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCc-ccccchhhhcc-CC Confidence 111223345567766554 356899999999999999998877777777777776644321 11111111111 11 Q ss_pred CceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHH----H-HHH Q lcl|NC_018086. 296 DRVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSA----V-KES 368 (511) Q Consensus 296 ~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~----~-~~~ 368 (511) .+.+.-....+++.+. ...+.......++.++..|...-.. ++..-.....|++.+...-..+..... + ... T Consensus 304 ~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E 382 (510) T protein:vir:63 304 MGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAEN 382 (510) T ss_pred CceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHh-hcccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 1233222223455543 2345566667777777766664322 122112223578776654333332221 1 112 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH-HHHHH----HHHHhc---c---CChHHH----HHh Q lcl|NC_018086. 369 KFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA-ELADM----AVKLRD---M---LPDETI----INQ 433 (511) Q Consensus 369 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~-e~a~~----~~~~~g---~---~s~et~----~~~ 433 (511) .+..-+.+++.++... +......+ ......|.+-.++-+... +.+.. +..+.+ + +....+ ... T Consensus 383 ~l~Pli~r~~~il~r~-gl~p~p~~-~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~ 460 (510) T protein:vir:63 383 LQSPLAYVCLSEVDDA-LLQGLITK-QHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAA 460 (510) T ss_pred HHHHHHHHHHHHHHhc-cCCCCCch-hcccceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHH Confidence 2333333333332211 11111111 111112233222222111 11111 111111 1 111222 233 Q ss_pred CCCCC----CHHHHHHHHHHHHHHHHHH---HHhhccccccCCCCCCccc Q lcl|NC_018086. 434 FPWIT----DARQEVEKADAQRQKRADI---ALQNFKQTSAVQGASTAAA 476 (511) Q Consensus 434 l~~v~----d~~~E~~ri~~E~~~~~~~---~~~~~~~~~~~~~~~~~~~ 476 (511) +|-.+ -.++|++.+.+++.++... +.....+..+..+.....- T Consensus 461 ~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 461 FSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred hCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 33211 1355666655442221111 1111111112211111111 No 188 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.70 E-value=2.7e-05 Score=45.59 Aligned_cols=427 Identities=10% Similarity=-0.010 Sum_probs=173.5 Q ss_pred CHHHHHHHHHHHHH--HHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc------Cc-ee Q lcl|NC_018086. 29 DLRELITLAEMHSR--SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE------PI-TE 99 (511) Q Consensus 29 ~~~~l~~~~~~~~~--~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~------~~-~~ 99 (511) =.+.+.++.+++++ -..+++.+.+|..-. ..............++..+-+...++.+++.|++- || ++ T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~---~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY---LMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc---cccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 01111222222221 113334444444431 11111111111122345566667777777776642 22 12 Q ss_pred cCch----------h----hH-------HHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceE Q lcl|NC_018086. 100 SGDE----------K----TI-------KAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCL 158 (511) Q Consensus 100 ~~d~----------~----~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~ 158 (511) ...+ . .. ..+...+..++|.....++.++...+|.+.+++. +++. .++.++- .-+ T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~--~~~~-~~~~~pl-~~y 153 (510) T protein:vir:78 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVAWSL-RSY 153 (510) T ss_pred CCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEe--CCCC-eEEEEEc-cee Confidence 2111 0 11 1244457778999999999999999999876654 3333 3455543 334 Q ss_pred EEecCCCCCceEEEEEEEEEeec--------------CCcceEEEEEEEcCCcEEEEEEccCc---cccccccccccccc Q lcl|NC_018086. 159 IAYSADLDEEPVAAIYYNTVISD--------------ITGHQIRTYEVYTEDLIYKFSTDDER---EVYREIPEELEIKD 221 (511) Q Consensus 159 ~v~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~ 221 (511) ++--|.. +.+...+|.+..... ...+....+++|+. ++ ...... +.+....... .. T Consensus 154 ~v~~d~~-G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~--V~--~~~~~~~~~~sv~~e~dg~--~i 226 (510) T protein:vir:78 154 AVRRDAT-GRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH--VQ--RRKGTAMDYAEMYHEIDGV--RV 226 (510) T ss_pred EEeeCCC-cCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEE--EE--eecCCCCcEEEEEEEecCe--ee Confidence 4444443 345555555443200 01112223343331 00 001100 0001000000 11 Q ss_pred ccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhC Q lcl|NC_018086. 222 YEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKND 296 (511) Q Consensus 222 ~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~ 296 (511) ......++..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+.- ++-....... .... T Consensus 227 ~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p-~g~~~~~~l~-~~~~ 304 (510) T protein:vir:78 227 GETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVDDYQ-DAEM 304 (510) T ss_pred ccccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCC-ccccchhhhc-cCCC Confidence 11222334556766543 356899999999999999998877777766666666544321 1111111111 1112 Q ss_pred ceeeecCCCceeeee--cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHH----H-HHHH Q lcl|NC_018086. 297 RVIVTDEDGMVKFIT--KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSA----V-KESK 369 (511) Q Consensus 297 ~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~----~-~~~~ 369 (511) +.+.-....+++.+. ...+.......++.++..|...-.. ++..-.....|++.+...-..+..... + .... T Consensus 305 g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~ 383 (510) T protein:vir:78 305 GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRITAEEAENTLGGTYSLLAENL 383 (510) T ss_pred ceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhh-ccccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 233322223455543 2345566666777777766654322 222112224578776654333332221 1 1222 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHH----HHH---hc---c---CChHHH----HH Q lcl|NC_018086. 370 FRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMA----VKL---RD---M---LPDETI----IN 432 (511) Q Consensus 370 ~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~----~~~---~g---~---~s~et~----~~ 432 (511) +..-+.+++.++... +......+ ......|.+-.++-+. +.++.+ +.+ .+ + +....+ .. T Consensus 384 l~Pli~r~~~il~r~-gl~p~p~~-~~~~~~v~~is~Lara--q~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~ 459 (510) T protein:vir:78 384 QSPLAYVCLSEVDDA-LLQGLITK-QHKPAIETGLPALSRS--AAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA 459 (510) T ss_pred HHHHHHHHHHHHHhc-cCCCCCcc-cccceeeecccHHHHH--HHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHH Confidence 333333333332211 11111111 1222233443333321 111111 111 11 1 112222 23 Q ss_pred hCCCCC----CHHHHHHHHHHHHHHHHHHH---HhhccccccCCCCCCccccccCCC Q lcl|NC_018086. 433 QFPWIT----DARQEVEKADAQRQKRADIA---LQNFKQTSAVQGASTAAANKLDKN 482 (511) Q Consensus 433 ~l~~v~----d~~~E~~ri~~E~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 482 (511) .+|-.+ -.++|++.+++++++.+... ......+.+..++. +.+- T Consensus 460 ~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~------~~g~ 510 (510) T protein:vir:78 460 AFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA------LAGV 510 (510) T ss_pred HhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc------CCCC Confidence 334211 23567776666543222211 11111111111100 0000 No 189 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=97.69 E-value=2.8e-05 Score=45.57 Aligned_cols=447 Identities=11% Similarity=0.113 Sum_probs=191.5 Q ss_pred CCC--ccchhhcccccCchhhH--hhhhc---cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccc Q lcl|NC_018086. 1 MAI--PNGQINAGDIITTNIRR--KHFIR---RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPN 73 (511) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~--~~~~~---~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~ 73 (511) |+- +.+-+...-+.++.... ....+ ...++.+=..+--.+.-...-++-+ -||.|..=+-+...+.-. T Consensus 46 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F~Gy~~la~la---- 120 (698) T protein:vir:10 46 MGRRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGFPGFPTLVLLA---- 120 (698) T ss_pred hcccccccccccccccCCCccccccccceeccccCCccccchhhhhhcccccccccc-hhhhccCcchHHHHHHHh---- Confidence 321 12222222222222211 10000 1111111000000000000001111 122221111111100000 Q ss_pred ceeccchHHHHHHHHHhhhhccCceec---------------------CchhhHHHHHHHHhccChhHHHHHHHHHHhhC Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAYLAGEPITES---------------------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIF 132 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~l~g~~~~~~---------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~ 132 (511) -++-.+.++.+.+..+.-+-+..+ .|-+..+.|..-+++=++...+.++.+++-.| T Consensus 121 ---Q~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlf 197 (698) T protein:vir:10 121 ---QLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAF 197 (698) T ss_pred ---hccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 123344445555554432211111 11144566777788888889999999999999 Q ss_pred CeEEEEeeeCCCCc----eE--------------EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc Q lcl|NC_018086. 133 GHCFEIHWIDRNKK----HR--------------FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT 194 (511) Q Consensus 133 G~~~~~v~~~~~g~----~~--------------i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 194 (511) |.+..++-.+.++. |. +.+++|..+.|-.-+.. .++. -.+|- T Consensus 198 GGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~--dP~s------------------pdfgk 257 (698) T protein:vir:10 198 GRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA------------------DDFYK 257 (698) T ss_pred cceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhc--cchh------------------hccCC Confidence 99876665433321 11 44455555444110000 0000 01111 Q ss_pred CCcEEEEEEccCcccccccccccccccccceeccCC--ccceEeec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 195 EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQ--KFPVLEII--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAY 270 (511) Q Consensus 195 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~iPvv~~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~ 270 (511) |..++-. +.. ......+.|- .+|-. ++ .+-.|.|....+.+-+++++++.-.....+.. T Consensus 258 P~~y~V~----G~~------------IH~SRL~~~vg~pvpd~-LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~ 320 (698) T protein:vir:10 258 PSTWWMI----GSE------------VHATRLHTIVSRPVGDM-LKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ 320 (698) T ss_pred CceEEEe----cce------------ecceeEEEecCCCchhh-hcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHH Confidence 2111100 000 0000000000 01111 11 12247888888888888888877666665544 Q ss_pred hcCceeEeecC----CCCccchhh------hhhhh-CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_018086. 271 WNDAYLWLQGF----DLSADSDSI------SNMKN-DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL 339 (511) Q Consensus 271 ~~~p~l~~~G~----~~~~~~~~~------~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 339 (511) +....+. ++. ......... ...+. .+++.++++ +=+|.+.+.+...+...+......|...+++|-. T Consensus 321 ~~~~~l~-~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~-~Eefeq~st~lSGLddVi~qf~q~VAgaa~IPlt 398 (698) T protein:vir:10 321 FSVSGIL-MDLAQALTPGANVDLSMRAELINRYRDNRNILFLDKA-TEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLI 398 (698) T ss_pred hhHHHHH-HHHHHhcCChhhHHHHHHHHHHHHhcCccceEEEecC-CcceEEEecCcCCHHHHHHHHHHHHHhhhcCchh Confidence 4333321 111 000111111 11222 345556532 2356677788899999999999999999999965 Q ss_pred cccccc----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHH Q lcl|NC_018086. 340 VSKDFT----AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELA 415 (511) Q Consensus 340 ~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 415 (511) -....+ |+||++=..-|...+.-. .++.+...+++++.+|.. +..+.. +. ++.+.|+|-...++.|.| T Consensus 399 kLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~ii~r--S~~G~i---dp-~i~~~fnPL~qmtd~EkA 470 (698) T protein:vir:10 399 KLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVMIQL--SLFGAV---DP-SIKWQWNALRELDDLEVA 470 (698) T ss_pred hhhccCCcccCccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH--HhcCCC---CC-cceEEeCCCCCcCHHHHH Confidence 543332 688987555555544433 367889999998887643 333322 22 588899999999999988 Q ss_pred HHHHHH---------hccCChHHHHHhCC------CC--CCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 416 DMAVKL---------RDMLPDETIINQFP------WI--TDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 416 ~~~~~~---------~g~~s~et~~~~l~------~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) +.-.+- .|+++...+..+|- |. .|.+.+--. -+|.. .+..+. ..+.....|..+.. T Consensus 471 eI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~-~~~~~--~~~~~~-~~~~~~~~~~~~~~--- 543 (698) T protein:vir:10 471 EARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGA-PADDD--IDGVLT-YVQRMAEGGDTGAP--- 543 (698) T ss_pred HHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCC-CCCCc--chHHHh-hhcCCcCCCCcccc--- Confidence 764332 36666665555551 21 111100000 00000 000000 00000000000000 Q ss_pred cCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 479 LDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ..++.+-++++-.|..++..-++.+--+.+. T Consensus 544 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 574 (698) T protein:vir:10 544 --TAPGGARAGATAPPAAANVNANANPREAGAQ 574 (698) T ss_pred --cccccccCCCCCCcccccccCCCCccccCcc Confidence 1122222333333444444444444333333 No 190 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.66 E-value=3.2e-05 Score=45.25 Aligned_cols=403 Identities=9% Similarity=-0.032 Sum_probs=147.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhc----C---CCcccccCCcCccccccc----ee-ccchHHHHHHHHHhhhhcc Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYK----G---NHIAIQSRTFDDTNKPNS----KI-VHNFPKLLVDTSTAYLAGE 95 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~----G---~~~~~~~~~~~~~~~~~~----ri-~~n~~k~ivd~~~~~l~g~ 95 (511) |+.+ +.+...|..+..+..+.++-+. | +++..+...--+...... .. ....++.||+..++-..-+ T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~ 78 (449) T protein:vir:10 1 MTDK--LTLAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQT 78 (449) T ss_pred Cchh--hHHHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhc Confidence 2222 1222222222222222222111 1 111111100000000000 01 1345678899888866433 Q ss_pred Cc-eecCchhh-------HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCC Q lcl|NC_018086. 96 PI-TESGDEKT-------IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDE 167 (511) Q Consensus 96 ~~-~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~ 167 (511) .+ .+.+++.. .+...+-+..+++...+.++.+++..+|.|++++..+ +|+..-..+.+.. T Consensus 79 ~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l~~Pl~~~~----------- 146 (449) T protein:vir:10 79 NPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIR-DEKDWNLPATKGR----------- 146 (449) T ss_pred CcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEec-CCCCCCcccccCc----------- Confidence 32 22322211 1112222333466677888899999999988877653 2332111111110 Q ss_pred ceEEEEEEEEE---eecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccC Q lcl|NC_018086. 168 EPVAAIYYNTV---ISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERL 244 (511) Q Consensus 168 ~~~~~v~~~~~---~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~ 244 (511) -+..|+.+.. ....-......-.++.|..+ ++.....+ ........|.--.+.++..+ ..|. T Consensus 147 -~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y-~v~~~~~g-----------~~~~~~~iH~SRl~~~~~~~--~~g~ 211 (449) T protein:vir:10 147 -GLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLW-KYTERLPN-----------GSSRRVDIHPDRVFILGDYS--EDAI 211 (449) T ss_pred -ceeeEEeeccccCChhhhhcCCCCCCCCCceEE-EEeeeccC-----------CCccceeeccceeEeecCCC--CCCh Confidence 0111111100 00000000000011111111 01000000 00001122332222222111 1244 Q ss_pred chhHHHHHHHHHHHHHHHH-----HHHHHHHhcCc---eeEe------ecCCCCccch-hhhh---hh-hCceeeecCCC Q lcl|NC_018086. 245 GDFEAQLSLIDAYNLAVSD-----SVNDIAYWNDA---YLWL------QGFDLSADSD-SISN---MK-NDRVIVTDEDG 305 (511) Q Consensus 245 s~~~~v~~l~d~~~~~~s~-----~~~~~~~~~~p---~l~~------~G~~~~~~~~-~~~~---~~-~~~~i~~~~~~ 305 (511) |.++.+..-+-.++.+.-. +.+..+..... ...+ .+.+.+...+ +... +. ....+.+..+. T Consensus 212 ~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~i~~~~ 291 (449) T protein:vir:10 212 GFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGNDVLMTTQGA 291 (449) T ss_pred hHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccchheeecCCc Confidence 5555433222222221100 11111111000 0011 1211111111 1111 11 11233444444 Q ss_pred ceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccc-ccc-c--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 306 MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVS-KDF-T--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELV 381 (511) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 381 (511) +. .+.+.+.......++.....+...+++|-.-. |.. + |++++ ++. -...+..++..+...|++++.+| T Consensus 292 d~--~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~n----yyd~i~~~Q~~l~p~le~l~~~l 364 (449) T protein:vir:10 292 TV--TPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QKY----FNARCQSRRVDLSFEIEDFCDKL 364 (449) T ss_pred ce--EEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchh-HHH----HHHHHHHHHHhhhHHHHHHHHHH Confidence 54 44556667777788888888899999985432 222 1 23433 332 33444445556899999999876 Q ss_pred HHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018086. 382 CSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQN 461 (511) Q Consensus 382 ~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~ 461 (511) +.. ..+.. ..+++|.|+|-...+++|.|+...+.+...+........+-+ ++ .|+.. . +.. T Consensus 365 ~~s--~~g~~----~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~-~~-~EiR~-------~----~~~ 425 (449) T protein:vir:10 365 IEL--KIIDA----VAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAF-SR-EEIRT-------A----AGY 425 (449) T ss_pred HHh--hcCCC----CCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCc-CH-HHHHH-------H----hcc Confidence 543 11211 136899999999999999998776664322211100011111 11 11111 0 000 Q ss_pred ccccccCCCCCCccccccCCCCCCcc Q lcl|NC_018086. 462 FKQTSAVQGASTAAANKLDKNPANTS 487 (511) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (511) .+. ...+..+...++.+....+++ T Consensus 426 ~~~--~~~~~~~e~~de~~~~~d~~a 449 (449) T protein:vir:10 426 DND--DEEPLGEEDGDEEDKATDSAA 449 (449) T ss_pred cCC--CCCCCCCCCCccccccCCcCC Confidence 000 011111111111111111111 No 191 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.60 E-value=3.9e-05 Score=44.77 Aligned_cols=406 Identities=9% Similarity=0.008 Sum_probs=185.9 Q ss_pred CCCccchhhcccccCchhhHh--------------hhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRK--------------HFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF 66 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~ 66 (511) |+.=+++- |.-+......+ ......+++..+...+..-. ...+..+.+.|+- .. T Consensus 1 ~~~~~d~~--g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~--~gd~~~~~~L~ed--------m~ 68 (526) T protein:vir:79 1 MAQIVDVY--GNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAE--QGNLQAQAELFMD--------ME 68 (526) T ss_pred CCeeeCCC--CCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhh--CCCHHHHHHHHHH--------HH Confidence 76666654 44332111000 01111233332222211110 0001111111110 00 Q ss_pred CccccccceeccchHHHHHHHHHhhhhccCceec---Cc----hhhHHHHHHHHhcc-ChhHHHHHHHHHHhhCCeEE-E Q lcl|NC_018086. 67 DDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES---GD----EKTIKAMQPVFKEN-YVTDVNSEEVKLSGIFGHCF-E 137 (511) Q Consensus 67 ~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d----~~~~~~l~~~~~~n-~~~~~~~~~~~~a~~~G~~~-~ 137 (511) -......-.+.+....+.+.++++. .+ .+..+.+.+++.+- +|...+.. ..+|.-||.++ + T Consensus 69 ---------e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~-~ldA~~~G~s~~E 138 (526) T protein:vir:79 69 ---------ERDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLD-ALDGIGHGYSCIE 138 (526) T ss_pred ---------hhChHHHHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHH-HHhhhhhcceeEE Confidence 0135566677777777788888773 12 23344567777653 57776665 55688899765 5 Q ss_pred EeeeCCCCceEEE---EEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccc Q lcl|NC_018086. 138 IHWIDRNKKHRFK---AVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIP 214 (511) Q Consensus 138 ~v~~~~~g~~~i~---~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 214 (511) ++|...+|...+. ..+|.. | .|++..+.. +++. .....| ..+ T Consensus 139 i~w~~~~g~~~~~~l~~r~~~~-F-~~~~~~~~~----l~~~--~~~~~g-----~~l---------------------- 183 (526) T protein:vir:79 139 LEWALQGREWMPLAFHHRPQSW-F-QLNPEDQNE----LRLR--DNSPAG-----EAL---------------------- 183 (526) T ss_pred EEEeecCCceeEEEeeeecccc-e-EeccCCCcE----EEec--CCCCCc-----eee---------------------- Confidence 6665545544333 233321 1 122222110 0000 000000 001 Q ss_pred cccccccccceeccCCccceEeec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch---- Q lcl|NC_018086. 215 EELEIKDYEVHPNLLQKFPVLEII--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD---- 288 (511) Q Consensus 215 ~~~~~~~~~~~~~~~g~iPvv~~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~---- 288 (511) .+++.|-.++-. .++.|.|.+..+.-..--=+..+.+++..++.|+.|+++.+=.....+++ T Consensus 184 ------------~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek~~L 251 (526) T protein:vir:79 184 ------------QPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATL 251 (526) T ss_pred ------------cCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHH Confidence 122222222211 24567788877554444445578888999999999999887322222211 Q ss_pred --hhhhhhhCceeeecCCCceeeeecC-CCHHHHHHHHHHHHHHHHHHhCcccccccc-ccCccHHHH-HHHHHHHHHHH Q lcl|NC_018086. 289 --SISNMKNDRVIVTDEDGMVKFITKD-VNDKHIENIKNRAKLDIFSLSQTPDLVSKD-FTAASGQAL-KAATQPLENKS 363 (511) Q Consensus 289 --~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~~Sg~Ai-~~~~~~l~~k~ 363 (511) ...++....+..+|.+.++++++.. .....++.+++.+.+.|...--.-.++.+. .|+.+.-|+ +....-....+ T Consensus 252 ~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~ 331 (526) T protein:vir:79 252 LRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDIL 331 (526) T ss_pred HHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHH Confidence 2234556678889999999999854 455678888898888887764332233221 111111122 11122222333 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCC Q lcl|NC_018086. 364 AVKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-LPDETIINQFPWITD 439 (511) Q Consensus 364 ~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~s~et~~~~l~~v~d 439 (511) +.-.+.+...+. ++++.++.+ .......-..-..+.|....+.|.++.++.+.+++ |+ +|.+.+.+.++. +. T Consensus 332 ~aDa~~i~~tln~~Li~~l~~~---N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi-p~ 407 (526) T protein:vir:79 332 ASDARQLAATLSRDLLWPLLVL---NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI-PQ 407 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHh---CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCC-CC Confidence 344455555553 355444432 22111111123567888889999999999999886 55 899888888875 32 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 440 ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 440 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) +... +.+. ....+... . ...++.... ..+ ....+..+..| T Consensus 408 ~~~~-e~~l-----------~~~~~~~~---~--------~~~~~~~~~------~~~---~~~~~~~~~~~ 447 (526) T protein:vir:79 408 PAKN-EPVL-----------RPAAQPAI---L--------SRQHGQRVA------ALA---TIVGPRYGDQQ 447 (526) T ss_pred CCCc-hhhc-----------cccCCccc---c--------ccccccccc------ccc---ccccccCchhh Confidence 2210 0000 00000000 0 000000000 000 00000011111 No 192 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.58 E-value=4.1e-05 Score=44.62 Aligned_cols=374 Identities=9% Similarity=0.035 Sum_probs=169.0 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |-|+|+..++......... ........ ... ......+.|... ...... .-+...- T Consensus 1 m~m~~~~~~~~~~~~~~~~--~~~~~~~~--------------~~~-~~~~~~~~~~~g-----~~v~~~---~al~~~~ 55 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAG--SVQSYFPD--------------GND-AQIMESLLGDNN-----EWVSAR---AALRNSD 55 (392) T ss_pred CcchhhhhhhcccCccccc--cccccccc--------------Cch-hhhhhhccCCCC-----cccchh---hhhcchH Confidence 8888875544322111000 00000000 000 001111111110 000000 0011233 Q ss_pred HHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLI 159 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~ 159 (511) ....|+..++-+-+-|+.+-.... ...+.+-.....-......+..+.+.+|.||+.+-.+.+|++ .+..++|..+.+ T Consensus 56 v~~~v~~ia~~ia~lp~~~~~~~~-~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v 134 (392) T protein:vir:74 56 LFSIILQLSSDLAIVKINAEKKKN-QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNT 134 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh-hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEE Confidence 444566666666556776532211 111111111122345566778889999999999888888876 577888888877 Q ss_pred EecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA 239 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 239 (511) ..+.... .+ +|...... + .......+.++.++++..- .... T Consensus 135 ~~~~~~~-~~-----~y~~~~~~-~-~~~~~~~~~~~evih~~~~-------------------------------~~~~ 175 (392) T protein:vir:74 135 YYFEYEN-GM-----YYNITFDD-P-KIEPILQAPQSDLIHMKLL-------------------------------SIDG 175 (392) T ss_pred EEcCCCc-eE-----EEEEEecC-C-ccceeEEEcCccEEEecCC-------------------------------CCCC Confidence 7654321 11 11111111 1 1111122444444444210 0011 Q ss_pred CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cCCCCccchhhhhh--------hhCceeeecCCCceee Q lcl|NC_018086. 240 NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ--GFDLSADSDSISNM--------KNDRVIVTDEDGMVKF 309 (511) Q Consensus 240 ~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~--G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~~ 309 (511) .-.|.|.+..+...++....+..-....++..+.|-.+++ +....+ ++....+ ..++++.++++.+++. T Consensus 176 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~-~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~ 254 (392) T protein:vir:74 176 GKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLS-DKDKASRSRSFMKRSRSGGPVVLDDLEEFTA 254 (392) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCch-HHHHHHHHHHHhccccCCCeeecCCCceEEE Confidence 1247888877777776666665555556666676765554 321111 1111111 1235677776666666 Q ss_pred eecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC-ccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 310 ITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA-ASG-QALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 310 ~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) +........+.+..+...+.|+..-++|..-.+..+. .|. .+.+ ..+..+|.-.++.|...+.. T Consensus 255 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~--------------~~~~~~l~p~~~~ie~~l~~ 320 (392) T protein:vir:74 255 LEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQIS--------------GMYASALNRYLRPAISELEY 320 (392) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHH Confidence 6555555666777788888999988988655544322 121 1211 12333444444433333322 Q ss_pred cCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQF---PWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) .-. . ++++.+...+-.|..+.++.+.++ +|+++...++..+ |+.. .|+.+ ..+. T Consensus 321 ~l~-~-----~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~p---ne~r~------------~enl 379 (392) T protein:vir:74 321 KLS-D-----HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIP---KDLPA------------PENT 379 (392) T ss_pred hcc-c-----hhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCc---cccch------------hcCC Confidence 110 0 122233333345667777777776 4788887776554 3322 11111 0011 Q ss_pred cccccCCCCCCccccccCCCCCC Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPAN 485 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~ 485 (511) + ...|++ + .+|.+ T Consensus 380 ~---~~~~Gd-----~--~~p~p 392 (392) T protein:vir:74 380 N---KKTTGQ-----S--NEPVP 392 (392) T ss_pred C---CCCCCC-----C--CCCCC Confidence 0 011110 0 00000 No 193 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=97.56 E-value=4.5e-05 Score=44.43 Aligned_cols=367 Identities=10% Similarity=0.048 Sum_probs=154.3 Q ss_pred HHHHHHHHHHHHHH-HHHHHHhcCCCcccccCCcCccccc-cce--eccchHHHHHHHHHhhhhccCceecCchhhHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAY-GVLYDYYKGNHIAIQSRTFDDTNKP-NSK--IVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAM 109 (511) Q Consensus 34 ~~~~~~~~~~~~~~-~~~~~yY~G~~~~~~~~~~~~~~~~-~~r--i~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l 109 (511) +.+........... .....+..--.. ..... ...+.. ..+ +..+-....|+..++-+-+-|+.+.. ......+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~-~~~~~l~ 77 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDP-DFLST-LNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASR-KQLQGII 77 (386) T ss_pred Ccccccccccccccccccccccccccc-hhccc-ccCCceechhhhhcchHHHHHHHHHHHhhccCceeecc-chhHHHh Confidence 12221111100000 000000000000 00000 000000 000 11122223444455555455665432 1211111 Q ss_pred HHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEE Q lcl|NC_018086. 110 QPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIR 188 (511) Q Consensus 110 ~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~ 188 (511) .+-............+..+.+.+|.||+.+-.+..|++ .+..++|..+.+..+.... . .+|.+..+.. ... T Consensus 78 ~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~-~-----~~y~~~~~~~--~~~ 149 (386) T protein:vir:48 78 DNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKD-G-----IYYNITFDDP--RIP 149 (386) T ss_pred hcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCc-e-----EEEEEEecCc--ccc Confidence 21122223345667788899999999999888888875 5677888888766554321 1 1111111111 101 Q ss_pred EEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 189 TYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDI 268 (511) Q Consensus 189 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~ 268 (511) ....+.++.++|++.-. ....-.|.|.+......+.....+..-....+ T Consensus 150 ~~~~~~~~evih~~~~~-------------------------------~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 198 (386) T protein:vir:48 150 PKQHVPQGDVLHFKLLS-------------------------------VDGGLTSVSPLMALSRELNIQKASDKLTLNSL 198 (386) T ss_pred ceeEecCccEEEecCCC-------------------------------CCCceeeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11123344444432100 00012477777776666666655555555556 Q ss_pred HHhcCceeEeecCCCCccchhhhhh---------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_018086. 269 AYWNDAYLWLQGFDLSADSDSISNM---------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL 339 (511) Q Consensus 269 ~~~~~p~l~~~G~~~~~~~~~~~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 339 (511) ...+.|-.+++-..... .+....+ ..++++.++++.+++.+........+.+..+...+.|+..-++|.. T Consensus 199 ~ng~~~~~ii~~~~~~~-~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 277 (386) T protein:vir:48 199 KNALNANGILKIKGGGL-LDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPEN 277 (386) T ss_pred hccCCcceEEEeCCCCC-HHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 66666766665432221 2222211 1234666666656555544444455667778888999999999876 Q ss_pred ccccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHH Q lcl|NC_018086. 340 VSKDFTA-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMA 418 (511) Q Consensus 340 ~~~~~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~ 418 (511) -.+..+. .+.+.. ....+..+|.-+++.|...++..-. . ++.+.+...+..+....+..+ T Consensus 278 ~lg~~~~~~~~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~-~-----~~~~~~~~~~~~d~~~~~~~~ 338 (386) T protein:vir:48 278 VVGGQGDQQSSLEM-------------SLDLYNKAVSRYLRPFLSELSQKLS-C-----DVDADILPAVDPTGSNSVSRI 338 (386) T ss_pred HhCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhc-c-----hhhcchhhhhccChHHHHHHH Confidence 5543221 111111 0112233333333333332221100 0 112222223334555666666 Q ss_pred HHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 419 VKL--RDMLPDETIINQFPW--ITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 419 ~~~--~g~~s~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) .++ +|+++.-.+++.++. +.. .++.+. ...+. ....|++++..+ T Consensus 339 ~~l~~~g~~t~nE~r~~lg~~~~~~--~~~~~~------------~~~~~-~~~~gGd~~~~~ 386 (386) T protein:vir:48 339 NSMVKSGTLAQNQGLYILQQAEILP--KELPEG------------ENPNK-TTLKGGEINGED 386 (386) T ss_pred HHHHhCCCcCHHHHHHHhhcCCCCC--ccchhh------------cCCCC-CccCCCCCCCCC Confidence 665 578888778776642 221 111110 00111 111111111111 No 194 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=97.54 E-value=4.8e-05 Score=44.29 Aligned_cols=463 Identities=10% Similarity=0.057 Sum_probs=169.7 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |.=.=.+..+.-+.......+... .-....+.++. .++|.++. ++. +...+.......-..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~----~~~~~~~~-~~~-p~~~~~~L~~~~e~~~~ 63 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADL-----------AKSPNSTQIPD----HRIQSHNV-GVN-PPYNPDRLAAFLELNET 63 (651) T ss_pred CCCccceeeeeEEEeecccccccc-----------cccccccccch----hhhcccCC-CCC-CCCCHHHHHHHHhcChH Confidence 110000000000000000000000 00001111111 13444433 221 21122222222334789 Q ss_pred HHHHHHHHHhhhhccCceec------Cchh---hHHHHHHHHhc---------------cChhHHHHHHHHHHhhCCeEE Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITES------GDEK---TIKAMQPVFKE---------------NYVTDVNSEEVKLSGIFGHCF 136 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~------~d~~---~~~~l~~~~~~---------------n~~~~~~~~~~~~a~~~G~~~ 136 (511) .+..|+..+..+.|-|+.+. .++. -.+.++.+|.. ..+......+..+...+|.+| T Consensus 64 ~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ 143 (651) T protein:vir:99 64 LATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLA 143 (651) T ss_pred HHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHh Confidence 99999999999999887642 2221 23344555433 123455666777888889888 Q ss_pred EEeeeCCCCce-EEEEEcccceEEEecCCCC-CceEEEE-------------------------EEEEEeecCCcceEEE Q lcl|NC_018086. 137 EIHWIDRNKKH-RFKAVSPMNCLIAYSADLD-EEPVAAI-------------------------YYNTVISDITGHQIRT 189 (511) Q Consensus 137 ~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~-~~~~~~v-------------------------~~~~~~~~~~~~~~~~ 189 (511) +-+..+..|++ .+..++|..+ .+..+... ......+ .++....+..+.. .. T Consensus 144 ieiIrn~~g~pv~L~~lp~~~~-Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~-~~ 221 (651) T protein:vir:99 144 LEMLTDIEGRPVGLAYVPARTV-RVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQ-EV 221 (651) T ss_pred hhhhhcCccchhhhhhcChhhe-eeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccce-ee Confidence 87776665553 2222333221 11110000 0000000 0000000000000 00 Q ss_pred EEEEcCCcEEEEEEccCcccccc---cccccccccccceeccCCccc---eEeecCC-----cccCchhHHHHHHHHHHH Q lcl|NC_018086. 190 YEVYTEDLIYKFSTDDEREVYRE---IPEELEIKDYEVHPNLLQKFP---VLEIIAN-----EERLGDFEAQLSLIDAYN 258 (511) Q Consensus 190 ~~~~~~~~i~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~iP---vv~~~n~-----~~g~s~~~~v~~l~d~~~ 258 (511) +.......+.............. ......+.. ....+...+| |++|+.. ..|.|.+..+...+.... T Consensus 222 ~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~--~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~ 299 (651) T protein:vir:99 222 VIDESGDEPTIRYREDEESEREPIFVDRETGDVTT--GDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADE 299 (651) T ss_pred eeccCCcceeEEeccCcceeeeeecccceeeeEEE--cCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHH Confidence 00000000000000000000000 000000000 0001111123 5666532 257777777666665555 Q ss_pred HHHHHHHHHHHHhcCceeEee--cCCCCccchhhhhhh---------hCceeeecC---------CCceeeeecCC---C Q lcl|NC_018086. 259 LAVSDSVNDIAYWNDAYLWLQ--GFDLSADSDSISNMK---------NDRVIVTDE---------DGMVKFITKDV---N 315 (511) Q Consensus 259 ~~~s~~~~~~~~~~~p~l~~~--G~~~~~~~~~~~~~~---------~~~~i~~~~---------~~~~~~~~~~~---~ 315 (511) .+..-..+.+...+.|-.++. |...+. +....++ .++.+.++. +.+++|..... . T Consensus 300 ~a~~~~~~~f~NG~~p~gil~~~~~~ls~--e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~ 377 (651) T protein:vir:99 300 AAKDYNRDFFDNDTIPRMVIKVTGGELSE--ESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISE 377 (651) T ss_pred HHHHHHHHHHhccCCCceEEEecCCCCCH--HHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchh Confidence 554444555555566666654 432222 2222221 234444443 23555554432 2 Q ss_pred HHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCcc Q lcl|NC_018086. 316 DKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN--KAKD 393 (511) Q Consensus 316 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~ 393 (511) ...+....+.....|+..-++|....+.....+...++... ...+..+|.-+++.|...++..- .... T Consensus 378 D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~----------~~f~~~tL~P~~~~ie~eln~kLl~~~e~ 447 (651) T protein:vir:99 378 EMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQD----------KDFALEVIQPEQHTFAEWLYQIIHQQALG 447 (651) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCcccc Confidence 34566677778888999999986554433222222221111 12223334444444333333211 1111 Q ss_pred ccccceeEEeC--CCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_018086. 394 LKPYEVTPVFV--RNLPQSYAELADMAVKL--RDMLPDETIINQFPW--ITDARQEVEKADAQRQKRADIALQNFKQTSA 467 (511) Q Consensus 394 ~~~~~i~i~f~--~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 467 (511) .....+.+.|+ .-+..|..+.++.+.++ .|+++...++++++. ++++... ..+...+ .. T Consensus 448 ~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd-------------~~l~~~~--~~ 512 (651) T protein:vir:99 448 VTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGE-------------MTLSEFE--AE 512 (651) T ss_pred ccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccc-------------ccccccc--cc Confidence 11223445553 34456778888888776 489998889888764 2221100 0000010 00 Q ss_pred CCCCCCccccccCCCCCCccccccCCCCccccccccCCCCC-CCC Q lcl|NC_018086. 468 VQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKP-KTD 511 (511) Q Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 511 (511) ..+.....++..+....+..+..+.++..+...+-..++.- ... T Consensus 513 ~~g~~~~gge~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~ 557 (651) T protein:vir:99 513 VAGDVAGGGETEAVHEPPEENKIGEREWDTVKSELTTKDPIEQMQ 557 (651) T ss_pred cccccccCCCCcccccCccccccccchhhhhhhhhcccchhhhhh Confidence 00100000000000000000111111111110000000000 000 No 195 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.54 E-value=4.8e-05 Score=44.25 Aligned_cols=361 Identities=10% Similarity=0.070 Sum_probs=157.8 Q ss_pred HHHHHHHHHHHHHH-----HHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCchhhHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAY-----GVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKA 108 (511) Q Consensus 34 ~~~~~~~~~~~~~~-----~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~ 108 (511) +.++.+...+.+.- ..+...+.+-. ....... +..-+..+-....|+..++-+-+-|+.+-.... .. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~v~---~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~--~~ 72 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASL---KGNEWVS---AETALRNSDLFSIINQLSNDLATVKLITSRKKL--QG 72 (382) T ss_pred CccccccccCCcccccccccchhhhccccc---cCCcccc---hHhhhccHHHHHHHHHHHHhhccCceeeecchh--hh Confidence 22222211111100 00000000000 0000000 000011223334556566655556776543221 11 Q ss_pred HH-HHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 109 MQ-PVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 109 l~-~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) |. +-............+..+.+.+|.||+++-.+..|++ .+..++|..+.++.++.... + +|.+..+..... T Consensus 73 L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~-~-----~y~~~~~~~~~~ 146 (382) T protein:vir:48 73 IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG-I-----YYNITFDDPRIP 146 (382) T ss_pred hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCe-E-----EEEEEecCcccc Confidence 21 1111123345667788889999999999988888875 67778899887766543221 1 111111111000 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVN 266 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~ 266 (511) ....+.++.+++++.-. ......|.|.+..+...++.......-..+ T Consensus 147 --~~~~~~~~evih~~~~~-------------------------------~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~ 193 (382) T protein:vir:48 147 --PKQHVPQNDVLHFRLLS-------------------------------VDGGMTSVSPLMALSRELDIQKASGNLTIN 193 (382) T ss_pred --ceeEEcCccEEEecCCC-------------------------------CCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 01123334444432100 001235778777777777766666666666 Q ss_pred HHHHhcCceeEeecCCCCccchhhhhh---------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_018086. 267 DIAYWNDAYLWLQGFDLSADSDSISNM---------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTP 337 (511) Q Consensus 267 ~~~~~~~p~l~~~G~~~~~~~~~~~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p 337 (511) .++..+.|-.+++-.... ..+....+ ..++++.++++.+++.+........+.+..+...+.|+..-++| T Consensus 194 ~~~ng~~p~~il~~~~~~-~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp 272 (382) T protein:vir:48 194 SLKNALNANGILKIKGGG-LLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIP 272 (382) T ss_pred HHhccCCCceEEEeCCCC-ChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 677777776665432211 11111111 12456777766666656555555566677888889999998998 Q ss_pred ccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHH Q lcl|NC_018086. 338 DLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADM 417 (511) Q Consensus 338 ~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~ 417 (511) ....+..+..+. .. +.....+..+|.-+++.|...+...-. ..+.. +....+. .+....... T Consensus 273 ~~~lg~~~~~~~--~~----------~~~~~~~~~~l~p~~~~i~~~l~~~l~-~~~~~-~~~~~~~----~~~~~~~~~ 334 (382) T protein:vir:48 273 DNVVGGQGDQQS--SL----------EMSSDLYSKAVSRYLRPFLSELSQKLS-CDVDA-DIFPAVD----PTGSNYISR 334 (382) T ss_pred HHHhCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHHHhc-Chhhh-hhhhhhc----cchhHHHHH Confidence 765554332211 10 111123333333333333333222110 01110 1111111 122334444 Q ss_pred HHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccc Q lcl|NC_018086. 418 AVKL--RDMLPDETIINQF---PWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAAN 477 (511) Q Consensus 418 ~~~~--~g~~s~et~~~~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (511) +.++ +|+++.-.+++.+ ++.++...+.+ . ......|+++++.+ T Consensus 335 ~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~---------------~--~~~~~~GGd~~~~~ 382 (382) T protein:vir:48 335 INSLVKTGTLAQNQGLYILQQAEILPKELPNGE---------------N--PNSTLKGGEEDGQD 382 (382) T ss_pred HHHHhhcCccCHHHHHHHHhhCCCCCcchhhhh---------------c--CCCCCCCCCCCCCC Confidence 4444 4778877776654 44332111100 0 00111222222211 No 196 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.50 E-value=5.6e-05 Score=43.91 Aligned_cols=362 Identities=9% Similarity=0.035 Sum_probs=149.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCC--Cccc---ccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCchhhHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGN--HIAI---QSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKA 108 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~--~~~~---~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~ 108 (511) +.+.++.......-....+...+- .... ......... .-+...-....|+..++-+-+-|+++..... ... T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~---~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~-~~l 76 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLDALNGSEWVSAE---TALKNSDLFSIISQLSNDLATAKITTSRKQL-QGI 76 (384) T ss_pred CccccccccCcccccccchhhccccchhhcccccCCceechh---hhhccHHHHHHHHHHHHHHhhCceeeecchh-hhh Confidence 222221110000000000000000 0000 000000000 0011222334566666666666776542211 111 Q ss_pred HHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceE Q lcl|NC_018086. 109 MQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQI 187 (511) Q Consensus 109 l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~ 187 (511) +.+-............+..+.+.+|.||+.+-.+..|++ .+..++|..+.++.++.... + +|.+........ T Consensus 77 ~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~-----~-~y~~~~~~~~~~- 149 (384) T protein:vir:49 77 VDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNG-----L-YYNITFDDPRIP- 149 (384) T ss_pred hhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce-----E-EEEEEecCcccc- Confidence 111111223445667788899999999999988888875 57778888887765443211 1 111111111000 Q ss_pred EEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 188 RTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVND 267 (511) Q Consensus 188 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~ 267 (511) ....+.++.++|++.-. ....-.|.|.+..+...++..........+. T Consensus 150 -~~~~~~~~eVih~~~~~-------------------------------~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 197 (384) T protein:vir:49 150 -PKQHVPQGDILHFRLLS-------------------------------VDGGLTSVSPLMALGRELNIQKASDKLTLNA 197 (384) T ss_pred -ceeEecCccEEEecCCC-------------------------------CCCceeeccHHHHHHHHHHHHHHHHHHHHHH Confidence 00123344444442100 0011247777777777776666555555566 Q ss_pred HHHhcCceeEeecCCCCccchhhhhh--------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_018086. 268 IAYWNDAYLWLQGFDLSADSDSISNM--------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDL 339 (511) Q Consensus 268 ~~~~~~p~l~~~G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 339 (511) +...+.|-.+++--.....++..... ..++++.++++.+++.+........+.+..+.+.+.|+..-++|.. T Consensus 198 ~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~ 277 (384) T protein:vir:49 198 LKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPES 277 (384) T ss_pred HhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHH Confidence 66667776665432222212111111 1245667766666555544445556667778888999999999875 Q ss_pred cccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHH Q lcl|NC_018086. 340 VSKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADM 417 (511) Q Consensus 340 ~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~ 417 (511) -.+..+ ..++..++..+...+.. ...-+...|.+.+ .. . +.....+....+....... T Consensus 278 ~lg~~~~~~~~~~~~~~~~~~~i~~---~l~pi~~~i~~~l-------~~-----~-----l~~~~~~~~~~~~~~~~~~ 337 (384) T protein:vir:49 278 VVGGEGDKQSSLEMIYNIYFKAVSR---FLRPFVSELSKKL-------SC-----E-----VDADILPAVDPTGSNYIGL 337 (384) T ss_pred HhCCCCCccccHHHHHHHHHHHHHH---HHHHHHHHHHHHh-------ch-----h-----hhhhhhhhhhccchHHHHH Confidence 554322 23444443332222111 1111111111111 00 0 0000011111111112222 Q ss_pred HHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCC-Cccc Q lcl|NC_018086. 418 AVKL--RDMLPDETIINQF---PWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAS-TAAA 476 (511) Q Consensus 418 ~~~~--~g~~s~et~~~~l---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~-~~~~ 476 (511) +..+ +++.++..++..+ |+.+ .|+.++. +. .+..|++ ++.- T Consensus 338 ~~~l~~~~~~t~~e~~~~l~~~g~~~---ne~r~~~------------~~---~p~~gGd~~~~~ 384 (384) T protein:vir:49 338 INSMVKTGTLAQNQGLYVLQQAEILP---KDLPEGE------------TD---STLKGGETNEQY 384 (384) T ss_pred HHHHhhcCcccHHHHHHHHhhCCCCC---hhHHHHc------------CC---CCCCCCCCCCCC Confidence 2222 3566666665554 4332 2222210 11 1111111 1111 No 197 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=97.44 E-value=6.6e-05 Score=43.49 Aligned_cols=385 Identities=9% Similarity=-0.034 Sum_probs=171.0 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec-Cchh---- Q lcl|NC_018086. 31 RELITLAEMHSR-SSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES-GDEK---- 104 (511) Q Consensus 31 ~~l~~~~~~~~~-~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~-~d~~---- 104 (511) ..+.++..+... .......+...+-+..... ....... ..=+........|+..++-+-+-|+.+- .+++ T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~-~g~~v~~---~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 76 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDTY-TGKRISS---QRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTR 76 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcccc-cCceech---hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccee Confidence 112222222111 1111122223222211100 0000000 0001123344456666666656666542 1111 Q ss_pred -hHHHHHHHHhc-----cChhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEE Q lcl|NC_018086. 105 -TIKAMQPVFKE-----NYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNT 177 (511) Q Consensus 105 -~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 177 (511) ....+..++.. .........+....+.+|.||+++..+ .|++ .+..++|..+.+..+... .+. |. T Consensus 77 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~--~~~-----y~ 148 (413) T protein:vir:48 77 VVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQW--QPV-----YQ 148 (413) T ss_pred ecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCc--eEE-----EE Confidence 11224444431 233456677888999999999888765 4664 467788888877665432 111 11 Q ss_pred EeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHH Q lcl|NC_018086. 178 VISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAY 257 (511) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~ 257 (511) ... .+|.. ..+.++.+++++.- . .+...|.|.+..+...++.. T Consensus 149 ~~~-~~g~~----~~~~~~evih~~~~-----------------------~---------~d~~~G~s~i~~~~~~i~~~ 191 (413) T protein:vir:48 149 VTF-PDGSV----DVLTQDEIWHVRTL-----------------------T---------LDGLVGLNPIAYAREAISLA 191 (413) T ss_pred EEe-cCceE----EEEccccEEEecCc-----------------------C---------CCCcccccHHHHHHHHHHHH Confidence 111 11211 23445555554210 0 01124777777777777766 Q ss_pred HHHHHHHHHHHHHhcCceeEeecCCCCccchhhh----hhh--------hCceeeecCCCceeeeecCCCHHHHHHHHHH Q lcl|NC_018086. 258 NLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS----NMK--------NDRVIVTDEDGMVKFITKDVNDKHIENIKNR 325 (511) Q Consensus 258 ~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~----~~~--------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (511) .....-....+...+.|-.+++.... ..++... .+. .++++.++++.+++.+........+.+..+. T Consensus 192 ~~~~~~~~~~~~ng~~p~gil~~~~~-~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 270 (413) T protein:vir:48 192 AATEEHGARLFGNGAVTSGVLRTEQK-LTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKF 270 (413) T ss_pred HHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHH Confidence 65555555556666667666554321 1112111 111 1345666666666555544445556677778 Q ss_pred HHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccccceeEEeC Q lcl|NC_018086. 326 AKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-AKDLKPYEVTPVFV 404 (511) Q Consensus 326 l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~~~~~~~i~i~f~ 404 (511) ....|+..-++|..-.+..+..+...++... ...+..+|.-+++.|...+...-. ........+++.+. T Consensus 271 ~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~----------~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~ 340 (413) T protein:vir:48 271 QLEEICRLFRVPLHMVQNTDRATFNNIEELG----------LGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFNAG 340 (413) T ss_pred HHHHHHHHhCCCHHHhCCCcCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEech Confidence 8888999889887555443322222222111 122233333333333333322111 11111223455555 Q ss_pred CCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCC Q lcl|NC_018086. 405 RNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKN 482 (511) Q Consensus 405 ~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (511) ..+-.|..+.++++.++ .|+++.-.++.+++.-+-+.. +. .....+..... ..+++ T Consensus 341 ~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~gg--D~-----------~~~~~n~~~~~---------~~~~~ 398 (413) T protein:vir:48 341 ALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGG--DV-----------YLTPMNMTTSP---------SAGDD 398 (413) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--ce-----------eeccccccccc---------ccccc Confidence 66667888999998887 478888778887765321100 00 00000000000 00000 Q ss_pred CCCccccccCCCCccccccccCCCCCCC Q lcl|NC_018086. 483 PANTSTITTTDPVAAKEQEKAIQKKPKT 510 (511) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) . ..+++++-.++..| T Consensus 399 ~-------------~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 399 N-------------GKKKESGDADKTAS 413 (413) T ss_pred C-------------CCCCCCCCccccCC Confidence 0 00011111111111 No 198 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.44 E-value=6.7e-05 Score=43.46 Aligned_cols=359 Identities=9% Similarity=-0.010 Sum_probs=154.3 Q ss_pred HHHHHHHHHHH--HH--H----HHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCchhh Q lcl|NC_018086. 34 ITLAEMHSRSS--SA--Y----GVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKT 105 (511) Q Consensus 34 ~~~~~~~~~~~--~~--~----~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~ 105 (511) +.++......+ ++ . .....+..|... ..... ...-+...-....|+..++-+-.-|+++..+. . T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~----~~~v~---~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~-~ 72 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQ----LSYVS---ALSALQNTNVYSVINRIASDVSSAHFKTENTA-T 72 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCcc----ccccc---hhHhhcchHHHHHHHHHHHhhccCceeecccc-h Confidence 22221110000 00 0 000000000000 00000 00001122233455555555555677653221 1 Q ss_pred HHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 106 IKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 106 ~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) ...+.+-............+..+.+.+|.||+++..+. ..+..++|..+.+..+.. . +. |......++. T Consensus 73 ~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---~~~~p~~~~~v~~~~~~~--~-----~~-~~~~~~~~~~ 141 (383) T protein:vir:10 73 LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM--G-----IV-YTVLESNDRP 141 (383) T ss_pred hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCcceEEEEEcCC--c-----eE-EEEEEcCCce Confidence 11122111112344566778888889999998875432 233334444333322211 0 00 1111111111 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) . ..+.++.++|++.-. .+. .+...|.|.+......++....+..-.. T Consensus 142 ~----~~~~~~evih~r~~~--------------------------~~~---~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 188 (383) T protein:vir:10 142 K----MVLRQDQMLHFRLMP--------------------------DPQ---YRYLIGRSPLESLQNALNLDDKASKSNM 188 (383) T ss_pred E----EEEcccceEEeccCC--------------------------CCc---ccccccccHHHHHHHHHHHHHHHHHHHH Confidence 1 112333333332100 000 0112478888887777777777766666 Q ss_pred HHHHHhcCceeEeecCCCCccchhhhh----hh-------hCceeeecCCCceeeeecCCCHHH-HHHHHHHHHHHHHHH Q lcl|NC_018086. 266 NDIAYWNDAYLWLQGFDLSADSDSISN----MK-------NDRVIVTDEDGMVKFITKDVNDKH-IENIKNRAKLDIFSL 333 (511) Q Consensus 266 ~~~~~~~~p~l~~~G~~~~~~~~~~~~----~~-------~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~i~~~ 333 (511) ..+...+.|-.++.-.....+++.... +. .++++.++++.+++.+..+..... +.+..+...+.|+.. T Consensus 189 ~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~a 268 (383) T protein:vir:10 189 SAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKA 268 (383) T ss_pred HHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHH Confidence 666776777555543221111222211 11 234666766666655544443333 345667778888988 Q ss_pred hCcccccccc--ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH Q lcl|NC_018086. 334 SQTPDLVSKD--FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY 411 (511) Q Consensus 334 s~~p~~~~~~--~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~ 411 (511) -++|..-.+. .++.++..++. ....|..+|.-+++.|...+...- -...+++.+...+..|. T Consensus 269 fgVPp~~lg~~~~~~~~~sn~eq-----------~~~~~~~~l~P~~~~ie~~l~~~l-----~~~~~~f~~~~l~~~d~ 332 (383) T protein:vir:10 269 FGVPSDILGGGTSTESQHSNIDQ-----------IKATYLANLNSYVNPIVDELRLKM-----NAPDLELDIKDMLDVDD 332 (383) T ss_pred hCCCHHHcCCccCCCCccccHHH-----------HHHHHHHHHHHHHHHHHHHHHHhh-----CCceEEeechhhhccCH Confidence 8988654432 12222222221 111222334444444433332211 11246677788888899 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCcccc Q lcl|NC_018086. 412 AELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTI 489 (511) Q Consensus 412 ~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (511) .+.++++.++ .|+++...++..++.-.-+..++.+. ........ T Consensus 333 ~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~--------------~~~~~~~~-------------------- 378 (383) T protein:vir:10 333 SILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEF--------------KPLTNETK-------------------- 378 (383) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCccccc--------------CCCcccCC-------------------- Confidence 9999998887 47898888888775422100000000 00000000 Q ss_pred ccCCCCcccc Q lcl|NC_018086. 490 TTTDPVAAKE 499 (511) Q Consensus 490 ~~~~~~~~~~ 499 (511) +|.++ T Consensus 379 -----gGd~e 383 (383) T protein:vir:10 379 -----GGDDK 383 (383) T ss_pred -----CCCCC Confidence 01110 No 199 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.42 E-value=7.1e-05 Score=43.34 Aligned_cols=386 Identities=9% Similarity=-0.022 Sum_probs=159.2 Q ss_pred hhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccc-cccceeccchHHHHHHHHHhhhhccCceecC Q lcl|NC_018086. 23 FIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTN-KPNSKIVHNFPKLLVDTSTAYLAGEPITESG 101 (511) Q Consensus 23 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~-~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~ 101 (511) +.++++-.+.--.+..++.+ .--.+-............. -.+.=+........|+..++-+-.-|+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~ 71 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWID---------QSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYE 71 (409) T ss_pred CCccchhhhhhhhhhhhhhc---------cccccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEee Confidence 11111110000001111100 0000000000000000000 0000012233344555555555555766421 Q ss_pred -chhhHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEE Q lcl|NC_018086. 102 -DEKTIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIY 174 (511) Q Consensus 102 -d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~ 174 (511) .+.....+..++.. |. -......+..+.+.+|.||+++..+..|++ .+..++|..+.+..++... .+ T Consensus 72 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~-~~----- 145 (409) T protein:vir:93 72 DYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSR-EL----- 145 (409) T ss_pred ccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCc-EE----- Confidence 12222234444432 32 334557788888999999999988888875 5777888888776654332 11 Q ss_pred EEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHH Q lcl|NC_018086. 175 YNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLI 254 (511) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~ 254 (511) +|... ...+.. ..+.++.++|++.- ++. +.-.|.|.++.+...+ T Consensus 146 ~y~~~-~~~g~~----~~~~~~eVih~r~~----------------------~~~---------~~~~G~s~i~~~~~~i 189 (409) T protein:vir:93 146 YYSIH-AATGNK----LIVHNMDMLHFKHI----------------------VAS---------NMVQGISPIDVLKNTT 189 (409) T ss_pred EEEEE-cCCceE----EEEccccEEEeCCC----------------------CCC---------CccccccHHHHHHHHH Confidence 11111 111211 12344444444210 000 1124777776655555 Q ss_pred HHHHHHHHHHHHHHHHhcCc-eeE-eecCCCCccchhhhhh--------h-hCceeeecCCCceeeeecCCCHHHHHHHH Q lcl|NC_018086. 255 DAYNLAVSDSVNDIAYWNDA-YLW-LQGFDLSADSDSISNM--------K-NDRVIVTDEDGMVKFITKDVNDKHIENIK 323 (511) Q Consensus 255 d~~~~~~s~~~~~~~~~~~p-~l~-~~G~~~~~~~~~~~~~--------~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 323 (511) +..+.+... .+..+..+ -.+ ..+...++ +....+ . .++++.++++.+++.+........+.+.. T Consensus 190 ~~~~~~~~~---~~~~~~~~~~~i~~~~~~l~~--e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r 264 (409) T protein:vir:93 190 DFDNAVRTF---NLTEMQKPDSFMLKYGSNVGK--EKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred HHHHHHHHH---HHHhcCCCCceEEecCCCCCH--HHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHH Confidence 544332111 23333333 222 23333222 222211 1 23466666665655554444445566667 Q ss_pred HHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCccc-cccceeE Q lcl|NC_018086. 324 NRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN-KAKDL-KPYEVTP 401 (511) Q Consensus 324 ~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~-~~~~i~i 401 (511) +.....|+..-++|+.-.+..++++...++.... ..+..+|.-+++.|...+...- ..... ....+++ T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~----------~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~f 334 (409) T protein:vir:93 265 NLTRERVANVFQLPSVFLNARSNTNFAKNEELNR----------FYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKF 334 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEe Confidence 7778889999899876555443333333222211 2223334444444333332210 11111 1122444 Q ss_pred EeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcccccc Q lcl|NC_018086. 402 VFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKL 479 (511) Q Consensus 402 ~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (511) .+..-+-.|..+.++++.++ +|+++.-.++..++.-+-+. .++.- -..+. .+.+....... T Consensus 335 d~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~g--gD~~~-----------~~~n~----~~~~~~~~~~~ 397 (409) T protein:vir:93 335 NVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG--GDKPL-----------ISGDL----YPIDTPLELRK 397 (409) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee-----------ecccc----cccccchhhcc Confidence 44455556888899988887 47888888888886532110 00000 00000 00000000000 Q ss_pred CCCCCCcccccc Q lcl|NC_018086. 480 DKNPANTSTITT 491 (511) Q Consensus 480 ~~~~~~~~~~~~ 491 (511) ..++|..+...+ T Consensus 398 ~~~gG~~n~~e~ 409 (409) T protein:vir:93 398 SLKGGDKNVNES 409 (409) T ss_pred cccCCCCCcCCC Confidence 001111111000 No 200 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=97.37 E-value=8.3e-05 Score=42.96 Aligned_cols=393 Identities=10% Similarity=0.019 Sum_probs=179.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHH-hcC----CCcccccCCcCccccccce Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDY-YKG----NHIAIQSRTFDDTNKPNSK 75 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~y-Y~G----~~~~~~~~~~~~~~~~~~r 75 (511) |.-.| +...++...... .. ..+...+.... .. +..+ +.| .-.++............. T Consensus 1 ~~~~i--------~~~~g~~~~~~~--~~-~~~~~~ia~~~---~~---~~~~~~~~~~p~~~~il~~~~~~~~~y~~m- 62 (491) T protein:vir:79 1 MSKGL--------WVSPTEFVKFGE--PD-KSLSSQIATRA---RS---IDFFALGMYLPNPDPVLKALGKDIRVYREL- 62 (491) T ss_pred CCCee--------eCCCCCcccccc--cc-hhHHHHHhhhc---cc---cccccccccCcchhHHHhhccCCHHHHHHH- Confidence 44433 223333222211 00 11111111000 00 0010 111 000100000000000000 Q ss_pred eccchHHHHHHHHHhhhhccCceec---CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEE-EEeeeCCCCceE--- Q lcl|NC_018086. 76 IVHNFPKLLVDTSTAYLAGEPITES---GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCF-EIHWIDRNKKHR--- 148 (511) Q Consensus 76 i~~n~~k~ivd~~~~~l~g~~~~~~---~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~~~~g~~~--- 148 (511) .......-.+.+...-+.+.++.+. .+++..+.+.+.+..-+|...+..+ .++.-||.++ +++|...+|... T Consensus 63 ~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~ 141 (491) T protein:vir:79 63 RADAHVGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPID 141 (491) T ss_pred hhChHHHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEe Confidence 1356677777877788888898874 3344567888888887888877765 5788899765 566655555543 Q ss_pred EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceecc Q lcl|NC_018086. 149 FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNL 228 (511) Q Consensus 149 i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (511) +..++|..+ .|++... + + +...... .. .....+ T Consensus 142 l~~r~~~~f--~~d~~~~--l----~--------------------------l~~~~~~------------~~-g~~lp~ 174 (491) T protein:vir:79 142 VVGKPADWF--VYDPENQ--L----R--------------------------FRSKEHW------------VQ-GEELPA 174 (491) T ss_pred eeeecccce--eeccCCc--e----E--------------------------EeecCCC------------CC-ceeecC Confidence 333444322 1222111 0 0 0000000 00 000112 Q ss_pred CCccceEeec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc------hhhhhhhhCceee Q lcl|NC_018086. 229 LQKFPVLEII--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS------DSISNMKNDRVIV 300 (511) Q Consensus 229 ~g~iPvv~~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~------~~~~~~~~~~~i~ 300 (511) ++.|-..+-. .++.|.|.+..+....--=+..+.+++..++.|+.|+++.+=.....++ ....++....++. T Consensus 175 ~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~v 254 (491) T protein:vir:79 175 RKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQDAVAV 254 (491) T ss_pred CCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCeEEE Confidence 2223222211 3567888888876666666677888999999999999987732221111 1223455567888 Q ss_pred ecCCCceeeeecCC---CHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 301 TDEDGMVKFITKDV---NDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKVLAK 376 (511) Q Consensus 301 ~~~~~~~~~~~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 376 (511) +|.+.++++++... +...++.+++.+.+.|...--.-.++.+..| .+.|. ....-....++.-.+.+...+.+ T Consensus 255 iP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~---vh~~v~~~i~~~D~~~i~~tln~ 331 (491) T protein:vir:79 255 IPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQ---AGLEVTDDIRDGDKAIVVEAMNM 331 (491) T ss_pred ecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhhHH---HHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999987542 3455788888887776665422112222222 22222 12222333344445566666666 Q ss_pred HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCH-HHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_018086. 377 RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY-AELADMAVKLR--DM-LPDETIINQFPWITDARQEVEKADAQRQ 452 (511) Q Consensus 377 ~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~-~e~a~~~~~~~--g~-~s~et~~~~l~~v~d~~~E~~ri~~E~~ 452 (511) +++-++.+ .... ...+.+.|.. +.+. ...++.+.+++ |+ ++.+.+.+.++. +.++.+.+ . T Consensus 332 li~~l~~~---N~~~----~~~p~f~~~e--~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gi-p~~~~~e~-~----- 395 (491) T protein:vir:79 332 LIRWICDL---NFDG----AARPVFDMWE--QEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNL-QDGDLDER-P----- 395 (491) T ss_pred HHHHHHHh---cCCC----CCcceEeecC--cCchhHHHHHHHHHHHhCCCccCHHHHHHHhCC-CCCCCCcc-c----- Confidence 55554433 2221 1123344543 3333 45677777775 55 788888887774 32211100 0 Q ss_pred HHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 453 KRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) + +......+. .+ ...+...++....| T Consensus 396 ------~---~~~~~~~~~--------------~~----------~~~~~~~~~~~~~d 421 (491) T protein:vir:79 396 ------L---PVSAVDAVG--------------AA----------SFAEFEAPDQDALD 421 (491) T ss_pred ------c---CcCcccccc--------------cc----------cccccCCCCCcchH Confidence 0 000000000 00 00000011111112 No 201 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.35 E-value=8.7e-05 Score=42.85 Aligned_cols=400 Identities=13% Similarity=0.021 Sum_probs=165.9 Q ss_pred ccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccccCC------------cCccc--cccce Q lcl|NC_018086. 11 GDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHI-AIQSRT------------FDDTN--KPNSK 75 (511) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~-~~~~~~------------~~~~~--~~~~r 75 (511) =.-.++.-.+-++.-.. ..+.++...--+...+.. +..... ..... ..... T Consensus 1 ~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRK--------------QSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE 66 (441) T ss_pred CccccCccccccccccc--------------cchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhh Confidence 00001110000000000 001111111111111000 000000 00000 00000 Q ss_pred -eccchHHHHHHHHHhhhhccCceecCchh--hHHHHHHHHh--ccCh---hHHHHHHHHHHhhCCeEEEEeeeCCCCce Q lcl|NC_018086. 76 -IVHNFPKLLVDTSTAYLAGEPITESGDEK--TIKAMQPVFK--ENYV---TDVNSEEVKLSGIFGHCFEIHWIDRNKKH 147 (511) Q Consensus 76 -i~~n~~k~ivd~~~~~l~g~~~~~~~d~~--~~~~l~~~~~--~n~~---~~~~~~~~~~a~~~G~~~~~v~~~~~g~~ 147 (511) +...-.-..|+..++-+-.-|+.+..+.. ....+..++. -|.. ......+....+.+|.||+.+..+..|++ T Consensus 67 al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:79 67 AIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred hhccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 00111112345555555455766533221 1122334432 2332 34556778888999999999988888886 Q ss_pred -EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccccee Q lcl|NC_018086. 148 -RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHP 226 (511) Q Consensus 148 -~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (511) .+..++|..+.++.++.. .+ +++....+..+... ...+.+..+++++. T Consensus 147 ~~L~~i~~~~v~v~~d~~g--~~----~~~~~~~~~~~~~~--~~~~~~~dvih~k~----------------------- 195 (441) T protein:vir:79 147 MNLTFRKTSEIELKSDARG--RL----YYFHQRIDSNGNNI--ERNVKFEDMLDIKF----------------------- 195 (441) T ss_pred EEEEEEcCceeEEEECCCc--cE----EEEEEEeccCCcee--EEEEccccEEEecc----------------------- Confidence 578899999988776532 11 11111111111111 12344444444421 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh----hh----h---- Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS----NM----K---- 294 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~----~~----~---- 294 (511) +++ +.-.|.|.+..+...++.......-..+.++..+.|-.++.--..-.+++... .+ . T Consensus 196 ~~~---------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~n 266 (441) T protein:vir:79 196 YSL---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQ 266 (441) T ss_pred CCC---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccc Confidence 000 01147777776666666555554555555566666766654211111112111 11 1 Q ss_pred hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 295 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) .++++.++++.+.+.++.+.....+.+..+...+.|+..-++|....+... +.|.+... ..|... T Consensus 267 ag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~--------------~~~~~t 332 (441) T protein:vir:79 267 AGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLST 332 (441) T ss_pred cCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHH Confidence 134667776666665555555556667777888889998898865554321 12211111 112223 Q ss_pred HHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADA 449 (511) Q Consensus 374 l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~ 449 (511) |.-+++.|...+...-. .......+++.+..-+-.|.++.++.+.++ .|+++...++..++.- ++.+..+-.+. T Consensus 333 l~P~~~~ie~eln~kl~-~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~- 410 (441) T protein:vir:79 333 LKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVD- 410 (441) T ss_pred HHHHHHHHHHHHhhhcc-ccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeec- Confidence 33333333333322111 111223455555555667888899988887 5789988888877642 22221110000 Q ss_pred HHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 450 QRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 450 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) .+. .+....++...++ ..+...+ -.|++..+ T Consensus 411 ------------~n~-~~~~~~~~~~~~~---~~~~~~~-----~kgGe~~e 441 (441) T protein:vir:79 411 ------------LNH-VNIELVDEYQMNK---SRATDKK-----LKGGEENE 441 (441) T ss_pred ------------ccc-ccccccccccccc---ccccccc-----cCCCCCCC Confidence 000 0000000000000 0000000 00000000 No 202 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.35 E-value=8.7e-05 Score=42.85 Aligned_cols=400 Identities=13% Similarity=0.021 Sum_probs=165.9 Q ss_pred ccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccccCC------------cCccc--cccce Q lcl|NC_018086. 11 GDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHI-AIQSRT------------FDDTN--KPNSK 75 (511) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~-~~~~~~------------~~~~~--~~~~r 75 (511) =.-.++.-.+-++.-.. ..+.++...--+...+.. +..... ..... ..... T Consensus 1 ~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRK--------------QSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE 66 (441) T ss_pred CccccCccccccccccc--------------cchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhh Confidence 00001110000000000 001111111111111000 000000 00000 00000 Q ss_pred -eccchHHHHHHHHHhhhhccCceecCchh--hHHHHHHHHh--ccCh---hHHHHHHHHHHhhCCeEEEEeeeCCCCce Q lcl|NC_018086. 76 -IVHNFPKLLVDTSTAYLAGEPITESGDEK--TIKAMQPVFK--ENYV---TDVNSEEVKLSGIFGHCFEIHWIDRNKKH 147 (511) Q Consensus 76 -i~~n~~k~ivd~~~~~l~g~~~~~~~d~~--~~~~l~~~~~--~n~~---~~~~~~~~~~a~~~G~~~~~v~~~~~g~~ 147 (511) +...-.-..|+..++-+-.-|+.+..+.. ....+..++. -|.. ......+....+.+|.||+.+..+..|++ T Consensus 67 al~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:94 67 AIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred hhccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 00111112345555555455766533221 1122334432 2332 34556778888999999999988888886 Q ss_pred -EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccccee Q lcl|NC_018086. 148 -RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHP 226 (511) Q Consensus 148 -~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (511) .+..++|..+.++.++.. .+ +++....+..+... ...+.+..+++++. T Consensus 147 ~~L~~i~~~~v~v~~d~~g--~~----~~~~~~~~~~~~~~--~~~~~~~dvih~k~----------------------- 195 (441) T protein:vir:94 147 MNLTFRKTSEIELKSDARG--RL----YYFHQRIDSNGNNI--ERNVKFEDMLDIKF----------------------- 195 (441) T ss_pred EEEEEEcCceeEEEECCCc--cE----EEEEEEeccCCcee--EEEEccccEEEecc----------------------- Confidence 578899999988776532 11 11111111111111 12344444444421 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhh----hh----h---- Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSIS----NM----K---- 294 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~----~~----~---- 294 (511) +++ +.-.|.|.+..+...++.......-..+.++..+.|-.++.--..-.+++... .+ . T Consensus 196 ~~~---------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~n 266 (441) T protein:vir:94 196 YSL---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQ 266 (441) T ss_pred CCC---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccc Confidence 000 01147777776666666555554555555566666766654211111112111 11 1 Q ss_pred hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 295 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) .++++.++++.+.+.++.+.....+.+..+...+.|+..-++|....+... +.|.+... ..|... T Consensus 267 ag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~--------------~~~~~t 332 (441) T protein:vir:94 267 AGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLST 332 (441) T ss_pred cCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHH Confidence 134667776666665555555556667777888889998898865554321 12211111 112223 Q ss_pred HHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADA 449 (511) Q Consensus 374 l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~ 449 (511) |.-+++.|...+...-. .......+++.+..-+-.|.++.++.+.++ .|+++...++..++.- ++.+..+-.+. T Consensus 333 l~P~~~~ie~eln~kl~-~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~- 410 (441) T protein:vir:94 333 LKPYITCVCAELNFKFN-DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVD- 410 (441) T ss_pred HHHHHHHHHHHHhhhcc-ccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeec- Confidence 33333333333322111 111223455555555667888899988887 5789988888877642 22221110000 Q ss_pred HHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 450 QRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 450 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) .+. .+....++...++ ..+...+ -.|++..+ T Consensus 411 ------------~n~-~~~~~~~~~~~~~---~~~~~~~-----~kgGe~~e 441 (441) T protein:vir:94 411 ------------LNH-VNIELVDEYQMNK---SRATDKK-----LKGGEENE 441 (441) T ss_pred ------------ccc-ccccccccccccc---ccccccc-----cCCCCCCC Confidence 000 0000000000000 0000000 00000000 No 203 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=97.26 E-value=0.00011 Score=42.20 Aligned_cols=441 Identities=12% Similarity=0.006 Sum_probs=160.5 Q ss_pred CC--Ccc-c--hhhcccccCchhhHhhhhc-----cCC-CHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc Q lcl|NC_018086. 1 MA--IPN-G--QINAGDIITTNIRRKHFIR-----RNF-DLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT 69 (511) Q Consensus 1 ~~--~~~-~--~~~~~~~~~~~~~~~~~~~-----~~~-~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~ 69 (511) || .++ . ....+.+.. +.....+.. ..+ +.-.+..+++.....+..+.....+..+.-....+. +... T Consensus 54 ~a~~~p~~~~~~~~~~~~~~-p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~l-k~~~ 131 (576) T protein:vir:96 54 QAYAEPFLEVMDTNPEFRTK-RSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRM-RDLD 131 (576) T ss_pred chhhcceeeeeecCCCcccc-CcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEE-ecCc Confidence 22 000 0 111111111 111110000 000 001122333333333333322222222111000000 0000 Q ss_pred ccccceeccchHHHHHHHHHhhhhccCceecCchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC--CCce Q lcl|NC_018086. 70 NKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR--NKKH 147 (511) Q Consensus 70 ~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~--~g~~ 147 (511) ..+ ......-+.....++.--.+.... + ...+..+...+..+.+.+|.+|+.+..+. .|++ T Consensus 132 ~~~-----~~~~~~~~~~l~~~l~~~~~~~~p-----------~-~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~ 194 (576) T protein:vir:96 132 AEP-----GKKEKEEIKRIENFILNTGRDKDI-----------D-RDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTM 194 (576) T ss_pred Ccc-----chhhhHhhhhHHhhHhhccCCCCC-----------c-cccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCce Confidence 000 000000011111111100000000 0 01233455677888999999998876544 3444 Q ss_pred -EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccccee Q lcl|NC_018086. 148 -RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHP 226 (511) Q Consensus 148 -~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (511) .+..++|..+.++.+.+.. .......|... .++.. ...+.++.++++... T Consensus 195 ~~L~pl~p~~V~v~~~~dg~--~~~~~~~~~~~--~~~~~---~~~~~~~dii~~~~~---------------------- 245 (576) T protein:vir:96 195 DKFIAVDPSTIFYATDKNGK--IIKGGKRFVQV--INKKV---VASFTSREMAMGIRN---------------------- 245 (576) T ss_pred EEEEEeCCceeEEEECCCCc--eeeeeeEEEEe--cCCce---EEEecccceEEEeec---------------------- Confidence 5777899998887765431 11111111111 11111 112233333332210 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cC-CCCcc--chhhhhhh------- Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQ--GF-DLSAD--SDSISNMK------- 294 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~--G~-~~~~~--~~~~~~~~------- 294 (511) |..-......|.|.++.+...+.....+..-..+.+...+.|-.++. |. ..+++ +.....+. T Consensus 246 ------~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~ 319 (576) T protein:vir:96 246 ------PRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGIN 319 (576) T ss_pred ------CCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Confidence 00000012257777777766666666555555555666666665543 42 22221 11111111 Q ss_pred -hCc-eeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc--CccH----HHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 -NDR-VIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--AASG----QALKAATQPLENKSAVK 366 (511) Q Consensus 295 -~~~-~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg----~Ai~~~~~~l~~k~~~~ 366 (511) .++ .+.++++.+.+.++.......+.+..+...+.|+..-++|....+... ..+| .++.+. .. -+.. T Consensus 320 nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~s--n~---e~~~ 394 (576) T protein:vir:96 320 GSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEA--DP---GKKQ 394 (576) T ss_pred ccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccc--cH---HHHH Confidence 123 355666655555555555667777888888999999999865544221 1111 111100 00 0111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHHHHHH Q lcl|NC_018086. 367 ESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL-RDMLPDETIINQFPWITDARQEVE 445 (511) Q Consensus 367 ~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~-~g~~s~et~~~~l~~v~d~~~E~~ 445 (511) ...+..+|.-+++.|...+...-.. .+ ...+.+.|.+.-+.+.++..+..... .|+++.-.++..++.-+-+... T Consensus 395 ~~f~~~tL~P~~~~ie~~ln~~Ll~-~~-~~~~~~~f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD-- 470 (576) T protein:vir:96 395 QQSQNKGLQPLLRFIEDLINTHIIS-EY-SDKYVFQFVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGD-- 470 (576) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhch-hc-cCceEEEeccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcc-- Confidence 1233333443333333333211101 11 12456778766555555554443322 4889988888887653211000 Q ss_pred HHHHHHHHHHHHHHhhccccccCCCCC----CccccccC--CCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 446 KADAQRQKRADIALQNFKQTSAVQGAS----TAAANKLD--KNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 446 ri~~E~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) ..-. ..............+.. .+..+... .+++...+.+ .....++..+..++|+.++| T Consensus 471 ~~~~------~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~-~~s~~~~~~g~~~~~~~~~~ 535 (576) T protein:vir:96 471 VLLD------GSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQ-QESTEDKVDGRESNDPTKID 535 (576) T ss_pred eecc------ccccccccccccCCCCCCccccccccccccccCCCCCCCCC-CCCCCCcccccccccCCCCC Confidence 0000 00000000000000000 00000000 0000000000 01111222223333333333 No 204 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=383 Identities=11% Similarity=0.009 Sum_probs=162.3 Q ss_pred HHHHHHHHHHHH------HHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec---Cch-- Q lcl|NC_018086. 35 TLAEMHSRSSSA------YGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES---GDE-- 103 (511) Q Consensus 35 ~~~~~~~~~~~~------~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d~-- 103 (511) .+..+...+... ..-+....-+.... ........ .=+...-....|+..++-+-.-|+.+- .+. T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~--~~~~vt~~---~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~ 75 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSD--SGQVVTPA---SALALTVLQNCVTLLAESIAQLPIELYERSGEDRK 75 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCcc--CCcccchH---HhhccHHHHHHHHHHHHhhccCceEEEEecCCccc Confidence 011110000000 00000111011000 00000000 001122344456666666555676541 111 Q ss_pred h-hHHHHHHHHh--cc---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCceE-EEEEcccceEEEecCCCCCceEEEEEEE Q lcl|NC_018086. 104 K-TIKAMQPVFK--EN---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYN 176 (511) Q Consensus 104 ~-~~~~l~~~~~--~n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 176 (511) . ....+..+|. -| .-......+....+.+|.+|+++..+.+|++. +..++|..+.+..+... .+ +| T Consensus 76 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~--~~-----~y 148 (419) T protein:vir:14 76 PATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL--KP-----VY 148 (419) T ss_pred cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc--eE-----EE Confidence 1 1123444443 23 23345566788889999999999888888864 77788888766554321 01 11 Q ss_pred EEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHH Q lcl|NC_018086. 177 TVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDA 256 (511) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~ 256 (511) .+.. ... +..+.++++. ..+ .+.-.|.|.+..+...++. T Consensus 149 ~~~~---~~~------~~~~~i~h~~----------------------------~~~----~dg~~G~s~i~~~~~~i~~ 187 (419) T protein:vir:14 149 RVRG---SDP------MPQRLVHHVR----------------------------WMS----INGYTGLSPVLLHANAIGH 187 (419) T ss_pred EEcc---Ccc------cchhheeEec----------------------------CcC----CCCcccccHHHHHHHHHHH Confidence 1100 000 0111111110 000 0122577777776666666 Q ss_pred HHHHHHHHHHHHHHhcCceeEeecCC---CCccchhhhhhh------------hCceeeecCCCceeeeecCCCHHHHHH Q lcl|NC_018086. 257 YNLAVSDSVNDIAYWNDAYLWLQGFD---LSADSDSISNMK------------NDRVIVTDEDGMVKFITKDVNDKHIEN 321 (511) Q Consensus 257 ~~~~~s~~~~~~~~~~~p~l~~~G~~---~~~~~~~~~~~~------------~~~~i~~~~~~~~~~~~~~~~~~~~~~ 321 (511) ......-..+.+...+.|-.+++-.. ...+++....++ .++++.++++.++..+........+.+ T Consensus 188 ~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e 267 (419) T protein:vir:14 188 AQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALID 267 (419) T ss_pred HHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHH Confidence 55555455555566666766655321 111222222221 134666766656555544444445556 Q ss_pred HHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Ccccccccee Q lcl|NC_018086. 322 IKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-AKDLKPYEVT 400 (511) Q Consensus 322 ~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-~~~~~~~~i~ 400 (511) ..+...+.|+..-++|..-.+.....+...++... ...+..+|.-.++.|...+...-- ........++ T Consensus 268 ~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~----------~~f~~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~ 337 (419) T protein:vir:14 268 ALRLSALDIARIYKIPAHMVNELERATFSNIEHQS----------LQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIE 337 (419) T ss_pred HHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEE Confidence 66777888999989986555433222222222211 123333444444433333332111 1111222344 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccc Q lcl|NC_018086. 401 PVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 401 i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (511) +.+..-+..|..+.++++.++ .|+++.-.++++++.-+-+.. +. ...+.+.. + .+...+. T Consensus 338 fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gG--D~-----------~~~~~n~~----~-~~~~~~~ 399 (419) T protein:vir:14 338 YNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGG--DI-----------YLSPMNMV----D-ASKPQQL 399 (419) T ss_pred EechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Ce-----------eeeccccc----c-ccccccc Confidence 445565667889999998887 478888888877764221100 00 00000000 0 0000000 Q ss_pred cCCCCCCccccccCCCCccccc Q lcl|NC_018086. 479 LDKNPANTSTITTTDPVAAKEQ 500 (511) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~ 500 (511) +...+.++.....+....- + T Consensus 400 ~~~~~~~~~~~~~e~~~~l--~ 419 (419) T protein:vir:14 400 PVGKSEPTKAAIDEIGRIL--S 419 (419) T ss_pred cCCCCCCccccccchhccc--C Confidence 0110000000000000000 0 No 205 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.20 E-value=0.00013 Score=41.83 Aligned_cols=382 Identities=8% Similarity=0.022 Sum_probs=172.0 Q ss_pred hhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccC-C----cCcc-c---c-ccce--eccchHHHHHHHHHh Q lcl|NC_018086. 23 FIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSR-T----FDDT-N---K-PNSK--IVHNFPKLLVDTSTA 90 (511) Q Consensus 23 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~-~----~~~~-~---~-~~~r--i~~n~~k~ivd~~~~ 90 (511) +.+-..+ .....+..-+.+++.++.|........ . .... . . .+.+ +...-....|+..++ T Consensus 1 ~~~~~~~--------~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYT--------IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLIST 72 (424) T ss_pred CCCCccc--------cccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHH Confidence 1111111 111223334445555665542211000 0 0000 0 0 0001 111223345555555 Q ss_pred hhhccCcee---cCch---h--hHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccc Q lcl|NC_018086. 91 YLAGEPITE---SGDE---K--TIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMN 156 (511) Q Consensus 91 ~l~g~~~~~---~~d~---~--~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~ 156 (511) -+-+-|+.+ ..+. + ....+.+++.. |. -......+..+.+.+|.||+++..+..|++ .+..++|.. T Consensus 73 ~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~ 152 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) T ss_pred hhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcc Confidence 555567654 1111 1 12234455432 32 234566778889999999999988888875 466788888 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) +.+..+... .. |.... +|.. ..+.++.+++++.-. T Consensus 153 v~v~~~~~~---~~-----y~~~~--~g~~----~~~~~~eVihir~~~------------------------------- 187 (424) T protein:vir:18 153 MDVKLVGKK---VV-----YRYQR--DSEY----ADFSQKEIFHLKGFG------------------------------- 187 (424) T ss_pred eEEEEcCCe---EE-----EEEEe--CCeE----EEeccccEEEecCcC------------------------------- Confidence 766543211 11 11111 1211 123444444432100 Q ss_pred ecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh-----------hhCceeeecCCC Q lcl|NC_018086. 237 IIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM-----------KNDRVIVTDEDG 305 (511) Q Consensus 237 ~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~ 305 (511) .+...|.|.+..+...+...........+.+...+.|-.+++-.+....++....+ ..++++.++++. T Consensus 188 -~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~ 266 (424) T protein:vir:18 188 -FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF 266 (424) T ss_pred -CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCc Confidence 01124666666555555544444444444555556665555532211112222111 123466676666 Q ss_pred ceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 306 MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCS 383 (511) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 383 (511) +.+.+........+.+..+...+.|+..-++|..-.+... +.++.+++..... .+..+|.-+++.|.. T Consensus 267 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~----------f~~~tl~P~~~~ie~ 336 (424) T protein:vir:18 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPYISRWEN 336 (424) T ss_pred eEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHH----------HHHHHHHHHHHHHHH Confidence 6655554545556667778888899999999875554332 2223333332222 223334334333333 Q ss_pred HHHhcC-CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 384 YLEFMN-KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQ 460 (511) Q Consensus 384 ~~~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (511) .+...- .........+++.+..-+..|.++.++.+.++ .|+++.-.++++++.-+-+..+ .. .- T Consensus 337 ~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD--~~-----------~~ 403 (424) T protein:vir:18 337 SIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGD--VA-----------MR 403 (424) T ss_pred HHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC--ee-----------ee Confidence 333211 11111223355666777778899999998887 4788888888887643211000 00 00 Q ss_pred hccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 461 NFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) ..+. ... ...+. ..++.++++ T Consensus 404 ~~n~-~~l-~~~~~-~~~~~~n~a 424 (424) T protein:vir:18 404 QAQY-VPI-TDLGT-NKEPRNNGA 424 (424) T ss_pred ccCc-cch-hhhhc-cCCccccCC Confidence 0000 000 00000 001111111 No 206 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.14 E-value=0.00016 Score=41.44 Aligned_cols=400 Identities=13% Similarity=0.000 Sum_probs=164.4 Q ss_pred ccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccccC------------CcCccccccc-e- Q lcl|NC_018086. 11 GDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHI-AIQSR------------TFDDTNKPNS-K- 75 (511) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~-~~~~~------------~~~~~~~~~~-r- 75 (511) =.-.++.-.+.++....-... .+....-+...+.. ..... .......... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRK--------------ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE 66 (441) T ss_pred CceecCccceeccccccchhh--------------hhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhh Confidence 000111111111110001000 00000000000000 00000 0000000000 0 Q ss_pred -eccchHHHHHHHHHhhhhccCceecCchh--hHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce Q lcl|NC_018086. 76 -IVHNFPKLLVDTSTAYLAGEPITESGDEK--TIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH 147 (511) Q Consensus 76 -i~~n~~k~ivd~~~~~l~g~~~~~~~d~~--~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~ 147 (511) +...=.-..|+..++-+-.-|+.+..+.. ....+..++. -|. -......+..+.+.+|.||+.+..+..|++ T Consensus 67 al~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~ 146 (441) T protein:vir:98 67 AIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEP 146 (441) T ss_pred hhccHHHHHHHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcE Confidence 00111112345555555455666533221 1222344432 233 234566778888999999999988888876 Q ss_pred -EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccccee Q lcl|NC_018086. 148 -RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHP 226 (511) Q Consensus 148 -~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (511) .+..++|..+.+..++.. .+.. +....+..+.... ..+.+..+++++.- T Consensus 147 ~~L~~i~~~~v~v~~~~~g--~~~~----~~~~~~~~~~~~~--~~~~~~dviHir~~---------------------- 196 (441) T protein:vir:98 147 MNLTFRKTSEIELKLDARG--RLYY----FHQRIDSNGNNIE--RNVKFEDMLDIKFY---------------------- 196 (441) T ss_pred EEEEEEcCceeEEEECCCC--cEEE----EEEEeccCcceee--EEEccccEEEeccC---------------------- Confidence 477889998888776432 1211 1111111111111 22444444444210 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhh----hhhh-------- Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSI----SNMK-------- 294 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~----~~~~-------- 294 (511) ++ +.-.|.|.+..+...++.......-....++..+.|-.+++=-..-.+++.. ..+. T Consensus 197 -~~---------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~n 266 (441) T protein:vir:98 197 -SL---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQ 266 (441) T ss_pred -CC---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccc Confidence 00 0114667777666666655555555555556666666665421111111211 1111 Q ss_pred hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AASGQALKAATQPLENKSAVKESKFRKV 373 (511) Q Consensus 295 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 373 (511) .++++.++++.+.+.++.+.....+....+...+.|+..-++|....+... +.|.+.... .|... T Consensus 267 ag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~--------------~y~~t 332 (441) T protein:vir:98 267 AGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYLST 332 (441) T ss_pred cCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHHHH Confidence 134667776666666655555556666677778889998898865554322 122121111 11123 Q ss_pred HHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHH Q lcl|NC_018086. 374 LAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADA 449 (511) Q Consensus 374 l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~ 449 (511) |.-++..|...+...-.. ......+++....-+-.|.++.++++.++ .|+++...++.+++.- ++.+..+-.+.. T Consensus 333 l~P~~~~ie~~ln~~L~~-~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~ 411 (441) T protein:vir:98 333 LKPYITCVCAELNFKFND-EYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDL 411 (441) T ss_pred HHHHHHHHHHHHHhhccc-cccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecc Confidence 333333222222221111 11222344444555667888899988887 4789988888877542 222211100000 Q ss_pred HHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 450 QRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 450 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) ++ .......+- +.....+.... -.|++..+ T Consensus 412 -----------n~---~~~~~~~~~---q~~~~~~~~~~-----~kgGe~ne 441 (441) T protein:vir:98 412 -----------NH---VNIELVDEY---QMNKSRATDKK-----LKGGEENE 441 (441) T ss_pred -----------cc---ccccccccc---ccccccccccc-----cCCCCCCC Confidence 00 000000000 00000000000 00000000 No 207 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.13 E-value=0.00016 Score=41.42 Aligned_cols=405 Identities=9% Similarity=0.016 Sum_probs=173.1 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) |+=.++++++.-.......+.......+.. .. ......+-|.... .........+ +...- T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~-~~~~~~~~g~~~~--~g~~v~~~~a---l~~~~ 60 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRL--------------TD-GAFWSQFLGRESS--SGKKVTVDKA---MKLSA 60 (434) T ss_pred Cccchhhhhhhcccccchhhhccccccccc--------------Cc-hHHHHHHhcCCcc--CCceechhhh---hccHH Confidence 777777776654332211111111111100 00 0111112232110 0111100000 11112 Q ss_pred HHHHHHHHHhhhhccCcee---cCch---h-hHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce- Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITE---SGDE---K-TIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH- 147 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~---~~d~---~-~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~- 147 (511) ....|+..++-+-.-|+.+ ..+. . ....+..++.. |. -......+..+.+.+|.+|+++..+ .|++ T Consensus 61 V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~ 139 (434) T protein:vir:43 61 VWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPA 139 (434) T ss_pred HHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEE Confidence 2334555555555556654 1111 1 12234555432 43 2356677788899999999888665 5765 Q ss_pred EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceec Q lcl|NC_018086. 148 RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPN 227 (511) Q Consensus 148 ~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (511) .+..++|..+.+.++... . ++|+. .. .++.. ..+.++.+.++..- T Consensus 140 ~L~~l~p~~v~~~~~~~g--~----~~y~~-~~-~~g~~----~~~~~~eVih~~~~----------------------- 184 (434) T protein:vir:43 140 ALDFLLPSRVDLECDENG--R----LKYFY-TT-KKGAR----REIERTNMLHIPAF----------------------- 184 (434) T ss_pred EEEEEcCcceEEEEcCCC--e----EEEEE-Ee-cCceE----EEEccccEEEecCc----------------------- Confidence 467788988877665431 1 11111 11 11211 12344444443210 Q ss_pred cCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh----h-------hC Q lcl|NC_018086. 228 LLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM----K-------ND 296 (511) Q Consensus 228 ~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~----~-------~~ 296 (511) | .+...|.|.+..+...+........-....+...+.|-.+++-.. ...++....+ . .+ T Consensus 185 -----~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~r~~~~~~~g~~nag 254 (434) T protein:vir:43 185 -----T----LDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR-ILQPAQREEFREYVKSVSGAMNSG 254 (434) T ss_pred -----C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC-CCCHHHHHHHHHHHHHhcCccccC Confidence 0 011246666665555554444333333344444555655554322 1112221111 1 13 Q ss_pred ceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccC--ccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 297 RVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTA--ASGQALKAATQPLENKSAVKESKFRKVL 374 (511) Q Consensus 297 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~--~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 374 (511) +++.++++.+.+.++.......+.+..+...+.|+..-++|..-.+.... .++..++.... ..+..+| T Consensus 255 ~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~----------~f~~~~L 324 (434) T protein:vir:43 255 RSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQML----------AFLTFSI 324 (434) T ss_pred CccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHH----------HHHHHHH Confidence 45666655555555444445566677788888999999998655443321 12333322211 2233344 Q ss_pred HHHHHHHHHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_018086. 375 AKRYELVCSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQR 451 (511) Q Consensus 375 ~~~~~li~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~ 451 (511) .-++..|...++..-.. .......+++.+..-+..|..+.++.+.++ .|+++.-.++..++.-+-+. .+++.. T Consensus 325 ~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD~~~~-- 400 (434) T protein:vir:43 325 SSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPG--GDILTV-- 400 (434) T ss_pred HHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeEee-- Confidence 44444443333321111 111123355555666677889999998887 47898888888876532111 000000 Q ss_pred HHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCC Q lcl|NC_018086. 452 QKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPK 509 (511) Q Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (511) +.+. .+. +..+++....+ ...++..+++.|++-. T Consensus 401 ---------~~n~----~~~-----~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 401 ---------QSNL----VPI-----DQLGQSNKSQA------VRAALMNWFSQPEPQE 434 (434) T ss_pred ---------ccCc----cch-----hhhhccCCCcc------hhhhhhccCCCCCCCC Confidence 0000 000 00000000000 0000000011111111 No 208 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.11 E-value=0.00017 Score=41.30 Aligned_cols=389 Identities=9% Similarity=0.011 Sum_probs=170.4 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |-++..+....-.. ... .-+..+.....+.... ......+. .=+...-.. T Consensus 1 m~~~~~~~~~~~~~------------s~~-------------~~w~~~~~~~~~~~~~--~g~~vt~~---~al~~~~v~ 50 (421) T protein:vir:10 1 MFIPQMFEGKKRSV------------SGG-------------GFWEAMLGGVRSSHSK--AGVMITPE---TALALSAVR 50 (421) T ss_pred CCCcchhccccccc------------Ccc-------------hhhHHHhhhhccCccc--CCceechH---HhhccHHHH Confidence 33333332222111 000 0011111111111100 00000000 001122334 Q ss_pred HHHHHHHhhhhccCceec---Cchh---h-HHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EE Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITES---GDEK---T-IKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RF 149 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~---~d~~---~-~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i 149 (511) ..|+..++-+-.-|+.+- .+.. . ...+..+|. -|. .......+..+.+.+|.||+++..+.+|++ .+ T Consensus 51 ~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L 130 (421) T protein:vir:10 51 ACVTLLAESVAQLPVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKEL 130 (421) T ss_pred HHHHHHHHhhccCceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEE Confidence 456666666655676541 1111 1 112444443 232 334556778889999999999988888876 46 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccC Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLL 229 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (511) ..++|..+.++.++.- ..+|.+. ..+. .+..+.++++..- T Consensus 131 ~~l~~~~v~v~~~~~g-------~~~y~~~--~~g~------~~~~~eiih~~~~------------------------- 170 (421) T protein:vir:10 131 IPINPKKVIVLKGPDG-------MPYYEIP--EIGE------TLPMRMMHHVKVF------------------------- 170 (421) T ss_pred EEecCceEEEEECCCc-------eEEEEEc--CCCc------EEchhhEEEecCc------------------------- Confidence 6778887766544321 1122211 1111 1222223222100 Q ss_pred CccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC---CCCccchhhhhh------------h Q lcl|NC_018086. 230 QKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF---DLSADSDSISNM------------K 294 (511) Q Consensus 230 g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~---~~~~~~~~~~~~------------~ 294 (511) + .+.-.|.|.++.+...++..........+.+...+.|-.+++-. .....++....+ . T Consensus 171 ---~----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n 243 (421) T protein:vir:10 171 ---S----LDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINN 243 (421) T ss_pred ---C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccc Confidence 0 01124777777666666655555444455556666676665521 111122222211 1 Q ss_pred hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVL 374 (511) Q Consensus 295 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 374 (511) .++++.++++.+.+.+........+.+..+...+.|+..-++|....+..+..+...++.. ....+..+| T Consensus 244 ~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~----------~~~f~~~tl 313 (421) T protein:vir:10 244 MFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQ----------GLQFVMYTL 313 (421) T ss_pred cCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHH----------HHHHHHHHH Confidence 2346677766666555544455566667777888899888988655543332222222111 112233344 Q ss_pred HHHHHHHHHHHHhcCCC-ccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_018086. 375 AKRYELVCSYLEFMNKA-KDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQR 451 (511) Q Consensus 375 ~~~~~li~~~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~ 451 (511) .-+++.|...++..-.. .......+++.+...+..|..+.++.+.++ .|+++.-.++..++.-+-+.. ++. T Consensus 314 ~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gg--D~~---- 387 (421) T protein:vir:10 314 LAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGG--DKY---- 387 (421) T ss_pred HHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--cee---- Confidence 44444443333321111 111122344455555667889999988887 578998888888765221100 000 Q ss_pred HHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 452 QKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) +.+.+.... ....+.+. ...+.+.+..|+-.|| T Consensus 388 -------~~~~n~~~~-~~~~~~~~-------------------~~~~~~~~e~d~~~~~ 420 (421) T protein:vir:10 388 -------LTPLNMVDS-AQIIPGDK-------------------KPTAQQMAEIDTILSR 420 (421) T ss_pred -------eeccccccc-cccccCCC-------------------CcccccCccccccccc Confidence 000000000 00000000 0111112222222222 No 209 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.07 E-value=0.00018 Score=41.08 Aligned_cols=381 Identities=8% Similarity=0.024 Sum_probs=174.5 Q ss_pred hhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc-C-CcCc-------ccc-ccce--eccchHHHHHHHHHh Q lcl|NC_018086. 23 FIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQS-R-TFDD-------TNK-PNSK--IVHNFPKLLVDTSTA 90 (511) Q Consensus 23 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~-~-~~~~-------~~~-~~~r--i~~n~~k~ivd~~~~ 90 (511) +.+-..+. ....+..-+.+++.++.|....... . ...+ .+. ...+ +...-....|+..++ T Consensus 1 ~~~~~~~~--------~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTI--------DLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLIST 72 (424) T ss_pred CCCCcceE--------eecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHH Confidence 11111110 1123334455555555553221100 0 0000 000 0000 111223345555555 Q ss_pred hhhccCcee---cCch---h--hHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccc Q lcl|NC_018086. 91 YLAGEPITE---SGDE---K--TIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMN 156 (511) Q Consensus 91 ~l~g~~~~~---~~d~---~--~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~ 156 (511) -+-+-|+.+ ..+. . ....+.+++.. |. .......+..+.+.+|.+|+++-.+.+|++ .+..++|.. T Consensus 73 ~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~ 152 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) T ss_pred hhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcc Confidence 555567654 1111 1 12234555432 32 334566778889999999999988888875 567788888 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) +.+..+.. .. . |.... ++.. ..+.++.+++++.-. T Consensus 153 V~v~~~~~---~~----~-y~~~~--~g~~----~~~~~~eIih~r~~~------------------------------- 187 (424) T protein:vir:18 153 MDVKLVGK---KV----V-YRYQR--DSEY----ADFSQKEIFHLKGFG------------------------------- 187 (424) T ss_pred eEEEEcCC---eE----E-EEEEe--CCeE----EEeccccEEEecCcC------------------------------- Confidence 77654321 11 1 11111 1211 124444454442100 Q ss_pred ecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh-----------hhCceeeecCCC Q lcl|NC_018086. 237 IIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM-----------KNDRVIVTDEDG 305 (511) Q Consensus 237 ~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~ 305 (511) .+...|.|.+..+...++..........+.+...+.|-.++.-.+...+++....+ ..++++.++++. T Consensus 188 -~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~ 266 (424) T protein:vir:18 188 -FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF 266 (424) T ss_pred -CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCc Confidence 01124677777666666554444444455556666676666532221112222111 123466777666 Q ss_pred ceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 306 MVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCS 383 (511) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 383 (511) +++.+........+.+..+...+.|+..-++|..-.+... +..+..++.... ..+..+|.-.++.|.. T Consensus 267 ~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~----------~f~~~tl~P~~~~ie~ 336 (424) T protein:vir:18 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWEN 336 (424) T ss_pred eEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH----------HHHHHHHHHHHHHHHH Confidence 6666655555556666777888899999999875554332 222333332222 2223344444443333 Q ss_pred HHHhcC-CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 384 YLEFMN-KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQ 460 (511) Q Consensus 384 ~~~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 460 (511) .+...- .........+++.+...+..|..+.++.+.++ .|+++.-.++..++.-+-+.. +.. .- T Consensus 337 ~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gG--D~~-----------~~ 403 (424) T protein:vir:18 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA-----------MR 403 (424) T ss_pred HHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee-----------ee Confidence 333211 11111223355566666778899999999888 478888778777754221000 000 00 Q ss_pred hccccccCCCCCCc-cccccCCCCC Q lcl|NC_018086. 461 NFKQTSAVQGASTA-AANKLDKNPA 484 (511) Q Consensus 461 ~~~~~~~~~~~~~~-~~~~~~~~~~ 484 (511) ..+ ..+.... ...++.++++ T Consensus 404 ~~n----~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 404 QSQ----YVPITDLGTNKEPRNNGA 424 (424) T ss_pred ccC----ccchHhhhccCCCccCCC Confidence 000 0000000 0001111111 No 210 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=96.96 E-value=0.00023 Score=40.49 Aligned_cols=410 Identities=10% Similarity=-0.008 Sum_probs=189.6 Q ss_pred CCCccchhhcccccCchhh--------------HhhhhccCCCHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccCC Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIR--------------RKHFIRRNFDLRELITLAEMHS-RSSSAYGVLYDYYKGNHIAIQSRT 65 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~yY~G~~~~~~~~~ 65 (511) |+.=+++- |.-+..... .....-..+++..+...+..-. ..+.++..| ||+- T Consensus 1 m~~~~d~~--g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L--~~dm--------- 67 (512) T protein:vir:19 1 MGRILDIS--GQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADL--AFDM--------- 67 (512) T ss_pred CcceeCCC--CCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHH--HHHH--------- Confidence 55544432 222211111 1112223455555444332211 122222222 1210 Q ss_pred cCccccccceeccchHHHHHHHHHhhhhccCceecC--c-h----hhHHHHHHHHhcc-ChhHHHHHHHHHHhhCCeEE- Q lcl|NC_018086. 66 FDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESG--D-E----KTIKAMQPVFKEN-YVTDVNSEEVKLSGIFGHCF- 136 (511) Q Consensus 66 ~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~--d-~----~~~~~l~~~~~~n-~~~~~~~~~~~~a~~~G~~~- 136 (511) -....+..-.+.+....+.+.++++.. + + +..+.+.+.+..- +|...+.. ..+|.-||.++ T Consensus 68 ---------~~~D~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~-lldA~~~G~s~~ 137 (512) T protein:vir:19 68 ---------EEKDTHLFSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFD-AGDAILKGYSMQ 137 (512) T ss_pred ---------HhhChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHH-HHhhhhhcceee Confidence 011345566677777777888887731 1 2 3345566666543 57877766 45788899765 Q ss_pred EEeeeCCCCceE---EEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 137 EIHWIDRNKKHR---FKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 137 ~~v~~~~~g~~~---i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) +++|.-.+|... +...+|... .|++..... +++.. ....| .. T Consensus 138 Ei~w~~~~g~~~~~~~~~r~~~~f--~~~~~~~~~----lr~~~--~~~~G-----~~---------------------- 182 (512) T protein:vir:19 138 EIEWGWLGKMRVPVALHHRDPALF--CANPDNLNE----LRLRD--ASYHG-----LE---------------------- 182 (512) T ss_pred eeEeeeeCCceeeeeeeeeccccc--eeccCCCcE----EEecC--CCCCc-----ee---------------------- Confidence 556643344333 334444322 222221111 00000 00000 00 Q ss_pred ccccccccccceeccCCccceEee--cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccc---- Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFPVLEI--IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADS---- 287 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~---- 287 (511) ..+++.|-.++- ..++.|.|.+..+.-..--=+..+.+++..++.|+.|+++.+=.....++ T Consensus 183 ------------l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~ 250 (512) T protein:vir:19 183 ------------LQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKAT 250 (512) T ss_pred ------------ecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHHH Confidence 012222222221 23567888888876666666778889999999999999987632221111 Q ss_pred --hhhhhhhhCceeeecCCCceeeeecC-CCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHH Q lcl|NC_018086. 288 --DSISNMKNDRVIVTDEDGMVKFITKD-VNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSA 364 (511) Q Consensus 288 --~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~ 364 (511) ....++....+..+|.+.++++++.. .+...++.+++.+.+.|...--.-.++.+..++.|...=+....-....++ T Consensus 251 L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~vh~ev~~di~~ 330 (512) T protein:vir:19 251 LMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEVHDEVRREIRN 330 (512) T ss_pred HHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHH Confidence 12234456678888999999998854 344568888888888887654322233333211111111112222333344 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh-cc-CChHHHHHhCCCCCCHH Q lcl|NC_018086. 365 VKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR-DM-LPDETIINQFPWITDAR 441 (511) Q Consensus 365 ~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~-g~-~s~et~~~~l~~v~d~~ 441 (511) .-.+.+...+. ++++-++. ...........-..+.|...-+.|....++.+.+++ |+ +|.+.+.+.++. +.++ T Consensus 331 aDa~~i~~tln~~li~~l~~---~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i~e~~Gi-p~~~ 406 (512) T protein:vir:19 331 ADVGQLARSINRDLIYPLLA---LNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWIQEKLHI-PQPV 406 (512) T ss_pred HHHHHHHHHHHHHHHHHHHH---hCCCCCCCccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHHHHHhCC-CCCC Confidence 44455555553 35544432 222211111223567888888999999998887764 65 788888888874 3222 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 442 QEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 442 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+-.-+.. .+....... ....+..+.. .+.......-+ ....| T Consensus 407 ~~e~~~~~-------------------~~~~~~~~~--~~~~~~~~~~---~~~~~~~d~~~---~~~~~ 449 (512) T protein:vir:19 407 GDEAVFTI-------------------QPVVPDNGS--QKEAALSAED---IPQEDDIDRMG---VSPED 449 (512) T ss_pred CccccccC-------------------CCccccccc--cccccccccC---CCchhhHhHHh---hhHHH Confidence 11000000 000000000 0000000000 00000000000 00001 No 211 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=96.89 E-value=0.00027 Score=40.11 Aligned_cols=378 Identities=8% Similarity=-0.050 Sum_probs=164.2 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+|+.-+....-+ .. ... ......++.+.... ....... ..=....... T Consensus 1 Mg~f~~~~~~~~~-----------~~-~~~--------------~~~~~~~~~~~~~~---~~~~~~~--~~~~~~~~v~ 49 (406) T protein:vir:95 1 MGLFDRWRRTKRK-----------SK-IRA--------------DTGYVGLFMSGEDV---SFLVPGY--VRLSDNPEVR 49 (406) T ss_pred Ccchhhhcccccc-----------cc-ccc--------------cchhhhhhccCccc---CccccCH--HHHhhcHHHH Confidence 5555322110000 00 000 00011112111110 0000000 0012245667 Q ss_pred HHHHHHHhhhhccCceec--Cch---hhHHHHHHHH-h-cc---ChhHHHHHHHHHHhhCCeEE--EEeeeCCCCce-EE Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITES--GDE---KTIKAMQPVF-K-EN---YVTDVNSEEVKLSGIFGHCF--EIHWIDRNKKH-RF 149 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~--~d~---~~~~~l~~~~-~-~n---~~~~~~~~~~~~a~~~G~~~--~~v~~~~~g~~-~i 149 (511) ..|+..++-+..-|+.+- .++ .....+...+ . -| ........+..+.+.+|.|+ +.+-.+..|++ .+ T Consensus 50 ~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l 129 (406) T protein:vir:95 50 MAVHKIADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDEL 129 (406) T ss_pred HHHHHHHHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEE Confidence 778888877777777651 111 1111222222 2 12 33456667777888887664 44445666765 46 Q ss_pred EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccC Q lcl|NC_018086. 150 KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLL 229 (511) Q Consensus 150 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (511) ..++|..+.++.+... .++.. ++ ..+.+..++|++.... T Consensus 130 ~~i~~~~v~~~~~~~~-------~~~~~-----~~------~~~~~~evih~~~~~~----------------------- 168 (406) T protein:vir:95 130 VPLTPSKVNFLDTPDG-------YQVLY-----GG------QTFNYDEVLHFIYNPD----------------------- 168 (406) T ss_pred EEEcCceeEEEEcCCe-------EEEEe-----cc------EEEchhHEEEeeccCC----------------------- Confidence 6788887766554321 11100 11 1233333333321000 Q ss_pred CccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCC-C--ccchhhhhh----h----hCce Q lcl|NC_018086. 230 QKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDL-S--ADSDSISNM----K----NDRV 298 (511) Q Consensus 230 g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~-~--~~~~~~~~~----~----~~~~ 298 (511) |. +.-.|.|.+..+...++....+.......+...+.|-.++.-... + ..+.....+ . .++. T Consensus 169 ---~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~ 241 (406) T protein:vir:95 169 ---PE----RPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQP 241 (406) T ss_pred ---CC----CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCc Confidence 00 001477777777777766666655555566666677666543221 1 111111111 1 1234 Q ss_pred eeecCCC-ceeeee-cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 299 IVTDEDG-MVKFIT-KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAK 376 (511) Q Consensus 299 i~~~~~~-~~~~~~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 376 (511) +.+..++ ....++ .+.....+....+...+.|+..-++|..-.+.. ++. +.. ...++..+|.- T Consensus 242 ~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~---~~~--~~~----------~~~~~~~~l~P 306 (406) T protein:vir:95 242 WIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG---EFN--RDE----------YNNFINSTILP 306 (406) T ss_pred eeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---Cch--HHH----------HHHHHHHHHHH Confidence 4444433 232222 233344555667777888888888886444322 221 111 11244555655 Q ss_pred HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_018086. 377 RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKR 454 (511) Q Consensus 377 ~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~ 454 (511) +++.|...+...--.. ....+.+.++.-+..|..+.++.+.++ .|+++...++.+++.-..+. .++.... T Consensus 307 ~~~~ie~~l~~~l~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~--gd~~~~~---- 378 (406) T protein:vir:95 307 IAKGIEQELTRKLLIS--PDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEG--LSELVIL---- 378 (406) T ss_pred HHHHHHHHHHHhcCCC--CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cceeeec---- Confidence 5555555444321111 112455666666677888899988877 47898888888887643211 1111000 Q ss_pred HHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 455 ADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+... ..........++++..+.. ++|+ T Consensus 379 -------~n~~~-~~~~~~~~~~k~g~~~~~~---------------------~~~~ 406 (406) T protein:vir:95 379 -------ENYIP-LDKIGDQSKLKGGDNSGAD---------------------GQTD 406 (406) T ss_pred -------cCccc-hhhcccccccCCCCCCCCC---------------------CCCC Confidence 00000 0000000000000011000 0011 No 212 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=96.83 E-value=0.00031 Score=39.83 Aligned_cols=387 Identities=10% Similarity=0.028 Sum_probs=163.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc---------cC-------CcCccccc---cceeccchHHHHHHHH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQ---------SR-------TFDDTNKP---NSKIVHNFPKLLVDTS 88 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~---------~~-------~~~~~~~~---~~ri~~n~~k~ivd~~ 88 (511) +-++..+.++ .+++..+....+.-. .. .....+.. +.=+...-....|+.. T Consensus 1 ~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~I 70 (432) T protein:vir:10 1 MPDEKKLGLL----------GQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLV 70 (432) T ss_pred CCCCcccchh----------hhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHH Confidence 1111111111 122222221110000 00 00000000 0001123333455555 Q ss_pred HhhhhccCcee---cCch--h-hHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccc Q lcl|NC_018086. 89 TAYLAGEPITE---SGDE--K-TIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMN 156 (511) Q Consensus 89 ~~~l~g~~~~~---~~d~--~-~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~ 156 (511) ++-+-+-|+.+ ..+. + ....+..+|. -|. .......+..+.+.+|.||+.+..+ +|++ .+..++|.. T Consensus 71 a~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~ 149 (432) T protein:vir:10 71 SQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDR 149 (432) T ss_pred HHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCc Confidence 55555567653 1111 1 1123444442 232 3345667788899999999888765 4664 467788988 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) +.++.+... ... |.... .+|.. ..+.++.+++++.- ++ T Consensus 150 v~v~~~~~g--~~~-----y~~~~-~~g~~----~~~~~~~iih~~~~-----------------------~~------- 187 (432) T protein:vir:10 150 LTITTDTKG--NTA-----YRYRR-TDGQM----IDIPKQQIWKIMGY-----------------------SL------- 187 (432) T ss_pred eEEEEcCCC--cEE-----EEEEe-cCceE----EEEcCccEEEecCC-----------------------CC------- Confidence 887765432 111 11111 12221 12334444433210 00 Q ss_pred ecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh--------hhCceeeecCCCcee Q lcl|NC_018086. 237 IIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM--------KNDRVIVTDEDGMVK 308 (511) Q Consensus 237 ~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~ 308 (511) +.-.|.|.+..+...++.......-..+.+...+.|-.+++... ...++....+ ..++++.++++.+.+ T Consensus 188 --dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~ 264 (432) T protein:vir:10 188 --DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVK 264 (432) T ss_pred --CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHHHHHHHHHHHhhhhhCCCceecCCCceEE Confidence 11236666665555555444433333444455556766665322 1122222222 234577777766666 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc---CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT---AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~---~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) .++.......+.+..+.....|+..-++|....+... ...+..++.... ..+..+|.-.++.|...+ T Consensus 265 ~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~----------~f~~~tl~P~~~~ie~~l 334 (432) T protein:vir:10 265 SLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------GFLSMTLSPWLRRIEQSI 334 (432) T ss_pred EccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHH Confidence 6655555556666678888899999999876554332 122333332211 122223333333333322 Q ss_pred HhcC-CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 386 EFMN-KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 386 ~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) ...- .........+++.+..-+..|..+.++.+.++ .|+++.-.++.+++.-.= +..-..+. T Consensus 335 n~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi-~g~~~~~~-------------- 399 (432) T protein:vir:10 335 ALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL-GGNAAVLT-------------- 399 (432) T ss_pred HhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC-CCCcceEe-------------- Confidence 2111 11111122344444555667888999988887 478998888888765220 00000000 Q ss_pred cccccCCCCCCccccccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) ......+. +..+.++++++.+...+++..+.++ T Consensus 400 -~~~~~~pl-----~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 -VQSAMVPL-----DSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred -ecCcccch-----hhhcccCCCCCCCCCCCcccccccC Confidence 00000000 0000000000000000111111111 No 213 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=440 Identities=11% Similarity=0.061 Sum_probs=173.0 Q ss_pred CCCccc-----hhhcccccCchhhHhhhhccCCCHHH-------------HHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_018086. 1 MAIPNG-----QINAGDIITTNIRRKHFIRRNFDLRE-------------LITLAEMHSRSSSAYGVLYDYYKGNHIAIQ 62 (511) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-------------l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~ 62 (511) ||+.++ +.-..+....+ . ...+.+... +..-+.....-+.+|..+-.+++-+. T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~---~--~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~---- 71 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSF---V--QKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDS---- 71 (537) T ss_pred CccccccceeecccccccCCcc---c--CCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhh---- Confidence 666554 32212111110 0 001111000 00011111222233333333333222 Q ss_pred cCCcCccccccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhC Q lcl|NC_018086. 63 SRTFDDTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIF 132 (511) Q Consensus 63 ~~~~~~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~ 132 (511) -...||+..+-+ ....|+.+. ..+...+.+..++.--+|+....+..|.+.+. T Consensus 72 -----------------Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVD 134 (537) T protein:vir:10 72 -----------------AVDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVD 134 (537) T ss_pred -----------------HHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheee Confidence 122233322211 122333332 12334556677777788999999999999999 Q ss_pred CeEEEEeeeCC----CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcc Q lcl|NC_018086. 133 GHCFEIHWIDR----NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDERE 208 (511) Q Consensus 133 G~~~~~v~~~~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~ 208 (511) |+.|.+...|. +|-..+..+||+.+-.|..-.... ...++.......... ......+|.+...+. . .. T Consensus 135 gRi~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~~~--~~~~~~~~~~~~v~~-~~~eyf~ynp~g~~~-~-~~--- 206 (537) T protein:vir:10 135 GRLFFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEAKR--PEALRTQDLNQQLTQ-QSASYFLYNPKGLKN-S-TN--- 206 (537) T ss_pred eEEEEEEEEeCCCccccceeeeeeCCccceeeEeecccC--CccceEEecceeeee-cccceeeeccccccc-c-CC--- Confidence 99998887754 466778899999886553211100 011111000000000 001122344432210 0 00 Q ss_pred cccccccccccccccceeccCCccc--eEeec-------CCcccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeE Q lcl|NC_018086. 209 VYREIPEELEIKDYEVHPNLLQKFP--VLEII-------ANEERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLW 277 (511) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~-------n~~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~ 277 (511) .+ =+|| .|.|. |.....|-+.. -+..+| +++-|....-+..+.|-+= T Consensus 207 --------~~-----------vkI~~dAI~y~hSGl~d~n~~~i~syLhk---AiKp~NQLkm~EDAlVIYRitRAPeRR 264 (537) T protein:vir:10 207 --------QG-----------MKIAPDSIAYCHSGIQDLNKNMVLSHLHK---AIKAVNQLRMIEDSLVIYRLSRAPERR 264 (537) T ss_pred --------Cc-----------eeccHhheeeecccceeCCCCeeeeeehh---hhHHHHhhHHHHhhHHHHhhhccccce Confidence 00 0122 11111 22233444444 334444 4556666666777777553 Q ss_pred eecCCCCccc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHH Q lcl|NC_018086. 278 LQGFDLSADS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIK 323 (511) Q Consensus 278 ~~G~~~~~~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~ 323 (511) +.-.+....+ .-...+. . +++ +++| +++. .+.-|.+ .+...+. -+ T Consensus 265 vFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~-DV 343 (537) T protein:vir:10 265 IFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DV 343 (537) T ss_pred EEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHH-HH Confidence 3322221111 1111110 0 011 1121 2222 2222333 2233222 24 Q ss_pred HHHHHHHHHHhCccccc--cc-cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cc Q lcl|NC_018086. 324 NRAKLDIFSLSQTPDLV--SK-DFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YE 398 (511) Q Consensus 324 ~~l~~~i~~~s~~p~~~--~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~ 398 (511) .-+.+-+|+--++|-.- .+ .|.-.-+..|-.-+......+.+.+..|..-+.++++.=+ +|...-...+++. .. T Consensus 344 ~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qL-ilKgiit~eeW~~i~~~ 422 (537) T protein:vir:10 344 KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQL-ILKGICSIEEWEEMKEH 422 (537) T ss_pred HHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhc Confidence 45555556666777422 22 2211122344444445567778888888888888876422 2221112222322 34 Q ss_pred eeEEeCCCCCcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH--HHhhcccc Q lcl|NC_018086. 399 VTPVFVRNLPQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADI--ALQNFKQT 465 (511) Q Consensus 399 i~i~f~~~~p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~--~~~~~~~~ 465 (511) +.+.|...-.-.+... ++++..+. | .+|.+++++.+-.-+| +|+..++++-+.+.+. ..++.... T Consensus 423 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tD--eeI~~~~k~I~~E~k~~~~~~p~~~~ 500 (537) T protein:vir:10 423 IQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTE--SEIKEIDKEIKQEIADGVIMDPQAMQ 500 (537) T ss_pred ceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCH--HHHHHHHHHHHHHhhCCCCCCccccc Confidence 6777744333333322 33444442 3 4699999987543343 3333333322222221 11111111 Q ss_pred ccCCC-CCCccccccCCCCCCc-cccccCCCCccccccc Q lcl|NC_018086. 466 SAVQG-ASTAAANKLDKNPANT-STITTTDPVAAKEQEK 502 (511) Q Consensus 466 ~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 502 (511) .+..+ ++.... +.+..+++ .+..+.+|.+++.++= T Consensus 501 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 501 AMEMGIGDEEPV--PEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred ccccCCCCcccC--CCCCCCcccCCccCCCCCCccCCCC Confidence 11111 000000 11111111 1112233333332222 No 214 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.76 E-value=0.00035 Score=39.50 Aligned_cols=379 Identities=10% Similarity=-0.025 Sum_probs=161.9 Q ss_pred hhhhccCCCH--HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCc--------Cccccccce------eccchHHHH Q lcl|NC_018086. 21 KHFIRRNFDL--RELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTF--------DDTNKPNSK------IVHNFPKLL 84 (511) Q Consensus 21 ~~~~~~~~~~--~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~--------~~~~~~~~r------i~~n~~k~i 84 (511) -++|-+..-. +-... .+..++..+... ....+ ......... +...=.... T Consensus 1 ~~~~~~~~~~~~~~~~~-------------~~~~lf~~~~~~-~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~c 66 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRV-------------LLDALFRSKSLE-NPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSC 66 (424) T ss_pred CeeEeeeceecCcchhH-------------HHHhhccccCCC-CCccccchhhhhhhccccCCceechHHhhccHHHHHH Confidence 2222221110 00111 111122111100 00000 000000000 111223335 Q ss_pred HHHHHhhhhccCceec--Cch---hh-HHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEE Q lcl|NC_018086. 85 VDTSTAYLAGEPITES--GDE---KT-IKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAV 152 (511) Q Consensus 85 vd~~~~~l~g~~~~~~--~d~---~~-~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~ 152 (511) |+..++-+-+-|+.+- .+. .. ...+..++. -|. .......+..+.+.+|.+|+.+-.+..|++ .+..+ T Consensus 67 v~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l 146 (424) T protein:vir:45 67 IYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCC 146 (424) T ss_pred HHHHHHHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEe Confidence 5666666656677541 111 11 113444442 233 234556678889999999999988888886 47778 Q ss_pred cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCcc Q lcl|NC_018086. 153 SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKF 232 (511) Q Consensus 153 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 232 (511) +|..+.+..+.. +. .|.... ..+. ..+.++.+++++.-. T Consensus 147 ~~~~v~i~~~~~---~~-----~y~~~~-~~~~-----~~~~~~eVih~r~~~--------------------------- 185 (424) T protein:vir:45 147 MPWETTLMNTGG---RY-----TYGLYN-EYGA-----FAISPDDMIHIRALG--------------------------- 185 (424) T ss_pred cCceEEEEEcCC---eE-----EEEEEe-cCce-----EEECcccEEEecCcC--------------------------- Confidence 887775543221 11 111111 1110 123344444432100 Q ss_pred ceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh----h----h-----hCcee Q lcl|NC_018086. 233 PVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN----M----K-----NDRVI 299 (511) Q Consensus 233 Pvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~----~----~-----~~~~i 299 (511) .+...|.|.+..+...++.......-..+.+...+.|-.+++-... .+++.... + . .++++ T Consensus 186 -----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~~~g~~~n~g~~~ 259 (424) T protein:vir:45 186 -----NNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSG-LNKESWGWLKDQWQKASQALRRQENKTM 259 (424) T ss_pred -----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHHHHHHHHHHHHHHhccccccCCcee Confidence 0112467777766655555444444444455666667666653221 11111111 1 1 13466 Q ss_pred eecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 300 VTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYE 379 (511) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 379 (511) .++++.+++.+........+.+..+...+.|+..-++|..-.+....++...++. .....+...|.-.++ T Consensus 260 vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq----------~~~~f~~~tL~P~~~ 329 (424) T protein:vir:45 260 LLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISA----------QAIQFVRYTMMPWVT 329 (424) T ss_pred EcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH----------HHHHHHHHHHHHHHH Confidence 6766666555544444445566677778889888898876554433222222111 111233334444444 Q ss_pred HHHHHHHhcCCC-ccc-cccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 380 LVCSYLEFMNKA-KDL-KPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRA 455 (511) Q Consensus 380 li~~~~~~~~~~-~~~-~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~ 455 (511) .|...++..--. ... ....+++.+..-+..|..+.++.+.++. |+++...++..++.-+-+. .+. T Consensus 330 ~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~g--gD~--------- 398 (424) T protein:vir:45 330 NWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEG--LDE--------- 398 (424) T ss_pred HHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cce--------- Confidence 444333321111 011 1123444555556678889999988874 7888877887776422000 000 Q ss_pred HHHHhhccccccCCCCCCccccccCCCCCCccccccCC Q lcl|NC_018086. 456 DIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTD 493 (511) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (511) .....+.... . .+.++...+.+++.+ T Consensus 399 --~~~~~n~~~~---~-------~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 399 --MLVSVNAANP---A-------GDFKPPKNDEGKTNE 424 (424) T ss_pred --eeeccccccc---c-------cccCCCCCCCCCCCC Confidence 0000000000 0 000000000000000 No 215 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=96.62 E-value=0.00046 Score=38.87 Aligned_cols=387 Identities=10% Similarity=0.003 Sum_probs=163.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc----CCcC---------ccccccce------eccchHHHHHHHH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQS----RTFD---------DTNKPNSK------IVHNFPKLLVDTS 88 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~----~~~~---------~~~~~~~r------i~~n~~k~ivd~~ 88 (511) +.....+.+..+ ....+......... .... ........ +...-....|+.. T Consensus 1 ~~~~~~mg~f~r----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~I 70 (432) T protein:vir:81 1 MPDEKKLGLFGQ----------LKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLV 70 (432) T ss_pred CCchhhcchhhh----------hhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHH Confidence 222222222222 22222211110000 0000 00000000 1112223345555 Q ss_pred HhhhhccCcee---cCch--h-hHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccc Q lcl|NC_018086. 89 TAYLAGEPITE---SGDE--K-TIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMN 156 (511) Q Consensus 89 ~~~l~g~~~~~---~~d~--~-~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~ 156 (511) ++-+-.-|+.+ ..+. + ....+..+|. -|. -......+..+.+.+|.||+.+..+ +|++ .+..++|.. T Consensus 71 a~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~ 149 (432) T protein:vir:81 71 SQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDR 149 (432) T ss_pred HHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCc Confidence 55555556653 1111 1 1122444443 233 2345667788899999999887765 4664 566788988 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) +.+.+++.. .+. |.... .+|..+ .+.++.+++++.- ++ T Consensus 150 v~v~~~~~g--~~~-----y~~~~-~~g~~~----~~~~~~iih~r~~-----------------------~~------- 187 (432) T protein:vir:81 150 LTITTDPKG--NTA-----YRYRR-TDGQMI----DIPKQQIWKIMGY-----------------------SL------- 187 (432) T ss_pred eEEEECCCC--cEE-----EEEEe-cCceEE----EEccccEEEecCC-----------------------CC------- Confidence 877776432 111 11111 122211 1233344433210 00 Q ss_pred ecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh--------hhCceeeecCCCcee Q lcl|NC_018086. 237 IIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM--------KNDRVIVTDEDGMVK 308 (511) Q Consensus 237 ~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~ 308 (511) +.-.|.|.+......++.......-....+...+.|-.++.-. ....++....+ ..++++.++++.+.+ T Consensus 188 --dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~ 264 (432) T protein:vir:81 188 --DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVK 264 (432) T ss_pred --CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHHHHhhhhcCCCceecCCCceEE Confidence 1113666666655555544444434444444445564444421 11122222222 224577787776666 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc---CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT---AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~---~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) .+........+.+..+...+.|+..-++|....+... ..++..++.... ..+..+|.-.+..|...+ T Consensus 265 ~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~l 334 (432) T protein:vir:81 265 SLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------GFLTMTLSPWLRRIEQSI 334 (432) T ss_pred EccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHH----------HHHHHHHHHHHHHHHHHH Confidence 6655555556667778888899999999875554332 122333322211 122223333333333333 Q ss_pred HhcCC-CccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 386 EFMNK-AKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 386 ~~~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) ...-. ........+++.+..-+..|..+.++++.++ .|+++.-.++.+++.-.-+.. -..+. T Consensus 335 ~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~-~~~~~-------------- 399 (432) T protein:vir:81 335 ALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGN-AAVLT-------------- 399 (432) T ss_pred HhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-cceEe-------------- Confidence 22111 1111222344444555677889999998887 478998888888765220000 00000 Q ss_pred cccccCCCCCCccccccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) ......+ .+.. +.++..++.+...+++..+.++ T Consensus 400 -~~~~~~p-l~~~----~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 400 -VQSAMVP-LDSI----GLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred -ecCcccc-hhhh----ccCCCCCCCCCCCCcccccccC Confidence 0000000 0000 0000000010011111111111 No 216 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=96.52 E-value=0.00055 Score=38.46 Aligned_cols=381 Identities=11% Similarity=0.037 Sum_probs=145.2 Q ss_pred HHHHHHHHHHHHHH-HHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec-Cc--hhhHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAY-GVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES-GD--EKTIKAM 109 (511) Q Consensus 34 ~~~~~~~~~~~~~~-~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~-~d--~~~~~~l 109 (511) +.+.+.......-+ .-+..+.-|.... ...... + .+. .-.-..|+..++-+-.-|+..- .+ ......+ T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-A-l~~--~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~ 72 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQ----KYLGVS-A-LKN--SDILTATSIIAGDIARFPLVKKDVNGDIIHDEDI 72 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCc----ccccch-h-hcc--HHHHHHHHHHHHhhhhCeeEEEecCccccccchH Confidence 11221111000000 1112222221110 000000 0 011 1111234444444434455432 11 1112234 Q ss_pred HHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCC-CCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecC Q lcl|NC_018086. 110 QPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDR-NKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDI 182 (511) Q Consensus 110 ~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~-~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 182 (511) ..+|. -|. .......+....+.+|.||+++..+. .|++ .+..++|..+.+..++.. . +. |.+.... T Consensus 73 ~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~--~----~~-y~~~~~~ 145 (406) T protein:vir:97 73 NYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNH--E----IV-YTFTDML 145 (406) T ss_pred HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCc--e----EE-EEEEecC Confidence 55553 233 33566678888899999999988764 4554 577788888876655422 1 11 1111111 Q ss_pred CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHH Q lcl|NC_018086. 183 TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVS 262 (511) Q Consensus 183 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s 262 (511) ++..+ .+.++.++|++.- ++ +.-.|.|.+..+...++....+.. T Consensus 146 ~~~~~----~~~~~evih~r~~-----------------------~~---------dg~~G~spi~~~~~~i~~~~a~~~ 189 (406) T protein:vir:97 146 TAKQV----KCFAHDVIHWKFF-----------------------SH---------DTILGRSPLLSLGDEIDLQTGGIN 189 (406) T ss_pred CceEE----EEccccEEEecCC-----------------------CC---------CCcccccHHHHHHHHHHHHHHHHH Confidence 11111 2333444443200 00 011366777665555554443333 Q ss_pred HHHHHHHHhcCceeE-eecCCCCccchhhhhh-----------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHH Q lcl|NC_018086. 263 DSVNDIAYWNDAYLW-LQGFDLSADSDSISNM-----------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDI 330 (511) Q Consensus 263 ~~~~~~~~~~~p~l~-~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i 330 (511) -....++..+.|-.+ ..+...++ +....+ ..++++.++++.+...++.......+.+..+...+.| T Consensus 190 ~~~~~f~ng~~~~~i~~~~~~l~~--e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~I 267 (406) T protein:vir:97 190 TLIKFFKDGFSSGILTMKGAQLSG--DARQRARQEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQI 267 (406) T ss_pred HHHHHHhccCCCceEEecCCCCCH--HHHHHHHHHHHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHH Confidence 333344444444333 33332222 222111 1134566666655555544444444445566667788 Q ss_pred HHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcC Q lcl|NC_018086. 331 FSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQS 410 (511) Q Consensus 331 ~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d 410 (511) +..-++|....+..+..| .+... ....+..+|.-+++.|...+...-... .....+.+.|.- ..+ T Consensus 268 a~afgVPp~~lg~~~~~~--~~e~~----------~~~f~~~~l~P~~~~ie~~l~~kll~~-~~~~~~~i~fd~--~~~ 332 (406) T protein:vir:97 268 AKALRVPSYKLGVNSPNQ--SVAQL----------MEDYVTNDLPFYFDAITSELGLKTLND-KDRRLYHIEFDT--RSV 332 (406) T ss_pred HHHhCCCHHHcCCCCCcc--hHHHH----------HHHHHHHHHHHHHHHHHHHHhhhhcCh-hhccceeEEEec--Ccc Confidence 888888866554322112 11111 112233334444443333332211111 111123345531 123 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 411 YAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 411 ~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) ....++++.++ .|+++...++..++.-+.+....++..- ..+. .+. +..+++.+..+.... T Consensus 333 ~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~-----------~~n~----~~~--~~~~~~~~~~~~~~~ 395 (406) T protein:vir:97 333 TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQS-----------SLNY----VFL--DKKEEYQDKVGIKGK 395 (406) T ss_pred chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEee-----------ccCc----cch--hcccccccccccccC Confidence 34445555555 4788888888887653211100000000 0000 000 000000000000000 Q ss_pred cccCCCCccccccc Q lcl|NC_018086. 489 ITTTDPVAAKEQEK 502 (511) Q Consensus 489 ~~~~~~~~~~~~~~ 502 (511) +. ++..+..+| T Consensus 396 gg---~~~~~~~~~ 406 (406) T protein:vir:97 396 GG---EVNAEEDKS 406 (406) T ss_pred CC---CCCCCCCCC Confidence 00 000011111 No 217 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=96.42 E-value=0.00064 Score=38.10 Aligned_cols=362 Identities=9% Similarity=-0.005 Sum_probs=149.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCc-cccccce--eccchHHHHHHHHHhhhhccCceecCchhhHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDD-TNKPNSK--IVHNFPKLLVDTSTAYLAGEPITESGDEKTIKAMQ 110 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~-~~~~~~r--i~~n~~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~ 110 (511) +.+++....... ..-..++.............. ....+.+ +...-....|+..++-+-.-|+++.. ... . T Consensus 1 Mg~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~-~~~----~ 73 (385) T protein:vir:10 1 MGLLTPRNFNKR--KAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTEN-TAT----L 73 (385) T ss_pred Cccccchhcccc--cccccccccchhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeec-cch----h Confidence 222211100000 000001111000000000000 0000001 11223344566666666566777532 111 2 Q ss_pred HHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce Q lcl|NC_018086. 111 PVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ 186 (511) Q Consensus 111 ~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~ 186 (511) .++.+ |. .......+..+.+.+|.||+++..+. ..+..++|..+.+..+.. . +.+ .......+.. T Consensus 74 ~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~~v~~~~~~~--~-----~~~-~~~~~~~~~~ 142 (385) T protein:vir:10 74 NRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM--G-----IVY-TVLESNDRPQ 142 (385) T ss_pred hhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCceEEEEEcCC--c-----eEE-EEEEcCCceE Confidence 22222 22 33455667778889999999886542 233333444333322211 0 111 1111111111 Q ss_pred EEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 187 IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVN 266 (511) Q Consensus 187 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~ 266 (511) ..+.++.+++++.-. .|. .+...|.|.+..+...++.......-..+ T Consensus 143 ----~~~~~~eiihik~~~--------------------------~~~---~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 189 (385) T protein:vir:10 143 ----MVLRQDQMLHFRLMP--------------------------DPQ---YRYLIGRSPLESLQNALNLDDKASKSNMS 189 (385) T ss_pred ----EEEccccEEEeccCC--------------------------CCc---ccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 123344444432100 000 01124778777777777666665555555 Q ss_pred HHHHhcCceeEeecCCCCccchhhhhh-----------hhCceeeecCCCceeeeecCCCHHH-HHHHHHHHHHHHHHHh Q lcl|NC_018086. 267 DIAYWNDAYLWLQGFDLSADSDSISNM-----------KNDRVIVTDEDGMVKFITKDVNDKH-IENIKNRAKLDIFSLS 334 (511) Q Consensus 267 ~~~~~~~p~l~~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~i~~~s 334 (511) .+...+.|-.+++-.....+++....+ ..++++.++++.+++.+........ +.+..+...+.|+..- T Consensus 190 ~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~f 269 (385) T protein:vir:10 190 AMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAF 269 (385) T ss_pred HHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHh Confidence 666666676665532211112211111 1234566766656555544333333 3456677788888888 Q ss_pred Ccccccccc--ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHH Q lcl|NC_018086. 335 QTPDLVSKD--FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYA 412 (511) Q Consensus 335 ~~p~~~~~~--~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~ 412 (511) ++|..-.+. .++.++..++.... .|..+|.-.++.|...+...-. ...+++.+..-+..|.. T Consensus 270 gVp~~~lg~~~~~~~~~sn~eq~~~-----------~~~~~l~P~~~~ie~~l~~~l~-----~~~~~f~~~~ll~~d~~ 333 (385) T protein:vir:10 270 GVPSDILGGGTSTESQHSNIDQIKA-----------TYLANLNSYVNPIVDELRLKMN-----APDLELDIKDMLDVDDS 333 (385) T ss_pred CCCHHHcCCccCCCcccccHHHHHH-----------HHHHHHHHHHHHHHHHHHHhhC-----CceEEeechhhhccCHH Confidence 888655432 23333333322111 1111233333333222221110 11356666777778999 Q ss_pred HHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccc Q lcl|NC_018086. 413 ELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAA 476 (511) Q Consensus 413 e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (511) +.++++.++ .|+++.-.++..++...=+...+.... ...+ ...++++.+. T Consensus 334 ~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~-----------~~~~---~~~~g~~~dn 385 (385) T protein:vir:10 334 ALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFK-----------PLTT---QVKGGDEGDN 385 (385) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCcccc-----------Cccc---ccCCCCCCCC Confidence 999999887 478888777776643210000000000 0000 0000000000 No 218 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=96.39 E-value=0.00067 Score=37.98 Aligned_cols=364 Identities=11% Similarity=0.021 Sum_probs=163.0 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc-------------CCcCcc Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQS-------------RTFDDT 69 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~-------------~~~~~~ 69 (511) |+- +...++ .+.-.++..+...... ....+. T Consensus 1 ~~~-------------------------------~~~~~~-----~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 44 (413) T protein:vir:96 1 MPG-------------------------------VSEIRK-----DKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPN 44 (413) T ss_pred CCc-------------------------------cchhhh-----hhcCCccccCCCcchhhhhhccccccccccccchh Confidence 111 100000 0000111111000000 000000 Q ss_pred c----c--ccce-eccchHHHHHHHHHhhhhccCceec---Cc--hhhHHHHHHHHh--ccC---hhHHHHHHHHHHhhC Q lcl|NC_018086. 70 N----K--PNSK-IVHNFPKLLVDTSTAYLAGEPITES---GD--EKTIKAMQPVFK--ENY---VTDVNSEEVKLSGIF 132 (511) Q Consensus 70 ~----~--~~~r-i~~n~~k~ivd~~~~~l~g~~~~~~---~d--~~~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~ 132 (511) . . ...+ .........|+..++-+-.-|+.+- .+ ......+..++. -|. .......+..+.+.+ T Consensus 45 ~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~ 124 (413) T protein:vir:96 45 FFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLE 124 (413) T ss_pred hHhhhccchhHHHhhchHHHHHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhc Confidence 0 0 0001 1134555667777777766677651 11 112223444432 232 345667788889999 Q ss_pred CeEEEEeeeCCCCc-e-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccc Q lcl|NC_018086. 133 GHCFEIHWIDRNKK-H-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVY 210 (511) Q Consensus 133 G~~~~~v~~~~~g~-~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 210 (511) |.||+++..+..|. + .+..++|..+.+.+++.. ++ |.... .+. .+.++.++|++... T Consensus 125 Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~~-------~~-y~~~~--~~~------~~~~~evih~k~~~----- 183 (413) T protein:vir:96 125 GNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDDD-------LD-YSITF--DNK------EYDPSTLLHFVLNP----- 183 (413) T ss_pred CCeEEEEEEcCCCCceEEEEEecCceeEEEEcCCe-------EE-EEEee--cCc------EEchhhEEEEeccC----- Confidence 99999998887774 3 577888888877665321 11 11110 111 12233333332100 Q ss_pred cccccccccccccceeccCCccceEeecCC-cccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchh Q lcl|NC_018086. 211 REIPEELEIKDYEVHPNLLQKFPVLEIIAN-EERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDS 289 (511) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~ 289 (511) ..++ -.|.|.+..+...+.............+...+.|-.+++-.. ...++. T Consensus 184 --------------------------~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~-~l~~e~ 236 (413) T protein:vir:96 184 --------------------------SIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS-DSDELS 236 (413) T ss_pred --------------------------CCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHH Confidence 0001 136777776666666655555555556666677776665322 112221 Q ss_pred hhh----hh--------hCceeeecCCCc-eeeee-cCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHH Q lcl|NC_018086. 290 ISN----MK--------NDRVIVTDEDGM-VKFIT-KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAA 355 (511) Q Consensus 290 ~~~----~~--------~~~~i~~~~~~~-~~~~~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~ 355 (511) ... +. .++++.+++++. ..-+. .+.....+.+..+...+.|+..-++|..-.+.. ++. +.. T Consensus 237 ~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~---~~~--~~~ 311 (413) T protein:vir:96 237 DEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVG---TYN--KDE 311 (413) T ss_pred HHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---cch--HHH Confidence 111 11 123455544432 22111 123334455566677788888888886544322 111 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHh Q lcl|NC_018086. 356 TQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQ 433 (511) Q Consensus 356 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~ 433 (511) . ...+..+|.-+++.|...++..--. ....+++.++..+..|..+.++++.++ .|+++.-.++++ T Consensus 312 ~----------~~~~~~~l~P~~~~ie~~ln~~ll~---~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~ 378 (413) T protein:vir:96 312 F----------NNFINTKIMSIAQVIQQTYNKLIVE---EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNW 378 (413) T ss_pred H----------HHHHHHHHHHHHHHHHHHHHHhhCC---CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 1 1234445555555555544422111 123455666677778889999998887 578988888888 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCc Q lcl|NC_018086. 434 FPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANT 486 (511) Q Consensus 434 l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (511) ++.-+.+. .+.+ ....+ ..+ .+...++...+++++ T Consensus 379 ~g~~p~~~--gd~~-----------~~~~n----~~~-~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 379 VGMPPDAE--MDDL-----------LVLEN----YLQ-QKDLVNQKKLIQDET 413 (413) T ss_pred hCCCCCCC--ccee-----------eeccc----ccc-hhhcccccCCCCCCC Confidence 87643211 0000 00000 000 000011111111111 No 219 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=96.28 E-value=0.00079 Score=37.60 Aligned_cols=385 Identities=11% Similarity=0.016 Sum_probs=146.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccc----cccceeccchHHHHHHHHHhhhhccCceec---Cchhh- Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTN----KPNSKIVHNFPKLLVDTSTAYLAGEPITES---GDEKT- 105 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~----~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d~~~- 105 (511) +++..... ...+ -+.+.+-......+...+ ..-.++. =.-..|+..++-+-.-|+.+- .+... T Consensus 1 m~~~~~~~---~~~~----~~~~~~~~~~~~~~~~~g~~~~~~Al~~~--~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~ 71 (417) T protein:vir:38 1 MKLFRGLA---TEVD----PHWADHLLDSGVIPSFRGGYLGISALRNS--DVLTAVSIVSGDVSRFPLVITDSSTDEVID 71 (417) T ss_pred Cccccccc---cCCC----ccchhhhcccccccccCCceechhhcccH--HHHHHHHHHHHhhccCeeEEEEcCCcceec Confidence 11111000 0000 000000000000000000 0001111 112345666665555576541 11111 Q ss_pred HHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCC-CceE-EEEEcccceEEEecCCCCCceEEEEEEEEE Q lcl|NC_018086. 106 IKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRN-KKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNTV 178 (511) Q Consensus 106 ~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~-g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 178 (511) ...+..++.. |. .......+..+.+.+|.||+.+..+.. |.+. +..++|..+.+..++.. .. .| .+ T Consensus 72 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~--~~----~y-~~ 144 (417) T protein:vir:38 72 LANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPD--NI----IY-RF 144 (417) T ss_pred cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCC--eE----EE-EE Confidence 1123334322 32 334566778889999999999888764 4343 55678888766543321 11 11 11 Q ss_pred eecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHH Q lcl|NC_018086. 179 ISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYN 258 (511) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~ 258 (511) ... ++.. ..++..+.++|++.- + .+.-.|.|.+..+...+.... T Consensus 145 ~~~-~~~~---~~~~~~~dviH~r~~----------------------------~----~d~~~G~s~l~~~~~~i~~~~ 188 (417) T protein:vir:38 145 TPY-NSSM---QKVCGFEDVIHWKFF----------------------------S----YDTIMGRSPLLSLGDEIGLQE 188 (417) T ss_pred EEc-CCcE---EEEecCcceEEecCC----------------------------C----CCCccccCHHHHHHHHHHHHH Confidence 111 1111 112333334333210 0 011137777766666555544 Q ss_pred HHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh-----------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHH Q lcl|NC_018086. 259 LAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM-----------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAK 327 (511) Q Consensus 259 ~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~-----------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 327 (511) ....-....++..+.|-.++.-.. ...++....+ ..++.+.++++.+.+.++.......+.+..+... T Consensus 189 ~~~~~~~~~f~ng~~p~~il~~~~-~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~ 267 (417) T protein:vir:38 189 SGVSTLQKFFKSGLKGSIIKAKES-RLSAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYST 267 (417) T ss_pred HHHHHHHHHHhccCCCcEEEEeCC-CCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhH Confidence 444444445555566655544221 1111212111 1234666665555554444444444555666667 Q ss_pred HHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCC Q lcl|NC_018086. 328 LDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNL 407 (511) Q Consensus 328 ~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~ 407 (511) +.|+..-++|....+.. .++..++. ....++...|.-+++.|...+...--. ......+.+.|.... T Consensus 268 ~~Ia~~fgVPp~~lg~~--~~~s~~e~----------~~~~~~~~tl~P~~~~ie~~l~~~Ll~-~~~~~~~~~~fd~~~ 334 (417) T protein:vir:38 268 AQIAKALRVPAYRLAQN--SPNQSVKQ----------LADDYIRNDLPFYFEPITSEFELKLLD-DAQRHQYCIGFDTKS 334 (417) T ss_pred HHHHHHhCCCHHHhCCC--CcchhHHH----------HHHHHHHHHHHHHHHHHHHHHHhhhcC-hhhcccceEEechhh Confidence 88888888886555432 22222111 111233445555555444443321111 111223456674221 Q ss_pred CcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCC Q lcl|NC_018086. 408 PQSYAELADMAVKL--RDMLPDETIINQFPW--ITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNP 483 (511) Q Consensus 408 p~d~~e~a~~~~~~--~g~~s~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (511) .+....++ +.++ .|+++.-.++++++. +++.+. ++...... .-....... .......+..+ T Consensus 335 -l~~~~~~~-~~~~~~~G~~T~NE~R~~~gl~pi~~g~~--d~~~~~~n------~~~~d~~~~-----~~~~~~~~~kg 399 (417) T protein:vir:38 335 -VNGLPIAD-VNTAVNGGLWTGNEGRAELGKKPLKDPNM--DRIQSTLN------TVFLDQKEA-----YQAEHAAELKG 399 (417) T ss_pred -hhHHHHHH-HHHHHhCCCcCHHHHHHHhCCCCCCCCCC--Ceeeeccc------ccccccccc-----cccccccccCC Confidence 12222222 3333 488888888888765 222211 11100000 000000000 00000000000 Q ss_pred CCccccc--cCCCCcccc Q lcl|NC_018086. 484 ANTSTIT--TTDPVAAKE 499 (511) Q Consensus 484 ~~~~~~~--~~~~~~~~~ 499 (511) |....+. ..+..+.++ T Consensus 400 g~~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 400 GDTNAKGNQNGSGTNANS 417 (417) T ss_pred CCCCCCCCCcCCCCcCCC Confidence 0000000 000001111 No 220 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=96.17 E-value=0.00046 Score=38.86 Aligned_cols=186 Identities=12% Similarity=0.069 Sum_probs=83.7 Q ss_pred eeEeecCCC--Cc-cchhhhhh------hh-CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccc-cc Q lcl|NC_018086. 275 YLWLQGFDL--SA-DSDSISNM------KN-DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVS-KD 343 (511) Q Consensus 275 ~l~~~G~~~--~~-~~~~~~~~------~~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~ 343 (511) ++.++|... .. .+.....+ +. ..++.+.++. -+|-+.+.+.+.+...+......|...+++|-.-. |. T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~~-e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~ 79 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVGQAIGIDADS-EEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKGK 79 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhhhhheeecCC-cceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcCC Confidence 222222100 00 00000111 11 1223333221 23556677888999999999999999999996443 32 Q ss_pred c-c--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHH Q lcl|NC_018086. 344 F-T--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVK 420 (511) Q Consensus 344 ~-~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~ 420 (511) . + |+||..=..-|...+.-.. ++.+.+.+++++++++. ..++++.|+|-...+.++.|+...+ T Consensus 80 sp~Glnatge~d~~nyyd~i~~~Q--e~~l~p~le~l~~~~~~------------~~~~~~~f~pL~~~s~kekAei~~~ 145 (201) T protein:vir:10 80 NVGGVSASQNTALETFYGYVDRKR--KAELLPLLEFLLPFIVT------------EQEWSVEFNPLSQVSDKDKSEILEK 145 (201) T ss_pred CCccccccchhHHHHHHHHHHHHH--HHHHHHHHHHHHHhhcC------------CCCceEeeCCCCCCCHHHHHHHHHH Confidence 2 1 4577754443333332222 36678888888776431 1368899999999999999887665 Q ss_pred HhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCC-CCccccccCCCCCCc Q lcl|NC_018086. 421 LRDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGA-STAAANKLDKNPANT 486 (511) Q Consensus 421 ~~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 486 (511) .+...+. +-..+ +-++++..++++.+... .+......... .....+.+.+.+.+. T Consensus 146 ~a~a~~~---~~~~g-~i~~~e~r~~L~~~~~~-------~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 146 NVNSVAA---LIAAG-IIDADEARDTLRAISTE-------VKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred HHHHHHH---HHHcC-CCCHHHHHHHHHhcCCc-------CCCCCCCCCccccccccCCCCCCCCCC Confidence 5321111 11111 11222222222221000 00000000000 000000011111111 No 221 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=372 Identities=8% Similarity=0.020 Sum_probs=152.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc-ccc----cce--eccchHHHHHHHHHhhhhccCcee---cCch Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT-NKP----NSK--IVHNFPKLLVDTSTAYLAGEPITE---SGDE 103 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~-~~~----~~r--i~~n~~k~ivd~~~~~l~g~~~~~---~~d~ 103 (511) +.+++....+... +-............... ... ... ..+++....|+..++-+-.-|+.+ ..|. T Consensus 1 Mg~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg 74 (423) T protein:vir:81 1 MGFLQKLGLAPSV------VATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDG 74 (423) T ss_pred CchhHhhcccccc------ccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCC Confidence 2233222111110 00000000000000000 000 000 113445566777777776667754 2221 Q ss_pred --h--hHHHHHHHHhc-c---ChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEe----cCCCCCceEE Q lcl|NC_018086. 104 --K--TIKAMQPVFKE-N---YVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAY----SADLDEEPVA 171 (511) Q Consensus 104 --~--~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~----d~~~~~~~~~ 171 (511) + ....+..++.+ | ........+..+.+.+|.||+++..+..+...+..+.|..+..+. .+.. .. T Consensus 75 ~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~-~~--- 150 (423) T protein:vir:81 75 GRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGW-GS--- 150 (423) T ss_pred ceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCC-cc--- Confidence 1 11224445433 3 234555667788899999999887765443333334333322211 1110 00 Q ss_pred EEEEEEE-eecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC----C-cccCc Q lcl|NC_018086. 172 AIYYNTV-ISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA----N-EERLG 245 (511) Q Consensus 172 ~v~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~-~~g~s 245 (511) +.|... ....+|.. + .+.++.+ +++++ . ..|.| T Consensus 151 -~~Y~~~~~~~~~g~~---~-~~~~~ev------------------------------------ih~r~~~~~~~~~G~s 189 (423) T protein:vir:81 151 -LDYIIIESGDNDGRS---V-KVPGERV------------------------------------IHRHGYNPKTMKRGKS 189 (423) T ss_pred -eEEEEEEecCCCceE---E-EEcccce------------------------------------EEecCCCCCCcccccc Confidence 111000 00001111 0 1122222 22221 1 24777 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC----CCccchhh----hhhh---------hCceeeecCCCcee Q lcl|NC_018086. 246 DFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFD----LSADSDSI----SNMK---------NDRVIVTDEDGMVK 308 (511) Q Consensus 246 ~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~----~~~~~~~~----~~~~---------~~~~i~~~~~~~~~ 308 (511) .+..+...++.......-....+...+.|-.+++-.. ....++.. ..+. .++++.++++.+.+ T Consensus 190 pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~ 269 (423) T protein:vir:81 190 PVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAE 269 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEE Confidence 7776666666555544444455556666766664211 11112211 1111 13466777666666 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFM 388 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 388 (511) .++.......+.+..+.....|+..-++|..-.+..+..+...++.+.. ..+..+|.-.++.|...+... T Consensus 270 ~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~----------~f~~~~L~P~~~~ie~~l~~~ 339 (423) T protein:vir:81 270 NFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRK----------ALYGDNLGSWIRIIQDVMNLF 339 (423) T ss_pred eccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH----------HHHHHHHHHHHHHHHHHHhhh Confidence 5554444445555667777889888898876555443333222222211 222223333333333333221 Q ss_pred CCC-cccc--ccceeEEeCCCCCcCHHHHHHHHHHH---hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 389 NKA-KDLK--PYEVTPVFVRNLPQSYAELADMAVKL---RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 389 ~~~-~~~~--~~~i~i~f~~~~p~d~~e~a~~~~~~---~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) -.. ...+ ...+++.+..-+..|..+.++++.++ .|+++.-.+++.++.-+.+. .+.+ .... T Consensus 340 L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~g--GD~~-----------~~p~ 406 (423) T protein:vir:81 340 LLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDG--GDDL-----------ARPL 406 (423) T ss_pred hcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCC--ccee-----------eccc Confidence 111 0111 12233444455667888888887764 37788777777665422110 0000 0000 Q ss_pred cccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCC Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKT 510 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) +.. ..+. .+ ...|+..| T Consensus 407 n~~---------~~~~-~~---------------------~~~~~~~t 423 (423) T protein:vir:81 407 NTE---------FGDS-ED---------------------APGEEVET 423 (423) T ss_pred ccc---------cCcc-CC---------------------CCCCCCCC Confidence 000 0000 00 00011111 No 222 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=96.01 E-value=0.0011 Score=36.74 Aligned_cols=338 Identities=11% Similarity=0.014 Sum_probs=142.4 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+|+-- ++++.. ... ..+.....+.... ..........+ .+. .=.- T Consensus 1 M~~~~~--------f~~r~~-----~~~-----------------~~~~~~~~~~~~~-~~~~~v~~~~a-l~~--~av~ 46 (359) T protein:vir:10 1 MSILNP--------FERRSS-----ITP-----------------NNYYPFMVQNGSI-VPNSLVDATEA-LKN--SDLY 46 (359) T ss_pred Ccccch--------hhcccc-----CCC-----------------Ccchhhhhccccc-cCCcccCHHHh-hcc--hHHH Confidence 333210 000000 000 0000010000000 00000000000 011 1112 Q ss_pred HHHHHHHhhhhccCceecCchhhHHHHHHHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccce Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESGDEKTIKAMQPVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNC 157 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~ 157 (511) ..|+..++-+-+.|+. ++.. +..++.+ |. -......+....+.+|.||+++..+..|++ .+..++|..+ T Consensus 47 ~cv~~ia~~ia~~p~~---~~~~---~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v 120 (359) T protein:vir:10 47 AVTSLISSDIAGTRFI---GNQV---FTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAI 120 (359) T ss_pred HHHHHHHHhhhcCccc---cchH---HHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceE Confidence 2445555544455653 1222 2222222 32 223455677788889999999988888875 4667788777 Q ss_pred EEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEee Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEI 237 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 237 (511) .+..+++. ++| .+....++. ...+.++.+.|++.-.. ..+++ T Consensus 121 ~i~~~~~~-------~~y-~~~~~~~~~----~~~~~~~evih~~~~~~------------------~~~~~-------- 162 (359) T protein:vir:10 121 TIDLTDDT-------LTY-EVNQFDDYP----SAKYNASEMIHVKIMAY------------------GVDTL-------- 162 (359) T ss_pred EEEEcCCe-------EEE-EEEecCCce----EEEEcccceEEeccCCC------------------CCCcc-------- Confidence 66554321 111 111111111 12244555555431100 00000 Q ss_pred cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh-----------hCceeeecCCCc Q lcl|NC_018086. 238 IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK-----------NDRVIVTDEDGM 306 (511) Q Consensus 238 ~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~-----------~~~~i~~~~~~~ 306 (511) +.-.|.|.++.+...+........-..+.++..+.|-.+++-.....+++....++ .++++.++++.+ T Consensus 163 -dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~ 241 (359) T protein:vir:10 163 -HNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSAD 241 (359) T ss_pred -CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecCCCcc Confidence 11247777777776666666555555556666666766665322111222222111 134666766656 Q ss_pred eeeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc--CccHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 VKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--AASGQALKAATQPLEN-KSAVKESKFRKVLAKRYELVCS 383 (511) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~~Sg~Ai~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~ 383 (511) .+.+........+.+..+...+.|+..-++|....+..+ +.+...++..+..... .+.- +...|.+.+ T Consensus 242 ~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p----~~~~l~~~l----- 312 (359) T protein:vir:10 242 FSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEP----LISELRIKC----- 312 (359) T ss_pred eeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHH----HHHHHHHHh----- Confidence 555544433444556677778889888899876554332 2344444333221111 1111 111111100 Q ss_pred HHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCH Q lcl|NC_018086. 384 YLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDA 440 (511) Q Consensus 384 ~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~ 440 (511) .....++.. .-+.| |.......+.++ .|+++.-.+++.++.-.=. T Consensus 313 -----~~~~~~~~~-~~~~~------d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 313 -----DSSIGVDMS-PITDY------SNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred -----hhhhcccch-hhhhc------CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 000011111 01122 223333334444 4788888888876432111 No 223 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=95.96 E-value=0.0012 Score=36.60 Aligned_cols=373 Identities=12% Similarity=0.014 Sum_probs=162.0 Q ss_pred HHHHHHHHH-HHH----HHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec---Cch--h Q lcl|NC_018086. 35 TLAEMHSRS-SSA----YGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES---GDE--K 104 (511) Q Consensus 35 ~~~~~~~~~-~~~----~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~---~d~--~ 104 (511) -+..+...+ ... -.-+.....|-... .......+ ..-+.+.-....|+..++-+-+-|+.+- .+. . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s-~~~~~v~~---~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~ 76 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARS-EAGQVVTP---ASALSLTVLQNCVTLLAESIAQLPVELYERSGDDRKP 76 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhccccc-ccCcccCh---HHhhccHHHHHHHHHHHHhhccCceEEEEecCCCccc Confidence 001110000 000 00000000110000 00000000 0011223334466666666666677641 111 1 Q ss_pred -hHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceE-EEEEcccceEEEecCCCCCceEEEEEEEE Q lcl|NC_018086. 105 -TIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHR-FKAVSPMNCLIAYSADLDEEPVAAIYYNT 177 (511) Q Consensus 105 -~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 177 (511) ....+..+|. -|. -......+..+.+.+|.||+++..+..|++. +..++|..+.+..+.... .+|. T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~-------~~y~ 149 (419) T protein:vir:80 77 ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLK-------PMYR 149 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCce-------EEEE Confidence 1122444443 232 3345567778889999999999888888864 777888887665443210 0111 Q ss_pred EeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC----CcccCchhHHHHHH Q lcl|NC_018086. 178 VISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA----NEERLGDFEAQLSL 253 (511) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g~s~~~~v~~l 253 (511) .. +.. .+..+. |++++. .-.|.|.+..+... T Consensus 150 ~~----~~~-----~~~~~~------------------------------------i~h~~~~~~d~~~G~s~i~~~~~~ 184 (419) T protein:vir:80 150 VA----GAD-----PLPQRL------------------------------------VHHVRWMSINGYTGLSPVLLHANA 184 (419) T ss_pred Ec----Ccc-----ccchhh------------------------------------eEEecCCCCCCcccccHHHHHHHH Confidence 10 000 011111 122221 12477777766666 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeEee--cC-CCCccchhhhhhh------------hCceeeecCCCceeeeecCCCHHH Q lcl|NC_018086. 254 IDAYNLAVSDSVNDIAYWNDAYLWLQ--GF-DLSADSDSISNMK------------NDRVIVTDEDGMVKFITKDVNDKH 318 (511) Q Consensus 254 ~d~~~~~~s~~~~~~~~~~~p~l~~~--G~-~~~~~~~~~~~~~------------~~~~i~~~~~~~~~~~~~~~~~~~ 318 (511) ++.......-..+.+...+.|-.+++ +. ....+++....++ .++++.++++.+++.+........ T Consensus 185 i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q 264 (419) T protein:vir:80 185 IGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAA 264 (419) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHH Confidence 65555544444445555566766654 21 1111222222221 234677776666655554444555 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccccc Q lcl|NC_018086. 319 IENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA-KDLKPY 397 (511) Q Consensus 319 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~ 397 (511) +.+..+...+.|+..-++|..-.+.....+...++... ...+..+|.-+++.|...++..--. ...... T Consensus 265 ~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~----------~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~ 334 (419) T protein:vir:80 265 LIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQS----------LQFVIYTLLPWVKRHEQAKTRDLLLPSERKQY 334 (419) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHhhhccCccccCCe Confidence 66777788889999999987555443322222221111 1222333433433333333321111 111122 Q ss_pred ceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCcc Q lcl|NC_018086. 398 EVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAA 475 (511) Q Consensus 398 ~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (511) .+++.+...+..|..+.++.+.++ .|+++.-.+++.++.-+-+.. +.. ..+.+..... .. T Consensus 335 ~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gG--D~~-----------~~~~n~~~~~-----~~ 396 (419) T protein:vir:80 335 FIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGG--DIY-----------LSPMNMVDAS-----KP 396 (419) T ss_pred EEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--cee-----------eecccccccc-----cc Confidence 344445566667889999988887 578888778777754211000 000 0000000000 00 Q ss_pred ccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 476 ANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+ ...+...+.++.+| T Consensus 397 ~~--------------------~~~~~~~~~~~~~~ 412 (419) T protein:vir:80 397 QP--------------------IPMGKTEPTKAALD 412 (419) T ss_pred cc--------------------ccCCCCCchhhhHH Confidence 00 00000011122222 No 224 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=95.86 E-value=0.0013 Score=36.33 Aligned_cols=387 Identities=10% Similarity=0.034 Sum_probs=161.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--------c-cc----C---CcCccccccc---eeccchHHHHHHHH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA--------I-QS----R---TFDDTNKPNS---KIVHNFPKLLVDTS 88 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~--------~-~~----~---~~~~~~~~~~---ri~~n~~k~ivd~~ 88 (511) +-++..+.++.+ ++..+....+. . .. . .....+.... =+...-....|+.. T Consensus 1 ~~~~~~~g~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~I 70 (432) T protein:vir:97 1 MPDEKKLGLLGQ----------LKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLV 70 (432) T ss_pred CCCcccCchhhh----------hHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHH Confidence 111111122211 11111111000 0 00 0 0000000000 00112222344444 Q ss_pred HhhhhccCcee---cCch--h-hHHHHHHHHh--ccC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccc Q lcl|NC_018086. 89 TAYLAGEPITE---SGDE--K-TIKAMQPVFK--ENY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMN 156 (511) Q Consensus 89 ~~~l~g~~~~~---~~d~--~-~~~~l~~~~~--~n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~ 156 (511) ++-+-.-|+.+ ..|. + ....+..+|. -|. -......+..+.+.+|.||+++..+ +|++ .+..++|.. T Consensus 71 a~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~ 149 (432) T protein:vir:97 71 SQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDR 149 (432) T ss_pred HHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcc Confidence 44444456653 1111 1 1123444442 232 3345667788899999999888776 4664 466788988 Q ss_pred eEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 157 CLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 157 ~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) +.++.+.... + +|.... .+|.. ..+.++.+++++. .++ T Consensus 150 v~v~~~~~g~--~-----~y~~~~-~~g~~----~~~~~~~iih~r~----------------------------~~~-- 187 (432) T protein:vir:97 150 LTITTDTKGN--T-----AYRYRR-TDGQM----IDIPRQQIWKIMG----------------------------YSL-- 187 (432) T ss_pred eEEEEcCCCc--E-----EEEEEe-cCceE----EEEccccEEEecC----------------------------cCC-- Confidence 8877654321 1 111111 12221 1233444444321 000 Q ss_pred ecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh--------hhCceeeecCCCcee Q lcl|NC_018086. 237 IIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM--------KNDRVIVTDEDGMVK 308 (511) Q Consensus 237 ~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~ 308 (511) +.-.|.|-+......++..........+.+...+.|-.+++-.. ...++....+ ..++++.++++.+.+ T Consensus 188 --dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~ 264 (432) T protein:vir:97 188 --DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDDQYDSFSKKVSGSVEAGRAPLLEGGMDVK 264 (432) T ss_pred --CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC-CCCHHHHHHHHHHHhhhhcCCCceecCCCceEE Confidence 01236676666555555444444334444455556655554321 1222222222 224577777666666 Q ss_pred eeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc--C-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 309 FITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT--A-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 309 ~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) .+........+.+..+.....|+..-++|....+... . ..+..++.... ..+..+|.-.++.|...+ T Consensus 265 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~----------~f~~~tl~P~~~~ie~~l 334 (432) T protein:vir:97 265 SLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQL----------GFLTMTLSPWLRRIEQSI 334 (432) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHH----------HHHHHHHHHHHHHHHHHH Confidence 6655555556666778888899999999875554321 1 12233322211 122233333333333333 Q ss_pred HhcC-CCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_018086. 386 EFMN-KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNF 462 (511) Q Consensus 386 ~~~~-~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 462 (511) ...- .........+++.+..-+..|..+.++++.++ .|+++.-.++.+++.-.- +..-..+. T Consensus 335 n~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~-~g~~~~~~-------------- 399 (432) T protein:vir:97 335 ALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL-GGNAAVLT-------------- 399 (432) T ss_pred hhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC-CCCcceEe-------------- Confidence 2111 11111122344555556667889999998887 478998888887764210 00000000 Q ss_pred cccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCC Q lcl|NC_018086. 463 KQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQK 506 (511) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) ......+. +..+.++++++.+. ..+++++..++ T Consensus 400 -~~~~~~pl-----~~~~~~~~~~~~~~-----~~~~~~~~~~~ 432 (432) T protein:vir:97 400 -VQSAMVPL-----DSIGLQASPEPASG-----LGNQQQDKVSK 432 (432) T ss_pred -ecccccch-----hhhcccCCCCCCCC-----CCCcccccccC Confidence 00000000 00000000000000 01111111111 No 225 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=95.82 E-value=0.0014 Score=36.22 Aligned_cols=430 Identities=9% Similarity=-0.023 Sum_probs=166.1 Q ss_pred CCHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhcc------Cc-e Q lcl|NC_018086. 28 FDLRELITLAEMHS--RSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGE------PI-T 98 (511) Q Consensus 28 ~~~~~l~~~~~~~~--~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~------~~-~ 98 (511) |.. ....+..+.. .-..+++.+.+|..-.-- ..............+...+-+...++.+++-|++- || + T Consensus 1 m~~-~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~-~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (514) T protein:vir:80 1 MRQ-QASAMWAEYRDSTAIRKAEDFAKFTIASLM-VDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPSFQ 78 (514) T ss_pred Ccc-chHHHHHHhhcchHHHHHHHHHHHhccccc-CCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcccc Confidence 110 0112222211 123445555555433100 00000000011112233455556666666666542 22 2 Q ss_pred ecCch--------------hhHH-------HHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce Q lcl|NC_018086. 99 ESGDE--------------KTIK-------AMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC 157 (511) Q Consensus 99 ~~~d~--------------~~~~-------~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~ 157 (511) +..++ +... .+...+..++|.....++.++..++|.+.+++..+. + .++.++- .- T Consensus 79 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~-~--~~~~~pl-~~ 154 (514) T protein:vir:80 79 IELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGT-G--KMLVWTM-QS 154 (514) T ss_pred cccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCC-C--cEEEEEc-Ce Confidence 22111 1111 244556778999999999999999999987774332 2 3445543 33 Q ss_pred EEEecCCCCCceEEEEEEEEEeecC--------------CcceEEEEEEEcCCcEEEEEEccCc--cccccccccccccc Q lcl|NC_018086. 158 LIAYSADLDEEPVAAIYYNTVISDI--------------TGHQIRTYEVYTEDLIYKFSTDDER--EVYREIPEELEIKD 221 (511) Q Consensus 158 ~~v~d~~~~~~~~~~v~~~~~~~~~--------------~~~~~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~~~~~~~ 221 (511) +++--+.. +.+...+|.+...... ..+....+++|+.- ++.....+ +.++....... . T Consensus 155 y~v~~d~~-G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v---~~~~~~~~~~~sv~~e~~g~~--i 228 (514) T protein:vir:80 155 YTVRRTSH-GDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVI---EWQPTPNGKRCAVWHELEGKR--V 228 (514) T ss_pred EEEeeCCC-cCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEE---EeecCCCCeEEEEEEecccee--e Confidence 44444443 3344455444322110 11122233443321 11111110 11111000000 0 Q ss_pred ccceeccCCccceEeec-----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhC Q lcl|NC_018086. 222 YEVHPNLLQKFPVLEII-----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKND 296 (511) Q Consensus 222 ~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~ 296 (511) ......++..+|++.++ ++.+|+|-..+..+-+..+|.+.-...........|.+.+.-... ...... ..... T Consensus 229 ~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~-~~~~~l-~~~~~ 306 (514) T protein:vir:80 229 GPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKG-GAVDDY-RDAET 306 (514) T ss_pred cccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccc-cchhhh-cccCC Confidence 11111122346665443 356899999999999999998887777777777777665431110 001101 11112 Q ss_pred ceeeecCCCceeeeec--CCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHH----HHH-HHH Q lcl|NC_018086. 297 RVIVTDEDGMVKFITK--DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKS----AVK-ESK 369 (511) Q Consensus 297 ~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~----~~~-~~~ 369 (511) +.+......+++.+.. ..+.......++.++..|...-.... ........|++.+......+.... .+. ... T Consensus 307 g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~-~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~El 385 (514) T protein:vir:80 307 GDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTG-QVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETL 385 (514) T ss_pred ceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhc-cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHH Confidence 3333223344555443 34566667777777777754322111 111223357777665433332221 111 122 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCC-cCHHHHHH-------HHHHHhccCC-------hHHHHHhC Q lcl|NC_018086. 370 FRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLP-QSYAELAD-------MAVKLRDMLP-------DETIINQF 434 (511) Q Consensus 370 ~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p-~d~~e~a~-------~~~~~~g~~s-------~et~~~~l 434 (511) +..-+.+.+.++... ..+.-......-+.+.+.-.+. ....+.++ .+..++++.+ ...++..+ T Consensus 386 l~Pli~r~~~il~r~--~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~ 463 (514) T protein:vir:80 386 QAPLAYLTMYEASRG--NGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERI 463 (514) T ss_pred HHHHHHHHHHHHhhh--ccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHH Confidence 222233333322110 0111111111123344422221 11111111 2222222222 22233332 Q ss_pred ---CCCC-C-H--HHHHHHHHHHHHHH-HHHHH--hhcccccc-CCCCCCc Q lcl|NC_018086. 435 ---PWIT-D-A--RQEVEKADAQRQKR-ADIAL--QNFKQTSA-VQGASTA 474 (511) Q Consensus 435 ---~~v~-d-~--~~E~~ri~~E~~~~-~~~~~--~~~~~~~~-~~~~~~~ 474 (511) -+++ . . .+|...+.+|+++. ...++ ...+...+ ..|-... T Consensus 464 a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 464 FANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred HHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 1122 1 0 12222222222211 11111 11111000 0111111 No 226 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=95.01 E-value=0.003 Score=34.45 Aligned_cols=399 Identities=9% Similarity=-0.067 Sum_probs=165.0 Q ss_pred CCCccchhhcc-----ccc--------CchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC Q lcl|NC_018086. 1 MAIPNGQINAG-----DII--------TTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD 67 (511) Q Consensus 1 ~~~~~~~~~~~-----~~~--------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~ 67 (511) ||-+=++--.. ... .....+....-..+.+..+...+.. . ..+ +.| .. T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~-~---~~~----~ly--------~~--- 61 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQG-K---DGL----LVY--------HK--- 61 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhcc-c---cch----HHH--------HH--- Confidence 54433321000 000 0000000000011111111111100 0 000 000 00 Q ss_pred ccccccceeccchHHHHHHHHHhhhhccCceec--Cchh----hHHHHHHHHhc-------cChhHHHHHHHHHHhhCCe Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES--GDEK----TIKAMQPVFKE-------NYVTDVNSEEVKLSGIFGH 134 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~--~d~~----~~~~l~~~~~~-------n~~~~~~~~~~~~a~~~G~ 134 (511) . ....+..-.+.+....+.+.++++. +++. ..+.+.+.+.. ..|...+..+ .+|..||. T Consensus 62 ------m-~~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~ 133 (448) T protein:vir:77 62 ------M-LSDGTVKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGM 133 (448) T ss_pred ------H-hhChHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcc Confidence 0 1145556666776677778888773 2222 22344444433 2566666654 68999997 Q ss_pred EE-EEeee-CCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccc Q lcl|NC_018086. 135 CF-EIHWI-DRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYRE 212 (511) Q Consensus 135 ~~-~~v~~-~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 212 (511) ++ +++|. ..+|...+..+.+... .+ +++|.... ++... +.+.. +... T Consensus 134 s~~Eivw~~~~dg~~~~~~l~~r~~------~~-------~~~f~~~~--~~~l~----~~~~~---------~~~~--- 182 (448) T protein:vir:77 134 AAGEIVLTLGADGKLILDKIVPIHP------FN-------IDEVLYDE--EGGPK----ALKLS---------GEVK--- 182 (448) T ss_pred eeEEEEEeecCCCceeeccccccCC------Cc-------cceeeeec--CCceE----EEecC---------Cccc--- Confidence 65 56664 4566654433322210 00 01111111 11110 00100 0000 Q ss_pred cccccccccccceeccCCccceEeec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCC-ccchh Q lcl|NC_018086. 213 IPEELEIKDYEVHPNLLQKFPVLEII--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLS-ADSDS 289 (511) Q Consensus 213 ~~~~~~~~~~~~~~~~~g~iPvv~~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~-~~~~~ 289 (511) ......+..+.+++.+=+.+.. .++.|.|.+..+.-..--=+..+.+++..++.|+.|+++.+-..+. .+++. T Consensus 183 ----~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~ 258 (448) T protein:vir:77 183 ----GGSQFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQ 258 (448) T ss_pred ----ccccCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHH Confidence 0000011122234443222211 2456778887755544444667788899999999999998843221 11121 Q ss_pred -------hhhhh--hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHH Q lcl|NC_018086. 290 -------ISNMK--NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLE 360 (511) Q Consensus 290 -------~~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~ 360 (511) ..++. ...+..++++.++++++.......+...++...+.|...--..-++.+..++.++.+......-.. T Consensus 259 ~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~ 338 (448) T protein:vir:77 259 WEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAVNIGEFVSLTQ 338 (448) T ss_pred HHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhccccccccccchhhhhhhhHHHHHH Confidence 12222 234567899999999987766666777888888887766544434444433333333332111111 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_018086. 361 NKSAVKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWITD 439 (511) Q Consensus 361 ~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v~d 439 (511) ..+..-.+.+...+. ++++-++. . +.... ..-..+.|....+.|.++.++.+.++++. ..+.++ +. T Consensus 339 ~~~~aDa~~i~~tln~~Li~~l~~---l-Nfg~~--~~~P~~~f~~~e~eDl~~~a~~~~~l~~~-----~~~~~~-ip- 405 (448) T protein:vir:77 339 QTIISLQREFASAVNLYLIPKLVL---P-NWPGA--TRFPRLTFEMEERNDFSAAANLMGMLINA-----VKDSED-IP- 405 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---h-cCCCC--CCCCEEEecCCChhhHHHHHHHhHHHHHH-----HHHHhc-CC- Confidence 112222333444443 23333332 2 22111 11246788888888888888888777532 111111 00 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 440 ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 440 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .. ..+..+.....+. ..+...++...+..+++.|- T Consensus 406 ------------------------~~--~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~ 440 (448) T protein:vir:77 406 ------------------------TE--LKALIDALPSKMR-----------RALGVVDEVREAVRQPADSR 440 (448) T ss_pred ------------------------cc--CCcCCCCCchhcc-----------cccCCCCCCCchhhcchhhH Confidence 00 0000000000000 00111111111122222222 No 227 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=94.94 E-value=0.0031 Score=34.32 Aligned_cols=463 Identities=11% Similarity=0.084 Sum_probs=177.3 Q ss_pred CCCccchh--hccccc-CchhhHhh----hhccCCCHH--HHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccc Q lcl|NC_018086. 1 MAIPNGQI--NAGDII-TTNIRRKH----FIRRNFDLR--ELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNK 71 (511) Q Consensus 1 ~~~~~~~~--~~~~~~-~~~~~~~~----~~~~~~~~~--~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~ 71 (511) ||-.|-|. ++.... .......+ +........ .+...+.....-+.+|+.+-.+++-+.. T Consensus 5 fgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A------------ 72 (558) T protein:vir:10 5 FGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGA------------ 72 (558) T ss_pred hcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhH------------ Confidence 55555322 111111 00000000 000000000 0001111122223334444333333221 Q ss_pred ccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee Q lcl|NC_018086. 72 PNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI 141 (511) Q Consensus 72 ~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~ 141 (511) ...||+..+-+ ....|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|.+... T Consensus 73 ---------v~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKii 143 (558) T protein:vir:10 73 ---------IEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVI 143 (558) T ss_pred ---------HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEE Confidence 22223222211 122333332 22344556677777788999999999999999999988877 Q ss_pred CC----CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCc----ceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 142 DR----NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITG----HQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 142 ~~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) |. +|-..+..+||+.+-.|..-.... .-.-....+ .+..+ ..+...-+|.+...+.....+ T Consensus 144 d~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~--~~~~~~~~~-~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~-------- 212 (558) T protein:vir:10 144 DTKNPQEGIQDLRYIDPLKIKFIRQEKRKP--GNQDPAIRV-RSEQDVVPNPEFEEFYIYTPKVQHPTGMVG-------- 212 (558) T ss_pred eCCCccccceeeeeeCcccceeeeeecccc--ccccceeee-ecccceeeccceeEeeeecCCcccccccce-------- Confidence 54 466778899999986665421111 000000011 11111 111222334443222211100 Q ss_pred ccccccccccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCc Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSA 285 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~ 285 (511) ... ..+++ +|| .|.|... ..+.-.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+... T Consensus 213 --~~~------~~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGn 283 (558) T protein:vir:10 213 --QMG------GKNSI-KIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGN 283 (558) T ss_pred --eec------CCCce-eechhheeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC Confidence 000 00111 222 1122111 001111223444455555 455666677777777755333222211 Q ss_pred cc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 286 DS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIF 331 (511) Q Consensus 286 ~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~ 331 (511) .+ .-...+. . +++ +++| +++. .+.-|.+ .+...+. -+.-+.+-+| T Consensus 284 LPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~-DV~YF~kKLy 362 (558) T protein:vir:10 284 LPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELS-DVDYFQKKLY 362 (558) T ss_pred CCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHH-HHHHHHHHHH Confidence 11 1111110 0 011 1121 1222 2222223 2333222 2445555566 Q ss_pred HHhCccccc--c-ccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCC Q lcl|NC_018086. 332 SLSQTPDLV--S-KDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRN 406 (511) Q Consensus 332 ~~s~~p~~~--~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~ 406 (511) +--++|-.- . +.|...-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|... T Consensus 363 ~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qL-ilKgiit~eeW~~i~~~I~~~f~~D 441 (558) T protein:vir:10 363 RALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQL-VLKNIVTPEDWKTMEDHIQYDFLYD 441 (558) T ss_pred HHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEeeec Confidence 666777422 2 22221122344444445567778888888888888876422 2221112222322 3467777443 Q ss_pred CCcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH--HhhccccccCCCCCC Q lcl|NC_018086. 407 LPQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADIA--LQNFKQTSAVQGAST 473 (511) Q Consensus 407 ~p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~ 473 (511) -.-.+... ++++..+. | .+|.+++++.+-.-+| +|+..++++-+.+.+.. .++........+..+ T Consensus 442 n~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tD--eeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~ 519 (558) T protein:vir:10 442 NQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTD--MEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLP 519 (558) T ss_pred chHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCH--HHHHHHHHHHHHHHhCCCCCCccccChhhccccC Confidence 33333222 33444443 3 4699999988543343 33333333222222211 111000000111111 Q ss_pred cc----ccccCCCCCCccccccCCCCccccccccCCCCCCC Q lcl|NC_018086. 474 AA----ANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKT 510 (511) Q Consensus 474 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) .+ ....+..+..+....+.+..+++.+...+ |++- T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 558 (558) T protein:vir:10 520 QEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKDTK--KAEL 558 (558) T ss_pred ccCCchhccCCCCCcccccccchhhhhhhhhhhhh--hhcC Confidence 11 11111222222222222222222111111 1111 No 228 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=94.86 E-value=0.0033 Score=34.17 Aligned_cols=396 Identities=9% Similarity=-0.012 Sum_probs=151.0 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcccccCCcCccccc---ccee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSS-SAYGVLYDYYKGNHIAIQSRTFDDTNKP---NSKI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~yY~G~~~~~~~~~~~~~~~~---~~ri 76 (511) ||--|.+++ .+....- ...+.+-++ .|..- .+. ...+.. ..-+ T Consensus 1 ~~~~~~~~~----------------------------~~~~~~~~~~~~~~~~~-~g~~~---~~~-~~~~~~~~~~~a~ 47 (460) T protein:vir:10 1 MANRIIRAL----------------------------RELTGLDNKFNDAFIKY-IGQTF---TKY-DNNGKTYLEQGYN 47 (460) T ss_pred CchhHHHHH----------------------------hhhhccCCCchHHHHHh-hcccc---CCC-ccchhhhhHHHHh Confidence 333333332 1111000 001111111 12110 000 000000 0011 Q ss_pred ccchHHHHHHHHHhhhhccCceec---Cchh-------------------------------hHHHHHHHHhc-c---Ch Q lcl|NC_018086. 77 VHNFPKLLVDTSTAYLAGEPITES---GDEK-------------------------------TIKAMQPVFKE-N---YV 118 (511) Q Consensus 77 ~~n~~k~ivd~~~~~l~g~~~~~~---~d~~-------------------------------~~~~l~~~~~~-n---~~ 118 (511) ..+..-..|+..++-+-+-|+.+- .+.. ....+..++.+ | .. T Consensus 48 ~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~ 127 (460) T protein:vir:10 48 INPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTW 127 (460) T ss_pred cchHHHHHHHHHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCH Confidence 223444556666666666666541 1100 00011122222 2 23 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCCC----CceE-EEEEcccceEEEecCCCCCce-EEEEEEEEEeecCCcceEEEEEE Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDRN----KKHR-FKAVSPMNCLIAYSADLDEEP-VAAIYYNTVISDITGHQIRTYEV 192 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~~----g~~~-i~~~~p~~~~~v~d~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~ 192 (511) ......+..+.+.+|.||+++..+.. |.+. +..++|..+.+..+++..... ...++.|... .++.. .. T Consensus 128 ~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~--~~g~~----~~ 201 (460) T protein:vir:10 128 ADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLI--QGDQF----IE 201 (460) T ss_pred HHHHHHHHHHHhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEe--cCcee----EE Confidence 34556677789999999998877543 5543 677888888776554432111 0111111111 11111 12 Q ss_pred EcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_018086. 193 YTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWN 272 (511) Q Consensus 193 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~ 272 (511) +.++.+++++.-... .-.....-.|.|.+..+...+........-....+...+ T Consensus 202 ~~~~evih~r~~~~~--------------------------~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~ 255 (460) T protein:vir:10 202 FNEDEVIHTKYANPN--------------------------FDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGG 255 (460) T ss_pred ecccceEEEecCCCC--------------------------cccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 344444444321100 000000124677777666666665555544444555555 Q ss_pred CceeEeecCCCCccchhhhhh------------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_018086. 273 DAYLWLQGFDLSADSDSISNM------------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLV 340 (511) Q Consensus 273 ~p~l~~~G~~~~~~~~~~~~~------------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 340 (511) .|-.++... ....++....+ ..++++.++++.+.+.+........+.+..+...+.|+..-++|..- T Consensus 256 ~~~~i~~~~-~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 334 (460) T protein:vir:10 256 VFGFIHGGS-TGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKL 334 (460) T ss_pred CcceeeecC-CCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 565544322 11122222211 12346667666565555544445556667778888898888888655 Q ss_pred ccccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHH Q lcl|NC_018086. 341 SKDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMA 418 (511) Q Consensus 341 ~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~ 418 (511) .+... ..+...++... ...+..+|.-++..|...+...-...........|.|+-.......+...+. T Consensus 335 lg~~~~~t~~~sn~e~~~----------~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~ 404 (460) T protein:vir:10 335 LNNNEGGGLNTGNLEEER----------KRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAM 404 (460) T ss_pred hCCCCCCCCccccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHH Confidence 44321 11222222211 1222233333333333333221111000111233444221111111222222 Q ss_pred HHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccc Q lcl|NC_018086. 419 VKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTST 488 (511) Q Consensus 419 ~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (511) ..+ .|+++...+++.++.-+-+++-.++. ....+. .+ .++..+...+++.++++ T Consensus 405 ~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~-----------~~~~n~----~~-~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 405 ASWLNTIPVTPNEIRIAMKYETLNQDGMDIV-----------FMPSNK----VR-IDDVSNNLIDSAFNQNQ 460 (460) T ss_pred HHHHhCCCCCHHHHHHHhCCCCCCCCCCCee-----------eecccc----cc-hhhcccccCCCcccCCC Confidence 222 47788777777765421000000000 000000 00 00000000000000000 No 229 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=94.79 E-value=0.0035 Score=34.05 Aligned_cols=376 Identities=10% Similarity=-0.006 Sum_probs=160.0 Q ss_pred HHHHHHHHHHHH-------HHHHH-----------HHHhcCCCcccc---cCCcCccccc-cce--eccchHHHHHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSS-------AYGVL-----------YDYYKGNHIAIQ---SRTFDDTNKP-NSK--IVHNFPKLLVDTST 89 (511) Q Consensus 34 ~~~~~~~~~~~~-------~~~~~-----------~~yY~G~~~~~~---~~~~~~~~~~-~~r--i~~n~~k~ivd~~~ 89 (511) +.+++.++.+.. +.+-- -+.+.|-.+... .......... ... +...-....|+..+ T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 223322221100 00000 001111110000 0000000000 000 11122334455555 Q ss_pred hhhhccCceec-Cch----hhHHHHHHHHhc--cCh---hHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEE Q lcl|NC_018086. 90 AYLAGEPITES-GDE----KTIKAMQPVFKE--NYV---TDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLI 159 (511) Q Consensus 90 ~~l~g~~~~~~-~d~----~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~ 159 (511) +-+-.-|+.+- .++ .....+..++.. |.. ......+..+.+.+|.||+++..+....+.+..++|..+.+ T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~L~pl~~~~v~~ 160 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRPIRLIPMDRGSAKG 160 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCceEEEEEEcCceeEE Confidence 55555576541 111 111234444432 332 24456778889999999999888753334567788888776 Q ss_pred EecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC Q lcl|NC_018086. 160 AYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA 239 (511) Q Consensus 160 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 239 (511) ..++.. .. . |... ...|..+ .+....+.|++.-. .+ T Consensus 161 ~~~~~~--~~----~-y~~~-~~~g~~~----~~~~~dViHir~~~--------------------------------~d 196 (431) T protein:vir:10 161 RLTSTW--QI----V-YDYT-TPTGDKI----ELPAREVFHLRDLS--------------------------------ID 196 (431) T ss_pred EEcCCC--eE----E-EEEE-eCCceEE----EEchhhEEEecCcC--------------------------------CC Confidence 654332 11 1 1111 1122211 12333333332100 01 Q ss_pred CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhh------------hhCceeeecCCCce Q lcl|NC_018086. 240 NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNM------------KNDRVIVTDEDGMV 307 (511) Q Consensus 240 ~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~------------~~~~~i~~~~~~~~ 307 (511) ...|.|.++-....+........-..+.+...+.|-.+++-.. .-.++....+ ..++++.++++.+. T Consensus 197 g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~ 275 (431) T protein:vir:10 197 GVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK-ELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATA 275 (431) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-CCCHHHHHHHHHHHHHHhcCccccCCceecCCCceE Confidence 1246677766655555444444444445555566766554322 1122222211 11356677766666 Q ss_pred eeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 308 KFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 308 ~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) +.+........+.+..+.....|+..-++|..-.+.....++..++..... .+..+|.-.++.|...++. T Consensus 276 ~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~----------f~~~tL~P~~~~ie~~ln~ 345 (431) T protein:vir:10 276 KQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIF----------FIQYGLSHWFVSWEQAAAR 345 (431) T ss_pred EEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHH----------HHHHHHHHHHHHHHHHHHh Confidence 666555555566666777788899999998765554333333333322222 2222333333333332221 Q ss_pred cCC-CccccccceeEEeCCCCCcCHHHHHHHHHHHh--c----cCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 388 MNK-AKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--D----MLPDETIINQFPW--ITDARQEVEKADAQRQKRADIA 458 (511) Q Consensus 388 ~~~-~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g----~~s~et~~~~l~~--v~d~~~E~~ri~~E~~~~~~~~ 458 (511) .-- ........+++.+..-+..|..+.++.+.++. | +++.-.++.+++. ++++... +. T Consensus 346 ~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD--~~----------- 412 (431) T protein:vir:10 346 AFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVAD--QL----------- 412 (431) T ss_pred hccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCcccc--ce----------- Confidence 110 01112233555556666778889998888764 2 4677667776644 2222211 00 Q ss_pred HhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCC Q lcl|NC_018086. 459 LQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKT 510 (511) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) ....+... ...+++. ++.| T Consensus 413 ~~p~n~~~--------------~~~~~~~-------------------p~~~ 431 (431) T protein:vir:10 413 RNPMTQKQ--------------KGSGDEP-------------------PATT 431 (431) T ss_pred eccccccc--------------CCCCCCC-------------------CCCC Confidence 00000000 0000000 1111 No 230 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=94.59 E-value=0.004 Score=33.73 Aligned_cols=366 Identities=9% Similarity=0.000 Sum_probs=155.6 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+|..-+.+....... .. .-+.....+.... ....... ..=+...-.. T Consensus 1 MGl~~~~~~~~~~~~~-----------~~----------------~~~~~~~~~~~~~--~~~~vt~---~~al~~~~v~ 48 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAE-----------KR----------------GYLDNVLGKSIRY--SGVYVTD---SNILQSSDVY 48 (394) T ss_pred CchhhhhhhhccCCCC-----------ch----------------hhhhhhhhccccc--CccccCh---hhhhccHHHH Confidence 4444333322111100 00 0011111111100 0000000 0012234455 Q ss_pred HHHHHHHhhhhccCceecC-c--hhhHHHHHHHHhc-cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESG-D--EKTIKAMQPVFKE-NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPM 155 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~-d--~~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~ 155 (511) ..|+..++-+-.-|+.+-. + ......+..++.+ |. .......+..+.+.+|.+|+++..+..+ . +. T Consensus 49 ~~i~~Ia~~iA~lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-----~--~~ 121 (394) T protein:vir:62 49 ELLQDISNQMVLADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-----L--AS 121 (394) T ss_pred HHHHHHHHhhcccceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-----c--cc Confidence 6667666666666776521 1 1112223344433 32 3355667888899999999987432211 1 12 Q ss_pred ceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceE Q lcl|NC_018086. 156 NCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVL 235 (511) Q Consensus 156 ~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 235 (511) .+.+..++.. +.+|.. .+ ..+.++.+ + T Consensus 122 ~~~~~~~~~~-------~~~~~~----~~------~~~~~~ei------------------------------------i 148 (394) T protein:vir:62 122 NVFTELDDNL-------VEHFNI----GG------HEIPPCMI------------------------------------R 148 (394) T ss_pred cceEEECCce-------EEEEee----CC------EEechhhe------------------------------------E Confidence 2333332210 111110 01 01222222 2 Q ss_pred eecC----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC-CCCccchh----hhhh--------hhCce Q lcl|NC_018086. 236 EIIA----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF-DLSADSDS----ISNM--------KNDRV 298 (511) Q Consensus 236 ~~~n----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~-~~~~~~~~----~~~~--------~~~~~ 298 (511) +++. .-.|.|.+..+...++.......-....+...+.|-.+++-. ....+++. ...+ ..+++ T Consensus 149 h~r~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~ 228 (394) T protein:vir:62 149 HVKNIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSV 228 (394) T ss_pred EecCcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCce Confidence 3221 124667777666666655555555555556666775555421 11111111 1111 11345 Q ss_pred eeecCCCceeeeecCC--CHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 299 IVTDEDGMVKFITKDV--NDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAK 376 (511) Q Consensus 299 i~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 376 (511) +.++.+.+.++..... ....+.+..+...+.|+..-++|..-.+....++.+ +.....+..+|.- T Consensus 229 ~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e-------------~~~~~~~~~~l~P 295 (394) T protein:vir:62 229 KMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIKEDIE-------------KAMMYIHNKAVRP 295 (394) T ss_pred eEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHH-------------HHHHHHHHHHHHH Confidence 5667777777655443 334455566777888888888886555433222211 1112333444444 Q ss_pred HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_018086. 377 RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKR 454 (511) Q Consensus 377 ~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~ 454 (511) +++.|...+...--... ....+.+.|+.....+....++++.++. |+++...++.+++.-+-+.....++-.. T Consensus 296 ~~~~ie~~l~~kll~~~-~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~---- 370 (394) T protein:vir:62 296 IMKNFEDHLSLLFYAQN-SGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYIS---- 370 (394) T ss_pred HHHHHHHHHhhhhcCcc-ccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeecc---- Confidence 44444444432111111 1234678887777777777888887774 7888888888876532111111111000 Q ss_pred HHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccc Q lcl|NC_018086. 455 ADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEK 502 (511) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (511) .++ .+.....+.+ .+..+++..++ T Consensus 371 -----~n~------~~~~~~~~~~-------------~~~kgge~~en 394 (394) T protein:vir:62 371 -----NDV------TEIGKKEATD-------------GSLGGGEENEN 394 (394) T ss_pred -----ccc------cccccccccc-------------ccCCCCCCCCC Confidence 000 0000000000 00111111111 No 231 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=94.44 E-value=0.0044 Score=33.49 Aligned_cols=468 Identities=9% Similarity=0.033 Sum_probs=173.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHH-------------HHHHHHHHHHHHHHHHHHhcCCCcccccCCcC Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELIT-------------LAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD 67 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-------------~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~ 67 (511) .|-.|.+.- +..........+ ++ . ...+.. .......-+.+|+.+-.+++-+. T Consensus 5 fgf~i~~~~-~~~~~S~vpp~~-~~-~--~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~--------- 70 (564) T protein:vir:10 5 FGFLINEKE-GQKGQSPVPPND-EA-S--VSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDS--------- 70 (564) T ss_pred hcceeeeec-cCCCCCcccCCc-CC-C--hhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhh--------- Confidence 333443321 111111000000 00 0 000100 00011112223333333333222 Q ss_pred ccccccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEE Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFE 137 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~ 137 (511) -...||+..+-+ -..+|+.+. ..++..+.+..++.--+|+....+..|.+.+.|+.|. T Consensus 71 ------------Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~f 138 (564) T protein:vir:10 71 ------------AIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHY 138 (564) T ss_pred ------------HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEE Confidence 122233322111 112233332 1233455677777778899999999999999999998 Q ss_pred EeeeC----CCCceEEEEEcccceEEEecCCCCCc--eEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccc Q lcl|NC_018086. 138 IHWID----RNKKHRFKAVSPMNCLIAYSADLDEE--PVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYR 211 (511) Q Consensus 138 ~v~~~----~~g~~~i~~~~p~~~~~v~d~~~~~~--~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 211 (511) +.-.| ++|-..+..+||+.+-.|+..-++.. -...++-+..... .+ .....-+|.+.. | .+..... T Consensus 139 Hkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~-y~-~~~Eyy~Ynp~~---~---~g~~~~~ 210 (564) T protein:vir:10 139 HKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYD-YG-DFIEYYIYNPKG---F---AGNIPMV 210 (564) T ss_pred EEEeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeecc-cc-ccccceeecccc---c---cCccccc Confidence 76554 24656788899998877764322110 0111111100000 00 000111222210 0 0000000 Q ss_pred ccccccccccccceeccCCccceEee----cCCcccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCc Q lcl|NC_018086. 212 EIPEELEIKDYEVHPNLLQKFPVLEI----IANEERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSA 285 (511) Q Consensus 212 ~~~~~~~~~~~~~~~~~~g~iPvv~~----~n~~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~ 285 (511) . ....+.......-+-..|+.++. .+... .+.-+..-+..+| +++-|....-+..+.|-+=+.-.+... T Consensus 211 -~-~~~~~~~~~~ikI~~daI~y~hSGL~d~~~~~---i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGn 285 (564) T protein:vir:10 211 -T-GSMDWSNQEGIKIASDAIAQSTSGLMDLNKKM---TLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGN 285 (564) T ss_pred -c-cccccccccceeechhhcceecccceeCCCCc---eeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC Confidence 0 00000000111111111222221 01111 2233444455555 455666777777777755333222211 Q ss_pred cc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 286 DS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIF 331 (511) Q Consensus 286 ~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~ 331 (511) .+ .-...+. . +++ +++| +++. .+.-|.+ .+...+.. +.-+.+-+| T Consensus 286 LPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY 364 (564) T protein:vir:10 286 LPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKD-VEYFKKKLY 364 (564) T ss_pred CCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHH-HHHHHHHHH Confidence 11 1111110 0 011 1121 1222 2222222 23333222 445555666 Q ss_pred HHhCcccc--cccc--ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCC Q lcl|NC_018086. 332 SLSQTPDL--VSKD--FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVR 405 (511) Q Consensus 332 ~~s~~p~~--~~~~--~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~ 405 (511) +--.+|-. ..++ |.---+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..|.+.|.. T Consensus 365 ~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qL-iLKgiit~eeW~~i~~~I~~~f~~ 443 (564) T protein:vir:10 365 NSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQL-ILKGIITPEDWDDMEEHIQYDFLF 443 (564) T ss_pred HHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEeee Confidence 66677742 2221 211112234444444567777888888888888876422 2221112222322 346777744 Q ss_pred CCCcCHHHH-------HHHHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHH--HHHhhccccccCCC Q lcl|NC_018086. 406 NLPQSYAEL-------ADMAVKL---RD-MLPDETIINQFPWITD--ARQEVEKADAQRQKRAD--IALQNFKQTSAVQG 470 (511) Q Consensus 406 ~~p~d~~e~-------a~~~~~~---~g-~~s~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~--~~~~~~~~~~~~~~ 470 (511) .-.-.+... ++++..+ +| .+|.+++++.+-.-+| +.++-+.|++|..+..- ....++.+.....+ T Consensus 444 Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~ 523 (564) T protein:vir:10 444 DNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQN 523 (564) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCC Confidence 333333322 3344444 23 4799999988543343 33334444444332110 00011111111111 Q ss_pred CC--CccccccCCCCCCccccccCCCCccccccccCCCCCC Q lcl|NC_018086. 471 AS--TAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPK 509 (511) Q Consensus 471 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (511) +. +...+-.++.+.........+-..+.+++++..++-| T Consensus 524 ~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 564 (564) T protein:vir:10 524 QAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKSQSNK 564 (564) T ss_pred CcCCcchhhhccccccccChhhhccCCCCCCCCCCcCcCCC Confidence 10 0000000111100000000111112223333333333 No 232 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=94.30 E-value=0.0048 Score=33.30 Aligned_cols=393 Identities=10% Similarity=0.019 Sum_probs=164.6 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccc----ee Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNS----KI 76 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~----ri 76 (511) |-|..- ...++....... ..........-|. . .+...++. ........ ++ T Consensus 1 ~~~~~~-------------------~~p~~~~~~~~~----~~~~~~~~~~g~~-~-~D~~lr~~-gg~~~~~~~l~~~m 54 (446) T protein:vir:98 1 MNMEVR-------------------NAPTPAIRRRTI----YAMEHLGLATSYL-S-EDGGYKRA-GKPTYQQLSAWDEA 54 (446) T ss_pred Cccccc-------------------CCCchhhhhhhh----hccccchhhcccC-C-cchHhhhc-CCChHHHHHHHHHH Confidence 211110 011111111111 1111112222222 1 11111111 10000000 11 Q ss_pred --ccchHHHHHHHHHhhhhccCceec-CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEE-EEeeeCCCCc-eEEEE Q lcl|NC_018086. 77 --VHNFPKLLVDTSTAYLAGEPITES-GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCF-EIHWIDRNKK-HRFKA 151 (511) Q Consensus 77 --~~n~~k~ivd~~~~~l~g~~~~~~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~-~~v~~~~~g~-~~i~~ 151 (511) ...+..-.+.+...-+.+-++++. ++++..+.+.+.+.+-.+...... ..++..||.++ +++|.-..|. .-.++ T Consensus 55 ~e~D~~v~s~l~~Rk~av~~~~w~V~p~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~ 133 (446) T protein:vir:98 55 AQTEPIIAQGLDSIALSVLNKVGPYQHGDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATV 133 (446) T ss_pred HhcchHHHHHHHHHHHHhhcCCceecCccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecccccccchh Confidence 245666666666667777787774 556677788888887777655544 67888899765 5666533332 11111 Q ss_pred E------cccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccce Q lcl|NC_018086. 152 V------SPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVH 225 (511) Q Consensus 152 ~------~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (511) + .|...--.++.... ...+.. .....+ ..-.|.+-..+++ . .+........+.. T Consensus 134 ~d~~~~~~~~~~r~~~~~~~~--~~~~~~-------~~~~~~-~~~~~~~~~~~~~--~--------~~~~~~~~~g~~~ 193 (446) T protein:vir:98 134 LDDIVNYHPLQVMLIANDNGR--IVDGDT-------VTASQY-KSGYWVPLPPYRI--G--------DPPKKVDVVGSHV 193 (446) T ss_pred hccccccccccceeeeccCCc--cccccc-------cchhhc-ccccccCcccchh--h--------hhhhhcccCcccc Confidence 1 11111001111100 000000 000000 0000000000000 0 0000000000111 Q ss_pred eccCCccceEeec---CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec---CCCCccc--h--------- Q lcl|NC_018086. 226 PNLLQKFPVLEII---ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQG---FDLSADS--D--------- 288 (511) Q Consensus 226 ~~~~g~iPvv~~~---n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G---~~~~~~~--~--------- 288 (511) +-|..+.=++.+. .++.|.|.+..+.-.---=+..+-+++..++.|..|+++.+- ....+.. + T Consensus 194 ~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~ 273 (446) T protein:vir:98 194 RLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIA 273 (446) T ss_pred cccccceEEEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHH Confidence 1122222122221 246788877765444444466777888899999999998773 2211110 0 Q ss_pred --hhhhh---hhCceeee-----cCCCceeeeecCCCH-HHHHHHHHHHHHHHHHHhCccccccccc--cCccHHHHHHH Q lcl|NC_018086. 289 --SISNM---KNDRVIVT-----DEDGMVKFITKDVND-KHIENIKNRAKLDIFSLSQTPDLVSKDF--TAASGQALKAA 355 (511) Q Consensus 289 --~~~~~---~~~~~i~~-----~~~~~~~~~~~~~~~-~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~~Sg~Ai~~~ 355 (511) -...+ ....+..+ |++..+++++..... ..++.+++.+.+.|...--...++.+.. ++.|...-+.. T Consensus 274 ~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh 353 (446) T protein:vir:98 274 EQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQ 353 (446) T ss_pred HHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHH Confidence 11111 12223333 777788888765543 4688888988888877654443333221 12222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcc--ccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-CC--h Q lcl|NC_018086. 356 TQPLENKSAVKESKFRKVLA-KRYELVCSYLEFMNKAKD--LKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-LP--D 427 (511) Q Consensus 356 ~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~s--~ 427 (511) ..-....++.-.+.+...+. ++++-++. .+..... .....-.+.|....+.|....++++.+++ |+ ++ . T Consensus 354 ~~V~~d~~~aDa~~i~~tln~~Li~~l~~---lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~ 430 (446) T protein:vir:98 354 LELFDGKINSIFDTVIHAFTEQVIGNLIR---LNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDK 430 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccH Confidence 11112222333344444443 34444433 2222111 11111234566667788888899988885 54 44 4 Q ss_pred HHHHHhCCCCCCHHHHH Q lcl|NC_018086. 428 ETIINQFPWITDARQEV 444 (511) Q Consensus 428 et~~~~l~~v~d~~~E~ 444 (511) +.+.+.++. ++.+..- T Consensus 431 ~~ire~~gi-P~~~~~~ 446 (446) T protein:vir:98 431 DHIRSITGL-PDAISST 446 (446) T ss_pred HHHHHHhCc-CCCCCCC Confidence 456666653 2211111 No 233 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=94.21 E-value=0.0051 Score=33.17 Aligned_cols=302 Identities=11% Similarity=-0.044 Sum_probs=118.8 Q ss_pred EEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccc Q lcl|NC_018086. 135 CFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIP 214 (511) Q Consensus 135 ~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 214 (511) .++++|.-.+|...+..+.+.. + .. +.+|. +...+....+.... T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~------~------~~-~~~f~--------------~~~~~~l~~~~~~~--------- 44 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRP------P------RT-ISRFD--------------VAPDGGLVAIEQWG--------- 44 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecC------c------cc-eeeee--------------eccCCceeEEEecC--------- Confidence 4444443333322222111110 0 00 00111 11111111111000 Q ss_pred ccccccccccee-ccCCccceEee--cCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCcc----c Q lcl|NC_018086. 215 EELEIKDYEVHP-NLLQKFPVLEI--IANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSAD----S 287 (511) Q Consensus 215 ~~~~~~~~~~~~-~~~g~iPvv~~--~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~----~ 287 (511) ..+ .+..+ .+.+.|-.++- ..++.|.|.+..+.-..--=+..+.+++..++.|..|+.+.+|-..... . T Consensus 45 -~~g---~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~ 120 (355) T protein:vir:78 45 -VFG---KATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDT 120 (355) T ss_pred -CCC---CCcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchh Confidence 000 00011 11122212211 1356788888876555544566778888899999888888777432111 0 Q ss_pred -----------hhh----hhhhh--CceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHH Q lcl|NC_018086. 288 -----------DSI----SNMKN--DRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQ 350 (511) Q Consensus 288 -----------~~~----~~~~~--~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~ 350 (511) +.. .++.. ..+..++.+.++++++.......+...++...+.|...--..-++.+..++.+.- T Consensus 121 ~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~ 200 (355) T protein:vir:78 121 ARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSY 200 (355) T ss_pred hhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchh Confidence 001 11111 2456788888999998777666677788888888776654443443322111111 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-C Q lcl|NC_018086. 351 AL-KAATQPLENKSAVKESKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-L 425 (511) Q Consensus 351 Ai-~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~ 425 (511) |+ .....-....++.-.+.+...+. ++++.++.+ +.... ..-..+.|.. .+.+....++.+.++. |+ + T Consensus 201 Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l----N~~~~--~~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~~ 273 (355) T protein:vir:78 201 ALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQ----NWGPE--EPAPRLVPAQ-LGKEQPVTAEAIRALVECGAFT 273 (355) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCC--CCCCEEEecC-cChhHHHHHHHHHHHHhCCCcc Confidence 21 11122223333333445555553 355544432 22111 1123566754 4456667788887774 44 4 Q ss_pred ChH----HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccccc Q lcl|NC_018086. 426 PDE----TIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQE 501 (511) Q Consensus 426 s~e----t~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (511) +.+ .+.+.++. +.+.+.- ..........+...+...+++. ..+.+... T Consensus 274 ~~~~~~~~~~e~~gi-p~p~~~~------------------------~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~a 325 (355) T protein:vir:78 274 ADPELEKDLRARYGL-PAPAERD------------------------DGADAAAAKAAGRRRAKRLPGQ---RQGAALPS 325 (355) T ss_pred ccHHHHHHHHHHhCC-CCCCCCC------------------------cccCCccccccccccccccCCc---cccccccc Confidence 432 33444442 2111000 0000000000000000000000 00000000 Q ss_pred c-cCCCCCC------------------CC Q lcl|NC_018086. 502 K-AIQKKPK------------------TD 511 (511) Q Consensus 502 ~-~~~~~~~------------------~~ 511 (511) + ...+.+. +| T Consensus 326 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 354 (355) T protein:vir:78 326 RSPRADPPRRRGPLRRRPRHPAHRRCAPD 354 (355) T ss_pred cCCCCCChhhhHHHHHHhhccccCCCCCC Confidence 0 0011111 11 No 234 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=93.62 E-value=0.0069 Score=32.43 Aligned_cols=370 Identities=10% Similarity=-0.011 Sum_probs=144.8 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccch Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNF 80 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~ 80 (511) || |. ..+ .+.+.--++..+.++ .+-......+......-....- T Consensus 1 mg--~~----------------------------~~~---~~~~~~~~~~~~~~~---~~~~~~~~~~~~t~~~~~~~~~ 44 (403) T protein:vir:10 1 MG--FK----------------------------SWI---TEKLNPGQRIIRDME---PVSHRTNRKPFTTGQAYSKIEI 44 (403) T ss_pred Cc--ch----------------------------hhh---hhccchhhhhhhccc---ccccccCCcccccHHHHHHHHH Confidence 11 11 100 100000011111111 1000000000000000011111 Q ss_pred HHHHHHHHHhhhhccCceecC------ch--hhHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESG------DE--KTIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH 147 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~------d~--~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~ 147 (511) ....|+..++-+..-|+++.. +. .....+..++.. |. .......+..+.+.+|.||++.. . . T Consensus 45 v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~--~--~- 119 (403) T protein:vir:10 45 LNRTANMVIDSAAECSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD--G--T- 119 (403) T ss_pred HHHHHHHHHHHHhhCceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe--C--c- Confidence 223344444444445554321 11 011224444432 32 23555667778889999997652 1 1 Q ss_pred EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceec Q lcl|NC_018086. 148 RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPN 227 (511) Q Consensus 148 ~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (511) .+..++|..+.+..+. .. . ++++. . +..+ .|..+.+.++. T Consensus 120 ~l~~l~~~~~~v~~~~-~~--~---~~~~~-~----~~~~----~~~~~eiih~~------------------------- 159 (403) T protein:vir:10 120 SLYHVPAALMQVEADA-NK--F---IKKFI-F----NNQI----NYRVDEIIFIK------------------------- 159 (403) T ss_pred eeEeecCcceEEEEcC-Cc--e---EEEEE-e----cCce----eecccceEEec------------------------- Confidence 2344555544332221 10 0 11110 0 0000 01112222221 Q ss_pred cCCccceEeec-CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh------------ Q lcl|NC_018086. 228 LLQKFPVLEII-ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK------------ 294 (511) Q Consensus 228 ~~g~iPvv~~~-n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~------------ 294 (511) ...+++.. +...|.|.+..+...++....+..-..+.+...+.|-.+++... .-.++....++ T Consensus 160 ---~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~~g~~n 235 (403) T protein:vir:10 160 ---DNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDE-ILNKKLRERKQEELQLDYNPSTG 235 (403) T ss_pred ---ccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhCCccc Confidence 11111111 23457777777777777666666555556666666766666422 12222222211 Q ss_pred hCceeeecCCCceeeeecCCC--HHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 295 NDRVIVTDEDGMVKFITKDVN--DKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRK 372 (511) Q Consensus 295 ~~~~i~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~ 372 (511) .++++.++++-+.+.+....+ ...+.+..+...+.|+..-++|....+....++.+. .....+.. T Consensus 236 ~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~-------------~~~~f~~~ 302 (403) T protein:vir:10 236 QSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRP-------------NIELFYYM 302 (403) T ss_pred CcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHH-------------HHHHHHHH Confidence 133566766556655543333 334455667778888888888865543222222111 11222333 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccceeEEeCCC--CCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_018086. 373 VLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRN--LPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKAD 448 (511) Q Consensus 373 ~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~--~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~ 448 (511) +|.-.++.|...+...- ...+.+.+..- +-.|..+.++++.++ .|+++...++..++.-+-+++...+. T Consensus 303 tl~P~~~~ie~~l~~~L------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~- 375 (403) T protein:vir:10 303 TIIPMLNKLTSSLTFFF------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKI- 375 (403) T ss_pred HHHHHHHHHHHHHHHhc------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccccc- Confidence 34444443333333211 11233334322 445777888888777 48899988988887543111111110 Q ss_pred HHHHHHHHHHHhhccccccCCCCCCccc-cccCCCCCC Q lcl|NC_018086. 449 AQRQKRADIALQNFKQTSAVQGASTAAA-NKLDKNPAN 485 (511) Q Consensus 449 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 485 (511) .-..+......+....++ ++.+...|+ T Consensus 376 ----------~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 376 ----------RIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ----------ccccccccccccCCCCcCCCCCCCcCCC Confidence 000000000000000000 000000111 No 235 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=93.26 E-value=0.0082 Score=32.02 Aligned_cols=412 Identities=8% Similarity=-0.028 Sum_probs=164.5 Q ss_pred CCHHHHHHHHH--HHHHHHHHHH-----HHHHHhcC-CC-cccccCCcCccccccc-ee--ccchHHHHHHHHHhhhhcc Q lcl|NC_018086. 28 FDLRELITLAE--MHSRSSSAYG-----VLYDYYKG-NH-IAIQSRTFDDTNKPNS-KI--VHNFPKLLVDTSTAYLAGE 95 (511) Q Consensus 28 ~~~~~l~~~~~--~~~~~~~~~~-----~~~~yY~G-~~-~~~~~~~~~~~~~~~~-ri--~~n~~k~ivd~~~~~l~g~ 95 (511) |+ +.+. ..+-++.+.- -....|+- .+ +.+.. .....-+ .+ ...+..-.+++....+.+. T Consensus 1 ~~-----~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~----~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~ 71 (469) T protein:vir:10 1 MT-----ERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQW----PQSVAVYSRMDNEDSRVTSLLEAISLPIRST 71 (469) T ss_pred CC-----CcccCCCCccchhhhhhcccccchhhcccccccccccc----ccchHHHHHHHhhChHHHHHHHHHHHHHhcC Confidence 11 1111 1111111100 00011110 00 00000 0000000 12 2566666777777778888 Q ss_pred Cceec---CchhhHHHHHHHHh-----------------ccChhHHHHHHHHHHhhCCeEE-EEeeeCC----CCceEEE Q lcl|NC_018086. 96 PITES---GDEKTIKAMQPVFK-----------------ENYVTDVNSEEVKLSGIFGHCF-EIHWIDR----NKKHRFK 150 (511) Q Consensus 96 ~~~~~---~d~~~~~~l~~~~~-----------------~n~~~~~~~~~~~~a~~~G~~~-~~v~~~~----~g~~~i~ 150 (511) ++++. .+++..+.+.+.+. ...+...+.++..++..||.++ +++|... +|...+. T Consensus 72 ~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~ 151 (469) T protein:vir:10 72 PWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLR 151 (469) T ss_pred CceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeee Confidence 88874 23333333333221 1134556666677788899765 5676422 3555444 Q ss_pred EEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCC Q lcl|NC_018086. 151 AVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQ 230 (511) Q Consensus 151 ~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 230 (511) .+.+..-- . +.+|.... ++..+ .++-..+..-.... ...... ......+.. T Consensus 152 ~l~~rp~~----------~---i~~~~~~~--~~~l~-~~~~~~~~~~~~~~-----------~~~~~~--~~~~lp~~k 202 (469) T protein:vir:10 152 KLAPRPQW----------T---ISKFNVAP--DGGLE-SIEQIAPPARTRGS-----------LYVANI--APPEIPVNR 202 (469) T ss_pred eeeecCcc----------c---ceeeeecc--CCcee-eeeecCcccccccc-----------cccCCC--CccccccCc Confidence 43322100 0 00011111 01000 00000000000000 000000 000001111 Q ss_pred ccceEeec--CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccch------hhhhhh--hCceee Q lcl|NC_018086. 231 KFPVLEII--ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSD------SISNMK--NDRVIV 300 (511) Q Consensus 231 ~iPvv~~~--n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~------~~~~~~--~~~~i~ 300 (511) .|-.++-. .++.|.|.+..+....--=+..+.+++..++.|+.|+++.+--....+++ ...++. ...++. T Consensus 203 ~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~i 282 (469) T protein:vir:10 203 LVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVG 282 (469) T ss_pred EEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEE Confidence 22222211 35678888887666555555678889999999999999877533222211 112222 233566 Q ss_pred ecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_018086. 301 TDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLA-KRYE 379 (511) Q Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~ 379 (511) ++++.++++++...+...+...++.+.+.|...--..-++.+..|++.|.+ .....-....++.-.+.+...+. ++++ T Consensus 283 ip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~-~vh~ev~~d~~~sDa~~i~~tln~~li~ 361 (469) T protein:vir:10 283 LAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALA-SVLEDPFTQAVHAYATSICRIANQHIIE 361 (469) T ss_pred ccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 888999999988877788889999988888776644444433222221111 11111222233333344555553 3444 Q ss_pred HHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-----CChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_018086. 380 LVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-----LPDETIINQFPWITDARQEVEKADAQRQ 452 (511) Q Consensus 380 li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-----~s~et~~~~l~~v~d~~~E~~ri~~E~~ 452 (511) -++.+ +... +..-..+.|.... .+....++.+++++ |+ ++.+.+.+.++. +.+..+..-+..++. T Consensus 362 ~l~~l----N~g~--~~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gi-p~~~~~~~~~~~~~~ 433 (469) T protein:vir:10 362 DLVDI----NFGV--DTPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNL-PSELNDTPSAEPEEP 433 (469) T ss_pred HHHHh----cCCC--CCCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCC-CCCCCCcccccchhc Confidence 43332 2111 1112456675433 45566777777764 44 334455566553 211111000000000 Q ss_pred HHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCC-CCCCCC Q lcl|NC_018086. 453 KRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQ-KKPKTD 511 (511) Q Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 511 (511) . ..+...++..+. .....+.....+... .-.-+| T Consensus 434 ~-----------------------~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~l~d 468 (469) T protein:vir:10 434 A-----------------------AVPNQSAAPART--RSSGNADARARAPKADQGVLFD 468 (469) T ss_pred c-----------------------cCCCCCcccccc--CCCCCcccccccCCChHHhhcc Confidence 0 000000000000 000000000000000 000111 No 236 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=92.81 E-value=0.0099 Score=31.57 Aligned_cols=367 Identities=8% Similarity=-0.047 Sum_probs=131.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCc-hhhHHHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPV 112 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~ 112 (511) +++.+....+......+..+..|.. . ....-+........|+..++-+-.-|+.+-.. ......+... T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~------v-----~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~l 69 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLDLDMIED------L-----SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYK 69 (395) T ss_pred CchhhhhhccCccccccccchhccc------c-----chhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHH Confidence 2222211111110000011111100 0 00001123344455666665555566654222 2222233343 Q ss_pred Hhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce--EEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 113 FKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC--LIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 113 ~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~--~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) +.. |. .......+..+.+..|.+|+++..+ +.+ ..+++... ..++++. ...+... +. T Consensus 70 l~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~-------~~ 133 (395) T protein:vir:10 70 LNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTVK-------DY 133 (395) T ss_pred HHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEEc-------Cc Confidence 332 32 2334455666777778777655433 222 11222221 1222111 0111000 00 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) .+ ...+.++.+++++.-. -.....|.|.++.....++.... T Consensus 134 ~~--~~~~~~~evih~~~~~-------------------------------~~~~~~G~spi~~~~~~~~~~~~------ 174 (395) T protein:vir:10 134 TY--QRTFTMQEVIYLKYNN-------------------------------NKVTHFVESLFEDYGKIFGRMIG------ 174 (395) T ss_pred ee--eeeeccccEEEEccCC-------------------------------CCcccccchHHHHHHHHHHHHHH------ Confidence 00 1123333333332100 00122466666655555544332 Q ss_pred HHHHHhcCceeEeecCCCCccchhhhhh----h-------hCc--eeeecCCCceeeeecCCC-----HHHHHHHHHHHH Q lcl|NC_018086. 266 NDIAYWNDAYLWLQGFDLSADSDSISNM----K-------NDR--VIVTDEDGMVKFITKDVN-----DKHIENIKNRAK 327 (511) Q Consensus 266 ~~~~~~~~p~l~~~G~~~~~~~~~~~~~----~-------~~~--~i~~~~~~~~~~~~~~~~-----~~~~~~~~~~l~ 327 (511) .+...+.+--++.-......++....+ . .++ ++.++++.+.+.++.... ...+.+..+... T Consensus 175 -~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~ 253 (395) T protein:vir:10 175 -AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAI 253 (395) T ss_pred -HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHH Confidence 223333333333221111112222111 1 112 333454445444433221 224566667777 Q ss_pred HHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCC Q lcl|NC_018086. 328 LDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNL 407 (511) Q Consensus 328 ~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~ 407 (511) +.|+..-++|..-.+. ..++.+. ....++..+|.-++..|...+...-.....-...+.+.+...+ T Consensus 254 ~~Ia~~f~VPp~~l~~-~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~ 319 (395) T protein:vir:10 254 KNVALMIGIPPGLIYG-ETADLEK-------------NTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVN 319 (395) T ss_pred HHHHHHhCCCHHHhcC-cccCHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhh Confidence 8888888888654431 1111111 1112223334444333333333211110000112345566666 Q ss_pred CcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCC Q lcl|NC_018086. 408 PQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPAN 485 (511) Q Consensus 408 p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (511) ..|..+.++++.++ .|+++.-.++..++.-+-+....++.. -..+......+.. .+.++. T Consensus 320 ~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~-----------~~~n~~~~~~~~~-------~~~~~~ 381 (395) T protein:vir:10 320 KKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYL-----------ITKNYEKANSGEN-------DEKEKD 381 (395) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee-----------ecccccccccccc-------ccCccc Confidence 77888999998877 478888888888765321111000000 0000000000000 000000 Q ss_pred ccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 486 TSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .. .+.++...++ .| T Consensus 382 ~~-----~~kgg~~~~~-------g~ 395 (395) T protein:vir:10 382 EN-----TLKGGDEDES-------GD 395 (395) T ss_pred cc-----ccCCCCCCCC-------CC Confidence 00 0000111111 11 No 237 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=92.81 E-value=0.0099 Score=31.57 Aligned_cols=367 Identities=8% Similarity=-0.047 Sum_probs=131.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCc-hhhHHHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPV 112 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~ 112 (511) +++.+....+......+..+..|.. . ....-+........|+..++-+-.-|+.+-.. ......+... T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~------v-----~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~l 69 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLDLDMIED------L-----SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYK 69 (395) T ss_pred CchhhhhhccCccccccccchhccc------c-----chhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHH Confidence 2222211111110000011111100 0 00001123344455666665555566654222 2222233343 Q ss_pred Hhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce--EEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 113 FKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC--LIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 113 ~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~--~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) +.. |. .......+..+.+..|.+|+++..+ +.+ ..+++... ..++++. ...+... +. T Consensus 70 l~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~-------~~ 133 (395) T protein:vir:10 70 LNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTVK-------DY 133 (395) T ss_pred HHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEEc-------Cc Confidence 332 32 2334455666777778777655433 222 11222221 1222111 0111000 00 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) .+ ...+.++.+++++.-. -.....|.|.++.....++.... T Consensus 134 ~~--~~~~~~~evih~~~~~-------------------------------~~~~~~G~spi~~~~~~~~~~~~------ 174 (395) T protein:vir:10 134 TY--QRTFTMQEVIYLKYNN-------------------------------NKVTHFVESLFEDYGKIFGRMIG------ 174 (395) T ss_pred ee--eeeeccccEEEEccCC-------------------------------CCcccccchHHHHHHHHHHHHHH------ Confidence 00 1123333333332100 00122466666655555544332 Q ss_pred HHHHHhcCceeEeecCCCCccchhhhhh----h-------hCc--eeeecCCCceeeeecCCC-----HHHHHHHHHHHH Q lcl|NC_018086. 266 NDIAYWNDAYLWLQGFDLSADSDSISNM----K-------NDR--VIVTDEDGMVKFITKDVN-----DKHIENIKNRAK 327 (511) Q Consensus 266 ~~~~~~~~p~l~~~G~~~~~~~~~~~~~----~-------~~~--~i~~~~~~~~~~~~~~~~-----~~~~~~~~~~l~ 327 (511) .+...+.+--++.-......++....+ . .++ ++.++++.+.+.++.... ...+.+..+... T Consensus 175 -~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~ 253 (395) T protein:vir:10 175 -AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAI 253 (395) T ss_pred -HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHH Confidence 223333333333221111112222111 1 112 333454445444433221 224566667777 Q ss_pred HHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCC Q lcl|NC_018086. 328 LDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNL 407 (511) Q Consensus 328 ~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~ 407 (511) +.|+..-++|..-.+. ..++.+. ....++..+|.-++..|...+...-.....-...+.+.+...+ T Consensus 254 ~~Ia~~f~VPp~~l~~-~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~ 319 (395) T protein:vir:10 254 KNVALMIGIPPGLIYG-ETADLEK-------------NTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVN 319 (395) T ss_pred HHHHHHhCCCHHHhcC-cccCHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhh Confidence 8888888888654431 1111111 1112223334444333333333211110000112345566666 Q ss_pred CcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCC Q lcl|NC_018086. 408 PQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPAN 485 (511) Q Consensus 408 p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (511) ..|..+.++++.++ .|+++.-.++..++.-+-+....++.. -..+......+.. .+.++. T Consensus 320 ~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~-----------~~~n~~~~~~~~~-------~~~~~~ 381 (395) T protein:vir:10 320 KKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYL-----------ITKNYEKANSGEN-------DEKEKD 381 (395) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee-----------ecccccccccccc-------ccCccc Confidence 77888999998877 478888888888765321111000000 0000000000000 000000 Q ss_pred ccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 486 TSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .. .+.++...++ .| T Consensus 382 ~~-----~~kgg~~~~~-------g~ 395 (395) T protein:vir:10 382 EN-----TLKGGDEDES-------GD 395 (395) T ss_pred cc-----ccCCCCCCCC-------CC Confidence 00 0000111111 11 No 238 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=92.81 E-value=0.0099 Score=31.57 Aligned_cols=367 Identities=8% Similarity=-0.047 Sum_probs=131.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCc-hhhHHHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPV 112 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~ 112 (511) +++.+....+......+..+..|.. . ....-+........|+..++-+-.-|+.+-.. ......+... T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~------v-----~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~l 69 (395) T protein:vir:95 1 MSILEKIFKTRKDITYMLDLDMIED------L-----SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYK 69 (395) T ss_pred CchhhhhhccCccccccccchhccc------c-----chhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHH Confidence 2222211111110000011111100 0 00001123344455666665555566654222 2222233343 Q ss_pred Hhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccce--EEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 113 FKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNC--LIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 113 ~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~--~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) +.. |. .......+..+.+..|.+|+++..+ +.+ ..+++... ..++++. ...+... +. T Consensus 70 l~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~-------~~ 133 (395) T protein:vir:95 70 LNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTVK-------DY 133 (395) T ss_pred HHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEEc-------Cc Confidence 332 32 2334455666777778777655433 222 11222221 1222111 0111000 00 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSV 265 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~ 265 (511) .+ ...+.++.+++++.-. -.....|.|.++.....++.... T Consensus 134 ~~--~~~~~~~evih~~~~~-------------------------------~~~~~~G~spi~~~~~~~~~~~~------ 174 (395) T protein:vir:95 134 TY--QRTFTMQEVIYLKYNN-------------------------------NKVTHFVESLFEDYGKIFGRMIG------ 174 (395) T ss_pred ee--eeeeccccEEEEccCC-------------------------------CCcccccchHHHHHHHHHHHHHH------ Confidence 00 1123333333332100 00122466666655555544332 Q ss_pred HHHHHhcCceeEeecCCCCccchhhhhh----h-------hCc--eeeecCCCceeeeecCCC-----HHHHHHHHHHHH Q lcl|NC_018086. 266 NDIAYWNDAYLWLQGFDLSADSDSISNM----K-------NDR--VIVTDEDGMVKFITKDVN-----DKHIENIKNRAK 327 (511) Q Consensus 266 ~~~~~~~~p~l~~~G~~~~~~~~~~~~~----~-------~~~--~i~~~~~~~~~~~~~~~~-----~~~~~~~~~~l~ 327 (511) .+...+.+--++.-......++....+ . .++ ++.++++.+.+.++.... ...+.+..+... T Consensus 175 -~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~ 253 (395) T protein:vir:95 175 -AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAI 253 (395) T ss_pred -HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHH Confidence 223333333333221111112222111 1 112 333454445444433221 224566667777 Q ss_pred HHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCC Q lcl|NC_018086. 328 LDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNL 407 (511) Q Consensus 328 ~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~ 407 (511) +.|+..-++|..-.+. ..++.+. ....++..+|.-++..|...+...-.....-...+.+.+...+ T Consensus 254 ~~Ia~~f~VPp~~l~~-~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~ 319 (395) T protein:vir:95 254 KNVALMIGIPPGLIYG-ETADLEK-------------NTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVN 319 (395) T ss_pred HHHHHHhCCCHHHhcC-cccCHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhh Confidence 8888888888654431 1111111 1112223334444333333333211110000112345566666 Q ss_pred CcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCC Q lcl|NC_018086. 408 PQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPAN 485 (511) Q Consensus 408 p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (511) ..|..+.++++.++ .|+++.-.++..++.-+-+....++.. -..+......+.. .+.++. T Consensus 320 ~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~-----------~~~n~~~~~~~~~-------~~~~~~ 381 (395) T protein:vir:95 320 KKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYL-----------ITKNYEKANSGEN-------DEKEKD 381 (395) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee-----------ecccccccccccc-------ccCccc Confidence 77888999998877 478888888888765321111000000 0000000000000 000000 Q ss_pred ccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 486 TSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .. .+.++...++ .| T Consensus 382 ~~-----~~kgg~~~~~-------g~ 395 (395) T protein:vir:95 382 EN-----TLKGGDEDES-------GD 395 (395) T ss_pred cc-----ccCCCCCCCC-------CC Confidence 00 0000111111 11 No 239 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=92.77 E-value=0.01 Score=31.54 Aligned_cols=397 Identities=9% Similarity=-0.067 Sum_probs=162.7 Q ss_pred CCCccchhhccc-----ccCch--------hhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC Q lcl|NC_018086. 1 MAIPNGQINAGD-----IITTN--------IRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD 67 (511) Q Consensus 1 ~~~~~~~~~~~~-----~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~ 67 (511) ||-...+--... ...+. ..+....-..+.+..+...+.. . ..+ +.|+ . T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~-~---~~~----~ly~--------~--- 61 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQG-K---DGL----LVYH--------K--- 61 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhcc-c---cch----HHHH--------H--- Confidence 665554311000 00000 0000000001111111111100 0 000 0000 0 Q ss_pred ccccccceeccchHHHHHHHHHhhhhccCceec--Cchh----hHHHHHHHHhc-------cChhHHHHHHHHHHhhCCe Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES--GDEK----TIKAMQPVFKE-------NYVTDVNSEEVKLSGIFGH 134 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~--~d~~----~~~~l~~~~~~-------n~~~~~~~~~~~~a~~~G~ 134 (511) .. ...+..-.+.+....+.+.++.+. +++. ..+.+.+.+.. -.|...+. -..+|.-||. T Consensus 62 ------m~-~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~-~~lda~~~G~ 133 (448) T protein:vir:79 62 ------ML-SDGTVKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFA-IYENAYIYGM 133 (448) T ss_pred ------Hh-hChHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHH-HHHHhhhhcc Confidence 01 145566667777777788888873 2222 22334443332 24555444 4667889997 Q ss_pred EE-EEeee-CCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccc Q lcl|NC_018086. 135 CF-EIHWI-DRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYRE 212 (511) Q Consensus 135 ~~-~~v~~-~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 212 (511) ++ +++|. ..+|...+..+.+... .. +++|.... ++... +.+... .. T Consensus 134 s~~Eivw~~~~~g~~~~~~l~~r~~------~~-------~~~f~~~~--d~~l~----~~~~~~---------~~---- 181 (448) T protein:vir:79 134 AAGEIVLTLGADGKLILDKIVPIHP------FN-------IDEVLYDE--EGGPK----ALKLSG---------EV---- 181 (448) T ss_pred eeEEEEeeecCCCceecccccccCC------cc-------ccceeeec--CCceE----EeecCC---------cc---- Confidence 76 56664 4566654333222110 00 00111111 01110 000000 00 Q ss_pred cccccccccccceeccCCccceEeecC----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCC-ccc Q lcl|NC_018086. 213 IPEELEIKDYEVHPNLLQKFPVLEIIA----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLS-ADS 287 (511) Q Consensus 213 ~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~-~~~ 287 (511) .......+..+-+++.+ |.+.. ++.|.|.+..+.-..--=+..+.+++..++.|+.|+++.+-.... .+. T Consensus 182 ---~~~~~~~~~~~lP~~~~--i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~ 256 (448) T protein:vir:79 182 ---KGGSQFVSGLEIPIWKT--VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGT 256 (448) T ss_pred ---cccccCCCccccccceE--EEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCH Confidence 00000011112233332 22222 456778888766655566677888999999999999988743221 111 Q ss_pred hh-------hhhhh--hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHH Q lcl|NC_018086. 288 DS-------ISNMK--NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQP 358 (511) Q Consensus 288 ~~-------~~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~ 358 (511) +. ..++. ...+..++++.++++++.......+..+++...+.|...--.--++.+..++.+..+......- T Consensus 257 ~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v 336 (448) T protein:vir:79 257 KQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAINIGEFVSL 336 (448) T ss_pred HHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhccccccchhhhhhhhHHHH Confidence 11 12222 2345668999999999877665566667888777776655333333333333333333221111 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCC Q lcl|NC_018086. 359 LENKSAVKESKFRKVLAK-RYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLRDMLPDETIINQFPWI 437 (511) Q Consensus 359 l~~k~~~~~~~~~~~l~~-~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~~v 437 (511) ....++.-.+.+...+.+ +++-++. . +.+.. ..-..+.|....+.|.++.++.+.+++++.. T Consensus 337 ~~~~~~aDa~~i~~tln~~li~~l~~---l-Nfg~~--~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~----------- 399 (448) T protein:vir:79 337 TQQTIISLQREFASAVNLYLIPKLVL---P-NWPSA--TRFPRLTFEMEERNDFSAAANLMGMLINAVK----------- 399 (448) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---h-cCCCc--CCCcEEEecCCChHHHHHHHHHhhhhhccch----------- Confidence 112222233334444432 3433332 2 21111 1124678887788888888888877764311 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 438 TDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 438 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) + .+.+..++ ...++ ..+..+..+ +........+.+.++.+- T Consensus 400 -~----~~~~~~~~--------~~~p~---~~~~~~~~a-----------------~~~~~~~~~~~~~~~~~~ 440 (448) T protein:vir:79 400 -D----SEDIPTEL--------KALID---ALPSKMRRA-----------------LGVVDEVREAVRQPADSR 440 (448) T ss_pred -h----hHHHHHHh--------hcCCC---CCCCccccc-----------------cCCCCcccccccCCcccc Confidence 1 11111110 00000 000000000 000001111111112221 No 240 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=92.01 E-value=0.013 Score=30.88 Aligned_cols=418 Identities=10% Similarity=-0.011 Sum_probs=150.2 Q ss_pred Cccc-hhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHH------HHHHHHHhcCCCcccccCCcCcccccc-c Q lcl|NC_018086. 3 IPNG-QINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSA------YGVLYDYYKGNHIAIQSRTFDDTNKPN-S 74 (511) Q Consensus 3 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~------~~~~~~yY~G~~~~~~~~~~~~~~~~~-~ 74 (511) |++. ++++..--.+-.. ........+........ --++..+-.|.... .....+... . T Consensus 1 M~~~~~l~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~----~~~~~g~~v~~ 66 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMS----------IDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTE----LAPDTFVGLAT 66 (466) T ss_pred CchhHHHhhccCcccccc----------hhhhhhhhhhhhccccccccccccHHHHHhhcccccc----ccCccccccch Confidence 2222 2211111100000 00000000000000000 00011111111100 000001100 0 Q ss_pred --eeccchHHHHHHHHHhhhhccCceecC--ch---hh-HHHHHHHHhc-c---ChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 75 --KIVHNFPKLLVDTSTAYLAGEPITESG--DE---KT-IKAMQPVFKE-N---YVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 75 --ri~~n~~k~ivd~~~~~l~g~~~~~~~--d~---~~-~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) =+........|+..++-+-.-|+.+-. +. +. ...+..++.+ | ........+..+.+.+|.||+.+..+ T Consensus 67 ~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~ 146 (466) T protein:vir:81 67 QAYQANGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDG 146 (466) T ss_pred hhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEec Confidence 122455566777777777667776521 11 11 1223344432 2 23345677788899999999999877 Q ss_pred CCCc---------eEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceE-EEEEEEcCCcEEEEEEccCcccccc Q lcl|NC_018086. 143 RNKK---------HRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQI-RTYEVYTEDLIYKFSTDDEREVYRE 212 (511) Q Consensus 143 ~~g~---------~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~~~~~~~~~ 212 (511) +.|. ..+..++|..+.+..+..... .... .|.. .+... .....+.++.++|++.- T Consensus 147 ~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~~~y-~~~~----~~~~~~~~~~~~~~~dviHir~~-------- 211 (466) T protein:vir:81 147 EFVRMRPDWVDVVVEERMVRGGRGELGGGQLGWR--KVGY-LYTE----GGRQSGNESVGFLAEDVVHFAPI-------- 211 (466) T ss_pred CccccccccCcceeEEEEecCcceEEEEcCCCce--EEEE-EEEe----cCcccccceeeeccccEEEEcCC-------- Confidence 6554 235556666665555432211 1110 0110 00000 00011223333332100 Q ss_pred cccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhh Q lcl|NC_018086. 213 IPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISN 292 (511) Q Consensus 213 ~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~ 292 (511) .+++ +.-.|.|.+......++.......-....+...+.|-.+++-. ....++.... T Consensus 212 -------------~~~~---------d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~ 268 (466) T protein:vir:81 212 -------------PDPL---------ASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHN-PMADPAAVKK 268 (466) T ss_pred -------------CCcc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHH Confidence 0000 0114777777766666655555555555566666676665532 1112222222 Q ss_pred h----h--------hCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHHHHHhCccccccccc---cCccHHHHHHHHH Q lcl|NC_018086. 293 M----K--------NDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF---TAASGQALKAATQ 357 (511) Q Consensus 293 ~----~--------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---~~~Sg~Ai~~~~~ 357 (511) + . .++++.++++.+++.+........+....+...+.|+..-++|....+.. +.+++..++.+.. T Consensus 269 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~ 348 (466) T protein:vir:81 269 WADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARR 348 (466) T ss_pred HHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHH Confidence 1 1 13466777666666665555555666777888899999999987655421 2233333322221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEe--CCCCCcCHHHHHHHHHHHhccCChHHHHHhCC Q lcl|NC_018086. 358 PLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVF--VRNLPQSYAELADMAVKLRDMLPDETIINQFP 435 (511) Q Consensus 358 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f--~~~~p~d~~e~a~~~~~~~g~~s~et~~~~l~ 435 (511) ..+..+|.-+++.|...+...-... .....+.+.| ..-+-.|..+.+++....... ..+++. -+ T Consensus 349 ----------~f~~~tl~P~~~~ie~~l~~~L~~~-~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~--~~~~~~-~g 414 (466) T protein:vir:81 349 ----------RLADGTAHPLWQNLSGCIGHVMPDM-GPDVRLWYDADDVPFLREDEKDAADIQKVRAET--INTLIT-AG 414 (466) T ss_pred ----------HHHHHHHHHHHHHHHHHHHhhcCCc-ccCcceEEEecchhhhccCHHHHHHHHHHHHHH--HHHHHH-cC Confidence 1222233333333322222111110 1111234455 333444655555542211100 001111 11 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCC-CCCccccccCCCCCCccccccCCCCccccccc Q lcl|NC_018086. 436 WITDARQEVEKADAQRQKRADIALQNFKQTSAVQG-ASTAAANKLDKNPANTSTITTTDPVAAKEQEK 502 (511) Q Consensus 436 ~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (511) ++ + .|+..+ . +.++.....+ +......-+..++......+++ ..|.+..++ T Consensus 415 -~t-~-nE~r~~-----------~-~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~Gg~~ngn 466 (466) T protein:vir:81 415 -YE-P-ESVVAA-----------V-NSGDLRLLKHTGLTSVQLLPPGVSASASSDTPT-SGGADDNGN 466 (466) T ss_pred -CC-h-hhcccc-----------c-cCCccccccCCCcchhhhcccccccccCCCCcc-cCCCCcCCC Confidence 11 1 111100 0 0000000000 0000000000000000000000 111111111 No 241 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=91.62 E-value=0.015 Score=30.58 Aligned_cols=447 Identities=9% Similarity=0.052 Sum_probs=169.6 Q ss_pred CCCccchhhcccccCchhhHhhhhc------cCCCH--HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIR------RNFDL--RELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKP 72 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~--~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~ 72 (511) .|..|...-.+.....+-.-...+. ..... -.+..-+.....-+.+|..+-.+++-+. T Consensus 5 fg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~-------------- 70 (533) T protein:vir:10 5 FGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDS-------------- 70 (533) T ss_pred cccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhh-------------- Confidence 4445554322222211100000000 00000 0000011111222233333333333222 Q ss_pred cceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC Q lcl|NC_018086. 73 NSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID 142 (511) Q Consensus 73 ~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~ 142 (511) -...||+..+-+ ....|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|.+.-.| T Consensus 71 -------Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid 143 (533) T protein:vir:10 71 -------AVDDIVNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVID 143 (533) T ss_pred -------HHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec Confidence 122233322211 112233332 123345566777777789999999999999999999876554 Q ss_pred C----CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccc Q lcl|NC_018086. 143 R----NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELE 218 (511) Q Consensus 143 ~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 218 (511) . +|-..+..+||+.+-+|..-... ....++........-+ .+...-+|++..... . . ..+ T Consensus 144 ~~~pk~GI~ELr~lDPr~i~~vr~i~~~--~~~~~~~~~~~~~v~~-~~~eyf~Ynp~g~~~-~--~----------~~~ 207 (533) T protein:vir:10 144 PDNPQGGLIELRYIDPRKIRKINETEQK--RPEQLRGLPLNQQLSP-KSAEYFLYDPKGLKN-S--T----------TQG 207 (533) T ss_pred CCCccccceeeeeccccceeeeeeeecc--CCCccceeecchhhhc-cceeeeeeccccccc-c--C----------CCc Confidence 3 46677888999988665421100 0011110000000001 111123444432210 0 0 000 Q ss_pred cccccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCccc--- Q lcl|NC_018086. 219 IKDYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSADS--- 287 (511) Q Consensus 219 ~~~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~~~--- 287 (511) + +|| .|.|... ..+.-.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+....+ T Consensus 208 ----------v-kI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~K 276 (533) T protein:vir:10 208 ----------L-KIAPDSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNK 276 (533) T ss_pred ----------e-ecchhheeeeeccceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchh Confidence 0 122 1111110 001112233444455555 35566666777777775533322221111 Q ss_pred --hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_018086. 288 --DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIFSLSQT 336 (511) Q Consensus 288 --~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~ 336 (511) .-...+. . +++ +++| +++. .+.-|.+ .+...+. -+.-+.+-+|+--++ T Consensus 277 AeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~-DV~YF~kKLY~aLnV 355 (533) T protein:vir:10 277 AEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELE-DVKYFQKKLYKSLNV 355 (533) T ss_pred HHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCC Confidence 1111110 0 011 1121 2222 2222333 2233222 244555555666677 Q ss_pred cccc--c-ccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCCCCcCH Q lcl|NC_018086. 337 PDLV--S-KDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRNLPQSY 411 (511) Q Consensus 337 p~~~--~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~~p~d~ 411 (511) |-.- . +.|...-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|...-.-.+ T Consensus 356 P~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qL-iLKgiit~eeW~~i~~~I~~~f~~Dn~f~E 434 (533) T protein:vir:10 356 PGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQL-VLKGVISIEEWDQMKEHIQYDYIADNYFAE 434 (533) T ss_pred CccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEeeeecchHHH Confidence 7422 2 22211122344444445567778888888888888876422 2221112222322 346777744333333 Q ss_pred HHH-------HHHHHHH---hc-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH--HhhccccccCCCCCCccccc Q lcl|NC_018086. 412 AEL-------ADMAVKL---RD-MLPDETIINQFPWITDARQEVEKADAQRQKRADIA--LQNFKQTSAVQGASTAAANK 478 (511) Q Consensus 412 ~e~-------a~~~~~~---~g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 478 (511) ... ++++..+ +| .+|.+++++.+-.-+| +|+..++++-+.+.+.. .++.......+++. ..+ T Consensus 435 lKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tD--eei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~---~~~ 509 (533) T protein:vir:10 435 LKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTD--VEMKEIDKQIESEMESGIIADPAAEMDPAMAAG---DPD 509 (533) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCH--HHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCC---CCC Confidence 322 3344444 23 4799999988544343 33333333222222211 10000000000000 000 Q ss_pred cCCCCCCccccccCCCCccccccccCCCCCCC Q lcl|NC_018086. 479 LDKNPANTSTITTTDPVAAKEQEKAIQKKPKT 510 (511) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) .++.++...+ |.++.+..-.. +.- T Consensus 510 ~~~~~~~~~~-----~~~~~~~~~~~---~~~ 533 (533) T protein:vir:10 510 AGGAPAEEVA-----PEGPDPSDERK---AEF 533 (533) T ss_pred cCCcccccCC-----CCCCCcchhhc---cCC Confidence 0001111001 11111110000 000 No 242 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=91.45 E-value=0.016 Score=30.46 Aligned_cols=371 Identities=7% Similarity=-0.022 Sum_probs=144.5 Q ss_pred HHHHHHHHHHHHHH--HHHHHHhcCCCcccccCCcCccccccceec-cchHHHHHHHHHhhhhccCceec--Cch---hh Q lcl|NC_018086. 34 ITLAEMHSRSSSAY--GVLYDYYKGNHIAIQSRTFDDTNKPNSKIV-HNFPKLLVDTSTAYLAGEPITES--GDE---KT 105 (511) Q Consensus 34 ~~~~~~~~~~~~~~--~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~-~n~~k~ivd~~~~~l~g~~~~~~--~d~---~~ 105 (511) +.+.+-++++ .+- .....++.... ...........++. .+.....|+..++-+-.-|+.+- .++ .. T Consensus 1 Mg~~~~f~~k-~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~ 74 (403) T protein:vir:80 1 MGLFNFFRRK-TRSEPTNAISWFLTQE-----AYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRI 74 (403) T ss_pred Cccccccccc-ccccccchhhhhcccc-----cccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCceeec Confidence 1222111100 000 00000000000 00000000001111 22334566666666666677641 111 12 Q ss_pred HHHHHHHHh--ccCh---hHHHHHHHHHHhhC--CeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEE Q lcl|NC_018086. 106 IKAMQPVFK--ENYV---TDVNSEEVKLSGIF--GHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNT 177 (511) Q Consensus 106 ~~~l~~~~~--~n~~---~~~~~~~~~~a~~~--G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 177 (511) ...+..++. -|.. ......+..+.+.. |.||+++..+..|++ .+..++|..+.++.+++. .+++. T Consensus 75 ~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g-------~~~~y 147 (403) T protein:vir:80 75 KNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTG-------YQIWY 147 (403) T ss_pred CChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCc-------eEEEE Confidence 223444443 2332 23344455566654 667887777777775 466788877765544321 01110 Q ss_pred EeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCc-ccCchhHHHHHHHHH Q lcl|NC_018086. 178 VISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANE-ERLGDFEAQLSLIDA 256 (511) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-~g~s~~~~v~~l~d~ 256 (511) .+ ..|.++.++|+.... .| .+. .|.|.+..+...+.. T Consensus 148 -----~~------~~~~~~eiih~~~~~--------------------------~~-----~~~~~G~s~~~~~~~~i~~ 185 (403) T protein:vir:80 148 -----QG------KAYNYDEVLHFIVNP--------------------------DP-----EKPYMGRGYRVVLKDIVNN 185 (403) T ss_pred -----ee------cccchhhEEEEeccC--------------------------CC-----cCccccccHHHHHHHHHHH Confidence 00 112333333332100 00 111 366666655555555 Q ss_pred HHHHHHHHHHHHHHhcCceeEeecCCC-Cc--cchhhhhh--------hhCceeeecCCC-ceeeee-cCCCHHHHHHHH Q lcl|NC_018086. 257 YNLAVSDSVNDIAYWNDAYLWLQGFDL-SA--DSDSISNM--------KNDRVIVTDEDG-MVKFIT-KDVNDKHIENIK 323 (511) Q Consensus 257 ~~~~~s~~~~~~~~~~~p~l~~~G~~~-~~--~~~~~~~~--------~~~~~i~~~~~~-~~~~~~-~~~~~~~~~~~~ 323 (511) ......-....+...+.|-.++.-... .+ .+.....+ ..++.+.++.+. +..-++ .+.....+.... T Consensus 186 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~ 265 (403) T protein:vir:80 186 LKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETV 265 (403) T ss_pred HHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHH Confidence 444444344445555566665543221 11 11111111 122344444332 222122 222333445556 Q ss_pred HHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEe Q lcl|NC_018086. 324 NRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVF 403 (511) Q Consensus 324 ~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f 403 (511) +.....|+..-++|..-.+.....+.... ..+..+|.-+++.|...+...--. ..+ ..+++.. T Consensus 266 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~---------------~f~~~~l~P~~~~ie~~l~~kll~-~~~-~~~~f~~ 328 (403) T protein:vir:80 266 ELDKRTVAGIFGVPAFLLGVGKYDKDEYN---------------NFINSTILPIAKGIEQELTRKLLI-SPD-LYFKFNP 328 (403) T ss_pred HHhHHHHHHHhCCCHHHcCCCCccHHHHH---------------HHHHHHHHHHHHHHHHHHHHhccC-CCC-cEEEeec Confidence 66777788887887544432111221111 133344444444444433321111 111 1233444 Q ss_pred CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCC Q lcl|NC_018086. 404 VRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDK 481 (511) Q Consensus 404 ~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (511) ..-+..|..+.++++.++ .|+++.-.++..++.-+.+... ++ .... ...+. +..++.... T Consensus 329 ~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd--~~-----------~~~~----n~~pl-~~~~~~~~~ 390 (403) T protein:vir:80 329 RSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS--EL-----------VILE----NYIPL-DKIGDQNKL 390 (403) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--eE-----------eecc----cccch-hhccchhhc Confidence 556667888899988877 4889988888887653321100 00 0000 00000 000000000 Q ss_pred CCCCccccccCCC Q lcl|NC_018086. 482 NPANTSTITTTDP 494 (511) Q Consensus 482 ~~~~~~~~~~~~~ 494 (511) ++|..+...++.+ T Consensus 391 k~ge~~~~~~~~~ 403 (403) T protein:vir:80 391 KGGEKGGADGQTD 403 (403) T ss_pred cCCCCCCCCCCCC Confidence 1111110000000 No 243 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=91.11 E-value=0.017 Score=30.22 Aligned_cols=394 Identities=12% Similarity=0.065 Sum_probs=149.9 Q ss_pred HHHHhcCCCcccccCCcCccccc---cceeccchHHHHHHHHHhhhhccCceecC-ch--hhHHHHHHHHhc--cC---h Q lcl|NC_018086. 50 LYDYYKGNHIAIQSRTFDDTNKP---NSKIVHNFPKLLVDTSTAYLAGEPITESG-DE--KTIKAMQPVFKE--NY---V 118 (511) Q Consensus 50 ~~~yY~G~~~~~~~~~~~~~~~~---~~ri~~n~~k~ivd~~~~~l~g~~~~~~~-d~--~~~~~l~~~~~~--n~---~ 118 (511) +--.-.|........ ...... ..-+........|+..++-+-+-|+.+-. +. .....+.+++.. |. . T Consensus 1 ~~~~~~~~g~~~~~~--~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t~ 78 (723) T protein:vir:94 1 MTTFPSGAGGWNAWS--ADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMPA 78 (723) T ss_pred CcccccCCCcccccc--ccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCCH Confidence 000000000000000 000000 00011233334555555555555766422 21 112234455542 33 2 Q ss_pred hHHHHHHHHHHhhCCeEEEEeeeCC---CCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEc Q lcl|NC_018086. 119 TDVNSEEVKLSGIFGHCFEIHWIDR---NKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYT 194 (511) Q Consensus 119 ~~~~~~~~~~a~~~G~~~~~v~~~~---~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~ 194 (511) ......+..+.+.+|.+|+.+..+. .|.| .+..++|..+.++..+............|.... .+|.. ..+. T Consensus 79 ~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~-~~G~~----~~~~ 153 (723) T protein:vir:94 79 QVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIER-TDGVR----VPVL 153 (723) T ss_pred HHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEe-cCcee----EEec Confidence 3455566777889999998886543 3443 355566655444332221110011111111110 11110 0112 Q ss_pred CCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_018086. 195 EDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDA 274 (511) Q Consensus 195 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p 274 (511) ++.+++++.- + | .+.-.|.|.+......+.............+...+.| T Consensus 154 ~~dIiHir~~----------------------~-----~----~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p 202 (723) T protein:vir:94 154 ADEMLWLRFS----------------------D-----P----YDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARP 202 (723) T ss_pred ccceEEecCC----------------------C-----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 2222222100 0 0 0112477777665555554444444444444555667 Q ss_pred eeEeecCCCCccchhhhh----hh--------hCceeeecC--------CCceeeeecCCC--HHHHHHHHHHHHHHHHH Q lcl|NC_018086. 275 YLWLQGFDLSADSDSISN----MK--------NDRVIVTDE--------DGMVKFITKDVN--DKHIENIKNRAKLDIFS 332 (511) Q Consensus 275 ~l~~~G~~~~~~~~~~~~----~~--------~~~~i~~~~--------~~~~~~~~~~~~--~~~~~~~~~~l~~~i~~ 332 (511) -.++.--.. +++.... +. .++.+.++. +.+.+|.....+ ...+.+..+...+.|+. T Consensus 203 ~giL~~~~l--~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~ 280 (723) T protein:vir:94 203 GGVVNLGDM--DEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVML 280 (723) T ss_pred ceEEEcCCC--CHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHH Confidence 666653222 2222211 11 133455543 124455444443 34455556667778888 Q ss_pred HhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC--CCCcC Q lcl|NC_018086. 333 LSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR--NLPQS 410 (511) Q Consensus 333 ~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~--~~p~d 410 (511) .-++|....+.. ++...+..+.. ..+...|.-.++.|...++..-.. .+ ..++.+.|+. -+..| T Consensus 281 afgVPp~~i~~~--st~sN~e~~~~----------~f~~~tL~P~~~~ie~~ln~~Ll~-~~-g~~~~~~f~~~~lLr~D 346 (723) T protein:vir:94 281 AFGIRKDALLGG--STYENQAEAKA----------AVWTETLIPQMEVMASITDLQLLP-DI-GWTVEWDFNSVPALQED 346 (723) T ss_pred HhCCChhHcCCC--CCcccHHHHHH----------HHHHHHHHHHHHHHHHHHhHhhcc-cc-cCceEEeecchhhhhcC Confidence 888886433221 22111111111 122333444444433333321111 11 2246677754 34578 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCC-CCC Q lcl|NC_018086. 411 YAELADMAVKL--RDMLPDETIINQFPWI--TDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKN-PAN 485 (511) Q Consensus 411 ~~e~a~~~~~~--~g~~s~et~~~~l~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 485 (511) ..+.++.+.++ .|+++.-.++..++.- ++-+..+ .+.+........+......+++... .+. T Consensus 347 ~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~-------------~~~p~~~~~a~~~~~~p~~~e~~~~~~~~ 413 (723) T protein:vir:94 347 LEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQM-------------TLTPYRAQFAPAPAPAPAVEEGAARMLAL 413 (723) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccc-------------eeccccccccCCCCCCccchhhhHhhhhh Confidence 88888888876 4789988888887652 2211100 0011101000010000000010000 000 Q ss_pred ccccccCCCCcccccccc--CCCC-CCCC Q lcl|NC_018086. 486 TSTITTTDPVAAKEQEKA--IQKK-PKTD 511 (511) Q Consensus 486 ~~~~~~~~~~~~~~~~~~--~~~~-~~~~ 511 (511) .......+| .++.+.++ +..+ ++.| T Consensus 414 ~~~~~~~~p-~~~~~~~~~~~~~~~~~~~ 441 (723) T protein:vir:94 414 LERVAADRP-LPELPVRATTVLHHDPGPD 441 (723) T ss_pred ccccccccC-cCCCCCCCCCCCCCCcccC Confidence 000011111 11111111 1111 1112 No 244 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=89.64 E-value=0.025 Score=29.35 Aligned_cols=434 Identities=8% Similarity=-0.045 Sum_probs=155.0 Q ss_pred cCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhh Q lcl|NC_018086. 14 ITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLA 93 (511) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~ 93 (511) +.. ++..-..+++..|-+....=.+ .-.-.||+-.-+++... ..-....... ...+..-.+.+....+. T Consensus 1 ~~~----~~~~~~gl~p~rl~~i~~~~~~-----~~~~~~~~~~~~~Lr~~-~~~~ly~~m~-~D~hi~s~l~~Rk~av~ 69 (488) T protein:vir:95 1 MAD----ITETQESLPPFRMGEVGSLGLK-----VKNGRIYEEPRQALRFP-ESIKTFQLMM-RDPAVAASVNIIKMFVR 69 (488) T ss_pred CCC----ccccCCCCCHHHHHHHHHHhhc-----cccchhhccchhhhccc-chHHHHHHHh-hChHHHHHHHHHHHHHh Confidence 111 1112234555443332211000 00112333111111100 0000000011 25667777888888888 Q ss_pred ccCceecC-----chh----hHHHHHHHHhcc--ChhHHHHHHHHHHhhCCeEE-EEeeeCCCCceEEEEEcccceEEEe Q lcl|NC_018086. 94 GEPITESG-----DEK----TIKAMQPVFKEN--YVTDVNSEEVKLSGIFGHCF-EIHWIDRNKKHRFKAVSPMNCLIAY 161 (511) Q Consensus 94 g~~~~~~~-----d~~----~~~~l~~~~~~n--~~~~~~~~~~~~a~~~G~~~-~~v~~~~~g~~~i~~~~p~~~~~v~ 161 (511) +.++++.. ++. ..+.+.+++..- .+...+..+ .+|.-||.++ +++|....+..........+..... T Consensus 70 ~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~ 148 (488) T protein:vir:95 70 KVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGW 148 (488) T ss_pred cCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccccccccccccCCeeee Confidence 88887741 121 223455555543 355666654 5788999765 5666533221111110000000000 Q ss_pred cCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccc---eEeec Q lcl|NC_018086. 162 SADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP---VLEII 238 (511) Q Consensus 162 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~ 238 (511) .. ...++-.-+++|.... ++..+....-..+... .. ........... --.|| +|.++ T Consensus 149 ~~-i~~Rpq~~~~~f~~d~--d~~l~~~~~~~~~~~~----~~------------~~~~~~~~~~~-~~~lP~~kfi~~~ 208 (488) T protein:vir:95 149 AK-LPIRNQSTLDKWYFDE--DFRRVTGVRQNLRNVS----HI------------AGAINLGERPL-TRKLPRAKFMLFK 208 (488) T ss_pred ee-eeecCcccccceeecc--CCCceeeccccccccc----cc------------ccccccccccc-cccccccceEEEe Confidence 00 0000000011111111 1111110000000000 00 00000000000 01133 12222 Q ss_pred -----CCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC---CCCccchhh----hhh---hh------Cc Q lcl|NC_018086. 239 -----ANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGF---DLSADSDSI----SNM---KN------DR 297 (511) Q Consensus 239 -----n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~---~~~~~~~~~----~~~---~~------~~ 297 (511) .++.|.|.+..+.-..--=+..+..++..++.|..|+.+.+|. ......+.. ..+ .. .. T Consensus 209 ~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~a 288 (488) T protein:vir:95 209 YDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRA 288 (488) T ss_pred ecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchh Confidence 3456777777644333333456677788888888888877762 111111111 111 10 12 Q ss_pred eeeecCCCceee---------eec-CCCHHHHHHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 298 VIVTDEDGMVKF---------ITK-DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKE 367 (511) Q Consensus 298 ~i~~~~~~~~~~---------~~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~ 367 (511) .+.++.+-++++ +.. ......+...++...+.|...--..-++.+..++.|...=+....-....++.-. T Consensus 289 g~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa 368 (488) T protein:vir:95 289 GLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILL 368 (488) T ss_pred heeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHH Confidence 234454433332 111 1223346667777777776654333233222111111111111112222233333 Q ss_pred HHHHHHHH-HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHHh--cc-CCh----HHHHHhCCCCCC Q lcl|NC_018086. 368 SKFRKVLA-KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKLR--DM-LPD----ETIINQFPWITD 439 (511) Q Consensus 368 ~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~-~s~----et~~~~l~~v~d 439 (511) +.+...+. ++++-++. ..... ...-..+.|....+.|.++.++++.+++ |+ ++. +.+.+.++. +. T Consensus 369 ~~i~~tln~~li~~l~~---~Nfg~---~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gi-p~ 441 (488) T protein:vir:95 369 KQIKNVINRDLVAQTYA---LNMWD---DEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGL-PP 441 (488) T ss_pred HHHHHHHHHHHHHHHHH---hcCCC---CCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC-CC Confidence 44444443 34433332 22111 1112467888888889889999998884 54 553 344555543 21 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCccccccccCCCCC Q lcl|NC_018086. 440 ARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKEQEKAIQKKP 508 (511) Q Consensus 440 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (511) ++.. | .. . ....+.....+......++... ...-.++..+.+.+++- T Consensus 442 ~~~~------e---~~---~------~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 442 ADES------Q---PV---S------EKLSPNSQSRSGDGYKTAGEGT----AKTPSAKDPSTANKANK 488 (488) T ss_pred CCCC------c---cc---c------ccCCCCCCCCCCcccCCCcccC----CcccccccchhhhhccC Confidence 1100 0 00 0 0000000000000000000000 00111111111111111 No 245 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=89.48 E-value=0.026 Score=29.26 Aligned_cols=349 Identities=11% Similarity=0.036 Sum_probs=145.1 Q ss_pred HHHHHHHHH-----HHHHHHHHHHHhcCCCcccccCC-cCcccc-cc------------------------ce--eccch Q lcl|NC_018086. 34 ITLAEMHSR-----SSSAYGVLYDYYKGNHIAIQSRT-FDDTNK-PN------------------------SK--IVHNF 80 (511) Q Consensus 34 ~~~~~~~~~-----~~~~~~~~~~yY~G~~~~~~~~~-~~~~~~-~~------------------------~r--i~~n~ 80 (511) +.+-..+.. ..+.-.=..+|..|+.++..-+. ...... .. .+ +.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 111110000 00000112345555444332111 110000 00 00 01122 Q ss_pred HHHHHHHHHhhhhccCceecCchhhHHHHHHHHh--ccChh--HHHHHHHHHHhhCCeEEEE-eeeCCCCce-EEEEEcc Q lcl|NC_018086. 81 PKLLVDTSTAYLAGEPITESGDEKTIKAMQPVFK--ENYVT--DVNSEEVKLSGIFGHCFEI-HWIDRNKKH-RFKAVSP 154 (511) Q Consensus 81 ~k~ivd~~~~~l~g~~~~~~~d~~~~~~l~~~~~--~n~~~--~~~~~~~~~a~~~G~~~~~-v~~~~~g~~-~i~~~~p 154 (511) ....|+..++-+-+-|+.+-.+....+.+..++. -|... ..+.+.....+..|.+|+. +..+.+|.+ .+..++| T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~~G~~~~L~pl~p 160 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGSDGYPIRFRVVPP 160 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECCCCcEEEEEEECC Confidence 3345555555555557654222111122222222 12221 1223333333455888875 556777875 4677888 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.+.++++.. .+|.+. ..+.++ .| T Consensus 161 ~~v~v~~~~~g~-------~~y~~~-----------~~~~~~------------------------------------ei 186 (409) T protein:vir:83 161 WLVNVELKKGAR-------REYRIG-----------GLNVTD------------------------------------EI 186 (409) T ss_pred cceEEEEcCCce-------EEEEEc-----------cccCcc------------------------------------ce Confidence 877655543210 111110 001111 23 Q ss_pred EeecC-----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhh----------hCcee Q lcl|NC_018086. 235 LEIIA-----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMK----------NDRVI 299 (511) Q Consensus 235 v~~~n-----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~----------~~~~i 299 (511) ++++. .-.|.|.++.....++..+....-..+.+...+.|-.++.-. ....++....++ .++.+ T Consensus 187 iHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~-~~ls~e~~~~~~~~~~~~~~~nag~~~ 265 (409) T protein:vir:83 187 LHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVE-RRLSETEAVDLMDRWIESRSKYAGHPA 265 (409) T ss_pred EEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecC-CCCCHHHHHHHHHHHHHhhCCccCccc Confidence 44431 125778787777777766655544455555666777666532 222222222211 12344 Q ss_pred eecCCCce-eeeecCCCHHHHHHHHHHHHHHHHHHhCcccccccccc---CccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 300 VTDEDGMV-KFITKDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT---AASGQALKAATQPLENKSAVKESKFRKVLA 375 (511) Q Consensus 300 ~~~~~~~~-~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~---~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 375 (511) .+.++.+. +.+........+.+..+...+.|+..-++|....+..+ ..+-..++.+.... +..+|. T Consensus 266 il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f----------~~~tL~ 335 (409) T protein:vir:83 266 LVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFH----------DRSSLR 335 (409) T ss_pred eecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHH----------HHHHHH Confidence 55555443 23333333334555566677888888888865554221 11111122211111 112222 Q ss_pred HHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHH Q lcl|NC_018086. 376 KRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQK 453 (511) Q Consensus 376 ~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~ 453 (511) -.++.|...++..--.. ...+++.+..-+-.|.++.++++.++ .|+++.-.+++.++.- T Consensus 336 P~~~~ie~~l~~~Ll~~---~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~glp---------------- 396 (409) T protein:vir:83 336 PKATAVMAALDRWALPS---PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAMERLH---------------- 396 (409) T ss_pred HHHHHHHHHHHHhhCCC---CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC---------------- Confidence 22222222221110000 12355555666667888888888776 4777776665554321 Q ss_pred HHHHHHhhccccccCCCCCCcccccc Q lcl|NC_018086. 454 RADIALQNFKQTSAVQGASTAAANKL 479 (511) Q Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (511) +..|++.-++... T Consensus 397 -------------p~~ggd~l~~~gv 409 (409) T protein:vir:83 397 -------------SEAAAVRLSGGGV 409 (409) T ss_pred -------------CCCCCcccCCCCC Confidence 1111111111111 No 246 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=88.57 E-value=0.031 Score=28.82 Aligned_cols=439 Identities=13% Similarity=0.066 Sum_probs=158.2 Q ss_pred Cccc--------hhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc----c--ccCCcCc Q lcl|NC_018086. 3 IPNG--------QINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIA----I--QSRTFDD 68 (511) Q Consensus 3 ~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~----~--~~~~~~~ 68 (511) |+=. .++...+++..+-+.-.....- ...|.. .........-....||-|.-.- + ++.... T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG-~s~i~~---~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~- 75 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMGAPHGAGG-SSMIPI---NMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDY- 75 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcccCccCCCC-CccccC---CCCcchhhhhhhhhhhccccccHHHHHHHHHHhhc- Confidence 1100 1111111111111100000000 000000 0000000000111223221000 0 000000 Q ss_pred cccccceeccchHHHHHHHHH-hhhhccCceecCch-hhHHHHH-HHHhccChhHHHHHHHHHHhhCCeEEEEeeeC--C Q lcl|NC_018086. 69 TNKPNSKIVHNFPKLLVDTST-AYLAGEPITESGDE-KTIKAMQ-PVFKENYVTDVNSEEVKLSGIFGHCFEIHWID--R 143 (511) Q Consensus 69 ~~~~~~ri~~n~~k~ivd~~~-~~l~g~~~~~~~d~-~~~~~l~-~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~--~ 143 (511) .+--+.+-...||+..+ ......|+.+.-++ +..+.+. .+..-.+|+....+..|.+++.|+.|.+.-.+ + T Consensus 76 ----~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k 151 (533) T protein:vir:58 76 ----TDPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSD 151 (533) T ss_pred ----cCcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCcc Confidence 00001122223333322 22234455543222 2333332 33445679999999999999999999887542 3 Q ss_pred CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccc Q lcl|NC_018086. 144 NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYE 223 (511) Q Consensus 144 ~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (511) .|-..+..+||..+-.+|+.-+. . .+-+|++....... .... T Consensus 152 ~GI~elr~lDPr~i~~vr~~~t~--------------------~-eyyvy~~~~~~~~s------------~~~~----- 193 (533) T protein:vir:58 152 GTIEKFQVVSPYIFSKRYNPETD--------------------T-WYYVITDVYRNVVS------------GYFN----- 193 (533) T ss_pred cchhhheecCCeeeEEEEeeccc--------------------e-EEEeeccccccccc------------Cccc----- Confidence 44457888999988776654322 0 11234443211100 0000 Q ss_pred ceeccCCccc---eEeec------CCcccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCc-----cc Q lcl|NC_018086. 224 VHPNLLQKFP---VLEII------ANEERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSA-----DS 287 (511) Q Consensus 224 ~~~~~~g~iP---vv~~~------n~~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~-----~~ 287 (511) -+|| |+++. +.+.+.|.+... +..+| +++-|....-+..+.|-+=+.=.+... .+ T Consensus 194 ------~kI~~daI~y~~SGl~d~~~~~iisyLhkA---iKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAe 264 (533) T protein:vir:58 194 ------EDIPEEDVIHFSHKIDTNFFPYGRSYLESA---RAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKIN 264 (533) T ss_pred ------cccchhheeeeeeccccCCCCceehhhhHH---HHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHH Confidence 0122 22221 123344555543 33444 234555555566666543222111111 11 Q ss_pred hhhhhhhh---Cce------------------------eeec--CCCc-eeeeecCCCHHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_018086. 288 DSISNMKN---DRV------------------------IVTD--EDGM-VKFITKDVNDKHIENIKNRAKLDIFSLSQTP 337 (511) Q Consensus 288 ~~~~~~~~---~~~------------------------i~~~--~~~~-~~~~~~~~~~~~~~~~~~~l~~~i~~~s~~p 337 (511) +-...+.. +++ +++| +++. .+.-|.++..-.-..-+.-+.+-+|.--.+| T Consensus 265 qYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgemeDV~YF~kkLy~ALnVP 344 (533) T protein:vir:58 265 EYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLAEDVEYMLNRLISALKVP 344 (533) T ss_pred HHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCCCcHHHHHHHHHHHHHHhCCC Confidence 11111100 000 0111 1222 2333333322222334566677778777888 Q ss_pred ccc---cccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHH- Q lcl|NC_018086. 338 DLV---SKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAE- 413 (511) Q Consensus 338 ~~~---~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e- 413 (511) -.- .+.||.+| .|-.-+.....-+.+.+..|..-|.+.| ++ ++... ..+..+.|...-.-.+.. T Consensus 345 ~sRl~~e~~fgr~~--eItRDEiKF~KFI~rLR~rF~~ll~~qL--il-----k~iit---~eew~~~f~~Dn~f~ElKe 412 (533) T protein:vir:58 345 KAFIGYEGDVNAKN--TLATQDIKFNNTIKRIQGFFVEELERMV--RM-----NKEFA---DQDFRLVMNRSNSIVEGER 412 (533) T ss_pred eeecCCCCCCccch--hhhHHHHHHHHHHHHHHHHHHHHHhccc--cc-----ccCcc---hhheeeeeeccchHHHHHH Confidence 432 23343222 2222222234445555555666665532 22 22222 223456674333322222 Q ss_pred ------HHHHHHHHhccCChHHHHHhC-CCCCCHHHHHHHHHHHHHHHHHHHHhhcc------ccccCCCCCCccc---- Q lcl|NC_018086. 414 ------LADMAVKLRDMLPDETIINQF-PWITDARQEVEKADAQRQKRADIALQNFK------QTSAVQGASTAAA---- 476 (511) Q Consensus 414 ------~a~~~~~~~g~~s~et~~~~l-~~v~d~~~E~~ri~~E~~~~~~~~~~~~~------~~~~~~~~~~~~~---- 476 (511) .++++..+.+.+++.++++.+ ...+|...+.+.|++|...-.-. ..... +..+..+ ++.+. T Consensus 413 ~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~-~~~~~~e~~~~~~~~~~~-~p~~~~~~~ 490 (533) T protein:vir:58 413 FAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFD-TGGFGEETTPADFLGERG-SPIESPRGR 490 (533) T ss_pred HHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCC-CCCcccccCCcccCcccc-CcccCCCCh Confidence 234555566788999988875 44444444444444332210000 00000 0000000 00000 Q ss_pred cccCCCCCCccccccCCCCccccccccCCCCCCCC Q lcl|NC_018086. 477 NKLDKNPANTSTITTTDPVAAKEQEKAIQKKPKTD 511 (511) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) .+.+.+.+.....++..+-++++.+...++-+++- T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~ 525 (533) T protein:vir:58 491 TEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEE 525 (533) T ss_pred hhHhcccCCcccccccccccccchhhhhhcCCccc Confidence 00000011111111222333333333333333333 No 247 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=79.00 E-value=0.11 Score=25.87 Aligned_cols=348 Identities=11% Similarity=0.018 Sum_probs=139.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceec-CchhhHHHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITES-GDEKTIKAMQPV 112 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~-~d~~~~~~l~~~ 112 (511) +.+......+.... ...|.+.. . .. . ....-+.......+|+..++-+-.-|+.+- .+.+....+..+ T Consensus 1 Mg~f~~~f~~~~~~---~~~~~~~~--~-~~---~--~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~~~l~~l 69 (385) T protein:vir:95 1 MGLFDSVFKRHSEL---SWMYDLEF--L-QD---K--SKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEKGTLYYL 69 (385) T ss_pred CchhhhhhccCccc---ccccchhh--h-hc---c--chhhhhhhHHHHHHHHHHHHHHcccceeeeecCccccchHHHH Confidence 11211111111000 01111100 0 00 0 000001234445667777776666677652 222222334444 Q ss_pred Hhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEE--EEEcccceEEEecCCCCCceEEEEEEEEEeecCCcc Q lcl|NC_018086. 113 FKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRF--KAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGH 185 (511) Q Consensus 113 ~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i--~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 185 (511) +.. |. -......+..+.+.+|.||++... ++...+ ..+.|.. ..++.+ .++..... +. T Consensus 70 L~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~--~~~~~~~~~~~~~~~-~~~~~~----------~~~~~~~~--~~ 134 (385) T protein:vir:95 70 LNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKND--EGHFFVADDFEKEDE-LGLYSH----------RFTNVLVN--DF 134 (385) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCceEEEEec--CCCeeeccccccccc-cccccc----------cceeeeec--cc Confidence 432 32 234566778888999999976543 332211 1111111 111100 00000000 00 Q ss_pred eEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCC-----cccCchhHHHHHHHHHHHHH Q lcl|NC_018086. 186 QIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIAN-----EERLGDFEAQLSLIDAYNLA 260 (511) Q Consensus 186 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~l~d~~~~~ 260 (511) . ....+.++.+ ++++.. ..|.|.+..+...++. T Consensus 135 ~--~~~~~~~~ei------------------------------------ih~~~~~~~~~~~G~s~~~~~~~~i~~---- 172 (385) T protein:vir:95 135 E--FKRVFTMDDV------------------------------------IYLKYNNQKLDAFSLGLFEDYGEIFGR---- 172 (385) T ss_pred c--eeeeeccccE------------------------------------EEecCCCCCcccccchHHHHHHHHHHH---- Confidence 0 0011222222 333221 2355555444433322 Q ss_pred HHHHHHHHHHhcCc--eeEeecCCCCccchhhhh----h---------hhCceeeecCCCceeeeecCC------CHHHH Q lcl|NC_018086. 261 VSDSVNDIAYWNDA--YLWLQGFDLSADSDSISN----M---------KNDRVIVTDEDGMVKFITKDV------NDKHI 319 (511) Q Consensus 261 ~s~~~~~~~~~~~p--~l~~~G~~~~~~~~~~~~----~---------~~~~~i~~~~~~~~~~~~~~~------~~~~~ 319 (511) ......+...| ++++.+.. ..+++.... + ....++.++++.+.+.++... ....+ T Consensus 173 ---~~~~~~~~~~~~g~l~~~~~~-~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~ 248 (385) T protein:vir:95 173 ---MIDLQMLNNQIRGILKVDATK-FYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSEL 248 (385) T ss_pred ---HHHHHHhcCCCceEEEeCCcc-CCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHH Confidence 22223333333 22332321 111221111 1 112356677666666554321 23456 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccccc Q lcl|NC_018086. 320 ENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKA-KDLKPYE 398 (511) Q Consensus 320 ~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~~ 398 (511) .+..+.....|+..-++|..-.+ ++-|.. .+.....+..+|.-+++.|...+...-.. ....... T Consensus 249 ~e~~~~~~~~Ia~~fgVpp~~l~--~~~sn~------------e~~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~ 314 (385) T protein:vir:95 249 NELKKTVLTDVARMIGVPPSLVL--GEMADL------------EKTIESYLQFCINPLLRKIEAELNSKFFYQDEYLNDD 314 (385) T ss_pred HHHHHHHHHHHHHHhCCCHHHhc--CCCcCH------------HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcccce Confidence 66777788888888888864432 111110 11223444555555555555554421111 1111224 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccc Q lcl|NC_018086. 399 VTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAA 476 (511) Q Consensus 399 i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (511) +.+.+..-+..|..+.++++.++ .|+++.-.++..++.-+-++...++.-. ..+. .+.+ . T Consensus 315 ~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~-----------~~n~----~~~~---~ 376 (385) T protein:vir:95 315 MHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFII-----------TKNL----QSAD---A 376 (385) T ss_pred EEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeee-----------cccc----eecc---c Confidence 55666677778889999998887 4789888888888653211000000000 0000 0000 0 Q ss_pred cccCCCCCC Q lcl|NC_018086. 477 NKLDKNPAN 485 (511) Q Consensus 477 ~~~~~~~~~ 485 (511) -++++..+. T Consensus 377 ~kgge~~~e 385 (385) T protein:vir:95 377 FKGGESNEE 385 (385) T ss_pred ccCCCCCCC Confidence 000111100 No 248 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=76.88 E-value=0.13 Score=25.43 Aligned_cols=241 Identities=10% Similarity=-0.067 Sum_probs=103.4 Q ss_pred CccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH Q lcl|NC_018086. 3 IPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK 82 (511) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k 82 (511) |+||.-....-. . ..... +...+ ....+........ .. ...-+..+-.. T Consensus 1 MglF~~~~~r~~-------~--~~~~~---~~~~~--------------~~~~~~~~~~~~~--v~---~~~al~~~~v~ 49 (251) T protein:vir:46 1 MGIFYKNEKRDL-------Q--YNEDD---LQMMV--------------QTLPSFQGTKLRQ--YK---DIEAIRHSDIF 49 (251) T ss_pred CCcccccccccc-------C--CCccc---hhhhh--------------hhhccccCcCcce--ec---hhhhhccHHHH Confidence 555432110000 0 00000 00100 0011111000000 00 00011223344 Q ss_pred HHHHHHHhhhhccCceecCch--hhHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcc Q lcl|NC_018086. 83 LLVDTSTAYLAGEPITESGDE--KTIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSP 154 (511) Q Consensus 83 ~ivd~~~~~l~g~~~~~~~d~--~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p 154 (511) ..|+..++-+-.-|+.+-.+. .....+..++.. |. .......+..+.+.+|.||+.+..+.+|++ .+..++| T Consensus 50 ~~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~ 129 (251) T protein:vir:46 50 TAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKT 129 (251) T ss_pred HHHHHHHHhHhhCceEEeeCccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECC Confidence 566777766666677653221 122234444432 32 335667788889999999999999988875 5888999 Q ss_pred cceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccce Q lcl|NC_018086. 155 MNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPV 234 (511) Q Consensus 155 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 234 (511) ..+.+..+++. .+.. +++.......+ ....+.++.+.+++.- + T Consensus 130 ~~v~v~~~~~g--~~~~--~~~~~~~~~~g----~~~~~~~~diiH~r~~----------------------------~- 172 (251) T protein:vir:46 130 SEIELKSDARG--RLYY--FHQRIDSNGNN----IERNVKFEDMLDIKFY----------------------------S- 172 (251) T ss_pred ceEEEEECCCC--cEEE--EEEEeccCCcc----eeEEECCccEEEecCc----------------------------C- Confidence 99988776532 1211 11111111111 1123455555555310 0 Q ss_pred EeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhC--ceeeecCCCceeeeec Q lcl|NC_018086. 235 LEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKND--RVIVTDEDGMVKFITK 312 (511) Q Consensus 235 v~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~--~~i~~~~~~~~~~~~~ 312 (511) .+.-.|.|.++.+...+........-..+.+...+.|-.+++-...-.+++....++.. ......+++. -+.. T Consensus 173 ---~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~~~~~g~~n~g--~~~~ 247 (251) T protein:vir:46 173 ---LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFPKVLVELNKLG--KLSY 247 (251) T ss_pred ---CCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccc--cccc Confidence 01125777777776666666655555556666666676665432211122222222110 0111111111 0100 Q ss_pred CCCH Q lcl|NC_018086. 313 DVND 316 (511) Q Consensus 313 ~~~~ 316 (511) ..+. T Consensus 248 gm~~ 251 (251) T protein:vir:46 248 SMNQ 251 (251) T ss_pred ccCC Confidence 0111 No 249 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=75.86 E-value=0.14 Score=25.23 Aligned_cols=462 Identities=10% Similarity=0.032 Sum_probs=180.6 Q ss_pred CCCccchhhcccccCchhhHhhhhccCCCHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccCCcCccccccceeccc Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHS-RSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHN 79 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n 79 (511) |.- .+-.++..+++-... -.-..++...+|. +=..+-+...+-|.+...- +.... .| .| T Consensus 1 m~~----~~~~~~~~tpe~la~------~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~-----~~~~~---~r--~n 60 (663) T protein:vir:34 1 MNE----SQPTDFADTPQGWAQ------RWQEEMSAAREPLEKWHTQGKEIVKRYRDERDS-----AHDAE---TR--WN 60 (663) T ss_pred CCc----cccccchhcchhHHH------HHHHHHHHHHhccchHHHHHHHHHHHhhccccC-----CCccc---cc--cc Confidence 221 111334433222100 0001112222221 1112233445555553321 11111 12 24 Q ss_pred hHHHHHHHHHhhhhccCcee------cC-chhhH----HHHHHHH------hccChhHHHHHHHHHHhhCCeEEEEeee- Q lcl|NC_018086. 80 FPKLLVDTSTAYLAGEPITE------SG-DEKTI----KAMQPVF------KENYVTDVNSEEVKLSGIFGHCFEIHWI- 141 (511) Q Consensus 80 ~~k~ivd~~~~~l~g~~~~~------~~-d~~~~----~~l~~~~------~~n~~~~~~~~~~~~a~~~G~~~~~v~~- 141 (511) +.---|..+.--+.+.++.. .+ |.+.. +-+.+.+ ++++|+..+....++++.+|+|.+.|-. T Consensus 61 l~~sni~~i~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye 140 (663) T protein:vir:34 61 LFSTNIQTQMASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYE 140 (663) T ss_pred hhhhhHHHHhhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEee Confidence 33333444444444544332 22 22222 2233333 4466899999999999999987665533 Q ss_pred -------------CCC-C----------------ceEEEEEcccceEEEecCCCCCceEE--EEEEEEEe---------- Q lcl|NC_018086. 142 -------------DRN-K----------------KHRFKAVSPMNCLIAYSADLDEEPVA--AIYYNTVI---------- 179 (511) Q Consensus 142 -------------~~~-g----------------~~~i~~~~p~~~~~v~d~~~~~~~~~--~v~~~~~~---------- 179 (511) |+. + .++|..+.=.++ ++++...+.-.. +.|.|... T Consensus 141 ~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~ 218 (663) T protein:vir:34 141 VEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNLLDMREFNARFDAD 218 (663) T ss_pred cccchhccccccCCCccccchhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeeccCCHHHHHHhhcCC Confidence 110 0 133333222221 222222221111 11111100 Q ss_pred -------------ec---CCc------ceEEEEEEEcCCcEEEEEE-ccCcccccccccccccccccceeccCCccceEe Q lcl|NC_018086. 180 -------------SD---ITG------HQIRTYEVYTEDLIYKFST-DDEREVYREIPEELEIKDYEVHPNLLQKFPVLE 236 (511) Q Consensus 180 -------------~~---~~~------~~~~~~~~~~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 236 (511) .. .+| ......|+|....-..|+. ++....+.. .+......+|-=||..- T Consensus 219 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~~-------~~p~lgl~~ffPcPrpl 291 (663) T protein:vir:34 219 GSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLDT-------QPDPLGLESFFPCPKPL 291 (663) T ss_pred hhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcceeccc-------CCCCCCCCCCCCCcccc Confidence 00 000 1122335665544333222 222111111 11111112222255554 Q ss_pred ecC----CcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCCCccchhhhhhhhCceeeecC------CCc Q lcl|NC_018086. 237 IIA----NEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLWLQGFDLSADSDSISNMKNDRVIVTDE------DGM 306 (511) Q Consensus 237 ~~n----~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~~G~~~~~~~~~~~~~~~~~~i~~~~------~~~ 306 (511) +++ +-...++|.-...+++.+|...-.+. .+...-.+-.+..+..++.......+-..+.++-++. .++ T Consensus 292 ~~~~~~ds~ipvpd~~~y~~~~~E~n~~t~Rin-~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg 370 (663) T protein:vir:34 292 LANWTTDKVVPRPDFVLAQDLYKEIDLVSTRIT-LLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGG 370 (663) T ss_pred cceecCCCeecCCcHHHHHHHHHHHHHHHHHHH-HHHhhhhhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcC Confidence 443 23467888888999999987654443 3333333323322111111112122222223333221 122 Q ss_pred ----eeeeecCC---CHHHHHHHHHHHHHHHHHHhCccccccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 307 ----VKFITKDV---NDKHIENIKNRAKLDIFSLSQTPDLVSKDF-TAASGQALKAATQPLENKSAVKESKFRKVLAKRY 378 (511) Q Consensus 307 ----~~~~~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 378 (511) +.++-.+. ....+-.....++.+++++|++-+..=+.. .+-++.|-..+-+-+..++.+++.......+++. T Consensus 371 ~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ 450 (663) T protein:vir:34 371 LRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQ 450 (663) T ss_pred ccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHH Confidence 22322221 123445667788999999999876654444 3567778778888899999999999999999999 Q ss_pred HHHHHHHHhcCC--------------------------CccccccceeEEeCCCCCcCHHHHHHHHHHH-hcc--CChHH Q lcl|NC_018086. 379 ELVCSYLEFMNK--------------------------AKDLKPYEVTPVFVRNLPQSYAELADMAVKL-RDM--LPDET 429 (511) Q Consensus 379 ~li~~~~~~~~~--------------------------~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~-~g~--~s~et 429 (511) ++...++..... ...+....+.|.=....-.|..+.-+..+++ .++ +++.. T Consensus 451 ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~ 530 (663) T protein:vir:34 451 RLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGV 530 (663) T ss_pred HHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHH Confidence 998887642110 0011222344444444445554433333322 111 11111 Q ss_pred --HHHhCCCCCCHHHHHHHHHHHH------HHHHHHHHhhccccccCCCCCCccccccCCCCCCccccccCCCCcccc-- Q lcl|NC_018086. 430 --IINQFPWITDARQEVEKADAQR------QKRADIALQNFKQTSAVQGASTAAANKLDKNPANTSTITTTDPVAAKE-- 499 (511) Q Consensus 430 --~~~~l~~v~d~~~E~~ri~~E~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 499 (511) ++.+.|..-..-.| +.++- ....+.+.+.+.+.. ++.+.+++. .+|....+ T Consensus 531 ~pl~~q~p~~~p~l~E---llk~~~~~f~~~~qie~ai~~~~~~~----------e~aa~~~~~------~~pa~~~~~~ 591 (663) T protein:vir:34 531 APLAQQVPGSAPFLLQ---MLKWSVSGLRGSSTIEGVLDKAIAAA----------EEAQKQAAQ------QSPAPQQPDP 591 (663) T ss_pred HHHHHhhhhhHHHHHH---HHHHHhhcCChhhhHHHHHHHHHhhh----------HHHhhccCC------CCcccchhhH Confidence 11222211000111 11110 001111111111100 000000000 00000000 Q ss_pred ccccCCCCCCCC Q lcl|NC_018086. 500 QEKAIQKKPKTD 511 (511) Q Consensus 500 ~~~~~~~~~~~~ 511 (511) +-..-+.|..++ T Consensus 592 k~~~~q~k~q~~ 603 (663) T protein:vir:34 592 KVVAQAMKGQQE 603 (663) T ss_pred HHHHHHHHHHHH Confidence 000011122222 No 250 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=70.22 E-value=0.21 Score=24.28 Aligned_cols=301 Identities=13% Similarity=0.077 Sum_probs=104.7 Q ss_pred HHHHHHHHHHHHHHhcCCCcccccCCcCcccccccee-ccch--HHH------HHHHHHhhhhcc----CceecCchhhH Q lcl|NC_018086. 40 HSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI-VHNF--PKL------LVDTSTAYLAGE----PITESGDEKTI 106 (511) Q Consensus 40 ~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri-~~n~--~k~------ivd~~~~~l~g~----~~~~~~d~~~~ 106 (511) .+ ++-+...+.-..........+...+. ++.| +.. +++-.--+..|+ |+...+--+.. T Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~ 71 (351) T protein:vir:78 1 MS---------KRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSF 71 (351) T ss_pred CC---------CCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHH Confidence 00 00000000000000000000000010 1111 111 111111111121 11111000000 Q ss_pred -------HHH-------HHHHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCce Q lcl|NC_018086. 107 -------KAM-------QPVFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEP 169 (511) Q Consensus 107 -------~~l-------~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 169 (511) ..| ...+.-|.. ...+.+++.+.+.+|.||+.+..+..|++ .+..++|..+.+..+.+ T Consensus 72 ~~~~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~----- 146 (351) T protein:vir:78 72 RASTHHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS----- 146 (351) T ss_pred hhhHhhhhhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC----- Confidence 000 011112221 23356788899999999999888888864 45666666654432221 Q ss_pred EEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHH Q lcl|NC_018086. 170 VAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEA 249 (511) Q Consensus 170 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~ 249 (511) ++|.... .+.. ..|.++.++++..- + | .+.-+|.|.+.. T Consensus 147 ----~~~~~~~--~~~~----~~~~~~eVihir~~----------------------~-----~----~~~~yGl~~~~~ 185 (351) T protein:vir:78 147 ----GFVYVNG--WQER----HEFAPDSVFQLVRP----------------------D-----I----NQEVYGLPEYLS 185 (351) T ss_pred ----eEEEEec--CCeE----EEEccccEEEEcCC----------------------C-----C----CCCcccccHHHH Confidence 1111111 1111 11333333333100 0 0 012257777765 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeEe--ecCCCCccchhhhhh----hh-------CceeeecC---CCceeeee-- Q lcl|NC_018086. 250 QLSLIDAYNLAVSDSVNDIAYWNDAYLWL--QGFDLSADSDSISNM----KN-------DRVIVTDE---DGMVKFIT-- 311 (511) Q Consensus 250 v~~l~d~~~~~~s~~~~~~~~~~~p~l~~--~G~~~~~~~~~~~~~----~~-------~~~i~~~~---~~~~~~~~-- 311 (511) ....+..-+.+..-..+..+..+.|-.++ +|...++ +....+ +. .+++.+.. +.++++.- T Consensus 186 a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~--e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls 263 (351) T protein:vir:78 186 SLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQ--DDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVS 263 (351) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCH--HHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcC Confidence 44444433322222233344445554444 4433322 222222 11 12333322 22344433 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-Cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 312 KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AA-SGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 312 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) .......+.+..+.....|+..-++|..-.+... +. +...++.. ....+...|.-+++.|..+....+ T Consensus 264 ~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~----------~~~f~~~~l~P~~~~iee~n~~l~ 333 (351) T protein:vir:78 264 EVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTA----------ARVFGRNEIRPLQARFAELNDWLG 333 (351) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHHHHHhhcC Confidence 3333455677777788899999999865443221 11 11111100 001112222222222222111111 Q ss_pred CCccccccceeEEeCCCCCcCHHHHH Q lcl|NC_018086. 390 KAKDLKPYEVTPVFVRNLPQSYAELA 415 (511) Q Consensus 390 ~~~~~~~~~i~i~f~~~~p~d~~e~a 415 (511) . + .|.|++..-..-.+.+ T Consensus 334 ~-------~-~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 334 D-------E-VVRFDDYEIPPAPVAA 351 (351) T ss_pred c-------c-ceecChhhhccccccC Confidence 0 1 1455443222211222 No 251 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=69.27 E-value=0.23 Score=24.13 Aligned_cols=411 Identities=12% Similarity=0.092 Sum_probs=166.1 Q ss_pred CCCccchh--hccccc----------CchhhHhhhhc-cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC Q lcl|NC_018086. 1 MAIPNGQI--NAGDII----------TTNIRRKHFIR-RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD 67 (511) Q Consensus 1 ~~~~~~~~--~~~~~~----------~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~ 67 (511) .+.++-.- ..|... ...+-.+.+.- .+.. +.....-+.+|+.+-.+++-+..+ T Consensus 27 ~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~-------~~~~~eLI~~YR~ma~~pEvd~Av------- 92 (524) T protein:vir:10 27 DLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPG-------MKTTRELIDTYRNLMNNYEVDNAV------- 92 (524) T ss_pred CCccccCccCCCCceeeeecccccccccceeeeehhcccccc-------cchHHHHHHHHHHHhhccchhhHH------- Confidence 22222111 111000 00000111110 0111 111122223444444444443322 Q ss_pred ccccccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEE Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFE 137 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~ 137 (511) ..||+..+-+ -...|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|. T Consensus 93 --------------~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~f 158 (524) T protein:vir:10 93 --------------SEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFF 158 (524) T ss_pred --------------HHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEE Confidence 1122221111 112233321 1233455677777778899999999999999999998 Q ss_pred EeeeCC----CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 138 IHWIDR----NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 138 ~v~~~~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) ....|. +|-..+..+||+.+-.|..-... ...++... +.+...-+|.++. ..+...+ ... T Consensus 159 hKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~--~~~~~~vi--------~~~~e~f~Y~~~~-~~y~~~g--~~~--- 222 (524) T protein:vir:10 159 HKIIDPKRPKEGIKELRRLDPRQVQYVREIITE--TEAGTKIV--------KGYKEYFIYDTAH-ESYACDG--RMY--- 222 (524) T ss_pred EEEeeCCCccccceeeeeeCCccceeeeeeccC--CCccchhh--------cchhhheeeccCc-cccccCc--ccc--- Confidence 887764 46677888999987444321110 00110000 0111222444432 1121111 000 Q ss_pred ccccccccccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCc Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSA 285 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~ 285 (511) +.. +++ +|| .|.|... ..|.-.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+... T Consensus 223 ~~~----------~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGn 291 (524) T protein:vir:10 223 EAG----------TKI-KIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGN 291 (524) T ss_pred CCC----------cce-ecchhheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCC Confidence 000 001 222 1222211 011112233444455555 355666667777777755333222211 Q ss_pred cc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 286 DS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIF 331 (511) Q Consensus 286 ~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~ 331 (511) .+ +-...+. . +++ +++| +++. .+.-|.+ .+...++. +.-+.+-+| T Consensus 292 lPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy 370 (524) T protein:vir:10 292 MPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-VRWFRQALY 370 (524) T ss_pred CCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHH Confidence 11 1111110 0 001 1122 2222 2222333 23333222 445555566 Q ss_pred HHhCcccccc--cc---ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeC Q lcl|NC_018086. 332 SLSQTPDLVS--KD---FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFV 404 (511) Q Consensus 332 ~~s~~p~~~~--~~---~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~ 404 (511) +--++|-.-. +. +...-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|. T Consensus 371 ~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qL-ilKgiit~eew~~i~~~I~~~f~ 449 (524) T protein:vir:10 371 MALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNL-LLKGIITEDEWNDEINNIKIEFH 449 (524) T ss_pred HHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEee Confidence 6667774222 21 211233344444445567778888888888888876422 2221112222222 34677774 Q ss_pred CCCCcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 405 RNLPQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 405 ~~~p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) ..-.-.+... ++++..+. | .+|.+++++.+-.-+| +|++.++++-+++.+... +.+ ....-.+- T Consensus 450 ~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~~~k~I~~E~k~~~--~~~-~~~~~~~f 524 (524) T protein:vir:10 450 RDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTD--EEIEQEAKQIEEESKEAR--FQD-PDQEQEDF 524 (524) T ss_pred ecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCH--HHHHHHHHHHHHHhhcCC--CCC-CchhhhcC Confidence 4333333322 33444443 3 4699999988544343 333333332222222110 000 00000000 No 252 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=69.25 E-value=0.23 Score=24.13 Aligned_cols=411 Identities=12% Similarity=0.090 Sum_probs=166.0 Q ss_pred CCCccchh--hccccc----------CchhhHhhhhc-cCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcC Q lcl|NC_018086. 1 MAIPNGQI--NAGDII----------TTNIRRKHFIR-RNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFD 67 (511) Q Consensus 1 ~~~~~~~~--~~~~~~----------~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~ 67 (511) .+.++-.- ..|... ...+-.+.+.- .+.. +.....-+.+|+.+-.+++-+..+ T Consensus 27 ~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~-------~~~~~eLI~~YR~ma~~pEvd~Av------- 92 (524) T protein:vir:72 27 DLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPG-------MKTTRELIDTYRNLMNNYEVDNAV------- 92 (524) T ss_pred CCccccCccCCCCceeeeecccccccccceeeeehhcccccc-------cchHHHHHHHHHHHhhccchhhHH------- Confidence 22222111 111000 00000111110 0111 111122223444444444443322 Q ss_pred ccccccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEE Q lcl|NC_018086. 68 DTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFE 137 (511) Q Consensus 68 ~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~ 137 (511) ..||+..+-+ -...|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|. T Consensus 93 --------------~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~f 158 (524) T protein:vir:72 93 --------------SEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFF 158 (524) T ss_pred --------------HHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEE Confidence 1122221111 112233321 1233455677777778899999999999999999998 Q ss_pred EeeeCC----CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccc Q lcl|NC_018086. 138 IHWIDR----NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREI 213 (511) Q Consensus 138 ~v~~~~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 213 (511) ....|. +|-..+..+||+.+-.|..-... ...++... +.+...-+|.++. ..+...+ ... T Consensus 159 hKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~--~~~~~~vi--------~~~~e~f~Y~~~~-~~y~~~g--~~~--- 222 (524) T protein:vir:72 159 HKIIDPKRPKEGIKELRRLDPRQVQYVREIITE--TEAGTKIV--------KGYKEYFIYDTAH-ESYACDG--RMY--- 222 (524) T ss_pred EEEEeCCCccccceeeeeeCCccceeeeeeccC--CCccchhh--------cchhhheeeccCc-cccccCc--ccc--- Confidence 887754 46677888999987444321110 00110000 0111222444432 1121111 000 Q ss_pred ccccccccccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCc Q lcl|NC_018086. 214 PEELEIKDYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSA 285 (511) Q Consensus 214 ~~~~~~~~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~ 285 (511) +.. +++ +|| .|.|... ..|.-.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+... T Consensus 223 ~~~----------~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGn 291 (524) T protein:vir:72 223 EAG----------TKI-KIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGN 291 (524) T ss_pred CCC----------cce-ecchhheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCC Confidence 000 001 222 1222211 011112233444455555 355666667777777755333222211 Q ss_pred cc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHH Q lcl|NC_018086. 286 DS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIF 331 (511) Q Consensus 286 ~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~ 331 (511) .+ +-...+. . +++ +++| +++. .+.-|.+ .+...++. +.-+.+-+| T Consensus 292 lPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy 370 (524) T protein:vir:72 292 MPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-IRWFRQALY 370 (524) T ss_pred CCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHH Confidence 11 1111110 0 001 1122 2222 2222333 23333222 445555566 Q ss_pred HHhCcccccc--cc---ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeC Q lcl|NC_018086. 332 SLSQTPDLVS--KD---FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFV 404 (511) Q Consensus 332 ~~s~~p~~~~--~~---~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~ 404 (511) +--++|-.-. +. +...-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|. T Consensus 371 ~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qL-ilKgiit~eew~~i~~~I~~~f~ 449 (524) T protein:vir:72 371 MALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNL-LLKGIITEDEWNDEINNIKIEFH 449 (524) T ss_pred HHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEee Confidence 6667774222 21 211233344444445567778888888888888876422 2221112222222 34677774 Q ss_pred CCCCcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 405 RNLPQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 405 ~~~p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) ..-.-.+... ++++..+. | .+|.+++++.+-.-+| +|++.++++-+++.+... +.+ +.+..++ T Consensus 450 ~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~~~k~I~~E~k~~~--~~~--~~~~~~~ 523 (524) T protein:vir:72 450 RDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTD--EEIEQEAKQIEEESKEAR--FQD--PDQEQED 523 (524) T ss_pred ecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCH--HHHHHHHHHHHHHhhcCC--CCC--Cchhhhc Confidence 4333333322 33444443 3 4699999988544343 333333332222222110 000 0000000 Q ss_pred c Q lcl|NC_018086. 474 A 474 (511) Q Consensus 474 ~ 474 (511) - T Consensus 524 f 524 (524) T protein:vir:72 524 F 524 (524) T ss_pred C Confidence 0 No 253 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=64.66 E-value=0.3 Score=23.48 Aligned_cols=294 Identities=12% Similarity=0.044 Sum_probs=99.9 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccc-cCCcCccccccce---eccchHH------HHHHHHHhhhhcc----CceecCc Q lcl|NC_018086. 37 AEMHSRSSSAYGVLYDYYKGNHIAIQ-SRTFDDTNKPNSK---IVHNFPK------LLVDTSTAYLAGE----PITESGD 102 (511) Q Consensus 37 ~~~~~~~~~~~~~~~~yY~G~~~~~~-~~~~~~~~~~~~r---i~~n~~k------~ivd~~~~~l~g~----~~~~~~d 102 (511) +.+++++.. -...... ........+...+ +..+=|. .+.+-.--+..|+ |+...+= T Consensus 1 m~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~l 70 (350) T protein:vir:11 1 MSKRRSHRR----------QQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGL 70 (350) T ss_pred CCccccCCC----------cCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHH Confidence 111110000 0000000 0000000000000 0001000 1111111111111 1111000 Q ss_pred hh----------hHH----HHHHHHhccC-h-hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCC Q lcl|NC_018086. 103 EK----------TIK----AMQPVFKENY-V-TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADL 165 (511) Q Consensus 103 ~~----------~~~----~l~~~~~~n~-~-~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~ 165 (511) -. ... .+...+.-|. + ...+.+++.+.+.+|.||+.+..+..|++ .+..++|..+-+.-+.. T Consensus 71 a~~~~~~~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~- 149 (350) T protein:vir:11 71 AKSVGSSVYLQSGLKFKRNMLAKTFIPHRLLSRATFEQFSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLE- 149 (350) T ss_pred HHHHhhhhhhccchhhhhhhhhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCC- Confidence 00 000 0001112232 1 23356778889999999999988888875 45666666553322111 Q ss_pred CCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCc Q lcl|NC_018086. 166 DEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLG 245 (511) Q Consensus 166 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s 245 (511) ++|.+.. .+.. ..|.++.++++..- + | .+.-+|.| T Consensus 150 --------~~~~~~~--~~~~----~~~~~~eVihir~~----------------------~-----~----~~~~yGls 184 (350) T protein:vir:11 150 --------TFYQVRS--WKDE----HEFEKGSVIQLREA----------------------D-----I----NQEIYGVP 184 (350) T ss_pred --------eEEEEee--CCeE----EEECcccEEEeCCC----------------------C-----C----CCCccccc Confidence 1111111 1111 11233333333100 0 0 01124777 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE--eecCCCCccchhhhhh----hh-------CceeeecCC---Cceee Q lcl|NC_018086. 246 DFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW--LQGFDLSADSDSISNM----KN-------DRVIVTDED---GMVKF 309 (511) Q Consensus 246 ~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~--~~G~~~~~~~~~~~~~----~~-------~~~i~~~~~---~~~~~ 309 (511) .+...+..+..-+....-........+.|-.+ ++|...++ +....+ .. .+++.+..+ .++++ T Consensus 185 ~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~--e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~ 262 (350) T protein:vir:11 185 EWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQNE--EDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQL 262 (350) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCH--HHHHHHHHHHHHhcCccccCceeeecCCCCccceEE Confidence 76654444433222222222333444455444 44533322 222222 11 123333222 23444 Q ss_pred eec--CCCHHHHHHHHHHHHHHHHHHhCccccccccc-------cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 310 ITK--DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-------TAASGQALKAATQPLENKSAVKESKFRKVLAKRYEL 380 (511) Q Consensus 310 ~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-------~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~l 380 (511) .-. ......+.+..+...+.|+..-++|....+.. ++....+..+. ...|.-+++. T Consensus 263 ~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~---------------~~~L~P~~~~ 327 (350) T protein:vir:11 263 IPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWA---------------SLELAPMQTR 327 (350) T ss_pred EEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHH---------------HHHHHHHHHH Confidence 332 23344567777788888999999986544322 11111222221 1122222222 Q ss_pred HHHHHHhcCCCccccccceeEEeCCCCCcCH Q lcl|NC_018086. 381 VCSYLEFMNKAKDLKPYEVTPVFVRNLPQSY 411 (511) Q Consensus 381 i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~ 411 (511) +..+....+. + .+.|.+-..... T Consensus 328 ie~ln~~l~~-------~-~~~F~~~~~~~l 350 (350) T protein:vir:11 328 LQQVNEMIGE-------E-VVRFAQFDAPGL 350 (350) T ss_pred HHHHHhhcCc-------c-ccccCcccccCC Confidence 2211111110 0 122332222221 No 254 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=60.30 E-value=0.38 Score=22.92 Aligned_cols=411 Identities=12% Similarity=0.083 Sum_probs=164.4 Q ss_pred CCCccc-------hhhccccc-----------------------CchhhHh-hhhccCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 1 MAIPNG-------QINAGDII-----------------------TTNIRRK-HFIRRNFDLRELITLAEMHSRSSSAYGV 49 (511) Q Consensus 1 ~~~~~~-------~~~~~~~~-----------------------~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 49 (511) ++-.|. +.++.... +...-.+ .+...+. -+.....-+.+|+. T Consensus 9 f~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~-------~~~~~~eLI~~YR~ 81 (523) T protein:vir:68 9 FAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEP-------GLKSTRELIDTYRN 81 (523) T ss_pred hhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhcccc-------ccchHHHHHHHHHH Confidence 121111 00000000 0000000 0111111 01111222233444 Q ss_pred HHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChh Q lcl|NC_018086. 50 LYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVT 119 (511) Q Consensus 50 ~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~ 119 (511) +-.+++-+.. ...||+..+-+ -...|+.+. ..+...+.+..++.--+|+ T Consensus 82 ma~~pEvd~A---------------------v~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~ 140 (523) T protein:vir:68 82 LMTNYEVDNA---------------------VSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQ 140 (523) T ss_pred HhhccchhhH---------------------HHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccc Confidence 4444443322 12222222211 122233332 1233456677777778899 Q ss_pred HHHHHHHHHHhhCCeEEEEeeeCC----CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcC Q lcl|NC_018086. 120 DVNSEEVKLSGIFGHCFEIHWIDR----NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTE 195 (511) Q Consensus 120 ~~~~~~~~~a~~~G~~~~~v~~~~----~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 195 (511) ....+..|.+.+.|+.|.....|. +|-..+..+||+.+-.+..-.+.. ..++..+ +.+...-+|.+ T Consensus 141 ~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~--~~g~~vi--------~~~~e~f~Y~~ 210 (523) T protein:vir:68 141 RKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTT--EAGVKIV--------KGYKEYFIYDT 210 (523) T ss_pred hhhhHHHHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCC--Ccchhhh--------hhhhhheeecc Confidence 999999999999999998887764 466778889999764432211100 0111000 01112224444 Q ss_pred CcEEEEEEccCcccccccccccccccccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHH Q lcl|NC_018086. 196 DLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVND 267 (511) Q Consensus 196 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~ 267 (511) .... +...+ .. .+.. +++ +|| .|.|... ..|.-.+.-+..-+..+| +++-|.... T Consensus 211 ~~~~-~~~~g--~~---~~~~----------~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVI 273 (523) T protein:vir:68 211 SHES-YACDG--RI---YEAG----------TKI-KIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVI 273 (523) T ss_pred cccc-ccccc--cc---cCCC----------cce-ecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHH Confidence 4311 11100 00 0000 001 122 1222211 011112233444455555 355666667 Q ss_pred HHHhcCceeEeecCCCCccc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC-- Q lcl|NC_018086. 268 IAYWNDAYLWLQGFDLSADS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD-- 313 (511) Q Consensus 268 ~~~~~~p~l~~~G~~~~~~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~-- 313 (511) -+..+.|-+=+.-.+....+ +-...+. . +++ +++| +++. .+.-|.+ T Consensus 274 YRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg 353 (523) T protein:vir:68 274 YRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGA 353 (523) T ss_pred HhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeecccc Confidence 77777775533322221111 1111110 0 000 1122 2222 2222333 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCccccc--cc--cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 314 VNDKHIENIKNRAKLDIFSLSQTPDLV--SK--DFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 314 ~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~--~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) .+...++. +.-+.+-+|+--++|-.- .+ .|.-.-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...- T Consensus 354 qnlgem~D-V~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qL-ilKgii 431 (523) T protein:vir:68 354 DNTGNMED-VRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNL-ILKGII 431 (523) T ss_pred CCcChHHH-HHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCC Confidence 23333322 445555566666777422 12 1211122344444445567778888888888888876422 222111 Q ss_pred CCccccc--cceeEEeCCCCCcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 390 KAKDLKP--YEVTPVFVRNLPQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRAD 456 (511) Q Consensus 390 ~~~~~~~--~~i~i~f~~~~p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~ 456 (511) ...+++. ..+.+.|...-.-.+... ++++..+. | .+|.+++++.+-.-+| +|++.++++-+++.+ T Consensus 432 t~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~~~kqI~~E~k 509 (523) T protein:vir:68 432 TEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSD--EEIEQEAKQIEEESK 509 (523) T ss_pred CHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCH--HHHHHHHHHHHHHhh Confidence 2222222 346777744333333322 33444443 3 4699999988543343 333333332222222 Q ss_pred HHHhhccccccCCCCCC Q lcl|NC_018086. 457 IALQNFKQTSAVQGAST 473 (511) Q Consensus 457 ~~~~~~~~~~~~~~~~~ 473 (511) ... +.+ ....-.+- T Consensus 510 ~~~--~~~-p~~e~~~f 523 (523) T protein:vir:68 510 EAR--FQD-PDQEQEDF 523 (523) T ss_pred cCC--CCC-CchhhhcC Confidence 110 000 00000000 No 255 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=58.86 E-value=0.4 Score=22.74 Aligned_cols=335 Identities=9% Similarity=0.008 Sum_probs=128.4 Q ss_pred HHHHHHHHHHhcCCCcccccCCcCccccccceec--cchHHHHHHHHHhhhhccCceec---Cc----h----hhHHHHH Q lcl|NC_018086. 44 SSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV--HNFPKLLVDTSTAYLAGEPITES---GD----E----KTIKAMQ 110 (511) Q Consensus 44 ~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~--~n~~k~ivd~~~~~l~g~~~~~~---~d----~----~~~~~l~ 110 (511) ..-+.+...+-.+.-.- ..... .......+. .......|+..++-+-.-|+.+- .. + .....+. T Consensus 1 Mg~f~~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNN-DTQRV--TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CccchhhhhhhcccccC-Cccee--eecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccchHH Confidence 11111111111111000 00000 000001111 12233445555555555576431 10 1 1123455 Q ss_pred HHHhc--c---ChhHHHHHHHHHHhhCCeEEEEeeeC-CCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCc Q lcl|NC_018086. 111 PVFKE--N---YVTDVNSEEVKLSGIFGHCFEIHWID-RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITG 184 (511) Q Consensus 111 ~~~~~--n---~~~~~~~~~~~~a~~~G~~~~~v~~~-~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~ 184 (511) .+|.. | ........+....+.+|.+|++...+ ..|++. .+-|... + T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l~~~~~--------------------------~ 129 (378) T protein:vir:16 78 EVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLLFADD--------------------------K 129 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEEecCC--------------------------e Confidence 55542 3 23455667788889999999764333 223321 1111100 0 Q ss_pred ceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 185 HQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDS 264 (511) Q Consensus 185 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~ 264 (511) ..|..+.++|+ ++.-.+......+..+.++++..++. T Consensus 130 ------~~~~~~diih~------------------------------------r~~~~~~~~~s~l~~~~~~i~~~~~~- 166 (378) T protein:vir:16 130 ------KEYKPEELVRL------------------------------------TSPFYINEDTSILDNALASIQTKLEQ- 166 (378) T ss_pred ------eEecccceEEe------------------------------------cCccCccchhHHHHHHHHHHHHHHhc- Confidence 00122233333 21111222233344444444433221 Q ss_pred HHHHHHhcCceeEeec-CCCCcc--ch----hhhhh-------hhCceeeecCCCceeeeecCCCHHHHHHHHHHHHHHH Q lcl|NC_018086. 265 VNDIAYWNDAYLWLQG-FDLSAD--SD----SISNM-------KNDRVIVTDEDGMVKFITKDVNDKHIENIKNRAKLDI 330 (511) Q Consensus 265 ~~~~~~~~~p~l~~~G-~~~~~~--~~----~~~~~-------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i 330 (511) +.+-.++.- ...+++ .+ +...+ ..++++.++++.+++.++.+.....+ ..++.+.+.| T Consensus 167 -------~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~I 238 (378) T protein:vir:16 167 -------GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSEL 238 (378) T ss_pred -------CccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHH Confidence 222222221 111111 11 11111 12356777766666655544444444 3456777888 Q ss_pred HHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--------cccccceeEE Q lcl|NC_018086. 331 FSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK--------DLKPYEVTPV 402 (511) Q Consensus 331 ~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~--------~~~~~~i~i~ 402 (511) +..-++|..-.. +...+. .....+..+|.-+++.|...+...--.. .....++.+. T Consensus 239 a~~fgVPp~~l~---g~~~e~-------------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~ 302 (378) T protein:vir:16 239 LTGYFMNENILL---GTASQE-------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVD 302 (378) T ss_pred HHHhCCCHHHhc---CCchHH-------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeec Confidence 888888753331 111110 1112344455555555444443211000 0112235566 Q ss_pred eCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccC Q lcl|NC_018086. 403 FVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLD 480 (511) Q Consensus 403 f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (511) +..-...|..+.++++.++. |+++.-.++..++.-+-+. .+++.. +.+. ..............+ T Consensus 303 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~g--gD~~~~-----------~~n~-~~~~~~~~~~~~~~~ 368 (378) T protein:vir:16 303 NQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG--GDVYIA-----------NLNA-VAVKNLSDLQGSRKD 368 (378) T ss_pred cchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeEee-----------cccc-ccccchhhhcCccCC Confidence 67777788899999888874 7898888888876532110 000000 0000 000000000000000 Q ss_pred CCCCCccccccCCC Q lcl|NC_018086. 481 KNPANTSTITTTDP 494 (511) Q Consensus 481 ~~~~~~~~~~~~~~ 494 (511) +.+++.... + T Consensus 369 ~~~~~e~~n----e 378 (378) T protein:vir:16 369 VTSTDETNN----Q 378 (378) T ss_pred CCCCCCCCC----C Confidence 000000000 0 No 256 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=54.46 E-value=0.5 Score=22.22 Aligned_cols=406 Identities=10% Similarity=0.096 Sum_probs=161.1 Q ss_pred CCCccchh----hccccc--CchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccc Q lcl|NC_018086. 1 MAIPNGQI----NAGDII--TTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNS 74 (511) Q Consensus 1 ~~~~~~~~----~~~~~~--~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ 74 (511) .+-+-+.- ..+.-. ...+-...+.+..-+.+--..|| .+|..+-.+++-+..+ T Consensus 29 ~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI-------~~YR~ma~~pEvd~Av-------------- 87 (516) T protein:vir:10 29 IATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLI-------NTYRQLTNNPEVERAV-------------- 87 (516) T ss_pred ccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHH-------HHHHHhhhccchhHHH-------------- Confidence 11111100 000000 00111111111111111112233 3444444444443322 Q ss_pred eeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC-- Q lcl|NC_018086. 75 KIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID-- 142 (511) Q Consensus 75 ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~-- 142 (511) ..||+..+-+ -...|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|.....| T Consensus 88 -------~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~ 160 (516) T protein:vir:10 88 -------ANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNP 160 (516) T ss_pred -------HHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCc Confidence 1122221111 112233322 123345566777777889999999999999999999875554 Q ss_pred CCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcc----eEEEEEEEcCCcEEEEEEccCcccccccccccc Q lcl|NC_018086. 143 RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGH----QIRTYEVYTEDLIYKFSTDDEREVYREIPEELE 218 (511) Q Consensus 143 ~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 218 (511) ++|-..+..+||+.+..+.-- . +.+.++. .+....+|+.+.. .|... +.. . T Consensus 161 k~GI~elr~lDPr~i~~vR~i-------------~-~~~~~~~~v~~~~~e~~~Y~~~~~-~~~~~-g~~-~-------- 215 (516) T protein:vir:10 161 KEGIVELRRLDPRHVEYYREI-------------V-TSDVGGTSVVKGYREFFVYTTGNE-GYAYN-GRL-F-------- 215 (516) T ss_pred ccceeeeeeeCCcceeeEEee-------------e-cccCcchhhhhceeeeeeeecCcc-ceecc-ccc-c-------- Confidence 356677888999987654321 1 1111111 1111223333221 11111 100 0 Q ss_pred cccccceeccCCccc--eEeecCCc----ccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCccc--- Q lcl|NC_018086. 219 IKDYEVHPNLLQKFP--VLEIIANE----ERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSADS--- 287 (511) Q Consensus 219 ~~~~~~~~~~~g~iP--vv~~~n~~----~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~~~--- 287 (511) .++.-=+|| .|.|.... .+...+.-+..-+..+| +++-|....-+..+.|-+=+.-.+....+ T Consensus 216 ------~~~~~ikI~~daI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~K 289 (516) T protein:vir:10 216 ------EPNTRIKIPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRK 289 (516) T ss_pred ------CCCCceecchhheeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchh Confidence 000000222 11111110 01111233444555555 35566666777777775533322221111 Q ss_pred --hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecCC--CHHHHHHHHHHHHHHHHHHhCc Q lcl|NC_018086. 288 --DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKDV--NDKHIENIKNRAKLDIFSLSQT 336 (511) Q Consensus 288 --~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~~ 336 (511) .-...+. . +++ +++| +++. .+.-|.+. +...+. -+.-+.+-+|+--++ T Consensus 290 AeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnV 368 (516) T protein:vir:10 290 ATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMD-DVRWFNKKLYEALRI 368 (516) T ss_pred HHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHH-HHHHHHHHHHHHhCC Confidence 1111110 0 001 1122 2222 22223332 333222 245555666777777 Q ss_pred cccc--c-cccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCCCCc Q lcl|NC_018086. 337 PDLV--S-KDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRNLPQ 409 (511) Q Consensus 337 p~~~--~-~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~~p~ 409 (511) |-.- . +.++ ..-+..|---+.....-+.+.+..|...+.++|+.=+ +|...-...+++. ..+.+.|...-.- T Consensus 369 P~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qL-ilKgIit~eeW~~i~~~I~~~f~~Dn~f 447 (516) T protein:vir:10 369 PLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNL-IYKKIILESEWEEQINNIKVNFHQDSYY 447 (516) T ss_pred CcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhcCCCCHHHHHHHhhcceEEeeecchH Confidence 7432 2 2221 1233444333444456677777777777777765411 1211111222222 2466777443333 Q ss_pred CHHHH-------HHHHHHH---h-ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 410 SYAEL-------ADMAVKL---R-DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 410 d~~e~-------a~~~~~~---~-g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) .+... ++++..+ + ..+|.+++++.+-..+| +|+..++++-+++.+.. -+++ +...++- T Consensus 448 ~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~~~k~I~~E~~~~--~~~~--p~~e~~f 516 (516) T protein:vir:10 448 TELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTD--EQIAQEEKQIEKEANVK--RFQN--PENEDDF 516 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCH--hHHHHHHHHHHHhhhCC--CCCC--CCccccC Confidence 33222 3344444 2 36899999998644443 22322222222211110 0111 0000111 No 257 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=52.31 E-value=0.56 Score=21.97 Aligned_cols=291 Identities=12% Similarity=0.085 Sum_probs=97.7 Q ss_pred cCCCcccccCCcCccccccce-eccch--HH------HHHHHHHhhh-hcc----CceecCc------h----hhHH--- Q lcl|NC_018086. 55 KGNHIAIQSRTFDDTNKPNSK-IVHNF--PK------LLVDTSTAYL-AGE----PITESGD------E----KTIK--- 107 (511) Q Consensus 55 ~G~~~~~~~~~~~~~~~~~~r-i~~n~--~k------~ivd~~~~~l-~g~----~~~~~~d------~----~~~~--- 107 (511) ..+.... +.......+..+ .++.| |- .+.+ ...+. .|+ |+...+= . .... T Consensus 1 m~~~~~~--~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~-~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~ 77 (340) T protein:vir:98 1 MSKRKPR--KAVAMTASAPQKMEAFTFGEPVPVLDKRDILD-YVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKR 77 (340) T ss_pred CCCCCCC--ccccccccCccceeEEEcCCceeecCcchhhh-hhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhh Confidence 1111100 000000000000 00000 00 0111 11110 111 1111000 0 0000 Q ss_pred -HHHHHHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCC Q lcl|NC_018086. 108 -AMQPVFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDIT 183 (511) Q Consensus 108 -~l~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~ 183 (511) .+...+.-|.. ...+..++.+.+.+|.||+.+-.+..|++ .+..++|..+-...+.. ++|.+.. . T Consensus 78 n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~~---------~~~~~~~--~ 146 (340) T protein:vir:98 78 NVLASTYIPHPLLSRQDFSRFALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDDS---------VFWFVEN--F 146 (340) T ss_pred hHHhhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccCc---------EEEEEec--C Confidence 01111122221 13456677888999999999888888875 34455554443211110 1111111 1 Q ss_pred cceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHH Q lcl|NC_018086. 184 GHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSD 263 (511) Q Consensus 184 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~ 263 (511) +.. ..|.++.++++..- + | ...-.|.|.+.....-+..-+.+..- T Consensus 147 ~~~----~~~~~~eViHir~~----------------------~-----~----~~~~~Gls~~~~a~~si~l~~aa~~~ 191 (340) T protein:vir:98 147 TQP----HEFAPDTVFHLLEP----------------------D-----I----NQEIYGLPEYLSALNSAWLNESATLF 191 (340) T ss_pred CeE----EEEccccEEEEcCC----------------------C-----C----CCCcccccHHHHHHHHHHHHHHHHHH Confidence 111 11333333333200 0 0 00124666666543333322211111 Q ss_pred HHHHHHHhcCcee--EeecCCCCccc--hhhhhhhh-------CceeeecCC---Cceeee--ecCCCHHHHHHHHHHHH Q lcl|NC_018086. 264 SVNDIAYWNDAYL--WLQGFDLSADS--DSISNMKN-------DRVIVTDED---GMVKFI--TKDVNDKHIENIKNRAK 327 (511) Q Consensus 264 ~~~~~~~~~~p~l--~~~G~~~~~~~--~~~~~~~~-------~~~i~~~~~---~~~~~~--~~~~~~~~~~~~~~~l~ 327 (511) .....+..+.|-. +++|...+++. .....+.. .+++.+..+ .++++. ........+.+..+..+ T Consensus 192 ~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~ 271 (340) T protein:vir:98 192 RRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASA 271 (340) T ss_pred HHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhH Confidence 1222333344544 44554333221 11111211 123433222 234443 33334456777788888 Q ss_pred HHHHHHhCcccccccccc-CccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC Q lcl|NC_018086. 328 LDIFSLSQTPDLVSKDFT-AASG-QALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR 405 (511) Q Consensus 328 ~~i~~~s~~p~~~~~~~~-~~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~ 405 (511) ..|+..-++|....+... +.++ ..++.. ....+...|.-+++.|..+-...+. ++ +.|++ T Consensus 272 ~eIa~a~~VPp~llGi~~~~t~~~sn~e~~----------~~~f~~~~l~Pl~~~iee~n~~L~~-------e~-~rF~~ 333 (340) T protein:vir:98 272 ADLMDAHRVPFQLMGGKPENIGSLGDVEKV----------AKVFVRNELSPLQDRFREVNDWLGM-------EV-IRFKE 333 (340) T ss_pred HHHHHHhCCCHHHhcccCCCCCccccHHHH----------HHHHHHHHHHHHHHHHHHHHhcccc-------cc-cccCc Confidence 899999999865443321 1111 111100 0011112222222222211111110 11 34433 Q ss_pred CCCcCHH Q lcl|NC_018086. 406 NLPQSYA 412 (511) Q Consensus 406 ~~p~d~~ 412 (511) ....+.. T Consensus 334 ~~l~~~d 340 (340) T protein:vir:98 334 YTLDNPE 340 (340) T ss_pred cccccCC Confidence 2222211 No 258 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=50.37 E-value=0.61 Score=21.75 Aligned_cols=287 Identities=13% Similarity=0.054 Sum_probs=97.7 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHH------HHHHHHHhhh-hcc----CceecCchh- Q lcl|NC_018086. 37 AEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPK------LLVDTSTAYL-AGE----PITESGDEK- 104 (511) Q Consensus 37 ~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k------~ivd~~~~~l-~g~----~~~~~~d~~- 104 (511) +.+.+. ..+............+. .-+..+=|. .+.+- .... .|+ |+...+=-. T Consensus 1 m~~~~~-------------~~~~~~~~~~~~~~~~~-~~~~f~~p~~v~~~~~~~~~-~~~~~~~~~~~pp~~~~~la~~ 65 (344) T protein:vir:60 1 MSKKKG-------------KTLQPAAKKMTASAPKM-EAFTFGEPVPVLDRRDILDY-VECISNGRWYEPPISFTGLAKS 65 (344) T ss_pred CCcccC-------------CCCCchHHhhcCCcCcE-EEEEcCCceeecCCcchhHH-HHhhhcCccccCCCCHHHHHHH Confidence 000000 00000000000000000 000011110 11111 1111 111 111110000 Q ss_pred ---------hH----HHHHHHHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCc Q lcl|NC_018086. 105 ---------TI----KAMQPVFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEE 168 (511) Q Consensus 105 ---------~~----~~l~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~ 168 (511) .. ..|...+.-|.. ...+..++.+.+.+|.||+.+-.+..|++ .+..++|..+-...+.+ T Consensus 66 ~~a~~~h~~~i~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~---- 141 (344) T protein:vir:60 66 LRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED---- 141 (344) T ss_pred HHhhhhhccchhhhhhHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCC---- Confidence 00 001111222331 12456678889999999999888888875 35555555443322111 Q ss_pred eEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC-----Cccc Q lcl|NC_018086. 169 PVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA-----NEER 243 (511) Q Consensus 169 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g 243 (511) ++|.+.. .+.. ..|.++.+ +++++ .-.| T Consensus 142 -----~~~~v~~--~~~~----~~~~~~eI------------------------------------iHir~~~~~~~~yG 174 (344) T protein:vir:60 142 -----VYWWVPS--FNEP----TAFAPGSV------------------------------------FHLLEPDINQELYG 174 (344) T ss_pred -----eEEEEcc--CCeE----EEEcCccE------------------------------------EEEcCCCCCCCccc Confidence 1111110 1110 01222333 33321 1247 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE--eecCCCCccc--hhhhhhhh------Cceeee--cC--CCceee Q lcl|NC_018086. 244 LGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW--LQGFDLSADS--DSISNMKN------DRVIVT--DE--DGMVKF 309 (511) Q Consensus 244 ~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~--~~G~~~~~~~--~~~~~~~~------~~~i~~--~~--~~~~~~ 309 (511) .|.+.....-++.-.....-........+.|-.+ ++|...+++. .....+.. ++.+.+ ++ ..++++ T Consensus 175 lsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~ 254 (344) T protein:vir:60 175 LPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKI 254 (344) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCCcceEEecCCCCccceeE Confidence 7766654333332222111122223334444444 4454333211 11112211 122333 22 123444 Q ss_pred eec--CCCHHHHHHHHHHHHHHHHHHhCcccccccccc-------CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 310 ITK--DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-------AASGQALKAATQPLENKSAVKESKFRKVLAKRYEL 380 (511) Q Consensus 310 ~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-------~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~l 380 (511) .-. ......+.+..+..++.|+..-++|....+... +....++.+.... |.-+++. T Consensus 255 ~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~---------------L~Pl~~~ 319 (344) T protein:vir:60 255 IPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNE---------------LIPLQDR 319 (344) T ss_pred EEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHH---------------HHHHHHH Confidence 432 333455777788888999999999865444321 1111122221111 1111111 Q ss_pred HHHHHHhcCCCccccccceeEEeCCCCC-cCHH Q lcl|NC_018086. 381 VCSYLEFMNKAKDLKPYEVTPVFVRNLP-QSYA 412 (511) Q Consensus 381 i~~~~~~~~~~~~~~~~~i~i~f~~~~p-~d~~ 412 (511) +..+-...+. + .+.|.+... .+.+ T Consensus 320 ~e~ln~~lg~-------~-~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 320 IREINGWLGQ-------E-VIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHhcCC-------c-ccccCccccCCCCC Confidence 1111111111 1 133432222 2222 No 259 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=49.12 E-value=0.65 Score=21.62 Aligned_cols=298 Identities=10% Similarity=-0.009 Sum_probs=101.4 Q ss_pred cCCCcccccCCcCccccccceeccch-HHHHHH--HHHhh---hh-c-----cCc-eecCchhh----------HH---- Q lcl|NC_018086. 55 KGNHIAIQSRTFDDTNKPNSKIVHNF-PKLLVD--TSTAY---LA-G-----EPI-TESGDEKT----------IK---- 107 (511) Q Consensus 55 ~G~~~~~~~~~~~~~~~~~~ri~~n~-~k~ivd--~~~~~---l~-g-----~~~-~~~~d~~~----------~~---- 107 (511) .-++... ........+..-..++. |-.+.+ .+.+| .. | +|+ ...+=-+. .. T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~h~~~i~~k~N 78 (348) T protein:vir:26 1 MTEQLIH--SHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGYHGSLLKARAN 78 (348) T ss_pred CCccccc--hhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhhhhhhHhhhhh Confidence 1111000 00000000000011110 111110 01111 11 1 111 11000000 00 Q ss_pred HHHHHHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCc Q lcl|NC_018086. 108 AMQPVFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITG 184 (511) Q Consensus 108 ~l~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~ 184 (511) .+...+.-|.. ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+-+. .+. ++|.... ++ T Consensus 79 ~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~-~d~---------~~~~~~~--~g 146 (348) T protein:vir:26 79 YVAGRFMNGGGLPMYKMNSACWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKR-KNG---------DFVQLLR--NN 146 (348) T ss_pred HHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEee-ecC---------cEEEEEe--cC Confidence 00000111221 24456778899999999999888888875 4555666544221 110 0111111 11 Q ss_pred ceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 185 HQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDS 264 (511) Q Consensus 185 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~ 264 (511) .. ..|.++.+.++..- ++ .+.-.|.|.+...+.-+..-.....-. T Consensus 147 ~~----~~f~~~dIiHir~~----------------------~~---------~~~~~Gls~~~~a~~si~l~~~a~~~~ 191 (348) T protein:vir:26 147 EQ----KVFKAKDVIFIPQY----------------------DP---------QQQIYGLPDYLGSIQSSLLNRDATLFR 191 (348) T ss_pred eE----EEEcCccEEEEcCC----------------------CC---------CCCcccccHHHHHHHHHHHHHHHHHHH Confidence 11 11333443333100 00 011246676665443333222222222 Q ss_pred HHHHHHhcCceeEe--ecCCCCccchhhhhh----hh-------Cceeee-cC--CCceeee--ecCCCHHHHHHHHHHH Q lcl|NC_018086. 265 VNDIAYWNDAYLWL--QGFDLSADSDSISNM----KN-------DRVIVT-DE--DGMVKFI--TKDVNDKHIENIKNRA 326 (511) Q Consensus 265 ~~~~~~~~~p~l~~--~G~~~~~~~~~~~~~----~~-------~~~i~~-~~--~~~~~~~--~~~~~~~~~~~~~~~l 326 (511) ....+..+.|-.++ ++...+ ++....+ .. .+++.+ ++ +.++++. ........+.+..+.- T Consensus 192 ~~~f~NGa~pg~Il~~~~~~ls--~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t 269 (348) T protein:vir:26 192 RRYYLNGAHMGFIFYATDPNLS--EADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNIT 269 (348) T ss_pred HHHHhccCCCceEEEecCCCCC--HHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhh Confidence 22334445555554 443332 2222222 11 123333 22 1234443 2223344566677777 Q ss_pred HHHHHHHhCccccccccc-cC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeC Q lcl|NC_018086. 327 KLDIFSLSQTPDLVSKDF-TA-ASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFV 404 (511) Q Consensus 327 ~~~i~~~s~~p~~~~~~~-~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~ 404 (511) +.+|+..-++|....+.. .+ .+...+... ....+...|.-+++.+...++..-.. .....+.+.|+ T Consensus 270 ~~dIa~af~VPp~llGi~~~~~~~~sn~e~~----------~~~f~~~~l~P~~~~ie~~ln~~l~~--~~~~~~~fdl~ 337 (348) T protein:vir:26 270 AQDIFVGHRFPAGMGGMLPQQGANVPDPLKV----------SQVYDFYEVIPVCKRFMDAVNNDPEI--PDNLKLKFNLN 337 (348) T ss_pred HHHHHHHhCCCHHHccccCCCCCccccHHHH----------HHHHHHHHHHHHHHHHHHHHhhhhCC--CCccEEEEecC Confidence 888999999986544322 11 111111110 01111222222222222222211000 01122334444 Q ss_pred CCCCcCHHHHHHHH Q lcl|NC_018086. 405 RNLPQSYAELADMA 418 (511) Q Consensus 405 ~~~p~d~~e~a~~~ 418 (511) +..-++. +.++ T Consensus 338 ~~~e~~~---~~a~ 348 (348) T protein:vir:26 338 PGVESAN---GSAV 348 (348) T ss_pred cccccch---hhcC Confidence 3322222 2222 No 260 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=45.76 E-value=0.76 Score=21.24 Aligned_cols=310 Identities=13% Similarity=0.070 Sum_probs=104.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcc---c--------cccce-ecc--chHHHH------HHH Q lcl|NC_018086. 28 FDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDT---N--------KPNSK-IVH--NFPKLL------VDT 87 (511) Q Consensus 28 ~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~---~--------~~~~r-i~~--n~~k~i------vd~ 87 (511) +-.+...+....+ +. -.-+-.|--+...++..... . ....+ .++ +-+..+ .+- T Consensus 1 ~~~~~~~~~~~~~---~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~ 73 (376) T protein:vir:10 1 MPARDRPRAARRR---RH----SFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDY 73 (376) T ss_pred CCCCccchhhhhh---cc----cchhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhh Confidence 1111100100000 00 00111111111111100000 0 00001 011 111111 111 Q ss_pred HHhhhhcc----CceecC------chh----hHHHHHHH----HhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce Q lcl|NC_018086. 88 STAYLAGE----PITESG------DEK----TIKAMQPV----FKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH 147 (511) Q Consensus 88 ~~~~l~g~----~~~~~~------d~~----~~~~l~~~----~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~ 147 (511) .--+..|+ |+...+ ... ....-..+ +.-|.. ...+.+++.+.+.+|.||+.+-.+..|++ T Consensus 74 ~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s~l~~k~n~l~~~~~Pnp~lT~~~f~~~v~d~ll~Gnay~~~~rn~~G~~ 153 (376) T protein:vir:10 74 VECWSNGEWFEPPVSFAGLAKSFRASTHHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGT 153 (376) T ss_pred hhhhhcCceecCCCCHHHHHHHHhhhHHhhhhHHHHhHHHHhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCE Confidence 11111121 111100 000 00000001 111211 23456778889999999999888888865 Q ss_pred -EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccccccccccee Q lcl|NC_018086. 148 -RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHP 226 (511) Q Consensus 148 -~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (511) .+..++|..+-+..+.. ++|.... .+.. ..|.++.++++..- T Consensus 154 ~~L~pl~~~~vr~~~d~~---------~~~~~~~--~~~~----~~~~~~eViHir~~---------------------- 196 (376) T protein:vir:10 154 LRLEPALAKYVRRKADFN---------GFVYVNG--WQER----HEFEPDSVFQLVRP---------------------- 196 (376) T ss_pred EEEEEeCCcceEEEeeCC---------eEEEEEc--CCeE----EEEccccEEEecCC---------------------- Confidence 46777777665443322 1111111 1111 12333433333210 Q ss_pred ccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHHHHHhcCceeE--eecCCCCccchhhhhh----hh----- Q lcl|NC_018086. 227 NLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVNDIAYWNDAYLW--LQGFDLSADSDSISNM----KN----- 295 (511) Q Consensus 227 ~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~--~~G~~~~~~~~~~~~~----~~----- 295 (511) + | .+.-+|.|.+...+.-+..-+....-.....+..+.|-.+ ++|...++ +....+ .. T Consensus 197 ~-----~----~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~--e~~~~lr~~~~~~~G~~ 265 (376) T protein:vir:10 197 D-----I----NQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQ--DDVDNMRDALKNAKGPG 265 (376) T ss_pred C-----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCH--HHHHHHHHHHHHhcCcc Confidence 0 0 0112466776654444432222211222233333444444 44543332 222222 11 Q ss_pred --CceeeecC---CCceeeeec--CCCHHHHHHHHHHHHHHHHHHhCccccccccc-------cCccHHHHHHHHHHHHH Q lcl|NC_018086. 296 --DRVIVTDE---DGMVKFITK--DVNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-------TAASGQALKAATQPLEN 361 (511) Q Consensus 296 --~~~i~~~~---~~~~~~~~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-------~~~Sg~Ai~~~~~~l~~ 361 (511) .+++.+.. +.++++.-. ......+.+..+...+.|+..-++|....+.. ++.......+... T Consensus 266 N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~---- 341 (376) T protein:vir:10 266 NFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRN---- 341 (376) T ss_pred ccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHH---- Confidence 12333322 234455433 33445567777788888999999986544322 1111111221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCCCCCcCHHHHH Q lcl|NC_018086. 362 KSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVRNLPQSYAELA 415 (511) Q Consensus 362 k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~e~a 415 (511) .|.-+++.+..+-...+ .++ +.|++..-..-.+.+ T Consensus 342 -----------~L~Pl~~~ieeln~~L~-------~~~-~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 342 -----------EIRPLQARFAELNDWLG-------EEV-VRFDDYEIPPAPVAA 376 (376) T ss_pred -----------HHHHHHHHHHHHHhhcc-------ccc-cccChhHhhcccccC Confidence 11111111111100000 011 445432221111111 No 261 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=44.03 E-value=0.82 Score=21.05 Aligned_cols=400 Identities=10% Similarity=0.093 Sum_probs=163.4 Q ss_pred CCCcc----chhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPN----GQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) =|..| +-.-.|.++.. +.....+..--..|+ .+|+.+-.+++-+..+ T Consensus 37 Ga~~i~~~~~~~~~~g~~~~------~~~~~~~~~~~~eLI-------~~YR~ma~~pEvd~Av---------------- 87 (516) T protein:vir:10 37 GATEIETREGEATYNAVMQQ------FFGIDNNISGTKDLI-------NTYRQLINNPEVERAV---------------- 87 (516) T ss_pred CceeeecCCCcccccceeee------eeccccccchHHHHH-------HHHHHHhhccchhhHH---------------- Confidence 01000 00011111111 111111111111222 3444444444443322 Q ss_pred ccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC--CC Q lcl|NC_018086. 77 VHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID--RN 144 (511) Q Consensus 77 ~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~--~~ 144 (511) ..||+..+-+ -..+|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|.....| ++ T Consensus 88 -----~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~ 162 (516) T protein:vir:10 88 -----ANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKK 162 (516) T ss_pred -----HHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccc Confidence 1122221111 112233221 223445666777777889999999999999999999875554 35 Q ss_pred CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce----EEEEEEEcCCcEEEEEEccCcccccccccccccc Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ----IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIK 220 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 220 (511) |-..+..+||+.+..+.-- .+++.+|.. +...-+|+++.. ++...+.. ...... T Consensus 163 GI~Elr~lDPr~i~~vR~i--------------~~~~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~-----~~~~~~-- 220 (516) T protein:vir:10 163 GIAELRRLDPRFMEYYREI--------------VTSDIGGTTIVKGYREFFIYTTGNE-GYSYNGRI-----FEPNTR-- 220 (516) T ss_pred cceeeeeeCCcceeeEeee--------------cccccccchhhhhhhheeeeccCcc-ccccccce-----eCCCcc-- Confidence 6677888999987654321 111222211 112234444332 22111110 000000 Q ss_pred cccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCccc----- Q lcl|NC_018086. 221 DYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSADS----- 287 (511) Q Consensus 221 ~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~~~----- 287 (511) + +|| .|.|... ..+--.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+....+ T Consensus 221 --------i-kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAe 291 (516) T protein:vir:10 221 --------I-KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKAT 291 (516) T ss_pred --------e-eechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHH Confidence 0 122 1222210 001111223444445555 35566666777777775533322221111 Q ss_pred hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_018086. 288 DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIFSLSQTPD 338 (511) Q Consensus 288 ~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~ 338 (511) .-...+. . +++ +++| +++. .+.-|.+ .+...+. -+.-+.+-+|+--++|- T Consensus 292 qYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~ 370 (516) T protein:vir:10 292 EYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMD-DVRWFNKKLYEALRIPL 370 (516) T ss_pred HHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCc Confidence 1111110 0 001 1122 2222 2222333 2333222 24555566677677774 Q ss_pred cc--c-cccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCCCCcCH Q lcl|NC_018086. 339 LV--S-KDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRNLPQSY 411 (511) Q Consensus 339 ~~--~-~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~~p~d~ 411 (511) .- . +.++ ..-+..|---+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|...-.-.+ T Consensus 371 sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qL-ilKgiit~eew~~i~~~I~~~f~~Dn~f~E 449 (516) T protein:vir:10 371 SRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNL-IYKRIITEDEWDEQINNIKVNFHQDSYYTE 449 (516) T ss_pred ccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 32 2 2221 1223334333444466777778888888888876422 2221112222222 346777744333333 Q ss_pred HHH-------HHHHHHH---h-ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 412 AEL-------ADMAVKL---R-DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 412 ~e~-------a~~~~~~---~-g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) ... ++++..+ + ..+|.+++++.+-..+| +|+..++++-+++.+... +. .+...++- T Consensus 450 lKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~e~k~I~~E~~~~~--~~--~p~~~~~f 516 (516) T protein:vir:10 450 LKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTE--EQIAQEEKQIEQEAGIKR--FQ--NPENEDDF 516 (516) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCH--hhHHHHHHHHHHhhhCCC--CC--CCCccccC Confidence 322 3344444 2 36899999998644443 233222222222211110 00 00000111 No 262 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=44.03 E-value=0.82 Score=21.05 Aligned_cols=400 Identities=10% Similarity=0.093 Sum_probs=163.4 Q ss_pred CCCcc----chhhcccccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccccee Q lcl|NC_018086. 1 MAIPN----GQINAGDIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI 76 (511) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri 76 (511) =|..| +-.-.|.++.. +.....+..--..|+ .+|+.+-.+++-+..+ T Consensus 37 Ga~~i~~~~~~~~~~g~~~~------~~~~~~~~~~~~eLI-------~~YR~ma~~pEvd~Av---------------- 87 (516) T protein:vir:10 37 GATEIETREGEATYNAVMQQ------FFGIDNNISGTKDLI-------NTYRQLINNPEVERAV---------------- 87 (516) T ss_pred CceeeecCCCcccccceeee------eeccccccchHHHHH-------HHHHHHhhccchhhHH---------------- Confidence 01000 00011111111 111111111111222 3444444444443322 Q ss_pred ccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeC--CC Q lcl|NC_018086. 77 VHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWID--RN 144 (511) Q Consensus 77 ~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~--~~ 144 (511) ..||+..+-+ -..+|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|.....| ++ T Consensus 88 -----~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~ 162 (516) T protein:vir:10 88 -----ANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKK 162 (516) T ss_pred -----HHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccc Confidence 1122221111 112233221 223445666777777889999999999999999999875554 35 Q ss_pred CceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcce----EEEEEEEcCCcEEEEEEccCcccccccccccccc Q lcl|NC_018086. 145 KKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQ----IRTYEVYTEDLIYKFSTDDEREVYREIPEELEIK 220 (511) Q Consensus 145 g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 220 (511) |-..+..+||+.+..+.-- .+++.+|.. +...-+|+++.. ++...+.. ...... T Consensus 163 GI~Elr~lDPr~i~~vR~i--------------~~~~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~-----~~~~~~-- 220 (516) T protein:vir:10 163 GIAELRRLDPRFMEYYREI--------------VTSDIGGTTIVKGYREFFIYTTGNE-GYSYNGRI-----FEPNTR-- 220 (516) T ss_pred cceeeeeeCCcceeeEeee--------------cccccccchhhhhhhheeeeccCcc-ccccccce-----eCCCcc-- Confidence 6677888999987654321 111222211 112234444332 22111110 000000 Q ss_pred cccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCccc----- Q lcl|NC_018086. 221 DYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSADS----- 287 (511) Q Consensus 221 ~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~~~----- 287 (511) + +|| .|.|... ..+--.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+....+ T Consensus 221 --------i-kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAe 291 (516) T protein:vir:10 221 --------I-KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKAT 291 (516) T ss_pred --------e-eechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHH Confidence 0 122 1222210 001111223444445555 35566666777777775533322221111 Q ss_pred hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_018086. 288 DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHIENIKNRAKLDIFSLSQTPD 338 (511) Q Consensus 288 ~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~~~~~~~l~~~i~~~s~~p~ 338 (511) .-...+. . +++ +++| +++. .+.-|.+ .+...+. -+.-+.+-+|+--++|- T Consensus 292 qYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~ 370 (516) T protein:vir:10 292 EYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMD-DVRWFNKKLYEALRIPL 370 (516) T ss_pred HHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCc Confidence 1111110 0 001 1122 2222 2222333 2333222 24555566677677774 Q ss_pred cc--c-cccc--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCCCCcCH Q lcl|NC_018086. 339 LV--S-KDFT--AASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRNLPQSY 411 (511) Q Consensus 339 ~~--~-~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~~p~d~ 411 (511) .- . +.++ ..-+..|---+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|...-.-.+ T Consensus 371 sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qL-ilKgiit~eew~~i~~~I~~~f~~Dn~f~E 449 (516) T protein:vir:10 371 SRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNL-IYKRIITEDEWDEQINNIKVNFHQDSYYTE 449 (516) T ss_pred ccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 32 2 2221 1223334333444466777778888888888876422 2221112222222 346777744333333 Q ss_pred HHH-------HHHHHHH---h-ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 412 AEL-------ADMAVKL---R-DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 412 ~e~-------a~~~~~~---~-g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) ... ++++..+ + ..+|.+++++.+-..+| +|+..++++-+++.+... +. .+...++- T Consensus 450 lKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~e~k~I~~E~~~~~--~~--~p~~~~~f 516 (516) T protein:vir:10 450 LKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTE--EQIAQEEKQIEQEAGIKR--FQ--NPENEDDF 516 (516) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCH--hhHHHHHHHHHHhhhCCC--CC--CCCccccC Confidence 322 3344444 2 36899999998644443 233222222222211110 00 00000111 No 263 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=43.91 E-value=0.83 Score=21.04 Aligned_cols=301 Identities=13% Similarity=0.071 Sum_probs=102.1 Q ss_pred HHHHHHHHHHHHHHhcCCCcccccCCcCcccccccee-ccch--HH------HHHHHHHhhhhcc----CceecCchh-- Q lcl|NC_018086. 40 HSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKI-VHNF--PK------LLVDTSTAYLAGE----PITESGDEK-- 104 (511) Q Consensus 40 ~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri-~~n~--~k------~ivd~~~~~l~g~----~~~~~~d~~-- 104 (511) .+ ++-+...+.-..........+...+. ++.| +. .+.+-.--+..|+ |+...+--+ T Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~ 71 (351) T protein:vir:79 1 MS---------KRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSF 71 (351) T ss_pred CC---------CCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHH Confidence 00 00000000000000000000000010 1111 11 1111111111122 111110000 Q ss_pred --------hHH----HHHHHHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCce Q lcl|NC_018086. 105 --------TIK----AMQPVFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEP 169 (511) Q Consensus 105 --------~~~----~l~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 169 (511) ... .+...+.-|.. ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+-+..+.. T Consensus 72 ~~~~~h~~~l~~k~n~l~~~~~Pnp~~t~~~f~~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~----- 146 (351) T protein:vir:79 72 RASTHHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS----- 146 (351) T ss_pred hhhHhhhhhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCC----- Confidence 000 00011111221 12356778899999999999988888874 46667776654332211 Q ss_pred EEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHH Q lcl|NC_018086. 170 VAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEA 249 (511) Q Consensus 170 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~ 249 (511) +++... ..+.. ..|.++.+++++.-. | .+.-+|.|.+.. T Consensus 147 ----~~~~~~--~~g~~----~~~~~~eIihir~~~---------------------------~----~~~~yGl~~~~~ 185 (351) T protein:vir:79 147 ----GFVYVN--GWQER----HEFEPDSVFQLVRPD---------------------------I----NQEVYGLPEYLS 185 (351) T ss_pred ----eEEEEe--cCceE----EEEcCccEEEeCCCC---------------------------C----CCCcccccHHHH Confidence 111111 11111 123334333332000 0 011246666655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeE--eecCCCCccchhhhh----hhh-------CceeeecC---CCceeeee-- Q lcl|NC_018086. 250 QLSLIDAYNLAVSDSVNDIAYWNDAYLW--LQGFDLSADSDSISN----MKN-------DRVIVTDE---DGMVKFIT-- 311 (511) Q Consensus 250 v~~l~d~~~~~~s~~~~~~~~~~~p~l~--~~G~~~~~~~~~~~~----~~~-------~~~i~~~~---~~~~~~~~-- 311 (511) ....+..-.....-.....+..+.|-.+ ++|...++ +.... +.. .+++.+.. +.++++.- T Consensus 186 a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~~~ls~--e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~ 263 (351) T protein:vir:79 186 SLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQ--DDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVS 263 (351) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCH--HHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcC Confidence 4333332222211122233444445444 44543322 22221 211 12333322 22344433 Q ss_pred cCCCHHHHHHHHHHHHHHHHHHhCcccccccccc-Ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_018086. 312 KDVNDKHIENIKNRAKLDIFSLSQTPDLVSKDFT-AAS-GQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN 389 (511) Q Consensus 312 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 389 (511) .......+.+..+...+.|+..-++|..-.+... +.+ +..++.. ....+...|.-+++.|..+-...+ T Consensus 264 ~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~----------~~~f~~~~l~Pl~~~ie~ln~~lg 333 (351) T protein:vir:79 264 EVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTA----------ARVFGRNEIRPLQARFAELNDWLG 333 (351) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHH----------HHHHHHHHHHHHHHHHHHHHhhcC Confidence 2334455777778888899999999865443321 111 1111110 001111222222222211111111 Q ss_pred CCccccccceeEEeCCCCCcCHHHHHHHHHHH Q lcl|NC_018086. 390 KAKDLKPYEVTPVFVRNLPQSYAELADMAVKL 421 (511) Q Consensus 390 ~~~~~~~~~i~i~f~~~~p~d~~e~a~~~~~~ 421 (511) .++ +.|++..- ..+..++ T Consensus 334 -------~~~-~~F~~~~l------lr~d~~a 351 (351) T protein:vir:79 334 -------DEV-VTFDDYEI------PPAPVAA 351 (351) T ss_pred -------cce-eeeChhhh------ccccccC Confidence 111 45654222 1111111 No 264 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=43.17 E-value=0.86 Score=20.96 Aligned_cols=411 Identities=11% Similarity=0.105 Sum_probs=163.2 Q ss_pred CCCccchhhcccc--------cCchhhHh-hhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccc Q lcl|NC_018086. 1 MAIPNGQINAGDI--------ITTNIRRK-HFIRRNFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNK 71 (511) Q Consensus 1 ~~~~~~~~~~~~~--------~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~ 71 (511) -..+-+.-..|.. .+...--+ .+-..... +.....-+.+|+.+-.+++-+..+ T Consensus 31 ~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~-------~~~~~eLI~~YR~ma~~pEvd~Av----------- 92 (524) T protein:vir:10 31 ESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPE-------VKNTRELIDTYRNLMNNYEVDNAV----------- 92 (524) T ss_pred CccccCCCCCCceeeccCcccccchhhhhhhhhcccch-------hhhHHHHHHHHHHHhhccchhhHH----------- Confidence 0000000000100 00000000 01111111 111122233444444444443322 Q ss_pred ccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeee Q lcl|NC_018086. 72 PNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWI 141 (511) Q Consensus 72 ~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~ 141 (511) ..||+..+-+ -...|+.+. ..+...+.+..++.--+|+....+..|.+.+.|+.|.+.-. T Consensus 93 ----------~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkii 162 (524) T protein:vir:10 93 ----------QEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKII 162 (524) T ss_pred ----------HHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEe Confidence 1122221111 112233321 22344566777777788999999999999999999988655 Q ss_pred C----CCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCccccccccccc Q lcl|NC_018086. 142 D----RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEEL 217 (511) Q Consensus 142 ~----~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 217 (511) | ++|-..+..+||+.+-.|..-... ...++..+ +.+...-+|.++.- .|... +.. .+.. T Consensus 163 d~~~pk~GI~Elr~lDPr~i~~vr~i~~~--~~~~~~vi--------~~~~e~f~Y~~~~~-~~~~~--~~~---~~~~- 225 (524) T protein:vir:10 163 NPKKMKDGVQELRRLDPRQVQYIREIVTR--MEDGVKIV--------DGYREFFVYDTGHE-SYCAD--GRI---YSAG- 225 (524) T ss_pred eCCCccccceeeeeeCCccceeeeeeccc--Ccccchhh--------cchhhheeecCCCc-ccccC--cce---ecCC- Confidence 4 346677888999987444321110 11111000 01111123332211 00000 000 0000 Q ss_pred ccccccceeccCCccc---eEeecCC---cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCccc-- Q lcl|NC_018086. 218 EIKDYEVHPNLLQKFP---VLEIIAN---EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSADS-- 287 (511) Q Consensus 218 ~~~~~~~~~~~~g~iP---vv~~~n~---~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~~~-- 287 (511) +++ +|| ||+.-.. ..+.-.+.-+..-+..+| +++-|....-+..+.|-+=+.-.+....+ T Consensus 226 ---------~~i-kI~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~ 295 (524) T protein:vir:10 226 ---------TKV-KIPRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSR 295 (524) T ss_pred ---------cce-ecchhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCch Confidence 000 233 2222110 011112233444555555 45566677777777775533322221111 Q ss_pred ---hhhhhhh---hCce---------------------eeec--CCCc-eeeeecCC--CHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018086. 288 ---DSISNMK---NDRV---------------------IVTD--EDGM-VKFITKDV--NDKHIENIKNRAKLDIFSLSQ 335 (511) Q Consensus 288 ---~~~~~~~---~~~~---------------------i~~~--~~~~-~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~ 335 (511) .-...+. .+++ +++| +++. .+.-|.+. +...+. -+.-+.+-+|+--+ T Consensus 296 KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLn 374 (524) T protein:vir:10 296 KAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMD-DVLYFRTALYRALR 374 (524) T ss_pred hHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhC Confidence 1111110 0010 1122 2222 22223332 233222 24555556666667 Q ss_pred cccccc--c---cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCCCC Q lcl|NC_018086. 336 TPDLVS--K---DFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRNLP 408 (511) Q Consensus 336 ~p~~~~--~---~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~~p 408 (511) +|-.-. + .|...-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|...-. T Consensus 375 VP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qL-ilKgiit~eew~~i~~~I~~~f~~Dn~ 453 (524) T protein:vir:10 375 IPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNL-ILKKIITEDEWEREINNIKVTFNRDSY 453 (524) T ss_pred CCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEeeecch Confidence 774322 2 1211233344444445567778888888888888876422 2221112222222 346777744333 Q ss_pred cCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc Q lcl|NC_018086. 409 QSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA 474 (511) Q Consensus 409 ~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 474 (511) -.+... ++++..+. | .+|.+++++.+-.-+| +|+..++++-+++.+... +++ .... .++- T Consensus 454 f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tD--eei~~~~k~I~~E~k~~~--~~~-~~~~-~~~f 524 (524) T protein:vir:10 454 FSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTD--EEINQEAKQIEEESKEAR--FQN-PDEE-EEDF 524 (524) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCH--HHHHHHHHHHHHHhhcCC--CCC-CChh-hhcC Confidence 333322 33444443 3 4699999988543343 333333332222222110 000 0000 0000 No 265 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=41.98 E-value=0.9 Score=20.82 Aligned_cols=334 Identities=9% Similarity=-0.017 Sum_probs=127.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceec--cchHHHHHHHHHhhhhccCceec---Cc----h- Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV--HNFPKLLVDTSTAYLAGEPITES---GD----E- 103 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~--~n~~k~ivd~~~~~l~g~~~~~~---~d----~- 103 (511) +.+.. +...+-.+... ...... .......+. .......|+..++-+-.-|+.+- .. + T Consensus 1 Mg~f~----------~~~~f~~~~~~-~~~~~~--~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:93 1 MNLFG----------KVVSFSRGKLN-NDTQRV--TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred Cccch----------hhhhhhccccC-CCccee--eecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccc Confidence 11110 11110000000 000000 000000111 12233445555555555676531 11 1 Q ss_pred ---hhHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeC-CCCceEEEEEcccceEEEecCCCCCceEEEEE Q lcl|NC_018086. 104 ---KTIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIHWID-RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIY 174 (511) Q Consensus 104 ---~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~-~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~ 174 (511) .....+..++.. |. .......+..+.+.+|.||++...+ ..|++... -|.. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l--~~~~------------------ 127 (378) T protein:vir:93 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDL--LFAD------------------ 127 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEE--EecC------------------ Confidence 112235555542 32 3355667788899999999764332 22332111 0100 Q ss_pred EEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHH Q lcl|NC_018086. 175 YNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLI 254 (511) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~ 254 (511) .+ . .|.++.++|+ ++.-.+...+..+..+. T Consensus 128 --------~~-----~-~~~~~diih~------------------------------------r~~~~~~~~~s~l~~~~ 157 (378) T protein:vir:93 128 --------DK-----K-EYKTEELVRL------------------------------------TSPFYINEDTSILDNAL 157 (378) T ss_pred --------Ce-----e-EeccceeEEe------------------------------------cCccccchhhHHHHHHH Confidence 00 0 1122223332 21111111222333333 Q ss_pred HHHHHHHHHHHHHHHHhcCceeEe--ecCCCCcc--chhh----hhh-------hhCceeeecCCCceeeeecCCCHHHH Q lcl|NC_018086. 255 DAYNLAVSDSVNDIAYWNDAYLWL--QGFDLSAD--SDSI----SNM-------KNDRVIVTDEDGMVKFITKDVNDKHI 319 (511) Q Consensus 255 d~~~~~~s~~~~~~~~~~~p~l~~--~G~~~~~~--~~~~----~~~-------~~~~~i~~~~~~~~~~~~~~~~~~~~ 319 (511) .+++..++ .+.+-.++ .+. .++. .... ..+ ..++++.++++.+++.++.+.....+ T Consensus 158 ~~i~~~~~--------~~~~~g~l~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~ 228 (378) T protein:vir:93 158 ASIQTKLE--------QGKLRGLLKINAF-LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK 228 (378) T ss_pred HHHHHHHh--------cCcccceeeeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH Confidence 44433222 12222232 221 1111 1111 111 12346777666666555544444444 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc------- Q lcl|NC_018086. 320 ENIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK------- 392 (511) Q Consensus 320 ~~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~------- 392 (511) ..++.+.+.|+..-++|..-.. +..| .. .....+..+|.-+++.|...+...--.. T Consensus 229 -~~~~~~~~~Ia~~fgVPp~~l~--g~~~-e~-------------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~ 291 (378) T protein:vir:93 229 -DEIDLIKSELLTGYFMNENILL--GTAT-QE-------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVK 291 (378) T ss_pred -HHHHHHHHHHHHHhCCCHHHhc--CCcH-HH-------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhh Confidence 3456777888888888753321 1111 10 1123444555555555554443211100 Q ss_pred -cccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCC Q lcl|NC_018086. 393 -DLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQ 469 (511) Q Consensus 393 -~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (511) .....++.+.+..-+-.|..+.++++.++. |+++.-.++.+++.-+-+. .+.+. -..+ .. T Consensus 292 ~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g--gD~~~-----------~~~n----~~ 354 (378) T protein:vir:93 292 GNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG--GDVYI-----------ANLN----AV 354 (378) T ss_pred hcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeee-----------eccc----cc Confidence 011223555666677788899999988874 7888888888876532111 00000 0000 00 Q ss_pred CCCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 470 GASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) +.+.....+++.+ +......+.++ T Consensus 355 ~~~~~~~~~~~~~-~~~~~~e~~n~ 378 (378) T protein:vir:93 355 AVKNLSDLQGSRK-DVTSTDETNNQ 378 (378) T ss_pred cccchhhhcCccC-CCCCCCCCCCC Confidence 0000000000000 00000000000 No 266 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=41.37 E-value=0.93 Score=20.76 Aligned_cols=289 Identities=13% Similarity=0.096 Sum_probs=103.3 Q ss_pred cCCCcccccCCcCcccccccee-ccch--HHHHHH-----HHHhh-h-hc---cCce-e--------------cCchhhH Q lcl|NC_018086. 55 KGNHIAIQSRTFDDTNKPNSKI-VHNF--PKLLVD-----TSTAY-L-AG---EPIT-E--------------SGDEKTI 106 (511) Q Consensus 55 ~G~~~~~~~~~~~~~~~~~~ri-~~n~--~k~ivd-----~~~~~-l-~g---~~~~-~--------------~~d~~~~ 106 (511) ..+..-..... ....++..++ ++.| +-.+.+ ..... . .| .|+. . +.=.... T Consensus 1 m~~~~~~~~~~-~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i~~k~ 79 (346) T protein:vir:10 1 MKKQLRKNLTQ-NDRLQPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAIITKA 79 (346) T ss_pred CCcccCCCCCc-ccccccccCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhhhhhh Confidence 11110000000 0000000000 0000 100100 01111 0 11 1110 0 0000001 Q ss_pred HHHHHHHhc-cCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecC Q lcl|NC_018086. 107 KAMQPVFKE-NYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDI 182 (511) Q Consensus 107 ~~l~~~~~~-n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 182 (511) ..+...+.. |.. ...+.+++.+.+.+|.||+.+..+..|++ .+..++|..+.+..+++. . +|. .... T Consensus 80 n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~----~----~~~-~~~~ 150 (346) T protein:vir:10 80 NILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQ----F----YYV-PQRF 150 (346) T ss_pred hhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCe----E----EEE-EEcc Confidence 112233332 221 23456678889999999999888888875 466677777654332211 0 011 1111 Q ss_pred CcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHH Q lcl|NC_018086. 183 TGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVS 262 (511) Q Consensus 183 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s 262 (511) ++..+ .+.++.+++++.-. ....-.|.|.+......+........ T Consensus 151 ~g~~~----~~~~~dIih~r~~~-------------------------------~~~~~~G~~~~~~a~~si~l~~~a~~ 195 (346) T protein:vir:10 151 DHQEH----EFAKGSIYHLLEPD-------------------------------INQDIYGLPQYLSALQSAWLNESATL 195 (346) T ss_pred CCeEE----EEecccEEEecCCC-------------------------------CCCCeeeccHHHHHHHHHHHHHHHHH Confidence 12111 12333333331000 00112477766654443333222222 Q ss_pred HHHHHHHHhcCceeEe--ecCCCCccchhhhhh----hh-------CceeeecCCC---ceeeee--cCCCHHHHHHHHH Q lcl|NC_018086. 263 DSVNDIAYWNDAYLWL--QGFDLSADSDSISNM----KN-------DRVIVTDEDG---MVKFIT--KDVNDKHIENIKN 324 (511) Q Consensus 263 ~~~~~~~~~~~p~l~~--~G~~~~~~~~~~~~~----~~-------~~~i~~~~~~---~~~~~~--~~~~~~~~~~~~~ 324 (511) -..+.....+.|-.++ ++...++ +....+ +. .+++.+..++ ++++.- .......+.+..+ T Consensus 196 ~~~~~~~NG~~~~~il~~~d~~l~~--e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~ 273 (346) T protein:vir:10 196 FRRKYFLNGAHAGFVFYMSDASQKQ--EDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKN 273 (346) T ss_pred HHHHHHhccCCCceEEEeCCCCCCH--HHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHH Confidence 2222334444455443 4433322 222222 11 2234443332 233332 2223445666777 Q ss_pred HHHHHHHHHhCccccccccc-------cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Q lcl|NC_018086. 325 RAKLDIFSLSQTPDLVSKDF-------TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPY 397 (511) Q Consensus 325 ~l~~~i~~~s~~p~~~~~~~-------~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 397 (511) ..+++|+..-++|+...+.. ++....++.+. ...|.-+++.|..+....+. T Consensus 274 ~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~---------------~~~l~P~~~~iee~n~~L~~------- 331 (346) T protein:vir:10 274 VSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFF---------------ITEIEPLQERLKEFNQWLGQ------- 331 (346) T ss_pred HhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHH---------------HHHHHHHHHHHHHHHhhccc------- Confidence 88889999999986544322 11111122221 11222222222211111110 Q ss_pred ceeEEeCCCCCcCHHH Q lcl|NC_018086. 398 EVTPVFVRNLPQSYAE 413 (511) Q Consensus 398 ~i~i~f~~~~p~d~~e 413 (511) + .|.|++...-.-+| T Consensus 332 e-~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 332 E-VIKFKPSKLLQRTQ 346 (346) T ss_pred c-eeeechhhhcccCC Confidence 1 14555433332222 No 267 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=40.37 E-value=0.97 Score=20.65 Aligned_cols=411 Identities=12% Similarity=0.110 Sum_probs=166.0 Q ss_pred CCCccch---hhcc-----------------------cccCchhhHhhhhccCCCHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 1 MAIPNGQ---INAG-----------------------DIITTNIRRKHFIRRNFDLRELITLAEMHSRSSSAYGVLYDYY 54 (511) Q Consensus 1 ~~~~~~~---~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~yY 54 (511) .+-.|.+ .+++ ......+--..+...... +.....-+.+|+.+-.++ T Consensus 12 ~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~-------~~~~~eLI~~YR~ma~~p 84 (521) T protein:vir:81 12 ADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQK-------ISTTKQLVNTYRGLMNNH 84 (521) T ss_pred cCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccc-------hhhHHHHHHHHHHHhhcc Confidence 1111111 0000 000000001111111111 111122233444444444 Q ss_pred cCCCcccccCCcCccccccceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHH Q lcl|NC_018086. 55 KGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSE 124 (511) Q Consensus 55 ~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~ 124 (511) +-+..+ ..||+..+-+ -..+|+.+. ..++..+++..++.--+|+....+ T Consensus 85 Evd~Av---------------------~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~ 143 (521) T protein:vir:81 85 EVENAV---------------------QNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQD 143 (521) T ss_pred chhhHH---------------------HHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhH Confidence 443322 1122221111 112233221 223445667777777889999999 Q ss_pred HHHHHhhCCeEEEEeeeCC---CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEE Q lcl|NC_018086. 125 EVKLSGIFGHCFEIHWIDR---NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKF 201 (511) Q Consensus 125 ~~~~a~~~G~~~~~v~~~~---~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 201 (511) ..|.+.+.|+.|.+.-.++ +|-..+..+||+.+..+.-..... ...+..+ +.+....+|+++...+. T Consensus 144 ~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~--~~~~~v~--------~~~~e~f~Y~~~~~~~~ 213 (521) T protein:vir:81 144 MFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITED--TPEGKIY--------KATKEYFIYTVGNSSYC 213 (521) T ss_pred HHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecccc--cCcccee--------cceeeeeeeecCCcccc Confidence 9999999999988765543 455678889999887664322110 0000000 11222334555432221 Q ss_pred EEccCcccccccccccccccccceeccCCccc--eEeecCC----cccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcC Q lcl|NC_018086. 202 STDDEREVYREIPEELEIKDYEVHPNLLQKFP--VLEIIAN----EERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWND 273 (511) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~n~----~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~ 273 (511) . .+. .. . ++.-=+|| .|.|... ..+.-.+.-+..-+..+| +++-|....-+..+. T Consensus 214 ~-~g~--~~---~-----------~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRA 276 (521) T protein:vir:81 214 A-GGQ--VF---S-----------PNSRVKIPRSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRA 276 (521) T ss_pred c-cce--ee---c-----------CCcceeechhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhcc Confidence 1 110 00 0 00000122 1111110 001112233444455555 456667777777777 Q ss_pred ceeEeecCCCCccc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC--CCHHHH Q lcl|NC_018086. 274 AYLWLQGFDLSADS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD--VNDKHI 319 (511) Q Consensus 274 p~l~~~G~~~~~~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~--~~~~~~ 319 (511) |-+=+.-.+....+ +-...+. . +++ +++| +++. .+.-|.+ .+...+ T Consensus 277 PeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem 356 (521) T protein:vir:81 277 PERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDI 356 (521) T ss_pred ccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChH Confidence 76543322221111 1111110 0 000 1122 2222 2222333 233333 Q ss_pred HHHHHHHHHHHHHHhCcccccc--cc---ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_018086. 320 ENIKNRAKLDIFSLSQTPDLVS--KD---FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDL 394 (511) Q Consensus 320 ~~~~~~l~~~i~~~s~~p~~~~--~~---~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 394 (511) .. +.-+.+-+|+--++|-.-. +. +...-+..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...++ T Consensus 357 ~D-V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qL-ilKgiit~eew 434 (521) T protein:vir:81 357 DD-IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLKYNL-ILKNVITEDDW 434 (521) T ss_pred HH-HHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhhcCCCHHHH Confidence 22 4455555666667774322 21 211233344444444567778888888888888876422 12211111122 Q ss_pred c--ccceeEEeCCCCCcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018086. 395 K--PYEVTPVFVRNLPQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADIALQN 461 (511) Q Consensus 395 ~--~~~i~i~f~~~~p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~ 461 (511) + ...+.+.|...-.-.+... ++++..+. | .+|.+++++.+-.-+| +|+..++++-+++.+.. - T Consensus 435 ~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tD--eei~~~~k~I~~E~~~~--~ 510 (521) T protein:vir:81 435 DREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTD--DQMDTEKKQIEEEANDP--R 510 (521) T ss_pred HHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCH--HHHHHHHHHHHHHhhCC--C Confidence 1 2246677744333333222 33444443 3 4699999988543343 33333333222222211 0 Q ss_pred ccccccCCCCC Q lcl|NC_018086. 462 FKQTSAVQGAS 472 (511) Q Consensus 462 ~~~~~~~~~~~ 472 (511) +++.....++= T Consensus 511 ~~~p~~~~~~f 521 (521) T protein:vir:81 511 FKQTPDEIEDF 521 (521) T ss_pred CCCCcccccCC Confidence 11100000000 No 268 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=40.04 E-value=0.99 Score=20.61 Aligned_cols=281 Identities=13% Similarity=0.066 Sum_probs=97.3 Q ss_pred cccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCch------------------hhHHHHHH----------- Q lcl|NC_018086. 61 IQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGDE------------------KTIKAMQP----------- 111 (511) Q Consensus 61 ~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d~------------------~~~~~l~~----------- 111 (511) ..+++...+ .+..+.... -......|.||+|..+-..- -....|.+ T Consensus 1 ~~~~~~~~~-~~~~~~~~~----~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~ 75 (344) T protein:vir:56 1 MSKKKGKTP-QPAAKTMTA----SAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSP 75 (344) T ss_pred CCCCCCCCC-chhhHHhhc----CCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCcc Confidence 111111000 000000000 00001122222221110000 00011111 Q ss_pred ----------HHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEE Q lcl|NC_018086. 112 ----------VFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTV 178 (511) Q Consensus 112 ----------~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 178 (511) .+.-|.. ...+..++.+.+.+|.||+.+-.+..|++ .+..++|..+-+.-+.. ++|.+ T Consensus 76 i~~k~n~l~~~~~Pnp~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~---------~~~~~ 146 (344) T protein:vir:56 76 IYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED---------VYWWV 146 (344) T ss_pred ceehhhhHHhhcCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCC---------EEEEE Confidence 1122221 13456678888999999998888888875 34555555443221110 11111 Q ss_pred eecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHH Q lcl|NC_018086. 179 ISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYN 258 (511) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~ 258 (511) .. .+.. ..|.++.++++..- .| .+.-.|.|.+.....-+..-. T Consensus 147 ~~--~g~~----~~~~~~dIiHir~~---------------------------~~----~~~~~Gls~~~~a~~si~l~~ 189 (344) T protein:vir:56 147 PS--FNEP----TAFAPGSVFHLLEP---------------------------DI----NQELYGLPEYLSALNSAWLNE 189 (344) T ss_pred ec--CCeE----EEEcCccEEEECCC---------------------------CC----CCCcccccHHHHHHHHHHHHH Confidence 10 1111 01233333332100 00 011247776664333333222 Q ss_pred HHHHHHHHHHHHhcCceeEe--ecCCCCccchhhhhh----hh------Cceeee--cC--CCceeeeec--CCCHHHHH Q lcl|NC_018086. 259 LAVSDSVNDIAYWNDAYLWL--QGFDLSADSDSISNM----KN------DRVIVT--DE--DGMVKFITK--DVNDKHIE 320 (511) Q Consensus 259 ~~~s~~~~~~~~~~~p~l~~--~G~~~~~~~~~~~~~----~~------~~~i~~--~~--~~~~~~~~~--~~~~~~~~ 320 (511) ....-.....+..+.|-.++ +|...++ +....+ .. ++.+.+ +. ..++++.-. ......+. T Consensus 190 ~a~~~~~~~f~NGa~pg~Il~~~d~~ls~--e~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~ 267 (344) T protein:vir:56 190 SATLFRRKYYENGAHAGYIMYVTDAVQDR--NDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFF 267 (344) T ss_pred HHHHHHHHHHhccCCCceEEEecCCCCCH--HHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChHHHHHH Confidence 11111122233334455444 4533332 222222 11 233333 22 224444433 33445567 Q ss_pred HHHHHHHHHHHHHhCcccccccccc-Ccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Q lcl|NC_018086. 321 NIKNRAKLDIFSLSQTPDLVSKDFT-AAS--GQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPY 397 (511) Q Consensus 321 ~~~~~l~~~i~~~s~~p~~~~~~~~-~~S--g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 397 (511) +..+.....|+..-++|....+... +.+ |.+-+... ......|.-+++.+..+....+. T Consensus 268 e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~-----------~f~~~tL~Pl~~~ie~~n~~l~~------- 329 (344) T protein:vir:56 268 NIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK-----------VFVRNELIPLQDRIREINGWIGQ------- 329 (344) T ss_pred HHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHH-----------HHHHHHHHHHHHHHHHHHhhhcc------- Confidence 7778888899999999875544321 111 11111110 11111111111111111111110 Q ss_pred ceeEEeCCCCCcCHHH Q lcl|NC_018086. 398 EVTPVFVRNLPQSYAE 413 (511) Q Consensus 398 ~i~i~f~~~~p~d~~e 413 (511) + .+.|.+..-..+.+ T Consensus 330 ~-~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 330 E-VIRFKNYSLDTDNG 344 (344) T ss_pred c-cccCCCccccccCC Confidence 0 13443333322222 No 269 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=38.00 E-value=1.1 Score=20.38 Aligned_cols=335 Identities=9% Similarity=-0.014 Sum_probs=127.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceec--cchHHHHHHHHHhhhhccCceec---C-c---h- Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV--HNFPKLLVDTSTAYLAGEPITES---G-D---E- 103 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~--~n~~k~ivd~~~~~l~g~~~~~~---~-d---~- 103 (511) +.+. .++..+..+...- ... .........+. .......|+..++-+-.-|+.+- . + + T Consensus 1 Mg~f----------~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLF----------GKVVSFSRGKLNN-DTQ--RVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CCcc----------ccchhcccccccC-Ccc--eeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCccccc Confidence 0011 1111111111000 000 00000111111 12344455666665555676531 1 1 0 Q ss_pred ---hhHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEEe-eeCCCCceEEEEEcccceEEEecCCCCCceEEEEE Q lcl|NC_018086. 104 ---KTIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEIH-WIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIY 174 (511) Q Consensus 104 ---~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~v-~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~ 174 (511) .....+.+++.. |. .......+....+.+|.||++. +.+..|++... -|... T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l--~p~~~----------------- 128 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDL--LFADD----------------- 128 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEE--EecCC----------------- Confidence 011235555542 32 3355667788899999999764 44444433211 11100 Q ss_pred EEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHH Q lcl|NC_018086. 175 YNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLI 254 (511) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~ 254 (511) +. -|.++.++|+ ++.-.+...+..+..+. T Consensus 129 ---------~~------~~~~~diiH~------------------------------------~~~~~~~~g~s~l~~~~ 157 (378) T protein:vir:94 129 ---------KK------EYKPEELVRL------------------------------------TSPFYINEDTSILDNAL 157 (378) T ss_pred ---------ee------EeeeeeeEEe------------------------------------cCcCCccchhHHHHHHH Confidence 00 0112223332 21111111122333333 Q ss_pred HHHHHHHHHHHHHHHHhcCc--eeEeecCCCCc-cchhh----hhh-------hhCceeeecCCCceeeeecCCCHHHHH Q lcl|NC_018086. 255 DAYNLAVSDSVNDIAYWNDA--YLWLQGFDLSA-DSDSI----SNM-------KNDRVIVTDEDGMVKFITKDVNDKHIE 320 (511) Q Consensus 255 d~~~~~~s~~~~~~~~~~~p--~l~~~G~~~~~-~~~~~----~~~-------~~~~~i~~~~~~~~~~~~~~~~~~~~~ 320 (511) .++...++. +.+ ++.+.+.-.++ ..... ..+ ..++++.++++.+++.++.......+ T Consensus 158 ~~i~~~~~~--------~~~~gil~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~- 228 (378) T protein:vir:94 158 ASIQTKLEQ--------GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK- 228 (378) T ss_pred HHHHHHHhc--------ccccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhH- Confidence 444333221 122 22222221111 11111 111 12346677666565555544433444 Q ss_pred HHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-------- Q lcl|NC_018086. 321 NIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAK-------- 392 (511) Q Consensus 321 ~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~-------- 392 (511) ...+.+.+.|+..-++|..-.. +..|.. .....+..+|.-.++.|...+...--.. T Consensus 229 ~~~~~~~~~Ia~~fgVP~~~l~--~~~se~--------------~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~ 292 (378) T protein:vir:94 229 DEIDLIKSELLTGYFMNENILL--GTASQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKG 292 (378) T ss_pred HHHHHHHHHHHHHhCCCHHHhc--CChHHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhh Confidence 3456677888888888754331 111111 1123444455555544444333211100 Q ss_pred cccccceeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCC Q lcl|NC_018086. 393 DLKPYEVTPVFVRNLPQSYAELADMAVKLR--DMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQG 470 (511) Q Consensus 393 ~~~~~~i~i~f~~~~p~d~~e~a~~~~~~~--g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 470 (511) .....++.+.+..-+-.|..+.++++.++. |+++.-.++.+++.-+-+.. +.+. -+.+ ..+ T Consensus 293 ~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gG--D~~~-----------~~~n----~~~ 355 (378) T protein:vir:94 293 NLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG--DVYI-----------ANLN----AVA 355 (378) T ss_pred cccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--Ceee-----------eccc----ccc Confidence 011123555666667778899999988874 78888888887764321110 0000 0000 000 Q ss_pred CCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 471 ASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) .......+++. .+......+.++ T Consensus 356 ~~~~~~~~~~~-~~~~~~~e~~n~ 378 (378) T protein:vir:94 356 VKNLSDLQGSR-KDVTSTDETNNQ 378 (378) T ss_pred cccchhhcCCc-CCCCCCCCCCCC Confidence 00000000000 000000000001 No 270 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=30.40 E-value=1.6 Score=19.50 Aligned_cols=335 Identities=9% Similarity=-0.002 Sum_probs=125.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceec--cchHHHHHHHHHhhhhccCcee---cCc----h- Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIV--HNFPKLLVDTSTAYLAGEPITE---SGD----E- 103 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~--~n~~k~ivd~~~~~l~g~~~~~---~~d----~- 103 (511) +.++++.... .+ ...+.|... ... .....+. .......|+..++-+-.-|+.+ ... + T Consensus 1 M~if~~~~~~----~~-~~~~~~~~~-----~~~---~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSF----SR-GKLNNDTQR-----VTA---WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhHHhHhh----hh-cccccCcce-----eee---eecchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccc Confidence 2333322210 00 011111110 000 0001111 1334556666666666667642 111 1 Q ss_pred ---hhHHHHHHHHhc--cC---hhHHHHHHHHHHhhCCeEEEE-eeeCCCCceEEEEEcccceEEEecCCCCCceEEEEE Q lcl|NC_018086. 104 ---KTIKAMQPVFKE--NY---VTDVNSEEVKLSGIFGHCFEI-HWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIY 174 (511) Q Consensus 104 ---~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~~~~~-v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~ 174 (511) .....+..+|.. |. -......+....+.+|.||++ ++.+..|++...+ T Consensus 68 ~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~----------------------- 124 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLL----------------------- 124 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEE----------------------- Confidence 112234455542 32 234555678888899999875 3333333321110 Q ss_pred EEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHH Q lcl|NC_018086. 175 YNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLI 254 (511) Q Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~ 254 (511) +. .++. .|.+..+.++.. |. . ...+.+. +..+. T Consensus 125 -~~----~~~~------~~~~~dvih~~~-----------------------------~~---~-~~~~~~~---~~~~~ 157 (378) T protein:vir:94 125 -FA----NDKK------EYKPEELVRLTS-----------------------------PF---Y-INEDTSI---LDNAL 157 (378) T ss_pred -Ee----cCcE------EechhceeeecC-----------------------------cC---C-cccchhH---HHHHH Confidence 00 0110 112222222210 00 0 0011122 22222 Q ss_pred HHHHHHHHHHHHHHHHhc-CceeEeecCCCCcc--ch----hhhhh-------hhCceeeecCCCceeeeecCCCHHHHH Q lcl|NC_018086. 255 DAYNLAVSDSVNDIAYWN-DAYLWLQGFDLSAD--SD----SISNM-------KNDRVIVTDEDGMVKFITKDVNDKHIE 320 (511) Q Consensus 255 d~~~~~~s~~~~~~~~~~-~p~l~~~G~~~~~~--~~----~~~~~-------~~~~~i~~~~~~~~~~~~~~~~~~~~~ 320 (511) ++++..+ ..++ ..++...+. .+.+ ++ +...+ ..++++.++++.+++.++.+.....+ T Consensus 158 ~~~~~~~-------~~~~~~g~l~~~~~-l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~- 228 (378) T protein:vir:94 158 ASIQTKL-------EQGKLRGLLKINAF-LDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK- 228 (378) T ss_pred HHHHHHH-------hhCCcccceeeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhH- Confidence 2222221 1111 112222221 1111 11 11111 11346777766666655544433343 Q ss_pred HHHHHHHHHHHHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------C-c Q lcl|NC_018086. 321 NIKNRAKLDIFSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNK-------A-K 392 (511) Q Consensus 321 ~~~~~l~~~i~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~-------~-~ 392 (511) ..++.+.+.|+..-++|..-.. +..+ +. .....+..+|.-++..|...+...-- . . T Consensus 229 ~~~~~~~~~Ia~~fgvPp~~l~--g~~~-e~-------------~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~ 292 (378) T protein:vir:94 229 DEIDLIKSELLTGYFMNENILL--GTAT-QE-------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKG 292 (378) T ss_pred HHHHHHHHHHHHHhCCCHHHhc--CCch-HH-------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhh Confidence 4467777888888888753331 1111 10 01123333444444443333332110 0 0 Q ss_pred cccccceeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCC Q lcl|NC_018086. 393 DLKPYEVTPVFVRNLPQSYAELADMAVKL--RDMLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQG 470 (511) Q Consensus 393 ~~~~~~i~i~f~~~~p~d~~e~a~~~~~~--~g~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 470 (511) .....++.+.++.-+-.|..+.++++.++ .|+++.-.++.+++.-+-+. .++... +.+. ..... T Consensus 293 ~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~g--gd~~~~-----------~~n~-~~~~~ 358 (378) T protein:vir:94 293 NLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG--GDVYIA-----------NLNA-VAVKN 358 (378) T ss_pred hcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeeee-----------cccc-cchhc Confidence 11123455666777778899999998887 47898888888876522100 000000 0000 00000 Q ss_pred CCCccccccCCCCCCccccccCCC Q lcl|NC_018086. 471 ASTAAANKLDKNPANTSTITTTDP 494 (511) Q Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~~ 494 (511) ..+...+..+..++.++.. + T Consensus 359 ~~~~~~~~~~~~~~~e~~n----~ 378 (378) T protein:vir:94 359 LSDLQGNRKDVTSTDETNN----Q 378 (378) T ss_pred chhcccccCCCCCCCCCCC----C Confidence 0000000000000000000 0 No 271 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=28.67 E-value=1.7 Score=19.28 Aligned_cols=290 Identities=11% Similarity=0.065 Sum_probs=106.7 Q ss_pred cccCCcCc-----cccccceeccchHHHHHHHHHhhh------hc---cCce-e--------------cCchhhHHHHHH Q lcl|NC_018086. 61 IQSRTFDD-----TNKPNSKIVHNFPKLLVDTSTAYL------AG---EPIT-E--------------SGDEKTIKAMQP 111 (511) Q Consensus 61 ~~~~~~~~-----~~~~~~ri~~n~~k~ivd~~~~~l------~g---~~~~-~--------------~~d~~~~~~l~~ 111 (511) .++..+.+ ...+..-.++.|...+--...+|+ .| +|+. . ++-......+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~n~l~~ 80 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMVSS 80 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceeeechHHHh Confidence 11000000 000011122222222111122222 01 1110 0 000000001111 Q ss_pred HHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEE Q lcl|NC_018086. 112 VFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIR 188 (511) Q Consensus 112 ~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~ 188 (511) .+.-|.. ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+.+..+... ...++++.. ...+.. T Consensus 81 ~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~----~~~~~~~~~--~~~g~~-- 152 (345) T protein:vir:37 81 LYEGGKALSRMDMRALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGY----SYLMKKSLY--DTAQEI-- 152 (345) T ss_pred hccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCe----eEEEEEeEe--cCCceE-- Confidence 2222321 13355678889999999999888888875 466677766544332211 111111110 001111 Q ss_pred EEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_018086. 189 TYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDS-VND 267 (511) Q Consensus 189 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~-~~~ 267 (511) ..+.++.++++..- + | .+.-.|.|.+...+..+..-. ..+.+ ... T Consensus 153 --~~~~~~dVihir~~----------------------~-----~----~~~~~Gls~~~~a~~si~l~~-~a~~~~~~~ 198 (345) T protein:vir:37 153 --YRYDAKDIIFIKLY----------------------D-----P----MQQVYGSPDYVGGIQSALLNS-DATVFRRRY 198 (345) T ss_pred --EEEccccEEEecCC----------------------C-----C----CCCcccccHHHHHHHHHHHHH-HHHHHHHHH Confidence 01223333322100 0 0 011247777665444333222 22222 223 Q ss_pred HHHhcCceeEe--ecCCCCccchhhhhh----hh-------Cceeee-cC--CCceeeeecC--CCHHHHHHHHHHHHHH Q lcl|NC_018086. 268 IAYWNDAYLWL--QGFDLSADSDSISNM----KN-------DRVIVT-DE--DGMVKFITKD--VNDKHIENIKNRAKLD 329 (511) Q Consensus 268 ~~~~~~p~l~~--~G~~~~~~~~~~~~~----~~-------~~~i~~-~~--~~~~~~~~~~--~~~~~~~~~~~~l~~~ 329 (511) .+..+.|-.++ +|...+ ++....+ .. .+++.+ ++ +.++++.-.. .....+.+..+...+. T Consensus 199 f~NG~~p~~Il~~~d~~l~--~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~d 276 (345) T protein:vir:37 199 FSNGAHMGFILYSTDPDLT--EEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQD 276 (345) T ss_pred HhccCCcceEEEecCCCCC--HHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHH Confidence 34444555554 443332 2222222 11 123333 22 2344544333 3345566777788889 Q ss_pred HHHHhCcccccccccc-Ccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccceeEEeCC- Q lcl|NC_018086. 330 IFSLSQTPDLVSKDFT-AAS--GQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKPYEVTPVFVR- 405 (511) Q Consensus 330 i~~~s~~p~~~~~~~~-~~S--g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~i~i~f~~- 405 (511) |+..-++|....+... +.+ +.+-+... ..+...|.-+++.|...++... .+. ....+.|++ T Consensus 277 Ia~a~~VPp~llGi~~~~~~~~~~~e~~~~-----------~f~~~~l~P~~~~ie~~ln~~~---~~~-~~~~i~F~~~ 341 (345) T protein:vir:37 277 VLTAHRFPAGLSGIIPTNTGGLGDPLKYRE-----------VYHYDEVMPLQEIIAETINQDP---EIK-NLLKIKFREQ 341 (345) T ss_pred HHHHhCCCHHHhCccCCCCCCcccHHHHHH-----------HHHHHHHHHHHHHHHHHhhhhc---cCC-CcceEEecch Confidence 9999999865443221 111 11111111 1222223333333333222110 111 123467753 Q ss_pred CCCc Q lcl|NC_018086. 406 NLPQ 409 (511) Q Consensus 406 ~~p~ 409 (511) .+.+ T Consensus 342 ~L~~ 345 (345) T protein:vir:37 342 NFAK 345 (345) T ss_pred hhcC Confidence 3333 No 272 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=26.40 E-value=1.9 Score=18.99 Aligned_cols=366 Identities=9% Similarity=-0.034 Sum_probs=129.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhhhhccCceecCc-hhhHHHHHHH Q lcl|NC_018086. 34 ITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAYLAGEPITESGD-EKTIKAMQPV 112 (511) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~l~g~~~~~~~d-~~~~~~l~~~ 112 (511) +.++.......... .+.. +..+........... ...-+...-....|+..++-+-.-|+.+-.+ +.....+..+ T Consensus 1 Mg~~~~~~~~~~~~---~~~~-~~~~~~~~~~~~~~~-~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~~~~~~~~l 75 (395) T protein:vir:40 1 MGFKSWVSGFFNEE---QRTL-NLTDTVWCSIPSEKL-KELSIKKWAIDSCANKIANTLSCAEVLTYEKGEEVRKKNWYM 75 (395) T ss_pred CchHHHHHhhhccc---cccc-ccccchhhccccccc-hhhhhhhHHHHHHHHHHHHHHhhCceeeccCCccccchHHHH Confidence 12221111111000 0000 000000000000000 0000112223345555555444456665322 2222334444 Q ss_pred Hhc--cC---hhHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceE Q lcl|NC_018086. 113 FKE--NY---VTDVNSEEVKLSGIFGHCFEIHWIDRNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQI 187 (511) Q Consensus 113 ~~~--n~---~~~~~~~~~~~a~~~G~~~~~v~~~~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~ 187 (511) +.. |. .......+....+.+|.||+++..+. +.+ |..+..... ...-..+..+.. ++... T Consensus 76 L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~---~~~----~~~~~~~~~------~~~~~~~~~v~~--~~~~~ 140 (395) T protein:vir:40 76 FNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY---IYV----ADSFTKNDK------SLYENTYTEVTL--KDLTL 140 (395) T ss_pred HHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc---eee----cCCcccccc------ccccceeeeeee--cCcee Confidence 432 32 23445667888899999997765432 111 111110000 000000100000 01000 Q ss_pred EEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecCCcccCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 188 RTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIANEERLGDFEAQLSLIDAYNLAVSDSVND 267 (511) Q Consensus 188 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l~d~~~~~~s~~~~~ 267 (511) -..+.++.++|++. ++..+.+.+.. +...+....+...+. T Consensus 141 --~~~~~~~evih~r~-----------------------------------~~~~~~~~~~~---l~~~~~~~~~~~~~~ 180 (395) T protein:vir:40 141 --KKEFKESEVLHLTL-----------------------------------NNESIKSIIDG---FYLLYGDLLTAAVNK 180 (395) T ss_pred --eeeeccccEEEeec-----------------------------------CCCCccccchh---HHHHHHHHHHHHHHH Confidence 01123333333321 11112222222 122222222222222 Q ss_pred HHHh--cCceeEeecCC-CCcc--chhhhh----h-----hhCceeeecCCCceeeeecCCCHHHHHH---HHHHHHHHH Q lcl|NC_018086. 268 IAYW--NDAYLWLQGFD-LSAD--SDSISN----M-----KNDRVIVTDEDGMVKFITKDVNDKHIEN---IKNRAKLDI 330 (511) Q Consensus 268 ~~~~--~~p~l~~~G~~-~~~~--~~~~~~----~-----~~~~~i~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~i 330 (511) ..+. ..+.+++.... .+++ +..... . ...+++.++++.+.+.+........+.+ +.+.+.+.| T Consensus 181 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~I 260 (395) T protein:vir:40 181 YKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMV 260 (395) T ss_pred HHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHH Confidence 2222 33444443322 2111 110111 1 1223566666666555544433333332 233445677 Q ss_pred HHHhCccccccccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccccccceeEEeCCCCC Q lcl|NC_018086. 331 FSLSQTPDLVSKDFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMN--KAKDLKPYEVTPVFVRNLP 408 (511) Q Consensus 331 ~~~s~~p~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~~~i~i~f~~~~p 408 (511) +..-++|..-.+ +.-|+. + +.....+..+|.-+++.|...+...- .........+++.+..-+. T Consensus 261 a~~fgVPp~~l~--~~~sn~--e----------~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~ 326 (395) T protein:vir:40 261 ANSFNIPLGLAK--GDTVGL--S----------EQVNSFLMFSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKV 326 (395) T ss_pred HHHhCCCHHHhc--CCCcCH--H----------HHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhc Confidence 777777754332 111111 0 11123444455555555544443211 1111122345666677777 Q ss_pred cCHHHHHHHHHHHh--ccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCccccccCCCCC Q lcl|NC_018086. 409 QSYAELADMAVKLR--DMLPDETIINQFPWI--TDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTAAANKLDKNPA 484 (511) Q Consensus 409 ~d~~e~a~~~~~~~--g~~s~et~~~~l~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (511) .|..+.++++.++. |+++.-.++..++.- +++... +. .-..+. . ..+...... ++++..+ T Consensus 327 ~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD--~~-----------~~~~n~-~-~~~~~~~~~-kgge~~~ 390 (395) T protein:vir:40 327 QDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQ--ER-----------FVTKNY-A-PLGENEEDL-KGGDINE 390 (395) T ss_pred cCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCc--ee-----------eecccc-c-ccccccccc-CCCCCCC Confidence 88999999988874 788888888887652 222111 00 000000 0 000000000 0000000 Q ss_pred CccccccCC Q lcl|NC_018086. 485 NTSTITTTD 493 (511) Q Consensus 485 ~~~~~~~~~ 493 (511) + .+.+ T Consensus 391 ~----~~~~ 395 (395) T protein:vir:40 391 N----KGDS 395 (395) T ss_pred C----cCCC Confidence 0 0000 No 273 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=25.14 E-value=2.1 Score=18.83 Aligned_cols=397 Identities=12% Similarity=0.086 Sum_probs=164.2 Q ss_pred CCCccc-----hhhcccccCchhhHhhhhcc--CCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCCcCcccccc Q lcl|NC_018086. 1 MAIPNG-----QINAGDIITTNIRRKHFIRR--NFDLRELITLAEMHSRSSSAYGVLYDYYKGNHIAIQSRTFDDTNKPN 73 (511) Q Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~~~~~yY~G~~~~~~~~~~~~~~~~~ 73 (511) =|..|. ....|.+... +.+. .+..+ .||+ +|..+-.+++-+..+ T Consensus 31 Ga~~i~~~~~~~~~~g~~~~~------~~~~~~~~~~~---eLI~-------~YR~ma~~pEvd~Av------------- 81 (511) T protein:vir:56 31 GAKEIHTNLLAPQLGHAIIPS------DAQSEGTIPVK---ELIK-------SYRALAEYHEVDDAI------------- 81 (511) T ss_pred CceEEecccccceecceeccc------cccccCccchH---HHHH-------HHHHHhhccchhhHH------------- Confidence 111110 0011111111 1111 11111 2332 333444444433221 Q ss_pred ceeccchHHHHHHHHHhh-hhccCceec---------CchhhHHHHHHHHhccChhHHHHHHHHHHhhCCeEEEEeeeCC Q lcl|NC_018086. 74 SKIVHNFPKLLVDTSTAY-LAGEPITES---------GDEKTIKAMQPVFKENYVTDVNSEEVKLSGIFGHCFEIHWIDR 143 (511) Q Consensus 74 ~ri~~n~~k~ivd~~~~~-l~g~~~~~~---------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~~~~~v~~~~ 143 (511) ..||+..+-+ -...|+.+. ..+...+.+..++.--+|+....+..+.+.+.|+.|...-.|+ T Consensus 82 --------~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~ 153 (511) T protein:vir:56 82 --------QEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDK 153 (511) T ss_pred --------HHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecc Confidence 1122211111 112233221 2234456677777778899999999999999999887765544 Q ss_pred -CCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccc Q lcl|NC_018086. 144 -NKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDY 222 (511) Q Consensus 144 -~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 222 (511) +|-..+..+||+.+-.|..-.. +....+... +.+...-+|.+....... +...... T Consensus 154 k~GI~eLr~lDPr~i~~vr~i~~--~~~~~~~v~--------~~~~ey~~Y~~~~~~~~~-----~~~~~~~-------- 210 (511) T protein:vir:56 154 DNNIIELRPLNPMKMELVREIQK--ETIDGVEVV--------KGTLEYYVYKQSDYKMPS-----WMSATNR-------- 210 (511) T ss_pred ccceeehhhcCcccchhhhhhhc--ccccccccc--------cceeeeeEecCCCcccCc-----ccccccc-------- Confidence 5666788899998766543211 111111100 112223344443211110 0000000 Q ss_pred cceeccCCccc---eEee--------cCCcccCchhHHHHHHHHHHH--HHHHHHHHHHHHhcCceeEeecCCCCccc-- Q lcl|NC_018086. 223 EVHPNLLQKFP---VLEI--------IANEERLGDFEAQLSLIDAYN--LAVSDSVNDIAYWNDAYLWLQGFDLSADS-- 287 (511) Q Consensus 223 ~~~~~~~g~iP---vv~~--------~n~~~g~s~~~~v~~l~d~~~--~~~s~~~~~~~~~~~p~l~~~G~~~~~~~-- 287 (511) .+.-=+|| ||+. .|+....|-+.. -+..+| +++-|....-+..+.|-+=+.-.+....+ T Consensus 211 ---~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLhk---AiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~ 284 (511) T protein:vir:56 211 ---AQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLDR---AIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQ 284 (511) T ss_pred ---cccceeechhheeeecccceeccCCCCeeeccchh---hhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCch Confidence 00001122 1111 122233444444 344444 45566666777777775533322211111 Q ss_pred ---hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecCC--CHHHHHHHHHHHHHHHHHHhC Q lcl|NC_018086. 288 ---DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKDV--NDKHIENIKNRAKLDIFSLSQ 335 (511) Q Consensus 288 ---~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~~--~~~~~~~~~~~l~~~i~~~s~ 335 (511) .-...+. . +++ +++| +++. .+.-|.+. +...+. -+.-+.+-+|+--+ T Consensus 285 KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kKLy~aLn 363 (511) T protein:vir:56 285 KAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIE-DVLYFNRKLYKAMR 363 (511) T ss_pred hHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhC Confidence 1111110 0 000 1122 2222 22223332 233222 24555566677777 Q ss_pred cccccc--c----cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--cceeEEeCCCC Q lcl|NC_018086. 336 TPDLVS--K----DFTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEFMNKAKDLKP--YEVTPVFVRNL 407 (511) Q Consensus 336 ~p~~~~--~----~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~--~~i~i~f~~~~ 407 (511) +|-.-. + .|+-.-|..|-.-+.....-+.+.+..|..-+.++++.=+ +|...-...+++. ..+.+.|...- T Consensus 364 VP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qL-ilKgiit~eeW~~i~~~I~~~f~~Dn 442 (511) T protein:vir:56 364 IPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQL-IVNNIITEEEWDANHEKLYVVFNQDS 442 (511) T ss_pred CCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccCCCHHHHHHHhhcceEEeeecc Confidence 774322 2 1221233445455555567788888888888888876422 2221112222222 34677774433 Q ss_pred CcCHHHH-------HHHHHHHh---c-cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCc Q lcl|NC_018086. 408 PQSYAEL-------ADMAVKLR---D-MLPDETIINQFPWITDARQEVEKADAQRQKRADIALQNFKQTSAVQGASTA 474 (511) Q Consensus 408 p~d~~e~-------a~~~~~~~---g-~~s~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 474 (511) .-.+... ++++..+. | .+|.+++++.+-.-+| +|+..++++-+++.+... +.+ .. .+- T Consensus 443 ~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tD--eei~~~~k~I~~E~k~~~--~~~--~e---~~f 511 (511) T protein:vir:56 443 YFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSD--DQITAMQSEIDEEETNPR--FQQ--DD---QGF 511 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCH--HHHHHHHHHHHHhhcCCC--CCC--cc---cCC Confidence 3333322 33444443 3 3699999988543343 333333332222222111 000 00 000 No 274 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=24.32 E-value=2.2 Score=18.72 Aligned_cols=409 Identities=10% Similarity=0.079 Sum_probs=167.2 Q ss_pred CCCccchhhcccccCchhh-Hhhhhc--------cCCCHH------------------HHHH---HHHHHHHHHHHHHHH Q lcl|NC_018086. 1 MAIPNGQINAGDIITTNIR-RKHFIR--------RNFDLR------------------ELIT---LAEMHSRSSSAYGVL 50 (511) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~--------~~~~~~------------------~l~~---~~~~~~~~~~~~~~~ 50 (511) |+..+-+++.-.+-..... .+.+.. .+.+.. .+.. .+.....-+.+|+.+ T Consensus 1 m~~~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~m 80 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSL 80 (521) T ss_pred CCcchhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHH Confidence 6665555543332211111 111000 000000 0000 000011112223333 Q ss_pred HHHhcCCCcccccCCcCccccccceeccchHHHHHHHHHhh-hhccCceecC---------chhhHHHHHHHHhccChhH Q lcl|NC_018086. 51 YDYYKGNHIAIQSRTFDDTNKPNSKIVHNFPKLLVDTSTAY-LAGEPITESG---------DEKTIKAMQPVFKENYVTD 120 (511) Q Consensus 51 ~~yY~G~~~~~~~~~~~~~~~~~~ri~~n~~k~ivd~~~~~-l~g~~~~~~~---------d~~~~~~l~~~~~~n~~~~ 120 (511) -.+++-+ +-...||+..+-+ -...|+.+.- .+...+.+..++.--+|+. T Consensus 81 a~~pEvd---------------------~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~ 139 (521) T protein:vir:10 81 SKYHEVD---------------------NAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFER 139 (521) T ss_pred hhccchh---------------------hHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccch Confidence 2222222 2222233332222 1223444321 1234456677777778999 Q ss_pred HHHHHHHHHhhCCeEEEEeeeC----CCCceEEEEEcccceEEEecCCCCCceEEEEEEEEEeecCCcceEEEEEEEcCC Q lcl|NC_018086. 121 VNSEEVKLSGIFGHCFEIHWID----RNKKHRFKAVSPMNCLIAYSADLDEEPVAAIYYNTVISDITGHQIRTYEVYTED 196 (511) Q Consensus 121 ~~~~~~~~a~~~G~~~~~v~~~----~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 196 (511) ...+..|.+.+.|+.|.+.-.| ++|-..+..+||+.+-.+....... ..++... +.+...-+|.+. T Consensus 140 ~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~--~~~~~v~--------~~~~e~f~Y~~~ 209 (521) T protein:vir:10 140 EGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSN--ENGNDVY--------KGVKEFFTYGAT 209 (521) T ss_pred hhhHHHhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCC--CCcchhh--------ccceeeeeeccC Confidence 9999999999999999886554 3466778889999875443211110 0111000 011122344432 Q ss_pred cEEEEEEccCcccccccccccccccccceeccCCccc--eEeec-------CCcccCchhHHHHHHHHHHH--HHHHHHH Q lcl|NC_018086. 197 LIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFP--VLEII-------ANEERLGDFEAQLSLIDAYN--LAVSDSV 265 (511) Q Consensus 197 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~-------n~~~g~s~~~~v~~l~d~~~--~~~s~~~ 265 (511) ....|-..+ .+...=+|| .|.|. |.....|-+.. -+..+| +++-|.. T Consensus 210 ~~~~~~~~g-------------------~~~~~vkI~~daI~y~hSGL~d~~~~~i~syLhk---AiKp~NQLkm~EDAl 267 (521) T protein:vir:10 210 EDNRYNISG-------------------NSNNLVQIPIDAIVYSHSGKVDIDGKTIVGYLHN---VIKPANQLKMLEDAM 267 (521) T ss_pred CCceecCCC-------------------CCCcceeechhheeeecccceeCCCCceeccchh---hhHhHHhhHHHHhhH Confidence 211111110 001111122 11121 22333444444 344444 4556666 Q ss_pred HHHHHhcCceeEeecCCCCccc-----hhhhhhh-h--Cce---------------------eeec--CCCc-eeeeecC Q lcl|NC_018086. 266 NDIAYWNDAYLWLQGFDLSADS-----DSISNMK-N--DRV---------------------IVTD--EDGM-VKFITKD 313 (511) Q Consensus 266 ~~~~~~~~p~l~~~G~~~~~~~-----~~~~~~~-~--~~~---------------------i~~~--~~~~-~~~~~~~ 313 (511) ..-+..+.|-+=+.-.+....+ +-...+. . +++ +++| +++. .+.-|.+ T Consensus 268 VIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLp 347 (521) T protein:vir:10 268 VIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLP 347 (521) T ss_pred HHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeecc Confidence 6677777775533322221111 1111110 0 000 1122 2222 2222333 Q ss_pred --CCHHHHHHHHHHHHHHHHHHhCcccc--cccc----ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 314 --VNDKHIENIKNRAKLDIFSLSQTPDL--VSKD----FTAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYL 385 (511) Q Consensus 314 --~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~----~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 385 (511) .+...+. -+.-+.+-+|+--++|-. ..++ +|. | ..|-.-+.....-+.+.+..|..-+.++++.=+ +| T Consensus 348 ggqnlgem~-DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr-~-~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qL-il 423 (521) T protein:vir:10 348 GAQSMGEMD-DVRWFNRKLYESMKIPLSRLPQEGAGVTFGA-G-NDITRDELQFTKYIRGLQQQFEPIFLNPLRTNL-ML 423 (521) T ss_pred ccCCcChHH-HHHHHHHHHHHHhCCCccccCCCCCceeccc-c-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hh Confidence 2233222 244555556666677742 2221 222 2 234444445567778888888888888876422 22 Q ss_pred HhcCCCccccc--cceeEEeCCCCCcCHHH-------HHHHHHHHhc------cCChHHHHHhCCCCCC--HHHHHHHHH Q lcl|NC_018086. 386 EFMNKAKDLKP--YEVTPVFVRNLPQSYAE-------LADMAVKLRD------MLPDETIINQFPWITD--ARQEVEKAD 448 (511) Q Consensus 386 ~~~~~~~~~~~--~~i~i~f~~~~p~d~~e-------~a~~~~~~~g------~~s~et~~~~l~~v~d--~~~E~~ri~ 448 (511) ...-...+++. ..+.+.|...-.-.+.. .++++..+.+ .+|.+++++.+-..+| ..++-+.|+ T Consensus 424 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~ 503 (521) T protein:vir:10 424 KGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKID 503 (521) T ss_pred ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHH Confidence 21112222222 34677774433322322 2344455533 5899999988644443 223333333 Q ss_pred HHHHHHHHHHHhhccccccCCCCCC Q lcl|NC_018086. 449 AQRQKRADIALQNFKQTSAVQGAST 473 (511) Q Consensus 449 ~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (511) +|..+ . -+++..... .+- T Consensus 504 ~E~~~----~--~~~~p~~e~-~df 521 (521) T protein:vir:10 504 GELKD----S--VYKNPEDPM-EEF 521 (521) T ss_pred HhhhC----C--CCCCCcchh-hcC Confidence 33211 0 011100000 000 No 275 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=23.85 E-value=2.3 Score=18.65 Aligned_cols=194 Identities=13% Similarity=0.036 Sum_probs=74.2 Q ss_pred EEEEEEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC-----CcccCchh Q lcl|NC_018086. 173 IYYNTVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA-----NEERLGDF 247 (511) Q Consensus 173 v~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~ 247 (511) +|.- .+|..++.+.. .. + ..++ ......-.. |+++++ .-.|.|.+ T Consensus 1 ~r~~-----~dg~~~y~~~~---~~---~--~~~g---------------~~~~~~~~e--ilH~r~~~~~~~~~Glspi 50 (219) T protein:vir:98 1 MRVC-----KDGNYKYLMKK---SL---Y--DTKS---------------EIYEYNKND--VIFIKLYDPMQQVYGSPDY 50 (219) T ss_pred Ccee-----ecCeEEEEEec---ce---e--cCCc---------------eeEEecccc--EEEecCCCCCCCcceecHH Confidence 2211 11211111100 00 0 0000 000000001 233332 12477776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeEe--ecCCCCccchhhhhh----hh------Cc-eeeecC---CCceeeee Q lcl|NC_018086. 248 EAQLSLIDAYNLAVSDSVNDIAYWNDAYLWL--QGFDLSADSDSISNM----KN------DR-VIVTDE---DGMVKFIT 311 (511) Q Consensus 248 ~~v~~l~d~~~~~~s~~~~~~~~~~~p~l~~--~G~~~~~~~~~~~~~----~~------~~-~i~~~~---~~~~~~~~ 311 (511) ......+.....+..-....++..+.|-.++ +|...++ +....+ .. .+ ++.+.. +.+++|.. T Consensus 51 ~~a~~~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~--e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~ 128 (219) T protein:vir:98 51 VGGITSALLNSDATIFRRRYYSNGAHMGFILYSTDPDMTE--EMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIP 128 (219) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCH--HHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEE Confidence 6544444432222222223344556666555 3433332 222211 11 12 233322 22455544 Q ss_pred cCC--CHHHHHHHHHHHHHHHHHHhCccccccccc--cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018086. 312 KDV--NDKHIENIKNRAKLDIFSLSQTPDLVSKDF--TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLEF 387 (511) Q Consensus 312 ~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 387 (511) ... ....+.+..+.....|+..-++|....+.. +.+++..++... ...+...|.-.+..|...++. T Consensus 129 ~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~----------~~f~~~tL~P~~~~ie~~ln~ 198 (219) T protein:vir:98 129 IGDTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIR----------EAYQADEVLPLQEIIAESINS 198 (219) T ss_pred ccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHH----------HHHHHHHHHHHHHHHHHHhhh Confidence 433 334555566666788888888887654422 122222222111 122233333333333333321 Q ss_pred cCCCccccccceeEEeCCCCCcCHH Q lcl|NC_018086. 388 MNKAKDLKPYEVTPVFVRNLPQSYA 412 (511) Q Consensus 388 ~~~~~~~~~~~i~i~f~~~~p~d~~ 412 (511) . .. -...+.+.|....+.|.- T Consensus 199 ~---~~-~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 199 D---YE-IKSALKVNFKQPEKRDKN 219 (219) T ss_pred h---hc-CCCccEEeecCcccccCC Confidence 1 00 112356788888877765 No 276 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=20.57 E-value=2.7 Score=18.18 Aligned_cols=276 Identities=13% Similarity=0.057 Sum_probs=95.1 Q ss_pred cCCCcccccCCcCcccc-------ccce---eccchHH------HHHHHHHhhh-hcc----CceecCchh--------- Q lcl|NC_018086. 55 KGNHIAIQSRTFDDTNK-------PNSK---IVHNFPK------LLVDTSTAYL-AGE----PITESGDEK--------- 104 (511) Q Consensus 55 ~G~~~~~~~~~~~~~~~-------~~~r---i~~n~~k------~ivd~~~~~l-~g~----~~~~~~d~~--------- 104 (511) .++ ++...+.. ..-+ +..+=+. .+.+-. ... .|+ |+...+=-+ T Consensus 1 ~~~------~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~-~~~~~~~~~~pp~~~~~la~~~~a~~~h~ 73 (344) T protein:vir:20 1 MSK------KKGKTPQPAAKTMTASGPKMEAFTFGEPVPVLDRRDILDYV-ECISNGRWYEPPVSFTGLAKSLRAAVHHS 73 (344) T ss_pred CCc------ccCCCCcchhhhhhccCCceEEEEcCCceEecCcchhhhhh-hhhhcCceecCCCCHHHHHHHHhhhhhhC Confidence 111 11000000 0000 0000000 011110 111 111 111100000 Q ss_pred -----hHHHHHHHHhccCh--hHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEecCCCCCceEEEEEEE Q lcl|NC_018086. 105 -----TIKAMQPVFKENYV--TDVNSEEVKLSGIFGHCFEIHWIDRNKKH-RFKAVSPMNCLIAYSADLDEEPVAAIYYN 176 (511) Q Consensus 105 -----~~~~l~~~~~~n~~--~~~~~~~~~~a~~~G~~~~~v~~~~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 176 (511) ....+...+.-|.. ...+..++.+.+.+|.||+.+-.+..|++ .+..++|..+-...+.. ++| T Consensus 74 ~~i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~---------~~~ 144 (344) T protein:vir:20 74 SPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED---------VYW 144 (344) T ss_pred ccceehhhhHHHhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCC---------EEE Confidence 00001111222321 12456677888999999998888888865 34445554432211110 011 Q ss_pred EEeecCCcceEEEEEEEcCCcEEEEEEccCcccccccccccccccccceeccCCccceEeecC-----CcccCchhHHHH Q lcl|NC_018086. 177 TVISDITGHQIRTYEVYTEDLIYKFSTDDEREVYREIPEELEIKDYEVHPNLLQKFPVLEIIA-----NEERLGDFEAQL 251 (511) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~ 251 (511) .+.. .+.. ..+.++. |+++++ .-.|.|.+.... T Consensus 145 ~~~~--~~~~----~~~~~~e------------------------------------IiHir~~~~~~~~yGls~~~~a~ 182 (344) T protein:vir:20 145 WVPS--FNEP----TAFAPGS------------------------------------VFHLLEPDINQELYGLPEYLSAL 182 (344) T ss_pred EEcc--CCeE----EEEcCcc------------------------------------EEEeCCCCCCCCcccccHHHHHH Confidence 1110 0110 0122222 233332 124666665433 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeEe--ecCCCCccchhhhh----hhh------Cceeee--cC--CCceeeeec--C Q lcl|NC_018086. 252 SLIDAYNLAVSDSVNDIAYWNDAYLWL--QGFDLSADSDSISN----MKN------DRVIVT--DE--DGMVKFITK--D 313 (511) Q Consensus 252 ~l~d~~~~~~s~~~~~~~~~~~p~l~~--~G~~~~~~~~~~~~----~~~------~~~i~~--~~--~~~~~~~~~--~ 313 (511) .-++.-..+..-..+.....+.|-.++ +|...++ +.... +.. ++.+.+ ++ ..++++... . T Consensus 183 ~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~--e~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~ 260 (344) T protein:vir:20 183 NSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDR--NDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV 260 (344) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCH--HHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCC Confidence 333322111111222233345554544 4433332 22222 211 222332 22 123454433 2 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCccccccccc-------cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018086. 314 VNDKHIENIKNRAKLDIFSLSQTPDLVSKDF-------TAASGQALKAATQPLENKSAVKESKFRKVLAKRYELVCSYLE 386 (511) Q Consensus 314 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-------~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 386 (511) .....+.+..+..++.|+..-++|..-.+.. ++....++.+... .|.-+++.+..+-. T Consensus 261 ~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~---------------~l~P~~~~~e~in~ 325 (344) T protein:vir:20 261 ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRN---------------ELIPLQDRIREING 325 (344) T ss_pred hhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHH---------------HHHHHHHHHHHHHH Confidence 3344567777888889999999986544422 1111112222111 11111111111111 Q ss_pred hcCCCccccccceeEEeCCCCCcCHHH Q lcl|NC_018086. 387 FMNKAKDLKPYEVTPVFVRNLPQSYAE 413 (511) Q Consensus 387 ~~~~~~~~~~~~i~i~f~~~~p~d~~e 413 (511) ..+. . .+.|.++......| T Consensus 326 ~lg~------~--~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 326 WLGQ------E--VIRFKNYSLDTDND 344 (344) T ss_pred hcCC------c--ccccCccccccCCC Confidence 1111 1 13344333222222 Done!