Query lcl|NC_011308.1_cdsid_YP_002261419.1 [gene=P40_gp3] [protein=gp3] [protein_id=YP_002261419.1] [location=2322..3914] Match_columns 530 No_of_seqs 125 out of 387 Neff 8.7 Searched_HMMs 1612 Date Thu Nov 7 13:13:56 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78083 Length: 537 100.0 8E-128 5E-131 717.3 54.8 524 1-530 1-536 (537) 2 protein:vir:105461 Length: 470 100.0 1E-108 7E-112 612.6 50.7 462 8-508 1-470 (470) 3 protein:vir:102330 Length: 451 100.0 4E-105 2E-108 593.0 49.4 436 11-499 1-451 (451) 4 protein:vir:95899 Length: 474 100.0 5E-105 3E-108 592.4 47.9 456 1-509 1-474 (474) 5 protein:vir:96266 Length: 474 100.0 5E-105 3E-108 592.4 47.9 456 1-509 1-474 (474) 6 protein:vir:102950 Length: 471 100.0 1E-104 9E-108 589.9 49.7 455 11-502 1-471 (471) 7 protein:vir:5961 Length: 503 # 100.0 6E-104 4E-107 586.5 51.8 486 1-524 14-503 (503) 8 protein:vir:79043 Length: 479 100.0 3E-103 2E-106 582.9 50.8 470 1-508 7-479 (479) 9 protein:vir:94805 Length: 492 100.0 7E-103 4E-106 580.8 49.9 457 1-516 33-492 (492) 10 protein:vir:1236 Length: 483 # 100.0 9E-103 6E-106 580.0 49.7 457 1-515 24-483 (483) 11 protein:vir:97336 Length: 492 100.0 1E-102 9E-106 579.0 49.8 457 1-515 24-492 (492) 12 protein:vir:96839 Length: 474 100.0 4E-102 3E-105 576.3 48.3 456 1-512 1-474 (474) 13 protein:vir:94498 Length: 474 100.0 7E-102 5E-105 575.0 49.0 456 1-516 13-474 (474) 14 protein:vir:97447 Length: 474 100.0 7E-102 5E-105 575.0 49.0 456 1-516 13-474 (474) 15 protein:vir:99781 Length: 511 100.0 2E-101 9E-105 573.3 50.5 464 1-516 27-511 (511) 16 protein:vir:97171 Length: 512 100.0 1E-100 6E-104 568.7 51.8 464 1-516 27-512 (512) 17 protein:vir:9306 Length: 511 # 100.0 9E-101 6E-104 569.1 51.4 464 1-516 27-511 (511) 18 protein:vir:106571 Length: 499 100.0 1E-100 7E-104 568.6 51.5 469 1-530 5-496 (499) 19 protein:vir:107112 Length: 478 100.0 4E-101 3E-104 570.8 49.0 459 1-510 10-478 (478) 20 protein:vir:95113 Length: 474 100.0 1E-100 7E-104 568.4 49.7 456 1-513 1-474 (474) 21 protein:vir:94101 Length: 474 100.0 1E-100 7E-104 568.4 48.5 457 1-513 1-474 (474) 22 protein:vir:105889 Length: 474 100.0 1E-100 7E-104 568.4 48.5 457 1-513 1-474 (474) 23 protein:vir:96240 Length: 511 100.0 7E-100 5E-103 564.0 51.8 464 1-516 27-511 (511) 24 protein:vir:103951 Length: 511 100.0 1.3E-99 8E-103 562.7 52.1 464 1-516 27-511 (511) 25 protein:vir:105292 Length: 478 100.0 6E-100 4E-103 564.6 48.5 457 1-510 17-478 (478) 26 protein:vir:78805 Length: 511 100.0 3.9E-99 2E-102 560.1 51.2 464 1-516 27-511 (511) 27 protein:vir:96366 Length: 511 100.0 3.9E-99 2E-102 560.1 51.2 464 1-516 27-511 (511) 28 protein:vir:93747 Length: 472 100.0 5E-99 3E-102 559.5 49.6 457 1-513 8-472 (472) 29 protein:vir:96179 Length: 468 100.0 8.5E-99 5E-102 558.2 48.5 449 1-513 1-468 (468) 30 protein:vir:2732 Length: 501 # 100.0 2.4E-98 1E-101 555.8 50.9 458 1-526 26-501 (501) 31 protein:vir:4898 Length: 502 # 100.0 2.6E-98 2E-101 555.6 50.6 457 1-530 30-502 (502) 32 protein:vir:96494 Length: 501 100.0 8.3E-98 5E-101 552.8 50.4 456 1-530 30-501 (501) 33 protein:vir:3609 Length: 452 # 100.0 2.8E-97 2E-100 549.9 50.5 438 1-511 7-452 (452) 34 protein:vir:94546 Length: 506 100.0 4.1E-97 3E-100 549.0 48.0 458 1-513 11-506 (506) 35 protein:vir:3964 Length: 453 # 100.0 9.2E-96 5.7E-99 541.6 50.9 438 1-514 7-453 (453) 36 protein:vir:9871 Length: 429 # 100.0 5E-95 3.1E-98 537.6 49.5 421 11-506 1-429 (429) 37 protein:vir:733 Length: 453 # 100.0 1E-94 6.3E-98 535.9 49.2 433 1-505 7-453 (453) 38 protein:vir:99522 Length: 470 100.0 4.3E-94 2.6E-97 532.5 51.5 445 1-512 15-470 (470) 39 protein:vir:95806 Length: 440 100.0 6.8E-95 4.2E-98 536.8 46.9 425 20-502 1-440 (440) 40 protein:vir:106639 Length: 481 100.0 1E-91 6.5E-95 519.4 50.8 449 1-521 12-481 (481) 41 protein:vir:9922 Length: 489 # 100.0 3.2E-90 2E-93 511.2 47.7 449 1-516 5-489 (489) 42 protein:vir:78537 Length: 480 100.0 1.9E-76 1.2E-79 435.7 46.1 453 10-526 1-480 (480) 43 protein:vir:2427 Length: 485 # 100.0 3E-76 1.9E-79 434.6 46.7 454 1-522 1-485 (485) 44 protein:vir:2341 Length: 488 # 100.0 2.7E-76 1.7E-79 434.9 44.7 452 5-519 1-488 (488) 45 protein:vir:80680 Length: 441 100.0 3.3E-76 2E-79 434.4 44.4 422 6-497 1-441 (441) 46 protein:vir:4223 Length: 486 # 100.0 1.6E-75 1E-78 430.6 45.2 455 1-524 1-486 (486) 47 protein:vir:78227 Length: 480 100.0 2.4E-75 1.5E-78 429.6 45.7 453 10-526 1-480 (480) 48 protein:vir:104082 Length: 485 100.0 5.1E-74 3.2E-77 422.4 46.8 454 1-521 1-485 (485) 49 protein:vir:7768 Length: 484 # 100.0 4.8E-74 3E-77 422.5 43.2 454 1-524 1-484 (484) 50 protein:vir:99072 Length: 479 100.0 1.1E-72 7.1E-76 415.0 43.1 454 1-522 1-479 (479) 51 protein:vir:2500 Length: 501 # 100.0 2.3E-71 1.5E-74 407.8 45.0 472 1-520 1-501 (501) 52 protein:vir:102602 Length: 456 100.0 1.5E-71 9.4E-75 408.8 42.4 442 8-503 1-456 (456) 53 protein:vir:105819 Length: 456 100.0 1.5E-71 9.4E-75 408.8 42.4 442 8-503 1-456 (456) 54 protein:vir:7987 Length: 456 # 100.0 6.8E-71 4.2E-74 405.3 43.7 439 8-503 1-456 (456) 55 protein:vir:38 Length: 496 # N 100.0 2.2E-62 1.4E-65 358.6 45.2 452 9-510 1-496 (496) 56 protein:vir:98444 Length: 434 100.0 2E-63 1.2E-66 364.4 38.4 404 46-521 1-434 (434) 57 protein:vir:99916 Length: 504 100.0 3.5E-62 2.2E-65 357.5 43.8 459 1-530 1-503 (504) 58 protein:vir:80959 Length: 499 100.0 9.6E-60 6E-63 344.1 45.4 456 9-502 1-499 (499) 59 protein:vir:9751 Length: 422 # 100.0 4.4E-56 2.8E-59 324.0 35.4 400 11-482 1-422 (422) 60 protein:vir:94742 Length: 409 100.0 9.8E-55 6.1E-58 316.7 35.4 388 11-468 1-409 (409) 61 protein:vir:9568 Length: 410 # 100.0 6.6E-54 4.1E-57 312.1 34.8 389 21-484 1-410 (410) 62 protein:vir:1634 Length: 409 # 100.0 4.8E-52 3E-55 301.9 35.4 388 11-468 1-409 (409) 63 protein:vir:79703 Length: 505 100.0 6.3E-50 3.9E-53 290.3 44.0 445 11-500 1-505 (505) 64 protein:vir:1587 Length: 508 # 100.0 8.1E-49 5E-52 284.2 45.4 454 11-507 1-508 (508) 65 protein:vir:8184 Length: 474 # 100.0 1.7E-49 1E-52 288.0 40.6 436 1-502 5-474 (474) 66 protein:vir:3028 Length: 500 # 100.0 5.3E-45 3.3E-48 263.3 45.9 449 11-519 1-500 (500) 67 protein:vir:9815 Length: 500 # 100.0 5.3E-45 3.3E-48 263.3 45.9 449 11-519 1-500 (500) 68 protein:vir:78907 Length: 518 100.0 6.9E-44 4.3E-47 257.2 41.2 477 11-518 1-518 (518) 69 protein:vir:4782 Length: 522 # 100.0 1.6E-43 1E-46 255.1 42.4 458 11-521 1-522 (522) 70 protein:vir:101494 Length: 527 100.0 1.8E-44 1.1E-47 260.4 36.1 482 1-528 10-527 (527) 71 protein:vir:102239 Length: 527 100.0 1.8E-44 1.1E-47 260.4 36.0 482 1-528 10-527 (527) 72 protein:vir:7430 Length: 563 # 100.0 5.9E-39 3.6E-42 230.2 37.2 508 1-530 1-563 (563) 73 protein:vir:98883 Length: 517 100.0 7.5E-37 4.7E-40 218.6 44.9 455 11-522 1-517 (517) 74 protein:vir:97265 Length: 513 99.9 4.6E-21 2.8E-24 132.1 37.9 466 1-529 1-513 (513) 75 protein:vir:94956 Length: 452 99.9 2.7E-21 1.6E-24 133.4 34.8 428 11-510 1-452 (452) 76 protein:vir:95149 Length: 501 99.8 2.2E-19 1.4E-22 122.9 36.1 454 1-510 1-501 (501) 77 protein:vir:93630 Length: 776 99.8 2.3E-19 1.4E-22 122.8 31.5 511 1-530 23-680 (776) 78 protein:vir:80453 Length: 535 99.8 1.1E-16 6.6E-20 108.1 39.2 458 1-524 24-535 (535) 79 protein:vir:78393 Length: 489 99.8 1.5E-16 9.6E-20 107.3 37.8 446 4-524 1-489 (489) 80 protein:vir:108295 Length: 711 99.7 2.7E-16 1.7E-19 105.9 35.4 490 1-530 10-637 (711) 81 protein:vir:104437 Length: 714 99.7 4.1E-16 2.5E-19 104.9 33.4 482 1-530 5-624 (714) 82 protein:vir:95014 Length: 491 99.7 2.5E-15 1.5E-18 100.6 36.0 448 4-515 1-491 (491) 83 protein:vir:9950 Length: 714 # 99.7 4.7E-15 2.9E-18 99.1 36.5 484 1-530 1-624 (714) 84 protein:vir:817 Length: 714 # 99.7 4.7E-15 2.9E-18 99.1 36.5 484 1-530 1-624 (714) 85 protein:vir:2764 Length: 714 # 99.7 4.7E-15 2.9E-18 99.1 36.5 484 1-530 1-624 (714) 86 protein:vir:3296 Length: 714 # 99.7 4.7E-15 2.9E-18 99.1 36.5 484 1-530 1-624 (714) 87 protein:vir:10117 Length: 714 99.7 4.7E-15 2.9E-18 99.1 36.5 484 1-530 1-624 (714) 88 protein:vir:96783 Length: 488 99.7 1.8E-15 1.1E-18 101.4 34.1 430 1-489 14-488 (488) 89 protein:vir:105619 Length: 772 99.7 3.3E-15 2.1E-18 100.0 33.3 501 1-530 3-650 (772) 90 protein:vir:8846 Length: 705 # 99.6 6.5E-14 4E-17 92.9 30.5 484 1-530 1-599 (705) 91 protein:vir:80040 Length: 461 99.5 4.3E-14 2.6E-17 93.9 28.0 427 1-521 1-461 (461) 92 protein:vir:80165 Length: 651 99.4 2.7E-11 1.7E-14 78.5 33.7 464 1-530 5-599 (651) 93 protein:vir:5249 Length: 437 # 99.4 1.6E-12 9.7E-16 85.3 25.5 404 11-530 1-437 (437) 94 protein:vir:95449 Length: 584 99.4 6.7E-12 4.2E-15 81.8 27.2 467 1-505 1-584 (584) 95 protein:vir:96068 Length: 765 99.3 4.8E-11 3E-14 77.2 31.1 427 1-530 71-565 (765) 96 protein:vir:77597 Length: 725 99.3 2.3E-11 1.5E-14 78.9 28.5 487 6-530 1-607 (725) 97 protein:vir:94049 Length: 532 99.3 1.4E-10 9E-14 74.5 31.0 436 1-530 36-531 (532) 98 protein:vir:100920 Length: 725 99.3 1.4E-11 8.4E-15 80.2 25.2 485 6-530 1-607 (725) 99 protein:vir:105520 Length: 706 99.3 1.2E-10 7.4E-14 75.0 29.9 487 1-530 1-635 (706) 100 protein:vir:9263 Length: 725 # 99.3 2.7E-11 1.7E-14 78.5 25.3 487 6-530 1-607 (725) 101 protein:vir:107742 Length: 537 99.3 4.7E-11 2.9E-14 77.2 26.4 429 1-530 25-531 (537) 102 protein:vir:99563 Length: 862 99.3 9.7E-11 6E-14 75.5 28.1 450 1-530 59-591 (862) 103 protein:vir:105429 Length: 708 99.2 3.1E-10 1.9E-13 72.7 28.8 508 9-530 1-651 (708) 104 protein:vir:79538 Length: 502 99.2 1E-09 6.3E-13 69.9 36.3 439 11-529 1-502 (502) 105 protein:vir:3520 Length: 720 # 99.1 1.4E-09 8.8E-13 69.1 31.3 480 9-530 1-613 (720) 106 protein:vir:104338 Length: 422 99.1 6.6E-10 4.1E-13 70.9 25.3 383 11-518 1-422 (422) 107 protein:vir:172 Length: 708 # 99.1 3.2E-09 2E-12 67.1 31.6 488 9-530 1-621 (708) 108 protein:vir:3139 Length: 599 # 99.0 3.8E-10 2.3E-13 72.3 21.7 478 1-513 1-599 (599) 109 protein:vir:79647 Length: 435 98.9 1.2E-08 7.2E-12 64.1 31.4 396 21-521 1-435 (435) 110 protein:vir:107662 Length: 427 98.9 1.4E-08 8.9E-12 63.6 30.6 389 21-526 1-427 (427) 111 protein:vir:94599 Length: 641 98.8 5.6E-08 3.5E-11 60.3 30.1 503 1-527 1-641 (641) 112 protein:vir:96738 Length: 505 98.8 6E-08 3.7E-11 60.2 36.3 424 1-519 11-505 (505) 113 protein:vir:6382 Length: 553 # 98.7 1E-07 6.5E-11 58.8 31.8 464 13-526 1-553 (553) 114 protein:vir:63755 Length: 547 98.6 1.5E-07 9.6E-11 57.9 30.8 442 1-530 15-523 (547) 115 protein:vir:3420 Length: 533 # 98.6 1.6E-07 1E-10 57.8 37.5 442 7-524 1-533 (533) 116 protein:vir:10321 Length: 495 98.5 3.5E-07 2.2E-10 56.0 36.1 440 1-521 1-495 (495) 117 protein:vir:99312 Length: 563 98.5 4.6E-07 2.8E-10 55.3 32.4 453 1-530 20-548 (563) 118 protein:vir:95599 Length: 563 98.5 4.6E-07 2.8E-10 55.3 32.4 453 1-530 20-548 (563) 119 protein:vir:102080 Length: 429 98.4 8.1E-07 5E-10 54.0 29.9 397 14-511 1-429 (429) 120 protein:vir:95821 Length: 763 98.4 8.1E-07 5E-10 54.0 34.7 468 1-530 14-633 (763) 121 protein:vir:102668 Length: 547 98.4 8.7E-07 5.4E-10 53.8 37.1 473 11-512 1-547 (547) 122 protein:vir:389 Length: 530 # 98.4 8.7E-07 5.4E-10 53.8 38.6 442 11-522 1-530 (530) 123 protein:vir:105002 Length: 432 98.4 9.3E-07 5.8E-10 53.6 30.4 391 33-511 1-432 (432) 124 protein:vir:107605 Length: 432 98.4 9.3E-07 5.8E-10 53.6 30.4 391 33-511 1-432 (432) 125 protein:vir:102855 Length: 432 98.4 9.3E-07 5.8E-10 53.6 30.4 391 33-511 1-432 (432) 126 protein:vir:80644 Length: 551 98.4 1E-06 6.2E-10 53.5 33.4 444 7-530 1-531 (551) 127 protein:vir:102727 Length: 945 98.3 1.3E-06 7.9E-10 52.9 31.8 425 1-530 61-536 (945) 128 protein:vir:95542 Length: 548 98.2 2.1E-06 1.3E-09 51.8 35.4 460 1-527 1-548 (548) 129 protein:vir:9359 Length: 348 # 98.0 6.5E-06 4.1E-09 49.0 28.3 330 87-511 1-348 (348) 130 protein:vir:96579 Length: 576 98.0 8.3E-06 5.1E-09 48.4 27.6 445 1-530 21-540 (576) 131 protein:vir:93610 Length: 454 97.9 1E-05 6.4E-09 47.9 35.0 405 36-530 1-442 (454) 132 protein:vir:1380 Length: 422 # 97.9 1.1E-05 6.5E-09 47.9 29.8 399 11-522 1-422 (422) 133 protein:vir:8418 Length: 409 # 97.9 1.2E-05 7.3E-09 47.6 32.2 389 11-519 1-409 (409) 134 protein:vir:4156 Length: 542 # 97.9 1.3E-05 8.3E-09 47.3 26.8 415 20-530 1-472 (542) 135 protein:vir:3153 Length: 467 # 97.7 2.3E-05 1.4E-08 46.0 33.4 398 61-530 1-461 (467) 136 protein:vir:3843 Length: 397 # 97.7 2.5E-05 1.6E-08 45.8 32.7 381 11-528 1-397 (397) 137 protein:vir:6240 Length: 457 # 97.6 4E-05 2.5E-08 44.7 33.5 419 11-530 1-449 (457) 138 protein:vir:81095 Length: 416 97.6 4.4E-05 2.7E-08 44.5 31.1 374 41-511 1-416 (416) 139 protein:vir:4598 Length: 416 # 97.6 4.4E-05 2.7E-08 44.5 31.1 374 41-511 1-416 (416) 140 protein:vir:102118 Length: 409 97.5 4.9E-05 3.1E-08 44.2 28.9 380 20-509 1-409 (409) 141 protein:vir:101648 Length: 518 97.5 5.4E-05 3.3E-08 44.0 32.4 410 16-530 1-451 (518) 142 protein:vir:1266 Length: 416 # 97.4 8.2E-05 5.1E-08 43.0 30.9 391 11-503 1-416 (416) 143 protein:vir:100691 Length: 535 97.4 8.6E-05 5.3E-08 42.9 37.2 440 14-529 1-535 (535) 144 protein:vir:79984 Length: 441 97.3 9.2E-05 5.7E-08 42.7 30.5 404 11-525 1-441 (441) 145 protein:vir:9408 Length: 441 # 97.3 9.2E-05 5.7E-08 42.7 30.5 404 11-525 1-441 (441) 146 protein:vir:81152 Length: 411 97.3 9.2E-05 5.7E-08 42.7 29.9 380 11-501 1-411 (411) 147 protein:vir:100150 Length: 437 97.3 0.00011 7E-08 42.2 30.3 409 11-526 1-437 (437) 148 protein:vir:1326 Length: 457 # 97.2 0.00014 8.6E-08 41.7 34.7 428 11-530 1-454 (457) 149 protein:vir:5737 Length: 419 # 97.2 0.00014 8.8E-08 41.7 31.3 390 11-527 1-419 (419) 150 protein:vir:94426 Length: 409 97.2 0.00015 9.4E-08 41.5 29.2 387 11-512 1-409 (409) 151 protein:vir:7853 Length: 518 # 97.1 0.00015 9.4E-08 41.5 31.7 415 16-530 1-462 (518) 152 protein:vir:78641 Length: 278 97.1 0.00016 9.7E-08 41.4 27.5 262 87-434 1-278 (278) 153 protein:vir:79772 Length: 648 97.1 0.00017 1.1E-07 41.3 37.0 427 1-530 49-515 (648) 154 protein:vir:93943 Length: 409 97.1 0.00018 1.1E-07 41.1 29.5 389 11-512 1-409 (409) 155 protein:vir:96980 Length: 409 97.1 0.00019 1.2E-07 41.0 28.6 388 11-512 1-409 (409) 156 protein:vir:4194 Length: 540 # 97.0 0.00023 1.4E-07 40.6 31.6 420 20-530 1-482 (540) 157 protein:vir:98396 Length: 441 97.0 0.00024 1.5E-07 40.4 30.2 384 37-525 1-441 (441) 158 protein:vir:4454 Length: 414 # 97.0 0.00024 1.5E-07 40.4 34.7 388 11-529 1-414 (414) 159 protein:vir:80796 Length: 574 96.9 0.00026 1.6E-07 40.2 34.7 455 1-530 1-534 (574) 160 protein:vir:7407 Length: 392 # 96.9 0.00028 1.7E-07 40.1 31.1 373 11-516 1-392 (392) 161 protein:vir:4337 Length: 434 # 96.8 0.00034 2.1E-07 39.6 31.3 407 13-516 1-434 (434) 162 protein:vir:105064 Length: 421 96.8 0.00035 2.2E-07 39.5 32.0 389 15-525 1-421 (421) 163 protein:vir:107404 Length: 555 96.7 0.00042 2.6E-07 39.1 36.4 476 11-523 1-555 (555) 164 protein:vir:98506 Length: 555 96.7 0.00042 2.6E-07 39.1 36.4 476 11-523 1-555 (555) 165 protein:vir:107822 Length: 555 96.7 0.00042 2.6E-07 39.1 36.4 476 11-523 1-555 (555) 166 protein:vir:103860 Length: 528 96.6 0.00047 2.9E-07 38.8 39.2 416 1-530 1-476 (528) 167 protein:vir:99853 Length: 488 96.6 0.00052 3.2E-07 38.6 33.6 383 1-530 1-410 (488) 168 protein:vir:100882 Length: 383 96.5 0.00056 3.5E-07 38.4 28.9 364 11-521 1-383 (383) 169 protein:vir:1023 Length: 392 # 96.4 0.00065 4E-07 38.1 31.0 355 35-516 1-392 (392) 170 protein:vir:3989 Length: 392 # 96.4 0.00065 4E-07 38.1 31.0 355 35-516 1-392 (392) 171 protein:vir:1785 Length: 555 # 96.4 0.00065 4E-07 38.1 28.9 472 13-522 1-555 (555) 172 protein:vir:3868 Length: 417 # 96.4 0.00066 4.1E-07 38.0 29.0 384 37-521 1-417 (417) 173 protein:vir:1431 Length: 419 # 96.4 0.00067 4.1E-07 38.0 29.0 387 15-529 1-419 (419) 174 protein:vir:107880 Length: 491 96.3 0.00076 4.7E-07 37.7 38.4 410 1-530 1-445 (491) 175 protein:vir:78942 Length: 510 96.3 0.0008 4.9E-07 37.6 33.1 442 13-504 1-510 (510) 176 protein:vir:79233 Length: 526 96.3 0.00082 5.1E-07 37.5 36.8 415 1-530 1-474 (526) 177 protein:vir:103765 Length: 549 96.2 0.00083 5.1E-07 37.5 36.7 476 1-522 1-549 (549) 178 protein:vir:4952 Length: 386 # 96.2 0.00086 5.3E-07 37.4 33.1 370 11-511 1-386 (386) 179 protein:vir:2683 Length: 412 # 96.2 0.00088 5.5E-07 37.3 28.7 388 11-512 1-412 (412) 180 protein:vir:95315 Length: 559 96.2 0.0009 5.6E-07 37.3 37.4 482 1-527 1-559 (559) 181 protein:vir:100039 Length: 522 96.2 0.00094 5.8E-07 37.2 31.5 457 11-528 1-522 (522) 182 protein:vir:81072 Length: 432 96.1 0.001 6.5E-07 36.9 35.1 391 26-530 1-432 (432) 183 protein:vir:6322 Length: 510 # 96.0 0.0011 6.8E-07 36.8 33.1 442 13-504 1-510 (510) 184 protein:vir:1884 Length: 424 # 96.0 0.0011 6.9E-07 36.8 30.2 375 33-521 1-424 (424) 185 protein:vir:103330 Length: 517 96.0 0.0011 7.1E-07 36.7 26.4 441 1-520 1-517 (517) 186 protein:vir:483 Length: 413 # 95.9 0.0013 8.3E-07 36.3 32.7 386 11-525 1-413 (413) 187 protein:vir:7321 Length: 556 # 95.9 0.0013 8.4E-07 36.3 36.9 474 1-529 1-556 (556) 188 protein:vir:1538 Length: 535 # 95.8 0.0014 8.8E-07 36.2 36.7 468 1-529 1-535 (535) 189 protein:vir:95378 Length: 406 95.8 0.0015 9E-07 36.1 28.3 381 11-515 1-406 (406) 190 protein:vir:99232 Length: 526 95.7 0.0016 9.8E-07 35.9 40.5 411 1-530 1-455 (526) 191 protein:vir:94709 Length: 522 95.6 0.0018 1.1E-06 35.6 39.9 458 1-518 1-522 (522) 192 protein:vir:4854 Length: 386 # 95.4 0.0021 1.3E-06 35.3 30.6 370 11-511 1-386 (386) 193 protein:vir:4828 Length: 382 # 95.3 0.0023 1.4E-06 35.0 31.1 369 11-511 1-382 (382) 194 protein:vir:80134 Length: 403 95.2 0.0026 1.6E-06 34.8 29.5 372 11-510 1-403 (403) 195 protein:vir:960 Length: 413 # 95.1 0.0027 1.7E-06 34.7 28.8 380 11-523 1-413 (413) 196 protein:vir:3361 Length: 535 # 95.1 0.0028 1.7E-06 34.6 37.8 470 1-524 1-535 (535) 197 protein:vir:10447 Length: 536 95.1 0.0028 1.8E-06 34.5 36.3 465 1-523 1-536 (536) 198 protein:vir:189 Length: 424 # 94.6 0.004 2.5E-06 33.7 30.3 395 7-521 1-424 (424) 199 protein:vir:4995 Length: 384 # 94.5 0.0042 2.6E-06 33.6 30.9 369 11-504 1-384 (384) 200 protein:vir:80333 Length: 419 94.5 0.0044 2.7E-06 33.5 27.2 378 23-530 1-413 (419) 201 protein:vir:100249 Length: 431 94.4 0.0044 2.8E-06 33.5 32.1 391 11-519 1-431 (431) 202 protein:vir:100187 Length: 385 94.3 0.0047 2.9E-06 33.3 30.7 368 11-520 1-385 (385) 203 protein:vir:79063 Length: 491 94.3 0.0049 3E-06 33.3 32.3 406 1-530 1-445 (491) 204 protein:vir:10362 Length: 432 94.2 0.005 3.1E-06 33.2 34.4 407 5-530 1-432 (432) 205 protein:vir:9702 Length: 406 # 94.2 0.0051 3.1E-06 33.2 28.1 383 11-520 1-406 (406) 206 protein:vir:8883 Length: 543 # 94.1 0.0055 3.4E-06 33.0 37.6 468 1-521 1-543 (543) 207 protein:vir:105782 Length: 449 93.0 0.009 5.6E-06 31.8 26.7 415 11-523 1-449 (449) 208 protein:vir:78696 Length: 542 92.8 0.0099 6.2E-06 31.6 37.3 457 11-527 1-542 (542) 209 protein:vir:2198 Length: 536 # 92.8 0.01 6.2E-06 31.5 39.0 465 1-523 1-536 (536) 210 protein:vir:103219 Length: 201 92.3 0.012 7.4E-06 31.1 15.9 178 302-519 1-201 (201) 211 protein:vir:3648 Length: 695 # 92.1 0.013 7.9E-06 31.0 24.1 423 1-530 69-576 (695) 212 protein:vir:94572 Length: 535 91.7 0.015 9.1E-06 30.6 32.7 471 1-526 1-535 (535) 213 protein:vir:77981 Length: 448 91.2 0.017 1E-05 30.3 32.7 390 1-530 1-439 (448) 214 protein:vir:78589 Length: 695 91.0 0.018 1.1E-05 30.2 24.9 423 1-530 69-576 (695) 215 protein:vir:101541 Length: 694 90.8 0.019 1.2E-05 30.0 25.0 443 1-530 51-575 (694) 216 protein:vir:6210 Length: 394 # 90.1 0.023 1.4E-05 29.6 26.9 372 11-520 1-394 (394) 217 protein:vir:99452 Length: 651 90.0 0.023 1.4E-05 29.6 23.3 448 21-530 1-555 (651) 218 protein:vir:106716 Length: 698 89.1 0.029 1.8E-05 29.1 23.3 421 1-530 69-578 (698) 219 protein:vir:4089 Length: 395 # 88.9 0.03 1.8E-05 29.0 22.3 367 11-512 1-395 (395) 220 protein:vir:104500 Length: 537 88.3 0.033 2.1E-05 28.7 30.1 435 1-530 1-533 (537) 221 protein:vir:96988 Length: 516 88.0 0.035 2.2E-05 28.6 30.7 443 1-515 1-516 (516) 222 protein:vir:1986 Length: 512 # 86.9 0.043 2.6E-05 28.1 37.3 414 1-530 1-459 (512) 223 protein:vir:1082 Length: 359 # 85.1 0.055 3.4E-05 27.5 31.1 335 11-465 1-359 (359) 224 protein:vir:7017 Length: 515 # 84.5 0.06 3.7E-05 27.3 32.4 440 6-515 1-515 (515) 225 protein:vir:97060 Length: 432 84.3 0.062 3.8E-05 27.2 35.4 407 5-530 1-432 (432) 226 protein:vir:108215 Length: 469 83.4 0.069 4.3E-05 26.9 36.4 413 1-530 1-462 (469) 227 protein:vir:104892 Length: 558 82.5 0.077 4.8E-05 26.7 29.6 441 1-530 39-552 (558) 228 protein:vir:8100 Length: 466 # 80.9 0.091 5.6E-05 26.3 30.2 407 11-526 1-466 (466) 229 protein:vir:105641 Length: 516 80.7 0.092 5.7E-05 26.3 32.3 443 1-515 1-516 (516) 230 protein:vir:79511 Length: 448 79.7 0.1 6.3E-05 26.0 34.4 392 1-530 14-440 (448) 231 protein:vir:100598 Length: 516 78.4 0.12 7.1E-05 25.7 28.0 400 1-500 54-516 (516) 232 protein:vir:1661 Length: 378 # 77.2 0.13 7.9E-05 25.5 21.6 338 11-510 1-378 (378) 233 protein:vir:101647 Length: 460 75.0 0.15 9.4E-05 25.1 29.4 401 13-521 1-460 (460) 234 protein:vir:93867 Length: 378 73.5 0.17 0.00011 24.8 22.1 338 11-519 1-378 (378) 235 protein:vir:4509 Length: 424 # 72.6 0.18 0.00011 24.7 31.0 387 20-525 1-424 (424) 236 protein:vir:101189 Length: 516 71.3 0.2 0.00012 24.4 27.2 400 1-500 54-516 (516) 237 protein:vir:101806 Length: 516 71.3 0.2 0.00012 24.4 27.2 400 1-500 54-516 (516) 238 protein:vir:78161 Length: 355 70.7 0.21 0.00013 24.3 30.3 304 133-530 1-343 (355) 239 protein:vir:99672 Length: 532 68.9 0.23 0.00014 24.1 36.1 464 1-520 1-532 (532) 240 protein:vir:9507 Length: 395 # 66.9 0.26 0.00016 23.8 25.0 367 11-526 1-395 (395) 241 protein:vir:100650 Length: 395 66.9 0.26 0.00016 23.8 25.0 367 11-526 1-395 (395) 242 protein:vir:101289 Length: 395 66.9 0.26 0.00016 23.8 25.0 367 11-526 1-395 (395) 243 protein:vir:103177 Length: 533 60.6 0.37 0.00023 23.0 28.4 426 1-530 38-528 (533) 244 protein:vir:94002 Length: 378 60.2 0.38 0.00023 22.9 21.4 338 11-519 1-378 (378) 245 protein:vir:106999 Length: 564 53.0 0.54 0.00034 22.0 27.3 449 1-530 18-557 (564) 246 protein:vir:104259 Length: 403 50.8 0.6 0.00037 21.8 28.0 373 11-526 1-403 (403) 247 protein:vir:5665 Length: 511 # 47.1 0.71 0.00044 21.4 25.8 412 1-497 21-511 (511) 248 protein:vir:95965 Length: 385 38.2 1.1 0.00067 20.4 23.2 354 11-526 1-385 (385) 249 protein:vir:81218 Length: 423 35.0 1.3 0.00078 20.0 32.8 383 11-523 1-423 (423) 250 protein:vir:5839 Length: 533 # 34.7 1.3 0.00079 20.0 26.1 430 1-530 20-533 (533) 251 protein:vir:94666 Length: 723 34.4 1.3 0.0008 20.0 33.8 395 46-530 1-447 (723) 252 protein:vir:94869 Length: 378 29.2 1.7 0.001 19.3 21.0 340 11-519 1-378 (378) 253 protein:vir:80211 Length: 514 28.3 1.8 0.0011 19.2 36.9 441 13-507 1-514 (514) 254 protein:vir:78310 Length: 376 27.2 1.9 0.0012 19.1 25.9 349 11-501 1-376 (376) 255 protein:vir:9641 Length: 395 # 25.4 2.1 0.0013 18.9 26.3 364 11-511 1-395 (395) 256 protein:vir:6896 Length: 523 # 24.3 2.2 0.0014 18.7 25.4 402 1-500 57-523 (523) No 1 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=8.2e-128 Score=717.31 Aligned_cols=524 Identities=56% Similarity=0.925 Sum_probs=455.3 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) ||..|+...++.++++++++|.+|.+++++++|+.+++||+|+|+|++|+.+.++.++...++..+||+||++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEc Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFD 160 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d 160 (530) ++.++||||+||+|++.+.+++++++.|+.++++++++++++++++++++|+||+++|+|++|++++++++|+++||||| T Consensus 81 d~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~pv~d 160 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKVKDEDNTQLDEILQEYFDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTLIPVFD 160 (537) T ss_pred HHHhhhhcccCceeecCcchhHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccceeEEEEc Confidence 99999999999999998888899999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) +++++.+++|+|....+.......+.+.++++||++.+++|+..+++..........+...+..++........... T Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--- 237 (537) T protein:vir:78 161 DYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADF--- 237 (537) T ss_pred CCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccc--- Confidence 99999999999999888888888899999999999999999999999888888888888888887766544432222 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNI 320 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~ 320 (530) .........+|+||+||||+|+||++|.|||+++++|||+||.++|+++|.+++|++|+|+++|+++++.++++.++ T Consensus 238 ---~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l 314 (537) T protein:vir:78 238 ---EDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNI 314 (537) T ss_pred ---cccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHH Confidence 22233456789999999999999999999999999999999999999999999999999999999999889999999 Q ss_pred hhCcceecC-CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_011308. 321 QSKKIIQTK-GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRK 399 (530) Q Consensus 321 ~~~~~i~~~-~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~ 399 (530) +..++|.++ ++|+|+||+|+++.+++++++++|+++||.+|++|++++.++||+||+|||++|++|++||++++++|++ T Consensus 315 ~~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~ 394 (537) T protein:vir:78 315 KAKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLRK 394 (537) T ss_pred hhcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHHH Confidence 999999987 4689999999999999999999999999999999999998889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_011308. 400 TLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLD 479 (530) Q Consensus 400 ~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~ 479 (530) +|++|+++|+++++..+.+++++.+|+++|+|++|+|++|+|+++++++++|++|+||+++++|+|+|++.+....++.+ T Consensus 395 ~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~ 474 (537) T protein:vir:78 395 VLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKLIAEELD 474 (537) T ss_pred HHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHH Confidence 99999999999999999889999999999999999999999999999999999999999999999999965543333434 Q ss_pred HHHHHHHHhhhccccc--cCCccccCCCCCCCCCCccCcCCC---------CcccccccCCC Q lcl|NC_011308. 480 LDYEDVVKALEDQEVE--ELEPTVTPIIDPLTIEPQPEPLNI---------DPVIEEEPVQE 530 (530) Q Consensus 480 ~e~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~ 530 (530) ...++....+.+++.+ +..++.++..++.+.+++++|... .|.|.|.-+-- T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:78 475 LDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQ 536 (537) T ss_pred hhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCC Confidence 4444444444433322 222333333333333333333211 11111111111 No 2 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=1e-108 Score=612.56 Aligned_cols=462 Identities=18% Similarity=0.229 Sum_probs=379.0 Q ss_pred CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccc-ccccCCcceeecCchhhHHhhhhhh Q lcl|NC_011308. 8 TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIV-EDDNASNIKISHGFFAELVDQKTQY 86 (530) Q Consensus 8 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~-~~~~~~n~ki~~n~~k~Ivd~~~~y 86 (530) =..+++.++|++++.+| +.++.+|..+++||.|+|+|++|........+... .+..++|+||++||+++||++.++| T Consensus 1 ~~~~~~~~~i~~~~~~~--~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y 78 (470) T protein:vir:10 1 MELDALKKLIQNTSTSR--NDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (470) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhh Confidence 12233444444444443 34678899999999999999999876655444433 3456899999999999999999999 Q ss_pred hcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCC--CC Q lcl|NC_011308. 87 LLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDY--GT 164 (530) Q Consensus 87 l~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~--~~ 164 (530) |||+||+|++.+ ++..+.|++++++++.+.+.+++++++++|+||+++|+|++|+++++++||.++||+||++ ++ T Consensus 79 l~G~p~~~~~~d---~~~~~~l~~~~~~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~ 155 (470) T protein:vir:10 79 VASVFPDIDVGK---DADNKKIIDVLGDDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNK 155 (470) T ss_pred eeccceeeecCc---hHHHHHHHHHHhhhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCc Confidence 999999998654 4567788899888999999999999999999999999999999999999999999999875 57 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) +.+++|+|...... ....+.++++||++.+++|...+++......... .......... T Consensus 156 ~~a~ir~y~~~~~~----~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~ 213 (470) T protein:vir:10 156 LLGILRSYKQLDPD----SGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNI------------------ITSYDLSAGY 213 (470) T ss_pred eEEEEEEEEeeecC----CceEEEEEEEEcCCcEEEEEeecCcceecccccc------------------cccccccccc Confidence 89999999765433 2456778999999999999877765433211110 0000111112 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCc Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKK 324 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~ 324 (530) ........+|+||+||||+|+||++|+|||+++++||||||.++|+++|.+++|++|+|+++|+++++.+++..+++..+ T Consensus 214 ~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~ 293 (470) T protein:vir:10 214 ETGQSNTLKHNFGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYK 293 (470) T ss_pred ccccccccccCCCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcC Confidence 23345567899999999999999999999999999999999999999999999999999999999988889999999999 Q ss_pred ceecCC-----CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHH Q lcl|NC_011308. 325 IIQTKG-----EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRK 399 (530) Q Consensus 325 ~i~~~~-----~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~ 399 (530) ++.++. +++|+||+|+++.+++++++++|.+.||.+|++|++++.++||+||+||+++|++|++||+++++.|++ T Consensus 294 ~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~ 373 (470) T protein:vir:10 294 SIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEH 373 (470) T ss_pred eEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 998864 467999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_011308. 400 TLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLD 479 (530) Q Consensus 400 ~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~ 479 (530) +|++++++|+++++. +++++.+|+++|++++|.|++|+|++++.+ +|++|+||+++++|+++|+++|++|+++|+ T Consensus 374 ~l~~~~~~i~~~l~~---~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~D~~~E~eri~~E~ 448 (470) T protein:vir:10 374 AINELVRAIMRYLNF---SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDK 448 (470) T ss_pred HHHHHHHHHHHHhcc---cCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHH Confidence 999999999998754 456788999999999999999999986554 689999999999999999999999999887 Q ss_pred HHHHHHHHhhhccccccCCccccCCCCCC Q lcl|NC_011308. 480 LDYEDVVKALEDQEVEELEPTVTPIIDPL 508 (530) Q Consensus 480 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (530) .+.....+.+.+ .. +.++.|+. T Consensus 449 ~e~~~~~~~~~~-----~~--~~~~dde~ 470 (470) T protein:vir:10 449 EENDPYSNQADE-----LN--GKGVNDEQ 470 (470) T ss_pred HHHHHhhccccc-----cC--CCCCCCCC Confidence 665443322211 11 11111111 No 3 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=3.8e-105 Score=593.05 Aligned_cols=436 Identities=21% Similarity=0.280 Sum_probs=375.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) ++.+ .|.++|++|.. ++.++.++++||.|+|+|++|.....+.. ..+..++|+||++||+++||++.++||||+ T Consensus 1 l~~~-~i~~~i~~~~~--~~~r~~~~~~YY~g~~~i~~~~~~~~~~~---~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~ 74 (451) T protein:vir:10 1 MELE-KIRAIISADAA--RRQEILQAKSYYYNKNDILKKGVVVQNRD---ENPLRNADNRISHNFHEILVDEKASYMFTY 74 (451) T ss_pred CCHH-HHHHHHHHHHH--HHHHHHHHHHHhcccCccccccccccccc---cccccccccccccchHHHHHHhhhhheecc Confidence 5544 56888888763 57899999999999999999876554432 234578999999999999999999999999 Q ss_pred ceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCC--------CceEEEEecccceEEEEcCC Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSE--------DKLTFQTVDALQLLPVFDDY 162 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~--------g~~~~~~~~p~~~~~v~d~~ 162 (530) ||+|++.+ ++...+.|+.+++|++++++.+++++++++|+||+++|+|++ |++++++++|.++||+||++ T Consensus 75 p~~~~~~~--~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~ 152 (451) T protein:vir:10 75 PVLFDIDN--NKELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNG 152 (451) T ss_pred cceeecCC--cHHHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCC Confidence 99998754 455678888888899999999999999999999999999986 88999999999999999874 Q ss_pred --CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 163 --GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 163 --~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) +++.+++|+|............+.+.++++||++.+++|.....+.... T Consensus 153 ~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~----------------------------- 203 (451) T protein:vir:10 153 IERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGS----------------------------- 203 (451) T ss_pred CCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCcccc----------------------------- Confidence 5789999999877766666667788999999999999997655543221 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNI 320 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~ 320 (530) .......+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+|+++|+++++.+++..++ T Consensus 204 -----~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 278 (451) T protein:vir:10 204 -----QIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL 278 (451) T ss_pred -----ccccccccCCCCeeeEEEeccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH Confidence 112335679999999999999999999999999999999999999999999999999999999998888899999 Q ss_pred hhCcceecCC-----CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHH Q lcl|NC_011308. 321 QSKKIIQTKG-----EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEI 395 (530) Q Consensus 321 ~~~~~i~~~~-----~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~ 395 (530) +..+++.+.. +|+|+||+|+++.+++++++++|.++||.+|++|+++++++||+||+||+++|++|++||+++++ T Consensus 279 ~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~ 358 (451) T protein:vir:10 279 KRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLET 358 (451) T ss_pred hhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 9999988753 57899999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHH Q lcl|NC_011308. 396 ALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAIC 475 (530) Q Consensus 396 ~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~ 475 (530) .|+++|++++++|+++++. .++.+++++|++++|+|++|.|++++.+ +|++|+||+++++|+++|++++++++ T Consensus 359 ~f~~~l~~~~~li~~~~~~-----~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~d~~~e~~~~ 431 (451) T protein:vir:10 359 EFRTSFDKLIKAILYFLGV-----TDYKKIQQTYTRNMMSNDLEDADIATKS--VGIIPTKIILRHHPWVDDVEEAEKLY 431 (451) T ss_pred HHHHHHHHHHHHHHHHhCC-----CCccceeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHH Confidence 9999999999999998753 3567899999999999999999987665 58899999999999999999998888 Q ss_pred HHHHHHHHHHHHhhhccccccCCc Q lcl|NC_011308. 476 DTLDLDYEDVVKALEDQEVEELEP 499 (530) Q Consensus 476 e~e~~e~~~~~~~~~~~~~~~~~~ 499 (530) ++++++..+.+...... ..+ T Consensus 432 ~ee~~~~~~~~~~~~~~----~~~ 451 (451) T protein:vir:10 432 LEEKKIQASKVSDDYNN----FTE 451 (451) T ss_pred HHHHHHHHHHHHhhcCC----CCC Confidence 77666543333221111 111 No 4 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=5.1e-105 Score=592.37 Aligned_cols=456 Identities=21% Similarity=0.314 Sum_probs=383.1 Q ss_pred CCccc---------------ccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccccc Q lcl|NC_011308. 1 MTNTL---------------LTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDN 65 (530) Q Consensus 1 ~~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~ 65 (530) ||++. +...-.+..++|+++|++|. .++.++..+++||.|+|+|+.|+..... ....+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~l~~Yy~g~~~i~~~~~~~~~---~~~~~~~ 75 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK--QKLKDINVGQKYYDKDNDINYQAYKQDL---HGNIDYT 75 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccchhhh---ccccccc Confidence 33321 12223567788999999886 3678999999999999999998654222 2234567 Q ss_pred CCcceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCce Q lcl|NC_011308. 66 ASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKL 145 (530) Q Consensus 66 ~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~ 145 (530) ++|+||++||+++||++.++||||+||+|++. +++..+.|+.++++++.+++.+++++++++|+||+++|+|++|++ T Consensus 76 ~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~---~~~~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~ 152 (474) T protein:vir:95 76 KPDWRITTNFHQNLVDQKVSYVAGKPVTYAHD---DDKVLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGEL 152 (474) T ss_pred ccccccccchHHHHHHhhhhhhcccCceeccC---ChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCce Confidence 89999999999999999999999999999764 455678888888899999999999999999999999999999999 Q ss_pred EEEEecccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccccccccc Q lcl|NC_011308. 146 TFQTVDALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPS 223 (530) Q Consensus 146 ~~~~~~p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 223 (530) +++++||.++||+||++ .++.+++|+|.. +...++++||++.+++|...+++...... T Consensus 153 ~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~----------~~~~~~~vy~~~~i~~~~~~~~~~~~~~~---------- 212 (474) T protein:vir:95 153 KLFRVPAEQAIPIWTDKEREQLNAFIRIFTF----------NGETKVEYWTAETVTYYVYENGGLIPDFY---------- 212 (474) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEee----------cCeeEEEEEeCCeEEEEEEcCCceeeccc---------- Confidence 99999999999999874 578899988753 23468999999999999877654322111 Q ss_pred ceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_011308. 224 QHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIY 303 (530) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~l 303 (530) ...........+|+||.||||+|+||.+|.|+|+.+++||||||.++|+++|.+++|++|+| T Consensus 213 ------------------~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~l 274 (474) T protein:vir:95 213 ------------------YGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIY 274 (474) T ss_pred ------------------cccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 11222334456799999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCchhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHH Q lcl|NC_011308. 304 VVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSR 382 (530) Q Consensus 304 vl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~ 382 (530) +++|+++++.+++..+++..+++.++++|+|+||+|+++.+++++++++|.+.||.+|++|++++.++ ||+||+||+++ T Consensus 275 v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~ 354 (474) T protein:vir:95 275 ILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFL 354 (474) T ss_pred hhcCCCcccccchhhhhhccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHH Confidence 99999998888889999999999999999999999999999999999999999999999999999988 58999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_011308. 383 YTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA 462 (530) Q Consensus 383 ~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~ 462 (530) |+++++||+++++.|+++|++++++|+++++ ..++..+|+++|++++|+|++|.|++++ .+|++|+||+++++ T Consensus 355 ~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g----~~~d~~~i~i~f~~~~p~~~~e~a~~~~---~~giiS~et~~~~l 427 (474) T protein:vir:95 355 YTNLNLKANKLKNKANVALQELMQFILDFNK----IKLDAKEIEITFNFNVMVNDLEQSQIGA---QSQYLSKETLVRHH 427 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEecCCCccCHHHHHHHHH---HcCCCChHHHHHhC Confidence 9999999999999999999999999998764 3467889999999999999999998743 36999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCC Q lcl|NC_011308. 463 PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLT 509 (530) Q Consensus 463 ~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (530) |+++|+++|++|+++|+++..+.++.......++..+++++..+..+ T Consensus 428 p~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 428 PWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 99999999999999998877666665555443333333332211111 No 5 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=5.1e-105 Score=592.37 Aligned_cols=456 Identities=21% Similarity=0.314 Sum_probs=383.1 Q ss_pred CCccc---------------ccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccccc Q lcl|NC_011308. 1 MTNTL---------------LTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDN 65 (530) Q Consensus 1 ~~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~ 65 (530) ||++. +...-.+..++|+++|++|. .++.++..+++||.|+|+|+.|+..... ....+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~l~~Yy~g~~~i~~~~~~~~~---~~~~~~~ 75 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK--QKLKDINVGQKYYDKDNDINYQAYKQDL---HGNIDYT 75 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH--HHHHHHHHHHHHhcccCccccccchhhh---ccccccc Confidence 33321 12223567788999999886 3678999999999999999998654222 2234567 Q ss_pred CCcceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCce Q lcl|NC_011308. 66 ASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKL 145 (530) Q Consensus 66 ~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~ 145 (530) ++|+||++||+++||++.++||||+||+|++. +++..+.|+.++++++.+++.+++++++++|+||+++|+|++|++ T Consensus 76 ~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~---~~~~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~ 152 (474) T protein:vir:96 76 KPDWRITTNFHQNLVDQKVSYVAGKPVTYAHD---DDKVLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGEL 152 (474) T ss_pred ccccccccchHHHHHHhhhhhhcccCceeccC---ChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCce Confidence 89999999999999999999999999999764 455678888888899999999999999999999999999999999 Q ss_pred EEEEecccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccccccccc Q lcl|NC_011308. 146 TFQTVDALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPS 223 (530) Q Consensus 146 ~~~~~~p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 223 (530) +++++||.++||+||++ .++.+++|+|.. +...++++||++.+++|...+++...... T Consensus 153 ~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~----------~~~~~~~vy~~~~i~~~~~~~~~~~~~~~---------- 212 (474) T protein:vir:96 153 KLFRVPAEQAIPIWTDKEREQLNAFIRIFTF----------NGETKVEYWTAETVTYYVYENGGLIPDFY---------- 212 (474) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEee----------cCeeEEEEEeCCeEEEEEEcCCceeeccc---------- Confidence 99999999999999874 578899988753 23468999999999999877654322111 Q ss_pred ceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_011308. 224 QHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIY 303 (530) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~l 303 (530) ...........+|+||.||||+|+||.+|.|+|+.+++||||||.++|+++|.+++|++|+| T Consensus 213 ------------------~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~l 274 (474) T protein:vir:96 213 ------------------YGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIY 274 (474) T ss_pred ------------------cccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 11222334456799999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCchhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHH Q lcl|NC_011308. 304 VVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSR 382 (530) Q Consensus 304 vl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~ 382 (530) +++|+++++.+++..+++..+++.++++|+|+||+|+++.+++++++++|.+.||.+|++|++++.++ ||+||+||+++ T Consensus 275 v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~ 354 (474) T protein:vir:96 275 ILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFL 354 (474) T ss_pred hhcCCCcccccchhhhhhccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHH Confidence 99999998888889999999999999999999999999999999999999999999999999999988 58999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_011308. 383 YTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA 462 (530) Q Consensus 383 ~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~ 462 (530) |+++++||+++++.|+++|++++++|+++++ ..++..+|+++|++++|+|++|.|++++ .+|++|+||+++++ T Consensus 355 ~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g----~~~d~~~i~i~f~~~~p~~~~e~a~~~~---~~giiS~et~~~~l 427 (474) T protein:vir:96 355 YTNLNLKANKLKNKANVALQELMQFILDFNK----IKLDAKEIEITFNFNVMVNDLEQSQIGA---QSQYLSKETLVRHH 427 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEecCCCccCHHHHHHHHH---HcCCCChHHHHHhC Confidence 9999999999999999999999999998764 3467889999999999999999998743 36999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCC Q lcl|NC_011308. 463 PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLT 509 (530) Q Consensus 463 ~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (530) |+++|+++|++|+++|+++..+.++.......++..+++++..+..+ T Consensus 428 p~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 428 PWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 99999999999999998877666665555443333333332211111 No 6 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=1.4e-104 Score=589.88 Aligned_cols=455 Identities=20% Similarity=0.288 Sum_probs=373.8 Q ss_pred ccHH---HHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccc-----ccccccccCCcceeecCchhhHHhh Q lcl|NC_011308. 11 DRLG---TILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDH-----GDIVEDDNASNIKISHGFFAELVDQ 82 (530) Q Consensus 11 ~~~~---~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~-----~~~~~~~~~~n~ki~~n~~k~Ivd~ 82 (530) |+++ ++|.++|.+| ++++.++..+++||+|+|+|++++....... +.......++|+||++||+++||++ T Consensus 1 ~~~e~~~~~i~~~~~~~--~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 78 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKH--GKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQ 78 (471) T ss_pred CCHHHHHHHHHHHHHHH--HHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHh Confidence 4444 4444444444 3467889999999999999998765432221 1112234568999999999999999 Q ss_pred hhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecC-CCceEEEEecccceEEEEcC Q lcl|NC_011308. 83 KTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTS-EDKLTFQTVDALQLLPVFDD 161 (530) Q Consensus 83 ~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~-~g~~~~~~~~p~~~~~v~d~ 161 (530) .++||||+||+|++. +++..+.|+.++++++++++.++++.++++|+||+++|+++ +|+++++++||.++||+||+ T Consensus 79 ~~~yl~G~p~~~~~~---~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~ 155 (471) T protein:vir:10 79 KKAYALTYPPTFDVD---DKKVNDMIVDVLGDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSK 155 (471) T ss_pred hhhhhcccCceeccC---ChHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcC Confidence 999999999999764 45677888888899999999999999999999999999995 69999999999999999997 Q ss_pred CC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceec Q lcl|NC_011308. 162 YG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILD 239 (530) Q Consensus 162 ~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (530) .. ++.+++|+|..... ...+.+.++++||++++++|...+++.............. T Consensus 156 ~~~~~~~~~ir~~~~~~~----~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~------------------ 213 (471) T protein:vir:10 156 SLDKKSIGVLRVYSSIDE----TDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLI------------------ 213 (471) T ss_pred CCCCceEEEEEEEEeecc----CCCceeEEEEEEeCCcEEEEEecCCccccccccccccccc------------------ Confidence 54 58899999876432 2356688999999999999998877654433221110000 Q ss_pred ccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHH Q lcl|NC_011308. 240 EGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKN 319 (530) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~ 319 (530) .............+|+||+||||+|+||.+|.|+|+++++||||||.++|+++|.+++|++|+|+++|+++++.+++..+ T Consensus 214 ~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~ 293 (471) T protein:vir:10 214 DTMNGDRSSDNSFKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLED 293 (471) T ss_pred ccccccccccccccCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHH Confidence 00111223445578999999999999999999999999999999999999999999999999999999998888899999 Q ss_pred HhhCcceecCC-----CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 320 IQSKKIIQTKG-----EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 320 ~~~~~~i~~~~-----~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke 394 (530) ++.++++++++ +++|+|++|+++.+++++++++|.+.||.+|++|++++.++||+||+||+++|++|++||++++ T Consensus 294 ~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~ 373 (471) T protein:vir:10 294 LKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNME 373 (471) T ss_pred hhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHH Confidence 99999998854 3589999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAI 474 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~ 474 (530) +.|+++|++++++|+++++.. +..+++++|++++|.|++++|++++++ +|+||+||+++++|+++|+++|++| T Consensus 374 ~~~~~~l~~~~~li~~~~~~~-----d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~D~~~E~er 446 (471) T protein:vir:10 374 TQFRSGYATLVKMILKHLGLS-----DKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVEDWQDELRL 446 (471) T ss_pred HHHHHHHHHHHHHHHHHhccC-----CCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH Confidence 999999999999999987543 457899999999999999999986665 6899999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhccccccCCcccc Q lcl|NC_011308. 475 CDTLDLDYEDVVKALEDQEVEELEPTVT 502 (530) Q Consensus 475 ~e~e~~e~~~~~~~~~~~~~~~~~~~~~ 502 (530) +++|+.+..+.+.......+ +++.+ T Consensus 447 i~~E~~~~~~~~~~~~~~~~---~~e~~ 471 (471) T protein:vir:10 447 QKAEQEGRSEKLYDMEEVEH---ESEVE 471 (471) T ss_pred HHHHHHHHHhcccccCCCCC---ccccC Confidence 99887766444443322211 11111 No 7 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=6e-104 Score=586.46 Aligned_cols=486 Identities=22% Similarity=0.313 Sum_probs=407.7 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +.+.+.......+.+....+|++|+++++++++..+.+||.|+|+|++++.+..+..+....+..++|+||++||+++|| T Consensus 14 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 93 (503) T protein:vir:59 14 ELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFV 93 (503) T ss_pred hHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHH Confidence 44445555555555666677888888788899999999999999999999888888888888889999999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEc Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFD 160 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d 160 (530) ++.++||||+||+|+++ +++..+.|+.++++++.+++.+++++++++|++|+++|+|++|++++++++|.++||+|| T Consensus 94 d~~~~yl~g~~~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d 170 (503) T protein:vir:59 94 DQKTQYLVGEPVTFTSD---NKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYK 170 (503) T ss_pred HHHHhhhhcCCeeeccC---cHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEe Confidence 99999999999999764 456777888888899999999999999999999999999999999999999999999998 Q ss_pred CC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 161 DY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 161 ~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) +. .++.+++|+|.... ..++.+.++++||++.+++|...+++......... T Consensus 171 ~~~~~~~~~~ir~~~~~~-----~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~---------------------- 223 (503) T protein:vir:59 171 DNTRRDILFALRYYSYKG-----IMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGE---------------------- 223 (503) T ss_pred CCCCCceEEEEEEEEEec-----CCCceEEEEEEEeCCcEEEEEEcCCcccccccccc---------------------- Confidence 75 56888899886532 23456789999999999999988776543221111 Q ss_pred cccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHH Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKK 318 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~ 318 (530) ............+|+|++|||++|+||.+|.|+|+.+++|||+||.++|++++.+++|++|+|+++|.++++.+++.. T Consensus 224 --~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~ 301 (503) T protein:vir:59 224 --NNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTA 301 (503) T ss_pred --cccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhh Confidence 001111223456899999999999999999999999999999999999999999999999999999999988888999 Q ss_pred HHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_011308. 319 NIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTEIAL 397 (530) Q Consensus 319 ~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~f 397 (530) +++..+++.++++|+|+|++|+++.++++.++++|++.||.+|++|++++..+ ||+||+||+++++++.+||+++++.| T Consensus 302 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 381 (503) T protein:vir:59 302 NLRYHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKI 381 (503) T ss_pred hhhcccceeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999998877 78999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_011308. 398 RKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICD 476 (530) Q Consensus 398 ~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e 476 (530) +++|++++++|+++++..+...++ ..+++++|++++|.|+++.|++++.++++|++|+||+++++|+++|+++|+++++ T Consensus 382 ~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~ 461 (503) T protein:vir:59 382 RAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIE 461 (503) T ss_pred HHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHH Confidence 999999999999999988776654 4679999999999999999999999999999999999999999999999999998 Q ss_pred HHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 477 TLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 477 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) +|+.+..+....+..+.... ..+. .+.+++.+.+-|.++.++ T Consensus 462 ~E~~~~~~~~~~~~~~~~~~---~~~~---~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 462 EEMNQYAEMQGNLLDDEGGD---DDLE---EDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHHHHHHhhhccccCccCCC---CCCC---cCCCCCCcccCCCCCCcC Confidence 87766554443333322111 1111 112222222234444444 No 8 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=2.7e-103 Score=582.87 Aligned_cols=470 Identities=25% Similarity=0.355 Sum_probs=395.6 Q ss_pred CCccc-ccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTL-LTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~-~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) |-..+ ....++.....+.++|++|++++++++++.+++||+|+|+|++|+...... +...++..++|+||++||+++| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~-~~~~~~~~~~~~ki~~~~~~~I 85 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLD-GAKVDDFTKVNNKAINNYHKLL 85 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccc-cccccccccCcceeecchHHHH Confidence 21111 111233444567788999988889999999999999999999987665444 3445667889999999999999 Q ss_pred HhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEE Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVF 159 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~ 159 (530) |++.++||+|+||+|++.+ +...+.|+.+++|++++.+.++++.++++|++|+++|.|++|+++++++||.++||+| T Consensus 86 vd~~~~~l~g~p~~~~~~~---~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~ 162 (479) T protein:vir:79 86 VDQKVGYSVGNPIVFNADD---DNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIW 162 (479) T ss_pred HHHHHhhhhcCCceeccCC---HHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEE Confidence 9999999999999997654 4566778888889999999999999999999999999999999999999999999999 Q ss_pred cCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 160 DDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 160 d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) |+. .++.+++|+|..... .++.+.++++|+++.+++|+..+++.......... . T Consensus 163 d~~~~~~~~~~ir~y~~~~~-----~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~-------------------~ 218 (479) T protein:vir:79 163 DSKRQRELVAFIRFYYIEDI-----DGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEY-------------------G 218 (479) T ss_pred eCCCCCceEEEEEEEEEeec-----CCceEEEEEEEeCCcEEEEEecCCccccccccccc-------------------c Confidence 875 468899999876542 34567899999999999998877654332221111 1 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK 317 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~ 317 (530) ...+...........+|+||.||||+|+||++|.|+|+.+++|||+||.++|+++|.+++|++|+++++|++++..+++. T Consensus 219 ~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~ 298 (479) T protein:vir:79 219 KMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFI 298 (479) T ss_pred cccccccccccccccccCCCcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccch Confidence 11122233445566789999999999999999999999999999999999999999999999999999999888888888 Q ss_pred HHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_011308. 318 KNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIAL 397 (530) Q Consensus 318 ~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f 397 (530) .+++..+++.++++|+|+|++++++.+++++++++|++.||.+|++|++++.++||+||+||++++++|++||.++++.| T Consensus 299 ~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~ 378 (479) T protein:vir:79 299 DNIRYYKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKF 378 (479) T ss_pred hhhhhccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_011308. 398 RKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDT 477 (530) Q Consensus 398 ~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~ 477 (530) +++|++++++|+++++..+..+++..+++|+|++++|.|+++.|++++.+ +|++|+||+++++|+++|+++|++|+++ T Consensus 379 ~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~d~~~E~~ri~~ 456 (479) T protein:vir:79 379 KKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWVEDVNDELERLKK 456 (479) T ss_pred HHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHH Confidence 99999999999999999998999999999999999999999999986665 5899999999999999999999999998 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCC Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPL 508 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (530) |+.+..+........ .+++.+.. T Consensus 457 E~~~~~~~~~~~~~~--------~~~~~~e~ 479 (479) T protein:vir:79 457 QEDTQKEYDDLIPNN--------QDGVIDET 479 (479) T ss_pred HHHHHHHHHhccCcc--------cCCCcCcC Confidence 877654433332211 11111110 No 9 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=6.6e-103 Score=580.77 Aligned_cols=457 Identities=22% Similarity=0.302 Sum_probs=383.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) -........++...++|+++|.+|. .++.++.++.+||.|+|+|++++.+.... ...+..++|+||++||+++|| T Consensus 33 ~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~I~~~~~~~~~~---~~~~~~~~~~ri~~n~~k~Iv 107 (492) T protein:vir:94 33 DAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDAT---GAVDPLKPDDRMITNFHANLV 107 (492) T ss_pred hcccccCCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccccccccc---ccccccccccccccchHHHHH Confidence 2223345677888999999999886 35688999999999999999987654332 234567899999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEc Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFD 160 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d 160 (530) ++.++||+|+||+|++. ++...+.|+.++++++++.+.+++++++++|+||+++|.|++|++++++++|.++||+|| T Consensus 108 d~~~~yl~G~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~d 184 (492) T protein:vir:94 108 DQKVSYIVGKPIAFKHT---DDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWT 184 (492) T ss_pred HHHHhhhcccCceeccC---chHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEc Confidence 99999999999999764 456788899999899999999999999999999999999999999999999999999998 Q ss_pred CC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 161 DY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 161 ~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) ++ .++.+++|+|... ...++++||+..+++|....++..... T Consensus 185 ~~~~~~~~a~ir~~~~~----------~~~~~~~y~~~~v~~~~~~~~~~~~~~-------------------------- 228 (492) T protein:vir:94 185 DKEHEELEAFIRMYKLE----------NETKVEYWDKVTVNYYVYENGSLIPDY-------------------------- 228 (492) T ss_pred CCCCCceEEEEEEEeec----------cceeEEEEecCeEEEEEEecCeeeecc-------------------------- Confidence 64 5688999988642 234689999999999987665432211 Q ss_pred cccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHH Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKK 318 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~ 318 (530) ......+.....+|+||.||||+|+||++|+|+|+++++||||||.++|++++.+++|++|+++++|+++++.+++.. T Consensus 229 --~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~ 306 (492) T protein:vir:94 229 --SNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR 306 (492) T ss_pred --ccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHH Confidence 111223344567899999999999999999999999999999999999999999999999999999999988888889 Q ss_pred HHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_011308. 319 NIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTEIAL 397 (530) Q Consensus 319 ~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~f 397 (530) +++..+++.++++|+|+|++|+++.++++.++++|++.||.+|++|++++.++ ||+||+||++++++|++||+++++.| T Consensus 307 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f 386 (492) T protein:vir:94 307 LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 386 (492) T ss_pred HHhhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999988 68999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_011308. 398 RKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDT 477 (530) Q Consensus 398 ~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~ 477 (530) +++|++++++|+++++... +..+++++|++++|+|+++.|++++.+ .|++|+||+++++|+++|+++|++++++ T Consensus 387 ~~~l~~~~~li~~~~~~~~----~~~~i~v~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l~~v~d~~~E~eri~~ 460 (492) T protein:vir:94 387 KVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQ 460 (492) T ss_pred HHHHHHHHHHHHHHhcCCc----ccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHH Confidence 9999999999999876543 567899999999999999999986655 5899999999999999999999999999 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) |+++..+.++.+.....+ .++..+ .+++.+. + T Consensus 461 E~~~~~~~~~~~~~~~~~-----~~~~~~-~~~~~e~-e 492 (492) T protein:vir:94 461 EQMEYNKQLPNLDDGGAD-----SAQQQE-RSNNKES-E 492 (492) T ss_pred HHHHHHhhccccccccCC-----CCcccc-CCccccC-C Confidence 887666555443322211 111110 0011111 0 No 10 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=9e-103 Score=580.03 Aligned_cols=457 Identities=22% Similarity=0.304 Sum_probs=380.9 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) -..+.....++.+.++|.++|++|. .++.++.++.+||+|+|+|++|+.+.... ...+..++|+||++||+++|| T Consensus 24 ~~~~~~~~~~e~~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~~---~~~~~~~~~~ki~~n~~k~Iv 98 (483) T protein:vir:12 24 DAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDAT---GAVDPLKPDDRMITNFHANLV 98 (483) T ss_pred hcccccCCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccccccccc---ccccccccccccccchHHHHH Confidence 1222334455778889999998886 45688999999999999999987554322 234567899999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEc Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFD 160 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d 160 (530) ++.++||+|+||+|++. +++..+.|++++++++++.+.+++++++++|+||+++|.|++|+++++++||.++||+|| T Consensus 99 d~~~~~l~G~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~~v~d 175 (483) T protein:vir:12 99 DQKVSYIVGKPIAFKHT---DDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWT 175 (483) T ss_pred HHHhhhhcccCceeccC---ChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceEEEEc Confidence 99999999999999764 456778899999899999999999999999999999999999999999999999999998 Q ss_pred CC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 161 DY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 161 ~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) ++ +++.+++|+|... ...++++|++..+++|...++...... T Consensus 176 ~~~~~~~~~~ir~~~~~----------~~~~~~~y~~~~v~~~~~~~~~~~~~~-------------------------- 219 (483) T protein:vir:12 176 DKEHEELEAFIRMYKLE----------NETKVEYWDKVTVNYYVYENGSLIPDY-------------------------- 219 (483) T ss_pred CCCCCceEEEEEEEEee----------cceEEEEEecCeEEEEEEeCCeeeecc-------------------------- Confidence 64 5789999988642 234689999999999987655322211 Q ss_pred cccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHH Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKK 318 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~ 318 (530) ......+.....+|+||.||||+|+||.+|+|+|+++++||||||.++|++++.+++|++|+++++|+++++.+++.. T Consensus 220 --~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~ 297 (483) T protein:vir:12 220 --SNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR 297 (483) T ss_pred --cccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHH Confidence 111223344567899999999999999999999999999999999999999999999999999999999988888888 Q ss_pred HHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_011308. 319 NIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTEIAL 397 (530) Q Consensus 319 ~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~f 397 (530) .++..+++.++++|+|+|++|+++.+++++++++|.+.||.+|++|+++++++ ||+||+||++++++|++||.++++.| T Consensus 298 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f 377 (483) T protein:vir:12 298 LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 377 (483) T ss_pred hhhhccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999988 68999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_011308. 398 RKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDT 477 (530) Q Consensus 398 ~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~ 477 (530) +++|++++++|+++++... +..+++++|++++|+|+++.|++++.+ +|++|+||+++++|+++|+++|++|+++ T Consensus 378 ~~~l~~~~~li~~~~~~~~----~~~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~ri~~ 451 (483) T protein:vir:12 378 KVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQ 451 (483) T ss_pred HHHHHHHHHHHHHHhcCCC----ccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHH Confidence 9999999999999876543 567899999999999999999986655 6899999999999999999999999999 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) |+.+..+.++.......++..++.+ +++.+++ T Consensus 452 E~~~~~~~~~~~~~~~~d~~~~~~~------~~~~e~e 483 (483) T protein:vir:12 452 EQMEYNKQLPNLDDGGADGAQQQER------SNNKESE 483 (483) T ss_pred HHHHHHhhcccccccccCCcccCCC------CCcccCC Confidence 8876655544332222111111110 0111111 No 11 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=1.4e-102 Score=578.97 Aligned_cols=457 Identities=22% Similarity=0.305 Sum_probs=380.7 Q ss_pred CCc---------ccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 1 MTN---------TLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 1 ~~~---------~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) +|. .......+...++|+++|.+|. .++.++.++.+||.|+|+|++++.+.... ...+..++|+|| T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~l~~YY~g~~~i~~~~~~~~~~---~~~~~~~~~~ri 98 (492) T protein:vir:97 24 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDAT---GAVDPLKPDDRM 98 (492) T ss_pred cchhhhhHhhhcccCCCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhcccCcccccccccccc---cccccccccccc Confidence 221 1223455677888888888875 46789999999999999999887554322 223567899999 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEec Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVD 151 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~ 151 (530) ++||+++||++.++||+|+||+|++. ++...+.|+.+++|++++.+.+++++++++|+||+++|.+++|+++++++| T Consensus 99 ~~n~~k~Ivd~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~ 175 (492) T protein:vir:97 99 ITNFHANLVDQKVSYIVGKPIAFKHT---DDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVP 175 (492) T ss_pred ccchHHHHHHHHhhhhcccCceeccC---chHHHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEc Confidence 99999999999999999999999764 456788899998899999999999999999999999999999999999999 Q ss_pred ccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 152 ALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 152 p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) |.++||+||++ +++.+++|+|... ...++++|+++.+++|...++...... T Consensus 176 p~~~~~i~d~~~~~~~~~~vr~~~~~----------~~~~~~~y~~~~v~~~~~~~~~~~~~~----------------- 228 (492) T protein:vir:97 176 AEQGIPIWTDKEHEELEAFIRMYKLE----------NETKVEYWDKVTVNYYVYENGSLIPDY----------------- 228 (492) T ss_pred ccceEEEEcCCCCCceEEEEEEEeec----------cceeEEEEecCeEEEEEEecCeeeecc----------------- Confidence 99999999864 5789999988642 234789999999999987665432111 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+.....+|+||.||||+|+||++|+|+|+.+++||||||.++|++++.+++|++|+++++|++ T Consensus 229 -----------~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~ 297 (492) T protein:vir:97 229 -----------SNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYD 297 (492) T ss_pred -----------cccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Confidence 111223344567899999999999999999999999999999999999999999999999999999999 Q ss_pred CCchhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHH Q lcl|NC_011308. 310 NSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAM 388 (530) Q Consensus 310 ~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ 388 (530) .++.+++..+++..+++.++++++|+|++|+++.+++++++++|++.||.+|++|++++.++ ||+||+||++++++|++ T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ 377 (492) T protein:vir:97 298 DQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNL 377 (492) T ss_pred cccchhHHHHHhhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHH Confidence 98888899999999999999999999999999999999999999999999999999999988 68999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCH Q lcl|NC_011308. 389 KAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDE 468 (530) Q Consensus 389 ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~ 468 (530) ||.++++.|+++|++++++|+++++... +..+++++|+|++|+|+++.|++++.+ +|++|+||+++++|+++|+ T Consensus 378 ka~~~~~~f~~~l~~~~~li~~~~~~~~----~~~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~ 451 (492) T protein:vir:97 378 KADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDL 451 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCc----ccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCH Confidence 9999999999999999999999876543 567899999999999999999986655 6899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 469 ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 469 ~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) ++|++++++|+.+..+..+.......+...+..++ ++.+.+ T Consensus 452 ~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~e 492 (492) T protein:vir:97 452 QAELERIEQEQTEYNKQLPNLDDGGADSAQQQERS------NNKESE 492 (492) T ss_pred HHHHHHHHHHHHHHHHhhhccccCCCCCCcccccc------cccccC Confidence 99999999988766555444333222211111110 111110 No 12 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=4.4e-102 Score=576.27 Aligned_cols=456 Identities=21% Similarity=0.297 Sum_probs=378.9 Q ss_pred CCc--------------ccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccC Q lcl|NC_011308. 1 MTN--------------TLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNA 66 (530) Q Consensus 1 ~~~--------------~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~ 66 (530) |-- ..+.....+..++|+++|++|. .+..++..+++||+|+|+|++++.+..... ..+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~---~~~~~~ 75 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHK--PKIDDITVGERYYNHDPDVLRLAPKLDNKG---EIDPLK 75 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHH--HHHHHHHHHHHHhccCCcchhccchhcccc---cccccc Confidence 111 1123344567788999999886 467899999999999999999876544322 345678 Q ss_pred CcceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceE Q lcl|NC_011308. 67 SNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLT 146 (530) Q Consensus 67 ~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~ 146 (530) +|+||++||+++||++.++||||+||+|++. +++..+.|++++++++.+++.++++.++++|+||+++|+|++|+++ T Consensus 76 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~ 152 (474) T protein:vir:96 76 PDWRMFTNYHQNLVDQKVAYAVANPVTFSSD---DDKSLKTIQEVLNHKWDDKLVDILTAASNKGIEWLQPYIDENGEFK 152 (474) T ss_pred cchhcccchHHHHHHhhhhhhcccCceeecC---chHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeeEEEEEecCCCceE Confidence 9999999999999999999999999999764 4567788999999999999999999999999999999999999999 Q ss_pred EEEecccceEEEEcC--CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccc Q lcl|NC_011308. 147 FQTVDALQLLPVFDD--YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQ 224 (530) Q Consensus 147 ~~~~~p~~~~~v~d~--~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 224 (530) +++++|.++||+||+ .+++.+++|+|... ...++++||++.+++|...++......... T Consensus 153 i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~----------~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~--------- 213 (474) T protein:vir:96 153 TFRVPAEQAIPIWTNKERDTLKAFIRYYRLD----------GAERVEYWTDSDVTYYEYQDGILIPDYYHG--------- 213 (474) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEeec----------CceEEEEEeCCeEEEEEecCCceeeccccc--------- Confidence 999999999999987 45788999988531 235789999999999987765433221110 Q ss_pred eeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceee Q lcl|NC_011308. 225 HVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYV 304 (530) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lv 304 (530) .............+|+||+||||+|+||.+|.|||+.+++||||||.++|+++|.+++|++|+|+ T Consensus 214 ---------------~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 278 (474) T protein:vir:96 214 ---------------EEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYI 278 (474) T ss_pred ---------------cccccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceee Confidence 01112223345678999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCchhhHHHHHhhCcceecCC-CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHH Q lcl|NC_011308. 305 VRGGTNSPVDEIKKNIQSKKIIQTKG-EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSR 382 (530) Q Consensus 305 l~g~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~ 382 (530) ++|+++++.+++..+++..+++.+++ +++|+||+|+++.+++++++++|++.||.+|++|++++.++ ||+||+||+++ T Consensus 279 ~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~ 358 (474) T protein:vir:96 279 LKGYEGQDLDEFMRNLKYYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFM 358 (474) T ss_pred eecCCcccccchhhhhhcCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHH Confidence 99999988888889999999999875 67899999999999999999999999999999999999988 58999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_011308. 383 YTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA 462 (530) Q Consensus 383 ~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~ 462 (530) ++++++||++++++|+++|++++++|+++++ ..+++.+++++|++++|.|+++.|+++ +.+|++|+||+++++ T Consensus 359 ~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~----~~~~~~~i~i~f~~~~p~~~~e~~~~~---~~ag~iS~et~~~~~ 431 (474) T protein:vir:96 359 YSNLDLKANKLKNKTLTALQELLQYIIDFYK----LNIKVQDVEITFNFNVMVNELEQSQIG---VQSQYLSKETVVTNH 431 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEeccCCCcCHHHHHHHH---HhcCCCchHHHHHhC Confidence 9999999999999999999999999988864 456788999999999999999998864 457999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 463 PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 463 ~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) |+++|+++|++|+++|+.+..+.+.....+.. +.. .+..+++. T Consensus 432 ~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~-~~~------~d~~~e~~ 474 (474) T protein:vir:96 432 PWVDDPVAELERIEQDNIDFNKQLPPLEGDAN-GRA------QDNESETN 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhcccccccccc-ccc------CCCcccCC Confidence 99999999999999998776665544433221 111 11111111 No 13 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=7.3e-102 Score=575.05 Aligned_cols=456 Identities=20% Similarity=0.311 Sum_probs=380.0 Q ss_pred CC---cccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MT---NTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~---~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |+ ..-+........++|+++|.+|. .++.++.++++||+|+|+|++|..+... ....+..++|+||++||++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~ki~~n~~k 87 (474) T protein:vir:94 13 YGEEVVEQLKPQFETQEEMIVRLIDDHR--KQLDKITVGQRYYDKDNDIVKQMKKVDV---HGNIDYDKPDWRITTNFHQ 87 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcccchhcc---ccccccccCcceeecchHH Confidence 22 12223344456788999998885 3578999999999999999987654332 2234567899999999999 Q ss_pred hHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 78 ELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 78 ~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) +||++.++||||+||+|++. ++...+.|+.++++++.+.+.++++.++++|.||+++|.|++|++++++++|.++|| T Consensus 88 ~Ivd~~~~~l~g~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~ 164 (474) T protein:vir:94 88 NLVDQKVSYVASKPVTYSCE---DENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIP 164 (474) T ss_pred HHHHHHHhhhhcCCceeccC---cHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEE Confidence 99999999999999999764 456788889999999999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) +||++ +++.+++|+|... ...++++||++.+++|+..+++........ T Consensus 165 v~d~~~~~~~~~~ir~~~~~----------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~-------------------- 214 (474) T protein:vir:94 165 IWVDKEREELKSFIRYYKFN----------NEEKVEFWTDTTVTYYVLENGGLIPDYYYG-------------------- 214 (474) T ss_pred EEcCCCCCceEEEEEEEEec----------CeEEEEEEeCCeEEEEEEcCCccccccccC-------------------- Confidence 99874 5788999988631 235789999999999988776543321111 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||+||||+|+||..|+|+|+++++|||+||.++|+++|.+++|++|+++++|+++++.++ T Consensus 215 --------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~ 286 (474) T protein:vir:94 215 --------ANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEE 286 (474) T ss_pred --------cCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchh Confidence 112234456899999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 316 ~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke 394 (530) +..+++..+++.++++++|+|++|+++.+++++++++|++.||.+|++|++++.++ ||+||+||+++++++++||.+++ T Consensus 287 ~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 366 (474) T protein:vir:94 287 FMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLK 366 (474) T ss_pred hhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHH Confidence 89999999999999999999999999999999999999999999999999999888 78999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAI 474 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~ 474 (530) +.|+++|++++++|+++++. ..+..+|+++|++++|.|+++.|+++.. .|++|+||+++++|+|+|+++|+++ T Consensus 367 ~~~~~~l~~~~~li~~~~~~----~~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D~~~E~er 439 (474) T protein:vir:94 367 NKATVAIQELISFIIDFNNL----KTDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDDYKAELER 439 (474) T ss_pred HHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCcccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCCHHHHHHH Confidence 99999999999999988654 3567789999999999999999887543 5889999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 475 CDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 475 ~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++..+..+.+.....+...++.++.. +..+ T Consensus 440 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~e 474 (474) T protein:vir:94 440 IEQEQMEYNKQLPNLDDGGADGAQQQEGSNN-------KESE 474 (474) T ss_pred HHHHHHHHHhhccccCCCCCCCcccCCCCcc-------cccC Confidence 9998877655554443332222221111111 1111 No 14 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=7.3e-102 Score=575.05 Aligned_cols=456 Identities=20% Similarity=0.311 Sum_probs=380.0 Q ss_pred CC---cccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MT---NTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~---~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |+ ..-+........++|+++|.+|. .++.++.++++||+|+|+|++|..+... ....+..++|+||++||++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~ki~~n~~k 87 (474) T protein:vir:97 13 YGEEVVEQLKPQFETQEEMIVRLIDDHR--KQLDKITVGQRYYDKDNDIVKQMKKVDV---HGNIDYDKPDWRITTNFHQ 87 (474) T ss_pred hhhHHHHhhhhcccCHHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcccchhcc---ccccccccCcceeecchHH Confidence 22 12223344456788999998885 3578999999999999999987654332 2234567899999999999 Q ss_pred hHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 78 ELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 78 ~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) +||++.++||||+||+|++. ++...+.|+.++++++.+.+.++++.++++|.||+++|.|++|++++++++|.++|| T Consensus 88 ~Ivd~~~~~l~g~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~ 164 (474) T protein:vir:97 88 NLVDQKVSYVASKPVTYSCE---DENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIP 164 (474) T ss_pred HHHHHHHhhhhcCCceeccC---cHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEE Confidence 99999999999999999764 456788889999999999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) +||++ +++.+++|+|... ...++++||++.+++|+..+++........ T Consensus 165 v~d~~~~~~~~~~ir~~~~~----------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~-------------------- 214 (474) T protein:vir:97 165 IWVDKEREELKSFIRYYKFN----------NEEKVEFWTDTTVTYYVLENGGLIPDYYYG-------------------- 214 (474) T ss_pred EEcCCCCCceEEEEEEEEec----------CeEEEEEEeCCeEEEEEEcCCccccccccC-------------------- Confidence 99874 5788999988631 235789999999999988776543321111 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||+||||+|+||..|+|+|+++++|||+||.++|+++|.+++|++|+++++|+++++.++ T Consensus 215 --------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~ 286 (474) T protein:vir:97 215 --------ANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEE 286 (474) T ss_pred --------cCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchh Confidence 112234456899999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 316 ~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke 394 (530) +..+++..+++.++++++|+|++|+++.+++++++++|++.||.+|++|++++.++ ||+||+||+++++++++||.+++ T Consensus 287 ~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~ 366 (474) T protein:vir:97 287 FMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLK 366 (474) T ss_pred hhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHH Confidence 89999999999999999999999999999999999999999999999999999888 78999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAI 474 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~ 474 (530) +.|+++|++++++|+++++. ..+..+|+++|++++|.|+++.|+++.. .|++|+||+++++|+|+|+++|+++ T Consensus 367 ~~~~~~l~~~~~li~~~~~~----~~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D~~~E~er 439 (474) T protein:vir:97 367 NKATVAIQELISFIIDFNNL----KTDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDDYKAELER 439 (474) T ss_pred HHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCcccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCCHHHHHHH Confidence 99999999999999988654 3567789999999999999999887543 5889999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 475 CDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 475 ~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++..+..+.+.....+...++.++.. +..+ T Consensus 440 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~e 474 (474) T protein:vir:97 440 IEQEQMEYNKQLPNLDDGGADGAQQQEGSNN-------KESE 474 (474) T ss_pred HHHHHHHHHhhccccCCCCCCCcccCCCCcc-------cccC Confidence 9998877655554443332222221111111 1111 No 15 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=1.5e-101 Score=573.32 Aligned_cols=464 Identities=14% Similarity=0.132 Sum_probs=371.0 Q ss_pred CCcccccCCc--ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAP--DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) +.......+- ....+.|+++|++|.. +++++|+++.+||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~-~~~~r~~~l~~Yy~g~~~i~~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:99 27 VVYTYDGTESDLLQNVNEVSKYIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (511) T ss_pred CccccchhhhhhhccHHHHHHHHHHHHH-hhHHHHHHHHHHhcccCccccccCcc--------cccccCcceeecchHHH Confidence 3333222221 1234568888988875 45678999999999999998776432 34567899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+++++ |+++.++.+++++++++|+||+++|+|++|+++++++||.++|| T Consensus 98 Iv~~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~ 174 (511) T protein:vir:99 98 ISDFINGYFLGNPIQYQDD---DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFV 174 (511) T ss_pred HHHHHHhhhcccCceeecC---chHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEE Confidence 9999999999999999764 4556788888885 67999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++ .++.+++|+|.....+. .....+.++++||++.+++|+..+++... T Consensus 175 vyd~~~~~~~~~~vr~~~~~~~~~--~~~~~~~~~~vyt~~~i~~~~~~~~~~~~------------------------- 227 (511) T protein:vir:99 175 IYDNTIERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTSRTNGLK------------------------- 227 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccc--CccceEEEEEEEeCCcEEEEEecCCcccc------------------------- Confidence 99986 46889999998765443 34566789999999999999876654221 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+|+++|....+.++ T Consensus 228 --------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (511) T protein:vir:99 228 --------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred --------ccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchh Confidence 111234456899999999999999999999999999999999999999999999999999999977665554 Q ss_pred HHHHHhhCccee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ-------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 316 ~~~~~~~~~~i~-------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) ... .+..+++. .+++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||++ T Consensus 300 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~ 378 (511) T protein:vir:99 300 VRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcc-cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 332 23333332 34578899999999999999999999999999999999999988 7999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) ++++|.+||+++++.|+++|++++++|++++...+. ...++.+++++|++++|.|.++.|++++.+ .|++|+||++ T Consensus 379 ~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl--~GiiS~et~l 456 (511) T protein:vir:99 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLM 456 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHH Confidence 999999999999999999999999999999988764 345677899999999999999999987665 5899999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++|+++|++||++|+++..+.......++....+++.++.. + +.+.++++ T Consensus 457 ~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~d~~e 511 (511) T protein:vir:99 457 SLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDS-T-KDSIDKKE 511 (511) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCC-C-cCcccccC Confidence 9999999999999999998776544333322222222222111110 1 11111111 No 16 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=1e-100 Score=568.72 Aligned_cols=464 Identities=14% Similarity=0.136 Sum_probs=369.3 Q ss_pred CCcccccCCc--ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAP--DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) ++......+- ....+.|.++|++|.. ++.++++++.+||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~-~~~~r~~~l~~YY~g~~~i~~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (512) T protein:vir:97 27 VVYTYDGTESDLLQNINEVSKYIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (512) T ss_pred cccccCchhhhhhhhHHHHHHHHHHHHH-hhHHHHHHHHHHhcccCccccccCcc--------cccccCcceeecchHHH Confidence 3322211111 1123667888888765 34678999999999999998776542 34567899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+++++ |+++.++.+++++++++|+||+++|++++|++++++++|.++|| T Consensus 98 Ivd~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~ 174 (512) T protein:vir:97 98 ISDFINGYFLGNPIQCQDD---DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFV 174 (512) T ss_pred HHHHHhhhhcccCceeccC---ChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 9999999999999999764 4456778888885 67999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) +||++ .++.+++|+|.....+. .....+.++++||++.+++|...+++.... T Consensus 175 iyd~~~~~~~~~~vr~~~~~~~~~--~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~------------------------ 228 (512) T protein:vir:97 175 IYDNTIERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTSRTNGLKL------------------------ 228 (512) T ss_pred EEcCCCCCceEEEEEEEEeeeccc--cccceEEEEEEEeCCcEEEEEecCCCcccc------------------------ Confidence 99986 46889999998765443 234567889999999999998765442211 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) ........+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+|+++|....+..+ T Consensus 229 ---------~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (512) T protein:vir:97 229 ---------TPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (512) T ss_pred ---------cccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchh Confidence 11224457899999999999999999999999999999999999999999999999999999987665554 Q ss_pred HHHHHhhCccee--------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ--------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIK 380 (530) Q Consensus 316 ~~~~~~~~~~i~--------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik 380 (530) +.. .+..+++. .+++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||+ T Consensus 300 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~ 378 (512) T protein:vir:97 300 VRK-QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK 378 (512) T ss_pred hhh-hhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHH Confidence 432 23333332 25578899999999999999999999999999999999999988 799999999 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHH Q lcl|NC_011308. 381 SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNL 458 (530) Q Consensus 381 ~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~ 458 (530) +++++|.+||+++++.|+++|++++++|++++...+. ..++..+++++|+|++|.|.++.|++++.+ +|++|+||+ T Consensus 379 ~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl--~giiS~et~ 456 (512) T protein:vir:97 379 YKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTL 456 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHH Confidence 9999999999999999999999999999999987654 356677899999999999999999986655 589999999 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 459 LAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 459 l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) ++++|+++|+++|++||++|+++..+.......+.....++ +++..+ ..+...+++ T Consensus 457 ~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~ 512 (512) T protein:vir:97 457 MSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND-DEQDDD-TKDTVDKKE 512 (512) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC-CCCCCC-ccccccccC Confidence 99999999999999999988776544333222222111111 111111 111111111 No 17 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=9e-101 Score=569.08 Aligned_cols=464 Identities=14% Similarity=0.138 Sum_probs=369.9 Q ss_pred CCcccccCCcc--cHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPD--RLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~--~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) .+......+-+ ...+.|.++|++|.. .+.++|+++++||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~-~~~~r~~~l~~Yy~g~~~il~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:93 27 VVYTYDGTESDLLQNVNEVSKYIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (511) T ss_pred CcccccchhhhhhccHHHHHHHHHHHHH-hhHHHHHHHHHHhcccCccccccCcC--------cccccCcceeecchHHH Confidence 33332222111 124568888988875 45678999999999999998776543 34467899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+++++ |+++.++.+++++++++|+||+++|+|++|+++++++||+++|| T Consensus 98 Iv~~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~ 174 (511) T protein:vir:93 98 ISDFINGYFLGNPIQYQDD---DKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFV 174 (511) T ss_pred HHHHHhhhhcccCeeeccC---ChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEE Confidence 9999999999999999764 4556778888875 67999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||+. .++.+++|+|.....+. ...+.+.++++||++.+++|...+++... T Consensus 175 vydd~~~~~~~~~vr~~~~~~~~~--~~~~~~~~~~iyt~~~i~~~~~~~~~~~~------------------------- 227 (511) T protein:vir:93 175 IYDNTIERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTSRTNGLK------------------------- 227 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccc--cccceEEEEEEEeCCcEEEEEecCCCccc------------------------- Confidence 99985 46889999997655443 23456789999999999999876543221 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+|+++|..+.+.++ T Consensus 228 --------~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (511) T protein:vir:93 228 --------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred --------cccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchh Confidence 111223456899999999999999999999999999999999999999999999999999999987766555 Q ss_pred HHHHHhhCccee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ-------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 316 ~~~~~~~~~~i~-------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) ... .+..+++. .+++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||++ T Consensus 300 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:93 300 VRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcc-cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 433 22333332 35578999999999999999999999999999999999999988 7999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) +++++.+||+++++.|+++|++++++|+++++..+. ..++..+++++|+|++|.|.++.|+++..+ .|++|+||++ T Consensus 379 ~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~ 456 (511) T protein:vir:93 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLM 456 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHH Confidence 999999999999999999999999999999988764 345677899999999999999999986655 6899999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++|+++|++||++|+++..+.......+..++. +++++.. ...+.+.+++ T Consensus 457 ~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~ 511 (511) T protein:vir:93 457 SLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDI-NDDEQDD-DTKDTVDKKE 511 (511) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCC-CCCCCCC-cccccccccC Confidence 99999999999999999987655433222222222111 1111111 1111111111 No 18 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=1.1e-100 Score=568.62 Aligned_cols=469 Identities=13% Similarity=0.080 Sum_probs=378.9 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) |-..|+++..+...++|.++|++|.. +..++..+.+||.|+|+|++|+.. ...++++||++||+++|| T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~--~~~~~~~l~~Yy~g~~~i~~~~~~----------~~~~~~~ki~~n~~~~Iv 72 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQN--RKKRLDKLSDYYNGKQEIEKHEFD----------NATVEAANVMVNHAKYIT 72 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHH--HHHHHHHHHHHhccccchhcCCcC----------cCCCCcceeecchHHHHH Confidence 66667766555557789999998853 568899999999999999887542 345789999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCC---------------- Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSED---------------- 143 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g---------------- 143 (530) ++.++||||+||+|++.+ ++..+.|+++++ ++++..+.++++.++++|+||+++|.+++| T Consensus 73 ~~~~~~l~g~p~~~~~~~---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~ 149 (499) T protein:vir:10 73 DMNVGFMTGNPVKYVAEK---GKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPN 149 (499) T ss_pred HHHhhhhcccCceeecCC---hhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccc Confidence 999999999999998654 445666777775 679999999999999999999999999987 Q ss_pred -ceEEEEecccceEEEEcCCC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccc Q lcl|NC_011308. 144 -KLTFQTVDALQLLPVFDDYG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNP 220 (530) Q Consensus 144 -~~~~~~~~p~~~~~v~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~ 220 (530) .++++.++|+++||||++.. .+.+++|+|...... ....+.++++||++.+++|.....+... T Consensus 150 ~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~----~~~~~~~~~iyt~~~i~~~~~~~~~~~~---------- 215 (499) T protein:vir:10 150 TELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLE----GNTNGYSITVYMPQRIVEYRTKTTMEVS---------- 215 (499) T ss_pred cceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecC----CCceEEEEEEEeCCeEEEEEecCCcccc---------- Confidence 46789999999999998865 467888888654322 2456789999999999999876654211 Q ss_pred cccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 221 NPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE 300 (530) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~ 300 (530) .........+|+||.||||+|+||++|.|+|+++++|||+||.++|++++.+++|++ T Consensus 216 -----------------------~~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~ 272 (499) T protein:vir:10 216 -----------------------ANDPIVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVD 272 (499) T ss_pred -----------------------CcceecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcC Confidence 112234456899999999999999999999999999999999999999999999999 Q ss_pred ceeeeecCCCCchhhHHHHHhhCcceec--CCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHH Q lcl|NC_011308. 301 AIYVVRGGTNSPVDEIKKNIQSKKIIQT--KGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNV 377 (530) Q Consensus 301 ~~lvl~g~~~~~~~~~~~~~~~~~~i~~--~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGv 377 (530) |+++++|+..++..+....++.++++.+ +++++++||+|+.+.+++++++++|.+.||.+|++|+++++++ ||+||+ T Consensus 273 ~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~ 352 (499) T protein:vir:10 273 ALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGE 352 (499) T ss_pred ceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHH Confidence 9999999988776666666666666655 4678899999999999999999999999999999999999887 799999 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHH Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINN 457 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et 457 (530) ||+++++++++||.++++.|+++|++++++|+++++..+ ..+++.+++++|++++|.|+++.|++++.+ +|++|+|| T Consensus 353 Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~-~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et 429 (499) T protein:vir:10 353 AMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKG-ANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKY 429 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHH Confidence 999999999999999999999999999999999998776 567889999999999999999999997765 68999999 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 458 LLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 458 ~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +++++|+++|+++|++|+++|+++..........+..++..+.++. ++.++++.+....+..-|=|- T Consensus 430 ~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~ 496 (499) T protein:vir:10 430 TYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDK------QDDSSENDKEAGSNHNQSHRT 496 (499) T ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCC------CcccCCCCCCCccccccCCCC Confidence 9999999999999999999888775544444333332222111111 111111111111111111111 No 19 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=4.3e-101 Score=570.80 Aligned_cols=459 Identities=20% Similarity=0.261 Sum_probs=380.9 Q ss_pred CCcc-----cccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCc Q lcl|NC_011308. 1 MTNT-----LLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGF 75 (530) Q Consensus 1 ~~~~-----~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~ 75 (530) .+.. .+.+...+..++|.++|++|. .++.++..+.+||.|+|+|++|+..... .......++|+||++|| T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~r~~~~~~Yy~g~~~i~~~~~~~~~---~~~~~~~~~~~ki~~n~ 84 (478) T protein:vir:10 10 KPYHEQVVEQIKPKYETQEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPFKRDV---NGDYDETKPDWRMYTNY 84 (478) T ss_pred chhhhHHHHHhhhccCChHHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccchhhhc---ccccccccccceeccch Confidence 1111 233455677889999999885 4678999999999999999988655322 33445678999999999 Q ss_pred hhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccce Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQL 155 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~ 155 (530) +++||++.++||||+||+|+++ +++..+.|+.++++++++.+.++++.++++|.+|+++|+|++|++++++++|.++ T Consensus 85 ~k~ivd~~~~yl~g~p~~~~~~---~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p~~~ 161 (478) T protein:vir:10 85 HQNLVDQKVAYAVANPVTFGVD---NDKALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQA 161 (478) T ss_pred HHHHHHHHhhhhcccCceeecC---ChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccce Confidence 9999999999999999999764 4557788899999999999999999999999999999999999999999999999 Q ss_pred EEEEcC--CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccc Q lcl|NC_011308. 156 LPVFDD--YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGV 233 (530) Q Consensus 156 ~~v~d~--~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (530) ||+||+ .+++.+++|+|... ...++++||++.+++|+..++......... T Consensus 162 ~~v~d~~~~~~~~~~ir~~~~~----------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~------------------ 213 (478) T protein:vir:10 162 VPIWTNKERDELQAFIRVYELD----------GAERVEYWTKDDVTFYELKEGQLIPDFYRS------------------ 213 (478) T ss_pred EEEEcCCCCCceEEEEEEEeee----------CceEEEEEeCCcEEEEEecCCeeecccccc------------------ Confidence 999986 46788999988542 235789999999999988765432211110 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCch Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPV 313 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~ 313 (530) .............+|+||+||||+|+||+.|.|+|+++++|||+||.++|+++|.+++|++|+++++|+++++. T Consensus 214 ------~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~ 287 (478) T protein:vir:10 214 ------EDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDM 287 (478) T ss_pred ------ccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccc Confidence 00111122344568999999999999999999999999999999999999999999999999999999999888 Q ss_pred hhHHHHHhhCcceecC--CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHH Q lcl|NC_011308. 314 DEIKKNIQSKKIIQTK--GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKA 390 (530) Q Consensus 314 ~~~~~~~~~~~~i~~~--~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka 390 (530) +++..+++..+++.+. ++|+|+|++|+++.+++++++++|++.||.+|++|++++.++ ||+||+||+++|++|++|| T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~ 367 (478) T protein:vir:10 288 KDFMHNLKYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKA 367 (478) T ss_pred cchhhhhhhCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHH Confidence 8889999999998874 568999999999999999999999999999999999999988 6899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHH Q lcl|NC_011308. 391 QKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEET 470 (530) Q Consensus 391 ~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~ 470 (530) .++++.|+++|++++++|+++++ .+++..+|+++|++++|+|+++.|++++. .+|++|+||+++++|+|+|+++ T Consensus 368 ~~~~~~~~~~l~~~~~li~~~~~----~~~d~~~i~i~f~~~~p~~~~e~~~~~~~--~~g~iS~et~i~~~~~v~d~~~ 441 (478) T protein:vir:10 368 NKLKNKTLTALQELLQYIIDFYR----LDVRVQDIEITFNFNVMVNELENSQIAMN--STGLLSKETILGNHSWVQDPVA 441 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHhC----CCcccccceEEeCCCCCCCHHHHHHHHHH--HhCCCChHHHHHhCCCCCCHHH Confidence 99999999999999999988764 45788899999999999999999998654 4789999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCC Q lcl|NC_011308. 471 LKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTI 510 (530) Q Consensus 471 e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (530) |++++++|+.+.........++..++.++.++ +.+++ T Consensus 442 E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~---d~~~e 478 (478) T protein:vir:10 442 EMERIEQENIELNQQLPDIEEGLNDEQQRQSE---DNQSE 478 (478) T ss_pred HHHHHHHHHHHHHHhccccCCCCcccccccCc---CCCCC Confidence 99999999887665554444433221111111 11111 No 20 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=1.2e-100 Score=568.37 Aligned_cols=456 Identities=21% Similarity=0.321 Sum_probs=380.9 Q ss_pred CCccc---------------ccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccccc Q lcl|NC_011308. 1 MTNTL---------------LTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDN 65 (530) Q Consensus 1 ~~~~~---------------~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~ 65 (530) ||+++ +.+......++|+++|++|. .++.++.++++||.|+|+|++|..+... ....+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~Yy~g~~~i~~r~~~~~~---~~~~~~~ 75 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHR--KQLDKITVGQRYYDKDNDIVKQMKKVDV---YGNIDYD 75 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHH--HHHHHHHHHHHHhcccCchhcccccccc---ccccccc Confidence 55544 33344556678888888875 4678899999999999999988655332 2233457 Q ss_pred CCcceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCce Q lcl|NC_011308. 66 ASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKL 145 (530) Q Consensus 66 ~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~ 145 (530) ++|+||++||+++||++.++||||+||+|++. ++...+.|+.+++++++..+.+++++++++|.||+++|+|++|++ T Consensus 76 ~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~ 152 (474) T protein:vir:95 76 KPDWRITTNFHQNLVDQKVSYVASKPVTYSCE---DESVLKIIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEM 152 (474) T ss_pred cccceeccchHHHHHHHHHhhhccCCceeccC---chHHHHHHHHHHhccHHHHHHHHHHHHhhcCcEEEEEEecCCCce Confidence 89999999999999999999999999999764 456778899999999999999999999999999999999999999 Q ss_pred EEEEecccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccccccccc Q lcl|NC_011308. 146 TFQTVDALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPS 223 (530) Q Consensus 146 ~~~~~~p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 223 (530) ++++++|.++||||++. +++.+++++|... ...++++||++.+++|+..+++........ T Consensus 153 ~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~----------~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~-------- 214 (474) T protein:vir:95 153 KLFRVPAEQAIPIWVDKEREELKSFIRYYKFN----------NEEKVEFWTDTTVTYYVLENGGLIPDYYYG-------- 214 (474) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEEEc----------CeeEEEEEeCCeEEEEEEcCCccccccccC-------- Confidence 99999999999999874 5788888888532 235789999999999998776543322211 Q ss_pred ceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_011308. 224 QHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIY 303 (530) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~l 303 (530) .........+|+||.||||+|+||..|+|+|+++++|||+||.++|++++.+++|++|+| T Consensus 215 --------------------~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~l 274 (474) T protein:vir:95 215 --------------------ANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIY 274 (474) T ss_pred --------------------cccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 111223456799999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCchhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHH Q lcl|NC_011308. 304 VVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSR 382 (530) Q Consensus 304 vl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~ 382 (530) +++|+++++.+++..+++..+++.++++|+|+|++|+++.+++++++++|.+.||.+|++|++++.++ ||+||+||+++ T Consensus 275 v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~ 354 (474) T protein:vir:95 275 ILKGYEGQDLEEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFL 354 (474) T ss_pred eeecCCcccchhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHH Confidence 99999998888888999999999999999999999999999999999999999999999999998888 68999999999 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_011308. 383 YTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA 462 (530) Q Consensus 383 ~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~ 462 (530) ++++++||+++++.|+++|++++++|+++++. .++..+|+++|++++|+|+++.|+++. ..|++|+||+++++ T Consensus 355 ~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~----~~d~~~i~v~f~~~~p~d~~e~a~~~~---~~g~iS~et~i~~l 427 (474) T protein:vir:95 355 YGNLDLKANKLKNKATVAIQELIGFIIDFNNL----KMDVKDIEISFNFNRMMNDAEQSQIIA---QSQYLSRETLVKSS 427 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCCCcCHHHHHHHHH---hcCCCchHHHHHhC Confidence 99999999999999999999999999888643 467889999999999999999998654 35899999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 463 PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 463 ~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) |+++|+++|++|+++|+.+....+........+..++..++ +.++++ T Consensus 428 ~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~----~~~~~~ 474 (474) T protein:vir:95 428 PLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERS----NDKESE 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCC----ccCCCC Confidence 99999999999999888776555544443322221111111 111111 No 21 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=1.2e-100 Score=568.38 Aligned_cols=457 Identities=19% Similarity=0.204 Sum_probs=378.0 Q ss_pred CCccccc---CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccch---hhhccccc----ccccccccccccCCcce Q lcl|NC_011308. 1 MTNTLLT---TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDND---IENTRIMW----MNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~---I~~r~~~~----~~~~~~~~~~~~~~n~k 70 (530) ||..-.. +..+...+.|+++|++|. ..+.++..+.+||+|.++ ++.|+... +...+...+...+||+| T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~--~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 78 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHK--DDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNK 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhh--hhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccc Confidence 5433222 222334467888888874 356788999999998764 44443221 22233345567789999 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCCc--chHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) |++||+++||++.++||||+||+|++.++ .++.+.+.|+++++ ++++.++.+++++++++|+||+++|.+++|++++ T Consensus 79 i~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~ 158 (474) T protein:vir:94 79 LNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRI 158 (474) T ss_pred cccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEE Confidence 99999999999999999999999988543 46778899999985 5799999999999999999999999999999999 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceee Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVL 227 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (530) ++++|.++|||||+++++.+++|+|.... ......++++++||+..++.|...+.+. T Consensus 159 ~~i~p~~~~~v~d~~~~~~~~i~~~~~~~----~~~~~~~~~~~~y~~~~~~~~~~~~~~~------------------- 215 (474) T protein:vir:94 159 KNIDPYNVIFVGDNILEPTYSLRYFYEKD----DDNGTDYVYAEFYDNAYYYVFRGEGIDA------------------- 215 (474) T ss_pred EEEcccceEEEEcCCCceEEEEEEEEEee----CCCceEEEEEEEEcCceEEEEeecCCCc------------------- Confidence 99999999999999999999999987643 2245667889999999999887543221 Q ss_pred eeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec Q lcl|NC_011308. 228 AVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRG 307 (530) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g 307 (530) .......+|+||.||||+|+||.+|.|+|+++++|||+||.++|+++|.+++|++|+|+++| T Consensus 216 ------------------~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g 277 (474) T protein:vir:94 216 ------------------LQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG 277 (474) T ss_pred ------------------ccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc Confidence 11233457999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCchhhHHHHHhhCcceec-CCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 308 GTNSPVDEIKKNIQSKKIIQT-KGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 308 ~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) +++++ +...+++..+++.+ +++++|+||+|+++.+++++++++|++.||.+|++|++++.++ ||+||+||++++++ T Consensus 278 ~~~~~--~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~ 355 (474) T protein:vir:94 278 MGMSE--EMIQETQKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMA 355 (474) T ss_pred CCCCc--hhhhhhhhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHH Confidence 87764 44566777778766 6789999999999999999999999999999999999999887 79999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAP 463 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~ 463 (530) |++||+++++.|+++|++++++|+++++.++. .++++.+++++|++++|.|+++.|++++++ .|++|+||+++++| T Consensus 356 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~ 433 (474) T protein:vir:94 356 LENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQ 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCC Confidence 99999999999999999999999999998754 466778999999999999999999987665 58999999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 464 RIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 464 ~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) +|+|+++|++++++|+++..+..+...++..++.+++++. + T Consensus 434 ~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s---------~ 474 (474) T protein:vir:94 434 LVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQS---------E 474 (474) T ss_pred CCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccC---------C Confidence 9999999999999988777665555444432222211111 0 No 22 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=1.2e-100 Score=568.38 Aligned_cols=457 Identities=19% Similarity=0.204 Sum_probs=378.0 Q ss_pred CCccccc---CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccch---hhhccccc----ccccccccccccCCcce Q lcl|NC_011308. 1 MTNTLLT---TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDND---IENTRIMW----MNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~---I~~r~~~~----~~~~~~~~~~~~~~n~k 70 (530) ||..-.. +..+...+.|+++|++|. ..+.++..+.+||+|.++ ++.|+... +...+...+...+||+| T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~--~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 78 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHK--DDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNK 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhh--hhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccc Confidence 5433222 222334467888888874 356788999999998764 44443221 22233345567789999 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCCc--chHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) |++||+++||++.++||||+||+|++.++ .++.+.+.|+++++ ++++.++.+++++++++|+||+++|.+++|++++ T Consensus 79 i~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~ 158 (474) T protein:vir:10 79 LNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRI 158 (474) T ss_pred cccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEE Confidence 99999999999999999999999988543 46778899999985 5799999999999999999999999999999999 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceee Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVL 227 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (530) ++++|.++|||||+++++.+++|+|.... ......++++++||+..++.|...+.+. T Consensus 159 ~~i~p~~~~~v~d~~~~~~~~i~~~~~~~----~~~~~~~~~~~~y~~~~~~~~~~~~~~~------------------- 215 (474) T protein:vir:10 159 KNIDPYNVIFVGDNILEPTYSLRYFYEKD----DDNGTDYVYAEFYDNAYYYVFRGEGIDA------------------- 215 (474) T ss_pred EEEcccceEEEEcCCCceEEEEEEEEEee----CCCceEEEEEEEEcCceEEEEeecCCCc------------------- Confidence 99999999999999999999999987643 2245667889999999999887543221 Q ss_pred eeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec Q lcl|NC_011308. 228 AVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRG 307 (530) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g 307 (530) .......+|+||.||||+|+||.+|.|+|+++++|||+||.++|+++|.+++|++|+|+++| T Consensus 216 ------------------~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g 277 (474) T protein:vir:10 216 ------------------LQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG 277 (474) T ss_pred ------------------ccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc Confidence 11233457999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCchhhHHHHHhhCcceec-CCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 308 GTNSPVDEIKKNIQSKKIIQT-KGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 308 ~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) +++++ +...+++..+++.+ +++++|+||+|+++.+++++++++|++.||.+|++|++++.++ ||+||+||++++++ T Consensus 278 ~~~~~--~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~ 355 (474) T protein:vir:10 278 MGMSE--EMIQETQKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMA 355 (474) T ss_pred CCCCc--hhhhhhhhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHH Confidence 87764 44566777778766 6789999999999999999999999999999999999999887 79999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAP 463 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~ 463 (530) |++||+++++.|+++|++++++|+++++.++. .++++.+++++|++++|.|+++.|++++++ .|++|+||+++++| T Consensus 356 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~ 433 (474) T protein:vir:10 356 LENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQ 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCC Confidence 99999999999999999999999999998754 466778999999999999999999987665 58999999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 464 RIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 464 ~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) +|+|+++|++++++|+++..+..+...++..++.+++++. + T Consensus 434 ~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s---------~ 474 (474) T protein:vir:10 434 LVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQS---------E 474 (474) T ss_pred CCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccC---------C Confidence 9999999999999988777665555444432222211111 0 No 23 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=7.4e-100 Score=564.04 Aligned_cols=464 Identities=14% Similarity=0.129 Sum_probs=369.3 Q ss_pred CCcccccCCcc--cHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPD--RLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~--~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) ++......+-+ ...+.|+++|++|.. ++.++++++++||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~-~~~~r~~~l~~Yy~g~~~i~~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:96 27 VVYTYDGTESDLLQNVNEVSKYIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (511) T ss_pred CccccchhhhhhhccHHHHHHHHHHHHH-hhHHHHHHHHHHhcccCccccccCcC--------cccccCcceeecchHHH Confidence 33333222221 134568888988875 44678999999999999998776442 34567899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+++++ |+++.++.+++++++++|+||+++|+|++|++++++++|.++|| T Consensus 98 Iv~~~~~yl~g~p~~~~~~---~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~ 174 (511) T protein:vir:96 98 ISDFINGYFLGNPIQYQDD---DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFV 174 (511) T ss_pred HHHHHHhhhccCCceeecC---chHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEE Confidence 9999999999999999764 4456778888885 67999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++ .++.+++|+|.....+. .....+.++++||++.+++|...+++... T Consensus 175 vydd~~~~~~~~~vr~~~~~~~d~--~~~~~~~~~~iyt~~~i~~~~~~~~~~~~------------------------- 227 (511) T protein:vir:96 175 IYDNTIERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTSRTNGLK------------------------- 227 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccc--cccceEEEEEEEeCCcEEEEEecCCCccc------------------------- Confidence 99986 46889999987655443 23456788999999999999876554221 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+|+++|....+.++ T Consensus 228 --------~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 299 (511) T protein:vir:96 228 --------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred --------ccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchh Confidence 111223456899999999999999999999999999999999999999999999999999999977665555 Q ss_pred HHHHHhhCccee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ-------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 316 ~~~~~~~~~~i~-------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) +... ...+++. .+++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||++ T Consensus 300 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:96 300 VRKQ-KEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hccc-ccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 4332 2233332 34578899999999999999999999999999999999999988 7999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) +++++.+||+++++.|+++|++++++|+++++..+. ..++..+++++|+|++|.|.++.|++++.+ +|++|+||++ T Consensus 379 ~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l 456 (511) T protein:vir:96 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLM 456 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHH Confidence 999999999999999999999999999999988754 356678899999999999999999986654 6899999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++|+++|++||++|+++..+.......+.. +..+++++.. ..++.+.+++ T Consensus 457 ~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~ 511 (511) T protein:vir:96 457 SLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP-RDINDDEQDD-DTKDTVDKKE 511 (511) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC-CCCCCCCCCC-cccccccccC Confidence 99999999999999999887654333222221111 1111111111 1111122211 No 24 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=1.3e-99 Score=562.72 Aligned_cols=464 Identities=14% Similarity=0.125 Sum_probs=370.7 Q ss_pred CCcccccCCcc--cHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPD--RLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~--~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) ++......+-+ ...+.|+++|++|.. .+.++++++.+||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~r~~~l~~Yy~g~~~i~~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:10 27 VVYTYDGTESDLLQNVNEVSKCIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (511) T ss_pred CCccCchhhhhcccCHHHHHHHHHHHHH-hhHHHHHHHHHHhcccCccccccCcc--------cccccCcceeecchHHH Confidence 44443322221 234568888988765 34678999999999999998876542 34567899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+++++ |+++.++.+++++++++|+||+++|+|++|+++++++||.++|| T Consensus 98 Iv~~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~ 174 (511) T protein:vir:10 98 ISDFINGYFLGNPIQYQDD---DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFV 174 (511) T ss_pred HHHHHhhhhcccCceeecC---chHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEE Confidence 9999999999999999764 4456788888885 67999999999999999999999999999999999999999999 Q ss_pred EEcCCC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDYG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++. ++.+++|+|.....+.. ..+.+.++++||++.+++|...+++... T Consensus 175 vydd~~~~~~~~~vr~~~~~~~d~~--~~~~~~~~~iyt~~~i~~~~~~~~~~~~------------------------- 227 (511) T protein:vir:10 175 IYDNTIERNSIAGVRYLRTKPIDKT--DEDEVFTVDLFTSHGVYRYLTSRTNGLK------------------------- 227 (511) T ss_pred EEcCCCCCceEEEEEEEEeeecccC--ccceEEEEEEEeCCcEEEEEecCCCccc------------------------- Confidence 999864 58899999987654432 3456788999999999999876554221 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||.||||+|+||++|.|+|+++++|||+||.++|+++|.+++|++|+++++|....+.++ T Consensus 228 --------~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 299 (511) T protein:vir:10 228 --------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred --------ccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchh Confidence 111223456899999999999999999999999999999999999999999999999999999977665554 Q ss_pred HHHHHhhCccee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ-------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 316 ~~~~~~~~~~i~-------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) ... .+..+++. .+++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||++ T Consensus 300 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:10 300 VRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcc-chhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHH Confidence 432 23333333 24468899999999999999999999999999999999999988 7999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) +++++++||.++++.|+++|++++++|++++...+. ..++..+++++|+|++|.|.++.|++++.+ .|++|+||++ T Consensus 379 ~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~G~iS~et~~ 456 (511) T protein:vir:10 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLM 456 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHH Confidence 999999999999999999999999999999987654 456778999999999999999999987666 5889999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++|+++|++||++|+++..+.......+... ..+++++.. ...+.+.+++ T Consensus 457 ~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~ 511 (511) T protein:vir:10 457 SLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR-DINDDEQDD-DTKDTVDKKE 511 (511) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCC-CCCCCCCCC-cccCcccccC Confidence 999999999999999998876543332222221111 111111111 1112222222 No 25 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=5.9e-100 Score=564.58 Aligned_cols=457 Identities=19% Similarity=0.256 Sum_probs=378.5 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +..+ ...-.+..++|.++|.+|. .++.++..+.+||+|+|+|+.++.+.. +....+..++|+||++||+++|| T Consensus 17 ~~~~--~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~yY~g~~~i~~~~~~~~---~~~~~~~~~~~~ki~~n~~~~iv 89 (478) T protein:vir:10 17 VEQI--KPKYETQEEMILRLVREHK--ENIDNITMGERYYNHHPDILDAPPKRD---VNGDYDETKPDWRMYTNYHQNLV 89 (478) T ss_pred HHHH--hhccCCcHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhccccccc---cccccccccccceeccchHHHHH Confidence 1111 1222346788999998886 456889999999999999998765432 23344567899999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEc Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFD 160 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d 160 (530) ++.++||||+||+|+++ +++..+.|+.++++++.+++.+++++++++|+||+++|.|++|++++++++|.++||+|| T Consensus 90 d~~~~~l~g~~~~~~~~---~d~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d 166 (478) T protein:vir:10 90 DQKVAYAVANPVTFGVD---NDKALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWT 166 (478) T ss_pred HHHHhhhccCCeeeecC---ChHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEc Confidence 99999999999999764 445677888999999999999999999999999999999999999999999999999998 Q ss_pred CC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 161 DY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 161 ~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) +. +++.+++|+|... ...++++||++.+++|+..++......... T Consensus 167 ~~~~~~~~~~v~~~~~~----------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~----------------------- 213 (478) T protein:vir:10 167 NKERDELQAFIRVYELD----------GAERVEYWTKDDVTYYELKEGQLIPDFYRS----------------------- 213 (478) T ss_pred CCCCCceEEEEEEEEec----------CceEEEEEeCCeEEEEEEcCCeeecccccc----------------------- Confidence 64 5788999988532 245789999999999987665432211110 Q ss_pred cccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHH Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKK 318 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~ 318 (530) .............+|+||.||||+|+||.+|.|+|+++++|||+||.++|++++.+++|++|+++++|+++++.+++.. T Consensus 214 -~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~ 292 (478) T protein:vir:10 214 -DDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMH 292 (478) T ss_pred -ccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhh Confidence 0111122234456899999999999999999999999999999999999999999999999999999999988888888 Q ss_pred HHhhCcceecC--CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHHH Q lcl|NC_011308. 319 NIQSKKIIQTK--GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTEI 395 (530) Q Consensus 319 ~~~~~~~i~~~--~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~ 395 (530) +++..+++.+. ++|+|+|++|+++.++++.++++|++.||.+|++|++++.++ ||+||+||++++++|++||+++++ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~ 372 (478) T protein:vir:10 293 NLKYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKN 372 (478) T ss_pred hhhhcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 99999998875 568999999999999999999999999999999999999998 689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHH Q lcl|NC_011308. 396 ALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAIC 475 (530) Q Consensus 396 ~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~ 475 (530) .|+++|++++++|+++++ .+++..+|+++|++++|+|+++.|++++.+ +|++|+||+++++|+++|+++|++++ T Consensus 373 ~~~~~l~~~~~li~~~~g----~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri 446 (478) T protein:vir:10 373 KTLTALQELLQYIIDFYR----LDVKVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVEDPVAEMERI 446 (478) T ss_pred HHHHHHHHHHHHHHHHhC----CCcccccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHHHHH Confidence 999999999999988763 457888999999999999999999986654 78999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhccccccCCccccCCCCCCCC Q lcl|NC_011308. 476 DTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTI 510 (530) Q Consensus 476 e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (530) ++|+.+..+.+..+..+...+.+.+.+. .+++ T Consensus 447 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 478 (478) T protein:vir:10 447 EQENIELNQQLPDIEEGLNGEQQRQSEN---NQPE 478 (478) T ss_pred HHHHHHHHhhccccccccCCCCCCCCCC---CCCC Confidence 9988776665555544332211111110 1111 No 26 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=3.9e-99 Score=560.10 Aligned_cols=464 Identities=14% Similarity=0.124 Sum_probs=369.7 Q ss_pred CCcccccCCc--ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAP--DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) ++......+. +...+.|.++|++|... +.++++++.+||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~~-~~~r~~~l~~Yy~g~~~il~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:78 27 VVYTYDGTESDLLQNVNEVSKYIEHHMDY-QRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (511) T ss_pred CcccccchhhhhhcCHHHHHHHHHHHHHh-hhHHHHHHHHHhhccCccccccCcc--------cccccCcceeecchHHH Confidence 3333322222 22345688889888754 4578999999999999998776532 34567899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+.+++ |+++.++.++++.++++|+||+++|+|++|++++++++|.++|| T Consensus 98 Iv~~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~ 174 (511) T protein:vir:78 98 ISDFINGYFLGNPIQYQDD---DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFI 174 (511) T ss_pred HHHHHhhhhcccCceeecC---chHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEE Confidence 9999999999999999764 4556778888885 67999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++ .++.+++|+|.....+. ...+.+.++++||++.+++|...+++... T Consensus 175 v~dd~~~~~~~~~vr~~~~~~~~~--~~~~~~~~~~vyt~~~i~~~~~~~~~~~~------------------------- 227 (511) T protein:vir:78 175 IYDNTVERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTNRTNGLK------------------------- 227 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccc--cccceEEEEEEEeCCcEEEEEecCCCccc------------------------- Confidence 99985 46889999998765443 23466789999999999999876654321 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+++++|....+.++ T Consensus 228 --------~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (511) T protein:vir:78 228 --------LTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred --------ccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh Confidence 111233457899999999999999999999999999999999999999999999999999999977655544 Q ss_pred HHHHHhhCccee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ-------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 316 ~~~~~~~~~~i~-------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) +.. ....+++. ..++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||++ T Consensus 300 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:78 300 VRK-QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcc-cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHH Confidence 332 23333332 24468899999999999999999999999999999999999998 7999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) ++++|.+||.++++.|+++|++++++|++++...+. ..++..+++++|+|++|.|+++.|++++.+ .|++|+||++ T Consensus 379 ~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~G~iS~et~l 456 (511) T protein:vir:78 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLM 456 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHH Confidence 999999999999999999999999999999987654 456778899999999999999999987666 5899999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++|+++|++||++|+++..+.......+.. +..+++++.+ ..++...+++ T Consensus 457 ~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~e~~ 511 (511) T protein:vir:78 457 SLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP-RDINDDEQDD-DTKDTVDKKE 511 (511) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC-CCCCCCCCCC-CccCcccccC Confidence 99999999999999999887654333222222111 1111111111 1111111111 No 27 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=3.9e-99 Score=560.10 Aligned_cols=464 Identities=14% Similarity=0.124 Sum_probs=369.7 Q ss_pred CCcccccCCc--ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAP--DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~--~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) ++......+. +...+.|.++|++|... +.++++++.+||.|+|+|+.+.... .+..++++||++||+++ T Consensus 27 ~~~~~~~~e~~~~~~~~~i~~~i~~~~~~-~~~r~~~l~~Yy~g~~~il~~~~~~--------~~~~~~~~ki~~n~~k~ 97 (511) T protein:vir:96 27 VVYTYDGTESDLLQNVNEVSKYIEHHMDY-QRPRLKVLSDYYEGKTKNLVELTRR--------KEEYMADNRVAHDYASY 97 (511) T ss_pred CcccccchhhhhhcCHHHHHHHHHHHHHh-hhHHHHHHHHHhhccCccccccCcc--------cccccCcceeecchHHH Confidence 3333322222 22345688889888754 4578999999999999998776532 34567899999999999 Q ss_pred HHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) ||++.++||+|+||+|++. +++..+.|+.+++ |+++.++.++++.++++|+||+++|+|++|++++++++|.++|| T Consensus 98 Iv~~~~~yl~g~p~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~ 174 (511) T protein:vir:96 98 ISDFINGYFLGNPIQYQDD---DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFI 174 (511) T ss_pred HHHHHhhhhcccCceeecC---chHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEE Confidence 9999999999999999764 4556778888885 67999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++ .++.+++|+|.....+. ...+.+.++++||++.+++|...+++... T Consensus 175 v~dd~~~~~~~~~vr~~~~~~~~~--~~~~~~~~~~vyt~~~i~~~~~~~~~~~~------------------------- 227 (511) T protein:vir:96 175 IYDNTVERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTNRTNGLK------------------------- 227 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccc--cccceEEEEEEEeCCcEEEEEecCCCccc------------------------- Confidence 99985 46889999998765443 23466789999999999999876654321 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) .........+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+++++|....+.++ T Consensus 228 --------~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~ 299 (511) T protein:vir:96 228 --------LTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE 299 (511) T ss_pred --------ccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchh Confidence 111233457899999999999999999999999999999999999999999999999999999977655544 Q ss_pred HHHHHhhCccee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQ-------------TKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 316 ~~~~~~~~~~i~-------------~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) +.. ....+++. ..++++++||+|+.+.+++++++++|.+.||.+|++|++++.++ ||+||+||++ T Consensus 300 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~ 378 (511) T protein:vir:96 300 VRK-QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKY 378 (511) T ss_pred hcc-cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHH Confidence 332 23333332 24468899999999999999999999999999999999999998 7999999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) ++++|.+||.++++.|+++|++++++|++++...+. ..++..+++++|+|++|.|+++.|++++.+ .|++|+||++ T Consensus 379 ~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~G~iS~et~l 456 (511) T protein:vir:96 379 KLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLM 456 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHH Confidence 999999999999999999999999999999987654 456778899999999999999999987666 5899999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +++|+++|+++|++||++|+++..+.......+.. +..+++++.+ ..++...+++ T Consensus 457 ~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~e~~ 511 (511) T protein:vir:96 457 SLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP-RDINDDEQDD-DTKDTVDKKE 511 (511) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC-CCCCCCCCCC-CccCcccccC Confidence 99999999999999999887654333222222111 1111111111 1111111111 No 28 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=5e-99 Score=559.50 Aligned_cols=457 Identities=22% Similarity=0.298 Sum_probs=377.1 Q ss_pred CCcccc-----cCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCc Q lcl|NC_011308. 1 MTNTLL-----TTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGF 75 (530) Q Consensus 1 ~~~~~~-----~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~ 75 (530) .+.++. ....+.+.++|.++|.+|. .++.++.++.+||+|+|+|+.++...... ...+..++|+||++|| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~YY~g~~~i~~~~~~~~~~---~~~~~~~~~~ri~~n~ 82 (472) T protein:vir:93 8 QTEIFDAIVRTNNKPETLEEMIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDAT---GAVDPLKPDDRMITNF 82 (472) T ss_pred chhhhhceeeecCchhhHHHHHHHHHHHHH--HHHHHHHHHHHHhccccccccccchhhcc---ccccccccccccccch Confidence 222221 2233566788888888875 46689999999999999999887654322 2234567999999999 Q ss_pred hhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccce Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQL 155 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~ 155 (530) +++||++.++||+|+||+|++. ++...+.|+.+++|++++.+.+++++++++|+||+++|.|++|+++++++||.++ T Consensus 83 ~~~ivd~~~~~l~g~~~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~ 159 (472) T protein:vir:93 83 HANLVDQKVSYIVGKPIAFKHT---DDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQG 159 (472) T ss_pred HHHHHHHHhhhhcccCeeeccC---ChHHHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccce Confidence 9999999999999999999764 4567788999998999999999999999999999999999999999999999999 Q ss_pred EEEEcC--CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccc Q lcl|NC_011308. 156 LPVFDD--YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGV 233 (530) Q Consensus 156 ~~v~d~--~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (530) ||+||+ ..++.+++|+|.... ..++++|++..+++|....+...... T Consensus 160 ~~i~d~~~~~~~~~~ir~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------- 208 (472) T protein:vir:93 160 IPIWTDKEHEELEAFIRMYKLEN----------ETKVEYWDKVTVNYYVYENGSLIPDY--------------------- 208 (472) T ss_pred EEEEcCCCCCceEEEEEEEEeec----------ceeEEEEecCeEEEEEEecCeeeecc--------------------- Confidence 999986 457899999886421 24689999999999987665432211 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCch Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPV 313 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~ 313 (530) ......+.....+|+||.||||+|+||++|+|+|+++++|||+||.++|++++.+++|++|+++++|++.++. T Consensus 209 -------~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~ 281 (472) T protein:vir:93 209 -------SNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL 281 (472) T ss_pred -------cccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccc Confidence 1112233445678999999999999999999999999999999999999999999999999999999998888 Q ss_pred hhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHH Q lcl|NC_011308. 314 DEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQK 392 (530) Q Consensus 314 ~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ 392 (530) +++...++..+++.++++|+|+|++|+++.+++++++++|++.||.+|++|+++++++ ||+||+||++++++|.+||++ T Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~ 361 (472) T protein:vir:93 282 PEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK 361 (472) T ss_pred hhhHHHHhhccccccCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHH Confidence 8888889999999999999999999999999999999999999999999999999888 689999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHH Q lcl|NC_011308. 393 TEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLK 472 (530) Q Consensus 393 ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~ 472 (530) +++.|+++|++++++|+++++.. .+..+++++|++++|+|+++.|++++.+ .|++|+||+++++|+++|+++|+ T Consensus 362 ~~~~~~~~l~~~~~li~~~~~~~----~~~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~ 435 (472) T protein:vir:93 362 LARKAKVAIQELLWFVFEHFDIK----GEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAEL 435 (472) T ss_pred HHHHHHHHHHHHHHHHHHHhCCC----cccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHH Confidence 99999999999999999887643 3567899999999999999999886654 68999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 473 AICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 473 ~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) +|+++|+.+..+.+........++.. +..+++..+++ T Consensus 436 ~ri~~E~~~~~~~~~~~~~~~~d~~~----~~~~~~~~~~e 472 (472) T protein:vir:93 436 ERIEQEQMEYNKQLPNLDDGGADGAQ----QQERSNNKESE 472 (472) T ss_pred HHHHHHHHHHHHhccCcCcccCCCCC----CCCCCCcccCC Confidence 99999877665554443222111111 11111111111 No 29 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=8.5e-99 Score=558.24 Aligned_cols=449 Identities=21% Similarity=0.290 Sum_probs=372.3 Q ss_pred CCccccc--------------CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccC Q lcl|NC_011308. 1 MTNTLLT--------------TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNA 66 (530) Q Consensus 1 ~~~~~~~--------------~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~ 66 (530) |-++... +......++|.++|++|. .++.++.++++||.|+|+|+.++..... ....+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~--~~~~~~~~~~~yY~g~~~i~~~~~~~~~---~~~~~~~~ 75 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHK--ENVEDITVGERYYNHQPDVLFNAPKRNV---KGEIDPFK 75 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCccccccccccc---cccccccc Confidence 3333222 233556677788888775 3568899999999999999888654322 22345677 Q ss_pred CcceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceE Q lcl|NC_011308. 67 SNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLT 146 (530) Q Consensus 67 ~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~ 146 (530) +++||++||++.||++.++||||+||+|+++ ++...+.|+.++++++.+.+.+++++++++|.+|+++|+|++|+++ T Consensus 76 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~---d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~ 152 (468) T protein:vir:96 76 PDWRMYTNYHQNLVDQKVAYAVANPVTYGTE---DEKSLKTIQEVLNHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFK 152 (468) T ss_pred cccccccchHHHHHHHHHhhhccCCceeccC---ChHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceE Confidence 9999999999999999999999999999764 4556788999999999999999999999999999999999999999 Q ss_pred EEEecccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccc Q lcl|NC_011308. 147 FQTVDALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQ 224 (530) Q Consensus 147 ~~~~~p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 224 (530) +++++|.++||+|++. +++.+++|+|... ...++++||++.+++|+..++......... T Consensus 153 i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------- 213 (468) T protein:vir:96 153 TFRVPAEQAIPIWTNKERDELKAFIRLYELD----------GGERVEYWTANDVTFYELKDGQLIPDYYQG--------- 213 (468) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEEec----------CceEEEEEeCCeEEEEEEcCCceeeccccc--------- Confidence 9999999999999864 5788898888532 235789999999999987765432211110 Q ss_pred eeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceee Q lcl|NC_011308. 225 HVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYV 304 (530) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lv 304 (530) .............+|+||+||||+|+||.+|.|+|+++++||||||.++|+++|.+++|++|+|+ T Consensus 214 ---------------~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv 278 (468) T protein:vir:96 214 ---------------EEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYV 278 (468) T ss_pred ---------------ccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 01111223345578999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCchhhHHHHHhhCcceecCC--CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 305 VRGGTNSPVDEIKKNIQSKKIIQTKG--EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 305 l~g~~~~~~~~~~~~~~~~~~i~~~~--~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) ++|+++++.+.+..+++.++++.+++ +|+++|++|+++.+++++++++|++.||.+|++|++++.++ ||+||+||++ T Consensus 279 ~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~ 358 (468) T protein:vir:96 279 LKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKF 358 (468) T ss_pred eecCCccccchhhhhhhcCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHH Confidence 99999988888888999999998864 57899999999999999999999999999999999999888 5899999999 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) +++++++||+++++.|+++|++++++|+++++ .+++..++.++|++++|.|+++.|++++ ..|++|+||++++ T Consensus 359 ~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g----~~~d~~~i~i~f~~~~p~d~~e~a~~~~---~~g~iS~et~i~~ 431 (468) T protein:vir:96 359 MYSNLDLKANKLKNKTLTALQELLQYIIDFYK----LSIKVQDVEITFNFNVMVNELEQSQIGV---NSQYLSKETVVTN 431 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccceeeEEecCCCCcCHHHHHHHHH---hcCCCchHHHHHh Confidence 99999999999999999999999999988763 4567889999999999999999988643 4699999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 462 APRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 462 ~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) +|+++|+++|++|+++|+++..+....+..... .+| + T Consensus 432 l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~------~~~---------~ 468 (468) T protein:vir:96 432 HPWVDDPVAEMERIDQEELALPSIEEGLNGKEN------NEP---------T 468 (468) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhccCCCCC------CCC---------C Confidence 999999999999999987765544333222111 111 0 No 30 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=2.4e-98 Score=555.82 Aligned_cols=458 Identities=16% Similarity=0.126 Sum_probs=369.0 Q ss_pred CCc--ccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTN--TLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~--~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) +.+ ....+......++|+++|++|.. ++..+++++.+||.|+| .|+++.. ..+..++++|+++||++ T Consensus 26 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~-~~~~r~~~l~~yY~g~~~~i~~~~~---------~~~~~~~~~ki~~n~~k 95 (501) T protein:vir:27 26 IRYRADNLEELMVNNWELLKNFINHHKL-RQAPRIQELLDYARGENHDVLQFGR---------RKDREMADKRAVHNYGR 95 (501) T ss_pred HhhccccccccccccHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCccccccCc---------cCccccccceeccchHH Confidence 111 12223333455678999998864 45678999999999985 5655432 23456789999999999 Q ss_pred hHHhhhhhhhcccceeeecCCcc-hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccce Q lcl|NC_011308. 78 ELVDQKTQYLLANGIDVKPTDHD-DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQL 155 (530) Q Consensus 78 ~Ivd~~~~yl~G~pv~~~~~~~~-de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~ 155 (530) +||++.++||+|+||+|++.+.+ ++.+.+.|++++. |+++..+.+++++++++|+||+++|++++|+++++++||.++ T Consensus 96 ~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~ 175 (501) T protein:vir:27 96 MISKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLET 175 (501) T ss_pred HHHHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEcccee Confidence 99999999999999999987654 4556778888875 789999999999999999999999999999999999999999 Q ss_pred EEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccc Q lcl|NC_011308. 156 LPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGV 233 (530) Q Consensus 156 ~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (530) |||||++ .++.+++|+|..... ...+.++++||++.+++|...++. T Consensus 176 ~~v~d~~~~~~~~~~ir~~~~~~~------~~~~~~~~vyt~~~v~~~~~~~~~-------------------------- 223 (501) T protein:vir:27 176 FVIYDNSLEDNSIAAVRYYNRGTL------QNAKDVVEIYTNEHIYTLDASDDF-------------------------- 223 (501) T ss_pred EEEecCCCCCceEEEEEEEEeeec------CCcEEEEEEEeCCeEEEEEeCCce-------------------------- Confidence 9999985 468889998875432 234678999999999988654321 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCch Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPV 313 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~ 313 (530) +.....+|+||.||||+|+||.+|.|+|+++++|||+||.++|++++.+++|++|+++++|...++. T Consensus 224 -------------~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~ 290 (501) T protein:vir:27 224 -------------NEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK 290 (501) T ss_pred -------------eeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCc Confidence 1233567999999999999999999999999999999999999999999999999999999988877 Q ss_pred hhHHHHHhhCcceecCC---------CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHH Q lcl|NC_011308. 314 DEIKKNIQSKKIIQTKG---------EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRY 383 (530) Q Consensus 314 ~~~~~~~~~~~~i~~~~---------~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~ 383 (530) ++....++..+++.+.. +++++|++|+++.+++++++++|.+.||.+|++|++++.++ ||+||+||++++ T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~ 370 (501) T protein:vir:27 291 GMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKL 370 (501) T ss_pred ccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHH Confidence 77778888888887643 45799999999999999999999999999999999999988 799999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_011308. 384 TLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL-GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA 462 (530) Q Consensus 384 ~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~ 462 (530) ++|.+||.++++.|+++|++++++|+++++..+. .+++..+|+++|+|++|.|+++.|++++.+ .|++|+||+++++ T Consensus 371 ~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l 448 (501) T protein:vir:27 371 FGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLS 448 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhC Confidence 9999999999999999999999999999988754 578888999999999999999999986654 6899999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 463 PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 463 ~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) |+++||++|++|+++|+.+.+.... +. +..+......+. .++..+.+....+| T Consensus 449 ~~v~D~~~E~eri~~E~~e~~~~~~--~~----~~~~~~~~~~d~-----~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 449 GLVESPNEELDKINKEVSEIDFKGY--SN----DFNEHVGKYTDE-----VKETHTDDFERAYE 501 (501) T ss_pred CCCCCHHHHHHHHHHHHHhhhHhhh--cC----ccccccccccCC-----CCCCccccccccCC Confidence 9999999999999887654321111 11 111111100000 00011111111111 No 31 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=2.6e-98 Score=555.62 Aligned_cols=457 Identities=15% Similarity=0.133 Sum_probs=370.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhccc-chhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQD-NDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~-~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) +...+ .+......++|+++|++|.. .+.++++++.+||.|+ |+|+++... .+..++++|+++||+++| T Consensus 30 ~~~~~-~~~~~~~~~~i~~~i~~h~~-~~~~rl~~l~~yY~g~~~~i~~~~~~---------~~~~~~~~ki~~n~~k~I 98 (502) T protein:vir:48 30 RADNL-EELMVNNWELLKNFINHHKL-RQAPRIQELLDYARGENHDVLKSGRR---------KDNEMADKRAVHNYGRMI 98 (502) T ss_pred cccch-hhhccccHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCccccccccc---------cccccccceeecchHHHH Confidence 22222 23233345678999998764 4467899999999998 577765432 345678999999999999 Q ss_pred HhhhhhhhcccceeeecCCcc-hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHD-DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~-de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) |++.++||+|+||+|++.+.. ++.+.+.|++++. |+++..+.+++++++++|+||+++|.+++|+++++++||.++|| T Consensus 99 vd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~ 178 (502) T protein:vir:48 99 SKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFV 178 (502) T ss_pred HHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 999999999999999987654 4667888998875 68999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++ .++.+++|+|..... ...+.++++||++.+++|...++. T Consensus 179 vydd~~~~~~~~~ir~~~~~~~------~~~~~~~~iyt~~~i~~~~~~~~~---------------------------- 224 (502) T protein:vir:48 179 IYDNSLEDNSIAAVRYYNRGTL------QNAKDVVEIYTNQHIYTLDASDSF---------------------------- 224 (502) T ss_pred EEcCCCCCceEEEEEEEEEeec------CCcEEEEEEEeCCeEEEEEeCCce---------------------------- Confidence 99975 468899999876432 234678999999999988643321 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) +.....+|+||.||||+|+||.+|.|+|+++++|||+||.++|+++|.+++|++|+++++|......++ T Consensus 225 -----------~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~ 293 (502) T protein:vir:48 225 -----------NEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGM 293 (502) T ss_pred -----------eeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccc Confidence 123356799999999999999999999999999999999999999999999999999999988776666 Q ss_pred HHHHHhhCcceecC---------CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 316 IKKNIQSKKIIQTK---------GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 316 ~~~~~~~~~~i~~~---------~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) ....++..+++.+. ++++++||+|+++.+++++++++|.+.||.+|++|++++.++ ||+||+||++++++ T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~ 373 (502) T protein:vir:48 294 QASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFG 373 (502) T ss_pred chhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHH Confidence 67777888887653 357899999999999999999999999999999999999887 79999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGL-GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) |.+||.++++.|+++|++++++|+++++..+. .+++..+|+++|++++|.|.++.|++++++ +|++|+||+++++|+ T Consensus 374 l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l~~ 451 (502) T protein:vir:48 374 LDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDL--GGQVSQETALSLSGL 451 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCC Confidence 99999999999999999999999999988753 678888999999999999999999987665 589999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 465 IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 465 vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) |+|+++|++|+++|+.+.+.... ..+. .+..+.+ .+..++ ++..+.++|-| T Consensus 452 v~D~~~E~~ri~~E~~~~~~~~~--~~~~-~~~~~~~---~d~~~e---------~~~~~~~~~~~ 502 (502) T protein:vir:48 452 VENPTEELDKINEESSKIDFKGY--PSYF-YDNVGKY---TDEVKE---------THTDDFERVYE 502 (502) T ss_pred CCCHHHHHHHHHHHHHhhhhhcc--cccc-ccccccc---CCCccC---------CCCcCcCCCCC Confidence 99999999999888665322111 1111 1111111 011111 11122222222 No 32 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=8.3e-98 Score=552.82 Aligned_cols=456 Identities=16% Similarity=0.141 Sum_probs=369.4 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhccc-chhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQD-NDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~-~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) +++ ..+..+...++|+++|.+|.. ++.++|+++.+||.|+ |+|+.+... ....++++|+++||+++| T Consensus 30 ~~~--~~~~~~~~~~~i~~~i~~~~~-~~~~r~~~~~~yY~g~~~~i~~~~~~---------~~~~~~~~ri~~n~~k~I 97 (501) T protein:vir:96 30 ADN--LEELMVNNWELLKNFINHHKL-RQAPRIQELLDYARGENHDVLKSGRR---------KDNEMADKRAVHNYGRMI 97 (501) T ss_pred ccc--cccccCChHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCCcccCcccc---------CccccccceeecchHHHH Confidence 222 223344556789999998874 4467899999999998 577765432 234678999999999999 Q ss_pred HhhhhhhhcccceeeecCCcc-hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHD-DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP 157 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~-de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~ 157 (530) |++.++||+|+||+|++.+.+ ++.+...|+++++ |+++..+.+++++++++|+||+++|++++|+++++++||.++|| T Consensus 98 vd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~ 177 (501) T protein:vir:96 98 SKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFV 177 (501) T ss_pred HHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEE Confidence 999999999999999887644 4667888999885 68999999999999999999999999999999999999999999 Q ss_pred EEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 158 VFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 158 v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) |||++ +++.+++|+|..... ...+.++++||++.+++|...++. T Consensus 178 v~d~~~~~~~~~~v~~~~~~~~------~~~~~~~~vyt~~~i~~~~~~~~~---------------------------- 223 (501) T protein:vir:96 178 IYDNSLEDNSIAAVRYYNRGTL------QSAKDVVEIYTDEHIYTLDASDDF---------------------------- 223 (501) T ss_pred EEcCCCCCceEEEEEEEEeecC------CCcEEEEEEEcCCcEEEEeeCCCc---------------------------- Confidence 99985 568899998865332 234678999999999998643321 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE 315 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~ 315 (530) +.....+|+||.||||+|+||.+|.|+|+++++|||+||.++|++++.+++|++|+++++|...++.++ T Consensus 224 -----------~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~ 292 (501) T protein:vir:96 224 -----------NEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGM 292 (501) T ss_pred -----------eeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCccc Confidence 123346799999999999999999999999999999999999999999999999999999998877777 Q ss_pred HHHHHhhCcceecC---------CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 316 IKKNIQSKKIIQTK---------GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 316 ~~~~~~~~~~i~~~---------~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) ...+++..+++.++ .+++++||+|+++.++++.++++|++.||.+|++|++++.++ ||+||+||++++++ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~ 372 (501) T protein:vir:96 293 QASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFG 372 (501) T ss_pred chhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHH Confidence 77888888887764 245799999999999999999999999999999999999888 78999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGL-GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) |.+||.++++.|+++|++++++|+++++..+. .+++..+|+++|++++|.|+++.|++++.+ .|+||+||+++++|+ T Consensus 373 l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl--~g~iS~et~~~~l~~ 450 (501) T protein:vir:96 373 LDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGL 450 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCC Confidence 99999999999999999999999999988753 567888999999999999999999986665 589999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 465 IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 465 vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ++|+++|++|+++|+.+... ....++ ..+.....++..+++ + ..+.|=|-| T Consensus 451 v~D~~~E~~ri~~E~~~~~~---~~~~~~---~~~~~~~~~~~~~e~--~-------~d~~e~~~~ 501 (501) T protein:vir:96 451 VESPNEELDKINKEMSEIDF---KGYSND---FNEHVGKYTDEVKET--H-------TDDFEREYE 501 (501) T ss_pred CCCHHHHHHHHHHHHHHhhc---cccccc---hhhcccccCCcCCCC--C-------CCccccccC Confidence 99999999999887665321 111111 111111000000000 0 000011111 No 33 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=2.8e-97 Score=549.90 Aligned_cols=438 Identities=14% Similarity=0.102 Sum_probs=366.9 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) =+++|..+.+++. +.|.++|++|.. +..++..+.+||+|+|+|++|+.+ ...++++||++||+++|| T Consensus 7 ~~~~~~~~~~~~~-~~i~~~i~~~~~--~~~r~~~~~~Yy~g~~~i~~~~~~----------~~~~~~~ki~~n~~~~iv 73 (452) T protein:vir:36 7 KLMTFSKDEPITV-EVVTKFMEKHKL--EVARYEYLKNMYLGIMAIDDEPAK----------DSWKPDNRLAVNFTKYIV 73 (452) T ss_pred eeEEcCCccCCCH-HHHHHHHHHHHH--HHHHHHHHHHHhccccccccCccc----------cccCccceeecchHHHHH Confidence 4566777777764 567889998753 568899999999999999987642 345789999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVF 159 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~ 159 (530) ++.++||||+||+|++.+ ++..+.|+++++ |+++..+.+++++++++|.||+++|+|++|+++++++||.++||+| T Consensus 74 d~~~~~l~g~~~~~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~ 150 (452) T protein:vir:36 74 DTFTGYFNGIPVKKSHSD---KEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVY 150 (452) T ss_pred HHHhhhhcccCceeecCC---hhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEE Confidence 999999999999998654 456677888885 6899999999999999999999999999999999999999999999 Q ss_pred cCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 160 DDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 160 d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) |+. ..+.+++|+|.. .....++++||++.+++|...+++. T Consensus 151 d~~~~~~~~~~i~~~~~---------~~~~~~~~vyt~~~i~~~~~~~~~~----------------------------- 192 (452) T protein:vir:36 151 DDTVKQEPLFAVRYGVD---------EDKKLQGEVYTLLETIKISGENDEI----------------------------- 192 (452) T ss_pred cCCCCCceEEEEEEEEe---------cCceEEEEEEecCeEEEEEEcCCce----------------------------- Confidence 985 357788887742 1235789999999999987654321 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK 317 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~ 317 (530) ......+|+||.||||+|+||++|.|+|+++++|||+||.++|++++.+++|++|+++++|+..+. +.. T Consensus 193 ---------~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~--~~~ 261 (452) T protein:vir:36 193 ---------SFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE--EDL 261 (452) T ss_pred ---------EEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc--hhh Confidence 112345799999999999999999999999999999999999999999999999999999987654 445 Q ss_pred HHHhhCcceecCCC-----CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHH Q lcl|NC_011308. 318 KNIQSKKIIQTKGE-----GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQK 392 (530) Q Consensus 318 ~~~~~~~~i~~~~~-----~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ 392 (530) .+++.++++.+..+ ++|+|++|+.+.+++++++++|.+.||.+|++|++++.++||+||+||++++++|++||++ T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~ 341 (452) T protein:vir:36 262 KNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALS 341 (452) T ss_pred hhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHH Confidence 66777888887654 3799999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHH Q lcl|NC_011308. 393 TEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLK 472 (530) Q Consensus 393 ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~ 472 (530) +++.|+.+|++++++|++++...+ ..++..+|+++|++++|.|+++.|++++++ +|++|+||+++++|+++|+++|+ T Consensus 342 ~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d~~~E~ 418 (452) T protein:vir:36 342 FQRKFQSSLNSRYKLFCELSTNVS-NKDSWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEM 418 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHH Confidence 999999999999999999988775 456888999999999999999999986654 68899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 473 AICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 473 ~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) +|+++|+++..+..+..... .++.+++.+. +. ++ T Consensus 419 ~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~--~~--~e 452 (452) T protein:vir:36 419 EKIKKEEASTAIFDKDKQPS-EKGTDTVVSE--TN--EE 452 (452) T ss_pred HHHHHHHHHHHHHHhhccCC-CCcccccCcc--cc--CC Confidence 99998876654433322111 1111111111 11 11 No 34 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=4.1e-97 Score=549.00 Aligned_cols=458 Identities=12% Similarity=0.097 Sum_probs=364.6 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) -+.++..+...-..+.|.++|++|... +.+++..+.+||.|+|.++.++... ..+..++++||++||+++|| T Consensus 11 ~~~~~~~~~~~l~~~~i~~li~~~~~~-~~~r~~~l~~YY~g~~~~i~~~~~~-------~~~~~~~~~ki~~n~~~~Iv 82 (506) T protein:vir:94 11 ANLIYQESLENLTPNKIMKFITHHFNY-QRPRLEMLDDYYQGYNLKILDKQSR-------RHEDGKADHRATHSFAKYIA 82 (506) T ss_pred ceeecccchhcCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccc-------cccccCCcceeecchHHHHH Confidence 222333332222234567788888764 4678999999999999665433221 23456789999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVF 159 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~ 159 (530) ++.++||||+||+|++.+ +...+.|+++++ |+++..+.++++.++++|+||+++|+|++|+++++++||.++|||| T Consensus 83 ~~~~~~l~G~p~~~~~~d---~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~ 159 (506) T protein:vir:94 83 DFQTSYSVGNPINVKLPD---DGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIY 159 (506) T ss_pred HHhhhhhcccCceeecCc---chHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEe Confidence 999999999999998764 345677888874 6899999999999999999999999999999999999999999999 Q ss_pred cCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 160 DDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 160 d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) |+. .++.+++|+|.....+. ........++++||+..++.|.....+. T Consensus 160 dd~~~~~~~~~v~~~~~~~~~~-~~~~~~~~~~~~yt~~~~~~~~~~~~~~----------------------------- 209 (506) T protein:vir:94 160 STDVDPKPIMAVRYHQIELVDD-NQVSTINYVPETWTADTYTLYNPTPIMG----------------------------- 209 (506) T ss_pred cCCCCCceEEEEEEEeeeeccC-CceeEEEEEEEEEeCceEEEeccccCcc----------------------------- Confidence 874 46889999887654433 2234456788999999988885433221 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC------- Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN------- 310 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~------- 310 (530) ......+|+||.||||+|+||++|.|+|+++++||||||.++|+++|.+++|++|+|+++|... T Consensus 210 ---------~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~ 280 (506) T protein:vir:94 210 ---------KMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSD 280 (506) T ss_pred ---------ceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchh Confidence 1122456999999999999999999999999999999999999999999999999999999642 Q ss_pred -----------------CchhhHHHHHhhCcceecCCC---------CceeEEEecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_011308. 311 -----------------SPVDEIKKNIQSKKIIQTKGE---------GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGF 364 (530) Q Consensus 311 -----------------~~~~~~~~~~~~~~~i~~~~~---------~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p 364 (530) ....++...++.++++.++++ ++++||+|+++.+++++++++|.+.||.+|++| T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 360 (506) T protein:vir:94 281 MMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTP 360 (506) T ss_pred ccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 223356667788888887653 479999999999999999999999999999999 Q ss_pred CCCcccc-cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCccccceeeEEeCCCCCCCHHHHHH Q lcl|NC_011308. 365 NSSAVGD-GNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRG-LGDYSSTDIKFDIEPYILANELDLAM 442 (530) Q Consensus 365 ~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~-~~~~d~~~i~i~f~~~~P~n~~e~a~ 442 (530) +++++++ ||+||+||++++++|++||++++++|+++|++++++|+++++..+ ..+++..+++|+|++++|.|+++.|+ T Consensus 361 ~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~ 440 (506) T protein:vir:94 361 DLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIK 440 (506) T ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHH Confidence 9999887 789999999999999999999999999999999999999998764 45788889999999999999999999 Q ss_pred HHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 443 IDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 443 ~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) +++++ +|+||+||+++++|+++|+++|++|+++|+.+.+..+........+ +..+...+...++-+ T Consensus 441 ~~~kl--~g~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 441 ALVQA--GATLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNGVISND---GQTNTTATQTDEEVR 506 (506) T ss_pred HHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcc---cCccccccccccCCC Confidence 86665 6899999999999999999999999999877654433222111111 111111111111111 No 35 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=9.2e-96 Score=541.61 Aligned_cols=438 Identities=14% Similarity=0.077 Sum_probs=365.5 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) =+.++..+.+++. ++|.++|++|. .+..++.++++||+|+|+|++++.. ...++++|+++||+++|| T Consensus 7 ~~~~~p~d~~~~~-~~l~~~i~~~~--~~~~r~~~~~~yy~g~~~i~~~~~~----------~~~~~~~ki~~n~~~~iv 73 (453) T protein:vir:39 7 KLMTFPKDEPITN-EVVTKFMEKHR--LEVARYEYLKNMYRGIMAIDAEPTK----------DLWKPDNRLTVNFTKYIV 73 (453) T ss_pred cceEcCCCCCCCH-HHHHHHHHHHH--HHHHHHHHHHHHhhccCchhcCCCc----------cccCccceeecchHHHHH Confidence 4667777777654 46899998875 3567899999999999999987642 355789999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVF 159 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~ 159 (530) ++.++||||+||+|++.+ +...+.|++++. |+++..+.+++++++++|.||+++|.+++|++++++++|.+++|+| T Consensus 74 d~~~~~l~g~~~~~~~~d---~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~ 150 (453) T protein:vir:39 74 DTFTGYFNGIPVKKSHSD---KETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFMVY 150 (453) T ss_pred HHHhhhhcccCceeccCC---hHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEe Confidence 999999999999997653 456677888875 6899999999999999999999999999999999999999999999 Q ss_pred cCCC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 160 DDYG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 160 d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) |+.. .+.+++|+|.. .....++++||++.+++|...++.. T Consensus 151 d~~~~~~~~~~ir~~~~---------~~~~~~~~~yt~~~i~~~~~~~~~~----------------------------- 192 (453) T protein:vir:39 151 DDTIKQEPLFAVRYGYD---------DDYKLYGEVYTKETTYALNGTMGFY----------------------------- 192 (453) T ss_pred cCCCCCeEEEEEEEEEe---------CCeEEEEEEEeCCeEEEEEecCCce----------------------------- Confidence 8754 46777777642 2346789999999999987544321 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK 317 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~ 317 (530) ......+|+||.||||+|+|+++|+|+|+.+++|||+||.++|++++.+++|++|+++++|.+.++ +.. T Consensus 193 ---------~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~--~~~ 261 (453) T protein:vir:39 193 ---------NMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE--EDL 261 (453) T ss_pred ---------eeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCc--hhh Confidence 112345799999999999999999999999999999999999999999999999999999987653 334 Q ss_pred HHHhhCcceecC------CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 318 KNIQSKKIIQTK------GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 318 ~~~~~~~~i~~~------~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) .+++..+++.+. .+++|+|++++++.+++++++++|.+.||.+|++|++++.++||+||+||++++++|++||+ T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~ 341 (453) T protein:vir:39 262 KNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLAL 341 (453) T ss_pred hhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHH Confidence 566777777653 46889999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHH Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETL 471 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e 471 (530) ++++.|+.+|++++++|+++++..+ ..++..+|+++|++++|.|+++.|++++++ +|++|+||+++++|+++|+++| T Consensus 342 ~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v~D~~~E 418 (453) T protein:vir:39 342 SFQRKFQSSLNSRYKLYCELSTNVS-NKEAWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAE 418 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHH Confidence 9999999999999999999988766 457888999999999999999999986655 6889999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCcc Q lcl|NC_011308. 472 KAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQP 514 (530) Q Consensus 472 ~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (530) ++|+++|+++.....+...... .+..+.. +++.++ T Consensus 419 ~~ri~~E~~~~~~~~~~~~~~~----~~~~~~~----~~~~~e 453 (453) T protein:vir:39 419 MEKIKKEEASTAIFDKDKQPSE----KGTDTVV----PETNEE 453 (453) T ss_pred HHHHHHHHHHHHHHHHhccCCC----CCCCCCC----CCcCCC Confidence 9999988776554333222211 1111110 111111 No 36 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=5e-95 Score=537.60 Aligned_cols=421 Identities=14% Similarity=0.130 Sum_probs=358.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |+ .++|.++|++|. .+..++..+++||+|+|+|+++..+ ...++++||++||+++||++.++||||+ T Consensus 1 l~-~~~l~~~i~~~~--~~~~r~~~l~~yy~g~~~il~~~~~----------~~~~~~~ki~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MT-KDLLSELIQKHR--SFNLSYSAYKQLYEGDHAILQQKQK----------EQYKPDNRLVVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CC-HHHHHHHHHHHH--HHHHHHHHHHHHhcccccccccccc----------ccCCCcceeecchHHHHHHHHhhhhccc Confidence 44 456788898875 3568999999999999999987643 3457899999999999999999999999 Q ss_pred ceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCC--Ccee Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYG--TLQR 167 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~--~~~~ 167 (530) ||+|++++ +...+.|+++++ |+++..+.+++++++++|+||+++|.+++|++++++++|.+++|+||+.. ++.+ T Consensus 68 ~~~~~~~~---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~ 144 (429) T protein:vir:98 68 PVQTSHEN---KQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLF 144 (429) T ss_pred CceeecCC---hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEE Confidence 99998654 456677888875 67999999999999999999999999999999999999999999999754 4778 Q ss_pred EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccc Q lcl|NC_011308. 168 IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEG 247 (530) Q Consensus 168 ~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (530) ++|+|.. ...+.+.++|+.+.++.|....++. . T Consensus 145 ~i~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------------------~ 177 (429) T protein:vir:98 145 AVRYFYN---------KGGVLEGSYSDASNITYFKDGEKGI--------------------------------------E 177 (429) T ss_pred EEEEEEe---------cCceEEEEEEeCceEEEEEecCCce--------------------------------------E Confidence 8887743 1245778899999998886544321 1 Q ss_pred ccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCccee Q lcl|NC_011308. 248 RQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQ 327 (530) Q Consensus 248 ~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~ 327 (530) .....+|+||.||||+|+||.+|+|+|+++++|||+||.++|++++.+++|++|+++++|.++++ ++..+++..+++. T Consensus 178 ~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~--~~~~~~~~~~~~~ 255 (429) T protein:vir:98 178 IGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELDD--ETLKSLRDTRIIN 255 (429) T ss_pred ecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCc--chhhhHhhCceee Confidence 12345799999999999999999999999999999999999999999999999999999987653 5567788889998 Q ss_pred cCCC----CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 328 TKGE----GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRW 403 (530) Q Consensus 328 ~~~~----~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~ 403 (530) ++++ ++|+|++|+.+.+++++++++|.+.||.+|++|++++.++||+||+||++++++|+.||.++++.|+++|++ T Consensus 256 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 335 (429) T protein:vir:98 256 LKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNR 335 (429) T ss_pred ccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8643 579999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 404 TADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYE 483 (530) Q Consensus 404 ~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~ 483 (530) ++++|+++++..+. .+++.+|+++|++++|.|+++.|++++++ +|++|+||+++++|+++|+++|++|+++|+++.. T Consensus 336 ~~~li~~~~~~~~~-~~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~ 412 (429) T protein:vir:98 336 RYKLIASYPTSKIG-PKDWIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVENPQKEIERKNSDKSTLI 412 (429) T ss_pred HHHHHHHHhccCCC-ccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 99999999887654 57888999999999999999999986655 6899999999999999999999999999887653 Q ss_pred HH-HHhhhccccccCCccccCCCC Q lcl|NC_011308. 484 DV-VKALEDQEVEELEPTVTPIID 506 (530) Q Consensus 484 ~~-~~~~~~~~~~~~~~~~~~~~~ 506 (530) +. ...+..+. .+.+.| T Consensus 413 ~~~~~~~~~~~-------~~~~~~ 429 (429) T protein:vir:98 413 SRQAGGLNGQN-------TTTILE 429 (429) T ss_pred HHHHhhhcCCC-------CCCCCC Confidence 32 22222221 111111 No 37 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=1e-94 Score=535.90 Aligned_cols=433 Identities=13% Similarity=0.080 Sum_probs=354.4 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) --.++..+..+ .++.|.++|++|. .+..++.++.+||+|+|+|+++... ...++++|+++||+++|| T Consensus 7 ~~~~~~~~~~~-~~~~i~~~i~~~~--~~~~r~~~~~~yy~g~~~i~~~~~~----------~~~~~~~ki~~n~~~~iv 73 (453) T protein:vir:73 7 KLMTYSRDEEI-TDKVVNDFMKKHQ--EEVERYEYLGNMYKGIMEISSQKAK----------DSWKPDNRLTNNFAKYIV 73 (453) T ss_pred eeeeccccccC-CHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcCCCC----------CccCccceeecchHHHHH Confidence 01111122333 3556788888874 4568999999999999999886542 345789999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVF 159 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~ 159 (530) ++.++||+|+||+|++. ++...+.|+++++ |++...+.+++++++++|.||+++|++++|++++++++|.++||+| T Consensus 74 d~~~~~l~g~~~~~~~~---d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~ 150 (453) T protein:vir:73 74 DTFVGYFNGIPIKKTHD---DKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVY 150 (453) T ss_pred HHhhhhhcccCceeecC---ChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEE Confidence 99999999999999765 4456778888875 6899999999999999999999999999999999999999999999 Q ss_pred cCCCCc--eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 160 DDYGTL--QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 160 d~~~~~--~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) |+.... .+++++|.. .....++++||++.+++|...++.. T Consensus 151 dd~~~~~~~~~i~~~~~---------~~~~~~~~vyt~~~i~~~~~~~~~~----------------------------- 192 (453) T protein:vir:73 151 DDSIKQKPLFAVYYGFD---------EEGNLSGTVYTLLETISITGKAGEV----------------------------- 192 (453) T ss_pred eCCCCceeEEEEEEEEe---------cCceEEEEEEeCCeEEEEEecCCce----------------------------- Confidence 886543 444444421 1234689999999999987544321 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK 317 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~ 317 (530) ......+|+||.||||+|+||++|.|+|+++++|||+||.++|++++.+++|++|+++++|+..++ +.. T Consensus 193 ---------~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~--~~~ 261 (453) T protein:vir:73 193 ---------KFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE--EDA 261 (453) T ss_pred ---------EEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc--hhh Confidence 112345799999999999999999999999999999999999999999999999999999987653 334 Q ss_pred HHHhhCcceec-----------CCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 318 KNIQSKKIIQT-----------KGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 318 ~~~~~~~~i~~-----------~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l 386 (530) .+++..+++.+ +.+++|+|++|+.+.+++++++++|.+.||.+|++|++++.++||+||+||++++++| T Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l 341 (453) T protein:vir:73 262 KNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAM 341 (453) T ss_pred hcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHH Confidence 55555555543 2357799999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCC Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIG 466 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vd 466 (530) .+||.++++.|+++|++++++|++++...+ ..+++.+++++|++++|.|+++.|++++++ .|++|+||+++++|+++ T Consensus 342 ~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~--~giis~et~~~~~~~~~ 418 (453) T protein:vir:73 342 SNLALSFQRKFQSALNRRYSLWSSLSTNAS-NKDAWKDIEYTFTRNEPKDIKEQAETANIL--KGITSEETALSVISVIP 418 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCC Confidence 999999999999999999999999887665 456788999999999999999999987665 48999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCC Q lcl|NC_011308. 467 DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPII 505 (530) Q Consensus 467 d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 505 (530) |+++|++|+++|+++..+........+.+.. .+++ T Consensus 419 d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~----~~~~ 453 (453) T protein:vir:73 419 DVQAEMEKIKKKKLLQLSLTRTSNLVRMKQM----RGNL 453 (453) T ss_pred CHHHHHHHHHHHHHHHHHHHHhccCCcchhh----hcCC Confidence 9999999999988776544443322221111 1111 No 38 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=4.3e-94 Score=532.47 Aligned_cols=445 Identities=11% Similarity=0.051 Sum_probs=365.3 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) ..+.+..+...+.+ .|.++|++|.. ++..+|+++++||.|+|+|+++.. ...++|+||++||+++|| T Consensus 15 ~~~~~~~~~~~~~~-~i~~~i~~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~-----------~~~~~~~ki~~n~~~~Iv 81 (470) T protein:vir:99 15 SSFIFPKGEKLTSN-ELLGFIAYNET-VLKPRYRENMKLYLGKHKILTAPE-----------KETGADNRIVVNSAKYVV 81 (470) T ss_pred ceEEeCCCCCcCHH-HHHHHHHHHHH-hhHHHHHHHHHHhccccccccCcc-----------cccCCcceeecchHHHHH Confidence 44445555555554 56788888764 456789999999999999988753 235789999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVF 159 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~ 159 (530) ++.++||+|+||+|++.+ +.+..+.|.+++ ++++...+.+++++++++|.||+++|.+++|++++++++|.++||+| T Consensus 82 d~~~~~l~g~p~~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~ 159 (470) T protein:vir:99 82 DVYNGYFCGIEPKLALLN--DSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAFIIY 159 (470) T ss_pred HHHhhhhccCCeeEeeCC--chhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeEEEE Confidence 999999999999998754 334456677776 47899999999999999999999999999999999999999999999 Q ss_pred cCCC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 160 DDYG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 160 d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) |+.. .+.+++|+|... .......++.+||++.+++|...+.+.. T Consensus 160 d~~~~~~~~~~vr~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------------- 205 (470) T protein:vir:99 160 DDTVQRQPLAFVHYQIDN------SNNWTDAYGVIQYADKFYKFKGYDIEED---------------------------- 205 (470) T ss_pred cCCCCcceEEEEEEEEEe------cCCeeEEEEEEEecCeEEEEEecccccc---------------------------- Confidence 9865 467888887542 1234567889999999988865443221 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCC--chhh Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNS--PVDE 315 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~--~~~~ 315 (530) .......+|+||.||||+|+||.+|.|+|+.+++|||+||.++|++++.+++|++|+++++|+..+ +.++ T Consensus 206 --------~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~ 277 (470) T protein:vir:99 206 --------TNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGN 277 (470) T ss_pred --------cccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccc Confidence 112234579999999999999999999999999999999999999999999999999999998653 3456 Q ss_pred HHHHHhhCcceecC-----CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHH Q lcl|NC_011308. 316 IKKNIQSKKIIQTK-----GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMK 389 (530) Q Consensus 316 ~~~~~~~~~~i~~~-----~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~k 389 (530) ....++..+++.++ .+++++||+|+++.+++++++++|++.||.+|++|++++.++ ||+||+||++++++|.+| T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k 357 (470) T protein:vir:99 278 PKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNK 357 (470) T ss_pred hhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHH Confidence 67778888887764 467899999999999999999999999999999999999887 789999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHH Q lcl|NC_011308. 390 AQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEE 469 (530) Q Consensus 390 a~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~ 469 (530) |.++++.|+++|++++++|+++++..+..++++.+++++|++++|.|+++.|++++.+ .|++|+||+++++|++ |++ T Consensus 358 ~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl--~giis~et~l~~l~~v-d~~ 434 (470) T protein:vir:99 358 ADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNA--EGIVSKKTQLGMIPDI-EPD 434 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhCCCC-CHH Confidence 9999999999999999999999998888888899999999999999999999987655 5889999999999999 689 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 470 TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 470 ~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) +|++|+++|+++..+......... +.. ..++.. ++. T Consensus 435 ~E~eri~~E~~~~~~~~~~~~~~~-d~~--~~d~~~----ee~ 470 (470) T protein:vir:99 435 AEMKQIAKEKADAIKQTQQLSMPI-DIL--KRDNNA----EEE 470 (470) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCC-CcC--CCCCCc----cCC Confidence 999999888766544333222211 101 011100 000 No 39 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=6.8e-95 Score=536.84 Aligned_cols=425 Identities=12% Similarity=0.087 Sum_probs=353.5 Q ss_pred HHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCCc Q lcl|NC_011308. 20 KIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDH 99 (530) Q Consensus 20 ~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~ 99 (530) +|..|+. +++++|..+.+||+|+|+|+.++... .+..++++||++||+++||++.++||||+||+|++.+. T Consensus 1 ~~~~~~~-~~~~r~~~l~~yy~g~~~~~~~~~~~--------~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~ 71 (440) T protein:vir:95 1 MLAAFLG-SQKQRLAILASYAQGDNFSILSGHRR--------LDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEG 71 (440) T ss_pred ChhhHHH-HHHHHHHHHHHHhccCCccccccccc--------ccccCCcceeecchHHHHHHhhhhheeccCceEeeCCC Confidence 6767665 45678999999999999987654332 34567899999999999999999999999999999888 Q ss_pred chHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCC--CceeEEEEEEEEe Q lcl|NC_011308. 100 DDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYG--TLQRIIRFYTEQR 176 (530) Q Consensus 100 ~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~--~~~~~~~~y~~~~ 176 (530) ++++.++.|++++. |+++.++.+++++++++|+||+++|.+++|+++++.++|.++||+||+.. ++.+++++|... T Consensus 72 ~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~- 150 (440) T protein:vir:95 72 GSADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYA- 150 (440) T ss_pred ccHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEec- Confidence 88888999999874 68999999999999999999999999999999999999999999999864 578888877532 Q ss_pred ecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccC Q lcl|NC_011308. 177 YSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYK 256 (530) Q Consensus 177 ~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 256 (530) ...++++||++.+++|.....+... .......+|+| T Consensus 151 ---------~~~~~~vyt~~~~~~~~~~~~~~~~-----------------------------------~~~~~~~~~~~ 186 (440) T protein:vir:95 151 ---------DKVNMTVYTKDKVITYKPYSNNSVR-----------------------------------LVVDDVKKHSY 186 (440) T ss_pred ---------CceEEEEEeCCeEEEEEEecCCccc-----------------------------------eeecceeeccC Confidence 2357899999999999865543211 12233567999 Q ss_pred CccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCC--chhhHHHHHhhCcceec------ Q lcl|NC_011308. 257 SRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNS--PVDEIKKNIQSKKIIQT------ 328 (530) Q Consensus 257 ~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~--~~~~~~~~~~~~~~i~~------ 328 (530) |.||||+|+||++|.|+|+.+++|||+||.++|++++.+++|++|+++++|.... ..++....++..+++.+ T Consensus 187 g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 266 (440) T protein:vir:95 187 NDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGIST 266 (440) T ss_pred ceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceeccccccc Confidence 9999999999999999999999999999999999999999999999999996432 23444455665555543 Q ss_pred ---CCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 329 ---KGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTEIALRKTLRWT 404 (530) Q Consensus 329 ---~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~ 404 (530) +++++|+||+|+++.+++++++++|.++||.+|++|++++.++ ||+||+||++++++|.+||.++++.|+++|+++ T Consensus 267 ~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~ 346 (440) T protein:vir:95 267 TGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRR 346 (440) T ss_pred ccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4568899999999999999999999999999999999999887 789999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 405 ADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYED 484 (530) Q Consensus 405 ~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~ 484 (530) +++|+++++.....+++..+++++|++++|.|+++.|++++++ +|++|+||+++++|+++++ +|++++++|+.+... T Consensus 347 ~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~-~E~~ri~~E~~~~~~ 423 (440) T protein:vir:95 347 YELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA--GGEISQETLMENASFTDYK-TEHSRILKQGGSSDL 423 (440) T ss_pred HHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCcH-HHHHHHHHHHHHhhh Confidence 9999999999888899999999999999999999999987665 6889999999999999764 577777776554332 Q ss_pred HHHhhhccccccCCcccc Q lcl|NC_011308. 485 VVKALEDQEVEELEPTVT 502 (530) Q Consensus 485 ~~~~~~~~~~~~~~~~~~ 502 (530) ..... .+..++.+++.+ T Consensus 424 ~~~~~-~~~~~~~~~~~e 440 (440) T protein:vir:95 424 EIGQI-VGDADVGQADTE 440 (440) T ss_pred hHHhh-ccCCCCCCcCCC Confidence 22111 111111111111 No 40 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1e-91 Score=519.37 Aligned_cols=449 Identities=15% Similarity=0.136 Sum_probs=359.9 Q ss_pred CCccccc-------CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchh-hhcccccccccccccccccCCcceee Q lcl|NC_011308. 1 MTNTLLT-------TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDI-ENTRIMWMNDHGDIVEDDNASNIKIS 72 (530) Q Consensus 1 ~~~~~~~-------~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I-~~r~~~~~~~~~~~~~~~~~~n~ki~ 72 (530) +.+.+.. ..-+...+.|+++|++|.. ++++++..+.+||.|+|.+ +.+.... .+...++++|++ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~~yY~g~~~~i~~~~~~~-------~~~~~~~~~ki~ 83 (481) T protein:vir:10 12 KFSPLANDDFVVSDLAELLKEENLRNFISRHQT-EQVPRLEMLESYYLNRNTDILAGERRL-------QKYGDKADHRAV 83 (481) T ss_pred hcccccCceeeeecchhhcCHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccCcccc-------ccccccccceee Confidence 1111111 1122334567888888764 4678899999999999854 4332221 234567899999 Q ss_pred cCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEec Q lcl|NC_011308. 73 HGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVD 151 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~ 151 (530) +||++.||++.++||+|+||+|++.+ +...+.|+++++ |+++..+.+++++++++|.||+++|.+++|+++++++| T Consensus 84 ~n~~~~ivd~~~~~l~g~~~~~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~ 160 (481) T protein:vir:10 84 HNYAKYVSRFIVGYLTGNPITITHQD---NQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLD 160 (481) T ss_pred cchHHHHHHHHHhhhccCCceEecCC---hhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEc Confidence 99999999999999999999998754 445667778875 67999999999999999999999999999999999999 Q ss_pred ccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 152 ALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 152 p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) |.++||+||+. .++.+++|+|.... .....+.++++||++.+++|...+++.. T Consensus 161 p~~~~~v~d~~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~y~~~~i~~~~~~~~~~~-------------------- 215 (481) T protein:vir:10 161 PKSTFVVYDQTLDKKVVAGVRYFEKQD-----KDKVPVQHVEVYTTDKIYYIEIKGGTYH-------------------- 215 (481) T ss_pred ccceEEEEcCCCCCceEEEEEEEEEee-----CCCceEEEEEEEecCeEEEEEecCCcee-------------------- Confidence 99999999975 46888888886432 2345678999999999999976554311 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) .....+|+||.||||+|+||..|.|+|+++++|||+||.++|++++.+++|++|+++++|.. T Consensus 216 ------------------~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~ 277 (481) T protein:vir:10 216 ------------------RVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNV 277 (481) T ss_pred ------------------ecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCc Confidence 12345799999999999999999999999999999999999999999999999999999975 Q ss_pred CCchhhHHHHHhhCcceec---------CCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHH Q lcl|NC_011308. 310 NSPVDEIKKNIQSKKIIQT---------KGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVI 379 (530) Q Consensus 310 ~~~~~~~~~~~~~~~~i~~---------~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAi 379 (530) ..+.+ .....+..+++.+ +++++++|++|+++.++++.++++|++.||.+|++|++++.++ ||+||+|| T Consensus 278 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al 356 (481) T protein:vir:10 278 DLDSE-DAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESM 356 (481) T ss_pred CCCcc-chhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHH Confidence 54332 2334444555443 3457899999999999999999999999999999999988887 78999999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHH Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLL 459 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l 459 (530) ++++++|.+||.++++.|+.+|++++++++++++..+..+++..+++++|++++|.|+++.|++++.+ +|++|+||++ T Consensus 357 ~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl--~g~is~et~~ 434 (481) T protein:vir:10 357 KYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNAL--SGGVSESTRL 434 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHH Confidence 99999999999999999999999999999999999988889999999999999999999999987655 5889999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 460 AIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 460 ~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) +++|+++|+++|++|+++|+.+..........++ ..+..++++ +.++ T Consensus 435 ~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~---~~~~~~~~d------------d~~g 481 (481) T protein:vir:10 435 SLLDFIDNPKEELEKMQEEEAQREKQADKRGYGE---AFENHLNVD------------DSNG 481 (481) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCc---cCCCCCCCC------------CCCC Confidence 9999999999999999888766544333222211 111111111 1111 No 41 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=3.2e-90 Score=511.20 Aligned_cols=449 Identities=14% Similarity=0.116 Sum_probs=350.6 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) =+.++..+...+ .+.+.++|++|+. ++..+++.+++||.|+|+|++++.+ .+..++++||++||+++|| T Consensus 5 ~~~~~~~~~~~~-~~~~~~~i~~~~~-~~~~r~~~~~~yy~g~~~i~~~~~~---------~~~~~~~~ki~~n~~~~iv 73 (489) T protein:vir:99 5 DFEAIDYESKLW-IDQLKNYISRFKA-EQLERLKELKRYYLGDNNIKYRPAK---------TDKYAADNRIASDFAKYIT 73 (489) T ss_pred ceeeeCCCCCCC-HHHHHHHHHHHHH-HHHHHHHHHHHHhcccCcccccccc---------ccccCCcceeecchHHHHH Confidence 111222222233 3456778988864 4567899999999999999987643 2345788999999999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEe----cCCCceEEEEecccce Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFART----TSEDKLTFQTVDALQL 155 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~----d~~g~~~~~~~~p~~~ 155 (530) ++.++||||+||+|++.+ +...+.|+++++ |+++..+.++++.++++|+||+++|. |++|++++.+++|.++ T Consensus 74 ~~~~~~l~g~~~~~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~ 150 (489) T protein:vir:99 74 VFEQGYMLGVPVEYKNEN---KDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQT 150 (489) T ss_pred HHHhhhhccCCceeecCC---hhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccce Confidence 999999999999997654 456677888775 68889999999999999999999996 5788999999999999 Q ss_pred EEEEcCCC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccc Q lcl|NC_011308. 156 LPVFDDYG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGV 233 (530) Q Consensus 156 ~~v~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (530) ||+||+.. ++.+++|+|.... .....+.++++||++.+++|+....+... T Consensus 151 ~~v~dd~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~y~~~~i~~~~~~~~~~~~----------------------- 202 (489) T protein:vir:99 151 FVIYDDTYQRNSLMAVHFYDIDY-----GSGKRKQIIKAYTSDTIYTYEDYNLETKG----------------------- 202 (489) T ss_pred EEEEcCCCCCceEEEEEEEEEec-----CCCceEEEEEEEeCCcEEEEEecCCCccc----------------------- Confidence 99998754 5888899886532 22355788999999999998765432111 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCch Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPV 313 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~ 313 (530) .......+|+||.||||+|+||.+|+|+|+.+++|||+||.++|++++.+++|++|+|+++|...... T Consensus 203 ------------~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~ 270 (489) T protein:vir:99 203 ------------MRLKDYEGHFFKGVPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGA 270 (489) T ss_pred ------------ceecccccccCCceeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccc Confidence 11233567999999999999999999999999999999999999999999999999999999754322 Q ss_pred h--hHHH--------------HHhhCcceecCC-------CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccc Q lcl|NC_011308. 314 D--EIKK--------------NIQSKKIIQTKG-------EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVG 370 (530) Q Consensus 314 ~--~~~~--------------~~~~~~~i~~~~-------~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~ 370 (530) + ++.. ..+..+++.+++ +.+|+||+|+++.+++++++++|.+.||.+|++|++++.+ T Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 350 (489) T protein:vir:99 271 DENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMK 350 (489) T ss_pred cchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccccc Confidence 1 1111 122334444433 3478999999999999999999999999999999999888 Q ss_pred c-cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---ccccceeeEEeCCCCCCCHHHHHHHHHH Q lcl|NC_011308. 371 D-GNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG---DYSSTDIKFDIEPYILANELDLAMIDKT 446 (530) Q Consensus 371 ~-gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~---~~d~~~i~i~f~~~~P~n~~e~a~~~~~ 446 (530) + ||+||+||+++++++.+||.++++.|+++|++++++|+++++..+.. .....+++++|++++|.|.++.|+++.. T Consensus 351 ~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~k 430 (489) T protein:vir:99 351 FSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQN 430 (489) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHH Confidence 7 79999999999999999999999999999999999999999876543 3345679999999999999999998665 Q ss_pred HHhcCCCcHHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 447 EAETNQIQINNLLAIAPRIG--DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 447 ~~~~g~iS~et~l~~~~~vd--d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) + +|++|+||+++++|+++ |+++|++|+++|+.+..... +... .++.. ...++....| T Consensus 431 l--~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~---~~~~----~~~~~----~~~~~~~~~p 489 (489) T protein:vir:99 431 L--YGIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLP---EPRL----VGDAS----GQEEPTAEKP 489 (489) T ss_pred H--hccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccc---cccc----cCCCC----CCcCCCCCCC Confidence 5 58999999999999997 67888888877665432211 1111 11111 1111122222 No 42 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1.9e-76 Score=435.75 Aligned_cols=453 Identities=11% Similarity=0.037 Sum_probs=338.2 Q ss_pred cccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 10 PDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 10 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) --|..++|++++.+|. .+..++.++.+||+|+|+|....... .....++|+++||+++||++.++||++ T Consensus 1 ~~t~~d~i~~L~~~~~--~~~~r~~~~~~Yy~G~~~i~~~~~~~---------~~~~~~~~~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLA--RDLPNLLEAEAYRNGTRRLKTIGIGA---------PPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHH--HHHHHHHHHHHHHhccccchhccccc---------chhhhhhhhhcchHHHHHHHHHhhhcc Confidence 2234467888887764 35688999999999999874432211 112236789999999999999999999 Q ss_pred cceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEecccceEEEEcCC Q lcl|NC_011308. 90 NGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFART------TSEDKLTFQTVDALQLLPVFDDY 162 (530) Q Consensus 90 ~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~------d~~g~~~~~~~~p~~~~~v~d~~ 162 (530) ++++.. +++...+.|+++++ |+++.++.+++++++++|+||+++|. |++|.+++.++||.++||+||++ T Consensus 70 ~g~~~~----~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~ 145 (480) T protein:vir:78 70 EGFRIS----EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPR 145 (480) T ss_pred CceecC----CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCC Confidence 887643 23344566777774 78999999999999999999999985 56889999999999999999976 Q ss_pred C--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 163 G--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 163 ~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) . .+.+++++|.... ....+.++++||++.+++|+..++...... T Consensus 146 ~~~~~~~~i~~~~~~d------~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~---------------------------- 191 (480) T protein:vir:78 146 NTRRVTRAVRLYTTRD------DVAVPDRATLYLPDETVPLRRNGGLNDQWV---------------------------- 191 (480) T ss_pred CccceEEEEEEEEeec------CCcceEEEEEEeCCeEEEEEecCCCccccc---------------------------- Confidence 4 5788888875432 233468899999999999987654322110 Q ss_pred cccccccccccccccCCccceEEeeCCc-----CCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchh Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKK-VKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVD 314 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~-v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~ 314 (530) ......+|+||.||||+|+||. .|.|+++. |++|||+||.++|++++.+++|++|++++.|...+... T Consensus 192 ------~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~ 265 (480) T protein:vir:78 192 ------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT 265 (480) T ss_pred ------ccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccc Confidence 1123457999999999999985 48899975 99999999999999999999999999999997653211 Q ss_pred ----hHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-----CcHHHHHHHHhh Q lcl|NC_011308. 315 ----EIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-----ATNVVIKSRYTL 385 (530) Q Consensus 315 ----~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-----~SGvAik~~~~~ 385 (530) ........++++ ..+++++++.+++.. ..+.++++|+..|+.++.+|++.+..||+ +||+||++++++ T Consensus 266 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~ 342 (480) T protein:vir:78 266 NDGENTTLDIYYGRIL-TLASEAAKISEFKAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSR 342 (480) T ss_pred cccccchhhhhhhhhc-cCCCCCceEEecCcc--CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHH Confidence 111111223333 345678998876653 46789999999999999999998888753 699999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCcHHHHHHhCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETN--QIQINNLLAIAP 463 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g--~iS~et~l~~~~ 463 (530) |+.||.++++.|+++|++++++++++++.. ...+...++++|+++.|.|+.+.|+.+++++++| ++|++|+++++| T Consensus 343 l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~--~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg 420 (480) T protein:vir:78 343 IVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG 420 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCC Confidence 999999999999999999999998876532 3346678999999999999999999888777655 789999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHH-HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 464 RIGDEETLKAICDTLDLDYE-DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 464 ~vdd~~~e~~~~e~e~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) |++|+.++++++++++.+.. ..+....++..+. .+ ++.. .+..+..+.+.+..++|..+ T Consensus 421 ~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~-~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 421 YTATQREQMRDWDKQETEDMIDTLYSTTKAQADA-TP--KPTV-TETKTETQTSPSGFNRTKTR 480 (480) T ss_pred CCHhHHHHHHHHHHHHHHHHHHHhhccccCCCcc-cc--CCCC-CCCCCccCCCcccCCCcCCC Confidence 99998888765555443321 2222212222111 11 1111 22334455556777777777 No 43 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=3e-76 Score=434.62 Aligned_cols=454 Identities=11% Similarity=0.025 Sum_probs=335.1 Q ss_pred CCcccccCCc-ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLLTTAP-DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~~~~~-~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) ||..+..... +....++..+|.+|. .+..++.++.+||+|+|+|....... .....++|+++||+++| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~--~~~~r~~~~~~YY~G~~~i~~~~~~~---------~~~~~~~~~~~n~~~~i 69 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFE--DQNQNLRSNTSYYEAERRPEAIGVTV---------PVQMQSLLAHVGYPRLY 69 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHH--HHHHHHHHHHHHHhccCchhhcCccc---------chhhhhhhhccchHHHH Confidence 6666655433 334556677887774 35688999999999999885432211 12235778999999999 Q ss_pred HhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCC--------CceEEEEe Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSE--------DKLTFQTV 150 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~--------g~~~~~~~ 150 (530) |++.++||++++++... ++...+.+++++. |+++..+.+++++++++|+||+++|.+++ |.++++.+ T Consensus 70 vd~~~~~l~~~g~~~~~----~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~ 145 (485) T protein:vir:24 70 VDSIAERQAVEGFRLGD----ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVE 145 (485) T ss_pred HHHHhhhhccCceecCC----CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEe Confidence 99999999999876532 3334455667764 78899999999999999999999999875 45689999 Q ss_pred cccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 151 DALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 151 ~p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) ||.++||+||++ .++.+++++|+.. ....+.++++||++.+++|...++... T Consensus 146 ~p~~~~~i~D~~~~~~~~~~~~~~~~-------~~~~~~~~~~y~~~~~~~~~~~~~~~~-------------------- 198 (485) T protein:vir:24 146 PPTRMYAEIDPRIGRPAKAIRVAYDA-------EGNEIQAATLYTPNETFGWFRAEGEWV-------------------- 198 (485) T ss_pred ccceeEEEeeCCcCceeEEEEEEEee-------cCCeEEEEEEEcCCcEEEEEecCCceE-------------------- Confidence 999999999975 4566666665421 234578899999999998876543211 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAIY 303 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~l 303 (530) .....+|+||.||||+|+||. .|.|+|+ .|++|||+||.++|++++.+++|++|++ T Consensus 199 ------------------~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~ 260 (485) T protein:vir:24 199 ------------------EWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQR 260 (485) T ss_pred ------------------eecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 112346999999999999985 5788887 5999999999999999999999999999 Q ss_pred eeecCCCCchhh----H--HHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-C--- Q lcl|NC_011308. 304 VVRGGTNSPVDE----I--KKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-N--- 373 (530) Q Consensus 304 vl~g~~~~~~~~----~--~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n--- 373 (530) ++.|.+.+.... . ..... .+.+....++++++.++ +.+..+.++++|+..||.+|.+|++++..|| + T Consensus 261 ~i~G~~~~~~~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~q~--~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n 337 (485) T protein:vir:24 261 LIFGIKPEEIGVDPETGQTLFDAY-LARILAFEDAEGKIQQF--SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADN 337 (485) T ss_pred hhccCCccccccccccccchhhhc-ccceeccCCCCceEEee--cccchHHHHHHHHHHHHHHhcccCCCHHHhccccCc Confidence 999976543210 0 11112 22333445677887654 4567889999999999999999999988885 3 Q ss_pred -CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC- Q lcl|NC_011308. 374 -ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETN- 451 (530) Q Consensus 374 -~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g- 451 (530) +||+||++++.+|++||.++++.|+++|++++++++.+.+..+ ...+...|+++|+++.|.|+.+.|+.+.+++++| T Consensus 338 ~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~ 416 (485) T protein:vir:24 338 PASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGD-VPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQ 416 (485) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CccccceeeEEecCCCCCCHHHHHHHHHHHHhccc Confidence 6999999999999999999999999999999999988765433 4567789999999999999999999888877655 Q ss_pred -CCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcc Q lcl|NC_011308. 452 -QIQINNLLAIAPRIGDEETLKAICDTLDLDYE-DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPV 522 (530) Q Consensus 452 -~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (530) ++|+||+++++||++|+.++++++++++.... ..+..+..+.... .+.+.. ...+.+++.+.+.+.. T Consensus 417 ~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~---~~~~~~-~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 417 GVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTV---PGSPNP-TPAPKPQPAIEGGDSA 485 (485) T ss_pred ccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCC---CCCCCC-CCCCCCccCCCCCCCC Confidence 89999999999999988878777666554322 2222222221111 111111 1223333444444444 No 44 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=2.7e-76 Score=434.91 Aligned_cols=452 Identities=11% Similarity=0.033 Sum_probs=343.9 Q ss_pred cccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhh Q lcl|NC_011308. 5 LLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKT 84 (530) Q Consensus 5 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~ 84 (530) +..+..++..+++++++.+|.. +..++.++.+||+|+|+|.+...... ....++|+++||+++||++.+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~--~~~r~~~~~~Yy~g~~~i~~~~~~~~---------~~~~~~~~~~n~~~~ivd~~a 69 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFEN--KQNELKSSKAYYDAERRPDAIGLAVP---------LDMRKYLAHVGYPRTYVDAIA 69 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHhcccchhhcCcccc---------hhhhhhhhhcchHHHHHHHHH Confidence 5555556677899999988764 35789999999999999865433221 123467999999999999988 Q ss_pred ------hhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEe--------cCCCceEEEE Q lcl|NC_011308. 85 ------QYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFART--------TSEDKLTFQT 149 (530) Q Consensus 85 ------~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~--------d~~g~~~~~~ 149 (530) ||.+|.|+.+.....++++..+.|+++++ |+++....+++++++++|+||+++|. ++++.+++.+ T Consensus 70 ~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~ 149 (488) T protein:vir:23 70 ERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRV 149 (488) T ss_pred HhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEE Confidence 57777777777767777778888888885 78999999999999999999999886 4566788999 Q ss_pred ecccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeee Q lcl|NC_011308. 150 VDALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLA 228 (530) Q Consensus 150 ~~p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (530) ++|.++||+||+. +.+.+++++|+. .....+.++++|+++.+++|...+++.. T Consensus 150 ~~p~~~~~~~d~~~~~~~~~~~~~~~-------~~~~~~~~~~~y~~~~~~~~~~~~~~~~------------------- 203 (488) T protein:vir:23 150 EPPTALYAEVDPRTRKVLYAIRAIYG-------ADGNEIVSATLYLPDTTMTWLRAEGEWE------------------- 203 (488) T ss_pred eccceeEEEEecCCCceEEEEEEEEe-------cCCCcEEEEEEEecCcEEEEEecCCceE------------------- Confidence 9999999999875 456666666542 1234578899999999999876544321 Q ss_pred eecccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccce Q lcl|NC_011308. 229 VADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAI 302 (530) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~ 302 (530) .....+|+||+||||+|+||. +|.|+++ .|++|||+||.++|++++.+++|++|+ T Consensus 204 -------------------~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~ 264 (488) T protein:vir:23 204 -------------------APTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQ 264 (488) T ss_pred -------------------eccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHH Confidence 112457999999999999986 5789996 589999999999999999999999999 Q ss_pred eeeecCCCCchhh------HHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC--- Q lcl|NC_011308. 303 YVVRGGTNSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN--- 373 (530) Q Consensus 303 lvl~g~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn--- 373 (530) ++++|...++... -......+++..+++|+++++.+++ ....+.++++|+..|+.++.+|++++..||. T Consensus 265 ~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~ 342 (488) T protein:vir:23 265 RLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFS--AAELRNFVDALDALDRKAASYSGLPPQYLSSSSD 342 (488) T ss_pred HHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecC--CCChHHHHHHHHHHHHHHhcccCCCHHHhccccC Confidence 9999976543211 1112233456666777888887655 4568899999999999999999988877742 Q ss_pred --CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC Q lcl|NC_011308. 374 --ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETN 451 (530) Q Consensus 374 --~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g 451 (530) +||+||++++++|++||.++++.|+++|++++++++++++... ...+...+.++|+++.|.|..+.|+++.+++++| T Consensus 343 n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g 421 (488) T protein:vir:23 343 NPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGD-IPTEYYRMETVWRDPSTPTYAAKADAAAKLFANG 421 (488) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cchhhccceEEecCCCCCCHHHHHHHHHHHHhcc Confidence 5999999999999999999999999999999999988766433 2346678999999999999999999988888765 Q ss_pred --CCcHHHHHHhCCCCCCHHHHHHHHHHHHH-HHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 452 --QIQINNLLAIAPRIGDEETLKAICDTLDL-DYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 452 --~iS~et~l~~~~~vdd~~~e~~~~e~e~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) ++|+||+++++|+++|+.++++++++++. +....+.++.....+ ++++.+.+..+.+.++|... T Consensus 422 ~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 422 AGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTP----EGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred cccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCC----cccCCCCCCCCCCCCCCCCC Confidence 79999999999999999888777654432 222333333222211 11111222233444444333 No 45 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=3.3e-76 Score=434.42 Aligned_cols=422 Identities=10% Similarity=0.003 Sum_probs=326.7 Q ss_pred ccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhh Q lcl|NC_011308. 6 LTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQ 85 (530) Q Consensus 6 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~ 85 (530) +++ +-.++|+.++.++. .+..+++.+.+||+|+|+|....... ....+++|+++||+++||+..++ T Consensus 1 ~~~---~~~~~i~~l~~~~~--~~~~r~~~l~~Yy~G~~~i~~~~~~~---------~~~~~~~k~~~n~~~~ivd~~~~ 66 (441) T protein:vir:80 1 MNS---DELALIEGMYDRIQ--RLSSWHCCIEGYYEGSNRVRDLGVAI---------PPELQRVQTVVSWPGIAVDALEE 66 (441) T ss_pred CCc---cHHHHHHHHHHHHH--HHHHHHHHHHHHHhcCCcchhcCccc---------chhhhhhhhhcchHHHHHHHHHh Confidence 222 23356788887764 34578999999999999874432221 12235789999999999999999 Q ss_pred hhcccceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCC- Q lcl|NC_011308. 86 YLLANGIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYG- 163 (530) Q Consensus 86 yl~G~pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~- 163 (530) |+.++++ ...+ ++ .|++++ .|++...+.+++++++++|+||+++|.|++|+++++++||.++||+||+.. T Consensus 67 ~l~~~g~--~~~d--~~----~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~ 138 (441) T protein:vir:80 67 RLDWLGW--TNGD--GY----GLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGS 138 (441) T ss_pred hhccccc--cCCC--hH----HHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCC Confidence 9976654 3332 22 245555 378999999999999999999999999999999999999999999999754 Q ss_pred CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccc Q lcl|NC_011308. 164 TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVE 243 (530) Q Consensus 164 ~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (530) ...+++++|+.. .....++++|+++.+++|...+.+.. T Consensus 139 ~~~~~~~~~~~~--------~~~~~~~~vy~~~~~~~~~~~~~~~~---------------------------------- 176 (441) T protein:vir:80 139 RLDAGLVVQQTC--------DPEVVEAELLLPDVIVQVERRGSREW---------------------------------- 176 (441) T ss_pred ceeEEEEEEEEe--------cCceEEEEEEecCeEEEEEEcCCcce---------------------------------- Confidence 456666666432 23356889999999998866554321 Q ss_pred ccccccccccccCCccceEEeeCCc-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH Q lcl|NC_011308. 244 EHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK 317 (530) Q Consensus 244 ~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~ 317 (530) ......+|+||+||||+|.|+. +|.|+|. .|++|||+||.++|++++.+++|++|+++++|...++..... T Consensus 177 ---~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~ 253 (441) T protein:vir:80 177 ---VEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQPG 253 (441) T ss_pred ---eeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccccccch Confidence 1223467999999999999986 4788885 599999999999999999999999999999998776655444 Q ss_pred HHHhhCcceecCCCCceeEE-EecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-----CcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 318 KNIQSKKIIQTKGEGGLDIQ-TVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-----ATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 318 ~~~~~~~~i~~~~~~~~~~l-t~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-----~SGvAik~~~~~l~~ka~ 391 (530) .....++++.++.+++.+.+ .++.+.+..+.++++|+.+|+.++.+|++++..||. +||+||++++.+|+.||. T Consensus 254 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~ 333 (441) T protein:vir:80 254 WVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAE 333 (441) T ss_pred hhhcccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHH Confidence 55666778877765443222 244556678899999999999999999988777642 499999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC--CcHHHHHHhCCCCCCHH Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ--IQINNLLAIAPRIGDEE 469 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~--iS~et~l~~~~~vdd~~ 469 (530) ++++.|+++|++++++++++++.......+...++++|++++|.|.++.|+++.+++++|+ +|++|+++++|++++ T Consensus 334 ~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~-- 411 (441) T protein:vir:80 334 RRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDV-- 411 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHH-- Confidence 9999999999999999999888766555567889999999999999999999999999886 588999999998765 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccc---cccC Q lcl|NC_011308. 470 TLKAICDTLDLDYEDVVKALEDQE---VEEL 497 (530) Q Consensus 470 ~e~~~~e~e~~e~~~~~~~~~~~~---~~~~ 497 (530) +++++++++++.++.+..+.... .+.+ T Consensus 412 -e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 412 -QVEAVMRHRAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred -HHHHHHHHHHHHHHHHHHHhhhhhcccccC Confidence 45556666655555544443321 1111 No 46 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.6e-75 Score=430.59 Aligned_cols=455 Identities=10% Similarity=0.022 Sum_probs=337.3 Q ss_pred CCcccc-cCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLL-TTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~-~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) ||..+. .+++++..+++.+++.+|.. +..++.++.+||+|+|+|.++..... ....+.++++||+++| T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~--~~~r~~~l~~YY~G~~~i~~~~~~~~---------~~~~~~~~v~n~~~~i 69 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFED--ASKDLASNTSYYDAERRPEAIGVTVP---------REMQQLLAHVGYPRLY 69 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHH--HHHHHHHHHHHhcccCcchhcccccc---------hhHhhhhhccchHHHH Confidence 776654 45667778899999988754 45889999999999998865432211 1123557889999999 Q ss_pred HhhhhhhhcccceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecC--------CCceEEEEe Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTS--------EDKLTFQTV 150 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~--------~g~~~~~~~ 150 (530) |+..++||.+.+++.. + ++...+.+++++ .|+++..+.+++++++++|+||+++|.++ ++.+++.++ T Consensus 70 Vd~~~~~l~~~g~~~~--~--~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~ 145 (486) T protein:vir:42 70 VDSVAERQAVEGFRLG--D--ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVE 145 (486) T ss_pred HHHHHhhhcccceecC--C--CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEe Confidence 9999999987776543 2 222334456666 47899999999999999999999999876 455789999 Q ss_pred cccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 151 DALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 151 ~p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) ||.++|++||+. .++.+++++|+. .....+.++++||++.+++|...+++.. T Consensus 146 ~p~~~~~i~d~~~~~~~~~~~~~~~-------~~~~~~~~~~~y~~~~~~~~~~~~~~~~-------------------- 198 (486) T protein:vir:42 146 PPTRMHAEIDPRINRVSKAIRVAYD-------KEGNEIQAATLYTPMETIGWFRADGEWA-------------------- 198 (486) T ss_pred cccceEEEEeCCCCCeEEEEEEEEe-------cCCCeEEEEEEEcCCcEEEEEecCCcEE-------------------- Confidence 999999999864 467788877642 1345678899999999999976554321 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAIY 303 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~l 303 (530) .....+|+||.||||+|+||. .|.|+|+ .|++|||+||.++|++++..++|++|++ T Consensus 199 ------------------~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~ 260 (486) T protein:vir:42 199 ------------------EWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQR 260 (486) T ss_pred ------------------eecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHH Confidence 112346999999999999985 4789998 5899999999999999999999999999 Q ss_pred eeecCCCCchhh------HHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-C--- Q lcl|NC_011308. 304 VVRGGTNSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-N--- 373 (530) Q Consensus 304 vl~g~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n--- 373 (530) +++|.+.+.... .......++ +...++++++|.+++ ....+.++++|+..|+.+|.+|++++..|| + T Consensus 261 ~i~G~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n 337 (486) T protein:vir:42 261 LIFGIKPEEIGVDSETGQTLFDAYLAR-ILAFEDAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADN 337 (486) T ss_pred HhhcCCccccccccccccchhhhhhch-hcccCCCCceEEeec--ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCc Confidence 999976533211 111112222 333456778887654 556889999999999999999999988885 3 Q ss_pred -CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhc-- Q lcl|NC_011308. 374 -ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAET-- 450 (530) Q Consensus 374 -~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~-- 450 (530) +||+||++++.+|.+||.++++.|+++|++++++++++++... ...+...|+++|+++.|.|..+.|+.+.+++++ T Consensus 338 ~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~ 416 (486) T protein:vir:42 338 PASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGD-VPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQ 416 (486) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccceeeeEEecCCCCCCHHHHHHHHHHHHhccc Confidence 5999999999999999999999999999999999988765433 244667899999999999999999998888765 Q ss_pred CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-HHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 451 NQIQINNLLAIAPRIGDEETLKAICDTLDLDYED-VVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 451 g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) |++|+||+++++|+++|+.+|++++++++..... .+..+..+..... +++ .+.+++..+++++-+|-.. T Consensus 417 g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 417 GVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVP---GSP--SPTAPPKPQPAIESSGGDA 486 (486) T ss_pred CCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC---CCC--CCCCCCCCCcccCCCCCCC Confidence 7899999999999999999998888766544322 2222222211111 111 1111222222222222222 No 47 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=2.4e-75 Score=429.63 Aligned_cols=453 Identities=11% Similarity=0.028 Sum_probs=336.2 Q ss_pred cccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 10 PDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 10 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) --|..++|+.++++|. .+..++.++.+||+|+|+|.+..... .....++|+++||+++||++.++||.+ T Consensus 1 ~~t~~~~i~~L~~~~~--~~~~r~~~l~~Yy~G~~~i~~~~~~~---------~~~~~~~~~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLA--RDLPNLLEAEAYRNGTRRLKTIGIGA---------PPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHH--HHHHHHHHHHHHHhcccccccccccc---------chhHhhhhhhcchHHHHHHHHHhhhcc Confidence 2345567888888874 35688999999999999874432211 122346789999999999999999998 Q ss_pred cceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEecccceEEEEcCC Q lcl|NC_011308. 90 NGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFART------TSEDKLTFQTVDALQLLPVFDDY 162 (530) Q Consensus 90 ~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~------d~~g~~~~~~~~p~~~~~v~d~~ 162 (530) ++++.. ++++..+.|+++++ |+++.++.+++++++++|+||+++|. |++|.+++.++||.++||+||++ T Consensus 70 ~g~~~~----~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~ 145 (480) T protein:vir:78 70 EGFRIS----EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPR 145 (480) T ss_pred CceecC----CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCC Confidence 887643 23345566777775 78999999999999999999999996 56889999999999999999975 Q ss_pred --CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 163 --GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 163 --~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ..+.+++++|.... ....+.++++||++.+++|+..++...... T Consensus 146 ~~~~~~~~i~~~~~~~------~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~---------------------------- 191 (480) T protein:vir:78 146 NTRRVTRAVRLYTTRD------DVAVPDRATLYLPDETVPLRRNGGLNDQWV---------------------------- 191 (480) T ss_pred CccceEEEEEEEEeec------CCCceEEEEEEeCCeEEEEEecCCCccccc---------------------------- Confidence 46888888885322 233568899999999999987654322111 Q ss_pred cccccccccccccccCCccceEEeeCCc-----CCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchh Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKK-VKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVD 314 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~-v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~ 314 (530) ......+|+||.||||+|+|+. .|.|+|+. |++|||+||.++|++++.+++|++|++++.|...++.. T Consensus 192 ------~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~ 265 (480) T protein:vir:78 192 ------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT 265 (480) T ss_pred ------cccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccc Confidence 1112357999999999999974 57899985 99999999999999999999999999999997654321 Q ss_pred h----HHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-----CcHHHHHHHHhh Q lcl|NC_011308. 315 E----IKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-----ATNVVIKSRYTL 385 (530) Q Consensus 315 ~----~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-----~SGvAik~~~~~ 385 (530) . .......+++ ...+++++++.+++. ...+.++++|+..|+.++.+|++++..||+ +||+||++++.+ T Consensus 266 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~ 342 (480) T protein:vir:78 266 NDGENTTLDIYYGRI-LTLASEAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSR 342 (480) T ss_pred cccccchhhhhhhhh-ccCCCCCceEEecCc--cCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHH Confidence 1 0111122223 334567899888664 457888999999999999999988887753 599999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC--CCcHHHHHHhCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETN--QIQINNLLAIAP 463 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g--~iS~et~l~~~~ 463 (530) |..||.++++.|+++|++++++|+++++.. ...+...++++|+++.|.|..+.|+.+++++++| ++|++|+++++| T Consensus 343 l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~--~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg 420 (480) T protein:vir:78 343 IVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG 420 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCC Confidence 999999999999999999999998876532 2345677899999999999999999888777655 799999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHH-HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 464 RIGDEETLKAICDTLDLDYE-DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 464 ~vdd~~~e~~~~e~e~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) |++|+.++++++++++.+.. +.+....+++.+ ...++.. .+..++.+++.+..+++..+ T Consensus 421 ~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 421 YTATQREQMRDWDKQETEDMIDTLYSTTKAQAD---ATPKPTV-TETKTETQTSPSGFNRTKTR 480 (480) T ss_pred CCHhHHHHHHHHHHHHHHHHHHHhhccccccCC---CCCCCCC-CCCCCccccccCCCCcccCC Confidence 99987777665544433211 112222222211 1111111 12233455555666676666 No 48 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=5.1e-74 Score=422.38 Aligned_cols=454 Identities=10% Similarity=0.017 Sum_probs=331.9 Q ss_pred CCccccc-CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLLT-TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~~-~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) ||--+.. ..+++...+++.++.++. .+..++.++.+||.|+|+|........ ....++++++||+++| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~--~~~~r~~~~~~Yy~G~~~i~~~~~~~~---------~~~~~~~~~~n~~~~i 69 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFE--DSTQNLKTNTSYYEAERRPEAIGVTVP---------IQMQSLLAHVGYPRLY 69 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHhcCCcchhcCCCCC---------hhhhhhhhhcCcHHHH Confidence 5543333 335666678888887764 355789999999999998854332221 1223567889999999 Q ss_pred HhhhhhhhcccceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCC--------CceEEEEe Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSE--------DKLTFQTV 150 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~--------g~~~~~~~ 150 (530) |+..++||++++++. . +++...+.+++++ .|+++....+++++++++|+||+++|.++. +.++++++ T Consensus 70 vd~~~~~l~~~g~~~--~--~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~ 145 (485) T protein:vir:10 70 VDSIAERQAVEGFRF--G--DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVE 145 (485) T ss_pred HHHHHhhhcccceec--C--CCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEE Confidence 999999998887654 2 2333445566666 478999999999999999999999999864 56789999 Q ss_pred cccceEEEEcCCC-CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 151 DALQLLPVFDDYG-TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 151 ~p~~~~~v~d~~~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) +|.++|++||+.. ++.+++++|.. .....+.++++||++.+++|...+++.. T Consensus 146 ~p~~~~~~~D~~~~~~~~~~~~~~~-------~~~~~~~~~~~y~~~~~~~~~~~~~~~~-------------------- 198 (485) T protein:vir:10 146 PPTRMYAEIDPRIGRVSKAIRVAYD-------AEGNEIQAATLYTPNDIFGWYRVENEWQ-------------------- 198 (485) T ss_pred ccceeEEEEcCCCCceeEEEEEEEe-------eCCCeEEEEEEEeCCeEEEEEEcCCceE-------------------- Confidence 9999999998754 45555665532 1245578899999999999976554321 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhcccee Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAIY 303 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~l 303 (530) .....+|+||.||||+|+||. .|.|+|+ .|++|||+||.++|++++.+++|++|++ T Consensus 199 ------------------~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~ 260 (485) T protein:vir:10 199 ------------------EWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQR 260 (485) T ss_pred ------------------EeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHH Confidence 112357999999999999985 4788997 5999999999999999999999999999 Q ss_pred eeecCCCCchh------hHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-C--- Q lcl|NC_011308. 304 VVRGGTNSPVD------EIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-N--- 373 (530) Q Consensus 304 vl~g~~~~~~~------~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n--- 373 (530) ++.|.+.++.. ........++ +...+++++++.+++ ....+.++++|+..||.++.+|++.+..|| + T Consensus 261 ~i~G~~~~~~~~~~~~~~~~~~~~~~~-i~~~~~~d~k~~q~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n 337 (485) T protein:vir:10 261 LIFGIKPEEIGVDPETGQTLFDAYLAR-ILAFEDAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADN 337 (485) T ss_pred HHhcCCcccccccccccchhhhhcccc-eeccCCCCceEEeec--ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCc Confidence 99997654321 1111122223 334456788887654 455789999999999999999998888774 2 Q ss_pred -CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC- Q lcl|NC_011308. 374 -ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETN- 451 (530) Q Consensus 374 -~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g- 451 (530) +||+||++++.+|..||.++++.|+.+|++++++++++.+.. ....+...++++|++++|.|.++.|+++.+++++| T Consensus 338 ~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~-~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~ 416 (485) T protein:vir:10 338 PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGG-DVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGT 416 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc Confidence 699999999999999999999999999999999998865432 23456778999999999999999999998888766 Q ss_pred -CCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-HHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 452 -QIQINNLLAIAPRIGDEETLKAICDTLDLDYED-VVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 452 -~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) ++|+||+++++|+++++.++++++++++..... .+..+.. ......+++..++.++++.....+... T Consensus 417 ~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 417 GVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVD---PNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred cCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhc---cCCCCCCCCCccccccCcCCCCCCCCC Confidence 899999999999999887777776655443222 2222211 111112222222233333332222222 No 49 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=4.8e-74 Score=422.53 Aligned_cols=454 Identities=9% Similarity=0.014 Sum_probs=336.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) |+.-|......+.+++++.++..+.. +..++..+.+||.|+|+|........ ....+.++++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~--~~~rl~~l~~Yy~G~~~i~~~~~~~~---------~~~~~~~~~~n~~~~iv 69 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTE--RTQDLGDNTAYYESERRPDAVGVTVP---------QQMQKLLAHVGYPRLYI 69 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHhccccchhcccccc---------hhHHhhhhhcCcHHHHH Confidence 99999999999999988888877653 34678899999999998744322111 11224567899999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce--------EEEEec Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKL--------TFQTVD 151 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~--------~~~~~~ 151 (530) +..+++|++++++.. +++...+.++++++ |+++..+.+++++++++|+||+++|.+++|.. ++..+| T Consensus 70 d~~~~~l~~~g~~~~----~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~ 145 (484) T protein:vir:77 70 DAIAARQELEGFRLG----GADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEP 145 (484) T ss_pred HHHHhhhccCceecC----CcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEec Confidence 999999999987753 23334455667764 78999999999999999999999999998753 588899 Q ss_pred ccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeee Q lcl|NC_011308. 152 ALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVA 230 (530) Q Consensus 152 p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (530) |.++|++||+. .++.+++++|... ....+.++++|+++.+++|...+++.. T Consensus 146 p~~~~~~~D~~~~~~~~a~~~~~~~-------~~~~~~~~~~y~~~~~~~~~~~~~~~~--------------------- 197 (484) T protein:vir:77 146 PTNLYAQIDPRTRQVMRAIRAIEDE-------EGNEVIGATLYLPNNTVIWNREDGQWV--------------------- 197 (484) T ss_pred cceeEEEecCCCCceEEEEEEEEee-------cCCcEEEEEEEecCeEEEEEecCCceE--------------------- Confidence 99999999975 5567777776532 234567889999999988876543211 Q ss_pred cccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceee Q lcl|NC_011308. 231 DGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAIYV 304 (530) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~lv 304 (530) .....+|+||.||||+|.||. +|.|+|+ .|++|||+||.++|++++..++|++|+++ T Consensus 198 -----------------~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~ 260 (484) T protein:vir:77 198 -----------------QVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRL 260 (484) T ss_pred -----------------eeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHH Confidence 122357999999999999975 5789997 59999999999999999999999999999 Q ss_pred eecCCCCchhh----HHHHH--hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC----- Q lcl|NC_011308. 305 VRGGTNSPVDE----IKKNI--QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN----- 373 (530) Q Consensus 305 l~g~~~~~~~~----~~~~~--~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn----- 373 (530) +.|...++... ....+ ..++++ ..+++++++.+++ ....+.++++|+.+|+.+|.+|++++..||. T Consensus 261 i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~ 337 (484) T protein:vir:77 261 LFGVKGEELGVDPETGQTLFDAYLARIL-AFEDHESKAQQFS--AAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENP 337 (484) T ss_pred HhCCCcchhcccccccchhhhhhhhhhc-ccCCCCceeEeec--CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcc Confidence 99976543211 01111 122222 3345678876654 5557889999999999999999998888742 Q ss_pred CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcC-- Q lcl|NC_011308. 374 ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETN-- 451 (530) Q Consensus 374 ~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g-- 451 (530) +||+||++++.+|.+||.++++.|+++|++++++++++.+... ...+...++++|+++.|.|..+.|+.+.+++++| T Consensus 338 ~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~-~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~g 416 (484) T protein:vir:77 338 ASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGD-IPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQG 416 (484) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-cccccccceEEecCCCCCCHHHHHHHHHHHHhccCC Confidence 6999999999999999999999999999999999988755332 3456678999999999999999999988887765 Q ss_pred CCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 452 QIQINNLLAIAPRIGDEETLKAICDTLDLDYE-DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 452 ~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) ++|++|+++++|+++|+.++++++++++.... ..+....... +...+.. .+.+++++.|...++... T Consensus 417 i~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 417 VIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTD-----PSGGGNP-DNPETPEPQPNPAEEAAA 484 (484) T ss_pred CCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccc-----ccCCCCC-CCCCcccccCCCccccCC Confidence 89999999999999999888877766554322 2222221111 1111110 111122222222222222 No 50 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=1.1e-72 Score=415.00 Aligned_cols=454 Identities=9% Similarity=0.016 Sum_probs=318.6 Q ss_pred CCcccccCCcccHHHHHH----HHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCch Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILS----TKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFF 76 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~----~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~ 76 (530) |-.+- ++.++.+++-+ +++.+|. .+..++.++.+||+|+|+|..+...... +...+..+++++||+ T Consensus 1 ~~~~p--~~~l~~~~~~~~~~~~l~~~~~--~~~~r~~~~~~YY~g~~~i~~~~~~~~~------~~~~~~~~~~~~n~~ 70 (479) T protein:vir:99 1 MIDLP--DEDLSSEGLAKYLETKVFPKMN--TECERLDDFEAWTKNGQEVPDLATRHKN------KEREVLQQLSRKPWM 70 (479) T ss_pred CccCC--cccCChhHHHHHHHHHHHHHHH--HHhHHHHHHHHHHhcCCcccccccccCC------hhHHHHHHHhhcCcH Confidence 33322 35666666433 4455554 3568899999999999998775433221 111233446688999 Q ss_pred hhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEe-----cCCCceEEEEe Q lcl|NC_011308. 77 AELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFART-----TSEDKLTFQTV 150 (530) Q Consensus 77 k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~-----d~~g~~~~~~~ 150 (530) ++||+..++|++.++++ ..+.+ ..+.+.++++ |+++....+++++++++|+||+++|. |++|.++++++ T Consensus 71 ~~iVd~~~~~l~~~gf~--~~d~~---~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~ 145 (479) T protein:vir:99 71 GLMVNSFAQQLIVDGYR--KTGTN---ENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCI 145 (479) T ss_pred HHHHHHHHhhccccccc--CCCch---hhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEe Confidence 99999999999877654 33322 2334556654 78999999999999999999999994 67889999999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeee Q lcl|NC_011308. 151 DALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVA 230 (530) Q Consensus 151 ~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (530) ||++++++||+......++..+ .. .....+.+||.+.++.|....+.. T Consensus 146 ~p~~~~~iydd~~~~~~~~~~~--~~--------~~~~~~~~~~~~~~~~~~~~~~~~---------------------- 193 (479) T protein:vir:99 146 DPRDAFAIWEDPYWDEWPKYLL--ER--------QPNGQYWWWTEEDYSIFEFKQGKF---------------------- 193 (479) T ss_pred chhheEEEecCCcccceeeEEE--ee--------cCceeEEEEecceEEEEEecCCce---------------------- Confidence 9999999998765443333222 11 112345678888777776543321 Q ss_pred cccccceecccccccccccccccccCCccceEEeeCC----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee Q lcl|NC_011308. 231 DGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN----KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR 306 (530) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn----~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~ 306 (530) ......+|+||.||||+|.|| .+|.|+|+++++|||+||.++|++++.+++|++|++++. T Consensus 194 ----------------~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~ 257 (479) T protein:vir:99 194 ----------------IYRETVSHDYGHIPFVRYVNVMDLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT 257 (479) T ss_pred ----------------eeccccccCCCCcceEEeecCCCcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc Confidence 112346799999999999998 679999999999999999999999999999999999999 Q ss_pred cCCCCchhhHH---HHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc---cCCcHHHHH Q lcl|NC_011308. 307 GGTNSPVDEIK---KNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD---GNATNVVIK 380 (530) Q Consensus 307 g~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~---gn~SGvAik 380 (530) |....+..... ..+...+++.+ .++++++.+++ ....++++++|+..|+.++.++++.+..| ||+||+||+ T Consensus 258 G~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~~q~~--~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~ 334 (479) T protein:vir:99 258 GLMLPEGANADQEKMRFAQESMLIS-QNEKASFGAIP--AAPLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALA 334 (479) T ss_pred CCCcccccccchhccccccccceee-cCCCceEEEec--ccchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHH Confidence 97654332221 23445566654 46678876554 55678899999999999988877766554 679999999 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_011308. 381 SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLA 460 (530) Q Consensus 381 ~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~ 460 (530) +++.+|.+||.++++.|+.+|++++++++.+.+... ..+...++++|.+..|.|..+.|+.+.+++++|++|+||+++ T Consensus 335 ~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~--~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~ 412 (479) T protein:vir:99 335 AGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTE--EATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWD 412 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCc--cccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHH Confidence 999999999999999999999999999987754332 344467899999999999999999999999999999999999 Q ss_pred hCCCCCCHHHH-HHHHHHHHHHHHHHHHhhhccccccC-CccccCCC---CCCCCCCccCcCCCCcc Q lcl|NC_011308. 461 IAPRIGDEETL-KAICDTLDLDYEDVVKALEDQEVEEL-EPTVTPII---DPLTIEPQPEPLNIDPV 522 (530) Q Consensus 461 ~~~~vdd~~~e-~~~~e~e~~e~~~~~~~~~~~~~~~~-~~~~~~~~---~~~~~~~~~~~~~~~~~ 522 (530) ++|++++++.+ +++.++++.+.......+..+..+.. .+..++.+ +..+.+..+..+|-.+. T Consensus 413 ~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 413 MIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred hcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 99999987633 33333333333333344332221111 01111111 11111122222333332 No 51 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=2.3e-71 Score=407.80 Aligned_cols=472 Identities=11% Similarity=-0.004 Sum_probs=319.4 Q ss_pred CCc------------ccccCCcccH---HHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccccc Q lcl|NC_011308. 1 MTN------------TLLTTAPDRL---GTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDN 65 (530) Q Consensus 1 ~~~------------~~~~~~~~~~---~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~ 65 (530) ||- +-+-+..|+. -+++.+++.+|.. +..++..+.+||+|+|++....... .+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~--~~~rl~~l~~YY~G~~~~~~~~~~~--------~~~~ 70 (501) T protein:vir:25 1 MTVPVDVIADAPAADVEFPEDSMSREQLGALVADMWRLHIS--ERQWLDRIYEYTKGLRGRPEVPEGA--------SDEV 70 (501) T ss_pred CcccchhhhccCcccccCCcccCChHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhccccC--------Chhh Confidence 332 1122223333 3445566666553 5688999999999999864332211 1223 Q ss_pred CC-cceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCC Q lcl|NC_011308. 66 AS-NIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSED 143 (530) Q Consensus 66 ~~-n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g 143 (530) +. +.++++||+++||+..++|++.++++ ..+.+. .+.+++++ .|+++..+.+++++++++|+||+++|.+++| T Consensus 71 ~~~~~~~v~n~~~~ivd~~a~~l~~~gf~--~~d~~~---~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~ 145 (501) T protein:vir:25 71 KELAKLSVKNVLSLVRDSFAQNLSVVGYR--NALAKE---NDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG 145 (501) T ss_pred hhhHhhhhcChHHHHHHHHHhhhccccee--cCCccc---hHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC Confidence 33 34567899999999999999876644 433322 23455666 4789999999999999999999999999988 Q ss_pred ceEEEEecccceEEEEcC-CCC--ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccc Q lcl|NC_011308. 144 KLTFQTVDALQLLPVFDD-YGT--LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNP 220 (530) Q Consensus 144 ~~~~~~~~p~~~~~v~d~-~~~--~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~ 220 (530) .+++++||.++|+||++ ... ..+++++|.... ......++++|+++.+++|............. T Consensus 146 -~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~------~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~------ 212 (501) T protein:vir:25 146 -PVFRTRSPRQILAVYADPSVDAWPQYALETWVAQK------DAKPHRRGVLYDDTYMYELDLGEVVLGDAGGG------ 212 (501) T ss_pred -CeEEEeccccEEEEEecCCCCcceeEEEEEEeecc------ccCcceeEEEecCeeEEEEecCceeeeecccc------ Confidence 47889999999999965 333 566677665322 23345678999999888775433211000000 Q ss_pred cccceeeeeecccccceecccccccccccccccccCCccceEEeeC----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 221 NPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN----NKLGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n----n~~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) .....................+|+||.||||+|.| +..|.|+|+++++|+|+||..+|++++..+ T Consensus 213 -----------~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e 281 (501) T protein:vir:25 213 -----------QATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSR 281 (501) T ss_pred -----------ccccccccccccccccccccccCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHH Confidence 00001111111222234456789999999999999 456899999999999999999999999999 Q ss_pred HhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEecC-CHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCC Q lcl|NC_011308. 297 DMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDI-PYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNA 374 (530) Q Consensus 297 ~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~-~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~ 374 (530) +|++|++++.|++.++.+.+ .++.++++.+ .++++++.+++. +.+.....++++...|+..|.+|+.+..++ +|+ T Consensus 282 ~~a~p~~~i~G~~~~~~~~~--~~~~~~i~~~-~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~ 358 (501) T protein:vir:25 282 FGANPQRVISGWTGSKAEVL--KASALRVWTF-EDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINV 358 (501) T ss_pred hhccHHHHHhCCCCCccchh--hhcccceecc-CCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCCh Confidence 99999999999987665533 3455666655 456788877765 345666667777777777788887766665 689 Q ss_pred cHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCc Q lcl|NC_011308. 375 TNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQ 454 (530) Q Consensus 375 SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS 454 (530) ||+||++++.+|.+||.++++.|+++|++++++++.+.+.. ...+..+++++|+++.|.|..+.|+++.+++++| +| T Consensus 359 Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~--~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~g-is 435 (501) T protein:vir:25 359 SAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDP--DTAADSGAEVLWRDTEARSFGAVVDGITKLASAG-IP 435 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcC-CC Confidence 99999999999999999999999999999999988775532 2345568999999999999999999999988887 69 Q ss_pred HHHHHHhCCCCCCHHHHHHHHHHHHHHH--HHHHHhhhcccc-ccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 455 INNLLAIAPRIGDEETLKAICDTLDLDY--EDVVKALEDQEV-EELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 455 ~et~l~~~~~vdd~~~e~~~~e~e~~e~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) +||++.++|++++++. +++++++++. ...+..+..++. +...+.+++..++++.... .|.++. T Consensus 436 ~et~~~~~~g~~~~~i--e~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~ 501 (501) T protein:vir:25 436 IEHLLSMVPGMTQQTI--QAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGV-NGNGGA 501 (501) T ss_pred HHHHHHHcCCCCHHHH--HHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccC-CCCCCC Confidence 9999999999987653 3333333222 122222222211 1111111111111100001 111111 No 52 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=1.5e-71 Score=408.85 Aligned_cols=442 Identities=9% Similarity=-0.012 Sum_probs=301.2 Q ss_pred CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhh Q lcl|NC_011308. 8 TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYL 87 (530) Q Consensus 8 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl 87 (530) =.+.|..+++++++.+|.. +..++.++.+||+|+|+|......... .....++|+++||+++||++.++|| T Consensus 1 ~~~~t~~~~~~~l~~~~~~--~~~r~~~l~~Yy~g~~~i~~~~~~~~~-------~~~~~~~k~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD--GMSRVRLLARYSNGDAPLPELTRNTSA-------AWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhcCcccCh-------hhhhhhhhhhcchHHHHHHHHHhhh Confidence 3356677889999888753 568899999999999988544332221 1223478999999999999999999 Q ss_pred cccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCC--C Q lcl|NC_011308. 88 LANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYG--T 164 (530) Q Consensus 88 ~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~--~ 164 (530) +|+|+++...++. +....++++++ |+++....+++++++++|+||+++|.+++|+++++++||.++|++||+.. . T Consensus 72 ~~~~~~~~~~~d~--~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~ 149 (456) T protein:vir:10 72 IPNGITVGGSADS--DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWR 149 (456) T ss_pred ccCCeecCCCCCc--chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcc Confidence 9999998654422 23344566664 77899999999999999999999999999999999999999999999875 4 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) +.+++++|... .....+..+|+.+++..|............ . ....... T Consensus 150 ~~~~i~~~~~~--------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------~-~~~~~~~ 198 (456) T protein:vir:10 150 IRAAMRWWRDL--------DAESDFAIVWSGDGWQKFARPCFVQSSSRR----------------------R-LVTRISD 198 (456) T ss_pred eEEEEEEEEec--------CCceeEEEEEeccceeEEEEEEEEeecccc----------------------e-eeeecCC Confidence 66777777532 122345555655554443221100000000 0 0000011 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc-----hhhHH-- Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSP-----VDEIK-- 317 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~-----~~~~~-- 317 (530) ........+|+||.+||++| +|.+|.|+|+++++|||+||.++|+.+++.+++++|++++.|...+. .+... T Consensus 199 ~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~ 277 (456) T protein:vir:10 199 SWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDY 277 (456) T ss_pred ceeeccccCCCCCceeEEEe-cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccch Confidence 11223345789999999876 67899999999999999999999999999999999999999964322 11100 Q ss_pred -H--HHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHH Q lcl|NC_011308. 318 -K--NIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKT 393 (530) Q Consensus 318 -~--~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~k 393 (530) . ....+++..+++++++..++ ..+.+.....++++...|+..+++|+....++ +|+||+||++++.+|++||.++ T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~ 356 (456) T protein:vir:10 278 ASIFEAAPGALWELPPGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDR 356 (456) T ss_pred hhhhhhhccccccCCCCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHH Confidence 0 11223344455555554432 22334444445555555555555664444443 5899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHH Q lcl|NC_011308. 394 EIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKA 473 (530) Q Consensus 394 e~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~ 473 (530) ++.|+++|++++++++.+. +. .+...++++|+++.|.|.++.|+++++++++|++|.+|+++.+++.++ +.+.. T Consensus 357 ~~~f~~~l~~~~rl~~~~~---g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~-~i~~~ 430 (456) T protein:vir:10 357 LSIAKIGLEAILVKALQIE---GE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNAD-QIKQD 430 (456) T ss_pred HHHHHHHHHHHHHHHHHhc---CC--CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHH-HHHHH Confidence 9999999999999986643 32 234578999999999999999999999999999999999999988654 22222 Q ss_pred HHHHHHHHHHHHHHhhhccccccCCccccC Q lcl|NC_011308. 474 ICDTLDLDYEDVVKALEDQEVEELEPTVTP 503 (530) Q Consensus 474 ~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~ 503 (530) +++++++|.........+.. +++++- T Consensus 431 e~er~~~e~~~~~~~~~~~~----~~~~~~ 456 (456) T protein:vir:10 431 DLDRAREQITLFAGNPVQRP----QEDGSR 456 (456) T ss_pred HHHHHHHHHHHHhhhhhhcC----CCCCCC Confidence 22332222221111111110 111111 No 53 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=1.5e-71 Score=408.85 Aligned_cols=442 Identities=9% Similarity=-0.012 Sum_probs=301.2 Q ss_pred CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhh Q lcl|NC_011308. 8 TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYL 87 (530) Q Consensus 8 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl 87 (530) =.+.|..+++++++.+|.. +..++.++.+||+|+|+|......... .....++|+++||+++||++.++|| T Consensus 1 ~~~~t~~~~~~~l~~~~~~--~~~r~~~l~~Yy~g~~~i~~~~~~~~~-------~~~~~~~k~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDD--GMSRVRLLARYSNGDAPLPELTRNTSA-------AWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHhcCCCchhcCcccCh-------hhhhhhhhhhcchHHHHHHHHHhhh Confidence 3356677889999888753 568899999999999988544332221 1223478999999999999999999 Q ss_pred cccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCC--C Q lcl|NC_011308. 88 LANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYG--T 164 (530) Q Consensus 88 ~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~--~ 164 (530) +|+|+++...++. +....++++++ |+++....+++++++++|+||+++|.+++|+++++++||.++|++||+.. . T Consensus 72 ~~~~~~~~~~~d~--~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~ 149 (456) T protein:vir:10 72 IPNGITVGGSADS--DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWR 149 (456) T ss_pred ccCCeecCCCCCc--chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcc Confidence 9999998654422 23344566664 77899999999999999999999999999999999999999999999875 4 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) +.+++++|... .....+..+|+.+++..|............ . ....... T Consensus 150 ~~~~i~~~~~~--------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------~-~~~~~~~ 198 (456) T protein:vir:10 150 IRAAMRWWRDL--------DAESDFAIVWSGDGWQKFARPCFVQSSSRR----------------------R-LVTRISD 198 (456) T ss_pred eEEEEEEEEec--------CCceeEEEEEeccceeEEEEEEEEeecccc----------------------e-eeeecCC Confidence 66777777532 122345555655554443221100000000 0 0000011 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc-----hhhHH-- Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSP-----VDEIK-- 317 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~-----~~~~~-- 317 (530) ........+|+||.+||++| +|.+|.|+|+++++|||+||.++|+.+++.+++++|++++.|...+. .+... T Consensus 199 ~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~ 277 (456) T protein:vir:10 199 SWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDY 277 (456) T ss_pred ceeeccccCCCCCceeEEEe-cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccch Confidence 11223345789999999876 67899999999999999999999999999999999999999964322 11100 Q ss_pred -H--HHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHH Q lcl|NC_011308. 318 -K--NIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKT 393 (530) Q Consensus 318 -~--~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~k 393 (530) . ....+++..+++++++..++ ..+.+.....++++...|+..+++|+....++ +|+||+||++++.+|++||.++ T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~ 356 (456) T protein:vir:10 278 ASIFEAAPGALWELPPGVDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDR 356 (456) T ss_pred hhhhhhhccccccCCCCcceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHH Confidence 0 11223344455555554432 22334444445555555555555664444443 5899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHH Q lcl|NC_011308. 394 EIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKA 473 (530) Q Consensus 394 e~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~ 473 (530) ++.|+++|++++++++.+. +. .+...++++|+++.|.|.++.|+++++++++|++|.+|+++.+++.++ +.+.. T Consensus 357 ~~~f~~~l~~~~rl~~~~~---g~--~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~-~i~~~ 430 (456) T protein:vir:10 357 LSIAKIGLEAILVKALQIE---GE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNAD-QIKQD 430 (456) T ss_pred HHHHHHHHHHHHHHHHHhc---CC--CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHH-HHHHH Confidence 9999999999999986643 32 234578999999999999999999999999999999999999988654 22222 Q ss_pred HHHHHHHHHHHHHHhhhccccccCCccccC Q lcl|NC_011308. 474 ICDTLDLDYEDVVKALEDQEVEELEPTVTP 503 (530) Q Consensus 474 ~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~ 503 (530) +++++++|.........+.. +++++- T Consensus 431 e~er~~~e~~~~~~~~~~~~----~~~~~~ 456 (456) T protein:vir:10 431 DLDRAREQITLFAGNPVQRP----QEDGSR 456 (456) T ss_pred HHHHHHHHHHHHhhhhhhcC----CCCCCC Confidence 22332222221111111110 111111 No 54 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=6.8e-71 Score=405.28 Aligned_cols=439 Identities=9% Similarity=-0.018 Sum_probs=306.6 Q ss_pred CCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhh Q lcl|NC_011308. 8 TAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYL 87 (530) Q Consensus 8 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl 87 (530) =.++|..++++.+|.+|. .+..++.++.+||+|+|+|......... +....++|+++||+++||++.++|+ T Consensus 1 ~~~~t~~~~~~~l~~~~~--~~~~r~~~l~~Yy~g~~~i~~~~~~~~~-------~~~~~~~~~~~n~~~~ivd~~~~~l 71 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRID--DGMSRVRLLARYSNGDAPLPELTRNTSA-------AWRSFQREARTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHhccCChhhcCcccCh-------hhchhhhhhhcchHHHHHHHHHhhh Confidence 235667788899888765 3567899999999999998653322211 1122355788999999999999999 Q ss_pred cccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCC--C Q lcl|NC_011308. 88 LANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYG--T 164 (530) Q Consensus 88 ~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~--~ 164 (530) +|+|+++...++ .+..+.++++++ |+++....+++++++++|+||+++|.+++|.+++++++|.++||+||+.. . T Consensus 72 ~~~g~~~~~~~d--~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~ 149 (456) T protein:vir:79 72 IPNGITVGGSAD--SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWR 149 (456) T ss_pred ccCCeecCCCCC--ccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCc Confidence 999999875443 233445666764 67899999999999999999999999999999999999999999999754 4 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) +.+++++|... .....+..+|+.+.+++|............. ....... T Consensus 150 ~~~~~~~~~~~--------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~ 198 (456) T protein:vir:79 150 IRSAMRWWRDL--------DAESDFAIVWSGDGWQKFARPCFVQSSSRRR-----------------------LVTRISD 198 (456) T ss_pred eEEEEEEEEec--------CCceeEEEEEcCCceEEEEEEEEeeccccce-----------------------eeeccCC Confidence 66778877531 2335567788888887765433211110000 0000111 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc-----hhhHH-- Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSP-----VDEIK-- 317 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~-----~~~~~-- 317 (530) ........+|+||.|||++| +|.+|.|+|+++++|||+||.++|+.+++.+++++|++++.|...+. .+... T Consensus 199 ~~~~~~~~~~~~~~~pvv~~-~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~ 277 (456) T protein:vir:79 199 SWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDY 277 (456) T ss_pred ceeecccccCCCCceeEEEe-cCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccch Confidence 12223456899999999998 67899999999999999999999999999999999999999975432 11110 Q ss_pred -HH--HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc----cCCcHHHHHHHHhhHHHHH Q lcl|NC_011308. 318 -KN--IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD----GNATNVVIKSRYTLLAMKA 390 (530) Q Consensus 318 -~~--~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~----gn~SGvAik~~~~~l~~ka 390 (530) .. ...++++.++++.++. +++....+.+++.|+..|+.++..+++++..+ +|+||+||++++.+|++|| T Consensus 278 ~~~~~~~~~~~~~~~~~~~~~----q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~ 353 (456) T protein:vir:79 278 ASIFEAAPGALWELPPGVDIW----ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKC 353 (456) T ss_pred hhhhhhhccccccCCCCccee----eecccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHH Confidence 11 1122333444444442 33344445556666666666655555444433 5899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHH Q lcl|NC_011308. 391 QKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEET 470 (530) Q Consensus 391 ~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~ 470 (530) +++++.|+++|++++++++++. +. .+...++++|+++.|.|.++.|++++++++.|++|++++++.+++.++ +. T Consensus 354 ~~~~~~f~~~l~~~~~l~~~~~---g~--~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~-~i 427 (456) T protein:vir:79 354 EDRLSIAKIGLEAILVKALQIE---GE--SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNAD-QI 427 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHhc---CC--CccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHH-HH Confidence 9999999999999999987653 32 244578999999999999999999999999999999999998887553 22 Q ss_pred HHHHHHHHHHHHHHHHHhhhccccccCCccccC Q lcl|NC_011308. 471 LKAICDTLDLDYEDVVKALEDQEVEELEPTVTP 503 (530) Q Consensus 471 e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~ 503 (530) +.++++++.+|.......+-. ..++++.- T Consensus 428 ~~~e~~r~~~e~~~~~~~~~~----~~~~~~~~ 456 (456) T protein:vir:79 428 KQDDLDRAREQITLFAGNPVQ----RPQEDGSR 456 (456) T ss_pred HHHHHHHHHHHHHHHhhhHhh----cCCCCCCC Confidence 222223332222222111111 11111111 No 55 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=2.2e-62 Score=358.60 Aligned_cols=452 Identities=10% Similarity=0.001 Sum_probs=305.1 Q ss_pred CcccHHHHHHHHHHH---------HH-------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceee Q lcl|NC_011308. 9 APDRLGTILSTKIDE---------YI-------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKIS 72 (530) Q Consensus 9 ~~~~~~~~i~~~i~~---------~~-------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~ 72 (530) =.+.+...++.++.+ .. +.....++...++||.|+|+++.+.....+. ..+.++|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~-------~~~~~~~~~ 73 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNG-------NPVNRRQLS 73 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCC-------Cccccceee Confidence 111122222222222 11 2234467889999999999998876544332 223456899 Q ss_pred cCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEec Q lcl|NC_011308. 73 HGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVD 151 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~ 151 (530) +||+|.||++.++||+|+||+|+++ ++..++.|+++++ ++|..++.+++..++++|.+|.++|+|++|.+++.+++ T Consensus 74 ~n~~k~i~~~~a~~l~~~p~~i~~~---d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~ 150 (496) T protein:vir:38 74 MNLPKVTAKYMSKLLFNEKVKINID---DKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFAT 150 (496) T ss_pred cchHHHHHHHHhhhhhCCcceEeeC---ChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEc Confidence 9999999999999999999999764 4667888999986 57999999999999999999999999999999999999 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEE---EeecCCcccchhhccccccccccceeee Q lcl|NC_011308. 152 ALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWY---YVQKDEGRSDEYVLDTTVNPNPSQHVLA 228 (530) Q Consensus 152 p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~---y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (530) |.++||++++++.+.+++. +.... ..++..++++.|+.....+ |..........+...+ .+.. T Consensus 151 ~~~~~P~~~~~~~~~~~~f-~~~~~-----~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v--------~~~~ 216 (496) T protein:vir:38 151 ADCMYPLSNDSENVDECVI-ANSFH-----KNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKV--------SLTL 216 (496) T ss_pred ccceEEEEecCCcEEEEEE-EEEEE-----eCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccc--------cccc Confidence 9999999988877654332 22211 1344566677766332221 1111111000000000 0000 Q ss_pred eecccccceecccccccccccccccccCCccceEEeeCC---------cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011308. 229 VADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN---------KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMA 299 (530) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn---------~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~ 299 (530) +. . ........+++.+.||++|+++ ..|.|+|+++++|||+||.++|++++.++... T Consensus 217 ~~-------------~-~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~ 282 (496) T protein:vir:38 217 LF-------------D-DIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGK 282 (496) T ss_pred cc-------------c-ccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcc Confidence 00 0 0011122356788899888764 24899999999999999999999999999877 Q ss_pred cceee-------eecCCCCchhhHHHHHhhCcceecCCCC---ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CC Q lcl|NC_011308. 300 EAIYV-------VRGGTNSPVDEIKKNIQSKKIIQTKGEG---GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SS 367 (530) Q Consensus 300 ~~~lv-------l~g~~~~~~~~~~~~~~~~~~i~~~~~~---~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~ 367 (530) ..+++ ..+..++....+..+.+...++....++ .++.++.++..++....++.+.+.|...+..+. ++ T Consensus 283 ~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~ 362 (496) T protein:vir:38 283 KKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFT 362 (496) T ss_pred cceecchHHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcC Confidence 77776 2233333333344444444444443332 356666677888888888888877777766553 34 Q ss_pred cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCCccccceeeEEeCCCCCCCHHHHHHHH Q lcl|NC_011308. 368 AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR---RGLGDYSSTDIKFDIEPYILANELDLAMID 444 (530) Q Consensus 368 ~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~---~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~ 444 (530) ....|++|+.|++++++.+..+|..+++.|+++|++++++|+.+... ..+..++...+++.|++++|.|..+.++++ T Consensus 363 ~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~ 442 (496) T protein:vir:38 363 FDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRY 442 (496) T ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHH Confidence 44556778999999999999999999999999999999998876543 334456667899999999999999999999 Q ss_pred HHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCC Q lcl|NC_011308. 445 KTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTI 510 (530) Q Consensus 445 ~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (530) +.++++|++|++|+|+.+|++++++++ +++++.++|....++ ..+..+..++. + T Consensus 443 ~~~~~~GiiS~et~l~~~~~~~d~ea~-~el~ri~~E~~~~~~---~~d~~~~~~~~--------e 496 (496) T protein:vir:38 443 TNAKNQGMIPLKIALQRAWNITEAEAD-EWAEMLAKEKQAEMP---NNDMNGIFGEE--------E 496 (496) T ss_pred HHHHhcCCCCHHHHHHhcCCCChHHHH-HHHHHHHHhhhccCc---cccccCCCCCC--------C Confidence 999999999999999999999886643 333444333322221 10000001111 1 No 56 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=2e-63 Score=364.38 Aligned_cols=404 Identities=8% Similarity=-0.044 Sum_probs=279.4 Q ss_pred hhhcccccccccccccccccCCcce-eecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHH Q lcl|NC_011308. 46 IENTRIMWMNDHGDIVEDDNASNIK-ISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQEL 123 (530) Q Consensus 46 I~~r~~~~~~~~~~~~~~~~~~n~k-i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~ 123 (530) ++.+. ..+..++.+| ++.||+++||+..++++.+++++ +.+. +..+.++++++ |+++....++ T Consensus 1 ~l~~~----------~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~--~~d~---~~~~~~~~i~~~N~~d~~~~~~ 65 (434) T protein:vir:98 1 MLPKN----------AEQAFLDFQRKARTNFCGLIANASVHRLLALGVT--GPDG---EPDTRASRWWQANRLDSRQKLV 65 (434) T ss_pred CCCCC----------ccHHHHHhhhhhhccchHHHHHHHHhhhccCcee--cCCC---chHHHHHHHHHhcChhHHHHHH Confidence 22111 1223334444 57899999999999999988765 3332 23445666664 7899999999 Q ss_pred HHHHhhcCeEEEEEEecCCC-------ceEEEEecccceEEEEcCCC-CceeEEEEEEEEeecccccccceEEEEEEEcC Q lcl|NC_011308. 124 VEGSTIKGYEGIFARTTSED-------KLTFQTVDALQLLPVFDDYG-TLQRIIRFYTEQRYSDADNKFNSIGHADVWTD 195 (530) Q Consensus 124 ~~~~~~~G~a~~~~y~d~~g-------~~~~~~~~p~~~~~v~d~~~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~ 195 (530) +++++++|+||+++|.++++ .++++++||.++|++||+.. .+.+++++|.... ... .+..+|+. T Consensus 66 ~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~------~~~--~~~~~~~~ 137 (434) T protein:vir:98 66 WRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDI------DGF--GYARVFFD 137 (434) T ss_pred HHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEecc------CCc--eEEEEEEe Confidence 99999999999999988655 35588999999999999754 5677777664311 112 23344444 Q ss_pred CceEEEeecCCcccc-hhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCC----cCC Q lcl|NC_011308. 196 TEVWYYVQKDEGRSD-EYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN----KLG 270 (530) Q Consensus 196 ~~~~~y~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn----~~~ 270 (530) +..+.|......... ..... ............+|+||.||||+|+|| ++| T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g 192 (434) T protein:vir:98 138 DTSFPYRTRERTGARLPWGPD-------------------------SWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDP 192 (434) T ss_pred CcEEEEEEeeccccccccccc-------------------------cceecccccccccCCCCccceEEeccCCCcCcCC Confidence 444444322211100 00000 000011123356799999999999999 789 Q ss_pred CCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH--------HHHhhCcceecCCCCceeEEEecCC Q lcl|NC_011308. 271 ISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK--------KNIQSKKIIQTKGEGGLDIQTVDIP 342 (530) Q Consensus 271 ~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~--------~~~~~~~~i~~~~~~~~~~lt~~~~ 342 (530) .|+|+.+++|||+||.++|+.++..++|++|+++++|...++..+.. ........+.+.+++++++.+. + T Consensus 193 ~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~--~ 270 (434) T protein:vir:98 193 EPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQL--D 270 (434) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCCCceEEEe--c Confidence 99999999999999999999999999999999999997654321110 0111222334445677887654 4 Q ss_pred HHHHHHHHHHHHHHHHHHhcccCCCccccc----CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_011308. 343 YEARKAKMDIDELNIYRSGMGFNSSAVGDG----NATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG 418 (530) Q Consensus 343 ~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g----n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~ 418 (530) ....+.++++|+..|+.++.++++.+..|| |+||+||++++.+|..||.++++.|+++|++++++++.+. + . T Consensus 271 ~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~---g-~ 346 (434) T protein:vir:98 271 ATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA---G-V 346 (434) T ss_pred CcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---C-C Confidence 556788999999999999999988777764 6899999999999999999999999999999999987652 3 3 Q ss_pred ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcccc---c Q lcl|NC_011308. 419 DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEV---E 495 (530) Q Consensus 419 ~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~---~ 495 (530) ..+..+++++|+++.|.|..+.|++++++++.| +|.+++++++|+.+ ++++|++++..+.......+..+.. . T Consensus 347 ~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g-~~~e~~~~~lg~~~---~e~~r~~~e~~~~~~~~~~~~~~~~~~~~ 422 (434) T protein:vir:98 347 PEDYTEAEVRWANPAHVTMAVKADAATKLKSIG-YPLDVIAEELDESP---ARVRRIVAGAASQALLAASLLPAPGAPSA 422 (434) T ss_pred ChhheeeeEEecCCCCCCHHHHHHHHHHHHhcC-CcHHHHHHhCCCCH---HHHHHHHHHHHHHHHHHHhhhccCCCCCC Confidence 456678999999999999999999999998877 69999999999854 4666666655443332222222111 1 Q ss_pred cCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 496 ELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) +..++.++ .+|+ T Consensus 423 g~~~~~~~--------------~~dg 434 (434) T protein:vir:98 423 GNVPDSGG--------------AVDG 434 (434) T ss_pred CCCCcccC--------------CCCC Confidence 11111111 1122 No 57 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=3.5e-62 Score=357.53 Aligned_cols=459 Identities=10% Similarity=0.036 Sum_probs=321.8 Q ss_pred CCcccccCC---------cccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 1 MTNTLLTTA---------PDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 1 ~~~~~~~~~---------~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) ||.+--... +.+..++|..++.++.. +..++.++.+||.|+|.+....... .... .++++ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~--~~~r~~~l~~YY~G~~~i~~~~~~~--------p~~~-~~~~~ 69 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVD--RTPRNLLRASFYDGKYAIRQIGNLI--------PPEY-LRTAT 69 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHH--HhHHHHHHHHHHhccccchhccccc--------cHHH-HHHhh Confidence 554433322 23334567888887753 4578999999999999874432211 1112 25578 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce--EEE Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKL--TFQ 148 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~--~~~ 148 (530) ++||+++||+..++++..++++... ++.....|++++. |+++....+++++++++|+||.++|.+++|+. .++ T Consensus 70 v~n~~~~iVd~~a~rl~~~Gf~~~d----~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~ 145 (504) T protein:vir:99 70 VLGWSAKAVDTLARRCNLESFVWPD----GDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIH 145 (504) T ss_pred ccCcHHHHHHHHHhhhccceeeCCC----CChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEE Confidence 9999999999999999988875432 2222344566664 78899999999999999999999999998874 578 Q ss_pred EecccceEEEEcCCC-CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceee Q lcl|NC_011308. 149 TVDALQLLPVFDDYG-TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVL 227 (530) Q Consensus 149 ~~~p~~~~~v~d~~~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (530) .+||.++|++||+.. .+.+++++|... .......+++|+++.+++|...+.+.+. T Consensus 146 ~~sP~~~~~iyD~~~~~~~~a~~~~~~d-------~~g~~~~~~~y~~~~~~~~~~~~~~~~~----------------- 201 (504) T protein:vir:99 146 VKSAMQATGEWNSRRNAMDSLLSITSRD-------AEGHPTGIALYEDGVTVTADMDDDGDWH----------------- 201 (504) T ss_pred EeccceeEEEEeCCCCceeEEEEEEEec-------CCCeEEEEEEEcCCcEEEEEEcCCceee----------------- Confidence 999999999999754 456666655321 1234567899999999988765443211 Q ss_pred eeecccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_011308. 228 AVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEA 301 (530) Q Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~ 301 (530) ....+|+|| ||||+|.|+. +|.|++ +.|++|+|+|+..+|+.++..++|++| T Consensus 202 ---------------------~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p 259 (504) T protein:vir:99 202 ---------------------ADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFP 259 (504) T ss_pred ---------------------eccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcch Confidence 123578998 9999999873 588888 489999999999999999999999999 Q ss_pred eeeeecCCCCch----hhHH--HHHhhCcceecCCCC--------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC Q lcl|NC_011308. 302 IYVVRGGTNSPV----DEIK--KNIQSKKIIQTKGEG--------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS 367 (530) Q Consensus 302 ~lvl~g~~~~~~----~~~~--~~~~~~~~i~~~~~~--------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~ 367 (530) ++++.|...++. +... -....++++.+++++ ++++- +.+....+.++++|+..|+.++...++. T Consensus 260 ~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--q~~~~~l~~~~~~l~~~i~~~a~~t~~P 337 (504) T protein:vir:99 260 QLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVK--QFPASSPQPHIEMLEQIAMMFSGETSIP 337 (504) T ss_pred hhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceee--ecCCCChHHHHHHHHHHHHHHHhhhCCC Confidence 999999865431 1111 123345666666543 34443 4455556788888888888886555554 Q ss_pred cccc------cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHH Q lcl|NC_011308. 368 AVGD------GNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLA 441 (530) Q Consensus 368 ~~~~------gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a 441 (530) +..| +++||+||++++.+|..||.++++.|+++|++++++++.+.........+...++++|.++.|.+..+.| T Consensus 338 ~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~a 417 (504) T protein:vir:99 338 VESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQA 417 (504) T ss_pred HHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHH Confidence 4433 3579999999999999999999999999999999999887665444455567899999999999999999 Q ss_pred HHHHHHHhcCC--C-cHHHHHHhCCCCCCHHHHHHHHHHHHHH--HHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 442 MIDKTEAETNQ--I-QINNLLAIAPRIGDEETLKAICDTLDLD--YEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 442 ~~~~~~~~~g~--i-S~et~l~~~~~vdd~~~e~~~~e~e~~e--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) +.+.+++++|. + ..+++++++++.. .++++++++..+ ....+..+..+...... . .+.+.++..+++ T Consensus 418 Da~~Kl~~ag~~l~~~~~~l~~~lg~~~---~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~-~----~~~~~~~~~e~a 489 (504) T protein:vir:99 418 DAGAKMLGAGPEWLKETEVGLELLGLTP---QQAKRALAERRRASSVSIIEALNRRQQEAAT-A----GEDQDQGAGEPP 489 (504) T ss_pred HHHHHHHhhccccccchHHHHhhcCCCH---HHHHHHHHHHHHHhhHHHHHHHhcccCCCCC-C----CCCCCcCCCCCC Confidence 99999888884 2 3588899998742 234444433322 12223333332221111 1 112223334445 Q ss_pred CCCCcccccccCCC Q lcl|NC_011308. 517 LNIDPVIEEEPVQE 530 (530) Q Consensus 517 ~~~~~~~~~~~~~~ 530 (530) .+..+...++|-++ T Consensus 490 ~~~~~~~~~~p~~~ 503 (504) T protein:vir:99 490 ANEPPAALGRPTLV 503 (504) T ss_pred CCCCCccCCCcccC Confidence 55666777777777 No 58 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=9.6e-60 Score=344.13 Aligned_cols=456 Identities=11% Similarity=0.017 Sum_probs=311.5 Q ss_pred CcccHHHHHHHHHHHH----------------HHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceee Q lcl|NC_011308. 9 APDRLGTILSTKIDEY----------------IRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKIS 72 (530) Q Consensus 9 ~~~~~~~~i~~~i~~~----------------~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~ 72 (530) =.+.+.++|+.++++. .+.+...++...++||.|+|+++.+...... ...++++|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~-------~~~~~~~~~s 73 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHN-------GNPVNRRQLS 73 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccC-------CCccccceee Confidence 1122233344433321 0223346788889999999988776543322 2234567999 Q ss_pred cCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEec Q lcl|NC_011308. 73 HGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVD 151 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~ 151 (530) +|+++.||++.++||||+|++++++ ++...+.|+++++ ++|..++.+++..++.+|.+|.++|+|++|++++..++ T Consensus 74 ~n~~~~iv~~~a~~l~~ep~~i~~~---d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~ 150 (499) T protein:vir:80 74 MNLPKVTAKYMSKLLFNEKVKINID---DETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFAT 150 (499) T ss_pred cchHHHHHHHHHHhhhCCcceEeeC---CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEc Confidence 9999999999999999999999774 4677888999986 56999999999999999999999999999999999999 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEc--CCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 152 ALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWT--DTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 152 p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt--~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) |.++||++.+++.+..++ |+..... ..+..+++|.|+ ......|........ ..... T Consensus 151 a~~~~Pi~~d~~~~~~~~-f~~~~~~-----~~~~y~~lE~h~~~~~~~~~y~I~n~~~~-------~~~~~-------- 209 (499) T protein:vir:80 151 ADCMYPLSNDSENVDECL-IANSFHK-----NNKYYKLLEWNEWKGEKEEVYTVTTELYQ-------SDDPN-------- 209 (499) T ss_pred CCceEEEEecCCCeEEEE-EEEEEee-----cCeEEEEEEEEEecccceeeEEEEEEEEe-------ccCcc-------- Confidence 999999987776654433 3322221 223445555433 222222221100000 00000 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCc---------CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK---------LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE 300 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~---------~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~ 300 (530) ..+........ ...........++++.|+++|+++. .|.|+|+++++|||+||.++|++++.++.... T Consensus 210 ~lG~~v~l~~~---~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~ 286 (499) T protein:vir:80 210 ELGGKVSLKLL---FNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKK 286 (499) T ss_pred ccCcccchhhh---ccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhccc Confidence 00000000000 0001111223468899999998752 48999999999999999999999999999888 Q ss_pred ceee-------eecCCCCchhhHHHHHhhCcceecCCC-C--ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCc Q lcl|NC_011308. 301 AIYV-------VRGGTNSPVDEIKKNIQSKKIIQTKGE-G--GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSA 368 (530) Q Consensus 301 ~~lv-------l~g~~~~~~~~~~~~~~~~~~i~~~~~-~--~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~ 368 (530) +++| ..+.+++....+..+.+..+++....+ + .++.++.++..++....++.+.+.|...+..+. +++ T Consensus 287 ~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~ 366 (499) T protein:vir:80 287 KVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTF 366 (499) T ss_pred ceecchhhhhccCCCCCCcccCCCcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCC Confidence 8887 444455544555556666666654332 2 467777888999998888888888877776553 345 Q ss_pred ccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCccccceeeEEeCCCCCCCHHHHHHHHH Q lcl|NC_011308. 369 VGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRR---GLGDYSSTDIKFDIEPYILANELDLAMIDK 445 (530) Q Consensus 369 ~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~---~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~ 445 (530) ...|+.||.+++++++.+.++|..+++.|+.+|++++++|+.+.... .+..++...+++.|++.+|.|..+.+++.+ T Consensus 367 ~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~ 446 (499) T protein:vir:80 367 DENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYT 446 (499) T ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHH Confidence 55577789999999999999999999999999999999998876543 334556678999999999999999999999 Q ss_pred HHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcccc Q lcl|NC_011308. 446 TEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVT 502 (530) Q Consensus 446 ~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~ 502 (530) .++.+|++|++|+++.+++++|++++ +++++.++|....+ ..++..+..++.+ T Consensus 447 ~~~~~Gi~S~et~l~~~~~~~d~ea~-~el~~i~~E~~~~~---~~~d~~g~~ge~e 499 (499) T protein:vir:80 447 TAKNQGMIPLKIALQRAWNITEAEAD-EWAEMLAKEKQAEI---PNNDMTGIFGEEE 499 (499) T ss_pred HHHHcCCCCHHHHHhhcCCCChHHHH-HHHHHHHHHhhcCC---CCCCccccCCCCC Confidence 99999999999999999999886643 33333333332221 1111111111111 No 59 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=4.4e-56 Score=324.05 Aligned_cols=400 Identities=11% Similarity=0.016 Sum_probs=293.8 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |+. ..|..++.++.+ +..++..+.+||+|+|++..-... ..+..+..+|++.||++++|+..++.+.-. T Consensus 1 m~~-~~i~~L~~~~~~--~~~r~~~~~~yy~g~~~~~~~~~~--------~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~ 69 (422) T protein:vir:97 1 MNY-MGMGYLRRKLAL--FKTGVDKRYRYYAMDDRDDTRSIV--------MPNNVREMYRSVLEWTAKGVDSLADRIIFR 69 (422) T ss_pred CCh-HHHHHHHHHHHH--HHHHHHHHHHHHhcCCChhhcCcc--------ccHHHHHHHHhhcchhHHHHHHHHhccccc Confidence 432 234555555443 456788999999999987443221 223344566888899999999999855444 Q ss_pred ceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecC-CCceEEEEecccceEEEEcCCCC-cee Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTS-EDKLTFQTVDALQLLPVFDDYGT-LQR 167 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~-~g~~~~~~~~p~~~~~v~d~~~~-~~~ 167 (530) .+ +.. |.. +++++ .|+++....++++++.++|+||.++|.++ +|.++++++||.+++.+||+... +.+ T Consensus 70 Gf--~~~---d~~----l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~~~~~~ 140 (422) T protein:vir:97 70 EF--TND---DFN----AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTTFLLTE 140 (422) T ss_pred ee--eCC---chh----HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCCCccee Confidence 43 332 333 34444 48899999999999999999999999986 68889999999999999987544 445 Q ss_pred EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccc Q lcl|NC_011308. 168 IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEG 247 (530) Q Consensus 168 ~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (530) ++++|.. +. ......+.+|++..++.++..+. T Consensus 141 a~~~~~~---~~----~~~~~~~~~~~~~~~~~~~~~~~----------------------------------------- 172 (422) T protein:vir:97 141 GYAILES---DS----NGNPTLEAYFTDKDIWYYPKKGK----------------------------------------- 172 (422) T ss_pred eEEEEEe---cC----CCcEEEEEEEcCceEEEEcCCCc----------------------------------------- Confidence 5554432 11 11233445566665554432111 Q ss_pred ccccccccCCccceEEeeCCc-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHh Q lcl|NC_011308. 248 RQVLGRSYKSRFPFDILYNNK-----LGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQ 321 (530) Q Consensus 248 ~~~~~~~~~~~iPiv~~~nn~-----~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~ 321 (530) ....+|++|.||+|+|.|+. +|.|++ +.|++|+|+|+..+|+..+..++|++|++++.|.+.+......-... T Consensus 173 -~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~ 251 (422) T protein:vir:97 173 -PYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRAT 251 (422) T ss_pred -cccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhhh Confidence 01236999999999999874 588888 78999999999999999999999999999999986533221111233 Q ss_pred hCcceecCCCC---ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc----C-CcHHHHHHHHhhHHHHHHHH Q lcl|NC_011308. 322 SKKIIQTKGEG---GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG----N-ATNVVIKSRYTLLAMKAQKT 393 (530) Q Consensus 322 ~~~~i~~~~~~---~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g----n-~SGvAik~~~~~l~~ka~~k 393 (530) .++++.++.+. ++++ ++.+....+.++++|+..|+.+|...++.+..|| | +||+||++.+.+|..||.++ T Consensus 252 ~~~i~~~~~de~~~~~~v--~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k 329 (422) T protein:vir:97 252 VSTLLEISKDEDGDKPTV--GQFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKA 329 (422) T ss_pred hhhhhccCCCCCCCccee--eecCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHH Confidence 44677776543 3554 4556666778899999999999887777766664 3 68999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCH---HHHHHHHHHHHhc--CCCcHHHHHHhCCCCCCH Q lcl|NC_011308. 394 EIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANE---LDLAMIDKTEAET--NQIQINNLLAIAPRIGDE 468 (530) Q Consensus 394 e~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~---~e~a~~~~~~~~~--g~iS~et~l~~~~~vdd~ 468 (530) ++.|+.+|++++++++++.........+..++.+.|.++.|.+. ++.|+.+.++.++ |++|.++++++++| +++ T Consensus 330 ~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-~~~ 408 (422) T protein:vir:97 330 QRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGV-KGA 408 (422) T ss_pred HHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCC-Cch Confidence 99999999999999988766544444556789999999999994 4455556667777 68999999999998 777 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_011308. 469 ETLKAICDTLDLDY 482 (530) Q Consensus 469 ~~e~~~~e~e~~e~ 482 (530) +.+..++++.+.+- T Consensus 409 ~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 409 DKPIPAITEVTTDG 422 (422) T ss_pred hHHHHHHHhhhccC Confidence 88877777765554 No 60 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=9.8e-55 Score=316.67 Aligned_cols=388 Identities=12% Similarity=0.009 Sum_probs=293.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |+ .++|.+++.++.. +..++..+.+||+|+|.+..-... .....++++|++.||++++|+..++.+.-+ T Consensus 1 ~~-~~~i~~L~~~~~~--~~~r~~~~~~yY~g~~~~~~~~~~--------~p~~~~~~~~~v~nw~~~iVds~a~rl~~~ 69 (409) T protein:vir:94 1 MT-EKGIGYLRFKLSV--HKRRAEMRYDQYAMKYVDRFKGIT--------IPQALSQQYRSILGWCAKGVDSLADRLVFR 69 (409) T ss_pred CC-HHHHHHHHHHHHH--HhHHHHHHHHHhcccCchhhcChh--------hhHHHHHHHhhhcchhHHHHHHhHhhcccC Confidence 43 2355666666543 457788999999999987432221 122334567899999999999999966544 Q ss_pred ceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCC-CCceeE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDY-GTLQRI 168 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~-~~~~~~ 168 (530) . |.. .|.. +++++ .|+++....++++.+.++|+||.++|.+++|+++++.+||.+++.+||+. .++.++ T Consensus 70 G--f~~---~d~~----l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~~~~~~a 140 (409) T protein:vir:94 70 E--FEN---DDFT----VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPITGLLTEG 140 (409) T ss_pred c--ccC---CchH----HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCCCceeee Confidence 4 332 3333 44555 47788899999999999999999999999999999999999999999874 456666 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) ++++... .........+|+++.++.|....+.. T Consensus 141 ~~~~~~d-------~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------------- 173 (409) T protein:vir:94 141 YAVLERD-------ENNNVVLEAHFLPDRTDYYYRDSRNN---------------------------------------- 173 (409) T ss_pred EEEEEec-------CCCceEEEEEEecCcEEEEEecCcee---------------------------------------- Confidence 6655321 12234567789999988876543221 Q ss_pred cccccccCCccceEEeeCCc-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhh Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNK-----LGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQS 322 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~-----~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~ 322 (530) ...+|++|.||+|+|.|+. +|.|++ +.|++|+|++++.+|+..+..++|++|++++.|.+.+......-.... T Consensus 174 -~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~ 252 (409) T protein:vir:94 174 -ISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATV 252 (409) T ss_pred -EeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhH Confidence 1236999999999999874 578888 789999999999999999999999999999999865332211112223 Q ss_pred CcceecCCC---CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc----C-CcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGE---GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG----N-ATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 323 ~~~i~~~~~---~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g----n-~SGvAik~~~~~l~~ka~~ke 394 (530) ++++.++++ .++++-+ .+....+.++++|+..|+.+|++.++.+..|| | +||.||+....+|..||.+++ T Consensus 253 ~~i~~~~~d~dg~~~~v~q--~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~ 330 (409) T protein:vir:94 253 SSMLQFTKDEDGDKPTLGQ--FTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQ 330 (409) T ss_pred HHhhcCCCCCCCCCceEEe--cCCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 567776543 3456544 44555678899999999999988887777775 3 699999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCH---HHHHHHHHHHHhcC--CCcHHHHHHhCCCCCCH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANE---LDLAMIDKTEAETN--QIQINNLLAIAPRIGDE 468 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~---~e~a~~~~~~~~~g--~iS~et~l~~~~~vdd~ 468 (530) +.|+++|++++++++++.........+...+++.|.+..|.+. ++.|+.+.+++++| +.+.++++.+++|.++- T Consensus 331 ~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 331 RSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 9999999999999888766554445566789999998888885 45566678899998 56889999999997654 No 61 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=6.6e-54 Score=312.14 Aligned_cols=389 Identities=11% Similarity=0.014 Sum_probs=289.1 Q ss_pred HHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCCcc Q lcl|NC_011308. 21 IDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHD 100 (530) Q Consensus 21 i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~ 100 (530) ++.| ..++..+.+||.|+|.+-.-.. ......++++|++.||++++|+..++.+.-+.++ .. T Consensus 1 l~~~-----~~r~~~~~~yY~g~~~~~~~~~--------~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~--~~--- 62 (410) T protein:vir:95 1 MNLY-----QSRVNLRYKHYAMQHYEAPTGI--------TIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFA--ND--- 62 (410) T ss_pred CCcc-----hhhHHHHHHHhcCCCCccccch--------hccHHHHhHHHhhcchhHHHHHHhHhhhcccccc--CC--- Confidence 3332 4668889999999997733221 1223345677899999999999999988766543 22 Q ss_pred hHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCC-CCceeEEEEEEEEeec Q lcl|NC_011308. 101 DQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDY-GTLQRIIRFYTEQRYS 178 (530) Q Consensus 101 de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~-~~~~~~~~~y~~~~~~ 178 (530) |.. +++++ .|+++....++++++.++|+||.++|.+++|.++++++||.+++.+||+. ..+.++++++... T Consensus 63 d~~----l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~--- 135 (410) T protein:vir:95 63 DFN----VTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVEGYAVLARD--- 135 (410) T ss_pred Cch----HHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEEEEEEEEec--- Confidence 222 44455 47889999999999999999999999999999999999999999999874 4566666655321 Q ss_pred ccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCc Q lcl|NC_011308. 179 DADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSR 258 (530) Q Consensus 179 ~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (530) .......+.+|+++.++.+...++.. ..+|++|. T Consensus 136 ----~~~~~~~~~~~~~~~~~~~~~~~~~~------------------------------------------~~~~~~g~ 169 (410) T protein:vir:95 136 ----DYNRPTLEAYFEPNATHFIPKDGEPY------------------------------------------SVTNETGI 169 (410) T ss_pred ----CCCeEEEEEEEeCCcEEEEeeCCccc------------------------------------------cccCCCCC Confidence 22346778899999988876433210 13599999 Q ss_pred cceEEeeCCc-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCC Q lcl|NC_011308. 259 FPFDILYNNK-----LGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEG 332 (530) Q Consensus 259 iPiv~~~nn~-----~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~ 332 (530) ||+|+|.|+. +|.|++ +.|++|+|++++.++++.+..++|++|++++.|.+.+......-....++++.++++. T Consensus 170 vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~ 249 (410) T protein:vir:95 170 PLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSD 249 (410) T ss_pred cceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCC Confidence 9999999864 578887 7899999999999999999999999999999998654322212223345677776543 Q ss_pred ---ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc----C-CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 333 ---GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG----N-ATNVVIKSRYTLLAMKAQKTEIALRKTLRWT 404 (530) Q Consensus 333 ---~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g----n-~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~ 404 (530) .+++.+ .+......++++|+..++.++...++....|| | +||.||++...+|..||.++++.|+.+|+++ T Consensus 250 ~~~~~~v~q--~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~ 327 (410) T protein:vir:95 250 KGVKPSVGQ--FTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNV 327 (410) T ss_pred CCCcceEEe--cCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 356544 44455667888888888888877777666664 3 6999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCCccccceeeEEeCCCCCC---CHHHHHHHHHHHHhc--CCCcHHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_011308. 405 ADLVVEDIRRRGLGDYSSTDIKFDIEPYILA---NELDLAMIDKTEAET--NQIQINNLLAIAPRIGDEETLKAICDTLD 479 (530) Q Consensus 405 ~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~---n~~e~a~~~~~~~~~--g~iS~et~l~~~~~vdd~~~e~~~~e~e~ 479 (530) +++++++.........+...+.+.|.++.+. +....|+...++.++ |+++.+++++.++|.++. +.+++.+++ T Consensus 328 ~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~--~~~~~~~e~ 405 (410) T protein:vir:95 328 AYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDM--SAKPVVSEG 405 (410) T ss_pred HHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHH--HHHHHHHHH Confidence 9998887665444445556789999866543 467788887788877 789999999999997653 222332222 Q ss_pred HHHHH Q lcl|NC_011308. 480 LDYED 484 (530) Q Consensus 480 ~e~~~ 484 (530) ...-+ T Consensus 406 ~~~g~ 410 (410) T protein:vir:95 406 GSNGE 410 (410) T ss_pred HhCCC Confidence 11111 No 62 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=4.8e-52 Score=301.93 Aligned_cols=388 Identities=12% Similarity=0.006 Sum_probs=290.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |+. +.|.+++.+... +..++..+.+||+|+|.+-.-... .....+.++|+..||++++|+..++.+.=+ T Consensus 1 ~~~-~~i~~L~~~~~~--~~~r~~~~~~yY~g~~~~~~~~~~--------~p~~~~~~~~~v~nw~~~iVds~a~rl~~~ 69 (409) T protein:vir:16 1 MTE-KGIGYLRFKLSV--HKRRAEMRYEQYAMKHVDRFKGIT--------IPQALSQQYRSILGWCAKGVDSLADRLVFR 69 (409) T ss_pred CCH-HHHHHHHHHHHH--HhHHHHHHHHHHhccCchhhcchh--------hhHHHHHHHhhhcChhHHHHHHhHhhcccc Confidence 543 355666655543 457889999999999976332111 122344567889999999999999977544 Q ss_pred ceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCC-CCceeE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDY-GTLQRI 168 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~-~~~~~~ 168 (530) . |.. .|+. +++++ .|+++....++++.+.++|+||.++|.+++|.++++.+||.+++++||+. .++.++ T Consensus 70 G--f~~---~d~~----l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~~~~~~~a 140 (409) T protein:vir:16 70 E--FEN---DDFT----VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPITGLLTEG 140 (409) T ss_pred c--ccC---cchH----HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeecccccceee Confidence 4 332 3333 44555 37889999999999999999999999999999999999999999999874 456677 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) ++++... .........+|+++.++.|...++.. T Consensus 141 ~~~~~~d-------~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------------- 173 (409) T protein:vir:16 141 YAVLERD-------ENNNVVLEAHFLPDRTDYYYRDSRNN---------------------------------------- 173 (409) T ss_pred eEEEEec-------CCCceEEEEEEecCcEEEEEecCccc---------------------------------------- Confidence 7766421 11223556788888887765433211 Q ss_pred cccccccCCccceEEeeCCc-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhh Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNK-----LGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQS 322 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~-----~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~ 322 (530) ...+|++|.||+|+|.|+. +|.|++ +.|++|+|++++.+|+..+..++|++|++++.|.+.+......-.... T Consensus 174 -~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~ 252 (409) T protein:vir:16 174 -ISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATV 252 (409) T ss_pred -cceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhhh Confidence 1246999999999999874 688988 679999999999999999999999999999999865322211112233 Q ss_pred CcceecCCC---CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc----C-CcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGE---GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG----N-ATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 323 ~~~i~~~~~---~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g----n-~SGvAik~~~~~l~~ka~~ke 394 (530) ++++.++++ .++++ ++.+....+.++++|+..++.+|+..++....|| | +||.||+....+|..||.+++ T Consensus 253 ~~i~~~~~d~~g~~~~v--~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~ 330 (409) T protein:vir:16 253 SSMLQFTKDEDGDKPTL--GQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQ 330 (409) T ss_pred hHhhccCCCCCCCCceE--EecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 567777543 33554 3456666778899999999999988877777764 3 689999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCC---HHHHHHHHHHHHhcCC--CcHHHHHHhCCCCCCH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILAN---ELDLAMIDKTEAETNQ--IQINNLLAIAPRIGDE 468 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n---~~e~a~~~~~~~~~g~--iS~et~l~~~~~vdd~ 468 (530) +.|+.+|++++++++.+....+........+.+.|.+..|.+ ..+.|+.+.+++++|. ...++++..++|.++- T Consensus 331 ~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 331 RSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 999999999999998876554333333467899999887766 5677788888888873 3568899999987654 No 63 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=6.3e-50 Score=290.30 Aligned_cols=445 Identities=13% Similarity=0.110 Sum_probs=297.0 Q ss_pred ccHHHHHHHHHHHHHH--------------------hhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 11 DRLGTILSTKIDEYIR--------------------SQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~--------------------~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |.+.+.|+.+|.+... .+.+.++...++||.|+|+.+++.... ...+..++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~---------~~~~~~~~ 71 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSY---------GDTQKHEL 71 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccC---------CCccccce Confidence 7777777776665321 223456677889999999877643311 12234468 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEE Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQT 149 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~ 149 (530) +++|+++.||++.++|+||+|++++++ ++..++.|+++++ ++|..++.+++..++..|.++..+|+| .|++++.. T Consensus 72 ~slnl~~~i~~~~A~ll~~e~~~i~~~---d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-~~~~~i~~ 147 (505) T protein:vir:79 72 QSVNVTKLASAKLASLIFNEQCQVTVS---DETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-SGKIKLAW 147 (505) T ss_pred eecchHHHHHHHHHhhhcCCCceeecC---ChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-CCceEEEE Confidence 889999999999999999999999864 4567888999986 569999999999999999999999998 57899999 Q ss_pred ecccceEEEEcCCCCcee--EEEEEEEEeecccccccceEEEEEEEcCCceEE---EeecCCcccchhhccccccccccc Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQR--IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWY---YVQKDEGRSDEYVLDTTVNPNPSQ 224 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~--~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~---y~~~~~~~~~~~~~~~~~~~~~~~ 224 (530) ++|.++||++.+++.... ++..|.. ........++.+|.|+.....+ +..........+...+. T Consensus 148 v~ad~~~P~~~d~~~~~~~a~~~~~~~----~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~------- 216 (505) T protein:vir:79 148 ATADQVYPLQADTNQVNELAIASRTTE----VENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVP------- 216 (505) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEEE----ecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccc------- Confidence 999999999655555332 2222221 1122333456677776332221 11111111100000000 Q ss_pred eeeeeecccccceecccccccccccccccccCCccceEEee----CCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 225 HVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY----NNK-----LGISDIKKVKSIIDDYDLMNCFLSNNL 295 (530) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~----nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~ 295 (530) +..+... .+.. ......++.+.++++|+ ||. .|.|+|++++++||++|.++|++++.+ T Consensus 217 -l~~~~~~-------~~l~-----~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~ 283 (505) T protein:vir:79 217 -LNSLEQY-------EGLE-----PQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEV 283 (505) T ss_pred -hhhcccc-------cccC-----cceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHH Confidence 0000000 0000 01112344555555554 443 489999999999999999999999999 Q ss_pred HHhccceee----eecC--C-CCchh-h---HHHHHhhCcceecC-CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 296 QDMAEAIYV----VRGG--T-NSPVD-E---IKKNIQSKKIIQTK-GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMG 363 (530) Q Consensus 296 ~~~~~~~lv----l~g~--~-~~~~~-~---~~~~~~~~~~i~~~-~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~ 363 (530) +....+++| ++.. + +.... . +..+.+..+.+..+ +++.++.++.++..++....++.+.+.|...+.. T Consensus 284 ~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 363 (505) T protein:vir:79 284 KKGQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGL 363 (505) T ss_pred HhcccceeechHHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCC Confidence 988888777 3211 1 11100 0 11111112222222 2345788888889999999999999988887765 Q ss_pred cC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------CccccceeeEEeCCC Q lcl|NC_011308. 364 FN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL---------GDYSSTDIKFDIEPY 432 (530) Q Consensus 364 p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~---------~~~d~~~i~i~f~~~ 432 (530) +. +++.+.|..||.++++.++.+..+++++++.|+.+|+++++.|+.+....+. ...+..++++.|.+. T Consensus 364 s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~ 443 (505) T protein:vir:79 364 SQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDG 443 (505) T ss_pred ChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCC Confidence 42 4555557779999999999999999999999999999999999887655321 234445799999999 Q ss_pred CCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHhhhccccccCCcc Q lcl|NC_011308. 433 ILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEE--TLKAICDTLDLDYEDVVKALEDQEVEELEPT 500 (530) Q Consensus 433 ~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~--~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~ 500 (530) +|.|..+.++..+.++.+|++|++++++.+|.+++.+ +|++|+++|+.. .++.+.... ++ T Consensus 444 i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~eeea~~el~ri~~E~~~---~~p~~~~~g-----g~ 505 (505) T protein:vir:79 444 VFVDQESKRAADLQAVQAQVMPKKQFLMRNYGLDEEEADEWLAQIDAENST---AEPEFNQFG-----GD 505 (505) T ss_pred CCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhccc---cCCCchhcc-----CC Confidence 9999988888888999999999999999999998844 455555554332 222221111 01 No 64 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=8.1e-49 Score=284.24 Aligned_cols=454 Identities=11% Similarity=0.004 Sum_probs=296.0 Q ss_pred ccHHHHHHHHHHHHH--------------------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 11 DRLGTILSTKIDEYI--------------------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 11 ~~~~~~i~~~i~~~~--------------------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |.+.+-|+.++.+-. ......++...++||+|+|+.+..... ....+...| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~---------~~~~~~~~~ 71 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS---------DGIKKKRLK 71 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC---------CCCccccce Confidence 655555555543311 223456788999999999976543211 111223347 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEE Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQT 149 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~ 149 (530) ++.|+++.||++.++|+||+|+++++.+ ++...+.|+++++ ++|...+.+++..++.+|.++..+|+|. |.+++.. T Consensus 72 ~sln~~~~i~~~~A~lv~~e~~~i~v~~--~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~ 148 (508) T protein:vir:15 72 NTINMAKTAARRIASVVFNEKAEIHVKD--NNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIKIAW 148 (508) T ss_pred eecchHHHHHHHHHhhhhCCCceEEeCC--chHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeEEEE Confidence 8999999999999999999999998754 3455677899885 6799999999999999999999999985 6799999 Q ss_pred ecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEc--C--CceEEEeecCCcccchhhccccccccccce Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWT--D--TEVWYYVQKDEGRSDEYVLDTTVNPNPSQH 225 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt--~--~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 225 (530) ++|.++||+.-+++....+.-++..... .....+.++++|.|+ . .+...+..........+... .. T Consensus 149 v~ad~~~P~~~d~~~~~~~af~~~~~~~--~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~--------v~ 218 (508) T protein:vir:15 149 VRADQFYPLQSNTNDISEAAIASRTQRT--ESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQ--------VP 218 (508) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEEee--cCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcc--------cc Confidence 9999999975445544322222222111 112334456666665 1 12222222111110000000 00 Q ss_pred eeeeecccccceecccccccccccccccccCCccceEEeeCC---------cCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 226 VLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN---------KLGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn---------~~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) +..+... . .........++.+.|+++|+++ ..|.|+|++++++||++|.+.|++++.++ T Consensus 219 l~~~~e~-----------~-~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~ 286 (508) T protein:vir:15 219 LSTLPVY-----------K-ELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIR 286 (508) T ss_pred hhhcccc-----------c-CCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHH Confidence 0000000 0 0001122346777888888763 24899999999999999999999999998 Q ss_pred Hhccceeeee---cCCCCchhhHHHHHhhCcceecCC--CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcc Q lcl|NC_011308. 297 DMAEAIYVVR---GGTNSPVDEIKKNIQSKKIIQTKG--EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAV 369 (530) Q Consensus 297 ~~~~~~lvl~---g~~~~~~~~~~~~~~~~~~i~~~~--~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~ 369 (530) ....+++|-. ..+.+....+..+.+..+.+..+. +..++.++.++..+...+.++.+.+.|...+..+. +++. T Consensus 287 ~~~~~i~v~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~ 366 (508) T protein:vir:15 287 LGQKHIAVQPGMLRFDDEHKPTFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYS 366 (508) T ss_pred hcccceeechHHhcCCCCCccccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccc Confidence 7667777732 223322222222222233333333 34588889999999999999999988888776553 3444 Q ss_pred cccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----C------ccccceeeEEeCCCCCCCHH Q lcl|NC_011308. 370 GDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL-----G------DYSSTDIKFDIEPYILANEL 438 (530) Q Consensus 370 ~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-----~------~~d~~~i~i~f~~~~P~n~~ 438 (530) +.|..||.++++..+.+..+++.+++.|+.+|++++++|+.++...+. . ..+..+++|.|++.+|.|.. T Consensus 367 ~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~ 446 (508) T protein:vir:15 367 NDGVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKD 446 (508) T ss_pred cCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHH Confidence 556679999999999999999999999999999999999887664321 1 12234689999999999998 Q ss_pred HHHHHHHHHHhcCCCcHHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCC Q lcl|NC_011308. 439 DLAMIDKTEAETNQIQINNLLAIAPRIGDEE--TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDP 507 (530) Q Consensus 439 e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~--~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (530) +.++..+.++.+|++|++++++.+|.+++.+ +|++|+++|+.+. .... + ...+. .+.+++ T Consensus 447 ~~~~~~~~~v~aGi~s~e~~i~~~~g~~deea~~el~ri~~E~~~~---~~~~--~---~~~~~-~g~~ge 508 (508) T protein:vir:15 447 KQLEEDAKVLAIGALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTD---TFEG--G---RSAIL-NGGDGE 508 (508) T ss_pred HHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhcccc---Cccc--c---ccccC-CCCCCC Confidence 8888888999999999999999999998754 4555555443221 0000 0 00001 111110 No 65 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=1.7e-49 Score=288.00 Aligned_cols=436 Identities=12% Similarity=0.041 Sum_probs=297.4 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) =|.-+. +-+++...++..++.++.. +..++..+.+||.|+|.+-.-.... ....+ +.|++.||++++| T Consensus 5 ~~~~~~-gl~~~~~~~~~~L~~~~~~--~~~~~~~~~~Yy~G~~~~~~~~~~~--------p~~~r-~~~~v~nw~~~~V 72 (474) T protein:vir:81 5 QTVRIP-SLSNDENALINGLLAQIEN--LRWKNLLRTSYYENKRTIQYVGTLI--------PPQYF-NLGLVLGWTGKAV 72 (474) T ss_pred CcCcCC-CCChhHHHHHHHHHHHHHH--HhhHHHHHHHHhccCCChhhccccc--------cHHHH-HHHhhcChHHHHH Confidence 122222 3334455677888877653 4577889999999999864432211 11222 4578899999999 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCc--eEEEEecccceEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDK--LTFQTVDALQLLP 157 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~--~~~~~~~p~~~~~ 157 (530) +..+..+.-++++....+..+. .+++++ .|+++....++++++.++|+||.+++.+++|+ .+++++||.++++ T Consensus 73 d~~a~rl~~~Gf~~~d~~~~~~----~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~ 148 (474) T protein:vir:81 73 DALARRCNLEGFVWPDGDLDSL----GGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATG 148 (474) T ss_pred HHHHhhhcccceECCCCCccch----HHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEE Confidence 9999988888766432222222 244555 47888899999999999999999999977765 6788999999999 Q ss_pred EEcCCCC-ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccc Q lcl|NC_011308. 158 VFDDYGT-LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEA 236 (530) Q Consensus 158 v~d~~~~-~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (530) +||+... ..+++.++... .......+.+|+++.++.|...+++.. T Consensus 149 ~~D~~~~~~~~al~~~~~~-------~~g~~~~~~ly~~~~~~~~~~~~~~~~--------------------------- 194 (474) T protein:vir:81 149 EWNRRRRGLNNLLSIIDKD-------KEGKVLSLALYLDNETVTAQRDKATLK--------------------------- 194 (474) T ss_pred EEeCCCCcceeeeEEEEEc-------CCCcEEEEEEEeCCcEEEEEEcCccce--------------------------- Confidence 9988543 44555444321 122345678999999988875443211 Q ss_pred eecccccccccccccccccCCccceEEeeCCc-----CCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC Q lcl|NC_011308. 237 ILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN 310 (530) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~ 310 (530) +.....+|++| ||+|+|.|+. +|.|++ +.+++|+|++++.++++....++|++|..++.|... T Consensus 195 ----------w~~~~~~~~~g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~ 263 (474) T protein:vir:81 195 ----------WQVDRDEHVYG-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADE 263 (474) T ss_pred ----------eeeccCCCCCC-cceEEecccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCCh Confidence 11234579998 8999999974 688887 799999999999999999999999999999999865 Q ss_pred Cch----hhHHHH--HhhCcceecCCCCceeEE------EecCCHHHHHHHHHHHHHHHHHHhcccCCCcc-----cccC Q lcl|NC_011308. 311 SPV----DEIKKN--IQSKKIIQTKGEGGLDIQ------TVDIPYEARKAKMDIDELNIYRSGMGFNSSAV-----GDGN 373 (530) Q Consensus 311 ~~~----~~~~~~--~~~~~~i~~~~~~~~~~l------t~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~-----~~gn 373 (530) ++. +..... ..-++++.++++.+++.. .++.+....+.+++.|+..++.+|..-.+... .+.| T Consensus 264 ~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~n 343 (474) T protein:vir:81 264 SALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSN 343 (474) T ss_pred hhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhccccccc Confidence 331 111111 122345566665443322 35566677788888888888888754443332 2345 Q ss_pred -CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--ccceeeEEeCCCCCCCHHHHHHHHHHHHhc Q lcl|NC_011308. 374 -ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDY--SSTDIKFDIEPYILANELDLAMIDKTEAET 450 (530) Q Consensus 374 -~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~--d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~ 450 (530) +||.||+....+|..||.++++.|+.+|++++++++++.+......+ ....+++.|.+....+..+.|+.+.+++++ T Consensus 344 p~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a 423 (474) T protein:vir:81 344 PTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAA 423 (474) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhc Confidence 68999999999999999999999999999999999876543322222 235788999988888999999999998887 Q ss_pred C--CCcHHHHHHhCCCCCCHHHHHHHHHHHHH--HHHHHHHhhhccccccCCcccc Q lcl|NC_011308. 451 N--QIQINNLLAIAPRIGDEETLKAICDTLDL--DYEDVVKALEDQEVEELEPTVT 502 (530) Q Consensus 451 g--~iS~et~l~~~~~vdd~~~e~~~~e~e~~--e~~~~~~~~~~~~~~~~~~~~~ 502 (530) | +.+.+++++.+++.. .++++++.+.. +....+..+.....+.. .++ T Consensus 424 ~~~~~~~~~~~~~lg~t~---~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~--~aq 474 (474) T protein:vir:81 424 VPWLAETEVGLELIGLTP---QQARRAMADKRRVQGRGTLQALIDRSNNGA--TAQ 474 (474) T ss_pred ccCCCcHHHHHhhcCCCH---HHHHHHHHHHHHHhHHHHHHHHHhcCCCCC--CCC Confidence 6 466788888877642 23344333322 22223333222211111 111 No 66 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=5.3e-45 Score=263.33 Aligned_cols=449 Identities=11% Similarity=0.014 Sum_probs=282.5 Q ss_pred ccHHHHHHHHHHHHH-------------------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 11 DRLGTILSTKIDEYI-------------------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 11 ~~~~~~i~~~i~~~~-------------------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) |.+.+-|+.+|.+-. +.+...++...++||+|+|.-+... ... ...+.++|+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~----~~~-----~~~~~~~~~ 71 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYL----NTD-----GETKKRDLN 71 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccc----cCC-----CCcccCcee Confidence 777777777765411 2234568889999999997543221 111 123345688 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEe Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTV 150 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~ 150 (530) ++|+++.||++.++|+||+|++++++ ++...+.|+++++ |+|...+.+++..++..|.+|..+|+|. +.+++..+ T Consensus 72 slnl~~~i~~~~A~lv~~e~~~i~~~---d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v 147 (500) T protein:vir:30 72 HLPIARTAAKKIASLVFNEQAEIKVD---DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFV 147 (500) T ss_pred ecchHHHHHHHHhhhhcCCcceEecC---ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEE Confidence 99999999999999999999999774 4667888999986 5799999999999999999999999984 67999999 Q ss_pred cccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEc--CCc--eEEEeecCCcccchhhccccccccccce Q lcl|NC_011308. 151 DALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWT--DTE--VWYYVQKDEGRSDEYVLDTTVNPNPSQH 225 (530) Q Consensus 151 ~p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt--~~~--~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 225 (530) +|.++||+..++ +...+++.++...... .....++++|.|+ ... ...+..........+... .. T Consensus 148 ~ad~~~P~~~d~~~~~~~a~~~~~~~~~~---~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~--------v~ 216 (500) T protein:vir:30 148 QAPVFLPLQSNTQDVSSAAVVIKSVKTIN---GKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSR--------VP 216 (500) T ss_pred cCCeeEEEEEcCCCeEEEEEEEEEeeeec---CCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcc--------cc Confidence 999999985554 4444444333222211 1234455667665 222 111111111100000000 00 Q ss_pred eeeeecccccceecccccccccccccccccCCccceEEeeC----Cc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 226 VLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN----NK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n----n~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) ...+. .. ........++.+.|++.|++ |. .|.|+|++++++||++|.+.|++++.++ T Consensus 217 l~~~~---------~~-----l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~ 282 (500) T protein:vir:30 217 LSEVY---------KD-----LKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVK 282 (500) T ss_pred ccccc---------CC-----cCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHH Confidence 00000 00 00111223455566666654 22 4899999999999999999999999999 Q ss_pred Hhccceee----ee----cCCCCchhhHHHHHhhC--cceecCC--CCceeEEEecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_011308. 297 DMAEAIYV----VR----GGTNSPVDEIKKNIQSK--KIIQTKG--EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGF 364 (530) Q Consensus 297 ~~~~~~lv----l~----g~~~~~~~~~~~~~~~~--~~i~~~~--~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p 364 (530) ....++++ ++ |.+++......-++... ..+.... +..++.++.++..+....-++.+.+.|-..+... T Consensus 283 ~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls 362 (500) T protein:vir:30 283 MGQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVS 362 (500) T ss_pred hCcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCC Confidence 87777777 22 11222212222122211 1222222 2347777777877777766666666554333222 Q ss_pred -C-CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCccccceeeEEeCCCCCCCHHH Q lcl|NC_011308. 365 -N-SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRR---GLGDYSSTDIKFDIEPYILANELD 439 (530) Q Consensus 365 -~-~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~---~~~~~d~~~i~i~f~~~~P~n~~e 439 (530) . +++...|..|+.+++++++.+..+++++++.|+.+|++++++|+.+.... +.......++++.|++.++.|..+ T Consensus 363 ~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~ 442 (500) T protein:vir:30 363 AGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDA 442 (500) T ss_pred ccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHH Confidence 1 33444466789999999999999999999999999999999998775532 222112246899999999999988 Q ss_pred HHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 440 LAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 440 ~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) .++..+.++.+|++|++++++.+..+++.+ ..+++++.++|. ++..... .+..-+.|+ T Consensus 443 ~~~~~~~~v~aGi~s~~~~i~~~~g~~eee-a~~~l~~i~~E~---~~~~~~~------------------~~~~~~~g~ 500 (500) T protein:vir:30 443 ELDYWIKVVNAGFGTREMAIQKVLNVTEEK-AQEIAAEINTGI---VDEINQQ------------------RTDTHLYGE 500 (500) T ss_pred HHHHHHHHHHcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhc---cccCCCC------------------CccccccCC Confidence 888899999999999999998886665543 222333333321 1111110 011111111 No 67 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=5.3e-45 Score=263.33 Aligned_cols=449 Identities=11% Similarity=0.014 Sum_probs=282.5 Q ss_pred ccHHHHHHHHHHHHH-------------------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 11 DRLGTILSTKIDEYI-------------------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 11 ~~~~~~i~~~i~~~~-------------------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) |.+.+-|+.+|.+-. +.+...++...++||+|+|.-+... ... ...+.++|+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~----~~~-----~~~~~~~~~ 71 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYL----NTD-----GETKKRDLN 71 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccc----cCC-----CCcccCcee Confidence 777777777765411 2234568889999999997543221 111 123345688 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEe Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTV 150 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~ 150 (530) ++|+++.||++.++|+||+|++++++ ++...+.|+++++ |+|...+.+++..++..|.+|..+|+|. +.+++..+ T Consensus 72 slnl~~~i~~~~A~lv~~e~~~i~~~---d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v 147 (500) T protein:vir:98 72 HLPIARTAAKKIASLVFNEQAEIKVD---DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFV 147 (500) T ss_pred ecchHHHHHHHHhhhhcCCcceEecC---ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEE Confidence 99999999999999999999999774 4667888999986 5799999999999999999999999984 67999999 Q ss_pred cccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEc--CCc--eEEEeecCCcccchhhccccccccccce Q lcl|NC_011308. 151 DALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWT--DTE--VWYYVQKDEGRSDEYVLDTTVNPNPSQH 225 (530) Q Consensus 151 ~p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt--~~~--~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 225 (530) +|.++||+..++ +...+++.++...... .....++++|.|+ ... ...+..........+... .. T Consensus 148 ~ad~~~P~~~d~~~~~~~a~~~~~~~~~~---~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~--------v~ 216 (500) T protein:vir:98 148 QAPVFLPLQSNTQDVSSAAVVIKSVKTIN---GKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSR--------VP 216 (500) T ss_pred cCCeeEEEEEcCCCeEEEEEEEEEeeeec---CCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcc--------cc Confidence 999999985554 4444444333222211 1234455667665 222 111111111100000000 00 Q ss_pred eeeeecccccceecccccccccccccccccCCccceEEeeC----Cc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 226 VLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN----NK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n----n~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) ...+. .. ........++.+.|++.|++ |. .|.|+|++++++||++|.+.|++++.++ T Consensus 217 l~~~~---------~~-----l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~ 282 (500) T protein:vir:98 217 LSEVY---------KD-----LKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVK 282 (500) T ss_pred ccccc---------CC-----cCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHH Confidence 00000 00 00111223455566666654 22 4899999999999999999999999999 Q ss_pred Hhccceee----ee----cCCCCchhhHHHHHhhC--cceecCC--CCceeEEEecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_011308. 297 DMAEAIYV----VR----GGTNSPVDEIKKNIQSK--KIIQTKG--EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGF 364 (530) Q Consensus 297 ~~~~~~lv----l~----g~~~~~~~~~~~~~~~~--~~i~~~~--~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p 364 (530) ....++++ ++ |.+++......-++... ..+.... +..++.++.++..+....-++.+.+.|-..+... T Consensus 283 ~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls 362 (500) T protein:vir:98 283 MGQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVS 362 (500) T ss_pred hCcceeeechHHhcccCCCCCccccCCcccCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCC Confidence 87777777 22 11222212222122211 1222222 2347777777877777766666666554333222 Q ss_pred -C-CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCccccceeeEEeCCCCCCCHHH Q lcl|NC_011308. 365 -N-SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRR---GLGDYSSTDIKFDIEPYILANELD 439 (530) Q Consensus 365 -~-~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~---~~~~~d~~~i~i~f~~~~P~n~~e 439 (530) . +++...|..|+.+++++++.+..+++++++.|+.+|++++++|+.+.... +.......++++.|++.++.|..+ T Consensus 363 ~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~ 442 (500) T protein:vir:98 363 AGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDA 442 (500) T ss_pred ccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHH Confidence 1 33444466789999999999999999999999999999999998775532 222112246899999999999988 Q ss_pred HHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 440 LAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 440 ~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) .++..+.++.+|++|++++++.+..+++.+ ..+++++.++|. ++..... .+..-+.|+ T Consensus 443 ~~~~~~~~v~aGi~s~~~~i~~~~g~~eee-a~~~l~~i~~E~---~~~~~~~------------------~~~~~~~g~ 500 (500) T protein:vir:98 443 ELDYWIKVVNAGFGTREMAIQKVLNVTEEK-AQEIAAEINTGI---VDEINQQ------------------RTDTHLYGE 500 (500) T ss_pred HHHHHHHHHHcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhc---cccCCCC------------------CccccccCC Confidence 888899999999999999998886665543 222333333321 1111110 011111111 No 68 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=6.9e-44 Score=257.21 Aligned_cols=477 Identities=10% Similarity=-0.003 Sum_probs=290.6 Q ss_pred ccHHHHHHHHHHHHHHhh----hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQ----NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQY 86 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~----~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~y 86 (530) |-+..-++++|+.+.+-. ...+.....++|.+...-..+. .+....|.........+.|++.|+++.|+++.++| T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKD-SYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhh-hhhhhhcccCCCCccccccccCChHHHHHHHHHHh Confidence 778888888888776422 2344455555566554332221 11122222222234456789999999999999999 Q ss_pred hcccceeeecCC---cchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCC Q lcl|NC_011308. 87 LLANGIDVKPTD---HDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDY 162 (530) Q Consensus 87 l~G~pv~~~~~~---~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~ 162 (530) +||+|++++... .+++.+++.|+++++ ++|...+.+++..++..|.++..+|++ +|++++..++|..++|++++ T Consensus 80 l~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~~i~~v~ad~~~P~~~~- 157 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRPSISVHSSSQFWIDFKN- 157 (518) T ss_pred hcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCeeEEEEEcCCeeEEEeec- Confidence 999999987532 246778899999985 678999999999999999999999987 48899999999999999965 Q ss_pred CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccc Q lcl|NC_011308. 163 GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGV 242 (530) Q Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (530) +++..++. +...... ......++++.+..+.+.+.....+.......+... ..........+...... ..... T Consensus 158 g~~~~~~f-~~~~~~~---~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~-~~~~~v~~~~~~~~~~l--~~~~~ 230 (518) T protein:vir:78 158 NEPFRFNF-FEEIPTS---NKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKI-DGDKTTPISAERLPEQI--TSYLH 230 (518) T ss_pred CcEEEEEE-EEEeecC---CcceeEEEEEeeccccccceeecccceeEEEEEeee-cCccccccccccccccc--ccccc Confidence 45544333 3222211 122233445554433322221111100000000000 00000000000000000 00000 Q ss_pred cccccccccccccCCccceEEeeCC-----c-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---- Q lcl|NC_011308. 243 EEHEGRQVLGRSYKSRFPFDILYNN-----K-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG---- 308 (530) Q Consensus 243 ~~~~~~~~~~~~~~~~iPiv~~~nn-----~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~---- 308 (530) ..+.......+++ .+.|++.|.+| . .|.|+|+..+++||++|.++|++++.++....++.|...+ T Consensus 231 ~~~~~e~~~~~tg-~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~ 309 (518) T protein:vir:78 231 TNDIQLNHSVSIG-LKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKK 309 (518) T ss_pred cccCccceeeccC-CccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccC Confidence 0111111112222 35566665432 2 3899999999999999999999999999877777773321 Q ss_pred -C---CCchhhHHHHHhhCcceecCC--CCc----eeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC-cccccCCcHH Q lcl|NC_011308. 309 -T---NSPVDEIKKNIQSKKIIQTKG--EGG----LDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS-AVGDGNATNV 377 (530) Q Consensus 309 -~---~~~~~~~~~~~~~~~~i~~~~--~~~----~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~-~~~~gn~SGv 377 (530) . ....-.+..+.+....+.... +++ ++.++.++..++....++.+.+.|...+..+.-+ +...|..||. T Consensus 310 ~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TAT 389 (518) T protein:vir:78 310 VNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKAT 389 (518) T ss_pred CCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHH Confidence 1 111111222223333343322 232 6778888999998888888888887777554221 1123568999 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-----ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG-----DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ 452 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~-----~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~ 452 (530) +++...+.+..++.++++.++.+|++++..|+.++...... ..+...++|.|.+.+|.|..+.+++++.++.+|+ T Consensus 390 ei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGi 469 (518) T protein:vir:78 390 EIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALA 469 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCC Confidence 99999999999999999999999999999999887764321 1233568999999999999999999999999999 Q ss_pred CcHHHHHHh-CCCCCCHH--HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCC Q lcl|NC_011308. 453 IQINNLLAI-APRIGDEE--TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLN 518 (530) Q Consensus 453 iS~et~l~~-~~~vdd~~--~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (530) +|+++++++ +|.++|.+ +|++|+++|+..... ++.++..+ .+++.| T Consensus 470 mS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~-------~~p~~~~g-------------~~~~~g 518 (518) T protein:vir:78 470 MSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEV-------PDPEAIGG-------------METKGG 518 (518) T ss_pred CCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCC-------CCCccccC-------------CCCCCC Confidence 999999976 56666643 445555544332111 11110000 000111 No 69 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=1.6e-43 Score=255.15 Aligned_cols=458 Identities=11% Similarity=-0.018 Sum_probs=287.6 Q ss_pred ccHHHHHHHHHHHHH-------------------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 11 DRLGTILSTKIDEYI-------------------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 11 ~~~~~~i~~~i~~~~-------------------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) |.+.+-++.+|.+-. +.+...++...++||+|++..+.... . ......++|+ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~----~-----~~~~~~~~~~ 71 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKN----T-----DGDIKSRPMN 71 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccc----c-----Ccchhcccce Confidence 555555554444321 33445778888999999875443211 1 1122334688 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEe Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTV 150 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~ 150 (530) ..|+++.||++.++++||+|++++++ ++...+.|+++++ ++|...+.+++..++..|.++..+|+| .|++++..+ T Consensus 72 slnl~~~i~~~~A~lv~~e~~~i~v~---d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v 147 (522) T protein:vir:47 72 HLPIARTASKKIASLVYNEQATITTK---NEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFI 147 (522) T ss_pred ecchHHHHHHHHhhhhcCCcceeecC---ChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEE Confidence 99999999999999999999999864 4677888999985 678899999999999999988899998 578999999 Q ss_pred cccceEEEE-cCCCCceeEEEEEEEEeecccccccceEEEEEEEc---CC------------ceEEEeecCCcccchhhc Q lcl|NC_011308. 151 DALQLLPVF-DDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWT---DT------------EVWYYVQKDEGRSDEYVL 214 (530) Q Consensus 151 ~p~~~~~v~-d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt---~~------------~~~~y~~~~~~~~~~~~~ 214 (530) +|..++|+. +..+...+++..... .... ......+.++.++ .. +...+......... T Consensus 148 ~ad~~~P~~~~~~~~~e~a~~~~~~-~~~~--~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~---- 220 (522) T protein:vir:47 148 QAPVFFPLESNTQDVSSAAILTKTI-KSEG--RKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVND---- 220 (522) T ss_pred cCCceEEEEEcCCceEEEEEEEEEE-eecc--cceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCc---- Confidence 999999984 444445554432222 1111 1111122233321 00 11111111000000 Q ss_pred cccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCC---------cCCCCcHHHHHHHHHHHH Q lcl|NC_011308. 215 DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN---------KLGISDIKKVKSIIDDYD 285 (530) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn---------~~~~sd~e~v~~liDa~~ 285 (530) ..+............ .........++.+.++++|+++ ..|.|+|+..+++||++| T Consensus 221 ---------------~lG~~v~l~~~~e~~-~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD 284 (522) T protein:vir:47 221 ---------------VLGQRVNLSELDKYK-NLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFIN 284 (522) T ss_pred ---------------ccCcccccccccccc-CCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHH Confidence 000000000000000 0011122345666777777664 258999999999999999 Q ss_pred HHHHHHHHHHHHhccceee----eec----CCCCch--hhHHHHHhhCcceec--CCCCceeEEEecCCHHHHHHHHHHH Q lcl|NC_011308. 286 LMNCFLSNNLQDMAEAIYV----VRG----GTNSPV--DEIKKNIQSKKIIQT--KGEGGLDIQTVDIPYEARKAKMDID 353 (530) Q Consensus 286 ~~~S~~~n~~~~~~~~~lv----l~g----~~~~~~--~~~~~~~~~~~~i~~--~~~~~~~~lt~~~~~~~~e~~ld~L 353 (530) .+.|.+++.++....+++| ++- .++... ..+..+-+-.+.+.. +++++++.++.++.++...+-++.+ T Consensus 285 ~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~ 364 (522) T protein:vir:47 285 RSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEG 364 (522) T ss_pred HHHHHHHHHHHhccceeecchHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHH Confidence 9999999999998888887 321 111110 011111111223332 2345688888888888777777766 Q ss_pred HHHHHHHhc-ccC-CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCCccccceeeEE Q lcl|NC_011308. 354 ELNIYRSGM-GFN-SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR---RGLGDYSSTDIKFD 428 (530) Q Consensus 354 ~~~I~~~s~-~p~-~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~---~~~~~~d~~~i~i~ 428 (530) .+.|-..+. .|. +++.+.|..|+.+++...+.+..+++++++.|+.+|++++..|+.+++. .+.......++++. T Consensus 365 l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~ 444 (522) T protein:vir:47 365 LKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVN 444 (522) T ss_pred HHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEE Confidence 665544332 222 3444445678999999999999999999999999999999999877653 23333345579999 Q ss_pred eCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCC Q lcl|NC_011308. 429 IEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEE--TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIID 506 (530) Q Consensus 429 f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~--~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (530) |.+.++.|..+.++..+.++.+|++|++++++.++.+++.+ +|++|+++|+.+ . ...+.+.. T Consensus 445 f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~---~---------~~~~~~~~---- 508 (522) T protein:vir:47 445 LDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLP---M---------NDAELAIY---- 508 (522) T ss_pred cCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhcc---C---------CCCCCCCC---- Confidence 99999999888888888999999999999999998777654 344444443221 0 00011111 Q ss_pred CCCCCCccCcCCCCc Q lcl|NC_011308. 507 PLTIEPQPEPLNIDP 521 (530) Q Consensus 507 ~~~~~~~~~~~~~~~ 521 (530) ..+++++.++++.+ T Consensus 509 -~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 509 -GMHDQNEEKADDKG 522 (522) T ss_pred -CCCCcccccCCCCC Confidence 12344445555555 No 70 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=1.8e-44 Score=260.41 Aligned_cols=482 Identities=12% Similarity=0.067 Sum_probs=301.7 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) -+.-|..++.- +...+..|. +.|+..|+.+.+||.+.|.-+.-...- .+-+--.++.++..++|| T Consensus 10 ~~~~~~~g~~~-----~p~~v~~~d-~~Rl~aY~l~~~~y~n~~~~~~~~lrg---------~~~~~~r~~~~ps~~~~~ 74 (527) T protein:vir:10 10 STQQLRAGEAN-----FPNAVTDFD-KARLASYRLYEDMYLTNTSDYQVILRG---------GDEGDQRPIYVPNGEKLI 74 (527) T ss_pred CCcCcCCcccc-----CcccCCHHH-HHHHHHHHHHHHHhcCchhheeeecCC---------ccccccceeeehhhHHhh Confidence 23333333221 111244443 467899999999999987433211100 000011246667776666 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC-C---CceEEEEecccce Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS-E---DKLTFQTVDALQL 155 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~-~---g~~~~~~~~p~~~ 155 (530) .....|+ +.+..+.. +..++.+.+.|+.+.+ ++...++.+..+++.+.|++..++-+|. + +++++..+||... T Consensus 75 ~~~~~~~-~~g~~~~~-~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~ 152 (527) T protein:vir:10 75 EAKMRFL-GQGLKWEF-SKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTY 152 (527) T ss_pred CCcceee-ccCccccc-cchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCccee Confidence 6666655 44544422 4457788899988875 6778899999999999999766555553 2 4799999999999 Q ss_pred EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeee-----ee Q lcl|NC_011308. 156 LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLA-----VA 230 (530) Q Consensus 156 ~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 230 (530) ||+.|+.+- ..+.++|.+-.|............+-+ .-+.|...+.+.... .+.+++-.. .. T Consensus 153 f~~ed~d~~-~~v~~v~~~~~~~~P~d~~~~~~~ar~----~~~~~~l~~~g~~~~--------~G~~~yt~~~w~lg~w 219 (527) T protein:vir:10 153 FPYEDPRYP-GQVLGVYLVDEYPHPDSEKKNEKCARV----QKYMKTLDDDGKPVP--------GGAIKYTEELYEPGKW 219 (527) T ss_pred eeeecCCCC-CceeeEEEeeeccCCccccccceehhh----hhhhhhcCccccccc--------Ccceeeeeceeecccc Confidence 999876432 234445544223222222211110100 001111111111000 000000000 00 Q ss_pred ccc---ccceecccccccccccccccccCCccceEEeeCC-----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccce Q lcl|NC_011308. 231 DGV---DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN-----KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAI 302 (530) Q Consensus 231 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn-----~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~ 302 (530) +.. ...-..........+....+++++.||||+|+|- ..|.|+++++++++|++|.++|+.+..+.....|+ T Consensus 220 ~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 220 DDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 000 0000000111223344567899999999999652 36999999999999999999999999999999999 Q ss_pred eeeecCCCCch-hhHH-HHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcc--cc-cCCcHH Q lcl|NC_011308. 303 YVVRGGTNSPV-DEIK-KNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAV--GD-GNATNV 377 (530) Q Consensus 303 lvl~g~~~~~~-~~~~-~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~--~~-gn~SGv 377 (530) .++.|...-+. ++.. -.+..+.++.+++++++..+.--...+..+.+++.|.+.||..|++|..+.. .. +++||. T Consensus 300 ~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ 379 (527) T protein:vir:10 300 YATDSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGI 379 (527) T ss_pred eeecccccccccCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHH Confidence 99999754322 2211 1345667888999999988876668889999999999999999999987654 32 457999 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHh---cCCCc-cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTAD-LVVEDIRR---RGLGD-YSSTDIKFDIEPYILANELDLAMIDKTEAETNQ 452 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~-~i~~~l~~---~~~~~-~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~ 452 (530) ||++.+++|..++.+++..|+-.++|... .+...|.. .+..+ .+...+.++|.+.+|+|..+.++.+.+++++|+ T Consensus 380 ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi 459 (527) T protein:vir:10 380 ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGL 459 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCc Confidence 99999999999999999988888876543 33332222 22223 234578999999999999999999999999999 Q ss_pred CcHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhh----ccccc-cCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 453 IQINNLLAIA---PRIGDEETLKAICDTLDLDYEDVVKALE----DQEVE-ELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 453 iS~et~l~~~---~~vdd~~~e~~~~e~e~~e~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) +|.+||+++| ++++|+++|++++.++.+.......... -+..+ ..-++.++++- - + T Consensus 460 ~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~---~-------------~ 523 (527) T protein:vir:10 460 IPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQA---L-------------N 523 (527) T ss_pred hhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccc---c-------------C Confidence 9999998776 7899999998888876655432222111 11111 11111111111 1 1 Q ss_pred cccC Q lcl|NC_011308. 525 EEPV 528 (530) Q Consensus 525 ~~~~ 528 (530) +.|. T Consensus 524 ~~~~ 527 (527) T protein:vir:10 524 GQPL 527 (527) T ss_pred CCCC Confidence 1111 No 71 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=1.8e-44 Score=260.36 Aligned_cols=482 Identities=12% Similarity=0.069 Sum_probs=301.8 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) -+.-|..++.- +...+..|. +.|+..|+.+.+||.+.|.-+.-...- .+-+--.++.++..++|| T Consensus 10 ~~~~~~~g~~~-----~p~~v~~~d-~~Rl~aY~l~~~~y~n~~~~~~~~lrg---------~~~~~~r~~~~ps~~~~~ 74 (527) T protein:vir:10 10 STQQLRAGEAN-----FPNAVTDFD-KARLASYRLYEDMYLTNTSDYQVILRG---------GDEGDQRPIYVPNGEKLI 74 (527) T ss_pred CCcCcCCcccc-----CcccCCHHH-HHHHHHHHHHHHHhcCchhheeeecCC---------ccccccceeeehhhHHhh Confidence 23333333221 111244443 467899999999999987433211100 000011246667776666 Q ss_pred hhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC-C---CceEEEEecccce Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS-E---DKLTFQTVDALQL 155 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~-~---g~~~~~~~~p~~~ 155 (530) .....|+ +.+..+.. +..++.+.+.|+.+.+ ++...++.+..+++.+.|++..++-+|. + +++++..+||... T Consensus 75 ~~~~~~~-~~g~~~~~-~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~ 152 (527) T protein:vir:10 75 EAKMRFL-GQGLKWEF-SKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTY 152 (527) T ss_pred CCcceee-ccCccccc-cchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCccee Confidence 6666555 44444422 4457788899988875 6778899999999999999766555553 2 4799999999999 Q ss_pred EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeee-----ee Q lcl|NC_011308. 156 LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLA-----VA 230 (530) Q Consensus 156 ~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 230 (530) ||+.|+.+- ..+.++|.+-.|............+-+ .-+.|...+.+.... .+.+++-.. .. T Consensus 153 f~~ed~d~~-~~v~~v~~~~~~~~P~d~~~~~~~ar~----~~~~~~l~~~g~~~~--------~G~~~yt~~~w~lg~w 219 (527) T protein:vir:10 153 FPYEDPRYP-GQVLGVYLVDEYPHPDSEKKNEKCARV----QKYMKTLDDDGKPVP--------GGAIKYTEELYEPGKW 219 (527) T ss_pred eeeecCCCC-CceeeEEEeeeccCCccccccceehhh----hhhhhhcCccccccc--------Ccceeeeeceeecccc Confidence 999876432 234445544223222222211110100 001111111111000 000000000 00 Q ss_pred ccc---ccceecccccccccccccccccCCccceEEeeCC-----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccce Q lcl|NC_011308. 231 DGV---DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN-----KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAI 302 (530) Q Consensus 231 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn-----~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~ 302 (530) +.. ...-..........+....+++++.||||+|+|- ..|.|+++++++++|++|.++|+.+..+.....|+ T Consensus 220 ~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 220 DDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 000 0000000111223344567899999999999652 36999999999999999999999999999999999 Q ss_pred eeeecCCCCch-hhHH-HHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcc--cc-cCCcHH Q lcl|NC_011308. 303 YVVRGGTNSPV-DEIK-KNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAV--GD-GNATNV 377 (530) Q Consensus 303 lvl~g~~~~~~-~~~~-~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~--~~-gn~SGv 377 (530) .++.|...-+. ++.. -.+..+.++.+++++++..+.--...+..+.+++.|.+.||..|++|..+.. .. +++||. T Consensus 300 ~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ 379 (527) T protein:vir:10 300 YATDSAPPRDSRGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGI 379 (527) T ss_pred eeecccccccccCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHH Confidence 99999754332 2211 1345667888999999988876668889999999999999999999987654 32 457999 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHh---cCCCc-cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTAD-LVVEDIRR---RGLGD-YSSTDIKFDIEPYILANELDLAMIDKTEAETNQ 452 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~-~i~~~l~~---~~~~~-~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~ 452 (530) ||++.+++|..++.+++..|+-.++|... .+...|.. .+..+ .+...+.++|.+.+|+|..+.++.+.+++++|+ T Consensus 380 ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi 459 (527) T protein:vir:10 380 ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGL 459 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCc Confidence 99999999999999999988888876543 33332222 22223 234578999999999999999999999999999 Q ss_pred CcHHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhh----ccccc-cCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 453 IQINNLLAIA---PRIGDEETLKAICDTLDLDYEDVVKALE----DQEVE-ELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 453 iS~et~l~~~---~~vdd~~~e~~~~e~e~~e~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) +|.+||+++| ++++|+++|++++.++.+.......... -+..+ ..-++.++++- - + T Consensus 460 iS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~---~-------------~ 523 (527) T protein:vir:10 460 IPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQA---L-------------N 523 (527) T ss_pred hhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccc---c-------------C Confidence 9999998776 7899999998888877665433222111 11111 11111111111 1 1 Q ss_pred cccC Q lcl|NC_011308. 525 EEPV 528 (530) Q Consensus 525 ~~~~ 528 (530) +.|. T Consensus 524 ~~~~ 527 (527) T protein:vir:10 524 GQPL 527 (527) T ss_pred CCCC Confidence 1111 No 72 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=5.9e-39 Score=230.19 Aligned_cols=508 Identities=12% Similarity=0.072 Sum_probs=296.0 Q ss_pred CCcccccCCcccH--HHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRL--GTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~--~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |.+.-.+-.|-.- -+-...++..+. +.|+.+|+.+.+||.|+|--+.-... + ++.+ -+..++.+. T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D-~~RlaaY~ly~d~y~n~~~el~~il~--G-------~dr~---~~~~ps~r~ 67 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDEND-KNRVRAYDLYENIYLNSAETLKLVLR--G-------DDSV---PILMPSGRK 67 (563) T ss_pred CCccccccCCCcccccccccccCCHHH-HHHHHHHHHHHHhhcCchhhhhhhcC--C-------Ccee---eeccchHHH Confidence 6655554444211 011112243333 45889999999999999843221000 0 0111 244457889 Q ss_pred HHhhhhhhhcccceeeecCCcc-hH----HHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC----CCceEEE Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTDHD-DQ----KLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS----EDKLTFQ 148 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~~~-de----~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~----~g~~~~~ 148 (530) +|++.+ +++|.|++|+..... ++ .++..|+.+.+ ++...+...+.+++.+.|++..++-+|. .+++++. T Consensus 68 ~V~~~~-~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~ 146 (563) T protein:vir:74 68 IVEAVH-RFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVD 146 (563) T ss_pred HHHHHH-HhcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEe Confidence 999955 555999999764433 33 34444555553 5667788899999999999766555553 2489999 Q ss_pred EecccceEEEEcCCCCcee-EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCccc-chhhcccc-ccccccce Q lcl|NC_011308. 149 TVDALQLLPVFDDYGTLQR-IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRS-DEYVLDTT-VNPNPSQH 225 (530) Q Consensus 149 ~~~p~~~~~v~d~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~-~~~~~~~~-~~~~~~~~ 225 (530) .+||...||+-|....... ++++.. .|.......+.+..+.-| .|...+.+.. ..+..+.+ ...+..-. T Consensus 147 ~vDP~~~fp~~dpd~v~g~~~v~v~~--~~~~pdd~~~~~~r~~~~------~~~lndeg~~~~~~~~dae~w~lg~wd~ 218 (563) T protein:vir:74 147 EVDPRQIFLIEDGSTVVGFHMVDIVQ--DFRSPDDPSKKLARRRTF------RRVRNDEGMFTGRISSELTHWTLGNWDD 218 (563) T ss_pred ecCCceeeeccCCCCcccceeeeccc--CCCCCcchhccceeeeee------eeeeCCCCCccceeeeccchhccccccc Confidence 9999999996655433111 122111 111111111112222111 1111111100 00000000 00000000 Q ss_pred eeeeecccccceecccccccccccccccccCCccceEEeeCC-----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 226 VLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN-----KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE 300 (530) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn-----~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~ 300 (530) ......................++...+++++.||++.|+|- ..|.|++++++++++++|.++|+.+..+....+ T Consensus 219 r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~ 298 (563) T protein:vir:74 219 RGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGL 298 (563) T ss_pred cCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCC Confidence 000000000000000011112234455889999999998652 369999999999999999999999999999999 Q ss_pred ceeeeecCCCCc--hhhHH-HHHhhCcceecCCCCc---eeEEEecCCHHHHHHHHHHHHH-HHHHHhcccCCCcc--cc Q lcl|NC_011308. 301 AIYVVRGGTNSP--VDEIK-KNIQSKKIIQTKGEGG---LDIQTVDIPYEARKAKMDIDEL-NIYRSGMGFNSSAV--GD 371 (530) Q Consensus 301 ~~lvl~g~~~~~--~~~~~-~~~~~~~~i~~~~~~~---~~~lt~~~~~~~~e~~ld~L~~-~I~~~s~~p~~~~~--~~ 371 (530) |+.++.|...-+ -++.. -++..+.++.+++++. ...|.=-.+.+.++.|++.|.. .||..|++|..... .. T Consensus 299 pi~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~ 378 (563) T protein:vir:74 299 GMYVTNASAPVDPNTGELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDV 378 (563) T ss_pred CeEEeccccccccccccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeeccccc Confidence 999998754322 11111 1355677788886644 4444333455788888877666 89999999987544 33 Q ss_pred c-CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHh----------cCCCccccc-eeeEEeCCCCCC Q lcl|NC_011308. 372 G-NATNVVIKSRYTLLAMKAQKTEIALRKTLRW----TADLVVEDIRR----------RGLGDYSST-DIKFDIEPYILA 435 (530) Q Consensus 372 g-n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~----~~~~i~~~l~~----------~~~~~~d~~-~i~i~f~~~~P~ 435 (530) | ..||.||++.+.+|.++|.+|+..+..++++ .+.+++.++.. .++.++.+. .+.|+|.+.+|. T Consensus 379 ~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~ 458 (563) T protein:vir:74 379 TSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPV 458 (563) T ss_pred ccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCc Confidence 3 5799999999999999999999977777766 44444433322 222333333 478999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCC-CHHHHHHHHHHHHHHHHHHHHhhhcc------ccccCCccccCCC Q lcl|NC_011308. 436 NELDLAMIDKTEAETNQIQINNLLAIA---PRIG-DEETLKAICDTLDLDYEDVVKALEDQ------EVEELEPTVTPII 505 (530) Q Consensus 436 n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vd-d~~~e~~~~e~e~~e~~~~~~~~~~~------~~~~~~~~~~~~~ 505 (530) |.....+...+++++|++|+|||+++| +|.. |++.+.++++..+...+-.+++.... ..++.-++.+.++ T Consensus 459 d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd 538 (563) T protein:vir:74 459 NKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDD 538 (563) T ss_pred cHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccc Confidence 999999999999999999999997777 6644 77777777666555443333333221 1112222222222 Q ss_pred CCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 506 DPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ..++-+.=-+|+.+-|-.++-|.|. T Consensus 539 ~g~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 539 QGNPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred cCCchhHcCCcccCCccccccCCCC Confidence 2222233334555555555556666 No 73 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=7.5e-37 Score=218.63 Aligned_cols=455 Identities=12% Similarity=0.018 Sum_probs=283.8 Q ss_pred ccHHHHHHHHHHHHH-------------------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 11 DRLGTILSTKIDEYI-------------------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 11 ~~~~~~i~~~i~~~~-------------------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) |.+.+-|+.++.+-. ......++...++||+|+++-++.. ... ...+..+++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~----~~~-----~~~~~~~~~ 71 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYI----NSQ-----GKIQERDYM 71 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccc----ccc-----cccccccee Confidence 777777776664421 1122457777889999999644321 111 112233578 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcch--------HHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCC Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDD--------QKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSE 142 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~d--------e~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~ 142 (530) ..|+++.|+...++++|++|++++.++.+. +..++.|+++++ |+|...+.+.+..++..|.++..+|+| . T Consensus 72 sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~ 150 (517) T protein:vir:98 72 TLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD-N 150 (517) T ss_pred ecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe-C Confidence 899999999999999999999998765432 446788999986 568999999999999999999999998 4 Q ss_pred CceEEEEecccceEEE-EcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceE--------EEeecCCcccchhh Q lcl|NC_011308. 143 DKLTFQTVDALQLLPV-FDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVW--------YYVQKDEGRSDEYV 213 (530) Q Consensus 143 g~~~~~~~~p~~~~~v-~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~--------~y~~~~~~~~~~~~ 213 (530) |.+++..++|..+||+ |+..+...+++.+...... ......++.+|.|+..... .+.....+....+. T Consensus 151 ~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~---~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG 227 (517) T protein:vir:98 151 GEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVI---GNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIG 227 (517) T ss_pred CeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEee---cCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccc Confidence 6789999999999995 4444444455443332221 1123345556666544321 11111100000000 Q ss_pred ccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC----C-----cCCCCcHHHHHHHHHHH Q lcl|NC_011308. 214 LDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN----N-----KLGISDIKKVKSIIDDY 284 (530) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n----n-----~~~~sd~e~v~~liDa~ 284 (530) ... .+..+... . .......+..+.+++.|++ | ..|.|+|++.++++|++ T Consensus 228 ~~v--------~L~~~~e~---------l-----~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~l 285 (517) T protein:vir:98 228 KRI--------PLEELYEG---------M-----QEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKI 285 (517) T ss_pred ccc--------cccccccC---------C-----CcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHH Confidence 000 00000000 0 0001112233333444443 2 25999999999999999 Q ss_pred HHHHHHHHHHHHHhccceeeeecCC---CCchhh-----HHHHHhhCcceecCC-CCceeEEEecCCHHHHHHHHHHHHH Q lcl|NC_011308. 285 DLMNCFLSNNLQDMAEAIYVVRGGT---NSPVDE-----IKKNIQSKKIIQTKG-EGGLDIQTVDIPYEARKAKMDIDEL 355 (530) Q Consensus 285 ~~~~S~~~n~~~~~~~~~lvl~g~~---~~~~~~-----~~~~~~~~~~i~~~~-~~~~~~lt~~~~~~~~e~~ld~L~~ 355 (530) |...|.+++.++....++.|-..+- .+..+. +..+.+..+.+..+. ++.++.++.++-.+...+.++.+.+ T Consensus 286 D~~~s~~~~e~~~g~~~i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~ 365 (517) T protein:vir:98 286 NDTYDQFWWEIKMGQRTVFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALR 365 (517) T ss_pred HHHHHHHHHHHHhCCcceecChhhhccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHH Confidence 9999999999999878777632221 111000 001111122222222 2345666677778888889999999 Q ss_pred HHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cCCCccccceeeEEeC Q lcl|NC_011308. 356 NIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR---RGLGDYSSTDIKFDIE 430 (530) Q Consensus 356 ~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~---~~~~~~d~~~i~i~f~ 430 (530) .|...+..+. +++.+-|..++.+++...+.+..+++++++.|+.+|++++++|+.+... .+.......++++.|. T Consensus 366 ~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~ 445 (517) T protein:vir:98 366 TLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFD 445 (517) T ss_pred HHHHHhCCCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcC Confidence 8888877663 4444445568999999999999999999999999999999998876554 3333223456899999 Q ss_pred CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCC Q lcl|NC_011308. 431 PYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEE--TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPL 508 (530) Q Consensus 431 ~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~--~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (530) +.++.|..+.++..+.++.+|++|.++++..+..+++.+ ++++++++|+.+. . . ....++..+++. T Consensus 446 D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~---~-~-----~~~~~~~~~~~~--- 513 (517) T protein:vir:98 446 DGVFQDRSALLRFYGQAKTFGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIEL---D-P-----VTISQRAQKRMF--- 513 (517) T ss_pred CCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCChHHHHHHHHHHHHhcccc---C-C-----CCccccccCCCC--- Confidence 999999999999999999999999999998875555543 3333333332211 0 0 001111111111 Q ss_pred CCCCccCcCCCCcc Q lcl|NC_011308. 509 TIEPQPEPLNIDPV 522 (530) Q Consensus 509 ~~~~~~~~~~~~~~ 522 (530) ++.. T Consensus 514 ----------gd~e 517 (517) T protein:vir:98 514 ----------GDEE 517 (517) T ss_pred ----------CCCC Confidence 0000 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.88 E-value=4.6e-21 Score=132.09 Aligned_cols=466 Identities=9% Similarity=-0.006 Sum_probs=264.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce----eecCch Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK----ISHGFF 76 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k----i~~n~~ 76 (530) |+ +.+|.+..-.-..+ .......+.+++-|.|...+-..-.. |-.+.....+. .-..| +-.|++ T Consensus 1 m~----~~~~~~v~~~h~~y------~a~~~~W~~ird~~~G~~~~r~~g~~-YLPk~~~E~~~-~Y~~rl~rA~~~n~~ 68 (513) T protein:vir:97 1 MA----DKDPKSPATTSGAY------DQMLPRWHVIETLLGGTEAMREAGET-YLPRHQEETDK-GYQERLASAVLLNMV 68 (513) T ss_pred CC----CCCCCCCCcCCHHH------HHHHHHHHHHHHHhcChHHHHhhccc-CCCCCCCCCHH-HHHHHHhcccCCChH Confidence 22 22233322111111 12345556677777776544321111 11111111000 01112 347999 Q ss_pred hhHHhhhhhhhcccceeeecCCcchHHHHH-HHHHHh--hccHHHHHHHHHHHHhhcCeEEEEEEecCCC---------- Q lcl|NC_011308. 77 AELVDQKTQYLLANGIDVKPTDHDDQKLCY-LIEEYY--NEEFQSAIQELVEGSTIKGYEGIFARTTSED---------- 143 (530) Q Consensus 77 k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~-~l~~~~--~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g---------- 143 (530) +.+++..+|++|-+||+++... -....+ ++.++- +++++.....+.+.+..+|+++.+|=+...+ T Consensus 69 ~~tl~~l~G~vf~k~p~~~~~~--p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~ 146 (513) T protein:vir:97 69 EQTLDTLSGKPFSEPIKLNEDV--PKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTL 146 (513) T ss_pred HHHHHHHhhhhhhcCcccCcCc--hHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhH Confidence 9999999999999999885321 122333 233432 3567788888999999999999999665432 Q ss_pred --------ceEEEEecccceEEEEcC--CCC--ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccch Q lcl|NC_011308. 144 --------KLTFQTVDALQLLPVFDD--YGT--LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDE 211 (530) Q Consensus 144 --------~~~~~~~~p~~~~~v~d~--~~~--~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~ 211 (530) .+.+..++|.+++= |+. .+. ..-.+++-......+ ....+.+..+-+++...+..|+....+.... T Consensus 147 Ade~~~~~rPy~~~~~~e~Iin-W~~~~v~G~~~L~~v~l~E~~~~~D-gf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~ 224 (513) T protein:vir:97 147 ADDRREGLRPYWVMIKPECLLF-ARSEVINGVEVLQHVRIIEHYMEQD-GFAEVCKRRIRVLEPGLVQLWEPVKKSNAQK 224 (513) T ss_pred HHHHhhccCceEEEecHhhhcC-cceeccCcceeeeeEEEEEEEeecC-CCcceEEEEEEEEeCceEEEEEeecCCCccc Confidence 25578888887743 321 111 222233333222222 2334555666778887777666544332211 Q ss_pred hhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCc----CCCCcHHHHHHHHHHHHHH Q lcl|NC_011308. 212 YVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK----LGISDIKKVKSIIDDYDLM 287 (530) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~----~~~sd~e~v~~liDa~~~~ 287 (530) ..........|.|+.||||++.... .+.+-|.++-.|--+.=.. T Consensus 225 --------------------------------~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~ 272 (513) T protein:vir:97 225 --------------------------------EEWALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQS 272 (513) T ss_pred --------------------------------cceEEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhh Confidence 0011122345789999999988543 3566688888888888899 Q ss_pred HHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCC-CCceeEEEecCC-HHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_011308. 288 NCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKG-EGGLDIQTVDIP-YEARKAKMDIDELNIYRSGMGFN 365 (530) Q Consensus 288 ~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~lt~~~~-~~~~e~~ld~L~~~I~~~s~~p~ 365 (530) .|+.-+.+...+.|++++.|.+.+..+.. -+..+.++.+++ ++++.|+..+.+ .++.+..++.|++.|...+..+ T Consensus 273 ~Sd~~~il~~~~~P~l~~~G~~~~~~~~i--~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~l- 349 (513) T protein:vir:97 273 ASDQRHILTVSRFPILACSGASGEDSDPV--VVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEF- 349 (513) T ss_pred hhhHHHHHHhcccceeeeecCCcCCCCce--EeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHh- Confidence 99999999999999999999765432221 234456677775 789999999865 4667889999999998888644 Q ss_pred CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCH--HHHHHH Q lcl|NC_011308. 366 SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANE--LDLAMI 443 (530) Q Consensus 366 ~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~--~e~a~~ 443 (530) -....++-||+|.++........-...-..+..+|.+.++++..++.... ..++++.++...... ...++. T Consensus 350 -l~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wlg~~~------~~~~v~in~dF~~~~~~~~~~~a 422 (513) T protein:vir:97 350 -LKRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDITADWLRLGP------NGGTVELVKDYDLEEMDAPGLQA 422 (513) T ss_pred -hccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC------CccEEEeccccCcccCCHHHHHH Confidence 22334678999999998888888888888899999999999988876421 123344444333221 234455 Q ss_pred HHHHHhcCCCcHHHHHHhCC---CCC---CHHHHHHHHHHHHHHHHHHHHhh--hccccccCCccccCCCC--CCCCCCc Q lcl|NC_011308. 444 DKTEAETNQIQINNLLAIAP---RIG---DEETLKAICDTLDLDYEDVVKAL--EDQEVEELEPTVTPIID--PLTIEPQ 513 (530) Q Consensus 444 ~~~~~~~g~iS~et~l~~~~---~vd---d~~~e~~~~e~e~~e~~~~~~~~--~~~~~~~~~~~~~~~~~--~~~~~~~ 513 (530) +..+...|.||++|.+..+- .+. |.+++.++ ++.+.++..... .........+++...++ +++.-+. T Consensus 423 l~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~---~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (513) T protein:vir:97 423 LQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEE---LMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEG 499 (513) T ss_pred HHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHH---HHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCC Confidence 66778899999999987763 332 23333222 222222211110 00000111111111111 1111111 Q ss_pred cCcCCCCcccccccCC Q lcl|NC_011308. 514 PEPLNIDPVIEEEPVQ 529 (530) Q Consensus 514 ~~~~~~~~~~~~~~~~ 529 (530) .+-.|..+- |--.| T Consensus 500 ~~~~~~~~~--~~~~~ 513 (513) T protein:vir:97 500 GEGGEGGGN--PGGES 513 (513) T ss_pred CCccccCCC--CCCCC Confidence 111111111 11111 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.87 E-value=2.7e-21 Score=133.39 Aligned_cols=428 Identities=10% Similarity=-0.012 Sum_probs=241.7 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceee----cCchhhHHhhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKIS----HGFFAELVDQKTQY 86 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~----~n~~k~Ivd~~~~y 86 (530) |+....=. .......+.+..++-|.|...+-.. ...+-.+.... ....-..|+. .|+++.+++..+|+ T Consensus 1 m~V~~~hp------~y~a~~~~W~~~rd~~~G~~~~r~~-g~~YLpk~~~E-~~~~Y~~rl~rA~~~n~~~~t~~~~~G~ 72 (452) T protein:vir:94 1 MPIETKHP------EYLAYENDWIDCRVASLGQREVKKK-GVRFLPKLSGQ-TDDMYNAYKQRALFYSITSKTLSALSGM 72 (452) T ss_pred CCCCCcCH------HHHHHHHHHHHHHHHhcChHHHHcC-CcccCCCCCCC-CHHHHHHHHhhccCCchHHHHHHHHhch Confidence 33221101 1122334555666777776654322 11111111111 1111112443 79999999999999 Q ss_pred hcccceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCC-ceEEEEecccceEEEE--cCCC Q lcl|NC_011308. 87 LLANGIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSED-KLTFQTVDALQLLPVF--DDYG 163 (530) Q Consensus 87 l~G~pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g-~~~~~~~~p~~~~~v~--d~~~ 163 (530) +|.+||+++..+ .......+.-+++++.........+..+|+++.+|=++..| ++.+..++|.+++= | +..+ T Consensus 73 vf~k~p~~~~p~----~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~-W~~~~~g 147 (452) T protein:vir:94 73 VLDQPPVITHPD----AMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILN-WEEDEDG 147 (452) T ss_pred hhcCCceecccH----HHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcC-ccccccC Confidence 999999986532 22221112224677888888999999999999999877665 68899999998863 3 2333 Q ss_pred Ccee-EEEEEEEEeecccccccceEEEEEEEc--CCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 164 TLQR-IIRFYTEQRYSDADNKFNSIGHADVWT--DTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 164 ~~~~-~~~~y~~~~~~~~~~~~~~~~~~evyt--~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) .+.. .+|..............+....+-+++ +..+..++....+... .. T Consensus 148 ~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~-~~--------------------------- 199 (452) T protein:vir:94 148 RLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKV-WE--------------------------- 199 (452) T ss_pred CeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCce-ee--------------------------- Confidence 3322 223222211111112223344444444 3322222221111000 00 Q ss_pred cccccccccccccccCCccceEEeeCCc----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNK----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEI 316 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~ 316 (530) ...........++|+.||+|++.... .+.+-|.++-.|.-+.-...|+..+.+...++|++++.|.+... . T Consensus 200 --~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~--~- 274 (452) T protein:vir:94 200 --LAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS--T- 274 (452) T ss_pred --eccceeecCCCcccceeEEEEEcCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC--c- Confidence 01112233456899999999986543 35667888888888999999999999999999999999975432 1 Q ss_pred HHHHhhCcceecCC-CCceeEEEecCCH-HHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHH----hhHHHHH Q lcl|NC_011308. 317 KKNIQSKKIIQTKG-EGGLDIQTVDIPY-EARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRY----TLLAMKA 390 (530) Q Consensus 317 ~~~~~~~~~i~~~~-~~~~~~lt~~~~~-~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~----~~l~~ka 390 (530) ..+..+.++.+++ ++++.|+..+.+. ++.+..++.|++.+...+.-. +.....++.||.|..... +.|...| T Consensus 275 -i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~l-l~~~~~~~~s~ea~~~~~~~~~s~L~~~a 352 (452) T protein:vir:94 275 -MHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSARL-IDNSTRGSEATETVKLRYMSETASLKSVT 352 (452) T ss_pred -eEecccccccCCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHHHHHh-hccCCCcchHHHHHHHHHHHhhHHHHHHH Confidence 1245566778885 8899999988644 678899999999998887522 112234667887766543 4455555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCC--CCCHHHHHHHHHHHHhcCCCcHHHHHHhCC--CCC Q lcl|NC_011308. 391 QKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYI--LANELDLAMIDKTEAETNQIQINNLLAIAP--RIG 466 (530) Q Consensus 391 ~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~--P~n~~e~a~~~~~~~~~g~iS~et~l~~~~--~vd 466 (530) ...+. ++.+++++++.+++.. . ++.+..++.. +.-....++.+..+..+|.+|++|++..|- .|- T Consensus 353 ~~~e~----al~~~l~~~a~w~g~~----~---~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl 421 (452) T protein:vir:94 353 RAVEA----LLNKAYSCIMDMESMG----G---TLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVL 421 (452) T ss_pred HHHHH----HHHHHHHHHHHHcCCC----C---ceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCC Confidence 55554 4555566666665431 1 2334433322 222233444556778999999999988872 233 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCC Q lcl|NC_011308. 467 DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTI 510 (530) Q Consensus 467 d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (530) |++.|.+++..|... + +..+.++|.....+- T Consensus 422 ~~~~e~~~i~~E~~~----------~---~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 422 PPPGESMGVIPDPPA----------P---EPSPSNTPPNPSSKA 452 (452) T ss_pred CCccCHHHHHHHhhc----------c---CcccCCCCCCCccCC Confidence 455444443332111 1 111111111100000 No 76 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.84 E-value=2.2e-19 Score=122.85 Aligned_cols=454 Identities=10% Similarity=0.022 Sum_probs=240.8 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccc----cccccCCcce----ee Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDI----VEDDNASNIK----IS 72 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~----~~~~~~~n~k----i~ 72 (530) |.+. -.-.| .-....++....++-+.|...+-.+-..+ -.+... .+.+..-..| +- T Consensus 1 m~~V-~~~hp--------------~y~~~~~~W~~ird~~~G~~~~r~~g~~Y-LP~~~~e~~~~e~~~~Y~~rl~rA~~ 64 (501) T protein:vir:95 1 MPNV-SFIRP--------------ELGKLLPLYYLIRDAIAGEPTVKGARTTY-LPMPNAEDQSKENKARYEAYLKRAVF 64 (501) T ss_pred CCCC-CCCCH--------------HHHHHHHHHHHHHHHhcChHHHHhccccc-CcCCCCCCCcccchHHHHHHhhcccc Confidence 2210 00000 01123344556666677766543221111 100000 0000000112 24 Q ss_pred cCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeEEEEEEecCCC------- Q lcl|NC_011308. 73 HGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY--NEEFQSAIQELVEGSTIKGYEGIFARTTSED------- 143 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~--~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g------- 143 (530) .|+++.+++..+|.+|.+||++..+ ..+..++.++- +++++.....++..+..+|+++.+|=+...+ T Consensus 65 ~n~~~~t~~~l~G~vf~k~p~~~~p----~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~ 140 (501) T protein:vir:95 65 YNVARRTLFGLVGQVFMRDPVVKVP----ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASI 140 (501) T ss_pred CchHHHHHHHHhhhhhcCCcceeCc----HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccH Confidence 6999999999999999999998532 23334444442 3567788888999999999999999665322 Q ss_pred --------ceEEEEecccceEEEEcC--CCC--ceeEEEEEEEEeecccccccceEEEEEEEc--CCceEEEe---ecCC Q lcl|NC_011308. 144 --------KLTFQTVDALQLLPVFDD--YGT--LQRIIRFYTEQRYSDADNKFNSIGHADVWT--DTEVWYYV---QKDE 206 (530) Q Consensus 144 --------~~~~~~~~p~~~~~v~d~--~~~--~~~~~~~y~~~~~~~~~~~~~~~~~~evyt--~~~~~~y~---~~~~ 206 (530) .+.+..++|.+++= |+. .+. ..-.+++-......+.....+.+..+-+.+ .++.+.|+ .... T Consensus 141 a~~~~~~~rPy~~~~~~~~Iin-W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~ 219 (501) T protein:vir:95 141 ADLEAGRIRPTLYVYSPTEIIN-WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQP 219 (501) T ss_pred HHHHhccCCcEEEEecHhhhcC-cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCC Confidence 15578888887743 321 111 222233332222222222223333343444 33444333 2111 Q ss_pred cccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcC----CCCcHHHHHHHHH Q lcl|NC_011308. 207 GRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKL----GISDIKKVKSIID 282 (530) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~----~~sd~e~v~~liD 282 (530) +....... ..+ ...............|.++.||+|++..... +.+-|.++-.|.- T Consensus 220 ~~~~~~~~---------------~~~------~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni 278 (501) T protein:vir:95 220 TKADGSKI---------------PKG------NYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNM 278 (501) T ss_pred cccCccee---------------cCC------cccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHH Confidence 11000000 000 0000001111223458999999998754322 3444555555555 Q ss_pred HHHHHHHHHHHHHHHhccceeeeecCCCCchhh---HHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 283 DYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE---IKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYR 359 (530) Q Consensus 283 a~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~---~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~ 359 (530) +.=...|+.-+.+...++|.++++|.+.+.... ....+....++.++.+|+++|+..+.+. -.+..++.+++.+.. T Consensus 279 ~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i~~G~~~~~~lP~~~~~~~ie~~~~~-i~~~~l~~l~~~m~~ 357 (501) T protein:vir:95 279 AHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSVNFGSRGGIPLPVGADAKLLQASENT-MLKEAMDTKERQMVA 357 (501) T ss_pred HHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCceeecccccccCCCCCceeEEecChhh-HHHHHHHHHHHHHHH Confidence 555666788888899999999999976542111 0112233456678899999999876433 336778999999888 Q ss_pred HhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCC--H Q lcl|NC_011308. 360 SGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILAN--E 437 (530) Q Consensus 360 ~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n--~ 437 (530) .+.. +-..+.++-||+|.++........-...-..+..+|.+.+++++.++.... ..++|..++..+.. . T Consensus 358 ~Ga~--ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~------~~~~v~i~~df~~~~~~ 429 (501) T protein:vir:95 358 LGAK--LVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQAD------SGVKFELNTDFDIARMT 429 (501) T ss_pred HHHh--hccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC------CceEEEEecccccccCC Confidence 8753 334445678899888887777666666666777778888888888765432 12344555554432 2 Q ss_pred HHHHHHHHHHHhcCCCcHHHHHHhC---CCCC-CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCC Q lcl|NC_011308. 438 LDLAMIDKTEAETNQIQINNLLAIA---PRIG-DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTI 510 (530) Q Consensus 438 ~e~a~~~~~~~~~g~iS~et~l~~~---~~vd-d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (530) ...++.+..+.++|.||++|++..+ ..++ +.+.+.++++.+..+.. ........... ..+.... .+.+ T Consensus 430 ~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~---~~~~~~~~~~~-~~gg~~~-~~~~ 501 (501) T protein:vir:95 430 PDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAM---ALATPANVPGD-GSGGDNV-GNSE 501 (501) T ss_pred HHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcc---cccccCCCCCC-Ccccccc-cCCC Confidence 3445666788899999999996665 4444 33444444444322211 00001111111 1111000 1111 No 77 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.81 E-value=2.3e-19 Score=122.78 Aligned_cols=511 Identities=12% Similarity=0.058 Sum_probs=238.8 Q ss_pred CCccccc-----CCcc---cHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCC Q lcl|NC_011308. 1 MTNTLLT-----TAPD---RLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNAS 67 (530) Q Consensus 1 ~~~~~~~-----~~~~---~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~ 67 (530) ||+.-.. ..+. ...++..+++..|... .-+....+-.+||.|.|= ........+...+| T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p 94 (776) T protein:vir:93 23 SPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQW--------SQDEIDELKERGQA 94 (776) T ss_pred CCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC--------CHHHHHHHHhcCCc Confidence 2332111 1122 2223445555544321 123345677899999861 00001111222333 Q ss_pred cceeecCchhhHHhhhhhhhcccce--eeecCCcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEec Q lcl|NC_011308. 68 NIKISHGFFAELVDQKTQYLLANGI--DVKPTDHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTT 140 (530) Q Consensus 68 n~ki~~n~~k~Ivd~~~~yl~G~pv--~~~~~~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d 140 (530) .+.+|..+.+|+..+|+...+.+ +|...+.+|.+..+.|+.+++ ++.......+..++.++|.+|.-++++ T Consensus 95 --~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d 172 (776) T protein:vir:93 95 --PTVYNVISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQ 172 (776) T ss_pred --eEEecchHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEee Confidence 68999999999999999988754 566666667777777777653 456777888899999999999888876 Q ss_pred CC---CceEEEEecccceEEEEcCCC------CceeEEEEE-E-------------------------EE---------- Q lcl|NC_011308. 141 SE---DKLTFQTVDALQLLPVFDDYG------TLQRIIRFY-T-------------------------EQ---------- 175 (530) Q Consensus 141 ~~---g~~~~~~~~p~~~~~v~d~~~------~~~~~~~~y-~-------------------------~~---------- 175 (530) .+ +.+.+.+++|.++| ||.+. +...+++.. . .. T Consensus 173 ~~~~~~~~~~~~~~p~~i~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (776) T protein:vir:93 173 DENDGEPIYAGAESWRNIL--WDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDA 250 (776) T ss_pred ccCCCCceEeeccChhhee--eccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccc Confidence 53 34556778888876 44321 111111110 0 00 Q ss_pred -------------eecccccccceEEEEEEEcCCceEEEeecCC-cccchhhcccc-------ccccccceeeeeecccc Q lcl|NC_011308. 176 -------------RYSDADNKFNSIGHADVWTDTEVWYYVQKDE-GRSDEYVLDTT-------VNPNPSQHVLAVADGVD 234 (530) Q Consensus 176 -------------~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~-~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 234 (530) .........+.+..+|+|+...+..++.... +.......+.. +... ...+........ T Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g-~~~~~~~~~~~v 329 (776) T protein:vir:93 251 MDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESG-RAVLAVSPMMRM 329 (776) T ss_pred ccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcC-ceeehheeeeee Confidence 0000011123455566666554433322111 11111100000 0000 000000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+ .........+.+++++|+|+++.-. .+.|.+..++++++.+|.++|.+.+.+. +.++++-+|.. T Consensus 330 ~~~~~~g--~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav 405 (776) T protein:vir:93 330 HCAIMTT--RDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAV 405 (776) T ss_pred EEEEEec--chhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc--CCceeeccccc Confidence 1111111 1122233455667889999887632 5789999999999999999999998874 56777777764 Q ss_pred CCchhhHHHH-HhhCcceecCCCC--ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-CcHHHHHHHHhh Q lcl|NC_011308. 310 NSPVDEIKKN-IQSKKIIQTKGEG--GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-ATNVVIKSRYTL 385 (530) Q Consensus 310 ~~~~~~~~~~-~~~~~~i~~~~~~--~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-~SGvAik~~~~~ 385 (530) .. .+++... .+.+.++.+..++ .+.+.....-..+....+..+...|...|.+-+......|| .||+|+..+... T Consensus 406 ~~-~d~~~~~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~ 484 (776) T protein:vir:93 406 DD-IDEFRREAARPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQ 484 (776) T ss_pred cc-hHHHHHhcccCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHH Confidence 33 3444432 3456677776654 34444333334667778899999999999776544333344 699999999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----------ccc----------------ceeeEEeCCCCCC-CH Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGD-----------YSS----------------TDIKFDIEPYILA-NE 437 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~-----------~d~----------------~~i~i~f~~~~P~-n~ 437 (530) ........-+.|..++++++++++.++....... ..+ .+|.+.=.+..+. .. T Consensus 485 ~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~ 564 (776) T protein:vir:93 485 GSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQ 564 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHH Confidence 8777777777778888777777777654321100 000 0111111111111 11 Q ss_pred HHHHHHHHHHHhcCCCcH----HHHHHhCCCCCCHHHHHHHHHHHHHHH-----------------HHHHHhhhcccccc Q lcl|NC_011308. 438 LDLAMIDKTEAETNQIQI----NNLLAIAPRIGDEETLKAICDTLDLDY-----------------EDVVKALEDQEVEE 496 (530) Q Consensus 438 ~e~a~~~~~~~~~g~iS~----et~l~~~~~vdd~~~e~~~~e~e~~e~-----------------~~~~~~~~~~~~~~ 496 (530) ...++++..+...+--.+ ..+++..++ .+.++..+++++..... +.....+..+.... T Consensus 565 ~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~-p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a 643 (776) T protein:vir:93 565 AAVAELMEVIGKMPPEIALTMLDLLVENMDI-PNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIA 643 (776) T ss_pred HHHHHHHHHHhhcChhhHHHHHHHHHHhcCc-cchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhh Confidence 111111111111110000 111222211 12222222222211000 00000000000000 Q ss_pred C--CccccCCC-CCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 497 L--EPTVTPII-DPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 497 ~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) . ........ .......+-...-....+....++. T Consensus 644 ~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~a 680 (776) T protein:vir:93 644 TLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVGA 680 (776) T ss_pred hhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhhh Confidence 0 00000000 0000000000000000000001111 No 78 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.77 E-value=1.1e-16 Score=108.14 Aligned_cols=458 Identities=9% Similarity=0.009 Sum_probs=230.5 Q ss_pred CCcccccCCcccHHHHHHHHHHHH-HHhhhHHHHHHHHHHhcccchhhhcccccccccccccc--cccC--Ccce----e Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEY-IRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVE--DDNA--SNIK----I 71 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~--~~~~--~n~k----i 71 (530) =+--|...-|+ .+. +| .-.....+....++-+.|...+-.+... |-.+..... .+.+ -..| + T Consensus 24 ~~~~~~~~m~d-V~~-------~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~-YLP~~~~~~~~~E~~~~Y~~rl~rA~ 94 (535) T protein:vir:80 24 PTSGLGPSLPN-VGY-------QRVEFGEMLPKWRKIMDCLSGQEAIKAKREE-YLPMPSVDSRDEEQRRRYETYLQRAI 94 (535) T ss_pred CCCCCCCCCCC-CCc-------CCHHHHHHHHHHHHHHHHhcChHHHHhcccc-cCCCCCcccCCcCCHHHHHHHHhhcc Confidence 11111111110 000 01 0122334455666667776554332211 111110000 0000 0012 3 Q ss_pred ecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeEEEEEEecCCCc----- Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY--NEEFQSAIQELVEGSTIKGYEGIFARTTSEDK----- 144 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~--~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~----- 144 (530) -.|+++.+|+..+|.+|.+||+++.. ..+..++.++- +++++.....++..+..+|+++.+|-+...+. T Consensus 95 ~~n~~~~tl~~l~G~vfrk~p~~~~p----~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~a 170 (535) T protein:vir:80 95 FYNVTARTLDGMMGQVFSRDPIRQLP----PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVL 170 (535) T ss_pred CCChhHHHHHHHhchhhcCCcceecc----HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHH Confidence 47999999999999999999988543 23334444332 34677888889999999999999996655443 Q ss_pred --------eEEEEecccceEEEEcCC--C--CceeEEEEEEEEeecccccccceEEEEEEEcCC--ceEE---EeecCCc Q lcl|NC_011308. 145 --------LTFQTVDALQLLPVFDDY--G--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDT--EVWY---YVQKDEG 207 (530) Q Consensus 145 --------~~~~~~~p~~~~~v~d~~--~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~--~~~~---y~~~~~~ 207 (530) +.+..++|++++= |+.. + ...-.+++-......+.....+.+..+-+++.+ +.|. |+....+ T Consensus 171 de~~~~~rPy~~~y~ae~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~ 249 (535) T protein:vir:80 171 EQKLGLYRPTITLVHPTSIIN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQE 249 (535) T ss_pred HHHhcCCCcEEEEechhhccC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCC Confidence 5578888887743 3221 1 122223333332222223333444444444432 2222 2221111 Q ss_pred ccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC--Cc--CCCCcHHHHHHHHHH Q lcl|NC_011308. 208 RSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN--NK--LGISDIKKVKSIIDD 283 (530) Q Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n--n~--~~~sd~e~v~~liDa 283 (530) .... . ...........|.|+.||||+|.. |. .+.+-|.++-.|.-+ T Consensus 250 ~~~~--~----------------------------~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~ 299 (535) T protein:vir:80 250 EMYY--S----------------------------YSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIG 299 (535) T ss_pred cccc--c----------------------------cceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHH Confidence 0000 0 000011123458999999999854 33 245557777777778 Q ss_pred HHHHHHHHHHHHHHhccceeeeecCCCCchhhHH----HHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 284 YDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK----KNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYR 359 (530) Q Consensus 284 ~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~ 359 (530) +=...|+..+.+...++|++++.|.+.+..+... ..+....++.++.+++++|+...-+.-+. ..++.+.+.+.. T Consensus 300 Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~-~~l~~~e~qM~~ 378 (535) T protein:vir:80 300 HYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAIIPLPQGATAGILQITPNSVPF-EAMTHKESQMIA 378 (535) T ss_pred HhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccccCCCCCCcceeeeccchhHH-HHHHHHHHHHHH Confidence 7788888999999999999999997654322111 12334456778999999999887665554 457888888877 Q ss_pred HhcccCCCcccccCCcHHHHHHH----HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCC-- Q lcl|NC_011308. 360 SGMGFNSSAVGDGNATNVVIKSR----YTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYI-- 433 (530) Q Consensus 360 ~s~~p~~~~~~~gn~SGvAik~~----~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~-- 433 (530) ++...- ....++.+..+-+.. .+.|...|... ..+|.+.++++..+++.. .+...+.|..++.. T Consensus 379 lGa~ll--~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~l----e~al~~aL~~~A~w~G~~----~~~~~~~i~~n~dF~~ 448 (535) T protein:vir:80 379 MGANLL--VKSGGNRTFGEAQQEEASEQSILSACTKNV----SMAFRKALRWANQFQTGI----VNDETVEYNLNTDFPA 448 (535) T ss_pred HHHHhh--ccCcccccHHHHHHHHHHHhHHHHHHHHHH----HHHHHHHHHHHHHHcCCc----cCCCceEEEecccccc Confidence 765431 222344433322333 33444444444 455555566666665432 12233444443322 Q ss_pred C-CCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCC Q lcl|NC_011308. 434 L-ANELDLAMIDKTEAETNQIQINNLLAIA---PRIG---DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIID 506 (530) Q Consensus 434 P-~n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vd---d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (530) + -+. ..++.+..+..+|.||++|++..| +.++ +.++|+.+++.|-.+. ....+...+....+++ T Consensus 449 ~~ld~-~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~-----~~~~g~~~d~~~~g~~--- 519 (535) T protein:vir:80 449 ARLTP-NERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAK-----TAAAGKVGDAASGGTN--- 519 (535) T ss_pred ccCCH-HHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhc-----cccCCCCCCCCCCCCC--- Confidence 1 133 334445677778999999998877 3332 1233333333322111 1111111100000100 Q ss_pred CCCCCCccCcCCCCcccc Q lcl|NC_011308. 507 PLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 507 ~~~~~~~~~~~~~~~~~~ 524 (530) +.+.+....+.++.-+ T Consensus 520 --~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 520 --KAKLNNGNGGGNQAGN 535 (535) T ss_pred --cCcccCCccccccCCC Confidence 0111111122222222 No 79 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.76 E-value=1.5e-16 Score=107.27 Aligned_cols=446 Identities=9% Similarity=0.009 Sum_probs=239.1 Q ss_pred ccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce----eecCchhhH Q lcl|NC_011308. 4 TLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK----ISHGFFAEL 79 (530) Q Consensus 4 ~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k----i~~n~~k~I 79 (530) ++-.+-.- .++-.+ +-.......+.+..++-|.|......|.. +-.......++..-..| +-.|+++.+ T Consensus 1 ~~~~~~~~--~~V~~~---hp~y~a~~~~W~~ird~~~G~~~~~~r~~--yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~t 73 (489) T protein:vir:78 1 MLTENGQG--SGVKTK---HREWLHYAPKWQKVRHALAGELVSYLRNV--GLNEPDKAYGEARQAEYEAGGIVYNFTRRT 73 (489) T ss_pred CccCCCcc--CCCCcc---CHHHHHHHHHHHHHHHHhcCcccccccCC--CCCCCCCCCChHHHHHHHhccccCChHHHH Confidence 22222110 000000 11112234556677777888542111111 11010111111101112 247999999 Q ss_pred HhhhhhhhcccceeeecCCcchHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeEEEEEEecCCC------------ce Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY--NEEFQSAIQELVEGSTIKGYEGIFARTTSED------------KL 145 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~--~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g------------~~ 145 (530) ++..+|.+|-+||++... ..+..++.++- +++++.....+...+..+|+++.+|=+...+ .+ T Consensus 74 l~~l~G~vfrk~p~~~~p----~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rP 149 (489) T protein:vir:78 74 LSGMVGSVMRKEPEINIP----KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNP 149 (489) T ss_pred HHHHhchhhcCCcceecc----HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCc Confidence 999999999999998543 22444444442 3567788888999999999999999877655 36 Q ss_pred EEEEecccceEEEEc--CCCC--ceeEEEEEEEEeecc--cccccceEEEEEEEcCC--ce---EEEeecCCcccchhhc Q lcl|NC_011308. 146 TFQTVDALQLLPVFD--DYGT--LQRIIRFYTEQRYSD--ADNKFNSIGHADVWTDT--EV---WYYVQKDEGRSDEYVL 214 (530) Q Consensus 146 ~~~~~~p~~~~~v~d--~~~~--~~~~~~~y~~~~~~~--~~~~~~~~~~~evyt~~--~~---~~y~~~~~~~~~~~~~ 214 (530) .+..++|.+++= |+ ..+. ..-.+++-......+ .....+.+..+-+++.+ +. ..|+....+...... T Consensus 150 y~~~~~~~~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~- 227 (489) T protein:vir:78 150 TIAFYTTENIVN-WRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDV- 227 (489) T ss_pred EEEEechhhhcC-ceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCccccee- Confidence 688888888743 22 1111 222233333222211 23344555666666654 22 222222221110000 Q ss_pred cccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCc----CCCCcHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 215 DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK----LGISDIKKVKSIIDDYDLMNCF 290 (530) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~----~~~sd~e~v~~liDa~~~~~S~ 290 (530) .........+.++.|||+++.... .+.+-|.++-.|--+.=...|+ T Consensus 228 ------------------------------~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd 277 (489) T protein:vir:78 228 ------------------------------VEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSAD 277 (489) T ss_pred ------------------------------eEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhH Confidence 001112345789999999986433 2445577777777777778888 Q ss_pred HHHHHHHhccceeeeecCCCCchhhHHH-------HHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHh-c Q lcl|NC_011308. 291 LSNNLQDMAEAIYVVRGGTNSPVDEIKK-------NIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSG-M 362 (530) Q Consensus 291 ~~n~~~~~~~~~lvl~g~~~~~~~~~~~-------~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s-~ 362 (530) .-+.+...+.|.+++.|.+....+ ... -+....++.++.+|+++|+....+.. .+..++.+++.+...+ . T Consensus 278 ~~~~l~~~~~P~l~i~G~d~~~~~-~~~~~~~~~i~~g~~~~~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa~ 355 (489) T protein:vir:78 278 NEESSFVVGQPTLFIYPGENLTPQ-AFKEANPNGIKFGSRRGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQ 355 (489) T ss_pred HHHHHHHcccceeeeecCccCCcc-cccccCccceeeCCcccccCCCCCCcceeccCcchH-HHHHHHHHHHHHHHHhhh Confidence 889999999999999996543211 111 12334456677889999999886544 4667888888877764 3 Q ss_pred ccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHH Q lcl|NC_011308. 363 GFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAM 442 (530) Q Consensus 363 ~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~ 442 (530) +. . ..++-|+.+.+.....-...-...-..+..++.+.+++++.+++........ ..+...|... +-+. ..++ T Consensus 356 l~---~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~~~-i~~n~dF~~~-~~d~-~~~~ 428 (489) T protein:vir:78 356 LI---T-PTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGKPEDTEVE-FRLNMDFFLE-PMTA-QDRA 428 (489) T ss_pred hc---c-CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCceE-EEeecccCcc-cCCH-HHHH Confidence 33 2 2356788877777666665555666666777777777887776654221110 1223334221 2233 3344 Q ss_pred HHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 443 IDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 443 ~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) .+..+..+|.||++|.+..|.. |-|++.+ +++.+.++ +..+ .. .+.+.+-|.+.+ T Consensus 429 al~~~~~~G~is~~t~~~~L~~~gv~d~~~e-----~~~~ei~~-------~~~~-~~----------~~~~g~~~~~~q 485 (489) T protein:vir:78 429 AWMADINAGLLPATAYYAALRKAGVTDWTDA-----DIKDAVAD-------QPLP-VA----------TEVQGEIPQSAQ 485 (489) T ss_pred HHHHHHhcCCCCHHHHHHHHHhCCCCCccHH-----HHHHHHhh-------cCCC-cc----------cCCcccCCCCcc Confidence 4566778999999999887642 3333321 11111111 1111 11 111111111111 Q ss_pred cccc Q lcl|NC_011308. 521 PVIE 524 (530) Q Consensus 521 ~~~~ 524 (530) +... T Consensus 486 ~~~~ 489 (489) T protein:vir:78 486 QQEK 489 (489) T ss_pred cccC Confidence 1111 No 80 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.73 E-value=2.7e-16 Score=105.90 Aligned_cols=490 Identities=12% Similarity=0.024 Sum_probs=241.3 Q ss_pred CCc--c----cccCCcccHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc Q lcl|NC_011308. 1 MTN--T----LLTTAPDRLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI 69 (530) Q Consensus 1 ~~~--~----~~~~~~~~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ 69 (530) |.. + +...+.++..+++.++...|... +.+....+-.+||.|.|= ........+....| T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p-- 79 (711) T protein:vir:10 10 VEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW--------PSQVRTERELEQRP-- 79 (711) T ss_pred ccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC--------CHHHHHHHHhcCCC-- Confidence 111 1 22334455667777776655432 224456677899999861 00000111222333 Q ss_pred eeecCchhhHHhhhhhhhcccceeeecC------------------------CcchHHHHHHHHHHhh-----ccHHHHH Q lcl|NC_011308. 70 KISHGFFAELVDQKTQYLLANGIDVKPT------------------------DHDDQKLCYLIEEYYN-----EEFQSAI 120 (530) Q Consensus 70 ki~~n~~k~Ivd~~~~yl~G~pv~~~~~------------------------~~~de~~~~~l~~~~~-----~~~~~~~ 120 (530) .+.+|..+.+|+..+|+--.+.+.+... +.+|.+..+.|+.+++ ++..... T Consensus 80 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~ 159 (711) T protein:vir:10 80 CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEY 159 (711) T ss_pred cEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHH Confidence 6889999999999999999888876433 2456677777777543 3556667 Q ss_pred HHHHHHHhhcCeEEEEEEec---C---CCceEEEEe-cccceEEEEcCCC------Cce-eEEEEEEEEe---------- Q lcl|NC_011308. 121 QELVEGSTIKGYEGIFARTT---S---EDKLTFQTV-DALQLLPVFDDYG------TLQ-RIIRFYTEQR---------- 176 (530) Q Consensus 121 ~e~~~~~~~~G~a~~~~y~d---~---~g~~~~~~~-~p~~~~~v~d~~~------~~~-~~~~~y~~~~---------- 176 (530) ..+..++.++|.+|.-+++| . +|++++..+ +|.++| ||... +.. .+.+.|.... T Consensus 160 s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a 237 (711) T protein:vir:10 160 DIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDAT 237 (711) T ss_pred HHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhhee--eCccccccChhhhcceeeeecCCHHHHHHhCCchh Confidence 78888999999999766544 2 478898887 798864 55321 111 2222221100 Q ss_pred ----------ecccccccceEEEEEEEcCCceEEEeec-CCcccchhhcccc-----ccccccceeeee-ecccccceec Q lcl|NC_011308. 177 ----------YSDADNKFNSIGHADVWTDTEVWYYVQK-DEGRSDEYVLDTT-----VNPNPSQHVLAV-ADGVDEAILD 239 (530) Q Consensus 177 ----------~~~~~~~~~~~~~~evyt~~~~~~y~~~-~~~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~~~~~~~ 239 (530) ....+.....+..+++|........... ..+.......... ............ .......... T Consensus 238 ~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~ 317 (711) T protein:vir:10 238 AEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKI 317 (711) T ss_pred hhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEE Confidence 0011111234555666665443322211 1111111000000 000000000000 0000000011 Q ss_pred ccccccccccccccccCCccceEEeeCC-------cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCCC Q lcl|NC_011308. 240 EGVEEHEGRQVLGRSYKSRFPFDILYNN-------KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV-RGGTNS 311 (530) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~iPiv~~~nn-------~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl-~g~~~~ 311 (530) .+.. ......+.+.+++|+|||... ..+.|.+..+++.++.+|.+.|.....+.-.+++-+++ .|... T Consensus 318 ~G~~---~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~- 393 (711) T protein:vir:10 318 TGAN---VLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVE- 393 (711) T ss_pred ecce---eecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccC- Confidence 1111 112334556678888877532 23567788999999999999999999999988866555 55443 Q ss_pred chhh-HHH-HHhhCcceecCCCC----ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHh Q lcl|NC_011308. 312 PVDE-IKK-NIQSKKIIQTKGEG----GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYT 384 (530) Q Consensus 312 ~~~~-~~~-~~~~~~~i~~~~~~----~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~ 384 (530) +.++ +.. ..+.+.++.+..++ .++++....-..+....++.+...|-..|.+.+..-... ++.||+|+..+.. T Consensus 394 ~~~~~~~e~~~~~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~ 473 (711) T protein:vir:10 394 GREDEWEQANTKNFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQR 473 (711) T ss_pred ChHHHHHhccccCCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHH Confidence 3333 322 23456677765543 467766566667788889999999999888765433333 3479999999988 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--------Cc---ccc-------------------------ceeeEE Q lcl|NC_011308. 385 LLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--------GD---YSS-------------------------TDIKFD 428 (530) Q Consensus 385 ~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--------~~---~d~-------------------------~~i~i~ 428 (530) .........-..|..+++++.+++++++..... +. .++ .+|.+. T Consensus 474 qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~ 553 (711) T protein:vir:10 474 QGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVT 553 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEe Confidence 877777766677777777777776666543210 00 000 012222 Q ss_pred eCCCCCCCHHHHHHHHHHHHhcCCCcH------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcccc Q lcl|NC_011308. 429 IEPYILANELDLAMIDKTEAETNQIQI------NNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVT 502 (530) Q Consensus 429 f~~~~P~n~~e~a~~~~~~~~~g~iS~------et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~ 502 (530) =.+..|.-..+.+..++.+ .+.++. ..+++.+++ .+.++..+++.+. ..+.. T Consensus 554 ~~p~~~s~r~~~~~~l~ql--~~~~p~~~~~~~~~il~~~d~-p~~~el~e~lr~~-----------~~~~~-------- 611 (711) T protein:vir:10 554 TGPAFATQRIEAAEAMIQF--AQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKI-----------VPPNV-------- 611 (711) T ss_pred eccCchhHHHHHHHHHHHH--HhhcchhhhHHHHHHHHhcCC-CCHHHHHHHHHhh-----------cCccc-------- Confidence 2223332222222221111 111211 112332222 2222222222111 00000 Q ss_pred CCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 503 PIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +..+...+..+..+ ..+....++-.. T Consensus 612 ~~~~~~~~~qq~~~--e~qq~~~~~q~~ 637 (711) T protein:vir:10 612 LSKDEREAIEEDMP--EQTEPTPEQQVE 637 (711) T ss_pred CcchhhhHHHHHHH--HHHHHHHHHHHH Confidence 00000000000000 000000000000 No 81 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.71 E-value=4.1e-16 Score=104.94 Aligned_cols=482 Identities=13% Similarity=0.072 Sum_probs=231.3 Q ss_pred CCcccccCCcccHHHHHHHHHHHHH----Hhh-hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYI----RSQ-NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGF 75 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~----~~~-~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~ 75 (530) ++..-+..+.....++..+.+..|. ++. -+..+.+-.+||.|.|= ........+....| .+.+|. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p--~~~~N~ 74 (714) T protein:vir:10 5 INTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQL--------APEVIQVLKDRGQP--MTIHNL 74 (714) T ss_pred cCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--cEEecc Confidence 3332222222223333333333332 211 13456688899999871 01111112222334 688999 Q ss_pred hhhHHhhhhhhhcccceeeecCC--cc--hHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecCC---C Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTD--HD--DQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTSE---D 143 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~--~~--de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~~---g 143 (530) .+.+|+..+|+.-.+.+.+.... .+ +++..+.|+.++. ++.......+..++.++|.+|.-+|+|.+ + T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~ 154 (714) T protein:vir:10 75 IAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGP 154 (714) T ss_pred HHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCC Confidence 99999999999999988765432 22 2356677766653 35667778888999999999998887754 6 Q ss_pred ceEEEEecccceEEEEcCCC------CceeEE-EEEEE------------------------------------------ Q lcl|NC_011308. 144 KLTFQTVDALQLLPVFDDYG------TLQRII-RFYTE------------------------------------------ 174 (530) Q Consensus 144 ~~~~~~~~p~~~~~v~d~~~------~~~~~~-~~y~~------------------------------------------ 174 (530) .+++..++|.++|+ |... +...++ +.+.. T Consensus 155 ~i~i~~v~p~~v~~--Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (714) T protein:vir:10 155 EFKVSTVSRNEVFW--DWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAW 232 (714) T ss_pred CeEEEecChhheee--ccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccc Confidence 79999999999864 4311 111111 10000 Q ss_pred --------EeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecc----------cccc Q lcl|NC_011308. 175 --------QRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADG----------VDEA 236 (530) Q Consensus 175 --------~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~ 236 (530) ..........+.+..+|+|.......++..... .....++ .....+...+..+ .... T Consensus 233 ~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~-g~~~~~d----~~~~~~~~~~~~g~~~~~~~~~~rv~~ 307 (714) T protein:vir:10 233 EEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN-GRVVAFD----KNNLMQAVAVASGRVQVKVGRVSRIRE 307 (714) T ss_pred hhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCC-CCeeeeC----ccCHHHHHHHHhccceecccceeeEEE Confidence 000000112234566777776655554433211 1111000 0000000000000 0000 Q ss_pred eecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCC Q lcl|NC_011308. 237 ILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNS 311 (530) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~ 311 (530) ....+. ........+.+.+++|+|||+... ...|.+-.+++.++.+|...|.....+. ++..++..|.... T Consensus 308 ~~~~g~--~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~~ 383 (714) T protein:vir:10 308 AWFVGP--HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL 383 (714) T ss_pred EEEecc--hhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHHh--CCceeeccccccc Confidence 001111 111123344565667777665432 2457778899999999999999888763 4455666676544 Q ss_pred chhhHHHHH-hhCcceecCCC----C----ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHHHH Q lcl|NC_011308. 312 PVDEIKKNI-QSKKIIQTKGE----G----GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVIKS 381 (530) Q Consensus 312 ~~~~~~~~~-~~~~~i~~~~~----~----~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAik~ 381 (530) ...++.... +.++++.+..+ + .++......-..++...+......|-..|.+-+..-...| +.||+||.. T Consensus 384 ~d~~~~e~~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~ 463 (714) T protein:vir:10 384 SDNDLMEQLERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISN 463 (714) T ss_pred cHHHHHHhccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHH Confidence 333444433 34556665432 1 1333332333456677788888999998876543322223 469999998 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--------Ccccc----ceeeE---------------------- Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--------GDYSS----TDIKF---------------------- 427 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--------~~~d~----~~i~i---------------------- 427 (530) +-..........-..|+.+.+++.+++++++..... +..+. ..+.+ T Consensus 464 r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i 543 (714) T protein:vir:10 464 LVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIAL 543 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEE Confidence 877666665556666666776666666665432110 00000 01111 Q ss_pred EeCCCCCCCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcccc Q lcl|NC_011308. 428 DIEPYILANELDLAMIDKTEAET-----NQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVT 502 (530) Q Consensus 428 ~f~~~~P~n~~e~a~~~~~~~~~-----g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~ 502 (530) .=.+..|.-..+.++.+..+... +.+....+++.+.+ .+.++..+++.+. +..+ . T Consensus 544 ~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~-p~~~ei~~~ir~~----------~~~~---------~ 603 (714) T protein:vir:10 544 APVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA----------LGTP---------K 603 (714) T ss_pred eeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-cCHHHHHHHHHHH----------cCCC---------C Confidence 11122222222222222222211 11222334444433 3333333333221 0000 0 Q ss_pred CCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 503 PIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) + +...++.+.+. + ..+++.++ T Consensus 604 ~---~~~~~~e~q~~--q--~~~~~~~~ 624 (714) T protein:vir:10 604 S---PDEMTPEEQEV--A--AQQQALQQ 624 (714) T ss_pred C---ccccCcchhHH--H--HHHHHHHH Confidence 0 00000000000 0 00111111 No 82 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.70 E-value=2.5e-15 Score=100.65 Aligned_cols=448 Identities=10% Similarity=-0.006 Sum_probs=234.1 Q ss_pred ccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce----eecCchhhH Q lcl|NC_011308. 4 TLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK----ISHGFFAEL 79 (530) Q Consensus 4 ~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k----i~~n~~k~I 79 (530) ++-++-.- .++-.+ +-......++.+..++-|.|.+.-..|. .+-.......++..-..| +-.|+++.+ T Consensus 1 ~~~~~~~~--~~V~~~---hp~y~a~~~~W~~ird~~~G~~~~~~r~--~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~t 73 (491) T protein:vir:95 1 MLTANGQG--SGVKTK---HREWLHYAPKWQKVRHALAGDLVGYLRN--VGLNEPDKAYGEARQAEYEAGGIVYNFTRRT 73 (491) T ss_pred CcccCCcc--CCCCcc---CHHHHHHHHHHHHHHHHhcCcchhhccc--CCCcCCCCCCCHHHHHHHHhcccCCChHHHH Confidence 33332110 000000 1111233455667777788853211111 111111111111111112 347999999 Q ss_pred HhhhhhhhcccceeeecCCcchHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeEEEEEEecCCC------------ce Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY--NEEFQSAIQELVEGSTIKGYEGIFARTTSED------------KL 145 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~--~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g------------~~ 145 (530) ++..+|.+|-+||++...+ .+..++.++- +++++.....+...+..+|+++.+|=+.+.+ .+ T Consensus 74 l~~l~G~vfrk~p~~~~p~----~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rP 149 (491) T protein:vir:95 74 LSGMVGSVMRKEPEINIPK----ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNP 149 (491) T ss_pred HHHHhchhhcCCceeeccH----HHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCc Confidence 9999999999999985432 2344444442 3567788888999999999999999776554 35 Q ss_pred EEEEecccceEEEEcC--CC--CceeEEEEEEEEeecc--cccccceEEEEEEEcCC--ceEE---EeecCCcccchhhc Q lcl|NC_011308. 146 TFQTVDALQLLPVFDD--YG--TLQRIIRFYTEQRYSD--ADNKFNSIGHADVWTDT--EVWY---YVQKDEGRSDEYVL 214 (530) Q Consensus 146 ~~~~~~p~~~~~v~d~--~~--~~~~~~~~y~~~~~~~--~~~~~~~~~~~evyt~~--~~~~---y~~~~~~~~~~~~~ 214 (530) .+..++|.+++= |+. .+ ...-.+++-......+ .....+.+..+-+++.+ +.+. |+....+.... T Consensus 150 y~~~~~~~~Iin-W~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~--- 225 (491) T protein:vir:95 150 TIAFYTTENIVN-WRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQE--- 225 (491) T ss_pred EEEEechhhhcC-ceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCccee--- Confidence 678888888743 221 11 1222233333221111 22233344444444432 2222 22111111000 Q ss_pred cccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCc--C--CCCcHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 215 DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK--L--GISDIKKVKSIIDDYDLMNCF 290 (530) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~--~--~~sd~e~v~~liDa~~~~~S~ 290 (530) ...........+.|+.||||++.... . +.+-|.++-.|--+.=...|+ T Consensus 226 ----------------------------~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd 277 (491) T protein:vir:95 226 ----------------------------EVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSAD 277 (491) T ss_pred ----------------------------eeeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhH Confidence 00011112345789999999986433 2 344467777776677778888 Q ss_pred HHHHHHHhccceeeeecCCCCchhhHHHH-------HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHh-c Q lcl|NC_011308. 291 LSNNLQDMAEAIYVVRGGTNSPVDEIKKN-------IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSG-M 362 (530) Q Consensus 291 ~~n~~~~~~~~~lvl~g~~~~~~~~~~~~-------~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s-~ 362 (530) .-+.+...+.|.+++.|.+....+ .... +.....+.++.+|++.|+....+.- .+..++.++..+...+ . T Consensus 278 ~~~~l~~~~~P~l~~~G~d~~~~~-~~~~~~~~~i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga~ 355 (491) T protein:vir:95 278 NEESSFVVGQPTLFIYPGDNLTPQ-SFKEANPNGIKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQ 355 (491) T ss_pred HHHHHHHcccceeeeecCcccCcc-hhhccCcceeEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHHH Confidence 888899999999999996643222 1111 1223345667889999999886554 4666777777766663 3 Q ss_pred ccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHH Q lcl|NC_011308. 363 GFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAM 442 (530) Q Consensus 363 ~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~ 442 (530) +. .. .++-||.+.+.....-...-...-.....++.+.+++++.+++....... ...+...|... +-+ .+.++ T Consensus 356 l~---~~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~v-~i~~n~dF~~~-~~~-~~~~~ 428 (491) T protein:vir:95 356 LI---TP-SQQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMMLGKPEDSEV-EFQLNMDFFLQ-PMT-AQDRA 428 (491) T ss_pred hc---cC-CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCce-EEEeecccccc-cCC-HHHHH Confidence 32 22 25678888887766666665666666777777777788777654321110 01223334222 223 33455 Q ss_pred HHHHHHhcCCCcHHHHHHhCC--CCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 443 IDKTEAETNQIQINNLLAIAP--RIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 443 ~~~~~~~~g~iS~et~l~~~~--~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) .+..+..+|.||++|.+..|. .|-|.+.| ++..+.++....+. ...+.+...++.. ..+++ T Consensus 429 all~~~~~G~is~~t~~~~L~~~~vl~~~~e-----~~~~~ie~~~~~~~-----~~~~~~~~~~~~~--~~~~~ 491 (491) T protein:vir:95 429 AWMADINAGLLPATAYYAALRKAGVTDWTDE-----DILNAIEDAPLPSG-----AVTQVAGEIPQAA--QQQQE 491 (491) T ss_pred HHHHHHhcCCCCHHHHHHHHHhCCCCCccHH-----HHHHHHHhcCCCCC-----ccccccccchhhh--hhccC Confidence 566777889999999988764 23333221 11111111111111 1111111111111 11111 No 83 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.69 E-value=4.7e-15 Score=99.13 Aligned_cols=484 Identities=12% Similarity=0.047 Sum_probs=228.0 Q ss_pred CC-----cccccCCcccHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MT-----NTLLTTAPDRLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~-----~~~~~~~~~~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |. -.-..+.+.+ -++-.+++..|... .-+..+.+..+||.|.|= ........+....| . T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p--~ 69 (714) T protein:vir:99 1 MKNETNTMATKNDNGAT-PRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQL--------PPEVLQVLKDRGQP--M 69 (714) T ss_pred CCcccccccCCCCcchh-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--c Confidence 21 1111122222 12333333333221 123456788899999871 00111111222333 6 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCC--c--chHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecC Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTD--H--DDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTS 141 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~--~--~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~ 141 (530) +.+|..+.+|+..+|+--.+.+.+.... . .+.+..+.|+.++. ++.......+..++.++|.+|.-+|++. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~ 149 (714) T protein:vir:99 70 TIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc Confidence 8899999999999999999988765432 2 22346666666553 3566777888899999999998888774 Q ss_pred C---CceEEEEecccceEEEEcCCC------Cc-eeEEEEEEEEee---------------------------------- Q lcl|NC_011308. 142 E---DKLTFQTVDALQLLPVFDDYG------TL-QRIIRFYTEQRY---------------------------------- 177 (530) Q Consensus 142 ~---g~~~~~~~~p~~~~~v~d~~~------~~-~~~~~~y~~~~~---------------------------------- 177 (530) + +.++|..++|.++| ||.+. +. -.+.+.+..... T Consensus 150 d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:99 150 DPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred CCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 3 56899999999976 44321 11 111111111000 Q ss_pred ----------------cccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc-------ccccccceeeeeecccc Q lcl|NC_011308. 178 ----------------SDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT-------VNPNPSQHVLAVADGVD 234 (530) Q Consensus 178 ----------------~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 234 (530) .......+.+..+++|.............. .....++.. +... ...+........ T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~-g~~~~~d~~~~~~~~~~~~g-~~~~~~~~~~rv 305 (714) T protein:vir:99 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN-GRVVAFDKNNLMQAVAVASG-RVQVKVGRVSRI 305 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC-CceEEeCccCHHHHHHHhhc-chhhhccccceE Confidence 000001123444566655444333322211 110000000 0000 000000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+. .....++.|.+.+++|+|||+... ...|.+-.+++.++.+|+..|.....+ .++..++..|.. T Consensus 306 ~~~~~~g~--~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~ 381 (714) T protein:vir:99 306 REAWFVGP--HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT 381 (714) T ss_pred EEEEEecC--cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc Confidence 00011111 111122334444556666655432 234667788999999999999988876 355556666655 Q ss_pred CCchhhHHHH-HhhCcceecCCCC--------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHH Q lcl|NC_011308. 310 NSPVDEIKKN-IQSKKIIQTKGEG--------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVI 379 (530) Q Consensus 310 ~~~~~~~~~~-~~~~~~i~~~~~~--------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAi 379 (530) .....++... -+.++++.+..+. .++......-..+.-..+......|-..|.+-+..-...| ..||+|+ T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi 461 (714) T protein:vir:99 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAI 461 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHH Confidence 4433344333 2344566654321 1343333334566667788888888888876543322223 3699999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCc-cc---c--------------------cee Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR----------RGLGD-YS---S--------------------TDI 425 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~----------~~~~~-~d---~--------------------~~i 425 (530) ..+-..........-..++.+.+++.+++++++.. .+..+ .. . .+| T Consensus 462 ~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv 541 (714) T protein:vir:99 462 SNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHI 541 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEE Confidence 88877665555555555666666665555554422 11000 00 0 122 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAET-----NQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPT 500 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~-----g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~ 500 (530) .+.=.+..|.-..+.++.+..+... +.+....+++.+++ .+.++..+++.+. ..+ . T Consensus 542 ~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~-~------ 602 (714) T protein:vir:99 542 ALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGT-P------ 602 (714) T ss_pred EEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCC-C------ Confidence 2333333444333444333333221 12333455666654 4555554444331 000 0 Q ss_pred ccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 501 VTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .+ +...++.+.+. ....+..++ T Consensus 603 -~~---~~~~~~e~q~~----~~~~q~~~~ 624 (714) T protein:vir:99 603 -KS---PDEMTPEEQEV----AAQQQALQQ 624 (714) T ss_pred -CC---ccccchhhHHH----HHHHHHHHH Confidence 00 00000100000 000001111 No 84 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.69 E-value=4.7e-15 Score=99.13 Aligned_cols=484 Identities=12% Similarity=0.047 Sum_probs=228.0 Q ss_pred CC-----cccccCCcccHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MT-----NTLLTTAPDRLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~-----~~~~~~~~~~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |. -.-..+.+.+ -++-.+++..|... .-+..+.+..+||.|.|= ........+....| . T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p--~ 69 (714) T protein:vir:81 1 MKNETNTMATKNDNGAT-PRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQL--------PPEVLQVLKDRGQP--M 69 (714) T ss_pred CCcccccccCCCCcchh-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--c Confidence 21 1111122222 12333333333221 123456788899999871 00111111222333 6 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCC--c--chHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecC Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTD--H--DDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTS 141 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~--~--~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~ 141 (530) +.+|..+.+|+..+|+--.+.+.+.... . .+.+..+.|+.++. ++.......+..++.++|.+|.-+|++. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~ 149 (714) T protein:vir:81 70 TIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc Confidence 8899999999999999999988765432 2 22346666666553 3566777888899999999998888774 Q ss_pred C---CceEEEEecccceEEEEcCCC------Cc-eeEEEEEEEEee---------------------------------- Q lcl|NC_011308. 142 E---DKLTFQTVDALQLLPVFDDYG------TL-QRIIRFYTEQRY---------------------------------- 177 (530) Q Consensus 142 ~---g~~~~~~~~p~~~~~v~d~~~------~~-~~~~~~y~~~~~---------------------------------- 177 (530) + +.++|..++|.++| ||.+. +. -.+.+.+..... T Consensus 150 d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:81 150 DPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred CCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 3 56899999999976 44321 11 111111111000 Q ss_pred ----------------cccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc-------ccccccceeeeeecccc Q lcl|NC_011308. 178 ----------------SDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT-------VNPNPSQHVLAVADGVD 234 (530) Q Consensus 178 ----------------~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 234 (530) .......+.+..+++|.............. .....++.. +... ...+........ T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~-g~~~~~d~~~~~~~~~~~~g-~~~~~~~~~~rv 305 (714) T protein:vir:81 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN-GRVVAFDKNNLMQAVAVASG-RVQVKVGRVSRI 305 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC-CceEEeCccCHHHHHHHhhc-chhhhccccceE Confidence 000001123444566655444333322211 110000000 0000 000000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+. .....++.|.+.+++|+|||+... ...|.+-.+++.++.+|+..|.....+ .++..++..|.. T Consensus 306 ~~~~~~g~--~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~ 381 (714) T protein:vir:81 306 REAWFVGP--HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT 381 (714) T ss_pred EEEEEecC--cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc Confidence 00011111 111122334444556666655432 234667788999999999999988876 355556666655 Q ss_pred CCchhhHHHH-HhhCcceecCCCC--------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHH Q lcl|NC_011308. 310 NSPVDEIKKN-IQSKKIIQTKGEG--------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVI 379 (530) Q Consensus 310 ~~~~~~~~~~-~~~~~~i~~~~~~--------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAi 379 (530) .....++... -+.++++.+..+. .++......-..+.-..+......|-..|.+-+..-...| ..||+|+ T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi 461 (714) T protein:vir:81 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAI 461 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHH Confidence 4433344333 2344566654321 1343333334566667788888888888876543322223 3699999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCc-cc---c--------------------cee Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR----------RGLGD-YS---S--------------------TDI 425 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~----------~~~~~-~d---~--------------------~~i 425 (530) ..+-..........-..++.+.+++.+++++++.. .+..+ .. . .+| T Consensus 462 ~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv 541 (714) T protein:vir:81 462 SNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHI 541 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEE Confidence 88877665555555555666666665555554422 11000 00 0 122 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAET-----NQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPT 500 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~-----g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~ 500 (530) .+.=.+..|.-..+.++.+..+... +.+....+++.+++ .+.++..+++.+. ..+ . T Consensus 542 ~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~-~------ 602 (714) T protein:vir:81 542 ALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGT-P------ 602 (714) T ss_pred EEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCC-C------ Confidence 2333333444333444333333221 12333455666654 4555554444331 000 0 Q ss_pred ccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 501 VTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .+ +...++.+.+. ....+..++ T Consensus 603 -~~---~~~~~~e~q~~----~~~~q~~~~ 624 (714) T protein:vir:81 603 -KS---PDEMTPEEQEV----AAQQQALQQ 624 (714) T ss_pred -CC---ccccchhhHHH----HHHHHHHHH Confidence 00 00000100000 000001111 No 85 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.69 E-value=4.7e-15 Score=99.13 Aligned_cols=484 Identities=12% Similarity=0.047 Sum_probs=228.0 Q ss_pred CC-----cccccCCcccHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MT-----NTLLTTAPDRLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~-----~~~~~~~~~~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |. -.-..+.+.+ -++-.+++..|... .-+..+.+..+||.|.|= ........+....| . T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p--~ 69 (714) T protein:vir:27 1 MKNETNTMATKNDNGAT-PRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQL--------PPEVLQVLKDRGQP--M 69 (714) T ss_pred CCcccccccCCCCcchh-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--c Confidence 21 1111122222 12333333333221 123456788899999871 00111111222333 6 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCC--c--chHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecC Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTD--H--DDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTS 141 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~--~--~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~ 141 (530) +.+|..+.+|+..+|+--.+.+.+.... . .+.+..+.|+.++. ++.......+..++.++|.+|.-+|++. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~ 149 (714) T protein:vir:27 70 TIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc Confidence 8899999999999999999988765432 2 22346666666553 3566777888899999999998888774 Q ss_pred C---CceEEEEecccceEEEEcCCC------Cc-eeEEEEEEEEee---------------------------------- Q lcl|NC_011308. 142 E---DKLTFQTVDALQLLPVFDDYG------TL-QRIIRFYTEQRY---------------------------------- 177 (530) Q Consensus 142 ~---g~~~~~~~~p~~~~~v~d~~~------~~-~~~~~~y~~~~~---------------------------------- 177 (530) + +.++|..++|.++| ||.+. +. -.+.+.+..... T Consensus 150 d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:27 150 DPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred CCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 3 56899999999976 44321 11 111111111000 Q ss_pred ----------------cccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc-------ccccccceeeeeecccc Q lcl|NC_011308. 178 ----------------SDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT-------VNPNPSQHVLAVADGVD 234 (530) Q Consensus 178 ----------------~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 234 (530) .......+.+..+++|.............. .....++.. +... ...+........ T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~-g~~~~~d~~~~~~~~~~~~g-~~~~~~~~~~rv 305 (714) T protein:vir:27 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN-GRVVAFDKNNLMQAVAVASG-RVQVKVGRVSRI 305 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC-CceEEeCccCHHHHHHHhhc-chhhhccccceE Confidence 000001123444566655444333322211 110000000 0000 000000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+. .....++.|.+.+++|+|||+... ...|.+-.+++.++.+|+..|.....+ .++..++..|.. T Consensus 306 ~~~~~~g~--~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~ 381 (714) T protein:vir:27 306 REAWFVGP--HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT 381 (714) T ss_pred EEEEEecC--cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc Confidence 00011111 111122334444556666655432 234667788999999999999988876 355556666655 Q ss_pred CCchhhHHHH-HhhCcceecCCCC--------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHH Q lcl|NC_011308. 310 NSPVDEIKKN-IQSKKIIQTKGEG--------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVI 379 (530) Q Consensus 310 ~~~~~~~~~~-~~~~~~i~~~~~~--------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAi 379 (530) .....++... -+.++++.+..+. .++......-..+.-..+......|-..|.+-+..-...| ..||+|+ T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi 461 (714) T protein:vir:27 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAI 461 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHH Confidence 4433344333 2344566654321 1343333334566667788888888888876543322223 3699999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCc-cc---c--------------------cee Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR----------RGLGD-YS---S--------------------TDI 425 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~----------~~~~~-~d---~--------------------~~i 425 (530) ..+-..........-..++.+.+++.+++++++.. .+..+ .. . .+| T Consensus 462 ~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv 541 (714) T protein:vir:27 462 SNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHI 541 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEE Confidence 88877665555555555666666665555554422 11000 00 0 122 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAET-----NQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPT 500 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~-----g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~ 500 (530) .+.=.+..|.-..+.++.+..+... +.+....+++.+++ .+.++..+++.+. ..+ . T Consensus 542 ~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~-~------ 602 (714) T protein:vir:27 542 ALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGT-P------ 602 (714) T ss_pred EEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCC-C------ Confidence 2333333444333444333333221 12333455666654 4555554444331 000 0 Q ss_pred ccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 501 VTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .+ +...++.+.+. ....+..++ T Consensus 603 -~~---~~~~~~e~q~~----~~~~q~~~~ 624 (714) T protein:vir:27 603 -KS---PDEMTPEEQEV----AAQQQALQQ 624 (714) T ss_pred -CC---ccccchhhHHH----HHHHHHHHH Confidence 00 00000100000 000001111 No 86 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.69 E-value=4.7e-15 Score=99.13 Aligned_cols=484 Identities=12% Similarity=0.047 Sum_probs=228.0 Q ss_pred CC-----cccccCCcccHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MT-----NTLLTTAPDRLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~-----~~~~~~~~~~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |. -.-..+.+.+ -++-.+++..|... .-+..+.+..+||.|.|= ........+....| . T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p--~ 69 (714) T protein:vir:32 1 MKNETNTMATKNDNGAT-PRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQL--------PPEVLQVLKDRGQP--M 69 (714) T ss_pred CCcccccccCCCCcchh-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--c Confidence 21 1111122222 12333333333221 123456788899999871 00111111222333 6 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCC--c--chHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecC Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTD--H--DDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTS 141 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~--~--~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~ 141 (530) +.+|..+.+|+..+|+--.+.+.+.... . .+.+..+.|+.++. ++.......+..++.++|.+|.-+|++. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~ 149 (714) T protein:vir:32 70 TIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc Confidence 8899999999999999999988765432 2 22346666666553 3566777888899999999998888774 Q ss_pred C---CceEEEEecccceEEEEcCCC------Cc-eeEEEEEEEEee---------------------------------- Q lcl|NC_011308. 142 E---DKLTFQTVDALQLLPVFDDYG------TL-QRIIRFYTEQRY---------------------------------- 177 (530) Q Consensus 142 ~---g~~~~~~~~p~~~~~v~d~~~------~~-~~~~~~y~~~~~---------------------------------- 177 (530) + +.++|..++|.++| ||.+. +. -.+.+.+..... T Consensus 150 d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:32 150 DPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred CCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 3 56899999999976 44321 11 111111111000 Q ss_pred ----------------cccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc-------ccccccceeeeeecccc Q lcl|NC_011308. 178 ----------------SDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT-------VNPNPSQHVLAVADGVD 234 (530) Q Consensus 178 ----------------~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 234 (530) .......+.+..+++|.............. .....++.. +... ...+........ T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~-g~~~~~d~~~~~~~~~~~~g-~~~~~~~~~~rv 305 (714) T protein:vir:32 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN-GRVVAFDKNNLMQAVAVASG-RVQVKVGRVSRI 305 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC-CceEEeCccCHHHHHHHhhc-chhhhccccceE Confidence 000001123444566655444333322211 110000000 0000 000000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+. .....++.|.+.+++|+|||+... ...|.+-.+++.++.+|+..|.....+ .++..++..|.. T Consensus 306 ~~~~~~g~--~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~ 381 (714) T protein:vir:32 306 REAWFVGP--HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT 381 (714) T ss_pred EEEEEecC--cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc Confidence 00011111 111122334444556666655432 234667788999999999999988876 355556666655 Q ss_pred CCchhhHHHH-HhhCcceecCCCC--------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHH Q lcl|NC_011308. 310 NSPVDEIKKN-IQSKKIIQTKGEG--------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVI 379 (530) Q Consensus 310 ~~~~~~~~~~-~~~~~~i~~~~~~--------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAi 379 (530) .....++... -+.++++.+..+. .++......-..+.-..+......|-..|.+-+..-...| ..||+|+ T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi 461 (714) T protein:vir:32 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAI 461 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHH Confidence 4433344333 2344566654321 1343333334566667788888888888876543322223 3699999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCc-cc---c--------------------cee Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR----------RGLGD-YS---S--------------------TDI 425 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~----------~~~~~-~d---~--------------------~~i 425 (530) ..+-..........-..++.+.+++.+++++++.. .+..+ .. . .+| T Consensus 462 ~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv 541 (714) T protein:vir:32 462 SNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHI 541 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEE Confidence 88877665555555555666666665555554422 11000 00 0 122 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAET-----NQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPT 500 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~-----g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~ 500 (530) .+.=.+..|.-..+.++.+..+... +.+....+++.+++ .+.++..+++.+. ..+ . T Consensus 542 ~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~-~------ 602 (714) T protein:vir:32 542 ALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGT-P------ 602 (714) T ss_pred EEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCC-C------ Confidence 2333333444333444333333221 12333455666654 4555554444331 000 0 Q ss_pred ccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 501 VTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .+ +...++.+.+. ....+..++ T Consensus 603 -~~---~~~~~~e~q~~----~~~~q~~~~ 624 (714) T protein:vir:32 603 -KS---PDEMTPEEQEV----AAQQQALQQ 624 (714) T ss_pred -CC---ccccchhhHHH----HHHHHHHHH Confidence 00 00000100000 000001111 No 87 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.69 E-value=4.7e-15 Score=99.13 Aligned_cols=484 Identities=12% Similarity=0.047 Sum_probs=228.0 Q ss_pred CC-----cccccCCcccHHHHHHHHHHHHHHh-----hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MT-----NTLLTTAPDRLGTILSTKIDEYIRS-----QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~-----~~~~~~~~~~~~~~i~~~i~~~~~~-----~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |. -.-..+.+.+ -++-.+++..|... .-+..+.+..+||.|.|= ........+....| . T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw--------~~~~~~~l~~~g~p--~ 69 (714) T protein:vir:10 1 MKNETNTMATKNDNGAT-PRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQL--------PPEVLQVLKDRGQP--M 69 (714) T ss_pred CCcccccccCCCCcchh-HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--c Confidence 21 1111122222 12333333333221 123456788899999871 00111111222333 6 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCC--c--chHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecC Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTD--H--DDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTS 141 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~--~--~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~ 141 (530) +.+|..+.+|+..+|+--.+.+.+.... . .+.+..+.|+.++. ++.......+..++.++|.+|.-+|++. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~ 149 (714) T protein:vir:10 70 TIHNLIAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc Confidence 8899999999999999999988765432 2 22346666666553 3566777888899999999998888774 Q ss_pred C---CceEEEEecccceEEEEcCCC------Cc-eeEEEEEEEEee---------------------------------- Q lcl|NC_011308. 142 E---DKLTFQTVDALQLLPVFDDYG------TL-QRIIRFYTEQRY---------------------------------- 177 (530) Q Consensus 142 ~---g~~~~~~~~p~~~~~v~d~~~------~~-~~~~~~y~~~~~---------------------------------- 177 (530) + +.++|..++|.++| ||.+. +. -.+.+.+..... T Consensus 150 d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~ 227 (714) T protein:vir:10 150 DPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSP 227 (714) T ss_pred CCCCCCeEEEecchhhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccc Confidence 3 56899999999976 44321 11 111111111000 Q ss_pred ----------------cccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc-------ccccccceeeeeecccc Q lcl|NC_011308. 178 ----------------SDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT-------VNPNPSQHVLAVADGVD 234 (530) Q Consensus 178 ----------------~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 234 (530) .......+.+..+++|.............. .....++.. +... ...+........ T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~-g~~~~~d~~~~~~~~~~~~g-~~~~~~~~~~rv 305 (714) T protein:vir:10 228 LMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSN-GRVVAFDKNNLMQAVAVASG-RVQVKVGRVSRI 305 (714) T ss_pred cccchhhhccccccccccccccccEEEEEEEEEEEEEEEEeeccCC-CceEEeCccCHHHHHHHhhc-chhhhccccceE Confidence 000001123444566655444333322211 110000000 0000 000000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......+. .....++.|.+.+++|+|||+... ...|.+-.+++.++.+|+..|.....+ .++..++..|.. T Consensus 306 ~~~~~~g~--~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~ 381 (714) T protein:vir:10 306 REAWFVGP--HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT 381 (714) T ss_pred EEEEEecC--cccccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc Confidence 00011111 111122334444556666655432 234667788999999999999988876 355556666655 Q ss_pred CCchhhHHHH-HhhCcceecCCCC--------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHH Q lcl|NC_011308. 310 NSPVDEIKKN-IQSKKIIQTKGEG--------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVI 379 (530) Q Consensus 310 ~~~~~~~~~~-~~~~~~i~~~~~~--------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAi 379 (530) .....++... -+.++++.+..+. .++......-..+.-..+......|-..|.+-+..-...| ..||+|+ T Consensus 382 ~~~d~~~~e~~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi 461 (714) T protein:vir:10 382 QLSDNDLMEQIERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAI 461 (714) T ss_pred cccHHHHHHhccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHH Confidence 4433344333 2344566654321 1343333334566667788888888888876543322223 3699999 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCCCc-cc---c--------------------cee Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRR----------RGLGD-YS---S--------------------TDI 425 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~----------~~~~~-~d---~--------------------~~i 425 (530) ..+-..........-..++.+.+++.+++++++.. .+..+ .. . .+| T Consensus 462 ~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv 541 (714) T protein:vir:10 462 SNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHI 541 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEE Confidence 88877665555555555666666665555554422 11000 00 0 122 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCcc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAET-----NQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPT 500 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~-----g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~ 500 (530) .+.=.+..|.-..+.++.+..+... +.+....+++.+++ .+.++..+++.+. ..+ . T Consensus 542 ~i~~~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~-----------~~~-~------ 602 (714) T protein:vir:10 542 ALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAA-----------LGT-P------ 602 (714) T ss_pred EEeeccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHH-----------cCC-C------ Confidence 2333333444333444333333221 12333455666654 4555554444331 000 0 Q ss_pred ccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 501 VTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .+ +...++.+.+. ....+..++ T Consensus 603 -~~---~~~~~~e~q~~----~~~~q~~~~ 624 (714) T protein:vir:10 603 -KS---PDEMTPEEQEV----AAQQQALQQ 624 (714) T ss_pred -CC---ccccchhhHHH----HHHHHHHHH Confidence 00 00000100000 000001111 No 88 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.69 E-value=1.8e-15 Score=101.44 Aligned_cols=430 Identities=10% Similarity=0.067 Sum_probs=224.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHH-HHHHHhcccc--hhhhcccccccccccccccccCCcce----eec Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLAR-VGQRYYNQDN--DIENTRIMWMNDHGDIVEDDNASNIK----ISH 73 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~-~~~~YY~g~~--~I~~r~~~~~~~~~~~~~~~~~~n~k----i~~ 73 (530) |+ .+..--........+. ....--...++ ....|..--+ +........+......... ..-+.| +-. T Consensus 14 m~---V~~~hp~y~a~~~~W~--~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~-~y~~~~~~rA~~~ 87 (488) T protein:vir:96 14 ML---TPIYHPDYLVNAPQWL--RNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEK-DWEDLTWRLANYV 87 (488) T ss_pred ec---ccccCHHHHHHhhhhh--HhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchh-hhHhhhhhccccC Confidence 44 1111111222222221 00000001111 2234433211 1110000000000000000 000111 235 Q ss_pred CchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeEEEEEEecCCC-------- Q lcl|NC_011308. 74 GFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY--NEEFQSAIQELVEGSTIKGYEGIFARTTSED-------- 143 (530) Q Consensus 74 n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~--~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g-------- 143 (530) |+++..++..+|++|.+||++...+ ...+..++.++- +++++.....+...+..+|+++.+|=+.+.+ T Consensus 88 n~~~~tl~~l~G~vfrk~p~~~~~~--~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~ 165 (488) T protein:vir:96 88 NIVNPTMNAITGAVMRREPEFDTMD--NPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNK 165 (488) T ss_pred chhHHHHHHhcchhhccCceeccCC--cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHH Confidence 9999999999999999999996432 223445455443 3567788889999999999999999877554 Q ss_pred ---ceEEEEecccceEEEEc--CCCC--ceeEEEEEEEEee-cccccccceEEEEEEEcCCceEEEeecCCcccchhhcc Q lcl|NC_011308. 144 ---KLTFQTVDALQLLPVFD--DYGT--LQRIIRFYTEQRY-SDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLD 215 (530) Q Consensus 144 ---~~~~~~~~p~~~~~v~d--~~~~--~~~~~~~y~~~~~-~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~ 215 (530) .+.+..++|.+++= |+ ..+. ..-.+++...... +..+...+....+-.+++..+..|+...++.... T Consensus 166 ~~~rPy~~~~~a~~Iin-W~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e---- 240 (488) T protein:vir:96 166 GKKLPTAAFYDALHIID-WEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDE---- 240 (488) T ss_pred hcCCcEEEEechhhhcC-cceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccc---- Confidence 25688888888753 22 1121 1222333332221 2222222333333345555444444433322110 Q ss_pred ccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCc----CCCCcHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 216 TTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK----LGISDIKKVKSIIDDYDLMNCFL 291 (530) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~----~~~sd~e~v~~liDa~~~~~S~~ 291 (530) ........+.|+.|||++|.... .+.+-|.++-.|.-+.=...|+. T Consensus 241 ------------------------------~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~ 290 (488) T protein:vir:96 241 ------------------------------WTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYS 290 (488) T ss_pred ------------------------------eEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHH Confidence 01112245789999999996543 24555677777777777778888 Q ss_pred HHHHHHhccceeeeecCCCCchhhHHHHHh------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhc-cc Q lcl|NC_011308. 292 SNNLQDMAEAIYVVRGGTNSPVDEIKKNIQ------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGM-GF 364 (530) Q Consensus 292 ~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~-~p 364 (530) -+.+-....|.|++.+.+.+.. . ..... ..+.......|+++|+....+.- .+..++.|++.+...+. ++ T Consensus 291 ~~il~~~~~p~lv~~~~~~~~~-~-~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l~ 367 (488) T protein:vir:96 291 NKAMILANEAKWMVDMGDMNKT-M-ASEMNPLGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASLF 367 (488) T ss_pred HHHHHhcCCceeeeccCCCCcc-c-ccccccceeeecccccccccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhhc Confidence 7777767788888754433221 1 11111 11122223467889988765543 36668888888877663 33 Q ss_pred CCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCC---CCHHHHH Q lcl|NC_011308. 365 NSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYIL---ANELDLA 441 (530) Q Consensus 365 ~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P---~n~~e~a 441 (530) .. .++-||.+.+.........-...-..+..++.+.++++..+++..... .+...+.+..++... -+.. .+ T Consensus 368 ---~~-~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~~~~-~~~~~~~~~in~dF~~~~ld~~-~~ 441 (488) T protein:vir:96 368 ---TQ-QSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGTNLY-VNPDELVFKLNRDYFDVEVNPQ-ML 441 (488) T ss_pred ---cC-CCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC-cCccceEEEeccCCCCccCCHH-HH Confidence 22 245678888876666665556666667777777778888877665432 122334444444322 2333 34 Q ss_pred HHHHHHHhcCCCcHHHHHHhCC---CCC-C--HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 442 MIDKTEAETNQIQINNLLAIAP---RIG-D--EETLKAICDTLDLDYEDVVKAL 489 (530) Q Consensus 442 ~~~~~~~~~g~iS~et~l~~~~---~vd-d--~~~e~~~~e~e~~e~~~~~~~~ 489 (530) +.+..+..+|.||++|.+..+. .++ | .+++++++++ .-..+ T Consensus 442 ~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~-------~g~~~ 488 (488) T protein:vir:96 442 QVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAE-------LGFGM 488 (488) T ss_pred HHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhh-------cCCCC Confidence 4566777889999999987763 232 2 2333222222 11111 No 89 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.67 E-value=3.3e-15 Score=99.97 Aligned_cols=501 Identities=11% Similarity=0.040 Sum_probs=226.6 Q ss_pred CC----cccccCCcccHHHHHHHHHHHH----HHh-hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee Q lcl|NC_011308. 1 MT----NTLLTTAPDRLGTILSTKIDEY----IRS-QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI 71 (530) Q Consensus 1 ~~----~~~~~~~~~~~~~~i~~~i~~~----~~~-~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki 71 (530) || +.+..-.+.....+..+.+..| .++ .-+..+.+-.+||.|.|= ........+....| .+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW--------~~~~~~~l~~~g~p--~~ 72 (772) T protein:vir:10 3 ITENDRQYLNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQL--------DTELLRRQQALGIP--PA 72 (772) T ss_pred cchhhHHhhccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcCCC--cE Confidence 33 2333222222222222322222 221 223456678889999871 00111111223334 68 Q ss_pred ecCchhhHHhhhhhhhcccceeeecC---CcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecCC- Q lcl|NC_011308. 72 SHGFFAELVDQKTQYLLANGIDVKPT---DHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTSE- 142 (530) Q Consensus 72 ~~n~~k~Ivd~~~~yl~G~pv~~~~~---~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~~- 142 (530) .+|..+.+|+..+|+.-.+.+.+... +.++.+..+.|+.++. ++.......+..++.++|.+|.-++++.+ T Consensus 73 ~~N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~ 152 (772) T protein:vir:10 73 VEDLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDP 152 (772) T ss_pred EEcchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCC Confidence 89999999999999999998876542 3356677777777653 45677778888899999999988887754 Q ss_pred --CceEEEEecccceEEEEcCCCC-----ceeEEEE-EE----------------------------------------- Q lcl|NC_011308. 143 --DKLTFQTVDALQLLPVFDDYGT-----LQRIIRF-YT----------------------------------------- 173 (530) Q Consensus 143 --g~~~~~~~~p~~~~~v~d~~~~-----~~~~~~~-y~----------------------------------------- 173 (530) +.++|..++|.++| ||.+.+ ...+++. |. T Consensus 153 ~~~~i~i~~v~p~~v~--~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (772) T protein:vir:10 153 FKFPYRCRPIRRDEIH--WDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTST 230 (772) T ss_pred CCCCeEEEeeCcccce--ecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccccccc Confidence 46889999999975 554321 1111111 00 Q ss_pred -------------EEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc-------ccccccceeeeeeccc Q lcl|NC_011308. 174 -------------EQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT-------VNPNPSQHVLAVADGV 233 (530) Q Consensus 174 -------------~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~ 233 (530) ...........+.+.-+|+|......+++.... ....+.++.. +...... ........ T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~-~g~~~~~~~~~~~~~~~l~~g~~~-~~~~~~~r 308 (772) T protein:vir:10 231 GLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSP-DGRVVEYDPNNLAHNIALASGRIS-PKKVTVSR 308 (772) T ss_pred ccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccC-CCceEeeCcccHHHHHHHhhcccc-hheeeeeE Confidence 000000111124455666666555444433221 1111110000 0000000 00000000 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG 308 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~ 308 (530) .......+ ......+..+.+.+++|+|+|+... ...|.+-.+++.++.+|+..|.....+...+ ++.=+|. T Consensus 309 v~~~~~~g--~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~ga 384 (772) T protein:vir:10 309 VRRSYWLG--PHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGA 384 (772) T ss_pred EEEEEEec--ceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCC Confidence 00111111 1112223445566677777766432 2346778899999999999999988875543 3333454 Q ss_pred CCCchhhHHHHH-hhCcceecCCC------CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-CcHHHHH Q lcl|NC_011308. 309 TNSPVDEIKKNI-QSKKIIQTKGE------GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-ATNVVIK 380 (530) Q Consensus 309 ~~~~~~~~~~~~-~~~~~i~~~~~------~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-~SGvAik 380 (530) .....+++.... +.+.++.+..+ +.+++.....-..+....+......|-.+|.+-+..-..-|| .||+||. T Consensus 385 v~~~d~~~~e~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~ 464 (772) T protein:vir:10 385 VAMTDAQFRRQIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQ 464 (772) T ss_pred ccchhHHHHHhccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHH Confidence 443333454443 34566666554 224444433335667777888888898888665432222244 5999998 Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------C--Cc-cccc---------------------e----- Q lcl|NC_011308. 381 SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRG-------L--GD-YSST---------------------D----- 424 (530) Q Consensus 381 ~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~-------~--~~-~d~~---------------------~----- 424 (530) .+-..........-..++.+.++..+++++++.... + .+ .... + T Consensus 465 ~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~ 544 (772) T protein:vir:10 465 QQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTR 544 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeee Confidence 876665555555555566666666666655543211 0 00 0000 0 Q ss_pred eeEEeCCCCCCCHH---HHHHHHHHHHhcCCCcHHHHH-------HhCCCCCCHHHHHHHHHHHHHHHHH-HHHhhhccc Q lcl|NC_011308. 425 IKFDIEPYILANEL---DLAMIDKTEAETNQIQINNLL-------AIAPRIGDEETLKAICDTLDLDYED-VVKALEDQE 493 (530) Q Consensus 425 i~i~f~~~~P~n~~---e~a~~~~~~~~~g~iS~et~l-------~~~~~vdd~~~e~~~~e~e~~e~~~-~~~~~~~~~ 493 (530) ..+.. ...|...+ +.++.+..+ .+.++.+... +.+. ..+.++..+++.+....... .+..-..+. T Consensus 545 yDv~i-~~~p~~~t~r~~~~~~m~ql--~~~~~P~~~~~~~~~~le~~D-~p~~~ei~~~ir~~~~~~~peq~~~~~~q~ 620 (772) T protein:vir:10 545 IKVAL-EDVPSTNSYRGQQLNAMSEA--VKSMPPQYQAAVLPFLVSLMD-VPFKRDVVEAIRAVDQQQTPEQIQQQIDQA 620 (772) T ss_pred EEEEe-eccccchHHHHHHHHHHHHH--HhccChhHHHHHHHHHHhhcC-CCChHHHHHHHHHHhccCChHHHHHHHHHH Confidence 01111 11122211 111111111 1223333322 2221 22333344333332110000 000000000 Q ss_pred cccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 494 VEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .. ...... ..+.+-...........-..++ T Consensus 621 ~q---q~~~~~----~~el~~~q~~a~~~~~~A~a~~ 650 (772) T protein:vir:10 621 VQ---DALAKA----GNDIKLRELEIKERKADSEISG 650 (772) T ss_pred HH---HHHHHH----HHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 0000000000000000000000 No 90 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.57 E-value=6.5e-14 Score=92.89 Aligned_cols=484 Identities=11% Similarity=0.015 Sum_probs=211.8 Q ss_pred CCcccccCCcccHHHHHHHHHHHHH------HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecC Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYI------RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHG 74 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~------~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n 74 (530) |-.-. ....++.+++++-+-.+.. ..+......+..+||.|+..- . ..+. ..++..+ T Consensus 1 ~~k~~-~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~----~----------~~~~--~s~~~~~ 63 (705) T protein:vir:88 1 MAKRR-KIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFG----N----------ERPG--KSGIVSR 63 (705) T ss_pred CCccc-ccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCC----c----------ccCC--CCccccH Confidence 32221 1234555554443333322 112223445666899997420 0 0111 2366677 Q ss_pred chhhHHhhhhhhhc----c--cceeeecCCcchHHHHHHHHHHhh------ccHHHHHHHHHHHHhhcCeEEEEEEecCC Q lcl|NC_011308. 75 FFAELVDQKTQYLL----A--NGIDVKPTDHDDQKLCYLIEEYYN------EEFQSAIQELVEGSTIKGYEGIFARTTSE 142 (530) Q Consensus 75 ~~k~Ivd~~~~yl~----G--~pv~~~~~~~~de~~~~~l~~~~~------~~~~~~~~e~~~~~~~~G~a~~~~y~d~~ 142 (530) .....|+...++|+ | +.+.|.+...+|....+.++.+++ ++....++...+++.++|.++.-+|++.. T Consensus 64 ~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~ 143 (705) T protein:vir:88 64 DVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEV 143 (705) T ss_pred HHHHHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccc Confidence 77777777777654 3 345676666666666666555542 23456677899999999999988877432 Q ss_pred ------------------------------------------------CceEEEEecccceEEEEcCCC-C-ce-eEEEE Q lcl|NC_011308. 143 ------------------------------------------------DKLTFQTVDALQLLPVFDDYG-T-LQ-RIIRF 171 (530) Q Consensus 143 ------------------------------------------------g~~~~~~~~p~~~~~v~d~~~-~-~~-~~~~~ 171 (530) |.+++..|+|+++++=.+... . .. .+.++ T Consensus 144 ~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~ 223 (705) T protein:vir:88 144 LKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHRE 223 (705) T ss_pred cchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEE Confidence 678888999998874222111 1 11 11222 Q ss_pred EEEEeec--cc-cc-ccceEEEEEEEcCCceE-EEeecCCcccchhhc-----ccccccccccee-eeee-cccc----c Q lcl|NC_011308. 172 YTEQRYS--DA-DN-KFNSIGHADVWTDTEVW-YYVQKDEGRSDEYVL-----DTTVNPNPSQHV-LAVA-DGVD----E 235 (530) Q Consensus 172 y~~~~~~--~~-~~-~~~~~~~~evyt~~~~~-~y~~~~~~~~~~~~~-----~~~~~~~~~~~~-~~~~-~~~~----~ 235 (530) +.....- .+ +. ....+...+..+.+... .+............. .........+.. ..+. .+.. . T Consensus 224 ~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~ 303 (705) T protein:vir:88 224 KYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELR 303 (705) T ss_pred eccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeE Confidence 2210000 00 00 00000000000000000 000000000000000 000000000000 0000 0000 0 Q ss_pred ceecccccccccccccccccCCccceEE-----eeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCC Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDI-----LYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV-RGGT 309 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~-----~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl-~g~~ 309 (530) .....+ .+....-++|++|++. .+..-.|.|.++.+.++++.+|.+.|.+.+.+...++|.+.+ .|.. T Consensus 304 ~~~~~g------~~il~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v 377 (705) T protein:vir:88 304 RILYVG------DYIISNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQV 377 (705) T ss_pred EEEEeC------ccccccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecccccc Confidence 000011 0111122456666664 444557899999999999999999999999999989876655 3432 Q ss_pred CCchhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcc----cc-cCCcHHHHHHHHh Q lcl|NC_011308. 310 NSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAV----GD-GNATNVVIKSRYT 384 (530) Q Consensus 310 ~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~----~~-gn~SGvAik~~~~ 384 (530) .. .+.. ..+.++++.+..++.+.++..+.-..+....++.+...|...|.+++.+-. .. ++.|+.|+..+.. T Consensus 378 -~~-~d~~-~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~ 454 (705) T protein:vir:88 378 -NL-EDLL-TNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMT 454 (705) T ss_pred -Cc-cccc-ccCCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHH Confidence 11 1111 234566777777777888877766677788899999999999999976432 22 3457778888887 Q ss_pred hHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCC--------cc---cc------ceeeEEeCCCCCCCHHHHHHHHHH Q lcl|NC_011308. 385 LLAMKAQKTEIALRK-TLRWTADLVVEDIRRRGLG--------DY---SS------TDIKFDIEPYILANELDLAMIDKT 446 (530) Q Consensus 385 ~l~~ka~~ke~~f~~-~l~~~~~~i~~~l~~~~~~--------~~---d~------~~i~i~f~~~~P~n~~e~a~~~~~ 446 (530) ..........+.|.+ +++++++++..++...... .+ +. .++.+.-......-+...+++... T Consensus 455 ~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~l 534 (705) T protein:vir:88 455 AAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRI 534 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHH Confidence 777777777777753 5565666665554332211 01 00 011222111110111112222111 Q ss_pred HHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHH-HHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 447 EAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDY-EDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 447 ~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +.....+.. .-...++++ +....+.+.+..... ......+..+ +......+.+. +.... T Consensus 535 l~~~q~l~~--~~~~~~~~~-~~~~~~~~~el~e~~~~k~~~~~~~~--------------~~~~e~~~~~~---~~~q~ 594 (705) T protein:vir:88 535 WEMAQAVVG--GGGLGVLVS-EQNLYNILKEVTENAGYKDPDRFWTN--------------PNSPEALQAKA---IREQK 594 (705) T ss_pred HHHHHHhhc--ccchhhhcC-hHHHHHHHHHHHHhhhhhhHHHHhhh--------------hhhHHHHHHHH---hhhhh Confidence 110000000 000111111 110000000000000 0000000000 00000000000 00000 Q ss_pred ccCCC Q lcl|NC_011308. 526 EPVQE 530 (530) Q Consensus 526 ~~~~~ 530 (530) ++.++ T Consensus 595 e~~~~ 599 (705) T protein:vir:88 595 EAQPK 599 (705) T ss_pred hhhHH Confidence 00000 No 91 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.55 E-value=4.3e-14 Score=93.89 Aligned_cols=427 Identities=12% Similarity=0.061 Sum_probs=200.9 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcc----c-chhhhcccccccccccccccccCCcceeecCc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQ----D-NDIENTRIMWMNDHGDIVEDDNASNIKISHGF 75 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g----~-~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~ 75 (530) |.. +=+....+ ...+. ..+..|..| . ++-..... +.. .....-......=-.+.+ T Consensus 1 ~~~------------~~~a~~~~-~~~~a----~~~~~~~~~~g~~~~~d~~~~~~--~~~-~~~~~~~~l~~lY~~~~l 60 (461) T protein:vir:80 1 MYS------------IDKAKQAK-IDSKI----VNRNDFMVGHGKANSRDKLTRQT--PGN-GQKLDLKACENLYASNSI 60 (461) T ss_pred Ccc------------chhhhhhh-hhhhh----hhhhHHHhhcCCcchhhhhhccc--cCc-ccccCHHHHHHHHHhCCc Confidence 110 00000000 00000 111122111 1 11111000 000 000000000000014688 Q ss_pred hhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccc Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQ 154 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~ 154 (530) ++.+|+..+..++-+++.+++.+ ++..+.|+..++ -+....+.++.+.+..+|.|+.++-..+.+. .+|.. T Consensus 61 ~r~iVd~~a~d~~r~g~~i~~~~---~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~-----~~~~~ 132 (461) T protein:vir:80 61 AMNIVDIISEDMVRAGWSLKTDN---KEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR-----EQADL 132 (461) T ss_pred cchhhccchHHhhcCCeeeecCC---HHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc-----cccCc Confidence 89999999999999999997754 333444555543 3567788999999999999998887643221 11111 Q ss_pred eEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccc Q lcl|NC_011308. 155 LLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVD 234 (530) Q Consensus 155 ~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (530) .-||......-...+..|............. ..--.++.+ ..|...+.+......... T Consensus 133 ~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~d-p~sp~fg~P---~~y~i~~~~~~~~~~~~~------------------ 190 (461) T protein:vir:80 133 STAIDPKTIKSIPYINTFNTQKVTQLYLNQD-MFSEHFGEV---EFFEVNRVSQLGEEILSG------------------ 190 (461) T ss_pred cCCcccccccceeEEEeccccccchhhhccc-CcCcccccc---eEEEEecccccccccccc------------------ Confidence 2222222211111111111100000000000 000000011 111111110000000000 Q ss_pred cceecccccccccccccccccCCccceEEeeCC-----cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN-----KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT 309 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn-----~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~ 309 (530) ......-.+..-++++|.+. -.|.|.++.+...+.+|+.++-..+..+..+..+.+.+.|.. T Consensus 191 -------------~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~ 257 (461) T protein:vir:80 191 -------------TTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDID 257 (461) T ss_pred -------------ccCccceEEccccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHH Confidence 00000011222344555443 248999999999999999999999998888888888777642 Q ss_pred C---CchhhHHHHH----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC---CcccccCCcHHH- Q lcl|NC_011308. 310 N---SPVDEIKKNI----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS---SAVGDGNATNVV- 378 (530) Q Consensus 310 ~---~~~~~~~~~~----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~---~~~~~gn~SGvA- 378 (530) . +...+....+ ...+++.++.+.+++.++ .+.......++.+...|...+.+|-. +....|++||.. T Consensus 258 ~~~~~~~~~~~~~~~~~~~~~g~~~~d~~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D 335 (461) T protein:vir:80 258 ALNKDDKANLTAMLDFMFRTEALAIIKGDEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYD 335 (461) T ss_pred hhhchHHHHHHHHHHHhcCCceEEEEcCCcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHH Confidence 2 1111212112 133455566666655444 66678889999999999999999963 222235677764 Q ss_pred HHHHHhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhc-CCCccccceeeEEeCCCCCCCHHHHHHH-------HHHHHh Q lcl|NC_011308. 379 IKSRYTLLAMKAQKTE-IALRKTLRWTADLVVEDIRRR-GLGDYSSTDIKFDIEPYILANELDLAMI-------DKTEAE 449 (530) Q Consensus 379 ik~~~~~l~~ka~~ke-~~f~~~l~~~~~~i~~~l~~~-~~~~~d~~~i~i~f~~~~P~n~~e~a~~-------~~~~~~ 449 (530) ++.-+ ..+..++ ..++..|++++.++..-.... ...+.+..++++.|++-.+-+++|.|++ .++..+ T Consensus 336 ~~~yy----d~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~ 411 (461) T protein:vir:80 336 VMNYY----ARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIV 411 (461) T ss_pred HHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 43322 3343333 567888998888775532222 2334556789999999999999998776 444555 Q ss_pred cCCCcHHHHHHhCC-C--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 450 TNQIQINNLLAIAP-R--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 450 ~g~iS~et~l~~~~-~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) +|++|.+++.+.+- . .++ .....+.+++.++......+..+++ +.++ T Consensus 412 ~g~is~~e~r~~l~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~~e--~~~g 461 (461) T protein:vir:80 412 NGVLDPDEVKETRFGRFGLEN-----------------------SSKFSGDSAEIDKLAKLVYDAYAKK--NADG 461 (461) T ss_pred cCCCCHHHHHHHHHHhcCCCC-----------------------CccCCCCCchhhhhhhhcccccccc--CCCC Confidence 66666666644220 0 000 0000111111111100011111111 1111 No 92 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.41 E-value=2.7e-11 Score=78.50 Aligned_cols=464 Identities=12% Similarity=0.105 Sum_probs=204.1 Q ss_pred CCcccccCCc----ccHHHHHHHHHHHHHHhhhH---HHH---------HHHHHHhcccchhhhcccccccccccccccc Q lcl|NC_011308. 1 MTNTLLTTAP----DRLGTILSTKIDEYIRSQNV---SLA---------RVGQRYYNQDNDIENTRIMWMNDHGDIVEDD 64 (530) Q Consensus 1 ~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~---~~~---------~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~ 64 (530) +|.+-..-+. +.+...+.+..+++.+.... +|. .++.+||.|... + .. ...+.. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~---~--~~-----~~~~~~ 74 (651) T protein:vir:80 5 TTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVL---R--SV-----GDVNAD 74 (651) T ss_pred ccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccc---c--cc-----CCCCCC Confidence 3333322222 22333444444444432211 111 133455555321 0 00 000111 Q ss_pred cCCcceeecCchhhHHhhhhhhhccc----ceeee--cCCcch--HHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcC Q lcl|NC_011308. 65 NASNIKISHGFFAELVDQKTQYLLAN----GIDVK--PTDHDD--QKLCYLIEEYYN-----EEFQSAIQELVEGSTIKG 131 (530) Q Consensus 65 ~~~n~ki~~n~~k~Ivd~~~~yl~G~----pv~~~--~~~~~d--e~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G 131 (530) .+ +++..+.....|+..++.|+.. +--|. ...+.+ ....+.++.++. .+|......+..++.++| T Consensus 75 ~r--s~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G 152 (651) T protein:vir:80 75 WR--HKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITG 152 (651) T ss_pred CC--ccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccC Confidence 12 4788888888888877776652 22233 222222 334455666653 246666677889999999 Q ss_pred eEEEEEEecC-------------------------------CCceEEEEecccceEEEEcCCCC----ceeEEEEEEEEe Q lcl|NC_011308. 132 YEGIFARTTS-------------------------------EDKLTFQTVDALQLLPVFDDYGT----LQRIIRFYTEQR 176 (530) Q Consensus 132 ~a~~~~y~d~-------------------------------~g~~~~~~~~p~~~~~v~d~~~~----~~~~~~~y~~~~ 176 (530) .++..+|++. .|.+++..|||.++|+ |.+.. ...+++.+.... T Consensus 153 ~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~t~~ 230 (651) T protein:vir:80 153 NSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTKTKA 230 (651) T ss_pred ceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeeeeHH Confidence 9998888752 2567899999999875 44321 122223222100 Q ss_pred e------ccccccc--------------------------------ceEEEEEEEcCCceEEEeecCCcccchhhccccc Q lcl|NC_011308. 177 Y------SDADNKF--------------------------------NSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTV 218 (530) Q Consensus 177 ~------~~~~~~~--------------------------------~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~ 218 (530) . .+..... .....++||+ +|... .. T Consensus 231 ~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E-----~~~~~------------d~ 293 (651) T protein:vir:80 231 DILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLE-----YWGDI------------HL 293 (651) T ss_pred HHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEE-----EEEEe------------ec Confidence 0 0000000 0000011111 00000 00 Q ss_pred cccccceeeeeecccccceeccccccccccccccccc-CCccceEEee-----CCcCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 219 NPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSY-KSRFPFDILY-----NNKLGISDIKKVKSIIDDYDLMNCFLS 292 (530) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~iPiv~~~-----nn~~~~sd~e~v~~liDa~~~~~S~~~ 292 (530) .......+..+ ..+... .....++ +...|++.++ ...+|.|..+.+.+.+..+|.+.+... T Consensus 294 e~~~~~~~~v~---------~~g~~i----l~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~l 360 (651) T protein:vir:80 294 ENKTYHDVVVT---------IMGNEV----LRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRL 360 (651) T ss_pred cCCceEEEEEE---------EcCcEE----ecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHH Confidence 00000000000 000000 0001121 2334665544 345799999999999999999999999 Q ss_pred HHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEecC-CHHHHHHHHHHHHHHHHHHhcccCCCc--- Q lcl|NC_011308. 293 NNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDI-PYEARKAKMDIDELNIYRSGMGFNSSA--- 368 (530) Q Consensus 293 n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~-~~~~~e~~ld~L~~~I~~~s~~p~~~~--- 368 (530) +.+.-.++|.+.+........+++. ...++++.++..+++..+.... +.......++.|...+-..+.++++.- T Consensus 361 d~~~~~~~~~~~v~~d~~~~~~~l~--~~pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~ 438 (651) T protein:vir:80 361 DNLELAIDQMYTLRSDGLLQPEDVY--TEPGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANA 438 (651) T ss_pred HHHHHHhCCcEEecCCccccHHHhh--cCCCceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCC Confidence 9999999999877543333333332 3567788888889998887643 455667789999999999999887532 Q ss_pred -ccccCCcHHHHHHHHhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCC----C-------------ccccceeeEEe Q lcl|NC_011308. 369 -VGDGNATNVVIKSRYTLLAMKAQKTEIALRK-TLRWTADLVVEDIRRRGL----G-------------DYSSTDIKFDI 429 (530) Q Consensus 369 -~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~-~l~~~~~~i~~~l~~~~~----~-------------~~d~~~i~i~f 429 (530) ...++.++.++..+...+...-...-+.|.. +++.+++.++.++...+. . ..+..+++..| T Consensus 439 ~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~ 518 (651) T protein:vir:80 439 ARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEV 518 (651) T ss_pred ccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeee Confidence 2234445556666555555444444444433 344444433333322111 0 01112333333 Q ss_pred CCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCC Q lcl|NC_011308. 430 EPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLT 509 (530) Q Consensus 430 ~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (530) .- .+.......+-.+.. ..+++.-..+...|.+.... ...++ ..+.+....-+.. +... . T Consensus 519 ~i-v~~g~~~~~~r~~~~--~~l~~~~q~~~~~p~~~~~~-~~~~~------~~~l~~~~g~~~~-------~~~l---~ 578 (651) T protein:vir:80 519 RL-VPIGSDHVIERKQYI--EDRLTFIQAVAQVPEMGQLV-DYKRI------LVDLLQHWGFEEP-------EAYL---K 578 (651) T ss_pred ee-eeccHHHHHHHHHHH--HHHHHHHHhhccCCccchhh-hHHHH------HHHHHHHcCCCCc-------HHhc---C Confidence 21 122221111100000 00111111222222222111 00000 0111111111110 0000 0 Q ss_pred CCCccCcCCCCcccccccCCC Q lcl|NC_011308. 510 IEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 510 ~~~~~~~~~~~~~~~~~~~~~ 530 (530) .++++.+.+.++.-..++-.. T Consensus 579 ~~~q~~~~~~~~~~~~q~~~~ 599 (651) T protein:vir:80 579 QQDQQAPANPQEALLSQAKDV 599 (651) T ss_pred CCccchhhhhhHHHHhhHHHH Confidence 111111111111111111110 No 93 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.39 E-value=1.6e-12 Score=85.31 Aligned_cols=404 Identities=13% Similarity=0.131 Sum_probs=192.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |...+-+..+....-. .+.. ....|+-+.+..... .. ..=-.+.+++.+|+..+.-++-+ T Consensus 1 ~~~~D~~~~~~~~~g~-~~~~---~~~~~~~~~~~~~~~-----------l~-----a~Y~~~~l~~~~vd~~a~d~~r~ 60 (437) T protein:vir:52 1 MKFFDGIKSLALKLGS-KQEQ---TYYSPSLSLTDDLVQ-----------LE-----ALWRDNWIANKVCIKRPEDMVRN 60 (437) T ss_pred CchhhhhHhHHhcCCC-cccc---ceeecCccccccHHH-----------HH-----HHHHhCchhhHHhhcchHHhhcC Confidence 2222222221111000 0000 000011110000000 00 00013688999999999999999 Q ss_pred ceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCC---------CceE-EEEecccceEEEE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSE---------DKLT-FQTVDALQLLPVF 159 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~---------g~~~-~~~~~p~~~~~v~ 159 (530) ++.+++.+.+ ++..+.++..++. ++...+.++.+.+..+|.|+.++-.|.. |.++ +.+++|.++.|+. T Consensus 61 ~~~i~~~d~~-~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~ 139 (437) T protein:vir:52 61 WREIYSNDLN-SKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTG 139 (437) T ss_pred CceEecCCCC-HHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccc Confidence 9999875433 3334455555543 6778889999999999999988877653 2332 5566666655432 Q ss_pred cCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceec Q lcl|NC_011308. 160 DDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILD 239 (530) Q Consensus 160 d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (530) -...+...- .|+... .|...+++.. ..+......|+.. T Consensus 140 ~~~~dp~s~-~fg~p~------------------------~y~v~~~~~~------~~iH~SRii~~~~----------- 177 (437) T protein:vir:52 140 TKDDDVLSP-NFGRYS------------------------EYSILGGSQS------ITVHHSRLIILNA----------- 177 (437) T ss_pred ccccccccc-ccCcce------------------------EEEEecCCcc------eeEccceeEEecC----------- Confidence 111110000 000000 0100000000 0000000000000 Q ss_pred ccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC--C--Cchhh Q lcl|NC_011308. 240 EGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT--N--SPVDE 315 (530) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~--~--~~~~~ 315 (530) ..+| ...++-.|.|.++.+..-|.+++.+.-..+..+..+..+.+.++|.. . ..... T Consensus 178 -----------------~~~~--~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~ 238 (437) T protein:vir:52 178 -----------------NDAP--LSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENE 238 (437) T ss_pred -----------------ccCC--CccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHH Confidence 0011 01123358999999999999999999998888888888888877631 1 11111 Q ss_pred HHH------HHh-hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc---cCCcHHHHHHHHhh Q lcl|NC_011308. 316 IKK------NIQ-SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD---GNATNVVIKSRYTL 385 (530) Q Consensus 316 ~~~------~~~-~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~---gn~SGvAik~~~~~ 385 (530) ... .++ ..+++.++.+.+++.+ +.+.......++...+.|...+.+|-.--.+. |=+||..=..-|.. T Consensus 239 ~~~~~~~~~~~~~~~~~~~~d~~~~~e~~--~~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd 316 (437) T protein:vir:52 239 VASVISAVQEIKSATNSLLLDAENEYDRK--ELTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHE 316 (437) T ss_pred HHHHHHHHHHhcCCCceEEEcCCcceEEE--ecCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHH Confidence 111 112 2455666666555555 45666788889999999999999995321111 22456533333332 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHH-------HHhcCCCcHHH Q lcl|NC_011308. 386 LAMKAQ-KTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKT-------EAETNQIQINN 457 (530) Q Consensus 386 l~~ka~-~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~-------~~~~g~iS~et 457 (530) .+. ..+..++..+++++.+|..- ..+ ..+ .++++.|++-..-+++|.|++..+ ..++|++|.+. T Consensus 317 ---~i~~~Qe~~l~p~le~l~~~i~~~--~~g--~~~-~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e 388 (437) T protein:vir:52 317 ---AIRRLQETRLRPIFEIIDPLICNE--LFG--GLP-ADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQ 388 (437) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHH--hcC--CCC-CcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHH Confidence 233 33467888888888876432 112 222 268899998888888887776433 33444444443 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 458 LLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 458 ~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +.+.| .+ ......+.....++..+. ++. ....++|.+..+...++|-.+ T Consensus 389 ~r~~L-------------~~-----~g~~~~i~~~~~~~~~~~-~~~-----~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 389 IANEL-------------RE-----SGLFANISAEHIEELKNA-DEF-----AGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred HHHHH-------------Hh-----cCCCCCCCccccccccCC-CCC-----CCccCCCCCCCCCCCCCCCCC Confidence 33321 11 000011111100000000 000 001111111112222222222 No 94 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.36 E-value=6.7e-12 Score=81.84 Aligned_cols=467 Identities=10% Similarity=0.019 Sum_probs=231.6 Q ss_pred CCcccccCC----cccHHHHHHHHHHHHHHhhhHHHH--HHHHHHhcccchhhhcccccccccccccccccCCcceeecC Q lcl|NC_011308. 1 MTNTLLTTA----PDRLGTILSTKIDEYIRSQNVSLA--RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHG 74 (530) Q Consensus 1 ~~~~~~~~~----~~~~~~~i~~~i~~~~~~~~~~~~--~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n 74 (530) |+....+-. ++++...+.+....+.+..+.... .++++||.+...- +.... .... -++|-.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~-----~~~~~-----~~~~--r~~~~~~ 68 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTT-----TTSNQ-----GLPW--KNSTTLP 68 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhh-----hhhhc-----cccc--ccccchh Confidence 443332221 344555666666666554433333 7899999885531 11111 1112 2478888 Q ss_pred chhhHHhhhhhhhcccce------eeecCCcc--hHHHHHHHHHHhhc-----cHHHHHHHHHHHHhhcCeEEEEEEecC Q lcl|NC_011308. 75 FFAELVDQKTQYLLANGI------DVKPTDHD--DQKLCYLIEEYYNE-----EFQSAIQELVEGSTIKGYEGIFARTTS 141 (530) Q Consensus 75 ~~k~Ivd~~~~yl~G~pv------~~~~~~~~--de~~~~~l~~~~~~-----~~~~~~~e~~~~~~~~G~a~~~~y~d~ 141 (530) .+.-+++..+++|++-=. .+.....+ +....++++.++.+ ++.....++..++.++|.|+..+++.. T Consensus 69 k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~ 148 (584) T protein:vir:95 69 KLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEA 148 (584) T ss_pred HHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEee Confidence 888888888888765321 12111112 23336777777643 567788999999999999998887653 Q ss_pred C-------------CceEEEEecccceEEEEcCCC---CceeE-EEEEEEEeec-------------------------- Q lcl|NC_011308. 142 E-------------DKLTFQTVDALQLLPVFDDYG---TLQRI-IRFYTEQRYS-------------------------- 178 (530) Q Consensus 142 ~-------------g~~~~~~~~p~~~~~v~d~~~---~~~~~-~~~y~~~~~~-------------------------- 178 (530) . .++++..++|.++| ||.+. +-.++ +|.+.....- T Consensus 149 ~~~e~~e~~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~ 226 (584) T protein:vir:95 149 KYKEMTDGTLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRH 226 (584) T ss_pred cceeeeccccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccC Confidence 3 25889999999998 56543 22222 2222110000 Q ss_pred ccccccceEEEEEEEcCCc-eEEEeecCCcccchhhc-----cccccccccceeeeeecccccceecccccccccccccc Q lcl|NC_011308. 179 DADNKFNSIGHADVWTDTE-VWYYVQKDEGRSDEYVL-----DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLG 252 (530) Q Consensus 179 ~~~~~~~~~~~~evyt~~~-~~~y~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (530) ............+-++.++ ...|.....+....+.. +.........++..+ .......+.... T Consensus 227 ~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v-----------~~g~~iIR~~~n 295 (584) T protein:vir:95 227 LGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITV-----------VDRSTEVRNESI 295 (584) T ss_pred CCCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEE-----------EeccEEEEeeec Confidence 0000000000000000000 00111111111110000 000000111111111 111222334456 Q ss_pred cccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCccee Q lcl|NC_011308. 253 RSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQ 327 (530) Q Consensus 253 ~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~ 327 (530) +.+++.+|++.... .-.|.|..+-+.++++.+|.+.-.+.|.+..+.+|.++..+...+ + ..+.+..+. T Consensus 296 p~~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~~----~--~~~pg~~~~ 369 (584) T protein:vir:95 296 PTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVEE----F--VWGPGAEIH 369 (584) T ss_pred CCCCCCCCEEEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccccch----h--cccCCceee Confidence 77889999987554 337999999999999999999999999999999997776654322 1 234667888 Q ss_pred cCCCCceeEEEecC-CHHHHHHHHHHHHHHHHHHhcccCCCcc--cccCCcHHHHHHHHhhHHHHHHHHHHHHHHHH-HH Q lcl|NC_011308. 328 TKGEGGLDIQTVDI-PYEARKAKMDIDELNIYRSGMGFNSSAV--GDGNATNVVIKSRYTLLAMKAQKTEIALRKTL-RW 403 (530) Q Consensus 328 ~~~~~~~~~lt~~~-~~~~~e~~ld~L~~~I~~~s~~p~~~~~--~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l-~~ 403 (530) ++..|+++++.++. +..+....+..+...+-..|.+|..+-. ..++.+...+.+++..+-.-...+.+.|..+| ++ T Consensus 370 ~~~~~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~ 449 (584) T protein:vir:95 370 LDQGGDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEP 449 (584) T ss_pred cCCCCCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88899999998874 4455556688888888899999976432 22333444467777777777778888888887 77 Q ss_pred HHHHHHHHHHhcC-CCc----------------cccceeeEEeC--CCC----CCCHHHHHHHHHHHH-h-----cCCCc Q lcl|NC_011308. 404 TADLVVEDIRRRG-LGD----------------YSSTDIKFDIE--PYI----LANELDLAMIDKTEA-E-----TNQIQ 454 (530) Q Consensus 404 ~~~~i~~~l~~~~-~~~----------------~d~~~i~i~f~--~~~----P~n~~e~a~~~~~~~-~-----~g~iS 454 (530) ++.++.++-.... ..+ ....+++-.|. ..- -..+...++....+. . .+-++ T Consensus 450 l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~ 529 (584) T protein:vir:95 450 VLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTS 529 (584) T ss_pred HHHHHHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccch Confidence 6777766532211 000 00011221111 000 000111111111111 1 11123 Q ss_pred HHHHHH------hCC---CCC-CHH-HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCC Q lcl|NC_011308. 455 INNLLA------IAP---RIG-DEE-TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPII 505 (530) Q Consensus 455 ~et~l~------~~~---~vd-d~~-~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 505 (530) +..... .+| +.. ++. ++..+.+....+.++.. .++.+.+ +++.. T Consensus 530 ~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~-~~~~~~~------~~~~~ 584 (584) T protein:vir:95 530 GKALATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDL-QLQAQMP------AEGAI 584 (584) T ss_pred HHHHHHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHH-HHHHhhh------hccCC Confidence 322222 122 111 111 00000111111111111 1111111 11111 No 95 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.34 E-value=4.8e-11 Score=77.15 Aligned_cols=427 Identities=11% Similarity=0.040 Sum_probs=200.8 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) |+-+........ +.-+. -......... .+..||....-+-...... =-.+.+++.+| T Consensus 71 ~ds~~~~~~~~~----~~~~~-~~~~~~~~~~--~~~~~~~~~~f~gyql~al----------------Y~~~~l~rkiV 127 (765) T protein:vir:96 71 MDSAYGDGPTPA----AKAAA-GGQNPYVVPT--MLQDWYNSQGFIGYQACAI----------------ISQHWLVDKAC 127 (765) T ss_pred ccccccccccch----HHHhh-hccCccchhh--HHHhhhcccCCccHHHHHH----------------HHhCchhhhhh Confidence 333321111111 11100 0000001111 1222332221110000000 01257899999 Q ss_pred hhhhhhhcccceeeecCCcc-hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCC-C-------------- Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHD-DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSE-D-------------- 143 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~-de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~-g-------------- 143 (530) +..+.-++-+++.+++.+.+ ..+..+.|+..++ =+....+.++.+.+-.||.+|.++-.+.. + T Consensus 128 d~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~k 207 (765) T protein:vir:96 128 SMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAP 207 (765) T ss_pred hcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhcccccccccc Confidence 99999999999999875432 2334444554443 25677889999999999999987766422 1 Q ss_pred -ce-EEEEecccceEEEEcCCCCceeEE-EEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccc Q lcl|NC_011308. 144 -KL-TFQTVDALQLLPVFDDYGTLQRII-RFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNP 220 (530) Q Consensus 144 -~~-~~~~~~p~~~~~v~d~~~~~~~~~-~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~ 220 (530) .+ .+.+++|.++.|.........+.- .||... .| ...+ T Consensus 208 g~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~----------------~y--------~i~g--------------- 248 (765) T protein:vir:96 208 GSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPD----------------FW--------IISG--------------- 248 (765) T ss_pred ceeeEEEEechhhcccccchhccccccccccCcce----------------ee--------eecC--------------- Confidence 12 134455555544221100000000 011110 00 0000 Q ss_pred cccceeeeeecccccceecccccccccccccccccCCccceEEee-CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011308. 221 NPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY-NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMA 299 (530) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~-nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~ 299 (530) .. .|.+.+..-.. . .+|-+.-. ++-.|.|.++.+..-|.+++...-..+..+..+. T Consensus 249 ~~-IH~SRli~~~g---------------------~-~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~ 305 (765) T protein:vir:96 249 KK-YHRSHLVVVRG---------------------P-QPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKR 305 (765) T ss_pred ce-eccceEEEecC---------------------C-CchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 00 01111000000 0 00110000 1224889999999999999999999999998888 Q ss_pred cceeeeecCCC-CchhhHHHH------Hh-hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC---- Q lcl|NC_011308. 300 EAIYVVRGGTN-SPVDEIKKN------IQ-SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS---- 367 (530) Q Consensus 300 ~~~lvl~g~~~-~~~~~~~~~------~~-~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~---- 367 (530) ...+.+.+... .+.+..... .+ ..+++.++.+.+++.+ +.+...+...++...+.|-..+.+|-.- T Consensus 306 ~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~ee~~e~~--s~~lsgl~d~l~~~~~~iAaas~IP~t~LfGq 383 (765) T protein:vir:96 306 TSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGIDETMEQF--DTNLSDFDSVIMNQYQLVAAIAKTPATKLLGT 383 (765) T ss_pred cceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhCCCeeeeccC Confidence 88887766422 222222221 11 2235556666655544 4667788999999999999999999532 Q ss_pred -cccccCCcHH-HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHH-- Q lcl|NC_011308. 368 -AVGDGNATNV-VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMI-- 443 (530) Q Consensus 368 -~~~~gn~SGv-Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~-- 443 (530) ..++ |+||. .++.-|..+. ...+..++..|++++.+|... +. .+ .++++.|++=..-+++|.|++ T Consensus 384 sp~Gl-nATGe~D~~nYyD~I~---s~Qe~~l~p~le~L~~li~~s----~~--i~-~d~~i~FnpL~~~sekEkAei~~ 452 (765) T protein:vir:96 384 SPKGF-NATGEHETISYHEELE---SIQEHIFDPLLERHYLLLAKS----ES--ID-VQLEIVWNPVDSTTSQQQAELNN 452 (765) T ss_pred Ccccc-cCcchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh----cC--CC-CcceEEeCCCCCCCHHHHHHHHH Confidence 2222 56776 4433333222 233467888898888876542 22 22 268999998888888886665 Q ss_pred -----HHHHHhcCCCcHHHHHHhCC------C--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccC---CccccCCCCC Q lcl|NC_011308. 444 -----DKTEAETNQIQINNLLAIAP------R--IGDEETLKAICDTLDLDYEDVVKALEDQEVEEL---EPTVTPIIDP 507 (530) Q Consensus 444 -----~~~~~~~g~iS~et~l~~~~------~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~---~~~~~~~~~~ 507 (530) .++....|++|...+++.+. + ++|.+.|.+ .-.. .+....+.....++. .++..+...+ T Consensus 453 k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~--~~~~---pe~~~~~~~~~~~~~~~~~e~~~~~a~p 527 (765) T protein:vir:96 453 KKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETE--PGMS---PENLAELEKAGAQSAKAKGEAERAEAQA 527 (765) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccc--cCCC---ccccccccCCCcccccccCccccccCCC Confidence 56678889999988888762 1 122111100 0000 000011111000000 0000000000 Q ss_pred CCCCCccCcCCCCcc---------------cccccCCC Q lcl|NC_011308. 508 LTIEPQPEPLNIDPV---------------IEEEPVQE 530 (530) Q Consensus 508 ~~~~~~~~~~~~~~~---------------~~~~~~~~ 530 (530) ......++|+..-|+ +.++|-.- T Consensus 528 ~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~ 565 (765) T protein:vir:96 528 GAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRP 565 (765) T ss_pred CccCCCCcccccCCcccCCccccccccCccccCccccc Confidence 001111111111111 11111000 No 96 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.33 E-value=2.3e-11 Score=78.86 Aligned_cols=487 Identities=9% Similarity=-0.003 Sum_probs=203.1 Q ss_pred ccCCcccHHHHHHHHHHHHHHh-hhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhh Q lcl|NC_011308. 6 LTTAPDRLGTILSTKIDEYIRS-QNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKT 84 (530) Q Consensus 6 ~~~~~~~~~~~i~~~i~~~~~~-~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~ 84 (530) .++...++.++...+-.....+ +-+..+..-.+||.|.+= ........+.. .|..+|..+.+|+.-+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~~l~~q----~rp~~N~i~~~i~~v~ 68 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQW--------DDWLSQYTTLQ----YRGQFDVVRPVVRKLV 68 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCC--------CHHHHHHHHhc----CCCccccHHHHHHHHH Confidence 3333333443333322222221 223456788899999871 00000011112 2446899999999999 Q ss_pred hhhccccee--eecCCcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEec---CC---CceEEEEe- Q lcl|NC_011308. 85 QYLLANGID--VKPTDHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTT---SE---DKLTFQTV- 150 (530) Q Consensus 85 ~yl~G~pv~--~~~~~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d---~~---g~~~~~~~- 150 (530) |+---+.+. |.+.+.++....+.|+.++. ++.......+..++.++|.+|.-++.| ++ +++++.+. T Consensus 69 g~~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~ 148 (725) T protein:vir:77 69 SEMRQNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) T ss_pred hhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEee Confidence 888877765 45566677777777777663 355666778888999999999766533 22 34444433 Q ss_pred ---cccceEEEEcCCCC------c-eeEEEEEEE------------------------EeecccccccceEEEEEEEcCC Q lcl|NC_011308. 151 ---DALQLLPVFDDYGT------L-QRIIRFYTE------------------------QRYSDADNKFNSIGHADVWTDT 196 (530) Q Consensus 151 ---~p~~~~~v~d~~~~------~-~~~~~~y~~------------------------~~~~~~~~~~~~~~~~evyt~~ 196 (530) ||.++| ||.... . -+++..|.. ..+...+.....+..+++|... T Consensus 149 ~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~ 226 (725) T protein:vir:77 149 IHSACSHVI--WDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) T ss_pred cccChhhce--eCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEE Confidence 455554 333211 0 011111110 0011112233456667777754 Q ss_pred ceEE--EeecCCcccchhhccccc-c-------cccccee-eeeecccccc-eecccccccccccccccccCCccceEEe Q lcl|NC_011308. 197 EVWY--YVQKDEGRSDEYVLDTTV-N-------PNPSQHV-LAVADGVDEA-ILDEGVEEHEGRQVLGRSYKSRFPFDIL 264 (530) Q Consensus 197 ~~~~--y~~~~~~~~~~~~~~~~~-~-------~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~iPiv~~ 264 (530) .+.. +...+........+.... . ......+ .......... ....+. .......+.+-+++|+||| T Consensus 227 ~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~---~~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:77 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT---AVLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCc---eeeccCCcCCCCccceEEE Confidence 4432 222221111110000000 0 0000000 0000000000 000111 1111222333344555554 Q ss_pred ---eCCcCCC----CcHHHHHHHHHHHHHHHHHHHHHHHHhccceee-eecCCCCchhhHHHHHhhCcce----ecCCCC Q lcl|NC_011308. 265 ---YNNKLGI----SDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYV-VRGGTNSPVDEIKKNIQSKKII----QTKGEG 332 (530) Q Consensus 265 ---~nn~~~~----sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lv-l~g~~~~~~~~~~~~~~~~~~i----~~~~~~ 332 (530) +....|. |-+..+++.++.+|...|.....+........+ -.|.. +...+..........+ ....+| T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) T protein:vir:77 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENSG 382 (725) T ss_pred eeeeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhh-hHHHHHHHhccCCceecccccccCCC Confidence 3323333 677789999999999999999888765543333 22221 1122222222111111 111222 Q ss_pred -----ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 333 -----GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTAD 406 (530) Q Consensus 333 -----~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~ 406 (530) .+..+..+.=..+....++.....|-..|.+-+..-...|| +||+|+..+-.....-....-..++.+.++..+ T Consensus 383 ~~~~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:77 383 DLPTQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred cccccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23333333333455557888888898888654332222243 699999998877777766666667777776666 Q ss_pred HHHHHHHhcC-------C-Cc---ccc------------------------ceeeEEeCCCCCCCHHHHHHHHHH-HHhc Q lcl|NC_011308. 407 LVVEDIRRRG-------L-GD---YSS------------------------TDIKFDIEPYILANELDLAMIDKT-EAET 450 (530) Q Consensus 407 ~i~~~l~~~~-------~-~~---~d~------------------------~~i~i~f~~~~P~n~~e~a~~~~~-~~~~ 450 (530) ++++++.... + +. .++ .+|.+.=.+..|.=..+.++.++. +... T Consensus 463 ~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~ 542 (725) T protein:vir:77 463 IYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhc Confidence 6665543211 0 00 000 112222222222111111111111 1111 Q ss_pred CC-CcH--HHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 451 NQ-IQI--NNLLAIAPRIGDE--ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 451 g~-iS~--et~l~~~~~vdd~--~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +- .+. -+++..++..+-+ ++..+++.+.... ... .++.+ + ...+...-..+.... T Consensus 543 ~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~------~~~----------~q~~~-~---~e~q~~~~~qq~~~~ 602 (725) T protein:vir:77 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ------MGV----------KKPET-P---EEQQWLVEAQQAKQG 602 (725) T ss_pred cccchhHHHHHHHhhccccchHHHHHHHHHHhhhhh------hhc----------cCCCC-h---hhHHHHHHHHHHHHH Confidence 11 111 1111112111111 1111111110000 000 00000 0 000000000000000 Q ss_pred ccCCC Q lcl|NC_011308. 526 EPVQE 530 (530) Q Consensus 526 ~~~~~ 530 (530) .+.-+ T Consensus 603 q~~~e 607 (725) T protein:vir:77 603 QQDPA 607 (725) T ss_pred hHHHH Confidence 00000 No 97 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.29 E-value=1.4e-10 Score=74.53 Aligned_cols=436 Identities=10% Similarity=0.095 Sum_probs=199.0 Q ss_pred CCcccccCCccc------HHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecC Q lcl|NC_011308. 1 MTNTLLTTAPDR------LGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHG 74 (530) Q Consensus 1 ~~~~~~~~~~~~------~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n 74 (530) +.+--.+...+. ..-.+. +..-....+-.+. ...||....-+-. .. ..-.+ .+. T Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~a--~~~g~~~~~~~~~--~~~~~~~~~~~~~-~l--------------~a~Y~-~~~ 95 (532) T protein:vir:94 36 ATAHEIDPTAYSPYERNAAQNAMA--MDYGLQTGRNGRN--ALSFVEATSWPGF-PT--------------LALLA-QLP 95 (532) T ss_pred hhhhhhcccccccccccccccccc--cccccCccccccc--ccccccccccchH-HH--------------HHHHH-cCc Confidence 111111110000 000000 0000000000000 0011111110000 00 00001 257 Q ss_pred chhhHHhhhhhhhcccceeeecCCcc--hHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCc------- Q lcl|NC_011308. 75 FFAELVDQKTQYLLANGIDVKPTDHD--DQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSEDK------- 144 (530) Q Consensus 75 ~~k~Ivd~~~~yl~G~pv~~~~~~~~--de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~------- 144 (530) +++.+|+..+.=++-+++++++.+.. .......|...++. +....+.++.+.+-.||.++.++-.+.+|. T Consensus 96 l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p 175 (532) T protein:vir:94 96 EYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAP 175 (532) T ss_pred hhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCcccccccc Confidence 78889999999999999999774432 22333444443321 566788889999999999998877654331 Q ss_pred ------------e-EEEEecccceEEE-EcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccc Q lcl|NC_011308. 145 ------------L-TFQTVDALQLLPV-FDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSD 210 (530) Q Consensus 145 ------------~-~~~~~~p~~~~~v-~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~ 210 (530) + .+.+++|.++.|- ++...-..+ .|+....|.-. .+.. |-++.+.+|.. T Consensus 176 ~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp--~fg~P~~y~v~--~g~~------iH~SRli~f~g------- 238 (532) T protein:vir:94 176 LLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLP--SFYKPDSWIAT--SGKK------IHSSRIHTVVG------- 238 (532) T ss_pred ccccccccccceeeEEEeechheeccccccccccccc--ccCCceeEEEc--cCee------eccceEEEecC------- Confidence 1 2445566665552 111111100 01111110000 0000 00111111100 Q ss_pred hhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEee-CCcCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_011308. 211 EYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY-NNKLGISDIKKVKSIIDDYDLMNC 289 (530) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~-nn~~~~sd~e~v~~liDa~~~~~S 289 (530) ..+|-+..+ ++=.|.|.++.+..-+.+|+.+.- T Consensus 239 ----------------------------------------------~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~ 272 (532) T protein:vir:94 239 ----------------------------------------------RPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQ 272 (532) T ss_pred ----------------------------------------------CCchhhhccccccccccHHHHHHHHHHHHHHHHH Confidence 001111100 122488999999999999999998 Q ss_pred HHHHHHHHhccceeeeecCCC---CchhhHHHHH------h-hCcceecCC-CCceeEEEecCCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 290 FLSNNLQDMAEAIYVVRGGTN---SPVDEIKKNI------Q-SKKIIQTKG-EGGLDIQTVDIPYEARKAKMDIDELNIY 358 (530) Q Consensus 290 ~~~n~~~~~~~~~lvl~g~~~---~~~~~~~~~~------~-~~~~i~~~~-~~~~~~lt~~~~~~~~e~~ld~L~~~I~ 358 (530) ..+..+..+....+++..... +....+...+ + ..+++.++. +.+++.+ ..+...+...++...+.|. T Consensus 273 ~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~--~~~lsgl~~~l~~~~~~iA 350 (532) T protein:vir:94 273 SVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQT--NTPLSGLDSLQAQSQEQMA 350 (532) T ss_pred HHHHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEE--ecccCCHHHHHHHHHHHHH Confidence 888888888877776532111 1112222211 1 123444554 3444444 4667778889999999999 Q ss_pred HHhcccCCC-----cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCC Q lcl|NC_011308. 359 RSGMGFNSS-----AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYI 433 (530) Q Consensus 359 ~~s~~p~~~-----~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~ 433 (530) ..+.+|-.- ..++ |+||..=+.-|.... ..+.+..++..|++++.++..- ..+ ..+ .++++.|++=. T Consensus 351 aa~~IP~t~LfG~sp~Gl-nstGe~D~~~yyd~I--~s~Qe~~l~p~le~l~~~l~~s--~~g--~~~-~d~~~~f~pL~ 422 (532) T protein:vir:94 351 AVSHIPLVKLLGITPNGL-NASSDGEIRVWYDFI--AGYQATNLTPLMEWIIDLIQLS--EYG--QID-PGLAWEWSPLM 422 (532) T ss_pred hHhCCCeeeeecCCcccc-cccchHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH--hcC--CCC-CCceEEeCCCC Confidence 999999542 2222 355653333333321 1234466778888887776431 112 222 25889999877 Q ss_pred CCCHHHHHHH-------HHHHHhcCCCcHHHHHHhCCCCCCH-----HHHHHHHHHHHHHHHHHHHhhhccccc-cCCcc Q lcl|NC_011308. 434 LANELDLAMI-------DKTEAETNQIQINNLLAIAPRIGDE-----ETLKAICDTLDLDYEDVVKALEDQEVE-ELEPT 500 (530) Q Consensus 434 P~n~~e~a~~-------~~~~~~~g~iS~et~l~~~~~vdd~-----~~e~~~~e~e~~e~~~~~~~~~~~~~~-~~~~~ 500 (530) .-+++|.|++ .+++.+.|++|.+.+...+..-.+. ..+.+.+++...+..+.......+... ...+. T Consensus 423 ~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (532) T protein:vir:94 423 ELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPN 502 (532) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCC Confidence 7777776654 4567888999998888776421110 000001111111111111111111100 00011 Q ss_pred ccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 501 VTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ..+...+ -+.+.++-...+|.-.+.||-. T Consensus 503 ~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~ 531 (532) T protein:vir:94 503 PQPDSED-DQTDNQPDAQADPAQNDQPVGN 531 (532) T ss_pred CCCCCCC-CCCCCccCCCccccccCCCcCC Confidence 1111101 1112222333445566666666 No 98 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.29 E-value=1.4e-11 Score=80.17 Aligned_cols=485 Identities=9% Similarity=-0.009 Sum_probs=203.5 Q ss_pred ccCCcccHHHHHHHHHHHHHH-hhhHHHHHHHHHHhcccc---hhhhcccccccccccccccccCCcceeecCchhhHHh Q lcl|NC_011308. 6 LTTAPDRLGTILSTKIDEYIR-SQNVSLARVGQRYYNQDN---DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVD 81 (530) Q Consensus 6 ~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~~~YY~g~~---~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd 81 (530) .++....+.++...+-..... ..-+..+.+-.+||.|.| .+.. ..+... |..+|..+.+|+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~-----------~l~~q~----rp~~N~i~~~v~ 65 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQ-----------YTTLQY----RGQFDVVRPVVR 65 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHH-----------HHHhcC----CCcccchHHHHH Confidence 333333333333222222222 122345678889999987 1111 111222 346799999999 Q ss_pred hhhhhhccccee--eecCCcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEE---ecCC---CceEEE Q lcl|NC_011308. 82 QKTQYLLANGID--VKPTDHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFAR---TTSE---DKLTFQ 148 (530) Q Consensus 82 ~~~~yl~G~pv~--~~~~~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y---~d~~---g~~~~~ 148 (530) .-+||---+.+. |.+.+.++....+.|+.++. ++.......+..++.++|.+|.-+. .+++ +++++. T Consensus 66 ~v~g~e~~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~ 145 (725) T protein:vir:10 66 KLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIR 145 (725) T ss_pred HHHhhHHhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeee Confidence 999998877765 45566667777777777653 3556667788889999999997664 3333 334443 Q ss_pred Ee----cccceEEEEcCCCC------ce-eEEEEEEEE---------e---------------ecccccccceEEEEEEE Q lcl|NC_011308. 149 TV----DALQLLPVFDDYGT------LQ-RIIRFYTEQ---------R---------------YSDADNKFNSIGHADVW 193 (530) Q Consensus 149 ~~----~p~~~~~v~d~~~~------~~-~~~~~y~~~---------~---------------~~~~~~~~~~~~~~evy 193 (530) .+ ||.++| ||.... .. .+++.|... + +...+...+.+..+++| T Consensus 146 ~~~i~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~ 223 (725) T protein:vir:10 146 REPIHSACSHVI--WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFY 223 (725) T ss_pred eeecccCHhHcc--cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEE Confidence 32 344454 443211 11 111111110 0 00111123344445555 Q ss_pred cCCceE--EEeecCCcccchhhccccc---------cccccceeeeeecccc-cceecccccccccccccccccCCccce Q lcl|NC_011308. 194 TDTEVW--YYVQKDEGRSDEYVLDTTV---------NPNPSQHVLAVADGVD-EAILDEGVEEHEGRQVLGRSYKSRFPF 261 (530) Q Consensus 194 t~~~~~--~y~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~iPi 261 (530) ....+. .|...+........++... ................ ......+.. ......+.+-+++|+ T Consensus 224 ~r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~---~l~~~~~~~~~~fP~ 300 (725) T protein:vir:10 224 EVVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA---VLKDKQLIAGEHIPI 300 (725) T ss_pred EEEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchh---hhcCCCCCCCCceeE Confidence 533322 1212111111100000000 0000000000000000 000011111 111222333344555 Q ss_pred EEe---eCCcC----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceec----CC Q lcl|NC_011308. 262 DIL---YNNKL----GISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQT----KG 330 (530) Q Consensus 262 v~~---~nn~~----~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~----~~ 330 (530) ||| +.... +.|.+-.+++.++.+|...|.....+........++.....+...+.........++.. .. T Consensus 301 vP~~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~ 380 (725) T protein:vir:10 301 VPVFGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDEN 380 (725) T ss_pred EEEEeeeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeeccccccc Confidence 554 33222 33888899999999999999999888765554443321111111222221111111111 11 Q ss_pred CC-----ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 331 EG-----GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVIKSRYTLLAMKAQKTEIALRKTLRWT 404 (530) Q Consensus 331 ~~-----~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~ 404 (530) +| .+.+...+.-..++...++.....|-.+|.+-+..-...| +.||+|+..+-..........-..++.+.++. T Consensus 381 ~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~ 460 (725) T protein:vir:10 381 NGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRD 460 (725) T ss_pred CcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 2333333334456667889999999999976543322234 46999999987777666665666666666666 Q ss_pred HHHHHHHHHhcC-------C-Ccc---cc------------------------ceeeEEeCCCCCCCHHHHHHHH-HHHH Q lcl|NC_011308. 405 ADLVVEDIRRRG-------L-GDY---SS------------------------TDIKFDIEPYILANELDLAMID-KTEA 448 (530) Q Consensus 405 ~~~i~~~l~~~~-------~-~~~---d~------------------------~~i~i~f~~~~P~n~~e~a~~~-~~~~ 448 (530) .+++++++..-. + +.. .+ .++.+.=.+..|.=..+.+..+ ..+. T Consensus 461 g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~ 540 (725) T protein:vir:10 461 GEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLG 540 (725) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHH Confidence 665555433210 0 000 00 1222222222222111222111 1111 Q ss_pred hcCC-CcH--HHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 449 ETNQ-IQI--NNLLAIAPRI--GDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 449 ~~g~-iS~--et~l~~~~~v--dd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) ..+. .+. .+++..++.. ...++..+++.+.... +.. .++. ++..++...-..+.. T Consensus 541 ~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~------~~~----------~~~~----~~e~~q~~~e~qq~~ 600 (725) T protein:vir:10 541 KTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQ------MGV----------KKPE----TPEEQQWLVEAQQAK 600 (725) T ss_pred hccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhh------hcc----------CCcc----ccchhHHHHHHHHHH Confidence 1111 111 2222222221 1122222222211000 000 0000 000000000000011 Q ss_pred ccccCCC Q lcl|NC_011308. 524 EEEPVQE 530 (530) Q Consensus 524 ~~~~~~~ 530 (530) ...+..+ T Consensus 601 ~~q~~~e 607 (725) T protein:vir:10 601 QGQQDPA 607 (725) T ss_pred HhhhHHH Confidence 1111111 No 99 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.28 E-value=1.2e-10 Score=74.98 Aligned_cols=487 Identities=11% Similarity=0.045 Sum_probs=212.7 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHH-----hhhHHHHHHHHHHhc--ccc---hhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIR-----SQNVSLARVGQRYYN--QDN---DIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~-----~~~~~~~~~~~~YY~--g~~---~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) |+. +..+++.++...|.. ++.+.+...-.+||. |.| .+...... ......+| . T Consensus 1 m~e--------~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~-------~~q~~grP--~ 63 (706) T protein:vir:10 1 MAE--------SRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKL-------DEQFEKYP--K 63 (706) T ss_pred CCc--------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHh-------hhhhcCCC--c Confidence 332 234455555544432 233444556667774 554 11110000 00011234 6 Q ss_pred eecCchhhHHhhhhhhhcccceeeecC---CcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEec-- Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPT---DHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTT-- 140 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~---~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d-- 140 (530) +.+|..+.+|+.-+|+.--+.+.+... +.++.+..+.|+.++. ++.......+..++.++|.+|.-++.| T Consensus 64 ~~~N~i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~ 143 (706) T protein:vir:10 64 FEINKVATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFV 143 (706) T ss_pred eEecchHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccc Confidence 889999999999999999888876532 3345666777776653 356677788889999999999777543 Q ss_pred -C------CCceEEEEe-cccceEEEEcCCC------CceeEEEE-EEEEe--------------------ecccccccc Q lcl|NC_011308. 141 -S------EDKLTFQTV-DALQLLPVFDDYG------TLQRIIRF-YTEQR--------------------YSDADNKFN 185 (530) Q Consensus 141 -~------~g~~~~~~~-~p~~~~~v~d~~~------~~~~~~~~-y~~~~--------------------~~~~~~~~~ 185 (530) + ++++++..+ +|.+.+ +||... +...+++. |.... +...+.... T Consensus 144 ~~~d~~~~~~~i~i~~v~~p~~~v-~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d 222 (706) T protein:vir:10 144 NEYDPMDERQRIAVEPIYDPARSV-WFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPD 222 (706) T ss_pred cccCCCCCCccceeeeeccchhce-ecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCC Confidence 1 123444433 565421 244321 11112221 11000 000111233 Q ss_pred eEEEEEEEcCCceE----EEeecCCcccchhhcccccc------ccccceeeeeecccccceeccccccccccccccccc Q lcl|NC_011308. 186 SIGHADVWTDTEVW----YYVQKDEGRSDEYVLDTTVN------PNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSY 255 (530) Q Consensus 186 ~~~~~evyt~~~~~----~y~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (530) .+...+.|+..... .|+....+.......+.... ......+.. .....................+.+.+ T Consensus 223 ~~~~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~v~~~~~~g~~~l~~~~p~~ 301 (706) T protein:vir:10 223 VVYIAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGR-RSVKRRRIYVAVVDGDGFLEKPRRIP 301 (706) T ss_pred cceecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhh-cccceeeEEEEeeccccccccCCCCC Confidence 44455555544321 11111111110000000000 000000000 00000000000000011112234455 Q ss_pred CCccceEEeeCCc---C----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhC----- Q lcl|NC_011308. 256 KSRFPFDILYNNK---L----GISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSK----- 323 (530) Q Consensus 256 ~~~iPiv~~~nn~---~----~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~----- 323 (530) .+++|+|||+... . ..|.+-.+++.++.+|+.+|.+.+.+........+ |... +-..+....... T Consensus 302 ~~~~P~vP~~g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~--~~~~-~i~~~~~~~~~~~~~~~ 378 (706) T protein:vir:10 302 GEHIPLIPVYGKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPI--VDME-QIRGLEQHWEGRNRKRP 378 (706) T ss_pred CCccceEEEeeccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccc--cchh-HHHHHHHHhhhcccccc Confidence 5888888875432 2 35677889999999999999999987555443222 2211 111111111000 Q ss_pred cceec---C-CCC-------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHH Q lcl|NC_011308. 324 KIIQT---K-GEG-------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQK 392 (530) Q Consensus 324 ~~i~~---~-~~~-------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ 392 (530) ..+.+ . .+| ...++..+.-..+....+......|-.+|.+-+.....-||+||+||..+-.....-... T Consensus 379 ~~l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~ 458 (706) T protein:vir:10 379 AFLPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFI 458 (706) T ss_pred cchhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHH Confidence 00111 1 112 223333333445666778888888988887755433334689999999987777777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh----------cCC--Cc---------c-----------cc----ceeeEEeCCCCCCC Q lcl|NC_011308. 393 TEIALRKTLRWTADLVVEDIRR----------RGL--GD---------Y-----------SS----TDIKFDIEPYILAN 436 (530) Q Consensus 393 ke~~f~~~l~~~~~~i~~~l~~----------~~~--~~---------~-----------d~----~~i~i~f~~~~P~n 436 (530) .-..|+.+.++..+++++++.. .+. .. . |. .+|.+.=.+..|.= T Consensus 459 ~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~ 538 (706) T protein:vir:10 459 YLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSAR 538 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchH Confidence 7777788887777766666542 110 00 0 00 02222222333332 Q ss_pred HHHHHHHHHHHHhc-CCCcHHH------HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCC Q lcl|NC_011308. 437 ELDLAMIDKTEAET-NQIQINN------LLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLT 509 (530) Q Consensus 437 ~~e~a~~~~~~~~~-g~iS~et------~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (530) ..+..+.++-+... +.....+ +++.+++ ...++..+++.+..- .+.. .+ +.. T Consensus 539 r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~-p~~~e~~e~irk~~~----------~q~~------~~----~~~ 597 (706) T protein:vir:10 539 RDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEG-EGLDDFKAFNRRQLL----------TQGI------VK----PRN 597 (706) T ss_pred HHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCc-cchHHHHHHHHHhhc----------ccCC------cc----ccc Confidence 33333333332222 2222222 2332221 222233333321100 0000 00 000 Q ss_pred CCCccCcCCCCcccccccC-----------------CC Q lcl|NC_011308. 510 IEPQPEPLNIDPVIEEEPV-----------------QE 530 (530) Q Consensus 510 ~~~~~~~~~~~~~~~~~~~-----------------~~ 530 (530) +..++......+....++. ++ T Consensus 598 ~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k 635 (706) T protein:vir:10 598 QQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQK 635 (706) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000000000000 01 No 100 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.26 E-value=2.7e-11 Score=78.50 Aligned_cols=487 Identities=9% Similarity=0.004 Sum_probs=197.9 Q ss_pred ccCCcccHHHHHHHHHHHHHH-hhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhh Q lcl|NC_011308. 6 LTTAPDRLGTILSTKIDEYIR-SQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKT 84 (530) Q Consensus 6 ~~~~~~~~~~~i~~~i~~~~~-~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~ 84 (530) .++.-..+.++...+-..... .+-+..+.+-.+||.|.+= ........+... |..+|..+.+|+.-+ T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--------~~~~~~~l~~q~----rp~~N~i~~~i~~v~ 68 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQW--------DDWLSQYTTLQY----RGQFDVVRPVVRKLV 68 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--------CHHHHHHHHhcC----CCcccchHHHHHHHH Confidence 233222333333222222211 1223456788899999871 000000111122 446799999999988 Q ss_pred hhhccccee--eecCCcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEec---CC---CceEEEEe- Q lcl|NC_011308. 85 QYLLANGID--VKPTDHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTT---SE---DKLTFQTV- 150 (530) Q Consensus 85 ~yl~G~pv~--~~~~~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d---~~---g~~~~~~~- 150 (530) ||---+.+. |.+.+.++.+..+.|+.++. ++.......+..++.++|.+|.-+..| ++ +++++... T Consensus 69 g~e~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~ 148 (725) T protein:vir:92 69 SEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREP 148 (725) T ss_pred hhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEee Confidence 887766665 55666677777777777663 355667778888999999999766433 22 34444432 Q ss_pred --ccc-ceEEEEcCCCC------ce-eEEE-------------EEE-----------EEeecccccccceEEEEEEEcCC Q lcl|NC_011308. 151 --DAL-QLLPVFDDYGT------LQ-RIIR-------------FYT-----------EQRYSDADNKFNSIGHADVWTDT 196 (530) Q Consensus 151 --~p~-~~~~v~d~~~~------~~-~~~~-------------~y~-----------~~~~~~~~~~~~~~~~~evyt~~ 196 (530) +|. ++| ||.... .. .+++ .|- ...+...+...+.+..+++|... T Consensus 149 i~~~~~~V~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~ 226 (725) T protein:vir:92 149 IHSACSHVI--WDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVV 226 (725) T ss_pred ccCChhhcc--cCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEE Confidence 233 343 332211 00 1111 110 00011111233455666666644 Q ss_pred ceE--EEeecCCcccchhhcccc-cc-------ccccceeee-eeccccc-ceecccccccccccccccccCCccceEEe Q lcl|NC_011308. 197 EVW--YYVQKDEGRSDEYVLDTT-VN-------PNPSQHVLA-VADGVDE-AILDEGVEEHEGRQVLGRSYKSRFPFDIL 264 (530) Q Consensus 197 ~~~--~y~~~~~~~~~~~~~~~~-~~-------~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~iPiv~~ 264 (530) .+. .|...+........+... .. ......+.. ....... .....+.. ......+.+-+++|+||| T Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~---~l~~~~~~~~~~~P~vP~ 303 (725) T protein:vir:92 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA---VLKDKQLIAGEHIPIVPV 303 (725) T ss_pred EEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchh---hhcCCCCCCCCceeeEEE Confidence 332 222222111111000000 00 000000000 0000000 00011111 111222333345555554 Q ss_pred e---CCcC----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee-ecCCCCchhhHHHHHhhCccee---c-CCCC Q lcl|NC_011308. 265 Y---NNKL----GISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV-RGGTNSPVDEIKKNIQSKKIIQ---T-KGEG 332 (530) Q Consensus 265 ~---nn~~----~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl-~g~~~~~~~~~~~~~~~~~~i~---~-~~~~ 332 (530) . .... +.|.+-.+++.++.+|+..|.....+.......+++ .|.. +...+.........++. + ..+| T Consensus 304 ~g~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g 382 (725) T protein:vir:92 304 FGEWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNG 382 (725) T ss_pred EeeeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhh-hHHHHHHhccCccceeeccccccccc Confidence 3 2222 337888999999999999999998887655543332 2211 11111111111111111 1 1122 Q ss_pred -----ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 333 -----GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTAD 406 (530) Q Consensus 333 -----~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~ 406 (530) .+++.....-..+....++.....|-.+|.+-+-.-... ++.||+|+..+-..........-..|+.+.++..+ T Consensus 383 ~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:92 383 EMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred cccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 233333334455666788999999999997643222222 34699999988776666555555556666666555 Q ss_pred HHHHHHHhcC-------CCccccceeeEEeCCCCCCC----------------------------HHHHHHHHHH-HHhc Q lcl|NC_011308. 407 LVVEDIRRRG-------LGDYSSTDIKFDIEPYILAN----------------------------ELDLAMIDKT-EAET 450 (530) Q Consensus 407 ~i~~~l~~~~-------~~~~d~~~i~i~f~~~~P~n----------------------------~~e~a~~~~~-~~~~ 450 (530) +++.++.... +..-+...-.+.++...+.. ..+.+..++. +... T Consensus 463 ~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~ 542 (725) T protein:vir:92 463 IYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhc Confidence 5555432211 00001111112222222211 1111111111 1111 Q ss_pred CCC-cH--HHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 451 NQI-QI--NNLLAIAPRIG--DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 451 g~i-S~--et~l~~~~~vd--d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +.+ +. -+++..++..+ ..++..+++. .+..... ..++. +...++...-..+.... T Consensus 543 ~~~~~~~~~~l~~~~~~~d~~~~~e~~erir--------------kq~~~~~--~~~~~----~~e~~q~~~~~qqa~~~ 602 (725) T protein:vir:92 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYAN--------------KQLIQMG--VKKPE----TPEEQQWLVEAQQAKQG 602 (725) T ss_pred ccchhHHHHHHHHHhhcccchHHHHHHHHHH--------------hhhchhc--cCCcc----chhhhHHHHHHHHHHHh Confidence 100 00 01111111100 0111111111 1100000 00000 00000000000000000 Q ss_pred ccCCC Q lcl|NC_011308. 526 EPVQE 530 (530) Q Consensus 526 ~~~~~ 530 (530) .+..+ T Consensus 603 q~~~e 607 (725) T protein:vir:92 603 QQDPA 607 (725) T ss_pred hhHHH Confidence 11110 No 101 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.25 E-value=4.7e-11 Score=77.18 Aligned_cols=429 Identities=11% Similarity=0.063 Sum_probs=198.1 Q ss_pred CCcccccCCcccHHHHHHH--------------------------------HHHHH---HHhhhHHHHHHHHHHhcccch Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILST--------------------------------KIDEY---IRSQNVSLARVGQRYYNQDND 45 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~--------------------------------~i~~~---~~~~~~~~~~~~~~YY~g~~~ 45 (530) |...-.++.-+....-.+. ....+ -...+... ....||.... T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~- 101 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEG--LVLWYAQQAF- 101 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccch--hhhhccccCC- Confidence 1100000000000000000 00000 00000000 0111222211 Q ss_pred hhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCCcc--hHHHHHHHHHHhh-ccHHHHHHH Q lcl|NC_011308. 46 IENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHD--DQKLCYLIEEYYN-EEFQSAIQE 122 (530) Q Consensus 46 I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~--de~~~~~l~~~~~-~~~~~~~~e 122 (530) +.+ .. .-. . -.+.+++.+|+..+.-++-+++.+++.+.+ +.+..+.|+..++ -+....+.+ T Consensus 102 ~~~-~l----------~a~----Y-~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~ 165 (537) T protein:vir:10 102 IGH-QM----------CAL----I-ATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQ 165 (537) T ss_pred ccH-HH----------HHH----H-HhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHH Confidence 000 00 000 0 126789999999999999999999876532 2233444554443 256678888 Q ss_pred HHHHHhhcCeEEEEEEecC-CCc---------------e-EEEEecccceEEEEcCCCCceeEE-EEEEEEeeccccccc Q lcl|NC_011308. 123 LVEGSTIKGYEGIFARTTS-EDK---------------L-TFQTVDALQLLPVFDDYGTLQRII-RFYTEQRYSDADNKF 184 (530) Q Consensus 123 ~~~~~~~~G~a~~~~y~d~-~g~---------------~-~~~~~~p~~~~~v~d~~~~~~~~~-~~y~~~~~~~~~~~~ 184 (530) +.+.+-.+|.++.++..+. ++. + .+.+++|.++-|...+.....+.- .|+....|.- .. T Consensus 166 a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v---~g 242 (537) T protein:vir:10 166 FVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLI---NG 242 (537) T ss_pred HHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeee---cC Confidence 9999999999998887642 221 1 244566666555321111000000 0111100000 00 Q ss_pred ceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEe Q lcl|NC_011308. 185 NSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL 264 (530) Q Consensus 185 ~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~ 264 (530) .. |-+..+.+|... . +|-+.- T Consensus 243 ~~------iH~SRli~f~g~--------------------------------------~---------------~p~~~~ 263 (537) T protein:vir:10 243 KK------YHRSHLAIYIND--------------------------------------E---------------VVDFLK 263 (537) T ss_pred eE------ecceeEEEecCC--------------------------------------C---------------Cchhhh Confidence 00 001111111000 0 000000 Q ss_pred e-CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-CchhhHHHH------Hhh-CcceecCCCCcee Q lcl|NC_011308. 265 Y-NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SPVDEIKKN------IQS-KKIIQTKGEGGLD 335 (530) Q Consensus 265 ~-nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~~~~~~~~------~~~-~~~i~~~~~~~~~ 335 (530) + ++=.|.|.++.+..-+.+++.+.-..+..+..+....+.+.|... .+.+.+... .+. .+++.++.++ .+ T Consensus 264 ~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g~~~id~e~-e~ 342 (537) T protein:vir:10 264 PSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQVRVVDKDN-ED 342 (537) T ss_pred cccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcceeEecCCC-ce Confidence 0 112488999999999999999999999999999988888776432 111222111 112 2445555542 24 Q ss_pred EEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc---c-CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 336 IQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD---G-NATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVED 411 (530) Q Consensus 336 ~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~---g-n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~ 411 (530) |-+...+.......++...+.|...+.+|-.--.+. | |+||..=..-|.... +..+..++..|++++.+|+.. T Consensus 343 ~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I---~~~Qe~l~p~l~~l~~ll~~~ 419 (537) T protein:vir:10 343 VVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEEC---ESTQDDMRPLIDRHHQLVCRS 419 (537) T ss_pred eEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh Confidence 455556777788999999999999999995422111 2 466764444444432 222224688888888776542 Q ss_pred HHhcCCCccccceeeEEeCCCCCCCHHHHHHH-------HHHHHhcCCCcHHHHHHhCCCCCCH--HHHHHHHHHHHHHH Q lcl|NC_011308. 412 IRRRGLGDYSSTDIKFDIEPYILANELDLAMI-------DKTEAETNQIQINNLLAIAPRIGDE--ETLKAICDTLDLDY 482 (530) Q Consensus 412 l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~-------~~~~~~~g~iS~et~l~~~~~vdd~--~~e~~~~e~e~~e~ 482 (530) ..+. ..++++.|++=..-+++|.|++ .+++...|++|.+.+...|..-.+. ......+..+..+ T Consensus 420 ---~~~~---~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e- 492 (537) T protein:vir:10 420 ---HLRK---RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAE- 492 (537) T ss_pred ---cCCC---CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhh- Confidence 1222 2368999998888888887765 6678888899998887776421110 0000000000000 Q ss_pred HHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 483 EDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .. ....+..+ ++... ....+....+.+..+...++|..- T Consensus 493 --~~-~~~~~~~~-----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 531 (537) T protein:vir:10 493 --DI-DVDDEGKP-----VRIIE-DQPAPSEMFGATSSGESANDPRDS 531 (537) T ss_pred --cc-cCCccCCc-----CCCCC-CCCCccccCCCCccccccCCCccC Confidence 00 00000000 00000 000000011111111111111111 No 102 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.25 E-value=9.7e-11 Score=75.47 Aligned_cols=450 Identities=10% Similarity=0.010 Sum_probs=202.7 Q ss_pred CCccccc---CCcccHHHHHHHHHHHHHHhhh---HHHH----HHHHHHhcccchhhhccccc---ccccccc--ccccc Q lcl|NC_011308. 1 MTNTLLT---TAPDRLGTILSTKIDEYIRSQN---VSLA----RVGQRYYNQDNDIENTRIMW---MNDHGDI--VEDDN 65 (530) Q Consensus 1 ~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~---~~~~----~~~~~YY~g~~~I~~r~~~~---~~~~~~~--~~~~~ 65 (530) -+..+.. +.-......+...+. +.+.-+ ..+. ..+..|-.+-...-...... ....+.. ..... T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~a-~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gy 137 (862) T protein:vir:99 59 SVKDFPFVEISDSVNAKSVSGKNFA-MDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGH 137 (862) T ss_pred cccccccccccccccchhhhhhhhc-chhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccH Confidence 1111110 000001111111000 000000 0000 00111111000000000000 0000000 00000 Q ss_pred -CCcceeecCchhhHHhhhhhhhcccceeeecCCcc---hHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEec Q lcl|NC_011308. 66 -ASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHD---DQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTT 140 (530) Q Consensus 66 -~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~---de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d 140 (530) ....=-.+.+++.+|+..+.=++-+++.+++..++ +.+..+.|.+.++. +....+.++.+.+-.+|.++.++-.+ T Consensus 138 ql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~ 217 (862) T protein:vir:99 138 QACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVD 217 (862) T ss_pred HHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEec Confidence 00000136789999999999999999999875432 23445555555432 56778888888888999988776554 Q ss_pred CC-C---------------ce-EEEEecccceEEEEcCCCCceeEE-EEEEEEeecccccccceEEEEEEEcCCceEEEe Q lcl|NC_011308. 141 SE-D---------------KL-TFQTVDALQLLPVFDDYGTLQRII-RFYTEQRYSDADNKFNSIGHADVWTDTEVWYYV 202 (530) Q Consensus 141 ~~-g---------------~~-~~~~~~p~~~~~v~d~~~~~~~~~-~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~ 202 (530) .+ + .+ .+.+++|.++.|.--......+.. .||... .|. T Consensus 218 ~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~----------------~y~-------- 273 (862) T protein:vir:99 218 SEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPE----------------FWI-------- 273 (862) T ss_pred CcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCce----------------eee-------- Confidence 22 1 11 245566666554211000000000 011110 000 Q ss_pred ecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEe-eCCcCCCCcHHHHHHHH Q lcl|NC_011308. 203 QKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL-YNNKLGISDIKKVKSII 281 (530) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~-~nn~~~~sd~e~v~~li 281 (530) ..+ . ..|.+.+..-.. ..+|-+.- .++=.|.|.++.+.+.+ T Consensus 274 I~g---------------~-~IH~SRliif~g----------------------~~vpd~lk~ay~f~G~SvLe~iyd~L 315 (862) T protein:vir:99 274 ISG---------------Q-KYHRSHLIIARG----------------------PQPADILKPTYIFGGIPLVQRIYERV 315 (862) T ss_pred ecC---------------e-eeccceeEEecC----------------------CCchhhhhccCCccCccHHHHHHHHH Confidence 000 0 000010000000 00000000 01124899999999999 Q ss_pred HHHHHHHHHHHHHHHHhccceeeeecCCC-CchhhHHHH------Hhh-CcceecCCCCceeEEEecCCHHHHHHHHHHH Q lcl|NC_011308. 282 DDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SPVDEIKKN------IQS-KKIIQTKGEGGLDIQTVDIPYEARKAKMDID 353 (530) Q Consensus 282 Da~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~~~~~~~~------~~~-~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L 353 (530) .+|+.+....+..+..+....+.+.+... .+.+.+... .+. .+++.++.+.+++.+ +.+.+.+...++.. T Consensus 316 ~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~liD~eEe~e~l--s~slSGL~dll~~~ 393 (862) T protein:vir:99 316 YAAERTANEAPLLAMNKRTTAIHTDTAKAIANEDKFIQRLMFWVRYRDNHAVKVLGTDETMEQF--DTSLADFDAVIMGQ 393 (862) T ss_pred HHHHHHHHHHHHHHHHhccceeechhHhhhccHHHHHHHHHHHHhccCcceeEEecCCCceeEE--ecccCChHHHHHHH Confidence 99999999999999988888887776532 122222221 112 245566666555544 46677888899999 Q ss_pred HHHHHHHhcccCCCccc---cc-CCcHH-HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEE Q lcl|NC_011308. 354 ELNIYRSGMGFNSSAVG---DG-NATNV-VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFD 428 (530) Q Consensus 354 ~~~I~~~s~~p~~~~~~---~g-n~SGv-Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~ 428 (530) .+.|-..+.+|-.--.+ -| |+||. .++.-|..+.. ..+..++..|++++.++..-+ + . ..+++|. T Consensus 394 ~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s---~QE~~L~P~LerL~~li~~~l---g-~---~~d~~ie 463 (862) T protein:vir:99 394 YQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELES---IQEHVYMPFLQRHYLISRLSL---G-I---QHEIDVV 463 (862) T ss_pred HHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhc---C-C---CCcceEE Confidence 99999999999532111 13 46665 44333332222 235678888887766553221 2 1 2468999 Q ss_pred eCCCCCCCHHHHHHH-------HHHHHhcCCCcHHHHHHhC--------CCCCCHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_011308. 429 IEPYILANELDLAMI-------DKTEAETNQIQINNLLAIA--------PRIGDEETLKAICDTLDLDYEDVVKALEDQE 493 (530) Q Consensus 429 f~~~~P~n~~e~a~~-------~~~~~~~g~iS~et~l~~~--------~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~ 493 (530) |++=..-+++|.|++ .+++..+|++|.+.++..| +.+++.+.|.. .....+.+....... T Consensus 464 FnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d-----~~~~~e~~~~~e~~g 538 (862) T protein:vir:99 464 MEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEET-----PGASPENLAAYQKAG 538 (862) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCccccccc-----CCCCcccccccccCC Confidence 998888888887766 4567889999998888753 22333322200 000001111111000 Q ss_pred c-ccCCc--cccCCCC-------CC------CCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 494 V-EELEP--TVTPIID-------PL------TIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 494 ~-~~~~~--~~~~~~~-------~~------~~~~~~~~~~~~~~~~~~~~~~ 530 (530) . ....+ +.++... .. ...+......++++|...|... T Consensus 539 ~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~ 591 (862) T protein:vir:99 539 AAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDD 591 (862) T ss_pred cccccccccccccccCCccccCCcccccccCCCCCCCccccccccccCCCccc Confidence 0 00000 0000000 00 0011112223344555444433 No 103 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.21 E-value=3.1e-10 Score=72.69 Aligned_cols=508 Identities=11% Similarity=0.026 Sum_probs=209.5 Q ss_pred CcccHHHHHHHHHHHHHH-----hhhHHHHHHHHHHhc--ccc---hhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 9 APDRLGTILSTKIDEYIR-----SQNVSLARVGQRYYN--QDN---DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 9 ~~~~~~~~i~~~i~~~~~-----~~~~~~~~~~~~YY~--g~~---~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) --+.+.+++.++...|.. +.-+.....-.+||. |.| .+...... ...-..+| .+.+|..+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~-------~~q~~grP--~~~~N~i~~ 71 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKL-------DEQFEKYP--KFEINKVAT 71 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHH-------hhhhcCCC--ceEEcchHH Confidence 112233455555554432 122333444555664 655 11000000 00011223 588999999 Q ss_pred HHhhhhhhhcccceeeec--CC-cchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEec---CC----- Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKP--TD-HDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTT---SE----- 142 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~--~~-~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d---~~----- 142 (530) +|+..+|+-..+.+.+.. .+ .++.+..+.|+.++. ++.+.....+..++.++|.+|.-++.| +. T Consensus 72 ~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~ 151 (708) T protein:vir:10 72 ELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD 151 (708) T ss_pred HHHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCC Confidence 999999999998887654 32 335677777777653 456677788889999999999766543 21 Q ss_pred -CceEEE-Eecc-cceEEEEcCCC------C-ceeEEEEEEEEe---------------------ecccccccceEEEEE Q lcl|NC_011308. 143 -DKLTFQ-TVDA-LQLLPVFDDYG------T-LQRIIRFYTEQR---------------------YSDADNKFNSIGHAD 191 (530) Q Consensus 143 -g~~~~~-~~~p-~~~~~v~d~~~------~-~~~~~~~y~~~~---------------------~~~~~~~~~~~~~~e 191 (530) .++++. +.+| ..+| ||... + +-.+.+.|.... +...+...+.+..++ T Consensus 152 ~~~i~i~~~~~p~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~e 229 (708) T protein:vir:10 152 RQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAK 229 (708) T ss_pred ccccceEEeecchhhcc--cCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEE Confidence 122222 2334 3333 33211 1 111111110000 000011123344455 Q ss_pred EEcCCceEEEe--ec--CCcccchhhcccc---ccccccceeeee--ecccccceecccccccccccccccccCCccceE Q lcl|NC_011308. 192 VWTDTEVWYYV--QK--DEGRSDEYVLDTT---VNPNPSQHVLAV--ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFD 262 (530) Q Consensus 192 vyt~~~~~~y~--~~--~~~~~~~~~~~~~---~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv 262 (530) +|......... .. .++....+..+.. ...........+ .......+...............+.+++++|+| T Consensus 230 y~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~v 309 (708) T protein:vir:10 230 YYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLI 309 (708) T ss_pred eeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeE Confidence 55433222211 11 1111111000000 000000000000 000000000000011112234566778888888 Q ss_pred EeeCCc---C----CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc-hhhHHHHHhhCcce-e----cC Q lcl|NC_011308. 263 ILYNNK---L----GISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSP-VDEIKKNIQSKKII-Q----TK 329 (530) Q Consensus 263 ~~~nn~---~----~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~-~~~~~~~~~~~~~i-~----~~ 329 (530) ||+... . ..|.+-.+++.++.||+..|..+..+......+.++-....+. ..+..........+ . .+ T Consensus 310 P~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 389 (708) T protein:vir:10 310 PVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRD 389 (708) T ss_pred EEeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccc Confidence 875422 2 2567788999999999999999988876665544432111111 11111100000000 0 01 Q ss_pred CCC-------ceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 330 GEG-------GLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLR 402 (530) Q Consensus 330 ~~~-------~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~ 402 (530) ..| ....+....-..+....+.....+|-.+|.+-+.....-||+||+||..+-..........-..++.+.+ T Consensus 390 ~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~ 469 (708) T protein:vir:10 390 KSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLK 469 (708) T ss_pred cccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1223333334556677788888888888876543322336789999999877777777766677777777 Q ss_pred HHHHHHHHHHHh----------cCC-Cc----------ccc---------------ceeeEEeCCCCCCCHHHHHHHHHH Q lcl|NC_011308. 403 WTADLVVEDIRR----------RGL-GD----------YSS---------------TDIKFDIEPYILANELDLAMIDKT 446 (530) Q Consensus 403 ~~~~~i~~~l~~----------~~~-~~----------~d~---------------~~i~i~f~~~~P~n~~e~a~~~~~ 446 (530) +..+++++++.. .+. +. .|. .+|.+.=.+..|.-..+.++.++. T Consensus 470 ~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~q 549 (708) T protein:vir:10 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTN 549 (708) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHH Confidence 766666665432 110 00 000 022233233333333333333332 Q ss_pred HHhc-CCCcHHH------HHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHHhhhccccccCCccccCCC- Q lcl|NC_011308. 447 EAET-NQIQINN------LLAIAPRIGDEETLKAICDTLDLD-------------YEDVVKALEDQEVEELEPTVTPII- 505 (530) Q Consensus 447 ~~~~-g~iS~et------~l~~~~~vdd~~~e~~~~e~e~~e-------------~~~~~~~~~~~~~~~~~~~~~~~~- 505 (530) +... +.....+ +++.+ -+.+.++..+++.+..-. ....+.....+..+-...+..... T Consensus 550 ll~~~~p~~~~~~~~~~~~l~~~-D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~ 628 (708) T protein:vir:10 550 VLSSMLPTDPMRPAIQGIILDNI-DGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMV 628 (708) T ss_pred HHHhcCCCchhhHHHHHHHHHhc-CCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222 2111112 22222 222333333333321100 000000000000000000000000 Q ss_pred CCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 506 DPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ...++ -++-.....+.... =++. T Consensus 629 ~~qAe-~~ka~a~a~~~~~~-a~q~ 651 (708) T protein:vir:10 629 AAQAE-AQKATNETAQTQIK-AFTA 651 (708) T ss_pred HHHHH-HHHHHHHHHHHHHH-HHHH Confidence 00000 00000000000000 0000 No 104 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.16 E-value=1e-09 Score=69.87 Aligned_cols=439 Identities=9% Similarity=-0.032 Sum_probs=218.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc----------eeecCchhhHH Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI----------KISHGFFAELV 80 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~----------ki~~n~~k~Iv 80 (530) |.+-+-+...+.= ...-++...+...+-|.+-.. .|........ ........++. =...++++-.| T Consensus 1 mn~~dr~i~~~sP-~~~~~R~~ar~~~~~y~aa~~--~r~~~~~~~~-~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av 76 (502) T protein:vir:79 1 MAILDDVIGVFSP-GWKAARLRSRAVIQAYEAVKT--TRTHKARREN-RTADQLSQYGAVSLREQARYLDNNHDLVIGVF 76 (502) T ss_pred CchHhhHHhhcCh-HHHHHHHhhHHHHhhccccCc--ccccCCCCCC-CChHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 6544433333321 111111222233344666532 1111110000 00000001110 11467899999 Q ss_pred hhhhhhhccc-ceeeecCC-----cchHHHHHHHHHHhh----c-------cHHHHHHHHHHHHhhcCeEEEEEEecCCC Q lcl|NC_011308. 81 DQKTQYLLAN-GIDVKPTD-----HDDQKLCYLIEEYYN----E-------EFQSAIQELVEGSTIKGYEGIFARTTSED 143 (530) Q Consensus 81 d~~~~yl~G~-pv~~~~~~-----~~de~~~~~l~~~~~----~-------~~~~~~~e~~~~~~~~G~a~~~~y~d~~g 143 (530) +..+.+++|. .+++++.. ..++++.+.|...+. + +|...-.-+.+.....|.+|..+.++..+ T Consensus 77 ~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~ 156 (502) T protein:vir:79 77 DKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRIN 156 (502) T ss_pred HHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccC Confidence 9999999996 55544321 123445555444442 1 35555555677788999999887776543 Q ss_pred c--------eEEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcc Q lcl|NC_011308. 144 K--------LTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLD 215 (530) Q Consensus 144 ~--------~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~ 215 (530) . +++..++|..+---+++.+.....|. + +..+..+ .+.++.. +. +.. T Consensus 157 ~~~~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe------~---d~~Gr~~-aY~i~~~--------hP-gd~------ 211 (502) T protein:vir:79 157 SLTPSAGVHFWLEALEPDFIPMTSDESNRLNQGVF------V---DDWGRPE-KYLVYKS--------RP-VSG------ 211 (502) T ss_pred ccCCCcccceEEEEecchhcCCCCCCCCeeEeeeE------E---CCCCceE-EEEEeec--------CC-CCC------ Confidence 2 68889999876211222111111111 0 0011111 1111111 00 000 Q ss_pred ccccccccceeeeeecccccceecccccccccccccccccCCccc---eEEeeCC-----cCCCCcHHHHHHHHHHHHHH Q lcl|NC_011308. 216 TTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFP---FDILYNN-----KLGISDIKKVKSIIDDYDLM 287 (530) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP---iv~~~nn-----~~~~sd~e~v~~liDa~~~~ 287 (530) ....+.+|| |+|+... .-|.|.|..++..+..++.. T Consensus 212 ------------------------------------~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~ 255 (502) T protein:vir:79 212 ------------------------------------RQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEY 255 (502) T ss_pred ------------------------------------cccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHH Confidence 001122344 5555443 35999999988887776655 Q ss_pred HHHHHHHHHHhccceeeeecCCC---------CchhhHHHHHhhCccee-cCCCCceeEEEecCCHHHHHHHHHHHHHHH Q lcl|NC_011308. 288 NCFLSNNLQDMAEAIYVVRGGTN---------SPVDEIKKNIQSKKIIQ-TKGEGGLDIQTVDIPYEARKAKMDIDELNI 357 (530) Q Consensus 288 ~S~~~n~~~~~~~~~lvl~g~~~---------~~~~~~~~~~~~~~~i~-~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I 357 (530) ..-..-...-.+.--.+++.... +..+.....+..+.++. +..|.++++++.+.+...+..++..+.+.| T Consensus 256 ~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~i 335 (502) T protein:vir:79 256 EDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAV 335 (502) T ss_pred HHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHH Confidence 54333333222222233442111 11111122344444554 788889999999888889999999999999 Q ss_pred HHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcc----c-cceeeEEe Q lcl|NC_011308. 358 YRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTL-RWTADLVVEDIRRRGLGDY----S-STDIKFDI 429 (530) Q Consensus 358 ~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l-~~~~~~i~~~l~~~~~~~~----d-~~~i~i~f 429 (530) -.-..+|- ++.. ++ .|..+++.-+...-..+...+..|...+ +.+++..+...-..|.... + ...+.+.| T Consensus 336 aaglGi~ye~lt~D-~s-~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W 413 (502) T protein:vir:79 336 AAGSRLSFSSTARN-YN-GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVY 413 (502) T ss_pred HhhcCCCHHHHhcc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceee Confidence 99888883 3222 33 3667777777777777777666666544 3344433332222232110 1 12235555 Q ss_pred CC-CCC-CCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCC Q lcl|NC_011308. 430 EP-YIL-ANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDP 507 (530) Q Consensus 430 ~~-~~P-~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (530) .. ..+ .|-.-.++.......+|+.|.+.++...+. |+++.++++.+|.+..++.--.+.. .+-..+.. . T Consensus 414 ~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~--D~~~v~~q~a~e~~~~~~~Gl~~~~------~~~~~~~~-~ 484 (502) T protein:vir:79 414 SGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGR--NPDDVKRRRKAEIDENRKLDLVFDT------DPASDKGG-S 484 (502) T ss_pred ecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHHHHHHHHHHHcCCCCCC------CCCCCCCC-C Confidence 32 222 466666677788999999999999999875 8998888888776665443211111 11111000 0 Q ss_pred CCCCCccCcCCCCcccccccCC Q lcl|NC_011308. 508 LTIEPQPEPLNIDPVIEEEPVQ 529 (530) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~ 529 (530) ....+.++|..+ .+++.+ T Consensus 485 ~~~~~~~e~~~~----~~~~e~ 502 (502) T protein:vir:79 485 SAATKRQEPQHT----DDQSEE 502 (502) T ss_pred CCCCCCCCCCCC----CCCCCC Confidence 001111111111 111111 No 105 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.13 E-value=1.4e-09 Score=69.08 Aligned_cols=480 Identities=10% Similarity=0.040 Sum_probs=203.2 Q ss_pred CcccHHHHHHHHHHHHHHhh-----hHHHHHHHHHHhc--ccc---hhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 9 APDRLGTILSTKIDEYIRSQ-----NVSLARVGQRYYN--QDN---DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 9 ~~~~~~~~i~~~i~~~~~~~-----~~~~~~~~~~YY~--g~~---~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) =-+.+.+++.++...|.... -+..+..-.+||. |.+ .+... .+. ..+...+| .+.+|..+. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~-~~~------~l~~~~~P--~~~~N~i~~ 71 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAG-SEL------GKHFEKYP--KFEINKIST 71 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHH-HHH------HHhhCCCC--eEEEccHHH Confidence 11233455555555554321 2234555667775 544 11110 000 00112234 578899999 Q ss_pred HHhhhhhhhcccceeeec--C-CcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEEecC----C---- Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKP--T-DHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFARTTS----E---- 142 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~--~-~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y~d~----~---- 142 (530) +|+..+|+---+.+.+.. . ..++++..+.|+.++. ++.......+..++.++|.+|.-+++|- + T Consensus 72 ~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~ 151 (720) T protein:vir:35 72 ELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDE 151 (720) T ss_pred HHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcc Confidence 999999999888877554 3 2236677777777653 3566677888889999999998776541 1 Q ss_pred -CceEEEEe--cccceEEEEcCCC------C-ceeEEEEEEE--------------------EeecccccccceEEEEEE Q lcl|NC_011308. 143 -DKLTFQTV--DALQLLPVFDDYG------T-LQRIIRFYTE--------------------QRYSDADNKFNSIGHADV 192 (530) Q Consensus 143 -g~~~~~~~--~p~~~~~v~d~~~------~-~~~~~~~y~~--------------------~~~~~~~~~~~~~~~~ev 192 (530) +.+++..+ |+..+| ||... + .-++.+.|.. ..+.....+...+..+|. T Consensus 152 ~~~i~i~~v~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~ 229 (720) T protein:vir:35 152 RQRICLEPIYDPARSVW--FDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKY 229 (720) T ss_pred cceeeEecccCchhhee--ecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEe Confidence 12333322 233443 33211 0 1111111100 000111112334455555 Q ss_pred EcCCceEE----EeecCCcccchhhcccc---ccccccceeeeeec--ccccceecccccccccccccccccCCccceEE Q lcl|NC_011308. 193 WTDTEVWY----YVQKDEGRSDEYVLDTT---VNPNPSQHVLAVAD--GVDEAILDEGVEEHEGRQVLGRSYKSRFPFDI 263 (530) Q Consensus 193 yt~~~~~~----y~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~ 263 (530) |....... +....++.......+.. ...........+.. .....+...............+.+++++|+|| T Consensus 230 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP 309 (720) T protein:vir:35 230 YEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIP 309 (720) T ss_pred eEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEE Confidence 54433221 11111111111100000 00000000000000 00000000000011122234556677788888 Q ss_pred eeCCc---CC----CCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhh---Cccee------ Q lcl|NC_011308. 264 LYNNK---LG----ISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQS---KKIIQ------ 327 (530) Q Consensus 264 ~~nn~---~~----~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~---~~~i~------ 327 (530) |+... .| .|.+-.+++.++.||+..|.+++.+.. .+...-.|.... ...+...... .+... T Consensus 310 ~~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~--~~~~~~~~a~~~-~~~~~~~~a~~~~~~~~~l~~~~~ 386 (720) T protein:vir:35 310 VYGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQ--DTGSIPIVGKSQ-IKTLEKYWANRNKNRPAFLPLNEI 386 (720) T ss_pred EEeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHc--CCccccccCcch-HHHHHHHhhccccccccccccccc Confidence 76421 22 467788999999999999999999854 444444443322 1222221111 10000 Q ss_pred cCCC-------CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 328 TKGE-------GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 328 ~~~~-------~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ...+ +.+.+.....-..+.-..+..-...|-..|.+-+-.....||+||+||..+-..........-..++.+ T Consensus 387 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~ 466 (720) T protein:vir:35 387 VDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMAKS 466 (720) T ss_pred cccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 233444444444556667777788888887665433333467999999997666666655555666666 Q ss_pred HHHHHHHHHHHHHhcC-------CCcccc---------------------------c--eeeEEeCCCCCCCHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRG-------LGDYSS---------------------------T--DIKFDIEPYILANELDLAMID 444 (530) Q Consensus 401 l~~~~~~i~~~l~~~~-------~~~~d~---------------------------~--~i~i~f~~~~P~n~~e~a~~~ 444 (530) .++..+++++++..-. +..-|. . +|.+.=.+..+.-..+.++.+ T Consensus 467 ~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m 546 (720) T protein:vir:35 467 LKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVL 546 (720) T ss_pred HHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHH Confidence 6666665555543211 000000 0 112222222222122222222 Q ss_pred HHHHhcCCCcHH---------HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 445 KTEAETNQIQIN---------NLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 445 ~~~~~~g~iS~e---------t~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) +.+. +.++.+ .+++.+++ ...++..+++.+... .... ..+ .....++. T Consensus 547 ~qll--~~~~p~~~~~~~~~~~ile~~d~-p~~~e~~erirk~~~-------~~~~---------~~~----~~~e~qq~ 603 (720) T protein:vir:35 547 TNLL--AGMLPQDPMRQVLQGIILDNMEG-EGLDEFKEYNRKQLL-------TQGV---------VKP----RNTEEEQM 603 (720) T ss_pred HHHH--HhcCCCchhHHHHHHHHHHhcCc-hhHHHHHHHHHhhcc-------hhcc---------cCc----cChhHHHH Confidence 2211 112111 11222211 111222222211100 0000 000 00000000 Q ss_pred cCCCCcccccccCCC Q lcl|NC_011308. 516 PLNIDPVIEEEPVQE 530 (530) Q Consensus 516 ~~~~~~~~~~~~~~~ 530 (530) .. ...+..++ T Consensus 604 ~a-----~~qq~~qq 613 (720) T protein:vir:35 604 VA-----QMIQQAQQ 613 (720) T ss_pred HH-----HHHHHHHh Confidence 00 00001111 No 106 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.09 E-value=6.6e-10 Score=70.92 Aligned_cols=383 Identities=13% Similarity=0.088 Sum_probs=177.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |+- ..-+..-+.|-++- ...++......... ....=-.+.+++.+|+..+.-++-+ T Consensus 1 ~~~-------------------~D~~~n~~~gg~~~----~~~~~~~~~~~~~~-l~a~Y~~~~l~~~~Vd~~aed~~r~ 56 (422) T protein:vir:10 1 MVK-------------------TDSYANIFLGGSDG----SEIYGSLQNQAPTI-LASLYADNALVRRIIDTIPETALAA 56 (422) T ss_pred Ccc-------------------chhhHHHHcCCCCC----ccccCcccccCHHH-HHHHHHhChhhHHHHhhhhHHHhcC Confidence 110 11111112222210 00000000000000 0000113678899999999999999 Q ss_pred ceeeecCCcchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecC----------CCce-EEEEecccceEEE- Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTS----------EDKL-TFQTVDALQLLPV- 158 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~----------~g~~-~~~~~~p~~~~~v- 158 (530) .+.+++.+. .+.....++++ +....+.++.+.+-.+|.|+.++-.+. .|.+ .+.+++|.++.|. T Consensus 57 g~~i~~~~~-~~~~~~~~~~l---~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~ 132 (422) T protein:vir:10 57 GFHIDGIDD-EPAFWSRWDDL---EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQT 132 (422) T ss_pred CccccCCCH-HHHHHHHHHHh---hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchh Confidence 999976542 23344444443 567788899999999999998887632 2223 2445566555442 Q ss_pred EcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 159 FDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 159 ~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) ++. .-..+ .|+.. ..|...+.+.... .. .|.+.+..-... T Consensus 133 ~~~-dp~s~--~fg~P------------------------~~y~v~~~~~~~~----~~------iH~SRli~~~g~--- 172 (422) T protein:vir:10 133 REE-NPRNA--RFGEP------------------------LTYRITTNESDMF----YD------VHYSRIHIIDGE--- 172 (422) T ss_pred ccc-Ccccc--ccCcc------------------------eEEEEecCCCCcc----ee------eccceeEEeCCC--- Confidence 111 00000 00000 0010000000000 00 000000000000 Q ss_pred cccccccccccccccccCCccceE-EeeCCcCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-----CC Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFD-ILYNNKLGISDIKK-VKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-----NS 311 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv-~~~nn~~~~sd~e~-v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-----~~ 311 (530) .+|-+ -..++-.|.|.++. +.+-+.+++.+.-..+..+..+....+.++|.. +. T Consensus 173 -------------------~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~ 233 (422) T protein:vir:10 173 -------------------RIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSE 233 (422) T ss_pred -------------------CchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCcc Confidence 01110 11123347777876 568888899999988998888888888777621 11 Q ss_pred chhhHHHH------Hh-hCcceecC-CCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC-----CcccccCCcHHH Q lcl|NC_011308. 312 PVDEIKKN------IQ-SKKIIQTK-GEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS-----SAVGDGNATNVV 378 (530) Q Consensus 312 ~~~~~~~~------~~-~~~~i~~~-~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~-----~~~~~gn~SGvA 378 (530) .....+.. .+ ..+.+.+. ++.+++.+ +.+.+.....++...+.|...+.+|-. +..++ |+||.. T Consensus 234 ~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Gl-natgd~ 310 (422) T protein:vir:10 234 GFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVL--NSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGV-SSSQNT 310 (422) T ss_pred chHHHHHHHHHHHHhcCCccceeEecCCcceEEE--ecccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccc-cccchH Confidence 11111111 11 12223333 33445544 566668889999999999999999953 22222 345554 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHH-------HhcC Q lcl|NC_011308. 379 IKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTE-------AETN 451 (530) Q Consensus 379 ik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~-------~~~g 451 (530) =..-|..... ...+..++..|.+++.+|.. ..++++.|++=..-+++|.|++..+. .+.| T Consensus 311 d~~~yyd~i~--~~Qe~~l~p~l~~l~~~i~~-----------s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g 377 (422) T protein:vir:10 311 ALETFHKLVD--RKRNAELLPILEFLIPFIVN-----------AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAG 377 (422) T ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHhcc-----------cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC Confidence 4333333211 23456788888888877632 13688999988888888887764433 3333 Q ss_pred CCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCC Q lcl|NC_011308. 452 QIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLN 518 (530) Q Consensus 452 ~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (530) ++|.+.+...| ... . ..........+...++.+.. +.+.+++|.+ T Consensus 378 ~i~~~e~r~~L-------------~~~---~--~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~d 422 (422) T protein:vir:10 378 AMDIDEARDTL-------------RTI---A--PEVKINDGSVETEVTISETS----NDPLEVPTDD 422 (422) T ss_pred CCCHHHHHHHh-------------hhh---c--ccccCCCCCCccccchhhcC----CCCCCCCCCC Confidence 33333333222 110 0 00000000000000000000 0011111111 No 107 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.06 E-value=3.2e-09 Score=67.15 Aligned_cols=488 Identities=10% Similarity=0.015 Sum_probs=202.3 Q ss_pred CcccHHHHHHHHHHHHHH-----hhhHHHHHHH--HHHhcccc---hhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 9 APDRLGTILSTKIDEYIR-----SQNVSLARVG--QRYYNQDN---DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 9 ~~~~~~~~i~~~i~~~~~-----~~~~~~~~~~--~~YY~g~~---~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) =-+.+.+++.++...|.. +..+.....- .+||.|.| .+...... .+ .-..+| .+.+|..+. T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~----~~---q~~~rP--~~~~N~i~~ 71 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKL----DE---QFEKYP--KFEINKVAT 71 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHh----hh---hhcCCC--ceEEcchHH Confidence 112233445555444432 1112222222 26899876 11111000 00 001223 578999999 Q ss_pred HHhhhhhhhcccceeeec--CC-cchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCeEEEEEE---ecCC----- Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKP--TD-HDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKGYEGIFAR---TTSE----- 142 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~--~~-~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G~a~~~~y---~d~~----- 142 (530) +|+.-+|+---+.+.+.. .+ .++.+..+.|+.+++ ++.......+..++.++|.+|.-++ .++. T Consensus 72 ~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~ 151 (708) T protein:vir:17 72 ELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD 151 (708) T ss_pred HHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCC Confidence 999999998888776543 32 335667777777653 4566677788889999999986553 3332 Q ss_pred -CceEEEE--ecccceEEEEcCCC---Cc---e-eEEEE----------E-----------EEEeecccccccceEEEEE Q lcl|NC_011308. 143 -DKLTFQT--VDALQLLPVFDDYG---TL---Q-RIIRF----------Y-----------TEQRYSDADNKFNSIGHAD 191 (530) Q Consensus 143 -g~~~~~~--~~p~~~~~v~d~~~---~~---~-~~~~~----------y-----------~~~~~~~~~~~~~~~~~~e 191 (530) ..+.+.. .+|..+| ||... ++ . .+.+. | ....+...+...+.+..++ T Consensus 152 ~~~i~i~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e 229 (708) T protein:vir:17 152 RQRIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAK 229 (708) T ss_pred ccccceEeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEE Confidence 2333333 2344554 44321 10 0 01000 0 0001111122234455555 Q ss_pred EEcCCceE--EEeecC--Ccccchhhccc-------cccccccceeeeeecccccceecccccccccccccccccCCccc Q lcl|NC_011308. 192 VWTDTEVW--YYVQKD--EGRSDEYVLDT-------TVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFP 260 (530) Q Consensus 192 vyt~~~~~--~y~~~~--~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP 260 (530) +|...... .+...+ ++....+.... .............. ....+...............+.+++++| T Consensus 230 ~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~--~r~~v~~~~~~g~~~l~~~~~~p~~~fP 307 (708) T protein:vir:17 230 YYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSV--KRRRVYVSVVDGDGFLEKPRRIPGEHIP 307 (708) T ss_pred EEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeee--eEEEEEEEeecccccccCCCCCCCCccc Confidence 55432221 111111 11110000000 00000000000000 0000000000111122344566777888 Q ss_pred eEEeeCC---cCC----CCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee-----cCCCCc----hhh-HHHHHh-- Q lcl|NC_011308. 261 FDILYNN---KLG----ISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR-----GGTNSP----VDE-IKKNIQ-- 321 (530) Q Consensus 261 iv~~~nn---~~~----~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~-----g~~~~~----~~~-~~~~~~-- 321 (530) +|||... ..| .|.+-..++.++.||+..|.+...+.......+++. |....- .+. ...... T Consensus 308 ~vP~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~ 387 (708) T protein:vir:17 308 LIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREV 387 (708) T ss_pred eEEEecccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhcc Confidence 8877542 223 355668999999999999999988877665544331 211100 000 000000 Q ss_pred hCcceecCCCCc-eeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 322 SKKIIQTKGEGG-LDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 322 ~~~~i~~~~~~~-~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ...+-.+..++. ...+..+.-..+....++.....|-..|.+-+......||+||+|+..+-..........-..++.+ T Consensus 388 ~~~~g~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~ 467 (708) T protein:vir:17 388 RDKYGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKS 467 (708) T ss_pred CCcccccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 011111112211 1222222333566677888999999988766544434468899999988777777666666666777 Q ss_pred HHHHHHHHHHHHHhcC-------CCccc----c------------------ce-----eeEEeC--CCCCCCHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRG-------LGDYS----S------------------TD-----IKFDIE--PYILANELDLAMID 444 (530) Q Consensus 401 l~~~~~~i~~~l~~~~-------~~~~d----~------------------~~-----i~i~f~--~~~P~n~~e~a~~~ 444 (530) .++..+++++++.... +..-| . .+ ..+... +..|.=..+..+.+ T Consensus 468 ~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l 547 (708) T protein:vir:17 468 LKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVL 547 (708) T ss_pred HHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHH Confidence 6666666665543211 00000 0 00 111111 11111111222222 Q ss_pred HHHHh-cCCCcHHH------HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcC Q lcl|NC_011308. 445 KTEAE-TNQIQINN------LLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPL 517 (530) Q Consensus 445 ~~~~~-~g~iS~et------~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (530) +.+.. .+.....+ +++.++ +.+.++..+++.+.... ... .+ +..+..++... T Consensus 548 ~qll~~~~~~~~~~~~~~~l~l~~~D-~p~~~ei~e~ir~~~~~------~~~----------~~----~~~~e~~q~~~ 606 (708) T protein:vir:17 548 TNVLSSMLPADPMRPAIQGIILDNID-GEGLDDFKEYNRNQLLI------SGI----------AK----PRNEKEQQIVQ 606 (708) T ss_pred HHHHHhcCCccchhHHHHHHHHHhcC-CCChHHHHHHHHHHhhc------ccc----------cc----CcchhhHHHHH Confidence 21111 11110011 222221 12222222222211000 000 00 00000000000 Q ss_pred CCCcccc--cccCCC Q lcl|NC_011308. 518 NIDPVIE--EEPVQE 530 (530) Q Consensus 518 ~~~~~~~--~~~~~~ 530 (530) -.++... .+|-+. T Consensus 607 q~qq~~q~q~~~~~~ 621 (708) T protein:vir:17 607 QAQMAAQSQPNPEMV 621 (708) T ss_pred HHHHHHHHHHHHHHH Confidence 0000000 000000 No 108 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.03 E-value=3.8e-10 Score=72.25 Aligned_cols=478 Identities=11% Similarity=0.049 Sum_probs=230.7 Q ss_pred CC------cccccCCcccHHHHHHHHHHHHHHhhh-----HHHHHHHHHHhcccchhhhcccccccccccccccccCCcc Q lcl|NC_011308. 1 MT------NTLLTTAPDRLGTILSTKIDEYIRSQN-----VSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI 69 (530) Q Consensus 1 ~~------~~~~~~~~~~~~~~i~~~i~~~~~~~~-----~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ 69 (530) |. +-++..-+ +.-+++.+.+.+|.+... .+.-+++++|-.... .| ..... .-++ -| T Consensus 1 m~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~---tr---~t~~~----~~~w--~~ 67 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRD-DDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATD---TR---KTSNS----KLPF--KN 67 (599) T ss_pred CccchHHHHHHhhccC-chHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhc---cc---ccccC----CCCc--cc Confidence 21 22222222 223455555555543222 223367777733222 11 11110 1112 25 Q ss_pred eeecCchhhHHhhhhhhhcccce------eeecCCcc--hHHHHHHHHHHhhc-----cHHHHHHHHHHHHhhcCeEEEE Q lcl|NC_011308. 70 KISHGFFAELVDQKTQYLLANGI------DVKPTDHD--DQKLCYLIEEYYNE-----EFQSAIQELVEGSTIKGYEGIF 136 (530) Q Consensus 70 ki~~n~~k~Ivd~~~~yl~G~pv------~~~~~~~~--de~~~~~l~~~~~~-----~~~~~~~e~~~~~~~~G~a~~~ 136 (530) ++..|..-.+++....|+++--. .|..-+.+ .....+.++.+..+ +|......+..+...+|-|+.. T Consensus 68 s~t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat 147 (599) T protein:vir:31 68 STTINKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAH 147 (599) T ss_pred ccchHHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEe Confidence 78888888899999999887532 12222222 34456677777654 5566777788888999988865 Q ss_pred EEec------CCC-------ceEEEEecccceEEEEcCCC---C-ceeEEEEEEEEeec-----------cccccc---- Q lcl|NC_011308. 137 ARTT------SED-------KLTFQTVDALQLLPVFDDYG---T-LQRIIRFYTEQRYS-----------DADNKF---- 184 (530) Q Consensus 137 ~y~d------~~g-------~~~~~~~~p~~~~~v~d~~~---~-~~~~~~~y~~~~~~-----------~~~~~~---- 184 (530) +-+. ++| .+++..++|.++|| |.+. + --.++|-+.....- ...+.. T Consensus 148 ~~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~--Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~ 225 (599) T protein:vir:31 148 TRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFW--DVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLR 225 (599) T ss_pred eeEEEcceeecccccccccccceEEeecccceee--CCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHH Confidence 5422 122 37889999999875 4432 1 21222333210000 000000 Q ss_pred ceEEEEEEEcCCceEEEeecC---------------Ccccchhhcc---ccccccccceeeeeecccccceeccccc-cc Q lcl|NC_011308. 185 NSIGHADVWTDTEVWYYVQKD---------------EGRSDEYVLD---TTVNPNPSQHVLAVADGVDEAILDEGVE-EH 245 (530) Q Consensus 185 ~~~~~~evyt~~~~~~y~~~~---------------~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 245 (530) ....+.--++.+....+...+ .+....+.+. +.....+.. ..+..+... .- T Consensus 226 ~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~----------~~~ViTi~g~~~ 295 (599) T protein:vir:31 226 EERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELW----------NNYEITVIDRKI 295 (599) T ss_pred hhccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccc----------cceEEEEecCcE Confidence 000111111222221111111 1111111110 001101111 011111111 12 Q ss_pred ccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHH Q lcl|NC_011308. 246 EGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNI 320 (530) Q Consensus 246 ~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~ 320 (530) ..+....|.+.|..|++.... .-.|.|.+..+.++++.+|.+-....+.++-+.+|+++..|..... + ... T Consensus 296 liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~e--D--~~~ 371 (599) T protein:vir:31 296 IGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREK--G--MRG 371 (599) T ss_pred EeecccCCCCCCCCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccccccccccccc--C--ccC Confidence 234555677888888886543 4579999999999999999999999999999999999887752211 1 112 Q ss_pred hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccc--ccCCcHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_011308. 321 QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVG--DGNATNVVIKSRYTLLAMKAQKTEIALR 398 (530) Q Consensus 321 ~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~--~gn~SGvAik~~~~~l~~ka~~ke~~f~ 398 (530) ..+.++.+...|++.++..+.+...+.+.+..+....=..|.+|..+... .|..+...++.++..+-....++.+.|. T Consensus 372 ~P~~v~~~~d~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e 451 (599) T protein:vir:31 372 GPNHVFEVEETGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFE 451 (599) T ss_pred CCCcceeecCCCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHH Confidence 35788999999999999999888888888999999888999999765432 2455777777788888777777888888 Q ss_pred HHHHH-HHHHHHHHHHh----cCC-----C--------ccccceeeEEeCCCCCC--C-HHHHHHHHH----HH---Hhc Q lcl|NC_011308. 399 KTLRW-TADLVVEDIRR----RGL-----G--------DYSSTDIKFDIEPYILA--N-ELDLAMIDK----TE---AET 450 (530) Q Consensus 399 ~~l~~-~~~~i~~~l~~----~~~-----~--------~~d~~~i~i~f~~~~P~--n-~~e~a~~~~----~~---~~~ 450 (530) .++-+ +++-+.+.... .+. . +....+++-.|. -+|. + ..+-++..+ .+ .+. T Consensus 452 ~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~-~v~~Ga~~v~ere~~~q~l~~il~~~~~q 530 (599) T protein:vir:31 452 RELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQ-MVAQGATLFAEKANTLQNLNAILGGPLGA 530 (599) T ss_pred HHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCee-eeechhhHHHHHHHHHHHHHHHhcccCCC Confidence 87643 44433332111 000 0 000011111111 0111 1 112122111 11 011 Q ss_pred C---CCcHHHHHHhCCCCCCH--------HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 451 N---QIQINNLLAIAPRIGDE--------ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 451 g---~iS~et~l~~~~~vdd~--------~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) + -+|++.....+-++.+- ...+++.+.+ -.|.+.+-+.....+--.+.+-.|++.+-+ T Consensus 531 ~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~-----~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 531 ALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQL-----ARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred ccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHH-----HHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 1 23443333222221111 0011111111 111111111111111111112223333333 No 109 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.94 E-value=1.2e-08 Score=64.09 Aligned_cols=396 Identities=12% Similarity=0.100 Sum_probs=182.2 Q ss_pred HHHHHHhhh--HHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCC Q lcl|NC_011308. 21 IDEYIRSQN--VSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTD 98 (530) Q Consensus 21 i~~~~~~~~--~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~ 98 (530) +--|..+++ ......+...+.++-.. .+....... ..... .....=-.+.+++.+|+..+.-++.+++.+++.+ T Consensus 1 ~~~~m~~~~~~~~~~D~~~~~~~~~~g~-~~~~~~~~~-~~~~~--~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~ 76 (435) T protein:vir:79 1 MGVFMSDKVKAITKEDGYNEIFGSKDGT-FRPNAFYMQ-RAAFK--ALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVK 76 (435) T ss_pred CCcccccccccchhhcchhhhhcccccc-cccCcccCC-cCCHH--HHHHHHhcCchhhhhhccchHHhhcCCceecCCC Confidence 111221111 11112222222221110 000000000 00000 0000011368899999999999999999997643 Q ss_pred cchHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCC----------Cce-EEEEecccceEEEEcCCCCcee Q lcl|NC_011308. 99 HDDQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSE----------DKL-TFQTVDALQLLPVFDDYGTLQR 167 (530) Q Consensus 99 ~~de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~----------g~~-~~~~~~p~~~~~v~d~~~~~~~ 167 (530) +.+.+...++++ +....+.++.+.+-.+|.|+.++-.... |.+ .+.+++|.++.|-.-+..-..+ T Consensus 77 -~~~~~~~~~~~l---~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp 152 (435) T protein:vir:79 77 -NEKSFKSRWDEL---RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSV 152 (435) T ss_pred -hHHHHHHHHHHh---hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCccc Confidence 233344444443 5667888999999999999888875322 222 2344555544331100000000 Q ss_pred EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccc Q lcl|NC_011308. 168 IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEG 247 (530) Q Consensus 168 ~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (530) .++.+ ..|.....+..... ..|.+.+..-.. T Consensus 153 -----------------------~fg~P---~~y~v~~~~~~~~~----------~iH~SRli~~~g------------- 183 (435) T protein:vir:79 153 -----------------------RYGEP---KLYKISPGGDIPEF----------FVHYSRICIIDG------------- 183 (435) T ss_pred -----------------------ccCcc---eEEEEecCCCCCce----------EEcceeEEEecC------------- Confidence 00000 01111110000000 000000000000 Q ss_pred ccccccccCCccceEE-eeCCcCCCCcH-HHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-----CCchhhHHHH- Q lcl|NC_011308. 248 RQVLGRSYKSRFPFDI-LYNNKLGISDI-KKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-----NSPVDEIKKN- 319 (530) Q Consensus 248 ~~~~~~~~~~~iPiv~-~~nn~~~~sd~-e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-----~~~~~~~~~~- 319 (530) ..+|-.. ..++-.|.|.+ +.+.+-+.+++.+....+..+..+....+.++|.. .......... T Consensus 184 ---------~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~ 254 (435) T protein:vir:79 184 ---------ERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRL 254 (435) T ss_pred ---------CcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHH Confidence 0001110 01222355655 67888889999999999999988888888776631 1111111111 Q ss_pred -----Hh-hCcceecCC-CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC---ccccc-CCcHHHHHHHHhhHHH Q lcl|NC_011308. 320 -----IQ-SKKIIQTKG-EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS---AVGDG-NATNVVIKSRYTLLAM 388 (530) Q Consensus 320 -----~~-~~~~i~~~~-~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~---~~~~g-n~SGvAik~~~~~l~~ 388 (530) .+ .++.+.+.+ +.+++.+ ..+.......++...+.|...+.+|-.- ....| |+||..=...|..... T Consensus 255 ~~~~~~~~~~~~~~i~~~~e~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~ 332 (435) T protein:vir:79 255 AQVDDESGVGKAIGIDATDEEYEVL--NSDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLID 332 (435) T ss_pred HHHHHhcCCCCceeEecCCcceEEE--ecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHH Confidence 11 133344433 3445555 4666788999999999999999999532 11113 4667544444443322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHH-------HhcCCCcHHHHHHh Q lcl|NC_011308. 389 KAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTE-------AETNQIQINNLLAI 461 (530) Q Consensus 389 ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~-------~~~g~iS~et~l~~ 461 (530) ...+..++..|.+++.++.. ..++++.|++=..-+++|.|++..+. .+.|++|.+.+... T Consensus 333 --~~Qe~~l~p~l~~l~~li~~-----------s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~ 399 (435) T protein:vir:79 333 --RKRVEDYKPILEFLLPFMIS-----------ETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDT 399 (435) T ss_pred --HHHHHHHHHHHHHHHHHhhc-----------CCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHH Confidence 13356788888888777532 13688999999988998887765443 33444443333221 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 462 APRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 462 ~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) +. .......-. .+..++-+...+. +++.+.|.+++. T Consensus 400 -------------L~-------~~~~~~~~~--~~~~~~~~~~~d~--~~~~~~e~g~~~ 435 (435) T protein:vir:79 400 -------------LR-------SICPDLKIM--DNDNIELPEPEDL--DPEPGQEGGLNK 435 (435) T ss_pred -------------HH-------HhccccCCC--CcccccCCccccC--CCCCCCCCCCCC Confidence 11 011111110 1111111110111 111111222222 No 110 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=98.92 E-value=1.4e-08 Score=63.58 Aligned_cols=389 Identities=14% Similarity=0.136 Sum_probs=176.4 Q ss_pred HHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCCcc Q lcl|NC_011308. 21 IDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHD 100 (530) Q Consensus 21 i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~ 100 (530) |.-|.. ..+.+..-|.++-..+.. .....+.... ..=-.+.+++.+|+..+.-++.+.+.+++.+ + T Consensus 1 ~~~~~~-------d~~~~~~~~~~~~~~~~~-~~~~~~~~l~-----a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~-~ 66 (427) T protein:vir:10 1 MKIVKH-------DGYNDIFNGGADGSPKPF-FMSDASYHVG-----SFYNDNATAKRIVDVIPEEMVTAGFKMSGVK-D 66 (427) T ss_pred CCcccc-------chHHHHhhcCCCCcccCc-cccCchHHHH-----HHHHcCchhhhhhccchHHhhcCCccccCcc-H Confidence 111111 111122222211111100 0000000000 0011367899999999999999999997643 2 Q ss_pred hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCC----------ce-EEEEecccceEEEEcCCCCceeEE Q lcl|NC_011308. 101 DQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSED----------KL-TFQTVDALQLLPVFDDYGTLQRII 169 (530) Q Consensus 101 de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g----------~~-~~~~~~p~~~~~v~d~~~~~~~~~ 169 (530) .+.+...++++ +....+.++.+.+..+|.|+.++-.+... .+ .+.+++|.++-|-.-...-..+ T Consensus 67 ~~~~~~~~~~l---~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~-- 141 (427) T protein:vir:10 67 EKEFKSLWDSY---KLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSP-- 141 (427) T ss_pred HHHHHHHHHHh---hHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCcccc-- Confidence 23333334332 56678889999999999999888765322 22 2344444444331100000000 Q ss_pred EEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccccc Q lcl|NC_011308. 170 RFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQ 249 (530) Q Consensus 170 ~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (530) .++.+ ..|...+...... . ..|.+.+..-... T Consensus 142 ---------------------~fg~P---~~y~v~~~~~~~~----~------~iH~SRli~~~g~-------------- 173 (427) T protein:vir:10 142 ---------------------RYGEP---EIYKVSPGDNMQP----Y------LIHHSRVFIADGE-------------- 173 (427) T ss_pred ---------------------ccCcc---eEEEEecCCCCcc----e------EEccccEEEecCC-------------- Confidence 00000 0111111100000 0 0000000000000 Q ss_pred ccccccCCccceEE-eeCCcCCCCcHH-HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-----CCchhhHHHHH-- Q lcl|NC_011308. 250 VLGRSYKSRFPFDI-LYNNKLGISDIK-KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-----NSPVDEIKKNI-- 320 (530) Q Consensus 250 ~~~~~~~~~iPiv~-~~nn~~~~sd~e-~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-----~~~~~~~~~~~-- 320 (530) .+|-.. ..++-.|.|.+. .+.+-+.+++.+....+..+..+....+.++|.. .......+... T Consensus 174 --------~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~ 245 (427) T protein:vir:10 174 --------RVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQ 245 (427) T ss_pred --------CchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHH Confidence 001100 012224666664 4667788889988888888888888888777642 11111112111 Q ss_pred ----h-hCcceecCC-CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC-----cccccCCcHHHHHHHHhhHHHH Q lcl|NC_011308. 321 ----Q-SKKIIQTKG-EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS-----AVGDGNATNVVIKSRYTLLAMK 389 (530) Q Consensus 321 ----~-~~~~i~~~~-~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~-----~~~~gn~SGvAik~~~~~l~~k 389 (530) + ..+.+.+.+ +.+++.+ ..+.......++...+.|-..+.+|-.- ..++ |+||..=..-|..... T Consensus 246 ~~~~~~~~~~~~l~~~~e~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Gl-nstgd~D~~nyyd~i~- 321 (427) T protein:vir:10 246 VDDNSGVGRAIGIDAETEEYDVL--NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGV-SASQNTALETFYKLVD- 321 (427) T ss_pred HHHhcCcccceeeecCCCceeEE--ecccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccc-ccchhHHHHHHHHHHH- Confidence 1 133344443 3444444 5667788889999999999999999532 2222 4666643333333221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHH-------HHhcCCCcHHHHHHhC Q lcl|NC_011308. 390 AQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKT-------EAETNQIQINNLLAIA 462 (530) Q Consensus 390 a~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~-------~~~~g~iS~et~l~~~ 462 (530) ...+..++..|.+++.+|.. ..++++.|++=..-+++|.|++..+ ..+.|+++.+.+.. T Consensus 322 -~~Qe~~l~p~l~~l~~~i~~-----------s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~-- 387 (427) T protein:vir:10 322 -RKREEDYRPLLEFLLPFIVD-----------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARD-- 387 (427) T ss_pred -HHHHHHHHHHHHHHHHHhhc-----------CCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHH-- Confidence 23456788888888877631 1368999999998999898776433 33344444433322 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 463 PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 463 ~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) ++...- ......... +.+.+. -++..+++|.-....+.++ T Consensus 388 -----------~L~~~~-----~~~~~~~~~--~~~~e~------~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 388 -----------TLRSIA-----PEFKLKDGN--NINIRE------PEETTEPEPGLGEKLEDEN 427 (427) T ss_pred -----------HHHhhh-----ccccCCCCc--cccccc------cchhcCCCCCCCCCCCCCC Confidence 221110 000000000 000000 0000001111111111111 No 111 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.77 E-value=5.6e-08 Score=60.32 Aligned_cols=503 Identities=12% Similarity=0.084 Sum_probs=200.0 Q ss_pred CC-----cccccCCc-------ccHHHHHHHHHHHHHHhhh--HHHHHHHHHHhcccchhh----hcccccccccccccc Q lcl|NC_011308. 1 MT-----NTLLTTAP-------DRLGTILSTKIDEYIRSQN--VSLARVGQRYYNQDNDIE----NTRIMWMNDHGDIVE 62 (530) Q Consensus 1 ~~-----~~~~~~~~-------~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~I~----~r~~~~~~~~~~~~~ 62 (530) || .+..+.+. +.+...|.+.++.+++... -.+.+.+.+||...-... .+.-.+.+ ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~---~~~- 76 (641) T protein:vir:94 1 MTIEMPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTG---ADD- 76 (641) T ss_pred CccCCCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccc---cch- Confidence 33 22222221 3344445555555554221 123456667765433221 11111111 000 Q ss_pred cccCCcceeecCchhhHHhhhhhhhcc----cce--eeecCCcchHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcC Q lcl|NC_011308. 63 DDNASNIKISHGFFAELVDQKTQYLLA----NGI--DVKPTDHDDQKLCYLIEEYYN-----EEFQSAIQELVEGSTIKG 131 (530) Q Consensus 63 ~~~~~n~ki~~n~~k~Ivd~~~~yl~G----~pv--~~~~~~~~de~~~~~l~~~~~-----~~~~~~~~e~~~~~~~~G 131 (530) -.+ .+||..+-+...++..++.|++ .+. ++.....++.+..+.++.+++ +++.+..++..+++..+| T Consensus 77 ~~~--r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g 154 (641) T protein:vir:94 77 ADW--RHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYG 154 (641) T ss_pred hcc--cccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcC Confidence 111 2377777777777777766654 332 343444455555555555443 456667778888999999 Q ss_pred eEEEEEEecC----------------------------CCceEEEEecccceEEEEcCCCCc--eeEEEEEEEEee---- Q lcl|NC_011308. 132 YEGIFARTTS----------------------------EDKLTFQTVDALQLLPVFDDYGTL--QRIIRFYTEQRY---- 177 (530) Q Consensus 132 ~a~~~~y~d~----------------------------~g~~~~~~~~p~~~~~v~d~~~~~--~~~~~~y~~~~~---- 177 (530) .++..++++. ...+++..++|.++| +|.+... ..++++...... T Consensus 155 ~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~~l 232 (641) T protein:vir:94 155 VSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELHEL 232 (641) T ss_pred ceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHHHH Confidence 9988776541 123466777787775 4444322 122222211100 Q ss_pred --cccc--cccceEEEEEE-Ec-CCceEEEeecCCcccchh--hccccccccccceeeeeecccccceeccccccccccc Q lcl|NC_011308. 178 --SDAD--NKFNSIGHADV-WT-DTEVWYYVQKDEGRSDEY--VLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQ 249 (530) Q Consensus 178 --~~~~--~~~~~~~~~ev-yt-~~~~~~y~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (530) ++.. .........+. ++ .+........+.....-+ ..+..........+.....+ +.... T Consensus 233 ~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g------------~~il~ 300 (641) T protein:vir:94 233 VTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYG------------KQLIR 300 (641) T ss_pred HhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeC------------CEEee Confidence 0000 00000000000 00 000000000000000000 00000000000000000000 00000 Q ss_pred ccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCc Q lcl|NC_011308. 250 VLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKK 324 (530) Q Consensus 250 ~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~ 324 (530) ......|...|++.++. .-+|.|-.+.+.+.+..++.+.....+.+....+|.+++.....-....+ ....++ T Consensus 301 ~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l--~~~PG~ 378 (641) T protein:vir:94 301 LSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDV--KAKPGA 378 (641) T ss_pred cccccccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccccee--eccCCc Confidence 11223466778886654 34899999999999999999999999999999999887644322121111 234566 Q ss_pred ceecCCCCceeEEEec-CCHHHHHHHHHHHHHHHHHHhcccCC---Cccccc-CCcHHHHHHHHhhHHHHHHHHHHHHH- Q lcl|NC_011308. 325 IIQTKGEGGLDIQTVD-IPYEARKAKMDIDELNIYRSGMGFNS---SAVGDG-NATNVVIKSRYTLLAMKAQKTEIALR- 398 (530) Q Consensus 325 ~i~~~~~~~~~~lt~~-~~~~~~e~~ld~L~~~I~~~s~~p~~---~~~~~g-n~SGvAik~~~~~l~~ka~~ke~~f~- 398 (530) ++.++..+++.++... .+.......++.+...|-....+..+ .....| +.++..+..++..+..+-...-+.|. T Consensus 379 ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~ 458 (641) T protein:vir:94 379 VFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIED 458 (641) T ss_pred ceeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777777888887543 23333334455555444433332222 111112 23444555555555555555555555 Q ss_pred HHHHHHHHHHHHHHHhc-----------------CCCccccceeeEEeCCCCCCCHHH-------HHHHHHHHHhcCCCc Q lcl|NC_011308. 399 KTLRWTADLVVEDIRRR-----------------GLGDYSSTDIKFDIEPYILANELD-------LAMIDKTEAETNQIQ 454 (530) Q Consensus 399 ~~l~~~~~~i~~~l~~~-----------------~~~~~d~~~i~i~f~~~~P~n~~e-------~a~~~~~~~~~g~iS 454 (530) ++|..+++-+.+.+... +.......+++..|.- +|-.... +.+........|..+ T Consensus 459 e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P 537 (641) T protein:vir:94 459 SSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVP 537 (641) T ss_pred HHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcCh Confidence 34444554444433221 1112223344444422 3433211 111111111112111 Q ss_pred -----------HHHHHHhCCCCCCH-------HHHHHHHHHHHHHHHHHHHhhhc--------cccccCCcc------cc Q lcl|NC_011308. 455 -----------INNLLAIAPRIGDE-------ETLKAICDTLDLDYEDVVKALED--------QEVEELEPT------VT 502 (530) Q Consensus 455 -----------~et~l~~~~~vdd~-------~~e~~~~e~e~~e~~~~~~~~~~--------~~~~~~~~~------~~ 502 (530) .+.+++..++ .++ +...+.....+++.++.+..... +......++ +. T Consensus 538 ~v~d~~d~~~~~~~~~~~~g~-~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~ 616 (641) T protein:vir:94 538 QIGQSLDYALILEDLLRQMRF-TDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASR 616 (641) T ss_pred hhhhcCCHHHHHHHHHHHhCC-CCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHh Confidence 1222333221 111 10000010111111111100000 000000000 00 Q ss_pred CCCCCCCCCCccCcCCCCccccccc Q lcl|NC_011308. 503 PIIDPLTIEPQPEPLNIDPVIEEEP 527 (530) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (530) -..++...-++.--+-+.++|.+.. T Consensus 617 ~~~~~~~~~~~~~~~~~~~~~~~~~ 641 (641) T protein:vir:94 617 IGIDTSDVAPEAMAAATQQITSGAL 641 (641) T ss_pred hcCCchhhhHHHHhcccccccccCC Confidence 0000111111111122233333333 No 112 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.76 E-value=6e-08 Score=60.18 Aligned_cols=424 Identities=9% Similarity=-0.029 Sum_probs=209.8 Q ss_pred CCcccccCCcccHHHHHHHHHHHHH---------------------------HhhhHHHHHHHHHHhcccchhhhccccc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYI---------------------------RSQNVSLARVGQRYYNQDNDIENTRIMW 53 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~---------------------------~~~~~~~~~~~~~YY~g~~~I~~r~~~~ 53 (530) +.-..---.+-.... .......|. ......+. ++.| T Consensus 11 ~dr~i~~~~~~~~~~-~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~Ra---RdL~------------- 73 (505) T protein:vir:96 11 AQRMVNWAWYRYVEP-QKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRA---REQS------------- 73 (505) T ss_pred hhcccchhhhhhHHH-HHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHH---HHHH------------- Confidence 000000000000000 000000000 00000111 1111 Q ss_pred ccccccccccccCCcceeecCchhhHHhhhhhhhcc-cceeeecCCc-----chHHHHHHHHHHhhc------------- Q lcl|NC_011308. 54 MNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA-NGIDVKPTDH-----DDQKLCYLIEEYYNE------------- 114 (530) Q Consensus 54 ~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G-~pv~~~~~~~-----~de~~~~~l~~~~~~------------- 114 (530) ..+++++-+|+..+.+++| ..+++++... .++++.+.+...+.. T Consensus 74 -----------------rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~ 136 (505) T protein:vir:96 74 -----------------INNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRY 136 (505) T ss_pred -----------------hcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccC Confidence 1246788899999999999 5888765421 244455555444321 Q ss_pred cHHHHHHHHHHHHhhcCeEEEEEEecCCC--ceEEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEE Q lcl|NC_011308. 115 EFQSAIQELVEGSTIKGYEGIFARTTSED--KLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADV 192 (530) Q Consensus 115 ~~~~~~~e~~~~~~~~G~a~~~~y~d~~g--~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ev 192 (530) +|......+.+.....|.+|..+.....+ -+++..++|..+---++....-...|+ .=++ .+ ..+..+ .+.+ T Consensus 137 ~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~-~GIe-~d---~~Gr~~-aY~i 210 (505) T protein:vir:96 137 HFVTLLHLWMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIR-MSIE-LD---AWERPV-AYHL 210 (505) T ss_pred CHHHHHHHHHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEE-eceE-EC---CCCceE-EEEE Confidence 24444555677888999998766554433 268888999876221111000001110 0000 00 011111 1111 Q ss_pred EcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccc---eEEeeC--- Q lcl|NC_011308. 193 WTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFP---FDILYN--- 266 (530) Q Consensus 193 yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP---iv~~~n--- 266 (530) +.. ..++.... .......|.+|| |+|+.. T Consensus 211 ~~~---------hPgd~~~~------------------------------------~~~~~~~~~rvpa~~vlH~f~~~r 245 (505) T protein:vir:96 211 LVN---------HPGDNSYC------------------------------------YHYAGQTYERVPADEIIHTFVPWR 245 (505) T ss_pred eec---------CCCccccc------------------------------------cccccccccccCHhHhhhhhcccC Confidence 111 11100000 000011244444 344433 Q ss_pred --CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-------CchhhHHHHHhhCcceecCCCCceeEE Q lcl|NC_011308. 267 --NKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-------SPVDEIKKNIQSKKIIQTKGEGGLDIQ 337 (530) Q Consensus 267 --n~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-------~~~~~~~~~~~~~~~i~~~~~~~~~~l 337 (530) ..-|.|.|..++..+..++....-......-.+.--.+|+.... +..+.....+..+.+..+..|.+++++ T Consensus 246 ~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 325 (505) T protein:vir:96 246 PHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYGIRFKEH 325 (505) T ss_pred CccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCccccccCCceeeecCCCCeeeee Confidence 23699999998877766655444333333222222233443211 111222334666777788999999999 Q ss_pred EecCCHHHHHHHHHHHHHHHHHHhcccCCC-cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhc Q lcl|NC_011308. 338 TVDIPYEARKAKMDIDELNIYRSGMGFNSS-AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTL-RWTADLVVEDIRRR 415 (530) Q Consensus 338 t~~~~~~~~e~~ld~L~~~I~~~s~~p~~~-~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l-~~~~~~i~~~l~~~ 415 (530) +.+.+...+..++..+.+.|-.-..+|--. ...++++|-.+.+.-+...-..+...+..|...+ +.+++..+..+-.. T Consensus 326 ~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 405 (505) T protein:vir:96 326 KIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLT 405 (505) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 999999999999999999998888877321 1134556677888888888777777777776543 44455444433233 Q ss_pred CCCc---cccc-eeeEEeCC-CCC-CCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011308. 416 GLGD---YSST-DIKFDIEP-YIL-ANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKAL 489 (530) Q Consensus 416 ~~~~---~d~~-~i~i~f~~-~~P-~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~ 489 (530) |... ++.. -..+.|.. ..| .|-.-.++.......+|+.|.+.++...+. |+++.++++.+|.+..++.--.+ T Consensus 406 G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~--D~~~v~~q~a~e~~~~~~~Gl~~ 483 (505) T protein:vir:96 406 QALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGD--DPEDVFDEIAWEEQLMRDKGVNP 483 (505) T ss_pred CCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHHHHHHHHHHHcCCCC Confidence 3211 1111 12344432 222 466666777788999999999999999875 89988888887766554321100 Q ss_pred hccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 490 EDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) .. + .........+++.+++.+. T Consensus 484 ~~-------~-~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 484 TP-------P-EQESKDATTDEEDDSASDD 505 (505) T ss_pred CC-------C-CCCCCCCCCCCCCCCCCCC Confidence 00 0 0000000011111111111 No 113 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.69 E-value=1e-07 Score=58.85 Aligned_cols=464 Identities=11% Similarity=-0.035 Sum_probs=208.6 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce----------eecCchhhHHhh Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK----------ISHGFFAELVDQ 82 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k----------i~~n~~k~Ivd~ 82 (530) +.+.+.+.|.-.-...-..+...-...|.|-..- .|....+............++.+ ..+++++-.|+. T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~-~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~ 79 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRL-SRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGY 79 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccC-CCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 1122222222211111111222222334442110 11111000000000001111111 146889999999 Q ss_pred hhhhhcccceeeecCC----------cchHHHHHHHHHHhh----c-----------cHHHHHHHHHHHHhhcCeEEEEE Q lcl|NC_011308. 83 KTQYLLANGIDVKPTD----------HDDQKLCYLIEEYYN----E-----------EFQSAIQELVEGSTIKGYEGIFA 137 (530) Q Consensus 83 ~~~yl~G~pv~~~~~~----------~~de~~~~~l~~~~~----~-----------~~~~~~~e~~~~~~~~G~a~~~~ 137 (530) .+.+++|..++.++.. ..++.+.+.+...|. + +|......+++.....|.+|... T Consensus 80 ~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 159 (553) T protein:vir:63 80 QRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATA 159 (553) T ss_pred HHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEe Confidence 9999999999886542 123444444443331 1 23444455667788999999766 Q ss_pred EecCC-C---ceEEEEecccceEEEEcCCC--CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccch Q lcl|NC_011308. 138 RTTSE-D---KLTFQTVDALQLLPVFDDYG--TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDE 211 (530) Q Consensus 138 y~d~~-g---~~~~~~~~p~~~~~v~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~ 211 (530) .+... | -+++.+++|..+-.-++... .+...|. . + ..+.. ..+.++.....-.|.......... T Consensus 160 ~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE-----~-d---~~Gr~-vaY~i~~~hPgd~~~~~~~~~~~~ 229 (553) T protein:vir:63 160 EWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQ-----Y-D---KRGRP-QGYWIQVAHPGDLYQMAPDMYKWK 229 (553) T ss_pred eeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeE-----E-C---CCCce-EEEEeeccCCCcccccccccccee Confidence 55443 2 36788899977633232211 1111111 0 0 11111 112222221111110000000000 Q ss_pred hhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 212 YVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL 291 (530) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~ 291 (530) ..... ...+ -...-|.|... .....-|.|.|..++..+-.++....-- T Consensus 230 r~~~~--~~v~--------------------------a~~vlH~f~~~----r~gQ~RGis~lapvl~~l~~l~~y~dae 277 (553) T protein:vir:63 230 FVQQS--KPWG--------------------------RRQVIHILEPR----EPDQSRGIADIVSGLKDMRMAKRFKEMS 277 (553) T ss_pred eeccc--cccC--------------------------hhHheeccccc----CCCcccCCchHHHHHHHHHHHhHHHHHH Confidence 00000 0000 00011222211 1123468999998877766555443332 Q ss_pred HHHHHHhccceeeeec-CCCCc-----------------------------hhhHHHHHhhCcceecCCCCceeEEEecC Q lcl|NC_011308. 292 SNNLQDMAEAIYVVRG-GTNSP-----------------------------VDEIKKNIQSKKIIQTKGEGGLDIQTVDI 341 (530) Q Consensus 292 ~n~~~~~~~~~lvl~g-~~~~~-----------------------------~~~~~~~~~~~~~i~~~~~~~~~~lt~~~ 341 (530) .....--+.-.++|+. .+.+. .+.....+..+.+..+..|.++++++... T Consensus 278 L~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~ 357 (553) T protein:vir:63 278 LQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGT 357 (553) T ss_pred HHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCC Confidence 2222221211222321 11000 00111235566777788899999999998 Q ss_pred CHHHHHHHHHHHHHHHHHHhcccCC-CcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCc Q lcl|NC_011308. 342 PYEARKAKMDIDELNIYRSGMGFNS-SAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLR-WTADLVVEDIRRRGLGD 419 (530) Q Consensus 342 ~~~~~e~~ld~L~~~I~~~s~~p~~-~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~-~~~~~i~~~l~~~~~~~ 419 (530) +...+..+...+.+.|-.-..+|-- -...++++|-.+.+.-+...-..+...+..|...+- -+++..+...-..+..+ T Consensus 358 p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~ 437 (553) T protein:vir:63 358 PGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVP 437 (553) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcc Confidence 8888889999999988888877721 112245566667777777776666666666655443 23343332211222111 Q ss_pred c------c--------cceeeEEeCCCCC--CCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 420 Y------S--------STDIKFDIEPYIL--ANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYE 483 (530) Q Consensus 420 ~------d--------~~~i~i~f~~~~P--~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~ 483 (530) . . ...+.+.|...-. .|-.-.++.....+.+|+.|.+.++...+. |+++.++++.+|.+..+ T Consensus 438 ~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~--D~~~v~~q~a~e~~~~~ 515 (553) T protein:vir:63 438 MPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLGG--DFRKSFAQRAREDALLK 515 (553) T ss_pred CCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHH Confidence 0 0 0112345543322 465555677778999999999999999974 89999888888766554 Q ss_pred HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 484 DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) +.--.+..... ...+.+.. .+..+.+.|...++..++| T Consensus 516 ~~Gl~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 516 KYGLTFNLSAK---RSLGDGRD--AATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HcCCCCCCCCc---cccCCCcc--cCCCCCCCCCCCCcccccC Confidence 43111110000 00000000 0000001111111111112 No 114 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.64 E-value=1.5e-07 Score=57.93 Aligned_cols=442 Identities=11% Similarity=0.068 Sum_probs=168.4 Q ss_pred CCcccccCCc----ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc-eee-cC Q lcl|NC_011308. 1 MTNTLLTTAP----DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI-KIS-HG 74 (530) Q Consensus 1 ~~~~~~~~~~----~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~-ki~-~n 74 (530) =+.+-+.... .++..+-...++|..+.+... |..-.-+...-......+. .+.. ..+... .. ... .+ T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~-~~~~~~~~~~~~~g~~~~~-~~~~----~~~l~~-l~~~~~~np 87 (547) T protein:vir:63 15 SDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVA-YSQPVIGSMSANPGFKTKP-SIRN----NQDLHG-VLKKFGGNI 87 (547) T ss_pred cccccccccccccchhhhhhhHHHHHHhhcccchh-hhchhhheeecccccccCC-ccCC----hhHHHH-HHHHhhcCH Confidence 0011111111 111222222333433322221 1111111111111111000 0000 000000 00 001 12 Q ss_pred chhhHH----hhhhhhh---------cccceeeecCC----cchHHHHHHHHHHhhc----------cHHHHHHHHHHHH Q lcl|NC_011308. 75 FFAELV----DQKTQYL---------LANGIDVKPTD----HDDQKLCYLIEEYYNE----------EFQSAIQELVEGS 127 (530) Q Consensus 75 ~~k~Iv----d~~~~yl---------~G~pv~~~~~~----~~de~~~~~l~~~~~~----------~~~~~~~e~~~~~ 127 (530) ....+| ++.++|. +|=++++...+ ..+......|.+++.. .+......+..+. T Consensus 88 iv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ 167 (547) T protein:vir:63 88 ILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDT 167 (547) T ss_pred HHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHH Confidence 233333 3333332 11122222111 1122233344444321 2334555677788 Q ss_pred hhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCCCc-eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecC Q lcl|NC_011308. 128 TIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTL-QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKD 205 (530) Q Consensus 128 ~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~-~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~ 205 (530) ..+|.+|.++-++..|++. +..++|..+.++.+..+.. ...++|+.+. . +. ....|..+.+.|++... T Consensus 168 ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~--~-----~~---~~~~~~~~eiih~r~n~ 237 (547) T protein:vir:63 168 YMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVI--D-----QK---IVATFNAREMAFAVRNP 237 (547) T ss_pred HhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEc--C-----Cc---EEEEeccccEEEecccC Confidence 8999999988889999864 6889999998887765432 1122222210 0 00 11124444555443211 Q ss_pred CcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHH Q lcl|NC_011308. 206 EGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYD 285 (530) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~ 285 (530) . ..+ .....|.|-++.....|.... T Consensus 238 ~--------------------------------------------------~~~-----~~~~~G~Spi~~~~~~i~~~~ 262 (547) T protein:vir:63 238 R--------------------------------------------------SDI-----YATGYGYPELEIALKQFIAHE 262 (547) T ss_pred C--------------------------------------------------CCc-----ccccccccHHHHHHHHHHHHH Confidence 0 000 001136676766555555554 Q ss_pred HHHHHHHHHHHHhccce--eeeecCC-CCc--hhhHHHHHh--------hCcceecCCCCceeEEEecCC--HHHHHHHH Q lcl|NC_011308. 286 LMNCFLSNNLQDMAEAI--YVVRGGT-NSP--VDEIKKNIQ--------SKKIIQTKGEGGLDIQTVDIP--YEARKAKM 350 (530) Q Consensus 286 ~~~S~~~n~~~~~~~~~--lvl~g~~-~~~--~~~~~~~~~--------~~~~i~~~~~~~~~~lt~~~~--~~~~e~~l 350 (530) .+..-..+.+.-.+.|- |.++|.. .++ ...++..+. .+++..+. +++++|....++ +..+.... T Consensus 263 ~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~-~~g~~~~~l~~~~~d~qfle~~ 341 (547) T protein:vir:63 263 NTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS-AEDVKFVNMTPSARDMEFEKWL 341 (547) T ss_pred HHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc-CCCceEEEcCCChhHHHHHHHH Confidence 44444444444444444 4445532 222 122232221 11222232 344555554443 33334445 Q ss_pred HHHHHHHHHHhcccC--CCcccccCC---cHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccee Q lcl|NC_011308. 351 DIDELNIYRSGMGFN--SSAVGDGNA---TNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDI 425 (530) Q Consensus 351 d~L~~~I~~~s~~p~--~~~~~~gn~---SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i 425 (530) +...+.|...-.+|. ++....+.. ++-.+- ++.+ ......++...|.-+++.|...|+..-...+. ..+ T Consensus 342 ~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t--~sn~---e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~-~~~ 415 (547) T protein:vir:63 342 NYLINVISALYGIDPAEINIPNNGGATGSKGGSLN--EGNS---AEKNQASKNKGLQPLLGFIEDFINKHIVAEFG-DKY 415 (547) T ss_pred HHHHHHHHHHhCCCHHHcCcccccccccccccccc--hhhH---HHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-Cce Confidence 666777777777774 221111100 111110 0111 11222344555555555555554443222222 347 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC---CCCHHHHHH-----H----HHHHHHHHHHHHHhhhccc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR---IGDEETLKA-----I----CDTLDLDYEDVVKALEDQE 493 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~---vdd~~~e~~-----~----~e~e~~e~~~~~~~~~~~~ 493 (530) .+.|+.....+..+.+++. .+...|+++...+++++++ ++.-+..+. . ..+.+.+.+.......... T Consensus 416 ~~~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (547) T protein:vir:63 416 TFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQ 494 (547) T ss_pred EEEeeccccccHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccc Confidence 7889888888888777654 4566788999999988754 222111110 0 0000000000000000000 Q ss_pred cccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 494 VEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ++...+ ...++.++|.+.+.....+..+. T Consensus 495 ~~~~~~--------~~~~~~~~~~~~~~~~~~~~d~~ 523 (547) T protein:vir:63 495 EQTGNR--------VSTDVEDIPDGKDTTGDIGKDGQ 523 (547) T ss_pred cccCCC--------CCCCCCCCCCCcccCCCcCcccc Confidence 000000 00111111111111111111111 No 115 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.63 E-value=1.6e-07 Score=57.78 Aligned_cols=442 Identities=10% Similarity=-0.033 Sum_probs=209.1 Q ss_pred cCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc----------eeecCch Q lcl|NC_011308. 7 TTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI----------KISHGFF 76 (530) Q Consensus 7 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~----------ki~~n~~ 76 (530) -.-| .-..+. .. .+.........||.|...-- +....+............++. =..++++ T Consensus 1 ~~~p--~~~~~~----~~---~~~~~~~~~~~y~~~a~~~~-~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a 70 (533) T protein:vir:34 1 MKTP--TIPTLL----GP---DGMTSLREYAGYHGGGSGFG-GQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYA 70 (533) T ss_pred CCCc--hhhhhh----cc---cccchHHHHHhhhhccCCCC-CcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHH Confidence 1111 111111 11 11111234456676532111 111110000000000001110 1146889 Q ss_pred hhHHhhhhhhhcccceeeecCC---------cchHHHHHHHHHHhh----c-----------cHHHHHHHHHHHHhhcCe Q lcl|NC_011308. 77 AELVDQKTQYLLANGIDVKPTD---------HDDQKLCYLIEEYYN----E-----------EFQSAIQELVEGSTIKGY 132 (530) Q Consensus 77 k~Ivd~~~~yl~G~pv~~~~~~---------~~de~~~~~l~~~~~----~-----------~~~~~~~e~~~~~~~~G~ 132 (530) +-.|+..+.+++|..++.++.. ..+++..+.++..|. + +|......+++...+.|. T Consensus 71 ~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE 150 (533) T protein:vir:34 71 ANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGE 150 (533) T ss_pred HHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCc Confidence 9999999999999999987642 234445555544432 1 244444556777889999 Q ss_pred EEEEEEecCCC----ceEEEEecccceEEEEcCC--CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCC Q lcl|NC_011308. 133 EGIFARTTSED----KLTFQTVDALQLLPVFDDY--GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDE 206 (530) Q Consensus 133 a~~~~y~d~~g----~~~~~~~~p~~~~~v~d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~ 206 (530) +|....+...+ -+++.+++|..+---++.. ..+...|. ++ ..+..+. +.++. .... T Consensus 151 ~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe------~d---~~Gr~~a-Y~i~~--------~~~~ 212 (533) T protein:vir:34 151 LFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQ------IN---DSGAALG-YYVSE--------DGYP 212 (533) T ss_pred eEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeE------EC---CCCCeEE-EEEee--------cCCC Confidence 99877665543 3678889987653212211 11111111 00 0111111 11111 1111 Q ss_pred cccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccc---eEEeeC-----CcCCCCcHHHHH Q lcl|NC_011308. 207 GRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFP---FDILYN-----NKLGISDIKKVK 278 (530) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP---iv~~~n-----n~~~~sd~e~v~ 278 (530) +........ . -.+..+| |+|+.. ..-|.|.|..++ T Consensus 213 ~~~~~~~~~---------------------------------~----~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl 255 (533) T protein:vir:34 213 GWMPQKWTW---------------------------------I----PRELPGGRASFIHVFEPVEDGQTRGANVFYSVM 255 (533) T ss_pred Cccccccce---------------------------------e----eeeeccChhHeeeeccccCCCcccCCchHHHHH Confidence 100000000 0 0011122 344333 346999999887 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCC-------------CCch-hhH--------------HHHHhhCcceecCC Q lcl|NC_011308. 279 SIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-------------NSPV-DEI--------------KKNIQSKKIIQTKG 330 (530) Q Consensus 279 ~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-------------~~~~-~~~--------------~~~~~~~~~i~~~~ 330 (530) ..+..++.-..-......--+.--.+++... .++. +.+ ...+..+.+..+.. T Consensus 256 ~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p 335 (533) T protein:vir:34 256 EQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMP 335 (533) T ss_pred HHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCC Confidence 7666554433222211111111112222110 0000 000 01255667777888 Q ss_pred CCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC-cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_011308. 331 EGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS-AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTL-RWTADLV 408 (530) Q Consensus 331 ~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~-~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l-~~~~~~i 408 (530) |.++++++.+.+...+..+...+.+.|-.-..+|--. ...++++|-.+++.-+......+...+..|...+ +-+++.. T Consensus 336 Ge~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~w 415 (533) T protein:vir:34 336 GDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCW 415 (533) T ss_pred CCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999888889999999999988888777311 1134556667787777777776766666555543 3333322 Q ss_pred HHHHHhcCC------Cccccc-----eeeEEeCCC--CCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHH Q lcl|NC_011308. 409 VEDIRRRGL------GDYSST-----DIKFDIEPY--ILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAIC 475 (530) Q Consensus 409 ~~~l~~~~~------~~~d~~-----~i~i~f~~~--~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~ 475 (530) +...-..+. ...++. ...+.|... .-.|-.-.++.......+|+.|.+.++...+. |+++.++++ T Consensus 416 l~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~--D~~ev~~q~ 493 (533) T protein:vir:34 416 LEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGD--DYQEIFAQQ 493 (533) T ss_pred HHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHH Confidence 221111221 111111 123444322 22466566677788999999999999999875 899888888 Q ss_pred HHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 476 DTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 476 e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) .+|.+..++.--.+. ..+...... ...++++++. .+.+.. T Consensus 494 a~e~~~~~~~gl~~~------~~~~~~~~s-~~~~~~~~~~--~~~~~~ 533 (533) T protein:vir:34 494 VRETMERRAAGLKPP------AWAAAAFES-GLRQSTEEEK--SDSRAA 533 (533) T ss_pred HHHHHHHHhcCCCCC------CCCCcCccC-CCCCCCCCCc--ccCCCC Confidence 877666544311111 111111100 0011111111 111111 No 116 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.53 E-value=3.5e-07 Score=55.95 Aligned_cols=440 Identities=12% Similarity=0.011 Sum_probs=202.8 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc----------e Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI----------K 70 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~----------k 70 (530) |. ++...-...... .+-.....-|+|-..--. ....... .......++. = T Consensus 1 m~-~~~~~~~a~~~~---------------~~~~~~~~~y~aa~~~~~--~~~~~~~--s~d~~~~~~~~~lr~RaRdl~ 60 (495) T protein:vir:10 1 MN-MTPSGYQSLASG---------------LLVPVGASAYEGASGGHR--WQDIGDY--GPDTAVASGIQTLRARSHHNV 60 (495) T ss_pred CC-cccccccccchh---------------hhhHHHhhhhhccccCcc--cCCCCCC--ChhHHHHHHHHHHHHHHHHHH Confidence 11 111110000000 000111122333221100 0000000 0000000000 1 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHh----hc-------cHHHHHHHHHHHHhhcCeEEEEEEe Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYY----NE-------EFQSAIQELVEGSTIKGYEGIFART 139 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~----~~-------~~~~~~~e~~~~~~~~G~a~~~~y~ 139 (530) ..+++++-.|+..+.+++|..++.++.. +++++.+.|...+ ++ +|......+++.....|.+|....+ T Consensus 61 rNn~~a~~av~~~~~~vVG~Gi~p~~~~-~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~ 139 (495) T protein:vir:10 61 RNNPWATNAVATWVAAAVGNGLTPRWRM-KEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKP 139 (495) T ss_pred hcChHHHHHHHHHHHhhcCCCcccccCC-chHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEee Confidence 1367899999999999999999887653 3444555555444 21 4555556677888899999975544 Q ss_pred c--CCC---ceEEEEecccceE-EEEcC---CC-CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCccc Q lcl|NC_011308. 140 T--SED---KLTFQTVDALQLL-PVFDD---YG-TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRS 209 (530) Q Consensus 140 d--~~g---~~~~~~~~p~~~~-~v~d~---~~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~ 209 (530) . .+| -+++..++|..+- |.-+. .+ .+...|.+ + ..+..+. |++.....++. T Consensus 140 ~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~------d---~~Gr~va----------Y~i~~~hpgd~ 200 (495) T protein:vir:10 140 RPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRF------S---NGGKRKA----------YCFYRNHPAES 200 (495) T ss_pred cccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEE------C---CCCceEE----------EEEeecCCCcc Confidence 3 333 3788999998862 32111 11 11111110 0 0111111 11111111111 Q ss_pred chhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_011308. 210 DEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNC 289 (530) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S 289 (530) ..........-.+ -...-|.|.. .+...-|.|.|..++.|-|.-+..-+ T Consensus 201 ~~~~~~~~~~rvp--------------------------A~~vlH~f~~-----r~gQ~RGis~la~i~~l~~l~~y~da 249 (495) T protein:vir:10 201 SLIGDPVDTVWIK--------------------------AEHVLHVTVL-----TVRSDAGAPWFQLLLRLNELDQYEDA 249 (495) T ss_pred cccccccceeeec--------------------------hhheEecccc-----CCCcccCcchhHHHHHHHHhhHHHHH Confidence 0000000000000 0011233321 12344588888776665332222222 Q ss_pred HHHHHHHHhccceeeeecCCCC-------------chhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHH Q lcl|NC_011308. 290 FLSNNLQDMAEAIYVVRGGTNS-------------PVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELN 356 (530) Q Consensus 290 ~~~n~~~~~~~~~lvl~g~~~~-------------~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~ 356 (530) .+....-.-..+ .+++....+ ..+.....+..+.+..+..|.++++++...+...+..++..+.+. T Consensus 250 el~~a~i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~ 328 (495) T protein:vir:10 250 ELVRKKTAALFA-AFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLS 328 (495) T ss_pred HHHHHHHhhhhe-eeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHH Confidence 222221111122 233321110 011112235666777888899999999988888888899999888 Q ss_pred HHHHhcccCCCc-ccccCCcHHHHHHHHhhHHHHHHHHHH-HHHHH-HHHHHHHHHHHHHhcCCCc----cccc--eeeE Q lcl|NC_011308. 357 IYRSGMGFNSSA-VGDGNATNVVIKSRYTLLAMKAQKTEI-ALRKT-LRWTADLVVEDIRRRGLGD----YSST--DIKF 427 (530) Q Consensus 357 I~~~s~~p~~~~-~~~gn~SGvAik~~~~~l~~ka~~ke~-~f~~~-l~~~~~~i~~~l~~~~~~~----~d~~--~i~i 427 (530) |-.-..+|--.- ..++++|-.+++.-+...-..+...+. .+-.. ++.+++..+...-..|... ++.. .+.+ T Consensus 329 iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~ 408 (495) T protein:vir:10 329 IAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRV 408 (495) T ss_pred HHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhcc Confidence 888877772111 123445556777777777666665443 44443 3444444333322233211 1111 1234 Q ss_pred EeCC-CCC-CCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCC Q lcl|NC_011308. 428 DIEP-YIL-ANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPII 505 (530) Q Consensus 428 ~f~~-~~P-~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 505 (530) .|.. ..| .|-.-.++.....+.+|+.|.+.++...+. |+++..+++.+|.+..++.--.+.. .+-.... T Consensus 409 ~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~--D~~~v~~q~a~e~~~~~~~Gl~~~~------~p~~~~~- 479 (495) T protein:vir:10 409 SWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGY--DMEELFDMISDANQLIDEYDLRLDS------DPRYVNG- 479 (495) T ss_pred ccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHHHHHHHHHHHcCCCCCC------CCCcCCC- Confidence 4532 222 466666777888999999999999999875 8998888888876655433111110 0000000 Q ss_pred CCCCCCCccCcCCCCc Q lcl|NC_011308. 506 DPLTIEPQPEPLNIDP 521 (530) Q Consensus 506 ~~~~~~~~~~~~~~~~ 521 (530) ....+.+.+++...+. T Consensus 480 ~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 480 SGAEQKSVMEAALNNE 495 (495) T ss_pred ccCCCCCCCCCCCCCC Confidence 0111111111111111 No 117 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.49 E-value=4.6e-07 Score=55.33 Aligned_cols=453 Identities=11% Similarity=0.080 Sum_probs=170.2 Q ss_pred CCcccccCCcccHHHHHH------HHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccc--ccCCcceee Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILS------TKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVED--DNASNIKIS 72 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~------~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~--~~~~n~ki~ 72 (530) |.+.-.++.+.--...+. ..|+|..+.+.+.-..-..+.......+..++. .-........ ....++=|+ T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~--~~~~~~~l~~~l~~~~~n~i~ 97 (563) T protein:vir:99 20 IAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRS--YMKNEHNLHDVLKKFGNNPIL 97 (563) T ss_pred cceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccccc--CCCCcccHHHHHHHhhcchHH Confidence 333333222211111111 123333322221111111111111111111100 0000000000 000111111 Q ss_pred cCchhhHHhhhhhhhc---------ccceeeecCCcc----hHHHHHHHHHHhh----c------cHHHHHHHHHHHHhh Q lcl|NC_011308. 73 HGFFAELVDQKTQYLL---------ANGIDVKPTDHD----DQKLCYLIEEYYN----E------EFQSAIQELVEGSTI 129 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~---------G~pv~~~~~~~~----de~~~~~l~~~~~----~------~~~~~~~e~~~~~~~ 129 (530) ..-...+.+..+.|.+ |=|+.+...+.. .......|..++. + .+......+..+... T Consensus 98 ~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll 177 (563) T protein:vir:99 98 NAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYI 177 (563) T ss_pred HHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHh Confidence 1112223333333322 234544322211 1111222333331 1 234556677888899 Q ss_pred cCeEEEEEE--ecCCCce-EEEEecccceEEEEcCCCCcee-EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecC Q lcl|NC_011308. 130 KGYEGIFAR--TTSEDKL-TFQTVDALQLLPVFDDYGTLQR-IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKD 205 (530) Q Consensus 130 ~G~a~~~~y--~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~ 205 (530) +|.+|.++. .+..|++ .+..++|..+.++.+..+.+.. ..+++.... +. ....|..+.+.+++..- T Consensus 178 ~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~-------g~---~~~~~~~~evI~~~~~~ 247 (563) T protein:vir:99 178 YDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVD-------KR---VVASFTSRELAMGIRNP 247 (563) T ss_pred cCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeC-------Cc---eeEEecCcceEEEeccC Confidence 999988765 4556765 4778999999988877654322 122221110 00 11123333332221100 Q ss_pred CcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHH Q lcl|NC_011308. 206 EGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYD 285 (530) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~ 285 (530) .. +. ...-.|.|-++.....|.... T Consensus 248 ~~----------------------------------------------d~---------~~~~~G~Spi~~a~~~i~~~~ 272 (563) T protein:vir:99 248 RT----------------------------------------------EL---------SSSGYGLSEVEIAMKEFIAYN 272 (563) T ss_pred CC----------------------------------------------Cc---------ccCcccchHHHHHHHHHHHHH Confidence 00 00 001146666665555555444 Q ss_pred HHHHHHHHHHHHhccceeee--ecCC-CCc--hhhHHHHHh--------hCcc-eecCCCCceeEEEecCCHHHHHHHHH Q lcl|NC_011308. 286 LMNCFLSNNLQDMAEAIYVV--RGGT-NSP--VDEIKKNIQ--------SKKI-IQTKGEGGLDIQTVDIPYEARKAKMD 351 (530) Q Consensus 286 ~~~S~~~n~~~~~~~~~lvl--~g~~-~~~--~~~~~~~~~--------~~~~-i~~~~~~~~~~lt~~~~~~~~e~~ld 351 (530) .+..-.++.+.-.+.|-.+| .|.. .++ ...++..+. .+++ +.+++|.++.-++.+..+..+..... T Consensus 273 ~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~ 352 (563) T protein:vir:99 273 NTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLN 352 (563) T ss_pred HHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHH Confidence 44444455555555555444 4432 222 122222221 1222 44555555555554444445556667 Q ss_pred HHHHHHHHHhcccC--CCccccc----CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccee Q lcl|NC_011308. 352 IDELNIYRSGMGFN--SSAVGDG----NATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDI 425 (530) Q Consensus 352 ~L~~~I~~~s~~p~--~~~~~~g----n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i 425 (530) ...+.|...-.+|. ++...-| +..|..+.. + .-......+++..|.-+++.|...++.+-..++. ..+ T Consensus 353 ~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~--s---n~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~-~~~ 426 (563) T protein:vir:99 353 YLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNE--A---DPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG-DKY 426 (563) T ss_pred HHHHHHHHHhCCCHHHccccccccccccccccchhh--c---cHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc-ccc Confidence 78888888888885 2221111 111111110 0 0111222344445555455454444432222222 245 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHH--------HH----HHHHHHHHHHHHhhhc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKA--------IC----DTLDLDYEDVVKALED 491 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~--------~~----e~e~~e~~~~~~~~~~ 491 (530) .+.|.+.-+.+..+..++ ..+...|+++...+++.+++ +++-+.-+. .. ..+....+.....+.. T Consensus 427 ~~~f~r~D~~~~~e~~~~-~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (563) T protein:vir:99 427 TFQFVGGDTKSATDKLNI-LKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMS 505 (563) T ss_pred EEEeccCCHHHHHHHHHH-HHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhccc Confidence 677877766666555443 34577899999998888754 332111100 00 0000001111111111 Q ss_pred cc-cccCCccccCCCCCCCCCCccCcCCCCcccccc------cCCC Q lcl|NC_011308. 492 QE-VEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE------PVQE 530 (530) Q Consensus 492 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 530 (530) +. .+...++.++.+++++. +.+.+.+.-.+.+ +++. T Consensus 506 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 548 (563) T protein:vir:99 506 LLEGDNDDSEEGQSTDSSND---DKEIGTDAQIKGDDNVYRTQTSN 548 (563) T ss_pred ccCCCCCCCCCCCCCCCCCC---ccccccccccccccccccccCcc Confidence 11 11111111111111100 0011111111111 1111 No 118 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.49 E-value=4.6e-07 Score=55.33 Aligned_cols=453 Identities=11% Similarity=0.080 Sum_probs=170.2 Q ss_pred CCcccccCCcccHHHHHH------HHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccc--ccCCcceee Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILS------TKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVED--DNASNIKIS 72 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~------~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~--~~~~n~ki~ 72 (530) |.+.-.++.+.--...+. ..|+|..+.+.+.-..-..+.......+..++. .-........ ....++=|+ T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~--~~~~~~~l~~~l~~~~~n~i~ 97 (563) T protein:vir:95 20 IAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRS--YMKNEHNLHDVLKKFGNNPIL 97 (563) T ss_pred cceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhccccccccccc--CCCCcccHHHHHHHhhcchHH Confidence 333333222211111111 123333322221111111111111111111100 0000000000 000111111 Q ss_pred cCchhhHHhhhhhhhc---------ccceeeecCCcc----hHHHHHHHHHHhh----c------cHHHHHHHHHHHHhh Q lcl|NC_011308. 73 HGFFAELVDQKTQYLL---------ANGIDVKPTDHD----DQKLCYLIEEYYN----E------EFQSAIQELVEGSTI 129 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~---------G~pv~~~~~~~~----de~~~~~l~~~~~----~------~~~~~~~e~~~~~~~ 129 (530) ..-...+.+..+.|.+ |=|+.+...+.. .......|..++. + .+......+..+... T Consensus 98 ~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll 177 (563) T protein:vir:95 98 NAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYI 177 (563) T ss_pred HHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHh Confidence 1112223333333322 234544322211 1111222333331 1 234556677888899 Q ss_pred cCeEEEEEE--ecCCCce-EEEEecccceEEEEcCCCCcee-EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecC Q lcl|NC_011308. 130 KGYEGIFAR--TTSEDKL-TFQTVDALQLLPVFDDYGTLQR-IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKD 205 (530) Q Consensus 130 ~G~a~~~~y--~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~ 205 (530) +|.+|.++. .+..|++ .+..++|..+.++.+..+.+.. ..+++.... +. ....|..+.+.+++..- T Consensus 178 ~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~-------g~---~~~~~~~~evI~~~~~~ 247 (563) T protein:vir:95 178 YDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVD-------KR---VVASFTSRELAMGIRNP 247 (563) T ss_pred cCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeC-------Cc---eeEEecCcceEEEeccC Confidence 999988765 4556765 4778999999988877654322 122221110 00 11123333332221100 Q ss_pred CcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHH Q lcl|NC_011308. 206 EGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYD 285 (530) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~ 285 (530) .. +. ...-.|.|-++.....|.... T Consensus 248 ~~----------------------------------------------d~---------~~~~~G~Spi~~a~~~i~~~~ 272 (563) T protein:vir:95 248 RT----------------------------------------------EL---------SSSGYGLSEVEIAMKEFIAYN 272 (563) T ss_pred CC----------------------------------------------Cc---------ccCcccchHHHHHHHHHHHHH Confidence 00 00 001146666665555555444 Q ss_pred HHHHHHHHHHHHhccceeee--ecCC-CCc--hhhHHHHHh--------hCcc-eecCCCCceeEEEecCCHHHHHHHHH Q lcl|NC_011308. 286 LMNCFLSNNLQDMAEAIYVV--RGGT-NSP--VDEIKKNIQ--------SKKI-IQTKGEGGLDIQTVDIPYEARKAKMD 351 (530) Q Consensus 286 ~~~S~~~n~~~~~~~~~lvl--~g~~-~~~--~~~~~~~~~--------~~~~-i~~~~~~~~~~lt~~~~~~~~e~~ld 351 (530) .+..-.++.+.-.+.|-.+| .|.. .++ ...++..+. .+++ +.+++|.++.-++.+..+..+..... T Consensus 273 ~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~ 352 (563) T protein:vir:95 273 NTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLN 352 (563) T ss_pred HHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHH Confidence 44444455555555555444 4432 222 122222221 1222 44555555555554444445556667 Q ss_pred HHHHHHHHHhcccC--CCccccc----CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccee Q lcl|NC_011308. 352 IDELNIYRSGMGFN--SSAVGDG----NATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDI 425 (530) Q Consensus 352 ~L~~~I~~~s~~p~--~~~~~~g----n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i 425 (530) ...+.|...-.+|. ++...-| +..|..+.. + .-......+++..|.-+++.|...++.+-..++. ..+ T Consensus 353 ~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~--s---n~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~-~~~ 426 (563) T protein:vir:95 353 YLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNE--A---DPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG-DKY 426 (563) T ss_pred HHHHHHHHHhCCCHHHccccccccccccccccchhh--c---cHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc-ccc Confidence 78888888888885 2221111 111111110 0 0111222344445555455454444432222222 245 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHH--------HH----HHHHHHHHHHHHhhhc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKA--------IC----DTLDLDYEDVVKALED 491 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~--------~~----e~e~~e~~~~~~~~~~ 491 (530) .+.|.+.-+.+..+..++ ..+...|+++...+++.+++ +++-+.-+. .. ..+....+.....+.. T Consensus 427 ~~~f~r~D~~~~~e~~~~-~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (563) T protein:vir:95 427 TFQFVGGDTKSATDKLNI-LKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMS 505 (563) T ss_pred EEEeccCCHHHHHHHHHH-HHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhccc Confidence 677877766666555443 34577899999998888754 332111100 00 0000001111111111 Q ss_pred cc-cccCCccccCCCCCCCCCCccCcCCCCcccccc------cCCC Q lcl|NC_011308. 492 QE-VEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE------PVQE 530 (530) Q Consensus 492 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 530 (530) +. .+...++.++.+++++. +.+.+.+.-.+.+ +++. T Consensus 506 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 548 (563) T protein:vir:95 506 LLEGDNDDSEEGQSTDSSND---DKEIGTDAQIKGDDNVYRTQTSN 548 (563) T ss_pred ccCCCCCCCCCCCCCCCCCC---ccccccccccccccccccccCcc Confidence 11 11111111111111100 0011111111111 1111 No 119 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.40 E-value=8.1e-07 Score=53.98 Aligned_cols=397 Identities=10% Similarity=0.026 Sum_probs=174.4 Q ss_pred HHHHHHHHHHHHHhhhHHH-----HHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhc Q lcl|NC_011308. 14 GTILSTKIDEYIRSQNVSL-----ARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLL 88 (530) Q Consensus 14 ~~~i~~~i~~~~~~~~~~~-----~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~ 88 (530) ..+++++..-+.+...... ...+..+.-.... +..+.. ..-.|. .-....|+..+.=+. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~------------~~~v~~--~~al~~--~~v~~~i~~ia~~ia 64 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS------------TISVKG--KNALKV--ATVFACIKILSESVS 64 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCC------------cceech--hhhhcc--HHHHHHHHHHHHhhc Confidence 1233333321111000000 0001111100000 000000 000111 122334555566666 Q ss_pred ccceeee-cCCcchHH-HHHHHHHHhh---c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEE Q lcl|NC_011308. 89 ANGIDVK-PTDHDDQK-LCYLIEEYYN---E---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVF 159 (530) Q Consensus 89 G~pv~~~-~~~~~de~-~~~~l~~~~~---~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~ 159 (530) +-|+++- ..+.+.+. ...-+..+++ | ........+..+...+|.||.++-++..|.+ ....++|..+-++. T Consensus 65 ~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~ 144 (429) T protein:vir:10 65 KLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYI 144 (429) T ss_pred cCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEE Confidence 6777742 22111111 1112333332 1 2234556778888999999999999999986 57789999998887 Q ss_pred cCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceec Q lcl|NC_011308. 160 DDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILD 239 (530) Q Consensus 160 d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (530) ++.+......+.|+..... +. ...|.++.+.|++.... T Consensus 145 ~~~~~~~~~~~~~~~~~~~-----g~----~~~~~~~evih~~~~~~--------------------------------- 182 (429) T protein:vir:10 145 DDVGLLNSKTKMWYVVNTG-----GQ----QRVLKPEEILHFKNGIT--------------------------------- 182 (429) T ss_pred cCcccccccceEEEEEccC-----Ce----EEEEccccEEEecCCCC--------------------------------- Confidence 7665443333333221110 00 11244455554431100 Q ss_pred ccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhH Q lcl|NC_011308. 240 EGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEI 316 (530) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~ 316 (530) .+.-.|.|.++.....++....+..-..+.+.-.+.|-.+++... .++ .+.+ T Consensus 183 -------------------------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~ 237 (429) T protein:vir:10 183 -------------------------LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVF 237 (429) T ss_pred -------------------------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHH Confidence 011246677776666666555544444555555555666665422 211 1122 Q ss_pred HHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 317 KKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 317 ~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l 386 (530) +..+. .++++.+++|-+++-+.....+.......+...+.|...-.+|. ++....|+-|++. T Consensus 238 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e-------- 309 (429) T protein:vir:10 238 RENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIE-------- 309 (429) T ss_pred HHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH-------- Confidence 22221 23556666666666555433334444556777888888888885 2222222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-c--eeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCC Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS-T--DIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAP 463 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~--~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~ 463 (530) .....++...|+-.++.|...++.+-..+... . .+++.++.-+-.|..+.++....+..+|+++...+++.++ T Consensus 310 ----~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 385 (429) T protein:vir:10 310 ----QQQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKED 385 (429) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 11222344455555555555444322111111 1 2333343444568888888888999999999999988875 Q ss_pred C--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 464 R--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 464 ~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) + +++-+...... ....-+ ...+...++.+..++...+.++.+ T Consensus 386 l~p~~ggD~~~~~~---n~~~~d---~~~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 386 LPPEAGGDRLLVNG---NMLPID---MAGQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred CCCCCCcCeeeecc---cccchh---hccccccCCCCCCCCCCCCCCCCC Confidence 4 22211111000 000000 000000011111111111111111 No 120 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.40 E-value=8.1e-07 Score=53.97 Aligned_cols=468 Identities=9% Similarity=-0.013 Sum_probs=179.7 Q ss_pred CCcccc--cCCcccHHHHHHHHHHHHHHhhh--HHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCch Q lcl|NC_011308. 1 MTNTLL--TTAPDRLGTILSTKIDEYIRSQN--VSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFF 76 (530) Q Consensus 1 ~~~~~~--~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~ 76 (530) -+.-+. .=++..+...|++.+..+...+. +.+.....+||.+..+-. ... .+ .+ .+++.+-. T Consensus 14 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--------~~--gr--s~vv~~~v 79 (763) T protein:vir:95 14 PSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK--PPK--------VK--GR--SQVQPKLV 79 (763) T ss_pred ccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc--ccc--------cC--CC--ccccCHHH Confidence 111111 11222333334444443332111 222223344433332111 111 11 11 13444444 Q ss_pred hhHHhhh----hhhhcccc--eeeecCCcchHHHHH----HHHHHhh--ccHHHHHHHHHHHHhhcCeEEEEEEecC--- Q lcl|NC_011308. 77 AELVDQK----TQYLLANG--IDVKPTDHDDQKLCY----LIEEYYN--EEFQSAIQELVEGSTIKGYEGIFARTTS--- 141 (530) Q Consensus 77 k~Ivd~~----~~yl~G~p--v~~~~~~~~de~~~~----~l~~~~~--~~~~~~~~e~~~~~~~~G~a~~~~y~d~--- 141 (530) ...|+.. ...|+|.+ |.|.....+|.+..+ .++.++. ++-.+..+..++++.++|.++..+|++. T Consensus 80 ~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~ 159 (763) T protein:vir:95 80 RRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIR 159 (763) T ss_pred HHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeee Confidence 4444443 34444433 256666666655444 3444343 3344677899999999999988877641 Q ss_pred ---------------------------------------------------------------------------CCceE Q lcl|NC_011308. 142 ---------------------------------------------------------------------------EDKLT 146 (530) Q Consensus 142 ---------------------------------------------------------------------------~g~~~ 146 (530) .+.++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ 239 (763) T protein:vir:95 160 KEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPT 239 (763) T ss_pred eeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceE Confidence 12346 Q ss_pred EEEecccceEEEEcCCCC---ceeE-EEEEEEEeecccccccceEEEEEEEc--CCceEE--------EeecCCcc---c Q lcl|NC_011308. 147 FQTVDALQLLPVFDDYGT---LQRI-IRFYTEQRYSDADNKFNSIGHADVWT--DTEVWY--------YVQKDEGR---S 209 (530) Q Consensus 147 ~~~~~p~~~~~v~d~~~~---~~~~-~~~y~~~~~~~~~~~~~~~~~~evyt--~~~~~~--------y~~~~~~~---~ 209 (530) +..|||+++|+=.+-..+ ...+ .+++.... +-... ......++... ...... +...+..+ . T Consensus 240 ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~-dL~~~-~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 317 (763) T protein:vir:95 240 VEMLNPENIIIDPSCQGDINKAMFAIVSFETCKA-DLLKE-KDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRK 317 (763) T ss_pred EEeecHHHheecCCCCCchhhCceEeeEEeccHH-HHHhc-cCCccccchhcchhccccccccccccchhhccCCCcccc Confidence 677899888752221111 1121 12221110 00000 00000000000 000000 00000000 0 Q ss_pred chhhcc----ccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHH Q lcl|NC_011308. 210 DEYVLD----TTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSI 280 (530) Q Consensus 210 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~l 280 (530) ...+++ ..+......+.. .+...+ .........+.++|++|++.+.. .-+|.|.+..++++ T Consensus 318 ~V~v~E~y~~~d~~gdg~~~~~--------~v~~~g--~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~ 387 (763) T protein:vir:95 318 RVVAYEYWGFWDIEGNGVLEPI--------VATWIG--STLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDN 387 (763) T ss_pred eEEEEEeeeeeccCCcceeEEE--------EEEEEc--CeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHH Confidence 000000 000000000000 011111 11222333445567777765443 44689999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccee-eeecCCCCchhhHHHHHhhCcceecCCCCce----eEEEecCCHHHHHHHHHHHHH Q lcl|NC_011308. 281 IDDYDLMNCFLSNNLQDMAEAIY-VVRGGTNSPVDEIKKNIQSKKIIQTKGEGGL----DIQTVDIPYEARKAKMDIDEL 355 (530) Q Consensus 281 iDa~~~~~S~~~n~~~~~~~~~l-vl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~----~~lt~~~~~~~~e~~ld~L~~ 355 (530) ++.+|.+.|...+.+.-.++|.+ +..|.. +..+... .+.++++.+..+++. ..+..+....+....+..+.. T Consensus 388 Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav-~~~d~~~--~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~ 464 (763) T protein:vir:95 388 QAVLGAVMRGMIDLLGRSANGQRGMPKGML-DALNSRR--YREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQ 464 (763) T ss_pred HHHHHHHHHHHHHHHHhhcCCcEEeecccc-cchhhhc--ccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHH Confidence 99999999999999998888755 445543 2223222 345566666555443 233333333455555666666 Q ss_pred HHHHHhcccCCCcc----ccc-CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------Cccc Q lcl|NC_011308. 356 NIYRSGMGFNSSAV----GDG-NATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL---------GDYS 421 (530) Q Consensus 356 ~I~~~s~~p~~~~~----~~g-n~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~---------~~~d 421 (530) .+=..|.+++.+.. ..| .+||++. +...........-+.|.++++.+++.+++++..... .++. T Consensus 465 ~~e~~TGv~~~~~G~~~~~~~~tat~v~~--l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v 542 (763) T protein:vir:95 465 EAESLTGVKAFAGGVTGESYGDVAAGIRG--VLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFV 542 (763) T ss_pred HHHHhhCcchhhcCcCcccccchhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccc Confidence 66666766665322 122 2344433 334444444444456666776666666665543211 1110 Q ss_pred c---------ceeeEEeCCCCCCCH-HHHHHHHHHHHhcCCCcHHHHHHhC-CCCCCHHHHHHHHHHH--HHHHHHHHHh Q lcl|NC_011308. 422 S---------TDIKFDIEPYILANE-LDLAMIDKTEAETNQIQINNLLAIA-PRIGDEETLKAICDTL--DLDYEDVVKA 488 (530) Q Consensus 422 ~---------~~i~i~f~~~~P~n~-~e~a~~~~~~~~~g~iS~et~l~~~-~~vdd~~~e~~~~e~e--~~e~~~~~~~ 488 (530) . .+|.+.- . |... .+.++.+. .++..+ |.++ +......+.+. -....+.... T Consensus 543 ~v~~~~~~~~~DV~V~~--~-~as~~~q~~~~l~-----------~ll~~l~~~~~-~~~~~~il~~~~d~~~~~~~~~~ 607 (763) T protein:vir:95 543 TIKREDLKGNFDLEVDI--S-TAEVDNQKSQDLG-----------FMLQTIGPNVD-QQITLNILAEIADLKRMPKLAHD 607 (763) T ss_pred cccHHHhcCCcceEEec--c-cchHHHHHHHHHH-----------HHHHHhccccC-hHHHHHHHHHHHhhhchhhhHHH Confidence 0 0111111 1 1111 01111000 011112 1111 11111111110 0001111111 Q ss_pred hhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 489 LEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +..... .. ++..+.. .+.....+..+ T Consensus 608 lr~~q~-----~~--------d~~~q~q---aqle~~~~q~e 633 (763) T protein:vir:95 608 LRTWQP-----QP--------DPVQEQL---KQLAVEKAQLE 633 (763) T ss_pred HHhcCC-----Cc--------cchhhhH---HHHHHHHHHHH Confidence 111100 00 0000000 00000000000 No 121 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.39 E-value=8.7e-07 Score=53.81 Aligned_cols=473 Identities=8% Similarity=-0.005 Sum_probs=212.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |+.+++.+. .+.+.+ +|..+....++||+.--.-.. ...+..........+.+.||..+-+...++..++.|+|- T Consensus 1 ~~~~~l~~r-~~~l~~-~R~~~e~~w~e~~~~~lP~~~---~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ 75 (547) T protein:vir:10 1 MENSKIVKR-LDFLKT-DRKNVEQIWDCIRKYIMPMRS---DFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGS 75 (547) T ss_pred CCHHHHHHH-HHHHHH-HhhHHHHHHHHHHHHhccccc---ccccCCCCCcccccccccccccchHHHHHHHHHHHHHHh Confidence 888887655 344433 333333333333332111000 000000000000112355777777777777777776652 Q ss_pred --cee-----eecCCcc---hHH-------HHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecC--CCceEEEEe Q lcl|NC_011308. 91 --GID-----VKPTDHD---DQK-------LCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTS--EDKLTFQTV 150 (530) Q Consensus 91 --pv~-----~~~~~~~---de~-------~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~--~g~~~~~~~ 150 (530) |+. +...+.. ... ....+...+ ..+|....+++.++..++|.+..++-.|. .+.++++.+ T Consensus 76 ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~ 155 (547) T protein:vir:10 76 LTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSS 155 (547) T ss_pred hcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEe Confidence 211 2222211 111 222233333 35788889999999999999976665443 357889999 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEeec-------c----------cccccceEEEEEEEcCCceEEEeecCCcccchhh Q lcl|NC_011308. 151 DALQLLPVFDDYGTLQRIIRFYTEQRYS-------D----------ADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYV 213 (530) Q Consensus 151 ~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~----------~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~ 213 (530) +..++++--|..+....++|.|...... . .......-..+++|+. .|............ T Consensus 156 pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~----v~~~~~~~~~~~~~ 231 (547) T protein:vir:10 156 PIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMC----VFTRYDKKQNRNAG 231 (547) T ss_pred ecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEE----EeeccCCCCCcccc Confidence 9999988888888887777765421110 0 0000110112222211 11111100000000 Q ss_pred ccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHH Q lcl|NC_011308. 214 LDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMN 288 (530) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~ 288 (530) ........+...+.. +...........+|...|++.++- +.+|.|--++..+-+..++.+. T Consensus 232 ~~~~~~~~p~~s~~~--------------e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~ 297 (547) T protein:vir:10 232 TVLAPTERPFGKKWI--------------LKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYV 297 (547) T ss_pred ceeeccccceeEEEE--------------EecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHH Confidence 000000000000000 000011112234566677776654 4589999999999999999999 Q ss_pred HHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCc Q lcl|NC_011308. 289 CFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSA 368 (530) Q Consensus 289 S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~ 368 (530) -......+...+|.+.+.-.+... . .++..++++..++.++++-+....+.......++.++..|=..-+..-+.- T Consensus 298 ~~~l~~~~~~~~pp~~v~~~g~~~--~--~~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~ 373 (547) T protein:vir:10 298 ELVLRSSEKVIDPAIMVTERGLIS--D--IDLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQM 373 (547) T ss_pred HHHHHHHHHHhcCceecccccccc--c--ceecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhc Confidence 999999999999988764221111 1 234556777677777788787777777777777777776654333211111 Q ss_pred ccccCCcHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCC--------ccccceeeEEeCCCCCCCH-- Q lcl|NC_011308. 369 VGDGNATNVVIKSRYTLLAMK-AQKTEIALRKTLRWTADLVVEDIRRRGLG--------DYSSTDIKFDIEPYILANE-- 437 (530) Q Consensus 369 ~~~gn~SGvAik~~~~~l~~k-a~~ke~~f~~~l~~~~~~i~~~l~~~~~~--------~~d~~~i~i~f~~~~P~n~-- 437 (530) .+....++..+..+-.-+... .....+.-.+.|.=+++-++.++...+.. +.....++|++...|-+.. T Consensus 374 ~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~ 453 (547) T protein:vir:10 374 KDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKI 453 (547) T ss_pred CCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHH Confidence 122334554444333222221 12222222222222222233334333321 1133466777765554432 Q ss_pred HHHHHHHHHHHhcC-----------CCcHHHHHHh----CC----CCCCHHHHHHHHHHHHHHHHH--HHHhhhcccccc Q lcl|NC_011308. 438 LDLAMIDKTEAETN-----------QIQINNLLAI----AP----RIGDEETLKAICDTLDLDYED--VVKALEDQEVEE 496 (530) Q Consensus 438 ~e~a~~~~~~~~~g-----------~iS~et~l~~----~~----~vdd~~~e~~~~e~e~~e~~~--~~~~~~~~~~~~ 496 (530) .+.+.+.+.+...+ .+.-..++.. ++ .+.. ++|.+.+.+++.+.+. .+.....+.... T Consensus 454 ~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~qaa~~~~~g~~ 532 (547) T protein:vir:10 454 DQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRP-KAKVTSIRKNRSQTQQKAEQAAIAEAEGNA 532 (547) T ss_pred HHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11221111111111 1222333322 21 1221 2343333333322222 212222221111 Q ss_pred CCccccCCCCCCCCCC Q lcl|NC_011308. 497 LEPTVTPIIDPLTIEP 512 (530) Q Consensus 497 ~~~~~~~~~~~~~~~~ 512 (530) ...-+.+.. +-.++. T Consensus 533 m~~~~~~~a-~~~~~~ 547 (547) T protein:vir:10 533 MEAQGKGQA-ALKENQ 547 (547) T ss_pred HHhhcCccc-chhccC Confidence 111111110 001111 No 122 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.39 E-value=8.7e-07 Score=53.81 Aligned_cols=442 Identities=9% Similarity=-0.037 Sum_probs=209.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc----------eeecCchhhHH Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI----------KISHGFFAELV 80 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~----------ki~~n~~k~Iv 80 (530) |..-.++. .. ...-......||.+...- .+....+.............+. =..+++++-.| T Consensus 1 ~~~~~~~~-----~~---~~~~~~~~~~~~~~a~~~-~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av 71 (530) T protein:vir:38 1 MKIPSLVG-----PD---GKTSLREYAGYHGGGGGF-GGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAV 71 (530) T ss_pred Cccceeec-----Cc---cccchHHHhhhhcccCCC-CCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 22111111 00 011122334455543210 0111000000000000000000 11367899999 Q ss_pred hhhhhhhcccceeeecCC---------cchHHHHHHHHHHhh----c-----------cHHHHHHHHHHHHhhcCeEEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTD---------HDDQKLCYLIEEYYN----E-----------EFQSAIQELVEGSTIKGYEGIF 136 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~---------~~de~~~~~l~~~~~----~-----------~~~~~~~e~~~~~~~~G~a~~~ 136 (530) +..+.+++|..++..+.. ..+++..+.+...+. + +|.....-+.+...+.|.+|.. T Consensus 72 ~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 151 (530) T protein:vir:38 72 QLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQ 151 (530) T ss_pred HHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEE Confidence 999999999999876531 234445555554442 1 2344445566778899999987 Q ss_pred EEecCC-C---ceEEEEecccceEEEEcC--CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccc Q lcl|NC_011308. 137 ARTTSE-D---KLTFQTVDALQLLPVFDD--YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSD 210 (530) Q Consensus 137 ~y~d~~-g---~~~~~~~~p~~~~~v~d~--~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~ 210 (530) ..+... | -+++.+++|..+---++. ...+...|. ++ ..+..+.| .++. ....+... T Consensus 152 ~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe------~d---~~Gr~~aY-~i~~--------~~~~~~~~ 213 (530) T protein:vir:38 152 ATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVK------IN---DSGAALGY-YVSD--------DGYPGWMA 213 (530) T ss_pred eeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeE------EC---CCCceEEE-EEee--------ccCCCccc Confidence 765544 3 267888998775311111 111111111 01 11111111 1111 10010000 Q ss_pred hhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCC-----cCCCCcHHHHHHHHHHHH Q lcl|NC_011308. 211 EYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNN-----KLGISDIKKVKSIIDDYD 285 (530) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn-----~~~~sd~e~v~~liDa~~ 285 (530) .. +.-......++.--|+|+... .-|.|.|..++..+..++ T Consensus 214 ~~----------------------------------~~~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~ 259 (530) T protein:vir:38 214 QN----------------------------------WTYIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLD 259 (530) T ss_pred cc----------------------------------cceeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHh Confidence 00 000000011111124554443 358999998877766555 Q ss_pred HHHHHHHHHHHHhccceeeeec-------------CCCCchhh---------------HHHHHhhCcceecCCCCceeEE Q lcl|NC_011308. 286 LMNCFLSNNLQDMAEAIYVVRG-------------GTNSPVDE---------------IKKNIQSKKIIQTKGEGGLDIQ 337 (530) Q Consensus 286 ~~~S~~~n~~~~~~~~~lvl~g-------------~~~~~~~~---------------~~~~~~~~~~i~~~~~~~~~~l 337 (530) .-.--......--+.-..+|+. .+..+... ....+..+.+..+..|.+++++ T Consensus 260 ~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~ 339 (530) T protein:vir:38 260 TLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQ 339 (530) T ss_pred HHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeee Confidence 4332222111111111122221 11110000 0012456667778889999999 Q ss_pred EecCCHHHHHHHHHHHHHHHHHHhcccCCCc-ccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhc Q lcl|NC_011308. 338 TVDIPYEARKAKMDIDELNIYRSGMGFNSSA-VGDGNATNVVIKSRYTLLAMKAQKTEIALRKTL-RWTADLVVEDIRRR 415 (530) Q Consensus 338 t~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~-~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l-~~~~~~i~~~l~~~ 415 (530) +.+.+...+..++..+.+.|-.-..+|--.- ..++++|-.+.+.-+......+...+..|...+ +.+++..+...-.. T Consensus 340 ~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~ 419 (530) T protein:vir:38 340 SAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVR 419 (530) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHc Confidence 9999989999999999999988888773211 124556667788877777777777666655543 33333222211111 Q ss_pred C------CCccccc-----eeeEEeCC--CCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_011308. 416 G------LGDYSST-----DIKFDIEP--YILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDY 482 (530) Q Consensus 416 ~------~~~~d~~-----~i~i~f~~--~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~ 482 (530) + ...+++. .....|.. -.-.|-.-.++.....+.+|+.|.+.++...+. |+++.++++.+|.+.. T Consensus 420 G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~--D~~~v~~q~a~e~~~~ 497 (530) T protein:vir:38 420 RVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGD--DYQEIFAQQVRESMER 497 (530) T ss_pred CCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHHHHHHHHH Confidence 2 1111211 12344432 222466666677788999999999999999875 8999988888877665 Q ss_pred HHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcc Q lcl|NC_011308. 483 EDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPV 522 (530) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (530) ++.--.+.. ........ ...+++++++.+..+. T Consensus 498 ~~~Gl~~~~---~~~~~~~~----~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 498 RAAGLNPPA---WAAAAFEA----GVKKSNEEEQDGARAA 530 (530) T ss_pred HHcCCCCCC---CcccccCC----CCCCCCCCCCCCCCCC Confidence 443111100 00000011 1112222222222222 No 123 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.38 E-value=9.3e-07 Score=53.64 Aligned_cols=391 Identities=8% Similarity=0.026 Sum_probs=175.0 Q ss_pred HHHHHHH--hcccchhhhcccc-ccccc--ccc---c----ccccCCcce--eecCchhhHHhhhhhhhcccceeee-cC Q lcl|NC_011308. 33 ARVGQRY--YNQDNDIENTRIM-WMNDH--GDI---V----EDDNASNIK--ISHGFFAELVDQKTQYLLANGIDVK-PT 97 (530) Q Consensus 33 ~~~~~~Y--Y~g~~~I~~r~~~-~~~~~--~~~---~----~~~~~~n~k--i~~n~~k~Ivd~~~~yl~G~pv~~~-~~ 97 (530) |-...+. .-|.+ .|... ..... ... . ......+.+ +.+.-....|+..++=+.+-|+.+. .. T Consensus 1 M~~~~r~~~~~~~~---~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 77 (432) T protein:vir:10 1 MKIVDSVKKFFNFE---KRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQED 77 (432) T ss_pred CChHHHHHHhcCcc---ccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 4444443 10100 00000 00000 000 0 000000000 0011112245555566666777742 21 Q ss_pred Ccch-HHHHHHHHHHhh---c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEE Q lcl|NC_011308. 98 DHDD-QKLCYLIEEYYN---E---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRII 169 (530) Q Consensus 98 ~~~d-e~~~~~l~~~~~---~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 169 (530) +.+. +....-|..+++ | .-......+......+|.+|.++-++..|++ .+..++|..|-++.|+.+....-. T Consensus 78 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~ 157 (432) T protein:vir:10 78 EYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKT 157 (432) T ss_pred CCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccc Confidence 1111 111111333321 1 2234566778888999999999999999986 577899999988887655433322 Q ss_pred EEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccccc Q lcl|NC_011308. 170 RFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQ 249 (530) Q Consensus 170 ~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (530) +.|+..... +. ...|.++++.|++.... T Consensus 158 ~~~y~~~~~-----g~----~~~~~~~eiih~r~~~~------------------------------------------- 185 (432) T protein:vir:10 158 KMWYVVNTG-----GQ----QRVLKPEEILHFKNGIT------------------------------------------- 185 (432) T ss_pred eEEEEEecC-----Ce----EEEEccccEEEecCCCC------------------------------------------- Confidence 222221100 00 11244555555431100 Q ss_pred ccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHHHHHh----- Q lcl|NC_011308. 250 VLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIKKNIQ----- 321 (530) Q Consensus 250 ~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~~~~~----- 321 (530) + +.-.|.|.+......|+....+..-.++.+.-.+.|-.+++... .++ .+.++..+. T Consensus 186 ------~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g 250 (432) T protein:vir:10 186 ------L---------DGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSG 250 (432) T ss_pred ------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcc Confidence 0 11147777776666666655555555555555556766665422 211 112222211 Q ss_pred ---hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_011308. 322 ---SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIA 396 (530) Q Consensus 322 ---~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~ 396 (530) .++++.+++|.+++-++....+..+....+...+.|...-.+|. ++...-|+-|++ . .....+ T Consensus 251 ~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~--e----------~~~~~~ 318 (432) T protein:vir:10 251 LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNI--E----------QQQQQF 318 (432) T ss_pred cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--H----------HHHHHH Confidence 23566677766666665444444444556777888888888884 222222222221 1 112223 Q ss_pred HHHHHHHHHHHHHHHHHhcCC--Cccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHH Q lcl|NC_011308. 397 LRKTLRWTADLVVEDIRRRGL--GDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETL 471 (530) Q Consensus 397 f~~~l~~~~~~i~~~l~~~~~--~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e 471 (530) ++..|+-.++.|...++.+-. .... ...+++.++.-+-.|..+.++....+..+|+++...+++.+++ +++-+.. T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~ 398 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRL 398 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeE Confidence 444555555555554443221 1111 1124444545556788888888889999999999999888764 2221111 Q ss_pred HHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 472 KAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 472 ~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) .-... ... .....+...++.+..++...+.++.+ T Consensus 399 ~~~~n---~~~---~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 399 LVNGN---MLP---IDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred eeccc---ccc---hhhccccccCCCCCCCCCCCCCCCCC Confidence 00000 000 00000000011110011000111111 No 124 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.38 E-value=9.3e-07 Score=53.64 Aligned_cols=391 Identities=8% Similarity=0.026 Sum_probs=175.0 Q ss_pred HHHHHHH--hcccchhhhcccc-ccccc--ccc---c----ccccCCcce--eecCchhhHHhhhhhhhcccceeee-cC Q lcl|NC_011308. 33 ARVGQRY--YNQDNDIENTRIM-WMNDH--GDI---V----EDDNASNIK--ISHGFFAELVDQKTQYLLANGIDVK-PT 97 (530) Q Consensus 33 ~~~~~~Y--Y~g~~~I~~r~~~-~~~~~--~~~---~----~~~~~~n~k--i~~n~~k~Ivd~~~~yl~G~pv~~~-~~ 97 (530) |-...+. .-|.+ .|... ..... ... . ......+.+ +.+.-....|+..++=+.+-|+.+. .. T Consensus 1 M~~~~r~~~~~~~~---~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 77 (432) T protein:vir:10 1 MKIVDSVKKFFNFE---KRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQED 77 (432) T ss_pred CChHHHHHHhcCcc---ccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 4444443 10100 00000 00000 000 0 000000000 0011112245555566666777742 21 Q ss_pred Ccch-HHHHHHHHHHhh---c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEE Q lcl|NC_011308. 98 DHDD-QKLCYLIEEYYN---E---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRII 169 (530) Q Consensus 98 ~~~d-e~~~~~l~~~~~---~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 169 (530) +.+. +....-|..+++ | .-......+......+|.+|.++-++..|++ .+..++|..|-++.|+.+....-. T Consensus 78 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~ 157 (432) T protein:vir:10 78 EYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKT 157 (432) T ss_pred CCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccc Confidence 1111 111111333321 1 2234566778888999999999999999986 577899999988887655433322 Q ss_pred EEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccccc Q lcl|NC_011308. 170 RFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQ 249 (530) Q Consensus 170 ~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (530) +.|+..... +. ...|.++++.|++.... T Consensus 158 ~~~y~~~~~-----g~----~~~~~~~eiih~r~~~~------------------------------------------- 185 (432) T protein:vir:10 158 KMWYVVNTG-----GQ----QRVLKPEEILHFKNGIT------------------------------------------- 185 (432) T ss_pred eEEEEEecC-----Ce----EEEEccccEEEecCCCC------------------------------------------- Confidence 222221100 00 11244555555431100 Q ss_pred ccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHHHHHh----- Q lcl|NC_011308. 250 VLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIKKNIQ----- 321 (530) Q Consensus 250 ~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~~~~~----- 321 (530) + +.-.|.|.+......|+....+..-.++.+.-.+.|-.+++... .++ .+.++..+. T Consensus 186 ------~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g 250 (432) T protein:vir:10 186 ------L---------DGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSG 250 (432) T ss_pred ------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcc Confidence 0 11147777776666666655555555555555556766665422 211 112222211 Q ss_pred ---hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_011308. 322 ---SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIA 396 (530) Q Consensus 322 ---~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~ 396 (530) .++++.+++|.+++-++....+..+....+...+.|...-.+|. ++...-|+-|++ . .....+ T Consensus 251 ~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~--e----------~~~~~~ 318 (432) T protein:vir:10 251 LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNI--E----------QQQQQF 318 (432) T ss_pred cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--H----------HHHHHH Confidence 23566677766666665444444444556777888888888884 222222222221 1 112223 Q ss_pred HHHHHHHHHHHHHHHHHhcCC--Cccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHH Q lcl|NC_011308. 397 LRKTLRWTADLVVEDIRRRGL--GDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETL 471 (530) Q Consensus 397 f~~~l~~~~~~i~~~l~~~~~--~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e 471 (530) ++..|+-.++.|...++.+-. .... ...+++.++.-+-.|..+.++....+..+|+++...+++.+++ +++-+.. T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~ 398 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRL 398 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeE Confidence 444555555555554443221 1111 1124444545556788888888889999999999999888764 2221111 Q ss_pred HHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 472 KAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 472 ~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) .-... ... .....+...++.+..++...+.++.+ T Consensus 399 ~~~~n---~~~---~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 399 LVNGN---MLP---IDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred eeccc---ccc---hhhccccccCCCCCCCCCCCCCCCCC Confidence 00000 000 00000000011110011000111111 No 125 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.38 E-value=9.3e-07 Score=53.64 Aligned_cols=391 Identities=8% Similarity=0.026 Sum_probs=175.0 Q ss_pred HHHHHHH--hcccchhhhcccc-ccccc--ccc---c----ccccCCcce--eecCchhhHHhhhhhhhcccceeee-cC Q lcl|NC_011308. 33 ARVGQRY--YNQDNDIENTRIM-WMNDH--GDI---V----EDDNASNIK--ISHGFFAELVDQKTQYLLANGIDVK-PT 97 (530) Q Consensus 33 ~~~~~~Y--Y~g~~~I~~r~~~-~~~~~--~~~---~----~~~~~~n~k--i~~n~~k~Ivd~~~~yl~G~pv~~~-~~ 97 (530) |-...+. .-|.+ .|... ..... ... . ......+.+ +.+.-....|+..++=+.+-|+.+. .. T Consensus 1 M~~~~r~~~~~~~~---~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 77 (432) T protein:vir:10 1 MKIVDSVKKFFNFE---KRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQED 77 (432) T ss_pred CChHHHHHHhcCcc---ccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 4444443 10100 00000 00000 000 0 000000000 0011112245555566666777742 21 Q ss_pred Ccch-HHHHHHHHHHhh---c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEE Q lcl|NC_011308. 98 DHDD-QKLCYLIEEYYN---E---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRII 169 (530) Q Consensus 98 ~~~d-e~~~~~l~~~~~---~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 169 (530) +.+. +....-|..+++ | .-......+......+|.+|.++-++..|++ .+..++|..|-++.|+.+....-. T Consensus 78 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~ 157 (432) T protein:vir:10 78 EYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKT 157 (432) T ss_pred CCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccc Confidence 1111 111111333321 1 2234566778888999999999999999986 577899999988887655433322 Q ss_pred EEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccccc Q lcl|NC_011308. 170 RFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQ 249 (530) Q Consensus 170 ~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (530) +.|+..... +. ...|.++++.|++.... T Consensus 158 ~~~y~~~~~-----g~----~~~~~~~eiih~r~~~~------------------------------------------- 185 (432) T protein:vir:10 158 KMWYVVNTG-----GQ----QRVLKPEEILHFKNGIT------------------------------------------- 185 (432) T ss_pred eEEEEEecC-----Ce----EEEEccccEEEecCCCC------------------------------------------- Confidence 222221100 00 11244555555431100 Q ss_pred ccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHHHHHh----- Q lcl|NC_011308. 250 VLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIKKNIQ----- 321 (530) Q Consensus 250 ~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~~~~~----- 321 (530) + +.-.|.|.+......|+....+..-.++.+.-.+.|-.+++... .++ .+.++..+. T Consensus 186 ------~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g 250 (432) T protein:vir:10 186 ------L---------DGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSG 250 (432) T ss_pred ------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcc Confidence 0 11147777776666666655555555555555556766665422 211 112222211 Q ss_pred ---hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_011308. 322 ---SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIA 396 (530) Q Consensus 322 ---~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~ 396 (530) .++++.+++|.+++-++....+..+....+...+.|...-.+|. ++...-|+-|++ . .....+ T Consensus 251 ~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~--e----------~~~~~~ 318 (432) T protein:vir:10 251 LQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNI--E----------QQQQQF 318 (432) T ss_pred cccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--H----------HHHHHH Confidence 23566677766666665444444444556777888888888884 222222222221 1 112223 Q ss_pred HHHHHHHHHHHHHHHHHhcCC--Cccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHH Q lcl|NC_011308. 397 LRKTLRWTADLVVEDIRRRGL--GDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETL 471 (530) Q Consensus 397 f~~~l~~~~~~i~~~l~~~~~--~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e 471 (530) ++..|+-.++.|...++.+-. .... ...+++.++.-+-.|..+.++....+..+|+++...+++.+++ +++-+.. T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~ 398 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRL 398 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeE Confidence 444555555555554443221 1111 1124444545556788888888889999999999999888764 2221111 Q ss_pred HHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 472 KAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 472 ~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) .-... ... .....+...++.+..++...+.++.+ T Consensus 399 ~~~~n---~~~---~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 399 LVNGN---MLP---IDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred eeccc---ccc---hhhccccccCCCCCCCCCCCCCCCCC Confidence 00000 000 00000000011110011000111111 No 126 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.37 E-value=1e-06 Score=53.47 Aligned_cols=444 Identities=10% Similarity=0.068 Sum_probs=170.7 Q ss_pred cCCcccHHHHHHHHHH---------HHHHhh-------hHHHHHHHHHHhcccchhhhcccc--cccccccccccccCCc Q lcl|NC_011308. 7 TTAPDRLGTILSTKID---------EYIRSQ-------NVSLARVGQRYYNQDNDIENTRIM--WMNDHGDIVEDDNASN 68 (530) Q Consensus 7 ~~~~~~~~~~i~~~i~---------~~~~~~-------~~~~~~~~~~YY~g~~~I~~r~~~--~~~~~~~~~~~~~~~n 68 (530) -+..| -|++.+.. .|.++. +......+.++-.++.....++.- ..-..+...+...+|. T Consensus 1 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~ 77 (551) T protein:vir:80 1 MKNKL---GLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNN 77 (551) T ss_pred Cchhh---hhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccCh Confidence 11111 12222210 111110 011123444554444433222211 0001111111111111 Q ss_pred cee-------e-cCchhhHHhhhhhhhc-----------ccceeeecCC------cchHHHHHHHHHHhhc--------- Q lcl|NC_011308. 69 IKI-------S-HGFFAELVDQKTQYLL-----------ANGIDVKPTD------HDDQKLCYLIEEYYNE--------- 114 (530) Q Consensus 69 ~ki-------~-~n~~k~Ivd~~~~yl~-----------G~pv~~~~~~------~~de~~~~~l~~~~~~--------- 114 (530) ..+ + .+..+.+|+..+.-+. |.+..+...+ ..+......+.+++.. T Consensus 78 ~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~ 157 (551) T protein:vir:80 78 QDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINR 157 (551) T ss_pred hHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCcc Confidence 000 0 1233334444333221 1222222211 1122333344444421 Q ss_pred -cHHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCCCc-eeEEEEEEEEeecccccccceEEEEE Q lcl|NC_011308. 115 -EFQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTL-QRIIRFYTEQRYSDADNKFNSIGHAD 191 (530) Q Consensus 115 -~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~-~~~~~~y~~~~~~~~~~~~~~~~~~e 191 (530) .+......+..+...+|.+|.++-++..|++. +..++|..+.++.+..+.. ...++|+... . +. ... T Consensus 158 ~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~--~-----g~---~~~ 227 (551) T protein:vir:80 158 DSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVI--D-----QK---IVA 227 (551) T ss_pred chHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEe--C-----Cc---EEE Confidence 22344555677788999999988889999864 7889999998887766532 1122222210 0 00 111 Q ss_pred EEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCC Q lcl|NC_011308. 192 VWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGI 271 (530) Q Consensus 192 vyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~ 271 (530) .|..+.+.|++..... ..+ ..-.|. T Consensus 228 ~~~~~eiiH~~~n~~~--------------------------------------------------~~~-----~~~~G~ 252 (551) T protein:vir:80 228 TFNAREMAFAVRNPRS--------------------------------------------------DIY-----ATGYGY 252 (551) T ss_pred EEcccceEEecccCCC--------------------------------------------------Ccc-----cccccc Confidence 2445555554321100 000 011366 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhccce--eeeecCC-CCc--hhhHHHHHh--------hCcceecCCCCceeEEE Q lcl|NC_011308. 272 SDIKKVKSIIDDYDLMNCFLSNNLQDMAEAI--YVVRGGT-NSP--VDEIKKNIQ--------SKKIIQTKGEGGLDIQT 338 (530) Q Consensus 272 sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~--lvl~g~~-~~~--~~~~~~~~~--------~~~~i~~~~~~~~~~lt 338 (530) |-++.....|.....+..-..+.+.-.+.|- |.++|.. .++ .+.++..+. .+++..+. +++++|.. T Consensus 253 spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~-~~g~~~~~ 331 (551) T protein:vir:80 253 PELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS-AEDVKFVN 331 (551) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcccccc-CCCceEEE Confidence 6666555555544444444444444444454 4445532 222 122232221 11222232 33455555 Q ss_pred ecCC--HHHHHHHHHHHHHHHHHHhcccC--CCcccccCC---cHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 339 VDIP--YEARKAKMDIDELNIYRSGMGFN--SSAVGDGNA---TNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVED 411 (530) Q Consensus 339 ~~~~--~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~---SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~ 411 (530) ..++ +..+....+...+.|...-.+|. ++...-+.. .+-.+- ++.+ ......+++..|.-+++.|... T Consensus 332 l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t--~sn~---e~~~~~f~~~tL~P~~~~ie~~ 406 (551) T protein:vir:80 332 MTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLN--EGNS---AEKNQASKNKGLQPLLGFIEDF 406 (551) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccc--hhhH---HHHHHHHHHHHHHHHHHHHHHH Confidence 4443 33344456667777777777774 221111100 011110 0111 1112234455555555555444 Q ss_pred HHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC---CCCHHHHH---------HHHHHHH Q lcl|NC_011308. 412 IRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR---IGDEETLK---------AICDTLD 479 (530) Q Consensus 412 l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~---vdd~~~e~---------~~~e~e~ 479 (530) ++..-...+. ..+.+.|......+..+.+.+. .+...|+++...+++.+++ ++.-+..+ ....+.. T Consensus 407 ln~~L~~~~~-~~~~f~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~ 484 (551) T protein:vir:80 407 INKHIVAEFG-DKYTFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQ 484 (551) T ss_pred HHhhhccccC-CceEEEeeccChhhHHHHHHHH-HHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccC Confidence 4433222222 3578889888777877777654 4556788999999988754 22111110 0000000 Q ss_pred HHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 480 LDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 480 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .+.+....++....+. .+.+...+.++++...+.+....+...--++ T Consensus 485 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 531 (551) T protein:vir:80 485 FEHEKQQSNLQMLQEQ----TGNRVSTDVEDIPDGKDTTGDIGKDGQRKDK 531 (551) T ss_pred cchhhhhhccccccCc----CCCCCCCCCCCCCCccccCCCccccccccCc Confidence 0111111111000000 0000000000000000000000000000001 No 127 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.33 E-value=1.3e-06 Score=52.92 Aligned_cols=425 Identities=10% Similarity=0.014 Sum_probs=176.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) --.|+..|..+..+.|+.. |.+ +..++...+.. +.+.+ ..+.... ..+.. ... ....+......-.... T Consensus 61 ~~~~~~~~~~~kk~~i~~p----fkk-k~~~~~~d~f~-~s~es~s~vtsls-~pdaf-~~v--nVs~~~AlknsaV~sc 130 (945) T protein:vir:10 61 SIIIFRKNQVLKKEKIIVP----YNH-QEPPFKFNLFE-YSPESLMYLPSIS-DPDAF-FLI--NLFRKYRFNNDSKLIK 130 (945) T ss_pred eeeeehhhhHHHhhccccc----ccc-cccchhhhhhh-ccCccceeccccc-Cccce-eee--hhhhhhhhccHHHHHH Confidence 1123444444444433322 222 22233322222 22222 1111000 00000 000 0001111222234446 Q ss_pred Hhhhhhhhcccceee-ecCCcc-----------hHHHHHHHHHHhhccH------H-HHHHHHHHHHhhcCeEEEEEEec Q lcl|NC_011308. 80 VDQKTQYLLANGIDV-KPTDHD-----------DQKLCYLIEEYYNEEF------Q-SAIQELVEGSTIKGYEGIFARTT 140 (530) Q Consensus 80 vd~~~~yl~G~pv~~-~~~~~~-----------de~~~~~l~~~~~~~~------~-~~~~e~~~~~~~~G~a~~~~y~d 140 (530) |+..++=+.+-|+++ ...+.+ +..+...|++ -|.. . .....+..+...+|.+|.++-++ T Consensus 131 I~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~r--PNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd 208 (945) T protein:vir:10 131 VSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLER--PDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRD 208 (945) T ss_pred HHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhC--CCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEEC Confidence 666777777888874 211111 1112222221 1211 1 24455678889999999999999 Q ss_pred CCCce-EEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccccc Q lcl|NC_011308. 141 SEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVN 219 (530) Q Consensus 141 ~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~ 219 (530) ..|++ .+..++|..+.|+.++.+.... + |.. ... +. ....|....+.++.... T Consensus 209 ~~G~ii~L~pLdPs~Vti~~ddDG~~~y--~-Yv~-~id-----G~---~~~~v~a~DvIlhirn~-------------- 262 (945) T protein:vir:10 209 EQGNLVAITPVDGTTIKPILSEDTGIVV--G-YVQ-EVD-----GA---IVAHFDKRDVVLFRQNL-------------- 262 (945) T ss_pred CCCcEEEEEEECCcceEEEEcCCCcEEE--E-EEE-ecC-----Cc---eEEEecCCceEEEeccC-------------- Confidence 99986 4788999999888776654321 1 111 000 00 01122222222111100 Q ss_pred ccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_011308. 220 PNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDM- 298 (530) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~- 298 (530) ++-|. ..-.|.|.++ .+.+++...++-.....+.| T Consensus 263 ----------------------------------s~DG~-------~~GyGlSPIe---aa~~aI~~alAaek~aar~Fs 298 (945) T protein:vir:10 263 ----------------------------------TPDVY-------MYGYSLPPIE---ILYKVILSDIFIDKGNLDYYR 298 (945) T ss_pred ----------------------------------CCCcc-------cccCCchHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 00000 0002444444 44455544444333333333 Q ss_pred ---ccc--eeeeecCCC-----------CchhhHHHHHh-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHH Q lcl|NC_011308. 299 ---AEA--IYVVRGGTN-----------SPVDEIKKNIQ-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDEL 355 (530) Q Consensus 299 ---~~~--~lvl~g~~~-----------~~~~~~~~~~~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~ 355 (530) +.| ++.++|... +..+.++..+. .++.+.+++|.+++-++....+.......+...+ T Consensus 299 kNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~e 378 (945) T protein:vir:10 299 KGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVAR 378 (945) T ss_pred hCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHHHHHH Confidence 234 444443211 11122222221 1223445555555545444444444566677778 Q ss_pred HHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCC Q lcl|NC_011308. 356 NIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYI 433 (530) Q Consensus 356 ~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~ 433 (530) .|...-++|. ++....++-|+ +.- .....+...|+-.+..|...++.+-........+.+.|+... T Consensus 379 eIArAFGVPP~lLG~~e~st~SN--iEq----------q~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ld 446 (945) T protein:vir:10 379 KICAVYQVSPQDVGILEGSNKAT--AEV----------MASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDD 446 (945) T ss_pred HHHHHhCCCHHHcccCCCCCcch--HHH----------HHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchh Confidence 8888888884 22221112122 111 122333444555555554444433222223346788888777 Q ss_pred CCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccc-cccCCccccCCCCCCCC Q lcl|NC_011308. 434 LANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQE-VEELEPTVTPIIDPLTI 510 (530) Q Consensus 434 P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 510 (530) ..+..+.++....+..+|+++...+++.+++ +++-+.-+... ...++.-..+. .++..+.. -..+. T Consensus 447 l~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~-------nn~~P~d~~~ka~~ga~p~q----~aq~~ 515 (945) T protein:vir:10 447 LEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGL-------RNWKPEDEQAKAQQGAMPPQ----LAQAM 515 (945) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecc-------ccccccccccccccCCCCcc----cccCC Confidence 7788888898889999999999999988754 33211111000 00000000000 00000000 00011 Q ss_pred CCccCcCCCCc-ccccccCCC Q lcl|NC_011308. 511 EPQPEPLNIDP-VIEEEPVQE 530 (530) Q Consensus 511 ~~~~~~~~~~~-~~~~~~~~~ 530 (530) .+++.+.+... ...+.|-.. T Consensus 516 ~dqp~~kGGe~dEns~~psE~ 536 (945) T protein:vir:10 516 ADQPSQQGGGVDENSSVPSEQ 536 (945) T ss_pred CCCCCCCCCCCCCCCCCCCcc Confidence 11111111110 111111111 No 128 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.25 E-value=2.1e-06 Score=51.77 Aligned_cols=460 Identities=10% Similarity=-0.036 Sum_probs=212.5 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcc----------e Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNI----------K 70 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~----------k 70 (530) |..+-..-.+-.-..-.+. ...+...+-|.+-.. .|........ ........++- = T Consensus 1 Mn~iDr~i~~~sP~~a~~R-----------~~ar~~~~~y~aa~~--~r~~~~~~~~-~s~~~~i~~~~~~lr~RaRdL~ 66 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARR-----------LAAREAIQAYEAARP--GRTHKAKRQP-LGADTSLQKSAVSMREQCRKLD 66 (548) T ss_pred CchHHhHhhhcchHHHHHH-----------HHhHHHhccccccCc--cccccccCCC-CChHHHHHHHHHHHHHHHHHHH Confidence 4333222222111111111 111122233554321 1111100000 00000000000 0 Q ss_pred eecCchhhHHhhhhhhhccc-ceeeecC----C-cchHHHHHHHHHHh----h-------ccHHHHHHHHHHHHhhcCeE Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLAN-GIDVKPT----D-HDDQKLCYLIEEYY----N-------EEFQSAIQELVEGSTIKGYE 133 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~-pv~~~~~----~-~~de~~~~~l~~~~----~-------~~~~~~~~e~~~~~~~~G~a 133 (530) ..+++++-+|+..+.+++|. .+.+++. + ..++++.+.+...| . .+|......+++.....|.+ T Consensus 67 rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~ 146 (548) T protein:vir:95 67 EDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEG 146 (548) T ss_pred hcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCce Confidence 13678888888888889883 4444321 1 12233444444433 2 13555556677888899999 Q ss_pred EEEEEecCCCc--------eEEEEecccceEEEEcCC-CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeec Q lcl|NC_011308. 134 GIFARTTSEDK--------LTFQTVDALQLLPVFDDY-GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQK 204 (530) Q Consensus 134 ~~~~y~d~~g~--------~~~~~~~p~~~~~v~d~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~ 204 (530) +....++..+. +++..++|..+---++.. ..+...|. ++ ..+..+. +.++... T Consensus 147 f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE------~D---~~Grp~a-Y~i~~~h-------- 208 (548) T protein:vir:95 147 LAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQGIE------RD---TWRRKRA-YHLLKDH-------- 208 (548) T ss_pred EEEeeecccccccCCcccceEEEEechhhcCCCCCCCCCceeeeeE------EC---CCCceEE-EEEeecC-------- Confidence 98776654332 588889998762112111 11111111 11 1111111 1122211 Q ss_pred CCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHH Q lcl|NC_011308. 205 DEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDY 284 (530) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~ 284 (530) .+............-.+. ...-|.|... .....-|.|.|..++..+..+ T Consensus 209 -Pgd~~~~~~~~~~~rvpA--------------------------~~VlHif~~~----r~gQ~RGvs~lapvl~~l~~l 257 (548) T protein:vir:95 209 -PGNLQTLGGSLAVKRVEA--------------------------ERIIHIAYRK----RIGQNRGVPMLHAVLIRLADL 257 (548) T ss_pred -CCcccccccccceeeech--------------------------hHheeccccc----CCccccCcchHHHHHHHHHHH Confidence 111000000000000000 0011222211 112346899999887776655 Q ss_pred HHHHHHHHHHHHHhccceeeeecCCC--------CchhhHHHHHhhCcce-ecCCCCceeEEEecCCHHHHHHHHHHHHH Q lcl|NC_011308. 285 DLMNCFLSNNLQDMAEAIYVVRGGTN--------SPVDEIKKNIQSKKII-QTKGEGGLDIQTVDIPYEARKAKMDIDEL 355 (530) Q Consensus 285 ~~~~S~~~n~~~~~~~~~lvl~g~~~--------~~~~~~~~~~~~~~~i-~~~~~~~~~~lt~~~~~~~~e~~ld~L~~ 355 (530) +....--.....-.+---++++.... +..+.....+..+.++ .+..|.++++++.+.+...++.+...+.+ T Consensus 258 ~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr 337 (548) T protein:vir:95 258 KDYEESERVAARISAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLR 337 (548) T ss_pred hHHHHHHHHHHHHhhhheeeeecCCCccccCCCCcccccccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHH Confidence 55444333222222222233332111 1111111224444445 47888899999988888899999999999 Q ss_pred HHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCc----cc-cceeeE Q lcl|NC_011308. 356 NIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRW-TADLVVEDIRRRGLGD----YS-STDIKF 427 (530) Q Consensus 356 ~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~l~~~~~~~----~d-~~~i~i 427 (530) .|-.-..+|- ++.. ++ .|-.+.+.-+...-......+..|...+-+ +++..+...-..+... .+ ...+.+ T Consensus 338 ~IAaglGipYe~ltgD-~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~ 415 (548) T protein:vir:95 338 MIGAGTRSTYSSVSRA-YD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAA 415 (548) T ss_pred HHHhhcCCCHHHHhcc-cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheee Confidence 9988888773 2221 22 366677777776666666665555544433 4443333322223211 11 123456 Q ss_pred EeC-CCCC-CCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhh---------c--ccc Q lcl|NC_011308. 428 DIE-PYIL-ANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALE---------D--QEV 494 (530) Q Consensus 428 ~f~-~~~P-~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~---------~--~~~ 494 (530) .|. +..| .|-.-.++.....+.+|+.|.+.++...+. |+++.++++.+|.+...+.--.+. . +.. T Consensus 416 ~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~--D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~ 493 (548) T protein:vir:95 416 VYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGR--DPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPV 493 (548) T ss_pred eeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCC Confidence 664 3333 576666777888999999999999999875 899998888888776544211110 0 000 Q ss_pred ccCCc--------------------cc--cCCCCCCCCCCccCcCCCCccccccc Q lcl|NC_011308. 495 EELEP--------------------TV--TPIIDPLTIEPQPEPLNIDPVIEEEP 527 (530) Q Consensus 495 ~~~~~--------------------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (530) ...+. ++ =+++.++=.+..+.-.-..|-++++| T Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 494 EAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQPSNPDP 548 (548) T ss_pred CchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCCCCCCC Confidence 00000 00 01111111111122222334566666 No 129 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.03 E-value=6.5e-06 Score=49.01 Aligned_cols=330 Identities=12% Similarity=0.067 Sum_probs=147.0 Q ss_pred hcccceeeec-CCcchHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCC Q lcl|NC_011308. 87 LLANGIDVKP-TDHDDQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDY 162 (530) Q Consensus 87 l~G~pv~~~~-~~~~de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~ 162 (530) +.+-|+.+.- .+..+..+...|+.==+. .-......+......+|.||.++-++..|.+ .+..++|..+-++.++. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 4455666422 122233333434311011 1223445667788899999999988988886 46778888887766544 Q ss_pred CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccc Q lcl|NC_011308. 163 GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGV 242 (530) Q Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (530) +... +|.+.... +. ...|..+.+.|++.... T Consensus 81 ~~~~----~y~~~~~~-----g~----~~~~~~~eiih~r~~~~------------------------------------ 111 (348) T protein:vir:93 81 SREL----YYSIHAAT-----GN----KLIVHNMDMLHFKHIVA------------------------------------ 111 (348) T ss_pred CcEE----EEEEEcCC-----Ce----EEEEccccEEEecCCCC------------------------------------ Confidence 3211 12111100 10 11244555555432110 Q ss_pred cccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeee-ecCCCCch--hhHHH Q lcl|NC_011308. 243 EEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEA-IYVV-RGGTNSPV--DEIKK 318 (530) Q Consensus 243 ~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~-~lvl-~g~~~~~~--~~~~~ 318 (530) . +.-.|.|-++-+...|+..+.+ ....+..+..+ -+++ .+...+++ ..++. T Consensus 112 -------------~---------~~~~G~s~~~~~~~~i~~~~~~---~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~ 166 (348) T protein:vir:93 112 -------------S---------NMVQGISPIDVLKNTTDFDNAV---RTFNLTEMQKPDSFMLKYGSNVSTEKRQQVLE 166 (348) T ss_pred -------------C---------CceeeccHHHHHHHHHHHHHHH---HHHHHHhcCCCceeEEecCCCCCHHHHHHHHH Confidence 0 0013556555544444432222 22224444443 2332 23222221 11111 Q ss_pred H----H-hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHH Q lcl|NC_011308. 319 N----I-QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKT 393 (530) Q Consensus 319 ~----~-~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~k 393 (530) . . ..++++.+++|.+++-++....+..+....+.....|...-.+|..--...++++...++- .. T Consensus 167 ~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~----------~~ 236 (348) T protein:vir:93 167 DFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEE----------LN 236 (348) T ss_pred HHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH----------HH Confidence 1 1 1234556666666665654444444455667778888888888852111111122111111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCH Q lcl|NC_011308. 394 EIALRKTLRWTADLVVEDIRRRGLGDYS---STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDE 468 (530) Q Consensus 394 e~~f~~~l~~~~~~i~~~l~~~~~~~~d---~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~ 468 (530) ..++...|.-+++.|...++.+-....+ ...+++.+..-+-.|..+.|++..++..+|+++...+++.+++ +++- T Consensus 237 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~gg 316 (348) T protein:vir:93 237 RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGG 316 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCc Confidence 1223334444444444444432211111 1224444445555678888888899999999999999988764 2221 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 469 ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 469 ~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) +.-+- ......... ..+.+....+..++..+. T Consensus 317 D~~~~---------~~n~~~~~~--~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 317 DKPLI---------SGDLYPIDT--PLELRKSLKGGDKNVNES 348 (348) T ss_pred CeEee---------ccccccccc--chhhcccccCCCCCcCCC Confidence 11100 000000000 000000000000000011 No 130 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=97.98 E-value=8.3e-06 Score=48.44 Aligned_cols=445 Identities=12% Similarity=0.066 Sum_probs=161.9 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcc--cccccccccccccccCCcc----eeecC Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTR--IMWMNDHGDIVEDDNASNI----KISHG 74 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~--~~~~~~~~~~~~~~~~~n~----ki~~n 74 (530) +..++.-+. .+...+... ++ .-..+.+=-.++..-...+ -...+..+...+.-..... .+... T Consensus 21 ~~~~~~~~~--~~~~~~~~~-~~--------~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~ 89 (576) T protein:vir:96 21 IIDTVPIDD--GLQANIRNI-EE--------KSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQ 89 (576) T ss_pred chhhhhccc--ChhHHHHHh-hh--------hhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHH Confidence 111111111 000111100 00 0000000001111000000 0001111111110000000 00111 Q ss_pred -----chhhHHhh----hhhhh---------cccceeeecCCcc--hH--HHHHHHHHHh----hc------cHHHHHHH Q lcl|NC_011308. 75 -----FFAELVDQ----KTQYL---------LANGIDVKPTDHD--DQ--KLCYLIEEYY----NE------EFQSAIQE 122 (530) Q Consensus 75 -----~~k~Ivd~----~~~yl---------~G~pv~~~~~~~~--de--~~~~~l~~~~----~~------~~~~~~~e 122 (530) ....+|+. .+.|. +|-++.....+.. ++ .....+..++ .+ .+...... T Consensus 90 ~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 169 (576) T protein:vir:96 90 FGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRK 169 (576) T ss_pred hhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHH Confidence 12222332 33331 2334443322211 11 1111222222 11 23455667 Q ss_pred HHHHHhhcCeEEEEEEecCC--Cce-EEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceE Q lcl|NC_011308. 123 LVEGSTIKGYEGIFARTTSE--DKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVW 199 (530) Q Consensus 123 ~~~~~~~~G~a~~~~y~d~~--g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~ 199 (530) +..+...+|.+|.++.++.+ |++ .+..++|..|.++.+..+........|.... . + .....+..+.+. T Consensus 170 lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~-~-----~---~~~~~~~~~dii 240 (576) T protein:vir:96 170 IVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVI-N-----K---KVVASFTSREMA 240 (576) T ss_pred HHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEec-C-----C---ceEEEecccceE Confidence 78888999999988776654 443 4778999999988877654322211111100 0 0 011123333333 Q ss_pred EEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHH Q lcl|NC_011308. 200 YYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKS 279 (530) Q Consensus 200 ~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~ 279 (530) +++.....+ ....-.|.|-++.... T Consensus 241 ~~~~~~~~d-------------------------------------------------------~~~~~~G~Spi~~a~~ 265 (576) T protein:vir:96 241 MGIRNPRTE-------------------------------------------------------LSSSGYGLSEVEIAMK 265 (576) T ss_pred EEeecCCCC-------------------------------------------------------cccCcccccHHHHHHH Confidence 322110000 0001136666665555 Q ss_pred HHHHHHHHHHHHHHHHHHhccceee--eecCC-CCc--hhhHHHHHh--------hCc-ceecCCCCceeEEEecCCHHH Q lcl|NC_011308. 280 IIDDYDLMNCFLSNNLQDMAEAIYV--VRGGT-NSP--VDEIKKNIQ--------SKK-IIQTKGEGGLDIQTVDIPYEA 345 (530) Q Consensus 280 liDa~~~~~S~~~n~~~~~~~~~lv--l~g~~-~~~--~~~~~~~~~--------~~~-~i~~~~~~~~~~lt~~~~~~~ 345 (530) .|+....+..-..+.+.-.+.|-.+ ++|.. .++ .+.++..+. .++ .+.+++|.+++-++....+.. T Consensus 266 ~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~q 345 (576) T protein:vir:96 266 QFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQ 345 (576) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHH Confidence 5555554444444445544555544 44532 222 122232221 123 245566655555554444555 Q ss_pred HHHHHHHHHHHHHHHhcccC--CCcccccCCc----HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_011308. 346 RKAKMDIDELNIYRSGMGFN--SSAVGDGNAT----NVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGD 419 (530) Q Consensus 346 ~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~S----GvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~ 419 (530) +....+...+.|...-.+|. ++....++.+ |.++.+ +.+ ......+++..|.-+++.|...++.+-... T Consensus 346 fle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~--sn~---e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~ 420 (576) T protein:vir:96 346 FEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNE--ADP---GKKQQQSQNKGLQPLLRFIEDLINTHIISE 420 (576) T ss_pred HHHHHHHhHHHHHHHhCCCHHHcccccccccccccccccccc--ccH---HHHHHHHHHHHHHHHHHHHHHHHHhhhchh Confidence 56667788888888888885 3322211111 111111 111 112223444445444444444443322122 Q ss_pred cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHH-----HHH----HHHHH---HHHH Q lcl|NC_011308. 420 YSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKA-----ICD----TLDLD---YEDV 485 (530) Q Consensus 420 ~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~-----~~e----~e~~e---~~~~ 485 (530) +. ..+.+.|.+.-+.+..+..++. .....|+++...+++.+++ +++-+..+. ... +...+ ..+. T Consensus 421 ~~-~~~~~~f~r~d~~~~~e~~~~~-~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 498 (576) T protein:vir:96 421 YS-DKYVFQFVGGDTKSELDKIKIL-QEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKER 498 (576) T ss_pred cc-CceEEEeccCCHHHHHHHHHHH-HHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCcccccc Confidence 22 2466778777666666655433 3445689999888888754 332111100 000 00000 0000 Q ss_pred HHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 486 VKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +....+.. ..+...+......++..+.+...++...+.+|-+ T Consensus 499 ~~~~~~~~---~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~ 540 (576) T protein:vir:96 499 FDMIQQFL---NSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGT 540 (576) T ss_pred cccccccc---CCCCCCCCCCCCCCCcccccccccCCCCCCcccc Confidence 00000000 0000000000000111111111111122222222 No 131 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=97.93 E-value=1e-05 Score=47.92 Aligned_cols=405 Identities=9% Similarity=-0.002 Sum_probs=166.8 Q ss_pred HHHHh-cccchhhhcccccccccccccc-------cccCCccee----ec--CchhhHHhhhhhhhcccceeeecCC-cc Q lcl|NC_011308. 36 GQRYY-NQDNDIENTRIMWMNDHGDIVE-------DDNASNIKI----SH--GFFAELVDQKTQYLLANGIDVKPTD-HD 100 (530) Q Consensus 36 ~~~YY-~g~~~I~~r~~~~~~~~~~~~~-------~~~~~n~ki----~~--n~~k~Ivd~~~~yl~G~pv~~~~~~-~~ 100 (530) +-+.+ .++...-............... ..+....++ +. .=....|+..++=+.+-|+.+--.+ .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 11111 1111000000000000000000 000000011 00 0012245555555566677653211 11 Q ss_pred h-HHH-HHHHHHHhh--ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEE Q lcl|NC_011308. 101 D-QKL-CYLIEEYYN--EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFY 172 (530) Q Consensus 101 d-e~~-~~~l~~~~~--~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y 172 (530) . +.. ...+..++. |.. ......+......+|.||.++-++..|.+ .+..++|..+-++.++.+.+. | T Consensus 81 ~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~-----y 155 (454) T protein:vir:93 81 IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEVF-----Y 155 (454) T ss_pred ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcEE-----E Confidence 1 111 111222222 222 24456677788999999999988888886 478899999988887665432 2 Q ss_pred EEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccc Q lcl|NC_011308. 173 TEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLG 252 (530) Q Consensus 173 ~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (530) ...... .. .......+..+.+.|++.... T Consensus 156 ~~~~~~-~~----~~~~~~~~~~~eViH~k~~~~---------------------------------------------- 184 (454) T protein:vir:93 156 RITPDR-NC----GITEAVTVPAREVIHDRFNCF---------------------------------------------- 184 (454) T ss_pred EEEecc-cc----ccceeEEecCcceEEeccCCC---------------------------------------------- Confidence 111100 00 001112345555555532110 Q ss_pred cccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHHHHHh-------h Q lcl|NC_011308. 253 RSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIKKNIQ-------S 322 (530) Q Consensus 253 ~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~~~~~-------~ 322 (530) .+.-.|.|.+......|.....+..-..+.+.-.+.|-.+++-.. .++ .+.++..+. . T Consensus 185 ------------~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~ 252 (454) T protein:vir:93 185 ------------FHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENA 252 (454) T ss_pred ------------CCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhccccc Confidence 011146666665555555444444434444444444555554321 111 122222221 3 Q ss_pred CcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 323 ~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ++++.++++.+++-++....+..+..........|...-.+|.. +...-++-|.+. ...+.++... T Consensus 253 g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e------------~~~~~f~~~~ 320 (454) T protein:vir:93 253 GKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVE------------ALEQQYYSQC 320 (454) T ss_pred CCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHH------------HHHHHHHHHH Confidence 34666777766666665444444445566777788888788752 221111112111 1112223333 Q ss_pred HHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTL 478 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e 478 (530) |.-+++.|...++.+-....+ ..+++.++.-+..|..+.++....+..+|+++...+++.+++ +++-++.. +... T Consensus 321 l~P~~~~ie~~ln~~L~~~~~-~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~--~~~~ 397 (454) T protein:vir:93 321 LQTLIESIELLLDEALETGEN-ESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALY--LQQQ 397 (454) T ss_pred HHHHHHHHHHHHHHhhcCCCC-cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeee--eccC Confidence 333333333333322111111 235566666667888888888889999999999999888754 22211100 0000 Q ss_pred HHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 479 DLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 479 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .-.+....+++.... +.. + .+....++++.+++.......+.... T Consensus 398 ----~~~~~~~~~~~~~~~-~~~-~-~~~~~~~~~~~~~~d~~~~~~e~~~d 442 (454) T protein:vir:93 398 ----NYSLEALSRRDARED-PFA-S-SGKTASVPQAVAASDGNKAITETEHD 442 (454) T ss_pred ----ccchHhhhccCcccC-CCC-C-CccCCCCCCCCCCCCCCCCccCCccc Confidence 000001111111000 000 0 00111111111111111111111111 No 132 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.93 E-value=1.1e-05 Score=47.87 Aligned_cols=399 Identities=9% Similarity=0.070 Sum_probs=174.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |- ++..+..+...... .+ ..+.....+.-.........+........+..=+..+-...-|+..++=+.+- T Consensus 1 MG---~f~~lf~~~~~~~~-~~-----~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~l 71 (422) T protein:vir:13 1 MG---FLRGLFNKKNNNDE-KR-----SNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKL 71 (422) T ss_pred Cc---hhhhhhhccCCccc-hh-----hhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhC Confidence 32 23333222111100 00 00000000000000000000000000000000011122334456666666777 Q ss_pred ceeeecCCcc--hHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCc Q lcl|NC_011308. 91 GIDVKPTDHD--DQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTL 165 (530) Q Consensus 91 pv~~~~~~~~--de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~ 165 (530) |+.+.-.... +..+...|+.-=+. ........+..+...+|.||.++-++..|++ .+..++|..|-++.|+.+.. T Consensus 72 p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~ 151 (422) T protein:vir:13 72 SLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFL 151 (422) T ss_pred ceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcce Confidence 8876322211 12233334321111 1234566778888999999999999988885 57889999999988776543 Q ss_pred eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccc Q lcl|NC_011308. 166 QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEH 245 (530) Q Consensus 166 ~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (530) ...-+.++..... .+. ...+.++.+.|++.... T Consensus 152 ~~~~~~~y~~~~~----~g~----~~~~~~~eiih~~~~~~--------------------------------------- 184 (422) T protein:vir:13 152 SSLSKVWYVVTDK----NGK----EHKLLPDEMLHFIGDIT--------------------------------------- 184 (422) T ss_pred eccceEEEEEEeC----CCe----EEEEcccceEEEcCCCC--------------------------------------- Confidence 3221222111100 000 11233444444431100 Q ss_pred ccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhHHHHHh- Q lcl|NC_011308. 246 EGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEIKKNIQ- 321 (530) Q Consensus 246 ~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~~~~~~- 321 (530) . +.-.|.|.++.....|+....+..-..+.+...+.|-.+|+-... ++ .+.++..+. T Consensus 185 ----------~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~ 245 (422) T protein:vir:13 185 ----------L---------DGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFES 245 (422) T ss_pred ----------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHH Confidence 0 111477777777777766555555555555555567666654221 11 122222221 Q ss_pred -------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHH Q lcl|NC_011308. 322 -------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQK 392 (530) Q Consensus 322 -------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ 392 (530) .++++.+++|.+++-++....+.......+.....|...-.+|.. +....++-|+ +.- . T Consensus 246 ~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn--~e~----------~ 313 (422) T protein:vir:13 246 MSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNN--LTE----------Q 313 (422) T ss_pred HhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHH----------H Confidence 234666766666665655444444455566777888888888852 2211121111 111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCC Q lcl|NC_011308. 393 TEIALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDI--EPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGD 467 (530) Q Consensus 393 ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f--~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd 467 (530) ...++...|.-.++.|...++.+-...... ....|.| ..-+-.|..+.++....+.++|+++...+++.+++ +++ T Consensus 314 ~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g 393 (422) T protein:vir:13 314 QKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEG 393 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 122334444444444444333322111111 1234444 34444578888888889999999999999988754 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcc Q lcl|NC_011308. 468 EETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPV 522 (530) Q Consensus 468 ~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (530) -+..+.... ... +....++..++. ..++. T Consensus 394 gD~~~~~~n-----~~~-l~~~~~~~~~~g--------------------~~~g~ 422 (422) T protein:vir:13 394 GDRLLVNGN-----MIP-IEMAGEQYKKGG--------------------EKGGK 422 (422) T ss_pred cCeeeeccC-----ccc-hhhcccccccCC--------------------CcCCC Confidence 111100000 000 000000000000 00000 No 133 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=389 Identities=11% Similarity=-0.006 Sum_probs=160.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+ +.++........... . +.+--... ..+....+...++.=+.+.-....|+..++=+.+- T Consensus 1 Mgl---~~~~f~~~~~~~~~~------~-~~~~~~~~--------~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~l 62 (409) T protein:vir:84 1 MSL---FTRIFSGPSEERTLT------K-ISGIPSPA--------EDWAMHGDRPGANSAMTLGAFYACVTLLADTVASL 62 (409) T ss_pred Cch---hhhhhcCCCcccccc------c-cccccccc--------chhhccCcccchhhhhccHHHHHHHHHHHHhhhhC Confidence 322 222111100000000 0 00000000 00000000000000011112344566666777777 Q ss_pred ceeeecCCcch----HHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEE-EecCCCce-EEEEecccceEEEEcCC Q lcl|NC_011308. 91 GIDVKPTDHDD----QKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFA-RTTSEDKL-TFQTVDALQLLPVFDDY 162 (530) Q Consensus 91 pv~~~~~~~~d----e~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~-y~d~~g~~-~~~~~~p~~~~~v~d~~ 162 (530) |+.+--.+... ..+...|+.--+. .-......+......+|.+|.++ +.+..|.+ .+..++|..+.+..... T Consensus 63 p~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~ 142 (409) T protein:vir:84 63 SIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKD 142 (409) T ss_pred ceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCC Confidence 88653222211 1122223211011 12345566777888999999776 46777775 47788998886654322 Q ss_pred CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccc Q lcl|NC_011308. 163 GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGV 242 (530) Q Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (530) .....+...|. . .+ .+|..+.+.|++.... T Consensus 143 ~~~~~~~~~~~----~----~g------~~~~~~dvih~~~~~~------------------------------------ 172 (409) T protein:vir:84 143 EDGDWIEPVYR----I----DG------KVVPNHRIMHIKRYPV------------------------------------ 172 (409) T ss_pred CcceEEEEEec----C----Cc------eEEchhhEEEecCCCC------------------------------------ Confidence 21111111110 0 00 1234444444421110 Q ss_pred cccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHHHH Q lcl|NC_011308. 243 EEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIKKN 319 (530) Q Consensus 243 ~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~~~ 319 (530) . +.-.|.|.++.....|+....+..-.++.+.-...|-.+|+... .++ .+.++.. T Consensus 173 -------------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~ 230 (409) T protein:vir:84 173 -------------A---------GCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQ 230 (409) T ss_pred -------------C---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHH Confidence 0 01146777776666555555444444455555556666665422 211 1222221 Q ss_pred H-----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHH Q lcl|NC_011308. 320 I-----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQK 392 (530) Q Consensus 320 ~-----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ 392 (530) . ..++++.++++.+++-++....+.......+...+.|...-.+|. ++...-++.++..++-.... T Consensus 231 ~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~------- 303 (409) T protein:vir:84 231 WIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGIN------- 303 (409) T ss_pred HHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH------- Confidence 1 123455566555554444332333334455677788888888885 23222222222222211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCC--CCHHH Q lcl|NC_011308. 393 TEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRI--GDEET 470 (530) Q Consensus 393 ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~v--dd~~~ 470 (530) ++...|.-.++.|...++.+-. ....|++.++.-+-.|..+.++....+.++|+++...+++.+++- ++-+. T Consensus 304 ---f~~~~l~P~~~~ie~~l~~~L~---~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~ 377 (409) T protein:vir:84 304 ---FVRHTLLPWLRCIEQALDTFLP---RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDI 377 (409) T ss_pred ---HHHHHHHHHHHHHHHHHHHhcc---CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcce Confidence 1122222222222222222111 112355566565667888889888899999999999998887542 22110 Q ss_pred HHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 471 LKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 471 e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) -.. ...+...... +..++. ..++++.....|. T Consensus 378 ~~~---------~~n~~~~~~~------~~~~~~--~~~~~~~~~~gn~ 409 (409) T protein:vir:84 378 HLQ---------PMNFVPLGYV------PPEEPA--QEPQPNSATEGNK 409 (409) T ss_pred eee---------cccccccccC------CccccC--cCCCCCCccCCCC Confidence 000 0000000000 001110 0111112222222 No 134 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.87 E-value=1.3e-05 Score=47.30 Aligned_cols=415 Identities=7% Similarity=0.053 Sum_probs=159.5 Q ss_pred HHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcc--ee--ecCchhhHHhhhhhhhcccceee Q lcl|NC_011308. 20 KIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNI--KI--SHGFFAELVDQKTQYLLANGIDV 94 (530) Q Consensus 20 ~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~--ki--~~n~~k~Ivd~~~~yl~G~pv~~ 94 (530) +-+.+.+..- ..+++.-+. ..-.+... ....+.-.+....+.. ++ ..+.....|+..+.-+.+-|+++ T Consensus 1 ~~~~~~~i~s------~~~~~~i~~~~~~s~~~~-~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~ 73 (542) T protein:vir:41 1 MFNYHLSIRS------LEKYKAIKREEVESQALG-ETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYIL 73 (542) T ss_pred Cccccccccc------cccchhhhhccccccccc-cccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceee Confidence 1111111000 111111110 00000000 0000000000000000 11 13455678888888899999887 Q ss_pred ecCCcchHHHHHHHHHHhhc---cHHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCCCceeEEE Q lcl|NC_011308. 95 KPTDHDDQKLCYLIEEYYNE---EFQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTLQRIIR 170 (530) Q Consensus 95 ~~~~~~de~~~~~l~~~~~~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~~~~~~ 170 (530) ...+. ..+..++.| +.......+..+...+|.||.++-++..|++. +..++|..+.+..|... ++. T Consensus 74 ~~~~~------~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~----~~~ 143 (542) T protein:vir:41 74 EGDDE------GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR----YRQ 143 (542) T ss_pred ecccc------hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe----eEe Confidence 53321 223334333 23455667788899999999999999988764 67888888876655331 111 Q ss_pred EEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccc Q lcl|NC_011308. 171 FYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQV 250 (530) Q Consensus 171 ~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (530) ++. .....+...|.......... + T Consensus 144 ~~~----------~~~~~~~~~y~~~~~~~~~~--g-------------------------------------------- 167 (542) T protein:vir:41 144 TWD----------GVNITHFKDYRYEGEINPET--G-------------------------------------------- 167 (542) T ss_pred eec----------CCcceeEEeecccccccccc--c-------------------------------------------- Confidence 110 00011111111110000000 0 Q ss_pred cccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeeecCCCCc----------- Q lcl|NC_011308. 251 LGRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEA--IYVVRGGTNSP----------- 312 (530) Q Consensus 251 ~~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~--~lvl~g~~~~~----------- 312 (530) .....+..=-|+++++.. .|.|.+......|..-..+..-..+.+.-.+.| +|.+.|...++ T Consensus 168 ~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~ 247 (542) T protein:vir:41 168 EDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTG 247 (542) T ss_pred ccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHH Confidence 000001111234454332 466666654444433222222222222222234 44455532211 Q ss_pred hhhHHH----HHh-----hCcceecC----CCCceeEEEecCC--HHHHHHHHHHHHHHHHHHhcccCC--Ccc--cccC Q lcl|NC_011308. 313 VDEIKK----NIQ-----SKKIIQTK----GEGGLDIQTVDIP--YEARKAKMDIDELNIYRSGMGFNS--SAV--GDGN 373 (530) Q Consensus 313 ~~~~~~----~~~-----~~~~i~~~----~~~~~~~lt~~~~--~~~~e~~ld~L~~~I~~~s~~p~~--~~~--~~gn 373 (530) .+.++. ... .++++.++ .+++++|..-.++ +..+....+...+.|...-.+|.. +.. +.+| T Consensus 248 ~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n 327 (542) T protein:vir:41 248 RTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLG 327 (542) T ss_pred HHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccc Confidence 111111 111 22344443 2345666554443 333444556677778887788742 221 1112 Q ss_pred CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCC--CCCCHHHHHHHHHHHHhcC Q lcl|NC_011308. 374 ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPY--ILANELDLAMIDKTEAETN 451 (530) Q Consensus 374 ~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~--~P~n~~e~a~~~~~~~~~g 451 (530) -|. ++- .....++..|.-+++.|...++..-..++. ..+.+.|+.. +..|.. .....+..+| T Consensus 328 ~sn--~Eq----------~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~~ll~~d~~---~~~~~~v~~G 391 (542) T protein:vir:41 328 GNF--AEV----------TRRTYYESVVRPQQNIISSILTDFFQVKFN-PKTRFKFNDETLLESDSV---RNCALLVQSG 391 (542) T ss_pred ccc--HHH----------HHHHHHHHHHHHHHHHHHHHHHhhcccccC-CceEEEecchhhcchHHH---HHHHHHHhCC Confidence 222 111 112233444444444444444432222222 2345566533 333322 2345678899 Q ss_pred CCcHHHHHHhCCCCCCHHHHH--------HHHHHHHHHH-HHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcc Q lcl|NC_011308. 452 QIQINNLLAIAPRIGDEETLK--------AICDTLDLDY-EDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPV 522 (530) Q Consensus 452 ~iS~et~l~~~~~vdd~~~e~--------~~~e~e~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (530) +++...+++.++.++.-++.. ..++..+.+. .+.....++.. ..+ .+..++......+.|+.. - T Consensus 392 ilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~----~k~-~~~~~~~~~~~~~~~~~~--~ 464 (542) T protein:vir:41 392 VLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIY----AKY-RPRFNEIISSKLSAEEKK--K 464 (542) T ss_pred CCCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCCCchhhhhhcc----ccc-Cccccccccccccchhhc--c Confidence 999999988665444221110 0000000000 00000000000 000 000111111222222222 3 Q ss_pred cccccCCC Q lcl|NC_011308. 523 IEEEPVQE 530 (530) Q Consensus 523 ~~~~~~~~ 530 (530) ..++|+.+ T Consensus 465 ~~~~~~~~ 472 (542) T protein:vir:41 465 KIDESLAE 472 (542) T ss_pred cccchhhh Confidence 33444444 No 135 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=97.74 E-value=2.3e-05 Score=46.01 Aligned_cols=398 Identities=12% Similarity=-0.001 Sum_probs=158.8 Q ss_pred cccccCCcceeecCchhhHHhhhhhhhcccceeeecCCc-----chHHHHHHHHHHhh----c-----------cHHHHH Q lcl|NC_011308. 61 VEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDH-----DDQKLCYLIEEYYN----E-----------EFQSAI 120 (530) Q Consensus 61 ~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~-----~de~~~~~l~~~~~----~-----------~~~~~~ 120 (530) .++.. =..+.+...|+..++.+.|-|+.+..... ......+.+..++. + .+.... T Consensus 1 l~~l~-----~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~ 75 (467) T protein:vir:31 1 MAELL-----EHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVL 75 (467) T ss_pred Chhhh-----hcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHH Confidence 11110 01466778889999999999988743221 11122222323221 1 122345 Q ss_pred HHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceE Q lcl|NC_011308. 121 QELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVW 199 (530) Q Consensus 121 ~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~ 199 (530) ..+..+...+|.||.++-++..|++ .+..++|..+-+..|...- .... ... ..++.+|...... T Consensus 76 ~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~-------~~~~-------~~~-~~~~~~~~~~~~~ 140 (467) T protein:vir:31 76 QTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGF-------VQLL-------EEK-EKYFGVAGDRYQT 140 (467) T ss_pred HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeeccee-------Eeec-------CCc-eeeEEecccccee Confidence 5677788899999999888988875 4778888888766543210 0000 000 0011111110000 Q ss_pred EEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCc-----CCCCcH Q lcl|NC_011308. 200 YYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNK-----LGISDI 274 (530) Q Consensus 200 ~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~-----~~~sd~ 274 (530) .. .+....... ....+ .......|..=-|++++... .|.|.+ T Consensus 141 ~~----~~~~~~~~~-------------~~~~~----------------~~~~~~~~~~~diih~r~~~~~~~~~G~s~~ 187 (467) T protein:vir:31 141 NG----NGDLDPVFV-------------DADDG----------------STGTSVSNPANELIFKRNHSPLYPHYGAPDI 187 (467) T ss_pred ec----ccceeeeee-------------eeccc----------------cccceeEeccccEEEecCCCCCCCcccccHH Confidence 00 000000000 00000 00000111112234554332 467666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccce--eeeecCCCCch--hhHHHHHh-------------------hCcceecCCC Q lcl|NC_011308. 275 KKVKSIIDDYDLMNCFLSNNLQDMAEAI--YVVRGGTNSPV--DEIKKNIQ-------------------SKKIIQTKGE 331 (530) Q Consensus 275 e~v~~liDa~~~~~S~~~n~~~~~~~~~--lvl~g~~~~~~--~~~~~~~~-------------------~~~~i~~~~~ 331 (530) ......|+....+..-..+.+.-.+.|- +.++|...+.. +.++..+. ..+...+..+ T Consensus 188 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g 267 (467) T protein:vir:31 188 IPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADG 267 (467) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCC Confidence 6544444433333222223333333343 34455433221 11222111 1122333444 Q ss_pred CceeEEEec-------CC-HHHHHHHHHHHHHHHHHHhcccCC--CcccccCC-cHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 332 GGLDIQTVD-------IP-YEARKAKMDIDELNIYRSGMGFNS--SAVGDGNA-TNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 332 ~~~~~lt~~-------~~-~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~-SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) .++..+..+ .+ +..+........+.|...-.+|.. +....|+. |++. - ....+++.. T Consensus 268 ~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e--~----------~~~~f~~~~ 335 (467) T protein:vir:31 268 ADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAE--E----------QRKEFAEET 335 (467) T ss_pred CcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHH--H----------HHHHHHHHH Confidence 443332211 11 233345556677777777777742 22111221 2211 0 111222233 Q ss_pred HHHHHHHHHHHHHhcCCC---ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLG---DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDT 477 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~---~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~ 477 (530) |+-+++.|...++.+-.. ......+++.++.-+..|..+.+++...+..+|+++...+++.+++-.-++.++.- T Consensus 336 l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~--- 412 (467) T protein:vir:31 336 IQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYG--- 412 (467) T ss_pred HHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccC--- Confidence 333333344333322111 11122366666677778999999988899999999999999998652111111000 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ... ....-..+..+....++.+...+..+.++....-.+...+++|.-+ T Consensus 413 --~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (467) T protein:vir:31 413 --GET--LVAEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEI 461 (467) T ss_pred --Ccc--cccccccccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhh Confidence 000 0000000000000000000000000000000000111111122111 No 136 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=97.72 E-value=2.5e-05 Score=45.80 Aligned_cols=381 Identities=12% Similarity=0.022 Sum_probs=163.7 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |.+....+.. ........ .....+..+.. .+..+. +++-+.+.-....|+..+.=+.+- T Consensus 1 M~~f~~~~~~----~~~~~~~~-~~~~~~~~~~~------------~~~~v~----~~~al~~~~V~~~v~~ia~~ia~~ 59 (397) T protein:vir:38 1 MPLLKLNKSH----SQGFSLND-PDWVNFLTGGE------------AQKYVS----ADTALKNSDIFSLIMQLSGDLAMV 59 (397) T ss_pred Ccchhhhhcc----cCcccCCc-hhhhhhhcCCc------------CCceec----hHHhhccHHHHHHHHHHHHHHhhC Confidence 4333221110 00000000 00011111100 000000 000011111222344444445555 Q ss_pred ceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRI 168 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 168 (530) |++.. +......+.+-... ......+.+......+|.||.++-++..|.+ .+..++|..+-+..+..+... T Consensus 60 p~~~~-----~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~-- 132 (397) T protein:vir:38 60 RYTSE-----SDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGL-- 132 (397) T ss_pred ccccc-----ccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE-- Confidence 66532 11122222111111 2234556777888999999998888988876 578899999887776544321 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) +|.+...... ......+..+.+.|++..... T Consensus 133 --~y~~~~~~~~------~~~~~~~~~~eiih~~~~~~~----------------------------------------- 163 (397) T protein:vir:38 133 --IYNINFDEPA------IGYMENVPAADVIHIRLLSKN----------------------------------------- 163 (397) T ss_pred --EEEEEecccc------ccceeEecCccEEEecCCCCC----------------------------------------- Confidence 1222110000 001123445555555321100 Q ss_pred cccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchh---hHHHHH----- Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVD---EIKKNI----- 320 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~---~~~~~~----- 320 (530) +.-.|.|.+......|+....+..-..+.+.-.+.|-.+++-......+ .++... T Consensus 164 -----------------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~ 226 (397) T protein:vir:38 164 -----------------GGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQ 226 (397) T ss_pred -----------------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhc Confidence 0114777777776666665555555555555566666666532221111 111111 Q ss_pred --hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_011308. 321 --QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEIA 396 (530) Q Consensus 321 --~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~~ 396 (530) ..++++.++++.+++-++....+.......+.+.+.|...-.+|.. +....++ |..+ ..+.. T Consensus 227 ~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~-~~~e-------------~~~~~ 292 (397) T protein:vir:38 227 IHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ-SSIT-------------QISGQ 292 (397) T ss_pred ccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHH-------------HHHHH Confidence 1234555666655555554444455556677888888888888742 2211111 2111 11223 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCC--CCHHHHHHH Q lcl|NC_011308. 397 LRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRI--GDEETLKAI 474 (530) Q Consensus 397 f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~v--dd~~~e~~~ 474 (530) |...|+-.++.|..-++.+-..++ ++.+.| .+-.|..+.++....+..+|+++...+++.+++- .+.+ +-. T Consensus 293 ~~~~l~P~~~~ie~~ln~~l~~~~---~~~~~~--~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d--~~~ 365 (397) T protein:vir:38 293 YAKSLNRYVQAIVGELNDKLHANI---SANIRF--AIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKD--LPD 365 (397) T ss_pred HHHHHHHHHHHHHHHHHHhccChh---cccccc--cccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc--ccc Confidence 444555555555554444322222 122333 3334677778888889999999999998876531 1110 000 Q ss_pred HHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccC Q lcl|NC_011308. 475 CDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPV 528 (530) Q Consensus 475 ~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (530) .+ ..... ... .....+....+.++. +.+.+ |. T Consensus 366 ~~-------~~~~~-~~~---~~~~~~g~~~~~~~~-----e~~~~------~~ 397 (397) T protein:vir:38 366 PE-------KEPQQ-AIQ---LIQQEGGENDGNNSD-----ERGSD------PE 397 (397) T ss_pred cc-------ccccc-ccc---ccccccCCCCCCCCC-----CCCCC------CC Confidence 00 00000 000 000000000000000 01111 11 No 137 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.60 E-value=4e-05 Score=44.72 Aligned_cols=419 Identities=11% Similarity=0.024 Sum_probs=170.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) |- ++..+..+... .... -... +.|.... .+..--... ..+..+.... -.|++. .-.-|+..+.=+.+ T Consensus 1 Mg---~~~~l~~~~~~-~~~~-~~~~-~~~~~~~~~~~~~~~~~--~~g~~v~~~~--al~~~~--v~~~i~~ia~~iA~ 68 (457) T protein:vir:62 1 MG---FWSALFGRGHS-PALD-AAEG-RAWEPYDPSIYNLGATA--SSGERVTPHD--ALQVSA--VFASVRLLSETIAT 68 (457) T ss_pred Cc---hhhhhhccccc-cccc-cccc-cccccchhhhhhccccc--cCCceechHH--hhccHH--HHHHHHHHHHhHhh Confidence 22 22221111000 0000 0000 0000000 000000000 0000000000 011111 12235555555666 Q ss_pred cceeeecCCcchHH-H-HHHHHHHhh--cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcC Q lcl|NC_011308. 90 NGIDVKPTDHDDQK-L-CYLIEEYYN--EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDD 161 (530) Q Consensus 90 ~pv~~~~~~~~de~-~-~~~l~~~~~--~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~ 161 (530) -|+++.-...+... . ...+..++. ++ .......+......+|.||.++-.+ .|.+ .+..++|..+.+..+. T Consensus 69 lp~~~~~~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~ 147 (457) T protein:vir:62 69 LPLSTYSKRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVM 147 (457) T ss_pred CceEEEEecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEec Confidence 68775332222111 1 111222221 22 2345566777888999999888554 4554 4677889888765543 Q ss_pred CCC-ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 162 YGT-LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 162 ~~~-~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ... ....+..|... .. + .......|+++.+.|++.... T Consensus 148 ~~~~~~~~~~~y~~~---~~---g-~~~~~~~~~~~eiih~r~~~~---------------------------------- 186 (457) T protein:vir:62 148 VDGLRRKVFEAYDID---AD---G-NEVLLGWFTPRDVLHIPGMML---------------------------------- 186 (457) T ss_pred cCCccceeEEEEEEc---cC---C-ceeEEEeeCccceEEecCCCC---------------------------------- Confidence 322 22222223221 00 0 112223455666666542110 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIK 317 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~ 317 (530) .+ .-.|.|-++.....|.....+..-.++.+.-.+.|-.+|+-.. .++ .+.++ T Consensus 187 ---------------~~---------~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~ 242 (457) T protein:vir:62 187 ---------------PG---------DFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAR 242 (457) T ss_pred ---------------CC---------ceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHH Confidence 00 0146676766666555555555444555555555655554322 121 12222 Q ss_pred HHH----h----hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHH Q lcl|NC_011308. 318 KNI----Q----SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLA 387 (530) Q Consensus 318 ~~~----~----~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~ 387 (530) ..+ . .++++.+++|.+++-++....+..+..........|...-.+|. +++..-++.+|..++-... T Consensus 243 ~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~--- 319 (457) T protein:vir:62 243 EAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI--- 319 (457) T ss_pred HHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH--- Confidence 211 1 24466777777776666544444445556677888888888885 2322223332322221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC- Q lcl|NC_011308. 388 MKAQKTEIALRKTLRWTADLVVEDIRRRGLGDY--SSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR- 464 (530) Q Consensus 388 ~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~--d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~- 464 (530) .++...|.-.++.|...++.+-..+. ....+++.++.-+-.|..+.++...++.++|+++...+++.+++ T Consensus 320 -------~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~ 392 (457) T protein:vir:62 320 -------AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMT 392 (457) T ss_pred -------HHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 12233333333333333333221111 12234455545555688888888889999999999999988754 Q ss_pred -CCCH--HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCC-CCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 465 -IGDE--ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIID-PLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 465 -vdd~--~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +++. +....-. .+.....+.+....+....... ..++.+.+.|.+ +..+|..+ T Consensus 393 pi~~g~~D~~~~~~---------n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~d~~ 449 (457) T protein:vir:62 393 PLPDGLGEKYRVPL---------NLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDN----AEGDPDEG 449 (457) T ss_pred CCCCCCcceeeecc---------ccccccccccccccCCCccCCCCccCCCCCCCCCC----CCCCCccc Confidence 4433 1111000 0000000000000000000000 000000111111 12333333 No 138 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.57 E-value=4.4e-05 Score=44.49 Aligned_cols=374 Identities=13% Similarity=0.015 Sum_probs=166.6 Q ss_pred cccchhhhcccc----cc---------------cccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCCcch Q lcl|NC_011308. 41 NQDNDIENTRIM----WM---------------NDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDD 101 (530) Q Consensus 41 ~g~~~I~~r~~~----~~---------------~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~d 101 (530) .|- ++|..+ .. +..+..+... .-.|.+. .-.-|+..++=+.+-|+++.... .. T Consensus 1 Mg~---f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~al~~~~--v~~cv~~Ia~~iA~~p~~~~~~~-~~ 72 (416) T protein:vir:81 1 MGI---FYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI--EAIRHSD--IFTAVMMIASDLARMPIRVTVNG-QI 72 (416) T ss_pred CCc---ccccccccccCCCcchhHHHHHhccccccCccccchh--hhhcchH--HHHHHHHHHHhhccCceEEecCc-cc Confidence 111 111000 00 0000000000 0011111 11135666666667788764321 11 Q ss_pred HHHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEEEE Q lcl|NC_011308. 102 QKLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTE 174 (530) Q Consensus 102 e~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~ 174 (530) ....-+..++. |. .......+......+|.||.++-++..|.+ .+..++|..+-++.|+.+.+..... T Consensus 73 -~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~---- 147 (416) T protein:vir:81 73 -NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQ---- 147 (416) T ss_pred -cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEE---- Confidence 11111222221 22 223446677778899999999999998886 4778899999988887665322111 Q ss_pred EeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccc Q lcl|NC_011308. 175 QRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRS 254 (530) Q Consensus 175 ~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (530) ..... .......|..+.+.|++... T Consensus 148 -~~~~~-----~~~~~~~~~~~evihir~~~------------------------------------------------- 172 (416) T protein:vir:81 148 -RIDSN-----GNNIERNVKFEDMLDIKFYS------------------------------------------------- 172 (416) T ss_pred -EecCC-----CceeEEEEccccEEEeccCC------------------------------------------------- Confidence 10110 01111234455555443100 Q ss_pred cCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCch--hhHHHHH----h----h Q lcl|NC_011308. 255 YKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTNSPV--DEIKKNI----Q----S 322 (530) Q Consensus 255 ~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~~~~--~~~~~~~----~----~ 322 (530) + +.-.|.|.++.....|+....+..-..+.+.-.+.|-.+++ |...++. +.++..+ . . T Consensus 173 -~---------d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~na 242 (416) T protein:vir:81 173 -L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQA 242 (416) T ss_pred -C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 0 01146677776666665544444444444555555666554 4322211 1122211 1 2 Q ss_pred CcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 323 ~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ++++.++++.+++-++.+..+..+.......++.|...-.+|.. +... ++.|-...+ ..|... T Consensus 243 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~~--------------~~~~~~ 307 (416) T protein:vir:81 243 GKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANMSITDAN--------------LDYLST 307 (416) T ss_pred CceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCccHHHHH--------------HHHHHH Confidence 34666776666655554444444445556777888888888842 2111 111211111 123334 Q ss_pred HHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTL 478 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e 478 (530) |.-+++.|...++.+-........+++.++.-+-.|..+.++....+..+|+++...+++.+++ +++.+...-.+... T Consensus 308 l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n 387 (416) T protein:vir:81 308 LKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 387 (416) T ss_pred HHHHHHHHHHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 4444444444444332222222334444444444678888888888999999999999888754 44433211110000 Q ss_pred HHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 479 DLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 479 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) -.. -+.....+............+. ++++ T Consensus 388 ~~~-~~~~~~~~~~~~~~~~~~~kgG---e~n~ 416 (416) T protein:vir:81 388 HVN-IELVDEYQMNKSRATDKKLKGG---EENE 416 (416) T ss_pred ccc-cccccccCcccccccccccCCC---CCCC Confidence 000 0000000000000000001110 0011 No 139 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.57 E-value=4.4e-05 Score=44.49 Aligned_cols=374 Identities=13% Similarity=0.015 Sum_probs=166.6 Q ss_pred cccchhhhcccc----cc---------------cccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCCcch Q lcl|NC_011308. 41 NQDNDIENTRIM----WM---------------NDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDD 101 (530) Q Consensus 41 ~g~~~I~~r~~~----~~---------------~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~d 101 (530) .|- ++|..+ .. +..+..+... .-.|.+. .-.-|+..++=+.+-|+++.... .. T Consensus 1 Mg~---f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~al~~~~--v~~cv~~Ia~~iA~~p~~~~~~~-~~ 72 (416) T protein:vir:45 1 MGI---FYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI--EAIRHSD--IFTAVMMIASDLARMPIRVTVNG-QI 72 (416) T ss_pred CCc---ccccccccccCCCcchhHHHHHhccccccCccccchh--hhhcchH--HHHHHHHHHHhhccCceEEecCc-cc Confidence 111 111000 00 0000000000 0011111 11135666666667788764321 11 Q ss_pred HHHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEEEE Q lcl|NC_011308. 102 QKLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTE 174 (530) Q Consensus 102 e~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~ 174 (530) ....-+..++. |. .......+......+|.||.++-++..|.+ .+..++|..+-++.|+.+.+..... T Consensus 73 -~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~---- 147 (416) T protein:vir:45 73 -NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQ---- 147 (416) T ss_pred -cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEE---- Confidence 11111222221 22 223446677778899999999999998886 4778899999988887665322111 Q ss_pred EeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccc Q lcl|NC_011308. 175 QRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRS 254 (530) Q Consensus 175 ~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (530) ..... .......|..+.+.|++... T Consensus 148 -~~~~~-----~~~~~~~~~~~evihir~~~------------------------------------------------- 172 (416) T protein:vir:45 148 -RIDSN-----GNNIERNVKFEDMLDIKFYS------------------------------------------------- 172 (416) T ss_pred -EecCC-----CceeEEEEccccEEEeccCC------------------------------------------------- Confidence 10110 01111234455555443100 Q ss_pred cCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCch--hhHHHHH----h----h Q lcl|NC_011308. 255 YKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTNSPV--DEIKKNI----Q----S 322 (530) Q Consensus 255 ~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~~~~--~~~~~~~----~----~ 322 (530) + +.-.|.|.++.....|+....+..-..+.+.-.+.|-.+++ |...++. +.++..+ . . T Consensus 173 -~---------d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~na 242 (416) T protein:vir:45 173 -L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQA 242 (416) T ss_pred -C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 0 01146677776666665544444444444555555666554 4322211 1122211 1 2 Q ss_pred CcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 323 ~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ++++.++++.+++-++.+..+..+.......++.|...-.+|.. +... ++.|-...+ ..|... T Consensus 243 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~~--------------~~~~~~ 307 (416) T protein:vir:45 243 GKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANMSITDAN--------------LDYLST 307 (416) T ss_pred CceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCccHHHHH--------------HHHHHH Confidence 34666776666655554444444445556777888888888842 2111 111211111 123334 Q ss_pred HHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTL 478 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e 478 (530) |.-+++.|...++.+-........+++.++.-+-.|..+.++....+..+|+++...+++.+++ +++.+...-.+... T Consensus 308 l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n 387 (416) T protein:vir:45 308 LKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 387 (416) T ss_pred HHHHHHHHHHHHhhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 4444444444444332222222334444444444678888888888999999999999888754 44433211110000 Q ss_pred HHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 479 DLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 479 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) -.. -+.....+............+. ++++ T Consensus 388 ~~~-~~~~~~~~~~~~~~~~~~~kgG---e~n~ 416 (416) T protein:vir:45 388 HVN-IELVDEYQMNKSRATDKKLKGG---EENE 416 (416) T ss_pred ccc-cccccccCcccccccccccCCC---CCCC Confidence 000 0000000000000000001110 0011 No 140 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.53 E-value=4.9e-05 Score=44.20 Aligned_cols=380 Identities=9% Similarity=-0.002 Sum_probs=165.9 Q ss_pred HHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccc-cccccccccCCcce--eecCchhhHHhhhhhhhcccceeeec Q lcl|NC_011308. 20 KIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMND-HGDIVEDDNASNIK--ISHGFFAELVDQKTQYLLANGIDVKP 96 (530) Q Consensus 20 ~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~-~~~~~~~~~~~n~k--i~~n~~k~Ivd~~~~yl~G~pv~~~~ 96 (530) ++ |. +-++-+..-.......... .+... .....+.+ +...-....|+..++=+.+-|+.+-- T Consensus 1 m~--f~------------~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 65 (409) T protein:vir:10 1 ML--FR------------KGFKNQSQEISIDDKKILEWLGINP-SETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQ 65 (409) T ss_pred Cc--cc------------ccccCcCCCCCCChHHHHHHhcCCc-CcceechhhhhccHHHHHHHHHHHHhhhhCceEEEE Confidence 11 00 0000000000000000000 00000 00000000 11111233455556666677876522 Q ss_pred CCcchH-HHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeE Q lcl|NC_011308. 97 TDHDDQ-KLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRI 168 (530) Q Consensus 97 ~~~~de-~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 168 (530) ..++.+ ....-+..++. |. ..+....+..+...+|.||.++-++..|.+ .+..++|..+-++.++.+....- T Consensus 66 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~ 145 (409) T protein:vir:10 66 KKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSE 145 (409) T ss_pred ecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCcccccc Confidence 221111 11111222221 22 234456778888999999999999999986 47788999988887765433221 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) -.+++... .. .+ ....+..+.+.|++.... T Consensus 146 ~~~~y~~~--~~--~g----~~~~~~~~evih~r~~~~------------------------------------------ 175 (409) T protein:vir:10 146 NNVWYLYT--DD--LG----QRHKFMSDEILHFKGLTA------------------------------------------ 175 (409) T ss_pred ceEEEEEE--eC--Cc----eeEEeccccEEEecCcCC------------------------------------------ Confidence 11111000 00 00 011233444444321100 Q ss_pred cccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhHHHHH----- Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEIKKNI----- 320 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~~~~~----- 320 (530) +.-.|.|-++.....++....+..-..+.+.-.+.|-.+++... .++ .+.++..+ T Consensus 176 -----------------d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~ 238 (409) T protein:vir:10 176 -----------------DGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSS 238 (409) T ss_pred -----------------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhc Confidence 01136777776666666655555555555555556766665422 111 12222211 Q ss_pred ---hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHH Q lcl|NC_011308. 321 ---QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEI 395 (530) Q Consensus 321 ---~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~ 395 (530) ..++++.+++|.+++-+.....+.......+...+.|...-.+|.. +....|+ +..+. ..... T Consensus 239 g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~~e----------~~~~~ 306 (409) T protein:vir:10 239 GLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT--HSNIT----------EQNRE 306 (409) T ss_pred cccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--cccHH----------HHHHH Confidence 1234666776666666654444444455667788888888888842 2211122 21111 11123 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHH Q lcl|NC_011308. 396 ALRKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIE--PYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEET 470 (530) Q Consensus 396 ~f~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~--~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~ 470 (530) ++...|+-.++.|...++.+-....+ .....+.|. .-+-.|..+.++....+..+|+++...+++.+++ +++-+. T Consensus 307 f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~ 386 (409) T protein:vir:10 307 FYIDTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDV 386 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 34444555555554444432111111 122334443 3334677888888889999999999888888754 222111 Q ss_pred HHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCC Q lcl|NC_011308. 471 LKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLT 509 (530) Q Consensus 471 e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (530) -... ..+..+....++ ...+ ..+ T Consensus 387 ~~~~---------~n~~~~~~~~~~----~~kg---Ge~ 409 (409) T protein:vir:10 387 LLIN---------GNMIPVKMAGEQ----YSKG---GEK 409 (409) T ss_pred eeec---------cCccchhhcccc----cccc---CCC Confidence 1000 000000000000 0000 000 No 141 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.51 E-value=5.4e-05 Score=43.98 Aligned_cols=410 Identities=11% Similarity=0.040 Sum_probs=166.1 Q ss_pred HHH---HHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce----eecCchhhHHhhhhhhhc Q lcl|NC_011308. 16 ILS---TKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK----ISHGFFAELVDQKTQYLL 88 (530) Q Consensus 16 ~i~---~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k----i~~n~~k~Ivd~~~~yl~ 88 (530) ++. +.+..-...++..+ ....|.+.-.+ + ..-....... ..++-....|+..+.=+. T Consensus 1 ~~~~~~~~~~~p~~~e~~~~---~~~~~~~~~~~-----------~--~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA 64 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQ---MQDSYYYAPAV-----------G--MQLERQFSLYGGIYKNQPWVRTVIAKRAQALA 64 (518) T ss_pred CcccCceeecCchhhhhhhh---hhccccccccc-----------c--eecccccchhhHHHhhhHHHHHHHHHHHHhhc Confidence 000 00000000000000 01111110000 0 0000000000 011223445666666666 Q ss_pred ccceeeecCC--cchHHHHHHHHHHhh--ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEc Q lcl|NC_011308. 89 ANGIDVKPTD--HDDQKLCYLIEEYYN--EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFD 160 (530) Q Consensus 89 G~pv~~~~~~--~~de~~~~~l~~~~~--~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d 160 (530) +-|+.+--.. ...+.....+..++. |.. ......+......+|.+|.++-++.+|++ .+..++|..|.+..+ T Consensus 65 ~lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~ 144 (518) T protein:vir:10 65 RLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRN 144 (518) T ss_pred cCceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEc Confidence 6677642211 111111122333332 222 23445667788899999999999999986 478899999988776 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ...... +|....... .. ...-.|..+.+.|++.... T Consensus 145 ~~~~~~----~y~~~~~~~---~~---~~~~~~~~~eViHir~~s~---------------------------------- 180 (518) T protein:vir:10 145 SRTGRY----EYYFQAGAG---VG---TQLVSFADDEVVPIRFFNP---------------------------------- 180 (518) T ss_pred CCCCEE----EEEEEecCC---cc---ceEEEecCCcEEEecCCCC---------------------------------- Confidence 533211 111111000 00 1112334555555432110 Q ss_pred cccccccccccccccCCccceEEeeCC-cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNN-KLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEI 316 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn-~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~ 316 (530) ++ ..|.|-+......|.....+..-.++.+.-.+.|-.+++.... ++ ...+ T Consensus 181 -------------------------dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~ 235 (518) T protein:vir:10 181 -------------------------DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) T ss_pred -------------------------CcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHH Confidence 01 1366666654444444444444444444444556555543221 11 1222 Q ss_pred HHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 317 KKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 317 ~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l 386 (530) +..+. .++++.+++|.+++-++....+...........+.|...-.+|. ++...-++-|++. T Consensus 236 k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~e-------- 307 (518) T protein:vir:10 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNIS-------- 307 (518) T ss_pred HHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHH-------- Confidence 22221 23466677666666665443333344445666677777777773 2211112222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC- Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR- 464 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~- 464 (530) .....++...|.-.+..|..-++.+-...+. ...+++.++.-+-.|..+.++....+..+|+++...+++.+++ T Consensus 308 ----q~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~ 383 (518) T protein:vir:10 308 ----AQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLP 383 (518) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 1112223333444444443333322111111 1234444455566788888888999999999999999888764 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHHh---hhccccccCCccc------cCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 465 -IGDEETLKAICDTLDLDYEDVVKA---LEDQEVEELEPTV------TPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 465 -vdd~~~e~~~~e~e~~e~~~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ++++-...-.+ ...+.. ...+..++.+... .+..+. .+.+.+.+.+..+.+..++... T Consensus 384 pie~~~gD~~~~-------~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:10 384 RSDDPKADELYA-------NSALQPLGATPDGAVEGEEAPAPKRPASTPVASL-DQSPPTSVPGLSPTNSDRSTDS 451 (518) T ss_pred CCCCCCCCeeee-------cccceecccccccccCCCCCCCCCCCCccccccc-cccccccCCCCCcccccccccc Confidence 44322110000 000011 1111111000000 000000 0111111111112222222222 No 142 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=97.37 E-value=8.2e-05 Score=43.00 Aligned_cols=391 Identities=12% Similarity=0.032 Sum_probs=176.1 Q ss_pred ccHHHHHHHHHHHHHHhhh-HHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQN-VSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~-~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) |-...++++.- ..+.. .........++-|..-. .+..+ .++.-+...-...-|+..+.=+.+ T Consensus 1 m~~~~~f~~~~---~~~~~~~~~~~~~~~~~~~~~~~----------~~~~v----~~~~al~~~~v~~~i~~Ia~~ia~ 63 (416) T protein:vir:12 1 MLLERMFEKRS---GSSDHEDGFNNILLNMFGGRKTA----------SGERV----SESNSLVQPDIFACVNVLSDDIAK 63 (416) T ss_pred Cccchhccccc---CccccCccchhHHHHhhcCcccc----------cCcee----chhhhhccHHHHHHHHHHHHhhhh Confidence 32222211110 00000 00011223333222100 00000 000111122233456666666667 Q ss_pred cceee-ecCCcchH-----HHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEc Q lcl|NC_011308. 90 NGIDV-KPTDHDDQ-----KLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFD 160 (530) Q Consensus 90 ~pv~~-~~~~~~de-----~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d 160 (530) -|+++ ...+.+.+ .....|..-=+. ........+..+...+|.||.++-++..|.+ ....++|..+-++.+ T Consensus 64 l~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~ 143 (416) T protein:vir:12 64 LPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVH 143 (416) T ss_pred CceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEe Confidence 78764 32222211 122222211011 1234456677788899999999989888876 477899999887765 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ...... +|.+.. . +.. ..+.++.+.|++... T Consensus 144 ~~~~~~----~~~~~~-~-----g~~----~~~~~~eiih~~~~~----------------------------------- 174 (416) T protein:vir:12 144 PTTGML----WYQTVL-N-----GKA----IELYDYEVLHFKGLS----------------------------------- 174 (416) T ss_pred CCCcEE----EEEEec-C-----CeE----EEecCccEEEecCcC----------------------------------- Confidence 543221 222110 0 110 123444455443110 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhHH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEIK 317 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~~ 317 (530) + +.-.|.|.++.....++....+..-..+.+.-.+.|-.+++-... ++ .+.++ T Consensus 175 ---------------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~ 230 (416) T protein:vir:12 175 ---------------T---------DGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVR 230 (416) T ss_pred ---------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHH Confidence 0 111467777776666666555555555656666667666653221 11 12222 Q ss_pred HH----HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 318 KN----IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 318 ~~----~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) .. ...++++.+++|.+++-++....+.......+.....|...-.+|.. +...-|+-|+..= T Consensus 231 ~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~------------ 298 (416) T protein:vir:12 231 KEWKRVNKVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEH------------ 298 (416) T ss_pred HHHHHHhcCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHH------------ Confidence 22 23556666777766666654444444445566777777777777742 2221122222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccc---cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CC Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYS---STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IG 466 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d---~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vd 466 (530) ....++...|.-++..|...++.+-....+ ...+++.++.-+..|..+.++....+...|+++...+++.++. ++ T Consensus 299 ~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 378 (416) T protein:vir:12 299 QSIEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIE 378 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 111233445555555555444433211111 1234555555577899999999999999999999999988754 33 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhhc-cccccCCccccC Q lcl|NC_011308. 467 DEETLKAICDTLDLDYEDVVKALED-QEVEELEPTVTP 503 (530) Q Consensus 467 d~~~e~~~~e~e~~e~~~~~~~~~~-~~~~~~~~~~~~ 503 (530) +-+..+....-...+..+..+.... +..++.++.+++ T Consensus 379 ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 379 NGDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred CcceeeeccccccccccchhhccccccccCCCCCcCCC Confidence 3221110000000000000000000 001111111111 No 143 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=97.36 E-value=8.6e-05 Score=42.88 Aligned_cols=440 Identities=12% Similarity=0.039 Sum_probs=168.2 Q ss_pred HHHHHHHHHHHHHh-hhHHHHHHHHHHhcccchhhhcccc--------cc-c---ccccccccccCCcceeecCc----- Q lcl|NC_011308. 14 GTILSTKIDEYIRS-QNVSLARVGQRYYNQDNDIENTRIM--------WM-N---DHGDIVEDDNASNIKISHGF----- 75 (530) Q Consensus 14 ~~~i~~~i~~~~~~-~~~~~~~~~~~YY~g~~~I~~r~~~--------~~-~---~~~~~~~~~~~~n~ki~~n~----- 75 (530) .-+++.+-+.|.-. ++...+..+.+|-+ ++++|... .. + ..+....-..++..|-...| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~ 77 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDK---DIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLK 77 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhH---HHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHH Confidence 12344433333211 11222333333322 12111110 00 0 00000000011111111111 Q ss_pred -------hhhHH----hhhhhh---------hcccceeeecCCc-ch---HHHHHHHHHHhh---ccH-------HHHHH Q lcl|NC_011308. 76 -------FAELV----DQKTQY---------LLANGIDVKPTDH-DD---QKLCYLIEEYYN---EEF-------QSAIQ 121 (530) Q Consensus 76 -------~k~Iv----d~~~~y---------l~G~pv~~~~~~~-~d---e~~~~~l~~~~~---~~~-------~~~~~ 121 (530) .+.++ +..+.| ..|-|+.+.-.+. .. ......|..++. |.+ ..... T Consensus 78 ~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~ 157 (535) T protein:vir:10 78 AYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLT 157 (535) T ss_pred HhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHH Confidence 12222 333333 2355666542221 11 111122344442 221 12344 Q ss_pred HHHHHHhhcC-eEEEEEEecCCCceE-EEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceE Q lcl|NC_011308. 122 ELVEGSTIKG-YEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVW 199 (530) Q Consensus 122 e~~~~~~~~G-~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~ 199 (530) .+..+...+| .+|.++..+..|++. +..++|..+.+..+..+..... .+|.... .. ....|..+.+. T Consensus 158 ~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~-~~~~~~~-------~~---~~~~~~~~eii 226 (535) T protein:vir:10 158 KIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPR-KFEQFVS-------ET---KSVKFSERNLT 226 (535) T ss_pred HHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCce-EEEEEec-------Cc---eeEEECcccEE Confidence 4555555554 689999999999875 7889999998887755432111 1121110 00 01124455555 Q ss_pred EEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHH Q lcl|NC_011308. 200 YYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKS 279 (530) Q Consensus 200 ~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~ 279 (530) |++....... ...-.|.|.++.... T Consensus 227 h~~~~~~~~~-------------------------------------------------------~~~~~G~Spi~~~~~ 251 (535) T protein:vir:10 227 FINYWNLSDT-------------------------------------------------------DRRGYGYSPVEASIP 251 (535) T ss_pred EEeccCCCCc-------------------------------------------------------ccccccccHHHHHHH Confidence 5432111000 001136666766666 Q ss_pred HHHHHHHHHHHHHHHHHHhccceeee--ecCC---CC--chhhHHHHHhh--------CcceecCCCCceeEEEecCCH- Q lcl|NC_011308. 280 IIDDYDLMNCFLSNNLQDMAEAIYVV--RGGT---NS--PVDEIKKNIQS--------KKIIQTKGEGGLDIQTVDIPY- 343 (530) Q Consensus 280 liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~---~~--~~~~~~~~~~~--------~~~i~~~~~~~~~~lt~~~~~- 343 (530) .|.....+..-..+.+.-.+.|-.+| .+.. .+ ..+.++..+.. +++..+ .+++++|....++. T Consensus 252 ~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl-~~~g~~~~~l~~~~~ 330 (535) T protein:vir:10 252 LIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPIL-AAKDAKFVNMTQNSR 330 (535) T ss_pred HHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccc-cCCCceEEecCCChh Confidence 66555555544455555555554444 3321 11 11222222211 122222 33445666555433 Q ss_pred -HHHHHHHHHHHHHHHHHhcccC--CCccc---ccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_011308. 344 -EARKAKMDIDELNIYRSGMGFN--SSAVG---DGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL 417 (530) Q Consensus 344 -~~~e~~ld~L~~~I~~~s~~p~--~~~~~---~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~ 417 (530) ..+....+...+.|...-.+|. ++... ++|.++....+--..++ .....++...|.-.++.|...++.+-. T Consensus 331 D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E---~~~~~~~~~~L~P~l~~ie~~ln~~Ll 407 (535) T protein:vir:10 331 DMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAK---AKLESSKDKGLTPLLSFIEQVINDKIM 407 (535) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHH---HHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 3333445566666666667774 22221 12222221111111111 222334444555555555555544332 Q ss_pred CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHHHH-----HHHHHHHhhh Q lcl|NC_011308. 418 GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTLDL-----DYEDVVKALE 490 (530) Q Consensus 418 ~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e~~-----e~~~~~~~~~ 490 (530) ..++ ..+.+.|+.-+..+..+.+++..... .|.++...+++++++ +++-+.-.-.+....- -.+...+... T Consensus 408 ~~~~-~~~~f~f~~l~~~d~~~r~~~~~~~~-~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~ 485 (535) T protein:vir:10 408 RYVD-TDYRFSFTLGDAQDKLQEEQVWKLKL-ANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSS 485 (535) T ss_pred cccC-CeEEEEeccccccCHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCC Confidence 3333 24778888878888877777665544 566899999888754 3221110000000000 0000000000 Q ss_pred ccccccCCcc------------ccCCCCCCCCCCccCcCCCCcccccccCC Q lcl|NC_011308. 491 DQEVEELEPT------------VTPIIDPLTIEPQPEPLNIDPVIEEEPVQ 529 (530) Q Consensus 491 ~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (530) +.......++ ..+.+++++.. .+++.+.++-..++-+- T Consensus 486 ~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 486 DDSGSTLGERERQERIQHSKDYEKGKDDPKSPL-PKPSESDDVSNNEDADT 535 (535) T ss_pred CCccccCCccccCcccccccccccCCCCCCCCC-CcCCCCCccccccccCC Confidence 0000000000 00001111110 11111112111111111 No 144 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.33 E-value=9.2e-05 Score=42.72 Aligned_cols=404 Identities=13% Similarity=0.061 Sum_probs=164.7 Q ss_pred ccHHHHHHHHHHHHHH-hhhHHHHHHHHHHhcccchhhhccccccccc------------ccccccccCC--cceeecCc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIR-SQNVSLARVGQRYYNQDNDIENTRIMWMNDH------------GDIVEDDNAS--NIKISHGF 75 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~-~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~------------~~~~~~~~~~--n~ki~~n~ 75 (530) |.-..----++ .|++ ++..+.|....-+.+.+. |........ +.... ...+ -.|++. T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~lf~~~e~----R~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~al~~~~-- 72 (441) T protein:vir:79 1 MHWYNTDCYFV-DFKSRKQSRKELVVVGIFYKNEK----RDLQYNEDDLQMMVQTLPGFQGTKLR-QYKDIEAIRHSD-- 72 (441) T ss_pred CccccCccccc-cccccccchhhhhcccccccccc----ccccCCCcchHHHHHHhcccCccccc-ccchhhhhccHH-- Confidence 10000000000 0111 111111111111111110 000000000 00000 0000 001111 Q ss_pred hhhHHhhhhhhhcccceeeecCCc--chHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEE Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQT 149 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~ 149 (530) .-.-|+..++=+.+-|+.+.-... .+..+...|..- -|.. ......+......+|.||.++-++..|++ .+.. T Consensus 73 V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~-PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~ 151 (441) T protein:vir:79 73 IFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTR-PNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTF 151 (441) T ss_pred HHHHHHHHHHhhccCceeeecCccccccchHHHHHhcc-cCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 111345555555566776532111 111222222210 0222 23445677788899999999999988986 4788 Q ss_pred ecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) ++|..+.++.|+.+.+... ........ ......|..+.+.|++... T Consensus 152 i~~~~v~v~~d~~g~~~~~-----~~~~~~~~-----~~~~~~~~~~dvih~k~~~------------------------ 197 (441) T protein:vir:79 152 RKTSEIELKSDARGRLYYF-----HQRIDSNG-----NNIERNVKFEDMLDIKFYS------------------------ 197 (441) T ss_pred EcCceeEEEECCCccEEEE-----EEEeccCC-----ceeEEEEccccEEEeccCC------------------------ Confidence 9999999988876653221 11111100 0112234455555443110 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--c Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--G 307 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g 307 (530) + +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+|+ | T Consensus 198 --------------------------~---------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 242 (441) T protein:vir:79 198 --------------------------L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG 242 (441) T ss_pred --------------------------C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC Confidence 0 00136666665555555444444444444455555666654 4 Q ss_pred CCCCch--hhHHHHH----h----hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCc Q lcl|NC_011308. 308 GTNSPV--DEIKKNI----Q----SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNAT 375 (530) Q Consensus 308 ~~~~~~--~~~~~~~----~----~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~S 375 (530) .-.++. +.++..+ . .++++.+++|.+++-++....+..+........+.|...-.+|.. +... ++.| T Consensus 243 ~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~s 321 (441) T protein:vir:79 243 VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANMS 321 (441) T ss_pred CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CCcc Confidence 322211 1122211 1 234666776666666654444444455566777778888788742 2111 1112 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcH Q lcl|NC_011308. 376 NVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQI 455 (530) Q Consensus 376 GvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~ 455 (530) ..... ..|...|.-.+..|..-++.+-..+.....+++.++.-+-.|..+.++....+..+|+++. T Consensus 322 ~~q~~--------------~~~~~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~ 387 (441) T protein:vir:79 322 ITDAN--------------LDYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 387 (441) T ss_pred HHHHH--------------HHHHHHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 11111 1222344444444444444332222212234444444355678888888888999999999 Q ss_pred HHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 456 NNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 456 et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) ..+++.+++ +++.+...-.+..--. ..+.+...+.......+... .++...+ T Consensus 388 NE~R~~~gl~Pi~ggd~~~~~~~~n~~-~~~~~~~~~~~~~~~~~~~~-----------------kgGe~~e 441 (441) T protein:vir:79 388 DEIRQRDGLAPIPGGNGSIHRVDLNHV-NIELVDEYQMNKSRATDKKL-----------------KGGEENE 441 (441) T ss_pred HHHHHHhCCCCCCCCCcceEeeccccc-cccccccccccccccccccc-----------------CCCCCCC Confidence 999888754 4433321100000000 00000000000000000000 0111111 No 145 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.33 E-value=9.2e-05 Score=42.72 Aligned_cols=404 Identities=13% Similarity=0.061 Sum_probs=164.7 Q ss_pred ccHHHHHHHHHHHHHH-hhhHHHHHHHHHHhcccchhhhccccccccc------------ccccccccCC--cceeecCc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIR-SQNVSLARVGQRYYNQDNDIENTRIMWMNDH------------GDIVEDDNAS--NIKISHGF 75 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~-~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~------------~~~~~~~~~~--n~ki~~n~ 75 (530) |.-..----++ .|++ ++..+.|....-+.+.+. |........ +.... ...+ -.|++. T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~lf~~~e~----R~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~al~~~~-- 72 (441) T protein:vir:94 1 MHWYNTDCYFV-DFKSRKQSRKELVVVGIFYKNEK----RDLQYNEDDLQMMVQTLPGFQGTKLR-QYKDIEAIRHSD-- 72 (441) T ss_pred CccccCccccc-cccccccchhhhhcccccccccc----ccccCCCcchHHHHHHhcccCccccc-ccchhhhhccHH-- Confidence 10000000000 0111 111111111111111110 000000000 00000 0000 001111 Q ss_pred hhhHHhhhhhhhcccceeeecCCc--chHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEE Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQT 149 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~ 149 (530) .-.-|+..++=+.+-|+.+.-... .+..+...|..- -|.. ......+......+|.||.++-++..|++ .+.. T Consensus 73 V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~-PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~ 151 (441) T protein:vir:94 73 IFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTR-PNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTF 151 (441) T ss_pred HHHHHHHHHHhhccCceeeecCccccccchHHHHHhcc-cCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 111345555555566776532111 111222222210 0222 23445677788899999999999988986 4788 Q ss_pred ecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) ++|..+.++.|+.+.+... ........ ......|..+.+.|++... T Consensus 152 i~~~~v~v~~d~~g~~~~~-----~~~~~~~~-----~~~~~~~~~~dvih~k~~~------------------------ 197 (441) T protein:vir:94 152 RKTSEIELKSDARGRLYYF-----HQRIDSNG-----NNIERNVKFEDMLDIKFYS------------------------ 197 (441) T ss_pred EcCceeEEEECCCccEEEE-----EEEeccCC-----ceeEEEEccccEEEeccCC------------------------ Confidence 9999999988876653221 11111100 0112234455555443110 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--c Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--G 307 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g 307 (530) + +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+|+ | T Consensus 198 --------------------------~---------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~ 242 (441) T protein:vir:94 198 --------------------------L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG 242 (441) T ss_pred --------------------------C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC Confidence 0 00136666665555555444444444444455555666654 4 Q ss_pred CCCCch--hhHHHHH----h----hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCc Q lcl|NC_011308. 308 GTNSPV--DEIKKNI----Q----SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNAT 375 (530) Q Consensus 308 ~~~~~~--~~~~~~~----~----~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~S 375 (530) .-.++. +.++..+ . .++++.+++|.+++-++....+..+........+.|...-.+|.. +... ++.| T Consensus 243 ~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~s 321 (441) T protein:vir:94 243 VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANMS 321 (441) T ss_pred CCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CCcc Confidence 322211 1122211 1 234666776666666654444444455566777778888788742 2111 1112 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcH Q lcl|NC_011308. 376 NVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQI 455 (530) Q Consensus 376 GvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~ 455 (530) ..... ..|...|.-.+..|..-++.+-..+.....+++.++.-+-.|..+.++....+..+|+++. T Consensus 322 ~~q~~--------------~~~~~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~ 387 (441) T protein:vir:94 322 ITDAN--------------LDYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 387 (441) T ss_pred HHHHH--------------HHHHHHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 11111 1222344444444444444332222212234444444355678888888888999999999 Q ss_pred HHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 456 NNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 456 et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) ..+++.+++ +++.+...-.+..--. ..+.+...+.......+... .++...+ T Consensus 388 NE~R~~~gl~Pi~ggd~~~~~~~~n~~-~~~~~~~~~~~~~~~~~~~~-----------------kgGe~~e 441 (441) T protein:vir:94 388 DEIRQRDGLAPIPGGNGSIHRVDLNHV-NIELVDEYQMNKSRATDKKL-----------------KGGEENE 441 (441) T ss_pred HHHHHHhCCCCCCCCCcceEeeccccc-cccccccccccccccccccc-----------------CCCCCCC Confidence 999888754 4433321100000000 00000000000000000000 0111111 No 146 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.33 E-value=9.2e-05 Score=42.72 Aligned_cols=380 Identities=12% Similarity=0.028 Sum_probs=172.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHH--HHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLA--RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~--~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~ 88 (530) |-+.+.+..+ |........+ ..+..+. |-. ...+.+-+.+.-....|+..++=+. T Consensus 1 MG~~~~~~~~---~~~~~~~~~~~~~~~~~~~-g~~-------------------~~~~~~al~~~~V~~~v~~Ia~~iA 57 (411) T protein:vir:81 1 MGWWSRLTRF---FRPRNETVDMTNPLLLQWL-GVD-------------------PDTPRNQLSEATYFACLKILSESLG 57 (411) T ss_pred CchHHHHHhh---ccCcccccccchHHHHHHh-cCc-------------------ccChhhhhccHHHHHHHHHHHHhHh Confidence 5444333322 2111100000 0000111 100 0001111212224446677777777 Q ss_pred ccceeeecCCc-c-----hHHHHHHHHHHhhc---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEE Q lcl|NC_011308. 89 ANGIDVKPTDH-D-----DQKLCYLIEEYYNE---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPV 158 (530) Q Consensus 89 G~pv~~~~~~~-~-----de~~~~~l~~~~~~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v 158 (530) +-|+.+--..+ + +......|+.= -| ........+......+|.||.++-++. |.+ .+..++|..+-++ T Consensus 58 ~lp~~~~~~~~~~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~l~~l~~~~v~~~ 135 (411) T protein:vir:81 58 KLPLKMYQKTERGIVKSDREELYNLLKLR-PNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQLQALWILPSQYVTIV 135 (411) T ss_pred hCceeEEEecCCceeeecccHHHHHHhhc-cCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CceEEEEEECCceEEEE Confidence 77887632111 1 12223333210 02 123445667778889999999888874 554 4778999999888 Q ss_pred EcCCCCcee-EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccce Q lcl|NC_011308. 159 FDDYGTLQR-IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAI 237 (530) Q Consensus 159 ~d~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (530) .|+.+.... ...+|....... + ....+..+.+.|++.... T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~----g----~~~~~~~~eiih~k~~~~------------------------------- 176 (411) T protein:vir:81 136 VDDRGLLGEKNAIWYRYNDPYD----G----KMYVFRNDEILHFKTSVT------------------------------- 176 (411) T ss_pred EcCcccccccceEEEEEEecCC----c----eEEEEccccEEEEcCCCC------------------------------- Confidence 876543211 111222111000 0 011234445554432110 Q ss_pred ecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cch--h Q lcl|NC_011308. 238 LDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SPV--D 314 (530) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~~--~ 314 (530) + +.-.|.|.+......|+....+..-..+.+.--+.|-.+|+.... +++ + T Consensus 177 ------------------~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~ 229 (411) T protein:vir:81 177 ------------------F---------DGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARD 229 (411) T ss_pred ------------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHH Confidence 0 111466767766666666655555555555555557777655322 211 2 Q ss_pred hHHHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHh Q lcl|NC_011308. 315 EIKKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYT 384 (530) Q Consensus 315 ~~~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~ 384 (530) .++..+. .++++.+++|.+++-++....+.......+...+.|...-.+|.. +...-|+-|... T Consensus 230 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e------ 303 (411) T protein:vir:81 230 RLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAE------ 303 (411) T ss_pred HHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHH------ Confidence 2222221 234566666666655554333334445567778888888888842 221112212110 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--cc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 385 LLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGD--YS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 385 ~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~--~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) .....++...|+-.++.|..-++.+-... .. ...+++.++.-+-.|..+.++....+..+|+++...+++. T Consensus 304 ------~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 377 (411) T protein:vir:81 304 ------AQNLAFYVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDY 377 (411) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 01223444555555555555444321111 11 1224444444456688888888889999999999999888 Q ss_pred CCCC--CCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccc Q lcl|NC_011308. 462 APRI--GDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTV 501 (530) Q Consensus 462 ~~~v--dd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~ 501 (530) +++- ++-+..+.... -.-.+ ...++..+ .+|. T Consensus 378 ~gl~p~~ggD~~~~~~n--~~pl~----~~~~~~~k--gGd~ 411 (411) T protein:vir:81 378 LDMPADDYGNNLMANGN--YIPLS----MLGANYGK--GGDS 411 (411) T ss_pred hCCCCCCCCCeeeeccC--ccchh----hhhhhhcc--CCCC Confidence 7652 21111100000 00000 00000000 0000 No 147 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.26 E-value=0.00011 Score=42.23 Aligned_cols=409 Identities=11% Similarity=0.024 Sum_probs=168.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccc-ccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRI-MWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~-~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) |. +-.++.+.++.. ....|.-..-......- ...............++.=+.+.-....|+..++=+.+ T Consensus 1 ~~--~~~~~~~~~~~~--------~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~ 70 (437) T protein:vir:10 1 MK--QGKQRALGRIKS--------SFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIAT 70 (437) T ss_pred CC--cchhhhhhhhHH--------hhhhhcCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhh Confidence 22 001111111111 01122111100000000 00000000000000000001111123345555565666 Q ss_pred cceeeec-CCcch------HHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEE Q lcl|NC_011308. 90 NGIDVKP-TDHDD------QKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVF 159 (530) Q Consensus 90 ~pv~~~~-~~~~d------e~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~ 159 (530) -|+.+-. ...+. ..+...|+.==+. ........+......+|.||.++-++. |.+ .+..++|..|-+.. T Consensus 71 lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~ 149 (437) T protein:vir:10 71 LPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKR 149 (437) T ss_pred CceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEE Confidence 6776422 11111 1122222210011 223345567778889999999888874 765 46789999988877 Q ss_pred cCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceec Q lcl|NC_011308. 160 DDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILD 239 (530) Q Consensus 160 d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (530) +..+... |...... + ....+..+.+.|++... T Consensus 150 ~~~g~~~-----y~~~~~~-----g----~~~~~~~~dIih~r~~~---------------------------------- 181 (437) T protein:vir:10 150 LTSGALQ-----YTYRNVD-----G----TVSTLAEDDVFHVRGFS---------------------------------- 181 (437) T ss_pred CCCCeEE-----EEEEecC-----c----eEEEEccccEEEecCcC---------------------------------- Confidence 6654321 2111100 1 11234455555543110 Q ss_pred ccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-C--chhhH Q lcl|NC_011308. 240 EGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-S--PVDEI 316 (530) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~--~~~~~ 316 (530) + +.-.|.|-++.+...|+....+..-..+.+.-.+.|-.+|+.... + ..+.+ T Consensus 182 ----------------~---------d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~ 236 (437) T protein:vir:10 182 ----------------L---------DGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEI 236 (437) T ss_pred ----------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHH Confidence 0 011466666655555555444444455555555556666654321 1 11222 Q ss_pred HHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 317 KKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 317 ~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l 386 (530) +..+. .++++.+++|-+++-++....+..+....+.....|...-.+|. ++...-++..+..++- T Consensus 237 ~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~----- 311 (437) T protein:vir:10 237 RTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQ----- 311 (437) T ss_pred HHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHH----- Confidence 22222 23466666666555555443344445555677778888888874 2222212221222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRG--LGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~--~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ....++...|.-.+..|...++.+- ..+.....+++.+..-+..|..+.++....+..+|+++...+++.+++ T Consensus 312 -----~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl 386 (437) T protein:vir:10 312 -----QTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENL 386 (437) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 1122334444444444444444321 122222235555555567788888888889999999999999888754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) +++-...+ .+ ...+..+....+......++.. ....+..+++.+.++ +. T Consensus 387 ~pi~gg~~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~---e~ 437 (437) T protein:vir:10 387 PPMGGNAAVL-TV-------QSALLPIDKLGEHTTATAAQDA--LKAWLYQEEKTRATQ---ER 437 (437) T ss_pred CCCCCCcceE-ee-------cCcccchhhccCcCCCcchhcc--ccccCCCCCCCCccc---cC Confidence 22111100 00 0000001100000000000000 000000011111111 11 No 148 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=97.19 E-value=0.00014 Score=41.75 Aligned_cols=428 Identities=10% Similarity=-0.031 Sum_probs=172.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |- ++..+...+.. ....... .+.+-........-- .....+..+-. ..-.|++. .-..|+..+.=+.+- T Consensus 1 Mg---~~~~l~~r~~~-~~~~~~~-~~~~~~~~~~~~~~~--~~~~~g~~V~~--~~al~~~~--V~~~v~~Ia~~iA~l 69 (457) T protein:vir:13 1 MG---FWSALFGRGHS-PALDGIE-ARAWEPYDPSIYNLG--AVAASGETVTP--HDALQVSA--VFASVRLLSETIATL 69 (457) T ss_pred Cc---hhhhhhccccc-ccccccc-cccccccchHHHhhc--ccccCCceech--HHhhccHH--HHHHHHHHHHhhccC Confidence 22 22222111100 0000000 000000000000000 00000000000 00001111 223456666666677 Q ss_pred ceeeecCCcch--HHHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCC Q lcl|NC_011308. 91 GIDVKPTDHDD--QKLCYLIEEYYN--EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDY 162 (530) Q Consensus 91 pv~~~~~~~~d--e~~~~~l~~~~~--~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~ 162 (530) |+++--...+. +.....+...++ ++ .......+......+|.||.++-.+ .|++ .+..++|..+-++.+.. T Consensus 70 p~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~ 148 (457) T protein:vir:13 70 PLSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMV 148 (457) T ss_pred ceEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecC Confidence 88753222111 111122333332 12 2245566777888999999888655 4554 56788898887765433 Q ss_pred C-CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccc Q lcl|NC_011308. 163 G-TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEG 241 (530) Q Consensus 163 ~-~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (530) . .....+..|.... . ........|.++.+.|++.... T Consensus 149 ~~~~~~~~~~y~~~~---~----~~~~~~~~~~~~diih~~~~~~----------------------------------- 186 (457) T protein:vir:13 149 DGLRRKVFEAYDIDA---D----GNEVLLGWFTPRDVLHIPGMML----------------------------------- 186 (457) T ss_pred CCccceeEEEEEEec---C----CceeeEEeeCccceEEecCCCC----------------------------------- Confidence 2 2233333332211 0 1112233455666665532110 Q ss_pred ccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhHHH Q lcl|NC_011308. 242 VEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEIKK 318 (530) Q Consensus 242 ~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~~~ 318 (530) - +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+|+-... ++ .+.++. T Consensus 187 --------------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~ 243 (457) T protein:vir:13 187 --------------P---------GDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARARE 243 (457) T ss_pred --------------C---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHH Confidence 0 001477777666666655554444444555555556666653221 11 122222 Q ss_pred HHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHH Q lcl|NC_011308. 319 NIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAM 388 (530) Q Consensus 319 ~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ 388 (530) .+. .++++.+++|.+++-++....+..+....+.....|...-.+|. ++....++.+|..++-... T Consensus 244 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~---- 319 (457) T protein:vir:13 244 AWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI---- 319 (457) T ss_pred HHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH---- Confidence 211 24567777777777666554444444555667777877777875 3322222222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--cccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC-- Q lcl|NC_011308. 389 KAQKTEIALRKTLRWTADLVVEDIRRRGLGD--YSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR-- 464 (530) Q Consensus 389 ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~--~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~-- 464 (530) .++...|.-.++.|...++.+-..+ .....+++.++.-+-.|..+.++....+..+|+++...+++.+++ T Consensus 320 ------~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~P 393 (457) T protein:vir:13 320 ------AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTP 393 (457) T ss_pred ------HHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 1223334334444444333322111 122234555555566688888888889999999999988888754 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 465 IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 465 vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +++.....-.+-. ....-......+.+..+..........+...++...++.+ ...+.+.| T Consensus 394 i~~g~~d~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~----~~~~~~~~ 454 (457) T protein:vir:13 394 LPDGLGEKYRVPL-NLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDE----GATEEDDE 454 (457) T ss_pred CCCCcccceeecc-ccccccccccccccCCCCCCCCCccccCCCCCCCCCCccc----cCCCCccc Confidence 3332101000000 0000000000000000000000000000001111111111 11222222 No 149 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=97.17 E-value=0.00014 Score=41.67 Aligned_cols=390 Identities=8% Similarity=-0.028 Sum_probs=170.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+.++.++- ... .+. . +. .+...........|.. ..+.+-+...-....|+..+.=+.+- T Consensus 1 m~~~~~~~~~-----~~~--~~~----~-~~---~~~~~~~~~~~~~g~~----v~~~~al~~~~v~~~i~~ia~~ia~l 61 (419) T protein:vir:57 1 MFIPQFWKGR-----PSE--NRV----N-WQ---VVPGGMRSSSSQAGVI----ITPETALALSAVRACVTLLAESVAQL 61 (419) T ss_pred CcchhhhccC-----Ccc--ccc----c-cc---ccccccccccccCCce----echHHhhccHHHHHHHHHHHHhhccC Confidence 3222211110 000 000 0 00 0000000000000000 00111122222455666666667777 Q ss_pred ceee-ecCCcch------HHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEc Q lcl|NC_011308. 91 GIDV-KPTDHDD------QKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFD 160 (530) Q Consensus 91 pv~~-~~~~~~d------e~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d 160 (530) |+.+ .....+. ..+...|+.=-+. ........+..+...+|.||.++-++..|.+ ....++|..+.+..+ T Consensus 62 p~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~ 141 (419) T protein:vir:57 62 PCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKG 141 (419) T ss_pred ceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEEC Confidence 8775 2221211 1122333210011 2234456777888999999999999999985 567889988887665 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ..+.+ +|.+.. .. .++..+.+.|++... T Consensus 142 ~~g~~-----~y~~~~------~~------~~~~~~~vih~r~~~----------------------------------- 169 (419) T protein:vir:57 142 PDGMP-----YYDIPS------IG------EILPMRMVHHIKSFS----------------------------------- 169 (419) T ss_pred CCceE-----EEEEcC------Cc------eEEchhhEEEecCcC----------------------------------- Confidence 54321 222100 00 123344444432100 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---CCCch---- Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG---TNSPV---- 313 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~---~~~~~---- 313 (530) + +.-.|.|-+......|+....+..-..+.+.-.+.|-.+|+-. ..... T Consensus 170 ---------------~---------d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~ 225 (419) T protein:vir:57 170 ---------------L---------DGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAV 225 (419) T ss_pred ---------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHH Confidence 0 1113667777666666654444433444444445565555421 11111 Q ss_pred hhHHHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHH Q lcl|NC_011308. 314 DEIKKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRY 383 (530) Q Consensus 314 ~~~~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~ 383 (530) +.++..+. .++++.+++|-+++-++....+.......+...+.|...-.+|. ++...-|+-|++ + T Consensus 226 ~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~--e--- 300 (419) T protein:vir:57 226 DAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNI--E--- 300 (419) T ss_pred HHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccH--H--- Confidence 22222221 23566677666666555444444555666778888888888885 332222222221 1 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 384 TLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIE--PYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 384 ~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~--~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) .....++...|.-.++.|...++.+-...-......+.|. .-+-.|..+.++....+...|+++...+++. T Consensus 301 -------~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 373 (419) T protein:vir:57 301 -------HQGLQYVIYTMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRM 373 (419) T ss_pred -------HHHHHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 1112234455555555555444432211111123344443 4455688888888889999999999999988 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccccc Q lcl|NC_011308. 462 APRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEP 527 (530) Q Consensus 462 ~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (530) +++=.-+. -++....+..-. .+...+.++.+|++.|.. ++.+...- T Consensus 374 ~gl~p~~g------------gD~~~~~~n~~~-------~~~~~~~~~~~~~~~~~~-~~~~~~~~ 419 (419) T protein:vir:57 374 ENLTPIPG------------GDKYLTPLNMVD-------SKALTGIGKATPQQLKDI-EAILCTRN 419 (419) T ss_pred hCCCCCCC------------cCeeeecccccc-------ccccccccCCCcccCcch-hhhhhccC Confidence 76411110 001111111000 000011111112111111 11111111 No 150 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.15 E-value=0.00015 Score=41.53 Aligned_cols=387 Identities=11% Similarity=0.061 Sum_probs=157.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) |..+.++.++ + .. .+.++....- .... ............ ..+.=+.+.-...-|+..++=+.. T Consensus 1 ~~~~~~~~~~----k-----~~--~~~~~~~~~~~~~~~-~~~~~~~~~~~v----~~~~a~~~~~v~~~i~~Ia~~ia~ 64 (409) T protein:vir:94 1 MAKENIVTRI----K-----KK--LIDNWIDQSASKLYD-FSPWKNKSFWGV----INNTLETNETIFSAITKLSNSMAS 64 (409) T ss_pred Ccccccchhh----h-----hH--HhhhhhcCCcccccc-cccccCcccccc----chhhhhccHHHHHHHHHHHHhhhh Confidence 3333332221 0 00 0111111000 0000 000000000000 000000111123334555555556 Q ss_pred cceeeec-CCcchHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCC Q lcl|NC_011308. 90 NGIDVKP-TDHDDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGT 164 (530) Q Consensus 90 ~pv~~~~-~~~~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~ 164 (530) -|+++-- .+..+..+...|+.== |.. ......+......+|.||.++.++..|++ .+..++|..+-++.++.+. T Consensus 65 lp~~~~~~~~~~~~~~~~lL~~~P-N~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~ 143 (409) T protein:vir:94 65 LPLKMYEDYKVVNTEVSDLLTVSP-NNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSR 143 (409) T ss_pred CceeEeecccccchhHHHHHhhhc-ccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCc Confidence 6776522 2222333333343210 221 23345667788999999999999988885 5778899998877765432 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) .. +|...... +.. ..+..+.+.|++... T Consensus 144 ~~----~y~~~~~~-----g~~----~~~~~~dvih~r~~~--------------------------------------- 171 (409) T protein:vir:94 144 EL----YYSIHAAT-----GNK----LIVHNMDMLHFKHIV--------------------------------------- 171 (409) T ss_pred EE----EEEEEcCC-----ceE----EEEccccEEEecCCC--------------------------------------- Confidence 11 12211100 110 123444555543110 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceeee-ecCCCCch--hhHHHHH Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE-AIYVV-RGGTNSPV--DEIKKNI 320 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~-~~lvl-~g~~~~~~--~~~~~~~ 320 (530) ++ +.-.|.|.+......|+....+..- .+..+.. +-+++ .+...++. ..++..+ T Consensus 172 ----------~~---------~~~~G~s~l~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~ 229 (409) T protein:vir:94 172 ----------AS---------NMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDF 229 (409) T ss_pred ----------CC---------CccccccHHHHHHHHHHHHHHHHHH---HHHhcCCCCeeEEecCCCCCHHHHHHHHHHH Confidence 00 0113666666555555533322211 2333333 22333 33332221 1111111 Q ss_pred -----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHH Q lcl|NC_011308. 321 -----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKT 393 (530) Q Consensus 321 -----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~k 393 (530) ..++++.+++|-+++-++....+.......+...+.|...-++|.. +.. ++.++..++ ... T Consensus 230 ~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~sn~e----------~~~ 297 (409) T protein:vir:94 230 KQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNAR--SNTNFAKNE----------ELN 297 (409) T ss_pred HHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC--CCCCcccHH----------HHH Confidence 2234666666656555544333334444556667778887788742 221 112211111 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCH Q lcl|NC_011308. 394 EIALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIE--PYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDE 468 (530) Q Consensus 394 e~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~--~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~ 468 (530) ..++...|.-+++.|...++.+-....+. ....+.|. .-+-.|..+.++....+..+|+++.-.+++.+++ +++- T Consensus 298 ~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gg 377 (409) T protein:vir:94 298 RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGG 377 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Confidence 22333445445555544444332211111 12334443 4445677888888889999999999988888754 2221 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 469 ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 469 ~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) +.-+-. ..+..+.... ..+....+.. .++++. T Consensus 378 D~~~~~---------~n~~~~~~~~--~~~~~~kGG~-~n~~e~ 409 (409) T protein:vir:94 378 DKPLIS---------GDLYPIDTPL--ELRKSLKGGD-KNVNES 409 (409) T ss_pred CeEeec---------ccccccccch--hhcccccCCC-CCcCCC Confidence 111000 0000000000 0000000000 000000 No 151 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.15 E-value=0.00015 Score=41.51 Aligned_cols=415 Identities=11% Similarity=0.029 Sum_probs=165.1 Q ss_pred HHH---HHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccce Q lcl|NC_011308. 16 ILS---TKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGI 92 (530) Q Consensus 16 ~i~---~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv 92 (530) ++. +.+..-...++.. ...+-|-+.-..-.+..... ......-..++.....|+..+.=+.+-|+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~---~~~~~~~~~~~~g~~~~~~~---------~~~~~~~~~~~~V~acV~~IA~~iA~lp~ 68 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSP---QMQDSYYYAPAVGMQLERQF---------SLYGGIYKNQPWVRTVIAKRAQALARLPV 68 (518) T ss_pred CcccCceeeccchhhhhhh---hhhhcccccceeceeccccc---------chhhHHhhhhHHHHHHHHHHHHhhccCce Confidence 000 0000000000001 11111111000000000000 00000000112334456666666667777 Q ss_pred eeecCCc-ch-HHHHHHHHHHhh--ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCC Q lcl|NC_011308. 93 DVKPTDH-DD-QKLCYLIEEYYN--EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGT 164 (530) Q Consensus 93 ~~~~~~~-~d-e~~~~~l~~~~~--~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~ 164 (530) .+--.+. .. +.....+..++. |.. ......+......+|.+|.++-++..|++ .+..++|..|-+..+.... T Consensus 69 ~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~ 148 (518) T protein:vir:78 69 KCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG 148 (518) T ss_pred EEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCC Confidence 7522111 11 111112222322 222 23345667788899999999999999886 4778899988887765332 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) .. .|....... .. .....+..+.+.|++.... T Consensus 149 ~~----~y~~~~~~~---~~---~~~~~~~~~eIiHir~~~~-------------------------------------- 180 (518) T protein:vir:78 149 RY----EYYFQAGAG---VG---TQLVSFADDEVVPIRFFNP-------------------------------------- 180 (518) T ss_pred EE----EEEEEecCC---cc---ceeEEecCCcEEEecCCCC-------------------------------------- Confidence 11 111111000 00 1111234455554431110 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhHHHHH- Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEIKKNI- 320 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~~~~~- 320 (530) . +...|.|-+......|.....+..-.++.+.-.+.|-.+|+.... ++ ...++..+ T Consensus 181 -----------d---------g~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~ 240 (518) T protein:vir:78 181 -----------D---------GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFD 240 (518) T ss_pred -----------C---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHH Confidence 0 001356666555555544444444444444445556666654221 21 12222222 Q ss_pred -------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 321 -------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 321 -------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) ..++++.+++|.+++-++....+..+..........|...-.+|. ++...-|+-|++. - T Consensus 241 ~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e--~---------- 308 (518) T protein:vir:78 241 RAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNIS--A---------- 308 (518) T ss_pred HHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHH--H---------- Confidence 123466677776666665443333344445566677777777773 2221112222211 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCH Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDE 468 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~ 468 (530) ....++...|+-++..|..-++.+-...+. ...+++..+.-+..|..+.++....+..+|+++...+++.+++ ++++ T Consensus 309 ~~~~f~~~tL~P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~ 388 (518) T protein:vir:78 309 QMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP 388 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 112223333433333333333322111111 1134444445566788888889999999999999999888754 4443 Q ss_pred HHHHHHHHHHHHHHHHHHHhh---hccccccCCccc------cCCCCCCCCCCccCcCCCCcccccc-----------cC Q lcl|NC_011308. 469 ETLKAICDTLDLDYEDVVKAL---EDQEVEELEPTV------TPIIDPLTIEPQPEPLNIDPVIEEE-----------PV 528 (530) Q Consensus 469 ~~e~~~~e~e~~e~~~~~~~~---~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~ 528 (530) -...-.+. ..+..+ ..+...+.+... .+..+.+ +.+.+.+....+.+..+ |. T Consensus 389 ~gD~~~v~-------~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (518) T protein:vir:78 389 KADELYAN-------SALQPLGATPDGAVEGEEAPAPKRPASTPVASLD-QSPPASVPGLSPTNSDRSTDSGKTEPRRLM 460 (518) T ss_pred CCceeeec-------ccceecccccccccCCCCCCCCCCCCcccccccc-cCccccCCCCCcccccccccccccchhccc Confidence 21100000 000110 111100000000 0000000 00001111111111111 22 Q ss_pred CC Q lcl|NC_011308. 529 QE 530 (530) Q Consensus 529 ~~ 530 (530) ++ T Consensus 461 ~~ 462 (518) T protein:vir:78 461 QK 462 (518) T ss_pred CC Confidence 22 No 152 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.13 E-value=0.00016 Score=41.44 Aligned_cols=262 Identities=11% Similarity=0.066 Sum_probs=130.3 Q ss_pred hcccceeeecC-CcchHHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCC Q lcl|NC_011308. 87 LLANGIDVKPT-DHDDQKLCYLIEEYYN--EEFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDY 162 (530) Q Consensus 87 l~G~pv~~~~~-~~~de~~~~~l~~~~~--~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~ 162 (530) +.+-|+.+... ...+......|+.--+ ....+....+..+...+|.||.++-.+.+|.+ .+..++|..|-+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 55666654322 2223333333332101 12335567788899999999999988999975 57788999988776654 Q ss_pred CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccc Q lcl|NC_011308. 163 GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGV 242 (530) Q Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (530) +... +|.+.. . .+. ...+..+.+.|++.... T Consensus 81 ~~~~----~y~~~~---~--~g~----~~~~~~~evih~~~~~~------------------------------------ 111 (278) T protein:vir:78 81 SREL----YYSIHA---A--TGN----KLIVHNMDMLHFKHIVA------------------------------------ 111 (278) T ss_pred CceE----EEEEEc---C--Cce----EEEEccccEEEECCCCC------------------------------------ Confidence 4321 122111 0 011 11344555555432110 Q ss_pred cccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceeeeec-CCCCc--hhhHHH Q lcl|NC_011308. 243 EEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE-AIYVVRG-GTNSP--VDEIKK 318 (530) Q Consensus 243 ~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~-~~lvl~g-~~~~~--~~~~~~ 318 (530) + +.-.|.|.+..+...++....+... .+..+.. |-.+++. ...++ ...++. T Consensus 112 -------------~---------~~~~G~s~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~l~~e~~~~~~~ 166 (278) T protein:vir:78 112 -------------S---------NMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSNVGKEKRQQVLE 166 (278) T ss_pred -------------C---------CCeeeccHHHHHHHHHHHHHHHHHH---HHHHhcCCCcEEEEeCCCCCHHHHHHHHH Confidence 0 0113667666666655544433222 2344443 3333332 22221 111111 Q ss_pred H----H-hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 319 N----I-QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 319 ~----~-~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) . . ..++++.+++|.+++-++....+.......+...+.|...-++|.. +....++-|.. .. T Consensus 167 ~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~------------~~ 234 (278) T protein:vir:78 167 DFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKN------------EE 234 (278) T ss_pred HHHHHhccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH------------HH Confidence 1 1 2345666776666666665545555556677888888888888842 22211222221 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeCCCCC Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIEPYIL 434 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~~~~P 434 (530) .....++..|.-+++.|...++.+-..+.+. ....+.|+.+.- T Consensus 235 ~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 235 LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 1234455556666666666655543222221 234567765544 No 153 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=97.10 E-value=0.00017 Score=41.25 Aligned_cols=427 Identities=8% Similarity=0.016 Sum_probs=160.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +.-....+.++...+-+-..+. ..-+ |.+++..-+ ..+....+ ..-..++....| T Consensus 49 ~~~~~~~d~~~~~~~r~g~~~~--------------~~~~-g~~~~~epp-~d~~~l~~---------l~~~np~V~~aI 103 (648) T protein:vir:79 49 GGGSAKRDPKMSLVKRIGLAIM--------------DGGG-GGRDFEEPE-FDFNEITS---------AYNTEGYVRQAV 103 (648) T ss_pred ccccccccchhHHHHHhHHHHH--------------hhcC-CccccccCC-cCHHHHHH---------HHhcChHHHHHH Confidence 2233333333322211111110 0001 222221110 00000000 000134455567 Q ss_pred hhhhhhhcccceeeecCCcch-HHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce---------- Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDHDD-QKLCYLIEEYYN-E---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL---------- 145 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~~d-e~~~~~l~~~~~-~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~---------- 145 (530) +..+.-+.+-|..+...+... +.....+. ... + +.......+..+...+|.||..+-++.+|.. T Consensus 104 ~iia~~ia~l~~~i~~~~~~~~~~~~~~~l-l~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~ 182 (648) T protein:vir:79 104 DKYIEMMFKADWDFVSKNPNAVEYIRMRFT-LMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVG 182 (648) T ss_pred HHHHHHHhhCcceEEecCCccchhhHHHHH-hhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhc Confidence 777777777787775544321 11111111 111 1 2334566778888899999998888887732 Q ss_pred ------EEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccccc Q lcl|NC_011308. 146 ------TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVN 219 (530) Q Consensus 146 ------~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~ 219 (530) ....++|..+-+..++.+... .|..... ... ....|.++.+.|++... T Consensus 183 ~~~~v~~l~pl~p~~v~v~~d~~g~~~----~Y~y~~~----g~~----~~~~~~~~dIIHik~~~-------------- 236 (648) T protein:vir:79 183 DSMPVAGYFPLNLASMKVKRDKFGMIK----GWQQEQE----GQD----KPQKFKPEDIVHIYYKR-------------- 236 (648) T ss_pred cccceeeeEeecCceeEEEEcCCCcee----eeEEEec----CCc----eeEEecCccEEEEccCC-------------- Confidence 112344444444444332111 1110000 000 00012222333221100 Q ss_pred ccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011308. 220 PNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMA 299 (530) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~ 299 (530) + .+.-.|.|.+......|+....+....++.+.-.+ T Consensus 237 -----------------------------------~---------~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa 272 (648) T protein:vir:79 237 -----------------------------------E---------KGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNL 272 (648) T ss_pred -----------------------------------C---------CCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 0 01124777777666666655555555555566666 Q ss_pred cceeeeec-CCCCchh---hHHHHHh-hCcceecC-CCCceeEEEecC--C--HHHHHHHHHHHHHHHHHHhcccCC--C Q lcl|NC_011308. 300 EAIYVVRG-GTNSPVD---EIKKNIQ-SKKIIQTK-GEGGLDIQTVDI--P--YEARKAKMDIDELNIYRSGMGFNS--S 367 (530) Q Consensus 300 ~~~lvl~g-~~~~~~~---~~~~~~~-~~~~i~~~-~~~~~~~lt~~~--~--~~~~e~~ld~L~~~I~~~s~~p~~--~ 367 (530) .|-.+++- .+....+ .....+. ..+-..+. .+.+.+.+..+. . +..+....+...+.|...-.+|.. + T Consensus 273 ~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG 352 (648) T protein:vir:79 273 HPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMG 352 (648) T ss_pred CccEEEEeCCCccchHHHHHHHHHHHHhcccccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcc Confidence 67666652 1111111 1111111 11111222 222333333221 1 112344556777888888888852 2 Q ss_pred cccccCC-cHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHH Q lcl|NC_011308. 368 AVGDGNA-TNVVIKSRYTLL-AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDK 445 (530) Q Consensus 368 ~~~~gn~-SGvAik~~~~~l-~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~ 445 (530) ....++- .+.+....|... ..-+......+...+.+.+ ++...+...... + ..+.+.|+.-+..|....++... T Consensus 353 ~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~l-l~e~~l~~~l~~--d-~~ieF~~~~Llr~D~~~~a~~~~ 428 (648) T protein:vir:79 353 RGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEI-LMEGGFDPVLNP--D-DKVEFRFNEIDMDSKIKLENQAV 428 (648) T ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhhhccccccc--c-ceEEEeecccchhhHHHHHHHHH Confidence 2222222 223332222211 1111111111111111110 000000000001 1 23677787766677777778778 Q ss_pred HHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccc---cCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 446 TEAETNQIQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVE---ELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 446 ~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) .+..+|++|...+++.+++ +++..... .+....-........-..+..+ .....+.... ..+...+..|.|.+ T Consensus 429 ~l~~~GilT~NEaR~~lGlpPi~~g~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~-~~e~~~~~~~~~~~ 506 (648) T protein:vir:79 429 FLYEHNAISEDEMRELIGRDPVDDGEGRA-KMHLQMVTIAQATALAALAPTPAGGSSASASGDKK-KKATDNKTKPTNQH 506 (648) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCCcc-ccccccccchhccccccCCCCCCCCCCCCcccccc-ccccCCCCCCCCCC Confidence 8889999999999998754 33221100 0000000000000000000000 0000000000 01111111122222 Q ss_pred cccccccCCC Q lcl|NC_011308. 521 PVIEEEPVQE 530 (530) Q Consensus 521 ~~~~~~~~~~ 530 (530) +. .+.|--+ T Consensus 507 g~-~~~~~~~ 515 (648) T protein:vir:79 507 GT-KTSPKKQ 515 (648) T ss_pred Cc-CCCCccc Confidence 21 2222111 No 154 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.07 E-value=0.00018 Score=41.08 Aligned_cols=389 Identities=11% Similarity=0.050 Sum_probs=161.8 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |..+.++.+.-..+.++-....- ..... ........+... ..+.=+.+.-...-|+..++=+.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~-~~~~~~~~~~~v----~~~~~~~~~~V~~ci~~Ia~~ia~l 65 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQST----------SKLYD-FSPWKNRSFWGV----INNTLETNETIFSAITKLSNSMASL 65 (409) T ss_pred CCccchhhhhhhhhhhhhhcccc----------ccccc-cccccCcccccc----chhhhhccHHHHHHHHHHHHhhhhC Confidence 55555544432222211000000 00000 000000000000 0000011122233455555555566 Q ss_pred ceeeecC-CcchHHHHHHHHHHhhcc--HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCce Q lcl|NC_011308. 91 GIDVKPT-DHDDQKLCYLIEEYYNEE--FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQ 166 (530) Q Consensus 91 pv~~~~~-~~~de~~~~~l~~~~~~~--~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~ 166 (530) |+++.-. +..+......|..==+.. -......+..+...+|.||.++-++..|.+ .+..++|..+-++.++.+... T Consensus 66 p~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~ 145 (409) T protein:vir:93 66 PLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSREL 145 (409) T ss_pred ceeEeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEE Confidence 7765322 222333333333210111 223346677788899999999999988875 577889988877665433211 Q ss_pred eEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccc Q lcl|NC_011308. 167 RIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHE 246 (530) Q Consensus 167 ~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (530) +|.+... .+.. ..+.++.+.|++.... T Consensus 146 ----~y~~~~~-----~g~~----~~~~~~eVih~r~~~~---------------------------------------- 172 (409) T protein:vir:93 146 ----YYSIHAA-----TGNK----LIVHNMDMLHFKHIVA---------------------------------------- 172 (409) T ss_pred ----EEEEEcC-----CceE----EEEccccEEEeCCCCC---------------------------------------- Confidence 1221110 0111 1244555555532110 Q ss_pred cccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeee-ecCCCCch--hhHHHHH-- Q lcl|NC_011308. 247 GRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEA-IYVV-RGGTNSPV--DEIKKNI-- 320 (530) Q Consensus 247 ~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~-~lvl-~g~~~~~~--~~~~~~~-- 320 (530) + +.-.|.|.++.+...|+....+.. -.+..+..+ -+++ .+...++. ..++..+ T Consensus 173 ---------~---------~~~~G~s~i~~~~~~i~~~~~~~~---~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~ 231 (409) T protein:vir:93 173 ---------S---------NMVQGISPIDVLKNTTDFDNAVRT---FNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQ 231 (409) T ss_pred ---------C---------CccccccHHHHHHHHHHHHHHHHH---HHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHH Confidence 0 001366666655444443322211 123333332 2233 33333221 1111111 Q ss_pred ---hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHH Q lcl|NC_011308. 321 ---QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEI 395 (530) Q Consensus 321 ---~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~ 395 (530) ..++++.+++|.+++-++....+.......+.....|...-.+|.. +....++-|+. +-. ... T Consensus 232 ~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~--e~~----------~~~ 299 (409) T protein:vir:93 232 YYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKN--EEL----------NRF 299 (409) T ss_pred HhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--HHH----------HHH Confidence 1334566666655555543333334444556677788888888842 22111121221 111 123 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHH Q lcl|NC_011308. 396 ALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIE--PYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEET 470 (530) Q Consensus 396 ~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~--~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~ 470 (530) ++...|+-+++.|...++.+-..+.+. ....+.|. .-+-.|..+.++...++..+|+++.-.+++.+++ +++-+. T Consensus 300 f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~ 379 (409) T protein:vir:93 300 YLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK 379 (409) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 334445555555554444332211111 12344543 3344678888888899999999999999888764 222111 Q ss_pred HHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 471 LKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 471 e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) -+- ...+..+..... ......+.. .+.++. T Consensus 380 ~~~---------~~n~~~~~~~~~--~~~~~~gG~-~n~~e~ 409 (409) T protein:vir:93 380 PLI---------SGDLYPIDTPLE--LRKSLKGGD-KNVNES 409 (409) T ss_pred eee---------cccccccccchh--hcccccCCC-CCcCCC Confidence 000 000000000000 000000000 000000 No 155 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.05 E-value=0.00019 Score=40.98 Aligned_cols=388 Identities=12% Similarity=0.066 Sum_probs=159.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) |..+.++.++=+.+ +.++-...- .... ............ ..+.=+...-...-|+..+.=+.+ T Consensus 1 ~~~~~~~~~~k~~~-----------~~~~~~~~~~~~~~-~~~~~~~~~~~v----~~~~a~~~~~V~~ci~~ia~~ia~ 64 (409) T protein:vir:96 1 MAKENIVTRIKKKL-----------IDNWIDQSASKLYD-FSPWKNKSFWGV----INNTLETNETIFSAITKLSNSMAS 64 (409) T ss_pred CccccchhhhhhHH-----------hhhhhccccccccc-cccccCcccccc----chhhHhhhHHHHHHHHHHHHhhhh Confidence 44444433321111 111110000 0000 000000000000 000001111122334444455555 Q ss_pred cceeeec-CCcchHHHHHHHHHHhhcc--HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCc Q lcl|NC_011308. 90 NGIDVKP-TDHDDQKLCYLIEEYYNEE--FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTL 165 (530) Q Consensus 90 ~pv~~~~-~~~~de~~~~~l~~~~~~~--~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~ 165 (530) -|+++-- .+..+......|+.==+.. -......+..+...+|.||.++-++..|++ .+..++|..+-++.++.... T Consensus 65 lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~ 144 (409) T protein:vir:96 65 LPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE 144 (409) T ss_pred CceEEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE Confidence 6766422 2222333333333210111 123345677788999999999988888875 46778998888776654321 Q ss_pred eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccc Q lcl|NC_011308. 166 QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEH 245 (530) Q Consensus 166 ~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (530) . +|.+.... +. ...|.++.+.|++... T Consensus 145 ~----~y~~~~~~-----g~----~~~~~~~evih~r~~~---------------------------------------- 171 (409) T protein:vir:96 145 L----YYSIHAAT-----GN----KLIVHNMDMLHFKHIV---------------------------------------- 171 (409) T ss_pred E----EEEEEcCC-----ce----EEEEccccEEEeCCCC---------------------------------------- Confidence 1 12111100 00 1124445555543110 Q ss_pred ccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc-eeee-ecCCCCc--hhhHHHHH- Q lcl|NC_011308. 246 EGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEA-IYVV-RGGTNSP--VDEIKKNI- 320 (530) Q Consensus 246 ~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~-~lvl-~g~~~~~--~~~~~~~~- 320 (530) ++ +.-.|+|-+......++....+. .. .+..+..+ -+++ .+...++ .+.++..+ T Consensus 172 ---------~~---------~~~~G~s~l~~~~~~i~~~~~~~-~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~ 230 (409) T protein:vir:96 172 ---------AS---------NMVQGISPIDVLKNTTDFDNAVR-TF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFK 230 (409) T ss_pred ---------CC---------CccccccHHHHHHHHHHHHHHHH-HH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHH Confidence 00 11136666665555554332221 11 23333332 2233 3333322 12222221 Q ss_pred ----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 321 ----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 321 ----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke 394 (530) ..++++.+++|-+++-++....+.......+...+.|...-.+|.. +....++-|+ ++ .... T Consensus 231 ~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~--~e----------~~~~ 298 (409) T protein:vir:96 231 QYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAK--NE----------ELNR 298 (409) T ss_pred HHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HH----------HHHH Confidence 1234666666666666654434444445566677888888888842 2211111111 11 1112 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIE--PYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEE 469 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~--~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~ 469 (530) .++...|.-++..|...++.+-....+. ....+.|. .-+-.|..+.++....+..+|+++.-.+++.+++ +++-+ T Consensus 299 ~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD 378 (409) T protein:vir:96 299 FYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGD 378 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcc Confidence 3334444444444444444332211111 12344443 3344577888888889999999999999888753 22211 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 470 TLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 470 ~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) .-+- ...+..+.... ..+....+.. .++++. T Consensus 379 ~~~~---------~~n~~~~~~~~--~~~~~~~gG~-~n~~e~ 409 (409) T protein:vir:96 379 KPLI---------SGDLYPIDTPL--ELRKSLKGGD-KNVNES 409 (409) T ss_pred eeee---------cccccccccch--hhcccccCCC-CCcCCC Confidence 1110 00000000000 0000001100 011111 No 156 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=96.98 E-value=0.00023 Score=40.56 Aligned_cols=420 Identities=9% Similarity=0.027 Sum_probs=153.9 Q ss_pred HHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCc--cee--ecCchhhHHhhhhhhhcccceeee Q lcl|NC_011308. 20 KIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASN--IKI--SHGFFAELVDQKTQYLLANGIDVK 95 (530) Q Consensus 20 ~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n--~ki--~~n~~k~Ivd~~~~yl~G~pv~~~ 95 (530) +-+-|.......++...+.- +....+ ..+..+.-.+....+. .++ ....+...|+..+..+.|-|..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~~-----~~~~~~~~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~ 73 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGD--TDSQAL-----KEDRFEEYVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLID 73 (540) T ss_pred CCCcccChhhccchhhhhcc--cccccc-----ccCCCCccccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEe Confidence 11111111111122111110 000000 0000000001000000 011 234456678888888999998886 Q ss_pred cCCcchHHHHHHHHHHhhc---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEE Q lcl|NC_011308. 96 PTDHDDQKLCYLIEEYYNE---EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRF 171 (530) Q Consensus 96 ~~~~~de~~~~~l~~~~~~---~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~ 171 (530) ..+.. ... ++.| ........+..+...+|.||.++-++..|++ .+..++|..+-+..+... + T Consensus 74 ~~~~~---~~~----~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~-------~ 139 (540) T protein:vir:41 74 GDDGG---VEE----LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSR-------Y 139 (540) T ss_pred cCccc---hhh----hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCce-------e Confidence 54432 222 2222 2345566777888999999999999988876 467788888866554331 1 Q ss_pred EEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccccccc Q lcl|NC_011308. 172 YTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVL 251 (530) Q Consensus 172 y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (530) +... ......++..|.......+. .+. T Consensus 140 ~~~~-------d~~~~~~~~~~~~~~~~~~~--~g~-------------------------------------------- 166 (540) T protein:vir:41 140 MQTW-------DGIHVTYFKDYRYEGEVNPD--NGE-------------------------------------------- 166 (540) T ss_pred Eeee-------cCceeeeeecccccceeecc--ccc-------------------------------------------- Confidence 1100 01111111111111110000 000 Q ss_pred ccccCCccceEEeeCCc-----CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCCCch----------- Q lcl|NC_011308. 252 GRSYKSRFPFDILYNNK-----LGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV--RGGTNSPV----------- 313 (530) Q Consensus 252 ~~~~~~~iPiv~~~nn~-----~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~~~~~----------- 313 (530) ....|..=-|+++++.. .|.|.+......|..-..+..-..+.+.-.+.|-.+| .|.-.+.. T Consensus 167 ~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~ 246 (540) T protein:vir:41 167 DQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGR 246 (540) T ss_pred cceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHH Confidence 00011111234444321 5777666544444443333333333333344454444 34221110 Q ss_pred hhHHHHH---------hhCcceecCC----CCceeEEEecCC--HHHHHHHHHHHHHHHHHHhcccCC--C--cccccCC Q lcl|NC_011308. 314 DEIKKNI---------QSKKIIQTKG----EGGLDIQTVDIP--YEARKAKMDIDELNIYRSGMGFNS--S--AVGDGNA 374 (530) Q Consensus 314 ~~~~~~~---------~~~~~i~~~~----~~~~~~lt~~~~--~~~~e~~ld~L~~~I~~~s~~p~~--~--~~~~gn~ 374 (530) +.+.... ..++++.++. +++++|..-..+ +..+....+...+.|...-++|.. + +.+.+|- T Consensus 247 ~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~ 326 (540) T protein:vir:41 247 TVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGG 326 (540) T ss_pred HHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCc Confidence 0111111 1123344431 345666544433 333445566777778777788742 2 1112222 Q ss_pred cHH-HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCC Q lcl|NC_011308. 375 TNV-VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQI 453 (530) Q Consensus 375 SGv-Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~i 453 (530) |++ +... .++...|.-+++.|...++..-..+.. ..+.+.|+..-.... +.+.....++.+|++ T Consensus 327 sn~eq~~~-------------~f~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~i~f~~~~ll~~-D~~~~~~~lv~~G~l 391 (540) T protein:vir:41 327 NFAEVARR-------------TYYESVVRPQQEIVSSVLTDFIQLKLD-PGARFVFNEEILMES-EFVHNYALLVQCGVL 391 (540) T ss_pred ccHHHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhhccC-CceEEEecchhhcch-HHHHHHHHHHhCCCC Confidence 221 1111 111222222222222222221111111 235666765433332 344445667889999 Q ss_pred cHHHHHHhCCCCCCHHHH-HHHHHHHHHHHHHHHHhhhccccccCC---ccccCCCCCCCCCCccCcCCCCc-------- Q lcl|NC_011308. 454 QINNLLAIAPRIGDEETL-KAICDTLDLDYEDVVKALEDQEVEELE---PTVTPIIDPLTIEPQPEPLNIDP-------- 521 (530) Q Consensus 454 S~et~l~~~~~vdd~~~e-~~~~e~e~~e~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-------- 521 (530) +...+++.++.++.-.+. +.-..--..+.......-++.+.+... .+.++...+ ...++.|..... T Consensus 392 T~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 469 (540) T protein:vir:41 392 TPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQE--IISSESPLEDKKKKIDEVLS 469 (540) T ss_pred CHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccC--cccccccccccccccccccc Confidence 998898765333211111 000000000000000000000000000 000000000 000000000000 Q ss_pred ----ccccccCCC Q lcl|NC_011308. 522 ----VIEEEPVQE 530 (530) Q Consensus 522 ----~~~~~~~~~ 530 (530) ...+-+-++ T Consensus 470 ~~~~~~~~~~~~~ 482 (540) T protein:vir:41 470 DFRAEAYENGKKM 482 (540) T ss_pred ccCCccccchhHH Confidence 000000000 No 157 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=96.96 E-value=0.00024 Score=40.44 Aligned_cols=384 Identities=13% Similarity=0.054 Sum_probs=160.0 Q ss_pred HHHhcccch------------------hhhccc----cccccccccc-------cc----ccCCc--ceeecCchhhHHh Q lcl|NC_011308. 37 QRYYNQDND------------------IENTRI----MWMNDHGDIV-------ED----DNASN--IKISHGFFAELVD 81 (530) Q Consensus 37 ~~YY~g~~~------------------I~~r~~----~~~~~~~~~~-------~~----~~~~n--~ki~~n~~k~Ivd 81 (530) .++|+-.-- ++.+.. .......... .. ...+. .|++. .-.-|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~--V~acv~ 78 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSD--IFTAVM 78 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHH--HHHHHH Confidence 222222110 000000 0000000000 00 00000 01111 112355 Q ss_pred hhhhhhcccceeeecCCc--chHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccce Q lcl|NC_011308. 82 QKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQL 155 (530) Q Consensus 82 ~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~ 155 (530) ..++=+.+-|+.+.-... .+..+...|+.- -|.. ......+......+|.||.++-++..|++ .+..++|..+ T Consensus 79 ~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~-PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:98 79 MIASDLARMPIRVTVNGQINYSDRIVNLLNTR-PNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred HHHHhhccCceEEecCCcccccchHHHHHhcc-cccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcee Confidence 555555566776532111 111122222210 0222 23445677788899999999988988885 4788999999 Q ss_pred EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccc Q lcl|NC_011308. 156 LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDE 235 (530) Q Consensus 156 ~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (530) .+..++.+.+...... .... . ......+..+.+.|++... T Consensus 158 ~v~~~~~g~~~~~~~~-----~~~~---~--~~~~~~~~~~dviHir~~~------------------------------ 197 (441) T protein:vir:98 158 ELKLDARGRLYYFHQR-----IDSN---G--NNIERNVKFEDMLDIKFYS------------------------------ 197 (441) T ss_pred EEEECCCCcEEEEEEE-----eccC---c--ceeeEEEccccEEEeccCC------------------------------ Confidence 9888776653321111 0000 0 0111234445555443110 Q ss_pred ceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCch Q lcl|NC_011308. 236 AILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTNSPV 313 (530) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~~~~ 313 (530) + +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+|+ |.-.++. T Consensus 198 --------------------~---------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e 248 (441) T protein:vir:98 198 --------------------L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKK 248 (441) T ss_pred --------------------C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHH Confidence 0 00135666665555555444444444444444455555554 4322211 Q ss_pred --hhHHHHH----h----hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHH Q lcl|NC_011308. 314 --DEIKKNI----Q----SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKS 381 (530) Q Consensus 314 --~~~~~~~----~----~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~ 381 (530) +.++..+ . .++++.+++|.+++-++....+...........+.|...-.+|.. +... ++.|-.... T Consensus 249 ~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~-~~~s~~q~~- 326 (441) T protein:vir:98 249 ARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-ANMSITDAN- 326 (441) T ss_pred HHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CCccHHHHH- Confidence 1222222 1 234667777766666654444444445556667777777777742 2111 112211111 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) ..|...|.-.+..|...++.+-........+++..+.-+-.|..+.++....+..+|+++...+++. T Consensus 327 -------------~~y~~tl~P~~~~ie~~ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~ 393 (441) T protein:vir:98 327 -------------LDYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQR 393 (441) T ss_pred -------------HHHHHHHHHHHHHHHHHHHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 1122233333333333333322111111233443334455688888888888999999999999888 Q ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 462 APR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 462 ~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +++ +++.+..+-.+-.--.. .+.....+............+. ..++ T Consensus 394 ~gl~pi~gGd~~~~~~~~n~~~-~~~~~~~q~~~~~~~~~~~kgG-----------------e~ne 441 (441) T protein:vir:98 394 DGLAPIPGGNGSIHRVDLNHVN-IELVDEYQMNKSRATDKKLKGG-----------------EENE 441 (441) T ss_pred hCCCCCCCCCcceEeecccccc-cccccccccccccccccccCCC-----------------CCCC Confidence 754 33332211000000000 0000000000000000000100 0000 No 158 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=96.95 E-value=0.00024 Score=40.41 Aligned_cols=388 Identities=10% Similarity=-0.012 Sum_probs=170.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+ +..+..+... ........+...+.+... ...+..+.... -.|+ .-....|+..+.=+.+- T Consensus 1 Mg~---f~~lf~r~~~-~~~~~~~~~~~~~~~~~~---------~~~g~~v~~~~--al~~--~~v~~~i~~Ia~~ia~~ 63 (414) T protein:vir:44 1 MVF---FSGLFQRKSD-APVTTPAELADAIGLSYD---------TYTGKQISSQR--AMRL--TAVFSCVRVLAESVGML 63 (414) T ss_pred Cch---hhhhhccCcc-CcccchhhHhHhhccCcc---------ccCCceechhh--hhcc--HHHHHHHHHHHHHhccC Confidence 322 2221111000 000000111111111000 00000000000 0111 12334556666666677 Q ss_pred ceeeecCCcc------hHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcC Q lcl|NC_011308. 91 GIDVKPTDHD------DQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDD 161 (530) Q Consensus 91 pv~~~~~~~~------de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~ 161 (530) |+.+--.+.+ +..+...|+.-=+. ........+......+|.||.++..+ .|.+ .+..++|..+-+.+++ T Consensus 64 p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~ 142 (414) T protein:vir:44 64 PCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNS 142 (414) T ss_pred ceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECC Confidence 8765322211 12223333211011 22345566778888999999888766 5765 4778999999888877 Q ss_pred CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccc Q lcl|NC_011308. 162 YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEG 241 (530) Q Consensus 162 ~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (530) .+.+. |.+.... + ....+..+.+.|++... T Consensus 143 ~~~~~-----y~~~~~~-----g----~~~~~~~~evih~~~~~------------------------------------ 172 (414) T protein:vir:44 143 SWEPV-----YQVTFPD-----G----STDVLSQEDIWHVRTLT------------------------------------ 172 (414) T ss_pred CCcEE-----EEEEecC-----c----eEEEEccccEEEecCCC------------------------------------ Confidence 65431 2221111 1 11234555555543110 Q ss_pred ccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-C--chhhHHH Q lcl|NC_011308. 242 VEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-S--PVDEIKK 318 (530) Q Consensus 242 ~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~--~~~~~~~ 318 (530) + +.-.|.|-+......++....+..-..+.+.-.+.|-.+++.... + ..+.++. T Consensus 173 --------------~---------d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 229 (414) T protein:vir:44 173 --------------L---------DGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKK 229 (414) T ss_pred --------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHH Confidence 0 011366666666666655555544455555555556666654322 1 1122222 Q ss_pred HHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHH Q lcl|NC_011308. 319 NIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAM 388 (530) Q Consensus 319 ~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ 388 (530) .+. .++++.+++|.+++-++....+.......+.....|...-.+|. ++....++-|++ + T Consensus 230 ~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~--e-------- 299 (414) T protein:vir:44 230 DFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNI--E-------- 299 (414) T ss_pred HHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--H-------- Confidence 221 23456666665555554433333344456667777887778875 232222222221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCC- Q lcl|NC_011308. 389 KAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDI--EPYILANELDLAMIDKTEAETNQIQINNLLAIAPRI- 465 (530) Q Consensus 389 ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f--~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~v- 465 (530) .....++...|+-.++.|...++.+-........+.+.| ..-+-.|..+.++...++..+|+++...+++.+++- T Consensus 300 --~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p 377 (414) T protein:vir:44 300 --ELGLGFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNP 377 (414) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 111234445555555555555544322222222334444 444456888888888899999999999998887541 Q ss_pred -CCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCC Q lcl|NC_011308. 466 -GDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQ 529 (530) Q Consensus 466 -dd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (530) +.- +.......-...+...... ....++...++|-| T Consensus 378 ~~gg--------------D~~~~~~n~~~~~~~~~~~--------------~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 378 RPGG--------------DVYLTPMNMTTKPSDGSKA--------------GKQKDNANADETTS 414 (414) T ss_pred CCCc--------------ceecccccccccCCccccC--------------CCCCCCCCCCCCCC Confidence 111 0111000000000000000 00111111222222 No 159 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=96.91 E-value=0.00026 Score=40.20 Aligned_cols=455 Identities=13% Similarity=0.069 Sum_probs=168.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHH----------------HHHHHHHHhcccchhhhccccccc--ccccccc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVS----------------LARVGQRYYNQDNDIENTRIMWMN--DHGDIVE 62 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~----------------~~~~~~~YY~g~~~I~~r~~~~~~--~~~~~~~ 62 (530) |..-|.-... +-+.-|++|+++..+. ....+.+.-.++..+...+.-... ..+...+ T Consensus 1 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (574) T protein:vir:80 1 MPKWLDKALG-----IEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTK 75 (574) T ss_pred Ccchhhhhhc-----cchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCc Confidence 2222211111 0011122222221110 011123333333222211100000 0000000 Q ss_pred cccCCcc---eee-----cCchhhHHhhhhhhh-----------cccceeeecCCcc------hHHHHHHHHHHhhc--- Q lcl|NC_011308. 63 DDNASNI---KIS-----HGFFAELVDQKTQYL-----------LANGIDVKPTDHD------DQKLCYLIEEYYNE--- 114 (530) Q Consensus 63 ~~~~~n~---ki~-----~n~~k~Ivd~~~~yl-----------~G~pv~~~~~~~~------de~~~~~l~~~~~~--- 114 (530) ...++.. ++. ......+++..+.-+ .|-|..+...+.+ .......|.+++.+ T Consensus 76 ~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~ 155 (574) T protein:vir:80 76 PSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQ 155 (574) T ss_pred CccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCC Confidence 0000000 010 111233444433211 1344443222111 11112223344321 Q ss_pred -------cHHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCCCc-eeEEEEEEEEeecccccccc Q lcl|NC_011308. 115 -------EFQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTL-QRIIRFYTEQRYSDADNKFN 185 (530) Q Consensus 115 -------~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~-~~~~~~y~~~~~~~~~~~~~ 185 (530) .+......+..+...+|.+|.++-++..|++. +..++|..+.++.+..+.. ....+||.... +. T Consensus 156 ~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~-------g~ 228 (574) T protein:vir:80 156 FRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVID-------NR 228 (574) T ss_pred CCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeC-------Cc Confidence 22345566777888999999988889888864 6789999998887665422 11223332210 00 Q ss_pred eEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEee Q lcl|NC_011308. 186 SIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY 265 (530) Q Consensus 186 ~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~ 265 (530) ....|..+.+.|++..... +.+ T Consensus 229 ---~~~~~~~~eiih~~~~~~~----------------------------------------------~~~--------- 250 (574) T protein:vir:80 229 ---IVAKFNERELAFAVRNPRA----------------------------------------------DIE--------- 250 (574) T ss_pred ---eEEEEccccEEEEeccCCC----------------------------------------------Ccc--------- Confidence 1123445555554321110 000 Q ss_pred CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceee--eecCC-CCc--hhhHHHHHh--------hCcc-eecCCC Q lcl|NC_011308. 266 NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYV--VRGGT-NSP--VDEIKKNIQ--------SKKI-IQTKGE 331 (530) Q Consensus 266 nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lv--l~g~~-~~~--~~~~~~~~~--------~~~~-i~~~~~ 331 (530) ..-.|.|.++.....|+....+..-..+.+.-.+.|-.+ +++.. .++ ...++..+. .+++ +.+++| T Consensus 251 ~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G 330 (574) T protein:vir:80 251 VGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAED 330 (574) T ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCC Confidence 011467777766666665555554455555555555544 44432 222 122222221 1122 333444 Q ss_pred CceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHH-HHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 332 GGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKS-RYTLLAMKAQKTEIALRKTLRWTADLV 408 (530) Q Consensus 332 ~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~-~~~~l~~ka~~ke~~f~~~l~~~~~~i 408 (530) .++.-++....+..+....+.+.+.|...-.+|. ++...-|...|..... -+..++ ......+...|+=+++.| T Consensus 331 ~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E---~~~~~f~~~tL~P~~~~i 407 (574) T protein:vir:80 331 VKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSK---EKMQASQNKGLQPLLRFI 407 (574) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHH---HHHHHHHHHHHHHHHHHH Confidence 4444444333344444556667777877777875 2222211111110000 011111 111223333344444444 Q ss_pred HHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHH--HHHHHHHHH Q lcl|NC_011308. 409 VEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAIC--DTLDLDYED 484 (530) Q Consensus 409 ~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~--e~e~~e~~~ 484 (530) ...++..-...+. ..+.+.|.+.-.....+.+++ .....+|+++...+++.+++ +++-+.-+.-. ......... T Consensus 408 e~~ln~~Ll~~~~-~~~~~~f~~~d~~~~~~~~~~-~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~ 485 (574) T protein:vir:80 408 EDTVNTYIVAEFG-EKYQFQFRGGDLSAQLDKLKI-IEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQE 485 (574) T ss_pred HHHHHhhhhhhcC-CceEEEecccchhhHHHHHHH-HHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeeccccccc Confidence 4434332222222 246778887766666666554 34566899999999988754 33211110000 000000000 Q ss_pred HHHhhhccccccCCccccCCCCCCCC---CCccCcCCCCcccccccCCC Q lcl|NC_011308. 485 VVKALEDQEVEELEPTVTPIIDPLTI---EPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 530 (530) .......+.....+...+...++.+. .|...+.+.+......+... T Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~ 534 (574) T protein:vir:80 486 EQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGL 534 (574) T ss_pred ccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhh Confidence 00000000000000000000000000 00001111111111111111 No 160 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=96.88 E-value=0.00028 Score=40.07 Aligned_cols=373 Identities=12% Similarity=0.046 Sum_probs=161.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc--hhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN--DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~--~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~ 88 (530) |-.. +.+.+........-.....+...-. .++.-. ....+..+ .+..=+.+.=....|+..++-+. T Consensus 1 m~m~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~v----~~~~al~~~~v~~~v~~ia~~ia 68 (392) T protein:vir:74 1 MILP-----ILNFINQTNDPPEAGSVQSYFPDGNDAQIMESL---LGDNNEWV----SARAALRNSDLFSIILQLSSDLA 68 (392) T ss_pred Ccch-----hhhhhhcccCcccccccccccccCchhhhhhhc---cCCCCccc----chhhhhcchHHHHHHHHHHHhhc Confidence 2111 1111111100000000000000000 000000 00000000 00000111223345666666666 Q ss_pred ccceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCce Q lcl|NC_011308. 89 ANGIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQ 166 (530) Q Consensus 89 G~pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~ 166 (530) +-|+++.... . ...+.+=... .-......+..+...+|.||.++-++.+|++ .+..++|..+-+..+..+... T Consensus 69 ~lp~~~~~~~--~---~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~ 143 (392) T protein:vir:74 69 IVKINAEKKK--N---QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM 143 (392) T ss_pred cCceeeccch--h---hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE Confidence 7787764221 1 1122221110 1123445667788999999999989999986 578889999887776543211 Q ss_pred eEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccc Q lcl|NC_011308. 167 RIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHE 246 (530) Q Consensus 167 ~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (530) +|.+..... .......|..+.+.|++.... T Consensus 144 ----~y~~~~~~~------~~~~~~~~~~~evih~~~~~~---------------------------------------- 173 (392) T protein:vir:74 144 ----YYNITFDDP------KIEPILQAPQSDLIHMKLLSI---------------------------------------- 173 (392) T ss_pred ----EEEEEecCC------ccceeEEEcCccEEEecCCCC---------------------------------------- Confidence 222211110 011122344555555432110 Q ss_pred cccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCchhhHHHH----H Q lcl|NC_011308. 247 GRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTNSPVDEIKKN----I 320 (530) Q Consensus 247 ~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~~~~~~~~~~----~ 320 (530) ...-.|.|-++.....|+....+..-..+.+.-...|-.+++ +... ..++.+.. . T Consensus 174 ------------------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~-~~~~~~~~~~~~~ 234 (392) T protein:vir:74 174 ------------------DGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL-LSDKDKASRSRSF 234 (392) T ss_pred ------------------CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC-chHHHHHHHHHHH Confidence 001147777776666665555555445555555555655554 3211 11221211 1 Q ss_pred ----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHH Q lcl|NC_011308. 321 ----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTE 394 (530) Q Consensus 321 ----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke 394 (530) ..++++.+++|.+++-++....+.......+.+.+.|...=.+|. +++...++.+..++ + T Consensus 235 ~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~--------------~ 300 (392) T protein:vir:74 235 MKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI--------------S 300 (392) T ss_pred hccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--------------H Confidence 233556677666666665444444445556677777777777774 22221111111111 2 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCCCHHHH Q lcl|NC_011308. 395 IALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA---PRIGDEETL 471 (530) Q Consensus 395 ~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vdd~~~e 471 (530) ..+...|.-.++.|..-++.+-... +.+.+..-+-.|..+.++.+..+..+|++++..+.+.+ ++..| | T Consensus 301 ~~~~~~l~p~~~~ie~~l~~~l~~~-----~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pn---e 372 (392) T protein:vir:74 301 GMYASALNRYLRPAISELEYKLSDH-----ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---D 372 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhccch-----hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcc---c Confidence 2344445444444444444332111 22222222234556777778888999999998887654 33221 1 Q ss_pred HHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 472 KAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 472 ~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) ..+ .+.++ ++.+.++++|- | T Consensus 373 ~r~--------~enl~---------------~~~~Gd~~~p~--p 392 (392) T protein:vir:74 373 LPA--------PENTN---------------KKTTGQSNEPV--P 392 (392) T ss_pred cch--------hcCCC---------------CCCCCCCCCCC--C Confidence 100 00111 11111112221 1 No 161 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=96.78 E-value=0.00034 Score=39.57 Aligned_cols=407 Identities=11% Similarity=0.047 Sum_probs=164.1 Q ss_pred HHHHHHHHHHHHHHhhhHHHHH-HHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccc Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQNVSLAR-VGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANG 91 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~~~~~~~-~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~p 91 (530) ..+.+.+.+.......+-.-.. ..+.+......++....-..-..+..+... .-.|+. =.-..|+..+.=+.+-| T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~--~al~~~--~V~~~i~~ia~~ia~lp 76 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVD--KAMKLS--AVWACVRLISTSVAGLP 76 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCccCCceechh--hhhccH--HHHHHHHHHHHhhhhCc Confidence 2233333333322211110000 000111111111110000000000000000 001111 12234566666666778 Q ss_pred eee-ecCCcch--HHHHHHHHHHhh---ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcC Q lcl|NC_011308. 92 IDV-KPTDHDD--QKLCYLIEEYYN---EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDD 161 (530) Q Consensus 92 v~~-~~~~~~d--e~~~~~l~~~~~---~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~ 161 (530) +++ ....++. +....-+.+++. |.. ......+......+|.+|.++-.+ .|++ ....++|..|-++.++ T Consensus 77 ~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~ 155 (434) T protein:vir:43 77 LGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDE 155 (434) T ss_pred eEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcC Confidence 775 2221111 111112333331 222 244566777889999999887665 5764 4677899999887776 Q ss_pred CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccc Q lcl|NC_011308. 162 YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEG 241 (530) Q Consensus 162 ~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (530) .+.+. |...... + ....|..+.+.|++... T Consensus 156 ~g~~~-----y~~~~~~-----g----~~~~~~~~eVih~~~~~------------------------------------ 185 (434) T protein:vir:43 156 NGRLK-----YFYTTKK-----G----ARREIERTNMLHIPAFT------------------------------------ 185 (434) T ss_pred CCeEE-----EEEEecC-----c----eEEEEccccEEEecCcC------------------------------------ Confidence 65322 1111100 1 01234555555543210 Q ss_pred ccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhHHH Q lcl|NC_011308. 242 VEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEIKK 318 (530) Q Consensus 242 ~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~~~ 318 (530) + +.-.|.|-++.....|.....+..-..+.+.-.+.|-.+++-... ++ .+.++. T Consensus 186 --------------~---------dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~ 242 (434) T protein:vir:43 186 --------------L---------DGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFRE 242 (434) T ss_pred --------------C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHH Confidence 0 111355555544444333332222233333333445555543221 11 122222 Q ss_pred HHh-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHH Q lcl|NC_011308. 319 NIQ-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMK 389 (530) Q Consensus 319 ~~~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~k 389 (530) .++ .++++.+++|.+++-++....+..+....+.....|...-.+|. ++...-++.++..++-. T Consensus 243 ~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~------- 315 (434) T protein:vir:43 243 YVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQ------- 315 (434) T ss_pred HHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHH------- Confidence 221 23455566555555444333344445556777888888888884 22222122223222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--C Q lcl|NC_011308. 390 AQKTEIALRKTLRWTADLVVEDIRRRGLG--DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--I 465 (530) Q Consensus 390 a~~ke~~f~~~l~~~~~~i~~~l~~~~~~--~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--v 465 (530) ...++...|.-.+..|...++.+-.. +.....+++.++.-+-.|..+.++...++..+|+++...+++.+++ + T Consensus 316 ---~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~ 392 (434) T protein:vir:43 316 ---MLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPEL 392 (434) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 11233444544455554444432211 1111234444445556788888888889999999999999888754 2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 466 GDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 466 dd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) ++-+.-. +. ..+..+.........+..++........|+|++ T Consensus 393 ~ggD~~~--~~-------~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 393 PGGDILT--VQ-------SNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred CCCCeEe--ec-------cCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 2211000 00 000000000000000000000000011111111 No 162 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=96.77 E-value=0.00035 Score=39.53 Aligned_cols=389 Identities=7% Similarity=-0.021 Sum_probs=163.7 Q ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceee Q lcl|NC_011308. 15 TILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDV 94 (530) Q Consensus 15 ~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~ 94 (530) =++..+ +.. ++.. ...-.++..- +.-.....-..+..+ .+..=+.+.-....|+..++=+.+-|+.+ T Consensus 1 m~~~~~---~~~-~~~~--~s~~~~w~~~---~~~~~~~~~~~g~~v----t~~~al~~~~v~~~i~~Ia~~iA~lp~~~ 67 (421) T protein:vir:10 1 MFIPQM---FEG-KKRS--VSGGGFWEAM---LGGVRSSHSKAGVMI----TPETALALSAVRACVTLLAESVAQLPVEL 67 (421) T ss_pred CCCcch---hcc-cccc--cCcchhhHHH---hhhhccCcccCCcee----chHHhhccHHHHHHHHHHHHhhccCceEE Confidence 000000 000 0000 0000010000 000000000000000 00000111223345666666666778774 Q ss_pred e-cCCcch------HHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCC Q lcl|NC_011308. 95 K-PTDHDD------QKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYG 163 (530) Q Consensus 95 ~-~~~~~d------e~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~ 163 (530) - ....+. ..+...|+.= -|. -......+..+...+|.||.++-++.+|.+. ...++|..+-++.++.+ T Consensus 68 ~~~~~~g~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g 146 (421) T protein:vir:10 68 YRRDKNGGRQRATDHPIYDLIHSQ-PNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDG 146 (421) T ss_pred EEEcCCCceeecccchHHHHHhhc-ccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCc Confidence 2 211111 1122222210 021 2234456677888999999999899888763 67788888887666554 Q ss_pred CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccc Q lcl|NC_011308. 164 TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVE 243 (530) Q Consensus 164 ~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (530) .+ +|.+. . .+. .+..+.+.|.+... T Consensus 147 ~~-----~y~~~--~----~g~------~~~~~eiih~~~~~-------------------------------------- 171 (421) T protein:vir:10 147 MP-----YYEIP--E----IGE------TLPMRMMHHVKVFS-------------------------------------- 171 (421) T ss_pred eE-----EEEEc--C----CCc------EEchhhEEEecCcC-------------------------------------- Confidence 21 22110 0 000 12233333332100 Q ss_pred ccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC---CCchhh----H Q lcl|NC_011308. 244 EHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT---NSPVDE----I 316 (530) Q Consensus 244 ~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~---~~~~~~----~ 316 (530) + +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+|+-.. ....++ + T Consensus 172 ------------~---------d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~ 230 (421) T protein:vir:10 172 ------------L---------DGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQL 230 (421) T ss_pred ------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHH Confidence 0 11136666665555555444443333444455555655655211 111112 2 Q ss_pred HHHH--------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 317 KKNI--------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 317 ~~~~--------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l 386 (530) +..+ ..++++.+++|.+++-++....+.......+...+.|...-.+|.. +...-++-|++ T Consensus 231 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~--------- 301 (421) T protein:vir:10 231 LAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNI--------- 301 (421) T ss_pred HHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccH--------- Confidence 2211 1235666777766666654444444455566777888888888852 22111221221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDI--EPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f--~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) -.....++...|.-.+..|...++.+-..........+.| ..-+-.|..+.++...++..+|+++...+++.+++ T Consensus 302 ---e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl 378 (421) T protein:vir:10 302 ---EHQGLQFVMYTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENL 378 (421) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 1111233444555555555554444322111122333444 44455688888888889999999999999988754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +++-+.- ...+.-...... ..+ ..+...+++.+..+-.+.+ T Consensus 379 ~p~~ggD~~--------------~~~~n~~~~~~~---~~~---~~~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 379 PPIAGGDKY--------------LTPLNMVDSAQI---IPG---DKKPTAQQMAEIDTILSRT 421 (421) T ss_pred CCCCCccee--------------eecccccccccc---ccC---CCCcccccCcccccccccC Confidence 2221111 111110000000 000 0111111111111111111 No 163 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=96.68 E-value=0.00042 Score=39.12 Aligned_cols=476 Identities=9% Similarity=-0.010 Sum_probs=200.5 Q ss_pred ccHHHHHHHHHHHHH--HhhhHH---HHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYI--RSQNVS---LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQ 85 (530) Q Consensus 11 ~~~~~~i~~~i~~~~--~~~~~~---~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~ 85 (530) |......+++...|. ..+|.. +.+.+.+|.--...-+ ....+. ...+.+.|+..+-+...++..++ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~------~~~~~~---~~~~~~~~~~dst~~~a~~~LAa 71 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRF------FVQDRN---RGEKRHNNILDNTGTRALRVLAA 71 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccc------cCCCCC---cchhcccccccccHHHHHHHHHH Confidence 333322223332221 122323 3344444432211000 000000 01123456777777777777777 Q ss_pred hhccc--cee-----eecCCcc---hHH-------HHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 86 YLLAN--GID-----VKPTDHD---DQK-------LCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 86 yl~G~--pv~-----~~~~~~~---de~-------~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) .|++- |+. +...+.+ ... ....+...+ ..||....+++.++...+|.|..++-.|..+-+++ T Consensus 72 ~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf 151 (555) T protein:vir:10 72 GMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYH 151 (555) T ss_pred HHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEE Confidence 66642 211 2222211 111 222233333 35788889999999999999987766666677889 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeec-------cc--------ccccceEEEEEEEcCCceEEEeecCCcccchh Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYS-------DA--------DNKFNSIGHADVWTDTEVWYYVQKDEGRSDEY 212 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~~--------~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~ 212 (530) ..++..++++--|..+....++|.+...... .. ...+..-.+++|++. .|...+..... T Consensus 152 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~-- 225 (555) T protein:vir:10 152 HSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSK-- 225 (555) T ss_pred EEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCC-- Confidence 9999999988888888888877765421110 00 000000112332221 01111100000 Q ss_pred hccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHH Q lcl|NC_011308. 213 VLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLM 287 (530) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~ 287 (530) .+. ...+ +..+ ....+. .........+|...|++.++- +.+|.|--++..+-+-.++.+ T Consensus 226 -~~~--~~~p---~~s~-------~~~~~~---d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l 289 (555) T protein:vir:10 226 -RDD--RNMA---WKSV-------YFEPGA---DETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHE 289 (555) T ss_pred -CCc--cccc---eEEE-------EEEecc---CCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHH Confidence 000 0000 0000 000000 000112334566677776654 458999999999999999997 Q ss_pred HHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecC--CCCce--eEEEecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 288 NCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTK--GEGGL--DIQTVDIPYEARKAKMDIDELNIYRSGMG 363 (530) Q Consensus 288 ~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~--~~~~~--~~lt~~~~~~~~e~~ld~L~~~I~~~s~~ 363 (530) .-......+...+|.+.+....... ..++..+++..+. .+++. -.+....+.......++.++..|=..-+. T Consensus 290 ~~~~l~~~~~~~~pp~~v~~~~~~~----~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~ 365 (555) T protein:vir:10 290 QLRKAQAIDYKSNPPLQLPVSAKNQ----DISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYA 365 (555) T ss_pred HHHHHHHHHHHhcCceeeccccccc----cceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhc Confidence 7778888899898877763322111 1123333332222 22322 22233346667777788887777433322 Q ss_pred c---CCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----CccccceeeEEeCCCCC Q lcl|NC_011308. 364 F---NSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGL-----GDYSSTDIKFDIEPYIL 434 (530) Q Consensus 364 p---~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-----~~~d~~~i~i~f~~~~P 434 (530) . .++..+-...++..+..+-.-+.. ......+.-.+.|.=+++-++.++...+. ..+....|++.|...|= T Consensus 366 dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La 445 (555) T protein:vir:10 366 DLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLA 445 (555) T ss_pred chhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHH Confidence 1 133333334565555433222221 12222233333333333333344444332 13344567777766654 Q ss_pred CCH--HHHHHHHHHHHhcCC-----------CcHHHHHHh----CCC----CCCHHHHHHHHHHHHHH--HHHHHHhhhc Q lcl|NC_011308. 435 ANE--LDLAMIDKTEAETNQ-----------IQINNLLAI----APR----IGDEETLKAICDTLDLD--YEDVVKALED 491 (530) Q Consensus 435 ~n~--~e~a~~~~~~~~~g~-----------iS~et~l~~----~~~----vdd~~~e~~~~e~e~~e--~~~~~~~~~~ 491 (530) +.. .+...+.+.+...|. +.-..++.. ++. +.. ++|.+++.+.+.+ ....+.++.. T Consensus 446 ~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~~a~~~~ 524 (555) T protein:vir:10 446 QAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVP-GNQVALIRKQRADQQQAAQQAALLN 524 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 111111111111111 222222222 211 121 2333433333222 2222222322 Q ss_pred cccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 492 QEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) +.......=++..++. .+....--...++-| T Consensus 525 q~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 525 QGADTAAKLGSVDTSK-QNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHhcccccCc-chhHHHHHhhhccCC Confidence 2111110000000000 000000001111112 No 164 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=96.68 E-value=0.00042 Score=39.12 Aligned_cols=476 Identities=9% Similarity=-0.010 Sum_probs=200.5 Q ss_pred ccHHHHHHHHHHHHH--HhhhHH---HHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYI--RSQNVS---LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQ 85 (530) Q Consensus 11 ~~~~~~i~~~i~~~~--~~~~~~---~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~ 85 (530) |......+++...|. ..+|.. +.+.+.+|.--...-+ ....+. ...+.+.|+..+-+...++..++ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~------~~~~~~---~~~~~~~~~~dst~~~a~~~LAa 71 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRF------FVQDRN---RGEKRHNNILDNTGTRALRVLAA 71 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccc------cCCCCC---cchhcccccccccHHHHHHHHHH Confidence 333322223332221 122323 3344444432211000 000000 01123456777777777777777 Q ss_pred hhccc--cee-----eecCCcc---hHH-------HHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 86 YLLAN--GID-----VKPTDHD---DQK-------LCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 86 yl~G~--pv~-----~~~~~~~---de~-------~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) .|++- |+. +...+.+ ... ....+...+ ..||....+++.++...+|.|..++-.|..+-+++ T Consensus 72 ~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf 151 (555) T protein:vir:98 72 GMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYH 151 (555) T ss_pred HHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEE Confidence 66642 211 2222211 111 222233333 35788889999999999999987766666677889 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeec-------cc--------ccccceEEEEEEEcCCceEEEeecCCcccchh Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYS-------DA--------DNKFNSIGHADVWTDTEVWYYVQKDEGRSDEY 212 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~~--------~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~ 212 (530) ..++..++++--|..+....++|.+...... .. ...+..-.+++|++. .|...+..... T Consensus 152 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~-- 225 (555) T protein:vir:98 152 HSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSK-- 225 (555) T ss_pred EEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCC-- Confidence 9999999988888888888877765421110 00 000000112332221 01111100000 Q ss_pred hccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHH Q lcl|NC_011308. 213 VLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLM 287 (530) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~ 287 (530) .+. ...+ +..+ ....+. .........+|...|++.++- +.+|.|--++..+-+-.++.+ T Consensus 226 -~~~--~~~p---~~s~-------~~~~~~---d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l 289 (555) T protein:vir:98 226 -RDD--RNMA---WKSV-------YFEPGA---DETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHE 289 (555) T ss_pred -CCc--cccc---eEEE-------EEEecc---CCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHH Confidence 000 0000 0000 000000 000112334566677776654 458999999999999999997 Q ss_pred HHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecC--CCCce--eEEEecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 288 NCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTK--GEGGL--DIQTVDIPYEARKAKMDIDELNIYRSGMG 363 (530) Q Consensus 288 ~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~--~~~~~--~~lt~~~~~~~~e~~ld~L~~~I~~~s~~ 363 (530) .-......+...+|.+.+....... ..++..+++..+. .+++. -.+....+.......++.++..|=..-+. T Consensus 290 ~~~~l~~~~~~~~pp~~v~~~~~~~----~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~ 365 (555) T protein:vir:98 290 QLRKAQAIDYKSNPPLQLPVSAKNQ----DISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYA 365 (555) T ss_pred HHHHHHHHHHHhcCceeeccccccc----cceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhc Confidence 7778888899898877763322111 1123333332222 22322 22233346667777788887777433322 Q ss_pred c---CCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----CccccceeeEEeCCCCC Q lcl|NC_011308. 364 F---NSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGL-----GDYSSTDIKFDIEPYIL 434 (530) Q Consensus 364 p---~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-----~~~d~~~i~i~f~~~~P 434 (530) . .++..+-...++..+..+-.-+.. ......+.-.+.|.=+++-++.++...+. ..+....|++.|...|= T Consensus 366 dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La 445 (555) T protein:vir:98 366 DLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLA 445 (555) T ss_pred chhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHH Confidence 1 133333334565555433222221 12222233333333333333344444332 13344567777766654 Q ss_pred CCH--HHHHHHHHHHHhcCC-----------CcHHHHHHh----CCC----CCCHHHHHHHHHHHHHH--HHHHHHhhhc Q lcl|NC_011308. 435 ANE--LDLAMIDKTEAETNQ-----------IQINNLLAI----APR----IGDEETLKAICDTLDLD--YEDVVKALED 491 (530) Q Consensus 435 ~n~--~e~a~~~~~~~~~g~-----------iS~et~l~~----~~~----vdd~~~e~~~~e~e~~e--~~~~~~~~~~ 491 (530) +.. .+...+.+.+...|. +.-..++.. ++. +.. ++|.+++.+.+.+ ....+.++.. T Consensus 446 ~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~~a~~~~ 524 (555) T protein:vir:98 446 QAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVP-GNQVALIRKQRADQQQAAQQAALLN 524 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 111111111111111 222222222 211 121 2333433333222 2222222322 Q ss_pred cccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 492 QEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) +.......=++..++. .+....--...++-| T Consensus 525 q~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 525 QGADTAAKLGSVDTSK-QNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHhcccccCc-chhHHHHHhhhccCC Confidence 2111110000000000 000000001111112 No 165 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=96.68 E-value=0.00042 Score=39.12 Aligned_cols=476 Identities=9% Similarity=-0.010 Sum_probs=200.5 Q ss_pred ccHHHHHHHHHHHHH--HhhhHH---HHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYI--RSQNVS---LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQ 85 (530) Q Consensus 11 ~~~~~~i~~~i~~~~--~~~~~~---~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~ 85 (530) |......+++...|. ..+|.. +.+.+.+|.--...-+ ....+. ...+.+.|+..+-+...++..++ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~------~~~~~~---~~~~~~~~~~dst~~~a~~~LAa 71 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRF------FVQDRN---RGEKRHNNILDNTGTRALRVLAA 71 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccc------cCCCCC---cchhcccccccccHHHHHHHHHH Confidence 333322223332221 122323 3344444432211000 000000 01123456777777777777777 Q ss_pred hhccc--cee-----eecCCcc---hHH-------HHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 86 YLLAN--GID-----VKPTDHD---DQK-------LCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 86 yl~G~--pv~-----~~~~~~~---de~-------~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) .|++- |+. +...+.+ ... ....+...+ ..||....+++.++...+|.|..++-.|..+-+++ T Consensus 72 ~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf 151 (555) T protein:vir:10 72 GMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYH 151 (555) T ss_pred HHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEE Confidence 66642 211 2222211 111 222233333 35788889999999999999987766666677889 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeec-------cc--------ccccceEEEEEEEcCCceEEEeecCCcccchh Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYS-------DA--------DNKFNSIGHADVWTDTEVWYYVQKDEGRSDEY 212 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~~--------~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~ 212 (530) ..++..++++--|..+....++|.+...... .. ...+..-.+++|++. .|...+..... T Consensus 152 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~-- 225 (555) T protein:vir:10 152 HSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSK-- 225 (555) T ss_pred EEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCC-- Confidence 9999999988888888888877765421110 00 000000112332221 01111100000 Q ss_pred hccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHH Q lcl|NC_011308. 213 VLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLM 287 (530) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~ 287 (530) .+. ...+ +..+ ....+. .........+|...|++.++- +.+|.|--++..+-+-.++.+ T Consensus 226 -~~~--~~~p---~~s~-------~~~~~~---d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l 289 (555) T protein:vir:10 226 -RDD--RNMA---WKSV-------YFEPGA---DETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHE 289 (555) T ss_pred -CCc--cccc---eEEE-------EEEecc---CCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHH Confidence 000 0000 0000 000000 000112334566677776654 458999999999999999997 Q ss_pred HHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecC--CCCce--eEEEecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 288 NCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTK--GEGGL--DIQTVDIPYEARKAKMDIDELNIYRSGMG 363 (530) Q Consensus 288 ~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~--~~~~~--~~lt~~~~~~~~e~~ld~L~~~I~~~s~~ 363 (530) .-......+...+|.+.+....... ..++..+++..+. .+++. -.+....+.......++.++..|=..-+. T Consensus 290 ~~~~l~~~~~~~~pp~~v~~~~~~~----~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~ 365 (555) T protein:vir:10 290 QLRKAQAIDYKSNPPLQLPVSAKNQ----DISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYA 365 (555) T ss_pred HHHHHHHHHHHhcCceeeccccccc----cceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhc Confidence 7778888899898877763322111 1123333332222 22322 22233346667777788887777433322 Q ss_pred c---CCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----CccccceeeEEeCCCCC Q lcl|NC_011308. 364 F---NSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGL-----GDYSSTDIKFDIEPYIL 434 (530) Q Consensus 364 p---~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-----~~~d~~~i~i~f~~~~P 434 (530) . .++..+-...++..+..+-.-+.. ......+.-.+.|.=+++-++.++...+. ..+....|++.|...|= T Consensus 366 dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La 445 (555) T protein:vir:10 366 DLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLA 445 (555) T ss_pred chhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHH Confidence 1 133333334565555433222221 12222233333333333333344444332 13344567777766654 Q ss_pred CCH--HHHHHHHHHHHhcCC-----------CcHHHHHHh----CCC----CCCHHHHHHHHHHHHHH--HHHHHHhhhc Q lcl|NC_011308. 435 ANE--LDLAMIDKTEAETNQ-----------IQINNLLAI----APR----IGDEETLKAICDTLDLD--YEDVVKALED 491 (530) Q Consensus 435 ~n~--~e~a~~~~~~~~~g~-----------iS~et~l~~----~~~----vdd~~~e~~~~e~e~~e--~~~~~~~~~~ 491 (530) +.. .+...+.+.+...|. +.-..++.. ++. +.. ++|.+++.+.+.+ ....+.++.. T Consensus 446 ~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~r~qr~~~~q~~~~a~~~~ 524 (555) T protein:vir:10 446 QAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVP-GNQVALIRKQRADQQQAAQQAALLN 524 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 321 111111111111111 222222222 211 121 2333433333222 2222222322 Q ss_pred cccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 492 QEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) +.......=++..++. .+....--...++-| T Consensus 525 q~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 525 QGADTAAKLGSVDTSK-QNALTDVTRAFSGYT 555 (555) T ss_pred HHHHHHHHhcccccCc-chhHHHHHhhhccCC Confidence 2111110000000000 000000001111112 No 166 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=96.61 E-value=0.00047 Score=38.84 Aligned_cols=416 Identities=10% Similarity=-0.015 Sum_probs=192.0 Q ss_pred CCcccccC-CcccHHHHHHHH---HH---H----HHHhh-hHHHH-HHHHHHhcccchhhhcccccccccccccccccCC Q lcl|NC_011308. 1 MTNTLLTT-APDRLGTILSTK---ID---E----YIRSQ-NVSLA-RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNAS 67 (530) Q Consensus 1 ~~~~~~~~-~~~~~~~~i~~~---i~---~----~~~~~-~~~~~-~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~ 67 (530) |..++... .|.....+-+.. +. + |...- -+.++ ..++.--.|.-...... + ++-.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L---~-------~~m~e- 69 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAEL---F-------MDMEE- 69 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHH---H-------HHHHh- Confidence 66665322 121111111110 00 0 00000 01111 12222222211100000 0 00000 Q ss_pred cceeecCchhhHHhhhhhhhcccceeeecCCc---chHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEE-EEEEecC Q lcl|NC_011308. 68 NIKISHGFFAELVDQKTQYLLANGIDVKPTDH---DDQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEG-IFARTTS 141 (530) Q Consensus 68 n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~---~de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~-~~~y~d~ 141 (530) ......-.+.+...-++|.+..+...+. .++...+++++++.+ +|.+++..+ .++.-+|.+. +++|.-. T Consensus 70 ----~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~-lda~~~G~s~~Ei~w~~~ 144 (528) T protein:vir:10 70 ----RDAHLFAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDC-MDGVGHGYSAIELDWSLQ 144 (528) T ss_pred ----hChHHHHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHH-HhhhhhcceeEEEEEeec Confidence 1244566777777888999988876432 345677778888754 577766655 4567788855 6777655 Q ss_pred CCceEE---EEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccc Q lcl|NC_011308. 142 EDKLTF---QTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTV 218 (530) Q Consensus 142 ~g~~~~---~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~ 218 (530) .|.+.. ...+|..+ .|+..+... ++..++.... . T Consensus 145 ~g~~~~~~~~~r~~~~f--~~~~~~~~~----------------------------------l~~~~~~~~g-~------ 181 (528) T protein:vir:10 145 GREWLPQAFDHRPQSWF--QLNPDDQDE----------------------------------LRLRDNSIAG-E------ 181 (528) T ss_pred CCceeEEEeeeecccce--eeccCCCcE----------------------------------EeccCCCCCc-e------ Confidence 565443 23333321 122222111 1100000000 0 Q ss_pred cccccceeeeeecccccceecccccccccccccccccCCccceEEe--eCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 219 NPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL--YNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~--~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) .-.+++.|=+++- ..+..|.|.+..+-...--=+..+.+.+..++ T Consensus 182 ---------------------------------~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E 228 (528) T protein:vir:10 182 ---------------------------------VLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLE 228 (528) T ss_pred ---------------------------------eecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHH Confidence 0001111111111 12346788888766666666778889999999 Q ss_pred HhccceeeeecCCCCchhh------HHHHHhhCcceecCCCCceeEEEec-CCHHHHHHHHHHHHHHHHHHhcccCCCcc Q lcl|NC_011308. 297 DMAEAIYVVRGGTNSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVD-IPYEARKAKMDIDELNIYRSGMGFNSSAV 369 (530) Q Consensus 297 ~~~~~~lvl~g~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~-~~~~~~e~~ld~L~~~I~~~s~~p~~~~~ 369 (530) .|..|+.+.+=..+...++ ...++....+..++.+..+++++.. ...+.++..++...+.|-..--+-.++.. T Consensus 229 ~yG~P~~igky~~~a~~~ek~~L~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~ 308 (528) T protein:vir:10 229 IYGLPIRLGKYPPGTPDEEKVTLLRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQ 308 (528) T ss_pred HcCCCeEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 9999999987322222221 2234556667778999999999854 45566788899888888887766555433 Q ss_pred c----cc-CC-cHHHHHHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCHHHHH Q lcl|NC_011308. 370 G----DG-NA-TNVVIKSRYTLLAMKAQKTEIALRKTLRW-TADLVVEDIRRRGLGDYS-STDIKFDIEPYILANELDLA 441 (530) Q Consensus 370 ~----~g-n~-SGvAik~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~~e~a 441 (530) . .| ++ +.+.-+.+..-...-|. .....|.+ ++. .++........+ ..-..+.|...-+.|..+.| T Consensus 309 ~~~g~~gS~Alg~vh~~v~~di~~aDa~----~i~~tln~~li~---~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a 381 (528) T protein:vir:10 309 TSESGGGAYALGQVHNEVRHDLLAADAR----QLAATLSRDLLW---PLLVLNRSGNLDARRAPRLVFDLKDRADLAAMA 381 (528) T ss_pred ccccccchhhhHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH---HHHHhCCCCCCCccccceEEecCCCcccHHHHH Confidence 2 12 11 22333332222222333 33333322 222 233333333322 23467889888899999999 Q ss_pred HHHHHHHhcCC-CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH-hhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 442 MIDKTEAETNQ-IQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVK-ALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 442 ~~~~~~~~~g~-iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) +.+..+...|+ +|.+.+.+.+++ .-++.. ++... ....+... ....+............+... T Consensus 382 ~~~~~L~~~G~~i~~~~i~e~~gi-p~p~~~-----------e~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 446 (528) T protein:vir:10 382 TSLPPLVKLGVQVPVNWVQEQLGI-PLPANG-----------EAVLGDQAGAGIAQ---LSRRPGPRIAALAQVIGPRYR 446 (528) T ss_pred HHHHHHHhCCCCCCHHHHHHHhCC-CCCCCC-----------cccccCCCcccccc---cCccccccccccccccccccc Confidence 99999999997 898888888864 111100 00000 00000000 000000000000000000000 Q ss_pred Cccccc-------------------ccCCC Q lcl|NC_011308. 520 DPVIEE-------------------EPVQE 530 (530) Q Consensus 520 ~~~~~~-------------------~~~~~ 530 (530) ++.... +|+.+ T Consensus 447 ~~~~~d~~~~~~~~~~~~~~~~~~l~~i~~ 476 (528) T protein:vir:10 447 DQEALDQVLASLPAQDMQNQADSLVAPLLD 476 (528) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000 01100 No 167 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=96.55 E-value=0.00052 Score=38.60 Aligned_cols=383 Identities=12% Similarity=0.034 Sum_probs=183.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhc----ccchhhhcccccccccccccccccCCcceeecCch Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYN----QDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFF 76 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~----g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~ 76 (530) |..-.+..+.-+... .. .....|.- .+..|+.+... ..-...++. ...... T Consensus 1 v~~~~l~~e~at~~~--------------~~--d~~~~~~~~l~~~~~~il~~a~~---g~~~~y~~l------~~D~~i 55 (488) T protein:vir:99 1 MEKPALGREIATSGD--------------GR--DITRPFISGLQVPNDSILQRRGG---NDLRVYEEI------LSDAQV 55 (488) T ss_pred CCccchhHHHHHHHh--------------hh--hhhccccCCCCCCChHHHHhhcc---CCHHHHHHH------hhChHH Confidence 111100000000000 00 00011111 11223222110 000000000 013456 Q ss_pred hhHHhhhhhhhcccceeeecCCc--chHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEE-EEEEecCCCceEEE---E Q lcl|NC_011308. 77 AELVDQKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEG-IFARTTSEDKLTFQ---T 149 (530) Q Consensus 77 k~Ivd~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~-~~~y~d~~g~~~~~---~ 149 (530) .-.+.+...-++|.+..+.+.+. .+++..+++++.++. +|.+++..+. ++.-+|.+. +++|...+|.+... . T Consensus 56 ~s~l~~rk~av~~~~w~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~ 134 (488) T protein:vir:99 56 KTVWGQRQLAVVSREWKVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKV 134 (488) T ss_pred HHHHHHHHHHHhcCCceEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeee Confidence 66778888889999999975432 345677888888765 6777777775 577789855 67776556665433 3 Q ss_pred ecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) .+|..+ .||..+.+. ++..++..... T Consensus 135 r~~~~f--~~d~~~~l~----------------------------------~~~~~~~~~g~------------------ 160 (488) T protein:vir:99 135 RNRRRF--RYDQDGGLR----------------------------------LLTPNNMFEGE------------------ 160 (488) T ss_pred ecccce--eecCCCceE----------------------------------EeccCCCCCcc------------------ Confidence 344322 233322211 00000000000 Q ss_pred ecccccceecccccccccccccccccCCccceEEe--eCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL--YNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRG 307 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~--~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g 307 (530) ..+.+++.|-+++- ..|..|.|.+..+-...--=+..+.+.+..++.|.-|+++.+- T Consensus 161 ---------------------~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky 219 (488) T protein:vir:99 161 ---------------------PCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRY 219 (488) T ss_pred ---------------------ccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeec Confidence 00001111111111 1234578888876655555566788889999999999999874 Q ss_pred CC-CCchhh------HHHHHhhCcceecCCCCceeEEEec-CCHHHHHHHHHHHHHHHHHHhcccCCCcccc-c-CCcH- Q lcl|NC_011308. 308 GT-NSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVD-IPYEARKAKMDIDELNIYRSGMGFNSSAVGD-G-NATN- 376 (530) Q Consensus 308 ~~-~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~-~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-g-n~SG- 376 (530) .. +.+.++ ...++....+..++.+..+++++.. ...+.++..+++..+.|-..--+-.++.++. | .+.| T Consensus 220 ~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~ 299 (488) T protein:vir:99 220 DDKTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDD 299 (488) T ss_pred CCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH Confidence 32 111111 2335566677778999999999864 3445578888888888876654433433322 2 2223 Q ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhc-CC-C Q lcl|NC_011308. 377 VVIKSRYTLLAMKAQKTEIALRKTLR-WTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAET-NQ-I 453 (530) Q Consensus 377 vAik~~~~~l~~ka~~ke~~f~~~l~-~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~-g~-i 453 (530) +.-+.+..-+.. -.+.+...|. +++..+ +..... .. .-..+.|...-|.|.++.|+.+..+... |+ + T Consensus 300 vh~~v~~d~~~a----Da~~i~~tln~~li~~l---~~~N~~-~~--~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i 369 (488) T protein:vir:99 300 LQADVRLDLVKA----DADLICESFNLGPARWL---TEWNFP-GA--QPPRVYRVIEEPEDITAKAERDEKVFRMSGFRP 369 (488) T ss_pred HHHHHHHHHHHH----HHHHHHHHHHHHHHHHH---HHhCcC-Cc--CCceeEecCCCcccHHHHHHHHHHHHhhcCCCC Confidence 333222222222 2233334442 233332 223221 11 2356788888889999999998888775 75 7 Q ss_pred cHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 454 QINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 454 S~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +.+.+.+.+++=. ++.. ++... +.+......+...++|... T Consensus 370 ~~~~i~e~~Gip~-~~~~-----------~~~~~------------------------~~~~~~~~~~~~~~~~~~~ 410 (488) T protein:vir:99 370 TRGYVQETYGVEV-ESTQ-----------AEATA------------------------PTPSTEFAEGDQPSDPAAA 410 (488) T ss_pred CHHHHHHHcCCCC-cccc-----------ccccc------------------------CCCcccCCCCCCCCCchHH Confidence 8777777775411 1100 00000 0000000111111112111 No 168 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=96.51 E-value=0.00056 Score=38.41 Aligned_cols=364 Identities=10% Similarity=0.015 Sum_probs=155.8 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHH----HHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLA----RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQY 86 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~----~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~y 86 (530) |-+... . .+...+..... .....+..+.. .+... .+.+-+.+.-....|+..++= T Consensus 1 Mg~~~~---~--~~~k~~~~~~~~~~~~~~~~~~~~~~------------~~~~v----~~~~~l~~~~v~~~i~~ia~~ 59 (383) T protein:vir:10 1 MGLLTP---K--NFSKRNAKNMVYPSNPAFFTTTVGGM------------QLSYV----SALSALQNTNVYSVINRIASD 59 (383) T ss_pred CCcccc---c--ccccccccccccccchhhhhhhccCc------------ccccc----chhHhhcchHHHHHHHHHHHh Confidence 221110 0 00000000000 00000000000 00000 000001111223345555555 Q ss_pred hcccceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCCCc Q lcl|NC_011308. 87 LLANGIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYGTL 165 (530) Q Consensus 87 l~G~pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~ 165 (530) +.+-|+++... .....|++-... ........+..+...+|.||.++..+. +.+..++|..+-++.+..+ T Consensus 60 ia~~~~~~~~~-----~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---~~~~p~~~~~v~~~~~~~~-- 129 (383) T protein:vir:10 60 VSSAHFKTENT-----ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNMG-- 129 (383) T ss_pred hccCceeeccc-----chhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCcceEEEEEcCCc-- Confidence 55667765321 122223221111 223445567777888999998775442 3334444544444333221 Q ss_pred eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccc Q lcl|NC_011308. 166 QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEH 245 (530) Q Consensus 166 ~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (530) .+|.+..... . ....|.++.+.|++..... T Consensus 130 ----~~~~~~~~~~----~----~~~~~~~~evih~r~~~~~-------------------------------------- 159 (383) T protein:vir:10 130 ----IVYTVLESND----R----PKMVLRQDQMLHFRLMPDP-------------------------------------- 159 (383) T ss_pred ----eEEEEEEcCC----c----eEEEEcccceEEeccCCCC-------------------------------------- Confidence 1111111000 0 1112344444444311100 Q ss_pred ccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCc--hhhHHHHHh Q lcl|NC_011308. 246 EGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTNSP--VDEIKKNIQ 321 (530) Q Consensus 246 ~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~~~--~~~~~~~~~ 321 (530) ++ +--.|.|.++.....|+....+..-.++.+.....|-.+++ |...+. .+.+...++ T Consensus 160 ---------~~---------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~ 221 (383) T protein:vir:10 160 ---------QY---------RYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFE 221 (383) T ss_pred ---------cc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHH Confidence 00 01147777887777777766666666666666666655544 322111 112222221 Q ss_pred -------hCcceecCCCCceeEEEecCCHHH-HHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 322 -------SKKIIQTKGEGGLDIQTVDIPYEA-RKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 322 -------~~~~i~~~~~~~~~~lt~~~~~~~-~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) .++++.++++.+++-+..+..... +....+...+.|...-.+|. ++-...++.++..++ T Consensus 222 ~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~e----------- 290 (383) T protein:vir:10 222 KANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNID----------- 290 (383) T ss_pred HHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHH----------- Confidence 234566666666655554433333 34566777888988888885 222112222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHH Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETL 471 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e 471 (530) .....|...|+-+++.|...++.+-.+ ..+++.++.-+..|..+.++....+..+|+++...+++.++.-.-+ T Consensus 291 q~~~~~~~~l~P~~~~ie~~l~~~l~~----~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~--- 363 (383) T protein:vir:10 291 QIKATYLANLNSYVNPIVDELRLKMNA----PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFL--- 363 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCC----ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccc--- Confidence 111223334444444444444332211 2467777777788999999999999999999999888876531100 Q ss_pred HHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 472 KAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 472 ~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) .++.+.... +..+.+++.+- T Consensus 364 -------------------~~d~~~~~~-----------~~~~~~gGd~e 383 (383) T protein:vir:10 364 -------------------PDNLPEFKP-----------LTNETKGGDDK 383 (383) T ss_pred -------------------CCcccccCC-----------CcccCCCCCCC Confidence 000000000 00000000000 No 169 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=96.42 E-value=0.00065 Score=38.07 Aligned_cols=355 Identities=10% Similarity=0.012 Sum_probs=154.0 Q ss_pred HHHHHhcccchhhhcccccccccccc-c----------cc-ccC----Ccceee--cCchhhHHhhhhhhhcccceeeec Q lcl|NC_011308. 35 VGQRYYNQDNDIENTRIMWMNDHGDI-V----------ED-DNA----SNIKIS--HGFFAELVDQKTQYLLANGIDVKP 96 (530) Q Consensus 35 ~~~~YY~g~~~I~~r~~~~~~~~~~~-~----------~~-~~~----~n~ki~--~n~~k~Ivd~~~~yl~G~pv~~~~ 96 (530) .+...+ +.+.|........... . .. ... .+.+.+ +.-....|+..++=+.+-|+++.- T Consensus 1 m~m~~f----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 76 (392) T protein:vir:10 1 MILPIL----NFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEK 76 (392) T ss_pred Ccchhh----hhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeecc Confidence 111111 1111111100000000 0 00 000 000000 111223444444444455655432 Q ss_pred CCcchHHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEE Q lcl|NC_011308. 97 TDHDDQKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFY 172 (530) Q Consensus 97 ~~~~de~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y 172 (530) .. . ...+.+= |. -......+..+...+|.||.++-++..|++ .+..++|..+-++.+..+... +| T Consensus 77 ~~--~---~~l~~~P--N~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~----~y 145 (392) T protein:vir:10 77 KK--N---QGIIDNP--STNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM----YY 145 (392) T ss_pred ch--h---hhHhhcC--CCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EE Confidence 11 1 1111111 21 123445677788999999999999999986 577889998877766543221 22 Q ss_pred EEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccc Q lcl|NC_011308. 173 TEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLG 252 (530) Q Consensus 173 ~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (530) .+..... .......|..+.+.|++..... T Consensus 146 ~~~~~~~------~~~~~~~~~~~eiih~~~~~~~--------------------------------------------- 174 (392) T protein:vir:10 146 NITFDDP------KIEPILQAPQSDLIHMKLLSID--------------------------------------------- 174 (392) T ss_pred EEEecCc------ccceeEEEccccEEEecCCCCC--------------------------------------------- Confidence 2211110 0111223445555554321100 Q ss_pred cccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCCCchhhHHH----HH----hh Q lcl|NC_011308. 253 RSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV--RGGTNSPVDEIKK----NI----QS 322 (530) Q Consensus 253 ~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~~~~~~~~~~----~~----~~ 322 (530) ..-.|.|-+......|+....+..-..+.+.-.+.|-.++ .+... ..++.+. .. .. T Consensus 175 -------------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~~~~~~~~~~~~~ 240 (392) T protein:vir:10 175 -------------GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL-LSDKDKASRSRSFMKRSRS 240 (392) T ss_pred -------------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC-chHHHHHHHHHHHhccccC Confidence 0114677777666666554444444444445555554444 33221 1122121 11 22 Q ss_pred CcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 323 ~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ++++.++++.+++-+.....+.......+.+.+.|...=.+|. +++... +.|.. ...+..+... T Consensus 241 g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~-~~~~~-------------~~~~~f~~~~ 306 (392) T protein:vir:10 241 GGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD-QQSSI-------------QQISGMYASA 306 (392) T ss_pred CCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-cccHH-------------HHHHHHHHHH Confidence 3556677666666665444444455666777788888877774 222111 12221 1112244445 Q ss_pred HHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCCCHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA---PRIGDEETLKAICDT 477 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vdd~~~e~~~~e~ 477 (530) |.-.++.|...++.+-..+ +.+.+..-+-.+..+.+..+..+..+|++++..+.+.+ ++..| |+.+ T Consensus 307 l~P~~~~ie~~l~~~L~~~-----~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~--- 375 (392) T protein:vir:10 307 LNRYLRPAISELEYKLSDH-----ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA--- 375 (392) T ss_pred HHHHHHHHHHHHHHhcccc-----ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch--- Confidence 5555555544444332211 11111112223455667777788899999998877654 44322 1100 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) .+.++ +..+.++++ |.| T Consensus 376 -----~e~l~---------------~~~~Gd~~~--p~p 392 (392) T protein:vir:10 376 -----PENTN---------------KKTTGQSNE--PVP 392 (392) T ss_pred -----hcCCC---------------CCCCCCCCC--CCC Confidence 00111 111111111 111 No 170 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=96.42 E-value=0.00065 Score=38.07 Aligned_cols=355 Identities=10% Similarity=0.012 Sum_probs=154.0 Q ss_pred HHHHHhcccchhhhcccccccccccc-c----------cc-ccC----Ccceee--cCchhhHHhhhhhhhcccceeeec Q lcl|NC_011308. 35 VGQRYYNQDNDIENTRIMWMNDHGDI-V----------ED-DNA----SNIKIS--HGFFAELVDQKTQYLLANGIDVKP 96 (530) Q Consensus 35 ~~~~YY~g~~~I~~r~~~~~~~~~~~-~----------~~-~~~----~n~ki~--~n~~k~Ivd~~~~yl~G~pv~~~~ 96 (530) .+...+ +.+.|........... . .. ... .+.+.+ +.-....|+..++=+.+-|+++.- T Consensus 1 m~m~~f----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 76 (392) T protein:vir:39 1 MILPIL----NFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEK 76 (392) T ss_pred Ccchhh----hhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeecc Confidence 111111 1111111100000000 0 00 000 000000 111223444444444455655432 Q ss_pred CCcchHHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEE Q lcl|NC_011308. 97 TDHDDQKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFY 172 (530) Q Consensus 97 ~~~~de~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y 172 (530) .. . ...+.+= |. -......+..+...+|.||.++-++..|++ .+..++|..+-++.+..+... +| T Consensus 77 ~~--~---~~l~~~P--N~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~----~y 145 (392) T protein:vir:39 77 KK--N---QGIIDNP--STNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM----YY 145 (392) T ss_pred ch--h---hhHhhcC--CCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE----EE Confidence 11 1 1111111 21 123445677788999999999999999986 577889998877766543221 22 Q ss_pred EEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccc Q lcl|NC_011308. 173 TEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLG 252 (530) Q Consensus 173 ~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 252 (530) .+..... .......|..+.+.|++..... T Consensus 146 ~~~~~~~------~~~~~~~~~~~eiih~~~~~~~--------------------------------------------- 174 (392) T protein:vir:39 146 NITFDDP------KIEPILQAPQSDLIHMKLLSID--------------------------------------------- 174 (392) T ss_pred EEEecCc------ccceeEEEccccEEEecCCCCC--------------------------------------------- Confidence 2211110 0111223445555554321100 Q ss_pred cccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCCCchhhHHH----HH----hh Q lcl|NC_011308. 253 RSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV--RGGTNSPVDEIKK----NI----QS 322 (530) Q Consensus 253 ~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~~~~~~~~~~----~~----~~ 322 (530) ..-.|.|-+......|+....+..-..+.+.-.+.|-.++ .+... ..++.+. .. .. T Consensus 175 -------------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~-~~~~~~~~~~~~~~~~~~~ 240 (392) T protein:vir:39 175 -------------GGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGL-LSDKDKASRSRSFMKRSRS 240 (392) T ss_pred -------------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC-chHHHHHHHHHHHhccccC Confidence 0114677777666666554444444444445555554444 33221 1122121 11 22 Q ss_pred CcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 323 KKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 323 ~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ++++.++++.+++-+.....+.......+.+.+.|...=.+|. +++... +.|.. ...+..+... T Consensus 241 g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~-~~~~~-------------~~~~~f~~~~ 306 (392) T protein:vir:39 241 GGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD-QQSSI-------------QQISGMYASA 306 (392) T ss_pred CCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-cccHH-------------HHHHHHHHHH Confidence 3556677666666665444444455666777788888877774 222111 12221 1112244445 Q ss_pred HHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCCCHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA---PRIGDEETLKAICDT 477 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vdd~~~e~~~~e~ 477 (530) |.-.++.|...++.+-..+ +.+.+..-+-.+..+.+..+..+..+|++++..+.+.+ ++..| |+.+ T Consensus 307 l~P~~~~ie~~l~~~L~~~-----~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~--- 375 (392) T protein:vir:39 307 LNRYLRPAISELEYKLSDH-----ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA--- 375 (392) T ss_pred HHHHHHHHHHHHHHhcccc-----ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch--- Confidence 5555555544444332211 11111112223455667777788899999998877654 44322 1100 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) .+.++ +..+.++++ |.| T Consensus 376 -----~e~l~---------------~~~~Gd~~~--p~p 392 (392) T protein:vir:39 376 -----PENTN---------------KKTTGQSNE--PVP 392 (392) T ss_pred -----hcCCC---------------CCCCCCCCC--CCC Confidence 00111 111111111 111 No 171 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=96.41 E-value=0.00065 Score=38.06 Aligned_cols=472 Identities=7% Similarity=-0.023 Sum_probs=185.3 Q ss_pred HHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) ..+-+++..+.+++.. -..+.+.+.+|..-.- ... .+. .....+.++..+-+...++..++.|+|- T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~--~~~-------~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ 68 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYI--LTD-------EGH---VQGGYLPTPWQSVGSKGVNVLASKLMLS 68 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--cCC-------CCC---cccccccccccccHHHHHHHHHHHHHHh Confidence 3344666666665422 1234455555543321 100 000 0111233566666777777777666642 Q ss_pred --cee-----eecCCcc------hHH-----------HHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCce Q lcl|NC_011308. 91 --GID-----VKPTDHD------DQK-----------LCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKL 145 (530) Q Consensus 91 --pv~-----~~~~~~~------de~-----------~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~ 145 (530) |+. +...+.+ +.. ....+...+ ..||....+++.++...+|.+. +|.++++ + T Consensus 69 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~-~ 145 (555) T protein:vir:17 69 LFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNAL--LYQGKKN-L 145 (555) T ss_pred hcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEE--EEecCCc-e Confidence 221 2222111 111 112233333 3678889999999999999986 5667664 3 Q ss_pred EEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccc-----eEEEEEEEcCCceEEEeecCCcccchhhccccccc Q lcl|NC_011308. 146 TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFN-----SIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNP 220 (530) Q Consensus 146 ~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~-----~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~ 220 (530) ++++-.+.++-.|..+....++|.+......-...-+. .+....-..++....... .......... T Consensus 146 --~~~pl~~y~v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~-------~~~~~~~~~~ 216 (555) T protein:vir:17 146 --KLYPLDRFVVSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVT-------APGGRDKGKS 216 (555) T ss_pred --eEEEcCeEEEeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhh-------hhcccccCCC Confidence 44444455555677888777777665322110000000 000000000000000000 0000000000 Q ss_pred cccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 221 NPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLSNNL 295 (530) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~n~~ 295 (530) .....++.+.-.......................+|...|++.++- +.+|.|--++..+-+..++.+.-...... T Consensus 217 ~~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 296 (555) T protein:vir:17 217 NDALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGS 296 (555) T ss_pred cceeEeecccccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000000000000000000000112345777788887664 35899999999999999999988899999 Q ss_pred HHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHHHHhcccCCCcccccC Q lcl|NC_011308. 296 QDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN 373 (530) Q Consensus 296 ~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn 373 (530) ....+|.+.+.-.+.....++. ..+.+.+..+..+++..+... .+.......++.++..|-..-+. ++..+.+. T Consensus 297 ~~~~~pp~lv~~~g~~~~~~l~--~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~--~~~~d~~r 372 (555) T protein:vir:17 297 AASAKVVFMVSPSATTKPQNLA--LAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM--LQVRQSER 372 (555) T ss_pred HHHhCCceeeccccccCcceee--cCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh--cCCCCccc Confidence 9999998776322222211111 122344444445566666544 34556666677776666433221 11122233 Q ss_pred CcHHHHHHHHhhHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHhcCCC-ccccceeeEEeCCCCCCCHHHHHHH- Q lcl|NC_011308. 374 ATNVVIKSRYTLLAMKAQKTEIALRKTL--------RWTADLVVEDIRRRGLG-DYSSTDIKFDIEPYILANELDLAMI- 443 (530) Q Consensus 374 ~SGvAik~~~~~l~~ka~~ke~~f~~~l--------~~~~~~i~~~l~~~~~~-~~d~~~i~i~f~~~~P~n~~e~a~~- 443 (530) .++..+..+ +..++..++..+ .=+++-++.++...+.- ......+.+.+.-.+.. ....+++ T Consensus 373 ~TAtEV~~r-------~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~-l~r~~~~~ 444 (555) T protein:vir:17 373 TTATEVQAT-------VQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWG-VGRGQDKQ 444 (555) T ss_pred chHHHHHHH-------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHH-HHHHHHHH Confidence 455444333 333333333332 22222233334333321 11111223333222211 1111111 Q ss_pred -----HHHHHhc-C------CCcHHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHHHHHhhhcccc-------- Q lcl|NC_011308. 444 -----DKTEAET-N------QIQINNLLA----IAP-----RIGDEETLKAICDTLDLDYEDVVKALEDQEV-------- 494 (530) Q Consensus 444 -----~~~~~~~-g------~iS~et~l~----~~~-----~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~-------- 494 (530) +..+.+. | .+.-..++. .++ .+.. +++++++.+.+.+.......+..+.. T Consensus 445 ~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs-~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~ 523 (555) T protein:vir:17 445 QLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINS-PETMKQLGDQQKQDMVQASLINQAGQLAKTPMAE 523 (555) T ss_pred HHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Confidence 1111111 1 122222222 222 1222 34444443332222221111111100 Q ss_pred ---ccCCccccCCC--CCCCCCCccCcCCCCcc Q lcl|NC_011308. 495 ---EELEPTVTPII--DPLTIEPQPEPLNIDPV 522 (530) Q Consensus 495 ---~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 522 (530) .....+....+ ..+...-.. |++--+. T Consensus 524 ~~~~~~~~~~~~a~~~~~a~~~~~~-~~~~~~~ 555 (555) T protein:vir:17 524 QAMQLIQQQQEGAQDAGAAESETSS-AEAQAGA 555 (555) T ss_pred hHHhccccchhhhhHHHHHHhhcCC-cccccCC Confidence 00000000000 000111111 1121111 No 172 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=96.41 E-value=0.00066 Score=38.03 Aligned_cols=384 Identities=11% Similarity=0.012 Sum_probs=153.1 Q ss_pred HHHhcccchhhhcc-ccccccccccc--cccc--CCcceeecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHH Q lcl|NC_011308. 37 QRYYNQDNDIENTR-IMWMNDHGDIV--EDDN--ASNIKISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEY 111 (530) Q Consensus 37 ~~YY~g~~~I~~r~-~~~~~~~~~~~--~~~~--~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~ 111 (530) .+.|++........ .......+... .... ..-.|++. .-..|+..++=+.+-|+++.-...+.......+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~--V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~l 78 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSD--VLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYL 78 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHH--HHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHH Confidence 22233322111100 00000000000 0000 00112222 113466666667677887632221110001112222 Q ss_pred hh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCc-eE-EEEecccceEEEEcCCCCceeEEEEEEEEeecccccc Q lcl|NC_011308. 112 YN---EE---FQSAIQELVEGSTIKGYEGIFARTTSEDK-LT-FQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNK 183 (530) Q Consensus 112 ~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~-~~-~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~ 183 (530) +. |. .......+......+|.||.++-++..|. +. +..++|..+-+..++.+... |.+..... T Consensus 79 L~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~-----y~~~~~~~---- 149 (417) T protein:vir:38 79 MNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNII-----YRFTPYNS---- 149 (417) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEE-----EEEEEcCC---- Confidence 21 21 22344566778889999999998887653 33 55688888877655444321 22111110 Q ss_pred cceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEE Q lcl|NC_011308. 184 FNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDI 263 (530) Q Consensus 184 ~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~ 263 (530) ....++..+.+.|++... + T Consensus 150 ----~~~~~~~~~dviH~r~~~--------------------------------------------------~------- 168 (417) T protein:vir:38 150 ----SMQKVCGFEDVIHWKFFS--------------------------------------------------Y------- 168 (417) T ss_pred ----cEEEEecCcceEEecCCC--------------------------------------------------C------- Confidence 011223344444442100 0 Q ss_pred eeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC-CCCc--hhhHHHHH-------hhCcceecCCCCc Q lcl|NC_011308. 264 LYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG-TNSP--VDEIKKNI-------QSKKIIQTKGEGG 333 (530) Q Consensus 264 ~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~-~~~~--~~~~~~~~-------~~~~~i~~~~~~~ 333 (530) +.-.|.|.++.+...|..-..+..-..+.+.-.+.|-.+++-. ..++ .+.++..+ ..++++.+++|.+ T Consensus 169 --d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~ 246 (417) T protein:vir:38 169 --DTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVDATMD 246 (417) T ss_pred --CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCce Confidence 0013666666555555444444443444444444565555432 2221 12222221 1234556666555 Q ss_pred eeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccC-CcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 334 LDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGN-ATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDI 412 (530) Q Consensus 334 ~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn-~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l 412 (530) ++-++....+..+....+.....|...-.+|. ..+|. .++..+ ......+++..|.-+++.|..-+ T Consensus 247 ~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp---~~lg~~~~~s~~----------e~~~~~~~~~tl~P~~~~ie~~l 313 (417) T protein:vir:38 247 YQPLEVDTNVLNLINSNNYSTAQIAKALRVPA---YRLAQNSPNQSV----------KQLADDYIRNDLPFYFEPITSEF 313 (417) T ss_pred EEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH---HHhCCCCcchhH----------HHHHHHHHHHHHHHHHHHHHHHH Confidence 55444333232333344555666776666663 22221 122211 11122344555655555555555 Q ss_pred HhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHH--H-----HHHHHHHHHHH Q lcl|NC_011308. 413 RRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETL--K-----AICDTLDLDYE 483 (530) Q Consensus 413 ~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e--~-----~~~e~e~~e~~ 483 (530) +.+-........+.+.|...- -+.+..++ ..++...|+++...+++.+++ +++.... . ..++....... T Consensus 314 ~~~Ll~~~~~~~~~~~fd~~~-l~~~~~~~-~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~ 391 (417) T protein:vir:38 314 ELKLLDDAQRHQYCIGFDTKS-VNGLPIAD-VNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQA 391 (417) T ss_pred HhhhcChhhcccceEEechhh-hhHHHHHH-HHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccccc Confidence 433222222234556674321 22233333 456778999999999988765 4443211 0 00111000000 Q ss_pred HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 484 DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) . +....++.++.+++.. +.... |.+. T Consensus 392 ~-----~~~~~kgg~~~~~~~~--~~~~~-----~~~~ 417 (417) T protein:vir:38 392 E-----HAAELKGGDTNAKGNQ--NGSGT-----NANS 417 (417) T ss_pred c-----cccccCCCCCCCCCCC--cCCCC-----cCCC Confidence 0 0000011111111000 00000 0000 No 173 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=96.39 E-value=0.00067 Score=37.99 Aligned_cols=387 Identities=10% Similarity=0.021 Sum_probs=160.6 Q ss_pred HHHHHHHHHHHHhhhH-HHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccccee Q lcl|NC_011308. 15 TILSTKIDEYIRSQNV-SLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGID 93 (530) Q Consensus 15 ~~i~~~i~~~~~~~~~-~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~ 93 (530) =+..+.. .+... ..+. ...|.. .++.-... ..+..+. ++.=+.+.-....|+..++=+.+-|+. T Consensus 1 ~~~~r~~----~~~~~~~~~~-~~~~~~---~~~g~~~s---~~~~~vt----~~~al~~~~v~~~v~~ia~~iA~lp~~ 65 (419) T protein:vir:14 1 MFFSRQL----LSNLGQTQMS-AGGWVS---ALLGSSRS---DSGQVVT----PASALALTVLQNCVTLLAESIAQLPIE 65 (419) T ss_pred Ccccccc----cccccccccC-cchhhH---HhhcCCCc---cCCcccc----hHHhhccHHHHHHHHHHHHhhccCceE Confidence 0000000 00000 0000 000000 00000000 0000000 000011122444566666667777877 Q ss_pred eecCCcch------HHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCC Q lcl|NC_011308. 94 VKPTDHDD------QKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYG 163 (530) Q Consensus 94 ~~~~~~~d------e~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~ 163 (530) +--.+.+. ..+...|+.- -|. -......+......+|.+|.++-++..|.+. +..++|..|-+..+..+ T Consensus 66 ~~~~~~~~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~ 144 (419) T protein:vir:14 66 LYERSGEDRKPATDHPLYSILKYE-PNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL 144 (419) T ss_pred EEEecCCccccccccHHHHHHHhh-cccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc Confidence 53222221 1222333211 121 2234456678888999999999999888864 77889988887766544 Q ss_pred CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccc Q lcl|NC_011308. 164 TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVE 243 (530) Q Consensus 164 ~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (530) .+ +|.+.. . ..+..+.+.|.+. T Consensus 145 ~~-----~y~~~~---~----------~~~~~~~i~h~~~---------------------------------------- 166 (419) T protein:vir:14 145 KP-----VYRVRG---S----------DPMPQRLVHHVRW---------------------------------------- 166 (419) T ss_pred eE-----EEEEcc---C----------cccchhheeEecC---------------------------------------- Confidence 21 111100 0 0011122222210 Q ss_pred ccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC---CCchhh----H Q lcl|NC_011308. 244 EHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT---NSPVDE----I 316 (530) Q Consensus 244 ~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~---~~~~~~----~ 316 (530) +.+ +.-.|.|-++.....|+....+..-..+.+...+.|-.+|+-.. ....++ + T Consensus 167 ----------~~~---------dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~ 227 (419) T protein:vir:14 167 ----------MSI---------NGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRI 227 (419) T ss_pred ----------cCC---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHH Confidence 000 11146676766666665555554444455555556666665321 111122 2 Q ss_pred HHHH--------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 317 KKNI--------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 317 ~~~~--------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l 386 (530) +..+ ..++++.++++.++.-+.....+.......+...+.|...-.+|. ++....|+-|++ +-. T Consensus 228 ~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~--E~~---- 301 (419) T protein:vir:14 228 TDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNI--EHQ---- 301 (419) T ss_pred HHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccH--HHH---- Confidence 2211 114566677666665554433333333445666778877778874 222111222221 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDI--EPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f--~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ...++...|.-.++.|...++.+-..........+.| +.-+-.|..+.++...++..+|+++...+++.+++ T Consensus 302 ------~~~f~~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl 375 (419) T protein:vir:14 302 ------SLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENM 375 (419) T ss_pred ------HHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 1223344444444444444443221111112233444 44445688888888889999999999988888754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCC Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQ 529 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (530) +++-+ .....+.-.. -+.+. ..+..++.|.+..-...+.--| T Consensus 376 ~p~~gGD--------------~~~~~~n~~~------~~~~~---~~~~~~~~~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 376 PPVKGGD--------------IYLSPMNMVD------ASKPQ---QLPVGKSEPTKAAIDEIGRILS 419 (419) T ss_pred CCCCCcC--------------eeeecccccc------ccccc---cccCCCCCCccccccchhcccC Confidence 11111 0000000000 00000 0000111111111112222222 No 174 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=96.31 E-value=0.00076 Score=37.70 Aligned_cols=410 Identities=9% Similarity=0.011 Sum_probs=188.6 Q ss_pred CCcccccCCcccHH-----HHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCc Q lcl|NC_011308. 1 MTNTLLTTAPDRLG-----TILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGF 75 (530) Q Consensus 1 ~~~~~~~~~~~~~~-----~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~ 75 (530) ||--+++-.-.... +-+...|..-. +....-..--.+..-..|+++.. ..-...++- . .... T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~~---~~~~~~~~~~~~~~~~~iLr~~~----~~~~~y~~m-----~-~D~~ 67 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATRA---RSIDFFALGMYLPNPDPVLKALG----KDIRVYREL-----R-ADAH 67 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhhh---cccccccccCCccchHHHHHhcC----CCHHHHHHH-----h-hChH Confidence 77666554332111 11111121100 00000000000111123332110 000001110 0 2355 Q ss_pred hhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEE-EEEEecCCCceEE---EEe Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEG-IFARTTSEDKLTF---QTV 150 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~-~~~y~d~~g~~~~---~~~ 150 (530) ..-.+.+...-++|.+..+...+. ++...+++.+.++. +|.+++..+. ++..+|.+. +++|....|.+.. ..+ T Consensus 68 i~s~l~~Rk~av~~~~w~i~~~~~-~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r 145 (491) T protein:vir:10 68 VGGCVRRRKAAVKALEWGLDRGKA-KSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGK 145 (491) T ss_pred HHHHHHHHHHHHhCCCcEEecCCC-CHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeee Confidence 666777777888899999976543 44566777777754 6777777775 677889854 6777655665543 333 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeee Q lcl|NC_011308. 151 DALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVA 230 (530) Q Consensus 151 ~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (530) +|..+ .||..+.+ .++..+++... T Consensus 146 ~~~~f--~~d~~~~l----------------------------------~~~~~~~~~~g-------------------- 169 (491) T protein:vir:10 146 PADWF--VYDPENQL----------------------------------RFRSKDHWMQG-------------------- 169 (491) T ss_pred cccce--eeccCCce----------------------------------EEecCCCCCCc-------------------- Confidence 44322 23322221 11111100000 Q ss_pred cccccceecccccccccccccccccCCccceEEee--CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC Q lcl|NC_011308. 231 DGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY--NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG 308 (530) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~--nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~ 308 (530) ..-.+++.|-+.+-. .|..|.|.+..+-...---+..+.+.+..++.|..|+.+.+-. T Consensus 170 --------------------~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~ 229 (491) T protein:vir:10 170 --------------------EELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP 229 (491) T ss_pred --------------------ceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecC Confidence 000111112111110 2346888888877777777888999999999999999998743 Q ss_pred CCCchhh------HHHHHhhCcceecCCCCceeEEEecC---CHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcH-H Q lcl|NC_011308. 309 TNSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVDI---PYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATN-V 377 (530) Q Consensus 309 ~~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~~---~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SG-v 377 (530) .+...++ ...++....++.++.+..+++++... +...++..++...+.|-..--+=.++..+. |.+.| + T Consensus 230 ~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~v 309 (491) T protein:vir:10 230 RSASDGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQA 309 (491) T ss_pred CCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhHHHH Confidence 3222221 22345666777889999999997653 334577778877777766654433433333 22222 3 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC-CcHH Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ-IQIN 456 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~-iS~e 456 (530) .-+.+..-. ..-.+.....+.+++. .++..++. .. ..+.+.|.. .+.+..+.|+.+..+...|+ ++.+ T Consensus 310 h~~v~~di~----~~D~~~i~~tln~li~---~l~~~N~~-~~--~~p~f~~~~-~~e~~~~~a~~~~~L~~~G~~i~~~ 378 (491) T protein:vir:10 310 GLEVTDDIR----DGDKAVVSEAMNMLIR---WICDLNFD-GA--DRPVFDMWE-QEQVDEIQAGRDQKLTQAGARFTPA 378 (491) T ss_pred HHHHHHHHH----HHHHHHHHHHHHHHHH---HHHHhcCC-CC--CcceEEecC-cCchhHHHHHHHHHHHhCCCcCCHH Confidence 333222222 2222344444544333 33334432 22 235566643 23444678888888888886 7888 Q ss_pred HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC-Cccc------c----c Q lcl|NC_011308. 457 NLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI-DPVI------E----E 525 (530) Q Consensus 457 t~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~------~----~ 525 (530) .+.+.+++ ..++.+. ... ..+ .+...+.... .....++++.+... +... . - T Consensus 379 ~i~e~~Gi-p~~~~~~-----------~~~---~~~-~~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 440 (491) T protein:vir:10 379 YFKRAYNL-QDGDLDE-----------RPL---PVS-AVDTVGAASF--AEFEAPDQDALDAALNTLSARDLNADAQALV 440 (491) T ss_pred HHHHHhCC-CCCCcCc-----------ccc---ccC-CCCCcccccc--cccCCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 88777754 2221110 000 000 0000000000 00000000000000 0000 0 0 Q ss_pred ccCCC Q lcl|NC_011308. 526 EPVQE 530 (530) Q Consensus 526 ~~~~~ 530 (530) +||.+ T Consensus 441 ~~i~~ 445 (491) T protein:vir:10 441 APLLK 445 (491) T ss_pred HHHHH Confidence 01100 No 175 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=96.27 E-value=0.0008 Score=37.57 Aligned_cols=442 Identities=10% Similarity=-0.001 Sum_probs=184.8 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc-- Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN-- 90 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~-- 90 (530) +-..+++..++++++.-..+.+.+.+|..-. ... .+.... ...-.|+..+-....++..++.|+|- T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~--~~~-----~~~~~~-----~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMV-----DPMSGS-----RGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc--ccc-----CCCCcc-----cccccCcccchHHHHHHHHHHHHHHhhc Confidence 3345566666655433334445555554431 110 000000 00111333444445555555544432 Q ss_pred cee-----eecCCcc---------h-HHHH-------HHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 91 GID-----VKPTDHD---------D-QKLC-------YLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 91 pv~-----~~~~~~~---------d-e~~~-------~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) |+. +...+.. + .++. ..+...+ ..||....+++.++...+|.+. +|.++++. +| T Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~~~~~~-~~ 145 (510) T protein:vir:78 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNAL--LYRNSDEA-TV 145 (510) T ss_pred CCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEE--EEEeCCCC-eE Confidence 221 2111110 0 1112 2222333 3588899999999999999985 45666543 45 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeec----------ccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYS----------DADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT 217 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~ 217 (530) +.++-.+.++-.|..+....++|-+...... ......+.-..+++|+ +........ T Consensus 146 ~~~pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~------~V~~~~~~~-------- 211 (510) T protein:vir:78 146 VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYT------HVQRRKGTA-------- 211 (510) T ss_pred EEEEcceeEEeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEE------EEEeecCCC-------- Confidence 6665555555557778776776655432110 0000112222334333 221111000 Q ss_pred ccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 218 VNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLS 292 (530) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~ 292 (530) ....+.+..+ .+.........+|...|++.++- ..+|.|--++..+-+-.++.+.-... T Consensus 212 --~~~~sv~~e~--------------dg~~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 275 (510) T protein:vir:78 212 --MDYAEMYHEI--------------DGVRVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLG 275 (510) T ss_pred --CcEEEEEEEe--------------cCeeeccccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 0000000000 00011112334567778777664 35799988999999999998877777 Q ss_pred HHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHHHHhcccCCCccc Q lcl|NC_011308. 293 NNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIYRSGMGFNSSAVG 370 (530) Q Consensus 293 n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~ 370 (530) ........|.+.+.-.+......+ ...+.+.+..+..+++..+... .+.......++.++..|-..-+. ++..-. T Consensus 276 ~~a~~a~~~~~lv~p~g~~~~~~l--~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~l~~~~ 352 (510) T protein:vir:78 276 LYELESLEVLNLVDEAKGAVVDDY--QDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRD 352 (510) T ss_pred HHHHHhhcCCcccCCccccchhhh--ccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhh-ccccCC Confidence 777777776655432111111111 1122344444444566665433 45566666677666666543222 232212 Q ss_pred ccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc---ccceeeEEeCCCCCCCH--HHHHHHH Q lcl|NC_011308. 371 DGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGLGDY---SSTDIKFDIEPYILANE--LDLAMID 444 (530) Q Consensus 371 ~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~---d~~~i~i~f~~~~P~n~--~e~a~~~ 444 (530) .+..++..+..+-.-+.+ .-....+.-.+.|.-+++.++.++...+.-.. ......+++...|=+.. ..+..+. T Consensus 353 ~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~ 432 (510) T protein:vir:78 353 AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNAS 432 (510) T ss_pred CCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHHHHHHHHHHHH Confidence 233455544443222221 12223333333333333434444433332111 11222334433332221 1122222 Q ss_pred HHHHhcCCC-------cHHHHHH----hCC-----CCCCHHHHHHHHHHHHHHHHHHHH----hhhccccccCCccccCC Q lcl|NC_011308. 445 KTEAETNQI-------QINNLLA----IAP-----RIGDEETLKAICDTLDLDYEDVVK----ALEDQEVEELEPTVTPI 504 (530) Q Consensus 445 ~~~~~~g~i-------S~et~l~----~~~-----~vdd~~~e~~~~e~e~~e~~~~~~----~~~~~~~~~~~~~~~~~ 504 (530) +.+...+.+ .-..++. .++ ++.. +++++.+.+++.+.+..+. .+..|.. ...+--.++ T Consensus 433 q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs-~eev~a~~~~~~~q~~~~~~~~~a~~~~~~-~~~~~~~g~ 510 (510) T protein:vir:78 433 QVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEEQRRQAAQAQAAQETLLEGAS-DMTNALAGV 510 (510) T ss_pred HHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhcccCCCC Confidence 222222222 2222222 222 2222 3444444443333222222 2222221 111111122 No 176 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=96.25 E-value=0.00082 Score=37.49 Aligned_cols=415 Identities=10% Similarity=-0.013 Sum_probs=190.4 Q ss_pred CCcccccC-CcccHHHHHH----------HHHHHHHHhh-hHHHH-HHHHHHhcccchhhhcccccccccccccccccCC Q lcl|NC_011308. 1 MTNTLLTT-APDRLGTILS----------TKIDEYIRSQ-NVSLA-RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNAS 67 (530) Q Consensus 1 ~~~~~~~~-~~~~~~~~i~----------~~i~~~~~~~-~~~~~-~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~ 67 (530) |..++... .|...+.+-+ +.+..|...- -+.++ ..++.--.|.-.... ..+ ++-.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~---~L~-------edm~e- 69 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQA---ELF-------MDMEE- 69 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHH---HHH-------HHHHh- Confidence 55555432 2222222100 0000000000 00111 222222222110000 000 00000 Q ss_pred cceeecCchhhHHhhhhhhhcccceeeecCC---cchHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEE-EEEEecC Q lcl|NC_011308. 68 NIKISHGFFAELVDQKTQYLLANGIDVKPTD---HDDQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEG-IFARTTS 141 (530) Q Consensus 68 n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~---~~de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~-~~~y~d~ 141 (530) ......-.+.+...-++|.+..+.... ..++...+++++++.+ +|.+++..+. ++.-+|.+. +++|.-. T Consensus 70 ----~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-dA~~~G~s~~Ei~w~~~ 144 (526) T protein:vir:79 70 ----RDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQ 144 (526) T ss_pred ----hChHHHHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHH-hhhhhcceeEEEEEeec Confidence 124455667777788889998887643 2356677788888854 5777776664 467788854 6777655 Q ss_pred CCceEEE---EecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccc Q lcl|NC_011308. 142 EDKLTFQ---TVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTV 218 (530) Q Consensus 142 ~g~~~~~---~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~ 218 (530) .|.+... ..+|..+ .|+...... .+..++.... T Consensus 145 ~g~~~~~~l~~r~~~~F--~~~~~~~~~----------------------------------l~~~~~~~~g-------- 180 (526) T protein:vir:79 145 GREWMPLAFHHRPQSWF--QLNPEDQNE----------------------------------LRLRDNSPAG-------- 180 (526) T ss_pred CCceeEEEeeeecccce--EeccCCCcE----------------------------------EEecCCCCCc-------- Confidence 6655433 2333322 122222111 0000000000 Q ss_pred cccccceeeeeecccccceecccccccccccccccccCCccceEEee--CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 219 NPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY--NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~--nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) ..-.+++.|-+++-. .+..|.|.+..+-...--=+..+.+.+..++ T Consensus 181 --------------------------------~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E 228 (526) T protein:vir:79 181 --------------------------------EALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLE 228 (526) T ss_pred --------------------------------eeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHH Confidence 000112222222111 2345777777654444444557888999999 Q ss_pred HhccceeeeecCCCCchhh------HHHHHhhCcceecCCCCceeEEEec-CCHHHHHHHHHHHHHHHHHHhcccCCCcc Q lcl|NC_011308. 297 DMAEAIYVVRGGTNSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVD-IPYEARKAKMDIDELNIYRSGMGFNSSAV 369 (530) Q Consensus 297 ~~~~~~lvl~g~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~-~~~~~~e~~ld~L~~~I~~~s~~p~~~~~ 369 (530) .|.-|+.+.+-..+...++ ...++....++.++.+..+++++.. .....++..+++..+.|-+.--+-.++.+ T Consensus 229 ~yG~P~~igky~~~a~~~ek~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~ 308 (526) T protein:vir:79 229 IYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTST 308 (526) T ss_pred HcCCceEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 9999999987322222222 2234566677788999999999854 45566788889999888877655555432 Q ss_pred c----cc-CCcH-HHHHHHHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccc-ceeeEEeCCCCCCCHHHHH Q lcl|NC_011308. 370 G----DG-NATN-VVIKSRYTLLAMKAQKTEIALRKTLR-WTADLVVEDIRRRGLGDYSS-TDIKFDIEPYILANELDLA 441 (530) Q Consensus 370 ~----~g-n~SG-vAik~~~~~l~~ka~~ke~~f~~~l~-~~~~~i~~~l~~~~~~~~d~-~~i~i~f~~~~P~n~~e~a 441 (530) . .| ++-| +.-+.+..-....|.. ....|. +++. .++.......-+. .-..+.|...-|.|.+..| T Consensus 309 ~~~g~~gS~a~g~vh~~v~~di~~aDa~~----i~~tln~~Li~---~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a 381 (526) T protein:vir:79 309 TSQSGGGAFALGQVHNEVRHDILASDARQ----LAATLSRDLLW---PLLVLNRPGSPDVRRAPRLVFDLREQADITSMA 381 (526) T ss_pred cccCcchhhhhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHH---HHHHhCCCCcCCccccceEEeCCCCcccHHHHH Confidence 1 11 2222 2332222222222332 333332 1222 2333333221111 2357888888899999999 Q ss_pred HHHHHHHhcCC-CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 442 MIDKTEAETNQ-IQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 442 ~~~~~~~~~g~-iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) +.+..+...|+ +|.+.+.+.+++ ..++.. +..... ...+.......+ ...........+...+ T Consensus 382 ~~~~~L~~~G~~i~~~~i~e~~gi-p~~~~~-----------e~~l~~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 445 (526) T protein:vir:79 382 QSIPALVNVGLEIPSAWVYDKLGI-PQPAKN-----------EPVLRP---AAQPAILSRQHG-QRVAALATIVGPRYGD 445 (526) T ss_pred HHHHHHHhCCCcCCHHHHHHHhCC-CCCCCc-----------hhhccc---cCCccccccccc-cccccccccccccCch Confidence 99999999996 899988888865 212110 000000 000000000000 0000000000000001 Q ss_pred ccccc-------------------ccCCC Q lcl|NC_011308. 521 PVIEE-------------------EPVQE 530 (530) Q Consensus 521 ~~~~~-------------------~~~~~ 530 (530) +...+ +|+.+ T Consensus 446 ~~~~d~~l~~~~~~~~~~~~~~~~~~i~~ 474 (526) T protein:vir:79 446 QQALDKALADLPAKDMQNQANDLLAPLLD 474 (526) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000 00000 No 177 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=96.25 E-value=0.00083 Score=37.48 Aligned_cols=476 Identities=10% Similarity=-0.015 Sum_probs=195.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) |.+ +. .++-+.+++..+... .+|-.+....+++|+.--.-..+........ .....+.+.|+..+-+...+ T Consensus 1 m~~----d~-~~~~~~l~~r~~~l~-~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~---~~~~~~~~~~~~dstg~~a~ 71 (549) T protein:vir:10 1 MTN----DD-AKILQALNADHGRMK-EKRQSYEAVWNDVIDYLMPRLDKFGQLPRPD---SEKGRERSQKMFDSTAPLAL 71 (549) T ss_pred CCc----ch-HHHHHHHHHHHHHHH-HHhhhHHHHHHHHHHHhccccccccccCCCC---CCcccccccccccchHHHHH Confidence 433 22 222222333333332 2333333333333332211110000000000 00111223466666677777 Q ss_pred hhhhhhhccc--cee-----eecCCcc---hHH-------HHHHHHHHh---hccHHHHHHHHHHHHhhcCeEEEEEEec Q lcl|NC_011308. 81 DQKTQYLLAN--GID-----VKPTDHD---DQK-------LCYLIEEYY---NEEFQSAIQELVEGSTIKGYEGIFARTT 140 (530) Q Consensus 81 d~~~~yl~G~--pv~-----~~~~~~~---de~-------~~~~l~~~~---~~~~~~~~~e~~~~~~~~G~a~~~~y~d 140 (530) +..++.|+|- |+. +...+.. ... ....+...+ ..||....+++.++...+|.+..++--+ T Consensus 72 ~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~ 151 (549) T protein:vir:10 72 RNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHD 151 (549) T ss_pred HHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeec Confidence 7777666642 221 2222211 111 112222222 3578888999999999999998776555 Q ss_pred CCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeec-------cc-------ccccceEEEEEEEcCCceEEEeecCC Q lcl|NC_011308. 141 SEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS-------DA-------DNKFNSIGHADVWTDTEVWYYVQKDE 206 (530) Q Consensus 141 ~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~~-------~~~~~~~~~~evyt~~~~~~y~~~~~ 206 (530) ..+-++|..++-.++++--|..+....++|.|...... .. ....+.-..++||+ .+.... T Consensus 152 ~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~------~V~pr~ 225 (549) T protein:vir:10 152 VGKGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYH------AVEPRA 225 (549) T ss_pred CCCeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEE------EeecCC Confidence 55667888899888888888888887777755421110 00 00112223344432 111110 Q ss_pred cccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHH Q lcl|NC_011308. 207 GRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSII 281 (530) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~li 281 (530) .....-.. ....+...+.. ..+ ........+|...|++.++- +.+|.|--++..+-+ T Consensus 226 ~~~~~~~~---~~~~pf~sv~~----------e~~-----~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~ 287 (549) T protein:vir:10 226 DRDPRKLD---GRNMQFASYWL----------DEG-----RDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDV 287 (549) T ss_pred CCCccccc---cccCceEEEEE----------Eec-----CCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHH Confidence 00000000 00000000000 000 01112234556667776553 358999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCccee--cCCCC--ceeEEEecCCHHHHHHHHHHHHHHH Q lcl|NC_011308. 282 DDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQ--TKGEG--GLDIQTVDIPYEARKAKMDIDELNI 357 (530) Q Consensus 282 Da~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~--~~~~~--~~~~lt~~~~~~~~e~~ld~L~~~I 357 (530) ..++.+.-......+...+|.+.+.-.+.... .++..+++.. .+.++ .+.-+....+.......++.++..| T Consensus 288 k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~----~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI 363 (549) T protein:vir:10 288 RMANDMAKTNIRGAQKLVDPPLLANEDGVLDG----FDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTI 363 (549) T ss_pred HHHHHHHHHHHHHHHHHhcCceeecccccccc----ceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHH Confidence 99999999999999999999988742222111 1222233321 22223 3454544456566666666666655 Q ss_pred HHHhcccCCCc-ccccCCcHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----c--ccceeeEE Q lcl|NC_011308. 358 YRSGMGFNSSA-VGDGNATNVVIKSRYTLLAMK-AQKTEIALRKTLRWTADLVVEDIRRRGLGD-----Y--SSTDIKFD 428 (530) Q Consensus 358 ~~~s~~p~~~~-~~~gn~SGvAik~~~~~l~~k-a~~ke~~f~~~l~~~~~~i~~~l~~~~~~~-----~--d~~~i~i~ 428 (530) =..-+.--+.. .+....++..+..+-.-+... -....+.-.+.|.=+++-++.++...+... . ....++++ T Consensus 364 ~~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~ 443 (549) T protein:vir:10 364 NQWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVE 443 (549) T ss_pred HHHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEE Confidence 44332211111 111234555444433222222 111222222222222222333333333311 1 23456677 Q ss_pred eCCCCCCC--HHHHHHHHHHHHhcCC-----------CcHHHHHHhC----C----CCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 429 IEPYILAN--ELDLAMIDKTEAETNQ-----------IQINNLLAIA----P----RIGDEETLKAICDTLDLDYEDVVK 487 (530) Q Consensus 429 f~~~~P~n--~~e~a~~~~~~~~~g~-----------iS~et~l~~~----~----~vdd~~~e~~~~e~e~~e~~~~~~ 487 (530) |...|-+. ..++..+.+.+...|. +.-..++..+ + .+.. ++|.+++.+..++....+. T Consensus 444 yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~r~~~~~qqq~~~ 522 (549) T protein:vir:10 444 YDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMST-DEELQAQQAAEAQAAQMQQ 522 (549) T ss_pred eecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCC-HHHHHHHHHHHHHHHHHHH Confidence 75544332 1112111111111111 2222332221 1 1222 2333333332222222221 Q ss_pred hhhccccccCCccccCCCCCCCCCCccCcCCCCcc Q lcl|NC_011308. 488 ALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPV 522 (530) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (530) ........ .+-+.... ...+..+ +.-+ T Consensus 523 ~~~~a~~a--~~~a~~~~--~~~ta~~----~~~~ 549 (549) T protein:vir:10 523 MLAAAPVA--AGAIKDLS--DAQTAAQ----TARV 549 (549) T ss_pred HHHHHHHH--HHHHHhhh--hhcCCCc----ccCC Confidence 11111000 00000000 0011100 0000 No 178 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=96.22 E-value=0.00086 Score=37.40 Aligned_cols=370 Identities=9% Similarity=0.016 Sum_probs=158.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHH-HHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSL-ARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLA 89 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~-~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G 89 (530) |.+...+.+...+ ....... .....-.+-+. ...+... .+..-+.+.-.-..|+..++=+.+ T Consensus 1 M~~f~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-----------~~~~~~v----~~~~al~~~~v~~~i~~ia~~ia~ 63 (386) T protein:vir:49 1 MPIFNITNLATES--PPINQESFFDIADSDFLAS-----------LNSSEWV----SAENALKNSDLFSIISQLSNDLAT 63 (386) T ss_pred CchhhhhccCCCC--cccchhhhhhhhhcccccc-----------ccCCcee----chhhhhccHHHHHHHHHHHHHhhh Confidence 4433222211100 0000000 00000000000 0000000 000001111122345556666667 Q ss_pred cceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCcee Q lcl|NC_011308. 90 NGIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQR 167 (530) Q Consensus 90 ~pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~ 167 (530) -|+++.-.. ....+.+=.. -........+......+|.||.++-++..|.+ .+..++|..+-++.++.+... T Consensus 64 ~p~~~~~~~-----~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~- 137 (386) T protein:vir:49 64 AKITTSRKQ-----LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGL- 137 (386) T ss_pred Cceeeccch-----hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceE- Confidence 777653211 1111111111 12234456677788899999999888888875 577889998877665443211 Q ss_pred EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccc Q lcl|NC_011308. 168 IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEG 247 (530) Q Consensus 168 ~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (530) +|.+...... ......+..+.+.|++..... T Consensus 138 ---~y~~~~~~~~------~~~~~~~~~~evih~~~~~~~---------------------------------------- 168 (386) T protein:vir:49 138 ---YYNITFDDPH------IAPKQHVPQNDILHFRLLSVD---------------------------------------- 168 (386) T ss_pred ---EEEEEEcCcc------ccceeEEccccEEEecCCCCC---------------------------------------- Confidence 1211110000 001123344555554321100 Q ss_pred ccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhh---HHHH----- Q lcl|NC_011308. 248 RQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDE---IKKN----- 319 (530) Q Consensus 248 ~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~---~~~~----- 319 (530) ..-.|.|-+..+...++....+..-..+.+.-.+.|-.+++-......+. .... T Consensus 169 ------------------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~ 230 (386) T protein:vir:49 169 ------------------GGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMK 230 (386) T ss_pred ------------------CccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhc Confidence 00146777776666666555444444455555556666654322211111 1111 Q ss_pred HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHHHH Q lcl|NC_011308. 320 IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEIAL 397 (530) Q Consensus 320 ~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f 397 (530) -..++++.+++|.+++-+.....+.......+.+.+.|...-.+|.. +....+..++..++. .+ T Consensus 231 ~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~--------------~~ 296 (386) T protein:vir:49 231 QMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYN--------------IY 296 (386) T ss_pred cCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHH--------------HH Confidence 11335566666666666654445555556678888888888888852 221112223333222 23 Q ss_pred HHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCC---CCCCHHHHHHH Q lcl|NC_011308. 398 RKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAP---RIGDEETLKAI 474 (530) Q Consensus 398 ~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~---~vdd~~~e~~~ 474 (530) ...++..++.+..-++.+-.. .+.+..+.-+-.+..+.+.....+..+|++++-.+++.+. +..++ + T Consensus 297 ~~~i~~~l~~i~~~~~~~l~~-----~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~---~-- 366 (386) T protein:vir:49 297 FKSVSRYLRPFVSEMSKKLSC-----EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKE---L-- 366 (386) T ss_pred HHHHHHHHHHHHHHHHHHhcc-----hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCc---C-- Confidence 333444444333333322111 1222233333345566777777888999999988877642 22111 0 Q ss_pred HHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 475 CDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 475 ~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) +..... .. +..++.+ .+.++ T Consensus 367 ------------~~~~~~---~~-~~~~gGd-~~~~~ 386 (386) T protein:vir:49 367 ------------PDGKNP---NR-TSLKGGE-INEQD 386 (386) T ss_pred ------------cchhcc---CC-CCCCCCC-CCCCC Confidence 000000 00 0000000 00011 No 179 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=96.20 E-value=0.00088 Score=37.32 Aligned_cols=388 Identities=11% Similarity=0.087 Sum_probs=159.6 Q ss_pred ccHHHHHHH--HHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhh Q lcl|NC_011308. 11 DRLGTILST--KIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYL 87 (530) Q Consensus 11 ~~~~~~i~~--~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl 87 (530) |. ++.+ +....+. . ...++..... .... .-......+... .++.-+...-...-|+..++=+ T Consensus 1 m~---~~~~~~~~~~~~~----~---~~~~~~~~~~~~~~~-~~~~~~~~~~~v----~~~~a~~~~~v~~~i~~ia~~i 65 (412) T protein:vir:26 1 MN---VIAKENIVTRIKK----K---LIDNWIDQSTSKLYD-FSPWKNRSFWGV----INNTLETNETIFSAITKLSNSM 65 (412) T ss_pred Cc---cchhhhhhhhhhh----h---Hhhhhhccccccccc-ccccCCcccccc----chhhhhccHHHHHHHHHHHHhH Confidence 22 2211 1111000 0 0111110000 0000 000000000000 0001111122333455555556 Q ss_pred cccceeeec-CCcchHHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCC Q lcl|NC_011308. 88 LANGIDVKP-TDHDDQKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDY 162 (530) Q Consensus 88 ~G~pv~~~~-~~~~de~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~ 162 (530) .+-|+++.- .+..+......|+.= -|. -......+..+...+|.||.++-++..|++ .+..++|..+-+..++. T Consensus 66 A~lp~~~~~~~~~~~~~~~~lL~~~-PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~ 144 (412) T protein:vir:26 66 ASLPLKMYEDYKVVNTEVSDLLTVS-PNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 144 (412) T ss_pred hhCceeEeeccccccchHHHHHHhh-cccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCC Confidence 667876532 222233333333321 122 223446678888999999999999999985 57788999998877654 Q ss_pred CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccc Q lcl|NC_011308. 163 GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGV 242 (530) Q Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (530) .... +|.+.... +. ...+.++.+.|++.... T Consensus 145 ~~~~----~y~~~~~~-----g~----~~~~~~~evih~~~~~~------------------------------------ 175 (412) T protein:vir:26 145 SREL----YYSIHAAT-----GN----KLIVHNMDMLHFKHIVA------------------------------------ 175 (412) T ss_pred CcEE----EEEEEcCC-----ce----EEEEccccEEEeCCCCC------------------------------------ Confidence 3211 12211100 11 11345556665532110 Q ss_pred cccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc-ceeeee-cCCCCch--hhHHH Q lcl|NC_011308. 243 EEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE-AIYVVR-GGTNSPV--DEIKK 318 (530) Q Consensus 243 ~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~-~~lvl~-g~~~~~~--~~~~~ 318 (530) . +.-.|.|.++-....|+..+.+. .. .+..+.. +-++++ +...+++ +..+. T Consensus 176 -------------~---------~~~~G~s~i~~~~~~i~~~~a~~-~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~ 230 (412) T protein:vir:26 176 -------------S---------NMVQGISPIDVLKNTTDFDNAVR-TF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLE 230 (412) T ss_pred -------------C---------CCcccccHHHHHHHHHHHHHHHH-HH--HHHhcCCCCceEEecCCCCCHHHHHHHHH Confidence 0 01136666655544444332221 11 2333332 233333 3232221 11111 Q ss_pred HH-----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 319 NI-----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 319 ~~-----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) .+ ..++++.+++|.++.-++....+.......+.....|...-.+|. ++....++-|+.. . T Consensus 231 ~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e------------~ 298 (412) T protein:vir:26 231 DFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNE------------E 298 (412) T ss_pred HHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH------------H Confidence 11 133455666665555554333333444455667778888878874 2221111112211 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccc---ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CC Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYSS---TDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IG 466 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~---~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vd 466 (530) ....++...|.-.+..|...++.+-....+. ..+++.+..-+-.|..+.++....+..+|+++...+++.+++ ++ T Consensus 299 ~~~~f~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ 378 (412) T protein:vir:26 299 LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE 378 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 1122333344444455544444322111111 224444445556788889999999999999999999888764 22 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 467 DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 467 d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) +-+.-+ + ......+... .+......+. +.++++. T Consensus 379 ggD~~~--~-------~~n~~~~~~~--~~~~~~~~gG-~~n~~e~ 412 (412) T protein:vir:26 379 GGDKPL--I-------SGDLYPIDTP--LELRKSLKGG-DKNVNES 412 (412) T ss_pred CcCeee--e-------cccccccccc--hhhcccccCC-CCCcCCC Confidence 211100 0 0000000000 0000000000 0011111 No 180 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=96.19 E-value=0.0009 Score=37.29 Aligned_cols=482 Identities=10% Similarity=0.034 Sum_probs=199.4 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHH---HHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVS---LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~---~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |...+ .++ +++..+.+.+ +|.. +.+.+.+|.--.- .+ ..+.. .....+.+.|+..+-+. T Consensus 1 m~~~~-------~~~-l~~r~~~l~~-~R~~~e~~w~e~~~~~lP~~---~~---~~~~~---~~~~~~~~~~~~dst~~ 62 (559) T protein:vir:95 1 MAETT-------KER-LNKQFAQLES-ERQSFEPHWRELSDYINPRG---SR---FLTSE---VNRNDRRNTRIIDSTGT 62 (559) T ss_pred CChhh-------HHH-HHHHHHHHHH-HhhHHHHHHHHHHHHhcccc---CC---cCCCC---CCcccccccccccchHH Confidence 22111 222 2333333332 3333 3444555532110 00 00000 00011223466666677 Q ss_pred hHHhhhhhhhccc--ce-----eeecCCcc---hHHHHHH-------HHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEe Q lcl|NC_011308. 78 ELVDQKTQYLLAN--GI-----DVKPTDHD---DQKLCYL-------IEEYY-NEEFQSAIQELVEGSTIKGYEGIFART 139 (530) Q Consensus 78 ~Ivd~~~~yl~G~--pv-----~~~~~~~~---de~~~~~-------l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~ 139 (530) ..++..++.|++- |+ ++...+.. ..++.+. +...+ ..+|....+++.++...+|.+..++-. T Consensus 63 ~a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~ 142 (559) T protein:vir:95 63 MAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLD 142 (559) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeec Confidence 7777777666642 21 12222211 1122222 23333 357888899999999999999766555 Q ss_pred cCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeec-------cc--------ccccceEEEEEEEcCCceEEEeec Q lcl|NC_011308. 140 TSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS-------DA--------DNKFNSIGHADVWTDTEVWYYVQK 204 (530) Q Consensus 140 d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~~--------~~~~~~~~~~evyt~~~~~~y~~~ 204 (530) +..+-+++..++..++++.-|..+....++|.+...... .. ...+..-.++++++. .|... T Consensus 143 d~~~~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~ 218 (559) T protein:vir:95 143 DDEDIIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS----VYPNI 218 (559) T ss_pred CCCceeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEE----Eeccc Confidence 555568899999999998889888888877765432210 00 000000112222210 01111 Q ss_pred CCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCc-HHHHH Q lcl|NC_011308. 205 DEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISD-IKKVK 278 (530) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd-~e~v~ 278 (530) +..... .+.. . ..+..+ ..+.+.. ........+|...|++.++- ..+|.|- -++.. T Consensus 219 ~~~~~~---~~~~--~---~pf~s~-------~~e~~~~---~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al 280 (559) T protein:vir:95 219 DRDTSK---LDSK--N---KPFKSV-------YYEVGGD---NDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLAL 280 (559) T ss_pred cccccc---cccc--c---ceEEEE-------EEEecCC---CceeeecCCcccCCccceeeeecCCccccccchHHHhh Confidence 100000 0000 0 000000 0000000 00111223455566666553 3578884 88889 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCC---ceeEE-EecCCHHHHHHHHHHHH Q lcl|NC_011308. 279 SIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEG---GLDIQ-TVDIPYEARKAKMDIDE 354 (530) Q Consensus 279 ~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~---~~~~l-t~~~~~~~~e~~ld~L~ 354 (530) +-+..++.+.-..+...+...+|.+.+-+..... ..++..+++..++..+ .+..+ +.+.+..++...++.++ T Consensus 281 ~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~----~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~ 356 (559) T protein:vir:95 281 GPVKALQLLQKRKSQLIDKATNPPMVAPTSLKNQ----RASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTR 356 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceecccccccc----ceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHH Confidence 9999999999999999999999988864322111 1223444444333322 23333 22344555555566666 Q ss_pred HHHHHHhccc---CCCcccccCCcHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCC-----Ccccccee Q lcl|NC_011308. 355 LNIYRSGMGF---NSSAVGDGNATNVVIKSRYTLLAMK-AQKTEIALRKTLRWTADLVVEDIRRRGL-----GDYSSTDI 425 (530) Q Consensus 355 ~~I~~~s~~p---~~~~~~~gn~SGvAik~~~~~l~~k-a~~ke~~f~~~l~~~~~~i~~~l~~~~~-----~~~d~~~i 425 (530) ..|=..-+.- .++.-+.+..|+..+..+-.-+... .....+.-.+.|.=+++-++.++...+. ...+...+ T Consensus 357 ~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i 436 (559) T protein:vir:95 357 QIINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPL 436 (559) T ss_pred HHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcce Confidence 6553333221 1222233445665554443333222 2223333333333333333444444332 23344567 Q ss_pred eEEeCCCCCCCH--HHHHHHHHHHHhcC-----------CCcHHHHHHhC----C----CCCCHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 426 KFDIEPYILANE--LDLAMIDKTEAETN-----------QIQINNLLAIA----P----RIGDEETLKAICDTLDLDYED 484 (530) Q Consensus 426 ~i~f~~~~P~n~--~e~a~~~~~~~~~g-----------~iS~et~l~~~----~----~vdd~~~e~~~~e~e~~e~~~ 484 (530) ++.|.-.|-+-. .+...+.+.+...| .+.-..++..+ + .+.. ++|.+++.+++.+.++ T Consensus 437 ~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs-~~ev~~~rqqr~~~qq 515 (559) T protein:vir:95 437 KVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVP-QEQVEQARQQRAQQQQ 515 (559) T ss_pred EEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCC-HHHHHHHHHHHHHHHH Confidence 788766554311 11111111111111 12233333221 1 1111 3444444443333333 Q ss_pred HHHhhhcccc-ccCCccccCCCCCCCCCCccCcCCCCccccccc Q lcl|NC_011308. 485 VVKALEDQEV-EELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEP 527 (530) Q Consensus 485 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (530) .+...+.+.. .+......+......+-.+....-..+.+.+.- T Consensus 516 ~~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 516 QQQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred HHHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccCC Confidence 2222221111 000011111000000111111100000000000 No 181 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=96.15 E-value=0.00094 Score=37.18 Aligned_cols=457 Identities=10% Similarity=0.018 Sum_probs=193.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |++.+-...+-++ + +.-..+.+.+.+|....- ..... ... .......|+..+.....++..++.|+|- T Consensus 1 m~~~~r~~~L~~~-R-~~~e~~w~e~~~~tlP~~--~~~~~-----~~~---~~~~~~~~~~dstg~~a~~~LAa~l~~~ 68 (522) T protein:vir:10 1 MKARERYNQLTTA-R-QMFLDKAVECSELTLPYL--IDDDI-----SSR---PNHKSLTVPWQSVGAKCCVTLAAKLMLA 68 (522) T ss_pred CchHHHHHHHHHH-h-hHHHHHHHHHHHHhhhcc--cCCCC-----CCC---cccccccccccchHHHHHHHHHHHHHHh Confidence 7765544443211 1 111233445555543211 00000 000 0011223566666666666666665542 Q ss_pred --cee---e--ecCCcc-----h-------HHHHH----HHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceE Q lcl|NC_011308. 91 --GID---V--KPTDHD-----D-------QKLCY----LIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLT 146 (530) Q Consensus 91 --pv~---~--~~~~~~-----d-------e~~~~----~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~ 146 (530) |+. | ...+.. + +..++ .+...+ ..||....+++.++...+|.+. +|.++++ T Consensus 69 ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~--- 143 (522) T protein:vir:10 69 VLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNAL--IFMGKDG--- 143 (522) T ss_pred hcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcee--EEEcCCC--- Confidence 221 2 111110 0 11111 122223 4688889999999999999987 5677764 Q ss_pred EEEecccceEEEEcCCCCceeEEEEEEEEeec------------ccccccceEEEEEEEcCCceEEEeecCCcccchhhc Q lcl|NC_011308. 147 FQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS------------DADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVL 214 (530) Q Consensus 147 ~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~------------~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~ 214 (530) +++++-.+.++--|..+....++|.+...... ......+.-..++||+. .|...+.+. T Consensus 144 ~~~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~----v~p~~~~~~------ 213 (522) T protein:vir:10 144 LKTFPLTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTY----VKLDKSSGR------ 213 (522) T ss_pred ceEEEcceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEE----EEeeccCCc------ Confidence 45555556555567788877777766532110 00011122223344331 111111100 Q ss_pred cccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_011308. 215 DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNC 289 (530) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S 289 (530) ...................+|..+|++.++- +.+|.|--++..+-+-.++.+.- T Consensus 214 ---------------------~~~~~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~ 272 (522) T protein:vir:10 214 ---------------------WVWHQEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQ 272 (522) T ss_pred ---------------------eEEEEccCCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHH Confidence 0000000001111112245777888877664 35899999999999999999988 Q ss_pred HHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHHHHhcccCCC Q lcl|NC_011308. 290 FLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIYRSGMGFNSS 367 (530) Q Consensus 290 ~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~~~s~~p~~~ 367 (530) ..........+|.+.+.-.+......+. -.+.+.+..+..+++..+... .+.......++.++..|...-+.- + T Consensus 273 ~~~~~~~~a~~p~~lv~~~~~~~~~~l~--~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~--~ 348 (522) T protein:vir:10 273 SLIEGAAAASKVVFLVSPSSTTKPATIA--KAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVM--N 348 (522) T ss_pred HHHHHHHHhcCCceeecccccccccccc--CCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhc--c Confidence 8889999999998887432222211111 123344555556677666533 456666777888888777654321 2 Q ss_pred cccccCCcHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cc--cc-ceeeEEeCCCCCCCH--HHH Q lcl|NC_011308. 368 AVGDGNATNVVIKSRYTLLAMK-AQKTEIALRKTLRWTADLVVEDIRRRGLG-DY--SS-TDIKFDIEPYILANE--LDL 440 (530) Q Consensus 368 ~~~~gn~SGvAik~~~~~l~~k-a~~ke~~f~~~l~~~~~~i~~~l~~~~~~-~~--d~-~~i~i~f~~~~P~n~--~e~ 440 (530) ..+.+..++..+..+-.-+.+. -....+.-.+.|.=+++-++.++...+.- .. +. ....+++...|=+.. ..+ T Consensus 349 ~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l 428 (522) T protein:vir:10 349 VRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESL 428 (522) T ss_pred CCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHHHHHHHHH Confidence 2222345665554443322221 11111111222222222233333333321 01 11 112234433332221 111 Q ss_pred HHHHHHHHh-cC------CCcHHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCC Q lcl|NC_011308. 441 AMIDKTEAE-TN------QIQINNLLAIA---PRIG-----DEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPII 505 (530) Q Consensus 441 a~~~~~~~~-~g------~iS~et~l~~~---~~vd-----d~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~ 505 (530) .+.+..+.. .| .+.-..++..+ -.|+ -.+++++++++...+.+...........-. +.+. T Consensus 429 ~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~----~~~~- 503 (522) T protein:vir:10 429 TAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMT----GSPL- 503 (522) T ss_pred HHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cccc- Confidence 111111110 11 12222222221 1121 123333333333322222211111110000 0011 Q ss_pred CCCCCCCccCcCCCCcccccccC Q lcl|NC_011308. 506 DPLTIEPQPEPLNIDPVIEEEPV 528 (530) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~~ 528 (530) -.+++.|..-++-+.+.+. T Consensus 504 ----~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 504 ----MDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred ----cCccccHHHHHHhCCCCCC Confidence 1122222233333333333 No 182 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=96.07 E-value=0.001 Score=36.92 Aligned_cols=391 Identities=12% Similarity=0.030 Sum_probs=161.4 Q ss_pred HhhhHHHHHHHHHHhcccchhhhcccccccccccccccc-----------cCCcceee------cCchhhHHhhhhhhhc Q lcl|NC_011308. 26 RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDD-----------NASNIKIS------HGFFAELVDQKTQYLL 88 (530) Q Consensus 26 ~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~-----------~~~n~ki~------~n~~k~Ivd~~~~yl~ 88 (530) -. -.+.|-. +.-....+.+........+...... ......|. +.=.-..|+..++=+. T Consensus 1 ~~-~~~~mg~----f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia 75 (432) T protein:vir:81 1 MP-DEKKLGL----FGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIA 75 (432) T ss_pred CC-chhhcch----hhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhh Confidence 00 0111111 1111111111111000000000000 00000110 0111224555555566 Q ss_pred ccceee-ecCCcch-----HHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEE Q lcl|NC_011308. 89 ANGIDV-KPTDHDD-----QKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPV 158 (530) Q Consensus 89 G~pv~~-~~~~~~d-----e~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v 158 (530) +-|+.+ .-..++. ..+...|+.- -|.. ......+......+|.||.++..+ +|++ .+..++|..+-+. T Consensus 76 ~lp~~~y~~~~~g~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~ 153 (432) T protein:vir:81 76 AMPLTMYMRTPDGRKEAVNHPLYTLLLDG-PNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTIT 153 (432) T ss_pred hCceeeEEecCCcceecccchHHHHHHhc-ccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEE Confidence 667764 2111111 1122223210 0222 234456677888999999888775 4664 4567899999888 Q ss_pred EcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 159 FDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 159 ~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) .++.+.+. |...... +. ...+..+.+.|++... T Consensus 154 ~~~~g~~~-----y~~~~~~-----g~----~~~~~~~~iih~r~~~--------------------------------- 186 (432) T protein:vir:81 154 TDPKGNTA-----YRYRRTD-----GQ----MIDIPKQQIWKIMGYS--------------------------------- 186 (432) T ss_pred ECCCCcEE-----EEEEecC-----ce----EEEEccccEEEecCCC--------------------------------- Confidence 87665422 2111100 10 1123344444442110 Q ss_pred cccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhh Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDE 315 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~ 315 (530) + +.-.|.|-+......|+....+..-.++.+.--..|-.+++-.. .++ -+. T Consensus 187 -----------------~---------dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~ 240 (432) T protein:vir:81 187 -----------------L---------DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDS 240 (432) T ss_pred -----------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHH Confidence 0 11135565554444444433333333333333334544443221 111 122 Q ss_pred HHHH----HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCC-cHHHHHHHHhhHHH Q lcl|NC_011308. 316 IKKN----IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNA-TNVVIKSRYTLLAM 388 (530) Q Consensus 316 ~~~~----~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~-SGvAik~~~~~l~~ 388 (530) ++.. ...++++.+++|.+++-++....+..+....+.....|...-.+|. ++....|+. .|..++- T Consensus 241 ~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq------- 313 (432) T protein:vir:81 241 FAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES------- 313 (432) T ss_pred HHHHHhhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHH------- Confidence 2222 2335677787777766665544444445556777888888888875 232222221 1222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEe--CCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC-- Q lcl|NC_011308. 389 KAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDI--EPYILANELDLAMIDKTEAETNQIQINNLLAIAPR-- 464 (530) Q Consensus 389 ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f--~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~-- 464 (530) ....+++..|.-.++.|..-++.+-....+...+.+.| ..-+..|..+.++....+..+|+++...+++++++ T Consensus 314 ---~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp 390 (432) T protein:vir:81 314 ---QQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPK 390 (432) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC Confidence 11122333444444444444443222111122334444 34466788888988889999999999999988754 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 465 IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 465 vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +++-...+ . ....+..+.... +...+.|.+..+-...+=+|| T Consensus 391 ~~g~~~~~-~-------~~~~~~pl~~~~----------------~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 391 LGGNAAVL-T-------VQSAMVPLDSIG----------------LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCCCcceE-e-------ecCcccchhhhc----------------cCCCCCCCCCCCCcccccccC Confidence 22110000 0 000000000000 000011111111111111222 No 183 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=96.03 E-value=0.0011 Score=36.82 Aligned_cols=442 Identities=10% Similarity=0.001 Sum_probs=185.5 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc-- Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN-- 90 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~-- 90 (530) +-.-+++..++++++.-..+.+.+.+|..-. ...+ +... ......|+..+-....++..++.|+|- T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~--~~~~-----~~~~-----~~~~~~~~~dstg~~a~~~LAa~l~~~lt 68 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPY--LMVD-----PMSG-----SRGVVEHDFQSAGALLVNNLAAKLARSLF 68 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccc--cCCC-----CCCc-----cccccCCCccchHHHHHHHHHHHHHhhhc Confidence 3344555555554333334444555554431 1110 0000 011112444455555555555555432 Q ss_pred cee---e--ecCCc---------ch-HHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 91 GID---V--KPTDH---------DD-QKLCY-------LIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 91 pv~---~--~~~~~---------~d-e~~~~-------~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) |+. | ...+. .. .++.. .+...+ ..||....+++-++...+|.+ .+|.++++. +| T Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a--~l~~~~~~~-~~ 145 (510) T protein:vir:63 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNA--LLYRDSDAA-TV 145 (510) T ss_pred CCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeE--EEEEcCCCc-EE Confidence 221 2 21110 00 11222 233333 357889999999999999998 566677653 56 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeec----------ccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYS----------DADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT 217 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~ 217 (530) +.++-.+.++--|..+....++|-+...... ......+....+++|+.- ++.. +.... T Consensus 146 ~~~pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V----~~~~-~~~~~------- 213 (510) T protein:vir:63 146 VAWSLRSYAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHV----QRKK-GTAME------- 213 (510) T ss_pred EEEEcceeEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEE----Eeec-CCCce------- Confidence 6776556555567778777776655432111 000111122233333311 1111 00000 Q ss_pred ccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 218 VNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLS 292 (530) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~ 292 (530) ....+..+ .+.........+|...|++.++- ..+|.|--++..+-+-.++.+.-... T Consensus 214 ----~~sv~~e~--------------dg~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 275 (510) T protein:vir:63 214 ----YAELYHEI--------------DGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLG 275 (510) T ss_pred ----EEEEEEEe--------------cCceeccccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 00000000 00011122345577778887664 35799988999999999998887777 Q ss_pred HHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHHHHhcccCCCccc Q lcl|NC_011308. 293 NNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIYRSGMGFNSSAVG 370 (530) Q Consensus 293 n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~ 370 (530) ........|.+.+.-.+......+. ..+.+.+..+..+++..+... .+.......++.++..|-..-+. ++..-. T Consensus 276 ~~a~~a~~~~~lv~p~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~l~~~~ 352 (510) T protein:vir:63 276 LYELESLEVLNLVDEAKGAVVDDYQ--DAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRD 352 (510) T ss_pred HHHHHhccCCcccCcccccchhhhc--cCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHh-hcccCC Confidence 7777777776555321111111111 122234433444556665433 45565566666666666554322 222222 Q ss_pred ccCCcHHHHHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceee---EEeCCCCCCC--HHHHHHHH Q lcl|NC_011308. 371 DGNATNVVIKSRYTLLA-MKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIK---FDIEPYILAN--ELDLAMID 444 (530) Q Consensus 371 ~gn~SGvAik~~~~~l~-~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~---i~f~~~~P~n--~~e~a~~~ 444 (530) .+..++..+..+-.-+. +.-....+.-.+.|.-+++.++.++...+.-..-...+. +++...|=+. ...+..+. T Consensus 353 ~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~ 432 (510) T protein:vir:63 353 AERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNAS 432 (510) T ss_pred CCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceecchhHHHHHHHHHHHHHHH Confidence 23345554444322222 112223333333333333434444433332111111122 2232222221 11122222 Q ss_pred HHHHhcCCCc-------HHHHHHh----CC-----CCCCHHHHHHHHHHH----HHHHHHHHHhhhccccccCCccccCC Q lcl|NC_011308. 445 KTEAETNQIQ-------INNLLAI----AP-----RIGDEETLKAICDTL----DLDYEDVVKALEDQEVEELEPTVTPI 504 (530) Q Consensus 445 ~~~~~~g~iS-------~et~l~~----~~-----~vdd~~~e~~~~e~e----~~e~~~~~~~~~~~~~~~~~~~~~~~ 504 (530) +.+...+.+. -..++.. ++ ++.. +++++.+.++ ..++++.+..+..+... ..+.--++ T Consensus 433 q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs-~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~-~~~~~~g~ 510 (510) T protein:vir:63 433 QVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKS-ADELQAEAEQQRQQAAQAQAAQETLLEGASD-MTNALAGV 510 (510) T ss_pred HHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcccccCC Confidence 2222222222 2222222 21 2222 2333333222 12222222333333221 11111122 No 184 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=96.02 E-value=0.0011 Score=36.79 Aligned_cols=375 Identities=13% Similarity=0.064 Sum_probs=165.5 Q ss_pred HHHHHHH---hcccchhhhcccccccccccccccc---c------------CCcceee--cCchhhHHhhhhhhhcccce Q lcl|NC_011308. 33 ARVGQRY---YNQDNDIENTRIMWMNDHGDIVEDD---N------------ASNIKIS--HGFFAELVDQKTQYLLANGI 92 (530) Q Consensus 33 ~~~~~~Y---Y~g~~~I~~r~~~~~~~~~~~~~~~---~------------~~n~ki~--~n~~k~Ivd~~~~yl~G~pv 92 (530) |. --+| ..++.-++++....+.......... . ..+.+-+ +.=.-..|+..++=+.+-|+ T Consensus 1 ~~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~ 79 (424) T protein:vir:18 1 ME-EPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPL 79 (424) T ss_pred CC-CCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCce Confidence 10 0122 1223333333221111100000000 0 0000000 00122345555555666677 Q ss_pred ee-ecCCcch-------HHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEc Q lcl|NC_011308. 93 DV-KPTDHDD-------QKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFD 160 (530) Q Consensus 93 ~~-~~~~~~d-------e~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d 160 (530) .+ .....+. ..+...|+.-= |. -......+......+|.||.++-++..|++ .+..++|..|-+..+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~l~~lL~~~P-N~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~ 158 (424) T protein:vir:18 80 DVFETDQNDNRKKVDLSNPLARLLRYSP-NQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) T ss_pred EEEEeecCCceeeeccccHHHHHHhhcc-CCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEc Confidence 64 2221111 11222232110 21 223445567788899999999888999885 467788988876544 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) + +.. +|.... .+. ...|.++.+.|++.... T Consensus 159 ~-~~~-----~y~~~~------~g~----~~~~~~~eIih~r~~~~---------------------------------- 188 (424) T protein:vir:18 159 G-KKV-----VYRYQR------DSE----YADFSQKEIFHLKGFGF---------------------------------- 188 (424) T ss_pred C-CeE-----EEEEEe------CCe----EEEeccccEEEecCcCC---------------------------------- Confidence 2 221 121110 011 11344555555431100 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC--Cch--hhH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN--SPV--DEI 316 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~--~~~--~~~ 316 (530) +.-.|.|-++.....++....+..-.++.+.-.+.|-.+++-... ++. +.+ T Consensus 189 -------------------------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~ 243 (424) T protein:vir:18 189 -------------------------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) T ss_pred -------------------------CCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHH Confidence 011366666655555554444444444555555667666653222 221 112 Q ss_pred HHHHh-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHH Q lcl|NC_011308. 317 KKNIQ-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLA 387 (530) Q Consensus 317 ~~~~~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~ 387 (530) +..+. .++++.+++|.+++-++....+.......+...+.|...-.+|. ++...-++..|..++-... T Consensus 244 ~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~--- 320 (424) T protein:vir:18 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL--- 320 (424) T ss_pred HHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH--- Confidence 22111 23466677666666665444444445556677788888888884 2222212222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc--ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC- Q lcl|NC_011308. 388 MKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS--TDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR- 464 (530) Q Consensus 388 ~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~--~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~- 464 (530) .+++..|.-.++.|..-++.+-....+. ..+++.+..-+..|..+.++....+..+|+++...+++.+++ T Consensus 321 -------~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~ 393 (424) T protein:vir:18 321 -------GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLP 393 (424) T ss_pred -------HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 2233444444444444444332211121 224444445566788888888889999999999888887653 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 465 -IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 465 -vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) +++-+ .--....-.+..+- ..+++|.+..+ T Consensus 394 pi~gGD------------------------~~~~~~n~~~l~~~---~~~~~p~~~ga 424 (424) T protein:vir:18 394 PLPGGD------------------------VAMRQSQYVPITDL---GTNKEPRNNGA 424 (424) T ss_pred CCCCcC------------------------eeeeccCccchHhh---hccCCCccCCC Confidence 11110 00000000010000 01112222222 No 185 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=441 Identities=14% Similarity=0.060 Sum_probs=174.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHH---HHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVS---LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~---~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |+-++- .+. + -+++..+.+++ +|-. +.+.+.+|.... ...+ .. ....+.|+..+-.. T Consensus 1 ~~~~~~-~e~---~-~l~~r~~~Lk~-~R~~~e~~w~e~~~~~lP~--~~~~------~~------~~~~~~~~~dstg~ 60 (517) T protein:vir:10 1 MDMRFA-GNK---S-KIPKLYEQLVG-KRSPFLSRAENYSRFTLPY--LMAD------VN------DDLSSQNAWQDDGA 60 (517) T ss_pred Cccccc-ccH---H-HHHHHHHHHHH-hhhHHHHHHHHHHHHhccc--cccC------CC------CCccccccccchHH Confidence 332221 111 1 22333333332 2323 344555554331 0110 00 11122355556666 Q ss_pred hHHhhhhhhhccc--cee---e--ecCCcc------h----HHHH-------HHHHHHh-hccHHHHHHHHHHHHhhcCe Q lcl|NC_011308. 78 ELVDQKTQYLLAN--GID---V--KPTDHD------D----QKLC-------YLIEEYY-NEEFQSAIQELVEGSTIKGY 132 (530) Q Consensus 78 ~Ivd~~~~yl~G~--pv~---~--~~~~~~------d----e~~~-------~~l~~~~-~~~~~~~~~e~~~~~~~~G~ 132 (530) ..++..++.|+|- |+. | ...+.. + ..+. ..+...+ ..||....+++.++...+|. T Consensus 61 ~a~~~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 140 (517) T protein:vir:10 61 SATNFLSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGN 140 (517) T ss_pred HHHHHHHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCe Confidence 6666666655542 221 2 211110 0 1111 1222233 35888999999999999999 Q ss_pred EEEEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeecc------------cccccceEEEEEEEcCCceEE Q lcl|NC_011308. 133 EGIFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSD------------ADNKFNSIGHADVWTDTEVWY 200 (530) Q Consensus 133 a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~------------~~~~~~~~~~~evyt~~~~~~ 200 (530) +.. |.++. ..+|+.++-.+.++--|..+....++|-.......- .....+.-..+++||.- T Consensus 141 a~l--y~~~~-~~~~~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v---- 213 (517) T protein:vir:10 141 VMM--YHPDK-TSPIQAVPLHHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHA---- 213 (517) T ss_pred EEE--EEeCC-CCcEEEEEcCeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEE---- Confidence 864 55543 334555555555555677777655554433111100 00011111233444311 Q ss_pred EeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHH Q lcl|NC_011308. 201 YVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIK 275 (530) Q Consensus 201 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e 275 (530) +...++.. . ...+. .+.........+|...|++.++- +.+|.|--+ T Consensus 214 ~~~~~~~~---------------~-----------~~~~~---d~~~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~ 264 (517) T protein:vir:10 214 KRTKDGKY---------------L-----------IRQSA---DDVPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAE 264 (517) T ss_pred EEeCCCce---------------E-----------EEEEe---CceeeccccccccccCCeeeeeeeecCCCCcccchHH Confidence 11111000 0 00000 00011112334567788877664 358999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHH Q lcl|NC_011308. 276 KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDID 353 (530) Q Consensus 276 ~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L 353 (530) +..+-+-.++.+.-...........|.+.+.-........+. ..+.+.+..+..+++..+... .+.......++.+ T Consensus 265 ~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~--~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~ 342 (517) T protein:vir:10 265 DHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFV--EGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDY 342 (517) T ss_pred HhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhcc--CCCccccccCCcccceeeecccccchhHHHHHHHHH Confidence 899999999988777777777777776665321111111111 112233434444566665433 3455556666666 Q ss_pred HHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcCCCcccccee Q lcl|NC_011308. 354 ELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRW--------TADLVVEDIRRRGLGDYSSTDI 425 (530) Q Consensus 354 ~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~--------~~~~i~~~l~~~~~~~~d~~~i 425 (530) +..|-..-+.-.+..-.....++..+. .++..++..++..+.+ +++.++..+...... ..+ T Consensus 343 ~~rI~~af~~~~l~~~~~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~----~~v 411 (517) T protein:vir:10 343 RQRIGRVFMMEAMTRRDAERVTAYEIQ-------RDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTS----KNV 411 (517) T ss_pred HHHHHHHHhhhhhhccCCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCC----CCc Confidence 665554433221221111234554443 3334444444444333 122222222111111 123 Q ss_pred eEEeCCCCCC-----CHHHHHHHHHHHHhcC--------CCcHHHHHHh----C--C--CCCCHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 426 KFDIEPYILA-----NELDLAMIDKTEAETN--------QIQINNLLAI----A--P--RIGDEETLKAICDTLDLDYED 484 (530) Q Consensus 426 ~i~f~~~~P~-----n~~e~a~~~~~~~~~g--------~iS~et~l~~----~--~--~vdd~~~e~~~~e~e~~e~~~ 484 (530) ++.+.-.+.. +...+.+.+....... .+.-..++.. + | ++.. ++|+++..+++.+.+. T Consensus 412 ~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs-~~ev~~~~~~~~~~~~ 490 (517) T protein:vir:10 412 SPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKT-QDELNAEAQAQQEQEA 490 (517) T ss_pred cceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCC-HHHHHHHHHHHHHHHH Confidence 3333322221 1111111111111000 0111111111 1 1 1111 2333322222222111 Q ss_pred HHHhhhccccccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 485 VVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) +....++. +.... ..-..+++.|.+.+ T Consensus 491 -~~~~~~~a-------g~~~~-~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 491 -TKYAAEQA-------GKAIP-DMVKNGQINPQGGQ 517 (517) T ss_pred -HHHHHHHH-------HHHHH-HHHhCCCCCCCCCC Confidence 11111111 00111 11223444444444 No 186 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=95.86 E-value=0.0013 Score=36.34 Aligned_cols=386 Identities=9% Similarity=-0.016 Sum_probs=165.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-...+.++ +..........+...+-+.... ..+..+.... -.|+ .-.-.-|+..++=+.+- T Consensus 1 ~~f~~~f~r-----~~~~~~~~~~~~~~~~~~~~~~---------~~g~~v~~~~--~l~~--~~v~~~i~~Ia~~iA~~ 62 (413) T protein:vir:48 1 MFFSGLFQR-----KSDAPVTTPAELAEAIGLSYDT---------YTGKRISSQR--AMRL--TAVYSCVRVLAESVGML 62 (413) T ss_pred Cccchhhcc-----CccCCccchHHHHHhhhcCccc---------ccCceechhh--hhcc--HHHHHHHHHHHHhhhhC Confidence 222222111 1111110111111111111000 0000000000 0011 11233566666667777 Q ss_pred ceeeecCCcc------hHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcC Q lcl|NC_011308. 91 GIDVKPTDHD------DQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDD 161 (530) Q Consensus 91 pv~~~~~~~~------de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~ 161 (530) |+++.-.+.+ +..+...|+.-=+. ........+......+|.||.++..+ .|++ ....++|..+-+..+. T Consensus 63 p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~ 141 (413) T protein:vir:48 63 PCSLYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNS 141 (413) T ss_pred ceEEEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcC Confidence 7775322211 11223333211011 22345566778889999999888765 4664 4667899988888776 Q ss_pred CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccc Q lcl|NC_011308. 162 YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEG 241 (530) Q Consensus 162 ~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (530) .+.. +|.+.... + ....|..+.+.|++... T Consensus 142 ~~~~-----~y~~~~~~-----g----~~~~~~~~evih~~~~~------------------------------------ 171 (413) T protein:vir:48 142 QWQP-----VYQVTFPD-----G----SVDVLTQDEIWHVRTLT------------------------------------ 171 (413) T ss_pred CceE-----EEEEEecC-----c----eEEEEccccEEEecCcC------------------------------------ Confidence 5432 12211111 0 11235556666553211 Q ss_pred ccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC---CchhhHHH Q lcl|NC_011308. 242 VEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN---SPVDEIKK 318 (530) Q Consensus 242 ~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~---~~~~~~~~ 318 (530) + +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+++.... +..+.++. T Consensus 172 --------------~---------d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~ 228 (413) T protein:vir:48 172 --------------L---------DGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKK 228 (413) T ss_pred --------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHH Confidence 0 111466766666666665554444444444544556566554322 11222332 Q ss_pred HHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcH-HHHHHHHhhHH Q lcl|NC_011308. 319 NIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATN-VVIKSRYTLLA 387 (530) Q Consensus 319 ~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SG-vAik~~~~~l~ 387 (530) .++ .++++.+++|.+++-+.....+.......+.....|...-.+|. ++....++-|. .... T Consensus 229 ~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~------- 301 (413) T protein:vir:48 229 DFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG------- 301 (413) T ss_pred HHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH------- Confidence 221 23455566665555555443344445566777888888888885 23221122121 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC- Q lcl|NC_011308. 388 MKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYS--STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR- 464 (530) Q Consensus 388 ~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d--~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~- 464 (530) ..++...|.=+++.|...++.+-..... ...+.+.++.-+-.|..+.++...++..+|+++...+++.+++ T Consensus 302 ------~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~ 375 (413) T protein:vir:48 302 ------LGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMN 375 (413) T ss_pred ------HHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 1223333444444444434332111111 1223444434344577888888889999999999888888754 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 465 -IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 465 -vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) ++.-+ ...........+...++. .++...+.+..|.. T Consensus 376 p~~ggD--------------~~~~~~n~~~~~~~~~~~----------~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 376 PRPGGD--------------VYLTPMNMTTSPSAGDDN----------GKKKESGDADKTAS 413 (413) T ss_pred CCCCcc--------------eeeccccccccccccccC----------CCCCCCCCccccCC Confidence 22111 000111000000000000 00111111111111 No 187 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=95.86 E-value=0.0013 Score=36.32 Aligned_cols=474 Identities=10% Similarity=0.048 Sum_probs=200.7 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHH---HHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVS---LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~---~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |-. +.-+.|++..+.+.+ +|-. +.+.+.+|..-.- ....+.. .....+.+.|+..+-+. T Consensus 1 m~~--------~~~~~l~~r~~~l~~-~R~~~e~~w~e~~~~~lP~~------~~~~~~~---~~~~~~~~~~~~dst~~ 62 (556) T protein:vir:73 1 MAE--------TEKERLLKQLAQLKN-ERTSFESHWLDLSDFINPRG------SRFLTSD---VNRDDRRNTKIVDPTGS 62 (556) T ss_pred CCh--------hhHHHHHHHHHHHHH-HhhHHHHHHHHHHHHhcccc------CCcCCCC---CCcchhhcCccccchHH Confidence 111 112223444444433 3333 3344445432110 0000000 00111223466666677 Q ss_pred hHHhhhhhhhccc--ce-----eeecCCcc---hHH-------HHHHHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEe Q lcl|NC_011308. 78 ELVDQKTQYLLAN--GI-----DVKPTDHD---DQK-------LCYLIEEYY-NEEFQSAIQELVEGSTIKGYEGIFART 139 (530) Q Consensus 78 ~Ivd~~~~yl~G~--pv-----~~~~~~~~---de~-------~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~ 139 (530) ..++..++.|++- |+ ++...+.+ ... ....+...+ ..||....+++.++...+|.+..++-. T Consensus 63 ~a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~ 142 (556) T protein:vir:73 63 MAQRILSSGMMSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVME 142 (556) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeee Confidence 7777777666542 21 12222211 111 222233333 357888899999999999999877666 Q ss_pred cCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeec-------cc--------ccccceEEEEEEEcCCceEEEeec Q lcl|NC_011308. 140 TSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS-------DA--------DNKFNSIGHADVWTDTEVWYYVQK 204 (530) Q Consensus 140 d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~-------~~--------~~~~~~~~~~evyt~~~~~~y~~~ 204 (530) +..+-+++..++..+++.--|..+....++|.+...... .. ...+..-.+++++.. .|... T Consensus 143 ~~~~~~r~~~~~l~~~~~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~----V~pr~ 218 (556) T protein:vir:73 143 DDQDVIRTMPFPIGSYYLANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHC----ITPNV 218 (556) T ss_pred cCCceEEEEEeecceeEEeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEE----Eeccc Confidence 666678899999999988888888887777765432110 00 001110112232210 01111 Q ss_pred CCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCc-HHHHH Q lcl|NC_011308. 205 DEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISD-IKKVK 278 (530) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd-~e~v~ 278 (530) +..... .+ .....+..+. ...+. .........+|...|++.++- +.+|.|- -++.. T Consensus 219 ~~~~~~---~~-----~~~~p~~s~~-------~~~~~---~~~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~l 280 (556) T protein:vir:73 219 NRDSGK---MD-----SKNKPYRSVY-------FESGG---DSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLAL 280 (556) T ss_pred cccccc---cC-----cccceEEEEE-------EEecC---CCceecccCCcccCCceeeeeeecCCcccccCccHHHhH Confidence 100000 00 0000000000 00000 000111223566667776553 4579985 88899 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceec--CCC-CceeEEE-ecCCHHHHHHHHHHHH Q lcl|NC_011308. 279 SIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQT--KGE-GGLDIQT-VDIPYEARKAKMDIDE 354 (530) Q Consensus 279 ~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~--~~~-~~~~~lt-~~~~~~~~e~~ld~L~ 354 (530) +-+..++.+.-......+...+|.+.+-...... ..++..++++.. .++ .+++.+. ...+...+...++.++ T Consensus 281 gD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~----~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~ 356 (556) T protein:vir:73 281 GQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQ----RVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTR 356 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceecccccccc----ceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHH Confidence 9999999999999999999999988864422111 123344444422 222 3345442 2334555666677777 Q ss_pred HHHHHHhcccC----CCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----Cccccce Q lcl|NC_011308. 355 LNIYRSGMGFN----SSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGL-----GDYSSTD 424 (530) Q Consensus 355 ~~I~~~s~~p~----~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-----~~~d~~~ 424 (530) ..|= .+...| ++..+..+.+...+..+-.-+.. ......+.-.+.|.=+++-++.++...+. ....... T Consensus 357 ~rI~-~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~ 435 (556) T protein:vir:73 357 QTIN-SAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMP 435 (556) T ss_pred HHHH-HHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCce Confidence 6663 333322 22333344455544443322222 22223333333333333333444443332 1233456 Q ss_pred eeEEeCCCCCCCH--HHHHHHHHHHHhcCC-----------CcHHHHHHhC----C----CCCCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 425 IKFDIEPYILANE--LDLAMIDKTEAETNQ-----------IQINNLLAIA----P----RIGDEETLKAICDTLDLDYE 483 (530) Q Consensus 425 i~i~f~~~~P~n~--~e~a~~~~~~~~~g~-----------iS~et~l~~~----~----~vdd~~~e~~~~e~e~~e~~ 483 (530) |++.|...|-+.. .+...+.+.+...|. +.-..++..+ + .+.. +++.+.+.+.+.+.. T Consensus 436 i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~rq~r~~~q 514 (556) T protein:vir:73 436 LRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVP-QEQVQGIREERAKQA 514 (556) T ss_pred eEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCC-HHHHHHHHHHHHHHH Confidence 7777766554321 111111111111111 2223333221 1 1221 233334333332222 Q ss_pred HHHHhhhccc--c---ccCCccccCCCCCCCCCCccCcCCCCcccccccCC Q lcl|NC_011308. 484 DVVKALEDQE--V---EELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQ 529 (530) Q Consensus 484 ~~~~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (530) ..+....... . ....+-+ ++.+..+..--...+-|-| T Consensus 515 q~~~~~~~~~~a~~~~~~~~~~~---------~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 515 QAAQAMAMGQAAAQGAKTLSETQ---------TSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHHHHHHHHHHHhhhcc---------CCCHHHHHHHHHhhcCCCC Confidence 2222111110 0 0011110 0011001100011112222 No 188 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=95.81 E-value=0.0014 Score=36.19 Aligned_cols=468 Identities=11% Similarity=0.030 Sum_probs=206.6 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |-+.....-. ++-+++..+.+++.. -..+.+.+.+|.... .+.+. . .. ......|+..+-... T Consensus 1 m~~~~~~~~~---~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~--~~~~~-----~--~~---~~~~~~~~~dst~~~ 65 (535) T protein:vir:15 1 MADSKRTGLG---EDGAKATYDRLTNDRRAYETRAENCAQYTIPS--LFPKE-----S--DN---ESTDYTTPWQAVGAR 65 (535) T ss_pred CCccchhccc---hHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc--ccCCC-----C--Cc---ccccccccccccHHH Confidence 5555433323 233445555554421 133445566664432 11110 0 00 001112455555556 Q ss_pred HHhhhhhhhccc--cee--ee--cCC----------cchH-------HHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID--VK--PTD----------HDDQ-------KLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~--~~--~~~----------~~de-------~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- |.+ |. ..+ .+.. .+...+...+ ..||....+++.++...+|.+. T Consensus 66 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 145 (535) T protein:vir:15 66 GLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNAL 145 (535) T ss_pred HHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcee Confidence 666666555531 221 21 111 0001 1222233333 3678899999999999999997 Q ss_pred EEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeecc----------cccccceEEEEEEEcCCceEEEeec Q lcl|NC_011308. 135 IFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSD----------ADNKFNSIGHADVWTDTEVWYYVQK 204 (530) Q Consensus 135 ~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~----------~~~~~~~~~~~evyt~~~~~~y~~~ 204 (530) .++-.+..+.++|+.++-.+.++..|..+....++|.+......- .....+.-..+++|+.- |... T Consensus 146 l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v----~~~~ 221 (535) T protein:vir:15 146 LYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHV----YLDE 221 (535) T ss_pred EEeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEE----EEec Confidence 665555556678888887777777888888888877665321100 00011112223333311 1111 Q ss_pred CCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHH Q lcl|NC_011308. 205 DEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKS 279 (530) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~ 279 (530) +++. ...+..+ ............+|...|++.++- +.+|.|--++..+ T Consensus 222 ~~~~--------------~~~~~e~-------------~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~ 274 (535) T protein:vir:15 222 ESGD--------------YLKYEEV-------------EDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLG 274 (535) T ss_pred CCCc--------------EEEEEEe-------------eCccccccccccccccCCceeeeeeecCCCccccchHHHHHH Confidence 1110 0000000 000011112345677788887664 3589999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHH Q lcl|NC_011308. 280 IIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNI 357 (530) Q Consensus 280 liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I 357 (530) -+..++.+.-..........+|.+++.-.+......+. ..+.+.+..+..+++..+... .+.......++.++..| T Consensus 275 D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I 352 (535) T protein:vir:15 275 DLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT--KAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARL 352 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcc--cCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHH Confidence 99999999999999999999998776322222211111 122344544556677766533 35666666677776666 Q ss_pred HHHhcccCCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccceeeEEeCCCCCC Q lcl|NC_011308. 358 YRSGMGFNSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGL-GDYSSTDIKFDIEPYILA 435 (530) Q Consensus 358 ~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-~~~d~~~i~i~f~~~~P~ 435 (530) -..-+.-.+.....+..++..+..+-.-+.. .-....+.-.+.|.=+++-++.++...+. .......++++|.-.+.+ T Consensus 353 ~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~ 432 (535) T protein:vir:15 353 SYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEA 432 (535) T ss_pred HHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHH Confidence 4322211111122233455444433222221 11222222233333334444444444432 223344578888766654 Q ss_pred CH--HHHHHHHHHHHhcCCCcHH---------HHHHh----C--C---CCCCHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_011308. 436 NE--LDLAMIDKTEAETNQIQIN---------NLLAI----A--P---RIGDEETLKAICDTLDLDYEDVVKALEDQEVE 495 (530) Q Consensus 436 n~--~e~a~~~~~~~~~g~iS~e---------t~l~~----~--~---~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~ 495 (530) .. .++..++........++.+ .++.. + | ++.. +++.+++.+.+.+.+........... T Consensus 433 aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~-~eev~~~~~q~~~~~~~~~~a~~~g~- 510 (535) T protein:vir:15 433 IGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLT-DEQKQALMMQDAAQTGIENAAATGGA- 510 (535) T ss_pred HHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHh- Confidence 32 1111111111111112222 22211 1 1 1222 33333333333222222222211100 Q ss_pred cCCccccCCCCCCCCCCccCcCCCCcccc--cccCC Q lcl|NC_011308. 496 ELEPTVTPIIDPLTIEPQPEPLNIDPVIE--EEPVQ 529 (530) Q Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 529 (530) .+... ++..|..-++.++ +-|-+ T Consensus 511 -------~~~~~----~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 511 -------GVGAL----ATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred -------hccch----hccChHHHHHHHhccCCCCC Confidence 00000 0111111111000 00111 No 189 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=95.79 E-value=0.0015 Score=36.14 Aligned_cols=381 Identities=10% Similarity=-0.024 Sum_probs=163.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+.+.+.. +..............++.+....- +.. . ....=.........|+..++=+.+- T Consensus 1 Mg~f~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~----------~~~--~--~~~~~~~~~~v~~~i~~ia~~ia~~ 62 (406) T protein:vir:95 1 MGLFDRWRR----TKRKSKIRADTGYVGLFMSGEDVS----------FLV--P--GYVRLSDNPEVRMAVHKIADLISSM 62 (406) T ss_pred Ccchhhhcc----ccccccccccchhhhhhccCcccC----------ccc--c--CHHHHhhcHHHHHHHHHHHHhhccC Confidence 333322111 110000000001111111110000 000 0 0000011233455677777777777 Q ss_pred ceeee-cCCcchHHHHHHHHH-Hhh--c---cHHHHHHHHHHHHhhcCeEEEE--EEecCCCce-EEEEecccceEEEEc Q lcl|NC_011308. 91 GIDVK-PTDHDDQKLCYLIEE-YYN--E---EFQSAIQELVEGSTIKGYEGIF--ARTTSEDKL-TFQTVDALQLLPVFD 160 (530) Q Consensus 91 pv~~~-~~~~~de~~~~~l~~-~~~--~---~~~~~~~e~~~~~~~~G~a~~~--~y~d~~g~~-~~~~~~p~~~~~v~d 160 (530) |+.+- ..+.+.+....-+.. ++. | ........+......+|.++.+ +-++..|.+ .+..++|..+-++.+ T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~ 142 (406) T protein:vir:95 63 TIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDT 142 (406) T ss_pred ceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 88752 222211111111222 221 2 2234556677777777766544 445666665 366788888877665 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ..+ |.... . . ..|..+.+.|++..... T Consensus 143 ~~~--------~~~~~-~-----~------~~~~~~evih~~~~~~~--------------------------------- 169 (406) T protein:vir:95 143 PDG--------YQVLY-G-----G------QTFNYDEVLHFIYNPDP--------------------------------- 169 (406) T ss_pred CCe--------EEEEe-c-----c------EEEchhHEEEeeccCCC--------------------------------- Confidence 432 11100 0 0 12344444444321100 Q ss_pred cccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC---CchhhHH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN---SPVDEIK 317 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~---~~~~~~~ 317 (530) + +.-.|.|-++.....++....+..-.++.+.-.+.|-.+++-... +..++++ T Consensus 170 ---------------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~ 225 (406) T protein:vir:95 170 ---------------E---------RPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGR 225 (406) T ss_pred ---------------C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHH Confidence 0 001366777766666666666655555555555566555543221 2222222 Q ss_pred HH----Hh----hCcceecCCCC-ceeEEE-ecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHH Q lcl|NC_011308. 318 KN----IQ----SKKIIQTKGEG-GLDIQT-VDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLA 387 (530) Q Consensus 318 ~~----~~----~~~~i~~~~~~-~~~~lt-~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~ 387 (530) .. .. .++++.+..++ .+.-++ ....+.......+.....|...-.+|. .-+|..++..- . T Consensus 226 ~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~---~~lg~~~~~~~--~----- 295 (406) T protein:vir:95 226 NAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPA---FLLGIGEFNRD--E----- 295 (406) T ss_pred HHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCH---HHcCCCCchHH--H----- Confidence 22 11 12233444443 232222 222333344556777788887777773 22343343211 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCC-- Q lcl|NC_011308. 388 MKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRI-- 465 (530) Q Consensus 388 ~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~v-- 465 (530) ...++...|.-.++.|...++.+-..+.+ ..+.+.++.-+-.|..+.++....+..+|+++...+++.+++- T Consensus 296 -----~~~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~ 369 (406) T protein:vir:95 296 -----YNNFINSTILPIAKGIEQELTRKLLISPD-LYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPK 369 (406) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHhcCCCCC-cEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 12355666666666666655543222211 2355555555567888889988899999999999999988652 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 466 GDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 466 dd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) ++-+.-. +- ..+..+.........+.++. + .+..+.+ T Consensus 370 ~~gd~~~--~~-------~n~~~~~~~~~~~~~k~g~~--~--~~~~~~~ 406 (406) T protein:vir:95 370 EGLSELV--IL-------ENYIPLDKIGDQSKLKGGDN--S--GADGQTD 406 (406) T ss_pred CCcceee--ec-------cCccchhhcccccccCCCCC--C--CCCCCCC Confidence 2211000 00 00000000000000000000 0 0000000 No 190 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=95.71 E-value=0.0016 Score=35.95 Aligned_cols=411 Identities=10% Similarity=-0.006 Sum_probs=190.6 Q ss_pred CCcccccCC-cccHHHH----HHHH--HHHHHHhh-----hHHH-HHHHHHHhcccc----hhhhccccccccccccccc Q lcl|NC_011308. 1 MTNTLLTTA-PDRLGTI----LSTK--IDEYIRSQ-----NVSL-ARVGQRYYNQDN----DIENTRIMWMNDHGDIVED 63 (530) Q Consensus 1 ~~~~~~~~~-~~~~~~~----i~~~--i~~~~~~~-----~~~~-~~~~~~YY~g~~----~I~~r~~~~~~~~~~~~~~ 63 (530) |..++...- |-....+ ...+ +......+ -+.+ ...++.--.|.- ++... . .+ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~-m---------~e- 69 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMD-M---------EE- 69 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHH-H---------Hh- Confidence 666663321 2221111 0110 00000000 0011 112222111211 01000 0 00 Q ss_pred ccCCcceeecCchhhHHhhhhhhhcccceeeecCC---cchHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEE-EEE Q lcl|NC_011308. 64 DNASNIKISHGFFAELVDQKTQYLLANGIDVKPTD---HDDQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEG-IFA 137 (530) Q Consensus 64 ~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~---~~de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~-~~~ 137 (530) ......-.+.+...-++|.+..+.... ..++...+++++++++ +|.+++..+. ++.-+|.+. +++ T Consensus 70 --------~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eiv 140 (526) T protein:vir:99 70 --------RDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELE 140 (526) T ss_pred --------hChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEE Confidence 023445566667777889998887543 2356677788888864 5777777665 577788854 677 Q ss_pred EecCCCceEE---EEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhc Q lcl|NC_011308. 138 RTTSEDKLTF---QTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVL 214 (530) Q Consensus 138 y~d~~g~~~~---~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~ 214 (530) |.-..|.+.. ...+|..+ .|+...... ++..++.... T Consensus 141 w~~~~g~~~~~~l~~r~~~~f--~~~~~~~~~----------------------------------l~~~~~~~~g---- 180 (526) T protein:vir:99 141 WALQGREWMPLAFHHRPQSWF--QLNPEDQNE----------------------------------LRLRDNSPAG---- 180 (526) T ss_pred EeecCCceeEEEeeeecccce--eeccCCCcE----------------------------------EEecCCCCCc---- Confidence 7665665443 33333322 122222111 0000000000 Q ss_pred cccccccccceeeeeecccccceecccccccccccccccccCCccceEEee--CCcCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 215 DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY--NNKLGISDIKKVKSIIDDYDLMNCFLS 292 (530) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~--nn~~~~sd~e~v~~liDa~~~~~S~~~ 292 (530) ..-.+++.|-+++-. .+..|.|.+..+-...=-=+..+.+.+ T Consensus 181 ------------------------------------~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~ 224 (526) T protein:vir:99 181 ------------------------------------EALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLA 224 (526) T ss_pred ------------------------------------eeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHH Confidence 000112222222111 234677877766554444456888999 Q ss_pred HHHHHhccceeeeecCCCCchhh------HHHHHhhCcceecCCCCceeEEEec-CCHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_011308. 293 NNLQDMAEAIYVVRGGTNSPVDE------IKKNIQSKKIIQTKGEGGLDIQTVD-IPYEARKAKMDIDELNIYRSGMGFN 365 (530) Q Consensus 293 n~~~~~~~~~lvl~g~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~lt~~-~~~~~~e~~ld~L~~~I~~~s~~p~ 365 (530) ..++.|..|+.+.+=..+...++ ...++....+..++.+..+++++.. ...+.++..++...+.|-+.--+-. T Consensus 225 ~f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqt 304 (526) T protein:vir:99 225 EMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGT 304 (526) T ss_pred HHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhh Confidence 99999999999987322222221 2234556667778999999999854 4556678888999998887765555 Q ss_pred CCccc-ccCC----cH-HHHHHHHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCH Q lcl|NC_011308. 366 SSAVG-DGNA----TN-VVIKSRYTLLAMKAQKTEIALRKTLR-WTADLVVEDIRRRGLGDYS-STDIKFDIEPYILANE 437 (530) Q Consensus 366 ~~~~~-~gn~----SG-vAik~~~~~l~~ka~~ke~~f~~~l~-~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~ 437 (530) ++.+. -|+. -| +.-+.+..-....| +.....|. +++..+ +.......-+ .....++|...-|.|. T Consensus 305 lTs~~~~g~~gS~a~g~vh~~v~~di~~aDa----~~i~~tln~~Li~~l---~~~N~~~~~~~~~~p~~~~~~~e~eDl 377 (526) T protein:vir:99 305 LTSTTSQSGGGAFALGQVHNEVRHDLLASDA----RQLAATLSRDLLWPL---LVLNRPGSPDVRRAPRLVFDLREQADI 377 (526) T ss_pred hccccccCcchhhhHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHH---HHhCCCCcCCccccceEEeCCCCcccH Confidence 54332 1222 12 22222222222222 33333442 233333 3333322111 1236788888889999 Q ss_pred HHHHHHHHHHHhcCC-CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCc Q lcl|NC_011308. 438 LDLAMIDKTEAETNQ-IQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEP 516 (530) Q Consensus 438 ~e~a~~~~~~~~~g~-iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (530) ++.|+.+..+...|+ +|.+.+.+.+++=. +... +........... .....+. ..........| T Consensus 378 ~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~-~~~~-----------e~~l~~~~~~~~---~~~~~~~-~~~~~~~~~~~ 441 (526) T protein:vir:99 378 TSMAQSIPALVNVGLEIPSAWVYDKLGIPQ-PAKN-----------EPVLRSAAQPAI---LSRQHGQ-RVAALATIVGP 441 (526) T ss_pred HHHHHHHHHHHhCCCccCHHHHHHHhCCCC-CCCc-----------ccccCCCCCCcc---ccccccc-ccccccccccc Confidence 999999999999996 89999888886511 1100 000100000000 0000000 00000000001 Q ss_pred CCCCcccccccCCC Q lcl|NC_011308. 517 LNIDPVIEEEPVQE 530 (530) Q Consensus 517 ~~~~~~~~~~~~~~ 530 (530) ...++...++.+.. T Consensus 442 ~~~~~~~~d~~l~~ 455 (526) T protein:vir:99 442 RYGDQQALDKALAD 455 (526) T ss_pred cCcchhhHHHHHHH Confidence 11111000000000 No 191 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=95.57 E-value=0.0018 Score=35.61 Aligned_cols=458 Identities=8% Similarity=0.016 Sum_probs=199.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |-. .+. .+ .+-+++..+.+++.. -..+.+.+.+|....- +. ..... ......|+..+-+.. T Consensus 1 ~~~---~~~-~~-~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~--~~-----~~~~~-----~~~~~~~~~dst~~~ 63 (522) T protein:vir:94 1 MAE---REG-FA-AEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSL--FP-----KESDN-----SSTEYTTPWQAVGAR 63 (522) T ss_pred Ccc---cch-hh-HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--cC-----CCCCc-----ccccccccccccHHH Confidence 322 111 11 122445555554421 1233445555543311 10 00000 011122455666666 Q ss_pred HHhhhhhhhccc-ce---eeec--CC---------cc-hHHH-------HHHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN-GI---DVKP--TD---------HD-DQKL-------CYLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~-pv---~~~~--~~---------~~-de~~-------~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- .| =|.. .+ .. ...+ ...+...+ ..||....+++.++...+|.+. T Consensus 64 a~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ 143 (522) T protein:vir:94 64 CLNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCL 143 (522) T ss_pred HHHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEe Confidence 666666665542 22 1111 11 01 1112 12222333 3678889999999999999998 Q ss_pred EEEEecCCCc-eEEEEecccceEEEEcCCCCceeEEEEEEEEeec--------ccccccceEEEEEEEcCCceEEEeecC Q lcl|NC_011308. 135 IFARTTSEDK-LTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS--------DADNKFNSIGHADVWTDTEVWYYVQKD 205 (530) Q Consensus 135 ~~~y~d~~g~-~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~--------~~~~~~~~~~~~evyt~~~~~~y~~~~ 205 (530) .++--+..|. .+++.++-.+.++-.|..+....++|.+...... -.....+....++||+. .... T Consensus 144 l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~------v~~~ 217 (522) T protein:vir:94 144 LYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTH------IYRQ 217 (522) T ss_pred EeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEE------EEee Confidence 6655554444 3466667666666667788777777665432110 00011112234444432 1111 Q ss_pred CcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHH Q lcl|NC_011308. 206 EGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSI 280 (530) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~l 280 (530) .+.. ..+. .+ ............+|...|++.++- +.+|.|--++..+- T Consensus 218 ~~~~-------------~~~~-~~-------------~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D 270 (522) T protein:vir:94 218 DDEY-------------LRYE-EV-------------EGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGD 270 (522) T ss_pred CCce-------------eEEe-ec-------------cCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHH Confidence 1100 0000 00 000000111234677788877664 35899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 281 IDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIY 358 (530) Q Consensus 281 iDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~ 358 (530) +..++.+.-..........+|.+++.-.+......+. ..+.+.+..+..+++..+... .+.......++.++..|- T Consensus 271 ~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~ 348 (522) T protein:vir:94 271 LNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLN--KAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLG 348 (522) T ss_pred HHHHHHHHHHHHHHHHHHhCCceeecccccccchhee--ccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHH Confidence 9999999999999999999998877422221211111 112334544555666655433 356666777777777765 Q ss_pred HHhcccCCCcccccCCcHHHHHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccceeeEEeCCCCCCC Q lcl|NC_011308. 359 RSGMGFNSSAVGDGNATNVVIKSRYTLLA-MKAQKTEIALRKTLRWTADLVVEDIRRRGL-GDYSSTDIKFDIEPYILAN 436 (530) Q Consensus 359 ~~s~~p~~~~~~~gn~SGvAik~~~~~l~-~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-~~~d~~~i~i~f~~~~P~n 436 (530) ..-+.-.+..-..+..++..+..+-.-+. +.-....+.-.+.|.=+++-++.++...+. .......+++.|.-.|.+- T Consensus 349 ~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~ 428 (522) T protein:vir:94 349 WAFLLNSAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEAL 428 (522) T ss_pred HHHhhhhhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHH Confidence 54432222222223445555443322222 112223333333333333334444433332 1223334777776655532 Q ss_pred H--HHHHHHHHHHHhcCCCcHHHH---------H----HhCC-----CCCCHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_011308. 437 E--LDLAMIDKTEAETNQIQINNL---------L----AIAP-----RIGDEETLKAICDTLDLDYEDVVKALEDQEVEE 496 (530) Q Consensus 437 ~--~e~a~~~~~~~~~g~iS~et~---------l----~~~~-----~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~ 496 (530) . .++..+...+...+.++.+.+ + ..++ ++.. ++|.+.+.+++.+.+..+.....+.. T Consensus 429 qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~-~ee~~~~~~q~~~~~~~~~~~~~~~~-- 505 (522) T protein:vir:94 429 GRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLT-QDEKIQRMAEQSSQQAVVQGASAAGA-- 505 (522) T ss_pred HHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCC-HHHHHHHHHHHHHHHHHHHHHHHHHH-- Confidence 1 111111111111111222222 1 1111 1222 23333333332222222211111110 Q ss_pred CCccccCCCCCCCCCCccCcCC Q lcl|NC_011308. 497 LEPTVTPIIDPLTIEPQPEPLN 518 (530) Q Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~ 518 (530) +.+... .++.++.-... T Consensus 506 --~~~a~~---~~~~~~~~~~~ 522 (522) T protein:vir:94 506 --NMGAAV---GQGAGEDMAQA 522 (522) T ss_pred --Hhhhhh---hcccchhhhcC Confidence 000000 00111110000 No 192 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=95.45 E-value=0.0021 Score=35.32 Aligned_cols=370 Identities=9% Similarity=0.005 Sum_probs=152.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |.+....++.-..... ...... .... ..++... ..+.. ..+..=+.+.-....|+..++=+.+- T Consensus 1 M~~f~~~~~~~~~~~~-~~~~~~----~~~~--~~~~~~~-----~~~~~----v~~~~~~~~~~v~~~i~~ia~~ia~~ 64 (386) T protein:vir:48 1 MPIFNITNLATESPPI-SQGGFF----DITD--PDFLSTL-----NGSEW----VSAESALRNSDLFSIINQLSNDLATV 64 (386) T ss_pred Cccccccccccccccc-cccccc----cccc--chhcccc-----cCCce----echhhhhcchHHHHHHHHHHHhhccC Confidence 3322221111000000 000000 0000 0000000 00000 00000011122334555555556666 Q ss_pred ceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRI 168 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 168 (530) |+++.- . .....+.+-.. -........+..+...+|.||.++-++..|.+ .+..++|..+-+..+..+... T Consensus 65 p~~~~~--~---~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~-- 137 (386) T protein:vir:48 65 KLTASR--K---QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGI-- 137 (386) T ss_pred ceeecc--c---hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceE-- Confidence 776531 1 11122221111 12234456677888999999998888888875 467789998877766544321 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) +|.+..... .......+..+.+.|++.... T Consensus 138 --~y~~~~~~~------~~~~~~~~~~~evih~~~~~~------------------------------------------ 167 (386) T protein:vir:48 138 --YYNITFDDP------RIPPKQHVPQGDVLHFKLLSV------------------------------------------ 167 (386) T ss_pred --EEEEEecCc------cccceeEecCccEEEecCCCC------------------------------------------ Confidence 222211110 011122344555555532110 Q ss_pred cccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHH-------- Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNI-------- 320 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~-------- 320 (530) . ..-.|.|.+......|.....+..-..+.+.--+.|-.+++-...... +....+ T Consensus 168 -------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~-e~~~~~~~~~~~~~ 230 (386) T protein:vir:48 168 -------D---------GGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLL-DFKTKLSRSRQAMK 230 (386) T ss_pred -------C---------CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCH-HHHHHHHHHHHHhh Confidence 0 001366766665555555544444444444544556666654332221 111111 Q ss_pred -hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCC--Ccccc-cCCcHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_011308. 321 -QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNS--SAVGD-GNATNVVIKSRYTLLAMKAQKTEIA 396 (530) Q Consensus 321 -~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~--~~~~~-gn~SGvAik~~~~~l~~ka~~ke~~ 396 (530) ..++++.+++|.+++-++............+.+.+.|...-.+|.. +..+. ++.....+. + T Consensus 231 ~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~---------------~ 295 (386) T protein:vir:48 231 QMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLD---------------L 295 (386) T ss_pred cCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH---------------H Confidence 1234455665555555543333334455667778888888888842 22111 111222222 3 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCC--CCCCHHHHHHH Q lcl|NC_011308. 397 LRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAP--RIGDEETLKAI 474 (530) Q Consensus 397 f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~--~vdd~~~e~~~ 474 (530) ++..|.-.++.|..-++.+-..++ .+.+...+-.+....+..+..+..+|++++-.+++.++ .+... +..+ T Consensus 296 ~~~~l~P~~~~ie~~l~~~l~~~~-----~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~--~~~~ 368 (386) T protein:vir:48 296 YNKAVSRYLRPFLSELSQKLSCDV-----DADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPK--ELPE 368 (386) T ss_pred HHHHHHHHHHHHHHHHHHhhcchh-----hcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCc--cchh Confidence 333344444444333333221111 11111122233345556667788899999988888653 22221 1100 Q ss_pred HHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 475 CDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 475 ~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) . + .+... +-..+..++ ++ T Consensus 369 ~-------~----~~~~~------~~~gGd~~~--~~ 386 (386) T protein:vir:48 369 G-------E----NPNKT------TLKGGEING--ED 386 (386) T ss_pred h-------c----CCCCC------ccCCCCCCC--CC Confidence 0 0 00000 001111111 11 No 193 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=95.31 E-value=0.0023 Score=35.04 Aligned_cols=369 Identities=9% Similarity=-0.010 Sum_probs=156.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+ +.+.... +.+............+-+-. . ......+..-+.++-....|+..+.=+.+- T Consensus 1 Mg~---f~~~~~~-~~~~~~~~~~~~~~~~~~~~----------~-----~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~ 61 (382) T protein:vir:48 1 MPI---FNLATES-PPDNQGGFFDVVDSDFLASL----------K-----GNEWVSAETALRNSDLFSIINQLSNDLATV 61 (382) T ss_pred Ccc---ccccccC-Ccccccccccchhhhccccc----------c-----CCcccchHhhhccHHHHHHHHHHHHhhccC Confidence 322 2221100 00000000000000000000 0 000000000011112333555566666667 Q ss_pred ceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRI 168 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 168 (530) |++..-.. .. ..+.+=... ........+..+...+|.||.++-.|.+|.+ .+..++|..+-++.++.+... T Consensus 62 ~~~~~~~~--~~---~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~-- 134 (382) T protein:vir:48 62 KLITSRKK--LQ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGI-- 134 (382) T ss_pred ceeeecch--hh---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeE-- Confidence 77653221 11 111111111 2234456677788999999999989988885 678889999887765544321 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) +|.+..... .......+..+.+.|++.... T Consensus 135 --~y~~~~~~~------~~~~~~~~~~~evih~~~~~~------------------------------------------ 164 (382) T protein:vir:48 135 --YYNITFDDP------RIPPKQHVPQNDVLHFRLLSV------------------------------------------ 164 (382) T ss_pred --EEEEEecCc------cccceeEEcCccEEEecCCCC------------------------------------------ Confidence 122211110 000112344455555432110 Q ss_pred cccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHH---HH---H-- Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIK---KN---I-- 320 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~---~~---~-- 320 (530) - ..-.|.|-+..+...|+....+..-..+.+.-.+.|-.+++-......++.. .. . T Consensus 165 -------~---------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~ 228 (382) T protein:vir:48 165 -------D---------GGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQ 228 (382) T ss_pred -------C---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhcc Confidence 0 0114677777777777666555555566666666676666432221111111 11 1 Q ss_pred hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 321 QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 321 ~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~ 400 (530) ..++++.++++.+++-+.....+.......+.+.+.|...-.+|..--...++.+ .. ....+.++... T Consensus 229 n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~--~~----------~~~~~~~~~~~ 296 (382) T protein:vir:48 229 MQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQ--SS----------LEMSSDLYSKA 296 (382) T ss_pred CCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cH----------HHHHHHHHHHH Confidence 1345666776666666654444444456667788888888888842111111111 00 11122344444 Q ss_pred HHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCCCHHHHHHHHHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA---PRIGDEETLKAICDT 477 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vdd~~~e~~~~e~ 477 (530) |.-.++.|..-++.+-..++.. ++...+. +. ....+..+..+..+|++++-.+++.+ ++..++.-+ T Consensus 297 l~p~~~~i~~~l~~~l~~~~~~-~~~~~~~---~~-~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~------ 365 (382) T protein:vir:48 297 VSRYLRPFLSELSQKLSCDVDA-DIFPAVD---PT-GSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPN------ 365 (382) T ss_pred HHHHHHHHHHHHHHHhcChhhh-hhhhhhc---cc-hhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhh------ Confidence 5444554444444332222211 1111111 11 22333445567778999988887654 443332101 Q ss_pred HHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) .++..+..+.+ + .+.++ T Consensus 366 ----~~~~~~~~~GG------------d-~~~~~ 382 (382) T protein:vir:48 366 ----GENPNSTLKGG------------E-EDGQD 382 (382) T ss_pred ----hhcCCCCCCCC------------C-CCCCC Confidence 00100111100 0 01111 No 194 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=95.18 E-value=0.0026 Score=34.77 Aligned_cols=372 Identities=11% Similarity=0.078 Sum_probs=147.0 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHh-cccchhhhcccccccccccccccccCCcceee-cCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYY-NQDNDIENTRIMWMNDHGDIVEDDNASNIKIS-HGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY-~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~-~n~~k~Ivd~~~~yl~ 88 (530) |-+..+.++ +.+.. ......++ .+.- ... -...+..++. .+-...-|+..++-+. T Consensus 1 Mg~~~~f~~-------k~~~~-~~~~~~~~~~~~~---------~~~------~~~~~~~~~~~~~~V~~~I~~ia~~iA 57 (403) T protein:vir:80 1 MGLFNFFRR-------KTRSE-PTNAISWFLTQEA---------YDT------LAIPGYTRLSDNPEVRMAVHKIAELIS 57 (403) T ss_pred Ccccccccc-------ccccc-ccchhhhhccccc---------ccc------cccchhhhhhhhHHHHHHHHHHHHhhh Confidence 333222111 00000 00000000 0000 000 0000001111 1122345677777777 Q ss_pred ccceee-ecCCcchHHHHHHHHHHhh---ccHH---HHHHHHHHHHhh--cCeEEEEEEecCCCce-EEEEecccceEEE Q lcl|NC_011308. 89 ANGIDV-KPTDHDDQKLCYLIEEYYN---EEFQ---SAIQELVEGSTI--KGYEGIFARTTSEDKL-TFQTVDALQLLPV 158 (530) Q Consensus 89 G~pv~~-~~~~~~de~~~~~l~~~~~---~~~~---~~~~e~~~~~~~--~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v 158 (530) +-|+.+ ...+.+......-+...+. |... .....+...... .|.||.++-++..|.+ .+..++|..+-++ T Consensus 58 ~~p~~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~ 137 (403) T protein:vir:80 58 SMTIHLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFV 137 (403) T ss_pred hCceEEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEE Confidence 778874 2222222222222333332 2221 222334445554 4667877777777876 4667889888776 Q ss_pred EcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccccccee Q lcl|NC_011308. 159 FDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAIL 238 (530) Q Consensus 159 ~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (530) .++.+. ++++. . ..|..+++.|++...... T Consensus 138 ~~~~g~-----~~~y~----~-----------~~~~~~eiih~~~~~~~~------------------------------ 167 (403) T protein:vir:80 138 DTDTGY-----QIWYQ----G-----------KAYNYDEVLHFIVNPDPE------------------------------ 167 (403) T ss_pred EcCCce-----EEEEe----e-----------cccchhhEEEEeccCCCc------------------------------ Confidence 665431 11110 0 123444555443211000 Q ss_pred cccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC---Cchhh Q lcl|NC_011308. 239 DEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN---SPVDE 315 (530) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~---~~~~~ 315 (530) ++ -.|.|-++.+...+.....+..-....+.-...|-.+++-... +..++ T Consensus 168 ---------------~~------------~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~ 220 (403) T protein:vir:80 168 ---------------KP------------YMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEE 220 (403) T ss_pred ---------------Cc------------cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHH Confidence 00 0255555544444444333333233333333445555543221 11122 Q ss_pred HHHHH--------hhCcceecCCCC-ce-eEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhh Q lcl|NC_011308. 316 IKKNI--------QSKKIIQTKGEG-GL-DIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTL 385 (530) Q Consensus 316 ~~~~~--------~~~~~i~~~~~~-~~-~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~ 385 (530) .+..+ ..++...++.++ +. ++...+..+.......+.....|...-.+|. .-+|..++..- .+. T Consensus 221 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp---~~lg~~~~~~~--~~~- 294 (403) T protein:vir:80 221 GRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPA---FLLGVGKYDKD--EYN- 294 (403) T ss_pred HHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCH---HHcCCCCccHH--HHH- Confidence 22211 122333333332 22 2211222223333445566666777666663 22232222110 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRI 465 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~v 465 (530) ..+...|.-.++.|...++.+-..+-+ ..+++..+.-+..|..+.++....+..+|+++...+++.+++- T Consensus 295 ---------~f~~~~l~P~~~~ie~~l~~kll~~~~-~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~ 364 (403) T protein:vir:80 295 ---------NFINSTILPIAKGIEQELTRKLLISPD-LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLS 364 (403) T ss_pred ---------HHHHHHHHHHHHHHHHHHHHhccCCCC-cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 144445555555555555433222211 1233333455667888889998999999999999998887642 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHhhhc-cc---cccCCccccCCCCCCCC Q lcl|NC_011308. 466 GDE--ETLKAICDTLDLDYEDVVKALED-QE---VEELEPTVTPIIDPLTI 510 (530) Q Consensus 466 dd~--~~e~~~~e~e~~e~~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~ 510 (530) ..+ +..... .....+.. ++ .++.++.++ +.+++ T Consensus 365 p~~ggd~~~~~---------~n~~pl~~~~~~~~~k~ge~~~~---~~~~~ 403 (403) T protein:vir:80 365 PKEGLSELVIL---------ENYIPLDKIGDQNKLKGGEKGGA---DGQTD 403 (403) T ss_pred CCCCCCeEeec---------ccccchhhccchhhccCCCCCCC---CCCCC Confidence 211 110000 00101110 00 000000000 11111 No 195 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=95.14 E-value=0.0027 Score=34.69 Aligned_cols=380 Identities=9% Similarity=0.023 Sum_probs=163.0 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccc-c-cccc------cccccccCCc--cee-ecCchhhH Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMW-M-NDHG------DIVEDDNASN--IKI-SHGFFAEL 79 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~-~-~~~~------~~~~~~~~~n--~ki-~~n~~k~I 79 (530) |.+..=+.+ | +..++++.+.......... . .... ...-.....+ +++ ........ T Consensus 1 ~~~~~~~~~-~-------------~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~c 66 (413) T protein:vir:96 1 MPGVSEIRK-D-------------KNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMA 66 (413) T ss_pred CCccchhhh-h-------------hcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHH Confidence 211100000 0 0112222222111000000 0 0000 0000000000 111 12344556 Q ss_pred Hhhhhhhhcccceeeec-CCcchHHHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCc-e-EEEEe Q lcl|NC_011308. 80 VDQKTQYLLANGIDVKP-TDHDDQKLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTSEDK-L-TFQTV 150 (530) Q Consensus 80 vd~~~~yl~G~pv~~~~-~~~~de~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~-~-~~~~~ 150 (530) |+..++-+.+-|+.+-- .....+....-+..++. |. .......+..+...+|.||.++-++..|. + ....+ T Consensus 67 I~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l 146 (413) T protein:vir:96 67 VDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPI 146 (413) T ss_pred HHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEe Confidence 67777777777887522 22112212222333331 22 23445667788889999999999988874 3 57788 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeee Q lcl|NC_011308. 151 DALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVA 230 (530) Q Consensus 151 ~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (530) +|..+-+..+.. .. +|.... . +. .|.++++.|++.... T Consensus 147 ~~~~v~~~~~~~-~~-----~y~~~~-~-----~~------~~~~~evih~k~~~~------------------------ 184 (413) T protein:vir:96 147 SPYKVTFNVSDD-DL-----DYSITF-D-----NK------EYDPSTLLHFVLNPS------------------------ 184 (413) T ss_pred cCceeEEEEcCC-eE-----EEEEee-c-----Cc------EEchhhEEEEeccCC------------------------ Confidence 998887766532 11 121110 0 00 123334443321100 Q ss_pred cccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC- Q lcl|NC_011308. 231 DGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT- 309 (530) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~- 309 (530) .. +.-.|.|-++.....|.....+..-..+.+.-.+.|-.+++... T Consensus 185 ------------------------~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~ 231 (413) T protein:vir:96 185 ------------------------IE---------RPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSD 231 (413) T ss_pred ------------------------CC---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC Confidence 00 00036666665555555554444444455555556666665322 Q ss_pred CCc--hhhHHHHHh--------hCcceecCCCCc-eeEEE-ecCCHHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHH Q lcl|NC_011308. 310 NSP--VDEIKKNIQ--------SKKIIQTKGEGG-LDIQT-VDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNV 377 (530) Q Consensus 310 ~~~--~~~~~~~~~--------~~~~i~~~~~~~-~~~lt-~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGv 377 (530) .++ .+.++..+. .++++.+++++. +.-+. ....+.......+...+.|...-.+|. .-+|..++. T Consensus 232 l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~---~~lg~~~~~ 308 (413) T protein:vir:96 232 SDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPA---FLLGVGTYN 308 (413) T ss_pred CCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCH---HHcCCCcch Confidence 111 122222221 223444544442 22111 122233333455566677777777774 223322221 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHH Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINN 457 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et 457 (530) +.. ...++...|.-.++.|...++.+-..+ ...+++.++.-+-.|..+.++....+..+|+++... T Consensus 309 --~~~----------~~~~~~~~l~P~~~~ie~~ln~~ll~~--~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE 374 (413) T protein:vir:96 309 --KDE----------FNNFINTKIMSIAQVIQQTYNKLIVEE--DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNE 374 (413) T ss_pred --HHH----------HHHHHHHHHHHHHHHHHHHHHHhhCCC--CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 111 112455556666666666555432211 224555555666678888899888999999999999 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 458 LLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 458 ~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) +++.+++-..+. .+.......-...+... +.++.+ +++ | T Consensus 375 ~R~~~g~~p~~~------------gd~~~~~~n~~~~~~~~---------~~~~~~----~~d--t 413 (413) T protein:vir:96 375 FRNWVGMPPDAE------------MDDLLVLENYLQQKDLV---------NQKKLI----QDE--T 413 (413) T ss_pred HHHHhCCCCCCC------------cceeeecccccchhhcc---------cccCCC----CCC--C Confidence 998886522110 00110000000000000 000000 000 0 No 196 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=95.08 E-value=0.0028 Score=34.57 Aligned_cols=470 Identities=11% Similarity=-0.001 Sum_probs=205.7 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |...+...-.. +-+++..+.+++.. -..+.+.+.+|....- +.+.. .. ......++..+-... T Consensus 1 m~~~~~~~~~~---~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~--~~~~~-------~~---~~~~~~~~~dst~~~ 65 (535) T protein:vir:33 1 MADSKRTGLGE---DGAKATYDRLTNDRRAYETRAENCAQYTIPSL--FPKES-------DN---ESTDYTTPWQAVGAR 65 (535) T ss_pred CChhhhhccCh---hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--cCCCC-------Cc---ccccccccccccHHH Confidence 66666443332 22444455554321 1334456666644421 11100 00 001112344555555 Q ss_pred HHhhhhhhhccc--cee--ee--cCCc----------ch-------HHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID--VK--PTDH----------DD-------QKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~--~~--~~~~----------~d-------e~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- |.+ |. ..+. +. +.+...+...+ ..||....+++.++...+|.+. T Consensus 66 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ 145 (535) T protein:vir:33 66 GLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNAL 145 (535) T ss_pred HHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcee Confidence 666666555542 221 11 1110 00 11112233333 4678889999999999999997 Q ss_pred EEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeecccc----------cccceEEEEEEEcCCceEEEeec Q lcl|NC_011308. 135 IFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDAD----------NKFNSIGHADVWTDTEVWYYVQK 204 (530) Q Consensus 135 ~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~----------~~~~~~~~~evyt~~~~~~y~~~ 204 (530) .++-.+..+.++|+.++-.+.++-.|..+....++|.+......-.. ...+.-..+++| +.... T Consensus 146 l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~------~~v~~ 219 (535) T protein:vir:33 146 LYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVY------THVYL 219 (535) T ss_pred EEeecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEE------EEEEe Confidence 66655555667888887777777778888888777766532110000 000111112222 11111 Q ss_pred CCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHH Q lcl|NC_011308. 205 DEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKS 279 (530) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~ 279 (530) ..... +...+..+ ............+|...|++.++- +.+|.|--++..+ T Consensus 220 ~~~~~------------~~~~~~~~-------------~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~ 274 (535) T protein:vir:33 220 DEESG------------DYLKYEEV-------------EDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLG 274 (535) T ss_pred eCCCC------------cEEEEEEE-------------eCccccccccccccccCCceeeeeeecCCCccccchHHHHHH Confidence 10000 00000000 000111112234677788887664 3589999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHH Q lcl|NC_011308. 280 IIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNI 357 (530) Q Consensus 280 liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I 357 (530) -+..++.+.-..........+|.+++.-.+......+. ..+.+.+..+..+++..+... .+.......++.++..| T Consensus 275 D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I 352 (535) T protein:vir:33 275 DLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT--KAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARL 352 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc--cCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHH Confidence 99999999999999999999998776322222211111 122344545556677766533 35666666677776666 Q ss_pred HHHhcccCCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CccccceeeEEeCCCCCC Q lcl|NC_011308. 358 YRSGMGFNSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGL-GDYSSTDIKFDIEPYILA 435 (530) Q Consensus 358 ~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~-~~~d~~~i~i~f~~~~P~ 435 (530) -..-+.-.+.....+..++..+..+-.-+.. .-....+.-.+.|.=+++-++.++...+. .......++++|.-.+.+ T Consensus 353 ~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~ 432 (535) T protein:vir:33 353 SYAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEA 432 (535) T ss_pred HHHHhhhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHH Confidence 4432211111122233455444333222221 11222222233333334444444444432 223344578888766654 Q ss_pred CHH--HHHHHHHHHHhcCCCcHH---------HHHHh----C--C---CCCCHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_011308. 436 NEL--DLAMIDKTEAETNQIQIN---------NLLAI----A--P---RIGDEETLKAICDTLDLDYEDVVKALEDQEVE 495 (530) Q Consensus 436 n~~--e~a~~~~~~~~~g~iS~e---------t~l~~----~--~---~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~ 495 (530) -.. ++..++........++.+ .++.. + | ++.. +++.+++.+++.+.+..++....... T Consensus 433 aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~-~ee~~~~~~q~~~~~~~~~~~~~~g~- 510 (535) T protein:vir:33 433 IGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLT-DEQKQALMMQDAAQTGVENAAAAGGA- 510 (535) T ss_pred HHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCC-HHHHHHHHHHHHHHHHHHHHHHhhhh- Confidence 321 111111111111112222 22211 1 1 1222 33333333333332222222221100 Q ss_pred cCCccccCCCCCCCCCCccCcCCCCcccc Q lcl|NC_011308. 496 ELEPTVTPIIDPLTIEPQPEPLNIDPVIE 524 (530) Q Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (530) .-+... ..+++..+.--..+.=.+. T Consensus 511 ---~~~~~~-~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 511 ---GVGALA-TSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred ---hhcchh-hcCChhHHHHHHhccCCCC Confidence 000000 0111111111111111111 No 197 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=95.06 E-value=0.0028 Score=34.54 Aligned_cols=465 Identities=11% Similarity=0.027 Sum_probs=190.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |-..... -..+-+++..+.+++.. -..+.+.+.+|....- ... .+. .......|+..+-+.. T Consensus 1 m~~~~~~----~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~--~~~----~~~------~~~~~~~~~~dst~~~ 64 (536) T protein:vir:10 1 MAEKRTG----LAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--FPK----DSD------NASTDYQTPWQAVGAR 64 (536) T ss_pred Ccchhhc----hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--cCC----CCC------cccccccccccccHHH Confidence 3331111 12234455555554421 1234455556644321 100 000 0111223555666666 Q ss_pred HHhhhhhhhccc--cee--ee--cCCcc-------h----------HHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID--VK--PTDHD-------D----------QKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~--~~--~~~~~-------d----------e~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- |.+ |. ..+.+ . +.....+...+ ..||....+++.++...+|.+. T Consensus 65 a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 144 (536) T protein:vir:10 65 GLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL 144 (536) T ss_pred HHHHHHHHHHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEe Confidence 666666655541 211 11 11100 0 11222233333 3678889999999999999987 Q ss_pred EEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEEEEEeec----------ccccccceEEEEEEEcCCceEEEee Q lcl|NC_011308. 135 IFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS----------DADNKFNSIGHADVWTDTEVWYYVQ 203 (530) Q Consensus 135 ~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evyt~~~~~~y~~ 203 (530) .|+--+..+.. .++.++-.++++--|..+....++|.+...... ......+.-..++||+.- |.. T Consensus 145 ly~~e~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V----~~~ 220 (536) T protein:vir:10 145 LYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI----YLD 220 (536) T ss_pred EEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEE----EEe Confidence 54432322223 366677667776778888887777766532110 000111112223333211 111 Q ss_pred cCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHH Q lcl|NC_011308. 204 KDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVK 278 (530) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~ 278 (530) ..++.. ..| ..+ ............+|...|++.++- +.+|.|--++.. T Consensus 221 ~~~~~~-------------~~~-~e~-------------~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l 273 (536) T protein:vir:10 221 EASGEY-------------LRY-EEV-------------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYL 273 (536) T ss_pred cCCCcE-------------EEE-Eee-------------cCccccccccccccccCCceeeeeeecCCCccccchHHHHH Confidence 111100 000 000 000111112345678888887764 358999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEe--cCCHHHHHHHHHHHHHH Q lcl|NC_011308. 279 SIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTV--DIPYEARKAKMDIDELN 356 (530) Q Consensus 279 ~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~--~~~~~~~e~~ld~L~~~ 356 (530) +-+-.++.+.-..........+|.+.+.=.+......+. ..+.+.+..+..+++..+.. ..+.......++.++.. T Consensus 274 ~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~r 351 (536) T protein:vir:10 274 GDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT--KAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEAR 351 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc--cCCCcceecCCcccceeeeccccccchHHHHHHHHHHHH Confidence 999999988877777777777765554211111111111 11223343344455554433 34555566667666666 Q ss_pred HHHHhcccCCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccceeeEEeCCCCC Q lcl|NC_011308. 357 IYRSGMGFNSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGLG-DYSSTDIKFDIEPYIL 434 (530) Q Consensus 357 I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~-~~d~~~i~i~f~~~~P 434 (530) |-..-+.-.+.....+..++..+..+-.-+.+ .-....+.-.+.|.=+++-++.++...+.. ......+++.+.-.+. T Consensus 352 I~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~ 431 (536) T protein:vir:10 352 LSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE 431 (536) T ss_pred HHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHH Confidence 63322221121122223455554443332222 112222333333333333344444333321 1222234555554443 Q ss_pred C-----CHHHHHHHHHHHHhcC------CCcHHHHHHhC----C-----CCCCHHHHHHHHHHHHHHHHH---HHHhhhc Q lcl|NC_011308. 435 A-----NELDLAMIDKTEAETN------QIQINNLLAIA----P-----RIGDEETLKAICDTLDLDYED---VVKALED 491 (530) Q Consensus 435 ~-----n~~e~a~~~~~~~~~g------~iS~et~l~~~----~-----~vdd~~~e~~~~e~e~~e~~~---~~~~~~~ 491 (530) . +...+.+.+..+.+.+ .+.-..++..+ + ++.. ++|++.+.+++.+... .+.++.. T Consensus 432 ~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt-~eev~~~r~q~~~~~~~~~~a~~~~~ 510 (536) T protein:vir:10 432 AIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLT-EEQKQQKMAQQSMQMGMDNGAAALAQ 510 (536) T ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 1111111111111111 12333333221 1 2222 3333333332222211 1111111 Q ss_pred cccc--cCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 492 QEVE--ELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 492 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) .... ...++. +....+.. +.+|.. T Consensus 511 ~~~~~~~~~~~~---~~~~~~~~-----g~~~~~ 536 (536) T protein:vir:10 511 GMAAQATASPEA---MAAAADSV-----GLQPGI 536 (536) T ss_pred HHHHHHhcCchh---HHhhhhcc-----ccCCCC Confidence 1100 001110 00111111 111222 No 198 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=94.59 E-value=0.004 Score=33.72 Aligned_cols=395 Identities=13% Similarity=0.042 Sum_probs=168.4 Q ss_pred cCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccc-cccccccccCCcce--eecCchhhHHhhh Q lcl|NC_011308. 7 TTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMND-HGDIVEDDNASNIK--ISHGFFAELVDQK 83 (530) Q Consensus 7 ~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~-~~~~~~~~~~~n~k--i~~n~~k~Ivd~~ 83 (530) -.+|+- ..++ . -+..+...++..+.|.............. .+...--....+.+ +.+.=....|+.. T Consensus 1 ~~~~~~----~~~~----~--~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~I 70 (424) T protein:vir:18 1 MEEPKY----TIDL----R--TNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLI 70 (424) T ss_pred CCCCcc----cccc----C--CCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHH Confidence 112211 1110 0 11233333444444432111000000000 00000000000111 1111123356666 Q ss_pred hhhhcccceeee-cCCcch-------HHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecc Q lcl|NC_011308. 84 TQYLLANGIDVK-PTDHDD-------QKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDA 152 (530) Q Consensus 84 ~~yl~G~pv~~~-~~~~~d-------e~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p 152 (530) +.=+.+-|+.+- ....+. ..+...|+.-=+. .-......+..+...+|.||.++-++..|++ ....++| T Consensus 71 a~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~ 150 (424) T protein:vir:18 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) T ss_pred HHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecC Confidence 666677787752 211111 1122223211011 1123445567788899999999988888875 4677888 Q ss_pred cceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecc Q lcl|NC_011308. 153 LQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADG 232 (530) Q Consensus 153 ~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (530) ..+-+..+. +.+ +|.... .+ ....|.++.+.|++.... T Consensus 151 ~~v~v~~~~-~~~-----~y~~~~------~g----~~~~~~~~eVihir~~~~-------------------------- 188 (424) T protein:vir:18 151 ANMDVKLVG-KKV-----VYRYQR------DS----EYADFSQKEIFHLKGFGF-------------------------- 188 (424) T ss_pred cceEEEEcC-CeE-----EEEEEe------CC----eEEEeccccEEEecCcCC-------------------------- Confidence 888765432 221 122111 01 111344555555432110 Q ss_pred cccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-- Q lcl|NC_011308. 233 VDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-- 310 (530) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-- 310 (530) +.-.|.|-+......|+....+..-..+.+.-.+.|-.+++-... T Consensus 189 ---------------------------------dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l 235 (424) T protein:vir:18 189 ---------------------------------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) T ss_pred ---------------------------------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCC Confidence 011355655544444444333333334444555556566653221 Q ss_pred Cch--hhHHHHHh-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHH Q lcl|NC_011308. 311 SPV--DEIKKNIQ-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVI 379 (530) Q Consensus 311 ~~~--~~~~~~~~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAi 379 (530) ++. +.++..++ .++++.+++|.+++-++....+.......+...+.|...-.+|. +++..-++.+|.++ T Consensus 236 ~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~ 315 (424) T protein:vir:18 236 TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) T ss_pred CHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccH Confidence 211 11221111 23456676666665555443444445556677788888888884 23222223323333 Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHH Q lcl|NC_011308. 380 KSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG--DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINN 457 (530) Q Consensus 380 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~--~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et 457 (530) +-... .+++..|.-.++.|..-++.+-.. +.....+++.+..-+..|..+.++....+..+|+++... T Consensus 316 eq~~~----------~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE 385 (424) T protein:vir:18 316 EQQNL----------GFLQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINE 385 (424) T ss_pred HHHHH----------HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHH Confidence 22222 223444444444444444432211 112223555555667788888888888999999999988 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 458 LLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 458 ~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (530) +++.+++=. ...++.--....-.+..+- ..+++|.|..+ T Consensus 386 ~R~~~gl~p----------------------i~ggD~~~~~~n~~~l~~~---~~~~~~~~n~a 424 (424) T protein:vir:18 386 MRRTDNMPP----------------------LPGGDVAMRQAQYVPITDL---GTNKEPRNNGA 424 (424) T ss_pred HHHHhCCCC----------------------CCCcCeeeeccCccchhhh---hccCCccccCC Confidence 888765311 0000000000000000000 00111112222 No 199 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=94.51 E-value=0.0042 Score=33.60 Aligned_cols=369 Identities=9% Similarity=0.000 Sum_probs=148.1 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+.+.+... .............+ ....++.. ... +..+. ...-+...=....|+..+.=+.+- T Consensus 1 Mglf~~~~~~-----~~~~~~~~~~~~~~--~~~~~~~~----~~~-~~~v~----~~~al~~~~V~~~i~~Ia~~ia~l 64 (384) T protein:vir:49 1 MPIFNITNLA-----TESPPSNQDSFFDI--TDPEFLDA----LNG-SEWVS----AETALKNSDLFSIISQLSNDLATA 64 (384) T ss_pred CccccccccC-----cccccccchhhccc--cchhhccc----ccC-Cceec----hhhhhccHHHHHHHHHHHHHHhhC Confidence 3222110000 00000000000000 00000000 000 00000 000011111334566666666677 Q ss_pred ceeeecCCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCCceeE Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRI 168 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 168 (530) |+++.-.. . ...+.+=.. -........+..+...+|.+|.++-++..|.+ .+..++|..+-++.++.... T Consensus 65 ~~~~~~~~--~---~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~--- 136 (384) T protein:vir:49 65 KITTSRKQ--L---QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNG--- 136 (384) T ss_pred ceeeecch--h---hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce--- Confidence 87763211 1 111111110 12234556677888999999999999999885 57788999887766443221 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) .+|.+...... ......+..+.+.|++.... T Consensus 137 -~~y~~~~~~~~------~~~~~~~~~~eVih~~~~~~------------------------------------------ 167 (384) T protein:vir:49 137 -LYYNITFDDPR------IPPKQHVPQGDILHFRLLSV------------------------------------------ 167 (384) T ss_pred -EEEEEEecCcc------ccceeEecCccEEEecCCCC------------------------------------------ Confidence 11222111100 00112244455555432110 Q ss_pred cccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHH--------H Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKN--------I 320 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~--------~ 320 (530) . ..-.|.|-+..+...|+....+..-..+.+...+.|-.+++-.+....++.... - T Consensus 168 -------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~ 231 (384) T protein:vir:49 168 -------D---------GGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQ 231 (384) T ss_pred -------C---------CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhccc Confidence 0 001366666666666655554444455555555566665543222222221111 1 Q ss_pred hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHH Q lcl|NC_011308. 321 QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALR 398 (530) Q Consensus 321 ~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~ 398 (530) ..++++.+++|.+++-+.....+.......+.+.+.|...-.+|. ++....+..++.+++-.+.. T Consensus 232 n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~------------- 298 (384) T protein:vir:49 232 MQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFK------------- 298 (384) T ss_pred CCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHH------------- Confidence 234456666666555554443444445566778888888888884 23222222334333332222 Q ss_pred HHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCCCCHHHHHHHH Q lcl|NC_011308. 399 KTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIA---PRIGDEETLKAIC 475 (530) Q Consensus 399 ~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~---~~vdd~~~e~~~~ 475 (530) .++..++-+...+...-...+. ....+..-.+..........+..+|+.++-.+++.+ |+..| |..++ T Consensus 299 -~i~~~l~pi~~~i~~~l~~~l~-----~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~n---e~r~~ 369 (384) T protein:vir:49 299 -AVSRFLRPFVSELSKKLSCEVD-----ADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPK---DLPEG 369 (384) T ss_pred -HHHHHHHHHHHHHHHHhchhhh-----hhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCCh---hHHHH Confidence 2222222222222111111110 000111111111223334455667788887776654 44432 11111 Q ss_pred HHHHHHHHHHHHhhhccccccCCccccCC Q lcl|NC_011308. 476 DTLDLDYEDVVKALEDQEVEELEPTVTPI 504 (530) Q Consensus 476 e~e~~e~~~~~~~~~~~~~~~~~~~~~~~ 504 (530) +..+....+++ ++.. T Consensus 370 --------~~~~p~~gGd~------~~~~ 384 (384) T protein:vir:49 370 --------ETDSTLKGGET------NEQY 384 (384) T ss_pred --------cCCCCCCCCCC------CCCC Confidence 11111111111 1111 No 200 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=378 Identities=11% Similarity=0.073 Sum_probs=159.7 Q ss_pred HHHHhhhHHHH---HHH-HHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccceeeecCC Q lcl|NC_011308. 23 EYIRSQNVSLA---RVG-QRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGIDVKPTD 98 (530) Q Consensus 23 ~~~~~~~~~~~---~~~-~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~ 98 (530) =|-+....+.. ... ..|.. .++.-... ..+..+ .+..-+.+.-...-|+..++=+.+-|+.+.-.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~---~~~g~~~s---~~~~~v----~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~ 70 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVS---ALLGSARS---EAGQVV----TPASALSLTVLQNCVTLLAESIAQLPVELYERS 70 (419) T ss_pred CCcccccccccCcCCCCcchhhH---Hhhccccc---ccCccc----ChHHhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 00000000000 000 00000 00000000 000000 001111223344466666677777788753222 Q ss_pred cc------hHHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCCCceeE Q lcl|NC_011308. 99 HD------DQKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTLQRI 168 (530) Q Consensus 99 ~~------de~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~~~~ 168 (530) .+ +..+...|+.- -|. -......+......+|.||.++-++..|.+. +..++|..+-++.+..+.. T Consensus 71 ~~~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~--- 146 (419) T protein:vir:80 71 GDDRKPATDHPLYSILKYE-PNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKP--- 146 (419) T ss_pred CCCcccccccHHHHHHHhh-cccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceE--- Confidence 11 11222333211 021 2234456777889999999999999999864 7788998887766654321 Q ss_pred EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccc Q lcl|NC_011308. 169 IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGR 248 (530) Q Consensus 169 ~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 248 (530) +|.+. .. ..+..+.+.|.+.. T Consensus 147 --~y~~~---~~----------~~~~~~~i~h~~~~-------------------------------------------- 167 (419) T protein:vir:80 147 --MYRVA---GA----------DPLPQRLVHHVRWM-------------------------------------------- 167 (419) T ss_pred --EEEEc---Cc----------cccchhheEEecCC-------------------------------------------- Confidence 11110 00 01122222222110 Q ss_pred cccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCC-CCchhh----HHHHH- Q lcl|NC_011308. 249 QVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGT-NSPVDE----IKKNI- 320 (530) Q Consensus 249 ~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~-~~~~~~----~~~~~- 320 (530) |. +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+|+ +.. .....+ ++..+ T Consensus 168 -----------~~----d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~ 232 (419) T protein:vir:80 168 -----------SI----NGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWN 232 (419) T ss_pred -----------CC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHH Confidence 00 11146666665555555444443333344444455665654 211 111111 22211 Q ss_pred -------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 321 -------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 321 -------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) ..++++.+++|.+++-++....+.......+...+.|...-.+|. ++....|+-|+..- T Consensus 233 ~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~------------ 300 (419) T protein:vir:80 233 AKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEH------------ 300 (419) T ss_pred HHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHH------------ Confidence 124567777766666555444444445556677888888888885 23222222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCC Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLG--DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGD 467 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~--~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd 467 (530) ....+++..|.-.++.|...++.+-.. ......+.+.++.-+..|..+.++...++..+|+++...+++.+++ +++ T Consensus 301 ~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~g 380 (419) T protein:vir:80 301 QSLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKG 380 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 112233333444444444433332111 1111223444444455688888888888999999999998888754 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 468 EETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 468 ~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) -+.- ...+.-.. - ..+++.+.+..+.+ ..++.+ T Consensus 381 GD~~--------------~~~~n~~~------~---------~~~~~~~~~~~~~~-~~~~~~ 413 (419) T protein:vir:80 381 GDIY--------------LSPMNMVD------A---------SKPQPIPMGKTEPT-KAALDE 413 (419) T ss_pred ccee--------------eecccccc------c---------cccccccCCCCCch-hhhHHH Confidence 1100 00000000 0 00001111111100 111111 No 201 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=94.43 E-value=0.0044 Score=33.48 Aligned_cols=391 Identities=8% Similarity=-0.052 Sum_probs=160.0 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHH-----------HHhcccchh-hhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQ-----------RYYNQDNDI-ENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~-----------~YY~g~~~I-~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |-+.+++...=. .......++.... +.+.+-++- +..........+..+ .++.=+.+.-... T Consensus 1 Mgl~d~~r~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v----~~~~al~~~~V~~ 74 (431) T protein:vir:10 1 MGLFDFIRREKQ--PEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTG----RETRALRNMAVLR 74 (431) T ss_pred CcchhhhhcCcc--cccccccccccccccccccccccccccccccchHHHHhhccCccCccee----chhhhhccHHHHH Confidence 544444322000 0000000000000 011111100 000000000000000 0000011122334 Q ss_pred HHhhhhhhhcccceeeecCC-cchHHHHHHHHHHhh---ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEec Q lcl|NC_011308. 79 LVDQKTQYLLANGIDVKPTD-HDDQKLCYLIEEYYN---EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVD 151 (530) Q Consensus 79 Ivd~~~~yl~G~pv~~~~~~-~~de~~~~~l~~~~~---~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~ 151 (530) .|+..++=+.+-|+++--.+ ........-+..+++ |.. ......+......+|.+|.++-++...-+....++ T Consensus 75 ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~L~pl~ 154 (431) T protein:vir:10 75 CVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRPIRLIPMD 154 (431) T ss_pred HHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCceEEEEEEc Confidence 55666666667788752222 111111112223322 222 23445677888899999999888853334567788 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeec Q lcl|NC_011308. 152 ALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVAD 231 (530) Q Consensus 152 p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (530) |..+-++.++.+.+ +|..... .+. ...+..+.+.|++... T Consensus 155 ~~~v~~~~~~~~~~-----~y~~~~~-----~g~----~~~~~~~dViHir~~~-------------------------- 194 (431) T protein:vir:10 155 RGSAKGRLTSTWQI-----VYDYTTP-----TGD----KIELPAREVFHLRDLS-------------------------- 194 (431) T ss_pred CceeEEEEcCCCeE-----EEEEEeC-----Cce----EEEEchhhEEEecCcC-------------------------- Confidence 99888877655432 1211110 011 0123344444442110 Q ss_pred ccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-C Q lcl|NC_011308. 232 GVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-N 310 (530) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~ 310 (530) + +.-.|.|-++-....|.....+..-..+.+.-.+.|-.+++-.. . T Consensus 195 ------------------------~---------dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l 241 (431) T protein:vir:10 195 ------------------------I---------DGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKEL 241 (431) T ss_pred ------------------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCC Confidence 0 11136666665555554444433333444444455655554322 1 Q ss_pred Cc--hhhHHHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHH Q lcl|NC_011308. 311 SP--VDEIKKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVV 378 (530) Q Consensus 311 ~~--~~~~~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvA 378 (530) ++ .+.++..+. .++++.+++|.+++-++....+.......+.....|...-.+|. ++.... .++.. T Consensus 242 s~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~--~t~sn 319 (431) T protein:vir:10 242 SDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDT--SWGSG 319 (431) T ss_pred CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCC--Ccccc Confidence 21 122222221 23566677666666555443343444455666778887778875 332211 12222 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC---- Q lcl|NC_011308. 379 IKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG--DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ---- 452 (530) Q Consensus 379 ik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~--~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~---- 452 (530) ++- ....+++..|.-.++.|..-++.+-.. ......+++.++.-+-.|..+.++...++.+.|+ T Consensus 320 ~eq----------~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~ 389 (431) T protein:vir:10 320 IEQ----------LAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPW 389 (431) T ss_pred HHH----------HHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCc Confidence 211 111222333444444444433322111 1111234444444455677787777777776664 Q ss_pred CcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 453 IQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 453 iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) ++.-.+++.+++ ++++... +....+..+. ....+++|+.+ T Consensus 390 lT~NE~R~~~gl~p~~~~~gD------------~~~~p~n~~~---------------~~~~~~~p~~~ 431 (431) T protein:vir:10 390 MKQNEVREMLDLPRADDPVAD------------QLRNPMTQKQ---------------KGSGDEPPATT 431 (431) T ss_pred cCHHHHHHHhCCCCCCCcccc------------ceeccccccc---------------CCCCCCCCCCC Confidence 788888777643 4443211 0001111100 01111223333 No 202 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=94.34 E-value=0.0047 Score=33.34 Aligned_cols=368 Identities=9% Similarity=-0.003 Sum_probs=151.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce--eecCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK--ISHGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k--i~~n~~k~Ivd~~~~yl~ 88 (530) |-+... .. ...++.+ . .++........-.. .... ....+.+ +...-....|+..++=+. T Consensus 1 Mg~~~~---~~-~~~~~~~-~------~~~~~~~~~~~~~~--~~~~------~~~v~~~~al~~~~v~~~i~~ia~~ia 61 (385) T protein:vir:10 1 MGLLTP---RN-FNKRKAK-N------MVYPSNPAFFTTTV--GGMQ------LSYVSALSALQNTNVYSVINRIASDVA 61 (385) T ss_pred Cccccc---hh-ccccccc-c------cccccchhhhhhhc--cccC------ccccCHHHhhccHHHHHHHHHHHHHHh Confidence 332211 10 0000000 0 01111111000000 0000 0000111 111223445666666666 Q ss_pred ccceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCCCcee Q lcl|NC_011308. 89 ANGIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYGTLQR 167 (530) Q Consensus 89 G~pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 167 (530) +-|+++.- ......|++=... ........+..+...+|.||.++..+. +....++|..+.++.+..+ . T Consensus 62 ~~p~~v~~-----~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~~v~~~~~~~~-~-- 130 (385) T protein:vir:10 62 SAHFKTEN-----TATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNMG-I-- 130 (385) T ss_pred hCceeeec-----cchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCceEEEEEcCCc-e-- Confidence 77877631 1122223221111 122344556778888999998876552 3334445544444433221 1 Q ss_pred EEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccccc Q lcl|NC_011308. 168 IIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEG 247 (530) Q Consensus 168 ~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (530) +|.+..... . ....+.++.+.|++.... T Consensus 131 ---~~~~~~~~~----~----~~~~~~~~eiihik~~~~----------------------------------------- 158 (385) T protein:vir:10 131 ---VYTVLESND----R----PQMVLRQDQMLHFRLMPD----------------------------------------- 158 (385) T ss_pred ---EEEEEEcCC----c----eEEEEccccEEEeccCCC----------------------------------------- Confidence 111111000 0 011234444444432110 Q ss_pred ccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC--CCCc--hhhHHHHHh-- Q lcl|NC_011308. 248 RQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG--TNSP--VDEIKKNIQ-- 321 (530) Q Consensus 248 ~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~--~~~~--~~~~~~~~~-- 321 (530) .++ +.-.|.|.+......|+....+..-.++.+.....|-.+++-. ..++ .+.++..++ T Consensus 159 ------~~~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~ 223 (385) T protein:vir:10 159 ------PQY---------RYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKA 223 (385) T ss_pred ------Ccc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 000 0114677777766666655555544555555555666665432 2111 111222111 Q ss_pred -----hCcceecCCCCceeEEEecCCHHH-HHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHH Q lcl|NC_011308. 322 -----SKKIIQTKGEGGLDIQTVDIPYEA-RKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKT 393 (530) Q Consensus 322 -----~~~~i~~~~~~~~~~lt~~~~~~~-~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~k 393 (530) .++++.+++|.+++-++....... .....+...+.|...-.+|. ++....++.++..++-. T Consensus 224 ~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~----------- 292 (385) T protein:vir:10 224 NTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQI----------- 292 (385) T ss_pred hCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHH----------- Confidence 234566666655555544333322 23455677788888888885 22211233322222110 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHH Q lcl|NC_011308. 394 EIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKA 473 (530) Q Consensus 394 e~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~ 473 (530) +..|...|.-.++.|..-++.+-.. ..+++.+..-+..|..+.++....+.++|+++...+++.+++-.=+...+ T Consensus 293 ~~~~~~~l~P~~~~ie~~l~~~l~~----~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~- 367 (385) T protein:vir:10 293 KATYLANLNSYVNPIVDELRLKMNA----PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNL- 367 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCC----ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCC- Confidence 1111222333333333333322111 13666666767788889999999999999999988887764311000000 Q ss_pred HHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 474 ICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 474 ~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) ......... ..+. +.|.+ T Consensus 368 ---------~~~~~~~~~-------~~~g-------------~~~dn 385 (385) T protein:vir:10 368 ---------PEFKPLTTQ-------VKGG-------------DEGDN 385 (385) T ss_pred ---------ccccCcccc-------cCCC-------------CCCCC Confidence 000000000 0000 00000 No 203 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=94.28 E-value=0.0049 Score=33.26 Aligned_cols=406 Identities=9% Similarity=0.028 Sum_probs=184.0 Q ss_pred CCcccccCCccc--HHH---HHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCc Q lcl|NC_011308. 1 MTNTLLTTAPDR--LGT---ILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGF 75 (530) Q Consensus 1 ~~~~~~~~~~~~--~~~---~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~ 75 (530) ||--++...... ..+ -+...|..- .+.-......-.+...-.|+++.- ..-...++- . .... T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~---~~~~~~~~~~~~~p~~~~il~~~~----~~~~~y~~m-----~-~D~~ 67 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATR---ARSIDFFALGMYLPNPDPVLKALG----KDIRVYREL-----R-ADAH 67 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhh---ccccccccccccCcchhHHHhhcc----CCHHHHHHH-----h-hChH Confidence 776665543321 111 111111100 000000000011222222222110 000011110 1 2345 Q ss_pred hhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCeEE-EEEEecCCCceE---EEEe Q lcl|NC_011308. 76 FAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYNE-EFQSAIQELVEGSTIKGYEG-IFARTTSEDKLT---FQTV 150 (530) Q Consensus 76 ~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~~-~~~~~~~e~~~~~~~~G~a~-~~~y~d~~g~~~---~~~~ 150 (530) ..-.+.+...-++|.+..+...+. ++...+++.+.++. +|.+++..+ .++.-+|.+. +++|...+|.+. +... T Consensus 68 i~s~l~~Rk~av~~~~w~i~~~~~-~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r 145 (491) T protein:vir:79 68 VGGCVRRRKAAVKALEWGLDRGKA-KSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGK 145 (491) T ss_pred HHHHHHHHHHHHhCCCcEEecCCC-CHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeee Confidence 566777777888899999977654 34456777777753 677777666 4577789865 677765566553 4444 Q ss_pred cccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeee Q lcl|NC_011308. 151 DALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVA 230 (530) Q Consensus 151 ~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (530) +|..+. ||..+.+. ++..++.... T Consensus 146 ~~~~f~--~d~~~~l~----------------------------------l~~~~~~~~g-------------------- 169 (491) T protein:vir:79 146 PADWFV--YDPENQLR----------------------------------FRSKEHWVQG-------------------- 169 (491) T ss_pred ccccee--eccCCceE----------------------------------EeecCCCCCc-------------------- Confidence 554332 33322211 1110000000 Q ss_pred cccccceecccccccccccccccccCCccceEEee--CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecC Q lcl|NC_011308. 231 DGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY--NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGG 308 (530) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~--nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~ 308 (530) ..-.+++.|-+.+-. .|..|.|.+..+-...--=+..+.+.+..++.|.-|+.+.+=. T Consensus 170 --------------------~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~ 229 (491) T protein:vir:79 170 --------------------EELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP 229 (491) T ss_pred --------------------eeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecC Confidence 000112222222111 2346788888776666566677888999999999999988732 Q ss_pred CCCchh------hHHHHHhhCcceecCCCCceeEEEecC---CHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcH-H Q lcl|NC_011308. 309 TNSPVD------EIKKNIQSKKIIQTKGEGGLDIQTVDI---PYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATN-V 377 (530) Q Consensus 309 ~~~~~~------~~~~~~~~~~~i~~~~~~~~~~lt~~~---~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SG-v 377 (530) .+...+ +...++....++.++.+.++++++... +...++..+++..+.|-..--+=.++..+. |.+.| + T Consensus 230 ~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~v 309 (491) T protein:vir:79 230 RSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQA 309 (491) T ss_pred CCCCHHHHHHHHHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhhHHH Confidence 221211 122345666677789999999997542 344577777777777766553323333332 23323 3 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCH-HHHHHHHHHHHhcCC-CcH Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANE-LDLAMIDKTEAETNQ-IQI 455 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~-~e~a~~~~~~~~~g~-iS~ 455 (530) .-+.+. ..+..-.+.....|.+++.- ++..++. +...+.+.|. -+.+. ...|+.+..+...|+ ++. T Consensus 310 h~~v~~----~i~~~D~~~i~~tln~li~~---l~~~N~~---~~~~p~f~~~--e~ee~~~~~a~~~~~L~~~G~~i~~ 377 (491) T protein:vir:79 310 GLEVTD----DIRDGDKAIVVEAMNMLIRW---ICDLNFD---GAARPVFDMW--EQEQVDEIQAGRDEKLTRAGARFTP 377 (491) T ss_pred HHHHHH----HHHHHHHHHHHHHHHHHHHH---HHHhcCC---CCCcceEeec--CcCchhHHHHHHHHHHHhCCCccCH Confidence 333222 22222233444455443333 3333332 1223445553 34443 457888888888886 788 Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc---------- Q lcl|NC_011308. 456 NNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE---------- 525 (530) Q Consensus 456 et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 525 (530) +.+.+.+++- .++... +..+....+.. .....+. ...+++.+ .+..... T Consensus 378 ~~~~e~~Gip-~~~~~e-----------~~~~~~~~~~~-~~~~~~~------~~~~~~~~--~d~~~~~~~~~~~~~~~ 436 (491) T protein:vir:79 378 AYFKRAYNLQ-DGDLDE-----------RPLPVSAVDAV-GAASFAE------FEAPDQDA--LDAALNALSARDLNADA 436 (491) T ss_pred HHHHHHhCCC-CCCCCc-----------cccCcCccccc-ccccccc------cCCCCCcc--hHHHHHHHHHHHHHHHH Confidence 8888877642 221110 00000000000 0000000 00000000 0000000 Q ss_pred ----ccCCC Q lcl|NC_011308. 526 ----EPVQE 530 (530) Q Consensus 526 ----~~~~~ 530 (530) +||.+ T Consensus 437 ~~~~~~i~~ 445 (491) T protein:vir:79 437 QALVAPLLK 445 (491) T ss_pred HHHHHHHHH Confidence 01111 No 204 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=94.23 E-value=0.005 Score=33.19 Aligned_cols=407 Identities=13% Similarity=0.052 Sum_probs=165.1 Q ss_pred cccCCcccHHHHHHHHHHHHHHhhhHHHHH-HHHHHhccc-chhhhcccccccccccccccccCCcceeecCchhhHHhh Q lcl|NC_011308. 5 LLTTAPDRLGTILSTKIDEYIRSQNVSLAR-VGQRYYNQD-NDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQ 82 (530) Q Consensus 5 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~-~~~~YY~g~-~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~ 82 (530) .+.+..|-.-+.++.+... ........ .......+- +...- ... ..+..+.... -.|+ .-.-..|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-~~s---~~g~~v~~~~--al~~--~~V~~~i~~ 69 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVP---PDPVDIGGGQTFTPVNATARDLGI-IIS---DTGAAVNADA--IMRL--DAVAACVKL 69 (432) T ss_pred CCCCcccchhhhhHhhcCC---ccccccccccccccCcchhhhhcc-ccc---ccCcccchhh--hhcc--hHHHHHHHH Confidence 3333333333322222111 00000000 000000000 00000 000 0000000000 0012 112335566 Q ss_pred hhhhhcccceeee-cCCcchHH-HHHHHHHHhh---ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEeccc Q lcl|NC_011308. 83 KTQYLLANGIDVK-PTDHDDQK-LCYLIEEYYN---EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDAL 153 (530) Q Consensus 83 ~~~yl~G~pv~~~-~~~~~de~-~~~~l~~~~~---~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~ 153 (530) .++=+.+-|+.+- -..++... ...-+..++. |.. ......+......+|.||.++..+ +|++ ....++|. T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~ 148 (432) T protein:vir:10 70 VSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLAND 148 (432) T ss_pred HHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCC Confidence 6666667787642 11111111 1111222221 222 234456777888999999888775 4664 46678999 Q ss_pred ceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccc Q lcl|NC_011308. 154 QLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGV 233 (530) Q Consensus 154 ~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (530) .+-++.+..+.+. |...... +. ...+..+.+.|++... T Consensus 149 ~v~v~~~~~g~~~-----y~~~~~~-----g~----~~~~~~~~iih~~~~~---------------------------- 186 (432) T protein:vir:10 149 RLTITTDTKGNTA-----YRYRRTD-----GQ----MIDIPKQQIWKIMGYS---------------------------- 186 (432) T ss_pred ceEEEEcCCCcEE-----EEEEecC-----ce----EEEEcCccEEEecCCC---------------------------- Confidence 9988887665422 2111111 10 1123444555442110 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP 312 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~ 312 (530) + +.-.|.|-++.....|+....+..-..+.+.-.+.|-.+++.... ++ T Consensus 187 ----------------------~---------dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~ 235 (432) T protein:vir:10 187 ----------------------L---------DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTD 235 (432) T ss_pred ----------------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCH Confidence 0 011355555544444433333322233333434456666653222 11 Q ss_pred --hhhHHHH----HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccC-CcHHHHHHHH Q lcl|NC_011308. 313 --VDEIKKN----IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGN-ATNVVIKSRY 383 (530) Q Consensus 313 --~~~~~~~----~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn-~SGvAik~~~ 383 (530) -+.++.. ...++++.+++|.+++-++....+..+....+.....|...-.+|. ++...-|+ ..|..++- T Consensus 236 e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~-- 313 (432) T protein:vir:10 236 DQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES-- 313 (432) T ss_pred HHHHHHHHHHhhhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHH-- Confidence 1222222 2335667777777666665544444444556777888888888875 22222222 11222211 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeC--CCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 384 TLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIE--PYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 384 ~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~--~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) ....+++..|.-.++.|...++.+-........+.+.|+ .-+-.|..+.++...++..+|+++...+++. T Consensus 314 --------~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~ 385 (432) T protein:vir:10 314 --------QQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREI 385 (432) T ss_pred --------HHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Confidence 112223334444444444444332111111123344554 4455688888888889999999999999988 Q ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 462 APR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 462 ~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +++ +++-...+ .+ ...+..+... . +...+.|.+..+-...+=+|| T Consensus 386 ~glppi~g~~~~~-~~-------~~~~~pl~~~-----~-----------~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 386 EGLPKLGGNAAVL-TV-------QSAMVPLDSI-----G-----------LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred hCCCCCCCCcceE-ee-------cCcccchhhh-----c-----------ccCCCCCCCCCCCcccccccC Confidence 754 22111000 00 0000000000 0 000111111111111111222 No 205 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=94.21 E-value=0.0051 Score=33.17 Aligned_cols=383 Identities=11% Similarity=-0.040 Sum_probs=142.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |.... + . .+.....-.....+..|.-. ...... . -.|++. .-..|+..+.=+..- T Consensus 1 m~~f~---~----~-~~~~~~~~~~~~~~~~~~~~------------~~~~~~-~--Al~~~~--V~~~i~~Ia~~iA~l 55 (406) T protein:vir:97 1 MSFFQ---P----L-GTSKVSYDDYISSVLAGDVS------------QKYLGV-S--ALKNSD--ILTATSIIAGDIARF 55 (406) T ss_pred Ccccc---c----c-CCCCCCcchHHHHHhcCCCC------------cccccc-h--hhccHH--HHHHHHHHHHhhhhC Confidence 21110 0 0 00000000000011111000 000000 0 001111 111344444445555 Q ss_pred ceeeecCCcchHHHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecC-CCce-EEEEecccceEEEEcCC Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTS-EDKL-TFQTVDALQLLPVFDDY 162 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~-~g~~-~~~~~~p~~~~~v~d~~ 162 (530) |+.+...+.+ .....-+..++. |. .......+......+|.||.++-++. .|.+ .+..++|..|-+..++. T Consensus 56 p~~~~~~~g~-~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~ 134 (406) T protein:vir:97 56 PLVKKDVNGD-IIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDN 134 (406) T ss_pred eeEEEecCcc-ccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCC Confidence 7765433221 111112233331 22 23445667788889999999888875 4554 57788999887766654 Q ss_pred CCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccc Q lcl|NC_011308. 163 GTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGV 242 (530) Q Consensus 163 ~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (530) +.+. |.+..... .. ...+.++.+.|++... T Consensus 135 ~~~~-----y~~~~~~~----~~----~~~~~~~evih~r~~~------------------------------------- 164 (406) T protein:vir:97 135 HEIV-----YTFTDMLT----AK----QVKCFAHDVIHWKFFS------------------------------------- 164 (406) T ss_pred ceEE-----EEEEecCC----ce----EEEEccccEEEecCCC------------------------------------- Confidence 4321 22111000 00 0123344444442100 Q ss_pred cccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee-eeecCCCCch--hhHHHH Q lcl|NC_011308. 243 EEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIY-VVRGGTNSPV--DEIKKN 319 (530) Q Consensus 243 ~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~l-vl~g~~~~~~--~~~~~~ 319 (530) + +.-.|.|.++.....|+.-..+..-.++.+..-..|-+ +..+...++. +.++.. T Consensus 165 -------------~---------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~ 222 (406) T protein:vir:97 165 -------------H---------DTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQE 222 (406) T ss_pred -------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHH Confidence 0 00125666655544444333333323333333333333 3333332221 222221 Q ss_pred Hh-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccccc-CCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 320 IQ-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGDG-NATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 320 ~~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~g-n~SGvAik~~~~~l~~ka~ 391 (530) ++ .++++.+++|.++.-++....+.......+.....|...-.+|. .-.| .+++..+ .. T Consensus 223 ~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp---~~lg~~~~~~~~----------e~ 289 (406) T protein:vir:97 223 FEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPS---YKLGVNSPNQSV----------AQ 289 (406) T ss_pred HHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH---HHcCCCCCcchH----------HH Confidence 11 23455566665555554332222222333444555655555663 2222 1222111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHH Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEE 469 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~ 469 (530) ..+.++...|.-.++.|...++.+-........+.+.|. +-.+....++....+.++|+++...+++.+++ ++++. T Consensus 290 ~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd--~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~ 367 (406) T protein:vir:97 290 LMEDYVTNDLPFYFDAITSELGLKTLNDKDRRLYHIEFD--TRSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPN 367 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhcChhhccceeEEEe--cCccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 122234444444444444444433222212223445553 22233344455567788999999999888754 22221 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccc--cccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 470 TLKAICDTLDLDYEDVVKALEDQE--VEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 470 ~e~~~~e~e~~e~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) ...-.+- ..+..+...+ .+.......+.+..++.+ .+ T Consensus 368 gD~~~~~-------~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~-------~~ 406 (406) T protein:vir:97 368 MDRYQSS-------LNYVFLDKKEEYQDKVGIKGKGGEVNAEED-------KS 406 (406) T ss_pred CCeEeec-------cCccchhcccccccccccccCCCCCCCCCC-------CC Confidence 0000000 0000000000 000000001110000000 00 No 206 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=94.05 E-value=0.0055 Score=32.95 Aligned_cols=468 Identities=10% Similarity=0.056 Sum_probs=199.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |-+++....+ .+.+++..+.+++.. ...+.+.+.+|..-.- +.+ +.... .....|+..+-... T Consensus 1 ~~~~~~~~~~---~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~--~~~-----~~~~~-----~~~~~~~~dst~~~ 65 (543) T protein:vir:88 1 MAETKREGLA---EEGAKAVYERLKNDRVPYETRAENCAKVTIPSL--FPK-----DSDNS-----STDYTTPWQAVGAR 65 (543) T ss_pred CcccccCcch---HHHHHHHHHHHHHHHhHHHHHHHHHHHHhcccc--CCC-----CCCcc-----cccccccccchHHH Confidence 7666654443 334555555555422 2244556666654421 110 00000 01112455555556 Q ss_pred HHhhhhhhhccc--cee--ee--cCCc----------chHHH-------HHHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID--VK--PTDH----------DDQKL-------CYLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~--~~--~~~~----------~de~~-------~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- |.+ |. ..+. +-..+ .+.+...+ ..||....+++.++...+|.+. T Consensus 66 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ 145 (543) T protein:vir:88 66 GLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTAL 145 (543) T ss_pred HHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcee Confidence 666666555542 221 11 1110 00111 12233333 3578889999999999999996 Q ss_pred EEEEecCCC--ceE---EEEecccceEEEEcCCCCceeEEEEEEEEeecc---------cccccceEEEEEEEcCCceEE Q lcl|NC_011308. 135 IFARTTSED--KLT---FQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSD---------ADNKFNSIGHADVWTDTEVWY 200 (530) Q Consensus 135 ~~~y~d~~g--~~~---~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~---------~~~~~~~~~~~evyt~~~~~~ 200 (530) . |.+++. ..+ ++.++-.+.+.-.|..+....++|.+......- .....+.-..++||+. . T Consensus 146 l--y~~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~----V 219 (543) T protein:vir:88 146 I--YLPPPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTH----I 219 (543) T ss_pred e--eeccCccccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEE----E Confidence 4 544443 222 333433333333477777777777665322211 0011222234455432 1 Q ss_pred EeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHH Q lcl|NC_011308. 201 YVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIK 275 (530) Q Consensus 201 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e 275 (530) |...+.+... . ...+. ...........+|...|++.++- +.+|.|--+ T Consensus 220 ~pr~~~~~~~---~--------~~~~~----------------~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~ 272 (543) T protein:vir:88 220 YIDDESGDFL---S--------YQEIE----------------GVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVE 272 (543) T ss_pred EeecCCCccc---c--------ccccc----------------CeeeecCCCccccccCCceeeeeeecCCCccccchHH Confidence 2222211100 0 00000 00000111234466778777664 358999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHH Q lcl|NC_011308. 276 KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDID 353 (530) Q Consensus 276 ~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L 353 (530) +..+-+-.++.+.-..........+|.+++.-.+......+. ..+.+.+..+..+++..+... .+.......++.+ T Consensus 273 ~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~ 350 (543) T protein:vir:88 273 EYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLV--KAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAI 350 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc--cCCCceeecCCCCcceeeecccccchhHHHHHHHHH Confidence 999999999999999999999999998886422222222221 122344544556777666444 3666677777777 Q ss_pred HHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccceeeEEeCC Q lcl|NC_011308. 354 ELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMK-AQKTEIALRKTLRWTADLVVEDIRRRGLG-DYSSTDIKFDIEP 431 (530) Q Consensus 354 ~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~k-a~~ke~~f~~~l~~~~~~i~~~l~~~~~~-~~d~~~i~i~f~~ 431 (530) +..|-..-+.-.+.....+..++..+..+-.-+... -....+.-.+.|.=+++-++.++...+.- ......+++.|.. T Consensus 351 ~~rI~~af~~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs 430 (543) T protein:vir:88 351 EARLSYVFMLNSAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTT 430 (543) T ss_pred HHHHHHHHhhhhhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEe Confidence 776643222211211122334555444332222211 12222222222322333333444333321 1222345666653 Q ss_pred CCCC-CH-HHHHHHHHHHHhcCCCcHH---------HHHHhC----C-----CCCCHHHHHHHHHHHHHHHHHHHHhhh- Q lcl|NC_011308. 432 YILA-NE-LDLAMIDKTEAETNQIQIN---------NLLAIA----P-----RIGDEETLKAICDTLDLDYEDVVKALE- 490 (530) Q Consensus 432 ~~P~-n~-~e~a~~~~~~~~~g~iS~e---------t~l~~~----~-----~vdd~~~e~~~~e~e~~e~~~~~~~~~- 490 (530) .+.. .. .+...+...+...|.+++. .++..+ + ++.. +++.+++.+++.+.+..+.... T Consensus 431 ~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~-~~e~~~~~~q~~~q~~~~~~~~~ 509 (543) T protein:vir:88 431 GAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLT-EAEKAQAQSQEMLKQGGLNAAAG 509 (543) T ss_pred cHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHH Confidence 3321 00 1111112222222333332 222221 1 1222 3344444433332222222111 Q ss_pred ccccc--c--CCccc-cCCCCCCCCCCccCcCCCCc Q lcl|NC_011308. 491 DQEVE--E--LEPTV-TPIIDPLTIEPQPEPLNIDP 521 (530) Q Consensus 491 ~~~~~--~--~~~~~-~~~~~~~~~~~~~~~~~~~~ 521 (530) .+... + ..++. +... +....++-|.+.+- T Consensus 510 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 510 IGSGVAAQATASPEAMESAM--DTAGVQPGPIATQV 543 (543) T ss_pred HhhchhhhhccChHHHHHHh--hhcCCCCCCCCCCC Confidence 11100 0 01111 0000 01112222222222 No 207 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=93.03 E-value=0.009 Score=31.79 Aligned_cols=415 Identities=11% Similarity=0.061 Sum_probs=149.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhc-ccchhhhcccccccccccccccc--cC-CcceeecCchhhHHhhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYN-QDNDIENTRIMWMNDHGDIVEDD--NA-SNIKISHGFFAELVDQKTQY 86 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~-g~~~I~~r~~~~~~~~~~~~~~~--~~-~n~ki~~n~~k~Ivd~~~~y 86 (530) |+ +-+.-.+++..++.+..+.+ +.+. .--..-.++.+.+...+....-. .. .-+| ....++.||+..+.= T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~r---d~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr-~~~ia~~iVd~~~d~ 74 (449) T protein:vir:10 1 MT--DKLTLAVNHALNDARMARAR---MGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYR-RGGIAHGAVEKLVGK 74 (449) T ss_pred Cc--hhhHHHHhhhcchhHHHHHH---HHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHh-cCchhHHHHHhhhhh Confidence 33 23444455555444433322 2111 10001011111111111110000 00 0001 134566778877664 Q ss_pred hc-ccceeeecCCcchH----HHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcC Q lcl|NC_011308. 87 LL-ANGIDVKPTDHDDQ----KLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDD 161 (530) Q Consensus 87 l~-G~pv~~~~~~~~de----~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~ 161 (530) ++ ..|..+...+.+.. .....+++++..++.+.+.++.+.+..+|.|+.++-.+. |+.-- .| +- . T Consensus 75 ~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d-~~~l~---~P-----l~-~ 144 (449) T protein:vir:10 75 CWQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIRD-EKDWN---LP-----AT-K 144 (449) T ss_pred hhhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEecC-CCCCC---cc-----cc-c Confidence 43 22332322222111 123344454445566678888888888998988776642 32111 12 11 1 Q ss_pred CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccc Q lcl|NC_011308. 162 YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEG 241 (530) Q Consensus 162 ~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (530) ...+..+.-+|.......... .+ ..--.++-+ ..|.+...-.+. T Consensus 145 ~~~i~~i~v~~~~~i~~~~~~-~d-p~sp~yg~P-~~y~v~~~~~g~--------------------------------- 188 (449) T protein:vir:10 145 GRGLQKVSVSWAGSLKVAEWD-TG-INSKTYGQP-KLWKYTERLPNG--------------------------------- 188 (449) T ss_pred CcceeeEEeeccccCChhhhh-cC-CCCCCCCCc-eEEEEeeeccCC--------------------------------- Confidence 111111111111000000000 00 000000000 011111000000 Q ss_pred ccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH-----HHHHHHhccce---------eeeec Q lcl|NC_011308. 242 VEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL-----SNNLQDMAEAI---------YVVRG 307 (530) Q Consensus 242 ~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~-----~n~~~~~~~~~---------lvl~g 307 (530) ...+ ..-|+=..+.++... ..|.|.++.+-.-+-.++.+.-.. .+..+...-.. .-+.+ T Consensus 189 ~~~~-----~~iH~SRl~~~~~~~--~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~ 261 (449) T protein:vir:10 189 SSRR-----VDIHPDRVFILGDYS--EDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYG 261 (449) T ss_pred Cccc-----eeeccceeEeecCCC--CCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhh Confidence 0000 001111111111110 124444544322222222221111 12111111000 01112 Q ss_pred CCCCchhh-HH---HHH-hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccc---cc-CCcHHH Q lcl|NC_011308. 308 GTNSPVDE-IK---KNI-QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVG---DG-NATNVV 378 (530) Q Consensus 308 ~~~~~~~~-~~---~~~-~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~---~g-n~SGvA 378 (530) .+.+...+ +. ..+ +....+.++.++++ -+.+.+.......++...+.+-..+.+|-.--.+ .| |++| - T Consensus 262 ~~~e~~~~~~~~~~~~~~~~~~~~~i~~~~d~--~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~-D 338 (449) T protein:vir:10 262 VSIDELQDKFNEVAGEINRGNDVLMTTQGATV--TPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTE-D 338 (449) T ss_pred CCchHHHHHHHHHHHHHhccchheeecCCcce--EEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccch-h Confidence 22222111 11 111 12223345555554 4555667778888898889999999999532111 13 2344 3 Q ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHH Q lcl|NC_011308. 379 IKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNL 458 (530) Q Consensus 379 ik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~ 458 (530) ++. |.... .+ ++..++..|++++.++... +.++-+ .+++|.|++=..-+++|.|++..+...+ ..++ T Consensus 339 ~~n-yyd~i-~~--~Q~~l~p~le~l~~~l~~s----~~g~~~-~d~~i~f~pL~~~t~kEkAei~k~~A~a----~~~~ 405 (449) T protein:vir:10 339 QKY-FNARC-QS--RRVDLSFEIEDFCDKLIEL----KIIDAV-AKKAVIWDDLNEQTGTEKLTNAKTMGEI----NQTM 405 (449) T ss_pred HHH-HHHHH-HH--HHHhhhHHHHHHHHHHHHh----hcCCCC-CceeEEeCCCCCCCHHHHHHHHHHHHHH----HHHH Confidence 433 33332 33 3335789999988876432 222222 3699999998888999998876554432 1222 Q ss_pred HHhC--CCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 459 LAIA--PRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 459 l~~~--~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) +..- +-+ ++++.-+.. .... .+..+..+++ ++.++ .+.++.- T Consensus 406 ~~ag~~~~~-~~~EiR~~~------------~~~~---~~~~~~~~e~-~de~~------~~~d~~a 449 (449) T protein:vir:10 406 LGSGDNPAF-SREEIRTAA------------GYDN---DDEEPLGEED-GDEED------KATDSAA 449 (449) T ss_pred HHccccCCc-CHHHHHHHh------------cccC---CCCCCCCCCC-Ccccc------ccCCcCC Confidence 2211 111 222110000 0000 0000000000 00000 0111111 No 208 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=92.80 E-value=0.0099 Score=31.57 Aligned_cols=457 Identities=9% Similarity=0.007 Sum_probs=193.9 Q ss_pred ccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~ 88 (530) |. +..++..+.+++.. -..+.+.+.+|..-.- .. .+ +.. ......|+..+-....++..++.|+ T Consensus 1 mk--~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~--~~-----~~--~~~---~~~~~~~~~dstg~~a~~~Laa~l~ 66 (542) T protein:vir:78 1 MK--GLAQARYSAMRADREDFLDMARRCAALTLPYL--LT-----ED--GHA---SGGRLQQPYQSLGSKGVNALSSKLM 66 (542) T ss_pred Ch--hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CC-----CC--CCc---ccccccccccchHHHHHHHHHHHHH Confidence 32 33445555554421 1234455555543211 00 00 000 0111235556666667777766665 Q ss_pred cc--cee---e--ecCCcc-------h----HHHH-------HHHHHH-hhccHHHHHHHHHHHHhhcCeEEEEEEecCC Q lcl|NC_011308. 89 AN--GID---V--KPTDHD-------D----QKLC-------YLIEEY-YNEEFQSAIQELVEGSTIKGYEGIFARTTSE 142 (530) Q Consensus 89 G~--pv~---~--~~~~~~-------d----e~~~-------~~l~~~-~~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~ 142 (530) |- |+. | ...+.. + ..+. ..+... ...||....+++.++...+|.+. +|.+++ T Consensus 67 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~~~~ 144 (542) T protein:vir:78 67 LSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVL--VFAGKK 144 (542) T ss_pred HhhcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEE--EEecCC Confidence 42 221 2 111100 1 1111 122233 34688899999999999999995 566765 Q ss_pred CceEEEEecccceEEEEcCCCCceeEEEEEEEEeecccc-----------------cccceEEEE-EEEcCCceEEEeec Q lcl|NC_011308. 143 DKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDAD-----------------NKFNSIGHA-DVWTDTEVWYYVQK 204 (530) Q Consensus 143 g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~-----------------~~~~~~~~~-evyt~~~~~~y~~~ 204 (530) . +++++-.+.++--|..+....++|.+......-.. ........+ .++..+....|... T Consensus 145 ~---~~~~pl~~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~ 221 (542) T protein:vir:78 145 T---LKVYPLDRYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCC 221 (542) T ss_pred C---ceEEecceeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCcccccc Confidence 3 44555555555567788887777776532110000 000000000 01111111111100 Q ss_pred CCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHH Q lcl|NC_011308. 205 DEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKS 279 (530) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~ 279 (530) ... ......+..+ ............+|...|++.++- +.+|.|--++..+ T Consensus 222 ~~~------------~~~~s~~~e~-------------~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~ 276 (542) T protein:vir:78 222 KLV------------DGQHRWHQEC-------------DGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFG 276 (542) T ss_pred ccC------------CCeEEEEEEe-------------ccccccccccccccccCCceeeeeeecCCCccccchHHHHHH Confidence 000 0000000000 000000112345777778877664 3589999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEE--ecCCHHHHHHHHHHHHHHH Q lcl|NC_011308. 280 IIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQT--VDIPYEARKAKMDIDELNI 357 (530) Q Consensus 280 liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt--~~~~~~~~e~~ld~L~~~I 357 (530) -+-.++.+.-..........+|.+.+.-.+......+. -.+.+.+..+..+++..+. ...+.......++.++..| T Consensus 277 D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~~~~~~--~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI 354 (542) T protein:vir:78 277 DLSSLDALTRSLIEGSAAAAKVVFMVSPSATTKPQSLA--RAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRI 354 (542) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc--cCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHH Confidence 99999999999999999999998776322221111111 1223444445556676554 3345666777788777777 Q ss_pred HHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhcCC-CccccceeeEE Q lcl|NC_011308. 358 YRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRW--------TADLVVEDIRRRGL-GDYSSTDIKFD 428 (530) Q Consensus 358 ~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~--------~~~~i~~~l~~~~~-~~~d~~~i~i~ 428 (530) -..-+.=+ .-+....++..+.. ++..++..++..+.+ +++-++.++...+. .......+++. T Consensus 355 ~~aFl~~~--~~d~~rvTAtEV~~-------r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~ 425 (542) T protein:vir:78 355 SDAFLILN--VRQSERTTATEVRE-------VQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPT 425 (542) T ss_pred HHHhcccc--cCCcccccHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeee Confidence 44322111 11112335544433 233333334433332 22223333433332 11222236777 Q ss_pred eCCCCCCCH-----HHHHHHHHHHHh-cC------CCcHHHHHHhC------C---CCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 429 IEPYILANE-----LDLAMIDKTEAE-TN------QIQINNLLAIA------P---RIGDEETLKAICDTLDLDYEDVVK 487 (530) Q Consensus 429 f~~~~P~n~-----~e~a~~~~~~~~-~g------~iS~et~l~~~------~---~vdd~~~e~~~~e~e~~e~~~~~~ 487 (530) |.-.|.+-. .-+.+.+..... .| .+.-..++..+ | ++..+ ++++++.++..+ +..+. T Consensus 426 ~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~-e~~~~~~~q~q~-~~~~~ 503 (542) T protein:vir:78 426 VVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSP-ETMANEAQQAQQ-QQMTA 503 (542) T ss_pred eechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCH-HHHHHHHHHHHH-HHHHH Confidence 766654311 111111111111 11 12222222221 1 22322 333333322222 22333 Q ss_pred hhhccccc-cCCccccCCC-CCCCCCCccCcCCCCccccccc Q lcl|NC_011308. 488 ALEDQEVE-ELEPTVTPII-DPLTIEPQPEPLNIDPVIEEEP 527 (530) Q Consensus 488 ~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 527 (530) .+..+... ...+-+++.. ..+++. +. ...+|.|-++. T Consensus 504 al~~~a~~~a~~~~~~~~~~~~~a~~-~~--~~~~~~~~~~~ 542 (542) T protein:vir:78 504 SLMGQAGQLAKSPIGEKMMQQINAPG-QE--APAGPQTGEDL 542 (542) T ss_pred HHHHhhhhccccccccchhhhcCCCC-cC--CCCCCcccccC Confidence 33222211 1111111111 001111 11 11233344444 No 209 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=92.78 E-value=0.01 Score=31.54 Aligned_cols=465 Identities=11% Similarity=0.027 Sum_probs=190.6 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |-..... -..+-+++..+.+++.. -..+.+.+.+|....- ... .+.. ......|+..+-+.. T Consensus 1 m~~~~~~----~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~--~~~----~~~~------~~~~~~~~~dst~~~ 64 (536) T protein:vir:21 1 MAEKRTG----LAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL--FPK----DSDN------ASTDYQTPWQAVGAR 64 (536) T ss_pred Ccchhhc----hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--cCC----CCCc------ccccccccccccHHH Confidence 3331111 12234555555554421 1234455555543321 100 0000 111223566666666 Q ss_pred HHhhhhhhhccc--cee--ee--cCCcc-------h----------HHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID--VK--PTDHD-------D----------QKLCYLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~--~~--~~~~~-------d----------e~~~~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- |.+ |. ..+.+ . +.....+...+ ..||....+++.++...+|.+. T Consensus 65 a~~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 144 (536) T protein:vir:21 65 GLNNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVL 144 (536) T ss_pred HHHHHHHHHHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEe Confidence 666666655541 211 11 11100 0 11222233333 3678889999999999999987 Q ss_pred EEEEecCCCce-EEEEecccceEEEEcCCCCceeEEEEEEEEeec----------ccccccceEEEEEEEcCCceEEEee Q lcl|NC_011308. 135 IFARTTSEDKL-TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS----------DADNKFNSIGHADVWTDTEVWYYVQ 203 (530) Q Consensus 135 ~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evyt~~~~~~y~~ 203 (530) .|+--+..+.. .++.++-.++++--|..+....++|.+...... ......+.-..+++|+.- |.. T Consensus 145 ly~~e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v----~~~ 220 (536) T protein:vir:21 145 LYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHI----YLD 220 (536) T ss_pred EEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEE----EEe Confidence 54433332223 366677667776778888887777766532210 000111222223333210 111 Q ss_pred cCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHH Q lcl|NC_011308. 204 KDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVK 278 (530) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~ 278 (530) .+++.. ..+..+ ...........++|...|++.++- +.+|.|--++.. T Consensus 221 ~~~~~~--------------~~~~e~-------------~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l 273 (536) T protein:vir:21 221 EDSGEY--------------LRYEEV-------------EGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYL 273 (536) T ss_pred cCCCcE--------------EEEecc-------------CCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHH Confidence 111100 000000 000011122335688888887764 358999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEe--cCCHHHHHHHHHHHHHH Q lcl|NC_011308. 279 SIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTV--DIPYEARKAKMDIDELN 356 (530) Q Consensus 279 ~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~--~~~~~~~e~~ld~L~~~ 356 (530) +-+-.++.+.-..........+|.+.+.=.+......+. ..+.+.+..+..+++..+.. ..+.......++.++.. T Consensus 274 ~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~r 351 (536) T protein:vir:21 274 GDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRRLT--KAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEAR 351 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhhhc--cCCCcceecCCcccceeeeccccccchHHHHHHHHHHHH Confidence 999999988877777777777765554211111111111 11223343344455554433 34555566667666666 Q ss_pred HHHHhcccCCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccceeeEEeCCCCC Q lcl|NC_011308. 357 IYRSGMGFNSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGLG-DYSSTDIKFDIEPYIL 434 (530) Q Consensus 357 I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~-~~d~~~i~i~f~~~~P 434 (530) |-..-+.-.+.....+..++..+..+-.-+.+ .-....+.-.+.|.=+++-++.++...+.. ......+++.+.-.+. T Consensus 352 I~~af~~~~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~ 431 (536) T protein:vir:21 352 LSFAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLE 431 (536) T ss_pred HHHHHhhhhcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHH Confidence 63322221121122223455554443332222 112222333333333333344444333321 1222234555554443 Q ss_pred CC-----HHHHHHHHHHHHhcC------CCcHHHHHHhC----C-----CCCCHHHHHHHHHHHHHHHHH---HHHhhhc Q lcl|NC_011308. 435 AN-----ELDLAMIDKTEAETN------QIQINNLLAIA----P-----RIGDEETLKAICDTLDLDYED---VVKALED 491 (530) Q Consensus 435 ~n-----~~e~a~~~~~~~~~g------~iS~et~l~~~----~-----~vdd~~~e~~~~e~e~~e~~~---~~~~~~~ 491 (530) .- ...+.+.+..+.+.+ .+.-..++..+ + ++.. ++|++.+.+++.+... .+..+.. T Consensus 432 ~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt-~eev~~~r~q~~~~~~~~~~a~~~~~ 510 (536) T protein:vir:21 432 AIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLT-EEQKQQKMAQQSMQMGMDNGAAALAQ 510 (536) T ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 21 111111111111111 12333333221 1 2222 3333333333222211 1111111 Q ss_pred cccc--cCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 492 QEVE--ELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 492 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) .... -..++. +....+.. +.+|.. T Consensus 511 ~~~~~~~~~~~~---~~~~~~~~-----g~~~~~ 536 (536) T protein:vir:21 511 GMAAQATASPEA---MAAAADSV-----GLQPGI 536 (536) T ss_pred HHHHHHhcChhh---HHhhhhcc-----ccCCCC Confidence 1100 000100 00111111 111222 No 210 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=92.31 E-value=0.012 Score=31.13 Aligned_cols=178 Identities=10% Similarity=0.096 Sum_probs=85.9 Q ss_pred eeeeecCC---CCchhhHHHHH------hh-CcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCC---- Q lcl|NC_011308. 302 IYVVRGGT---NSPVDEIKKNI------QS-KKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSS---- 367 (530) Q Consensus 302 ~lvl~g~~---~~~~~~~~~~~------~~-~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~---- 367 (530) .+.++|.. ..+.+..+..+ +. ...+.++++ +-+|-+.+.+..++...++.....|-..+.+|-.- T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~-~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~ 79 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVGQAIGIDAD-SEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKGK 79 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhhhhheeecC-CcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcCC Confidence 22222210 01111111111 11 122223322 23466777888899999999999999999999532 Q ss_pred -cccccCCcHH-HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHH Q lcl|NC_011308. 368 -AVGDGNATNV-VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDK 445 (530) Q Consensus 368 -~~~~gn~SGv-Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~ 445 (530) ..++ |+||. .++--|-.+. ...++.++..|++++.++.. ..+++|.|++=..-+++|.|++.. T Consensus 80 sp~Gl-natge~d~~nyyd~i~---~~Qe~~l~p~le~l~~~~~~-----------~~~~~~~f~pL~~~s~kekAei~~ 144 (201) T protein:vir:10 80 NVGGV-SASQNTALETFYGYVD---RKRKAELLPLLEFLLPFIVT-----------EQEWSVEFNPLSQVSDKDKSEILE 144 (201) T ss_pred CCccc-cccchhHHHHHHHHHH---HHHHHHHHHHHHHHHHhhcC-----------CCCceEeeCCCCCCCHHHHHHHHH Confidence 2222 45665 4433333332 33357788888887775421 246899999999999989887754 Q ss_pred HHH-------hcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCC Q lcl|NC_011308. 446 TEA-------ETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLN 518 (530) Q Consensus 446 ~~~-------~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (530) +.. ..|++|.+.+... +... .....-.....+++.+. ...++|...|.| T Consensus 145 ~~a~a~~~~~~~g~i~~~e~r~~-------------L~~~--------~~~~~~~~~~~~~~~~~---~e~~dp~~~~~~ 200 (201) T protein:vir:10 145 KNVNSVAALIAAGIIDADEARDT-------------LRAI--------STEVKIGEGSIQTEVVI---NESEDPLDVSAN 200 (201) T ss_pred HHHHHHHHHHHcCCCCHHHHHHH-------------HHhc--------CCcCCCCCCCCCccccc---cccCCCCCCCCC Confidence 433 3333333333222 1111 00000000111111110 111222222333 Q ss_pred C Q lcl|NC_011308. 519 I 519 (530) Q Consensus 519 ~ 519 (530) . T Consensus 201 ~ 201 (201) T protein:vir:10 201 N 201 (201) T ss_pred C Confidence 3 No 211 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=92.13 E-value=0.013 Score=30.98 Aligned_cols=423 Identities=11% Similarity=0.089 Sum_probs=172.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHH-HHhhhHHHHHHHHH----------HhcccchhhhcccccccccccccccccCCcc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEY-IRSQNVSLARVGQR----------YYNQDNDIENTRIMWMNDHGDIVEDDNASNI 69 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~~~~~~----------YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ 69 (530) .+.-+..+.- -+ .++. +-+....+ +|.+..=|-.-.- . T Consensus 69 ~~~~~~~~~~------------~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~l-----------------a 117 (695) T protein:vir:36 69 LARQFEVDVS------------NYTPRER--RAASYALDFNGTSMDALSFVTSSGFPGFPTL-----------------V 117 (695) T ss_pred cceeceeccc------------ccCcccc--chhhhhhcccccccccchhhhccCcchHHHH-----------------H Confidence 2221111100 00 0000 00000000 1111000000000 0 Q ss_pred ee-ecCchhhHHhhhhhhhcccceeeec-----------------CCcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhc Q lcl|NC_011308. 70 KI-SHGFFAELVDQKTQYLLANGIDVKP-----------------TDHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIK 130 (530) Q Consensus 70 ki-~~n~~k~Ivd~~~~yl~G~pv~~~~-----------------~~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~ 130 (530) -| .++-.+.++...+..+.=+-+..+. ....+-+..+.|+.-++ =+....+.+..+.+-.| T Consensus 118 ~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlf 197 (695) T protein:vir:36 118 LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAF 197 (695) T ss_pred HHhhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 00 0111122222222222211111110 01011123344444332 25567888999999999 Q ss_pred CeEEEEEEecCCC-----------------ceE-EEEecccceEEEE-cCCCCceeEEEEEEEEeecccccccceEEEEE Q lcl|NC_011308. 131 GYEGIFARTTSED-----------------KLT-FQTVDALQLLPVF-DDYGTLQRIIRFYTEQRYSDADNKFNSIGHAD 191 (530) Q Consensus 131 G~a~~~~y~d~~g-----------------~~~-~~~~~p~~~~~v~-d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~e 191 (530) |.+..++-++.++ .++ +.+++|.++.|-. +-...+.+ .||....|.-. +.. T Consensus 198 GGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp--dfgkP~~y~V~---G~k----- 267 (695) T protein:vir:36 198 GRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD--DFYKPSTWWMI---GTE----- 267 (695) T ss_pred cceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhh--ccCCCceEEEe---ceE----- Confidence 9998777665433 111 5677888887722 11111111 12211111000 000 Q ss_pred EEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCC Q lcl|NC_011308. 192 VWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGI 271 (530) Q Consensus 192 vyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~ 271 (530) +|.. ++..+ . +.+.... .+ |. ++-.|. T Consensus 268 -------IH~S----------------------RL~~f---~-------g~plPd~-LK---------p~----y~~~Gi 294 (695) T protein:vir:36 268 -------VHAT----------------------RLHTI---V-------SRPVGDM-LK---------PT----YSFAGI 294 (695) T ss_pred -------Eeee----------------------eEEEe---c-------CCCchhh-hh---------cc----cccCcc Confidence 1100 00000 0 0000000 00 00 122467 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhccceee------eecCCCCchh---hHHHHHhh-CcceecCCCCceeEEEecC Q lcl|NC_011308. 272 SDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYV------VRGGTNSPVD---EIKKNIQS-KKIIQTKGEGGLDIQTVDI 341 (530) Q Consensus 272 sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lv------l~g~~~~~~~---~~~~~~~~-~~~i~~~~~~~~~~lt~~~ 341 (530) |....+.+-+++++...-..+..+..+.-..+. +.+....+.. +.....+. .+++.++ +++=+|.+++. T Consensus 295 Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~st 373 (695) T protein:vir:36 295 SMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLD-KATEEFFQFNT 373 (695) T ss_pred cHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEe-cCCcceEEEec Confidence 777777888888877766666666444333221 1111100100 01111222 2233444 33446778889 Q ss_pred CHHHHHHHHHHHHHHHHHHhcccCCC-----cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_011308. 342 PYEARKAKMDIDELNIYRSGMGFNSS-----AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRG 416 (530) Q Consensus 342 ~~~~~e~~ld~L~~~I~~~s~~p~~~-----~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~ 416 (530) +..++...+.+....|-..+.+|-.. ..+| |+||..=..-|.+.. ....+..++..|++++.+|. ++..+ T Consensus 374 slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~--rS~~G 448 (695) T protein:vir:36 374 PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQ--LSLFG 448 (695) T ss_pred ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccc-cccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHhcC Confidence 99999999999999999999999643 3333 678875555555442 34558899999999888873 33334 Q ss_pred CCccccceeeEEeCCCCCCCHHHHHHHHHH-------HHhcCCCcHHHHHHhC------CCCC--CHHHHHHHHHHHHHH Q lcl|NC_011308. 417 LGDYSSTDIKFDIEPYILANELDLAMIDKT-------EAETNQIQINNLLAIA------PRIG--DEETLKAICDTLDLD 481 (530) Q Consensus 417 ~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~-------~~~~g~iS~et~l~~~------~~vd--d~~~e~~~~e~e~~e 481 (530) . .+. +|+++|++=.--++.|.|++..+ ....|+|+...+...+ +|.. |...+--...+.+.+ T Consensus 449 ~--idp-di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~ 525 (695) T protein:vir:36 449 A--VDP-SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDID 525 (695) T ss_pred C--CCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhh Confidence 3 344 68899998777777888776433 4456777666655553 2211 100000000000000 Q ss_pred HHHHHHhhhc-cccccCCccc---cCCCCCCCCCCccCcCCCCccccc--ccCCC Q lcl|NC_011308. 482 YEDVVKALED-QEVEELEPTV---TPIIDPLTIEPQPEPLNIDPVIEE--EPVQE 530 (530) Q Consensus 482 ~~~~~~~~~~-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 530 (530) -.+...+. .+.++..+.+ .+.+ ++.+..+.+.+.++...+ .|.-+ T Consensus 526 --~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~v~~~~~~~~~~~ag~~~~~~~ 576 (695) T protein:vir:36 526 --GVLTYVQRLAEGGDTGAPGGARAGAT--APPTVANVNANVNPREAGAQDAAMR 576 (695) T ss_pred --hhHhhhcCcccccccCCCCccccccc--CCCcccccccccCccccCCCCccce Confidence 00000000 0000000000 1100 011111222222221111 11101 No 212 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=91.70 E-value=0.015 Score=30.64 Aligned_cols=471 Identities=9% Similarity=0.009 Sum_probs=188.1 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |.-.-+.+. . .++-.++..+.+++.. -..+.+.+.+|..-.- .. .... . ......++..+-+.. T Consensus 1 ~~~~~~~~~-~-~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~--~~-----~~~~--~---~~~~~~~~~dst~~~ 66 (535) T protein:vir:94 1 MASSQKREG-F-AENGAKAVYDALKNDRNSYETRAENCAKYTIPSL--FP-----KDSD--N---ASTDYTTPWQAVGAR 66 (535) T ss_pred CCchhhhhh-H-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--CC-----CCCC--c---cccccCCcccccHHH Confidence 211111110 0 1112233333333211 1223334444433210 00 0000 0 111223455566666 Q ss_pred HHhhhhhhhccc--cee--ee--cCC----------cchHHHHHHH-------HHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID--VK--PTD----------HDDQKLCYLI-------EEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~--~~--~~~----------~~de~~~~~l-------~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) .++..++.|+|- |.+ |. ..+ .+-.++.+.| ...+ ..||....+++.++...+|.+. T Consensus 67 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ 146 (535) T protein:vir:94 67 GLNNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNAL 146 (535) T ss_pred HHHHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEe Confidence 666666555541 221 21 111 0001222222 2223 4688889999999999999996 Q ss_pred EEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeecccc---------cccceEEEEEEEcCCceEEEeecC Q lcl|NC_011308. 135 IFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDAD---------NKFNSIGHADVWTDTEVWYYVQKD 205 (530) Q Consensus 135 ~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~---------~~~~~~~~~evyt~~~~~~y~~~~ 205 (530) .|+-.+.....+|+.++-.+.++-.|..+....++|-+......-.. ...+....+++|+.- |.... T Consensus 147 l~~~~~~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v----~~~~~ 222 (535) T protein:vir:94 147 LYIPEPEGTYNPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHI----YLDEE 222 (535) T ss_pred EeeccCcCcccceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEE----EeeCC Confidence 65544433335677776666666667788877777665432211000 011222334444321 12111 Q ss_pred CcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHH Q lcl|NC_011308. 206 EGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSI 280 (530) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~l 280 (530) ++.. ..| ..+ . ...........+|...|++.++- +.+|.|--++..+- T Consensus 223 ~~~~-------------~~~-~e~-~------------g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D 275 (535) T protein:vir:94 223 SGEY-------------LKY-EEI-D------------GVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGD 275 (535) T ss_pred CCcE-------------EEE-EEe-c------------CeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHH Confidence 1100 000 000 0 00001112345788888887764 35899988999999 Q ss_pred HHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 281 IDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIY 358 (530) Q Consensus 281 iDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~ 358 (530) +-.++.+.-..........+|.+++.-.+......+. ..+.+.+..+..+++..+... .+.......++.++..|- T Consensus 276 ~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~--~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~ 353 (535) T protein:vir:94 276 LRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLT--KAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLS 353 (535) T ss_pred HHHHHHHHHHHHHHHHHhccCCcccccccccchhhcc--cCCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHH Confidence 9999988777777777777776655321111111111 122344444555666665444 355666666777666664 Q ss_pred HHhcccCCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccceeeEEeCCCCCC- Q lcl|NC_011308. 359 RSGMGFNSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGLG-DYSSTDIKFDIEPYILA- 435 (530) Q Consensus 359 ~~s~~p~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~-~~d~~~i~i~f~~~~P~- 435 (530) ..-+.-.+.....+..++..+..+-.-+.. .-....+.-.+.|.=+++-++.++...+.. ......+++.|.-.+.. T Consensus 354 ~af~~~~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l 433 (535) T protein:vir:94 354 YAFMLNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEAL 433 (535) T ss_pred HHHhHhhhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHH Confidence 433322222222233455554433222221 112222222233333333334444333321 12222345555443321 Q ss_pred ----CHHHHHHHHHHHHhcC------CCcHHHHHHhC------C---CCCCHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_011308. 436 ----NELDLAMIDKTEAETN------QIQINNLLAIA------P---RIGDEETLKAICDTLDLDYEDVVKALEDQEVEE 496 (530) Q Consensus 436 ----n~~e~a~~~~~~~~~g------~iS~et~l~~~------~---~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~ 496 (530) +..-+.+.+..+.+.+ .+.-..++..+ | ++.. ++|++++.+++.+++..+........ . T Consensus 434 ~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs-~eev~~~~~q~~~~~~~~~~~~~~g~-~ 511 (535) T protein:vir:94 434 GRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKT-PEEKQQEMAEAAQGTAMQNAAASAGA-G 511 (535) T ss_pred HHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHHH-h Confidence 1111111111111111 12222222221 1 2222 33333333333333222222221110 0 Q ss_pred CCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 497 LEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) ..+.+. .+++..+..+ .+.-..|. T Consensus 512 ~~~~~~----~~~~~~~~~~--~~~g~~~~ 535 (535) T protein:vir:94 512 AGTMAT----ASPENMKAAA--AQAGMAPN 535 (535) T ss_pred hhcccc----cChHHHHHHH--HHhccCCC Confidence 000111 0111110000 00011111 No 213 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=91.22 E-value=0.017 Score=30.30 Aligned_cols=390 Identities=11% Similarity=0.012 Sum_probs=160.2 Q ss_pred CCcccccCCcc---------------cHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccccc Q lcl|NC_011308. 1 MTNTLLTTAPD---------------RLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDN 65 (530) Q Consensus 1 ~~~~~~~~~~~---------------~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~ 65 (530) |- -..+.|. ++...-...+. |...-.. ...-..|++... . -...++- T Consensus 1 m~--kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~---------~~~~~~iLr~~~-~----~~ly~~m- 62 (448) T protein:vir:77 1 MA--KRGRKPKELVPGPGSIDPSDVPKLEGASVPVMS-TSYDVVV---------DREFDELLQGKD-G----LLVYHKM- 62 (448) T ss_pred CC--CCCCCCcccCCcccccchhhhhhhccchhhhcc-ccccccc---------ccchhHhhcccc-c----hHHHHHH- Confidence 11 0111110 00000000000 0000000 000001111000 0 0000000 Q ss_pred CCcceeecCchhhHHhhhhhhhcccceeeecCC--cchHHHHHHHHHHhhc--------cHHHHHHHHHHHHhhcCeEE- Q lcl|NC_011308. 66 ASNIKISHGFFAELVDQKTQYLLANGIDVKPTD--HDDQKLCYLIEEYYNE--------EFQSAIQELVEGSTIKGYEG- 134 (530) Q Consensus 66 ~~n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~--~~de~~~~~l~~~~~~--------~~~~~~~e~~~~~~~~G~a~- 134 (530) . ......-.+.+...-+.|.+..+.... ..+.+..+++.+.+.. +|.+++..+ .++..+|.+. T Consensus 63 ----~-~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~ 136 (448) T protein:vir:77 63 ----L-SDGTVKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAG 136 (448) T ss_pred ----h-hChHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeE Confidence 0 023344556666677888898886432 2345566777776542 466666666 5788889865 Q ss_pred EEEEe-cCCCceEEEEe---cccce-EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCccc Q lcl|NC_011308. 135 IFART-TSEDKLTFQTV---DALQL-LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRS 209 (530) Q Consensus 135 ~~~y~-d~~g~~~~~~~---~p~~~-~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~ 209 (530) +++|. ..+|.+....+ +|... +..|+..+.+. ++...+... T Consensus 137 Eivw~~~~dg~~~~~~l~~r~~~~~~~f~~~~~~~l~----------------------------------~~~~~~~~~ 182 (448) T protein:vir:77 137 EIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGPK----------------------------------ALKLSGEVK 182 (448) T ss_pred EEEEeecCCCceeeccccccCCCccceeeeecCCceE----------------------------------EEecCCccc Confidence 67774 45676643322 33211 11222222211 111000000 Q ss_pred chhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEee--CCcCCCCcHHHHHHHHHHHHHH Q lcl|NC_011308. 210 DEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY--NNKLGISDIKKVKSIIDDYDLM 287 (530) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~--nn~~~~sd~e~v~~liDa~~~~ 287 (530) .. . ......+.|++++=+.... .|-.|.|.+..+--..=-=... T Consensus 183 ~~-~---------------------------------~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~ 228 (448) T protein:vir:77 183 GG-S---------------------------------QFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRAL 228 (448) T ss_pred cc-c---------------------------------cCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhh Confidence 00 0 0000001123333111111 1235677777654444444567 Q ss_pred HHHHHHHHHHhccceeeeecCCCCc--hh------hHHHHHh--hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHH Q lcl|NC_011308. 288 NCFLSNNLQDMAEAIYVVRGGTNSP--VD------EIKKNIQ--SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNI 357 (530) Q Consensus 288 ~S~~~n~~~~~~~~~lvl~g~~~~~--~~------~~~~~~~--~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I 357 (530) +.+.+..++.|..|+.+.+-..+.+ .+ ....+++ ...++.++.+..+++++...........+++..+.| T Consensus 229 ~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~I 308 (448) T protein:vir:77 229 ILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGI 308 (448) T ss_pred HHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHH Confidence 7888999999999999988432221 11 1222332 223456788899999987765555666788888888 Q ss_pred HHHhcccCCCcccccCCcHHHHHH---HHhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCC Q lcl|NC_011308. 358 YRSGMGFNSSAVGDGNATNVVIKS---RYTL-LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYI 433 (530) Q Consensus 358 ~~~s~~p~~~~~~~gn~SGvAik~---~~~~-l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~ 433 (530) -..--+--++.++.|..+..+..- .... +...|.. +...|.+ .+|-.++..+... +..-..+.|...- T Consensus 309 sk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~----i~~tln~--~Li~~l~~lNfg~--~~~~P~~~f~~~e 380 (448) T protein:vir:77 309 ARALGIDFNTVQLNMGVQAVNIGEFVSLTQQTIISLQRE----FASAVNL--YLIPKLVLPNWPG--ATRFPRLTFEMEE 380 (448) T ss_pred HHHHhccccccccccchhhhhhhhHHHHHHHHHHHHHHH----HHHHHHH--HHHHHHHHhcCCC--CCCCCEEEecCCC Confidence 776655555433333222222211 1111 1122222 2333322 1222233333221 2223578887777 Q ss_pred CCCHHHHHHHHHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 434 LANELDLAMIDKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 434 P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) |.|.+..|+.+..+++. .. +.. .++. .+ ++ ..++.+..++ T Consensus 381 ~eDl~~~a~~~~~l~~~-------~~---------------------~~~-~ip~--~~------~~---~~~~~~~~~~ 420 (448) T protein:vir:77 381 RNDFSAAANLMGMLINA-------VK---------------------DSE-DIPT--EL------KA---LIDALPSKMR 420 (448) T ss_pred hhhHHHHHHHhHHHHHH-------HH---------------------HHh-cCCc--cC------Cc---CCCCCchhcc Confidence 77777777665444310 00 000 0000 00 00 0111111111 Q ss_pred cCcCC--CCcccccccCCC Q lcl|NC_011308. 514 PEPLN--IDPVIEEEPVQE 530 (530) Q Consensus 514 ~~~~~--~~~~~~~~~~~~ 530 (530) +.+.. .++....+|..- T Consensus 421 ~~~~~~~~~~~~~~~~~~~ 439 (448) T protein:vir:77 421 RALGVVDEVREAVRQPADS 439 (448) T ss_pred cccCCCCCCCchhhcchhh Confidence 11111 111111112211 No 214 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=91.02 E-value=0.018 Score=30.16 Aligned_cols=423 Identities=11% Similarity=0.086 Sum_probs=173.5 Q ss_pred CCcccccCCcccHHHHHHHHHHHH-HHhhhHHHHHHHHH----------HhcccchhhhcccccccccccccccccCCcc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEY-IRSQNVSLARVGQR----------YYNQDNDIENTRIMWMNDHGDIVEDDNASNI 69 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~~~~~~----------YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ 69 (530) .+.-+..+.- -+ .++.+ -+....+ +|.+..=|-.-.- . T Consensus 69 ~~~~~~~~~~------------~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~l-----------------a 117 (695) T protein:vir:78 69 LARQFEVDVS------------NYTPRERR--AASYALDFNGTSMDALSFVTSSGFPGFPTL-----------------V 117 (695) T ss_pred cceeceeccc------------cCCccccc--hhhhhhcccccccccchhhhccCcchHHHH-----------------H Confidence 2221111100 00 00000 0100001 1111000000000 0 Q ss_pred ee-ecCchhhHHhhhhhhhcccceeeecC-----------------CcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhc Q lcl|NC_011308. 70 KI-SHGFFAELVDQKTQYLLANGIDVKPT-----------------DHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIK 130 (530) Q Consensus 70 ki-~~n~~k~Ivd~~~~yl~G~pv~~~~~-----------------~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~ 130 (530) -| .++-.+.++...+..+.=+-+..+.. ...+-+..+.|..-++ =+....+.+..+.+-.| T Consensus 118 ~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlf 197 (695) T protein:vir:78 118 LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAF 197 (695) T ss_pred HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 00 01112222333333332111111100 0011122333443332 25667888999999999 Q ss_pred CeEEEEEEecCCC-----------------ceE-EEEecccceEEEE-cCCCCceeEEEEEEEEeecccccccceEEEEE Q lcl|NC_011308. 131 GYEGIFARTTSED-----------------KLT-FQTVDALQLLPVF-DDYGTLQRIIRFYTEQRYSDADNKFNSIGHAD 191 (530) Q Consensus 131 G~a~~~~y~d~~g-----------------~~~-~~~~~p~~~~~v~-d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~e 191 (530) |.+..++-++.++ .++ +.+++|.++.|-. +-...+.+ .||....|.-. +.. T Consensus 198 GGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp--dfgkP~~y~V~---G~k----- 267 (695) T protein:vir:78 198 GRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD--DFYKPSTWWMI---GTE----- 267 (695) T ss_pred cceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhh--ccCCCceEEEe---ceE----- Confidence 9998777665433 111 5677888887722 11111111 12211111000 000 Q ss_pred EEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCC Q lcl|NC_011308. 192 VWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGI 271 (530) Q Consensus 192 vyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~ 271 (530) +|.. ++..+ . +.+.... .+ |. ++-.|. T Consensus 268 -------IH~S----------------------RL~~f---~-------g~plPd~-LK---------p~----y~~~Gi 294 (695) T protein:vir:78 268 -------VHAT----------------------RLHTI---V-------SRPVGDM-LK---------PT----YSFAGI 294 (695) T ss_pred -------Eeee----------------------eEEEe---c-------CCCchhh-hh---------cc----cccCcc Confidence 1100 00000 0 0000000 00 00 122467 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee---c-CCCCchhhHHH------HHhh-CcceecCCCCceeEEEec Q lcl|NC_011308. 272 SDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR---G-GTNSPVDEIKK------NIQS-KKIIQTKGEGGLDIQTVD 340 (530) Q Consensus 272 sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~---g-~~~~~~~~~~~------~~~~-~~~i~~~~~~~~~~lt~~ 340 (530) |....+.+-+++++...-..+..+..+.-..+..- . ..+.. .+... ..+. .+++.++ +++=+|.+++ T Consensus 295 Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~~L~~g~~-~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~s 372 (695) T protein:vir:78 295 SMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGAN-VDLSMRAELINRYRDNRNILFLD-KATEEFFQFN 372 (695) T ss_pred cHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHhhcChhH-HHHHHHHHHHHHhcCccceEEEe-cCCcceEEEe Confidence 77778888888888777666666654443322110 0 01111 11111 1222 2233444 3344677888 Q ss_pred CCHHHHHHHHHHHHHHHHHHhcccCCC-----cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011308. 341 IPYEARKAKMDIDELNIYRSGMGFNSS-----AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRR 415 (530) Q Consensus 341 ~~~~~~e~~ld~L~~~I~~~s~~p~~~-----~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~ 415 (530) .+...+...+.+....|-..+.+|-.. ..+| |+||..=..-|.+.. ....+..++..|++++.+|. ++.. T Consensus 373 tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~--rS~~ 447 (695) T protein:vir:78 373 TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQ--LSLF 447 (695) T ss_pred cccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccc-cccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHhc Confidence 999999999999999999999999643 3333 678875555555442 34558899999999888873 3333 Q ss_pred CCCccccceeeEEeCCCCCCCHHHHHHHHHH-------HHhcCCCcHHHHHHhC------CCCC--CHHHHHHHHHHHHH Q lcl|NC_011308. 416 GLGDYSSTDIKFDIEPYILANELDLAMIDKT-------EAETNQIQINNLLAIA------PRIG--DEETLKAICDTLDL 480 (530) Q Consensus 416 ~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~-------~~~~g~iS~et~l~~~------~~vd--d~~~e~~~~e~e~~ 480 (530) +. .+. +|++.|++=.--++.|.|++..+ ....|+|+...+...+ +|.. |...+--...+.+. T Consensus 448 G~--idp-di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~ 524 (695) T protein:vir:78 448 GA--VDP-SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDI 524 (695) T ss_pred CC--CCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchh Confidence 43 344 58899998777777888776433 4456677666555553 2211 10000000000000 Q ss_pred HHHHHHHhhhc-cccccCCccccCCCCCCCCCCccCcC--CCCccccc--ccCCC Q lcl|NC_011308. 481 DYEDVVKALED-QEVEELEPTVTPIIDPLTIEPQPEPL--NIDPVIEE--EPVQE 530 (530) Q Consensus 481 e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~--~~~~~ 530 (530) + -.+...+. .+.++..+.+++ --..+.+|.=.++ |..+...+ .|.-+ T Consensus 525 ~--~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~~~~~~~~~~ag~~~~~~~ 576 (695) T protein:vir:78 525 D--GVLTYVQRLAEGGDTGAPGGA-RAGATAPPTVANVNANVKPREAGAQDAAMR 576 (695) T ss_pred h--hhHhhhcCcccccccCCCCCC-CCCCCCCCceeeeeccccccccCCCCcccc Confidence 0 00000000 000011111110 0111222222121 11111111 11101 No 215 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=90.76 E-value=0.019 Score=30.00 Aligned_cols=443 Identities=11% Similarity=0.078 Sum_probs=175.2 Q ss_pred CCccccc--CCcccHHHHHHHH---HHHH-HHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCc-cee-e Q lcl|NC_011308. 1 MTNTLLT--TAPDRLGTILSTK---IDEY-IRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASN-IKI-S 72 (530) Q Consensus 1 ~~~~~~~--~~~~~~~~~i~~~---i~~~-~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n-~ki-~ 72 (530) ||.--.+ -+|-.-..+.+.+ ..-+ .++. +-+ .|=.+-+.+.......+...+.. ..+- .-| . T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~----~~~~~~~~~~~~~l~~~~~~~F~----Gy~~la~laQ 120 (694) T protein:vir:10 51 LNALDAAPVAEPSPSLRLARQFEVDVSNYTPRER--RAA----SYALDFNGTSMDALSFVTSSGFP----GFPTLVLLAQ 120 (694) T ss_pred chhhcccccCCCCcchhhhhhccccccCCCcccc--chh----hhhhccCcccccchhhhhccCcc----hHHHHHHHhh Confidence 2221111 1111111110000 0000 0000 000 11111010000000000000000 0000 000 0 Q ss_pred cCchhhHHhhhhhhhcccceeeecC-----------------CcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 73 HGFFAELVDQKTQYLLANGIDVKPT-----------------DHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 73 ~n~~k~Ivd~~~~yl~G~pv~~~~~-----------------~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~ 134 (530) ++-.+.++...+..+.=+-+..+.. ...+-+..+.|..-++ =+....+.+..+.+-.||.+. T Consensus 121 ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~ 200 (694) T protein:vir:10 121 LPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAH 200 (694) T ss_pred ccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceE Confidence 1112223333333332211111110 0011122333443332 256678889999999999998 Q ss_pred EEEEecCCC-----------------ceE-EEEecccceEEEE-cCCCCceeEEEEEEEEeecccccccceEEEEEEEcC Q lcl|NC_011308. 135 IFARTTSED-----------------KLT-FQTVDALQLLPVF-DDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTD 195 (530) Q Consensus 135 ~~~y~d~~g-----------------~~~-~~~~~p~~~~~v~-d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~ 195 (530) .++-++.++ .++ +.+++|.++.|-. +-...+.+ .||....|.-. +.. T Consensus 201 ~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp--dfgkP~~y~V~---G~~--------- 266 (694) T protein:vir:10 201 PYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD--DFYKPSTWWMI---GTE--------- 266 (694) T ss_pred EEEEeecCccccccccccccccccCcceeeeEeecccccccchhhhccchhh--ccCCCceEEEe---ceE--------- Confidence 766655433 111 5677888887722 11111111 12211111000 000 Q ss_pred CceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHH Q lcl|NC_011308. 196 TEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIK 275 (530) Q Consensus 196 ~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e 275 (530) +|.. ++..+ . +.+.... .+ |. ++-.|.|... T Consensus 267 ---IH~S----------------------RL~~f---~-------g~plPd~-LK---------p~----y~~~G~Sv~q 297 (694) T protein:vir:10 267 ---VHAT----------------------RLHTI---V-------SRPVGDM-LK---------PT----YSFAGISMTQ 297 (694) T ss_pred ---Eeee----------------------eEEEe---c-------CCCchhh-hh---------cc----cccCcccHHH Confidence 1100 00000 0 0000000 00 00 1224677777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeee---c-CCCCchhhHHH------HHhh-CcceecCCCCceeEEEecCCHH Q lcl|NC_011308. 276 KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR---G-GTNSPVDEIKK------NIQS-KKIIQTKGEGGLDIQTVDIPYE 344 (530) Q Consensus 276 ~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~---g-~~~~~~~~~~~------~~~~-~~~i~~~~~~~~~~lt~~~~~~ 344 (530) .+.+-+++++...-..+..+..+.-..+..- . ..+.. .+... ..+. .+++.++ +++=+|.+++.+.. T Consensus 298 ~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~~L~~g~~-~~l~~R~eli~~~Rsn~G~~llD-k~~Eefeq~stslS 375 (694) T protein:vir:10 298 LAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGAN-VDLSMRAELINRYRDNRNILFLD-KATEEFFQFNTPLS 375 (694) T ss_pred HHHHHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHhhcChhH-HHHHHHHHHHHHhcCccceEEEe-cCCcceEEEecccC Confidence 7888888887776666666654443322110 0 01111 11111 1222 2233444 33446778889999 Q ss_pred HHHHHHHHHHHHHHHHhcccCCC-----cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Q lcl|NC_011308. 345 ARKAKMDIDELNIYRSGMGFNSS-----AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGD 419 (530) Q Consensus 345 ~~e~~ld~L~~~I~~~s~~p~~~-----~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~ 419 (530) ++...+.+....|-..+.+|-.. ..+| |+||..=..-|.+.. ....+..++..|++++.+|. ++..+. T Consensus 376 GLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~--rS~~G~-- 448 (694) T protein:vir:10 376 GLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQ--LSLFGA-- 448 (694) T ss_pred CHHHHHHHHHHHHHhhhcCchhhhhccCcccc-cccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHH--HHhcCC-- Confidence 99999999999999999999643 3333 678875555555442 34558899999999888873 333343 Q ss_pred cccceeeEEeCCCCCCCHHHHHHHHHH-------HHhcCCCcHHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 420 YSSTDIKFDIEPYILANELDLAMIDKT-------EAETNQIQINNLLAIA------PRIG--DEETLKAICDTLDLDYED 484 (530) Q Consensus 420 ~d~~~i~i~f~~~~P~n~~e~a~~~~~-------~~~~g~iS~et~l~~~------~~vd--d~~~e~~~~e~e~~e~~~ 484 (530) .+. +|++.|++=.--++.|.|++..+ ....|+|+...+...+ +|.. |...+--...+.+.+ - T Consensus 449 idp-~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~--~ 525 (694) T protein:vir:10 449 VDP-SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDID--G 525 (694) T ss_pred CCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhh--h Confidence 344 68899998777777888776433 4456777666655553 2211 100000000000000 0 Q ss_pred HHHhhhc-cccccCCccc---cCCCCCCCCCCccCcCCCCccccc--ccCCC Q lcl|NC_011308. 485 VVKALED-QEVEELEPTV---TPIIDPLTIEPQPEPLNIDPVIEE--EPVQE 530 (530) Q Consensus 485 ~~~~~~~-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 530 (530) .+...+. .+.++..+.+ .+.+ ++.+..+.+.+.++...+ .|.-+ T Consensus 526 ~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~v~~~~~~~~~~~ag~~~~~~~ 575 (694) T protein:vir:10 526 VLTYVQRLAEGGDTGAPGGARAGAT--APPTVANVNANVNPREAGAQDAAMR 575 (694) T ss_pred hHhhhcCcccccccCCCCccccccc--CCCcccccccccCccccCCCCccce Confidence 0000000 0000000000 1100 011111222222221111 11101 No 216 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=90.05 E-value=0.023 Score=29.58 Aligned_cols=372 Identities=10% Similarity=0.003 Sum_probs=158.7 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+-..+...+. .......+ +....-+.... .+ ..+ .+.+=+.+.-....|+..++=+.+- T Consensus 1 MGl~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~-------~~---~~v----t~~~al~~~~v~~~i~~Ia~~iA~l 61 (394) T protein:vir:62 1 MGLRDRFSNYLF--KKAEKRGY---LDNVLGKSIRY-------SG---VYV----TDSNILQSSDVYELLQDISNQMVLA 61 (394) T ss_pred Cchhhhhhhhcc--CCCCchhh---hhhhhhccccc-------Cc---ccc----ChhhhhccHHHHHHHHHHHHhhccc Confidence 554444332211 11111111 11111111100 00 000 0000012233556677777777788 Q ss_pred ceeeecCCcchHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCCCc Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYN--EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYGTL 165 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~--~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~ 165 (530) |+.+...+.+ +.....+..++. |. .......+..+...+|.+|.++-.+..+ . |..+.|..++.+. T Consensus 62 p~~v~~~~g~-~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-----~--~~~~~~~~~~~~~- 132 (394) T protein:vir:62 62 DIVVEDEFGN-EIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-----L--ASNVFTELDDNLV- 132 (394) T ss_pred ceEEEcCCCc-ccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-----c--cccceEEECCceE- Confidence 8876543321 111111222221 22 2234456777888999998765322111 1 2234444443210 Q ss_pred eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccc Q lcl|NC_011308. 166 QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEH 245 (530) Q Consensus 166 ~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (530) .+|.. .. ..|..+.+.|++... T Consensus 133 ----~~~~~--------~~------~~~~~~eiih~r~~~---------------------------------------- 154 (394) T protein:vir:62 133 ----EHFNI--------GG------HEIPPCMIRHVKNIG---------------------------------------- 154 (394) T ss_pred ----EEEee--------CC------EEechhheEEecCcC---------------------------------------- Confidence 01100 00 012333333322100 Q ss_pred ccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCC-Cch--hhHHHH- Q lcl|NC_011308. 246 EGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTN-SPV--DEIKKN- 319 (530) Q Consensus 246 ~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~-~~~--~~~~~~- 319 (530) + +.-.|.|.++.....|+....+..-..+.+.-.+.|-.+++ +... ++. +.++.. T Consensus 155 ----------~---------d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~ 215 (394) T protein:vir:62 155 ----------A---------DHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAI 215 (394) T ss_pred ----------C---------CCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHH Confidence 0 01136666666655555555554444444555455655543 3221 111 122221 Q ss_pred ---H----hhCcceecCCCCceeEEEecCC--HHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHH Q lcl|NC_011308. 320 ---I----QSKKIIQTKGEGGLDIQTVDIP--YEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKA 390 (530) Q Consensus 320 ---~----~~~~~i~~~~~~~~~~lt~~~~--~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka 390 (530) . ..++++.++.+.+.++.....+ +.......+.....|...-.+|.. -+|..++..+ . T Consensus 216 ~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~---~lg~~~~sn~----------e 282 (394) T protein:vir:62 216 LDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVD---TYTELIKEDI----------E 282 (394) T ss_pred HHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHH---HcCCCCCcCH----------H Confidence 1 1234556677777877665543 222334456667777777777742 2222221111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCH Q lcl|NC_011308. 391 QKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDE 468 (530) Q Consensus 391 ~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~ 468 (530) .....++...|.-+++.|...++.+-...-....+.+.|+.....+..+.++....+..+|+++...+++.+++ ++++ T Consensus 283 ~~~~~~~~~~l~P~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~ 362 (394) T protein:vir:62 283 KAMMYIHNKAVRPIMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTK 362 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCccccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 12233445555555555555555332222223457888887777777777888888999999999999888754 3332 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 469 ETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 469 ~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) ....-.+. ..+..+...........+. +.|++ T Consensus 363 ~gd~~~~~-------~n~~~~~~~~~~~~~~kgg-------------e~~en 394 (394) T protein:vir:62 363 ESQAIYIS-------NDVTEIGKKEATDGSLGGG-------------EENEN 394 (394) T ss_pred CCCeeecc-------cccccccccccccccCCCC-------------CCCCC Confidence 21100000 0011111111000000000 00111 No 217 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=90.02 E-value=0.023 Score=29.55 Aligned_cols=448 Identities=9% Similarity=0.008 Sum_probs=159.8 Q ss_pred HHHHHHhhhHHHHHH--------------------HHHHhcccchhhhcccccccccccccccccCCc----ceeecCch Q lcl|NC_011308. 21 IDEYIRSQNVSLARV--------------------GQRYYNQDNDIENTRIMWMNDHGDIVEDDNASN----IKISHGFF 76 (530) Q Consensus 21 i~~~~~~~~~~~~~~--------------------~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n----~ki~~n~~ 76 (530) +...+.+-+ ...-+ -.++|.++. + .+....|. ..-..+++ T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--------------~~p~~~~~~L~~~~e~~~~~ 64 (651) T protein:vir:99 1 MTDTTGETQ-ETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNV-G--------------VNPPYNPDRLAAFLELNETL 64 (651) T ss_pred CCCccceee-eeEEEeecccccccccccccccccchhhhcccCC-C--------------CCCCCCHHHHHHHHhcChHH Confidence 111000000 00000 001111110 0 00111111 11136788 Q ss_pred hhHHhhhhhhhcccceeeec-----CCcchHHHHHHHHHHhh----------------ccHHHHHHHHHHHHhhcCeEEE Q lcl|NC_011308. 77 AELVDQKTQYLLANGIDVKP-----TDHDDQKLCYLIEEYYN----------------EEFQSAIQELVEGSTIKGYEGI 135 (530) Q Consensus 77 k~Ivd~~~~yl~G~pv~~~~-----~~~~de~~~~~l~~~~~----------------~~~~~~~~e~~~~~~~~G~a~~ 135 (530) ...|+..+..+.|-++.+.. .+..+.+..+....++. ..+..++..+..+...+|.+|. T Consensus 65 ~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~i 144 (651) T protein:vir:99 65 ATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLAL 144 (651) T ss_pred HHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhh Confidence 88999999999998876543 11223333334434331 2344566666777778898887 Q ss_pred EEEecCCCce-EEEEecccceEEEEcCCCC----ceeEEEEEEEEeecccc-------cccceEEEEEEEcCCc--eEEE Q lcl|NC_011308. 136 FARTTSEDKL-TFQTVDALQLLPVFDDYGT----LQRIIRFYTEQRYSDAD-------NKFNSIGHADVWTDTE--VWYY 201 (530) Q Consensus 136 ~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~----~~~~~~~y~~~~~~~~~-------~~~~~~~~~evyt~~~--~~~y 201 (530) -+..+..|.+ ++..++|..+- +...... +..++... ........ .......++.+|-... +... T Consensus 145 eiIrn~~g~pv~L~~lp~~~~R-v~~~~~~~~~~~~~ll~~~-pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~ 222 (651) T protein:vir:99 145 EMLTDIEGRPVGLAYVPARTVR-VRRPQNRFDQPRHPEEGRY-VDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVV 222 (651) T ss_pred hhhhcCccchhhhhhcChhhee-eecccccccchhhhhhhcc-cccccchhHHHHHHHHHhcCcceEEEeeccccceeee Confidence 6766665542 22223333221 1111000 00000000 00000000 0000000111111000 0000 Q ss_pred eecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccc---eEEeeCCc-----CCCCc Q lcl|NC_011308. 202 VQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFP---FDILYNNK-----LGISD 273 (530) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iP---iv~~~nn~-----~~~sd 273 (530) . ...+.......................++. ...........+| |++|++.. .|.|. T Consensus 223 ~-~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~--------------~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~sp 287 (651) T protein:vir:99 223 I-DESGDEPTIRYREDEESEREPIFVDRETGD--------------VTTGDANGLENRPANELIFIPNPSILEDDYGVPD 287 (651) T ss_pred e-ccCCcceeEEeccCcceeeeeecccceeee--------------EEEcCCCceeEecccceEEecCCCCCCCcccccH Confidence 0 000000000000000000000000000000 0000001111122 56666432 46776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--cCCCCc--hhhHHHHHh-----hCcceecCC---------CCcee Q lcl|NC_011308. 274 IKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVR--GGTNSP--VDEIKKNIQ-----SKKIIQTKG---------EGGLD 335 (530) Q Consensus 274 ~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~--g~~~~~--~~~~~~~~~-----~~~~i~~~~---------~~~~~ 335 (530) +......|+....+..-..+.+.-.+.|-.+|+ |...++ .+.++..++ .++++.++. +.+++ T Consensus 288 l~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~ 367 (651) T protein:vir:99 288 WVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIE 367 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCce Confidence 665555554444333334444444445656654 432222 122222221 234444443 23566 Q ss_pred EEEecCC---HHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 336 IQTVDIP---YEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVE 410 (530) Q Consensus 336 ~lt~~~~---~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~ 410 (530) |...... +..+....+.....|...-.+|. ++...-++-|++.- .....+...|+-+++.|.. T Consensus 368 ~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~------------~~~~f~~~tL~P~~~~ie~ 435 (651) T protein:vir:99 368 LEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQ------------QDKDFALEVIQPEQHTFAE 435 (651) T ss_pred EEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHH------------HHHHHHHHHHHHHHHHHHH Confidence 6655432 33344455667777877777874 22221122121110 0112233344444444444 Q ss_pred HHHhcCCCc---cccceeeEEeCC--CCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 411 DIRRRGLGD---YSSTDIKFDIEP--YILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTLDLDYE 483 (530) Q Consensus 411 ~l~~~~~~~---~d~~~i~i~f~~--~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~ 483 (530) .++.+-... ..-..+.+.|.. -+-.|....++....+..+|+++...+++.+++ ++++... T Consensus 436 eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd------------ 503 (651) T protein:vir:99 436 WLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGE------------ 503 (651) T ss_pred HHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccc------------ Confidence 443321111 111134455542 344677788888888999999999999988754 4432100 Q ss_pred HHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcc----------cccccCCC Q lcl|NC_011308. 484 DVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPV----------IEEEPVQE 530 (530) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~ 530 (530) ..+...+... ..+....+. +....++++.+..+. +..+|... T Consensus 504 ~~l~~~~~~~--~g~~~~gge---~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~ 555 (651) T protein:vir:99 504 MTLSEFEAEV--AGDVAGGGE---TEAVHEPPEENKIGEREWDTVKSELTTKDPIEQ 555 (651) T ss_pred cccccccccc--ccccccCCC---CcccccCccccccccchhhhhhhhhcccchhhh Confidence 0000000000 000000000 001111111111111 11222222 No 218 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=89.07 E-value=0.029 Score=29.05 Aligned_cols=421 Identities=10% Similarity=0.056 Sum_probs=170.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHH-HHhhhHHHHHHHHH----------HhcccchhhhcccccccccccccccccCCcc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEY-IRSQNVSLARVGQR----------YYNQDNDIENTRIMWMNDHGDIVEDDNASNI 69 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~-~~~~~~~~~~~~~~----------YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ 69 (530) .+.-+..+.- -+ .++. +-+....+ +|.+..=|-.-. . . T Consensus 69 ~~~~~~~~~~------------~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~---------l--------a 117 (698) T protein:vir:10 69 LARQFEVDVS------------NYTPRER--RAASYALDFNGTSMDALSFVTSSGFPGFPT---------L--------V 117 (698) T ss_pred ccccceeccc------------cCCcccc--chhhhhhcccccccccchhhhccCcchHHH---------H--------H Confidence 2221111100 00 0000 00110001 111100000000 0 0 Q ss_pred ee-ecCchhhHHhhhhhhhcccceeeecC-----------------CcchHHHHHHHHHHhh-ccHHHHHHHHHHHHhhc Q lcl|NC_011308. 70 KI-SHGFFAELVDQKTQYLLANGIDVKPT-----------------DHDDQKLCYLIEEYYN-EEFQSAIQELVEGSTIK 130 (530) Q Consensus 70 ki-~~n~~k~Ivd~~~~yl~G~pv~~~~~-----------------~~~de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~ 130 (530) -| .++-.+.++...+..+.=+-+..+.. ...+-+..+.|..-++ =+....+.+..+.+-.| T Consensus 118 ~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlf 197 (698) T protein:vir:10 118 LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAF 197 (698) T ss_pred HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 00 01112222333333332111111100 0011122333443332 25667888999999999 Q ss_pred CeEEEEEEecCCC-----------------ceE-EEEecccceEEEE-cCCCCceeEEEEEEEEeecccccccceEEEEE Q lcl|NC_011308. 131 GYEGIFARTTSED-----------------KLT-FQTVDALQLLPVF-DDYGTLQRIIRFYTEQRYSDADNKFNSIGHAD 191 (530) Q Consensus 131 G~a~~~~y~d~~g-----------------~~~-~~~~~p~~~~~v~-d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~e 191 (530) |.+..++-++.++ .++ +.+++|.++.|-. +....+.+ .||....|. T Consensus 198 GGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~sp--dfgkP~~y~------------- 262 (698) T protein:vir:10 198 GRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD--DFYKPSTWW------------- 262 (698) T ss_pred cceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhh--ccCCCceEE------------- Confidence 9998666654433 111 5667777777621 11111111 111111100 Q ss_pred EEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEE-eeCCcCC Q lcl|NC_011308. 192 VWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDI-LYNNKLG 270 (530) Q Consensus 192 vyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~-~~nn~~~ 270 (530) |-.. . +|. +++..+. + ..+|-+. -.++-.| T Consensus 263 V~G~-~-IH~----------------------SRL~~~v----------g---------------~pvpd~LKp~y~f~G 293 (698) T protein:vir:10 263 MIGS-E-VHA----------------------TRLHTIV----------S---------------RPVGDMLKPTYSFAG 293 (698) T ss_pred Eecc-e-ecc----------------------eeEEEec----------C---------------CCchhhhcchhccCC Confidence 0000 0 000 0000000 0 0011100 0012246 Q ss_pred CCcHHHHHHHHHHHHHHHHHHHHHHHHhccceee------eecCCCCchhhHHH------HHhh-CcceecCCCCceeEE Q lcl|NC_011308. 271 ISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYV------VRGGTNSPVDEIKK------NIQS-KKIIQTKGEGGLDIQ 337 (530) Q Consensus 271 ~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lv------l~g~~~~~~~~~~~------~~~~-~~~i~~~~~~~~~~l 337 (530) .|..+.+.+-+++++...-..+..+..+.-..+. +.+ ++ . .+... ..+. .+++.++ +++=+|. T Consensus 294 ~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~~dla~aL~~-g~-~-~~l~~R~eli~~~Rsn~G~~llD-k~~Eefe 369 (698) T protein:vir:10 294 ISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALTP-GA-N-VDLSMRAELINRYRDNRNILFLD-KATEEFF 369 (698) T ss_pred ccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHHhcCC-hh-h-HHHHHHHHHHHHhcCccceEEEe-cCCcceE Confidence 7777888888888877766666665444433321 111 11 1 11111 1222 2333444 3444677 Q ss_pred EecCCHHHHHHHHHHHHHHHHHHhcccCCC-----cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 338 TVDIPYEARKAKMDIDELNIYRSGMGFNSS-----AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDI 412 (530) Q Consensus 338 t~~~~~~~~e~~ld~L~~~I~~~s~~p~~~-----~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l 412 (530) +++.+...+...+.+....|--.+.+|-.- ..+| |+||..=..-|.+.. ....+..++..|++++.+|.. T Consensus 370 q~st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl-NATGE~D~rnYYD~I--~s~Qe~~L~p~L~rl~~ii~r-- 444 (698) T protein:vir:10 370 QFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL-NASSEGEIRVWYDYV--RAYQRNALQQLMNDVIVMIQL-- 444 (698) T ss_pred EEecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCccc-CccchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-- Confidence 888999999999999999999999999643 3333 678875555555442 345588999999998888733 Q ss_pred HhcCCCccccceeeEEeCCCCCCCHHHHHHHHHH-------HHhcCCCcHHHHHHhC------CCC--CCHHHHHHHHHH Q lcl|NC_011308. 413 RRRGLGDYSSTDIKFDIEPYILANELDLAMIDKT-------EAETNQIQINNLLAIA------PRI--GDEETLKAICDT 477 (530) Q Consensus 413 ~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~-------~~~~g~iS~et~l~~~------~~v--dd~~~e~~~~e~ 477 (530) +..+. .+. +|++.|++=.--++.|.|++..+ ....|+|+...+...| +|. -|+..+--.- T Consensus 445 S~~G~--idp-~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~-- 519 (698) T protein:vir:10 445 SLFGA--VDP-SIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAP-- 519 (698) T ss_pred HhcCC--CCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCC-- Confidence 33343 344 58999998777788888877544 3345666555544443 121 0100000000 Q ss_pred HHHHHHHHHHhhhccccccC--CccccCCCC---CCCCCCccCcCCCCcccc-cccCCC Q lcl|NC_011308. 478 LDLDYEDVVKALEDQEVEEL--EPTVTPIID---PLTIEPQPEPLNIDPVIE-EEPVQE 530 (530) Q Consensus 478 e~~e~~~~~~~~~~~~~~~~--~~~~~~~~~---~~~~~~~~~~~~~~~~~~-~~~~~~ 530 (530) ++...+......+...+.+. .+......- ..+...-..++|.+|... .++.+- T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (698) T protein:vir:10 520 ADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDAAM 578 (698) T ss_pred CCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCccccee Confidence 00000001111100000000 000000000 000011111112222111 111111 No 219 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=88.86 E-value=0.03 Score=28.95 Aligned_cols=367 Identities=11% Similarity=0.013 Sum_probs=137.8 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+-+.+..+..+ ..+... ....- ....... . ....-+.+.-....|+..++=+.+- T Consensus 1 Mg~~~~~~~~~~~---~~~~~~---~~~~~--~~~~~~~-------------~--~~~~~l~~~~v~~~v~~Ia~~ia~~ 57 (395) T protein:vir:40 1 MGFKSWVSGFFNE---EQRTLN---LTDTV--WCSIPSE-------------K--LKELSIKKWAIDSCANKIANTLSCA 57 (395) T ss_pred CchHHHHHhhhcc---cccccc---cccch--hhccccc-------------c--chhhhhhhHHHHHHHHHHHHHHhhC Confidence 6554444443221 111000 00000 0000000 0 0000011122344566666666666 Q ss_pred ceeeecCCcc-hHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCCCce Q lcl|NC_011308. 91 GIDVKPTDHD-DQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYGTLQ 166 (530) Q Consensus 91 pv~~~~~~~~-de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~ 166 (530) |+.+...+.. .......|+. --|.. ......+......+|.||.++..+.. ..|..+.+....... T Consensus 58 p~~~~~~~~~~~~~~~~lL~~-~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~-------~~~~~~~~~~~~~~~-- 127 (395) T protein:vir:40 58 EVLTYEKGEEVRKKNWYMFNV-EANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI-------YVADSFTKNDKSLYE-- 127 (395) T ss_pred ceeeccCCccccchHHHHHHh-cCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce-------eecCCcccccccccc-- Confidence 7765332211 1122222221 01222 23345567788889999966544321 011111000000000 Q ss_pred eEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccc Q lcl|NC_011308. 167 RIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHE 246 (530) Q Consensus 167 ~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (530) .+++.+.. . . ..+-..|..+.+.|+ T Consensus 128 --~~~~~v~~-~-----~--~~~~~~~~~~evih~--------------------------------------------- 152 (395) T protein:vir:40 128 --NTYTEVTL-K-----D--LTLKKEFKESEVLHL--------------------------------------------- 152 (395) T ss_pred --ceeeeeee-c-----C--ceeeeeeccccEEEe--------------------------------------------- Confidence 00000000 0 0 000011223333332 Q ss_pred cccccccccCCccceEEeeCCc-CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc--cceeeeecCCC-Cch--hhHHHHH Q lcl|NC_011308. 247 GRQVLGRSYKSRFPFDILYNNK-LGISDIKKVKSIIDDYDLMNCFLSNNLQDMA--EAIYVVRGGTN-SPV--DEIKKNI 320 (530) Q Consensus 247 ~~~~~~~~~~~~iPiv~~~nn~-~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~--~~~lvl~g~~~-~~~--~~~~~~~ 320 (530) +.+. .+.+. +.++...+...++...+...... .+.++++.... ++. ...+..+ T Consensus 153 ------------------r~~~~~~~~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 211 (395) T protein:vir:40 153 ------------------TLNNESIKSI---IDGFYLLYGDLLTAAVNKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLML 211 (395) T ss_pred ------------------ecCCCCcccc---chhHHHHHHHHHHHHHHHHHhcCCCCceEEEecccCCCHHHHHHHHHHH Confidence 2211 11111 22333444444444444333322 45666643322 211 1111111 Q ss_pred ---------hhCcceecCCCCceeEEEecCCHH-HHH--HHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhhHH Q lcl|NC_011308. 321 ---------QSKKIIQTKGEGGLDIQTVDIPYE-ARK--AKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTLLA 387 (530) Q Consensus 321 ---------~~~~~i~~~~~~~~~~lt~~~~~~-~~e--~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~l~ 387 (530) ..++++.+++|.+++-+.....+. ..+ .+.+.+.+.|...=.+|. .-+ |+-|++ T Consensus 212 ~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp---~~l~~~~sn~---------- 278 (395) T protein:vir:40 212 SERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPL---GLAKGDTVGL---------- 278 (395) T ss_pred HHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCH---HHhcCCCcCH---------- Confidence 123355566666555554332222 221 112233445655656663 222 222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 388 MKAQKTEIALRKTLRWTADLVVEDIRRRGLG--DYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 388 ~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~--~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ......+++..|.-.++.|..-++.+-.. ++. ...+.+.+..-+-.|..+.++....+..+|+++.-.+++.+++ T Consensus 279 --e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~ 356 (395) T protein:vir:40 279 --SEQVNSFLMFSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGR 356 (395) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Confidence 11223455666666666666655543222 111 1345666666667788888888888999999999999988754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCC Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEP 512 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (530) ++++....-.+- ..+..+... ++.. .++..+++..+. T Consensus 357 ~pi~~~~gD~~~~~-------~n~~~~~~~--~~~~--kgge~~~~~~~~ 395 (395) T protein:vir:40 357 EPVMSPETQERFVT-------KNYAPLGEN--EEDL--KGGDINENKGDS 395 (395) T ss_pred CCCCCCCCceeeec-------ccccccccc--cccc--CCCCCCCCcCCC Confidence 333321100000 000000000 0000 000000000111 No 220 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=88.27 E-value=0.033 Score=28.68 Aligned_cols=435 Identities=10% Similarity=0.064 Sum_probs=152.7 Q ss_pred CCcccccCCcccHHH-----------------------HHHHH---HHHHH-HhhhHHHHHHHHHHhcccchhhhccccc Q lcl|NC_011308. 1 MTNTLLTTAPDRLGT-----------------------ILSTK---IDEYI-RSQNVSLARVGQRYYNQDNDIENTRIMW 53 (530) Q Consensus 1 ~~~~~~~~~~~~~~~-----------------------~i~~~---i~~~~-~~~~~~~~~~~~~YY~g~~~I~~r~~~~ 53 (530) |.+-|+.=+...... ..... -..+. ..+.+++|+.+-.+++-...|.. T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~e----- 75 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDD----- 75 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHH----- Confidence 111111100000000 00000 00000 12334556666666555553321 Q ss_pred ccccccccccccCCcceeecCchhhHHhhhh-hhhcccceeeecCCc-----chHHHHHHHHHHhh-ccHHHHHHHHHHH Q lcl|NC_011308. 54 MNDHGDIVEDDNASNIKISHGFFAELVDQKT-QYLLANGIDVKPTDH-----DDQKLCYLIEEYYN-EEFQSAIQELVEG 126 (530) Q Consensus 54 ~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~-~yl~G~pv~~~~~~~-----~de~~~~~l~~~~~-~~~~~~~~e~~~~ 126 (530) ||+-.+ .=....||.+...+- -.+.+.+.++.+++ =+|+...++..+. T Consensus 76 -------------------------IVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~ 130 (537) T protein:vir:10 76 -------------------------VVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRR 130 (537) T ss_pred -------------------------hhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhh Confidence 111111 112244555433221 12335555555553 3788899999999 Q ss_pred HhhcCeEEEEEEecC----CCceEEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEe Q lcl|NC_011308. 127 STIKGYEGIFARTTS----EDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYV 202 (530) Q Consensus 127 ~~~~G~a~~~~y~d~----~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~ 202 (530) +-+.|+-|.|..+|. +|-.....+||..+..|.--..+....+++... .........-+-+|.+.+.+ + T Consensus 131 WYVDgRi~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~-----~~~v~~~~~eyf~ynp~g~~-~- 203 (537) T protein:vir:10 131 WYVDGRLFFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDL-----NQQLTQQSASYFLYNPKGLK-N- 203 (537) T ss_pred heeeeEEEEEEEEeCCCccccceeeeeeCCccceeeEeecccCCccceEEec-----ceeeeecccceeeecccccc-c- Confidence 999999999999874 366778889999886654211111111111100 00000000111223332211 0 Q ss_pred ecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHH Q lcl|NC_011308. 203 QKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIID 282 (530) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liD 282 (530) +....+..+. ....+++ . |.++. |+....|-++..+.... T Consensus 204 ----~~~~~vkI~~-----dAI~y~h---------S------------------Gl~d~----n~~~i~syLhkAiKp~N 243 (537) T protein:vir:10 204 ----STNQGMKIAP-----DSIAYCH---------S------------------GIQDL----NKNMVLSHLHKAIKAVN 243 (537) T ss_pred ----cCCCceeccH-----hheeeec---------c------------------cceeC----CCCeeeeeehhhhHHHH Confidence 0000000000 0000000 0 00000 11122233333222211 Q ss_pred HHHHHHHHH-------------------------------HHHHHHhccceeeeecC-CC--CchhhHHHHHhhCcceec Q lcl|NC_011308. 283 DYDLMNCFL-------------------------------SNNLQDMAEAIYVVRGG-TN--SPVDEIKKNIQSKKIIQT 328 (530) Q Consensus 283 a~~~~~S~~-------------------------------~n~~~~~~~~~lvl~g~-~~--~~~~~~~~~~~~~~~i~~ 328 (530) .+- ++-|. .+.+..|.+- || ... .+ .+.-.++.-+....+--= T Consensus 244 QLk-m~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNk-lV-YDa~TGev~ddrk~msMlEDyWLPRR 320 (537) T protein:vir:10 244 QLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNK-LV-YDANTGEIKDDKKFMSMLEDFWLPRR 320 (537) T ss_pred hhH-HHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccce-EE-EeccCceecccchhhhhhhhhccccc Confidence 111 11111 1111222221 11 110 00 011111111111111111 Q ss_pred CCCCceeEEEec--CCHHHHHHHHHHHHHHHHHHhcccC--CCccc---ccCCcHHHHH-HHHhhHHHHHHHHHHHHHHH Q lcl|NC_011308. 329 KGEGGLDIQTVD--IPYEARKAKMDIDELNIYRSGMGFN--SSAVG---DGNATNVVIK-SRYTLLAMKAQKTEIALRKT 400 (530) Q Consensus 329 ~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~---~gn~SGvAik-~~~~~l~~ka~~ke~~f~~~ 400 (530) +++.+-+.-|-+ .+.. .-.-+.=.++.+|..-.+|- +..++ +|.+|.++.. ++|. --+.+.+..|..- T Consensus 321 eGgrgTEItTLpGgqnlg-em~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~---KFI~RLR~rFs~l 396 (537) T protein:vir:10 321 EGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ---KFIARLRKRFSEL 396 (537) T ss_pred CCCcccceeeccccCCcC-hHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHH---HHHHHHHHHHHHH Confidence 122222222323 2222 12234455667777778883 33332 2444433221 1111 1234444444444 Q ss_pred HHHHHHHHHHHHHhcCCCcccc--ceeeEEeCCCCCCCHHHHHHHHHHH---------HhcCCCcHHHHHHhCCCCCCHH Q lcl|NC_011308. 401 LRWTADLVVEDIRRRGLGDYSS--TDIKFDIEPYILANELDLAMIDKTE---------AETNQIQINNLLAIAPRIGDEE 469 (530) Q Consensus 401 l~~~~~~i~~~l~~~~~~~~d~--~~i~i~f~~~~P~n~~e~a~~~~~~---------~~~g~iS~et~l~~~~~vdd~~ 469 (530) |.++|+.=+-+=++....+|+- ..|.+.|.+.---.+...++++... .-+..+|.+++.+.+=..+|.+ T Consensus 397 F~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDee 476 (537) T protein:vir:10 397 FVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESE 476 (537) T ss_pred HHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHH Confidence 4444332111111223345554 4677888776554444444443221 1122469999998854444433 Q ss_pred HHHHHHHHHHHHHHHHHHh--hhcccccc----CCccccCCCCCCCCCCccCcCCCCcccccccCC-C Q lcl|NC_011308. 470 TLKAICDTLDLDYEDVVKA--LEDQEVEE----LEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQ-E 530 (530) Q Consensus 470 ~e~~~~e~e~~e~~~~~~~--~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 530 (530) +++++++-+++++. +.+.++.. ..+++++.+.... .| -.+|.....|+- | T Consensus 477 -----I~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~ 533 (537) T protein:vir:10 477 -----IKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGE-EP-----QTDPNSAVSPADQK 533 (537) T ss_pred -----HHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCC-Cc-----ccCCccCCCCCCcc Confidence 22233222222222 22111111 1122222221111 11 111111122222 2 No 221 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=88.00 E-value=0.035 Score=28.56 Aligned_cols=443 Identities=13% Similarity=0.062 Sum_probs=182.4 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhH---HHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNV---SLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~---~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |-+........+.+ .+++..+.+++ +|- .+.+.+.+|..-. .+.+. .+ .....|+..+-.. T Consensus 1 ~~~~~~~~~~~~~~-~l~~r~~~L~~-~R~~~e~~w~e~a~~~lP~--~~~~~------~~------~~~~~~~~dstg~ 64 (516) T protein:vir:96 1 MKQSIDLEYGGKRS-KIPKLWEKFSN-KRSSFLDRAKHYSKLTLPY--LMNDK------GD------NETSQNGWQGVGA 64 (516) T ss_pred CcchhhhhhhhhHH-HHHHHHHHHHH-HhhHHHHHHHHHHHhhccc--ccCCC------CC------ccccCCcccchHH Confidence 55554444444443 34444555443 222 3445555554441 11110 00 1112245455555 Q ss_pred hHHhhhhhhhccc--cee-----eecCCc----------chHHH-------HHHHHHHh-hccHHHHHHHHHHHHhhcCe Q lcl|NC_011308. 78 ELVDQKTQYLLAN--GID-----VKPTDH----------DDQKL-------CYLIEEYY-NEEFQSAIQELVEGSTIKGY 132 (530) Q Consensus 78 ~Ivd~~~~yl~G~--pv~-----~~~~~~----------~de~~-------~~~l~~~~-~~~~~~~~~e~~~~~~~~G~ 132 (530) ..++..++-|+|- |+. +...+. +..++ ...+...+ ..||....+++-++...+|. T Consensus 65 ~a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 144 (516) T protein:vir:96 65 QATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGS 144 (516) T ss_pred HHHHHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCe Confidence 5566555555432 211 222111 00111 12233333 35888999999999999999 Q ss_pred EEEEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeec------------ccccccceEEEEEEEcCCceEE Q lcl|NC_011308. 133 EGIFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYS------------DADNKFNSIGHADVWTDTEVWY 200 (530) Q Consensus 133 a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~------------~~~~~~~~~~~~evyt~~~~~~ 200 (530) +. +|.++++.++ .++-.+.++--|..+....+++-....... ......+.-..++||+... T Consensus 145 a~--l~~d~~~~~~--~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~--- 217 (516) T protein:vir:96 145 CM--LYKPSKGAIS--AIPMHHYVVNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAK--- 217 (516) T ss_pred Ee--EEecCCCCEE--EEEcCeEEEeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeee--- Confidence 85 6678877554 444444444456666654444332211000 0000111122334443211 Q ss_pred EeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHH Q lcl|NC_011308. 201 YVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIK 275 (530) Q Consensus 201 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e 275 (530) +.. ++. ..++..+ ...........+|...|++.++- ..+|.|--+ T Consensus 218 -~~~-~~~--------------~~~~~~~--------------d~~~~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~ 267 (516) T protein:vir:96 218 -YLG-DGF--------------WELKQSA--------------DDIPVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAE 267 (516) T ss_pred -eeC-Cce--------------eEEEEEe--------------CceeeccccccccccCCeeeeeeeecCCCCcccchHH Confidence 000 000 0000000 00011112334566778777664 358999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHH Q lcl|NC_011308. 276 KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDID 353 (530) Q Consensus 276 ~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L 353 (530) +..+-+-.++.+.-...........|.+.+.-.+....... ...+.+.+..+..+++..+... .+.......++.+ T Consensus 268 ~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l--~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~ 345 (516) T protein:vir:96 268 DYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHF--VNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVY 345 (516) T ss_pred HhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhh--ccCCCceeecCCcccceeeecCcccchhHHHHHHHHH Confidence 88999989988888888888888888776532111111111 1123345544555666666533 3556666777777 Q ss_pred HHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHH----HHHHhcCCCccccceeeEE Q lcl|NC_011308. 354 ELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTAD-LVV----EDIRRRGLGDYSSTDIKFD 428 (530) Q Consensus 354 ~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~-~i~----~~l~~~~~~~~d~~~i~i~ 428 (530) +..|-..-+.-.+..-.....++..+. .++..++..++..+.++-. ++. ..+...+. ..--..+++. T Consensus 346 ~~rI~~af~~~~l~~r~~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p-~lp~~~v~~~ 417 (516) T protein:vir:96 346 TRRIGVVFMMETMTRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGE-SFTSDLVDPV 417 (516) T ss_pred HHHHHHHHhhhhhccCCCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCC-CCccccccce Confidence 766644322211111112234554443 3444455555554444211 111 11111111 1111123333 Q ss_pred eCCCCCC-----CHHHHHHHHHHH---HhcC-----CCcHHHHHHhC----C----CCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 429 IEPYILA-----NELDLAMIDKTE---AETN-----QIQINNLLAIA----P----RIGDEETLKAICDTLDLDYEDVVK 487 (530) Q Consensus 429 f~~~~P~-----n~~e~a~~~~~~---~~~g-----~iS~et~l~~~----~----~vdd~~~e~~~~e~e~~e~~~~~~ 487 (530) +...+.. +...+.+.++.. .+.. .+.-..++..+ + ++.. ++|.+++.+.+.+.+..+. T Consensus 418 ~vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs-~eev~~~~~~~~~~q~~~~ 496 (516) T protein:vir:96 418 IITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKS-AEEMAQEQEAQMQAQQAQM 496 (516) T ss_pred eechHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHHHHHHH Confidence 3322211 111111111111 1000 11112222211 1 2222 2333333333222222211 Q ss_pred hhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 488 ALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) ... +. +.++++......++. T Consensus 497 ~a~-~~-------~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 497 LEE-GV-------AKAVPGVIQQELKEA 516 (516) T ss_pred HHH-Hh-------hhhhhHHhhcccccC Confidence 111 11 112222222222222 No 222 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=86.86 E-value=0.043 Score=28.09 Aligned_cols=414 Identities=13% Similarity=0.026 Sum_probs=185.8 Q ss_pred CCccccc-CCcccHHHHHHHHHH----------HHHHhh-hHHHH-HHHHHHhcccchhhhcccccccccccccccccCC Q lcl|NC_011308. 1 MTNTLLT-TAPDRLGTILSTKID----------EYIRSQ-NVSLA-RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNAS 67 (530) Q Consensus 1 ~~~~~~~-~~~~~~~~~i~~~i~----------~~~~~~-~~~~~-~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~ 67 (530) |..++.. ..|...+.+.+...- .|...- -+.++ ..++.--.|........ + .+-. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L---~-------~dm~-- 68 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADL---A-------FDME-- 68 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHH---H-------HHHH-- Confidence 5444422 112222221111100 000000 01111 11111111111000000 0 0000 Q ss_pred cceeecCchhhHHhhhhhhhcccceeeecCCc---chHHHHHHHHHHhhc--cHHHHHHHHHHHHhhcCeEE-EEEEecC Q lcl|NC_011308. 68 NIKISHGFFAELVDQKTQYLLANGIDVKPTDH---DDQKLCYLIEEYYNE--EFQSAIQELVEGSTIKGYEG-IFARTTS 141 (530) Q Consensus 68 n~ki~~n~~k~Ivd~~~~yl~G~pv~~~~~~~---~de~~~~~l~~~~~~--~~~~~~~e~~~~~~~~G~a~-~~~y~d~ 141 (530) -......-.+.+...-+.|.+..+....+ .+++..+++++++.+ +|.+++..+. ++.-+|.+. +++|.-. T Consensus 69 ---~~D~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~ 144 (512) T protein:vir:19 69 ---EKDTHLFSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWGWL 144 (512) T ss_pred ---hhChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEeeee Confidence 01234555667777788899988875432 345677888888854 5777777664 577888855 6777544 Q ss_pred CCceE---EEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccc Q lcl|NC_011308. 142 EDKLT---FQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTV 218 (530) Q Consensus 142 ~g~~~---~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~ 218 (530) +|.+. +...+|..+ .|+..+... ++.. .....+ ..+ T Consensus 145 ~g~~~~~~~~~r~~~~f--~~~~~~~~~--lr~~------~~~~~G-----~~l-------------------------- 183 (512) T protein:vir:19 145 GKMRVPVALHHRDPALF--CANPDNLNE--LRLR------DASYHG-----LEL-------------------------- 183 (512) T ss_pred CCceeeeeeeeeccccc--eeccCCCcE--EEec------CCCCCc-----eee-------------------------- Confidence 55443 334444432 122211110 0000 000000 000 Q ss_pred cccccceeeeeecccccceecccccccccccccccccCCccceEEe--eCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 219 NPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL--YNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQ 296 (530) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~--~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~ 296 (530) .+++.|-+++- ..+..|.|.+..+-...--=+..+.+.+..++ T Consensus 184 -----------------------------------~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E 228 (512) T protein:vir:19 184 -----------------------------------QPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLE 228 (512) T ss_pred -----------------------------------cCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01222222111 12446888888776666666778889999999 Q ss_pred HhccceeeeecCCCCchh------hHHHHHhhCcceecCCCCceeEEEec-CCHHHHHHHHHHHHHHHHHHhcccCCCcc Q lcl|NC_011308. 297 DMAEAIYVVRGGTNSPVD------EIKKNIQSKKIIQTKGEGGLDIQTVD-IPYEARKAKMDIDELNIYRSGMGFNSSAV 369 (530) Q Consensus 297 ~~~~~~lvl~g~~~~~~~------~~~~~~~~~~~i~~~~~~~~~~lt~~-~~~~~~e~~ld~L~~~I~~~s~~p~~~~~ 369 (530) .|..|+.+.+-..+...+ +...++....++.++.+..+++++.. .....++..+++..+.|-+.--+-.++.+ T Consensus 229 ~yG~P~~igky~~~a~~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~ 308 (512) T protein:vir:19 229 IYGLPMRVGKYPTGSTNREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTE 308 (512) T ss_pred HcCCCeeEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 999999988733222211 22335566777778999999999854 44556788889888888877554444433 Q ss_pred c--cc-CCcH-HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCHHHHHHHH Q lcl|NC_011308. 370 G--DG-NATN-VVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIEPYILANELDLAMID 444 (530) Q Consensus 370 ~--~g-n~SG-vAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~~e~a~~~ 444 (530) . .| ++.| +.-+.+..-....| +.+...|.+ .+|-.++........+ ..-..+.|...-|.|....++.+ T Consensus 309 ~g~~Gs~a~~~vh~ev~~di~~aDa----~~i~~tln~--~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~ 382 (512) T protein:vir:19 309 AGDKGARSLGEVHDEVRREIRNADV----GQLARSINR--DLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAI 382 (512) T ss_pred ccccchhhHHHHHHHHHHHHHHHHH----HHHHHHHHH--HHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHH Confidence 2 12 2222 22222222222222 223333321 0222333333322222 12367888888889988888888 Q ss_pred HHHHhcC-CCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 445 KTEAETN-QIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 445 ~~~~~~g-~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) ..+. .| .+|.+.+.+.+++ .-++.. +.....-... ....+........ ...+.+. ..+... T Consensus 383 ~~l~-~G~~i~~~~i~e~~Gi-p~~~~~-----------e~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~--~~d~~~ 444 (512) T protein:vir:19 383 PKLA-AGMRIPVSWIQEKLHI-PQPVGD-----------EAVFTIQPVV--PDNGSQKEAALSA-EDIPQED--DIDRMG 444 (512) T ss_pred HHHh-cCCCCCHHHHHHHhCC-CCCCCc-----------cccccCCCcc--ccccccccccccc-cCCCchh--hHhHHh Confidence 7776 56 4788888888764 111110 0000000000 0000000000000 0000000 000000 Q ss_pred cc--------ccCCC Q lcl|NC_011308. 524 EE--------EPVQE 530 (530) Q Consensus 524 ~~--------~~~~~ 530 (530) .. +|.-+ T Consensus 445 ~~~~~~~~~~~~~~~ 459 (512) T protein:vir:19 445 VSPEDWQRSVDPLLK 459 (512) T ss_pred hhHHHHHHHHHHHHH Confidence 00 00000 No 223 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=85.10 E-value=0.055 Score=27.48 Aligned_cols=335 Identities=13% Similarity=0.100 Sum_probs=135.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCc--hhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGF--FAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~--~k~Ivd~~~~yl~ 88 (530) |-... .|...... ....|.. .+ ...+.... ....+.+-+... .-.-|+..++=+. T Consensus 1 M~~~~-------~f~~r~~~----~~~~~~~----~~-------~~~~~~~~-~~~v~~~~al~~~av~~cv~~ia~~ia 57 (359) T protein:vir:10 1 MSILN-------PFERRSSI----TPNNYYP----FM-------VQNGSIVP-NSLVDATEALKNSDLYAVTSLISSDIA 57 (359) T ss_pred Ccccc-------hhhccccC----CCCcchh----hh-------hccccccC-CcccCHHHhhcchHHHHHHHHHHHhhh Confidence 21111 11110000 0000000 00 00000000 000011111000 1123333444444 Q ss_pred ccceeeecCCcchHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcCCCC Q lcl|NC_011308. 89 ANGIDVKPTDHDDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDDYGT 164 (530) Q Consensus 89 G~pv~~~~~~~~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~~~~ 164 (530) +-|+. ++......+.+= |.. ......+......+|.||.++-++..|.+ .+..++|..+-+..++. . T Consensus 58 ~~p~~------~~~~~~~L~~~P--N~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~-~ 128 (359) T protein:vir:10 58 GTRFI------GNQVFTSVLNNP--SHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD-T 128 (359) T ss_pred cCccc------cchHHHHHhhcc--cccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC-e Confidence 55542 111122222211 211 12334556677889999999988988875 46778888876655432 1 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) + +|.+..... .....|..+++.|++...... T Consensus 129 ----~-~y~~~~~~~--------~~~~~~~~~evih~~~~~~~~------------------------------------ 159 (359) T protein:vir:10 129 ----L-TYEVNQFDD--------YPSAKYNASEMIHVKIMAYGV------------------------------------ 159 (359) T ss_pred ----E-EEEEEecCC--------ceEEEEcccceEEeccCCCCC------------------------------------ Confidence 1 122111000 011234555565554321100 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC--CCc--hhhHHHHH Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT--NSP--VDEIKKNI 320 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~--~~~--~~~~~~~~ 320 (530) +++ +.-.|.|-++.....|.....+..-.++.+.--+.|-.+++-.. .+. .+.++..+ T Consensus 160 ---------~~~---------dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~ 221 (359) T protein:vir:10 160 ---------DTL---------HNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEF 221 (359) T ss_pred ---------Ccc---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHH Confidence 000 01136666666555555555444445555555555666665322 121 12222222 Q ss_pred h-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHHHHHH Q lcl|NC_011308. 321 Q-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLAMKAQ 391 (530) Q Consensus 321 ~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~~ka~ 391 (530) . .++++.+++|-+++-++....+.......+.....|...-.+|. ++..+.++.+...++-.+.. T Consensus 222 ~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~------ 295 (359) T protein:vir:10 222 EKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVN------ 295 (359) T ss_pred HHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHH------ Confidence 1 22456666665555554332222333455666777777777875 23322223343333332222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCC---CC Q lcl|NC_011308. 392 KTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAP---RI 465 (530) Q Consensus 392 ~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~---~v 465 (530) .|...+.-+..-++.+-. ..++... -+.| |.......+..+..+|+++...+++.++ .. T Consensus 296 --------~l~~~l~p~~~~l~~~l~~~~~~~~~~-~~~~------d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 296 --------ALNRFIEPLISELRIKCDSSIGVDMSP-ITDY------SNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred --------HHHHHHHHHHHHHHHHhhhhhcccchh-hhhc------CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 222222211111111100 0111110 1112 2223334456688899999999988863 33 No 224 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=84.47 E-value=0.06 Score=27.27 Aligned_cols=440 Identities=13% Similarity=0.056 Sum_probs=179.8 Q ss_pred ccCCcccHH---HHHHHHHHHHHHhhhH---HHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 6 LTTAPDRLG---TILSTKIDEYIRSQNV---SLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 6 ~~~~~~~~~---~~i~~~i~~~~~~~~~---~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) .++..|... +-+++..+.+++ +|- .+.+.+.+|..-. ++.+. .. ....-|+..+-...- T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~-~R~~~e~~w~e~~~~tlP~--~~~~~-----~~-------~~~~~~~~dstg~~a 65 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSK-KRSPYLDRAKHFAKLTLPY--LMNNK-----GD-------NETSQNGWQGVGAQA 65 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHH-hhhHHHHHHHHHHHHhccc--ccCCC-----CC-------cccccccccchHHHH Confidence 333333221 122333333332 222 2445555554441 11110 00 111113444445555 Q ss_pred Hhhhhhhhccc--cee---e--ecCCcc---------hH-HHH-------HHHHHHh-hccHHHHHHHHHHHHhhcCeEE Q lcl|NC_011308. 80 VDQKTQYLLAN--GID---V--KPTDHD---------DQ-KLC-------YLIEEYY-NEEFQSAIQELVEGSTIKGYEG 134 (530) Q Consensus 80 vd~~~~yl~G~--pv~---~--~~~~~~---------de-~~~-------~~l~~~~-~~~~~~~~~e~~~~~~~~G~a~ 134 (530) ++..++.|+|- |+. | ...+.. .. .+. ..+...+ ..||....+++-++...+|.+. T Consensus 66 ~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 145 (515) T protein:vir:70 66 TNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL 145 (515) T ss_pred HHHHHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEE Confidence 55555555432 221 2 111110 11 111 1222223 4588889999999999999985 Q ss_pred EEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeeccc------------ccccceEEEEEEEcCCceEEEe Q lcl|NC_011308. 135 IFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDA------------DNKFNSIGHADVWTDTEVWYYV 202 (530) Q Consensus 135 ~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~------------~~~~~~~~~~evyt~~~~~~y~ 202 (530) +|.|+++.+++ ++-.+.++--|..+....++|.+......-. ....+.-..+++||. . T Consensus 146 --l~~d~~~~~~~--~pl~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~------v 215 (515) T protein:vir:70 146 --LYKPSKGAMSA--VPMHHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH------A 215 (515) T ss_pred --EEEeCCCCeEE--EEcCeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEE------E Confidence 56687776554 4445555556777877777765542211000 001111112233321 1 Q ss_pred ecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHH Q lcl|NC_011308. 203 QKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKV 277 (530) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v 277 (530) .... .+......+. .+.........+|...|++.++- +.+|.|--++. T Consensus 216 ~~~~------------------------~~~~~~~~e~---d~~~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~ 268 (515) T protein:vir:70 216 QYAG------------------------EGFWKINQSA---DDIPVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDY 268 (515) T ss_pred EecC------------------------CCceEEEEec---CceeeccccccccccCCceeeeeeecCCCCcccchHHHh Confidence 0000 0000000000 00111122334577778877664 45899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHH Q lcl|NC_011308. 278 KSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDEL 355 (530) Q Consensus 278 ~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~ 355 (530) .+-+-.++.+.-..........+|.+++.-.+......+ ...+.+.+..+..+++..+... .+.......++.++. T Consensus 269 l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l--~~~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ 346 (515) T protein:vir:70 269 SGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHF--VNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTR 346 (515) T ss_pred hHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhc--cccCCceeecCCcccceeeecCcccchhHHHHHHHHHHH Confidence 999999999888888888888888877642222111111 1123345545555666666533 356666777777777 Q ss_pred HHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHH--hcCCC-ccccceeeEEeCC Q lcl|NC_011308. 356 NIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVE-DIR--RRGLG-DYSSTDIKFDIEP 431 (530) Q Consensus 356 ~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~-~l~--~~~~~-~~d~~~i~i~f~~ 431 (530) .|-..-+.-.+..-.....++..+. .++..++..++..+.++-.-++. ++. ..+.. ..-...+.+.+.. T Consensus 347 rI~~af~~~~l~~rd~~rvTAtEV~-------~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~~~~vs 419 (515) T protein:vir:70 347 RIGVIFMMETMTRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVT 419 (515) T ss_pred HHHHHHhhhhhhccCCccccHHHHH-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhhcccceeh Confidence 7744332211111111123444443 34455555566555443221111 110 01110 0001113333322 Q ss_pred CCC---C--CHHHHHHHHHHHH---hcC-----CCcH----HHHHHhCC----CCCCHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011308. 432 YIL---A--NELDLAMIDKTEA---ETN-----QIQI----NNLLAIAP----RIGDEETLKAICDTLDLDYEDVVKALE 490 (530) Q Consensus 432 ~~P---~--n~~e~a~~~~~~~---~~g-----~iS~----et~l~~~~----~vdd~~~e~~~~e~e~~e~~~~~~~~~ 490 (530) .+. + +...+.++++.+. ..+ .+.- +.+...++ ++.. +++++.+.+..++. ..+..+. T Consensus 420 ~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs-~eev~~~r~q~~~~-~~~~~~~ 497 (515) T protein:vir:70 420 GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKS-EEEMQQEMAQQAQA-QQEAMLN 497 (515) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCC-HHHHHHHHHHHHHH-HHHHHHH Confidence 221 0 1111111111111 111 0111 11111111 2222 23333333322222 2222222 Q ss_pred ccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 491 DQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) .+... ...+...+..++. T Consensus 498 ~~~~~-------a~~~~~~~~~~~~ 515 (515) T protein:vir:70 498 EGVAK-------AVPGVIQQEMKEG 515 (515) T ss_pred Hhhhh-------hcccchhhhhccC Confidence 22111 1111111222221 No 225 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=84.30 E-value=0.062 Score=27.22 Aligned_cols=407 Identities=13% Similarity=0.057 Sum_probs=164.3 Q ss_pred cccCCcccHHHHHHHHHHHHHHhhhHHHH-HHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhh Q lcl|NC_011308. 5 LLTTAPDRLGTILSTKIDEYIRSQNVSLA-RVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQK 83 (530) Q Consensus 5 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~-~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~ 83 (530) .+.+..|-....++.+ |......... ........+-..-+.-... ..+..+.... -.|++ =.-..|+.. T Consensus 1 ~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~g~~v~~~~--a~~~~--aV~~~v~~I 70 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAM---FVPPDPVDIGGGQTFTPVNATARDLGIIIS---DTGAAVNADA--IMRLD--AVAACVKLV 70 (432) T ss_pred CCCcccCchhhhhHhh---cCCccccccccccccccCchhhhhhccccc---ccCcccchHh--hhcch--HHHHHHHHH Confidence 2333333322222222 1110000000 0000000000000000000 0000000000 00121 122345555 Q ss_pred hhhhcccceeeec-CCcch-----HHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEeccc Q lcl|NC_011308. 84 TQYLLANGIDVKP-TDHDD-----QKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDAL 153 (530) Q Consensus 84 ~~yl~G~pv~~~~-~~~~d-----e~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~ 153 (530) ++=+.+-|+.+-- ..++. ..+...|+.- -|.. ......+......+|.||.++..+ +|++ .+..++|. T Consensus 71 a~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~-PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~ 148 (432) T protein:vir:97 71 SQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDG-PNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLAND 148 (432) T ss_pred HHhhccCceEEEEecCCCcccccccHHHHHHHhc-ccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCc Confidence 5566666776421 11111 1122223211 1222 234456777888999999888776 4664 45678999 Q ss_pred ceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeeccc Q lcl|NC_011308. 154 QLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGV 233 (530) Q Consensus 154 ~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (530) .+-++.+..+.+. |...... +. ...+..+.+.|++... T Consensus 149 ~v~v~~~~~g~~~-----y~~~~~~-----g~----~~~~~~~~iih~r~~~---------------------------- 186 (432) T protein:vir:97 149 RLTITTDTKGNTA-----YRYRRTD-----GQ----MIDIPRQQIWKIMGYS---------------------------- 186 (432) T ss_pred ceEEEEcCCCcEE-----EEEEecC-----ce----EEEEccccEEEecCcC---------------------------- Confidence 9988887665421 2211100 10 1123444555443110 Q ss_pred ccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc Q lcl|NC_011308. 234 DEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP 312 (530) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~ 312 (530) + +.-.|.|-++.....++....+..-.++.+...+.|-.+++-... ++ T Consensus 187 ----------------------~---------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~ 235 (432) T protein:vir:97 187 ----------------------L---------DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTD 235 (432) T ss_pred ----------------------C---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCH Confidence 0 011355555544444443333333333444444455555543221 11 Q ss_pred --hhhHHHHH----hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCC-cHHHHHHHH Q lcl|NC_011308. 313 --VDEIKKNI----QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNA-TNVVIKSRY 383 (530) Q Consensus 313 --~~~~~~~~----~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~-SGvAik~~~ 383 (530) -+.+...+ ..++++.+++|.+++-++....+..+.+..+.....|...-.+|. ++....|+. .|..++- T Consensus 236 e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~-- 313 (432) T protein:vir:97 236 DQYDSFSKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES-- 313 (432) T ss_pred HHHHHHHHHHhhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHH-- Confidence 11222221 234567777777666665544444445556777788888878874 222222221 1222211 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 384 TLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL--GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 384 ~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) ....++...|.-.++.|..-++.+-. .+.....+++.++.-+-.|..+.++...++..+|+++...+++. T Consensus 314 --------~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~ 385 (432) T protein:vir:97 314 --------QQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREI 385 (432) T ss_pred --------HHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Confidence 11222333444444444444433211 11111234444444455688888888889999999999999888 Q ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 462 APR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 462 ~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) +++ +++-...+ . ....+..+.... ....+.|.+..+-...+=+|| T Consensus 386 ~glpp~~g~~~~~-~-------~~~~~~pl~~~~----------------~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 386 EGLPKLGGNAAVL-T-------VQSAMVPLDSIG----------------LQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred hCCCCCCCCcceE-e-------ecccccchhhhc----------------ccCCCCCCCCCCCcccccccC Confidence 754 22110000 0 000000000000 000111111111111122222 No 226 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=83.36 E-value=0.069 Score=26.95 Aligned_cols=413 Identities=9% Similarity=0.012 Sum_probs=176.6 Q ss_pred CCcccccCCcc-cHHHHHHHHHHHHHHhhhHHHHHHHHHHhc--ccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTNTLLTTAPD-RLGTILSTKIDEYIRSQNVSLARVGQRYYN--QDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~~~~~~~~~-~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~--g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) ||..--...|- ...++...-+..+ + ..|+ ...+.+.+.. .-...++- .-...... T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~--------~----~~~~~~e~~~~lr~~~-----~~~ly~~m-----~e~D~~i~ 58 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDG--------W----TVWDPFEQTPELQWPQ-----SVAVYSRM-----DNEDSRVT 58 (469) T ss_pred CCCcccCCCCccchhhhhhcccccc--------h----hhcccccccccccccc-----chHHHHHH-----HhhChHHH Confidence 66665555552 4444333211100 0 0111 0001111000 00000000 00124455 Q ss_pred hHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh------------------ccHHHHHHHHHHHHhhcCeEE-EEEE Q lcl|NC_011308. 78 ELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN------------------EEFQSAIQELVEGSTIKGYEG-IFAR 138 (530) Q Consensus 78 ~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~------------------~~~~~~~~e~~~~~~~~G~a~-~~~y 138 (530) -.+.+....+.|-+.++...... ++..+++.+.+. ..+.+.+.++...+.-+|.++ +++| T Consensus 59 s~l~~rk~av~~~~w~v~p~~~~-~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw 137 (469) T protein:vir:10 59 SLLEAISLPIRSTPWRIRANGAS-DEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVY 137 (469) T ss_pred HHHHHHHHHHhcCCceEecCCCC-HHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeee Confidence 56666777788888888765433 333333333221 134556666666677789865 6777 Q ss_pred ecC----CCceEEEEecc---cce-EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccc Q lcl|NC_011308. 139 TTS----EDKLTFQTVDA---LQL-LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSD 210 (530) Q Consensus 139 ~d~----~g~~~~~~~~p---~~~-~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~ 210 (530) ... +|.+....+.| ..+ .-.|++.+.+... ....... ......|. .+.. T Consensus 138 ~~~~~~~dG~~~~~~l~~rp~~~i~~~~~~~~~~l~~~----~~~~~~~---------------~~~~~~~~--~~~~-- 194 (469) T protein:vir:10 138 RPRNQSPDGRFWLRKLAPRPQWTISKFNVAPDGGLESI----EQIAPPA---------------RTRGSLYV--ANIA-- 194 (469) T ss_pred ecccccCCCceeeeeeeecCcccceeeeeccCCceeee----eecCccc---------------cccccccc--CCCC-- Confidence 532 35554433322 211 1112222211100 0000000 00000000 0000 Q ss_pred hhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEee--CCcCCCCcHHHHHHHHHHHHHHH Q lcl|NC_011308. 211 EYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILY--NNKLGISDIKKVKSIIDDYDLMN 288 (530) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~--nn~~~~sd~e~v~~liDa~~~~~ 288 (530) ...-.+++.|-+++-. .|..|.|.+..+-..---=+..+ T Consensus 195 ---------------------------------------~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~ 235 (469) T protein:vir:10 195 ---------------------------------------PPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLL 235 (469) T ss_pred ---------------------------------------ccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHH Confidence 0000011111111111 24467888877655544445578 Q ss_pred HHHHHHHHHhccceeeeecCCCCchhh------HHHHHh--hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 289 CFLSNNLQDMAEAIYVVRGGTNSPVDE------IKKNIQ--SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRS 360 (530) Q Consensus 289 S~~~n~~~~~~~~~lvl~g~~~~~~~~------~~~~~~--~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~ 360 (530) .+.+..++.|..|+.+.+-..+.+.++ ...++. ...++.++.+.+++++....+....+..+++..+.|-+. T Consensus 236 ~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~ 315 (469) T protein:vir:10 236 RIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALS 315 (469) T ss_pred HHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHH Confidence 889999999999999987544333222 222332 233456788999999998888788888999999998887 Q ss_pred hcccCCCcccc-cC-C-cHHHHHHHHhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCccccceeeEEeCCCCCCC Q lcl|NC_011308. 361 GMGFNSSAVGD-GN-A-TNVVIKSRYTLLAMKAQKTEIALRKTLR-WTADLVVEDIRRRGLGDYSSTDIKFDIEPYILAN 436 (530) Q Consensus 361 s~~p~~~~~~~-gn-~-SGvAik~~~~~l~~ka~~ke~~f~~~l~-~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n 436 (530) --+..++.++- |. + +.+.-+.+-.-+...|. .+...|. +++. .++..... .+..-..+.|. ....+ T Consensus 316 iLG~tlTs~~~gGS~a~~~vh~ev~~d~~~sDa~----~i~~tln~~li~---~l~~lN~g--~~~~~P~~~~~-~~e~~ 385 (469) T protein:vir:10 316 GLAHFLNLDGKGGSYALASVLEDPFTQAVHAYAT----SICRIANQHIIE---DLVDINFG--VDTPAPVLTFD-PIGSR 385 (469) T ss_pred HhcccccccCccchhhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH---HHHHhcCC--CCCCccEEEec-CCCCc Confidence 76666654422 22 2 22333322222222233 2333332 1222 22222222 12223567774 34455 Q ss_pred HHHHHHHHHHHHhcCC-----CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 437 ELDLAMIDKTEAETNQ-----IQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 437 ~~e~a~~~~~~~~~g~-----iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) ....|+.++.++..|+ ++.+.+.+.+++ ..++.....+.. ..+...... ...++ T Consensus 386 ~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gi-p~~~~~~~~~~~-------~~~~~~~~~--~~~~~----------- 444 (469) T protein:vir:10 386 QDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNL-PSELNDTPSAEP-------EEPAAVPNQ--SAAPA----------- 444 (469) T ss_pred HHHHHHHHHHHHhcCCccCccccHHHHHHHhCC-CCCCCCcccccc-------hhcccCCCC--Ccccc----------- Confidence 6677888888888887 345556666543 111111000000 000000000 00000 Q ss_pred CccCcCCCCcccccccCCC Q lcl|NC_011308. 512 PQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 512 ~~~~~~~~~~~~~~~~~~~ 530 (530) ...+.+.+....+.|-.. T Consensus 445 -~~~~~~~~~~~~~~~~~~ 462 (469) T protein:vir:10 445 -RTRSSGNADARARAPKAD 462 (469) T ss_pred -ccCCCCCcccccccCCCh Confidence 011111111111111111 No 227 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=82.46 E-value=0.077 Score=26.70 Aligned_cols=441 Identities=10% Similarity=0.048 Sum_probs=158.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) ......+.+. .++ ...+.+++|+.+..+++-...|.. || T Consensus 39 ~~~~~~~~~~-----~~~------~~~eLI~~YR~ma~~pEvd~Av~e------------------------------IV 77 (558) T protein:vir:10 39 FYGQYVDIEG-----AYR------SEYDLIRRYREMALHPEADGAIED------------------------------VV 77 (558) T ss_pred eeeeeecccc-----hhh------hHHHHHHHHHHHhhccchhhHHHH------------------------------hh Confidence 0011111111 000 112345556666666655553321 11 Q ss_pred hhhh-hhhcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEE Q lcl|NC_011308. 81 DQKT-QYLLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS----EDKLTFQT 149 (530) Q Consensus 81 d~~~-~yl~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~----~g~~~~~~ 149 (530) +-.+ .=....||.+..++.+ .+.+.+.++.+++ =+|+...++..+.+-+.|+-|.|..+|. +|-..... T Consensus 78 neaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~ 157 (558) T protein:vir:10 78 NEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRY 157 (558) T ss_pred cceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeee Confidence 1111 1122445544332222 2344555555543 3788899999999999999999999874 36667888 Q ss_pred ecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) +||..+-.|..-..+..-.-....+.... .......+..+-+|++...+.....+. ... . ..+... T Consensus 158 lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~-~~~~~~~~~eyy~Y~~~~~~~~~~~~~-~~~----------~--~~vkI~ 223 (558) T protein:vir:10 158 IDPLKIKFIRQEKRKPGNQDPAIRVRSEQ-DVVPNPEFEEFYIYTPKVQHPTGMVGQ-MGG----------K--NSIKIA 223 (558) T ss_pred eCcccceeeeeeccccccccceeeeeccc-ceeeccceeEeeeecCCccccccccee-ecC----------C--Cceeec Confidence 99998866553211110000000000000 000001111112233322221111000 000 0 000000 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH------------------ Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL------------------ 291 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~------------------ 291 (530) ..+...+.. |-+++ |+..-.|-++..+.....+- ++-|. T Consensus 224 ~dAI~y~hS------------------GL~d~----~~~~i~syLhkAIKp~NQLk-mlEDAlVIYRitRAPERRvFYID 280 (558) T protein:vir:10 224 KDSITMCTS------------------GLVDR----NKNRVLSYLHKAIKALNQLR-MIEDSLVIYRLSRAPERRIFYID 280 (558) T ss_pred hhheeeecc------------------cceec----CCCeeeecchHhhHhHHhhH-HHHhhHHHHhhhccccceEEEEe Confidence 000000000 00000 01111122332211111111 11111 Q ss_pred -------------HHHHHHhccceeeeecC-CC--CchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHH Q lcl|NC_011308. 292 -------------SNNLQDMAEAIYVVRGG-TN--SPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDID 353 (530) Q Consensus 292 -------------~n~~~~~~~~~lvl~g~-~~--~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L 353 (530) .+.+..|.+- || ... .+ .+.-.++.-+....+-.=+++.+-+.-|-+ .+... ..-+.=. T Consensus 281 VGnLPk~KAeqYlr~iM~k~KNk-lV-YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLge-m~DV~YF 357 (558) T protein:vir:10 281 VGNLPKVKAEQYLKEVMSRYRNK-LV-YDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGE-LSDVDYF 357 (558) T ss_pred cCCCCchhHHHHHHHHHHhccce-EE-EeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcch-HHHHHHH Confidence 1111222211 11 110 00 011111111111111111122222222322 22221 2234556 Q ss_pred HHHHHHHhcccC--CCccc-c--cCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc--cee Q lcl|NC_011308. 354 ELNIYRSGMGFN--SSAVG-D--GNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS--TDI 425 (530) Q Consensus 354 ~~~I~~~s~~p~--~~~~~-~--gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~--~~i 425 (530) ++.+|..-.+|- +..++ | |.+|.++.. ++|. --+.+.+..|..-|.++|+.=+-+=++....+|+- ..| T Consensus 358 ~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~---KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I 434 (558) T protein:vir:10 358 QKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFA---KFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHI 434 (558) T ss_pred HHHHHHHhCCCccccCCCCcccccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcc Confidence 677777778883 33332 2 444433221 1121 12344444444444444332111112223345554 467 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHH---------hcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEA---------ETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEE 496 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~---------~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~ 496 (530) .+.|.+.---.+...++++.... -+..+|++++.+.+=...|.+ +++++++-+++++..--++.+. T Consensus 435 ~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee-----I~~~~kqI~~E~k~~~~~~p~~ 509 (558) T protein:vir:10 435 QYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDME-----IEEIDTQIEDEIQKGIIPDPSQ 509 (558) T ss_pred eEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH-----HHHHHHHHHHHHhCCCCCCccc Confidence 78887765544444444433211 122469999998854444433 2333333333333222221111 Q ss_pred CCccc--------cCC-CCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 497 LEPTV--------TPI-IDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 497 ~~~~~--------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) .++.. ++. .+...+++.|.+....+....+-..+ T Consensus 510 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (558) T protein:vir:10 510 IDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKD 552 (558) T ss_pred cChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhhh Confidence 11111 111 12223333333333333333222222 No 228 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=80.85 E-value=0.091 Score=26.29 Aligned_cols=407 Identities=9% Similarity=-0.051 Sum_probs=152.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccc------------ccc--------ccccccccccCCcce Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIM------------WMN--------DHGDIVEDDNASNIK 70 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~------------~~~--------~~~~~~~~~~~~n~k 70 (530) |- ++-.+........+. .+..+-...+........ ... ..+..+. +..= T Consensus 1 M~---~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~----~~~a 69 (466) T protein:vir:81 1 MR---LIDRLLSTRGAAPRM----SIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLA----TQAY 69 (466) T ss_pred Cc---hhHHHhhccCccccc----chhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccc----hhhh Confidence 43 333333222211100 011111111110000000 000 0000000 0000 Q ss_pred eecCchhhHHhhhhhhhcccceeeecCCcch-HHH-HHHHHHHhh--cc---HHHHHHHHHHHHhhcCeEEEEEEecCCC Q lcl|NC_011308. 71 ISHGFFAELVDQKTQYLLANGIDVKPTDHDD-QKL-CYLIEEYYN--EE---FQSAIQELVEGSTIKGYEGIFARTTSED 143 (530) Q Consensus 71 i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~d-e~~-~~~l~~~~~--~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g 143 (530) +...-....|+..+.=+.+-|+.+.-..++. +.. ...+..++. |. .......+..+...+|.||.++.+++.| T Consensus 70 ~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g 149 (466) T protein:vir:81 70 QANGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFV 149 (466) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCcc Confidence 1123345567777777777888764332211 111 111222222 22 2234466777889999999999888766 Q ss_pred ce---------EEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhc Q lcl|NC_011308. 144 KL---------TFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVL 214 (530) Q Consensus 144 ~~---------~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~ 214 (530) .+ .+..++|..+.+..+..+......++. .. ..... .....|..+.+.|++... T Consensus 150 ~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~--~~--~~~~~----~~~~~~~~~dviHir~~~--------- 212 (466) T protein:vir:81 150 RMRPDWVDVVVEERMVRGGRGELGGGQLGWRKVGYLYT--EG--GRQSG----NESVGFLAEDVVHFAPIP--------- 212 (466) T ss_pred ccccccCcceeEEEEecCcceEEEEcCCCceEEEEEEE--ec--Ccccc----cceeeeccccEEEEcCCC--------- Confidence 53 356677777777665544332221111 00 00000 001123344444432110 Q ss_pred cccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 215 DTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNN 294 (530) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~ 294 (530) ++++ .-.|.|-+......|+....+..-..+. T Consensus 213 ---------------------------------------~~~d---------~~~G~s~i~~~~~~i~~~~a~~~~~~~~ 244 (466) T protein:vir:81 213 ---------------------------------------DPLA---------SYRGMSWLTPILREIRADQAMSKHQAKF 244 (466) T ss_pred ---------------------------------------Cccc---------ccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0136666665555555444444444455 Q ss_pred HHHhccceeeeecCC-CC--chhhHHHHHh--------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 295 LQDMAEAIYVVRGGT-NS--PVDEIKKNIQ--------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMG 363 (530) Q Consensus 295 ~~~~~~~~lvl~g~~-~~--~~~~~~~~~~--------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~ 363 (530) +.....|-.+++-.. .+ ..+.++..+. .++++.++++.+++-++....+.......+...+.|...-.+ T Consensus 245 f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgV 324 (466) T protein:vir:81 245 FDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGV 324 (466) T ss_pred HhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 555566766665322 11 1222222221 234667777776666655444444445566777888888788 Q ss_pred cCC--Ccc-cccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCC--CCCCHH Q lcl|NC_011308. 364 FNS--SAV-GDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPY--ILANEL 438 (530) Q Consensus 364 p~~--~~~-~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~--~P~n~~ 438 (530) |.. +.. +.+.+++..++-. .+.++...|.-.++.|...++.+-........+.+.|+.. +=.|.. T Consensus 325 Pp~~lG~~~~~~~st~sn~eq~----------~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~ 394 (466) T protein:vir:81 325 PPVIVGLSEGLAAATYSNYGQA----------RRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEK 394 (466) T ss_pred CHHHcccccCCCccccccHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHH Confidence 842 211 1122222222111 1122233333333333333332211111112344555432 223444 Q ss_pred HHHHH-------HHHHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 439 DLAMI-------DKTEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 439 e~a~~-------~~~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) +.+++ ...+..+|+ ....++...+.-|.+- +........+. +..+... .. T Consensus 395 ~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~~-----~~~~~~~~~~~---~~~~~~~--------------~~ 451 (466) T protein:vir:81 395 DAADIQKVRAETINTLITAGY-EPESVVAAVNSGDLRL-----LKHTGLTSVQL---LPPGVSA--------------SA 451 (466) T ss_pred HHHHHHHHHHHHHHHHHHcCC-ChhhccccccCCcccc-----ccCCCcchhhh---ccccccc--------------cc Confidence 44333 223344452 3333332221111100 00000000000 0000000 00 Q ss_pred CccCcCCCCcccccc Q lcl|NC_011308. 512 PQPEPLNIDPVIEEE 526 (530) Q Consensus 512 ~~~~~~~~~~~~~~~ 526 (530) ..++|.+..+..+++ T Consensus 452 ~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 452 SSDTPTSGGADDNGN 466 (466) T ss_pred CCCCcccCCCCcCCC Confidence 000011111112222 No 229 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=80.69 E-value=0.092 Score=26.25 Aligned_cols=443 Identities=13% Similarity=0.046 Sum_probs=178.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhh---HHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQN---VSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFA 77 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~---~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k 77 (530) |-+.-......+.+. +++..+.+++ +| ..+.+.+.+|..-. ++.+. .. .....|+..+-.. T Consensus 1 ~~~~~~~~~~~~~~~-l~~r~~~L~~-~R~~~e~~w~e~a~~~lP~--~~~~~-----~~-------~~~~~~~~dstg~ 64 (516) T protein:vir:10 1 MKQSTDLEYGGKRSK-IPKLWEKFST-KRSSFLDRAKHYSKLTLPY--LMNDK-----GD-------NETSQNGWQGVGA 64 (516) T ss_pred CCchhhHhhhhHHHH-HHHHHHHHHH-hhhHHHHHHHHHHHhhccc--ccCCC-----CC-------cccccccccchHH Confidence 444333333333333 3444444432 23 23344555554431 11110 00 1111245555555 Q ss_pred hHHhhhhhhhccc--cee-----eecCCcc----------hHHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCe Q lcl|NC_011308. 78 ELVDQKTQYLLAN--GID-----VKPTDHD----------DQKLCY-------LIEEYY-NEEFQSAIQELVEGSTIKGY 132 (530) Q Consensus 78 ~Ivd~~~~yl~G~--pv~-----~~~~~~~----------de~~~~-------~l~~~~-~~~~~~~~~e~~~~~~~~G~ 132 (530) ..++.-++-|+|- |+. +...+.. ...+.+ .+...+ ..||....+++-++...+|. T Consensus 65 ~a~~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~ 144 (516) T protein:vir:10 65 QATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGS 144 (516) T ss_pred HHHHHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCe Confidence 5555555555432 221 2221110 011111 222233 46888999999999999999 Q ss_pred EEEEEEecCCCceEEEEecccceEEEEcCCCCceeEEEEEEEEeeccc------------ccccceEEEEEEEcCCceEE Q lcl|NC_011308. 133 EGIFARTTSEDKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDA------------DNKFNSIGHADVWTDTEVWY 200 (530) Q Consensus 133 a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~------------~~~~~~~~~~evyt~~~~~~ 200 (530) +. +|.|+++.+++ ++-.+.++--|..+....+++........-. ....+....+++||.- T Consensus 145 a~--l~~d~~~~~~~--~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v---- 216 (516) T protein:vir:10 145 CM--LYKPSKGAISA--IPMHHYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHA---- 216 (516) T ss_pred Ee--EEecCCCCeEE--EEcCeEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEE---- Confidence 85 67788776554 4444555555767766555543321111000 0001112233343310 Q ss_pred EeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHH Q lcl|NC_011308. 201 YVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIK 275 (530) Q Consensus 201 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e 275 (530) ++ ...+. .... . .. ...........+|...|++.++- ..+|.|--+ T Consensus 217 ~~-~~~~~--------------~~~~--~---------~~---d~~~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~ 267 (516) T protein:vir:10 217 KY-LGEGF--------------WELK--Q---------SA---DDIPVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAE 267 (516) T ss_pred Ee-cCCCc--------------eEEE--E---------ee---CceeeccccccccccCCeeeeeeeecCCCCcccchHH Confidence 00 00000 0000 0 00 00011112234566777777664 358999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHH Q lcl|NC_011308. 276 KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDID 353 (530) Q Consensus 276 ~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L 353 (530) +..+-+-.++.+.-...........|.+.+.-.+....... ...+.+.+..+..+++..+... .+.......++.+ T Consensus 268 ~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l--~~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~ 345 (516) T protein:vir:10 268 DYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHF--VNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVY 345 (516) T ss_pred HhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhh--ccCCCceeecCCcccceeeecCcccchHHHHHHHHHH Confidence 88899989988888888888888888776632111111111 1123344544555666666533 3556666777777 Q ss_pred HHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHh----cCCCccccceeeEE Q lcl|NC_011308. 354 ELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTAD-LVVEDIRR----RGLGDYSSTDIKFD 428 (530) Q Consensus 354 ~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~-~i~~~l~~----~~~~~~d~~~i~i~ 428 (530) +..|-..-+.-.+..-.....++..+. .++..++..++..+.++-. ++.-++.. ... ..=...+.+. T Consensus 346 ~~rI~~af~~~~l~~rd~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p-~~P~~lv~~~ 417 (516) T protein:vir:10 346 TRRIGVVFMMETMTRRDAERVTAVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGD-SFTSDLVDPV 417 (516) T ss_pred HHHHHHHHhhhhhhccCCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCC-CCChhhcCcc Confidence 776644332211111111223444443 3445555555555544321 11111111 110 0000001111 Q ss_pred eCCCC---CC--CHHHHHHHHHHHHhcCCCcH------------HHHHHhC----CCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 429 IEPYI---LA--NELDLAMIDKTEAETNQIQI------------NNLLAIA----PRIGDEETLKAICDTLDLDYEDVVK 487 (530) Q Consensus 429 f~~~~---P~--n~~e~a~~~~~~~~~g~iS~------------et~l~~~----~~vdd~~~e~~~~e~e~~e~~~~~~ 487 (530) ..-.+ -+ +...+.+.++.......++. +.+...+ .++.. ++|++.+.+++.+.+.... T Consensus 418 ~v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs-~eev~~~r~~~~~~q~~~~ 496 (516) T protein:vir:10 418 IITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKS-AEEMEQEQEAQMQAQQAQM 496 (516) T ss_pred eehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCC-HHHHHHHHHHHHHHHHHHH Confidence 11111 01 11111111111100000111 1222222 13333 3444444444433333222 Q ss_pred hhhccccccCCccccCCCCCCCCCCccC Q lcl|NC_011308. 488 ALEDQEVEELEPTVTPIIDPLTIEPQPE 515 (530) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (530) ..+.... .-+.. -.++.++. T Consensus 497 ~~~~~~~----~~~~~----~~~~~~~~ 516 (516) T protein:vir:10 497 LEEGVAK----AVPGV----IQQELKEA 516 (516) T ss_pred HHHHhhh----cccch----hhhhhhcC Confidence 2111100 00111 11122221 No 230 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=79.74 E-value=0.1 Score=26.03 Aligned_cols=392 Identities=9% Similarity=-0.008 Sum_probs=156.7 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +--.....+.-++...-...+.+--.--...+ ...|++... .-...++ .. ......-.+ T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~----------~~~iLr~~~-----~~~ly~~-----m~-~D~hi~s~l 72 (448) T protein:vir:79 14 GPGSIDPSDVPKLEGASVPVMSTSYDVVVDRE----------FDELLQGKD-----GLLVYHK-----ML-SDGTVKNAL 72 (448) T ss_pred cccccccccchhhhhhhhhhcccccccccccc----------hhHhhcccc-----chHHHHH-----Hh-hChHHHHHH Confidence 10011111111111111111111000000000 001111000 0000000 00 023344566 Q ss_pred hhhhhhhcccceeeecCCc--chHHHHHHHHHHhhc--------cHHHHHHHHHHHHhhcCeEE-EEEEe-cCCCceEEE Q lcl|NC_011308. 81 DQKTQYLLANGIDVKPTDH--DDQKLCYLIEEYYNE--------EFQSAIQELVEGSTIKGYEG-IFART-TSEDKLTFQ 148 (530) Q Consensus 81 d~~~~yl~G~pv~~~~~~~--~de~~~~~l~~~~~~--------~~~~~~~e~~~~~~~~G~a~-~~~y~-d~~g~~~~~ 148 (530) .+...-+.|.+..+..... .+.+..+++.+.+.. +|.+++.++ .++..+|.++ +++|. ..+|.+... T Consensus 73 ~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~-lda~~~G~s~~Eivw~~~~~g~~~~~ 151 (448) T protein:vir:79 73 NYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILD 151 (448) T ss_pred HHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHH-HHhhhhcceeEEEEeeecCCCceecc Confidence 6667778899988864322 344556666665532 355555443 5677888865 67774 456765433 Q ss_pred Ee---cccce-EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccc Q lcl|NC_011308. 149 TV---DALQL-LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQ 224 (530) Q Consensus 149 ~~---~p~~~-~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 224 (530) .+ +|... ...|+..+.+.. +...+....... T Consensus 152 ~l~~r~~~~~~~f~~~~d~~l~~----------------------------------~~~~~~~~~~~~----------- 186 (448) T protein:vir:79 152 KIVPIHPFNIDEVLYDEEGGPKA----------------------------------LKLSGEVKGGSQ----------- 186 (448) T ss_pred cccccCCccccceeeecCCceEE----------------------------------eecCCccccccc----------- Confidence 22 33211 112332222111 100000000000 Q ss_pred eeeeeecccccceecccccccccccccccccCCccceEEeeC----CcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011308. 225 HVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN----NKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAE 300 (530) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n----n~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~ 300 (530) .......+++++ +++.+ |-.|.|.+..+--..=-=+..+.+.+..++.|.. T Consensus 187 -----------------------~~~~~~lP~~~~--i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~ 241 (448) T protein:vir:79 187 -----------------------FVSGLEIPIWKT--VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMI 241 (448) T ss_pred -----------------------CCCccccccceE--EEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCC Confidence 000001122222 22222 2356777776655555556778889999999999 Q ss_pred ceeeeecCCCCc--hh------hHHHHHh--hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCccc Q lcl|NC_011308. 301 AIYVVRGGTNSP--VD------EIKKNIQ--SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVG 370 (530) Q Consensus 301 ~~lvl~g~~~~~--~~------~~~~~~~--~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~ 370 (530) |+.+.+-..+.+ .. ....+++ ...++.++.+..+++++...........+++..+.|-..--+-.++.++ T Consensus 242 P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~ 321 (448) T protein:vir:79 242 GVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQL 321 (448) T ss_pred ceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhcccc Confidence 999988432221 11 1222332 2334557888999999877655555557777777776655444444333 Q ss_pred ccCCc----HHHHHHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHH Q lcl|NC_011308. 371 DGNAT----NVVIKSRYTLLAMKAQKTEIALRKTLRW-TADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDK 445 (530) Q Consensus 371 ~gn~S----GvAik~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~ 445 (530) .|..+ |+.-..+-..+...|.. +...|.+ ++. .++..+... +..-..+.|...-|.|.+..|+.+. T Consensus 322 ~~g~~~~~~~~~~~v~~~~~~aDa~~----i~~tln~~li~---~l~~lNfg~--~~~~P~~~f~~~e~~Dl~~~a~~~~ 392 (448) T protein:vir:79 322 NMGVQAINIGEFVSLTQQTIISLQRE----FASAVNLYLIP---KLVLPNWPS--ATRFPRLTFEMEERNDFSAAANLMG 392 (448) T ss_pred ccchhhhhhhhHHHHHHHHHHHHHHH----HHHHHHHHHHH---HHHHhcCCC--cCCCcEEEecCCChHHHHHHHHHhh Confidence 22222 22122111111122222 3333322 222 223333221 1123577887776777777776654 Q ss_pred HHHhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 446 TEAETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 446 ~~~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +++..+ .+.+ .+.++ + ... ++..+...+.+ ...+++.+.. ..+ T Consensus 393 ~l~~~~--------------~~~~----~~~~~---~----~~~---------p~~~~~~~~~a-~~~~~~~~~~--~~~ 435 (448) T protein:vir:79 393 MLINAV--------------KDSE----DIPTE---L----KAL---------IDALPSKMRRA-LGVVDEVREA--VRQ 435 (448) T ss_pred hhhccc--------------hhhH----HHHHH---h----hcC---------CCCCCCccccc-cCCCCccccc--ccC Confidence 443211 1111 01000 0 000 00111000101 1111111111 111 Q ss_pred ccCCC Q lcl|NC_011308. 526 EPVQE 530 (530) Q Consensus 526 ~~~~~ 530 (530) .+.++ T Consensus 436 ~~~~~ 440 (448) T protein:vir:79 436 PADSR 440 (448) T ss_pred Ccccc Confidence 22222 No 231 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=78.35 E-value=0.12 Score=25.73 Aligned_cols=400 Identities=8% Similarity=-0.022 Sum_probs=149.4 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +.+.+..+.... -..+ .+++|+.+-.+++-...|.. || T Consensus 54 ~~~~~d~~~~~~---~~~~---------LI~~YR~ma~~pEvd~Av~e------------------------------Iv 91 (516) T protein:vir:10 54 MQQFFGIDNNIS---GTKD---------LINTYRQLTNNPEVERAVAN------------------------------IV 91 (516) T ss_pred eeeeecccCccc---cHHH---------HHHHHHHhhhccchhHHHHH------------------------------hh Confidence 333332222221 1122 24556666666666654322 11 Q ss_pred hhhh-hhhcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEec--CCCceEEEEec Q lcl|NC_011308. 81 DQKT-QYLLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTT--SEDKLTFQTVD 151 (530) Q Consensus 81 d~~~-~yl~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d--~~g~~~~~~~~ 151 (530) +-.+ .=-...||.+..++.+ .+.+.+.++.+++ =+|+...++..+.+-+.|+-|.|-..| ++|-.....+| T Consensus 92 neaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~elr~lD 171 (516) T protein:vir:10 92 NEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNPKEGIVELRRLD 171 (516) T ss_pred cceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCcccceeeeeeeC Confidence 1111 1122455555433321 2334555555543 378888999999999999999987776 34656678899 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeec Q lcl|NC_011308. 152 ALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVAD 231 (530) Q Consensus 152 p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (530) |..+..|.---.+. ..+....+.+...-+|+... ..|...+....+... +..... T Consensus 172 Pr~i~~vR~i~~~~------------~~~~~v~~~~~e~~~Y~~~~-~~~~~~g~~~~~~~~------------ikI~~d 226 (516) T protein:vir:10 172 PRHVEYYREIVTSD------------VGGTSVVKGYREFFVYTTGN-EGYAYNGRLFEPNTR------------IKIPRS 226 (516) T ss_pred CcceeeEEeeeccc------------CcchhhhhceeeeeeeecCc-cceeccccccCCCCc------------eecchh Confidence 99886654210000 00011111122222333221 111111110000000 000000 Q ss_pred ccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH-------------------- Q lcl|NC_011308. 232 GVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL-------------------- 291 (530) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~-------------------- 291 (530) +...+ +.+| .|+ ++..=.|-++..+.....+- ++-|. T Consensus 227 aI~y~----------------hSGl--~d~----~~~~i~syLhkAiKp~NQLk-m~EDAlVIYRitRAPeRRvFYIDVG 283 (516) T protein:vir:10 227 AIVYA----------------HSGL--QDC----SDRGIVGYLHNAVKPANQLK-LLEDALVIYRITRAPERRVFYIDVG 283 (516) T ss_pred heeee----------------ecCc--ccC----CCCceeceehhhhHhHHhhH-HHHhhHHHHhhhccccceEEEEecC Confidence 00000 0000 000 00000122222111111111 11111 Q ss_pred -----------HHHHHHhccceeeeecCCCC--chhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHH Q lcl|NC_011308. 292 -----------SNNLQDMAEAIYVVRGGTNS--PVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELN 356 (530) Q Consensus 292 -----------~n~~~~~~~~~lvl~g~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~ 356 (530) .+.+..|.+-+.. -...++ +.-.++.-+....+--=+++.+-+.-|-+ .+.. .-.-+.=.++. T Consensus 284 nLPk~KAeqYl~~iM~k~KNklvY-Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~kk 361 (516) T protein:vir:10 284 NMPNRKATEYVNGIMQSLKNRVVY-DSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMG-EMDDVRWFNKK 361 (516) T ss_pred CCCchhHHHHHHHHHHhcCceeEE-eCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcC-hHHHHHHHHHH Confidence 1122222222111 000110 11111111111111111122222322333 2222 22334556677 Q ss_pred HHHHhcccC--CCccc-c----cCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc--eee Q lcl|NC_011308. 357 IYRSGMGFN--SSAVG-D----GNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSST--DIK 426 (530) Q Consensus 357 I~~~s~~p~--~~~~~-~----gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~--~i~ 426 (530) +|..-.+|- +..++ | |..|-+... ++|.-. +.+.+..|...+..+|+.=+-+=++....+|+-. .|. T Consensus 362 Ly~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KF---I~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~ 438 (516) T protein:vir:10 362 LYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKF---IVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIK 438 (516) T ss_pred HHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcce Confidence 788888883 33322 2 333332221 223222 3334444443333333211111112223455543 577 Q ss_pred EEeCCCCCCCHHHHHHHHHHH---------HhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccC Q lcl|NC_011308. 427 FDIEPYILANELDLAMIDKTE---------AETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEEL 497 (530) Q Consensus 427 i~f~~~~P~n~~e~a~~~~~~---------~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~ 497 (530) +.|.+.---.+...++++... .-++.+|++++.+.+=...|.+ +++++++-+++++..--++.+.. T Consensus 439 ~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDee-----i~~~~k~I~~E~~~~~~~~p~~e 513 (516) T protein:vir:10 439 VNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQ-----IAQEEKQIEKEANVKRFQNPENE 513 (516) T ss_pred EEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhH-----HHHHHHHHHHhhhCCCCCCCCcc Confidence 778766544444444433221 1235789999998864444432 33333333333332211111111 Q ss_pred Ccc Q lcl|NC_011308. 498 EPT 500 (530) Q Consensus 498 ~~~ 500 (530) ++. T Consensus 514 ~~f 516 (516) T protein:vir:10 514 DDF 516 (516) T ss_pred ccC Confidence 111 No 232 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=77.23 E-value=0.13 Score=25.50 Aligned_cols=338 Identities=9% Similarity=0.014 Sum_probs=132.0 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCccee--ecCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKI--SHGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki--~~n~~k~Ivd~~~~yl~ 88 (530) |-+..-+.. |.+ ....... ..-....+..+ -.......|+..++=+. T Consensus 1 Mg~f~~~~~----~~~-----------------------~~~~~~~----~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA 49 (378) T protein:vir:16 1 MNLFGKVVS----FSR-----------------------GKLNNDT----QRVTAWQNEAVEYTSAFVTNIHNKIANEIT 49 (378) T ss_pred Cccchhhhh----hhc-----------------------ccccCCc----ceeeecccchhhHHHHHHHHHHHHHHhhhh Confidence 221111100 000 0000000 00000000011 11223345566666666 Q ss_pred ccceee-ecCCcc--h----HHHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCC-CceEEEEecccc Q lcl|NC_011308. 89 ANGIDV-KPTDHD--D----QKLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTSE-DKLTFQTVDALQ 154 (530) Q Consensus 89 G~pv~~-~~~~~~--d----e~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~-g~~~~~~~~p~~ 154 (530) +-|+.+ .....+ . +....-|..+++ |. .......+......+|.||.+.-++.. |++. T Consensus 50 ~l~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~-------- 121 (378) T protein:vir:16 50 KVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL-------- 121 (378) T ss_pred hCceeEEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE-------- Confidence 677753 211110 0 001122333332 21 223345567788889999976543321 2211 Q ss_pred eEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccc Q lcl|NC_011308. 155 LLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVD 234 (530) Q Consensus 155 ~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (530) .++-..+ . ..|..+.+.|++ T Consensus 122 --~l~~~~~----------------------~----~~~~~~diih~r-------------------------------- 141 (378) T protein:vir:16 122 --DLLFADD----------------------K----KEYKPEELVRLT-------------------------------- 141 (378) T ss_pred --EEEecCC----------------------e----eEecccceEEec-------------------------------- Confidence 1100000 0 011223333332 Q ss_pred cceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCch Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSPV 313 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~~ 313 (530) +.-.+......+..+.++++..++.- ....+|.+.+.- .+.. T Consensus 142 -------------------------------~~~~~~~~~s~l~~~~~~i~~~~~~~------~~~g~l~~~~~l~~~~~ 184 (378) T protein:vir:16 142 -------------------------------SPFYINEDTSILDNALASIQTKLEQG------KLRGLLKINAFLDIDNT 184 (378) T ss_pred -------------------------------CccCccchhHHHHHHHHHHHHHHhcC------ccceeeEeCCcCCHHHH Confidence 11111222233445555554433321 112333333321 1111 Q ss_pred h----hHHHHH-------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 314 D----EIKKNI-------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 314 ~----~~~~~~-------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) + .+.... ..++++.++++.+++-++.+..+... ..++.+.+.|...-.+|.. -+ |+.|.. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~---~l~g~~~e~---- 256 (378) T protein:vir:16 185 QEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNEN---ILLGTASQE---- 256 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHH---HhcCCchHH---- Confidence 1 122222 12356777777766666554333333 3456677788877777741 11 222211 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL---------GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ 452 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~---------~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~ 452 (530) ....++...|.-.++.|..-++.+-. .......+.+.++.-+-.|..+.++....+..+|+ T Consensus 257 ----------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~ 326 (378) T protein:vir:16 257 ----------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPI 326 (378) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCC Confidence 11234555666666665555543211 11122345566666666788888888889999999 Q ss_pred CcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCC Q lcl|NC_011308. 453 IQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTI 510 (530) Q Consensus 453 iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (530) ++...+++.+++ +++-+.-+-... ....+.....+ +.. ....+..+.+++ T Consensus 327 ~T~NE~R~~~g~~p~~ggD~~~~~~n---~~~~~~~~~~~-~~~----~~~~~~~e~~ne 378 (378) T protein:vir:16 327 FTQNQLLVKMGEQPIEGGDVYIANLN---AVAVKNLSDLQ-GSR----KDVTSTDETNNQ 378 (378) T ss_pred cCHHHHHHHhCCCCCCCCCeEeeccc---cccccchhhhc-Ccc----CCCCCCCCCCCC Confidence 999999988754 322111100000 00000000000 000 000000000011 No 233 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=75.01 E-value=0.15 Score=25.08 Aligned_cols=401 Identities=9% Similarity=0.031 Sum_probs=154.3 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhcccce Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLANGI 92 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~pv 92 (530) +..+|.+++.+...... ......-++. |.. +.. ....+..+- ...=...+-.-..|+..++=+.+-|+ T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~-g~~--~~~----~~~~~~~~~----~~~a~~~~~v~~~v~~ia~~iA~lp~ 68 (460) T protein:vir:10 1 MANRIIRALRELTGLDN-KFNDAFIKYI-GQT--FTK----YDNNGKTYL----EQGYNINPDVYSCISQMAAKTVAVPY 68 (460) T ss_pred CchhHHHHHhhhhccCC-CchHHHHHhh-ccc--cCC----Cccchhhhh----HHHHhcchHHHHHHHHHHHhhhhCce Confidence 33444554433211110 0000011111 110 000 000000000 00001112233445666666667777 Q ss_pred eeecCCcch--HH----------------------------HHHHHHHHhh--cc---HHHHHHHHHHHHhhcCeEEEEE Q lcl|NC_011308. 93 DVKPTDHDD--QK----------------------------LCYLIEEYYN--EE---FQSAIQELVEGSTIKGYEGIFA 137 (530) Q Consensus 93 ~~~~~~~~d--e~----------------------------~~~~l~~~~~--~~---~~~~~~e~~~~~~~~G~a~~~~ 137 (530) .+--...+. .. ....+..++. |. .......+......+|.||.++ T Consensus 69 ~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 148 (460) T protein:vir:10 69 TIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYL 148 (460) T ss_pred EEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 653211110 00 0000111221 21 2234456677888999999888 Q ss_pred EecCC----CceE-EEEecccceEEEEcCCCCceeE---EEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCccc Q lcl|NC_011308. 138 RTTSE----DKLT-FQTVDALQLLPVFDDYGTLQRI---IRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRS 209 (530) Q Consensus 138 y~d~~----g~~~-~~~~~p~~~~~v~d~~~~~~~~---~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~ 209 (530) -++.. |.+. +..++|..+-+..++.+..... ++.|... .. .....|.++.+.|++....... T Consensus 149 ~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~--~~--------g~~~~~~~~evih~r~~~~~~~ 218 (460) T protein:vir:10 149 MSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLI--QG--------DQFIEFNEDEVIHTKYANPNFD 218 (460) T ss_pred EecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEe--cC--------ceeEEecccceEEEecCCCCcc Confidence 87654 4443 6778898887776654432111 1111110 00 0112345555555543221100 Q ss_pred chhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHH Q lcl|NC_011308. 210 DEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNC 289 (530) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S 289 (530) . . ...-.|.|.+......|.....+.. T Consensus 219 ~----------------------------------------------~-------~~~~~G~sp~~~~~~~i~~~~~~~~ 245 (460) T protein:vir:10 219 L----------------------------------------------Q-------GSHLYGMSPIRAILRNINSQNSTID 245 (460) T ss_pred c----------------------------------------------c-------cCccccccHHHHHHHHHHHHHHHHH Confidence 0 0 0001356666655555555444443 Q ss_pred HHHHHHHHhccceeeeec-CCCCc--hhhHHHHH--------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHH Q lcl|NC_011308. 290 FLSNNLQDMAEAIYVVRG-GTNSP--VDEIKKNI--------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIY 358 (530) Q Consensus 290 ~~~n~~~~~~~~~lvl~g-~~~~~--~~~~~~~~--------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~ 358 (530) -..+.+.-.+.|-.+++. ...++ .+.++..+ ..++++.+++|.+++-++....+.......+...+.|. T Consensus 246 ~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 325 (460) T protein:vir:10 246 NNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAIC 325 (460) T ss_pred HHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 334444444445444432 22222 12222221 12346667766666555544444444555677778888 Q ss_pred HHhcccCC--CcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCC Q lcl|NC_011308. 359 RSGMGFNS--SAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIEPYILA 435 (530) Q Consensus 359 ~~s~~p~~--~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~ 435 (530) ..-.+|.. +...-++.++..++- ....++...|.-.+..|...++.+-..+.. .....+.|...-.. T Consensus 326 ~~fgVPp~~lg~~~~~t~~~sn~e~----------~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~ 395 (460) T protein:vir:10 326 NALGWSDKLLNNNEGGGLNTGNLEE----------ERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELP 395 (460) T ss_pred HHhCCCHHHhCCCCCCCCccccHHH----------HHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhh Confidence 87777742 211111111111111 112223334444444444433332111111 11233444322211 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCc Q lcl|NC_011308. 436 NELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQ 513 (530) Q Consensus 436 n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (530) ...+.......+...|+++...+++.+++ ++++-. +.......-+.-+.. .+... T Consensus 396 ~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~g------------D~~~~~~n~~~~~~~-----------~~~~~ 452 (460) T protein:vir:10 396 EMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQDGM------------DIVFMPSNKVRIDDV-----------SNNLI 452 (460) T ss_pred hHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCC------------Ceeeecccccchhhc-----------ccccC Confidence 12222233345677899999888888743 222100 000000000000000 01111 Q ss_pred cCcCCCCc Q lcl|NC_011308. 514 PEPLNIDP 521 (530) Q Consensus 514 ~~~~~~~~ 521 (530) +.+.|.++ T Consensus 453 ~~~~nq~~ 460 (460) T protein:vir:10 453 DSAFNQNQ 460 (460) T ss_pred CCcccCCC Confidence 11112222 No 234 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=73.47 E-value=0.17 Score=24.80 Aligned_cols=338 Identities=9% Similarity=0.013 Sum_probs=130.7 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceee--cCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKIS--HGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~--~n~~k~Ivd~~~~yl~ 88 (530) |-...-+.. | .+....... ..-....+..+. .......|+..++=+. T Consensus 1 Mg~f~~~~~----f-----------------------~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA 49 (378) T protein:vir:93 1 MNLFGKVVS----F-----------------------SRGKLNNDT----QRVTAWQNEAVEYTSAFVTNIHNKIANEIT 49 (378) T ss_pred Cccchhhhh----h-----------------------hccccCCCc----ceeeecccchhHHHHHHHHHHHHHHHhhhh Confidence 111111000 0 000000000 000000001111 1123344666667777 Q ss_pred ccceeee-cCCcc--hHH----HHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecC-CCceEEEEecccc Q lcl|NC_011308. 89 ANGIDVK-PTDHD--DQK----LCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTS-EDKLTFQTVDALQ 154 (530) Q Consensus 89 G~pv~~~-~~~~~--de~----~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~-~g~~~~~~~~p~~ 154 (530) +-|+++- ....+ .+. ...-|..+++ |. .......+..+...+|.||.+.-++. .|++. T Consensus 50 ~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~-------- 121 (378) T protein:vir:93 50 KVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL-------- 121 (378) T ss_pred hCceeeEEEcccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE-------- Confidence 7787642 11111 010 1122333332 22 22344567778889999997654332 22211 Q ss_pred eEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccc Q lcl|NC_011308. 155 LLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVD 234 (530) Q Consensus 155 ~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (530) .++-... . ..|..+.+.|++ T Consensus 122 --~l~~~~~--------------------~------~~~~~~diih~r-------------------------------- 141 (378) T protein:vir:93 122 --DLLFADD--------------------K------KEYKTEELVRLT-------------------------------- 141 (378) T ss_pred --EEEecCC--------------------e------eEeccceeEEec-------------------------------- Confidence 1110000 0 012223333332 Q ss_pred cceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCch Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSPV 313 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~~ 313 (530) +.-.+......+..+..+++..++. . ....++.+.|.- .+.. T Consensus 142 -------------------------------~~~~~~~~~s~l~~~~~~i~~~~~~---~---~~~g~l~~~~~l~~~~~ 184 (378) T protein:vir:93 142 -------------------------------SPFYINEDTSILDNALASIQTKLEQ---G---KLRGLLKINAFLDIDNT 184 (378) T ss_pred -------------------------------CccccchhhHHHHHHHHHHHHHHhc---C---cccceeeeCCcCCHHHH Confidence 1101111222333444444433321 1 112233333321 1111 Q ss_pred hh----HHHHH-------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 314 DE----IKKNI-------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 314 ~~----~~~~~-------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) ++ +.... ..++++.++++.+++-++.+..+... ...+.+.+.|...-.+|.. -. |+.|.. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~---~l~g~~~e~---- 256 (378) T protein:vir:93 185 QEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNEN---ILLGTATQE---- 256 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHH---HhcCCcHHH---- Confidence 11 21211 12356777776666666544333333 3456677778877777741 11 222211 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---------ccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG---------DYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ 452 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~---------~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~ 452 (530) ....++...|.-.++.|...++.+-.. ......+.+.++.-+-.|..+.++....+..+|+ T Consensus 257 ----------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~ 326 (378) T protein:vir:93 257 ----------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPI 326 (378) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCC Confidence 112345556666666666555432211 1112335555555666788888888889999999 Q ss_pred CcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 453 IQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 453 iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) ++...+++.+++ +++.+.-.-.. .....+.....+.. .. + . ++..+.-|. T Consensus 327 ~t~NE~R~~~gl~p~~ggD~~~~~~---n~~~~~~~~~~~~~-~~---~-~---------~~~~e~~n~ 378 (378) T protein:vir:93 327 FTQNQLLVKMGEQPIEGGDVYIANL---NAVAVKNLSDLQGS-RK---D-V---------TSTDETNNQ 378 (378) T ss_pred cCHHHHHHHhCCCCCCCCCeeeecc---ccccccchhhhcCc-cC---C-C---------CCCCCCCCC Confidence 999999888764 22211100000 00000000000000 00 0 0 000000000 No 235 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=72.59 E-value=0.18 Score=24.66 Aligned_cols=387 Identities=9% Similarity=-0.000 Sum_probs=158.3 Q ss_pred HHHHHHHhhhH-H-HHHHHHHHhcccchhhhcccccccccccccccccCCcceee------cCchhhHHhhhhhhhcccc Q lcl|NC_011308. 20 KIDEYIRSQNV-S-LARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKIS------HGFFAELVDQKTQYLLANG 91 (530) Q Consensus 20 ~i~~~~~~~~~-~-~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~------~n~~k~Ivd~~~~yl~G~p 91 (530) ++-++-.-... . .+-....+++.+. . ..+................++..+. +.=....|+..++=+.+-| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp 78 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKS-L-ENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMP 78 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccC-C-CCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCc Confidence 11111000000 0 0111222222221 0 0000000000000000011111111 1112235666666677778 Q ss_pred eeeecCCcch------HHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEecccceEEEEcC Q lcl|NC_011308. 92 IDVKPTDHDD------QKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKL-TFQTVDALQLLPVFDD 161 (530) Q Consensus 92 v~~~~~~~~d------e~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~-~~~~~~p~~~~~v~d~ 161 (530) +++--.+.+. ......|+.- -|.. ......+...+..+|.+|.++-++..|++ ....++|..+.+..+ T Consensus 79 ~~v~~~~~~~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~- 156 (424) T protein:vir:45 79 LHVMRRHKGKVEPARDHPAFYLVHDE-PNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNT- 156 (424) T ss_pred eEEEEecCCceeecccchHHHHHHhh-cccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEc- Confidence 8753222111 1222223221 0221 23345577788899999999888888886 467788887765433 Q ss_pred CCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccc Q lcl|NC_011308. 162 YGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEG 241 (530) Q Consensus 162 ~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (530) .+.. +|.+.. . .. ...+.++.+.|++.... T Consensus 157 ~~~~-----~y~~~~-~-----~~----~~~~~~~eVih~r~~~~----------------------------------- 186 (424) T protein:vir:45 157 GGRY-----TYGLYN-E-----YG----AFAISPDDMIHIRALGN----------------------------------- 186 (424) T ss_pred CCeE-----EEEEEe-c-----Cc----eEEECcccEEEecCcCC----------------------------------- Confidence 2211 111110 0 00 01234444444431100 Q ss_pred ccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cc--hhhHHH Q lcl|NC_011308. 242 VEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SP--VDEIKK 318 (530) Q Consensus 242 ~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~--~~~~~~ 318 (530) +.-.|.|.++.....|+....+..-.++.+.-.+.|-.+++-... ++ .+.++. T Consensus 187 ------------------------d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 242 (424) T protein:vir:45 187 ------------------------NQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKD 242 (424) T ss_pred ------------------------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHH Confidence 111366666655555544333333333444444556666653222 21 111222 Q ss_pred HH----h-----hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHHHHHHHHhhHH Q lcl|NC_011308. 319 NI----Q-----SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNVVIKSRYTLLA 387 (530) Q Consensus 319 ~~----~-----~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGvAik~~~~~l~ 387 (530) .+ . .++++.+++|.+++-++....+.......+...+.|...-.+|. ++....|+-|++. T Consensus 243 ~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e--------- 313 (424) T protein:vir:45 243 QWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNIS--------- 313 (424) T ss_pred HHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH--------- Confidence 11 1 23456666666665555433333334455666677777777774 2221112222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc---ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 388 MKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS---TDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 388 ~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~---~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) .....++...|.-.++.|..-++.+-....+. ..+++.+..-+-.|..+.++...++.++|+++...+++.+++ T Consensus 314 ---q~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl 390 (424) T protein:vir:45 314 ---AQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDM 390 (424) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 11122344445555555544444322111111 123343434444678888888889999999999988887754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEE 525 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (530) +++-+ +....+.-.. +..+..+.+.++ +.+++ T Consensus 391 ~pi~ggD--------------~~~~~~n~~~---------~~~~~~~~~~~~------~~~~~ 424 (424) T protein:vir:45 391 NPVEGLD--------------EMLVSVNAAN---------PAGDFKPPKNDE------GKTNE 424 (424) T ss_pred CCCCCcc--------------eeeecccccc---------cccccCCCCCCC------CCCCC Confidence 22211 1111111000 000000011111 11111 No 236 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=71.30 E-value=0.2 Score=24.45 Aligned_cols=400 Identities=7% Similarity=-0.020 Sum_probs=151.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +.+.+ +.++.- +-.. +.+++|+.+..+++-...|.. || T Consensus 54 ~~~~~-~~~~~~--~~~~---------eLI~~YR~ma~~pEvd~Av~e------------------------------IV 91 (516) T protein:vir:10 54 MQQFF-GIDNNI--SGTK---------DLINTYRQLINNPEVERAVAN------------------------------IV 91 (516) T ss_pred eeeee-cccccc--chHH---------HHHHHHHHHhhccchhhHHHH------------------------------hh Confidence 11122 222211 0112 234556666666655553321 11 Q ss_pred hhhh-hhhcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEec--CCCceEEEEec Q lcl|NC_011308. 81 DQKT-QYLLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTT--SEDKLTFQTVD 151 (530) Q Consensus 81 d~~~-~yl~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d--~~g~~~~~~~~ 151 (530) +-.+ .=-...||.+...+.+ .+.+.+.++.+++ =+|+...++..+.+-+.|+-|.|-..| ++|-.....+| T Consensus 92 neaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lD 171 (516) T protein:vir:10 92 NEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLD 171 (516) T ss_pred cceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeC Confidence 1111 1122445555443222 2335555555543 378889999999999999999987776 34656678899 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeec Q lcl|NC_011308. 152 ALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVAD 231 (530) Q Consensus 152 p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (530) |..+..|.---.+ . ..+....+.+...-+|+... ..|...+....+... +..... T Consensus 172 Pr~i~~vR~i~~~-----------~-~~~~~v~~~~~e~~~Y~~~~-~~~~~~g~~~~~~~~------------ikI~~d 226 (516) T protein:vir:10 172 PRFMEYYREIVTS-----------D-IGGTTIVKGYREFFIYTTGN-EGYSYNGRIFEPNTR------------IKIPRS 226 (516) T ss_pred CcceeeEeeeccc-----------c-cccchhhhhhhheeeeccCc-cccccccceeCCCcc------------eeechh Confidence 9988665421000 0 00000011111122333221 122111111000000 000000 Q ss_pred ccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH-------------------- Q lcl|NC_011308. 232 GVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL-------------------- 291 (530) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~-------------------- 291 (530) +... .-.|.++. +...-.|-++..+.....+- ++-|. T Consensus 227 AI~y------------------~hSGL~d~----~~~~i~syLhkAiKp~NQLk-m~EDAlVIYRitRAPeRRvFYIDvG 283 (516) T protein:vir:10 227 AVVY------------------ASSGLMDC----SDRGIIGYLHNAVKPANQLK-LLEDAMVIYRITRAPERRVFYIDVG 283 (516) T ss_pred heee------------------ecccceeC----CCCceeeeehhhhHhHHhhH-HHHhhHHHHhhhccccceEEEEecC Confidence 0000 00011110 00011222332211111111 11111 Q ss_pred -----------HHHHHHhccceeeeecCCCC--chhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHH Q lcl|NC_011308. 292 -----------SNNLQDMAEAIYVVRGGTNS--PVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELN 356 (530) Q Consensus 292 -----------~n~~~~~~~~~lvl~g~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~ 356 (530) .+.+..|.+-+.. -...++ +.-.++.-+....+--=+++.+-+.-|-+ .+.. .-.-+.=.++. T Consensus 284 nlPk~KAeqYl~~im~k~kNklvY-Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~kk 361 (516) T protein:vir:10 284 NMNNRKATEYVNGIMQSLKNRVVY-DSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMG-DMDDVRWFNKK 361 (516) T ss_pred CCCchhHHHHHHHHHHhcCceeEE-eCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcC-hHHHHHHHHHH Confidence 1222222222111 001110 11111111111111111222222322333 2222 22334556677 Q ss_pred HHHHhcccC--CCccc-c----cCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc--ceee Q lcl|NC_011308. 357 IYRSGMGFN--SSAVG-D----GNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS--TDIK 426 (530) Q Consensus 357 I~~~s~~p~--~~~~~-~----gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~--~~i~ 426 (530) +|..-.+|- +..++ | |.+|.+... .+|.- -+.+.+..|..-+.++|+.=+-+=++....+|+- ..|. T Consensus 362 Ly~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K---FI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~ 438 (516) T protein:vir:10 362 LYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRK---FVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIK 438 (516) T ss_pred HHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcce Confidence 788888883 33322 1 333333222 22222 2344444555544444432111112223345554 4677 Q ss_pred EEeCCCCCCCHHHHHHHHHHH---------HhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccC Q lcl|NC_011308. 427 FDIEPYILANELDLAMIDKTE---------AETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEEL 497 (530) Q Consensus 427 i~f~~~~P~n~~e~a~~~~~~---------~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~ 497 (530) +.|.+.---.+...++++... .-++.+|.+++.+.+=...|.+ +++++++-+++++..--+..++. T Consensus 439 ~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-----i~~e~k~I~~E~~~~~~~~p~~~ 513 (516) T protein:vir:10 439 VNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQ-----IAQEEKQIEQEAGIKRFQNPENE 513 (516) T ss_pred EEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhh-----HHHHHHHHHHhhhCCCCCCCCcc Confidence 888776554444444443221 1235789999998854444432 33333333333333222211111 Q ss_pred Ccc Q lcl|NC_011308. 498 EPT 500 (530) Q Consensus 498 ~~~ 500 (530) ++. T Consensus 514 ~~f 516 (516) T protein:vir:10 514 DDF 516 (516) T ss_pred ccC Confidence 111 No 237 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=71.30 E-value=0.2 Score=24.45 Aligned_cols=400 Identities=7% Similarity=-0.020 Sum_probs=151.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) +.+.+ +.++.- +-.. +.+++|+.+..+++-...|.. || T Consensus 54 ~~~~~-~~~~~~--~~~~---------eLI~~YR~ma~~pEvd~Av~e------------------------------IV 91 (516) T protein:vir:10 54 MQQFF-GIDNNI--SGTK---------DLINTYRQLINNPEVERAVAN------------------------------IV 91 (516) T ss_pred eeeee-cccccc--chHH---------HHHHHHHHHhhccchhhHHHH------------------------------hh Confidence 11122 222211 0112 234556666666655553321 11 Q ss_pred hhhh-hhhcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEec--CCCceEEEEec Q lcl|NC_011308. 81 DQKT-QYLLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTT--SEDKLTFQTVD 151 (530) Q Consensus 81 d~~~-~yl~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d--~~g~~~~~~~~ 151 (530) +-.+ .=-...||.+...+.+ .+.+.+.++.+++ =+|+...++..+.+-+.|+-|.|-..| ++|-.....+| T Consensus 92 neaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lD 171 (516) T protein:vir:10 92 NEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLD 171 (516) T ss_pred cceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeC Confidence 1111 1122445555443222 2335555555543 378889999999999999999987776 34656678899 Q ss_pred ccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeec Q lcl|NC_011308. 152 ALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVAD 231 (530) Q Consensus 152 p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (530) |..+..|.---.+ . ..+....+.+...-+|+... ..|...+....+... +..... T Consensus 172 Pr~i~~vR~i~~~-----------~-~~~~~v~~~~~e~~~Y~~~~-~~~~~~g~~~~~~~~------------ikI~~d 226 (516) T protein:vir:10 172 PRFMEYYREIVTS-----------D-IGGTTIVKGYREFFIYTTGN-EGYSYNGRIFEPNTR------------IKIPRS 226 (516) T ss_pred CcceeeEeeeccc-----------c-cccchhhhhhhheeeeccCc-cccccccceeCCCcc------------eeechh Confidence 9988665421000 0 00000011111122333221 122111111000000 000000 Q ss_pred ccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH-------------------- Q lcl|NC_011308. 232 GVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL-------------------- 291 (530) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~-------------------- 291 (530) +... .-.|.++. +...-.|-++..+.....+- ++-|. T Consensus 227 AI~y------------------~hSGL~d~----~~~~i~syLhkAiKp~NQLk-m~EDAlVIYRitRAPeRRvFYIDvG 283 (516) T protein:vir:10 227 AVVY------------------ASSGLMDC----SDRGIIGYLHNAVKPANQLK-LLEDAMVIYRITRAPERRVFYIDVG 283 (516) T ss_pred heee------------------ecccceeC----CCCceeeeehhhhHhHHhhH-HHHhhHHHHhhhccccceEEEEecC Confidence 0000 00011110 00011222332211111111 11111 Q ss_pred -----------HHHHHHhccceeeeecCCCC--chhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHH Q lcl|NC_011308. 292 -----------SNNLQDMAEAIYVVRGGTNS--PVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELN 356 (530) Q Consensus 292 -----------~n~~~~~~~~~lvl~g~~~~--~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~ 356 (530) .+.+..|.+-+.. -...++ +.-.++.-+....+--=+++.+-+.-|-+ .+.. .-.-+.=.++. T Consensus 284 nlPk~KAeqYl~~im~k~kNklvY-Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~kk 361 (516) T protein:vir:10 284 NMNNRKATEYVNGIMQSLKNRVVY-DSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMG-DMDDVRWFNKK 361 (516) T ss_pred CCCchhHHHHHHHHHHhcCceeEE-eCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcC-hHHHHHHHHHH Confidence 1222222222111 001110 11111111111111111222222322333 2222 22334556677 Q ss_pred HHHHhcccC--CCccc-c----cCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc--ceee Q lcl|NC_011308. 357 IYRSGMGFN--SSAVG-D----GNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS--TDIK 426 (530) Q Consensus 357 I~~~s~~p~--~~~~~-~----gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~--~~i~ 426 (530) +|..-.+|- +..++ | |.+|.+... .+|.- -+.+.+..|..-+.++|+.=+-+=++....+|+- ..|. T Consensus 362 Ly~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K---FI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~ 438 (516) T protein:vir:10 362 LYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRK---FVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIK 438 (516) T ss_pred HHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcce Confidence 788888883 33322 1 333333222 22222 2344444555544444432111112223345554 4677 Q ss_pred EEeCCCCCCCHHHHHHHHHHH---------HhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccccC Q lcl|NC_011308. 427 FDIEPYILANELDLAMIDKTE---------AETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVEEL 497 (530) Q Consensus 427 i~f~~~~P~n~~e~a~~~~~~---------~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~ 497 (530) +.|.+.---.+...++++... .-++.+|.+++.+.+=...|.+ +++++++-+++++..--+..++. T Consensus 439 ~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-----i~~e~k~I~~E~~~~~~~~p~~~ 513 (516) T protein:vir:10 439 VNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQ-----IAQEEKQIEQEAGIKRFQNPENE 513 (516) T ss_pred EEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhh-----HHHHHHHHHHhhhCCCCCCCCcc Confidence 888776554444444443221 1235789999998854444432 33333333333333222211111 Q ss_pred Ccc Q lcl|NC_011308. 498 EPT 500 (530) Q Consensus 498 ~~~ 500 (530) ++. T Consensus 514 ~~f 516 (516) T protein:vir:10 514 DDF 516 (516) T ss_pred ccC Confidence 111 No 238 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=70.67 E-value=0.21 Score=24.35 Aligned_cols=304 Identities=9% Similarity=-0.054 Sum_probs=126.2 Q ss_pred EEEEEEecCCCceEE---EEecccce-EEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcc Q lcl|NC_011308. 133 EGIFARTTSEDKLTF---QTVDALQL-LPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGR 208 (530) Q Consensus 133 a~~~~y~d~~g~~~~---~~~~p~~~-~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~ 208 (530) .+|++|.-.+|.+.. ...+|..+ +-.+++.+.+... +... ...... +++ T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~-~~~~-----~~g~~~-----~~l---------------- 53 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAI-EQWG-----VFGKAT-----VRI---------------- 53 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEE-EecC-----CCCCCc-----cee---------------- Confidence 888888766665543 33344321 1122322221110 0000 000000 000 Q ss_pred cchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEe--eCCcCCCCcHHHHHHHHHHHHH Q lcl|NC_011308. 209 SDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL--YNNKLGISDIKKVKSIIDDYDL 286 (530) Q Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~--~nn~~~~sd~e~v~~liDa~~~ 286 (530) -+++.|-+.+- ..|-.|.|.+..+--..--=.. T Consensus 54 ---------------------------------------------p~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~ 88 (355) T protein:vir:78 54 ---------------------------------------------PVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDR 88 (355) T ss_pred ---------------------------------------------ccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHh Confidence 01111111110 0233567777665444444456 Q ss_pred HHHHHHHHHHHhccceeeeecCCCCc---hh----------------hHHHHHhh--CcceecCCCCceeEEEecCCHHH Q lcl|NC_011308. 287 MNCFLSNNLQDMAEAIYVVRGGTNSP---VD----------------EIKKNIQS--KKIIQTKGEGGLDIQTVDIPYEA 345 (530) Q Consensus 287 ~~S~~~n~~~~~~~~~lvl~g~~~~~---~~----------------~~~~~~~~--~~~i~~~~~~~~~~lt~~~~~~~ 345 (530) .+.+.+..++.|..|+.+.+|..+.. .+ ....++.. ..++.++.+..++++........ T Consensus 89 ~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~ 168 (355) T protein:vir:78 89 FLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPE 168 (355) T ss_pred hHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCccc Confidence 77788889999988888887743211 00 01111111 13455788889999987766556 Q ss_pred HHHHHHHHHHHHHHHhcccCCCcccc---c-CC-cHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_011308. 346 RKAKMDIDELNIYRSGMGFNSSAVGD---G-NA-TNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDY 420 (530) Q Consensus 346 ~e~~ld~L~~~I~~~s~~p~~~~~~~---g-n~-SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~ 420 (530) ....+++..+.|-..--+..++.++. | .+ +.+..+.+..-+...|......+.+ +++.-+ +..+... T Consensus 169 ~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~---~li~~l---~~lN~~~-- 240 (355) T protein:vir:78 169 MDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQ---HVVEDL---VDQNWGP-- 240 (355) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH---HHhcCCC-- Confidence 66788888888877766655543322 2 12 2233433333333333333333322 233322 2222221 Q ss_pred ccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCC-cHH----HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh-hcccc Q lcl|NC_011308. 421 SSTDIKFDIEPYILANELDLAMIDKTEAETNQI-QIN----NLLAIAPRIGDEETLKAICDTLDLDYEDVVKAL-EDQEV 494 (530) Q Consensus 421 d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~i-S~e----t~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~-~~~~~ 494 (530) +..-..+.|. ..+.++...|+.+..+...|+. +.+ .+.+.+++ ..+.... + ......... ..+.. T Consensus 241 ~~~~P~~~~~-~~~~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gi-p~p~~~~----~---~~~~~~~~~~~~~~~ 311 (355) T protein:vir:78 241 EEPAPRLVPA-QLGKEQPVTAEAIRALVECGAFTADPELEKDLRARYGL-PAPAERD----D---GADAAAAKAAGRRRA 311 (355) T ss_pred CCCCCEEEec-CcChhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC-CCCCCCC----c---ccCCccccccccccc Confidence 1223567784 4566767788888888888863 432 23344432 1111000 0 000000000 00000 Q ss_pred ccCCccccCCCCCCCCCCccCcCCCCcccccc-cCCC Q lcl|NC_011308. 495 EELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE-PVQE 530 (530) Q Consensus 495 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 530 (530) ....+... ....+...|...++...+. -++. T Consensus 312 ~~~~~~~~-----~~~~~a~~~~a~~~~~~~~~~~~~ 343 (355) T protein:vir:78 312 KRLPGQRQ-----GAALPSRSPRADPPRRRGPLRRRP 343 (355) T ss_pred cccCCccc-----cccccccCCCCCChhhhHHHHHHh Confidence 00000000 0011111111111111110 1111 No 239 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=68.95 E-value=0.23 Score=24.08 Aligned_cols=464 Identities=9% Similarity=0.013 Sum_probs=189.2 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhh--hHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhh Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQ--NVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAE 78 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~ 78 (530) |..++...... +-+++..+.+++.. -..+.+.+.+|....- +.+ + +. ...+...|+..+-+.. T Consensus 1 m~~~~~~~~~~---~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~--~~~-----~--~~---~~~~~~~~~~dst~~~ 65 (532) T protein:vir:99 1 MAEVEKTGFAA---DGAAAAYNRLKNDRGAYETRAEDCATYTIPSV--FPS-----A--TA---DGSTSYTTPWQSIGAR 65 (532) T ss_pred CcchhhccccH---HHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc--cCC-----C--CC---cchhhccccccchHHH Confidence 76666443322 22444455544321 1233445555543321 100 0 00 1112234666666777 Q ss_pred HHhhhhhhhccc--cee---ee--cCCcc----------hHHHH-------HHHHHHh-hccHHHHHHHHHHHHhhcCeE Q lcl|NC_011308. 79 LVDQKTQYLLAN--GID---VK--PTDHD----------DQKLC-------YLIEEYY-NEEFQSAIQELVEGSTIKGYE 133 (530) Q Consensus 79 Ivd~~~~yl~G~--pv~---~~--~~~~~----------de~~~-------~~l~~~~-~~~~~~~~~e~~~~~~~~G~a 133 (530) .++..++.|+|- |+. |. ..+.. -.++. ..+...+ ..||....+++.++...+|.| T Consensus 66 a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a 145 (532) T protein:vir:99 66 GLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNV 145 (532) T ss_pred HHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcE Confidence 777777666642 221 21 11110 01122 2222333 468889999999999999999 Q ss_pred EEEEEecCC---CceEEEEecccceEEEEcCCCCceeEEEEEEEEeecc----------cccccceEEEEEEEcCCceEE Q lcl|NC_011308. 134 GIFARTTSE---DKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSD----------ADNKFNSIGHADVWTDTEVWY 200 (530) Q Consensus 134 ~~~~y~d~~---g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~----------~~~~~~~~~~~evyt~~~~~~ 200 (530) ..|+-.++. ...+|+.++-.+.+.--|..+....++|-+..-...- .....+.-..++||+. T Consensus 146 ~l~~~~~~~~~~~~~~f~~~pl~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~----- 220 (532) T protein:vir:99 146 LLYIPSTEQVEGQSNAPKLYKLHNFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTH----- 220 (532) T ss_pred eEEecccccccCcccceEEEEcCeEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEE----- Confidence 876654332 3446677776666666677887767666443221100 0001112223444331 Q ss_pred EeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHH Q lcl|NC_011308. 201 YVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIK 275 (530) Q Consensus 201 y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e 275 (530) ......... +.. .... ............+|...|++.++- ..+|.|--+ T Consensus 221 -v~~~~~~~~---------------~~~--------~~~~--~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~ 274 (532) T protein:vir:99 221 -VYRDPEAMV---------------FRS--------YQEI--DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVE 274 (532) T ss_pred -EEecCCCCe---------------eEE--------EEee--cCceecccccccccccCCceeeeeeecCCCccccchHH Confidence 111110000 000 0000 000011111233466778777664 358999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHH Q lcl|NC_011308. 276 KVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDID 353 (530) Q Consensus 276 ~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L 353 (530) +..+-+-.++.+.-..........+|.+.+.-.+......+. ..+.+.+..+..+++..+... .+.......++.+ T Consensus 275 ~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~--~~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~ 352 (532) T protein:vir:99 275 EYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA--KANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDI 352 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhhc--cCCCcceecCCcccceeeecccccchhHHHHHHHHH Confidence 999999999988888888888888887665321211111111 122344444444556655433 3566666677766 Q ss_pred HHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---ccccceee-EE Q lcl|NC_011308. 354 ELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAM-KAQKTEIALRKTLRWTADLVVEDIRRRGLG---DYSSTDIK-FD 428 (530) Q Consensus 354 ~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~-ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~---~~d~~~i~-i~ 428 (530) +..|-..-+.-.+.....+..++..+..+-.-+.+ .-....+.-.+.|.=+++-++.++...+.- .-+...+. ++ T Consensus 353 ~~rI~~af~~~~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~ 432 (532) T protein:vir:99 353 EKRLSYAFMLNSAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIAT 432 (532) T ss_pred HHHHHHHHhhhhcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceee Confidence 66664322211121122233455444433222221 112222222233333333333444333321 11111122 22 Q ss_pred eCCCCCCC--HHHHHHHHHHHHhc-C----CCcHHHHHHhC----C-----CCCCHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_011308. 429 IEPYILAN--ELDLAMIDKTEAET-N----QIQINNLLAIA----P-----RIGDEETLKAICDTLDLDYEDVVKALEDQ 492 (530) Q Consensus 429 f~~~~P~n--~~e~a~~~~~~~~~-g----~iS~et~l~~~----~-----~vdd~~~e~~~~e~e~~e~~~~~~~~~~~ 492 (530) +...|=+. ...+++.+..+.+. + .+.-..++..+ + ++...++..++.++.+.+....+...+.+ T Consensus 433 ~is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~ 512 (532) T protein:vir:99 433 GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMG 512 (532) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22112111 11111111111111 1 12222232221 1 23333333222222222221111111111 Q ss_pred ccccCCccccCCCCCCCCCCccCcCCCC Q lcl|NC_011308. 493 EVEELEPTVTPIIDPLTIEPQPEPLNID 520 (530) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (530) . .... + ..+...++.-.++. T Consensus 513 ~-~~~~--~-----~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 513 A-AGGQ--A-----AAAMMQQQAGMPTQ 532 (532) T ss_pred H-HHHH--h-----cchhHHhhcCCCCC Confidence 0 0000 0 00001111000110 No 240 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=66.93 E-value=0.26 Score=23.79 Aligned_cols=367 Identities=10% Similarity=0.008 Sum_probs=130.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+ +.+.+..+.+... ...+..... .....-+...-....|+..++=+.+- T Consensus 1 Mg~----------------------f~~lf~~~~~~~~------~~~~~~~~~-v~~~~~~~~~~v~~~i~~Ia~~iA~~ 51 (395) T protein:vir:95 1 MSI----------------------LEKIFKTRKDITY------MLDLDMIED-LSQQAYVKRLAIDSCIEFVARAVAQS 51 (395) T ss_pred Cch----------------------hhhhhccCccccc------cccchhccc-cchhhhhhhHHHHHHHHHHHHhhccc Confidence 222 1111111111000 000000000 00011112233455666666667777 Q ss_pred ceeeecCCc-chHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE--EEcCCCC Q lcl|NC_011308. 91 GIDVKPTDH-DDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP--VFDDYGT 164 (530) Q Consensus 91 pv~~~~~~~-~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~--v~d~~~~ 164 (530) |+.+..... .+......|..= -|.+ ......+.......|.+|.++..+ +.+ ..+++..+-+ +++.. T Consensus 52 p~~~~~~~~~~~~~~~~ll~~~-PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-- 124 (395) T protein:vir:95 52 HFKVLEGNRIQKNDVYYKLNIK-PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-- 124 (395) T ss_pred eeEeccCCccccchHHHHHHhc-cCcCCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-- Confidence 776533221 122222222210 0222 223344556666677776544332 222 2222222211 22111 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) ++.+.... ..+...+..+.+.|++.... T Consensus 125 ------~~~~~~~~--------~~~~~~~~~~evih~~~~~~-------------------------------------- 152 (395) T protein:vir:95 125 ------FKDVTVKD--------YTYQRTFTMQEVIYLKYNNN-------------------------------------- 152 (395) T ss_pred ------eeEEEEcC--------ceeeeeeccccEEEEccCCC-------------------------------------- Confidence 00000000 00011233333333321100 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCCCch--hhHHHHH Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV--RGGTNSPV--DEIKKNI 320 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~~~~~--~~~~~~~ 320 (530) .....|.|-++....+++. ..+.....+.+--+| .+...++. +.+...+ T Consensus 153 --------------------~~~~~G~spi~~~~~~~~~-------~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~ 205 (395) T protein:vir:95 153 --------------------KVTHFVESLFEDYGKIFGR-------MIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFT 205 (395) T ss_pred --------------------CcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 0111344444443333332 222233333333333 23222221 1111111 Q ss_pred h---------hCcceecCCCCceeEEEecCC-----HHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 321 Q---------SKKIIQTKGEGGLDIQTVDIP-----YEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 321 ~---------~~~~i~~~~~~~~~~lt~~~~-----~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) . ..+++.+++|.+++-++..-. ...+.+..+...+.|...-.+|.. -+ |+.|++ T Consensus 206 ~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~---~l~~~~sn~-------- 274 (395) T protein:vir:95 206 NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG---LIYGETADL-------- 274 (395) T ss_pred HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH---HhcCcccCH-------- Confidence 1 112333455545444432211 112344455666777777777742 22 222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ......++...|.-.+..|...++.+-...... ..+.+.++.-+-.|..+.++....+..+|+++...+++.+++ T Consensus 275 ----e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~ 350 (395) T protein:vir:95 275 ----EKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGE 350 (395) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 122223344455555555544444321111110 123455555556788888888888999999999999888754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) ++++....-.+ ........... . ..+...+..+.+++.-.+++ T Consensus 351 ~p~~~g~~d~~~~-------~~n~~~~~~~~-----~-------~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 351 EPSDNPELDEYLI-------TKNYEKANSGE-----N-------DEKEKDENTLKGGDEDESGD 395 (395) T ss_pred CCCCCCCCceeee-------ccccccccccc-----c-------ccCcccccccCCCCCCCCCC Confidence 33321000000 00000000000 0 00000001111111111111 No 241 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=66.93 E-value=0.26 Score=23.79 Aligned_cols=367 Identities=10% Similarity=0.008 Sum_probs=130.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+ +.+.+..+.+... ...+..... .....-+...-....|+..++=+.+- T Consensus 1 Mg~----------------------f~~lf~~~~~~~~------~~~~~~~~~-v~~~~~~~~~~v~~~i~~Ia~~iA~~ 51 (395) T protein:vir:10 1 MSI----------------------LEKIFKTRKDITY------MLDLDMIED-LSQQAYVKRLAIDSCIEFVARAVAQS 51 (395) T ss_pred Cch----------------------hhhhhccCccccc------cccchhccc-cchhhhhhhHHHHHHHHHHHHhhccc Confidence 222 1111111111000 000000000 00011112233455666666667777 Q ss_pred ceeeecCCc-chHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE--EEcCCCC Q lcl|NC_011308. 91 GIDVKPTDH-DDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP--VFDDYGT 164 (530) Q Consensus 91 pv~~~~~~~-~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~--v~d~~~~ 164 (530) |+.+..... .+......|..= -|.+ ......+.......|.+|.++..+ +.+ ..+++..+-+ +++.. T Consensus 52 p~~~~~~~~~~~~~~~~ll~~~-PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-- 124 (395) T protein:vir:10 52 HFKVLEGNRIQKNDVYYKLNIK-PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-- 124 (395) T ss_pred eeEeccCCccccchHHHHHHhc-cCcCCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-- Confidence 776533221 122222222210 0222 223344556666677776544332 222 2222222211 22111 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) ++.+.... ..+...+..+.+.|++.... T Consensus 125 ------~~~~~~~~--------~~~~~~~~~~evih~~~~~~-------------------------------------- 152 (395) T protein:vir:10 125 ------FKDVTVKD--------YTYQRTFTMQEVIYLKYNNN-------------------------------------- 152 (395) T ss_pred ------eeEEEEcC--------ceeeeeeccccEEEEccCCC-------------------------------------- Confidence 00000000 00011233333333321100 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCCCch--hhHHHHH Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV--RGGTNSPV--DEIKKNI 320 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~~~~~--~~~~~~~ 320 (530) .....|.|-++....+++. ..+.....+.+--+| .+...++. +.+...+ T Consensus 153 --------------------~~~~~G~spi~~~~~~~~~-------~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~ 205 (395) T protein:vir:10 153 --------------------KVTHFVESLFEDYGKIFGR-------MIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFT 205 (395) T ss_pred --------------------CcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 0111344444443333332 222233333333333 23222221 1111111 Q ss_pred h---------hCcceecCCCCceeEEEecCC-----HHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 321 Q---------SKKIIQTKGEGGLDIQTVDIP-----YEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 321 ~---------~~~~i~~~~~~~~~~lt~~~~-----~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) . ..+++.+++|.+++-++..-. ...+.+..+...+.|...-.+|.. -+ |+.|++ T Consensus 206 ~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~---~l~~~~sn~-------- 274 (395) T protein:vir:10 206 NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG---LIYGETADL-------- 274 (395) T ss_pred HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH---HhcCcccCH-------- Confidence 1 112333455545444432211 112344455666777777777742 22 222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ......++...|.-.+..|...++.+-...... ..+.+.++.-+-.|..+.++....+..+|+++...+++.+++ T Consensus 275 ----e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~ 350 (395) T protein:vir:10 275 ----EKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGE 350 (395) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 122223344455555555544444321111110 123455555556788888888888999999999999888754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) ++++....-.+ ........... . ..+...+..+.+++.-.+++ T Consensus 351 ~p~~~g~~d~~~~-------~~n~~~~~~~~-----~-------~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 351 EPSDNPELDEYLI-------TKNYEKANSGE-----N-------DEKEKDENTLKGGDEDESGD 395 (395) T ss_pred CCCCCCCCceeee-------ccccccccccc-----c-------ccCcccccccCCCCCCCCCC Confidence 33321000000 00000000000 0 00000001111111111111 No 242 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=66.93 E-value=0.26 Score=23.79 Aligned_cols=367 Identities=10% Similarity=0.008 Sum_probs=130.3 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+ +.+.+..+.+... ...+..... .....-+...-....|+..++=+.+- T Consensus 1 Mg~----------------------f~~lf~~~~~~~~------~~~~~~~~~-v~~~~~~~~~~v~~~i~~Ia~~iA~~ 51 (395) T protein:vir:10 1 MSI----------------------LEKIFKTRKDITY------MLDLDMIED-LSQQAYVKRLAIDSCIEFVARAVAQS 51 (395) T ss_pred Cch----------------------hhhhhccCccccc------cccchhccc-cchhhhhhhHHHHHHHHHHHHhhccc Confidence 222 1111111111000 000000000 00011112233455666666667777 Q ss_pred ceeeecCCc-chHHHHHHHHHHhhccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEE--EEcCCCC Q lcl|NC_011308. 91 GIDVKPTDH-DDQKLCYLIEEYYNEEF---QSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLP--VFDDYGT 164 (530) Q Consensus 91 pv~~~~~~~-~de~~~~~l~~~~~~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~--v~d~~~~ 164 (530) |+.+..... .+......|..= -|.+ ......+.......|.+|.++..+ +.+ ..+++..+-+ +++.. T Consensus 52 p~~~~~~~~~~~~~~~~ll~~~-PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-- 124 (395) T protein:vir:10 52 HFKVLEGNRIQKNDVYYKLNIK-PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-- 124 (395) T ss_pred eeEeccCCccccchHHHHHHhc-cCcCCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-- Confidence 776533221 122222222210 0222 223344556666677776544332 222 2222222211 22111 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) ++.+.... ..+...+..+.+.|++.... T Consensus 125 ------~~~~~~~~--------~~~~~~~~~~evih~~~~~~-------------------------------------- 152 (395) T protein:vir:10 125 ------FKDVTVKD--------YTYQRTFTMQEVIYLKYNNN-------------------------------------- 152 (395) T ss_pred ------eeEEEEcC--------ceeeeeeccccEEEEccCCC-------------------------------------- Confidence 00000000 00011233333333321100 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCCCch--hhHHHHH Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVV--RGGTNSPV--DEIKKNI 320 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl--~g~~~~~~--~~~~~~~ 320 (530) .....|.|-++....+++. ..+.....+.+--+| .+...++. +.+...+ T Consensus 153 --------------------~~~~~G~spi~~~~~~~~~-------~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~ 205 (395) T protein:vir:10 153 --------------------KVTHFVESLFEDYGKIFGR-------MIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFT 205 (395) T ss_pred --------------------CcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 0111344444443333332 222233333333333 23222221 1111111 Q ss_pred h---------hCcceecCCCCceeEEEecCC-----HHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 321 Q---------SKKIIQTKGEGGLDIQTVDIP-----YEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 321 ~---------~~~~i~~~~~~~~~~lt~~~~-----~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) . ..+++.+++|.+++-++..-. ...+.+..+...+.|...-.+|.. -+ |+.|++ T Consensus 206 ~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~---~l~~~~sn~-------- 274 (395) T protein:vir:10 206 NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG---LIYGETADL-------- 274 (395) T ss_pred HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH---HhcCcccCH-------- Confidence 1 112333455545444432211 112344455666777777777742 22 222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc-ceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS-TDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~-~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ......++...|.-.+..|...++.+-...... ..+.+.++.-+-.|..+.++....+..+|+++...+++.+++ T Consensus 275 ----e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~ 350 (395) T protein:vir:10 275 ----EKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGE 350 (395) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 122223344455555555544444321111110 123455555556788888888888999999999999888754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) ++++....-.+ ........... . ..+...+..+.+++.-.+++ T Consensus 351 ~p~~~g~~d~~~~-------~~n~~~~~~~~-----~-------~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 351 EPSDNPELDEYLI-------TKNYEKANSGE-----N-------DEKEKDENTLKGGDEDESGD 395 (395) T ss_pred CCCCCCCCceeee-------ccccccccccc-----c-------ccCcccccccCCCCCCCCCC Confidence 33321000000 00000000000 0 00000001111111111111 No 243 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=60.63 E-value=0.37 Score=22.96 Aligned_cols=426 Identities=10% Similarity=0.051 Sum_probs=154.0 Q ss_pred CCcccccCCcccHHHHHHHHHHHHH-HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYI-RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAEL 79 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~I 79 (530) .+....+-++ .+. ..+.+++|+.+-.+++-...|.. | T Consensus 38 ~~~~~~~~e~------------~~~~~~eLI~~YR~ma~~pEvd~Av~e------------------------------I 75 (533) T protein:vir:10 38 YYGYTVDFDG------------QVRNEYQLISRYREMVLQPECDSAVDD------------------------------I 75 (533) T ss_pred ccceeeeccc------------ccchHHHHHHHHHHHhhccchhhHHHH------------------------------h Confidence 1111111111 011 12345566666666666554322 1 Q ss_pred Hhhhh-hhhcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC----CCceEEE Q lcl|NC_011308. 80 VDQKT-QYLLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS----EDKLTFQ 148 (530) Q Consensus 80 vd~~~-~yl~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~----~g~~~~~ 148 (530) |+-.+ .=....||.+..++.+ .+.+.+.+..+++ =+|+...++..+.+-+.|+-|.|.-+|. +|-.... T Consensus 76 Vneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr 155 (533) T protein:vir:10 76 VNETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELR 155 (533) T ss_pred hcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeee Confidence 11111 1122344554433211 2335555555553 3788889999999999999998887763 3556678 Q ss_pred EecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeee Q lcl|NC_011308. 149 TVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLA 228 (530) Q Consensus 149 ~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (530) .+||..+-+|.--..+.....+.+.... ........+-+|++.+.. + .+ ...+..+ . ....++ T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~~~~~~-----~v~~~~~eyf~Ynp~g~~-~---~~--~~~vkI~----~-dAI~y~- 218 (533) T protein:vir:10 156 YIDPRKIRKINETEQKRPEQLRGLPLNQ-----QLSPKSAEYFLYDPKGLK-N---ST--TQGLKIA----P-DSICYV- 218 (533) T ss_pred eccccceeeeeeeeccCCCccceeecch-----hhhccceeeeeecccccc-c---cC--CCceecc----h-hheeee- Confidence 8999988665421111111111000000 000111112233332211 0 00 0000000 0 000000 Q ss_pred eecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH----------------- Q lcl|NC_011308. 229 VADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL----------------- 291 (530) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~----------------- 291 (530) +.+| +|. ++..-.|-++..+.....+- ++-|. T Consensus 219 ------------------------hSGl--~d~----~~~~i~syLhkAiKp~NQLk-m~EDAlVIYRitRAPeRRvFYI 267 (533) T protein:vir:10 219 ------------------------HSGI--MDL----NKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYI 267 (533) T ss_pred ------------------------eccc--eeC----CCCceeccchHhHHHHHhhH-HHHhhHHHHhhhccccceEEEE Confidence 0011 111 11111233333222111111 11111 Q ss_pred --------------HHHHHHhccceeeeecC-CC--CchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHH Q lcl|NC_011308. 292 --------------SNNLQDMAEAIYVVRGG-TN--SPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDI 352 (530) Q Consensus 292 --------------~n~~~~~~~~~lvl~g~-~~--~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~ 352 (530) .+.+..|.+- || ... .+ .+.-.++.-+....+--=+++.+-+.-|-+ .+.. .-.-+.= T Consensus 268 DVGnLPk~KAeqYlr~iM~k~KNk-lV-YDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLg-em~DV~Y 344 (533) T protein:vir:10 268 DVGNLPKNKAEQYLREVMGRYRNK-LV-YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLG-ELEDVKY 344 (533) T ss_pred ecCCCCchhHHHHHHHHHHhccce-EE-EeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcC-hHHHHHH Confidence 1111222221 11 110 00 011111111111111111122222322333 2222 1223455 Q ss_pred HHHHHHHHhcccC--CCccc---ccCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc--ce Q lcl|NC_011308. 353 DELNIYRSGMGFN--SSAVG---DGNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS--TD 424 (530) Q Consensus 353 L~~~I~~~s~~p~--~~~~~---~gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~--~~ 424 (530) .++.+|..-.+|- +..++ +|.+|.++.. ++|. --+.+.+..|..-|.++|+.=+-+=++....+|+- .. T Consensus 345 F~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~---KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~ 421 (533) T protein:vir:10 345 FQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQ---KFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEH 421 (533) T ss_pred HHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhc Confidence 5667777778883 33332 2444433221 1111 22344444444444444332111111223345554 46 Q ss_pred eeEEeCCCCCCCHHHHHHHHHHH---------HhcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|NC_011308. 425 IKFDIEPYILANELDLAMIDKTE---------AETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVE 495 (530) Q Consensus 425 i~i~f~~~~P~n~~e~a~~~~~~---------~~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~ 495 (530) |.+.|.+.---.+...++++... .-+..+|++++.+.+=...|.+ +++++++-+++++..--++.+ T Consensus 422 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee-----i~~~~kqI~~E~k~~~~~~p~ 496 (533) T protein:vir:10 422 IQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVE-----MKEIDKQIESEMESGIIADPA 496 (533) T ss_pred ceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH-----HHHHHHHHHHHHhCCCCCCCc Confidence 77888776554444444443221 1122469999998854444433 233333333333222211111 Q ss_pred cCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 496 ELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 496 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) + +-++...+...+..-.+.+..+...|+|.-| T Consensus 497 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (533) T protein:vir:10 497 A---EMDPAMAAGDPDAGGAPAEEVAPEGPDPSDE 528 (533) T ss_pred c---hhhHHhcCCCCCcCCcccccCCCCCCCcchh Confidence 0 0000000000011111111222233444444 No 244 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=60.17 E-value=0.38 Score=22.90 Aligned_cols=338 Identities=9% Similarity=0.010 Sum_probs=129.6 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceee--cCchhhHHhhhhhhhc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKIS--HGFFAELVDQKTQYLL 88 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~--~n~~k~Ivd~~~~yl~ 88 (530) |-+..-+.. | .+.....+ ...-....+..+. .......|+..++=+. T Consensus 1 Mg~f~~~~~----~-----------------------~~~~~~~~----~~~~~~~~~~~~~~~~~~v~~~v~~IA~~iA 49 (378) T protein:vir:94 1 MNLFGKVVS----F-----------------------SRGKLNND----TQRVTAWQNEAVEYTSAFVTNIHNKIANEIT 49 (378) T ss_pred CCccccchh----c-----------------------ccccccCC----cceeeeeccchhHHHHHHHHHHHHHHHhhhh Confidence 211111000 0 00000000 0000001111111 1123445666666666 Q ss_pred ccceee-ecCCcch--HH----HHHHHHHHhh---ccH---HHHHHHHHHHHhhcCeEEEEE-EecCCCceEEEEecccc Q lcl|NC_011308. 89 ANGIDV-KPTDHDD--QK----LCYLIEEYYN---EEF---QSAIQELVEGSTIKGYEGIFA-RTTSEDKLTFQTVDALQ 154 (530) Q Consensus 89 G~pv~~-~~~~~~d--e~----~~~~l~~~~~---~~~---~~~~~e~~~~~~~~G~a~~~~-y~d~~g~~~~~~~~p~~ 154 (530) +-|+.+ .....+. +. ...-|.++++ |.. ......+...+..+|.||.+. +.+..|++.. T Consensus 50 ~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~------- 122 (378) T protein:vir:94 50 KVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLD------- 122 (378) T ss_pred hCceeeEEEcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEE------- Confidence 778763 2111110 00 1112333332 222 234456677889999999764 3333333211 Q ss_pred eEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccc Q lcl|NC_011308. 155 LLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVD 234 (530) Q Consensus 155 ~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (530) +|-..+ . . .|..+.+.|++ T Consensus 123 ---l~p~~~--------------------~--~----~~~~~diiH~~-------------------------------- 141 (378) T protein:vir:94 123 ---LLFADD--------------------K--K----EYKPEELVRLT-------------------------------- 141 (378) T ss_pred ---EEecCC--------------------e--e----EeeeeeeEEec-------------------------------- Confidence 110000 0 0 01122333332 Q ss_pred cceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCC-ch Q lcl|NC_011308. 235 EAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNS-PV 313 (530) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~-~~ 313 (530) +--.+......+..+..+++..++.. ....+|.+.|.-.+ .. T Consensus 142 -------------------------------~~~~~~~g~s~l~~~~~~i~~~~~~~------~~~gil~~~~~l~~~~~ 184 (378) T protein:vir:94 142 -------------------------------SPFYINEDTSILDNALASIQTKLEQG------KLRGLLKINAFLDIDNT 184 (378) T ss_pred -------------------------------CcCCccchhHHHHHHHHHHHHHHhcc------cccceeeeCCcCCHHHH Confidence 10011111223334444444433321 11223333332111 11 Q ss_pred h----hHHHHH-------hhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHH Q lcl|NC_011308. 314 D----EIKKNI-------QSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKS 381 (530) Q Consensus 314 ~----~~~~~~-------~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~ 381 (530) + .+...+ +.++++.+++|.+++-++....+... ...+.+.+.|...-.+|.. -+ |+.|.. T Consensus 185 ~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~---~l~~~~se~---- 256 (378) T protein:vir:94 185 QEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNEN---ILLGTASQE---- 256 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHH---HhcCChHHH---- Confidence 1 122222 12346667766666555544333222 3445667777777777641 11 222211 Q ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------CccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCC Q lcl|NC_011308. 382 RYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGL---------GDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQ 452 (530) Q Consensus 382 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~---------~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~ 452 (530) ....++...|.-.+..|..-++.+-. .......+.+.+..-+-.|..+.++....+.++|+ T Consensus 257 ----------~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~ 326 (378) T protein:vir:94 257 ----------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPI 326 (378) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCC Confidence 11234555565555555554443211 11122335555556666788888888889999999 Q ss_pred CcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 453 IQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 453 iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) ++.-.+++.+++ +++.+.-+-.+ .+........ .++ + .+..++..++-|. T Consensus 327 ~T~NE~R~~~gl~p~~gGD~~~~~~---------n~~~~~~~~~--~~~-~-----~~~~~~~~e~~n~ 378 (378) T protein:vir:94 327 FTQNQLLVKMGEQPIEGGDVYIANL---------NAVAVKNLSD--LQG-S-----RKDVTSTDETNNQ 378 (378) T ss_pred cCHHHHHHHhCCCCCCCCCeeeecc---------cccccccchh--hcC-C-----cCCCCCCCCCCCC Confidence 999988888754 33322110000 0000000000 000 0 0000000000011 No 245 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=52.98 E-value=0.54 Score=22.05 Aligned_cols=449 Identities=11% Similarity=0.048 Sum_probs=159.0 Q ss_pred CCcccccCCcccHHHHHHHH----H-----HHHH-HhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcce Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTK----I-----DEYI-RSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIK 70 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~----i-----~~~~-~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~k 70 (530) ..-.-..+ .+....+.... + .++. ..+.+++|+.+..+++-...|.. T Consensus 18 ~S~vpp~~-~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~e---------------------- 74 (564) T protein:vir:10 18 QSPVPPND-EASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDE---------------------- 74 (564) T ss_pred CCcccCCc-CCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHH---------------------- Confidence 00000000 01111110000 0 0111 12334556666555555543321 Q ss_pred eecCchhhHHhhhh-hhhcccceeeecCCcc-h----HHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC-- Q lcl|NC_011308. 71 ISHGFFAELVDQKT-QYLLANGIDVKPTDHD-D----QKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS-- 141 (530) Q Consensus 71 i~~n~~k~Ivd~~~-~yl~G~pv~~~~~~~~-d----e~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~-- 141 (530) ||+-.+ .=-...||.+...+.+ . +.+.+.++.+++ =+|+...++..+.+-+.|+-|.|.-+|. T Consensus 75 --------IVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 75 --------IVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred --------hhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCC Confidence 111111 1122445544332222 1 335555555553 3788899999999999999998887762 Q ss_pred --CCceEEEEecccceEEEEcCCC----CceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcc Q lcl|NC_011308. 142 --EDKLTFQTVDALQLLPVFDDYG----TLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLD 215 (530) Q Consensus 142 --~g~~~~~~~~p~~~~~v~d~~~----~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~ 215 (530) +|-..+..+||..+=.|+-.-. ....+.+.+. ..+...+-.+++.|...+ ........ T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~--------------~~~~y~~~~Eyy~Ynp~~--~~g~~~~~ 210 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTA--------------LQYDYGDFIEYYIYNPKG--FAGNIPMV 210 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceeeeeee--------------eeccccccccceeecccc--ccCccccc Confidence 3545577899998766662111 1111111111 011111111222221110 00000000 Q ss_pred ccccccccceeeeeecccccceecccccccccccccccccCCccceEEe----eCCcCCCCcHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 216 TTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDIL----YNNKLGISDIKKVKSIIDDYDLMNCFL 291 (530) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~----~nn~~~~sd~e~v~~liDa~~~~~S~~ 291 (530) ... ...+....+ .-+-..|+.++. .++..-.|-++..+.....+- ++-|. T Consensus 211 ~~~--~~~~~~~~i-----------------------kI~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLk-mlEDA 264 (564) T protein:vir:10 211 TGS--MDWSNQEGI-----------------------KIASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLR-MIEDS 264 (564) T ss_pred ccc--cccccccce-----------------------eechhhcceecccceeCCCCceeccchhhhHhHHhhH-HHHhh Confidence 000 000000000 000111111110 011111222332211111111 11111 Q ss_pred H-------------------------------HHHHHhccceeeeecC-CC--CchhhHHHHHhhCcceecCCCCceeEE Q lcl|NC_011308. 292 S-------------------------------NNLQDMAEAIYVVRGG-TN--SPVDEIKKNIQSKKIIQTKGEGGLDIQ 337 (530) Q Consensus 292 ~-------------------------------n~~~~~~~~~lvl~g~-~~--~~~~~~~~~~~~~~~i~~~~~~~~~~l 337 (530) . +.+..|.+- || ... .+ .+.-.++.-+....+--=+++.+-+.- T Consensus 265 lVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNk-lV-YDa~TGevrddrk~msMlEDyWLPRReGgrgTEIt 342 (564) T protein:vir:10 265 LVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNK-LV-YDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEIT 342 (564) T ss_pred HHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce-EE-EeccCceecccchhhhhHhhhcccccCCCccccee Confidence 1 111122211 11 100 00 001111111111111111112222222 Q ss_pred Eec--CCHHHHHHHHHHHHHHHHHHhcccC--CCccc--c--cCCcHHHH-HHHHhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 338 TVD--IPYEARKAKMDIDELNIYRSGMGFN--SSAVG--D--GNATNVVI-KSRYTLLAMKAQKTEIALRKTLRWTADLV 408 (530) Q Consensus 338 t~~--~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~--~--gn~SGvAi-k~~~~~l~~ka~~ke~~f~~~l~~~~~~i 408 (530) |-+ .+... ..-+.=.++.+|..-.+|- +..++ | |..|-++. .++|. --+.+.+..|..-|.++|+.= T Consensus 343 TLpGgqnLge-m~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~---KFI~RLR~rFs~lF~~~Lk~q 418 (564) T protein:vir:10 343 TLPGGQNLGE-LKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFT---KFIGRLRKRFAQLFHDILKTQ 418 (564) T ss_pred eccccCCcch-HHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHh Confidence 322 22221 2234556677778788883 43332 2 44333322 11121 123444444444444443321 Q ss_pred HHHHHhcCCCcccc--ceeeEEeCCCCCCCHHHHHHHHHHH---------HhcCCCcHHHHHHhCCCCCCHH--HHHHHH Q lcl|NC_011308. 409 VEDIRRRGLGDYSS--TDIKFDIEPYILANELDLAMIDKTE---------AETNQIQINNLLAIAPRIGDEE--TLKAIC 475 (530) Q Consensus 409 ~~~l~~~~~~~~d~--~~i~i~f~~~~P~n~~e~a~~~~~~---------~~~g~iS~et~l~~~~~vdd~~--~e~~~~ 475 (530) +-+=++....+|+- ..|.+.|.+.---.+...++++... .-+..+|++++.+.+=...|.+ ++.++| T Consensus 419 LiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI 498 (564) T protein:vir:10 419 LILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQM 498 (564) T ss_pred hhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHH Confidence 11112223345554 4678888776555554444443221 1122469999998854444433 333333 Q ss_pred HHHHHHH------HHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 476 DTLDLDY------EDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 476 e~e~~e~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) ++|.++. +..+-...+++.....|+.++..+. ..+++++....+++.+.|-+. T Consensus 499 ~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~--~~~~~~~~~~~~a~~~~~~~~ 557 (564) T protein:vir:10 499 KSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDD--LAAEREIKKLNSAPKPPPSQQ 557 (564) T ss_pred HHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccc--cccccChhhhccCCCCCCCCC Confidence 3333321 0000001111111222333333222 233333343344444444443 No 246 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=50.82 E-value=0.6 Score=21.81 Aligned_cols=373 Identities=10% Similarity=-0.009 Sum_probs=140.9 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+..++.+.++-- ++. +.++ ..+-.+........+ ..-.|. .-...-|+..++-+..- T Consensus 1 mg~~~~~~~~~~~~---~~~-----~~~~----~~~~~~~~~~~~~t~-------~~~~~~--~~v~~cv~~Ia~~ia~~ 59 (403) T protein:vir:10 1 MGFKSWITEKLNPG---QRI-----IRDM----EPVSHRTNRKPFTTG-------QAYSKI--EILNRTANMVIDSAAEC 59 (403) T ss_pred Ccchhhhhhccchh---hhh-----hhcc----cccccccCCcccccH-------HHHHHH--HHHHHHHHHHHHHHhhC Confidence 55444443322210 000 0111 001000000000000 000011 11222344444555566 Q ss_pred ceeeecCC----cchHHHHHHHHHHhh---ccH---HHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEc Q lcl|NC_011308. 91 GIDVKPTD----HDDQKLCYLIEEYYN---EEF---QSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFD 160 (530) Q Consensus 91 pv~~~~~~----~~de~~~~~l~~~~~---~~~---~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d 160 (530) |+++.... ..+.....-+..++. |.. ......+...+..+|.||.+. +.. ....++|..+-+..+ T Consensus 60 p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~~---~l~~l~~~~~~v~~~ 134 (403) T protein:vir:10 60 SYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DGT---SLYHVPAALMQVEAD 134 (403) T ss_pred ceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eCc---eeEeecCcceEEEEc Confidence 66543211 111111112333332 222 234455677888899998543 322 234566665544333 Q ss_pred CCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecc Q lcl|NC_011308. 161 DYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDE 240 (530) Q Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (530) ... . +..|.. . . . ..|..+.+.|++ T Consensus 135 ~~~-~---~~~~~~---~-----~-~----~~~~~~eiih~~-------------------------------------- 159 (403) T protein:vir:10 135 ANK-F---IKKFIF---N-----N-Q----INYRVDEIIFIK-------------------------------------- 159 (403) T ss_pred CCc-e---EEEEEe---c-----C-c----eeecccceEEec-------------------------------------- Confidence 211 1 111100 0 0 0 001112222221 Q ss_pred cccccccccccccccCCccceEEee-CCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCc--hhhH Q lcl|NC_011308. 241 GVEEHEGRQVLGRSYKSRFPFDILY-NNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT-NSP--VDEI 316 (530) Q Consensus 241 ~~~~~~~~~~~~~~~~~~iPiv~~~-nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~-~~~--~~~~ 316 (530) ...+++.. +.-.|.|.+......++....+..-.++.+.-.+.|-.+++... .++ .+.+ T Consensus 160 -----------------~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~ 222 (403) T protein:vir:10 160 -----------------DNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERK 222 (403) T ss_pred -----------------ccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHH Confidence 00001111 12246666665555555544444444444544445666666432 221 1222 Q ss_pred HHHHh--------hCcceecCCCCceeEEEecCC--HHHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhH Q lcl|NC_011308. 317 KKNIQ--------SKKIIQTKGEGGLDIQTVDIP--YEARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLL 386 (530) Q Consensus 317 ~~~~~--------~~~~i~~~~~~~~~~lt~~~~--~~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l 386 (530) +..+. .++++.+++|-+++.++...+ +.......+.....|...-.+|.. -+|...+..+ T Consensus 223 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~---~lg~~~~sn~------- 292 (403) T protein:vir:10 223 QEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQV---LLDGGNNANI------- 292 (403) T ss_pred HHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHH---HcCCCCCcCH------- Confidence 22221 233556666666666654333 222334445667777777677742 1221111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCC--CCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC Q lcl|NC_011308. 387 AMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPY--ILANELDLAMIDKTEAETNQIQINNLLAIAPR 464 (530) Q Consensus 387 ~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~--~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~ 464 (530) ....+..+...|.-.++.|..-++.+-. ..+.+.|+.- +-.|....++....+...|+++...+++.+++ T Consensus 293 ---e~~~~~f~~~tl~P~~~~ie~~l~~~L~-----~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl 364 (403) T protein:vir:10 293 ---RPNIELFYYMTIIPMLNKLTSSLTFFFG-----YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNL 364 (403) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHhcC-----ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 1112333344444444444444433221 1233333322 33366666777778899999999999988754 Q ss_pred --CCCHHHHHHHHHHHHHHHHHHHHhhhc-ccccc-CCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 465 --IGDEETLKAICDTLDLDYEDVVKALED-QEVEE-LEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 465 --vdd~~~e~~~~e~e~~e~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) ++++. .+.....+.- +.... ..+++. .+ +. .|+.+ T Consensus 365 ~pi~~~~------------~d~~~~p~n~~~~~~~~~~~e~~--------~~---~~----~~~g~ 403 (403) T protein:vir:10 365 EPLDDEQ------------MNKIRIPANVAGSATGVSGQEGG--------RP---KG----STEGD 403 (403) T ss_pred CCCCccc------------ccccccccccccccccCCCCcCC--------CC---CC----CcCCC Confidence 32211 0111111110 00000 000000 00 00 11111 No 247 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=47.08 E-value=0.71 Score=21.39 Aligned_cols=412 Identities=8% Similarity=0.016 Sum_probs=148.2 Q ss_pred CCcccc---cC---------CcccHHHHHHHHHH---HHHHhhhHHHHHHHHHHhcccchhhhccccccccccccccccc Q lcl|NC_011308. 1 MTNTLL---TT---------APDRLGTILSTKID---EYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDN 65 (530) Q Consensus 1 ~~~~~~---~~---------~~~~~~~~i~~~i~---~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~ 65 (530) ...... +. .+.....+....+. .+.+++.+++|+.+-.+++-...|.. T Consensus 21 ~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~e----------------- 83 (511) T protein:vir:56 21 RSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDAIQE----------------- 83 (511) T ss_pred ccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhHHHH----------------- Confidence 000000 00 00000000000000 01122345566666666666554322 Q ss_pred CCcceeecCchhhHHhhhh-hhhcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEE Q lcl|NC_011308. 66 ASNIKISHGFFAELVDQKT-QYLLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFAR 138 (530) Q Consensus 66 ~~n~ki~~n~~k~Ivd~~~-~yl~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y 138 (530) ||+-.+ .=-...||.+..++.+ .+.+.+.++.+++ =+|+...++..+.+-+.|+-|.|.- T Consensus 84 -------------Ivne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHki 150 (511) T protein:vir:56 84 -------------IVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKI 150 (511) T ss_pred -------------hhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEE Confidence 111111 1122445555443222 2335555555553 3788889999999999999998887 Q ss_pred ecCC-CceEEEEecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhcccc Q lcl|NC_011308. 139 TTSE-DKLTFQTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT 217 (530) Q Consensus 139 ~d~~-g~~~~~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~ 217 (530) .|.+ |-.....+||..+-+|..--.+.... ......+...-+|++................ T Consensus 151 id~k~GI~eLr~lDPr~i~~vr~i~~~~~~~------------~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~------ 212 (511) T protein:vir:56 151 LDKDNNIIELRPLNPMKMELVREIQKETIDG------------VEVVKGTLEYYVYKQSDYKMPSWMSATNRAQ------ 212 (511) T ss_pred eccccceeehhhcCcccchhhhhhhcccccc------------cccccceeeeeEecCCCcccCcccccccccc------ Confidence 7754 54556778998875554311111000 0001111122233332211110000000000 Q ss_pred ccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHH------ Q lcl|NC_011308. 218 VNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFL------ 291 (530) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~------ 291 (530) ..+..... ++.. .+.++ +- +..++....|-++..+.....+- ++-|. T Consensus 213 ------~~vkI~~d----aI~y------------~hSGL--~d--~~~~~g~i~syLhkAiKp~NQLk-m~EDAlVIYRi 265 (511) T protein:vir:56 213 ------TSFRIPKD----AIVF------------AHSGL--MR--GCADDPYIIGYLDRAIKPANQLK-MLEDALVIYRL 265 (511) T ss_pred ------cceeechh----heee------------ecccc--ee--ccCCCCeeeccchhhhHHHHhhH-HHHhhHHHHhh Confidence 00000000 0000 00000 00 00122223333443222222211 11111 Q ss_pred -------------------------HHHHHHhccceee--eecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CC Q lcl|NC_011308. 292 -------------------------SNNLQDMAEAIYV--VRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IP 342 (530) Q Consensus 292 -------------------------~n~~~~~~~~~lv--l~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~ 342 (530) .+.+..|.+-+.- -+| ...+.-.++.-+....+--=+++.+-+.-|-+ .+ T Consensus 266 tRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TG-ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqn 344 (511) T protein:vir:56 266 ARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTG-QVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQS 344 (511) T ss_pred hccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCc-eeccchhhhhhHhhhcccccCCCCccceeeccccCC Confidence 1111222221110 001 00111111111111111111122222322333 22 Q ss_pred HHHHHHHHHHHHHHHHHHhcccC--CCcc----cc--cCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 343 YEARKAKMDIDELNIYRSGMGFN--SSAV----GD--GNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIR 413 (530) Q Consensus 343 ~~~~e~~ld~L~~~I~~~s~~p~--~~~~----~~--gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~ 413 (530) .. ...-+.=.++.+|..-.+|- +..+ +| |.+|.++.. ++|. --+.+.+..|..-+.++|+.=+-+=+ T Consensus 345 lg-em~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~---KFI~RLR~rFs~lF~~~Lk~qLilKg 420 (511) T protein:vir:56 345 LG-DIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFT---KFVKRLQTKFETVITDPLKHQLIVNN 420 (511) T ss_pred cC-hHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 22 22334556677788888883 3322 12 333322221 1111 22344444444444444332111111 Q ss_pred hcCCCcccc--ceeeEEeCCCCCCCHHHHHHHHHHHH-----h----cCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_011308. 414 RRGLGDYSS--TDIKFDIEPYILANELDLAMIDKTEA-----E----TNQIQINNLLAIAPRIGDEETLKAICDTLDLDY 482 (530) Q Consensus 414 ~~~~~~~d~--~~i~i~f~~~~P~n~~e~a~~~~~~~-----~----~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~ 482 (530) +....+|+- ..|.+.|.+.---.+...++++.... . +..+|.+++.+.+=...|.+ +++++++- T Consensus 421 iit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDee-----i~~~~k~I 495 (511) T protein:vir:56 421 IITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQ-----ITAMQSEI 495 (511) T ss_pred CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHH-----HHHHHHHH Confidence 222345554 46778887765555544444433221 1 22469999998854444433 33333333 Q ss_pred HHHHHhhhccc-cccC Q lcl|NC_011308. 483 EDVVKALEDQE-VEEL 497 (530) Q Consensus 483 ~~~~~~~~~~~-~~~~ 497 (530) +++++..-.+. +++- T Consensus 496 ~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 496 DEEETNPRFQQDDQGF 511 (511) T ss_pred HHhhcCCCCCCcccCC Confidence 33332222111 1111 No 248 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=38.20 E-value=1.1 Score=20.40 Aligned_cols=354 Identities=10% Similarity=0.063 Sum_probs=140.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |- ++..+..+ ++. ...+ |.+ .++... ....-+...-....|+..++=+.+- T Consensus 1 Mg---~f~~~f~~--~~~-~~~~------~~~--~~~~~~---------------~~~~a~~~~~v~~~i~~ia~~ia~~ 51 (385) T protein:vir:95 1 MG---LFDSVFKR--HSE-LSWM------YDL--EFLQDK---------------SKKAYLKQIALNTVVEMVARTISQS 51 (385) T ss_pred Cc---hhhhhhcc--Ccc-cccc------cch--hhhhcc---------------chhhhhhhHHHHHHHHHHHHHHccc Confidence 22 22221111 000 0000 000 000000 0000011223445677777777778 Q ss_pred ceeeecCCc-chHHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceEEE--EecccceEEEEcCCCC Q lcl|NC_011308. 91 GIDVKPTDH-DDQKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQ--TVDALQLLPVFDDYGT 164 (530) Q Consensus 91 pv~~~~~~~-~de~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~--~~~p~~~~~v~d~~~~ 164 (530) |+.+.-... .+......|+.= -|. -......+..+...+|.||.+. +.+|.+... .+.|..+ .++.. T Consensus 52 p~~~~~~~~~~~~~l~~lL~~~-PN~~~t~~~f~~~~~~~l~l~Gna~i~~--~~~~~~~~~~~~~~~~~~-~~~~~--- 124 (385) T protein:vir:95 52 EFRVMKNNTKEKGTLYYLLNVR-PNRNQNAVDFWQKFIFKLIMDNEVLVVK--NDEGHFFVADDFEKEDEL-GLYSH--- 124 (385) T ss_pred ceeeeecCccccchHHHHHhcc-cCcCCCHHHHHHHHHHHHhhcCceEEEE--ecCCCeeecccccccccc-ccccc--- Confidence 887632221 122233333210 022 2334566778888999998544 444432110 0111110 00000 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) .++.+... .......+..+.+.|++.... T Consensus 125 -----~~~~~~~~--------~~~~~~~~~~~eiih~~~~~~-------------------------------------- 153 (385) T protein:vir:95 125 -----RFTNVLVN--------DFEFKRVFTMDDVIYLKYNNQ-------------------------------------- 153 (385) T ss_pred -----cceeeeec--------ccceeeeeccccEEEecCCCC-------------------------------------- Confidence 01111000 000111223333333321000 Q ss_pred cccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc--cceeeeecCCC-Cch--hhHHHH Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMA--EAIYVVRGGTN-SPV--DEIKKN 319 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~--~~~lvl~g~~~-~~~--~~~~~~ 319 (530) .....|.|.++.....+ +...+...... .+++++.+... ++. +.++.. T Consensus 154 --------------------~~~~~G~s~~~~~~~~i-------~~~~~~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~ 206 (385) T protein:vir:95 154 --------------------KLDAFSLGLFEDYGEIF-------GRMIDLQMLNNQIRGILKVDATKFYNKEKQKELQAY 206 (385) T ss_pred --------------------CcccccchHHHHHHHHH-------HHHHHHHHhcCCCceEEEeCCccCCCHHHHHHHHHH Confidence 00012444443333322 22222222223 34444443222 111 111111 Q ss_pred H---------hhCcceecCCCCceeEEEecC------CHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHH Q lcl|NC_011308. 320 I---------QSKKIIQTKGEGGLDIQTVDI------PYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRY 383 (530) Q Consensus 320 ~---------~~~~~i~~~~~~~~~~lt~~~------~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~ 383 (530) + ...+++.+++|.+++-++... .+..+....+.....|...-.+|.. -+ |+-|. T Consensus 207 ~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~---~l~~~~sn------- 276 (385) T protein:vir:95 207 IDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPS---LVLGEMAD------- 276 (385) T ss_pred HHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHH---HhcCCCcC------- Confidence 1 123356677777666665322 1334445566677777777777742 12 22221 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 384 TLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYS--STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 384 ~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d--~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) .......++...|.-+++.|...++.+-..... ...+.+.+..-+..|..+.++....+..+|+++...+++. T Consensus 277 -----~e~~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~ 351 (385) T protein:vir:95 277 -----LEKTIESYLQFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIM 351 (385) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 112344556666766666666666543222111 1235555556666788888888889999999999999988 Q ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccc Q lcl|NC_011308. 462 APR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEE 526 (530) Q Consensus 462 ~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (530) +++ ++++... .......-..-+ +.++.+ .++| T Consensus 352 ~g~~p~~~~~gd------------~~~~~~n~~~~~--~~kgge-------------------~~~e 385 (385) T protein:vir:95 352 TGEEPADDPELD------------KFIITKNLQSAD--AFKGGE-------------------SNEE 385 (385) T ss_pred hCCCCCCCCCCc------------eeeecccceecc--cccCCC-------------------CCCC Confidence 764 2221100 000000000000 000000 0000 No 249 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=35.02 E-value=1.3 Score=20.04 Aligned_cols=383 Identities=8% Similarity=-0.011 Sum_probs=141.5 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc-hhhhcccccccccccccccccCCccee--ecCchhhHHhhhhhhh Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN-DIENTRIMWMNDHGDIVEDDNASNIKI--SHGFFAELVDQKTQYL 87 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~-~I~~r~~~~~~~~~~~~~~~~~~n~ki--~~n~~k~Ivd~~~~yl 87 (530) |-+-+.|.. +.... -.+.. .+..-.. ....+.. .......+ .++.....|+..++=+ T Consensus 1 Mg~~~~~~~---------~~~~~------~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~~~v~~~i~~ia~~i 60 (423) T protein:vir:81 1 MGFLQKLGL---------APSVV------ATPEPIELVGPIF--ESLKLST---KNMTVEQIWEDQPHLRTVTTFIARNV 60 (423) T ss_pred CchhHhhcc---------ccccc------cCccccccccccc--ccccccc---chhhHHHHHHhhhHHHHHHHHHHHhH Confidence 222111100 00000 00000 0000000 0000000 00000000 1233445677777777 Q ss_pred cccceee-ecCCcch-HHH-HHHHHHHhh--cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEec---ccceE Q lcl|NC_011308. 88 LANGIDV-KPTDHDD-QKL-CYLIEEYYN--EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVD---ALQLL 156 (530) Q Consensus 88 ~G~pv~~-~~~~~~d-e~~-~~~l~~~~~--~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~---p~~~~ 156 (530) .+-|+++ .-..++. +.. ..-+..++. |. ..+....+..+...+|.||.++..|..+......+- +..+. T Consensus 61 a~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~ 140 (423) T protein:vir:81 61 ASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQ 140 (423) T ss_pred hhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceee Confidence 7778764 2211111 111 111222322 22 233445667788899999988877754433333333 32222 Q ss_pred EEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccc Q lcl|NC_011308. 157 PVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEA 236 (530) Q Consensus 157 ~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (530) +.....+......+++.. .. ..+. ...+..+.+.|++.... T Consensus 141 ~~~~~~~~~~~~Y~~~~~---~~--~~g~----~~~~~~~evih~r~~~~------------------------------ 181 (423) T protein:vir:81 141 RRAYKDGWGSLDYIIIES---GD--NDGR----SVKVPGERVIHRHGYNP------------------------------ 181 (423) T ss_pred eeeccCCCcceEEEEEEe---cC--CCce----EEEEcccceEEecCCCC------------------------------ Confidence 211111111111111100 00 0000 01122333333321000 Q ss_pred eecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC------C Q lcl|NC_011308. 237 ILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGT------N 310 (530) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~------~ 310 (530) - ..-.|.|.+......++....+..-..+.+.-.+.|-.+|+-.. . T Consensus 182 -------------------~---------~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l 233 (423) T protein:vir:81 182 -------------------K---------TMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKW 233 (423) T ss_pred -------------------C---------CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccC Confidence 0 00136666665555544444433333344444445655554211 1 Q ss_pred Cc--hhhHHHHHh---------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccC--CCcccccCCcHH Q lcl|NC_011308. 311 SP--VDEIKKNIQ---------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFN--SSAVGDGNATNV 377 (530) Q Consensus 311 ~~--~~~~~~~~~---------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~--~~~~~~gn~SGv 377 (530) ++ -+.++..++ .++++.+++|.++.-++....+...-.........|...-.+|. +++..-++-|+ T Consensus 234 ~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn- 312 (423) T protein:vir:81 234 DAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSN- 312 (423) T ss_pred CHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCccc- Confidence 11 112222221 23466677666666555433333333344566777777777774 22211122122 Q ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--ccccceeeEEeCCC--CCCCHHHHHHHHHHHH-hcCC Q lcl|NC_011308. 378 VIKSRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLG--DYSSTDIKFDIEPY--ILANELDLAMIDKTEA-ETNQ 452 (530) Q Consensus 378 Aik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~--~~d~~~i~i~f~~~--~P~n~~e~a~~~~~~~-~~g~ 452 (530) ++ .....++...|.-.+..|..-++.+-.. ..+.....+.|... +-.|..+.++...++. +.|+ T Consensus 313 -~e----------~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~ 381 (423) T protein:vir:81 313 -VR----------EFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAW 381 (423) T ss_pred -HH----------HHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCC Confidence 11 1122233334444444444444332211 12233344555433 4567777776666554 5688 Q ss_pred CcHHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCccc Q lcl|NC_011308. 453 IQINNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVI 523 (530) Q Consensus 453 iS~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (530) ++...+++.+++ +++-+ +....+.- .+-+ ++++.++...| T Consensus 382 ~T~NE~R~~~gl~p~~gGD--------------~~~~p~n~------~~~~-----------~~~~~~~~~~t 423 (423) T protein:vir:81 382 MTINEVRAMDNLPSIDGGD--------------DLARPLNT------EFGD-----------SEDAPGEEVET 423 (423) T ss_pred cCHHHHHHHhCCCCCCCcc--------------eeeccccc------ccCc-----------cCCCCCCCCCC Confidence 898888887654 22110 00100000 0000 00000111111 No 250 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=34.72 E-value=1.3 Score=20.01 Aligned_cols=430 Identities=10% Similarity=-0.003 Sum_probs=140.5 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccc----hhhhcccccccccccccccccCCcceeecCch Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDN----DIENTRIMWMNDHGDIVEDDNASNIKISHGFF 76 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~----~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~ 76 (530) |...+--..|.....- ..|..-.+ +-....-....||-|.- +..++.+... ...| -+.+=. T Consensus 20 ~~~~~~~~~p~~~dG~--s~i~~~~~-~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma---------~~~p---EVd~Ai 84 (533) T protein:vir:58 20 LSPMYGMGAPHGAGGS--SMIPINMY-HPFATAGYASRFYGGIEFNRFFLYDMYDRMD---------YTDP---LISTVL 84 (533) T ss_pred hchhhcccCccCCCCC--ccccCCCC-cchhhhhhhhhhhccccccHHHHHHHHHHhh---------ccCc---chhhHH Confidence 1111100111000000 00000000 00000111122222210 0000000000 0000 001112 Q ss_pred hhHHhhh-hhhhcccceeeecCCcc-hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeEEEEEEec--CCCceEEEEecc Q lcl|NC_011308. 77 AELVDQK-TQYLLANGIDVKPTDHD-DQKLCYLIEEYYNEEFQSAIQELVEGSTIKGYEGIFARTT--SEDKLTFQTVDA 152 (530) Q Consensus 77 k~Ivd~~-~~yl~G~pv~~~~~~~~-de~~~~~l~~~~~~~~~~~~~e~~~~~~~~G~a~~~~y~d--~~g~~~~~~~~p 152 (530) ..||+-. +..-...||.+...+.+ .+++.+.|..+ -+|+...++..+.+-+.|+.|.|.-.+ +.|-..+..+|| T Consensus 85 deIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~l--ldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDP 162 (533) T protein:vir:58 85 DIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYV--INIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSP 162 (533) T ss_pred HhhhceeeEecCCCceeEeecccccccHHHHHHHHHH--hcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCC Confidence 2233322 23345677776544333 34444555444 358889999999999999999888543 334446788999 Q ss_pred cceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecc Q lcl|NC_011308. 153 LQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADG 232 (530) Q Consensus 153 ~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (530) ..+=+|++--.+. .+ =+|++....... ..... .+....+.+ T Consensus 163 r~i~~vr~~~t~~----------------------ey-yvy~~~~~~~~s-----~~~~~----kI~~daI~y------- 203 (533) T protein:vir:58 163 YIFSKRYNPETDT----------------------WY-YVITDVYRNVVS-----GYFNE----DIPEEDVIH------- 203 (533) T ss_pred eeeEEEEeeccce----------------------EE-Eeeccccccccc-----Ccccc----ccchhheee------- Confidence 9887666432211 11 133333211110 00000 000000000 Q ss_pred cccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccc----eeee--e Q lcl|NC_011308. 233 VDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEA----IYVV--R 306 (530) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~----~lvl--~ 306 (530) -.+++ +.. +.....|-++..+.....+-. +-|.. .+...+.| ++-+ - T Consensus 204 -------------------~~SGl--~d~----~~~~iisyLhkAiKp~NQLkm-iEDAl-VIYRisRAPeRRvFYIDVG 256 (533) T protein:vir:58 204 -------------------FSHKI--DTN----FFPYGRSYLESARAIWNQLRL-MEDAL-MLYRVVRSVDRRVFYVDVG 256 (533) T ss_pred -------------------eeecc--ccC----CCCceehhhhHHHHHHHHHHH-HHHHH-HHHhhcCChhheEEEEeec Confidence 00111 000 112233445443222222211 11111 11112222 1111 1 Q ss_pred cCCCCchhhHHHH------------------------------HhhCcceecCCCCceeEEEecCCHHHHHHHHHHHHHH Q lcl|NC_011308. 307 GGTNSPVDEIKKN------------------------------IQSKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELN 356 (530) Q Consensus 307 g~~~~~~~~~~~~------------------------------~~~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~ 356 (530) +..-..-.+.+.+ +....+-.=+++.+-+.-|-+...-+.-.-+.=+++. T Consensus 257 Nlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgemeDV~YF~kk 336 (533) T protein:vir:58 257 NVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLAEDVEYMLNR 336 (533) T ss_pred CCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCCCcHHHHHHHHHH Confidence 1000000111111 1111111001122223333332212333445667788 Q ss_pred HHHHhcccC--CC-cccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCccccceeeEEeCCC Q lcl|NC_011308. 357 IYRSGMGFN--SS-AVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVVE-DIRRRGLGDYSSTDIKFDIEPY 432 (530) Q Consensus 357 I~~~s~~p~--~~-~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~-~l~~~~~~~~d~~~i~i~f~~~ 432 (530) +|..-.+|- +. +.+||.+|-++. =..|-..|-..|+.++.-++. -|-.+ +.....+.++.|.+. T Consensus 337 Ly~ALnVP~sRl~~e~~fgr~~eItR----------DEiKF~KFI~rLR~rF~~ll~~qLilk--~iit~eew~~~f~~D 404 (533) T protein:vir:58 337 LISALKVPKAFIGYEGDVNAKNTLAT----------QDIKFNNTIKRIQGFFVEELERMVRMN--KEFADQDFRLVMNRS 404 (533) T ss_pred HHHHhCCCeeecCCCCCCccchhhhH----------HHHHHHHHHHHHHHHHHHHHhcccccc--cCcchhheeeeeecc Confidence 888888884 32 334454433221 111222222233332221111 11122 223344556777766 Q ss_pred CCCCHHHHHHHHH-----HHHhcCCCcHHHHHHhC-CCCCCHHHHHHHHHHHHHHHHHHHHhhhcccc-ccCCcc--ccC Q lcl|NC_011308. 433 ILANELDLAMIDK-----TEAETNQIQINNLLAIA-PRIGDEETLKAICDTLDLDYEDVVKALEDQEV-EELEPT--VTP 503 (530) Q Consensus 433 ~P~n~~e~a~~~~-----~~~~~g~iS~et~l~~~-~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~-~~~~~~--~~~ 503 (530) ---.+...++++. .....+.+++.++.+.+ -..||...+.+.+++|. ....+...+. .+..+. ... T Consensus 405 n~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~-----~~~~~~~~~~~~e~~~~~~~~~ 479 (533) T protein:vir:58 405 NSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAG-----GGGLFDTGGFGEETTPADFLGE 479 (533) T ss_pred chHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhh-----cCCCCCCCCcccccCCcccCcc Confidence 5444444344332 23344678888888774 33333222212222221 1111111110 000000 000 Q ss_pred CCCCCCCCCccCcCC--------------------------CCcccc-cccCCC Q lcl|NC_011308. 504 IIDPLTIEPQPEPLN--------------------------IDPVIE-EEPVQE 530 (530) Q Consensus 504 ~~~~~~~~~~~~~~~--------------------------~~~~~~-~~~~~~ 530 (530) ..+|...+..+.... +++..+ +=|..+ T Consensus 480 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 480 RGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred ccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCCCCCCCC Confidence 001111111111100 000000 011111 No 251 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=34.41 E-value=1.3 Score=19.97 Aligned_cols=395 Identities=10% Similarity=-0.032 Sum_probs=148.4 Q ss_pred hhhcccccccccccccccc-cCC-cce--eecCchhhHHhhhhhhhcccceeeecCCcchHHHHHHHHHHhh---ccH-- Q lcl|NC_011308. 46 IENTRIMWMNDHGDIVEDD-NAS-NIK--ISHGFFAELVDQKTQYLLANGIDVKPTDHDDQKLCYLIEEYYN---EEF-- 116 (530) Q Consensus 46 I~~r~~~~~~~~~~~~~~~-~~~-n~k--i~~n~~k~Ivd~~~~yl~G~pv~~~~~~~~de~~~~~l~~~~~---~~~-- 116 (530) +.. .. .....+..+-.. ..+ ..+ ....-...-|+..++=+.+-|+.+.-.+..-... .-+-.++. |.. T Consensus 1 ~~~-~~-~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~-~~l~~lL~~~PN~~~t 77 (723) T protein:vir:94 1 MTT-FP-SGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDEL-HPLSQLWNVMPNRAMP 77 (723) T ss_pred Ccc-cc-cCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchh-hHHHHHHhhCCCCCCC Confidence 100 00 000000000000 000 000 0111123345555555556677764322211111 12223332 222 Q ss_pred -HHHHHHHHHHHhhcCeEEEEEEecC---CCce-EEEEecccceEEEEcCCCCceeE--EEEEEEEeecccccccceEEE Q lcl|NC_011308. 117 -QSAIQELVEGSTIKGYEGIFARTTS---EDKL-TFQTVDALQLLPVFDDYGTLQRI--IRFYTEQRYSDADNKFNSIGH 189 (530) Q Consensus 117 -~~~~~e~~~~~~~~G~a~~~~y~d~---~g~~-~~~~~~p~~~~~v~d~~~~~~~~--~~~y~~~~~~~~~~~~~~~~~ 189 (530) ......+..+...+|.+|.++-++. .|.+ .+..++|..+.++.....+.... ...|.+.... +. T Consensus 78 ~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~-----G~---- 148 (723) T protein:vir:94 78 AQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTD-----GV---- 148 (723) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecC-----ce---- Confidence 2233445667888999998876543 2433 35566776665554333221110 1111110000 00 Q ss_pred EEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccccccccccccccCCccceEEeeCCcC Q lcl|NC_011308. 190 ADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKL 269 (530) Q Consensus 190 ~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~ 269 (530) ...+..+.+.|++.. +++ +.-. T Consensus 149 ~~~~~~~dIiHir~~-------------------------------------------------~~~---------dg~~ 170 (723) T protein:vir:94 149 RVPVLADEMLWLRFS-------------------------------------------------DPY---------DPLA 170 (723) T ss_pred eEEecccceEEecCC-------------------------------------------------CCC---------CCcc Confidence 001222222222110 000 1114 Q ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc--hhhHHHHHh--------hCcceecCCC-------- Q lcl|NC_011308. 270 GISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTNSP--VDEIKKNIQ--------SKKIIQTKGE-------- 331 (530) Q Consensus 270 ~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~~~--~~~~~~~~~--------~~~~i~~~~~-------- 331 (530) |.|.++.....|+.-..+..-..+.+.-.+.|-.+|+-...++ .+.++..++ .++++.++++ T Consensus 171 G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~ 250 (723) T protein:vir:94 171 VMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAG 250 (723) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhchhhcCcceeeccccccccccc Confidence 6666665444444333333333333333444656665322211 112222221 2345555432 Q ss_pred CceeEEEecCCH--HHHHHHHHHHHHHHHHHhcccCCCcccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 332 GGLDIQTVDIPY--EARKAKMDIDELNIYRSGMGFNSSAVGDGNATNVVIKSRYTLLAMKAQKTEIALRKTLRWTADLVV 409 (530) Q Consensus 332 ~~~~~lt~~~~~--~~~e~~ld~L~~~I~~~s~~p~~~~~~~gn~SGvAik~~~~~l~~ka~~ke~~f~~~l~~~~~~i~ 409 (530) .+++|....++. ..+..........|...-++|..-- .|.+++..+ ......++...|.-.++.|. T Consensus 251 ~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i--~~~st~sN~----------e~~~~~f~~~tL~P~~~~ie 318 (723) T protein:vir:94 251 KGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDAL--LGGSTYENQ----------AEAKAAVWTETLIPQMEVMA 318 (723) T ss_pred CCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHc--CCCCCcccH----------HHHHHHHHHHHHHHHHHHHH Confidence 245555544443 2333344555666777777774211 111111111 11111233444544445555 Q ss_pred HHHHhcCCCccccceeeEEeCCC--CCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC--CCCHHHHHHHH--H------- Q lcl|NC_011308. 410 EDIRRRGLGDYSSTDIKFDIEPY--ILANELDLAMIDKTEAETNQIQINNLLAIAPR--IGDEETLKAIC--D------- 476 (530) Q Consensus 410 ~~l~~~~~~~~d~~~i~i~f~~~--~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~--vdd~~~e~~~~--e------- 476 (530) ..++.+-..++. ..+.+.|+.. +-.|..+.++....+..+|+++...+++.+++ +.+-+..+-.. . T Consensus 319 ~~ln~~Ll~~~g-~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~ 397 (723) T protein:vir:94 319 SITDLQLLPDIG-WTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAP 397 (723) T ss_pred HHHhHhhccccc-CceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCC Confidence 444433222221 2466777642 44677888888888999999999999988754 33222110000 0 Q ss_pred ---HHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCCCcccccccCCC Q lcl|NC_011308. 477 ---TLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNIDPVIEEEPVQE 530 (530) Q Consensus 477 ---~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (530) -...+....+.++.+... ...+.+.....-.++.+.|+-..+++.+- T Consensus 398 ~~~p~~~e~~~~~~~~~~~~~-------~~~p~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (723) T protein:vir:94 398 APAPAVEEGAARMLALLERVA-------ADRPLPELPVRATTVLHHDPGPDPQQTLY 447 (723) T ss_pred CCCccchhhhHhhhhhccccc-------cccCcCCCCCCCCCCCCCCcccCCchhHH Confidence 000000000001111100 00000000001111122222222221111 No 252 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=29.16 E-value=1.7 Score=19.34 Aligned_cols=340 Identities=11% Similarity=0.053 Sum_probs=129.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+.. ++. .|.. ..-+.+..... ...+. ... .........|+..++-+.+- T Consensus 1 M~if~---~~~-~~~~----------~~~~~~~~~~~----~~~~~---------~~~--~~~~~v~~~v~~Ia~~iA~l 51 (378) T protein:vir:94 1 MNLFG---KVV-SFSR----------GKLNNDTQRVT----AWQNE---------AVE--YTSAFVTNIHNKIANEITKV 51 (378) T ss_pred CchhH---HhH-hhhh----------cccccCcceee----eeecc---------hhh--hhhHHHHHHHHHHHHhHhhC Confidence 33222 211 0100 00011111000 00000 000 11123455777777778788 Q ss_pred ceee-ecCCc--chH----HHHHHHHHHhh---ccH---HHHHHHHHHHHhhcCeEEEE-EEecCCCceEEEEecccceE Q lcl|NC_011308. 91 GIDV-KPTDH--DDQ----KLCYLIEEYYN---EEF---QSAIQELVEGSTIKGYEGIF-ARTTSEDKLTFQTVDALQLL 156 (530) Q Consensus 91 pv~~-~~~~~--~de----~~~~~l~~~~~---~~~---~~~~~e~~~~~~~~G~a~~~-~y~d~~g~~~~~~~~p~~~~ 156 (530) |+.. ..... ... ....-|..+++ |.. ......+.......|.||.+ ++.+..|++... T Consensus 52 p~~~~~~~~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~-------- 123 (378) T protein:vir:94 52 EFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDL-------- 123 (378) T ss_pred ceeeeeecccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEE-------- Confidence 8753 21111 000 01111223332 221 23334566778889999865 344444433110 Q ss_pred EEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccc Q lcl|NC_011308. 157 PVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEA 236 (530) Q Consensus 157 ~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (530) +|.. .. ..|..+.+.|++.. T Consensus 124 --------------~~~~--------~~------~~~~~~dvih~~~~-------------------------------- 143 (378) T protein:vir:94 124 --------------LFAN--------DK------KEYKPEELVRLTSP-------------------------------- 143 (378) T ss_pred --------------EEec--------Cc------EEechhceeeecCc-------------------------------- Confidence 1100 00 01222233332100 Q ss_pred eecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCC-Cch-- Q lcl|NC_011308. 237 ILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQDMAEAIYVVRGGTN-SPV-- 313 (530) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~~~~~~lvl~g~~~-~~~-- 313 (530) .+ .....+. +..+..+++..+... ....++...+.-. +.. T Consensus 144 -----------------~~-----------~~~~~~~---~~~~~~~~~~~~~~~------~~~g~l~~~~~l~~~~~~~ 186 (378) T protein:vir:94 144 -----------------FY-----------INEDTSI---LDNALASIQTKLEQG------KLRGLLKINAFLDIDNTQE 186 (378) T ss_pred -----------------CC-----------cccchhH---HHHHHHHHHHHHhhC------CcccceeeCCcCCHHHHHH Confidence 00 0011111 222333333222111 1122333322111 111 Q ss_pred --hhHHHHHh-------hCcceecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHH Q lcl|NC_011308. 314 --DEIKKNIQ-------SKKIIQTKGEGGLDIQTVDIPYEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRY 383 (530) Q Consensus 314 --~~~~~~~~-------~~~~i~~~~~~~~~~lt~~~~~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~ 383 (530) +.+...++ .++++.++++.+++-++.+..... ...++.+.+.|...-.+|. .-+ |+.+... T Consensus 187 ~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgvPp---~~l~g~~~e~~----- 257 (378) T protein:vir:94 187 YREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNE---NILLGTATQEQ----- 257 (378) T ss_pred HHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhh-HHHHHHHHHHHHHHhCCCH---HHhcCCchHHH----- Confidence 11222221 234667776666665554332222 2445667777877777763 111 2222111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---------CCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCc Q lcl|NC_011308. 384 TLLAMKAQKTEIALRKTLRWTADLVVEDIRRRG---------LGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQ 454 (530) Q Consensus 384 ~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~---------~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS 454 (530) ...++...|.-.+..|...|+.+- -.......+.+.+++-+-.|..+.++....+...|+++ T Consensus 258 ---------~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t 328 (378) T protein:vir:94 258 ---------QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFT 328 (378) T ss_pred ---------HHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcC Confidence 112344445555544444444321 11112234556666767778889999999999999999 Q ss_pred HHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCCCccCcCCC Q lcl|NC_011308. 455 INNLLAIAPR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIEPQPEPLNI 519 (530) Q Consensus 455 ~et~l~~~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (530) .-.+++.+++ +++-+.-+-. ..+..+....+......+ .++..+..|. T Consensus 329 ~NE~R~~~g~~p~~ggd~~~~~---------~n~~~~~~~~~~~~~~~~--------~~~~~e~~n~ 378 (378) T protein:vir:94 329 QNQLLVKMGEQPIEGGDVYIAN---------LNAVAVKNLSDLQGNRKD--------VTSTDETNNQ 378 (378) T ss_pred HHHHHHHhCCCCCCCCCeeeec---------ccccchhcchhcccccCC--------CCCCCCCCCC Confidence 9999888754 3332111000 000000000000000000 0000000011 No 253 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=28.29 E-value=1.8 Score=19.24 Aligned_cols=441 Identities=10% Similarity=-0.047 Sum_probs=174.0 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc-- Q lcl|NC_011308. 13 LGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN-- 90 (530) Q Consensus 13 ~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~-- 90 (530) +.+-...+-.+..++.-..+.+.+.+|.-.. +...+.......+. -.+...+-...-++..++-|+|- T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~--~~~~~~~~~~~~~~--------~~~~~dstg~~a~~~LAa~l~~~lt 70 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIAS--LMVDPLDKTHQAEV--------VEYDFQSAGAFLVNNLTAKLALTLF 70 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhccc--ccCCCCCCcccccc--------cccccchhHHHHHHHHHHHHHhhhc Confidence 2233334433443332233445555554331 11111000000000 01222233333444444433321 Q ss_pred ce-----eeecCCcc----------hHHHHH-------HHHHHh-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEE Q lcl|NC_011308. 91 GI-----DVKPTDHD----------DQKLCY-------LIEEYY-NEEFQSAIQELVEGSTIKGYEGIFARTTSEDKLTF 147 (530) Q Consensus 91 pv-----~~~~~~~~----------de~~~~-------~l~~~~-~~~~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~ 147 (530) |+ ++...+.. ..++.. .+...+ ..||....+++-++...+|.+. +|.+++ ..++ T Consensus 71 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~~~~-~~~~ 147 (514) T protein:vir:80 71 PPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNAL--FYREPG-TGKM 147 (514) T ss_pred CCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEE--EEEecC-CCcE Confidence 21 12221110 011222 122222 4688899999999999999985 444443 2345 Q ss_pred EEecccceEEEEcCCCCceeEEEEEEEEeeccc----------ccccceEEEEEEEcCCceEEEeecCCcccchhhcccc Q lcl|NC_011308. 148 QTVDALQLLPVFDDYGTLQRIIRFYTEQRYSDA----------DNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTT 217 (530) Q Consensus 148 ~~~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~----------~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~ 217 (530) +.++-.+.++--|..+....++|-+......-. ....+....++||+.- ++..+. .... T Consensus 148 ~~~pl~~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v----~~~~~~-~~~~------ 216 (514) T protein:vir:80 148 LVWTMQSYTVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVI----EWQPTP-NGKR------ 216 (514) T ss_pred EEEEcCeEEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEE----EeecCC-CCeE------ Confidence 666555555556777777666655432111100 0011112233343310 111110 0000 Q ss_pred ccccccceeeeeecccccceecccccccccccccccccCCccceEEeeC-----CcCCCCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_011308. 218 VNPNPSQHVLAVADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYN-----NKLGISDIKKVKSIIDDYDLMNCFLS 292 (530) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~n-----n~~~~sd~e~v~~liDa~~~~~S~~~ 292 (530) ...+..+ .+.........+|...|++.++- ..+|.|--++..+-+-.++.+.-... T Consensus 217 -----~sv~~e~--------------~g~~i~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l 277 (514) T protein:vir:80 217 -----CAVWHEL--------------EGKRVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLG 277 (514) T ss_pred -----EEEEEec--------------cceeecccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHH Confidence 0000000 00000111223345567776654 35899988999999999998877777 Q ss_pred HHHHHhccceeeeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHHHHHHHHhccc-CCCcc Q lcl|NC_011308. 293 NNLQDMAEAIYVVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDELNIYRSGMGF-NSSAV 369 (530) Q Consensus 293 n~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~~~I~~~s~~p-~~~~~ 369 (530) ........|.+.+.-.+....... ...+.+.+..+..+++..+... .+.......++.++..|-..-+.- ++.+ T Consensus 278 ~~~~~a~~~~~~v~~~g~~~~~~l--~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~~~rd- 354 (514) T protein:vir:80 278 LYEFEALSLLNLVDEAKGGAVDDY--RDAETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTGQVRD- 354 (514) T ss_pred HHHHHhcCCCceeCcccccchhhh--cccCCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhccCCC- Confidence 777777777766532111111111 1223344544555667776543 356666777777777774322211 1111 Q ss_pred cccCCcHHHHHHHHhhHHHH----HHHHHHHHHH-HHHHHHHHHHHHHHhc--CC-CccccceeeEEeCCCCCC-----C Q lcl|NC_011308. 370 GDGNATNVVIKSRYTLLAMK----AQKTEIALRK-TLRWTADLVVEDIRRR--GL-GDYSSTDIKFDIEPYILA-----N 436 (530) Q Consensus 370 ~~gn~SGvAik~~~~~l~~k----a~~ke~~f~~-~l~~~~~~i~~~l~~~--~~-~~~d~~~i~i~f~~~~P~-----n 436 (530) .++.++..+..+-.-+.+. -.+....|-. -+.|.+.+ +... +. -......+++.+.-.+.. + T Consensus 355 -~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~i----l~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~ 429 (514) T protein:vir:80 355 -AERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYE----ASRGNGGMLLGIAQGVYRPSIITGIPALTRNIE 429 (514) T ss_pred -CCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH----HhhhccCCCCCCCchhhcceeeecHHHHHHHHH Confidence 1334555554332222211 1122222222 22332333 2211 11 111111234444333211 1 Q ss_pred HHH---HHHHHHHHHhcC-----CCcHHHHHHhC------C---CCCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCc Q lcl|NC_011308. 437 ELD---LAMIDKTEAETN-----QIQINNLLAIA------P---RIGDEETLKAICDTLDLDYEDVVKALEDQEVEELEP 499 (530) Q Consensus 437 ~~e---~a~~~~~~~~~g-----~iS~et~l~~~------~---~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~ 499 (530) ... .++.+..+.... .+.-..++..+ | ++.+++....+.++++++..+.+..-+-. . ..+ T Consensus 430 ~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~--~~~ 506 (514) T protein:vir:80 430 TANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGA-L--AAE 506 (514) T ss_pred HHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHH-H--HHh Confidence 111 111111111110 12233333331 1 34455544444455444433322221111 0 111 Q ss_pred cccCCCCC Q lcl|NC_011308. 500 TVTPIIDP 507 (530) Q Consensus 500 ~~~~~~~~ 507 (530) -+.+..-. T Consensus 507 ~~~~~~~~ 514 (514) T protein:vir:80 507 TSAGVLTS 514 (514) T ss_pred hhccccCC Confidence 11111100 No 254 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=27.18 E-value=1.9 Score=19.09 Aligned_cols=349 Identities=7% Similarity=-0.025 Sum_probs=134.2 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |- ++..+..+ ....... .++ ..... . ..+.-+...-....|+..++-+.+- T Consensus 1 Mg---~f~~l~~~---~~~~~~~---~~~-----~~~~~-------------~--~~~~~l~~~~v~~~i~~Ia~~ia~~ 51 (376) T protein:vir:78 1 MG---FFSELFKR---NKEIEWM---WDL-----DFLED-------------K--TTKVYLKKMALNTCVKHIARTIAKS 51 (376) T ss_pred Cc---hhhhhhcc---CCccccc---cch-----hhccc-------------c--chhhhhhhHHHHHHHHHHHHhhccc Confidence 32 11211100 0000000 000 00000 0 0000011223455677777777778 Q ss_pred ceeeecCCc-chHHHHHHHHHHhhcc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEecccceEEEEcCCCCc Q lcl|NC_011308. 91 GIDVKPTDH-DDQKLCYLIEEYYNEE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLT-FQTVDALQLLPVFDDYGTL 165 (530) Q Consensus 91 pv~~~~~~~-~de~~~~~l~~~~~~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~-~~~~~p~~~~~v~d~~~~~ 165 (530) |+.+...+. .+......|..- -|. .......+......+|.+|.++..+..|.+. ...+.|..+.+. T Consensus 52 p~~~~~~~~~~~~~l~~ll~~~-PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~------- 123 (376) T protein:vir:78 52 DFRLKNGETSVRDKLYYKLNIR-PNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPD------- 123 (376) T ss_pred ceeeccccccccchHHHHHhhc-cccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeee------- Confidence 887642221 122222323210 122 2334566777888899999887776665331 111112111110 Q ss_pred eeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceeccccccc Q lcl|NC_011308. 166 QRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEEH 245 (530) Q Consensus 166 ~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (530) .++.+... ...+...|..+.+.|++. T Consensus 124 ----~~~~~~~~--------~~~~~~~~~~~evih~~~------------------------------------------ 149 (376) T protein:vir:78 124 ----VFEGVTVK--------DYRYNRNFSMDDVIFLEY------------------------------------------ 149 (376) T ss_pred ----eeeeeeee--------cceeeeeeccccEEEecc------------------------------------------ Confidence 01111000 000111233333333321 Q ss_pred ccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHH--hccceeeeecCC-CCchh--hHHH-- Q lcl|NC_011308. 246 EGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLSNNLQD--MAEAIYVVRGGT-NSPVD--EIKK-- 318 (530) Q Consensus 246 ~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~n~~~~--~~~~~lvl~g~~-~~~~~--~~~~-- 318 (530) +..| +. ....+++..+..+++...+.... ...+.+++.... .++.. .++. T Consensus 150 -----------~~~~---------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 206 (376) T protein:vir:78 150 -----------GNER---------LS---AFTDGMFEDYGELFGKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYI 206 (376) T ss_pred -----------CCCC---------ch---hhhhHHHHHHHHHHHHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHH Confidence 0011 11 12223444555444444333322 223555554222 22211 1111 Q ss_pred --HHh-----hCcceecCCCCceeEEEecCC-----HHHHHHHHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHHHhh Q lcl|NC_011308. 319 --NIQ-----SKKIIQTKGEGGLDIQTVDIP-----YEARKAKMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSRYTL 385 (530) Q Consensus 319 --~~~-----~~~~i~~~~~~~~~~lt~~~~-----~~~~e~~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~~~~ 385 (530) ..+ ..+++.+++|.+..-++.... ...+....+...+.|...-.+|.. -+ |+-|++. T Consensus 207 ~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~---~l~~~~s~~e------- 276 (376) T protein:vir:78 207 DKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSS---LLHGDMADLS------- 276 (376) T ss_pred HHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHH---HhCCCCCCHH------- Confidence 111 122444565655554443221 123344455666677777777742 22 2222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHhCCC- Q lcl|NC_011308. 386 LAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSSTDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAIAPR- 464 (530) Q Consensus 386 l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~~~~- 464 (530) ......+...|.-.+..|...++.+--.... ..+.+.|..-+-.|..+.++....+...|+++...+++.+++ T Consensus 277 -----~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~-~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~ 350 (376) T protein:vir:78 277 -----NNMKAYMEYCIDPLTKKLEDELNAKLFTFSE-FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAE 350 (376) T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHhhhCCccc-ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 1112344444555555554444432211111 112222222334577888888889999999999888887654 Q ss_pred -CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccc Q lcl|NC_011308. 465 -IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTV 501 (530) Q Consensus 465 -vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~ 501 (530) +++.....-.+ ...+..+.++.++ + T Consensus 351 p~~~g~~d~~~~-------~~n~~~~~~~~e~-----g 376 (376) T protein:vir:78 351 RVDNPELDKYLI-------TKNYQSADEGGED-----G 376 (376) T ss_pred CCCCCCCceeee-------ccCceehhccccC-----C Confidence 22221000000 0001111111111 1 No 255 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=25.45 E-value=2.1 Score=18.87 Aligned_cols=364 Identities=10% Similarity=-0.032 Sum_probs=129.4 Q ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHHhhhhhhhccc Q lcl|NC_011308. 11 DRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELVDQKTQYLLAN 90 (530) Q Consensus 11 ~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Ivd~~~~yl~G~ 90 (530) |-+-+++.. ++..-... ...+..... ..++.=+.+.-....|+..++-+.+- T Consensus 1 Mgl~d~~~~----------------------~~~~~~~~-----~~~~~~~~~-~~~~~~l~~~~v~~~i~~Ia~~ia~l 52 (395) T protein:vir:96 1 MGILDFFSF----------------------KKSGTLSD-----DDSGSTTSE-KLTNVVLKEDALYKCVNYLARIISKS 52 (395) T ss_pred CcchhhhcC----------------------CCCccccc-----cccccchhh-hcchhhhhhHHHHHHHHHHHHhhccc Confidence 332222111 11000000 000000000 00000011122334567777777777 Q ss_pred ceeeecCCcchHHHHHHHHHHhh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEecccceEEEEcCCCC Q lcl|NC_011308. 91 GIDVKPTDHDDQKLCYLIEEYYN---EE---FQSAIQELVEGSTIKGYEGIFARTTSEDKLTFQTVDALQLLPVFDDYGT 164 (530) Q Consensus 91 pv~~~~~~~~de~~~~~l~~~~~---~~---~~~~~~e~~~~~~~~G~a~~~~y~d~~g~~~~~~~~p~~~~~v~d~~~~ 164 (530) |+.+...+. ......-+..+++ |. .......+......+|.||.++-.+..+ .+.+. +++... T Consensus 53 p~~v~~~~~-~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~-----~~~~~--~~~~~~--- 121 (395) T protein:vir:96 53 TFRIKAPEK-LTENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGI-----YVADA--FTQDKK--- 121 (395) T ss_pred eeEEEeCCc-cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCce-----ecCCc--cccccc--- Confidence 887643321 1111112233332 22 2234556778888899998777655321 11111 111000 Q ss_pred ceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeeeecccccceecccccc Q lcl|NC_011308. 165 LQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAVADGVDEAILDEGVEE 244 (530) Q Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (530) ..+ .+++.+.. . . ...-..+..+.+.|++.... T Consensus 122 ~~~-~~~~~v~~-~-----~--~~~~~~~~~~dvih~k~~~~-------------------------------------- 154 (395) T protein:vir:96 122 LSG-NKFKVSRV-Q-----G--QTYEKIFTFDQVIYLKNDNS-------------------------------------- 154 (395) T ss_pred ccc-ceeeeeee-c-----c--ceeeeEeccCceEEecccCC-------------------------------------- Confidence 000 00111100 0 0 00011233333443321110 Q ss_pred cccccccccccCCccceEEeeCCcCCCCc---HHHHHHHHHHHHHHHHH---HHHHHHHhccceeeeecCCCCchhhHH- Q lcl|NC_011308. 245 HEGRQVLGRSYKSRFPFDILYNNKLGISD---IKKVKSIIDDYDLMNCF---LSNNLQDMAEAIYVVRGGTNSPVDEIK- 317 (530) Q Consensus 245 ~~~~~~~~~~~~~~iPiv~~~nn~~~~sd---~e~v~~liDa~~~~~S~---~~n~~~~~~~~~lvl~g~~~~~~~~~~- 317 (530) +.. ..+.+. ...+..+.-+.....+. ..+...-...+..++.-.+....+... T Consensus 155 ---------------~~~-----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:96 155 ---------------DLM-----LKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKD 214 (395) T ss_pred ---------------ccc-----cccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhHHHHHH Confidence 000 011122 22333333222211111 112222222222333221211111111 Q ss_pred --HH----Hh--hCcceecCCCCceeEEEecC-CHHHHHH-----HHHHHHHHHHHHhcccCCCcccc-cCCcHHHHHHH Q lcl|NC_011308. 318 --KN----IQ--SKKIIQTKGEGGLDIQTVDI-PYEARKA-----KMDIDELNIYRSGMGFNSSAVGD-GNATNVVIKSR 382 (530) Q Consensus 318 --~~----~~--~~~~i~~~~~~~~~~lt~~~-~~~~~e~-----~ld~L~~~I~~~s~~p~~~~~~~-gn~SGvAik~~ 382 (530) .. .. ..+++.+++|.+.+-++... +....+. ......+.|...=.+|. .-+ |+.|+.. T Consensus 215 ~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp---~~l~~~~sn~e---- 287 (395) T protein:vir:96 215 FFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPI---SLLHGDIADNQ---- 287 (395) T ss_pred HHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCH---HHhcCCCccHH---- Confidence 11 11 22233444444443333221 1111221 11222344555555553 222 2222211 Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cceeeEEeCCCCCCCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_011308. 383 YTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYS-STDIKFDIEPYILANELDLAMIDKTEAETNQIQINNLLAI 461 (530) Q Consensus 383 ~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d-~~~i~i~f~~~~P~n~~e~a~~~~~~~~~g~iS~et~l~~ 461 (530) ...+.++...|.-.+..|...++.+-..+.. ...+.+.|+.-+..|..+.++....+..+|+++...+++. T Consensus 288 --------~~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 359 (395) T protein:vir:96 288 --------KNYELLLEGPIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREE 359 (395) T ss_pred --------HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 1122345555555555555555432211111 1124567777788899999999999999999999889888 Q ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHHhhhccccccCCccccCCCCCCCCC Q lcl|NC_011308. 462 APR--IGDEETLKAICDTLDLDYEDVVKALEDQEVEELEPTVTPIIDPLTIE 511 (530) Q Consensus 462 ~~~--vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (530) +++ ++++... +......-..-+. ..+..++..++ T Consensus 360 ~gl~pi~~~~gD------------~~~~~~N~~~~~~----~gge~~~~~~~ 395 (395) T protein:vir:96 360 IGLPELPDGLGK------------VLYMTKNYESVLE----RGGEVDEEVET 395 (395) T ss_pred hCCCCCCCCCCc------------eeeecccceechh----ccCCCCCCCCC Confidence 753 4442110 0000000000000 00000111111 No 256 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=24.33 E-value=2.2 Score=18.72 Aligned_cols=402 Identities=11% Similarity=0.011 Sum_probs=147.6 Q ss_pred CCcccccCCcccHHHHHHHHHHHHHHhhhHHHHHHHHHHhcccchhhhcccccccccccccccccCCcceeecCchhhHH Q lcl|NC_011308. 1 MTNTLLTTAPDRLGTILSTKIDEYIRSQNVSLARVGQRYYNQDNDIENTRIMWMNDHGDIVEDDNASNIKISHGFFAELV 80 (530) Q Consensus 1 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~~YY~g~~~I~~r~~~~~~~~~~~~~~~~~~n~ki~~n~~k~Iv 80 (530) |.+.+..+ +-..+-+ ..+.+++|+.+..+++-...|.. || T Consensus 57 ~~q~~y~~-------~e~~~~~---~~eLI~~YR~ma~~pEvd~Av~e------------------------------IV 96 (523) T protein:vir:68 57 MFQRMFGS-------QEPGLKS---TRELIDTYRNLMTNYEVDNAVSE------------------------------IV 96 (523) T ss_pred hhhhhhhc-------cccccch---HHHHHHHHHHHhhccchhhHHHH------------------------------hh Confidence 11111111 0001000 12334556666665555553321 22 Q ss_pred hhhhhh-hcccceeeecCCcc-----hHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEE Q lcl|NC_011308. 81 DQKTQY-LLANGIDVKPTDHD-----DQKLCYLIEEYYN-EEFQSAIQELVEGSTIKGYEGIFARTTS----EDKLTFQT 149 (530) Q Consensus 81 d~~~~y-l~G~pv~~~~~~~~-----de~~~~~l~~~~~-~~~~~~~~e~~~~~~~~G~a~~~~y~d~----~g~~~~~~ 149 (530) +-.+-+ -...||.+..++.+ .+.+.+.++.+++ =+|+...++..+.+-+.|+-|.|..+|. +|-..... T Consensus 97 neaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~ 176 (523) T protein:vir:68 97 SDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRR 176 (523) T ss_pred cceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccccceeeee Confidence 111111 13445555433221 2345555555553 3788899999999999999999999873 36667788 Q ss_pred ecccceEEEEcCCCCceeEEEEEEEEeecccccccceEEEEEEEcCCceEEEeecCCcccchhhccccccccccceeeee Q lcl|NC_011308. 150 VDALQLLPVFDDYGTLQRIIRFYTEQRYSDADNKFNSIGHADVWTDTEVWYYVQKDEGRSDEYVLDTTVNPNPSQHVLAV 229 (530) Q Consensus 150 ~~p~~~~~v~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (530) +||..+-.|.--..+...... ....+...-+|++.... |...+. .... +. .+... T Consensus 177 lDPr~i~~vr~i~~~~~~g~~------------vi~~~~e~f~Y~~~~~~-~~~~g~--~~~~--------~~--~ikI~ 231 (523) T protein:vir:68 177 LDPRQVQYVREVITTTEAGVK------------IVKGYKEYFIYDTSHES-YACDGR--IYEA--------GT--KIKIP 231 (523) T ss_pred eCCcceeEEEeecCCCCcchh------------hhhhhhhheeecccccc-cccccc--ccCC--------Cc--ceecc Confidence 999877443310000000000 00001111123322211 111100 0000 00 00000 Q ss_pred ecccccceecccccccccccccccccCCccceEEeeCCcCCCCcHHHHHHHHHHHHHHHHHHH----------------- Q lcl|NC_011308. 230 ADGVDEAILDEGVEEHEGRQVLGRSYKSRFPFDILYNNKLGISDIKKVKSIIDDYDLMNCFLS----------------- 292 (530) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPiv~~~nn~~~~sd~e~v~~liDa~~~~~S~~~----------------- 292 (530) ..+... .-.|.+|+ +...-.|-++..+.....+- ++-|.. T Consensus 232 ~dAI~y------------------~hSGL~d~----~~~~i~gyLhkAiKp~NQLk-mlEDAlVIYRitRAPeRRvFYID 288 (523) T protein:vir:68 232 KAAIVY------------------AHSGLVDC----CGKNIIGYLHRAIKPANQLK-LLEDAVVIYRITRAPDRRVWYVD 288 (523) T ss_pred hhheee------------------eeccceeC----CCCceeccchhhhHHHHhhH-HHHhhHHHHhhhccccceEEEEe Confidence 000000 00111111 01111233332222111111 111111 Q ss_pred --------------HHHHHhcccee--eeecCCCCchhhHHHHHhhCcceecCCCCceeEEEec--CCHHHHHHHHHHHH Q lcl|NC_011308. 293 --------------NNLQDMAEAIY--VVRGGTNSPVDEIKKNIQSKKIIQTKGEGGLDIQTVD--IPYEARKAKMDIDE 354 (530) Q Consensus 293 --------------n~~~~~~~~~l--vl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~lt~~--~~~~~~e~~ld~L~ 354 (530) +.+..|.+-+. .-+|- ..+.-.++.-+....+--=+++.+-+.-|-+ .+.. .-.-+.=.+ T Consensus 289 vGnlPk~KAeqYl~~im~k~kNKlvYDa~TGe-v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlg-em~DV~YF~ 366 (523) T protein:vir:68 289 TGNMPSRKAAEHMQHVMNTMKNRIAYDATTGK-IKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTG-NMEDVRWFR 366 (523) T ss_pred cCCCCchhHHHHHHHHHHhhcceeEEeccCCe-eccchhhhhhHhhhcccccCCCcccceeeccccCCcC-hHHHHHHHH Confidence 11111222110 01110 0011111111111111111122222322333 2222 222345566 Q ss_pred HHHHHHhcccC--CCcc----cccCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc--cee Q lcl|NC_011308. 355 LNIYRSGMGFN--SSAV----GDGNATNVVIK-SRYTLLAMKAQKTEIALRKTLRWTADLVVEDIRRRGLGDYSS--TDI 425 (530) Q Consensus 355 ~~I~~~s~~p~--~~~~----~~gn~SGvAik-~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~l~~~~~~~~d~--~~i 425 (530) +.+|..-.+|- +..+ .+|..|.++.. ++|. --+.+.+..|..-+.++|+.=+-+=++....+|+- ..| T Consensus 367 kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~---KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I 443 (523) T protein:vir:68 367 NALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFG---KFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNI 443 (523) T ss_pred HHHHHHhCCcceeecCCCcceecccccchhHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcc Confidence 77777778883 3222 22433322221 1111 22344444444444444332111111222345554 467 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHH---------hcCCCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccccc- Q lcl|NC_011308. 426 KFDIEPYILANELDLAMIDKTEA---------ETNQIQINNLLAIAPRIGDEETLKAICDTLDLDYEDVVKALEDQEVE- 495 (530) Q Consensus 426 ~i~f~~~~P~n~~e~a~~~~~~~---------~~g~iS~et~l~~~~~vdd~~~e~~~~e~e~~e~~~~~~~~~~~~~~- 495 (530) .+.|.+.---.+...++++.... -+..+|++++.+.+=...|.+ +++++++-+++++..--++.+ T Consensus 444 ~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee-----i~~~~kqI~~E~k~~~~~~p~~ 518 (523) T protein:vir:68 444 KIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEE-----IEQEAKQIEEESKEARFQDPDQ 518 (523) T ss_pred eEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH-----HHHHHHHHHHHhhcCCCCCCch Confidence 88887765555544444433211 122469999998854444433 334443333333332222111 Q ss_pred cCCcc Q lcl|NC_011308. 496 ELEPT 500 (530) Q Consensus 496 ~~~~~ 500 (530) +.++. T Consensus 519 e~~~f 523 (523) T protein:vir:68 519 EQEDF 523 (523) T ss_pred hhhcC Confidence 11111 Done!