Query lcl|NC_010808.1_cdsid_YP_001949841.1 [gene=orf43] [protein=putative portal protein] [protein_id=YP_001949841.1] [location=18561..20099] Match_columns 512 No_of_seqs 159 out of 551 Neff 9.8 Searched_HMMs 1612 Date Thu Nov 7 13:26:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97171 Length: 512 100.0 6E-124 4E-127 696.2 56.2 512 1-512 1-512 (512) 2 protein:vir:9306 Length: 511 # 100.0 1E-120 9E-124 677.5 55.1 511 1-512 1-511 (511) 3 protein:vir:96240 Length: 511 100.0 4E-120 2E-123 675.3 55.4 511 1-512 1-511 (511) 4 protein:vir:103951 Length: 511 100.0 4E-120 2E-123 675.2 55.2 511 1-512 1-511 (511) 5 protein:vir:78805 Length: 511 100.0 4E-120 2E-123 675.5 54.5 511 1-512 1-511 (511) 6 protein:vir:96366 Length: 511 100.0 4E-120 2E-123 675.5 54.5 511 1-512 1-511 (511) 7 protein:vir:99781 Length: 511 100.0 2E-119 1E-122 671.2 54.3 511 1-512 1-511 (511) 8 protein:vir:4898 Length: 502 # 100.0 3E-107 2E-110 604.4 52.3 493 1-512 1-498 (502) 9 protein:vir:2732 Length: 501 # 100.0 2E-105 1E-108 595.0 52.3 495 1-512 1-501 (501) 10 protein:vir:96494 Length: 501 100.0 2E-104 1E-107 589.7 51.0 495 2-512 1-500 (501) 11 protein:vir:94546 Length: 506 100.0 3E-100 2E-103 566.3 50.0 477 1-512 1-504 (506) 12 protein:vir:106571 Length: 499 100.0 8E-99 5E-102 558.4 53.4 468 11-512 1-485 (499) 13 protein:vir:106639 Length: 481 100.0 1.2E-98 8E-102 557.4 53.5 478 9-512 1-480 (481) 14 protein:vir:3964 Length: 453 # 100.0 1.9E-98 1E-101 556.3 52.8 453 11-512 1-453 (453) 15 protein:vir:94805 Length: 492 100.0 9.8E-99 6E-102 557.9 50.7 467 11-511 1-492 (492) 16 protein:vir:97336 Length: 492 100.0 3.5E-98 2E-101 554.8 51.0 467 11-512 1-492 (492) 17 protein:vir:99522 Length: 470 100.0 8.5E-98 5E-101 552.8 53.0 466 11-503 1-470 (470) 18 protein:vir:1236 Length: 483 # 100.0 2.9E-97 2E-100 549.9 52.1 472 1-511 1-483 (483) 19 protein:vir:733 Length: 453 # 100.0 4.4E-97 3E-100 548.8 51.4 453 21-499 1-453 (453) 20 protein:vir:3609 Length: 452 # 100.0 7E-97 4E-100 547.7 52.4 452 11-512 1-452 (452) 21 protein:vir:105461 Length: 470 100.0 2.7E-96 1.7E-99 544.5 50.5 445 37-512 1-470 (470) 22 protein:vir:96266 Length: 474 100.0 5.8E-96 3.6E-99 542.7 49.6 462 14-512 1-474 (474) 23 protein:vir:95899 Length: 474 100.0 5.8E-96 3.6E-99 542.7 49.6 462 14-512 1-474 (474) 24 protein:vir:95806 Length: 440 100.0 6.1E-96 3.8E-99 542.6 48.4 434 48-503 1-440 (440) 25 protein:vir:9871 Length: 429 # 100.0 3.5E-95 2.1E-98 538.4 51.3 429 40-498 1-429 (429) 26 protein:vir:5961 Length: 503 # 100.0 6.7E-95 4.1E-98 536.9 51.2 479 1-512 1-499 (503) 27 protein:vir:93747 Length: 472 100.0 9.1E-95 5.7E-98 536.1 51.7 461 7-511 1-472 (472) 28 protein:vir:94101 Length: 474 100.0 1.1E-94 6.9E-98 535.7 49.5 454 14-505 1-474 (474) 29 protein:vir:105889 Length: 474 100.0 1.1E-94 6.9E-98 535.7 49.5 454 14-505 1-474 (474) 30 protein:vir:102330 Length: 451 100.0 1.7E-94 1.1E-97 534.6 50.3 434 40-493 1-451 (451) 31 protein:vir:107112 Length: 478 100.0 3E-94 1.9E-97 533.3 51.3 463 1-512 1-478 (478) 32 protein:vir:97447 Length: 474 100.0 2.2E-94 1.4E-97 534.0 50.3 463 1-512 1-474 (474) 33 protein:vir:94498 Length: 474 100.0 2.2E-94 1.4E-97 534.0 50.3 463 1-512 1-474 (474) 34 protein:vir:9922 Length: 489 # 100.0 2.8E-94 1.7E-97 533.5 50.4 472 1-508 1-489 (489) 35 protein:vir:105292 Length: 478 100.0 4.2E-94 2.6E-97 532.5 51.2 463 1-512 1-478 (478) 36 protein:vir:79043 Length: 479 100.0 1.3E-93 8.1E-97 529.8 52.0 456 1-504 1-479 (479) 37 protein:vir:102950 Length: 471 100.0 5.7E-94 3.6E-97 531.8 49.7 441 36-505 1-471 (471) 38 protein:vir:95113 Length: 474 100.0 5.3E-93 3.3E-96 526.5 50.4 460 17-510 1-474 (474) 39 protein:vir:96179 Length: 468 100.0 2.9E-92 1.8E-95 522.5 49.7 452 1-504 1-468 (468) 40 protein:vir:96839 Length: 474 100.0 5.6E-92 3.5E-95 520.9 49.6 459 1-509 1-474 (474) 41 protein:vir:78083 Length: 537 100.0 2.3E-90 1.4E-93 512.0 49.9 463 30-512 1-522 (537) 42 protein:vir:4223 Length: 486 # 100.0 1.5E-80 9.4E-84 458.2 45.7 456 27-512 1-478 (486) 43 protein:vir:78537 Length: 480 100.0 4.9E-80 3.1E-83 455.4 46.9 450 39-512 1-470 (480) 44 protein:vir:78227 Length: 480 100.0 1.5E-79 9.1E-83 452.8 46.6 447 39-512 1-468 (480) 45 protein:vir:2427 Length: 485 # 100.0 1.2E-79 7.4E-83 453.3 45.6 455 27-512 1-483 (485) 46 protein:vir:7768 Length: 484 # 100.0 3.9E-79 2.4E-82 450.4 44.8 457 30-512 1-480 (484) 47 protein:vir:2500 Length: 501 # 100.0 1.4E-78 8.7E-82 447.4 45.5 468 13-512 1-494 (501) 48 protein:vir:2341 Length: 488 # 100.0 2.4E-78 1.5E-81 446.2 46.0 454 30-512 1-485 (488) 49 protein:vir:104082 Length: 485 100.0 3.3E-78 2E-81 445.4 45.6 458 27-512 1-483 (485) 50 protein:vir:105819 Length: 456 100.0 2.9E-78 1.8E-81 445.6 43.4 440 37-502 1-456 (456) 51 protein:vir:102602 Length: 456 100.0 2.9E-78 1.8E-81 445.6 43.4 440 37-502 1-456 (456) 52 protein:vir:7987 Length: 456 # 100.0 8.9E-77 5.5E-80 437.5 43.8 440 37-506 1-456 (456) 53 protein:vir:80680 Length: 441 100.0 3.2E-76 2E-79 434.5 45.2 424 38-498 1-441 (441) 54 protein:vir:99072 Length: 479 100.0 9.4E-75 5.8E-78 426.4 43.8 445 27-512 1-470 (479) 55 protein:vir:99916 Length: 504 100.0 2.7E-71 1.7E-74 407.4 45.5 468 17-512 1-493 (504) 56 protein:vir:98444 Length: 434 100.0 3.6E-69 2.2E-72 395.8 41.4 409 75-511 1-434 (434) 57 protein:vir:8184 Length: 474 # 100.0 1.9E-67 1.2E-70 386.4 42.8 453 20-500 1-474 (474) 58 protein:vir:9568 Length: 410 # 100.0 8.6E-68 5.3E-71 388.3 37.6 394 53-482 1-410 (410) 59 protein:vir:9751 Length: 422 # 100.0 2.4E-67 1.5E-70 385.8 38.0 404 36-480 1-422 (422) 60 protein:vir:94742 Length: 409 100.0 1.1E-66 6.5E-70 382.3 39.6 393 41-466 1-409 (409) 61 protein:vir:1634 Length: 409 # 100.0 1.3E-65 7.9E-69 376.4 38.1 392 41-466 1-409 (409) 62 protein:vir:38 Length: 496 # N 100.0 8.4E-62 5.2E-65 355.4 44.3 461 13-503 1-496 (496) 63 protein:vir:80959 Length: 499 100.0 8.2E-59 5.1E-62 339.0 47.1 455 30-503 1-499 (499) 64 protein:vir:79703 Length: 505 100.0 8.7E-56 5.4E-59 322.4 43.3 468 11-491 1-505 (505) 65 protein:vir:1587 Length: 508 # 100.0 2E-55 1.3E-58 320.4 42.4 467 11-512 1-508 (508) 66 protein:vir:9815 Length: 500 # 100.0 2.2E-53 1.4E-56 309.3 41.2 466 11-506 1-500 (500) 67 protein:vir:3028 Length: 500 # 100.0 2.2E-53 1.4E-56 309.3 41.2 466 11-506 1-500 (500) 68 protein:vir:4782 Length: 522 # 100.0 9E-50 5.6E-53 289.5 46.1 477 11-508 1-522 (522) 69 protein:vir:101494 Length: 527 100.0 4.1E-48 2.5E-51 280.4 38.7 475 1-512 1-525 (527) 70 protein:vir:102239 Length: 527 100.0 4.7E-48 2.9E-51 280.1 38.7 475 1-512 1-525 (527) 71 protein:vir:78907 Length: 518 100.0 3.1E-47 1.9E-50 275.6 40.9 449 11-505 1-518 (518) 72 protein:vir:98883 Length: 517 100.0 3.9E-46 2.4E-49 269.6 45.0 472 11-512 1-517 (517) 73 protein:vir:7430 Length: 563 # 100.0 9.9E-42 6.1E-45 245.4 37.7 472 1-512 1-541 (563) 74 protein:vir:97265 Length: 513 100.0 2.6E-30 1.6E-33 182.8 35.6 449 38-512 1-501 (513) 75 protein:vir:94956 Length: 452 100.0 1.1E-29 6.8E-33 179.3 33.0 424 40-508 1-452 (452) 76 protein:vir:95149 Length: 501 99.9 3E-26 1.9E-29 160.5 37.3 438 21-512 1-501 (501) 77 protein:vir:80453 Length: 535 99.9 3.3E-25 2.1E-28 154.8 38.3 472 1-512 1-527 (535) 78 protein:vir:78393 Length: 489 99.9 8E-25 4.9E-28 152.7 37.3 445 1-510 1-489 (489) 79 protein:vir:95014 Length: 491 99.9 1.1E-24 6.6E-28 152.0 34.7 445 1-507 1-491 (491) 80 protein:vir:96783 Length: 488 99.9 5.8E-22 3.6E-25 137.0 32.9 433 11-490 1-488 (488) 81 protein:vir:93630 Length: 776 99.8 2E-20 1.2E-23 128.6 29.7 492 1-512 1-679 (776) 82 protein:vir:108295 Length: 711 99.8 4.9E-19 3E-22 121.0 36.1 484 1-512 1-665 (711) 83 protein:vir:3296 Length: 714 # 99.8 4.1E-18 2.6E-21 115.9 38.2 465 21-512 1-642 (714) 84 protein:vir:2764 Length: 714 # 99.8 4.1E-18 2.6E-21 115.9 38.2 465 21-512 1-642 (714) 85 protein:vir:9950 Length: 714 # 99.8 4.1E-18 2.6E-21 115.9 38.2 465 21-512 1-642 (714) 86 protein:vir:10117 Length: 714 99.8 4.1E-18 2.6E-21 115.9 38.2 465 21-512 1-642 (714) 87 protein:vir:817 Length: 714 # 99.8 4.1E-18 2.6E-21 115.9 38.2 465 21-512 1-642 (714) 88 protein:vir:104437 Length: 714 99.8 1.4E-17 8.8E-21 112.9 37.3 466 11-512 1-633 (714) 89 protein:vir:105619 Length: 772 99.8 4.3E-19 2.7E-22 121.3 28.6 481 11-512 1-661 (772) 90 protein:vir:80040 Length: 461 99.7 2.3E-17 1.4E-20 111.8 26.5 423 28-512 1-458 (461) 91 protein:vir:79538 Length: 502 99.7 4E-14 2.5E-17 94.0 40.3 435 39-512 1-501 (502) 92 protein:vir:105429 Length: 708 99.7 5.9E-15 3.7E-18 98.6 33.7 475 30-512 1-636 (708) 93 protein:vir:80165 Length: 651 99.6 2.2E-14 1.3E-17 95.5 35.3 458 14-512 1-634 (651) 94 protein:vir:100920 Length: 725 99.6 1.4E-14 8.5E-18 96.6 32.7 469 31-512 1-631 (725) 95 protein:vir:77597 Length: 725 99.6 2.3E-14 1.4E-17 95.4 33.8 470 31-512 1-631 (725) 96 protein:vir:8846 Length: 705 # 99.6 3.4E-14 2.1E-17 94.4 33.4 459 1-512 1-629 (705) 97 protein:vir:172 Length: 708 # 99.6 3.4E-14 2.1E-17 94.4 32.2 471 30-512 1-641 (708) 98 protein:vir:9263 Length: 725 # 99.6 3.8E-14 2.3E-17 94.2 32.3 471 31-512 1-631 (725) 99 protein:vir:95449 Length: 584 99.5 1.7E-12 1.1E-15 85.1 37.1 444 1-493 1-584 (584) 100 protein:vir:105520 Length: 706 99.5 6E-13 3.7E-16 87.6 32.6 473 30-512 1-637 (706) 101 protein:vir:96738 Length: 505 99.5 3.1E-12 1.9E-15 83.7 37.8 445 16-510 1-505 (505) 102 protein:vir:389 Length: 530 # 99.5 3.9E-12 2.4E-15 83.1 34.0 447 39-512 1-528 (530) 103 protein:vir:6382 Length: 553 # 99.5 7.2E-12 4.5E-15 81.7 35.1 445 38-512 1-553 (553) 104 protein:vir:3420 Length: 533 # 99.5 7.5E-12 4.7E-15 81.6 34.7 447 39-512 1-531 (533) 105 protein:vir:3520 Length: 720 # 99.5 4.2E-12 2.6E-15 83.0 32.9 470 30-512 1-641 (720) 106 protein:vir:5249 Length: 437 # 99.4 7.6E-13 4.7E-16 87.0 25.3 401 39-510 1-437 (437) 107 protein:vir:95542 Length: 548 99.4 2.9E-11 1.8E-14 78.3 36.7 437 39-512 1-515 (548) 108 protein:vir:10321 Length: 495 99.4 6.1E-11 3.8E-14 76.6 34.2 444 14-512 1-495 (495) 109 protein:vir:79647 Length: 435 99.4 1.5E-12 9.4E-16 85.4 23.4 399 11-511 1-435 (435) 110 protein:vir:95821 Length: 763 99.3 5.3E-12 3.3E-15 82.4 26.2 475 7-512 1-666 (763) 111 protein:vir:107662 Length: 427 99.3 1.5E-11 9E-15 80.0 24.3 394 52-512 1-427 (427) 112 protein:vir:107742 Length: 537 99.3 6.8E-11 4.2E-14 76.3 27.2 443 1-512 41-529 (537) 113 protein:vir:63755 Length: 547 99.2 3.6E-10 2.3E-13 72.3 30.3 459 11-512 1-534 (547) 114 protein:vir:104338 Length: 422 99.2 1E-10 6.4E-14 75.3 26.9 389 57-510 1-422 (422) 115 protein:vir:94049 Length: 532 99.2 4.3E-10 2.7E-13 71.9 29.0 430 1-512 35-515 (532) 116 protein:vir:99563 Length: 862 99.0 1.9E-09 1.2E-12 68.4 24.9 447 1-512 66-572 (862) 117 protein:vir:3139 Length: 599 # 99.0 7.7E-09 4.8E-12 65.1 29.1 453 9-500 1-599 (599) 118 protein:vir:80644 Length: 551 99.0 8.2E-09 5.1E-12 64.9 27.1 459 7-512 1-534 (551) 119 protein:vir:96068 Length: 765 99.0 1.4E-09 8.7E-13 69.1 22.3 440 14-512 1-563 (765) 120 protein:vir:3843 Length: 397 # 98.8 5.9E-08 3.7E-11 60.2 27.1 391 46-512 1-397 (397) 121 protein:vir:98506 Length: 555 98.7 7.7E-08 4.8E-11 59.6 40.0 457 30-512 1-547 (555) 122 protein:vir:107404 Length: 555 98.7 7.7E-08 4.8E-11 59.6 40.0 457 30-512 1-547 (555) 123 protein:vir:107822 Length: 555 98.7 7.7E-08 4.8E-11 59.6 40.0 457 30-512 1-547 (555) 124 protein:vir:95315 Length: 559 98.7 7.8E-08 4.8E-11 59.5 40.2 451 30-512 1-557 (559) 125 protein:vir:94599 Length: 641 98.7 8.2E-08 5.1E-11 59.4 31.2 471 11-512 1-628 (641) 126 protein:vir:6240 Length: 457 # 98.7 8.5E-08 5.3E-11 59.3 31.1 409 60-512 1-450 (457) 127 protein:vir:103765 Length: 549 98.7 1.3E-07 8.1E-11 58.3 41.0 452 31-507 1-549 (549) 128 protein:vir:4952 Length: 386 # 98.6 2.3E-07 1.4E-10 57.0 29.3 379 11-511 1-386 (386) 129 protein:vir:1326 Length: 457 # 98.6 2.3E-07 1.4E-10 57.0 29.3 418 39-512 1-455 (457) 130 protein:vir:7321 Length: 556 # 98.6 2.5E-07 1.5E-10 56.8 38.9 453 30-511 1-556 (556) 131 protein:vir:3153 Length: 467 # 98.5 3.1E-07 1.9E-10 56.3 32.6 385 87-512 1-450 (467) 132 protein:vir:102080 Length: 429 98.5 5E-07 3.1E-10 55.1 29.0 410 39-512 1-429 (429) 133 protein:vir:107605 Length: 432 98.5 5.7E-07 3.5E-10 54.8 29.9 411 39-512 1-432 (432) 134 protein:vir:105002 Length: 432 98.5 5.7E-07 3.5E-10 54.8 29.9 411 39-512 1-432 (432) 135 protein:vir:102855 Length: 432 98.5 5.7E-07 3.5E-10 54.8 29.9 411 39-512 1-432 (432) 136 protein:vir:100150 Length: 437 98.4 6.9E-07 4.3E-10 54.4 30.1 407 49-510 1-437 (437) 137 protein:vir:102668 Length: 547 98.4 8.3E-07 5.2E-10 53.9 41.7 438 40-508 1-547 (547) 138 protein:vir:1380 Length: 422 # 98.3 1.2E-06 7.4E-10 53.1 29.4 402 39-512 1-422 (422) 139 protein:vir:4156 Length: 542 # 98.3 1.3E-06 8.2E-10 52.8 23.2 428 20-512 1-472 (542) 140 protein:vir:81152 Length: 411 98.3 1.3E-06 8.4E-10 52.8 28.9 395 39-507 1-411 (411) 141 protein:vir:3361 Length: 535 # 98.2 2.2E-06 1.4E-09 51.6 42.0 452 1-508 1-535 (535) 142 protein:vir:94709 Length: 522 98.2 2.4E-06 1.5E-09 51.4 43.6 450 1-511 1-522 (522) 143 protein:vir:1538 Length: 535 # 98.2 2.6E-06 1.6E-09 51.2 41.7 452 1-509 1-535 (535) 144 protein:vir:8418 Length: 409 # 98.2 2.6E-06 1.6E-09 51.2 31.0 384 60-512 1-409 (409) 145 protein:vir:10447 Length: 536 98.2 2.9E-06 1.8E-09 51.0 39.8 451 31-512 1-534 (536) 146 protein:vir:1266 Length: 416 # 98.2 3E-06 1.8E-09 50.9 28.4 398 15-511 1-416 (416) 147 protein:vir:4454 Length: 414 # 98.2 3E-06 1.8E-09 50.9 32.6 397 39-512 1-411 (414) 148 protein:vir:96579 Length: 576 98.2 3.3E-06 2.1E-09 50.6 30.0 430 1-512 27-533 (576) 149 protein:vir:80796 Length: 574 98.1 3.8E-06 2.3E-09 50.3 31.6 452 1-512 1-525 (574) 150 protein:vir:93610 Length: 454 98.1 4E-06 2.5E-09 50.2 31.2 410 45-512 1-440 (454) 151 protein:vir:4854 Length: 386 # 98.0 7.1E-06 4.4E-09 48.8 26.8 381 11-506 1-386 (386) 152 protein:vir:483 Length: 413 # 98.0 7.3E-06 4.5E-09 48.7 29.8 395 15-512 1-410 (413) 153 protein:vir:4598 Length: 416 # 98.0 8E-06 5E-09 48.5 28.3 399 46-512 1-416 (416) 154 protein:vir:81095 Length: 416 98.0 8E-06 5E-09 48.5 28.3 399 46-512 1-416 (416) 155 protein:vir:9359 Length: 348 # 98.0 9.3E-06 5.8E-09 48.2 29.1 335 106-512 1-347 (348) 156 protein:vir:8883 Length: 543 # 97.9 1.1E-05 6.6E-09 47.9 38.3 458 11-512 1-535 (543) 157 protein:vir:81072 Length: 432 97.9 1.1E-05 6.8E-09 47.8 28.0 396 21-511 1-432 (432) 158 protein:vir:79984 Length: 441 97.9 1.2E-05 7.5E-09 47.6 29.9 411 27-512 1-441 (441) 159 protein:vir:9408 Length: 441 # 97.9 1.2E-05 7.5E-09 47.6 29.9 411 27-512 1-441 (441) 160 protein:vir:2198 Length: 536 # 97.9 1.4E-05 8.7E-09 47.2 42.2 451 31-512 1-534 (536) 161 protein:vir:100691 Length: 535 97.9 1.5E-05 9E-09 47.1 35.5 448 11-512 1-532 (535) 162 protein:vir:2683 Length: 412 # 97.9 1.5E-05 9E-09 47.1 32.6 395 46-512 1-411 (412) 163 protein:vir:100039 Length: 522 97.9 1.5E-05 9.1E-09 47.1 39.1 440 39-512 1-522 (522) 164 protein:vir:99312 Length: 563 97.8 1.5E-05 9.2E-09 47.1 32.4 428 1-512 55-531 (563) 165 protein:vir:95599 Length: 563 97.8 1.5E-05 9.2E-09 47.1 32.4 428 1-512 55-531 (563) 166 protein:vir:78696 Length: 542 97.8 1.5E-05 9.4E-09 47.0 41.5 434 38-508 1-542 (542) 167 protein:vir:98396 Length: 441 97.8 1.6E-05 1E-08 46.9 29.9 411 27-512 1-441 (441) 168 protein:vir:4828 Length: 382 # 97.8 1.9E-05 1.2E-08 46.5 28.0 376 11-511 1-382 (382) 169 protein:vir:1785 Length: 555 # 97.8 2.1E-05 1.3E-08 46.2 38.9 440 38-512 1-553 (555) 170 protein:vir:4194 Length: 540 # 97.8 2.1E-05 1.3E-08 46.2 27.5 425 28-512 1-480 (540) 171 protein:vir:7407 Length: 392 # 97.8 2.2E-05 1.4E-08 46.1 31.5 381 38-503 1-392 (392) 172 protein:vir:10362 Length: 432 97.7 2.3E-05 1.4E-08 46.0 27.6 395 21-511 1-432 (432) 173 protein:vir:102727 Length: 945 97.7 2.5E-05 1.6E-08 45.8 29.7 460 1-512 12-537 (945) 174 protein:vir:105064 Length: 421 97.7 2.7E-05 1.7E-08 45.6 24.3 398 14-511 1-421 (421) 175 protein:vir:102118 Length: 409 97.6 4.2E-05 2.6E-08 44.6 32.0 395 44-507 1-409 (409) 176 protein:vir:99853 Length: 488 97.6 4.6E-05 2.8E-08 44.4 30.2 390 39-512 1-416 (488) 177 protein:vir:3989 Length: 392 # 97.5 4.7E-05 2.9E-08 44.3 28.6 384 12-503 1-392 (392) 178 protein:vir:1023 Length: 392 # 97.5 4.7E-05 2.9E-08 44.3 28.6 384 12-503 1-392 (392) 179 protein:vir:79772 Length: 648 97.5 4.8E-05 3E-08 44.2 32.2 438 1-512 1-509 (648) 180 protein:vir:101648 Length: 518 97.5 5E-05 3.1E-08 44.2 30.2 418 36-512 1-453 (518) 181 protein:vir:7853 Length: 518 # 97.5 5.3E-05 3.3E-08 44.0 29.8 420 36-512 1-453 (518) 182 protein:vir:4995 Length: 384 # 97.5 5.4E-05 3.4E-08 44.0 21.6 377 11-508 1-384 (384) 183 protein:vir:189 Length: 424 # 97.5 5.5E-05 3.4E-08 43.9 26.3 396 39-508 1-424 (424) 184 protein:vir:5737 Length: 419 # 97.5 5.5E-05 3.4E-08 43.9 26.9 395 43-512 1-416 (419) 185 protein:vir:99232 Length: 526 97.5 6.3E-05 3.9E-08 43.6 35.4 418 17-512 1-460 (526) 186 protein:vir:99672 Length: 532 97.5 6.4E-05 4E-08 43.6 39.0 447 1-512 1-532 (532) 187 protein:vir:3868 Length: 417 # 97.4 7.1E-05 4.4E-08 43.3 29.0 385 64-512 1-417 (417) 188 protein:vir:94572 Length: 535 97.4 8.4E-05 5.2E-08 42.9 39.1 450 31-509 1-535 (535) 189 protein:vir:97060 Length: 432 97.3 9.7E-05 6E-08 42.6 27.9 396 21-511 1-432 (432) 190 protein:vir:95378 Length: 406 97.3 9.7E-05 6E-08 42.6 27.4 390 39-512 1-406 (406) 191 protein:vir:105782 Length: 449 97.3 0.0001 6.2E-08 42.5 26.1 409 38-512 1-445 (449) 192 protein:vir:960 Length: 413 # 97.3 0.00011 6.6E-08 42.4 27.7 379 31-507 1-413 (413) 193 protein:vir:103860 Length: 528 97.3 0.00011 7E-08 42.2 31.4 414 17-512 1-450 (528) 194 protein:vir:1884 Length: 424 # 97.2 0.00014 8.4E-08 41.8 26.5 396 39-508 1-424 (424) 195 protein:vir:100187 Length: 385 97.2 0.00014 8.8E-08 41.7 32.6 380 11-510 1-385 (385) 196 protein:vir:101647 Length: 460 97.1 0.00016 1E-07 41.3 28.6 412 41-510 1-460 (460) 197 protein:vir:78641 Length: 278 97.1 0.00017 1E-07 41.3 25.2 271 106-434 1-278 (278) 198 protein:vir:4337 Length: 434 # 97.1 0.00018 1.1E-07 41.1 28.4 397 1-512 1-434 (434) 199 protein:vir:94426 Length: 409 97.1 0.00018 1.1E-07 41.1 31.9 394 36-508 1-409 (409) 200 protein:vir:93943 Length: 409 97.1 0.00019 1.2E-07 41.0 31.3 397 21-511 1-409 (409) 201 protein:vir:100650 Length: 395 97.0 0.00024 1.5E-07 40.4 23.3 376 60-512 1-395 (395) 202 protein:vir:9507 Length: 395 # 97.0 0.00024 1.5E-07 40.4 23.3 376 60-512 1-395 (395) 203 protein:vir:101289 Length: 395 97.0 0.00024 1.5E-07 40.4 23.3 376 60-512 1-395 (395) 204 protein:vir:9702 Length: 406 # 96.9 0.00026 1.6E-07 40.2 27.8 391 46-512 1-404 (406) 205 protein:vir:6322 Length: 510 # 96.9 0.00028 1.7E-07 40.1 39.0 436 38-505 1-510 (510) 206 protein:vir:100882 Length: 383 96.9 0.00029 1.8E-07 40.0 29.7 377 27-511 1-383 (383) 207 protein:vir:6210 Length: 394 # 96.6 0.00051 3.1E-07 38.6 23.5 380 11-510 1-394 (394) 208 protein:vir:80333 Length: 419 96.5 0.00052 3.3E-07 38.6 29.3 393 36-512 1-415 (419) 209 protein:vir:79063 Length: 491 96.5 0.00057 3.6E-07 38.4 32.6 406 16-512 1-428 (491) 210 protein:vir:101541 Length: 694 96.5 0.00058 3.6E-07 38.3 20.6 416 1-512 56-544 (694) 211 protein:vir:107880 Length: 491 96.4 0.00063 3.9E-07 38.1 32.9 401 16-512 1-421 (491) 212 protein:vir:80134 Length: 403 96.4 0.0007 4.3E-07 37.9 27.4 385 11-512 1-403 (403) 213 protein:vir:8100 Length: 466 # 96.4 0.0007 4.4E-07 37.9 29.0 424 14-510 1-466 (466) 214 protein:vir:78161 Length: 355 96.3 0.00078 4.8E-07 37.6 20.6 313 150-512 1-335 (355) 215 protein:vir:79233 Length: 526 96.3 0.00079 4.9E-07 37.6 38.1 415 17-512 1-450 (526) 216 protein:vir:96980 Length: 409 96.3 0.00081 5E-07 37.5 32.5 394 36-511 1-409 (409) 217 protein:vir:3648 Length: 695 # 96.2 0.00089 5.5E-07 37.3 20.6 413 1-512 67-545 (695) 218 protein:vir:1431 Length: 419 # 96.1 0.00098 6.1E-07 37.1 29.1 395 36-512 1-416 (419) 219 protein:vir:104259 Length: 403 96.1 0.00099 6.1E-07 37.1 26.5 384 39-512 1-403 (403) 220 protein:vir:78589 Length: 695 96.0 0.0011 6.9E-07 36.8 21.0 413 1-512 67-545 (695) 221 protein:vir:4509 Length: 424 # 95.9 0.0012 7.6E-07 36.5 29.8 396 27-510 1-424 (424) 222 protein:vir:103219 Length: 201 95.9 0.00075 4.7E-07 37.7 12.2 189 286-511 1-201 (201) 223 protein:vir:1986 Length: 512 # 95.7 0.0015 9.4E-07 36.0 34.2 420 1-512 1-444 (512) 224 protein:vir:105641 Length: 516 95.7 0.0016 9.9E-07 35.9 39.4 431 30-507 1-516 (516) 225 protein:vir:7017 Length: 515 # 95.7 0.0016 1E-06 35.9 41.2 430 31-507 1-515 (515) 226 protein:vir:100249 Length: 431 95.5 0.0019 1.2E-06 35.5 31.4 395 39-504 1-431 (431) 227 protein:vir:103330 Length: 517 95.4 0.0021 1.3E-06 35.3 40.7 428 21-506 1-517 (517) 228 protein:vir:99452 Length: 651 95.3 0.0022 1.4E-06 35.1 18.7 452 11-512 1-539 (651) 229 protein:vir:106716 Length: 698 95.2 0.0025 1.5E-06 34.9 23.6 421 1-512 67-555 (698) 230 protein:vir:96988 Length: 516 95.2 0.0026 1.6E-06 34.8 39.5 431 30-507 1-516 (516) 231 protein:vir:98816 Length: 446 94.8 0.0036 2.2E-06 34.0 25.0 411 14-470 1-446 (446) 232 protein:vir:94666 Length: 723 94.3 0.0048 3E-06 33.3 32.5 394 70-512 1-446 (723) 233 protein:vir:4089 Length: 395 # 94.1 0.0053 3.3E-06 33.1 24.4 380 39-510 1-395 (395) 234 protein:vir:95965 Length: 385 94.1 0.0053 3.3E-06 33.0 25.9 366 39-512 1-385 (385) 235 protein:vir:78942 Length: 510 93.9 0.0061 3.8E-06 32.7 41.9 435 38-505 1-510 (510) 236 protein:vir:1082 Length: 359 # 93.8 0.0063 3.9E-06 32.7 28.2 347 46-470 1-359 (359) 237 protein:vir:81218 Length: 423 92.9 0.0096 6E-06 31.6 30.7 395 14-507 1-423 (423) 238 protein:vir:108215 Length: 469 89.4 0.027 1.7E-05 29.2 31.7 413 49-512 1-465 (469) 239 protein:vir:98643 Length: 395 89.0 0.029 1.8E-05 29.0 22.6 375 39-512 1-394 (395) 240 protein:vir:80211 Length: 514 88.0 0.035 2.2E-05 28.6 40.0 428 38-499 1-514 (514) 241 protein:vir:77981 Length: 448 85.7 0.051 3.1E-05 27.7 28.9 416 1-512 1-440 (448) 242 protein:vir:8317 Length: 409 # 84.4 0.061 3.8E-05 27.3 26.8 362 27-498 1-409 (409) 243 protein:vir:78310 Length: 376 83.4 0.069 4.3E-05 27.0 28.3 360 11-511 1-376 (376) 244 protein:vir:94002 Length: 378 83.1 0.071 4.4E-05 26.9 22.0 348 39-511 1-378 (378) 245 protein:vir:93867 Length: 378 82.8 0.074 4.6E-05 26.8 21.1 350 39-511 1-378 (378) 246 protein:vir:1661 Length: 378 # 80.6 0.093 5.8E-05 26.2 23.1 350 57-511 1-378 (378) 247 protein:vir:104892 Length: 558 79.6 0.1 6.4E-05 26.0 25.7 462 1-512 1-552 (558) 248 protein:vir:95254 Length: 488 77.8 0.12 7.5E-05 25.6 25.7 430 1-512 1-487 (488) 249 protein:vir:104500 Length: 537 76.9 0.13 8.1E-05 25.4 23.7 451 1-512 1-536 (537) 250 protein:vir:9641 Length: 395 # 76.6 0.13 8.3E-05 25.4 27.1 369 60-512 1-394 (395) 251 protein:vir:79511 Length: 448 75.7 0.14 8.9E-05 25.2 31.5 396 48-512 1-447 (448) 252 protein:vir:345 Length: 663 # 70.4 0.21 0.00013 24.3 32.0 457 1-512 1-599 (663) 253 protein:vir:5839 Length: 533 # 66.7 0.26 0.00016 23.8 20.1 430 1-512 1-526 (533) 254 protein:vir:78191 Length: 351 64.6 0.3 0.00018 23.5 22.5 317 62-441 1-351 (351) 255 protein:vir:4698 Length: 251 # 60.4 0.37 0.00023 22.9 19.3 242 46-341 1-251 (251) 256 protein:vir:858 Length: 378 # 56.9 0.44 0.00028 22.5 23.8 350 46-511 1-378 (378) 257 protein:vir:94869 Length: 378 46.4 0.74 0.00046 21.3 23.0 350 46-512 1-378 (378) 258 protein:vir:3780 Length: 345 # 43.5 0.84 0.00052 21.0 23.6 311 60-435 1-345 (345) 259 protein:vir:6058 Length: 344 # 41.9 0.91 0.00056 20.8 21.7 311 49-439 1-344 (344) 260 protein:vir:5665 Length: 511 # 36.8 1.2 0.00071 20.2 20.7 427 1-497 4-511 (511) 261 protein:vir:267 Length: 348 # 34.3 1.3 0.00081 20.0 23.3 322 49-441 1-348 (348) 262 protein:vir:100328 Length: 346 29.2 1.7 0.001 19.4 22.2 319 49-439 1-346 (346) 263 protein:vir:79207 Length: 351 28.0 1.8 0.0011 19.2 23.1 317 62-447 1-351 (351) 264 protein:vir:79150 Length: 368 24.4 2.2 0.0014 18.7 21.1 330 53-448 1-368 (368) 265 protein:vir:106999 Length: 564 23.0 2.4 0.0015 18.5 23.0 467 1-512 1-560 (564) 266 protein:vir:5691 Length: 344 # 22.1 2.5 0.0015 18.4 22.4 299 49-439 1-344 (344) 267 protein:vir:98853 Length: 219 20.4 2.8 0.0017 18.2 16.2 214 180-438 1-219 (219) No 1 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=5.8e-124 Score=696.23 Aligned_cols=512 Identities=99% Similarity=1.412 Sum_probs=491.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |++.|.|+.+.+++.|..++|++++|++|.+.+.+.++.++.+.|.++|.+|...+++||+++.+||.|+|+++++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 80 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) +++.++++|+++||+++||++.++|++|+|++++++++++++.|++||+.|+++.++.++++++++||+||+++|.|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~ 160 (512) T protein:vir:97 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCC Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.++||+||++..++++++||+|.+...++.....++++++||++.+++|....+++...........+|+| T Consensus 161 ~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) T protein:vir:97 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccC Confidence 99999999999999999998899999999999998888888889999999999999999988887777777788899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|++|++|+.+.+..++..++.++++.+.+....+.. T Consensus 241 g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) T protein:vir:97 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRD 320 (512) T ss_pred cccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcc Confidence 99999999999999999999999999999999999999999999999999999888889999999999998888888888 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .....+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||++++.++.++|+++++.|+++|+ T Consensus 321 ~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~ 400 (512) T protein:vir:97 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) T ss_pred cccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|++++...+....+.++.+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++. T Consensus 401 ~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~ 480 (512) T protein:vir:97 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) T ss_pred HHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777778888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++..+.....++++.+++++++++++..+++| T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred HHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 98888888888988888889999999999999 No 2 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=1.5e-120 Score=677.52 Aligned_cols=511 Identities=97% Similarity=1.368 Sum_probs=482.3 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+++|.|++..+++.+++++|++.+|.+|.+.+.+.+...+.+.|.++|.+|...+++||+++++||.|+|+++.+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) +.+.++++|+++||+++||++.++||+|+||++++++++.++.|++||+.|+++.++.++++++++||+||++||.|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~ 160 (511) T protein:vir:93 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.+++|+||++..++++++||||.+...++...+.++++++||++.+++|....+++...........+|+| T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (511) T protein:vir:93 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCC Confidence 99999999999999999998889999999999988888888889999999999999999888877777777788899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+.+.++....+..+++.+.... .... T Consensus 241 g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 319 (511) T protein:vir:93 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTV-YADS 319 (511) T ss_pred CccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceeccccc-cccc Confidence 999999999999999999999999999999999999999999999999999998888888888887777665543 3334 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+...+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||+++++++.++|+++++.|+++|+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~ 399 (511) T protein:vir:93 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45566788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|+++++..+....+.++.+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++. T Consensus 400 ~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~ 479 (511) T protein:vir:93 400 RRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 479 (511) T ss_pred HHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777778888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+..+.....++++.+++++++++++.++++| T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 480 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhhcccCCCCCCCCCCCCcccccccccC Confidence 88888888888888888888888999999888 No 3 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=3.7e-120 Score=675.34 Aligned_cols=511 Identities=97% Similarity=1.370 Sum_probs=482.7 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+++|.|++..+++.+++++|++.+|..|.+.+.+.+...+++.|.++|.+|...+++||+++++||.|+|+++++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~ 80 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) +.+.++++|+++||+++||++.++|++|+||++++++++.++.|++||+.|+++.++.++++++++||+||+++|+|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:96 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred cccccCcceeecchHHHHHHHHHhhhccCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.+++|+|+++...+++++||+|.+...++.....++++++||++.+++|....+++...........+|+| T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (511) T protein:vir:96 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccC Confidence 99999999999999999998899999999999988888888888999999999999999988887777777888899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|++|++|....+..++..++.++.+.+.... .... T Consensus 241 ~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 319 (511) T protein:vir:96 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTV-YADS 319 (511) T ss_pred CceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceeccccc-cccc Confidence 999999999999999999999999999999999999999999999999999988888888888877777664433 3334 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+...+++++++||+++.+.+++++++++|.+.|+.+|++|++++++++||+||+||+++++++.++|+++++.|+++|+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 399 (511) T protein:vir:96 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45566778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|+++++..+....+.++.+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++. T Consensus 400 ~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~ 479 (511) T protein:vir:96 400 RRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 479 (511) T ss_pred HHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777788889999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+..+......+.+.++++++++++++.++|| T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 480 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 88888888888888999999999999999999 No 4 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=4e-120 Score=675.16 Aligned_cols=511 Identities=97% Similarity=1.375 Sum_probs=483.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+++|.|++..+++.+++.+|++.+|.+|.+.+.+.+...+++.|.++|.+|...+++||+++.+||.|+|+++++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~ 80 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) ..+.++++|+++||+++||++.++|++|+|++++++++++++.|++||+.|+++.++.++++++++||+||+++|.|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg 160 (511) T protein:vir:10 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDD 160 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.+++|+|+++..++++++||+|.+...++...+.++++++||++.+++|....+++...........+|+| T Consensus 161 ~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (511) T protein:vir:10 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccC Confidence 99999999999999999998889999999999998888888889999999999999999988887777777888899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|++|++|+...+.+++..++..+++.+.+.... .. T Consensus 241 ~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 319 (511) T protein:vir:10 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYA-DS 319 (511) T ss_pred cceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceeccccccc-cc Confidence 99999999999999999999999999999999999999999999999999998888888888888887776554333 34 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+...+++++++|++++.+.+++++++++|.++|+.+|++|++++++++||+||+||+++++++.+||.++++.|+++|+ T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~ 399 (511) T protein:vir:10 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44566778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|+++++..+....+.++.+++++|++++|+|.++.+++++++.|++|+||+++++|+++||++|++||++|+++. T Consensus 400 ~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~ 479 (511) T protein:vir:10 400 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 479 (511) T ss_pred HHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777778888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+..+.....++++.++++++++++++.+++| T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 480 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhhcccCCCCCCCCCCCCcccCcccccC Confidence 88888888888888888889999999999999 No 5 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=3.5e-120 Score=675.47 Aligned_cols=511 Identities=97% Similarity=1.373 Sum_probs=482.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+++|.|+++.+++.+++++|.+.+|..|.+.+.+.+...+++.|.++|.+|...+++||+++++||.|+|+++++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:78 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) ..+.++++|+++||+++||++.++||+|+||++++++++.++.|++||+.|+++.++.++++++++||+||+++|+|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg 160 (511) T protein:vir:78 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.+++|+||++..+++++|||+|.+...++...+.++++++||++.+++|....+++........+..+|+| T Consensus 161 ~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (511) T protein:vir:78 161 ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSF 240 (511) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcC Confidence 99999999999999999988889999999999998888888888999999999999999988877777777788899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|++|++|....+.+++...+..+++....... ... T Consensus 241 g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 319 (511) T protein:vir:78 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVY-VDA 319 (511) T ss_pred cccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccce-ecc Confidence 9999999999999999999999999999999999999999999999999999888888888887777776654332 233 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+....++++++|++++.+.+++++++++|.+.|+.+|++|+++++++++|+||+||+++++++.++|+++++.|+++|+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~ 399 (511) T protein:vir:78 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44556778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|+++++..+....+.++.+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++. T Consensus 400 ~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~ 479 (511) T protein:vir:78 400 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 479 (511) T ss_pred HHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777778888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+..+.....++.+.++++++++++++.+++| T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 480 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 99888888888888889999999999999999 No 6 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=3.5e-120 Score=675.47 Aligned_cols=511 Identities=97% Similarity=1.373 Sum_probs=482.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+++|.|+++.+++.+++++|.+.+|..|.+.+.+.+...+++.|.++|.+|...+++||+++++||.|+|+++++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~ 80 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) ..+.++++|+++||+++||++.++||+|+||++++++++.++.|++||+.|+++.++.++++++++||+||+++|+|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg 160 (511) T protein:vir:96 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.+++|+||++..+++++|||+|.+...++...+.++++++||++.+++|....+++........+..+|+| T Consensus 161 ~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (511) T protein:vir:96 161 ETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSF 240 (511) T ss_pred ceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcC Confidence 99999999999999999988889999999999998888888888999999999999999988877777777788899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|++|++|....+.+++...+..+++....... ... T Consensus 241 g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 319 (511) T protein:vir:96 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVY-VDA 319 (511) T ss_pred cccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccce-ecc Confidence 9999999999999999999999999999999999999999999999999999888888888887777776654332 233 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+....++++++|++++.+.+++++++++|.+.|+.+|++|+++++++++|+||+||+++++++.++|+++++.|+++|+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~ 399 (511) T protein:vir:96 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44556778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|+++++..+....+.++.+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++. T Consensus 400 ~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~ 479 (511) T protein:vir:96 400 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 479 (511) T ss_pred HHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777778888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+..+.....++.+.++++++++++++.+++| T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 480 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred HHHHhhccccCCCCCCCCCCCCCccCcccccC Confidence 99888888888888889999999999999999 No 7 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=2.1e-119 Score=671.17 Aligned_cols=511 Identities=95% Similarity=1.359 Sum_probs=481.4 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+++|.|+...+++.|++.+|.+.+|.+|.+.+.+.+...+.+.|.++|.+|...+++||+++++||.|+|+++++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~ 80 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (511) T ss_pred CccccchhhhhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) +.+.++++|+++||+++||++.++|++|+|++++++++++++.|++||+.|+++.++.+++++++++|+||+++|.|++| T Consensus 81 ~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:99 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred cccccCcceeecchHHHHHHHHHhhhcccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) ++++++++|.++||+||++...+++++||+|.+...++...+.++++++||++.+++|.....+............+|+| T Consensus 161 ~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (511) T protein:vir:99 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (511) T ss_pred ceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCC Confidence 99999999999999999998889999999999998888888889999999999999999988877777777788899999 Q ss_pred cccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcc Q lcl|NC_010808. 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRD 320 (512) Q Consensus 241 ~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (512) |.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|+++++|....+.++...++..+++...... .... T Consensus 241 g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 319 (511) T protein:vir:99 241 ERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTV-YADS 319 (511) T ss_pred CccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceeccccc-cccc Confidence 999999999999999999999999999999999999999999999999999988888888888887777664433 3334 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+....++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||+++++++.+||+++++.|+++|+ T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~ 399 (511) T protein:vir:99 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45566778999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++++|+++++..+......++.+++++|++++|.|.++.+++++++.|++|+||+++++|+++||++|++||++|++++ T Consensus 400 ~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~ 479 (511) T protein:vir:99 400 RRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKES 479 (511) T ss_pred HHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHH Confidence 99999999999888777778888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++..+.....++++.++++.+++++++.|++| T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 480 IKKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred HHHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 99888888888888888888888888888888 No 8 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=3.2e-107 Score=604.42 Aligned_cols=493 Identities=39% Similarity=0.622 Sum_probs=421.9 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |++.+.|..+.+...+.+++|++++|+.|+.+..+.+.......|.++|.+|...+++||+++.+||.|+|..+.+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~ 80 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR 80 (502) T ss_pred CceeEEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc Confidence 99999999999999999999999999999999999888888899999999999999999999999999986555555566 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCch----hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR 156 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~ 156 (512) ....++++|+++||+++||++.++|++|+|++++++++ .+++.|+++|+.|+|+.++.++++++++||+||+++|. T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ 160 (502) T protein:vir:48 81 KDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYR 160 (502) T ss_pred cccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe Confidence 77788999999999999999999999999999998753 45678999999999999999999999999999999999 Q ss_pred CCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccc Q lcl|NC_010808. 157 NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFE 236 (512) Q Consensus 157 d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (512) |++|++++++++|.+++|+||++..++++++||+|.....+ ...+++++||++.+++|...+.. ...... T Consensus 161 dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~----~~~~~~~iyt~~~i~~~~~~~~~------~~~~~~ 230 (502) T protein:vir:48 161 SEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQ----NAKDVVEIYTNQHIYTLDASDSF------NEISVT 230 (502) T ss_pred CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeecC----CcEEEEEEEeCCeEEEEEeCCce------eeccce Confidence 99999999999999999999998888999999999775433 34678899999999998765433 345678 Q ss_pred cccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh-hhhhhhhccccccchhh Q lcl|NC_010808. 237 SHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD-EVKKQKEANVLFLEPTV 315 (512) Q Consensus 237 ~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-~~~~~~~~~~~~~~~~~ 315 (512) +|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.+++|++|+++++|....+.+ ....++..+.+.+.... T Consensus 231 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (502) T protein:vir:48 231 PHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPK 310 (502) T ss_pred ecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccc Confidence 99999999999999999999999999999999999999999999999999999997655433 33444455554443221 Q ss_pred hhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 316 YENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLF 395 (512) Q Consensus 316 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~ 395 (512) ...+..++++++|++++.+.+++++++++|.++|+.+|++|++++++++||+||+||++++++|.+||+.+++.| T Consensus 311 -----~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~ 385 (502) T protein:vir:48 311 -----SADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQF 385 (502) T ss_pred -----cccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHH Confidence 122345678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_010808. 396 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEE 475 (512) Q Consensus 396 ~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~ 475 (512) +++|++++++|+++++..+.. .++++.+|+++|++++|+|.++.+++++|++|++|+||+++++|+++||++|++||++ T Consensus 386 ~~~l~~~~~li~~~~~~~~~~-~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~ 464 (502) T protein:vir:48 386 TQGLKRRYRLAARIGSLVNEF-KDFDESRLKITFTPNLPKSLYEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINE 464 (502) T ss_pred HHHHHHHHHHHHHHHhhcccc-cccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHH Confidence 999999999999999876543 3567788999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 476 DEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 476 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) |+++...........+.. .++.++...+..+++| T Consensus 465 E~~~~~~~~~~~~~~~~~---~~~~d~~~e~~~~~~~ 498 (502) T protein:vir:48 465 ESSKIDFKGYPSYFYDNV---GKYTDEVKETHTDDFE 498 (502) T ss_pred HHHhhhhhcccccccccc---cccCCCccCCCCcCcC Confidence 987543222222222111 1122222223333333 No 9 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=1.7e-105 Score=595.00 Aligned_cols=495 Identities=39% Similarity=0.616 Sum_probs=420.6 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |- .--|..+.....+++++|++++|..|+++..+.+....+..|.++|.+|...+++||+++.+||.|+|.++...... T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~ 79 (501) T protein:vir:27 1 ME-QTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRR 79 (501) T ss_pred CC-ceeEEeccchhhhhhcccChhHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCcc Confidence 32 22388888899999999999999999999999988888999999999999999999999999999997666666667 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCch----hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR 156 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~ 156 (512) ..+.++++|+++||+++||++.++|++|+||+++++++ .+++.|++||+.|+|+..+.++++++++||+||+++|+ T Consensus 80 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ 159 (501) T protein:vir:27 80 KDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYR 159 (501) T ss_pred CccccccceeccchHHHHHHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEe Confidence 77888999999999999999999999999999998763 45678999999999999999999999999999999999 Q ss_pred CCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccc Q lcl|NC_010808. 157 NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFE 236 (512) Q Consensus 157 d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (512) +++|++++++++|.+++|+||++..++++++||+|.....+ ..+.++++||++.+++|...+.. ...... T Consensus 160 ded~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~----~~~~~~~vyt~~~v~~~~~~~~~------~~~~~~ 229 (501) T protein:vir:27 160 NEYDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGTLQ----NAKDVVEIYTNEHIYTLDASDDF------NEISVT 229 (501) T ss_pred CCCCceEEEEEccceeEEEecCCCCCceEEEEEEEEeeecC----CcEEEEEEEeCCeEEEEEeCCce------eecccc Confidence 99999999999999999999999889999999999865443 34678899999999998876543 245678 Q ss_pred cccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh-hhhhhhhccccccchhh Q lcl|NC_010808. 237 SHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD-EVKKQKEANVLFLEPTV 315 (512) Q Consensus 237 ~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-~~~~~~~~~~~~~~~~~ 315 (512) +|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.+++|++|+++++|....+.+ ....++..+.+.+... T Consensus 230 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~- 308 (501) T protein:vir:27 230 THAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPP- 308 (501) T ss_pred ccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeeccc- Confidence 99999999999999999999999999999999999999999999999999999997655433 3334444555544322 Q ss_pred hhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 316 YENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLF 395 (512) Q Consensus 316 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~ 395 (512) ....+..++++++|++++.+.+++++++++|.+.|+.+|++|+++++++++|+||+||++++.+|.+||..+++.| T Consensus 309 ----~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~ 384 (501) T protein:vir:27 309 ----KSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQF 384 (501) T ss_pred ----ccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHH Confidence 2223345678899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_010808. 396 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEE 475 (512) Q Consensus 396 ~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~ 475 (512) +++|++++++|+++++..+.. ...++.+|+++|++++|.|.++.+++++|++|++|+||+++++|+++||++|++||++ T Consensus 385 ~~~l~~~~~li~~~~~~~~~~-~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~ 463 (501) T protein:vir:27 385 TQGLKRRYRLAARIGSLVNEF-KDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINK 463 (501) T ss_pred HHHHHHHHHHHHHHHhhcccc-cccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHH Confidence 999999999999998876543 3556788999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccCC-CCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 476 DEKESIKKAQKGIYKDP-RDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 476 E~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e 512 (512) |+++...........+. +...+...+..+.+..+..| T Consensus 464 E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 464 EVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred HHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 98765443333222222 22222222222333333333 No 10 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=1.5e-104 Score=589.71 Aligned_cols=495 Identities=39% Similarity=0.608 Sum_probs=416.6 Q ss_pred CcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccc Q lcl|NC_010808. 2 LKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRK 81 (512) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~ 81 (512) ..-.-|..+.-..+..+.+|++++++.|.+...+.........|.++|.+|...+.+||+++.+||.|++..+......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRK 80 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccC Confidence 23344777777778888999999999999999998888888899999999999999999999999999876555556667 Q ss_pred cccccceeeecchHHHHHHHHHhhhhccCceecCCc----hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEEC Q lcl|NC_010808. 82 EEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD----KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN 157 (512) Q Consensus 82 ~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d 157 (512) ...++++|+++||+++||++.++|++|+|+++++.+ +.+++.|+++|+.|+|+.++.++++++++||+||+++|+| T Consensus 81 ~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d 160 (501) T protein:vir:96 81 DNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRS 160 (501) T ss_pred ccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEc Confidence 778899999999999999999999999999998865 4467789999999999999999999999999999999999 Q ss_pred CCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccc Q lcl|NC_010808. 158 QDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFES 237 (512) Q Consensus 158 ~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (512) ++|++++++++|.+++|+||++..++++++||+|.....+ ..+.++++||++.+++|...+..+ .....+ T Consensus 161 edg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~----~~~~~~~vyt~~~i~~~~~~~~~~------~~~~~~ 230 (501) T protein:vir:96 161 EYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQ----SAKDVVEIYTDEHIYTLDASDDFN------EISVTT 230 (501) T ss_pred CCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeecCC----CcEEEEEEEcCCcEEEEeeCCCce------eccccc Confidence 9999999999999999999998888999999999765433 345788999999999997655432 456789 Q ss_pred ccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh-hhhhhhhccccccchhhh Q lcl|NC_010808. 238 HSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD-EVKKQKEANVLFLEPTVY 316 (512) Q Consensus 238 ~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-~~~~~~~~~~~~~~~~~~ 316 (512) |+||.||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|++|++|+...+.+ ....++..+.+.+... T Consensus 231 ~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~-- 308 (501) T protein:vir:96 231 HAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPP-- 308 (501) T ss_pred cCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeeccc-- Confidence 9999999999999999999999999999999999999999999999999999998655543 2344444554444321 Q ss_pred hhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 317 ENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 396 (512) Q Consensus 317 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~ 396 (512) ....+...+++++|++++.+.+++++++++|.+.|+.+|++|+++++++++|+||+||++++.++.+||+.+++.|+ T Consensus 309 ---~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~ 385 (501) T protein:vir:96 309 ---KSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFT 385 (501) T ss_pred ---ccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 12223456778999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHH Q lcl|NC_010808. 397 KGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEED 476 (512) Q Consensus 397 ~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E 476 (512) .+|++++++|+++++..+.. ...++.+|+++|++++|.|.++.+++++|++|++|+||+++++|+++||++|++||++| T Consensus 386 ~~l~~~~~li~~~~~~~~~~-~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E 464 (501) T protein:vir:96 386 KGLKRRYRLAARIGSLVNEF-KDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKE 464 (501) T ss_pred HHHHHHHHHHHHHHHhcccc-cccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHH Confidence 99999999999999877543 34567789999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 477 EKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +++.................++++.+...++.++++ T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~ 500 (501) T protein:vir:96 465 MSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREY 500 (501) T ss_pred HHHhhccccccchhhcccccCCcCCCCCCCcccccc Confidence 886543222222222222222222222222222222 No 11 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=2.9e-100 Score=566.28 Aligned_cols=477 Identities=35% Similarity=0.580 Sum_probs=394.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-ccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-VELTR 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~-~~~~~ 79 (512) |+. ++++.++++.+|.. ..+..+ .+.|.++|.+|+.++++||+++.+||+|+|+++ .+... T Consensus 1 ~~~--------------~~~~~~~~~~~~~~-~~~~l~---~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~ 62 (506) T protein:vir:94 1 MDY--------------DLTEHKQANLIYQE-SLENLT---PNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSR 62 (506) T ss_pred CCc--------------chhhhhcceeeccc-chhcCC---HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc Confidence 333 25556666666542 222222 467899999999999999999999999999765 44455 Q ss_pred cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC Q lcl|NC_010808. 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD 159 (512) Q Consensus 80 ~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~ 159 (512) .....++++|+++||+++||++.++||+|+|++|++++++.++.|++||+.|+++..+.+++++++++|+||+++|+|++ T Consensus 63 ~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded 142 (506) T protein:vir:94 63 RHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGED 142 (506) T ss_pred cccccCCcceeecchHHHHHHHhhhhhcccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCC Confidence 56778899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcc-eEEEEEEEcCCcEEEEEecCCccccccccccccccc Q lcl|NC_010808. 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED-EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESH 238 (512) Q Consensus 160 g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~-~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (512) |++++++++|.+++|+||+++..+++++||+|.....++.... ..+++++||++.+++|.....++ ......+| T Consensus 143 ~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~-----~~~~~~~~ 217 (506) T protein:vir:94 143 NEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMG-----KMQVDTTK 217 (506) T ss_pred CeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCcc-----ceeccccc Confidence 9999999999999999999888899999999988776655443 35678899999988887665443 23456789 Q ss_pred cccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh-------------------- Q lcl|NC_010808. 239 SFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD-------------------- 298 (512) Q Consensus 239 ~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-------------------- 298 (512) +||.||||+|+|++.|.|+|+++++|||+||.++|++++.+++|++|+++++|....... T Consensus 218 ~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (506) T protein:vir:94 218 PITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLA 297 (506) T ss_pred cCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999996433221 Q ss_pred -----hhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch Q lcl|NC_010808. 299 -----EVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 373 (512) Q Consensus 299 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 373 (512) ....++..+++.+.... ...+.+.+++++||+++.+.+++++++++|.+.|+.+|++|+++++++++|+| T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~S 372 (506) T protein:vir:94 298 KDKLELIKEMKDANMLLLKSGM-----TVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSS 372 (506) T ss_pred cchhHHHhhhhhcCeeeecccc-----cccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccch Confidence 12223333444333221 12234567899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCCh Q lcl|NC_010808. 374 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQ 453 (512) Q Consensus 374 g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~ 453 (512) |+||++++.++.+||+++++.|+++|++++++|+++++..+. ..+.++.+++|+|++++|.|.++.|++++|++|++|+ T Consensus 373 g~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~-~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~ 451 (506) T protein:vir:94 373 GVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHG-DWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQ 451 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-ccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCh Confidence 999999999999999999999999999999999999887543 3456778899999999999999999999999999999 Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 454 TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 454 et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ||+++++|+++||++|++||++|+++..+..... .. ....+..+..+++.++| T Consensus 452 et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~~~e 504 (506) T protein:vir:94 452 KYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQN-GV-----ISNDGQTNTTATQTDEE 504 (506) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhh-cC-----CCcccCccccccccccC Confidence 9999999999999999999999987754432211 11 11111222233333444 No 12 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=8e-99 Score=558.39 Aligned_cols=468 Identities=26% Similarity=0.401 Sum_probs=388.5 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |.+-...+++.. ....+++.|.++|.+| ..+.+||+++++||+|+|+++.+. .+.+.++++|+ T Consensus 1 ~~~~~~~~~~~~--------------~~~~~~~~i~~~i~~~-~~~~~~~~~l~~Yy~g~~~i~~~~--~~~~~~~~~ki 63 (499) T protein:vir:10 1 MAVVIDKDLLDD--------------VNEPNIEAINYAIREL-QNRKKRLDKLSDYYNGKQEIEKHE--FDNATVEAANV 63 (499) T ss_pred CccchhhhHHhh--------------hhcCCHHHHHHHHHHH-HHHHHHHHHHHHHhccccchhcCC--cCcCCCCccee Confidence 433333333211 1122367788888877 467899999999999999987543 34567789999 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCc--------- Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDE--------- 161 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~--------- 161 (512) ++||+++||++.++|++|+|++|++++++..+.|+++|+.|+|+..+.++++++++||+||+++|.+++|. T Consensus 64 ~~n~~~~Iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~ 143 (499) T protein:vir:10 64 MVNHAKYITDMNVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGN 143 (499) T ss_pred ecchHHHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999873 Q ss_pred --------eEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccc Q lcl|NC_010808. 162 --------TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPREN 233 (512) Q Consensus 162 --------~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~ 233 (512) ++++.++|.++||+|++++..++++++|+|...+.++ ...++++++||++++++|.....+......... T Consensus 144 ~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~--~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~ 221 (499) T protein:vir:10 144 EKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEG--NTNGYSITVYMPQRIVEYRTKTTMEVSANDPIV 221 (499) T ss_pred cccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCC--CceEEEEEEEeCCeEEEEEecCCccccCcceec Confidence 5689999999999999999899999999998776553 456788999999999999987776655555667 Q ss_pred ccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch Q lcl|NC_010808. 234 GFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP 313 (512) Q Consensus 234 ~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~ 313 (512) ...+|+||.||||+|+|++.|+|+|+++++|||+||.++|++++.++++++|+++++|+...+.......... T Consensus 222 ~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~------- 294 (499) T protein:vir:10 222 YDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKR------- 294 (499) T ss_pred ccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhh------- Confidence 8889999999999999999999999999999999999999999999999999999999765443322211111 Q ss_pred hhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 314 TVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEG 393 (512) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~ 393 (512) +.......+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||+++++++.+||+++++ T Consensus 295 ----~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~ 370 (499) T protein:vir:10 295 ----GAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQR 370 (499) T ss_pred ----cceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHH Confidence 1122234567789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHH Q lcl|NC_010808. 394 LFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKI 473 (512) Q Consensus 394 ~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri 473 (512) .|+.+|++++++|+++++..+ ...++.+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++|| T Consensus 371 ~~~~~l~~~~~li~~~~~~~~---~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri 447 (499) T protein:vir:10 371 YFFDGLRRRLKLIQTIVNIKG---ANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQDVIDEM 447 (499) T ss_pred HHHHHHHHHHHHHHHHHhccC---CccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHH Confidence 999999999999999987654 34677899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 474 EEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 474 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++|+++..+..+........+....++. +.+.+..++| T Consensus 448 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 485 (499) T protein:vir:10 448 NQQDAETIKKNQEALRGQDPDRLELEDK-QDDSSENDKE 485 (499) T ss_pred HHHHHHHHHHHHhhhccCCCCCCCCCCC-CcccCCCCCC Confidence 9999887766655543332222111111 1111111222 No 13 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.2e-98 Score=557.37 Aligned_cols=478 Identities=40% Similarity=0.646 Sum_probs=411.5 Q ss_pred cccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc--cccccccc Q lcl|NC_010808. 9 TDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT--RRKEEYMA 86 (512) Q Consensus 9 ~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~--~~~~~~~~ 86 (512) --.-+|.+++-+|..-.|-.|.++-....+ ....|.++|.+|...+.+||+++.+||+|+|+++.... ......++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ 78 (481) T protein:vir:10 1 MTVYTINNINTKFSPLANDDFVVSDLAELL--KEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKA 78 (481) T ss_pred CeeEeeehhchhcccccCceeeeecchhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccc Confidence 112577888888888888888887665444 35779999999999999999999999999987654332 33445678 Q ss_pred ceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEE Q lcl|NC_010808. 87 DNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK 166 (512) Q Consensus 87 ~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~ 166 (512) ++|+++||+++||++.++|++|+|+++++++++.++.|+++|+.|+++.++.+++++++++|+||+++|.+++|++++++ T Consensus 79 ~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~ 158 (481) T protein:vir:10 79 DHRAVHNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKV 158 (481) T ss_pred cceeecchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEE Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceE Q lcl|NC_010808. 167 SDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPIT 246 (512) Q Consensus 167 ~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv 246 (512) ++|.+++|+||+....++++++|+|...+.+ ...+.++++||++.+++|...++.+ ..++..+|+||.|||| T Consensus 159 ~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~---~~~~~~~~~y~~~~i~~~~~~~~~~-----~~~~~~~~~~g~vPvv 230 (481) T protein:vir:10 159 LDPKSTFVVYDQTLDKKVVAGVRYFEKQDKD---KVPVQHVEVYTTDKIYYIEIKGGTY-----HRVEEVEHYYNDVPII 230 (481) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEeeCC---CceEEEEEEEecCeEEEEEecCCce-----eecccccccCCceeEE Confidence 9999999999998888999999999765433 3456789999999999998776544 3456789999999999 Q ss_pred eecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCC Q lcl|NC_010808. 247 EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETE 326 (512) Q Consensus 247 ~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (512) +|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|+.+.+.+.+...+..+.+.+.... ...+.+ T Consensus 231 ~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~ 305 (481) T protein:vir:10 231 EYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGT-----NANGSE 305 (481) T ss_pred EeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccc-----cccCCC Confidence 999999999999999999999999999999999999999999999888887777777766665553332 223445 Q ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 327 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 406 (512) Q Consensus 327 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 406 (512) ++++++|++++.+.+++++++++|.+.|+.+|++|+++++.+++|+||+|+++++++|.+||+++++.|+.+|+++++++ T Consensus 306 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li 385 (481) T protein:vir:10 306 GKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLL 385 (481) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 407 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 486 (512) Q Consensus 407 ~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~ 486 (512) +++++..+. ...+..+++++|++++|+|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+. T Consensus 386 ~~~~~~~~~--~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~ 463 (481) T protein:vir:10 386 LNNVNLTGL--KQHNYAELTITFTPNLPKSMMESINAFNALSGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADK 463 (481) T ss_pred HHHHhccCC--CccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhh Confidence 999887654 345677899999999999999999999999999999999999999999999999999999887765544 Q ss_pred hcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 487 GIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ...+++.+. ++++|+++ T Consensus 464 ~~~~~~~~~---------~~~~dd~~ 480 (481) T protein:vir:10 464 RGYGEAFEN---------HLNVDDSN 480 (481) T ss_pred ccCCccCCC---------CCCCCCCC Confidence 433333222 11112222 No 14 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=1.9e-98 Score=556.30 Aligned_cols=453 Identities=25% Similarity=0.392 Sum_probs=383.6 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |--.-|..|+||.+..+. .+.|.++|.+|. .+++|++++++||+|+|+++.++. +.+.++++|+ T Consensus 1 ~~~~~~~~~~~p~d~~~~-------------~~~l~~~i~~~~-~~~~r~~~~~~yy~g~~~i~~~~~--~~~~~~~~ki 64 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPIT-------------NEVVTKFMEKHR-LEVARYEYLKNMYRGIMAIDAEPT--KDLWKPDNRL 64 (453) T ss_pred CeecCCcceEcCCCCCCC-------------HHHHHHHHHHHH-HHHHHHHHHHHHhhccCchhcCCC--ccccCcccee Confidence 444445555555555443 467888888885 567899999999999999876543 4567889999 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++||+++||++.++|++|+|++|++++++.++.|+++|+.|+|+..+.+++++++++|+||++||+|++|++++++++|. T Consensus 65 ~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~ 144 (453) T protein:vir:39 65 TVNFTKYIVDTFTGYFNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPE 144 (453) T ss_pred ecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 250 (512) +++|+|+++..+++++++|+|.. .....++++||++.+++|....+.+ ..++..+|+||.||||+|+| T Consensus 145 ~~~~v~d~~~~~~~~~~ir~~~~-------~~~~~~~~~yt~~~i~~~~~~~~~~-----~~~~~~~~~~g~vPvv~~~n 212 (453) T protein:vir:39 145 NMFMVYDDTIKQEPLFAVRYGYD-------DDYKLYGEVYTKETTYALNGTMGFY-----NMTEQAPNPFDDLPVVEFYF 212 (453) T ss_pred ceEEEecCCCCCeEEEEEEEEEe-------CCeEEEEEEEeCCeEEEEEecCCce-----eeecccccCCCceeEEEecC Confidence 99999999888899999998853 2346789999999999998765543 34567899999999999999 Q ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcc Q lcl|NC_010808. 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVD 330 (512) Q Consensus 251 ~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (512) +++|+|+|+++++|||+||+++|++++.++++++|+++++|.... .+.....+..+++... ...+.+.+++ T Consensus 213 ~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~-~~~~~~~~~~~~~~~~--------~~~~~~~~~~ 283 (453) T protein:vir:39 213 NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVE-EEDLKNIRSNRVINYY--------GESSEAKNVD 283 (453) T ss_pred CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCC-chhhhhhhhcceeeec--------CCCCCCCCCc Confidence 999999999999999999999999999999999999999996433 3344444444443322 2233456789 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 331 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 410 (512) Q Consensus 331 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 410 (512) ++|++++.+.+++++++++|.+.|+.+|++|+++++.+ ||+||+||++++++|.+||+++++.|+.+|++++++|++++ T Consensus 284 ~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~ 362 (453) T protein:vir:39 284 VKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELS 362 (453) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999998887 68999999999999999999999999999999999999988 Q ss_pred HhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010808. 411 KNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) Q Consensus 411 ~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 490 (512) +..+ ...++.+|+|+|++++|+|.++.+++++|++|++|+||+++++|+++|+++|++||++|+++..+........ T Consensus 363 ~~~~---~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~ 439 (453) T protein:vir:39 363 TNVS---NKEAWKDIEYTFTRNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPS 439 (453) T ss_pred hccC---CccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCC Confidence 7654 3456788999999999999999999999999999999999999999999999999999999877655444333 Q ss_pred CCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 491 DPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..+..++ ...+++| T Consensus 440 ~~~~~~~--------~~~~~~e 453 (453) T protein:vir:39 440 EKGTDTV--------VPETNEE 453 (453) T ss_pred CCCCCCC--------CCCcCCC Confidence 2221111 1112222 No 15 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=9.8e-99 Score=557.90 Aligned_cols=467 Identities=18% Similarity=0.269 Sum_probs=385.2 Q ss_pred cchhhcccccc---CCCcCeeecccchhHHhhhc-----------HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_010808. 11 TDLRENRNYLF---NDEANVVYTYDGTESDLLQN-----------INEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE 76 (512) Q Consensus 11 ~~~~~~~~~~f---~~~~~~~~~~~~~~~~~~~~-----------~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~ 76 (512) |++..++..+- -+..|+.|.+....+...+. .+.|.++|.+|. .+++|++++.+||+|+|+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~I~~~ 79 (492) T protein:vir:94 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKE 79 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccc Confidence 55555544432 35677777776665544321 356788888886 5679999999999999998765 Q ss_pred ccc-----cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEE Q lcl|NC_010808. 77 LTR-----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAY 151 (512) Q Consensus 77 ~~~-----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~ 151 (512) +.+ .....++++|+++||+++||++.++|++|+|+++++++++..+.|++||+ |+++..+.++++++++||++| T Consensus 80 ~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~ 158 (492) T protein:vir:94 80 PKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEW 158 (492) T ss_pred cccccccccccccccccccccchHHHHHHHHHhhhcccCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEE Confidence 543 24456788999999999999999999999999999999999999999986 789999999999999999999 Q ss_pred EEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc----- Q lcl|NC_010808. 152 ELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL----- 226 (512) Q Consensus 152 ~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~----- 226 (512) +++|.|++|++++++++|.+++|+||++..+++++++|+|.... ..++++|++..+++|....+... T Consensus 159 ~~v~~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~--------~~~~~~y~~~~v~~~~~~~~~~~~~~~~ 230 (492) T protein:vir:94 159 LHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSN 230 (492) T ss_pred EEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecc--------ceeEEEEecCeEEEEEEecCeeeecccc Confidence 99999999999999999999999999988899999999997542 23679999999999876554321 Q ss_pred cccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhh-hh Q lcl|NC_010808. 227 KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQ-KE 305 (512) Q Consensus 227 ~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~-~~ 305 (512) ..........+|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.+++|++|++|++|+.+.+..+.... +. T Consensus 231 ~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~ 310 (492) T protein:vir:94 231 NLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRY 310 (492) T ss_pred ccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHHHhh Confidence 11234456788999999999999999999999999999999999999999999999999999999866555443321 11 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 385 (512) .+++ ..+++++++|++++.+.+++++++++|.+.|+.+|++|+++++++++|+||+||++++.+|. T Consensus 311 ~~~~--------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 376 (492) T protein:vir:94 311 YGAI--------------KVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 376 (492) T ss_pred ccce--------------ecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHH Confidence 1111 23557789999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQD 465 (512) Q Consensus 386 ~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d 465 (512) +||+++++.|+.+|++++++|+++++.. .++.+++++|++++|+|.++.++++++++|++|+||+++++|+++| T Consensus 377 ~k~~~k~~~f~~~l~~~~~li~~~~~~~------~~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d 450 (492) T protein:vir:94 377 LKADKLARKAKVAIQELLWFVFEHFDIK------GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVED 450 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCC------cccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCC Confidence 9999999999999999999999987653 2466899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 466 PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 466 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) +++|++||++|+++.++..+......+....+++++ ++.++| T Consensus 451 ~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~----~~~e~e 492 (492) T protein:vir:94 451 LQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERS----NNKESE 492 (492) T ss_pred HHHHHHHHHHHHHHHHhhccccccccCCCCccccCC----ccccCC Confidence 999999999999887765544332222222211111 111111 No 16 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=3.5e-98 Score=554.85 Aligned_cols=467 Identities=18% Similarity=0.272 Sum_probs=384.9 Q ss_pred cchhhcccccc---CCCcCeeecccchhHHhhh-----------cHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Q lcl|NC_010808. 11 TDLRENRNYLF---NDEANVVYTYDGTESDLLQ-----------NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE 76 (512) Q Consensus 11 ~~~~~~~~~~f---~~~~~~~~~~~~~~~~~~~-----------~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~ 76 (512) |++..++..+- -+..|+.|....+.....+ ..+.|.++|.+|. .+++|++++.+||+|+|+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~~~ 79 (492) T protein:vir:97 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKE 79 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccc Confidence 65555544432 3567888777665554322 1345778888886 5779999999999999998765 Q ss_pred ccc-----cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEE Q lcl|NC_010808. 77 LTR-----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAY 151 (512) Q Consensus 77 ~~~-----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~ 151 (512) +.+ ...+.++++|+++||+++||++.++|++|+|+++++++++..+.|++||+ |+++..+.++++++++||+|| T Consensus 80 ~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~ 158 (492) T protein:vir:97 80 PKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEW 158 (492) T ss_pred cccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEE Confidence 443 23456788999999999999999999999999999999999999999986 789999999999999999999 Q ss_pred EEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc----- Q lcl|NC_010808. 152 ELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL----- 226 (512) Q Consensus 152 ~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~----- 226 (512) +++|.+++|++++++++|.+++|+||++..+++++++|+|.... ..++++|+++.+++|....+... T Consensus 159 ~~v~~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~--------~~~~~~y~~~~v~~~~~~~~~~~~~~~~ 230 (492) T protein:vir:97 159 LHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSN 230 (492) T ss_pred EEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecc--------ceeEEEEecCeEEEEEEecCeeeecccc Confidence 99999999999999999999999999988899999999997542 23678999999999876654321 Q ss_pred cccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhh Q lcl|NC_010808. 227 KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKE 305 (512) Q Consensus 227 ~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~ 305 (512) ..........+|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|++|++|....+..+... .+. T Consensus 231 ~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~ 310 (492) T protein:vir:97 231 NLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRY 310 (492) T ss_pred cccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhh Confidence 1233455678999999999999999999999999999999999999999999999999999999986655444322 222 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 385 (512) .+++ ..+++++++|++++.+.+++++++++|.++|+.+|++|+++++++++|+||+||++++.+|. T Consensus 311 ~~~~--------------~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 376 (492) T protein:vir:97 311 YGAI--------------KVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 376 (492) T ss_pred ccce--------------ecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHH Confidence 2222 22456789999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQD 465 (512) Q Consensus 386 ~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d 465 (512) .||+++++.|+.+|++++++|+++++.. .++.+++++|++++|+|.++.+++++|++|++|+||+++++|+++| T Consensus 377 ~ka~~~~~~f~~~l~~~~~li~~~~~~~------~~~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~~v~d 450 (492) T protein:vir:97 377 LKADKLARKAKVAIQELLWFVFEHFDIK------GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVED 450 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCC------cccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCC Confidence 9999999999999999999999987643 3567899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 466 PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 466 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +++|++||++|+++..+..+.....+.......+.+++. .+| T Consensus 451 ~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~e 492 (492) T protein:vir:97 451 LQAELERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNK-----ESE 492 (492) T ss_pred HHHHHHHHHHHHHHHHHhhhccccCCCCCCccccccccc-----ccC Confidence 999999999999877665544332222222111111111 111 No 17 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=8.5e-98 Score=552.76 Aligned_cols=466 Identities=23% Similarity=0.361 Sum_probs=394.0 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+ .....+|..++|..|.++..+..+. +.|.++|.+|..++++||+++++||+|+|++++.. ..+.++++|+ T Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~---~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~---~~~~~~~~ki 72 (470) T protein:vir:99 1 MK--DINYGRDKVTGNSSFIFPKGEKLTS---NELLGFIAYNETVLKPRYRENMKLYLGKHKILTAP---EKETGADNRI 72 (470) T ss_pred Cc--cccCCcccccCCceEEeCCCCCcCH---HHHHHHHHHHHHhhHHHHHHHHHHhccccccccCc---ccccCCccee Confidence 32 2346688999999999988777664 57889999999999999999999999999987543 3457789999 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCc-hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDD-KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) ++||+++||++.++|++|+|+++++.+ ....+.|+++|+.|+|+.++.+++++++++|++|+++|.+++|++++++++| T Consensus 73 ~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p 152 (470) T protein:vir:99 73 VVNSAKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSP 152 (470) T ss_pred ecchHHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEcc Confidence 999999999999999999999998865 4567889999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeec Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS 249 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 249 (512) .+++|+||++...++++++|+|.... ......++.+|+++.+++|.....++. ....+..+|+||.||||+|+ T Consensus 153 ~~~~~i~d~~~~~~~~~~vr~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~ 225 (470) T protein:vir:99 153 NHAFIIYDDTVQRQPLAFVHYQIDNS----NNWTDAYGVIQYADKFYKFKGYDIEED---TNAAGYAINPYGLVPAVEFF 225 (470) T ss_pred ceeEEEEcCCCCcceEEEEEEEEEec----CCeeEEEEEEEecCeEEEEEecccccc---cccccccccCCCccceEeec Confidence 99999999988888999999987542 334567888999999998887655432 23456788999999999999 Q ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh---hhhccccccchhhhhhcccccCCC Q lcl|NC_010808. 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK---QKEANVLFLEPTVYENRDTGIETE 326 (512) Q Consensus 250 n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 326 (512) |+++|+|+|+++++|||+||+++|++++.++++++|++|++|+.....+.+.. ....+++.+ .....+ T Consensus 226 n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~---------~~~~~~ 296 (470) T protein:vir:99 226 ENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYV---------SQLDPD 296 (470) T ss_pred CCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeee---------cCCCCC Confidence 99999999999999999999999999999999999999999986554433221 122222211 122346 Q ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 327 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 406 (512) Q Consensus 327 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 406 (512) .+++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||++++.+|.+||+++++.|+.+|++++++| T Consensus 297 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li 376 (470) T protein:vir:99 297 TNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIV 376 (470) T ss_pred CCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 407 ETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 486 (512) Q Consensus 407 ~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~ 486 (512) +++++..+.. ..++.+++++|++++|.|.++.++++++++|++|+||+++++|++ |+++|++||++|+++..+..+. T Consensus 377 ~~~~~~~~~~--~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~~v-d~~~E~eri~~E~~~~~~~~~~ 453 (470) T protein:vir:99 377 LATLFNNKQD--QELWSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIPDI-EPDAEMKQIAKEKADAIKQTQQ 453 (470) T ss_pred HHHHhccCCc--ccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCC-CHHHHHHHHHHHHHHHHHHHHh Confidence 9998776543 456778999999999999999999999999999999999999998 7999999999999887765554 Q ss_pred hcccCCCCCCCCCCCCC Q lcl|NC_010808. 487 GIYKDPRDINDDEQDDD 503 (512) Q Consensus 487 ~~~~~~~~~~~~~~~~~ 503 (512) .....+....+.+.+++ T Consensus 454 ~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 454 LSMPIDILKRDNNAEEE 470 (470) T ss_pred hcCCCCcCCCCCCccCC Confidence 44332222211111111 No 18 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=2.9e-97 Score=549.86 Aligned_cols=472 Identities=17% Similarity=0.243 Sum_probs=374.7 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR- 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~- 79 (512) ||.+--= ..-+++.....-....|-++..+...+.+ .+.|.++|.+|. .+++||.++.+||+|+|+++.+... T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~e~~---~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~~~~~~~ 74 (483) T protein:vir:12 1 MAQALIK--GGNILYPSQPTQTEIFDAIVRTNNKPETL---EEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPV 74 (483) T ss_pred Cccchhc--CCceeecCcchhhhhhhcccccCCchhhH---HHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccc Confidence 4322100 00001111111111222222222222222 357788888886 5678999999999999998765433 Q ss_pred ----cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 80 ----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 80 ----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) ...+.++++|+++||+++||++.++|++|+|+++++++++..+.|++||+ |+++..+.++++++++||+||+++| T Consensus 75 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~ 153 (483) T protein:vir:12 75 DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPY 153 (483) T ss_pred cccccccccccccccccchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEE Confidence 24556788999999999999999999999999999999999999999986 6899999999999999999999999 Q ss_pred ECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcc-----ccccc Q lcl|NC_010808. 156 RNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNG-----LKLTP 230 (512) Q Consensus 156 ~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~-----~~~~~ 230 (512) .|++|++++++++|.+++|+||++..+++++++|+|.... ..++++|++..+++|....+.. ..... T Consensus 154 ~d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~ 225 (483) T protein:vir:12 154 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSNNLEN 225 (483) T ss_pred EcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeec--------ceEEEEEecCeEEEEEEeCCeeeecccccccc Confidence 9999999999999999999999988899999999997642 2357999999999887655432 12233 Q ss_pred cccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhhcccc Q lcl|NC_010808. 231 RENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKEANVL 309 (512) Q Consensus 231 ~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~ 309 (512) ......+|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|++|++|....+..+... .+..+++ T Consensus 226 ~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~ 305 (483) T protein:vir:12 226 SKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAI 305 (483) T ss_pred cccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhcccc Confidence 445678899999999999999999999999999999999999999999999999999999976655444322 2222222 Q ss_pred ccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_010808. 310 FLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK 389 (512) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 389 (512) ..+++++++|++++.+.+++++++++|.++|+.+|++|+++++++++|+||+||++++.++.+||+ T Consensus 306 --------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 371 (483) T protein:vir:12 306 --------------KVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKAD 371 (483) T ss_pred --------------ccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHH Confidence 224577899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHH Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELE 469 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E 469 (512) ++++.|+.+|++++++|+++++.. .++.+++++|++++|+|.++.++++++++|++|+||+++++|+++|+++| T Consensus 372 ~~~~~f~~~l~~~~~li~~~~~~~------~~~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~~v~d~~~E 445 (483) T protein:vir:12 372 KLARKAKVAIQELLWFVFEHFDIK------GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAE 445 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCC------CccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHH Confidence 999999999999999999987643 35678999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 470 VKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 470 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) ++||++|+++..+..+......+. .+.+++...+.++| T Consensus 446 ~~ri~~E~~~~~~~~~~~~~~~~d----~~~~~~~~~~~e~e 483 (483) T protein:vir:12 446 LERIEQEQMEYNKQLPNLDDGGAD----GAQQQERSNNKESE 483 (483) T ss_pred HHHHHHHHHHHHhhcccccccccC----CcccCCCCCcccCC Confidence 999999998876654433222211 11111111111111 No 19 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=4.4e-97 Score=548.85 Aligned_cols=453 Identities=25% Similarity=0.403 Sum_probs=386.0 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISD 100 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~ 100 (512) ++..++..+.++..+..+ .+.|.++|++|. .+++||+++.+||+|+|+++.. ....+.++++|+++||+++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~i~~~i~~~~-~~~~r~~~~~~yy~g~~~i~~~--~~~~~~~~~~ki~~n~~~~ivd 74 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEIT---DKVVNDFMKKHQ-EEVERYEYLGNMYKGIMEISSQ--KAKDSWKPDNRLTNNFAKYIVD 74 (453) T ss_pred CccccceeeeccccccCC---HHHHHHHHHHHH-HHHHHHHHHHHHhccccchhcC--CCCCccCccceeecchHHHHHH Confidence 455555555555444433 457888898885 6789999999999999998754 3455678899999999999999 Q ss_pred HHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCC Q lcl|NC_010808. 101 FINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTI 180 (512) Q Consensus 101 ~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~ 180 (512) +.++|++|+|++++++++..++.|++||+.|+|+..+.++++++++||+||+++|++++|.+++++++|.+++|+|+++. T Consensus 75 ~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~~ 154 (453) T protein:vir:73 75 TFVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDSI 154 (453) T ss_pred HhhhhhcccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHH Q lcl|NC_010808. 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEK 260 (512) Q Consensus 181 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~ 260 (512) +.++++++++|... ....++++||++.+++|....+.+ ......+|+||.||||+|+|+++|+|+|++ T Consensus 155 ~~~~~~~i~~~~~~-------~~~~~~~vyt~~~i~~~~~~~~~~-----~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~ 222 (453) T protein:vir:73 155 KQKPLFAVYYGFDE-------EGNLSGTVYTLLETISITGKAGEV-----KFGESTYNVYSDLPIVEYNFNEERQSIFEP 222 (453) T ss_pred CceeEEEEEEEEec-------CceEEEEEEeCCeEEEEEecCCce-----EEccceeccCCceeEEEecCCCCCCcchhh Confidence 88899999987532 234678999999999998765543 345678899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCH Q lcl|NC_010808. 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDV 340 (512) Q Consensus 261 v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 340 (512) +++|||+||+++|++++.+++|++|++|++|.... .+.....+..+.+...... .......+.+++++|++++.+. T Consensus 223 v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~d~~~l~~~~~~ 298 (453) T protein:vir:73 223 VHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVD-EEDAKNIKDNRLINFFDKN---SNGQGTNAAKVDVKFLDKPDSD 298 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCC-chhhhcccccccccccccc---cccccccccCceeEEeeecCCH Confidence 99999999999999999999999999999997443 3444444444444332221 1122234567889999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc Q lcl|NC_010808. 341 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 420 (512) Q Consensus 341 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~ 420 (512) +++++++++|.+.|+.+|++|+++++.+ ||+||+||++++.+|.+||+++++.|+.+|++++++|+++++..+. .. T Consensus 299 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~---~~ 374 (453) T protein:vir:73 299 VQTENLLNRLERSIFQFTMAANISDENF-GNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASN---KD 374 (453) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCcccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---cc Confidence 9999999999999999999999999887 7899999999999999999999999999999999999998776543 45 Q ss_pred ccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 421 DFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 421 d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) ++.+++++|++++|.|.++.+++++|+.|++|+||+++++|+++||++|++||++|+++.++.++......+.+..++= T Consensus 375 ~~~~i~v~f~~~~p~~~~~~a~~~~k~~giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 375 AWKDIEYTFTRNEPKDIKEQAETANILKGITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred ccccceEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 6788999999999999999999999999999999999999999999999999999999988877765544443332222 No 20 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=7e-97 Score=547.74 Aligned_cols=452 Identities=25% Similarity=0.400 Sum_probs=379.0 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |-..-..-+.|+++..+. .+.|.++|.+|. .+++||+++++||+|+|+++.+.. ..+.++++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~-------------~~~i~~~i~~~~-~~~~r~~~~~~Yy~g~~~i~~~~~--~~~~~~~~ki 64 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPIT-------------VEVVTKFMEKHK-LEVARYEYLKNMYLGIMAIDDEPA--KDSWKPDNRL 64 (452) T ss_pred CcccCceeEEcCCccCCC-------------HHHHHHHHHHHH-HHHHHHHHHHHHhccccccccCcc--ccccCcccee Confidence 322223334444444432 567889999886 567999999999999999876543 4567789999 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++||+++||++.++|++|+|+++++++++.++.|+++|+.|+|+..+.+++++++++|+||+++|+|++|++++++++|. T Consensus 65 ~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~ 144 (452) T protein:vir:36 65 AVNFTKYIVDTFTGYFNGIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPE 144 (452) T ss_pred ecchHHHHHHHHhhhhcccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 250 (512) +++|+||++...++++++|+|... ....++++||++.+++|....+++. .....+|+||.||||+|+| T Consensus 145 ~~~~v~d~~~~~~~~~~i~~~~~~-------~~~~~~~vyt~~~i~~~~~~~~~~~-----~~~~~~~~~g~iPvv~~~n 212 (452) T protein:vir:36 145 NMFMVYDDTVKQEPLFAVRYGVDE-------DKKLQGEVYTLLETIKISGENDEIS-----FGEGTYNPYPDLPVVEFYF 212 (452) T ss_pred ceEEEEcCCCCCceEEEEEEEEec-------CceEEEEEEecCeEEEEEEcCCceE-----EecceeccCCcccEEEecC Confidence 999999998888999999998632 2356889999999999987665443 4556889999999999999 Q ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcc Q lcl|NC_010808. 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVD 330 (512) Q Consensus 251 ~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (512) +++|+|+|+++++|||+||+++|++++.++++++|++|++|..... +.....+..+++.+.. .+...+++ T Consensus 213 ~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~-~~~~~~~~~~~~~~~~---------~~~~~~~~ 282 (452) T protein:vir:36 213 NEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEE-EDLKNIRSNRVINYYA---------DGEGKNVD 282 (452) T ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCc-hhhhhhhhcceEEecC---------CCCccCCc Confidence 9999999999999999999999999999999999999999975433 3334444444433321 23455678 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 331 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 410 (512) Q Consensus 331 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 410 (512) ++|++++.+.+++++++++|.+.|+.+|++|+++++++ ||+||+||++++++|.+||+++++.|+.+|++++++|++++ T Consensus 283 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 361 (452) T protein:vir:36 283 VKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELS 361 (452) T ss_pred ceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-cCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999998887 68999999999999999999999999999999999999998 Q ss_pred HhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010808. 411 KNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) Q Consensus 411 ~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 490 (512) +..+. ..++.+|+|+|++++|.|.++.+++++|++|++|+||+++++|+++|+++|++||++|+++..+........ T Consensus 362 ~~~~~---~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~ 438 (452) T protein:vir:36 362 TNVSN---KDSWKDIEYTFTRNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPS 438 (452) T ss_pred hccCC---ccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCC Confidence 87642 456778999999999999999999999999999999999999999999999999999998765544332222 Q ss_pred CCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 491 DPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++ ..++..+.+++| T Consensus 439 ~~--------~~~~~~~~~~~e 452 (452) T protein:vir:36 439 EK--------GTDTVVSETNEE 452 (452) T ss_pred CC--------cccccCccccCC Confidence 11 111222222222 No 21 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=2.7e-96 Score=544.53 Aligned_cols=445 Identities=16% Similarity=0.165 Sum_probs=370.3 Q ss_pred HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc---------ccccccceeeecchHHHHHHHHHhhhh Q lcl|NC_010808. 37 DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (512) Q Consensus 37 ~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~---------~~~~~~~~ri~~n~~~~iv~~~a~~l~ 107 (512) ...+.++.|.+....++..+.+||+++.+||+|+|+++.+.... ....++++|+++||+++||++.++|++ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 22333455555544555778899999999999999987664332 234567899999999999999999999 Q ss_pred ccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEE Q lcl|NC_010808. 108 GNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAG 187 (512) Q Consensus 108 g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~ 187 (512) |+||+|++++++..+.|+++++. +++..+.+++++++++|++|+++|+|++|++++++++|.++||+|++++.++++++ T Consensus 81 G~p~~~~~~d~~~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a~ 159 (470) T protein:vir:10 81 SVFPDIDVGKDADNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLGI 159 (470) T ss_pred ccceeeecCchHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 99999999999999999999974 68888899999999999999999999999999999999999999999998999999 Q ss_pred EEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccc---------------cccccccccccccccceEeecCCC Q lcl|NC_010808. 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL---------------TPRENGFESHSFERMPITEFSNNE 252 (512) Q Consensus 188 v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~vPvv~~~n~~ 252 (512) ||+|...+.++ ...+.++++||++.+++|.......... ........+|+||.||||+|+||+ T Consensus 160 ir~y~~~~~~~--~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~ 237 (470) T protein:vir:10 160 LRSYKQLDPDS--GKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNK 237 (470) T ss_pred EEEEEeeecCC--ceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCC Confidence 99998765543 3456778999999999998765543221 123345678999999999999999 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhh-hhhhccccccchhhhhhcccccCCCCCcce Q lcl|NC_010808. 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVK-KQKEANVLFLEPTVYENRDTGIETEGSVDG 331 (512) Q Consensus 253 ~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (512) +|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+.+..+.. ..+..+.+.+. ..+.+.++++ T Consensus 238 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~---------~~~~~~~~~~ 308 (470) T protein:vir:10 238 YRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKIN---------NTGNGDNSGV 308 (470) T ss_pred CCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEecc---------CCCCCcCcee Confidence 9999999999999999999999999999999999999998665544332 23333333221 1234567889 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 332 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 411 (512) Q Consensus 332 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 411 (512) +|++++.+.++++.++++|.+.|+.+|++|++++..+ ||+||+||+++++++.+||+++++.|+++|++++++|+++++ T Consensus 309 ~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~ 387 (470) T protein:vir:10 309 DKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLN 387 (470) T ss_pred EEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999999999999999887 689999999999999999999999999999999999999886 Q ss_pred hccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_010808. 412 NTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKD 491 (512) Q Consensus 412 ~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 491 (512) .. ..++.+|+++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++..+........ T Consensus 388 ~~-----~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~~~~- 461 (470) T protein:vir:10 388 FS-----DADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQADEL- 461 (470) T ss_pred cc-----CcccceeeEEeccCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhcccccc- Confidence 53 346778999999999999999999999999999999999999999999999999999988876544321111 Q ss_pred CCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 492 PRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~e 512 (512) .+..++++| T Consensus 462 ------------~~~~~dde~ 470 (470) T protein:vir:10 462 ------------NGKGVNDEQ 470 (470) T ss_pred ------------CCCCCCCCC Confidence 111111122 No 22 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=5.8e-96 Score=542.69 Aligned_cols=462 Identities=17% Similarity=0.216 Sum_probs=373.5 Q ss_pred hhccccccCCCcCeeecc-cchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-----cccccccc Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTY-DGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR-----RKEEYMAD 87 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~-~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~-----~~~~~~~~ 87 (512) |.+ ...+|-..+-.-.+ ...+.+.....+.|.+++.+|. .+++|++++++||+|+|+++.+... ...+.+++ T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:96 1 MIN-IIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK-QKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPD 78 (474) T ss_pred Ccc-cccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccchhhhcccccccccc Confidence 222 33444443333222 2222222222455788888886 5789999999999999998765432 23345688 Q ss_pred eeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEE Q lcl|NC_010808. 88 NRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (512) Q Consensus 88 ~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~ 167 (512) +|+++||+++||++.++||+|+|+++++++++..+.|++|++ |+++..+.+++++++++|+||+++|.|++|+++++++ T Consensus 79 ~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~ 157 (474) T protein:vir:96 79 WRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRV 157 (474) T ss_pred cccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Confidence 899999999999999999999999999999999999999986 7899999999999999999999999999999999999 Q ss_pred ccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc-----cccccccccccccccc Q lcl|NC_010808. 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTPRENGFESHSFER 242 (512) Q Consensus 168 ~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 242 (512) +|.++||+||++...++++++|+|... ...++++|+++++++|....+... ..........+|+||. T Consensus 158 ~p~~~~~v~d~~~~~~~~a~ir~~~~~--------~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:96 158 PAEQAIPIWTDKEREQLNAFIRIFTFN--------GETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWER 229 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec--------CeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCc Confidence 999999999999889999999998642 245789999999999987654321 1123345667899999 Q ss_pred cceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhh-hhhhhccccccchhhhhhccc Q lcl|NC_010808. 243 MPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEV-KKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 243 vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 321 (512) ||||+|+|+++|.|+|+++++|||+||.++|++++.+++|++|++|++|+.+.+..+. ..++..+++ T Consensus 230 vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i------------ 297 (474) T protein:vir:96 230 VPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAI------------ 297 (474) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhcccee------------ Confidence 9999999999999999999999999999999999999999999999999765543332 222222222 Q ss_pred ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 401 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 401 (512) ..+++++++|++++.+.+++++++++|.++|+.+|++|+++++++++|+||+||+++++++.+||+++++.|+++|++ T Consensus 298 --~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~ 375 (474) T protein:vir:96 298 --NVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQE 375 (474) T ss_pred --eccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234577899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_010808. 402 RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESI 481 (512) Q Consensus 402 ~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~ 481 (512) ++++|+++++. ..++.+|+++|++++|.|.++.++++++ +|++|+||+++++|+++|+++|++||++|+++.. T Consensus 376 ~~~~i~~~~g~------~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~ 448 (474) T protein:vir:96 376 LMQFILDFNKI------KLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWVDDPKAELERLDEEQLELN 448 (474) T ss_pred HHHHHHHHhCC------CcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 99999998653 3467789999999999999999999877 5999999999999999999999999999998776 Q ss_pred HHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 482 KKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.........+... .++.+.+.++.| T Consensus 449 ~~~~~~~~~~~~~~-----~~~~~~~~~e~~ 474 (474) T protein:vir:96 449 KQLPNLDDGGADGA-----QQQQQSENNQSK 474 (474) T ss_pred hhccccccccCCCC-----CCcCCCCccccC Confidence 54433332222211 111222222222 No 23 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=5.8e-96 Score=542.69 Aligned_cols=462 Identities=17% Similarity=0.216 Sum_probs=373.5 Q ss_pred hhccccccCCCcCeeecc-cchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-----cccccccc Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTY-DGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR-----RKEEYMAD 87 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~-~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~-----~~~~~~~~ 87 (512) |.+ ...+|-..+-.-.+ ...+.+.....+.|.+++.+|. .+++|++++++||+|+|+++.+... ...+.+++ T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 78 (474) T protein:vir:95 1 MIN-IIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHK-QKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPD 78 (474) T ss_pred Ccc-cccCCCCCCCCcchhhhccccccchHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccchhhhcccccccccc Confidence 222 33444443333222 2222222222455788888886 5789999999999999998765432 23345688 Q ss_pred eeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEE Q lcl|NC_010808. 88 NRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (512) Q Consensus 88 ~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~ 167 (512) +|+++||+++||++.++||+|+|+++++++++..+.|++|++ |+++..+.+++++++++|+||+++|.|++|+++++++ T Consensus 79 ~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~ 157 (474) T protein:vir:95 79 WRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRV 157 (474) T ss_pred cccccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEE Confidence 899999999999999999999999999999999999999986 7899999999999999999999999999999999999 Q ss_pred ccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc-----cccccccccccccccc Q lcl|NC_010808. 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTPRENGFESHSFER 242 (512) Q Consensus 168 ~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~ 242 (512) +|.++||+||++...++++++|+|... ...++++|+++++++|....+... ..........+|+||. T Consensus 158 ~p~~~~~v~d~~~~~~~~a~ir~~~~~--------~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (474) T protein:vir:95 158 PAEQAIPIWTDKEREQLNAFIRIFTFN--------GETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWER 229 (474) T ss_pred cccceEEEEcCCCCCceEEEEEEEeec--------CeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCc Confidence 999999999999889999999998642 245789999999999987654321 1123345667899999 Q ss_pred cceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhh-hhhhhccccccchhhhhhccc Q lcl|NC_010808. 243 MPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEV-KKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 243 vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 321 (512) ||||+|+|+++|.|+|+++++|||+||.++|++++.+++|++|++|++|+.+.+..+. ..++..+++ T Consensus 230 vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i------------ 297 (474) T protein:vir:95 230 VPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAI------------ 297 (474) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhcccee------------ Confidence 9999999999999999999999999999999999999999999999999765543332 222222222 Q ss_pred ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 401 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 401 (512) ..+++++++|++++.+.+++++++++|.++|+.+|++|+++++++++|+||+||+++++++.+||+++++.|+++|++ T Consensus 298 --~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~ 375 (474) T protein:vir:95 298 --NVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQE 375 (474) T ss_pred --eccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234577899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_010808. 402 RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESI 481 (512) Q Consensus 402 ~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~ 481 (512) ++++|+++++. ..++.+|+++|++++|.|.++.++++++ +|++|+||+++++|+++|+++|++||++|+++.. T Consensus 376 ~~~~i~~~~g~------~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~ 448 (474) T protein:vir:95 376 LMQFILDFNKI------KLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWVDDPKAELERLDEEQLELN 448 (474) T ss_pred HHHHHHHHhCC------CcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 99999998653 3467789999999999999999999877 5999999999999999999999999999998776 Q ss_pred HHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 482 KKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.........+... .++.+.+.++.| T Consensus 449 ~~~~~~~~~~~~~~-----~~~~~~~~~e~~ 474 (474) T protein:vir:95 449 KQLPNLDDGGADGA-----QQQQQSENNQSK 474 (474) T ss_pred hhccccccccCCCC-----CCcCCCCccccC Confidence 54433332222211 111222222222 No 24 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=6.1e-96 Score=542.58 Aligned_cols=434 Identities=39% Similarity=0.614 Sum_probs=371.7 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc---hhHHHHH Q lcl|NC_010808. 48 YIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD---KDVLEAI 124 (512) Q Consensus 48 ~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d---~~~~~~l 124 (512) ||..|+..+++||+++.+||+|+|+++........+.++++|+++||+++||++.++|++|+|+++++.+ ++..+.| T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~l 80 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLSTI 80 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHHH Confidence 8888899999999999999999999988887888889999999999999999999999999999998754 4456689 Q ss_pred HHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceE Q lcl|NC_010808. 125 EAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEV 204 (512) Q Consensus 125 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~ 204 (512) +++|+.|+++..+.++++++++||++|+++|+|++|++++++++|.+++|+||++..+++++++|+|... .. T Consensus 81 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~--------~~ 152 (440) T protein:vir:95 81 KDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYA--------DK 152 (440) T ss_pred HHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------Cc Confidence 9999999999999999999999999999999999999999999999999999998888999999998643 23 Q ss_pred EEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_010808. 205 FTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLND 284 (512) Q Consensus 205 ~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~ 284 (512) .++++||++.+++|.....+.. ....++..+|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.+++|++ T Consensus 153 ~~~~vyt~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~ 230 (440) T protein:vir:95 153 VNMTVYTKDKVITYKPYSNNSV--RLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLND 230 (440) T ss_pred eEEEEEeCCeEEEEEEecCCcc--ceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 4678999999999886544322 33456788999999999999999999999999999999999999999999999999 Q ss_pred ceeeeecCCcC---ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 285 AMLLIKGNLSL---DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP 361 (512) Q Consensus 285 ~~lv~~g~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 361 (512) |++|++|.... +.+....++..+.+..... ......+++++++|++++++.+++++++++|.++|+.+|++| T Consensus 231 ~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p 305 (440) T protein:vir:95 231 AMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTG-----ISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIP 305 (440) T ss_pred ceeeeecccccCCCCccchhhhhhccceecccc-----cccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 99999996433 4444455555555443221 222344667899999999999999999999999999999999 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHH Q lcl|NC_010808. 362 NMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEEL 441 (512) Q Consensus 362 ~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~ 441 (512) +++++.+++|+||+||++++++|.+||+++++.|+++|++++++|+++++...+. ..+..+++++|++++|+|.++.+ T Consensus 306 ~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~--~~~~~~v~i~f~~~~p~~~~~~a 383 (440) T protein:vir:95 306 NLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGP--VIEANKLTFTFHPNIPQDVWTEI 383 (440) T ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc--ccccccceEEeCCCCCCCHHHHH Confidence 9999999999999999999999999999999999999999999999998876543 45677899999999999999999 Q ss_pred HHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 442 KAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 442 ~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (512) ++++|++|++|+||+++++|++++ ++|++||++|+++........... .++.+++++ T Consensus 384 d~~~kl~g~iS~et~~~~l~~~d~-~~E~~ri~~E~~~~~~~~~~~~~~----~~~~~~~~e 440 (440) T protein:vir:95 384 KAYIEAGGEISQETLMENASFTDY-KTEHSRILKQGGSSDLEIGQIVGD----ADVGQADTE 440 (440) T ss_pred HHHHHHhccCcHHHHHHhCCCCCc-HHHHHHHHHHHHHhhhhHHhhccC----CCCCCcCCC Confidence 999999999999999999999854 679999999988765443222211 111111111 No 25 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=3.5e-95 Score=538.45 Aligned_cols=429 Identities=27% Similarity=0.453 Sum_probs=372.2 Q ss_pred hcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCchh Q lcl|NC_010808. 40 QNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKD 119 (512) Q Consensus 40 ~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~ 119 (512) -..++|.++|.+|. .+.+||+++++||+|+|+++.+. ...+.++++|+++||+++||++.++|++|+|++++++++. T Consensus 1 l~~~~l~~~i~~~~-~~~~r~~~l~~yy~g~~~il~~~--~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~ 77 (429) T protein:vir:98 1 MTKDLLSELIQKHR-SFNLSYSAYKQLYEGDHAILQQK--QKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQ 77 (429) T ss_pred CCHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccccc--ccccCCCcceeecchHHHHHHHHhhhhcccCceeecCChH Confidence 23677889998886 55699999999999999987543 3566788999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccC Q lcl|NC_010808. 120 VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKT 199 (512) Q Consensus 120 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~ 199 (512) .++.|++||+.|+++..+.++++++++||+||+++|.+++|.+++++++|.+++|+||++...++++++|+|... T Consensus 78 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~----- 152 (429) T protein:vir:98 78 VSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNK----- 152 (429) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEec----- Confidence 999999999999999999999999999999999999999999999999999999999998888999999998542 Q ss_pred CcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 200 DEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYM 279 (512) Q Consensus 200 ~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~ 279 (512) ....+..+|+.+.+++|.....+. ...+..+|+||.||||+|+|+++|+|+|+++++|+|+||+++|++++.+ T Consensus 153 --~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~ 225 (429) T protein:vir:98 153 --GGVLEGSYSDASNITYFKDGEKGI-----EIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKANDV 225 (429) T ss_pred --CceEEEEEEeCceEEEEEecCCce-----EecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 245677889999988887655443 3456789999999999999999999999999999999999999999999 Q ss_pred HHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010808. 280 SDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTN 359 (512) Q Consensus 280 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 359 (512) ++|++|+++++|.... .+....+...+++.+.. +.+.+++++|++++.+.+++++++++|.+.|+.+|+ T Consensus 226 ~~~~~p~~~i~g~~~~-~~~~~~~~~~~~~~~~~----------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 294 (429) T protein:vir:98 226 EYFADAYLKILGAELD-DETLKSLRDTRIINLKD----------TDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAM 294 (429) T ss_pred HHhcCceeeeecCCCC-cchhhhHhhCceeeccC----------CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhC Confidence 9999999999997543 33444454455443321 224567899999999999999999999999999999 Q ss_pred ccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHH Q lcl|NC_010808. 360 TPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIE 439 (512) Q Consensus 360 ~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~ 439 (512) +|+++++++ ||+||+||+++++++.+|++++++.|+++|++++++|+++++..+. ..++.+|++.|++++|+|.++ T Consensus 295 ~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~---~~d~~~i~v~f~~~~p~~~~~ 370 (429) T protein:vir:98 295 VANISDESF-GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIG---PKDWIGIKYKFTRNLPANLLE 370 (429) T ss_pred ccccCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---ccccccceEEeCCCCCcCHHH Confidence 999998887 7899999999999999999999999999999999999999876543 356778999999999999999 Q ss_pred HHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 440 ELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 440 ~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) .+++++|++|++|+||+++++|+++|+++|++||++|+++..+.++.....+......+ T Consensus 371 ~a~~~~kl~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 371 ESQIAGNLAGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTILE 429 (429) T ss_pred HHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCCCC Confidence 99999999999999999999999999999999999999987765554443332222111 No 26 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=6.7e-95 Score=536.89 Aligned_cols=479 Identities=20% Similarity=0.263 Sum_probs=383.2 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR- 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~- 79 (512) || +-|..+-..+.. .+.. ...........+...|.+++.+| +.++++++++||.|+|+++.+... T Consensus 1 ~~--~~~~~~~~~~~~--------~~~~-~~~~~~~~~~~~~~~i~~~i~~~---~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (503) T protein:vir:59 1 MA--DIYPLGKTHTEE--------LNEI-IVESAKEIAEPDTTMIQKLIDEH---NPEPLLKGVRYYMCENDIEKKRRTY 66 (503) T ss_pred Cc--ccccCChhhHHh--------HHHh-hhhhhhhccchhHHHHHHHHHhh---cHHHHHHHHHHhccccchhhccchh Confidence 33 223222211111 1111 11112222222345677777766 357899999999999998765433 Q ss_pred -------cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 80 -------RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 80 -------~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~ 152 (512) .....++++|+++||+++||++.++|++|+|++++++++++.+.|+.|++ |+++..+.+++++++++|++|+ T Consensus 67 ~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~ 145 (503) T protein:vir:59 67 YDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELAD-DDFDDILNETVKNMSNKGIEYW 145 (503) T ss_pred cccccccccccccccceeecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEE Confidence 23455778999999999999999999999999999999999999988875 8999999999999999999999 Q ss_pred EEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc--- Q lcl|NC_010808. 153 LMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT--- 229 (512) Q Consensus 153 ~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~--- 229 (512) +||.|++|++++++++|.+++|+|++....++.++||+|.....+ ...+.++++||++.+++|......+.... T Consensus 146 ~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~---~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~ 222 (503) T protein:vir:59 146 HPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIM---GEETQKAELYTDTHVYYYEKIDGVYQMDYSYG 222 (503) T ss_pred EEeecCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCC---CceEEEEEEEeCCcEEEEEEcCCccccccccc Confidence 999999999999999999999999999889999999999865433 34567899999999999987765543211 Q ss_pred ------ccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh- Q lcl|NC_010808. 230 ------PRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK- 302 (512) Q Consensus 230 ------~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~- 302 (512) .......+|+|+.||||+|+|+++|.|+|+++++|||+||+++|++++.++++++|+++++|..+.+..+... T Consensus 223 ~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~ 302 (503) T protein:vir:59 223 ENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTAN 302 (503) T ss_pred ccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhh Confidence 1223557899999999999999999999999999999999999999999999999999999986665443322 Q ss_pred hhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHH Q lcl|NC_010808. 303 QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLF 382 (512) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 382 (512) +...+++ ..+++++++|++++++.++++.++++|.+.|+.++++|+++++.++||+||+||++++. T Consensus 303 ~~~~~~~--------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~ 368 (503) T protein:vir:59 303 LRYHSVI--------------KVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYA 368 (503) T ss_pred hhcccce--------------eccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHH Confidence 2222222 23456779999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF 460 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~ 460 (512) ++.++|+++++.|+.+|++++++|+++++..+.... .+..+|+++|++++|.|.++.+++++++ +|++|+||+++++ T Consensus 369 ~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~-~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l 447 (503) T protein:vir:59 369 LLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDF-NPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARN 447 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-ccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhC Confidence 999999999999999999999999999987665432 2456799999999999999999999998 6899999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 461 SFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 461 ~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) |+++||++|++||++|+++..+.........+...+..+++++.++...++. T Consensus 448 ~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (503) T protein:vir:59 448 PFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGA 499 (503) T ss_pred CCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCC Confidence 9999999999999999988777665554444433333333333333333333 No 27 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=9.1e-95 Score=536.14 Aligned_cols=461 Identities=18% Similarity=0.262 Sum_probs=370.1 Q ss_pred eccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-----cc Q lcl|NC_010808. 7 FETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR-----RK 81 (512) Q Consensus 7 ~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~-----~~ 81 (512) -+..|. ..+-+|.. .+......+.+ .+.|.++|.+|. .+++|++++.+||+|+|+++.+... .. T Consensus 1 ~~~~~~---~~~~~~~~----~~~~~~~~~~~---~~~i~~~i~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~ 69 (472) T protein:vir:93 1 MYPSQP---TQTEIFDA----IVRTNNKPETL---EEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAV 69 (472) T ss_pred CCCCCC---cchhhhhc----eeeecCchhhH---HHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccchhhccccc Confidence 111110 01112211 11122222222 356778888775 5679999999999999998765433 24 Q ss_pred cccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCc Q lcl|NC_010808. 82 EEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDE 161 (512) Q Consensus 82 ~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~ 161 (512) .+.++++|+++||+++||++.++|++|+|+++++++++..+.|++||+ |+++..+.++++++++||+||++||.|++|+ T Consensus 70 ~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~ 148 (472) T protein:vir:93 70 DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGE 148 (472) T ss_pred cccccccccccchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECCCCc Confidence 456788899999999999999999999999999999999999999986 6899999999999999999999999999999 Q ss_pred eEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcc-----ccccccccccc Q lcl|NC_010808. 162 TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNG-----LKLTPRENGFE 236 (512) Q Consensus 162 ~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~-----~~~~~~~~~~~ 236 (512) +++++++|.+++|+||++..+++++++|+|.... ..++++|++..+++|....... ........... T Consensus 149 ~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (472) T protein:vir:93 149 FKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN--------ETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFS 220 (472) T ss_pred eEEEEEcccceEEEEcCCCCCceEEEEEEEEeec--------ceeEEEEecCeEEEEEEecCeeeecccccccccccccc Confidence 9999999999999999988899999999997542 2357899999998887654432 12233445678 Q ss_pred cccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhhccccccchhh Q lcl|NC_010808. 237 SHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKEANVLFLEPTV 315 (512) Q Consensus 237 ~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~~~~~~~ 315 (512) +|+||.||||+|+|+++|+|+|+++++|||+||+++|++++.+++|++|++|++|....+..+... ++..++ T Consensus 221 ~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~------- 293 (472) T protein:vir:93 221 TGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGA------- 293 (472) T ss_pred cCCCCCcceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccc------- Confidence 899999999999999999999999999999999999999999999999999999986554443322 111121 Q ss_pred hhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 316 YENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLF 395 (512) Q Consensus 316 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~ 395 (512) ...+++++++|++++++.+++++++++|.++|+.+|++|+++++.+++|+||+||++++.+|.+||+++++.| T Consensus 294 -------~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~ 366 (472) T protein:vir:93 294 -------IKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 366 (472) T ss_pred -------cccCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHH Confidence 1235577899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_010808. 396 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEE 475 (512) Q Consensus 396 ~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~ 475 (512) +++|++++++|+++++.. .++.+++++|++++|+|.++.+++++|++|++|+||+++++|+++|+++|++||++ T Consensus 367 ~~~l~~~~~li~~~~~~~------~~~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~~~~d~~~E~~ri~~ 440 (472) T protein:vir:93 367 KVAIQELLWFVFEHFDIK------GEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQ 440 (472) T ss_pred HHHHHHHHHHHHHHhCCC------cccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHH Confidence 999999999999987543 35678999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 476 DEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 476 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) |+++..+..+......+.... +++..++.+++ T Consensus 441 E~~~~~~~~~~~~~~~~d~~~----~~~~~~~~~~e 472 (472) T protein:vir:93 441 EQMEYNKQLPNLDDGGADGAQ----QQERSNNKESE 472 (472) T ss_pred HHHHHHHhccCcCcccCCCCC----CCCCCCcccCC Confidence 998876655443222222111 11111111111 No 28 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=1.1e-94 Score=535.69 Aligned_cols=454 Identities=23% Similarity=0.321 Sum_probs=373.9 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---cccc------------c Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN---LVEL------------T 78 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~---~~~~------------~ 78 (512) |.+-+++.+-.... + ..+.|.++|..|. ..++|+.++.+||+|.+.. ..++ . T Consensus 1 ~~~~~~~~~~~~~~----------~--~~e~i~~~i~~~~-~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (474) T protein:vir:94 1 MTLYKLIDDIEAQG----------I--LPKHIEALIESHK-DDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGN 67 (474) T ss_pred CchHHHHhhccccC----------C--CHHHHHHHHHHhh-hhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccc Confidence 44444443332211 1 2356888898885 5688999999999996542 2111 1 Q ss_pred ccccccccceeeecchHHHHHHHHHhhhhccCceecCC-----chhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEE Q lcl|NC_010808. 79 RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD-----DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (512) Q Consensus 79 ~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~-----d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 153 (512) ......++++|+++||+++||++.++|++|+|++|+++ ++.+.+.|++||+.|+++.++.+++++++++|+||++ T Consensus 68 ~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 147 (474) T protein:vir:94 68 VRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARL 147 (474) T ss_pred ccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEE Confidence 12445678899999999999999999999999999874 3566789999999999999999999999999999999 Q ss_pred EEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccc Q lcl|NC_010808. 154 MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPREN 233 (512) Q Consensus 154 v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~ 233 (512) +|.+++|++++++++|.+++|+||++ .++++++|+|...... .....+++++||++.++.|...+.+. +... T Consensus 148 ~~~d~~~~~~~~~i~p~~~~~v~d~~--~~~~~~i~~~~~~~~~--~~~~~~~~~~y~~~~~~~~~~~~~~~----~~~~ 219 (474) T protein:vir:94 148 AYIDTNGDIRIKNIDPYNVIFVGDNI--LEPTYSLRYFYEKDDD--NGTDYVYAEFYDNAYYYVFRGEGIDA----LQEV 219 (474) T ss_pred EEeCCCCeeEEEEEcccceEEEEcCC--CceEEEEEEEEEeeCC--CceEEEEEEEEcCceEEEEeecCCCc----cccc Confidence 99999999999999999999999875 4678999999876533 34567789999999999998765443 3456 Q ss_pred ccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch Q lcl|NC_010808. 234 GFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP 313 (512) Q Consensus 234 ~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~ 313 (512) +..+|+||.||||+|+|+++|.|+|+++++|||+||.++|++++.++++++|+++++|+... .+....+...+.+.+ T Consensus 220 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~-~~~~~~~~~~~~i~~-- 296 (474) T protein:vir:94 220 GRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMS-EEMIQETQKSGAFEL-- 296 (474) T ss_pred ccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC-chhhhhhhhcceeEe-- Confidence 77899999999999999999999999999999999999999999999999999999997443 344444444444332 Q ss_pred hhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 314 TVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEG 393 (512) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~ 393 (512) .+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||+++++++.+||+++++ T Consensus 297 -----------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~ 365 (474) T protein:vir:94 297 -----------FDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFER 365 (474) T ss_pred -----------cCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHH Confidence 245678999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHH Q lcl|NC_010808. 394 LFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKI 473 (512) Q Consensus 394 ~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri 473 (512) .|+++|++++++|+++++..+....+.++.+++++|++++|.|.++.|++++++.|++|+||+++++|+++|+++|++|| T Consensus 366 ~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri 445 (474) T protein:vir:94 366 KMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEM 445 (474) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHH Confidence 99999999999999999988766566778899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 474 EEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 474 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) ++|+++..+.......+ ..++.++.++++ T Consensus 446 ~~E~~e~~~~~~~~~~~---~~~~~~~~~~s~ 474 (474) T protein:vir:94 446 EKESLEFNDKLPDIDEG---DANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHhhcccccCC---CcCCCCccccCC Confidence 99987765543222111 111111111111 No 29 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=1.1e-94 Score=535.69 Aligned_cols=454 Identities=23% Similarity=0.321 Sum_probs=373.9 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---cccc------------c Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN---LVEL------------T 78 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~---~~~~------------~ 78 (512) |.+-+++.+-.... + ..+.|.++|..|. ..++|+.++.+||+|.+.. ..++ . T Consensus 1 ~~~~~~~~~~~~~~----------~--~~e~i~~~i~~~~-~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (474) T protein:vir:10 1 MTLYKLIDDIEAQG----------I--LPKHIEALIESHK-DDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGN 67 (474) T ss_pred CchHHHHhhccccC----------C--CHHHHHHHHHHhh-hhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccc Confidence 44444443332211 1 2356888898885 5688999999999996542 2111 1 Q ss_pred ccccccccceeeecchHHHHHHHHHhhhhccCceecCC-----chhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEE Q lcl|NC_010808. 79 RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD-----DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (512) Q Consensus 79 ~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~-----d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 153 (512) ......++++|+++||+++||++.++|++|+|++|+++ ++.+.+.|++||+.|+++.++.+++++++++|+||++ T Consensus 68 ~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 147 (474) T protein:vir:10 68 VRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARL 147 (474) T ss_pred ccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEE Confidence 12445678899999999999999999999999999874 3566789999999999999999999999999999999 Q ss_pred EEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccc Q lcl|NC_010808. 154 MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPREN 233 (512) Q Consensus 154 v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~ 233 (512) +|.+++|++++++++|.+++|+||++ .++++++|+|...... .....+++++||++.++.|...+.+. +... T Consensus 148 ~~~d~~~~~~~~~i~p~~~~~v~d~~--~~~~~~i~~~~~~~~~--~~~~~~~~~~y~~~~~~~~~~~~~~~----~~~~ 219 (474) T protein:vir:10 148 AYIDTNGDIRIKNIDPYNVIFVGDNI--LEPTYSLRYFYEKDDD--NGTDYVYAEFYDNAYYYVFRGEGIDA----LQEV 219 (474) T ss_pred EEeCCCCeeEEEEEcccceEEEEcCC--CceEEEEEEEEEeeCC--CceEEEEEEEEcCceEEEEeecCCCc----cccc Confidence 99999999999999999999999875 4678999999876533 34567789999999999998765443 3456 Q ss_pred ccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch Q lcl|NC_010808. 234 GFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP 313 (512) Q Consensus 234 ~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~ 313 (512) +..+|+||.||||+|+|+++|.|+|+++++|||+||.++|++++.++++++|+++++|+... .+....+...+.+.+ T Consensus 220 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~-~~~~~~~~~~~~i~~-- 296 (474) T protein:vir:10 220 GRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMS-EEMIQETQKSGAFEL-- 296 (474) T ss_pred ccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCC-chhhhhhhhcceeEe-- Confidence 77899999999999999999999999999999999999999999999999999999997443 344444444444332 Q ss_pred hhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 314 TVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEG 393 (512) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~ 393 (512) .+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||+++++++.+||+++++ T Consensus 297 -----------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~ 365 (474) T protein:vir:10 297 -----------FDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFER 365 (474) T ss_pred -----------cCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHH Confidence 245678999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHH Q lcl|NC_010808. 394 LFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKI 473 (512) Q Consensus 394 ~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri 473 (512) .|+++|++++++|+++++..+....+.++.+++++|++++|.|.++.|++++++.|++|+||+++++|+++|+++|++|| T Consensus 366 ~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri 445 (474) T protein:vir:10 366 KMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEM 445 (474) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHH Confidence 99999999999999999988766566778899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 474 EEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 474 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) ++|+++..+.......+ ..++.++.++++ T Consensus 446 ~~E~~e~~~~~~~~~~~---~~~~~~~~~~s~ 474 (474) T protein:vir:10 446 EKESLEFNDKLPDIDEG---DANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHhhcccccCC---CcCCCCccccCC Confidence 99987765543222111 111111111111 No 30 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=1.7e-94 Score=534.61 Aligned_cols=434 Identities=20% Similarity=0.249 Sum_probs=369.8 Q ss_pred hcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-----cccccccceeeecchHHHHHHHHHhhhhccCceec Q lcl|NC_010808. 40 QNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR-----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ 114 (512) Q Consensus 40 ~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~-----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~ 114 (512) -++..|.++|.+|.. +++||+++++||.|+|+++.+... .....++++|+++||+++||++.++|++|+|++|+ T Consensus 1 l~~~~i~~~i~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADAA-RRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHHH-HHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee Confidence 247889999999874 689999999999999998765433 23345678899999999999999999999999998 Q ss_pred CCch-hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC--------CceEEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 115 DDDK-DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--------DETRLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 115 ~~d~-~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~--------g~~~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) ++++ +..+.|+.++ .|+++..+.+++++++++|+||+++|++++ |.+++++++|.+++|+|++++.+++. T Consensus 80 ~~~~~~~~~~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~ 158 (451) T protein:vir:10 80 IDNNKELNEKVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELE 158 (451) T ss_pred cCCcHHHHHHHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceE Confidence 7664 4556666555 589999999999999999999999999986 78899999999999999999889999 Q ss_pred EEEEEeeeeeeccCC--cceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTD--EDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (512) Q Consensus 186 ~~v~~~~~~~~~~~~--~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~ 263 (512) ++||+|.....+... ....+++++||++.+++|.....+. ..........+|+||.||||+|+|++.|.|+|+++++ T Consensus 159 ~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~ 237 (451) T protein:vir:10 159 AVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSC-CGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKK 237 (451) T ss_pred EEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCc-cccccccccccCCCCeeeEEEeccCCCCCCchhhHHH Confidence 999999876655433 2456789999999999988755443 2233456778999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhh-hhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHH Q lcl|NC_010808. 264 LIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDE-VKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQG 342 (512) Q Consensus 264 liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 342 (512) |||+||.++|++++.+++|++|+++++|+.+.+..+ ...++..+++.+.. .....+++++|++++.+.++ T Consensus 238 liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~---------~~~~~~~~~~~l~~~~~~~~ 308 (451) T protein:vir:10 238 ILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTET---------DSEGDSGGLKTMQIEIPTEA 308 (451) T ss_pred HHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecC---------cCCccCCcceEEeecCCHHH Confidence 999999999999999999999999999986655443 34444455544322 23456789999999999999 Q ss_pred HHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc Q lcl|NC_010808. 343 TEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 422 (512) Q Consensus 343 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~ 422 (512) +++++++|.++|+.+|++|+++++++ ||+||+||++++.++.+||+++++.|+++|++++++|+++++. .++ T Consensus 309 ~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~-------~d~ 380 (451) T protein:vir:10 309 RKIILEILKKQIYESGQGLQQDTENF-GNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGV-------TDY 380 (451) T ss_pred HHHHHHHHHHHHHHHhCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-------CCc Confidence 99999999999999999999999887 6899999999999999999999999999999999999998753 256 Q ss_pred ceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|NC_010808. 423 NTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR 493 (512) Q Consensus 423 ~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 493 (512) .+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++++++|+++.....+.....-.. T Consensus 381 ~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 381 KKIQQTYTRNMMSNDLEDADIATKSVGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred cceeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 78999999999999999999999999999999999999999999999999999988877665544333221 No 31 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=3e-94 Score=533.32 Aligned_cols=463 Identities=17% Similarity=0.202 Sum_probs=371.9 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR- 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~- 79 (512) |+..|.=-.-.=+-.+.+.+-++. .. ..+.|.+++.+|. .+++|++++.+||.|+|+++.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~-------------~~~~i~~~i~~~~-~~~~r~~~~~~Yy~g~~~i~~~~~~~ 65 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKY-ET-------------QEEMILRLVREHK-ENIDNITMGERYYNHHPDILDAPFKR 65 (478) T ss_pred CccccccCCchhhhHHHHHhhhcc-CC-------------hHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccccchhh Confidence 554443211111111111111110 00 1356778888885 5678999999999999998765433 Q ss_pred ----cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 80 ----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 80 ----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) .....++++|+++||+++||++.++|++|+||++++++++..+.|+++|+ |+++..+.++++.++++|++|++|| T Consensus 66 ~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~ 144 (478) T protein:vir:10 66 DVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPY 144 (478) T ss_pred hcccccccccccceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEE Confidence 23456788899999999999999999999999999999999999999986 8999999999999999999999999 Q ss_pred ECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc--------- Q lcl|NC_010808. 156 RNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL--------- 226 (512) Q Consensus 156 ~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~--------- 226 (512) .|++|++++++++|.+++|+|+++..+++.+++|+|.... ..++++|+++.+++|........ T Consensus 145 ~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~--------~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~ 216 (478) T protein:vir:10 145 VDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDG--------AERVEYWTKDDVTFYELKEGQLIPDFYRSEDH 216 (478) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeC--------ceEEEEEeCCcEEEEEecCCeeeccccccccc Confidence 9999999999999999999999988899999999996532 34689999999998877554321 Q ss_pred cccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhh-hhhhh Q lcl|NC_010808. 227 KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEV-KKQKE 305 (512) Q Consensus 227 ~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~~~ 305 (512) ..........+|+||.||||+|+|++.|+|+|+++++|||+||.++|++++.++++++|+++++|..+.+..+. ..++. T Consensus 217 ~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 296 (478) T protein:vir:10 217 IQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKY 296 (478) T ss_pred cccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhh Confidence 11222355678999999999999999999999999999999999999999999999999999999866554432 22222 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 385 (512) .+++. ...+.+++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||++++.+|. T Consensus 297 ~~~~~------------~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 364 (478) T protein:vir:10 297 YKAIS------------VAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLD 364 (478) T ss_pred CceeE------------ecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHH Confidence 32222 234567889999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQD 465 (512) Q Consensus 386 ~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d 465 (512) +||+++++.|+++|++++++|+++++. .+++.+|+++|++++|.|.++.++++++++|++|+||+++++|+++| T Consensus 365 ~k~~~~~~~~~~~l~~~~~li~~~~~~------~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g~iS~et~i~~~~~v~d 438 (478) T protein:vir:10 365 LKANKLKNKTLTALQELLQYIIDFYRL------DVRVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILGNHSWVQD 438 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCC------CcccccceEEeCCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCC Confidence 999999999999999999999998643 35677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 466 PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 466 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +++|++||++|+++..+........ . ..+++..++.+++| T Consensus 439 ~~~E~~ri~~E~~~~~~~~~~~~~~-----~--~d~~~~~~~d~~~e 478 (478) T protein:vir:10 439 PVAEMERIEQENIELNQQLPDIEEG-----L--NDEQQRQSEDNQSE 478 (478) T ss_pred HHHHHHHHHHHHHHHHHhccccCCC-----C--cccccccCcCCCCC Confidence 9999999999998765433211111 1 11111222222222 No 32 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=2.2e-94 Score=534.03 Aligned_cols=463 Identities=18% Similarity=0.233 Sum_probs=369.3 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR- 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~- 79 (512) |. |.|....+-+++.. ++- ..........+.|.+++..|. .+++||+++++||+|+|+++.+..+ T Consensus 1 ~~--~~~~~~~~~~~~~~--------~~~---~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (474) T protein:vir:97 1 MF--NIIRMPWDKPYGEE--------VVE---QLKPQFETQEEMIVRLIDDHR-KQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred Cc--ccccccCCCchhhH--------HHH---hhhhcccCHHHHHHHHHHHHH-HHHHHHHHHHHHhccccchhcccchh Confidence 11 11111111111111 111 111111123467888888885 5689999999999999998765432 Q ss_pred ----cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 80 ----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 80 ----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) ..+..++++|+++||+++||++.++|++|+|+++++++++..+.|+.|++ |+++..+.+++++++++|+||+++| T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~ 145 (474) T protein:vir:97 67 DVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVY 145 (474) T ss_pred ccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEE Confidence 24556788999999999999999999999999999999999999999886 7899999999999999999999999 Q ss_pred ECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccc-----cc Q lcl|NC_010808. 156 RNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL-----TP 230 (512) Q Consensus 156 ~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~-----~~ 230 (512) .|++|.+++++++|.+++|+||++..+++++++|+|... ...++++||++.+++|...+++.... .. T Consensus 146 ~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~ 217 (474) T protein:vir:97 146 INENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANH 217 (474) T ss_pred ecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------CeEEEEEEeCCeEEEEEEcCCccccccccCcCc Confidence 999999999999999999999998889999999999753 23478999999999998766543221 22 Q ss_pred cccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhhcccc Q lcl|NC_010808. 231 RENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKEANVL 309 (512) Q Consensus 231 ~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~ 309 (512) ......+|+||+||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|..+.+..+... ++..+++ T Consensus 218 ~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i 297 (474) T protein:vir:97 218 VQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAI 297 (474) T ss_pred ccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhcccee Confidence 234567899999999999999999999999999999999999999999999999999999986555443222 2222222 Q ss_pred ccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_010808. 310 FLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK 389 (512) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 389 (512) ..+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||++++.++.+||+ T Consensus 298 --------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 363 (474) T protein:vir:97 298 --------------NVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKAN 363 (474) T ss_pred --------------eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHH Confidence 234567899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHH Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELE 469 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E 469 (512) ++++.|+++|++++++|+++++. ..++.+|+++|++++|.|.++.|++++++ |++|+||+++++|+++|+++| T Consensus 364 ~k~~~~~~~l~~~~~li~~~~~~------~~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~v~D~~~E 436 (474) T protein:vir:97 364 KLKNKATVAIQELISFIIDFNNL------KTDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAE 436 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhCC------CcccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCCCCCHHHH Confidence 99999999999999999998754 24677899999999999999999999886 899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 470 VKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 470 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++||++|+++..+........+.... ++ +.+.+....| T Consensus 437 ~eri~~E~~~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~e 474 (474) T protein:vir:97 437 LERIEQEQMEYNKQLPNLDDGGADGA----QQ-QEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHHHhhccccCCCCCCCc----cc-CCCCcccccC Confidence 99999999876554333222111111 11 1111111122 No 33 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=2.2e-94 Score=534.03 Aligned_cols=463 Identities=18% Similarity=0.233 Sum_probs=369.3 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR- 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~- 79 (512) |. |.|....+-+++.. ++- ..........+.|.+++..|. .+++||+++++||+|+|+++.+..+ T Consensus 1 ~~--~~~~~~~~~~~~~~--------~~~---~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (474) T protein:vir:94 1 MF--NIIRMPWDKPYGEE--------VVE---QLKPQFETQEEMIVRLIDDHR-KQLDKITVGQRYYDKDNDIVKQMKKV 66 (474) T ss_pred Cc--ccccccCCCchhhH--------HHH---hhhhcccCHHHHHHHHHHHHH-HHHHHHHHHHHHhccccchhcccchh Confidence 11 11111111111111 111 111111123467888888885 5689999999999999998765432 Q ss_pred ----cccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 80 ----RKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 80 ----~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) ..+..++++|+++||+++||++.++|++|+|+++++++++..+.|+.|++ |+++..+.+++++++++|+||+++| T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~ 145 (474) T protein:vir:94 67 DVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVY 145 (474) T ss_pred ccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEE Confidence 24556788999999999999999999999999999999999999999886 7899999999999999999999999 Q ss_pred ECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccc-----cc Q lcl|NC_010808. 156 RNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL-----TP 230 (512) Q Consensus 156 ~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~-----~~ 230 (512) .|++|.+++++++|.+++|+||++..+++++++|+|... ...++++||++.+++|...+++.... .. T Consensus 146 ~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~--------~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~ 217 (474) T protein:vir:94 146 INENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFN--------NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANH 217 (474) T ss_pred ecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------CeEEEEEEeCCeEEEEEEcCCccccccccCcCc Confidence 999999999999999999999998889999999999753 23478999999999998766543221 22 Q ss_pred cccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhhcccc Q lcl|NC_010808. 231 RENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKEANVL 309 (512) Q Consensus 231 ~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~ 309 (512) ......+|+||+||||+|+|+++|+|+|+++++|||+||+++|++++.++++++|+++++|..+.+..+... ++..+++ T Consensus 218 ~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i 297 (474) T protein:vir:94 218 VQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAI 297 (474) T ss_pred ccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhcccee Confidence 234567899999999999999999999999999999999999999999999999999999986555443222 2222222 Q ss_pred ccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_010808. 310 FLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK 389 (512) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 389 (512) ..+++++++|++++.+.+++++++++|.+.|+.+|++|++++++++||+||+||++++.++.+||+ T Consensus 298 --------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~ 363 (474) T protein:vir:94 298 --------------NVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKAN 363 (474) T ss_pred --------------eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHH Confidence 234567899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHH Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELE 469 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E 469 (512) ++++.|+++|++++++|+++++. ..++.+|+++|++++|.|.++.|++++++ |++|+||+++++|+++|+++| T Consensus 364 ~k~~~~~~~l~~~~~li~~~~~~------~~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~v~D~~~E 436 (474) T protein:vir:94 364 KLKNKATVAIQELISFIIDFNNL------KTDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAE 436 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhCC------CcccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCCCCCHHHH Confidence 99999999999999999998754 24677899999999999999999999886 899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 470 VKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 470 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++||++|+++..+........+.... ++ +.+.+....| T Consensus 437 ~eri~~E~~~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~e 474 (474) T protein:vir:94 437 LERIEQEQMEYNKQLPNLDDGGADGA----QQ-QEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHHHhhccccCCCCCCCc----cc-CCCCcccccC Confidence 99999999876554333222111111 11 1111111122 No 34 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=2.8e-94 Score=533.46 Aligned_cols=472 Identities=33% Similarity=0.505 Sum_probs=379.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+..+.| .++.+.++ .+++|+++|.+|+..+++||+++++||+|+|+++.++ .+ T Consensus 1 ~~~~~~~------------~~~~~~~~-------------~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~-~~ 54 (489) T protein:vir:99 1 MLQEDFE------------AIDYESKL-------------WIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRP-AK 54 (489) T ss_pred CCcccee------------eeCCCCCC-------------CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc-cc Confidence 3333322 22222222 3678999999999889999999999999999987654 44 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE---- Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR---- 156 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~---- 156 (512) ..+.++++|+++||+++||++.++|++|+|+++++++++.++.|++||+.|+|+..+.+++++++++|++|+++|. T Consensus 55 ~~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~ 134 (489) T protein:vir:99 55 TDKYAADNRIASDFAKYITVFEQGYMLGVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKID 134 (489) T ss_pred ccccCCcceeecchHHHHHHHHhhhhccCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCc Confidence 6667889999999999999999999999999999999999999999999999999999999999999999999986 Q ss_pred CCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccc Q lcl|NC_010808. 157 NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFE 236 (512) Q Consensus 157 d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (512) |++|++++.+++|.+++|+|++....+++++||+|..... ......++++|+++.+++|........ +....... T Consensus 135 d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~---~~~~~~~~~~y~~~~i~~~~~~~~~~~--~~~~~~~~ 209 (489) T protein:vir:99 135 DKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYG---SGKRKQIIKAYTSDTIYTYEDYNLETK--GMRLKDYE 209 (489) T ss_pred CCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEEEecC---CCceEEEEEEEeCCcEEEEEecCCCcc--cceecccc Confidence 5678999999999999999999888899999999876433 334567899999999999987654332 23456778 Q ss_pred cccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcc-----cccc Q lcl|NC_010808. 237 SHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEAN-----VLFL 311 (512) Q Consensus 237 ~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~-----~~~~ 311 (512) +|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|+++++|......+......... .... T Consensus 210 ~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (489) T protein:vir:99 210 GHFFKGVPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAI 289 (489) T ss_pred cccCCceeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhccccccccccc Confidence 99999999999999999999999999999999999999999999999999999997544433211111000 0000 Q ss_pred chhh-----hhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHH Q lcl|NC_010808. 312 EPTV-----YENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQ 386 (512) Q Consensus 312 ~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~ 386 (512) .... ..........+.+++++|++++.+.+++++++++|.+.|+.+|++|+++++++++|+||+||+++++++.+ T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ 369 (489) T protein:vir:99 290 SIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDN 369 (489) T ss_pred ccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHH Confidence 0000 00111122234467899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCC- Q lcl|NC_010808. 387 RTKTKEGLFTKGLRRRAKLLETILKNTRSI-DANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQ- 464 (512) Q Consensus 387 k~~~~~~~~~~~l~~~~~li~~~l~~~~~~-~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~- 464 (512) ||.++++.|+.+|++++++|+++++..+.. .....+.+++++|++++|.|.++.++++++++|++|+||+++++|+++ T Consensus 370 k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~~ 449 (489) T protein:vir:99 370 YREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVTG 449 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCCc Confidence 999999999999999999999998866543 334456789999999999999999999999999999999999999997 Q ss_pred -CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 465 -DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 465 -d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) |+++|++||++|+++.....+....++.. ++.+++.+.. T Consensus 450 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-----~~~~~~~~~p 489 (489) T protein:vir:99 450 VDAEAELKRLKEEADKKQSLPEPRLVGDAS-----GQEEPTAEKP 489 (489) T ss_pred hhHHHHHHHHHHHHHHHhccccccccCCCC-----CCcCCCCCCC Confidence 78899999999987655433222221111 1111111111 No 35 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=4.2e-94 Score=532.49 Aligned_cols=463 Identities=17% Similarity=0.227 Sum_probs=370.5 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |++-|. .....|..+- ..+....-....+.|.+++.+|. .+.++++++++||+|+|+++.+..+. T Consensus 1 ~~~~~~---------~~~~~~~~e~-----~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~yY~g~~~i~~~~~~~ 65 (478) T protein:vir:10 1 MISINW---------PWDKPYHEQV-----VEQIKPKYETQEEMILRLVREHK-ENIDNITMGERYYNHHPDILDAPPKR 65 (478) T ss_pred CccccC---------CCCchhHHHH-----HHHHhhccCCcHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCchhcccccc Confidence 443332 2222221110 00000000001456778888776 56799999999999999987654332 Q ss_pred -----ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 81 -----KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 81 -----~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) ..+.++++|+++||+++||++.++|++|+||++++++++..+.|+++++ |+++..+.+++++++++|+||+++| T Consensus 66 ~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~ 144 (478) T protein:vir:10 66 DVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPY 144 (478) T ss_pred ccccccccccccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEE Confidence 3355678899999999999999999999999999999999999999986 7899999999999999999999999 Q ss_pred ECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccc-------- Q lcl|NC_010808. 156 RNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK-------- 227 (512) Q Consensus 156 ~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~-------- 227 (512) .|++|++++++++|.+++|+|+++..+++++++|+|... ...++++||++++++|......... T Consensus 145 ~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~--------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~ 216 (478) T protein:vir:10 145 VDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELD--------GAERVEYWTKDDVTYYELKEGQLIPDFYRSDDH 216 (478) T ss_pred ecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEec--------CceEEEEEeCCeEEEEEEcCCeeeccccccccc Confidence 999999999999999999999998888999999999653 2346899999999988775443211 Q ss_pred -ccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhh Q lcl|NC_010808. 228 -LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKE 305 (512) Q Consensus 228 -~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~ 305 (512) .........+|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|+++++|+...+..+... ++. T Consensus 217 ~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~ 296 (478) T protein:vir:10 217 IQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKY 296 (478) T ss_pred cccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhh Confidence 112234567899999999999999999999999999999999999999999999999999999986655433322 222 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 385 (512) .+ ......+.+++++|++++++.+++++++++|.+.|+.+|++|++++++++||+||+||++++++|. T Consensus 297 ~~------------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 364 (478) T protein:vir:10 297 YK------------AISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLD 364 (478) T ss_pred cc------------eEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHH Confidence 22 222334567889999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQD 465 (512) Q Consensus 386 ~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d 465 (512) +||+++++.|+++|++++++|+++++. .+++.+|+++|++++|+|.++.|+++++++|++|+||+++++|+++| T Consensus 365 ~k~~~~~~~~~~~l~~~~~li~~~~g~------~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D 438 (478) T protein:vir:10 365 LKANKLKNKTLTALQELLQYIIDFYRL------DVKVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILSNHAWVED 438 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCC------CcccccceEEecCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCC Confidence 999999999999999999999998642 35677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 466 PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 466 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +++|++||++|+++..+..... ..+.. .+.+.+++.++.| T Consensus 439 ~~~E~~ri~~E~~~~~~~~~~~-~~~~~------~~~~~~~~~~~~~ 478 (478) T protein:vir:10 439 PVAEMERIEQENIELNQQLPDI-EEGLN------GEQQRQSENNQPE 478 (478) T ss_pred HHHHHHHHHHHHHHHHhhcccc-ccccC------CCCCCCCCCCCCC Confidence 9999999999987655432222 11111 1222222233333 No 36 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=1.3e-93 Score=529.82 Aligned_cols=456 Identities=21% Similarity=0.282 Sum_probs=371.7 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDY-QRPRLKVLSDYYEGKTKNLVELTR 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~-~~~r~~~~~~yy~G~~~~~~~~~~ 79 (512) |+..+-+..++ +.+..... ...++.++|.+|... +.++|+++++||+|+|+++.++.. T Consensus 1 ~~~~~~~~~~~-----------------~~~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~ 59 (479) T protein:vir:79 1 MLNIYISETDL-----------------IKVQLKKE----STINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRY 59 (479) T ss_pred CCCceecccce-----------------EeeccccC----ChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccc Confidence 44333222222 11111111 123444555554433 568899999999999998876543 Q ss_pred c-------ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 80 R-------KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 80 ~-------~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~ 152 (512) . ....++++|+++||+++||++.++|++|+|+++++++++.++.++.|++ |+|+..+.++++.++++|++|+ T Consensus 60 ~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~ 138 (479) T protein:vir:79 60 YLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNPIVFNADDDNLTKLLNDLLG-EEFDDTITELYLNASNKGVEWL 138 (479) T ss_pred cccccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCCHHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEE Confidence 3 3445788899999999999999999999999999999988888776665 8999999999999999999999 Q ss_pred EEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccc----- Q lcl|NC_010808. 153 LMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----- 227 (512) Q Consensus 153 ~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----- 227 (512) ++|.|++|++++++++|.+++|+||++...++++++|+|.....+ .+.+.++++|+++.+++|......... T Consensus 139 ~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~---~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~ 215 (479) T protein:vir:79 139 HPYINRKGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDID---GNKIKRVEYYTENDITYFIERGNSFIQEFLYD 215 (479) T ss_pred EEEeCCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecC---CceEEEEEEEeCCcEEEEEecCCccccccccc Confidence 999999999999999999999999998888999999999876544 345678999999999999876654321 Q ss_pred ---------ccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh Q lcl|NC_010808. 228 ---------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD 298 (512) Q Consensus 228 ---------~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~ 298 (512) .........+|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.+++|++|+++++|..+.+.. T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~ 295 (479) T protein:vir:79 216 EYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQ 295 (479) T ss_pred ccccccccccccccccccccCCCcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc Confidence 12234566799999999999999999999999999999999999999999999999999999997655443 Q ss_pred hh-hhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHH Q lcl|NC_010808. 299 EV-KKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM 377 (512) Q Consensus 299 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai 377 (512) +. ..++..+++ ..+++++++|++++.+.+++++++++|.+.|+.+|++|+++++.+ ||+||+|+ T Consensus 296 ~~~~~~~~~~~i--------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-gn~Sg~Ai 360 (479) T protein:vir:79 296 EFIDNIRYYKSI--------------KVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT-GDKSGVAL 360 (479) T ss_pred cchhhhhhccce--------------ecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccc-cchhHHHH Confidence 32 222222222 224567899999999999999999999999999999999998876 78999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHH Q lcl|NC_010808. 378 KYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLM 457 (512) Q Consensus 378 ~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~ 457 (512) +++++++.++|+.+++.|+++|++++++|+++++..+. ..++..+++|+|++++|.|.++.|+++++++|++|+||++ T Consensus 361 ~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~g~iS~et~l 438 (479) T protein:vir:79 361 KFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGN--KSYDYKTVQITFNHSMIINEAEKIDMAAKSTGIVSDETIV 438 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--CccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHH Confidence 99999999999999999999999999999999887654 3456778999999999999999999999999999999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 458 SLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 458 ~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) +++|+++|+++|++||++|+++..+........ .+...+++ T Consensus 439 ~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~------~~~~~~e~ 479 (479) T protein:vir:79 439 SNHPWVEDVNDELERLKKQEDTQKEYDDLIPNN------QDGVIDET 479 (479) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHhccCcc------cCCCcCcC Confidence 999999999999999999988766544333211 11112222 No 37 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=5.7e-94 Score=531.77 Aligned_cols=441 Identities=18% Similarity=0.213 Sum_probs=366.1 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc-------------ccccccceeeecchHHHHHHHH Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-------------KEEYMADNRVAHDYASYISDFI 102 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~-------------~~~~~~~~ri~~n~~~~iv~~~ 102 (512) .++....+.|.+++.+|. .++++|.++++||+|+|+++.+.... ....++++|+++||+++||++. T Consensus 1 ~~~e~~~~~i~~~~~~~~-~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~ 79 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHG-KFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQK 79 (471) T ss_pred CCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhh Confidence 222222344666666664 57889999999999999987654322 1234577899999999999999 Q ss_pred HhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC-CCceEEEEEccceeEEEEeCCCC Q lcl|NC_010808. 103 NGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDETRLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 103 a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~~~i~~~~p~~~~~i~d~~~~ 181 (512) ++|++|+|+++++++++.++.|+.|++ |+++..+.++++.++++|+||+++|.++ +|++++.+++|.+++|+|+++.. T Consensus 80 ~~yl~G~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~ 158 (471) T protein:vir:10 80 KAYALTYPPTFDVDDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLD 158 (471) T ss_pred hhhhcccCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCC Confidence 999999999999999999999999986 7999999999999999999999999985 69999999999999999999888 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccc---------------ccccccccccccccccceE Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK---------------LTPRENGFESHSFERMPIT 246 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~vPvv 246 (512) .++++++|+|...... ......++++|+++.+++|.....+... .........+|+||.|||| T Consensus 159 ~~~~~~ir~~~~~~~~--~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 236 (471) T protein:vir:10 159 KKSIGVLRVYSSIDET--DGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFI 236 (471) T ss_pred CceEEEEEEEEeeccC--CCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEE Confidence 8999999999776543 3456778999999999999876554221 1233456679999999999 Q ss_pred eecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhh-hhhhhccccccchhhhhhcccccCC Q lcl|NC_010808. 247 EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEV-KKQKEANVLFLEPTVYENRDTGIET 325 (512) Q Consensus 247 ~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (512) +|+|+..|+|+|+++++|||+||.++|++++.+++|++|+++++|+.+.+..+. ..++..+++.+. ..+. T Consensus 237 ~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~---------~~~~ 307 (471) T protein:vir:10 237 PFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMD---------NDGM 307 (471) T ss_pred EeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEec---------CCCC Confidence 999999999999999999999999999999999999999999999865554433 334444444331 1234 Q ss_pred CCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 326 EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKL 405 (512) Q Consensus 326 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~l 405 (512) +.+++++|++++.+.++++.++++|.++|+.+|++|+++++.+ ||+||+||++++.++.+||+.+++.|+++|++++++ T Consensus 308 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~l 386 (471) T protein:vir:10 308 GDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKL-GNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKM 386 (471) T ss_pred ccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5678899999999999999999999999999999999999887 689999999999999999999999999999999999 Q ss_pred HHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 406 LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQ 485 (512) Q Consensus 406 i~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~ 485 (512) |+++++.. ++.+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++..+... T Consensus 387 i~~~~~~~-------d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~~ 459 (471) T protein:vir:10 387 ILKHLGLS-------DKLKIKQTWTRNSINNDTEMAQVVSTLATITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKLY 459 (471) T ss_pred HHHHhccC-------CCceeEEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhccc Confidence 99987542 45688999999999999999999999999999999999999999999999999999877644222 Q ss_pred hhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 486 KGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~ 505 (512) ... + . .++++.+ T Consensus 460 ~~~----~-~---~~~~e~~ 471 (471) T protein:vir:10 460 DME----E-V---EHESEVE 471 (471) T ss_pred ccC----C-C---CCccccC Confidence 111 1 1 1111111 No 38 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=5.3e-93 Score=526.46 Aligned_cols=460 Identities=18% Similarity=0.228 Sum_probs=371.9 Q ss_pred cccccCCCcCeeec---ccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-----cccccccce Q lcl|NC_010808. 17 RNYLFNDEANVVYT---YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR-----RKEEYMADN 88 (512) Q Consensus 17 ~~~~f~~~~~~~~~---~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~-----~~~~~~~~~ 88 (512) -..+|+..-...+. ............+.|.++|.+|. .+.+|++++.+||.|+|+++++..+ .....++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHR-KQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDW 79 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHH-HHHHHHHHHHHHhcccCchhccccccccccccccccccc Confidence 01222222222222 22333333333567888888875 6788999999999999998765433 234466788 Q ss_pred eeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEc Q lcl|NC_010808. 89 RVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) Q Consensus 89 ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~ 168 (512) |+++||+++||++.++|++|+|+++++++++..+.|+.|++ |+++..+.++++.++++|+||+++|.+++|++++++++ T Consensus 80 ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~ 158 (474) T protein:vir:95 80 RITTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVP 158 (474) T ss_pred eeccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEc Confidence 99999999999999999999999999999999999999986 78999999999999999999999999999999999999 Q ss_pred cceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccc-----ccccccccccccccc Q lcl|NC_010808. 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL-----TPRENGFESHSFERM 243 (512) Q Consensus 169 p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~v 243 (512) |.+++|+|+++..+++++++|+|.... ..++++|+++++++|.....+.... ........+|+||.| T Consensus 159 p~~~~~v~d~~~~~~~~~~i~~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 230 (474) T protein:vir:95 159 AEQAIPIWVDKEREELKSFIRYYKFNN--------EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRV 230 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEEEcC--------eeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCcc Confidence 999999999988889999999996532 3468999999999998766543221 223445678999999 Q ss_pred ceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hhhccccccchhhhhhcccc Q lcl|NC_010808. 244 PITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QKEANVLFLEPTVYENRDTG 322 (512) Q Consensus 244 Pvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 322 (512) |||+|+|++.|+|+|+++++|||+||.++|++++.++++++|++|++|+.+.+..+... +...+++ T Consensus 231 Pvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i------------- 297 (474) T protein:vir:95 231 PFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAI------------- 297 (474) T ss_pred ceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhcccee------------- Confidence 99999999999999999999999999999999999999999999999987655443222 2222222 Q ss_pred cCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 323 IETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 323 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) ..+++++++|++++.+.+++++++++|.++|+..|++|+++++++++|+||+||++++.++.+||+++++.|+++|+++ T Consensus 298 -~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~ 376 (474) T protein:vir:95 298 -NVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQEL 376 (474) T ss_pred -eccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2355678999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 403 AKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIK 482 (512) Q Consensus 403 ~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~ 482 (512) +++|+++++. ..++.+|+++|++++|.|.++.|++++++ |++|+||+++++|+++|+++|++||++|+++..+ T Consensus 377 ~~li~~~~g~------~~d~~~i~v~f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~ 449 (474) T protein:vir:95 377 IGFIIDFNNL------KMDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNK 449 (474) T ss_pred HHHHHHHhCC------CcccceeeEEeccCCCcCHHHHHHHHHhc-CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHh Confidence 9999998753 34677899999999999999999999885 8999999999999999999999999999988766 Q ss_pred HHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 483 KAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) ........++...+ +.+++.+...+ T Consensus 450 ~~~~~~~~~~d~~~---~~~~~~~~~~~ 474 (474) T protein:vir:95 450 QLPNLDDGGADGAQ---QQERSNDKESE 474 (474) T ss_pred cccccccccCCCCc---CCCCCccCCCC Confidence 54333222221111 11111111111 No 39 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=2.9e-92 Score=522.46 Aligned_cols=452 Identities=19% Similarity=0.248 Sum_probs=367.1 Q ss_pred CCcceeeccccchhhcccc-ccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNY-LFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~ 79 (512) |+ .+.++++. .|-.... +.+....-..+.|.+++.+|. .+.+||+++++||.|+|+++.+... T Consensus 1 ~~---------~~~~~~~~~~~~~~~~------~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~yY~g~~~i~~~~~~ 64 (468) T protein:vir:96 1 MI---------DIFWPNEKPYHERVVE------QIKPQYETQEEMILRLITKHK-ENVEDITVGERYYNHQPDVLFNAPK 64 (468) T ss_pred Cc---------cccCCcCceeehheee------cccccccCcHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCcccccccc Confidence 32 22122222 1111111 111111123456778888886 5678999999999999998776544 Q ss_pred c-----ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEE Q lcl|NC_010808. 80 R-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELM 154 (512) Q Consensus 80 ~-----~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v 154 (512) . ..+.++++|+++||++.||++.++|++|+|+++++++++.++.|+++|+ |+++..+.+++++++++|++|++| T Consensus 65 ~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v 143 (468) T protein:vir:96 65 RNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQP 143 (468) T ss_pred ccccccccccccccccccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEE Confidence 3 2345678899999999999999999999999999999999999999996 789999999999999999999999 Q ss_pred EECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc-------- Q lcl|NC_010808. 155 IRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL-------- 226 (512) Q Consensus 155 ~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~-------- 226 (512) |.|++|++++++++|.+++|+|+++..+++++++|+|.... ..++++|+++.+++|....+... T Consensus 144 ~~d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (468) T protein:vir:96 144 YVDEQGEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDG--------GERVEYWTANDVTFYELKDGQLIPDYYQGEE 215 (468) T ss_pred EEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--------ceEEEEEeCCeEEEEEEcCCceeeccccccc Confidence 99999999999999999999999988889999999986532 24679999999998887554321 Q ss_pred -cccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-hh Q lcl|NC_010808. 227 -KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-QK 304 (512) Q Consensus 227 -~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~ 304 (512) ..........+|+||+||||+|+|++.|.|+|+++++|||+||.++|++++.++++++|++|++|+.+.+...... ++ T Consensus 216 ~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~ 295 (468) T protein:vir:96 216 HVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLK 295 (468) T ss_pred ccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhh Confidence 1122345667899999999999999999999999999999999999999999999999999999986654443322 22 Q ss_pred hccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHH Q lcl|NC_010808. 305 EANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGL 384 (512) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 384 (512) ..+++ ....+++++++|++++.+.+++++++++|.++|+.+|++|++++++++||+||+||+++++++ T Consensus 296 ~~~~i------------~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l 363 (468) T protein:vir:96 296 YYKAI------------NVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNL 363 (468) T ss_pred cCceE------------EecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHH Confidence 22222 223456778999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCC Q lcl|NC_010808. 385 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQ 464 (512) Q Consensus 385 ~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~ 464 (512) .+||+.+++.|+++|++++++|+++++. ..++.+++++|++++|.|.++.|++++++ |++|+||+++++|+++ T Consensus 364 ~~k~~~k~~~~~~~l~~~~~li~~~~g~------~~d~~~i~i~f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~v~ 436 (468) T protein:vir:96 364 DLKANKLKNKTLTALQELLQYIIDFYKL------SIKVQDVEITFNFNVMVNELEQSQIGVNS-QYLSKETVVTNHPWVD 436 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCC------CcccceeeEEecCCCCcCHHHHHHHHHhc-CCCchHHHHHhCCCCC Confidence 9999999999999999999999998643 34677899999999999999999988764 9999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 465 DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 465 d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ||++|++||++|+++..+.... . . +.++++.+ T Consensus 437 D~~~E~~ri~~E~~~~~~~~~~-~----~---~~~~~~~~ 468 (468) T protein:vir:96 437 DPVAEMERIDQEELALPSIEEG-L----N---GKENNEPT 468 (468) T ss_pred CHHHHHHHHHHHHHHHHHHhhc-c----C---CCCCCCCC Confidence 9999999999999876654322 1 1 11122222 No 40 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=5.6e-92 Score=520.87 Aligned_cols=459 Identities=19% Similarity=0.231 Sum_probs=366.0 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+. +....+..+..+- + .............|.++|.+|. .+.+|++++.+||+|+|+++.+..+. T Consensus 1 ~~~---------~~~~~~~~~~~~~--~---~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~Yy~g~~~i~~~~~~~ 65 (474) T protein:vir:96 1 MIV---------IFWPNEKPYHERV--V---EQIKPKYETQEEMIIRLINDHK-PKIDDITVGERYYNHDPDVLRLAPKL 65 (474) T ss_pred Cee---------eccCCCchhhhhH--H---HHhhhccCChHHHHHHHHHHHH-HHHHHHHHHHHHhccCCcchhccchh Confidence 332 2222222211110 0 0111111112456778888876 56899999999999999987765432 Q ss_pred -----ccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 81 -----KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 81 -----~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) ..+.++++|+++||+++||++.++||+|+|+++++++++..+.|++|++ |+++..+.+++++++++|++|+++| T Consensus 66 ~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y 144 (474) T protein:vir:96 66 DNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPY 144 (474) T ss_pred cccccccccccchhcccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEE Confidence 2345788899999999999999999999999999999999999999986 6789999999999999999999999 Q ss_pred ECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccc------- Q lcl|NC_010808. 156 RNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL------- 228 (512) Q Consensus 156 ~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~------- 228 (512) .|++|++++++++|.+++|+|+++...++++++|+|.... ..++++||++++++|....+..... T Consensus 145 ~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~--------~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~ 216 (474) T protein:vir:96 145 IDENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDG--------AERVEYWTDSDVTYYEYQDGILIPDYYHGEEH 216 (474) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecC--------ceEEEEEeCCeEEEEEecCCceeecccccccc Confidence 9999999999999999999999988889999999996532 3467999999999988755432211 Q ss_pred --cccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhh-hhhhh Q lcl|NC_010808. 229 --TPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEV-KKQKE 305 (512) Q Consensus 229 --~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~~~ 305 (512) ........+|+||.||||+|+|+++|+|+|+++++|||+||.++|++++.++++++|++|++|+.+.+..+. ..++. T Consensus 217 ~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~ 296 (474) T protein:vir:96 217 IQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKY 296 (474) T ss_pred ccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhc Confidence 112345678999999999999999999999999999999999999999999999999999999876554332 22333 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 385 (512) .+++.+ .+++++++|++++.+.+++++++++|.++|+.+|++|+++++++++|+||+||+++++++. T Consensus 297 ~~~i~~-------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~ 363 (474) T protein:vir:96 297 YKAINV-------------DGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLD 363 (474) T ss_pred CceEEe-------------cCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHH Confidence 333322 2456789999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQD 465 (512) Q Consensus 386 ~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v~d 465 (512) +||+++++.|+++|++++++|+++++. ..++.+++++|++++|.|.++.++++.+ +|++|+||+++++|+++| T Consensus 364 ~k~~~k~~~~~~~l~~~~~~i~~~~~~------~~~~~~i~i~f~~~~p~~~~e~~~~~~~-ag~iS~et~~~~~~~v~d 436 (474) T protein:vir:96 364 LKANKLKNKTLTALQELLQYIIDFYKL------NIKVQDVEITFNFNVMVNELEQSQIGVQ-SQYLSKETVVTNHPWVDD 436 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCC------CcccceeeEEeccCCCcCHHHHHHHHHh-cCCCchHHHHHhCCCCCC Confidence 999999999999999999999998643 3467789999999999999999998755 699999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCccc Q lcl|NC_010808. 466 PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 509 (512) Q Consensus 466 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (512) +++|++||++|+++..+.......+ ..+. ..+..++.+ T Consensus 437 ~~~E~~ri~~E~~e~~~~~~~~~~~-~~~~-----~~d~~~e~~ 474 (474) T protein:vir:96 437 PVAELERIEQDNIDFNKQLPPLEGD-ANGR-----AQDNESETN 474 (474) T ss_pred HHHHHHHHHHHHHHHHhcccccccc-cccc-----cCCCcccCC Confidence 9999999999987765433222111 1111 111111111 No 41 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=2.3e-90 Score=512.01 Aligned_cols=463 Identities=13% Similarity=0.147 Sum_probs=350.9 Q ss_pred cccc-hhHHhhhcHHHHHH-HHHHHHHHHHHHHHHHHHHhcccccccccccc--------cccccccceeeecchHHHHH Q lcl|NC_010808. 30 TYDG-TESDLLQNINEVSK-YIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR--------RKEEYMADNRVAHDYASYIS 99 (512) Q Consensus 30 ~~~~-~~~~~~~~~~~l~~-~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~--------~~~~~~~~~ri~~n~~~~iv 99 (512) ..+. ....+......+.+ ++.++.++++++++++++||+|+|+|+.++.. .....++++|+++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 1111 01111111122333 33444467789999999999999998865533 24456788999999999999 Q ss_pred HHHHhhhhccCceecCCch---hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEE Q lcl|NC_010808. 100 DFINGYFLGNPIQCQDDDK---DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIY 176 (512) Q Consensus 100 ~~~a~~l~g~~~~~~~~d~---~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~ 176 (512) ++.++||+|+||+|++.++ +..+.|++++ .|+++..+.+++++++++|+||+++|.|++|++++++++|.++||+| T Consensus 81 d~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~pv~ 159 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKVKDEDNTQLDEILQEYF-DEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTLIPVF 159 (537) T ss_pred HHHhhhhcccCceeecCcchhHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccceeEEEE Confidence 9999999999999998764 4555677766 48999999999999999999999999999999999999999999999 Q ss_pred eCCCCceeEEEEEEeeeeeecc--CCcceEEEEEEEcCCcEEEEEecCCcccc--------------------------- Q lcl|NC_010808. 177 DNTIERNSIAGVRYLRTKPIDK--TDEDEVFTVDLFTSHGVYRYLTSRTNGLK--------------------------- 227 (512) Q Consensus 177 d~~~~~~~~~~v~~~~~~~~~~--~~~~~~~~~~~yt~~~~~~~~~~~~~~~~--------------------------- 227 (512) |++ .++.+++|+|....... .....++++++||++.+++|.....+... T Consensus 160 d~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 237 (537) T protein:vir:78 160 DDY--GVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADF 237 (537) T ss_pred cCC--CCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccc Confidence 975 45777888887655443 34467889999999999999876543211 Q ss_pred ccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhh-hhhhhc Q lcl|NC_010808. 228 LTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEV-KKQKEA 306 (512) Q Consensus 228 ~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~-~~~~~~ 306 (512) .........+|+||.||||+|+||++|.|+|+++++|||+||.++|++++.+++|++|++|++|+.+.+.++. ..++.. T Consensus 238 ~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~ 317 (537) T protein:vir:78 238 EDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAK 317 (537) T ss_pred cccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhc Confidence 1123345678999999999999999999999999999999999999999999999999999999876554443 333333 Q ss_pred cccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHH Q lcl|NC_010808. 307 NVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQ 386 (512) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~ 386 (512) +++.+ .+++++++|++++.+.+++++++++|.+.||.+|++|+.+. .++||+||+||++++++|.+ T Consensus 318 ~~i~v-------------~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~-~~~gn~SGvAlk~~~~~l~~ 383 (537) T protein:vir:78 318 KMIGV-------------NGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTA-VGDGNVTNVVIKSRYTLLAM 383 (537) T ss_pred Cceee-------------cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcc-ccccCCcHHHHHHHHhhHHH Confidence 33322 23567899999999999999999999999999999999765 46789999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC Q lcl|NC_010808. 387 RTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ 464 (512) Q Consensus 387 k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~ 464 (512) ||+.+++.|+++|++++++|+++++..+. ..+++.+|.++|++++|.|.++.|++++++ +|++|+||+++++|+++ T Consensus 384 ka~~ke~~f~~~l~~~~~~i~~~~~~~~~--~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vd 461 (537) T protein:vir:78 384 KARKMETSLRKVLRWCADMVVSDIALRGL--GEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIG 461 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCC--cccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCC Confidence 99999999999999999999999987654 345678999999999999999999999987 48999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCC----C----------CCCCCCCcCcccCCC Q lcl|NC_010808. 465 DPELEVKKIEEDEKESIKKAQKGIYKDPRDIN----D----------DEQDDDTKDTVDKKE 512 (512) Q Consensus 465 d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~----~----------~~~~~~~~~~~~~~e 512 (512) |++.| +++++|.+...............+.. + ..+++...++.-.++ T Consensus 462 d~e~e-k~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 522 (537) T protein:vir:78 462 DDETL-KLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVAD 522 (537) T ss_pred CHHHH-HHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCC Confidence 98433 33343332222111111111000000 0 000000011111111 No 42 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.5e-80 Score=458.18 Aligned_cols=456 Identities=11% Similarity=0.082 Sum_probs=339.0 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) +++.-.+.++ .+....+.+.+.+++..+++|++++.+||+|+|++.+.+...+.+. .+.++++||+++||++.++|+ T Consensus 1 ~~~~~~~~~e--~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~-~~~~~v~n~~~~iVd~~~~~l 77 (486) T protein:vir:42 1 MTAPLPGMEE--IEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREM-QQLLAHVGYPRLYVDSVAERQ 77 (486) T ss_pred CCCCCCCCCC--cccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhH-hhhhhccchHHHHHHHHHhhh Confidence 2222222222 2233444444444456778999999999999999876555444433 356788999999999999999 Q ss_pred hccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC--------CCceEEEEEccceeEEEEeC Q lcl|NC_010808. 107 LGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ--------DDETRLYKSDAMSTFVIYDN 178 (512) Q Consensus 107 ~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~--------~g~~~i~~~~p~~~~~i~d~ 178 (512) .+.++++. ++++..+.++++|+.|+|+..+.++++++++||+||++||.++ ++.+++++++|.+++++||+ T Consensus 78 ~~~g~~~~-~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~ 156 (486) T protein:vir:42 78 AVEGFRLG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDP 156 (486) T ss_pred cccceecC-CCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeC Confidence 99888754 3455667799999999999999999999999999999999875 45678999999999999997 Q ss_pred CCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----C Q lcl|NC_010808. 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----R 253 (512) Q Consensus 179 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~ 253 (512) .. .++.+++++|... ..+.+.++++|+++.+++|....+.+. .....+|+||.||||+|+|++ + T Consensus 157 ~~-~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~~~-----~~~~~~h~~g~vPvv~~~n~~~~~~~~ 225 (486) T protein:vir:42 157 RI-NRVSKAIRVAYDK-----EGNEIQAATLYTPMETIGWFRADGEWA-----EWFNVPHGLGVVPVVPLPNRTRLSDLY 225 (486) T ss_pred CC-CCeEEEEEEEEec-----CCCeEEEEEEEcCCcEEEEEecCCcEE-----eecceecCCCCceEEEeccccccCCCC Confidence 64 5799999887532 234567889999999999987665442 345678999999999999974 5 Q ss_pred CCcchHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCccee Q lcl|NC_010808. 254 RKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG 332 (512) Q Consensus 254 g~s~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (512) |.|+|++ |++|||+||+++|++++.++++++|+++++|........ .......++....+. ....++++++ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~-~~~~~~~~~~~~~~~-------~~~~~~~~~~ 297 (486) T protein:vir:42 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGV-DSETGQTLFDAYLAR-------ILAFEDAEGK 297 (486) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCcccccc-ccccccchhhhhhch-------hcccCCCCce Confidence 7899985 899999999999999999999999999999964332211 111111111111111 1112234566 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 333 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT----QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 333 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) +.+. +....++++++|+..|++++.+|++.+..|+++ +||+||++++.+|.+||+++++.|+++|+++++++++ T Consensus 298 ~~q~--~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~ 375 (486) T protein:vir:42 298 IQQF--SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYR 375 (486) T ss_pred EEee--cccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6443 456788999999999999998888877766644 6999999999999999999999999999999999988 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKA 484 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~----g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~ 484 (512) +++.. ..+.+..+|+++|+++.|+|.++.+++++|+. |++|++|+++++|+++|+.+|++|+++|+++..... T Consensus 376 ~~~~~---~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~ 452 (486) T protein:vir:42 376 IMKGG---DVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGL 452 (486) T ss_pred HhcCC---CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHH Confidence 76532 23446678999999999999999999999984 789999999999999999999999998887655544 Q ss_pred HhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 485 QKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ...........+. ++..+++...+.+ T Consensus 453 ~~~~~~~~~~~~~--~~~~~~~~~~~~~ 478 (486) T protein:vir:42 453 LGTMVDADPTVPG--SPSPTAPPKPQPA 478 (486) T ss_pred HHHhhcCCCCCCC--CCCCCCCCCCCcc Confidence 3332222111111 1111111111111 No 43 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=4.9e-80 Score=455.37 Aligned_cols=450 Identities=12% Similarity=0.100 Sum_probs=332.4 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCch Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK 118 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~ 118 (512) .....++++.+.+.+..+++|++++.+||+|+|++.+.+...++ ...++|+++||+++||++.++|++++++... +++ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~-~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~d~ 78 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPP-ELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDS 78 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccch-hhhhhhhhcchHHHHHHHHHhhhccCceecC-CCc Confidence 22233444444444567789999999999999997665544443 3346789999999999999999999998654 456 Q ss_pred hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE------CCCCceEEEEEccceeEEEEeCCCCceeEEEEEEee Q lcl|NC_010808. 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (512) Q Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~------d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~ 192 (512) +..+.|+++|+.|+++.++.++++++++||+||++||. |++|.+++++++|.+++|+||+...+++.+++|+|. T Consensus 79 ~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~ 158 (480) T protein:vir:78 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) T ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEE Confidence 77889999999999999999999999999999999996 467899999999999999999988899999999986 Q ss_pred eeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcchHH-HHHHHH Q lcl|NC_010808. 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYEK-VITLID 266 (512) Q Consensus 193 ~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~~~-v~~liD 266 (512) ..+ ....++++++|+++.+++|...++..... ....+..+|+||.||||+|+|++ +|.|+++. |++|+| T Consensus 159 ~~d----~~~~~~~~~~y~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~D 233 (480) T protein:vir:78 159 TRD----DVAVPDRATLYLPDETVPLRRNGGLNDQW-VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) T ss_pred eec----CCcceEEEEEEeCCeEEEEEecCCCcccc-cccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHH Confidence 543 23356788999999999988765543221 12346679999999999999874 58899985 999999 Q ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHH Q lcl|NC_010808. 267 LYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAY 346 (512) Q Consensus 267 a~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 346 (512) +||+++|++++.+++|++|+++++|........- ....++....+ ......++++++.+++ ....+++ T Consensus 234 a~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~---~~~~~~~~~~~-------~~~~~~~~~~~~~~~~--~~~~~~~ 301 (480) T protein:vir:78 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYG-------RILTLASEAAKISEFK--AAELRNF 301 (480) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhCCCccccccc---cccchhhhhhh-------hhccCCCCCceEEecC--ccCHHHH Confidence 9999999999999999999999999643322111 11111111111 1112234556676654 3445666 Q ss_pred HHHHHHHHHHHhcccccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc Q lcl|NC_010808. 347 KDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 422 (512) Q Consensus 347 ~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~ 422 (512) ++.++..|+.++.+|++.+..|++ ++||+||++++.+|.+||+++++.|+.+|++++++++++++.. ...++ T Consensus 302 ~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~----~~~~~ 377 (480) T protein:vir:78 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE----VTEEY 377 (480) T ss_pred HHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC----ccccc Confidence 677777777766666555544443 3699999999999999999999999999999999999876432 23466 Q ss_pred ceeeEEeCCCCCcCHHHHHHHHHHH----hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 423 NTVRYVYNRNLPKSLIEELKAYIDS----GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 423 ~~i~i~f~~~~p~d~~~~~~~~~kl----~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) ..++++|+++.|+|..+.+++++|+ .|++|++|+++++|+++|+.+|++++++++.+....+.........+...+ T Consensus 378 ~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) T protein:vir:78 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) T ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccC Confidence 7899999999999999999999987 247999999999999999999988887777665543333222211111101 Q ss_pred CCCCCCcCcccCCC Q lcl|NC_010808. 499 EQDDDTKDTVDKKE 512 (512) Q Consensus 499 ~~~~~~~~~~~~~e 512 (512) .......+..+.. T Consensus 458 -~~~~~~~~~~~~~ 470 (480) T protein:vir:78 458 -PTVTETKTETQTS 470 (480) T ss_pred -CCCCCCCCccCCC Confidence 0111111111111 No 44 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=1.5e-79 Score=452.78 Aligned_cols=447 Identities=12% Similarity=0.104 Sum_probs=330.8 Q ss_pred hhcHHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc Q lcl|NC_010808. 39 LQNINE-VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD 117 (512) Q Consensus 39 ~~~~~~-l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d 117 (512) .-...+ |..++.+| ..+++|++++.+||+|+|++.+.+.. .++...++|+++||+++||++.++|+++++++.. ++ T Consensus 1 ~~t~~~~i~~L~~~~-~~~~~r~~~l~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~d 77 (480) T protein:vir:78 1 MTTYHEHVERLQGLL-ARDLPNLLEAEAYRNGTRRLKTIGIG-APPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) T ss_pred CCCHHHHHHHHHHHH-HHHHHHHHHHHHHHhccccccccccc-cchhHhhhhhhcchHHHHHHHHHhhhccCceecC-CC Confidence 112233 44455545 66789999999999999987655443 3344457789999999999999999999998754 45 Q ss_pred hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE------CCCCceEEEEEccceeEEEEeCCCCceeEEEEEEe Q lcl|NC_010808. 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (512) Q Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~------d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~ 191 (512) ++..+.|+++|+.|+++.++.++++++++||+||++||. |++|.+++.+++|.+++|+||+...+++.+++++| T Consensus 78 ~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~ 157 (480) T protein:vir:78 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) T ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEE Confidence 677889999999999999999999999999999999997 45788999999999999999998889999999998 Q ss_pred eeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcchHH-HHHHH Q lcl|NC_010808. 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYEK-VITLI 265 (512) Q Consensus 192 ~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~~~-v~~li 265 (512) ...+ ....+.++++|+++.+++|....+...... ...+..+|+||.||||+|+|++ +|+|+|++ |++|+ T Consensus 158 ~~~~----~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) T protein:vir:78 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQWV-VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) T ss_pred Eeec----CCCceEEEEEEeCCeEEEEEecCCCccccc-cccccccCCCCCcceEEeecccccCCccCcccchhhHHHHH Confidence 6443 233567889999999999887655432221 2345678999999999999874 68899985 99999 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEA 345 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 345 (512) |+||+++|++++.+++|++|+++++|........- ....++....+. +....++++++.+++. ...++ T Consensus 233 Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~---~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~--~~~~~ 300 (480) T protein:vir:78 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYGR-------ILTLASEAAKISEFKA--AELRN 300 (480) T ss_pred HHHHHHHHHHHHHHHhhcchhhhhhcCCccccccc---cccchhhhhhhh-------hccCCCCCceEEecCc--cCHHH Confidence 99999999999999999999999999643332211 111111111111 1112345666766543 44666 Q ss_pred HHHHHHHHHHHHhcccccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Q lcl|NC_010808. 346 YKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKD 421 (512) Q Consensus 346 ~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d 421 (512) ++++++..|+.++.+|++.+..+++ ++||+||++++.+|..||+++++.|+.+|++++++++++.+. ....+ T Consensus 301 ~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~----~~~~~ 376 (480) T protein:vir:78 301 FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEE 376 (480) T ss_pred HHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC----Ccccc Confidence 6777777777766655555444432 369999999999999999999999999999999999987653 22346 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHH----hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIEELKAYIDS----GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND 497 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~~~~~~~kl----~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 497 (512) +..+++.|+++.++|..+.+++++|+ .|++|++|+++++|+++|+.++++++++|+.+..............+. T Consensus 377 ~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~-- 454 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA-- 454 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCC-- Confidence 67899999999999999999999987 347999999999999998888888777776554432222221111111 Q ss_pred CCCCCCCcCcccCCC Q lcl|NC_010808. 498 DEQDDDTKDTVDKKE 512 (512) Q Consensus 498 ~~~~~~~~~~~~~~e 512 (512) ..+++.+++..+.+ T Consensus 455 -~~~~~~~~~~~~~~ 468 (480) T protein:vir:78 455 -TPKPTVTETKTETQ 468 (480) T ss_pred -CCCCCCCCCCCccc Confidence 11111222211111 No 45 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=1.2e-79 Score=453.27 Aligned_cols=455 Identities=12% Similarity=0.106 Sum_probs=337.0 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) ..+.-++.++... ....+..++.+| ..+++|++++.+||+|+|++.+.+...+. ...++|+++||+++||++.++|| T Consensus 1 ~~~~i~~~~~~~~-~~~~~~~L~~~~-~~~~~r~~~~~~YY~G~~~i~~~~~~~~~-~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:24 1 MTAPLPGQEEIAD-PAIARDEMVSAF-EDQNQNLRSNTSYYEAERRPEAIGVTVPV-QMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCcccc-hHHHHHHHHHHH-HHHHHHHHHHHHHHhccCchhhcCcccch-hhhhhhhccchHHHHHHHHhhhh Confidence 3333333333221 123333455555 56679999999999999998665544443 34577889999999999999999 Q ss_pred hccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC--------CceEEEEEccceeEEEEeC Q lcl|NC_010808. 107 LGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD--------DETRLYKSDAMSTFVIYDN 178 (512) Q Consensus 107 ~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~--------g~~~i~~~~p~~~~~i~d~ 178 (512) ++++++.. ++++.++.++++|+.|+|+.++.++++++++||+||++||.+++ |.+++++++|.+++++||+ T Consensus 78 ~~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~ 156 (485) T protein:vir:24 78 AVEGFRLG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDP 156 (485) T ss_pred ccCceecC-CCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeC Confidence 99998754 45566778999999999999999999999999999999999875 5578999999999999998 Q ss_pred CCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----C Q lcl|NC_010808. 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----R 253 (512) Q Consensus 179 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~ 253 (512) .+ .++.+++++|... ....+.++++|+++.+++|....+.+ ......+|+||.||||+|.|++ + T Consensus 157 ~~-~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~~-----~~~~~~~h~~g~vPvv~f~n~~~~~~~~ 225 (485) T protein:vir:24 157 RI-GRPAKAIRVAYDA-----EGNEIQAATLYTPNETFGWFRAEGEW-----VEWFSDPHGLGAVPVVPLPNRTRLSDLY 225 (485) T ss_pred Cc-CceeEEEEEEEee-----cCCeEEEEEEEcCCcEEEEEecCCce-----EeecccccCCCcccEEEeccCcccCCcC Confidence 76 5677777766432 23456788999999999998766543 2345678999999999999874 6 Q ss_pred CCcchH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCccee Q lcl|NC_010808. 254 RKGDYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG 332 (512) Q Consensus 254 g~s~~~-~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (512) |.|+++ .|++|||+||+++|++++.+++|++|+++++|........ .......++....+. .....+++++ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~-~~~~~~~~~~~~~~~-------i~~~~~~~~~ 297 (485) T protein:vir:24 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGV-DPETGQTLFDAYLAR-------ILAFEDAEGK 297 (485) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCcccccc-ccccccchhhhcccc-------eeccCCCCce Confidence 889997 5999999999999999999999999999999964332211 111111111111111 1112234455 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 333 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT----QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 333 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) +. +.+.+.+++++++|+..|++++.+|++.+..|+++ +||+||++++.+|.+||+++++.|+++|+++++++++ T Consensus 298 ~~--q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~ 375 (485) T protein:vir:24 298 IQ--QFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYR 375 (485) T ss_pred EE--eecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54 34556788999999999999998888877776643 6999999999999999999999999999999999988 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKA 484 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~----g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~ 484 (512) +.+.. ....+...|+++|+++.|+|..+.+++++|+. |++|+||+++++|+++|+.+|++++++|+.+..... T Consensus 376 ~~~~~---~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~ 452 (485) T protein:vir:24 376 LMKGG---DVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGL 452 (485) T ss_pred HhcCC---CCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhH Confidence 75532 23457789999999999999999999999983 589999999999999998899999988877654433 Q ss_pred HhhcccCCCCCCCCCCCCCCcCcccC------CC Q lcl|NC_010808. 485 QKGIYKDPRDINDDEQDDDTKDTVDK------KE 512 (512) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~------~e 512 (512) ............+. +++++..+. ++ T Consensus 453 ~~~~~~~~~~~~~~---~~~~e~~~~~~~~~~~~ 483 (485) T protein:vir:24 453 LGTMVDADPTVPGS---PNPTPAPKPQPAIEGGD 483 (485) T ss_pred HHhhcccCCCCCCC---CCCCCCCCCccCCCCCC Confidence 33332222211111 111111111 11 No 46 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=3.9e-79 Score=450.43 Aligned_cols=457 Identities=10% Similarity=0.088 Sum_probs=337.0 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) .......+..-+.+++.+.+.+++..+.+|++++.+||+|+|++.+.+...++ ...+.++++||+++||+++++|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~-~~~~~~~~~n~~~~ivd~~~~~l~~~ 79 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQ-QMQKLLAHVGYPRLYIDAIAARQELE 79 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccch-hHHhhhhhcCcHHHHHHHHHhhhccC Confidence 11122222222355666666666677788999999999999997655444333 33455678999999999999999999 Q ss_pred CceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCc--------eEEEEEccceeEEEEeCCCC Q lcl|NC_010808. 110 PIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDE--------TRLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 110 ~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~--------~~i~~~~p~~~~~i~d~~~~ 181 (512) +++.. ++++.++.++++|+.|+|+.++.++++++++||+||++||.+++|. ++|++++|.+++++||+. . T Consensus 80 g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~-~ 157 (484) T protein:vir:77 80 GFRLG-GADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPR-T 157 (484) T ss_pred ceecC-CcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCC-C Confidence 98764 4456678899999999999999999999999999999999998874 568999999999999876 5 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCc Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKG 256 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s 256 (512) +++.+++++|.... ...+.++++|+++.+++|....+.+. ..+..+|+||.||||+|.|+. +|+| T Consensus 158 ~~~~~a~~~~~~~~-----~~~~~~~~~y~~~~~~~~~~~~~~~~-----~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s 227 (484) T protein:vir:77 158 RQVMRAIRAIEDEE-----GNEVIGATLYLPNNTVIWNREDGQWV-----QVANVAHNLEMVPVIPIPNRTRLSDLYGTT 227 (484) T ss_pred CceEEEEEEEEeec-----CCcEEEEEEEecCeEEEEEecCCceE-----eeccccCCCCCcceEEeccccccCccCCcc Confidence 67999999886432 23467788999999988877665432 345678999999999999874 5899 Q ss_pred chH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe Q lcl|NC_010808. 257 DYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY 335 (512) Q Consensus 257 ~~~-~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 335 (512) +|+ .|++|+|+||+++|++++.++++++|+++++|....+. ......+...+.... + .. ....++++++.+ T Consensus 228 ~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~---~~~~~~~~~~~~~~~--~-~~--~~~~~~~~~~~q 299 (484) T protein:vir:77 228 EITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEEL---GVDPETGQTLFDAYL--A-RI--LAFEDHESKAQQ 299 (484) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchh---cccccccchhhhhhh--h-hh--cccCCCCceeEe Confidence 997 59999999999999999999999999999999644322 111222222221111 0 01 111233455543 Q ss_pred ecCCHHHHHHHHHHHHHHHHHHhccccccccccccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 336 KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT----QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 411 (512) Q Consensus 336 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 411 (512) .+....++++++|+..|+.++.+|++.+..|+++ +||+||++++.+|.+||+++++.|+++|++++++++++.+ T Consensus 300 --~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~ 377 (484) T protein:vir:77 300 --FSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMN 377 (484) T ss_pred --ecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 4455677888888888888887776666555433 6999999999999999999999999999999999988754 Q ss_pred hccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_010808. 412 NTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKG 487 (512) Q Consensus 412 ~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~----g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~ 487 (512) . .....+...++++|+++.|+|.++.+++++|++ |++|++|+++++|+++++.+|++++++|+.......... T Consensus 378 ~---~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~ 454 (484) T protein:vir:77 378 G---GDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGT 454 (484) T ss_pred C---CCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhh Confidence 3 233456778999999999999999999999983 589999999999999999999999988876654333222 Q ss_pred cccC-CCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 488 IYKD-PRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 488 ~~~~-~~~~~~~~~~~~~~~~~~~~e 512 (512) .... +....+.+.+++.+...+-++ T Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (484) T protein:vir:77 455 MFGTDPSGGGNPDNPETPEPQPNPAE 480 (484) T ss_pred hccccccCCCCCCCCCcccccCCCcc Confidence 2111 111111111111111222222 No 47 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=1.4e-78 Score=447.41 Aligned_cols=468 Identities=10% Similarity=0.051 Sum_probs=334.1 Q ss_pred hhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccee-ee Q lcl|NC_010808. 13 LRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR-VA 91 (512) Q Consensus 13 ~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~r-i~ 91 (512) .+-.-+.+-+.- ...+..+.+..+.......+.+++..| ..+++|++++.+||+|+|++...+...+++.+..++ ++ T Consensus 1 ~~~~~~~~~~~~-~~~~~~p~~~~~~~~~~~l~~~l~~~~-~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v 78 (501) T protein:vir:25 1 MTVPVDVIADAP-AADVEFPEDSMSREQLGALVADMWRLH-ISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSV 78 (501) T ss_pred CcccchhhhccC-cccccCCcccCChHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhh Confidence 122222222222 122222222222222233345556555 467899999999999999987767666777776554 66 Q ss_pred cchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccce Q lcl|NC_010808. 92 HDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMS 171 (512) Q Consensus 92 ~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~ 171 (512) +|||++||+++++|++.++++ +++++..+.++++|+.|+|+..+.++++++++||+||++||.+++| +++++++|.+ T Consensus 79 ~n~~~~ivd~~a~~l~~~gf~--~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-~~i~~~sp~~ 155 (501) T protein:vir:25 79 KNVLSLVRDSFAQNLSVVGYR--NALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-PVFRTRSPRQ 155 (501) T ss_pred cChHHHHHHHHHhhhccccee--cCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-CeEEEecccc Confidence 799999999999999988865 4555566779999999999999999999999999999999999888 5899999999 Q ss_pred eEEEEeC-CCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcc----------------ccccccccc Q lcl|NC_010808. 172 TFVIYDN-TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNG----------------LKLTPRENG 234 (512) Q Consensus 172 ~~~i~d~-~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~----------------~~~~~~~~~ 234 (512) ++++|++ ..+.++++++++|..... .+...++++|++..+++|....... ......... T Consensus 156 ~~~iy~D~~~~~~~~~ai~~~~~~~~----~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:25 156 ILAVYADPSVDAWPQYALETWVAQKD----AKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHG 231 (501) T ss_pred EEEEEecCCCCcceeEEEEEEeeccc----cCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccc Confidence 9999965 445579999999875432 2344567889998887775432110 001111234 Q ss_pred cccccccccceEeecCC----CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccc Q lcl|NC_010808. 235 FESHSFERMPITEFSNN----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLF 310 (512) Q Consensus 235 ~~~~~~~~vPvv~~~n~----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~ 310 (512) ..+|+|+.||||+|+|+ ++|+|+|+++++|+|+||+++|++++.++++++|+++++|+...+.+.. ....++++. T Consensus 232 ~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~-~~~~~~i~~ 310 (501) T protein:vir:25 232 ATFEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEVL-KASALRVWT 310 (501) T ss_pred cccCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccchh-hhcccceec Confidence 56899999999999994 5689999999999999999999999999999999999999865443322 222222211 Q ss_pred cchhhhhhcccccCCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH Q lcl|NC_010808. 311 LEPTVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK 389 (512) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~ 389 (512) ..++++++.+.+ .+.+.+..+++.+...|+..|++|+.+++.+++|+||+||++++.+|.+++. T Consensus 311 ---------------~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~ 375 (501) T protein:vir:25 311 ---------------FEDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLA 375 (501) T ss_pred ---------------cCCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHH Confidence 112334444443 4567788888899999999999999999988899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhcc-CChHHHHHhCCCCCCHHH Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGK-ISQTTLMSLFSFFQDPEL 468 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~-~s~et~~~~~~~v~d~~~ 468 (512) ++++.|+++|++++++++.+.+.. ...+..++++.|+++.|+|.++.+|+++|+.|+ +|.+|++.+++++++++ T Consensus 376 ~k~~~f~~~l~~~~rl~~~~~~~~----~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g~~~~~- 450 (501) T protein:vir:25 376 AKRESFGESWEQLLRLAAEMDDDP----DTAADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPGMTQQT- 450 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCC----ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCCCCHHH- Confidence 999999999999999998876532 233456799999999999999999999999765 89999999999998654 Q ss_pred HHHHHHHHHHHHHH--HHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 469 EVKKIEEDEKESIK--KAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 469 E~~ri~~E~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++++++++++... ........++.+. .+.+.++..+..+.++ T Consensus 451 -ie~~~~~~~e~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 494 (501) T protein:vir:25 451 -IQAIKDSLRGGEVKSLVDKLLSNEPAPV-PPPPPQAAAQALNEGG 494 (501) T ss_pred -HHHHHHHHHHHhHHHHHHHhhccCcCCC-CCCCCCCCcccccccc Confidence 4444444333222 1112222222111 1111111111111122 No 48 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=2.4e-78 Score=446.17 Aligned_cols=454 Identities=12% Similarity=0.108 Sum_probs=334.7 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +...... + ...+++.+..++..+++|++++.+||+|+|++.+.+...+.+. .++|+++||+++||+++++|++.+ T Consensus 1 ~~~~~~~---d-~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~-~~~~~~~n~~~~ivd~~a~~l~~~ 75 (488) T protein:vir:23 1 MAETESI---D-PEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDM-RKYLAHVGYPRTYVDAIAERQELE 75 (488) T ss_pred CCcccCC---C-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhh-hhhhhhcchHHHHHHHHHHhhhcc Confidence 1111111 1 2234444445556778999999999999999876655544444 477899999999999999988877 Q ss_pred Cceec---------CCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEEC--------CCCceEEEEEcccee Q lcl|NC_010808. 110 PIQCQ---------DDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN--------QDDETRLYKSDAMST 172 (512) Q Consensus 110 ~~~~~---------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d--------~~g~~~i~~~~p~~~ 172 (512) ++.+. +++++..+.|+++|+.|+|+.++.++++++++||+||++||.+ +++.+++++++|.++ T Consensus 76 Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~ 155 (488) T protein:vir:23 76 GFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTAL 155 (488) T ss_pred ceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEecccee Confidence 77553 3456778889999999999999999999999999999999874 456789999999999 Q ss_pred EEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC Q lcl|NC_010808. 173 FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE 252 (512) Q Consensus 173 ~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~ 252 (512) +++||+. .+++.+++++|... ....++++++|+++.+++|....+++. .....+|+||.||||+|.|++ T Consensus 156 ~~~~d~~-~~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~~~-----~~~~~~h~~g~vPvv~f~n~~ 224 (488) T protein:vir:23 156 YAEVDPR-TRKVLYAIRAIYGA-----DGNEIVSATLYLPDTTMTWLRAEGEWE-----APTSTPHGLEMVPVIPISNRT 224 (488) T ss_pred EEEEecC-CCceEEEEEEEEec-----CCCcEEEEEEEecCcEEEEEecCCceE-----eccccccCCCCcceEEecccc Confidence 9999975 45788888887532 223467789999999999987665442 345678999999999999875 Q ss_pred -----CCCcchH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCC Q lcl|NC_010808. 253 -----RRKGDYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETE 326 (512) Q Consensus 253 -----~g~s~~~-~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (512) +|+|+++ .|++|+|+||+++|++++.++++++|+++++|....+... .......++....+ . ....+ T Consensus 225 ~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~-~~~~~~~~~~~~~~----~--v~~~~ 297 (488) T protein:vir:23 225 RLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGI-NAETGQRMFDAYMA----R--ILAFE 297 (488) T ss_pred ccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccc-cccccchhhhhhhh----h--hccCC Confidence 5889997 5899999999999999999999999999999965333221 11111111111111 1 11123 Q ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 327 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 327 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) ++.++++.+. +....++++++|+..|+.++.+|++.+..+++ ++||+||++++.+|.+||+++++.|+.+|+++ T Consensus 298 ~g~~~~~~q~--~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~ 375 (488) T protein:vir:23 298 GGEGAHAEQF--SAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQA 375 (488) T ss_pred CCCCceeEec--CCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4455666544 44567788888888888887777666555543 36999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH----hccCChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_010808. 403 AKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS----GGKISQTTLMSLFSFFQDPELEVKKIEEDEK 478 (512) Q Consensus 403 ~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl----~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~ 478 (512) +++++++++.. ....++.+++++|+++.|+|..+.+++++|+ .|++|+||+++++|+++|+.+|+++++++++ T Consensus 376 ~~l~~~~~~~~---~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~ 452 (488) T protein:vir:23 376 MRLAYKMVKGG---DIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQ 452 (488) T ss_pred HHHHHHHhcCC---CcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHH Confidence 99999876532 2345677899999999999999999999997 2479999999999999999999999877765 Q ss_pred HHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 479 ESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.............. ..+..++.++++..+.+. T Consensus 453 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~e~ 485 (488) T protein:vir:23 453 KQGLGLIGSLYGAST-PEGKPGEAPVGEPPAPEP 485 (488) T ss_pred HHHHHHHHHHhccCC-CcccCCCCCCCCCCCCCC Confidence 543333332222221 222223333333333333 No 49 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=3.3e-78 Score=445.39 Aligned_cols=458 Identities=11% Similarity=0.085 Sum_probs=333.9 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) ++--..+.. ..+....+++.+.+++..+++||+++.+||+|+|++.+.+...+...+ ++++++||+++||++.++|| T Consensus 1 ~~~~i~~~~--~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:10 1 MTAPLPGQE--EIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQ-SLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCC--CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhh-hhhhhcCcHHHHHHHHHhhh Confidence 221122211 123345566666667778889999999999999998766655555444 56778899999999999999 Q ss_pred hccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC--------CCceEEEEEccceeEEEEeC Q lcl|NC_010808. 107 LGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ--------DDETRLYKSDAMSTFVIYDN 178 (512) Q Consensus 107 ~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~--------~g~~~i~~~~p~~~~~i~d~ 178 (512) ++++++.. ++++.++.++++|+.|+|+.++.++++++++||+||++||.++ ++.++|++++|.+++++||+ T Consensus 78 ~~~g~~~~-~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~ 156 (485) T protein:vir:10 78 AVEGFRFG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDP 156 (485) T ss_pred cccceecC-CCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcC Confidence 99988643 4556778899999999999999999999999999999999985 46788999999999999987 Q ss_pred CCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----C Q lcl|NC_010808. 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----R 253 (512) Q Consensus 179 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~ 253 (512) .. +++.+++++|... ......++++|+++.+++|....+++. .....+|+||.||||+|+|++ + T Consensus 157 ~~-~~~~~~~~~~~~~-----~~~~~~~~~~y~~~~~~~~~~~~~~~~-----~~~~~~~~~g~vPvv~~~n~~~~~~~~ 225 (485) T protein:vir:10 157 RI-GRVSKAIRVAYDA-----EGNEIQAATLYTPNDIFGWYRVENEWQ-----EWFNNPHGLGVVPVVPIPNRTRLSDLY 225 (485) T ss_pred CC-CceeEEEEEEEee-----CCCeEEEEEEEeCCeEEEEEEcCCceE-----EeccccCCCCcccEEEeccccccCCCC Confidence 65 4567777666421 234567789999999999987665542 345678999999999999974 4 Q ss_pred CCcchHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCccee Q lcl|NC_010808. 254 RKGDYEK-VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG 332 (512) Q Consensus 254 g~s~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (512) |.|+|++ |++|||+||+++|++++.+++|++|+++++|........ ........+....+ .....++++++ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~-~~~~~~~~~~~~~~-------~i~~~~~~d~k 297 (485) T protein:vir:10 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGV-DPETGQTLFDAYLA-------RILAFEDAEGK 297 (485) T ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccc-cccccchhhhhccc-------ceeccCCCCce Confidence 7899985 899999999999999999999999999999964332211 11111111111111 11112344566 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 333 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 333 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) +.+. +....+.++++|+..|++++.+|++.+..|++ ++||+||++++.+|.+||+++++.|+.+|+++++++++ T Consensus 298 ~~q~--~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~ 375 (485) T protein:vir:10 298 IQQF--SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYR 375 (485) T ss_pred EEee--cccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6544 44567788888888888887777665555543 37999999999999999999999999999999999988 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKA 484 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~----g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~ 484 (512) +.+. .....+...++++|+++.|+|.++.+++++||. |++|+||+++++|+++++.+|++++++|+....... T Consensus 376 ~~~~---~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~ 452 (485) T protein:vir:10 376 MMKG---GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGL 452 (485) T ss_pred HhCC---CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 6542 223456778999999999999999999999982 489999999999999888889999888776543333 Q ss_pred HhhcccCCCC---CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 485 QKGIYKDPRD---INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 485 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~e 512 (512) .......... +.+.++..+.....+.+. T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (485) T protein:vir:10 453 IGTMVDPNPTVPGSPSPAPAPKPAALESGGD 483 (485) T ss_pred HHHhhccCCCCCCCCCccccccCcCCCCCCC Confidence 2222221111 111111111111122222 No 50 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=2.9e-78 Score=445.64 Aligned_cols=440 Identities=14% Similarity=0.067 Sum_probs=325.4 Q ss_pred HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc-ceeeecchHHHHHHHHHhhhhccCceecC Q lcl|NC_010808. 37 DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQCQD 115 (512) Q Consensus 37 ~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~-~~ri~~n~~~~iv~~~a~~l~g~~~~~~~ 115 (512) .+...+.++++.+..++..+++|++++++||+|+|++++.+...+++.+. ++|+++||+++||++.++|++|+|+++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 22333545445444555678899999999999999987666666666665 67899999999999999999999999976 Q ss_pred Cc-hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeee Q lcl|NC_010808. 116 DD-KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTK 194 (512) Q Consensus 116 ~d-~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~ 194 (512) ++ .+..+.++++|+.|+++..+.++++++++||+||++||.+++|.+++++++|.+++++||+...+++.+++++|... T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~ 160 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDL 160 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEec Confidence 54 45667899999999999999999999999999999999999999999999999999999999989999999998643 Q ss_pred eeccCCcceEEEEEEEcCCcEEEEEe-----cCCc-----cccccccccccccccccccceEeecCCCCCCcchHHHHHH Q lcl|NC_010808. 195 PIDKTDEDEVFTVDLFTSHGVYRYLT-----SRTN-----GLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITL 264 (512) Q Consensus 195 ~~~~~~~~~~~~~~~yt~~~~~~~~~-----~~~~-----~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~l 264 (512) + . ...+..+|.++.+..+.. .... .....+......+|+++.|||+++ +|++|.|+|+++++| T Consensus 161 d-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi~l 233 (456) T protein:vir:10 161 D-----A-ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) T ss_pred C-----C-ceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEe-cCCCCCchhhhhHHH Confidence 2 1 122333444443322211 0000 001112234556899999999887 567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) ||+||+++|++++.++++++|+++++|.....+.. ...+..+................+++++++.+. +.+.+.+. T Consensus 234 iDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~---d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~ 309 (456) T protein:vir:10 234 INRINRAELQLLSTMAIQAFRQRALKSTEHGLPNV---DENGNAIDYASIFEAAPGALWELPPGVDIWESQ-ANDFTPML 309 (456) T ss_pred HHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccc---cccccccchhhhhhhhccccccCCCCcceEEec-ccChhHHH Confidence 99999999999999999999999999964332110 001111111101111111122234566666554 45677788 Q ss_pred HHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) ..++.+...|+..+++|+..++.+++|+||+||++++.+|.+||+.+++.|+++|++++++++++.+ ..+... T Consensus 310 ~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g-------~~~~~~ 382 (456) T protein:vir:10 310 SAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG-------ESVEDT 382 (456) T ss_pred HHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CCcccc Confidence 8888888888899999998888888899999999999999999999999999999999999877532 223457 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) +++.|+++.|+|.++.+|+++|+ +|++|.+++++++|++++ .++|++|+++|+.......... + +.+.+ T Consensus 383 ~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~----~-~~~~~-- 455 (456) T protein:vir:10 383 VDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQR----P-QEDGS-- 455 (456) T ss_pred eeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhc----C-CCCCC-- Confidence 89999999999999999999998 488999999999988654 3356777766654322111111 1 11111 Q ss_pred CC Q lcl|NC_010808. 501 DD 502 (512) Q Consensus 501 ~~ 502 (512) . T Consensus 456 -~ 456 (456) T protein:vir:10 456 -R 456 (456) T ss_pred -C Confidence 1 No 51 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=2.9e-78 Score=445.64 Aligned_cols=440 Identities=14% Similarity=0.067 Sum_probs=325.4 Q ss_pred HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc-ceeeecchHHHHHHHHHhhhhccCceecC Q lcl|NC_010808. 37 DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQCQD 115 (512) Q Consensus 37 ~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~-~~ri~~n~~~~iv~~~a~~l~g~~~~~~~ 115 (512) .+...+.++++.+..++..+++|++++++||+|+|++++.+...+++.+. ++|+++||+++||++.++|++|+|+++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 22333545445444555678899999999999999987666666666665 67899999999999999999999999976 Q ss_pred Cc-hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeee Q lcl|NC_010808. 116 DD-KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTK 194 (512) Q Consensus 116 ~d-~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~ 194 (512) ++ .+..+.++++|+.|+++..+.++++++++||+||++||.+++|.+++++++|.+++++||+...+++.+++++|... T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~ 160 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDL 160 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEec Confidence 54 45667899999999999999999999999999999999999999999999999999999999989999999998643 Q ss_pred eeccCCcceEEEEEEEcCCcEEEEEe-----cCCc-----cccccccccccccccccccceEeecCCCCCCcchHHHHHH Q lcl|NC_010808. 195 PIDKTDEDEVFTVDLFTSHGVYRYLT-----SRTN-----GLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITL 264 (512) Q Consensus 195 ~~~~~~~~~~~~~~~yt~~~~~~~~~-----~~~~-----~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~l 264 (512) + . ...+..+|.++.+..+.. .... .....+......+|+++.|||+++ +|++|.|+|+++++| T Consensus 161 d-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~g~gd~e~vi~l 233 (456) T protein:vir:10 161 D-----A-ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) T ss_pred C-----C-ceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEe-cCCCCCchhhhhHHH Confidence 2 1 122333444443322211 0000 001112234556899999999887 567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) ||+||+++|++++.++++++|+++++|.....+.. ...+..+................+++++++.+. +.+.+.+. T Consensus 234 iDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~---d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~ 309 (456) T protein:vir:10 234 INRINRAELQLLSTMAIQAFRQRALKSTEHGLPNV---DENGNAIDYASIFEAAPGALWELPPGVDIWESQ-ANDFTPML 309 (456) T ss_pred HHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccc---cccccccchhhhhhhhccccccCCCCcceEEec-ccChhHHH Confidence 99999999999999999999999999964332110 001111111101111111122234566666554 45677788 Q ss_pred HHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) ..++.+...|+..+++|+..++.+++|+||+||++++.+|.+||+.+++.|+++|++++++++++.+ ..+... T Consensus 310 ~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g-------~~~~~~ 382 (456) T protein:vir:10 310 SAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG-------ESVEDT 382 (456) T ss_pred HHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CCcccc Confidence 8888888888899999998888888899999999999999999999999999999999999877532 223457 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) +++.|+++.|+|.++.+|+++|+ +|++|.+++++++|++++ .++|++|+++|+.......... + +.+.+ T Consensus 383 ~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~----~-~~~~~-- 455 (456) T protein:vir:10 383 VDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQR----P-QEDGS-- 455 (456) T ss_pred eeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhc----C-CCCCC-- Confidence 89999999999999999999998 488999999999988654 3356777766654322111111 1 11111 Q ss_pred CC Q lcl|NC_010808. 501 DD 502 (512) Q Consensus 501 ~~ 502 (512) . T Consensus 456 -~ 456 (456) T protein:vir:10 456 -R 456 (456) T ss_pred -C Confidence 1 No 52 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=8.9e-77 Score=437.53 Aligned_cols=440 Identities=13% Similarity=0.057 Sum_probs=327.0 Q ss_pred HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccce-eeecchHHHHHHHHHhhhhccCceecC Q lcl|NC_010808. 37 DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN-RVAHDYASYISDFINGYFLGNPIQCQD 115 (512) Q Consensus 37 ~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~-ri~~n~~~~iv~~~a~~l~g~~~~~~~ 115 (512) .+.....++++.+.+++..+++|++++.+||+|+|++.+.+...+++.+..+ ++++||+++||++.++|++|+|+++++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 80 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 1222344455555555677899999999999999998776666666676654 577899999999999999999999876 Q ss_pred Cc-hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeee Q lcl|NC_010808. 116 DD-KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTK 194 (512) Q Consensus 116 ~d-~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~ 194 (512) .+ .+..+.++++|+.|+|+..+.++++++++||+||+++|.+++|.+++++++|.+++++||+....++.+++++|... T Consensus 81 ~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~ 160 (456) T protein:vir:79 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) T ss_pred CCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCceEEEEEEEEec Confidence 54 45677899999999999999999999999999999999999999999999999999999999889999999998643 Q ss_pred eeccCCcceEEEEEEEcCCcEEEEEecCC-----cc-----ccccccccccccccccccceEeecCCCCCCcchHHHHHH Q lcl|NC_010808. 195 PIDKTDEDEVFTVDLFTSHGVYRYLTSRT-----NG-----LKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITL 264 (512) Q Consensus 195 ~~~~~~~~~~~~~~~yt~~~~~~~~~~~~-----~~-----~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~l 264 (512) + ....+..+|+.+.++++..... .. ....+..+...+|+++.|||++| +|++|.|+|+++++| T Consensus 161 d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~~~gd~e~v~~l 233 (456) T protein:vir:79 161 D------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) T ss_pred C------CceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEe-cCCCCCchhhhhHHH Confidence 2 2234555777776655432110 00 00112234456899999999998 568899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) ||+||+++|++++.++++++|+++++|........ ...+..+................+++++++.+. +.+.+.+. T Consensus 234 iD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~---d~~g~~i~~~~~~~~~~~~~~~~~~~~~~~q~~-~~~~~~~~ 309 (456) T protein:vir:79 234 INRINRAELQLLSTMAIQAFRQRALKSSEHRLPKV---DENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPML 309 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccc---cccccccchhhhhhhhccccccCCCCcceeeec-ccChHHHH Confidence 99999999999999999999999999964322110 011111111111111111112334556664433 45667777 Q ss_pred HHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) ..++.+...|+..+++|...++.+++|+||+||++++.+|.+||+.+++.|+++|++++++++++.+. .+... T Consensus 310 ~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~-------~~~~~ 382 (456) T protein:vir:79 310 SAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVEDT 382 (456) T ss_pred HHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Ccccc Confidence 77888888888888899888888888999999999999999999999999999999999998876431 23457 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) ++++|+++.|+|.++.||+++|+ +|++|.+++++.++++.+ +++|++|+++|........ .+. . T Consensus 383 i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~----~~~--------~ 450 (456) T protein:vir:79 383 VDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNP----VQR--------P 450 (456) T ss_pred ceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhH----hhc--------C Confidence 89999999999999999999998 578999999999888654 3456666666644321111 111 0 Q ss_pred CCCCcC Q lcl|NC_010808. 501 DDDTKD 506 (512) Q Consensus 501 ~~~~~~ 506 (512) ++++.. T Consensus 451 ~~~~~~ 456 (456) T protein:vir:79 451 QEDGSR 456 (456) T ss_pred CCCCCC Confidence 111111 No 53 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=3.2e-76 Score=434.47 Aligned_cols=424 Identities=12% Similarity=0.085 Sum_probs=314.4 Q ss_pred hhhcH-HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCC Q lcl|NC_010808. 38 LLQNI-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD 116 (512) Q Consensus 38 ~~~~~-~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~ 116 (512) +.++. .+|.+++.+| ..+++|++++.+||+|+|++...+...++ ...++|+++|||++||++.++|+.+++++ ++ T Consensus 1 ~~~~~~~~i~~l~~~~-~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~-~~~~~k~~~n~~~~ivd~~~~~l~~~g~~--~~ 76 (441) T protein:vir:80 1 MNSDELALIEGMYDRI-QRLSSWHCCIEGYYEGSNRVRDLGVAIPP-ELQRVQTVVSWPGIAVDALEERLDWLGWT--NG 76 (441) T ss_pred CCccHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCcchhcCcccch-hhhhhhhhcchHHHHHHHHHhhhcccccc--CC Confidence 22333 3344555544 56679999999999999997665544443 34578999999999999999999877765 33 Q ss_pred chhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeee Q lcl|NC_010808. 117 DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPI 196 (512) Q Consensus 117 d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~ 196 (512) ++ +.|+++|+.|+|+.++.+++++++++|+||++||.|++|.+++++++|.+++++||+.....+.++++++.. T Consensus 77 d~---~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~--- 150 (441) T protein:vir:80 77 DG---YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQTC--- 150 (441) T ss_pred Ch---HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCCceeEEEEEEEEe--- Confidence 32 458999999999999999999999999999999999999999999999999999998766555555555432 Q ss_pred ccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcchH-HHHHHHHHHHH Q lcl|NC_010808. 197 DKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE-KVITLIDLYDN 270 (512) Q Consensus 197 ~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~~-~v~~liDa~~~ 270 (512) .....++++|+++.+++|...+.+.+ ..++..+|+||+||||+|.|++ +|.|+|. .|++|||+||+ T Consensus 151 ----~~~~~~~~vy~~~~~~~~~~~~~~~~----~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~ 222 (441) T protein:vir:80 151 ----DPEVVEAELLLPDVIVQVERRGSREW----VEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVR 222 (441) T ss_pred ----cCceEEEEEEecCeEEEEEEcCCcce----eeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHH Confidence 12356788999999999877665543 3456789999999999999875 4788886 59999999999 Q ss_pred HHHHHHHHHHHhcCceeeeecCCcCChhhhh-hhhhccccccchhhhhhcccccCC-CCCcceeEEeecCCHHHHHHHHH Q lcl|NC_010808. 271 AESDTANYMSDLNDAMLLIKGNLSLDPDEVK-KQKEANVLFLEPTVYENRDTGIET-EGSVDGGYIYKQYDVQGTEAYKD 348 (512) Q Consensus 271 ~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~ 348 (512) ++|++++.+++|++|+++++|....+..... .....+++. ... .++..+++.+ .+.+..+.+++ T Consensus 223 ~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~------------~~~~~~~~~~~~~~--~~~~~~~~~~~ 288 (441) T protein:vir:80 223 TLLGQSVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWA------------VDKDDDGDTPNVGS--FPVNSPTPYSD 288 (441) T ss_pred HHHHHHHHHHhhcCceeeeecCCccccccchhhhccccccc------------CCCCCCCCcceeEe--cCccchHHHHH Confidence 9999999999999999999997544332211 111111111 111 1222244433 34456777788 Q ss_pred HHHHHHHHHhcccccccccccc---c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSG---T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~---n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) +|+..|+.++.+|++.+..+++ | +||+||++++.+|.++|+++++.|+++|++++++++++++..... ...+.+ T Consensus 289 ~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~--~~~~~~ 366 (441) T protein:vir:80 289 QMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDE--ADFFGD 366 (441) T ss_pred HHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--ccccee Confidence 8877777777666665444432 3 599999999999999999999999999999999999987765432 334678 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hcc--CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGK--ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~--~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) ++++|+++.|+|.++.+++++|+ +|+ +|++++++++|++++ |++||++|+++..+.........+.+.+.. T Consensus 367 i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~---e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 367 VGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDV---QVEAVMRHRAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred eeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHH---HHHHHHHHHHHHHHHHHHHhhhhhcccccC Confidence 99999999999999999999998 343 588999999998754 455555555554443333322222222222 No 54 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=9.4e-75 Score=426.43 Aligned_cols=445 Identities=11% Similarity=0.061 Sum_probs=306.6 Q ss_pred eeecccchhHHhhhcHHH-HH-HHHHHHHHHHHHHHHHHHHHhcccccccccccccccc--cccceeeecchHHHHHHHH Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINE-VS-KYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEE--YMADNRVAHDYASYISDFI 102 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~-l~-~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~--~~~~~ri~~n~~~~iv~~~ 102 (512) +++ -+.+....+.+.. |. +++.+| ..+++|++++.+||+|+|+++........+ .+..+++++||+++||+++ T Consensus 1 ~~~--~p~~~l~~~~~~~~~~~~l~~~~-~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 77 (479) T protein:vir:99 1 MID--LPDEDLSSEGLAKYLETKVFPKM-NTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSF 77 (479) T ss_pred Ccc--CCcccCChhHHHHHHHHHHHHHH-HHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHH Confidence 221 1222223333322 33 344444 577899999999999999987655443322 1223456789999999999 Q ss_pred HhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE-----CCCCceEEEEEccceeEEEEe Q lcl|NC_010808. 103 NGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR-----NQDDETRLYKSDAMSTFVIYD 177 (512) Q Consensus 103 a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~-----d~~g~~~i~~~~p~~~~~i~d 177 (512) ++|+++++++ +.+.+..+.++++|+.|+|+..+.++++++++||+||++||+ |++|.+++++++|.+++++|+ T Consensus 78 ~~~l~~~gf~--~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iyd 155 (479) T protein:vir:99 78 AQQLIVDGYR--KTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWE 155 (479) T ss_pred Hhhccccccc--CCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEEEec Confidence 9999888765 456666778999999999999999999999999999999996 567899999999999999998 Q ss_pred CCCCce-eEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCC----C Q lcl|NC_010808. 178 NTIERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNN----E 252 (512) Q Consensus 178 ~~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~----~ 252 (512) ++.... +++++++. ....+.+|+.+.++.|....+.+ ...+..+|+||+||||+|.|+ + T Consensus 156 d~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~h~~g~vPvv~f~n~~~~~~ 219 (479) T protein:vir:99 156 DPYWDEWPKYLLERQ-----------PNGQYWWWTEEDYSIFEFKQGKF-----IYRETVSHDYGHIPFVRYVNVMDLRG 219 (479) T ss_pred CCcccceeeEEEeec-----------CceeEEEEecceEEEEEecCCce-----eeccccccCCCCcceEEeecCCCcCc Confidence 765443 22222221 11245678888777776654433 345678999999999999998 5 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCccee Q lcl|NC_010808. 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG 332 (512) Q Consensus 253 ~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (512) +|+|+|+++++|||+||+++|++++.+++|++|+++++|....+.+.....+.. ....+.. ...+.+++ T Consensus 220 ~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~--------~~~~~i~---~~~~~~~~ 288 (479) T protein:vir:99 220 VCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMR--------FAQESML---ISQNEKAS 288 (479) T ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccc--------cccccce---eecCCCce Confidence 799999999999999999999999999999999999999765443332211100 0000111 11233455 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHH---hcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 333 YIYKQYDVQGTEAYKDRLNSDIHMF---TNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETI 409 (512) Q Consensus 333 ~l~~~~~~~~~~~~~~~l~~~i~~~---s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 409 (512) +.+.+ ....++++++|+..|+.+ +++|...++ +.+|+||+||++++.+|.++|+.+++.|+.+|++++++++++ T Consensus 289 ~~q~~--~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g-~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~ 365 (479) T protein:vir:99 289 FGAIP--AAPLDGLLNAYKESLLEFLALAQLPPHIAG-QIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKI 365 (479) T ss_pred EEEec--ccchHHHHHHHHHHHHHHhccCCCCHHHcc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54443 344556666655555555 555655544 357899999999999999999999999999999999999887 Q ss_pred HHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH--HHH Q lcl|NC_010808. 410 LKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIK--KAQ 485 (512) Q Consensus 410 l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~--~~~ 485 (512) .+... ..+..+++++|.++.++|..+.+++++|| +|++|+||++++++++++++ ++++++++++... ... T Consensus 366 ~~~~~----~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~--~e~~~~~~~~~~~~~~~~ 439 (479) T protein:vir:99 366 EGRTE----EATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQST--VNGWKEIYDREGDFGKYM 439 (479) T ss_pred cCCCc----cccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH--HHHHHHHHHHHHHHHHHH Confidence 64322 23445789999999999999999999998 58999999999999998755 3444444333221 112 Q ss_pred hhcccCCCCCCCCCC----CCCCcCcccCCC Q lcl|NC_010808. 486 KGIYKDPRDINDDEQ----DDDTKDTVDKKE 512 (512) Q Consensus 486 ~~~~~~~~~~~~~~~----~~~~~~~~~~~e 512 (512) ......+.+....+. .+..+.+...++ T Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 440 RKLQNGPDPAEQRGGPNGATNMQQANNKTGE 470 (479) T ss_pred HHHhcccCcccccCCCCCCCCCCCCCCCCcc Confidence 222222211111111 111111111222 No 55 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=2.7e-71 Score=407.44 Aligned_cols=468 Identities=10% Similarity=0.027 Sum_probs=315.0 Q ss_pred cccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHH Q lcl|NC_010808. 17 RNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYAS 96 (512) Q Consensus 17 ~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~ 96 (512) -..++.-...-++.-.+... +...+|..++..| ..+++|++++.+||+|+|.+.+.+...+++.+ ++++++||++ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~---~e~~~i~~L~~~~-~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~-~~~~v~n~~~ 75 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELND---DVVDKVNGLYQQL-VDRTPRNLLRASFYDGKYAIRQIGNLIPPEYL-RTATVLGWSA 75 (504) T ss_pred CCccCCcccccccccCCCCH---HHHHHHHHHHHHH-HHHhHHHHHHHHHHhccccchhccccccHHHH-HHhhccCcHH Confidence 01111111111111111111 1123344555545 55679999999999999998766655555544 6678899999 Q ss_pred HHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCc--eEEEEEccceeEE Q lcl|NC_010808. 97 YISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDE--TRLYKSDAMSTFV 174 (512) Q Consensus 97 ~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~--~~i~~~~p~~~~~ 174 (512) +||+++++++..+++... ++++.+..|++||+.|+|+....++++++++|||||++||.+++|+ +.|+++||.++|+ T Consensus 76 ~iVd~~a~rl~~~Gf~~~-d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~ 154 (504) T protein:vir:99 76 KAVDTLARRCNLESFVWP-DGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATG 154 (504) T ss_pred HHHHHHHhhhccceeeCC-CCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEE Confidence 999999999999998754 3445567799999999999999999999999999999999999886 5688999999999 Q ss_pred EEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCC--- Q lcl|NC_010808. 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNN--- 251 (512) Q Consensus 175 i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~--- 251 (512) +||+. .+++.+++++|... .......+++|+++.++++.....+.+ ..+..+|++| ||||+|.|+ T Consensus 155 iyD~~-~~~~~~a~~~~~~d-----~~g~~~~~~~y~~~~~~~~~~~~~~~~-----~~~~~~~~~g-vPvV~~~n~~~~ 222 (504) T protein:vir:99 155 EWNSR-RNAMDSLLSITSRD-----AEGHPTGIALYEDGVTVTADMDDDGDW-----HADVRTHKLG-VPVEVLPYKPRE 222 (504) T ss_pred EEeCC-CCceeEEEEEEEec-----CCCeEEEEEEEcCCcEEEEEEcCCcee-----eeccccCCCC-cceEEecccccC Confidence 99975 46778888776532 123456788999999998877655433 2456789998 899999987 Q ss_pred --CCCCcchH-HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCC--- Q lcl|NC_010808. 252 --ERRKGDYE-KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET--- 325 (512) Q Consensus 252 --~~g~s~~~-~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 325 (512) ++|.|++. +|++|+|++|++++++++..++|++|+++++|....+......... ..+. ...+.....+. T Consensus 223 ~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~-~~~~----~~~~~i~~~~~~~~ 297 (504) T protein:vir:99 223 DRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMK-PAWQ----IALARVFALPDDED 297 (504) T ss_pred ccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCcccccccccccc-chhh----hhhhhhhcCCCccc Confidence 36888885 8999999999999999999999999999999976543221110000 0000 00011111111 Q ss_pred ---CCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhcccccccccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 326 ---EGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF--SGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 399 (512) Q Consensus 326 ---~~~~~~~~l~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 399 (512) ..+.++++-+. +.+.+.+...++.+..+|+..|++|..+++.. .+|+||+||++++.+|.+++.++++.|+.+| T Consensus 298 ~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l 377 (504) T protein:vir:99 298 EPDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAF 377 (504) T ss_pred cccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11223433322 22344444555555555566678887776533 4678999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhc--c---CChHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_010808. 400 RRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGG--K---ISQTTLMSLFSFFQDPELEVKKIE 474 (512) Q Consensus 400 ~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g--~---~s~et~~~~~~~v~d~~~E~~ri~ 474 (512) ++++++++++...... ...++..+++.|+++.+++.++.+|+++|+.+ . .+.+++++++|+. ++|++|++ T Consensus 378 ~~~~rla~~~~~~~~~--~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~---~~ei~r~~ 452 (504) T protein:vir:99 378 RRSMIRALAIKNGLDR--IPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLT---PQQAKRAL 452 (504) T ss_pred HHHHHHHHHHhcCCCc--cccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCC---HHHHHHHH Confidence 9999999887664432 34456789999999999999999999999844 2 2358899999874 34666766 Q ss_pred HHHHHHHHHH--HhhcccCCCC-CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 475 EDEKESIKKA--QKGIYKDPRD-INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 475 ~E~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~e 512 (512) +|+++..... ......+... ..++.+++...+....+- T Consensus 453 ~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~ 493 (504) T protein:vir:99 453 AERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEP 493 (504) T ss_pred HHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCC Confidence 6654332211 1111111111 111111111111111111 No 56 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=3.6e-69 Score=395.85 Aligned_cols=409 Identities=10% Similarity=0.071 Sum_probs=287.0 Q ss_pred cccccccccccccee-eecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEE Q lcl|NC_010808. 75 VELTRRKEEYMADNR-VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (512) Q Consensus 75 ~~~~~~~~~~~~~~r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~ 153 (512) +.+...++..+..+| +++|||++||+++++++.+++++ +.|.+.++.++++|+.|+|+..+.++++++++||+||++ T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~--~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~ 78 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVT--GPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYML 78 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCcee--cCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE Confidence 345555667777665 46899999999999999998865 566677888999999999999999999999999999999 Q ss_pred EEECCCC-------ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEE--ecCCc Q lcl|NC_010808. 154 MIRNQDD-------ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL--TSRTN 224 (512) Q Consensus 154 v~~d~~g-------~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~--~~~~~ 224 (512) ||.++++ .+.|++++|.+++++||+.. +++.+++++|.... + .. .+..+|+.+.++.|. ..... T Consensus 79 v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~-~~~~~ai~~~~~~~-~---~~--~~~~~~~~~~~~~~~~~~~~~~ 151 (434) T protein:vir:98 79 VGAHPTRTEDNGRPSPLITMEHPSECIVEYDPET-GEPLVGLKVWHNDI-D---GF--GYARVFFDDTSFPYRTRERTGA 151 (434) T ss_pred EecCCCcccccCCceeEEEEeccceeEEEEeCCC-CceEEEEEEEEecc-C---Cc--eEEEEEEeCcEEEEEEeecccc Confidence 9987654 46789999999999999775 46999999886432 1 11 223344444433332 22211 Q ss_pred cc-------cccccccccccccccccceEeecCC----CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Q lcl|NC_010808. 225 GL-------KLTPRENGFESHSFERMPITEFSNN----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (512) Q Consensus 225 ~~-------~~~~~~~~~~~~~~~~vPvv~~~n~----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~ 293 (512) .. ..........+|+||+||||+|.|+ .+|.|+|+++++|||+||+++|++++.+++|++|+++++|.. T Consensus 152 ~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~ 231 (434) T protein:vir:98 152 RLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHK 231 (434) T ss_pred ccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC Confidence 11 1112234567899999999999998 678999999999999999999999999999999999999975 Q ss_pred cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh---cccccccccccc Q lcl|NC_010808. 294 SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT---NTPNMKDDNFSG 370 (512) Q Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s---~~p~~~~~~~~~ 370 (512) ..+..+... +.+.... ........+...+++++++.+. +....++++++|+..|+.++ ++|...++...+ T Consensus 232 ~~~~~~~~~----~~~~~~~-~~~~~~~~i~~~~~~~~~~~q~--~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~ 304 (434) T protein:vir:98 232 FAKRTDPAT----GMTVVDQ-PFVPSPSAVWASEGENTQFGQL--DATDLSGFLKEHASDVRDMLTISQTPTYLYATDLV 304 (434) T ss_pred ccccccccc----ccchhhh-hhhccccccccCCCCCceEEEe--cCcchHHHHHHHHHHHHHHhcccCCCHHHhccccC Confidence 444332111 1110000 0000111111223445555443 33445556565555555555 555545544446 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhcc Q lcl|NC_010808. 371 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGK 450 (512) Q Consensus 371 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~ 450 (512) |+||+||++++.+|.+||+++++.|+++|++++++++++.+. ..+..++++.|+++.|+|.++++|+++||.|+ T Consensus 305 n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~------~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~ 378 (434) T protein:vir:98 305 NISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGV------PEDYTEAEVRWANPAHVTMAVKADAATKLKSI 378 (434) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------ChhheeeeEEecCCCCCCHHHHHHHHHHHHhc Confidence 899999999999999999999999999999999998876321 23556799999999999999999999999775 Q ss_pred -CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 451 -ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 451 -~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) +|.+++++++|+. ++|++|+++|+++............+.+ .++..++.++..+| T Consensus 379 g~~~e~~~~~lg~~---~~e~~r~~~e~~~~~~~~~~~~~~~~~~---~~g~~~~~~~~~dg 434 (434) T protein:vir:98 379 GYPLDVIAEELDES---PARVRRIVAGAASQALLAASLLPAPGAP---SAGNVPDSGGAVDG 434 (434) T ss_pred CCcHHHHHHhCCCC---HHHHHHHHHHHHHHHHHHHhhhccCCCC---CCCCCCcccCCCCC Confidence 8999999999985 3578888887665443333222221111 11112222233333 No 57 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=1.9e-67 Score=386.41 Aligned_cols=453 Identities=11% Similarity=0.046 Sum_probs=311.1 Q ss_pred ccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHH Q lcl|NC_010808. 20 LFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (512) Q Consensus 20 ~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 99 (512) +-+...-. -.+... ....++..+.+.+..+++|++++.+||+|+|++.+.+...++..+ +.++++||++++| T Consensus 1 ~~~~~~~~---~~gl~~----~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r-~~~~v~nw~~~~V 72 (474) T protein:vir:81 1 MIQQQTVR---IPSLSN----DENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYF-NLGLVLGWTGKAV 72 (474) T ss_pred CcCCCcCc---CCCCCh----hHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHH-HHHhhcChHHHHH Confidence 11221111 112111 122233333334466678999999999999998777766666665 5678899999999 Q ss_pred HHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCc--eEEEEEccceeEEEEe Q lcl|NC_010808. 100 DFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDE--TRLYKSDAMSTFVIYD 177 (512) Q Consensus 100 ~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~--~~i~~~~p~~~~~i~d 177 (512) +++++++..++++.. +++.....++++|+.|+|+....++++++++|||||++|+.+++|+ +.+++++|.+++++|| T Consensus 73 d~~a~rl~~~Gf~~~-d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D 151 (474) T protein:vir:81 73 DALARRCNLEGFVWP-DGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWN 151 (474) T ss_pred HHHHhhhcccceECC-CCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEe Confidence 999999999999854 3334456789999999999999999999999999999999987765 7789999999999998 Q ss_pred CCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC----- Q lcl|NC_010808. 178 NTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE----- 252 (512) Q Consensus 178 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~----- 252 (512) +.. +++.++++++... .........+|+++.++++....+.+. ...+..+|++| ||||+|.|++ T Consensus 152 ~~~-~~~~~al~~~~~~-----~~g~~~~~~ly~~~~~~~~~~~~~~~~----w~~~~~~~~~g-vPvV~~~n~~~~~~~ 220 (474) T protein:vir:81 152 RRR-RGLNNLLSIIDKD-----KEGKVLSLALYLDNETVTAQRDKATLK----WQVDRDEHVYG-VPAQVLPYKPAPKRP 220 (474) T ss_pred CCC-CcceeeeEEEEEc-----CCCcEEEEEEEeCCcEEEEEEcCccce----eeeccCCCCCC-cceEEecccccccCc Confidence 864 5677777665432 122345667999999988876554321 12456789998 7999999864 Q ss_pred CCCcch-HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc-- Q lcl|NC_010808. 253 RRKGDY-EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV-- 329 (512) Q Consensus 253 ~g~s~~-~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 329 (512) +|+|++ +++++|+|++|++++++.+..+++++|+++++|....+..+....+. ..+. ...........++++ T Consensus 221 ~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~-~~~~----~~~~~i~~~~~d~d~~~ 295 (474) T protein:vir:81 221 FGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIK-SVWE----ARLGRIKGLPDDADADI 295 (474) T ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhccccccccc-chhh----hhHHHHhcCCCcccccc Confidence 688887 69999999999999999999999999999999976544322111110 0111 111111111111111 Q ss_pred ----ceeEEe-ecCCHHHHHHHHHHHHHHHHHHhccccccccc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 ----DGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN--FSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 330 ----~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) .+++-+ .+.+.+.+...++.+..++...|++|..+++. +.+++||+||++++.+|..|++++++.|+.+|+++ T Consensus 296 ~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~ 375 (474) T protein:vir:81 296 PQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKA 375 (474) T ss_pred cccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122221 12344444455555555666667888877763 35668999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_010808. 403 AKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEK 478 (512) Q Consensus 403 ~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~----g~~s~et~~~~~~~v~d~~~E~~ri~~E~~ 478 (512) +++++++.+.........++..+++.|.++..++.++.+|+++|+. |+.+.+++++++++. ++++++++.+++ T Consensus 376 ~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t---~~~i~~~~~~~~ 452 (474) T protein:vir:81 376 FIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLT---PQQARRAMADKR 452 (474) T ss_pred HHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCC---HHHHHHHHHHHH Confidence 9999987655443344455678999999999999999999999984 356667888887765 346666665543 Q ss_pred HHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 479 ESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~ 500 (512) +......-............++ T Consensus 453 ~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 453 RVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HHhHHHHHHHHHhcCCCCCCCC Confidence 3222111111111111111111 No 58 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=8.6e-68 Score=388.26 Aligned_cols=394 Identities=11% Similarity=0.012 Sum_probs=301.8 Q ss_pred HHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccC Q lcl|NC_010808. 53 MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLND 132 (512) Q Consensus 53 ~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~ 132 (512) ++..++|++++.+||+|+|++.+.+...+++.+.++|+++||+++||+++++++..++++ .+|+ .+++||+.|+ T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~--~~d~----~l~~i~~~N~ 74 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFA--NDDF----NVTEIFDRNN 74 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhcccccc--CCCc----hHHHHHhhcC Confidence 344478899999999999998777777777888888999999999999999999999975 3443 3889999999 Q ss_pred hhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcC Q lcl|NC_010808. 133 VESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTS 212 (512) Q Consensus 133 ~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~ 212 (512) |+..+.++++++++|||||++||.+++|.++|+++||.+++++||+ ..+++.++++++... .......+.+|++ T Consensus 75 ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp-~~~~~~~al~~~~~~-----~~~~~~~~~~~~~ 148 (410) T protein:vir:95 75 PDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDP-ITGLLVEGYAVLARD-----DYNRPTLEAYFEP 148 (410) T ss_pred hHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeC-CCCceEEEEEEEEec-----CCCeEEEEEEEeC Confidence 9999999999999999999999999999999999999999999987 567899999876432 2234567789999 Q ss_pred CcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcch-HHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010808. 213 HGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDY-EKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) Q Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~-~~v~~liDa~~~~~s~~~~~~~~~~~~~ 286 (512) +.++++...+..+ ..+|++|.||||+|.|++ +|+|++ ++|++|+|++|++++++.+..++|++|+ T Consensus 149 ~~~~~~~~~~~~~---------~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pq 219 (410) T protein:vir:95 149 NATHFIPKDGEPY---------SVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQ 219 (410) T ss_pred CcEEEEeeCCccc---------cccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchh Confidence 9999887754432 357999999999999853 588887 6899999999999999999999999999 Q ss_pred eeeecCCcCChhhhhhhhhccccccchhhhhhcccccCC-CCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_010808. 287 LLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET-EGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPNMK 364 (512) Q Consensus 287 lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 364 (512) ++++|...... ... .+. ...+.....+. .++..+++-+ .+.+.+.+.+.++.+..+++..|++|... T Consensus 220 r~i~G~d~d~~-~~~------~~~----~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~ 288 (410) T protein:vir:95 220 KYILGLDPDAE-PME------KWK----ATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDD 288 (410) T ss_pred heeeccCCCCC-cCc------hhh----hhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHH Confidence 99999732111 110 011 11111111111 1222233322 23355566666666666777777888877 Q ss_pred ccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeC---CCCCcCHHHH Q lcl|NC_010808. 365 DDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYN---RNLPKSLIEE 440 (512) Q Consensus 365 ~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~---~~~p~d~~~~ 440 (512) ++..+.| +||+||++++.+|..|++++++.|+.+|++++++++++...... .+.++..+++.|. ++..++.++. T Consensus 289 lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~--~~~~~~~~~v~W~p~~d~~~~s~a~~ 366 (410) T protein:vir:95 289 LGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRY--TRSQFVRTAVKWEPLFEADANTMTMI 366 (410) T ss_pred hccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--cccccceeeEEeeecCCcchhhHHHH Confidence 7755555 69999999999999999999999999999999999887654432 2345667899998 4555689999 Q ss_pred HHHHHHH--h--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 441 LKAYIDS--G--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIK 482 (512) Q Consensus 441 ~~~~~kl--~--g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~ 482 (512) +|+++|+ + |+++.+++++++|++++ ++..++.+|++..-+ T Consensus 367 aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~--~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 367 GDGVVKLNQALPGYINAETIRDLTGIAGD--MSAKPVVSEGGSNGE 410 (410) T ss_pred HHHHHHHHHhccCCccHHHHHHhcCCChH--HHHHHHHHHHHhCCC Confidence 9999998 2 68899999999999654 233333333322211 No 59 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=2.4e-67 Score=385.78 Aligned_cols=404 Identities=9% Similarity=-0.005 Sum_probs=301.1 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecC Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD 115 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~ 115 (512) .+ ...++.|. ++...+++|++++.+||+|+|++.+.+...+++.+..+|+++||++++|+++++++..++++ + T Consensus 1 m~-~~~i~~L~----~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~--~ 73 (422) T protein:vir:97 1 MN-YMGMGYLR----RKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFT--N 73 (422) T ss_pred CC-hHHHHHHH----HHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceee--C Confidence 11 11233443 44456678999999999999998877777778888888888899999999999999999875 3 Q ss_pred CchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC-CCceEEEEEccceeEEEEeCCCCceeEEEEEEeeee Q lcl|NC_010808. 116 DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTK 194 (512) Q Consensus 116 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~ 194 (512) +|. .++++|+.|+|+..+.+++++|++|||||++|+.++ +|.++++++||.+++++||+.. +++.+++++|... T Consensus 74 ~d~----~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~-~~~~~a~~~~~~~ 148 (422) T protein:vir:97 74 DDF----NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTT-FLLTEGYAILESD 148 (422) T ss_pred Cch----hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCC-CcceeeEEEEEec Confidence 443 378999999999999999999999999999999986 6889999999999999998764 5677777766432 Q ss_pred eeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcch-HHHHHHHHHH Q lcl|NC_010808. 195 PIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDY-EKVITLIDLY 268 (512) Q Consensus 195 ~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~-~~v~~liDa~ 268 (512) . ......+.+|++..++++...+.+ ...+|++|.||||+|.|++ +|.|++ ++|++|+|++ T Consensus 149 ~-----~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (422) T protein:vir:97 149 S-----NGNPTLEAYFTDKDIWYYPKKGKP---------YNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAA 214 (422) T ss_pred C-----CCcEEEEEEEcCceEEEEcCCCcc---------ccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHH Confidence 1 122334455666655555443222 2358999999999999863 688988 7899999999 Q ss_pred HHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCC---CCCcceeEEeecCCHHHHHH Q lcl|NC_010808. 269 DNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET---EGSVDGGYIYKQYDVQGTEA 345 (512) Q Consensus 269 ~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~l~~~~~~~~~~~ 345 (512) |++++++.+..+++++|+++++|...... .... +. .........+. +.+++++.+ .+.+.+.+.+ T Consensus 215 ~r~~~~~~~~~e~~a~pqr~i~G~d~d~~-~~~~------~~----~~~~~i~~~~~de~~~~~~v~q~-~~~~l~~~~~ 282 (422) T protein:vir:97 215 KRTLERAEVTAEFYSFPQKYVLGMDPDAK-PMEK------WR----ATVSTLLEISKDEDGDKPTVGQF-TTASMAPFME 282 (422) T ss_pred HHHHHHHHHHHHHhcchhhhhcccCcccc-cCch------hh----hhhhhhhccCCCCCCCcceeeec-CCCChhHHHH Confidence 99999999999999999999999732111 1100 00 11111111111 122333322 1234444555 Q ss_pred HHHHHHHHHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 346 YKDRLNSDIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 346 ~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) .++.+..+++..|++|...++..+.| +||+||++++.+|.+|++++++.|+.+|++++++++++.+.... ...++.+ T Consensus 283 ~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~--~~~~~~~ 360 (422) T protein:vir:97 283 HLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPY--LRNQFMD 360 (422) T ss_pred HHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc--cchhhcc Confidence 55555555556667787777665555 69999999999999999999999999999999999887664432 3456778 Q ss_pred eeEEeCCCCCcC---HHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 425 VRYVYNRNLPKS---LIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKES 480 (512) Q Consensus 425 i~i~f~~~~p~d---~~~~~~~~~kl~----g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~ 480 (512) +++.|.++.|.+ .++.+|+++|+. |+++.+++++++|+ ++++.|+.++++++.+- T Consensus 361 ~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 361 TVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGV-KGADKPIPAITEVTTDG 422 (422) T ss_pred ceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCC-CchhHHHHHHHhhhccC Confidence 999999888888 677889999973 67899999999988 77888899998886554 No 60 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=1.1e-66 Score=382.30 Aligned_cols=393 Identities=11% Similarity=0.016 Sum_probs=297.8 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhH Q lcl|NC_010808. 41 NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDV 120 (512) Q Consensus 41 ~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~ 120 (512) -...++..+.+....+++|++++.+||+|+|++.+.+...+++.+.++|+++||+++||+++++++..++++ ++|. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~--~~d~-- 76 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE--NDDF-- 76 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCccc--CCch-- Confidence 012333333444566778999999999999998776666677777788999999999999999999988875 4443 Q ss_pred HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCC Q lcl|NC_010808. 121 LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTD 200 (512) Q Consensus 121 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~ 200 (512) .+++||+.|+|+....++++++++|||||++||.+++|.++|+++||.+++++||+. .+++.++++++... . T Consensus 77 --~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~-~~~~~~a~~~~~~d-----~ 148 (409) T protein:vir:94 77 --TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPI-TGLLTEGYAVLERD-----E 148 (409) T ss_pred --HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecC-CCceeeeEEEEEec-----C Confidence 478999999999999999999999999999999999999999999999999999874 56799999887432 1 Q ss_pred cceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcch-HHHHHHHHHHHHHHHH Q lcl|NC_010808. 201 EDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDY-EKVITLIDLYDNAESD 274 (512) Q Consensus 201 ~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~-~~v~~liDa~~~~~s~ 274 (512) ........+|+++.++.+....+.+ ...+|++|.||||+|.|++ +|+|++ ++|++|+|++|+++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~ 220 (409) T protein:vir:94 149 NNNVVLEAHFLPDRTDYYYRDSRNN--------ISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLER 220 (409) T ss_pred CCceEEEEEEecCcEEEEEecCcee--------EeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHH Confidence 2234556789999998887765543 2358999999999999864 688988 6899999999999999 Q ss_pred HHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccC-CCCCcceeEEee-cCCHHHHHHHHHHHHH Q lcl|NC_010808. 275 TANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIE-TEGSVDGGYIYK-QYDVQGTEAYKDRLNS 352 (512) Q Consensus 275 ~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~-~~~~~~~~~~~~~l~~ 352 (512) +.+..++|++|+++++|..... ..... +.. ..+.....+ ..++..+++-+. +.+.+.+...++.+.. T Consensus 221 ~~~~~e~~a~pqr~i~G~d~d~-~~~~~------~~~----~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~ 289 (409) T protein:vir:94 221 ADVTAEFYSFPQKYVTGLSDDA-EPMET------WKA----TVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAA 289 (409) T ss_pred HHHHHHHhcChhheeEecCCCC-cccch------hhh----hHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHH Confidence 9999999999999999973211 11111 111 111111111 122223333222 2344555555555666 Q ss_pred HHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCC Q lcl|NC_010808. 353 DIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNR 431 (512) Q Consensus 353 ~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~ 431 (512) +++..|++|...++..+.| +||+||++++.+|..+++++++.|+.+|++++++++++.+.... ...++.++++.|.+ T Consensus 290 ~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~--~~~~~~~~~v~W~p 367 (409) T protein:vir:94 290 GFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPY--LREQFRKTKPKWEP 367 (409) T ss_pred HHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc--cccccccceEEecc Confidence 6666667777777655445 79999999999999999999999999999999999887665432 34567789999997 Q ss_pred CCCcC---HHHHHHHHHHHh--c--cCChHHHHHhCCCCCCH Q lcl|NC_010808. 432 NLPKS---LIEELKAYIDSG--G--KISQTTLMSLFSFFQDP 466 (512) Q Consensus 432 ~~p~d---~~~~~~~~~kl~--g--~~s~et~~~~~~~v~d~ 466 (512) ..|++ .++.||+++|+. | +.+.++++.++|+.++. T Consensus 368 ~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 368 LFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred CCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 76666 567889999984 4 46679999999987542 No 61 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=1.3e-65 Score=376.37 Aligned_cols=392 Identities=11% Similarity=0.027 Sum_probs=298.4 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhH Q lcl|NC_010808. 41 NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDV 120 (512) Q Consensus 41 ~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~ 120 (512) -...++..+.+....+++|+.++.+||+|+|++.+.+...+++.+.++|+++||+++||+++++++..++++ ++|. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~--~~d~-- 76 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE--NDDF-- 76 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhccccccc--Ccch-- Confidence 112334444455567789999999999999998776666777777888899999999999999999988875 3443 Q ss_pred HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCC Q lcl|NC_010808. 121 LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTD 200 (512) Q Consensus 121 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~ 200 (512) .+++||+.|+|+....+++++|++|||||++||.+++|.++|+++||.+++++||+. .+++.+++++|... . T Consensus 77 --~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~-~~~~~~a~~~~~~d-----~ 148 (409) T protein:vir:16 77 --TVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPI-TGLLTEGYAVLERD-----E 148 (409) T ss_pred --HHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeecc-cccceeeeEEEEec-----C Confidence 488999999999999999999999999999999999999999999999999999874 67788888877432 1 Q ss_pred cceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCC-----CCCcch-HHHHHHHHHHHHHHHH Q lcl|NC_010808. 201 EDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDY-EKVITLIDLYDNAESD 274 (512) Q Consensus 201 ~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~-----~g~s~~-~~v~~liDa~~~~~s~ 274 (512) ........+|+++.++.+......+ ...+|++|.||||+|.|++ +|+|++ ++|++|+|++|+++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~ 220 (409) T protein:vir:16 149 NNNVVLEAHFLPDRTDYYYRDSRNN--------ISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLER 220 (409) T ss_pred CCceEEEEEEecCcEEEEEecCccc--------cceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHH Confidence 2233456789999988887655443 2367999999999999863 688988 6799999999999999 Q ss_pred HHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccC---CCCCcceeEEeecCCHHHHHHHHHHHH Q lcl|NC_010808. 275 TANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIE---TEGSVDGGYIYKQYDVQGTEAYKDRLN 351 (512) Q Consensus 275 ~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~ 351 (512) +.+..++|++|+++++|..... ..... +. ...+.....+ .+.+++++.+ .+.+.+.+.+.++.+. T Consensus 221 ~~~~~e~~a~pqr~i~G~d~d~-~~~~~------~~----~~~~~i~~~~~d~~g~~~~v~q~-~~~~l~~~~~~l~~~~ 288 (409) T protein:vir:16 221 ADVTAEFYSFPQKYVTGLSDDA-EPMET------WK----ATVSSMLQFTKDEDGDKPTLGQF-TQPSMSPFTEQLRTAA 288 (409) T ss_pred HHHHHHHhcChhheeEecCCCC-Cccch------hh----hhhhHhhccCCCCCCCCceEEec-CCCChhHHHHHHHHHH Confidence 9999999999999999974211 11110 11 1111111111 1223344322 2234455556666666 Q ss_pred HHHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeC Q lcl|NC_010808. 352 SDIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYN 430 (512) Q Consensus 352 ~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~ 430 (512) .+++..|++|...++..+.| +||+||++++.+|..+++++++.|+.+|++++++++.+.+.... ....+..+++.|. T Consensus 289 ~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~--~~~~~~~~~v~W~ 366 (409) T protein:vir:16 289 AGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPY--LREQFSKTKPKWE 366 (409) T ss_pred HHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--cchhhccceEEec Confidence 66666777887777755555 69999999999999999999999999999999999887655432 2344578899999 Q ss_pred CCCCcC---HHHHHHHHHHHhc----cCChHHHHHhCCCCCCH Q lcl|NC_010808. 431 RNLPKS---LIEELKAYIDSGG----KISQTTLMSLFSFFQDP 466 (512) Q Consensus 431 ~~~p~d---~~~~~~~~~kl~g----~~s~et~~~~~~~v~d~ 466 (512) ++.+++ .++.+|+++|+.+ +...+++++++|+..+. T Consensus 367 ~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 367 PLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred CCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 776555 7899999999843 34568999999986542 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=8.4e-62 Score=355.42 Aligned_cols=461 Identities=9% Similarity=0.058 Sum_probs=307.3 Q ss_pred hhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHH----HHHHHHHHHHHHHHhccccccccccccc-ccccccc Q lcl|NC_010808. 13 LRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHH----MDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMAD 87 (512) Q Consensus 13 ~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~----~~~~~~r~~~~~~yy~G~~~~~~~~~~~-~~~~~~~ 87 (512) -+.+.+..|+.-.+..+. .+.+.+++.+. ...+..+++++++||+|+|+++++.... ..+.+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~ 69 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGL-----------LKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNR 69 (496) T ss_pred ChhHHHHHHHHHHHHhcc-----------chhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCcccc Confidence 011111111111111111 11122222111 2455678999999999999987654433 2234456 Q ss_pred eeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEE Q lcl|NC_010808. 88 NRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKS 167 (512) Q Consensus 88 ~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~ 167 (512) +|+++|||+.||++.|+|++|+|++++++++..++.|+++++.|+|...+.++++.|+++|.+|+++|.|++|++++.++ T Consensus 70 ~~~~~n~~k~i~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v 149 (496) T protein:vir:38 70 RQLSMNLPKVTAKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFA 149 (496) T ss_pred ceeecchHHHHHHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEE Confidence 77899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCc----EEE--EEecCCc--cccc------ccccc Q lcl|NC_010808. 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG----VYR--YLTSRTN--GLKL------TPREN 233 (512) Q Consensus 168 ~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~----~~~--~~~~~~~--~~~~------~~~~~ 233 (512) +|.+++|+|+++.+-..+++++.|.. ..+..+.++.|+... +.+ |...... +..+ ..... T Consensus 150 ~~~~~~P~~~~~~~~~~~~f~~~~~~------~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~ 223 (496) T protein:vir:38 150 TADCMYPLSNDSENVDECVIANSFHK------NNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEP 223 (496) T ss_pred cccceEEEEecCCcEEEEEEEEEEEe------CCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccccc Confidence 99999999987643333444433321 223444566655321 111 2221111 0000 01112 Q ss_pred ccccccccccceEeecCC---------CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhh Q lcl|NC_010808. 234 GFESHSFERMPITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQK 304 (512) Q Consensus 234 ~~~~~~~~~vPvv~~~n~---------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~ 304 (512) ....+++.++|+++|+++ +.|+|+|++++++||+||.++|++++.++....++.+...+.....+ ... T Consensus 224 ~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~-~~g-- 300 (496) T protein:vir:38 224 VVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVN-LDG-- 300 (496) T ss_pred ceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCC-CCC-- Confidence 223456778888888653 46899999999999999999999999999877776663222111100 000 Q ss_pred hccccccchhhhhhccccc-CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccc-cccccchHHHHHHHHH Q lcl|NC_010808. 305 EANVLFLEPTVYENRDTGI-ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGTQSGEAMKYKLF 382 (512) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Ai~~~~~ 382 (512) .................. ..+.+..++.++.++..+.+.+.++.+.+.|...++++...++ ..+|+.||.+++++++ T Consensus 301 -~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~ 379 (496) T protein:vir:38 301 -STTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKS 379 (496) T ss_pred -ccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHH Confidence 000011111111111111 2223335677777788888888888888888777777655443 2346679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-cccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHh Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSL 459 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~-~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~ 459 (512) .|.++++.+++.|+.+|++++++|+.+........ ...+...+++.|++++|.|..+++++++++ +|++|++|++.. T Consensus 380 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~ 459 (496) T protein:vir:38 380 ETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQR 459 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHh Confidence 99999999999999999999999987655332211 223445689999999999999999999987 699999999999 Q ss_pred CCCCCCHH--HHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 460 FSFFQDPE--LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 460 ~~~v~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (512) +++++|++ +|++||++|++..++ ..+.....+ +++ T Consensus 460 ~~~~~d~ea~~el~ri~~E~~~~~~------~~d~~~~~~---~~e 496 (496) T protein:vir:38 460 AWNITEAEADEWAEMLAKEKQAEMP------NNDMNGIFG---EEE 496 (496) T ss_pred cCCCChHHHHHHHHHHHHhhhccCc------cccccCCCC---CCC Confidence 99998755 488888888755321 111111111 111 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=8.2e-59 Score=339.02 Aligned_cols=455 Identities=9% Similarity=0.062 Sum_probs=307.3 Q ss_pred cccchhHHhhhcHH------HHHHHHHH----HHHHHHHHHHHHHHHhccccccccccccc-ccccccceeeecchHHHH Q lcl|NC_010808. 30 TYDGTESDLLQNIN------EVSKYIEH----HMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMADNRVAHDYASYI 98 (512) Q Consensus 30 ~~~~~~~~~~~~~~------~l~~~i~~----~~~~~~~r~~~~~~yy~G~~~~~~~~~~~-~~~~~~~~ri~~n~~~~i 98 (512) +......++.+.++ .|.+.+.+ ....+..++.++++||.|+|+.+++.... ....+.++|+++|+++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 11111111111111 11111110 12445578999999999999877654332 233445778999999999 Q ss_pred HHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeC Q lcl|NC_010808. 99 SDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (512) Q Consensus 99 v~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~ 178 (512) |++.|+|++|+|++++++++..++.|+++++.|+|...+.++++.|+++|.+|+++|.|++|++++.+++|.+++|++.+ T Consensus 81 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d 160 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSND 160 (499) T ss_pred HHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999876 Q ss_pred CCCceeEEEEEEeeeeeeccCCcceEEEEEEEc--CCcE--EE-----EEecCCc--cccc------ccccccccccccc Q lcl|NC_010808. 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT--SHGV--YR-----YLTSRTN--GLKL------TPRENGFESHSFE 241 (512) Q Consensus 179 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt--~~~~--~~-----~~~~~~~--~~~~------~~~~~~~~~~~~~ 241 (512) +.+-..++++..+.. + .+..+.++.|+ .... ++ |...... +..+ .........++++ T Consensus 161 ~~~~~~~~f~~~~~~---~---~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~ 234 (499) T protein:vir:80 161 SENVDECLIANSFHK---N---NKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLT 234 (499) T ss_pred CCCeEEEEEEEEEee---c---CeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCC Confidence 544334444433322 1 12333344332 2211 11 1111111 1110 0111222334678 Q ss_pred ccceEeecCC---------CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccc Q lcl|NC_010808. 242 RMPITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLE 312 (512) Q Consensus 242 ~vPvv~~~n~---------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~ 312 (512) ++|+++|+++ +.|+|+|++++++||+||+++|++++.++....++.|...+.....+. .+ ......+ T Consensus 235 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~-~g---~~~~~~~ 310 (499) T protein:vir:80 235 RPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNL-DG---STTQYFD 310 (499) T ss_pred ccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCC-CC---CcccCCC Confidence 8899988754 458999999999999999999999999999888888744433211110 00 0000111 Q ss_pred hhhhhhcccccC-CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccc-cccccchHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 313 PTVYENRDTGIE-TEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGTQSGEAMKYKLFGLEQRTKT 390 (512) Q Consensus 313 ~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Ai~~~~~~l~~k~~~ 390 (512) ............ .+.+..++.+++++..+.+...++.+.+.|...++++...++ ..+|+.||.+++++++.+.+++.. T Consensus 311 ~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~ 390 (499) T protein:vir:80 311 STDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNS 390 (499) T ss_pred cccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHH Confidence 111111111112 222334777778888888888888888888887777654443 234667999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCC-cccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHH Q lcl|NC_010808. 391 KEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPE 467 (512) Q Consensus 391 ~~~~~~~~l~~~~~li~~~l~~~~~~~-~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~ 467 (512) +++.|+.+|++++++|+.+........ ...+...+++.|+++++.|..+++++.+++ +|++|++|++..+++++|++ T Consensus 391 ~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~e 470 (499) T protein:vir:80 391 HSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEAE 470 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHH Confidence 999999999999999998765543222 223456799999999999999999999886 69999999999999988855 Q ss_pred --HHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 468 --LEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 468 --~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (512) +|++||++|+.... +..+..+..+. ++ T Consensus 471 a~~el~~i~~E~~~~~------~~~d~~g~~ge---~e 499 (499) T protein:vir:80 471 ADEWAEMLAKEKQAEI------PNNDMTGIFGE---EE 499 (499) T ss_pred HHHHHHHHHHHhhcCC------CCCCccccCCC---CC Confidence 56888888765432 11111111111 11 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=8.7e-56 Score=322.44 Aligned_cols=468 Identities=10% Similarity=0.028 Sum_probs=317.3 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |++....+.+|++-.+..-.-.. +. .+.+-.+. ..-..+..+++++++||.|+++.+.... .....+.++++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~----~~-~i~d~~~i--~~~~~~~~~i~~~~~~Y~g~~~~l~~~~-~~~~~~~~~~~ 72 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKS----LG-QIIDDPRI--NLPADEVERIARDKRYYMDDFKQVTHKN-SYGDTQKHELQ 72 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhh----hh-hhhcccCC--CCCHHHHHHHHHHHHHhcCCCccccccc-cCCCcccccee Confidence 88888888888876543211111 00 00000000 0114556788899999999998765433 23444556778 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++|+++.||+..|++++|+|++++++++..++.|+++++.|+|...+.++++.++++|.+++.+|+| .|++++.+++|. T Consensus 73 slnl~~~i~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-~~~~~i~~v~ad 151 (505) T protein:vir:79 73 SVNVTKLASAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-SGKIKLAWATAD 151 (505) T ss_pred ecchHHHHHHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-CCceEEEEEcCC Confidence 8999999999999999999999999999999999999999999999999999999999999999997 578999999999 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCC----cEEE--EEecCCc--ccc--------ccccccc Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH----GVYR--YLTSRTN--GLK--------LTPRENG 234 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~----~~~~--~~~~~~~--~~~--------~~~~~~~ 234 (512) +++|++.++.+...++++..|.... .......+.++.|+.+ .|.+ |...... +.. ....... T Consensus 152 ~~~P~~~d~~~~~~~a~~~~~~~~~--~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~ 229 (505) T protein:vir:79 152 QVYPLQADTNQVNELAIASRTTEVE--NHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQ 229 (505) T ss_pred eeEEEEEcCCCeEEEEEEEEEEEec--CCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcc Confidence 9999965555554555554443322 2222234456777532 2222 2221111 100 0001112 Q ss_pred cccccccccceEeecC----C-----CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhh Q lcl|NC_010808. 235 FESHSFERMPITEFSN----N-----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKE 305 (512) Q Consensus 235 ~~~~~~~~vPvv~~~n----~-----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~ 305 (512) ....+++++|+++|++ + +.|.|+|++++++||++|.++|++++.++....++.|...+............. T Consensus 230 ~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~ 309 (505) T protein:vir:79 230 VKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASE 309 (505) T ss_pred eeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCccccc Confidence 2234566667777754 2 468999999999999999999999999999888888744332221111000000 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccc-ccccchHHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLFGL 384 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Ai~~~~~~l 384 (512) .+....+........... .++++.++.+++++..+.+...++.+.+.|...++.+...++. ..+..||++++++.+.+ T Consensus 310 ~~~~~fd~~~~~y~~~~~-~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l 388 (505) T protein:vir:79 310 THPPMFDPDETVYQAMYG-DASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQT 388 (505) T ss_pred ccccCCCccceeeeeccC-CCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHH Confidence 000001111111111111 2334557788888888999988888888888877765444332 33567999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-------CcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHH Q lcl|NC_010808. 385 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSI-------DANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTT 455 (512) Q Consensus 385 ~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~-------~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et 455 (512) .++++.+++.|+.+|++++++|+.+....... ....+...+++.|+++++.|..+.++..+++ +|++|+++ T Consensus 389 ~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~ 468 (505) T protein:vir:79 389 YQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQ 468 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHH Confidence 99999999999999999999999876554321 1222334689999999999999999988886 79999999 Q ss_pred HHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_010808. 456 LMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKD 491 (512) Q Consensus 456 ~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~ 491 (512) +++.+++++| +++|++||++|+...++. ....+++ T Consensus 469 ~l~~~~~~~eeea~~el~ri~~E~~~~~p~-~~~~gg~ 505 (505) T protein:vir:79 469 FLMRNYGLDEEEADEWLAQIDAENSTAEPE-FNQFGGD 505 (505) T ss_pred HHHhcCCCChHHHHHHHHHHHHhccccCCC-chhccCC Confidence 9999999987 557899999986542211 0111111 No 65 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=2e-55 Score=320.44 Aligned_cols=467 Identities=9% Similarity=0.002 Sum_probs=322.1 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+...-.+.+|++..+..+...+....+.. +.| ..-..+..|++++.+||+|+++.++... ..+..+...|+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~------~~i-~~~~~~~~ri~~~~~~y~g~~~~~~~~~-~~~~~~~~~~~ 72 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDD------PRI-SIDPDEYVRIQTDLDYYSDKLQYIHYQA-SDGIKKKRLKN 72 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcc------ccc-ccCHHHHHHHHHHHHHhcCCCccccccc-CCCCcccccee Confidence 888888889999877776666553332210 001 1124567889999999999998664332 22333345568 Q ss_pred ecchHHHHHHHHHhhhhccCceecCC-chhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDD-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) ++|+++.||+.+|+++++++++++++ ++..++.|+++++.|+|...+.+++..|+++|.+++.+|+|. +.++|.+++| T Consensus 73 sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~v~a 151 (508) T protein:vir:15 73 TINMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIKIAWVRA 151 (508) T ss_pred ecchHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeEEEEEcC Confidence 89999999999999999999999984 455667899999999999999999999999999999999984 6799999999 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc-----CCcEEEEEecCCc----ccccc--------ccc Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT-----SHGVYRYLTSRTN----GLKLT--------PRE 232 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt-----~~~~~~~~~~~~~----~~~~~--------~~~ 232 (512) .+++|+..++.+...+++++.+. ..+.......+.++.|+ +..|.+....... +..+. ... T Consensus 152 d~~~P~~~d~~~~~~~af~~~~~--~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~ 229 (508) T protein:vir:15 152 DQFYPLQSNTNDISEAAIASRTQ--RTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELA 229 (508) T ss_pred CeeEEEEEcCCCeEEEEEEEEEE--eecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCC Confidence 99999854443332333333222 22223334455566665 2233222212111 11110 011 Q ss_pred cccccccccccceEeecCC---------CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhh Q lcl|NC_010808. 233 NGFESHSFERMPITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQ 303 (512) Q Consensus 233 ~~~~~~~~~~vPvv~~~n~---------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~ 303 (512) .....+++.++|+++|+++ +.|+|+|++++++||++|.++|++++.++....++.|..++...+.+... T Consensus 230 ~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~~~-- 307 (508) T protein:vir:15 230 PQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEHKP-- 307 (508) T ss_pred cceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCCcc-- Confidence 1223356777888887652 46999999999999999999999999998777777775554433322111 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccc-ccccchHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLF 382 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Ai~~~~~ 382 (512) ..+...........+.+.+..++.+++++..+.+.+.++.+.+.|...++++...++. .++..||.+++++.+ T Consensus 308 ------~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~ 381 (508) T protein:vir:15 308 ------TFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNS 381 (508) T ss_pred ------ccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHH Confidence 1122222223333344445568888889999999999999888888888776544432 235579999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---------cccccceeeEEeCCCCCcCHHHHHHHHHHH--hccC Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID---------ANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKI 451 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~---------~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~ 451 (512) .+.++++.+++.|+.+|++++++|+.++....... ...+...++|.|++++++|..++++..+++ +|++ T Consensus 382 ~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~ 461 (508) T protein:vir:15 382 MTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGAL 461 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCC Confidence 99999999999999999999999998766433211 112234588999999999999999998886 6999 Q ss_pred ChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 452 SQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 452 s~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) |++++++++++++| +++|++||++|+.+.... .. ..+-.+.+++| T Consensus 462 s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~-----~~-----------~~~~~~g~~ge 508 (508) T protein:vir:15 462 SKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFE-----GG-----------RSAILNGGDGE 508 (508) T ss_pred CHHHHHHhcCCCChHHHHHHHHHHHHhccccCcc-----cc-----------ccccCCCCCCC Confidence 99999999988877 456899999996442110 00 00111111111 No 66 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=2.2e-53 Score=309.27 Aligned_cols=466 Identities=11% Similarity=0.025 Sum_probs=308.1 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHH--HHHHHHHHHHHHHHHHHHHhcccccccccccccccccccce Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSK--YIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~--~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ 88 (512) |++..-.+.+|++..+.... +..+.+.. .| ..-..+..||+++.+||+|+++.+..... .+..+.++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~---------~~~~~~~~~~~i-~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~ 69 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTT---------QSLTNITDHPKI-AISKLEYDRITTNLKYYKSDWDSVLYLNT-DGETKKRD 69 (500) T ss_pred CchHHHHHHHHHHHHHHhhc---------chhhhhhccccc-cCCHHHHHHHHHHHHHhcCCCCCcccccC-CCCcccCc Confidence 88888889999876543211 11111110 00 12246678899999999999876543332 33445667 Q ss_pred eeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEc Q lcl|NC_010808. 89 RVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) Q Consensus 89 ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~ 168 (512) ++++|+++.||+.+|++++|++++++++++..++.|+++++.|+|...+.++++.+++.|.+++.+|+|. ++++|.+++ T Consensus 70 ~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ 148 (500) T protein:vir:98 70 LNHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQ 148 (500) T ss_pred eeecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEc Confidence 8899999999999999999999999999999999999999999999999999999999999999999984 679999999 Q ss_pred cceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc--CCc---EEE--EEecCCc--ccc--c----ccccc Q lcl|NC_010808. 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT--SHG---VYR--YLTSRTN--GLK--L----TPREN 233 (512) Q Consensus 169 p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt--~~~---~~~--~~~~~~~--~~~--~----~~~~~ 233 (512) |.+++|+..++... ..+++.++.....++ .....+.++.|+ ... |.+ |...... +.. + ..... T Consensus 149 ad~~~P~~~d~~~~-~~~a~~~~~~~~~~~-~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:98 149 APVFLPLQSNTQDV-SSAAVVIKSVKTING-KEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred CCeeEEEEEcCCCe-EEEEEEEEEeeeecC-CceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 99999986665443 334443332222222 223444566654 222 222 2221110 100 0 01111 Q ss_pred ccccccccccceEeecC---------CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-h Q lcl|NC_010808. 234 GFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-Q 303 (512) Q Consensus 234 ~~~~~~~~~vPvv~~~n---------~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~ 303 (512) .....+++++|+++|++ .+.|.|+|++++++||++|..+|++++.++....++.|...+...+...... . T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~ 306 (500) T protein:vir:98 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDV 306 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccc Confidence 22334566667776643 2469999999999999999999999999999888877755543322221110 0 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccc-cccccchHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGTQSGEAMKYKLF 382 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Ai~~~~~ 382 (512) .....+.++ ...+.......+++..++.+++++..+.+.+.++.+.+.|...++.+...++ ..++..||.+++++++ T Consensus 307 ~~~~~~d~~--~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~ 384 (500) T protein:vir:98 307 VPRPRFESD--QNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENS 384 (500) T ss_pred cCCcccCCC--cceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHH Confidence 001111111 1111222222333445777788887777777777766666555544433322 3346679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHH Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS--IDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMS 458 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~--~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~ 458 (512) .+.++++.+++.|+.+|++++++|+.+...... ...+. ...+++.|+++++.|..++++.++++ +|++|++++++ T Consensus 385 ~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~-~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~ 463 (500) T protein:vir:98 385 DTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPS-MDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQ 463 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-CcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 999999999999999999999999987654321 11222 23589999999999999999999886 79999999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcC Q lcl|NC_010808. 459 LFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKD 506 (512) Q Consensus 459 ~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (512) ++.++++ +++|+++|++|+.... ...+...+--+| T Consensus 464 ~~~g~~eeea~~~l~~i~~E~~~~~-------------~~~~~~~~~~g~ 500 (500) T protein:vir:98 464 KVLNVTEEKAQEIAAEINTGIVDEI-------------NQQRTDTHLYGE 500 (500) T ss_pred hcCCCCHHHHHHHHHHHHHhccccC-------------CCCCccccccCC Confidence 8866665 3355777776632211 001111111111 No 67 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=2.2e-53 Score=309.27 Aligned_cols=466 Identities=11% Similarity=0.025 Sum_probs=308.1 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHH--HHHHHHHHHHHHHHHHHHHhcccccccccccccccccccce Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSK--YIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~--~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ 88 (512) |++..-.+.+|++..+.... +..+.+.. .| ..-..+..||+++.+||+|+++.+..... .+..+.++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~---------~~~~~~~~~~~i-~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~ 69 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTT---------QSLTNITDHPKI-AISKLEYDRITTNLKYYKSDWDSVLYLNT-DGETKKRD 69 (500) T ss_pred CchHHHHHHHHHHHHHHhhc---------chhhhhhccccc-cCCHHHHHHHHHHHHHhcCCCCCcccccC-CCCcccCc Confidence 88888889999876543211 11111110 00 12246678899999999999876543332 33445667 Q ss_pred eeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEc Q lcl|NC_010808. 89 RVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) Q Consensus 89 ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~ 168 (512) ++++|+++.||+.+|++++|++++++++++..++.|+++++.|+|...+.++++.+++.|.+++.+|+|. ++++|.+++ T Consensus 70 ~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ 148 (500) T protein:vir:30 70 LNHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQ 148 (500) T ss_pred eeecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEc Confidence 8899999999999999999999999999999999999999999999999999999999999999999984 679999999 Q ss_pred cceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc--CCc---EEE--EEecCCc--ccc--c----ccccc Q lcl|NC_010808. 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT--SHG---VYR--YLTSRTN--GLK--L----TPREN 233 (512) Q Consensus 169 p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt--~~~---~~~--~~~~~~~--~~~--~----~~~~~ 233 (512) |.+++|+..++... ..+++.++.....++ .....+.++.|+ ... |.+ |...... +.. + ..... T Consensus 149 ad~~~P~~~d~~~~-~~~a~~~~~~~~~~~-~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:30 149 APVFLPLQSNTQDV-SSAAVVIKSVKTING-KEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred CCeeEEEEEcCCCe-EEEEEEEEEeeeecC-CceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 99999986665443 334443332222222 223444566654 222 222 2221110 100 0 01111 Q ss_pred ccccccccccceEeecC---------CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh-h Q lcl|NC_010808. 234 GFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK-Q 303 (512) Q Consensus 234 ~~~~~~~~~vPvv~~~n---------~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~ 303 (512) .....+++++|+++|++ .+.|.|+|++++++||++|..+|++++.++....++.|...+...+...... . T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~ 306 (500) T protein:vir:30 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDV 306 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccc Confidence 22334566667776643 2469999999999999999999999999999888877755543322221110 0 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccc-cccccchHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD-NFSGTQSGEAMKYKLF 382 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Ai~~~~~ 382 (512) .....+.++ ...+.......+++..++.+++++..+.+.+.++.+.+.|...++.+...++ ..++..||.+++++++ T Consensus 307 ~~~~~~d~~--~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~ 384 (500) T protein:vir:30 307 VPRPRFESD--QNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENS 384 (500) T ss_pred cCCcccCCC--cceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHH Confidence 001111111 1111222222333445777788887777777777766666555544433322 3346679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHH Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS--IDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMS 458 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~--~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~ 458 (512) .+.++++.+++.|+.+|++++++|+.+...... ...+. ...+++.|+++++.|..++++.++++ +|++|++++++ T Consensus 385 ~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~-~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~ 463 (500) T protein:vir:30 385 DTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPS-MDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQ 463 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-CcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 999999999999999999999999987654321 11222 23589999999999999999999886 79999999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcC Q lcl|NC_010808. 459 LFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKD 506 (512) Q Consensus 459 ~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (512) ++.++++ +++|+++|++|+.... ...+...+--+| T Consensus 464 ~~~g~~eeea~~~l~~i~~E~~~~~-------------~~~~~~~~~~g~ 500 (500) T protein:vir:30 464 KVLNVTEEKAQEIAAEINTGIVDEI-------------NQQRTDTHLYGE 500 (500) T ss_pred hcCCCCHHHHHHHHHHHHHhccccC-------------CCCCccccccCC Confidence 8866665 3355777776632211 001111111111 No 68 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=9e-50 Score=289.46 Aligned_cols=477 Identities=10% Similarity=0.007 Sum_probs=303.1 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHH--HHHHHHHHHHHHHHHHHHHhcccccccccccccccccccce Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSK--YIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN 88 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~--~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ 88 (512) |++..-.+.+|++-..... . +..+.+.. .|. ....+..+|.++.+||+|+++.+.... ...+...+. T Consensus 1 m~~~~~~k~~~~k~~~~~~-~--------~~~~~i~~~~~i~-~~~~~~~~i~~~~~~y~g~~~~~~~~~-~~~~~~~~~ 69 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQ-T--------SNLNSILEHPKIA-VTQEEYDRIKRNLVYYQSKWDDVQYKN-TDGDIKSRP 69 (522) T ss_pred CchHHHHHHHHHHHHHHhh-c--------ccchhccccCCCC-CCHHHHHHHHHHHHHhcCCcccccccc-cCcchhccc Confidence 6666666666665332210 0 00000000 011 135667889999999999987654332 233444566 Q ss_pred eeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEc Q lcl|NC_010808. 89 RVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) Q Consensus 89 ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~ 168 (512) |+++|+++.||+..|+++++++++++++++..++.|+++++.|+|...+.+++..+++.|..++.+|+| .|++++.+++ T Consensus 70 ~~slnl~~~i~~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v~ 148 (522) T protein:vir:47 70 MNHLPIARTASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFIQ 148 (522) T ss_pred ceecchHHHHHHHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEEc Confidence 888999999999999999999999999999999999999999999999999999999999999999997 5789999999 Q ss_pred cceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc----------------CCcEEE--EEecCCc--ccc- Q lcl|NC_010808. 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT----------------SHGVYR--YLTSRTN--GLK- 227 (512) Q Consensus 169 p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt----------------~~~~~~--~~~~~~~--~~~- 227 (512) |.+++|+..++.. ...+++-. ......+......+.++.++ +..|.+ |...... +.. T Consensus 149 ad~~~P~~~~~~~-~~e~a~~~-~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (522) T protein:vir:47 149 APVFFPLESNTQD-VSSAAILT-KTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRV 226 (522) T ss_pred CCceEEEEEcCCc-eEEEEEEE-EEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccc Confidence 9999998554432 22223221 11111111111122233321 111211 2211110 000 Q ss_pred -------ccccccccccccccccceEeecCC---------CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Q lcl|NC_010808. 228 -------LTPRENGFESHSFERMPITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) Q Consensus 228 -------~~~~~~~~~~~~~~~vPvv~~~n~---------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g 291 (512) ...........++.++++++|+++ +.|+|+|+++++++|++|.++|++++.++....++.|... T Consensus 227 ~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~ 306 (522) T protein:vir:47 227 NLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEH 306 (522) T ss_pred cccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchH Confidence 011111222345666677777553 5699999999999999999999999999999888887544 Q ss_pred CCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc-ccccccccc Q lcl|NC_010808. 292 NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP-NMKDDNFSG 370 (512) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p-~~~~~~~~~ 370 (512) +....... ...........+.....+........++..++.+++.+..+.+.+.++.+.+.|...++.. .......++ T Consensus 307 ~l~~~~~~-~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~ 385 (522) T protein:vir:47 307 LTQRQYQR-PDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQG 385 (522) T ss_pred HhccCCCC-CCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccc Confidence 32221110 0000000001111111222233333445567788888888877777777766665555443 322223345 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-cccccceeeEEeCCCCCcCHHHHHHHHHHH-- Q lcl|NC_010808. 371 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEELKAYIDS-- 447 (512) Q Consensus 371 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~-~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl-- 447 (512) ..||.+++++.+.+.++++++++.|+.+|+++++.|+.+........ .......++|.|+++++.|..++++..+++ T Consensus 386 ~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~ 465 (522) T protein:vir:47 386 MKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVA 465 (522) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHh Confidence 67899999999999999999999999999999999998765433211 112334689999999999999999998885 Q ss_pred hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 448 GGKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 448 ~g~~s~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) +|++|+++++++++++++ +++|++||++|+.+..+... +..+.+ +.+...+++++ T Consensus 466 aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~-----~~~~~~-~~~~~~~d~~~ 522 (522) T protein:vir:47 466 AGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAEL-----AIYGMH-DQNEEKADDKG 522 (522) T ss_pred cCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCC-----CCCCCC-CcccccCCCCC Confidence 799999999999877776 45689999888654321110 111111 11111122222 No 69 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=4.1e-48 Score=280.40 Aligned_cols=475 Identities=14% Similarity=0.090 Sum_probs=324.5 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |++..+=+... +.+|+-+.|+- +.+..|..+++.+|+.|.+||.+.+..+....+. T Consensus 1 ~~~~~~~~~~~------~~~~~g~~~~p------------------~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg 56 (527) T protein:vir:10 1 MGQDKRQYGST------QQLRAGEANFP------------------NAVTDFDKARLASYRLYEDMYLTNTSDYQVILRG 56 (527) T ss_pred CCccccccCCC------cCcCCccccCc------------------ccCCHHHHHHHHHHHHHHHHhcCchhheeeecCC Confidence 88777766665 33444444432 1145677888999999999999987765432222 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCcee--cCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQC--QDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ 158 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~--~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~ 158 (512) ..-+-..++.++-.++|+.....| .+.+..+ +..++.+++.|..|++.|++..++.+..+++++.|.+..++-+|+ T Consensus 57 -~~~~~~r~~~~ps~~~~~~~~~~~-~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~ 134 (527) T protein:vir:10 57 -GDEGDQRPIYVPNGEKLIEAKMRF-LGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDD 134 (527) T ss_pred -ccccccceeeehhhHHhhCCccee-eccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecc Confidence 222223457788888888776554 4566654 344678899999999999999999999999999998666555554 Q ss_pred C----CceEEEEEccceeEEEEeCCCCceeEEE--EEEeeeeeeccCCcc----------------------eEEEEEEE Q lcl|NC_010808. 159 D----DETRLYKSDAMSTFVIYDNTIERNSIAG--VRYLRTKPIDKTDED----------------------EVFTVDLF 210 (512) Q Consensus 159 ~----g~~~i~~~~p~~~~~i~d~~~~~~~~~~--v~~~~~~~~~~~~~~----------------------~~~~~~~y 210 (512) + ++++++.+||.+.||+.|+.....+... ++-|....-...+.+ ..+..+.| T Consensus 135 ~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w 214 (527) T protein:vir:10 135 EKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELY 214 (527) T ss_pred CCCcCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeecee Confidence 2 4799999999999999887554444432 222322211110000 00011112 Q ss_pred cCCcEEEEEecC---Cc-cccccccccccccccccccceEeecCC-----CCCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 211 TSHGVYRYLTSR---TN-GLKLTPRENGFESHSFERMPITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (512) Q Consensus 211 t~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~vPvv~~~n~-----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~ 281 (512) +...+....... .. -......+....+++++.||||+|+|- .+|+|+++++++++|++|+++|+.+.++.+ T Consensus 215 ~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~ 294 (527) T protein:vir:10 215 EPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVF 294 (527) T ss_pred eccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHH Confidence 221111000000 00 001123345678999999999999653 479999999999999999999999999999 Q ss_pred hcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 282 LNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP 361 (512) Q Consensus 282 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 361 (512) ...|+.++.|....+. .++. .. ........+..++++++..+.-....+.++.|++.|.+.|+.++++| T Consensus 295 sG~Pi~~~tg~~~vd~-~G~~----~~------~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~P 363 (527) T protein:vir:10 295 GGLGFYATDSAPPRDS-RGNM----VP------WTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIP 363 (527) T ss_pred hCCceeeecccccccc-cCCc----Cc------cccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCC Confidence 9999999999764432 1111 11 11222233445777888877766688999999999999999999999 Q ss_pred cccccc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhccCCCc--ccccceeeEEeCCCCCcC Q lcl|NC_010808. 362 NMKDDN--FSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK-LLETILKNTRSIDA--NKDFNTVRYVYNRNLPKS 436 (512) Q Consensus 362 ~~~~~~--~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-li~~~l~~~~~~~~--~~d~~~i~i~f~~~~p~d 436 (512) ...++. .++++||.||+..+++|.+++.+++..++...++..+ ++..+|.+...... ......+.+.|.+.+|.| T Consensus 364 avA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D 443 (527) T protein:vir:10 364 DIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVN 443 (527) T ss_pred eeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCC Confidence 999994 4567899999999999999999999999999887654 55555554332211 122346789999999999 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCC-CCCCCCCCCCcCcccC Q lcl|NC_010808. 437 LIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD-INDDEQDDDTKDTVDK 510 (512) Q Consensus 437 ~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 510 (512) ..+.++.++++ +|++|.+||+++| +++.|+++|+++|.+++.............-.-. +.+.+-+++..++.-+ T Consensus 444 ~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 444 SEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALN 523 (527) T ss_pred HHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccC Confidence 99999999987 7999999998887 7789999999999999887766555544433222 1222222222222233 Q ss_pred CC Q lcl|NC_010808. 511 KE 512 (512) Q Consensus 511 ~e 512 (512) .- T Consensus 524 ~~ 525 (527) T protein:vir:10 524 GQ 525 (527) T ss_pred CC Confidence 33 No 70 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=4.7e-48 Score=280.07 Aligned_cols=475 Identities=14% Similarity=0.091 Sum_probs=324.7 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |++..+=+... +.+|+-+.|+- +.+..|..+++.+|+.|.+||.+.+..+....+. T Consensus 1 ~~~~~~~~~~~------~~~~~g~~~~p------------------~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg 56 (527) T protein:vir:10 1 MGQDKRQYGST------QQLRAGEANFP------------------NAVTDFDKARLASYRLYEDMYLTNTSDYQVILRG 56 (527) T ss_pred CCccccccCCC------cCcCCccccCc------------------ccCCHHHHHHHHHHHHHHHHhcCchhheeeecCC Confidence 88777666665 33444444432 1145677888999999999999987765432222 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCcee--cCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQC--QDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ 158 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~--~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~ 158 (512) ..-+-..++.++-.++|+.....| .+.+..+ +..++.+++.|..|++.|++..++.+..+++++.|.+..++-+|+ T Consensus 57 -~~~~~~r~~~~ps~~~~~~~~~~~-~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~ 134 (527) T protein:vir:10 57 -GDEGDQRPIYVPNGEKLIEAKMRF-LGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDD 134 (527) T ss_pred -ccccccceeeehhhHHhhCCccee-eccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecc Confidence 222223457788888888776554 4566654 344678899999999999999999999999999998666555554 Q ss_pred C----CceEEEEEccceeEEEEeCCCCceeEEE--EEEeeeeeeccCCcc----------------------eEEEEEEE Q lcl|NC_010808. 159 D----DETRLYKSDAMSTFVIYDNTIERNSIAG--VRYLRTKPIDKTDED----------------------EVFTVDLF 210 (512) Q Consensus 159 ~----g~~~i~~~~p~~~~~i~d~~~~~~~~~~--v~~~~~~~~~~~~~~----------------------~~~~~~~y 210 (512) + ++++++.+||.+.||+.|+.....+... ++-|....-...+.+ ..+..+.| T Consensus 135 ~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w 214 (527) T protein:vir:10 135 EKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELY 214 (527) T ss_pred CCCcCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeecee Confidence 2 4799999999999999887554444432 222322211110000 00011112 Q ss_pred cCCcEEEEEecC---Cc-cccccccccccccccccccceEeecCC-----CCCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 211 TSHGVYRYLTSR---TN-GLKLTPRENGFESHSFERMPITEFSNN-----ERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (512) Q Consensus 211 t~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~vPvv~~~n~-----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~ 281 (512) +...+....... .. -......+....+++++.||||+|+|- .+|+|+++++++++|++|+++|+.+.++.+ T Consensus 215 ~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~ 294 (527) T protein:vir:10 215 EPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVF 294 (527) T ss_pred eccccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHH Confidence 221111000000 00 001123345678999999999999653 479999999999999999999999999999 Q ss_pred hcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 282 LNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP 361 (512) Q Consensus 282 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 361 (512) ...|+.++.|....+. .++. .. ........+..++++++..+.-....+.++.|++.|.+.|+.++++| T Consensus 295 sG~Pi~~~tg~~~vd~-~G~~----~~------~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~P 363 (527) T protein:vir:10 295 GGLGFYATDSAPPRDS-RGNM----VP------WTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIP 363 (527) T ss_pred hCCceeeecccccccc-cCCc----Cc------cccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCC Confidence 9999999999764432 1111 11 11222233445777888877766688999999999999999999999 Q ss_pred cccccc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhccCCC--cccccceeeEEeCCCCCcC Q lcl|NC_010808. 362 NMKDDN--FSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK-LLETILKNTRSID--ANKDFNTVRYVYNRNLPKS 436 (512) Q Consensus 362 ~~~~~~--~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-li~~~l~~~~~~~--~~~d~~~i~i~f~~~~p~d 436 (512) ...++. .++++||.||+..+++|.+++.+++..++...++..+ ++..+|.+..... .......+.+.|.+.+|.| T Consensus 364 avA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D 443 (527) T protein:vir:10 364 DIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVN 443 (527) T ss_pred eeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCC Confidence 999994 4567899999999999999999999999999887654 5555555433221 1122346789999999999 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCC-CCCCCCCCCCcCcccC Q lcl|NC_010808. 437 LIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD-INDDEQDDDTKDTVDK 510 (512) Q Consensus 437 ~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 510 (512) ..+.++.++++ +|++|.+||+++| +++.|+++|+++|.+++.............-.-. +.+.+-+++..++.-+ T Consensus 444 ~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 444 NEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALN 523 (527) T ss_pred HHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccC Confidence 99999999987 7999999998887 7789999999999999888766555554443322 1222222222222233 Q ss_pred CC Q lcl|NC_010808. 511 KE 512 (512) Q Consensus 511 ~e 512 (512) .- T Consensus 524 ~~ 525 (527) T protein:vir:10 524 GQ 525 (527) T ss_pred CC Confidence 33 No 71 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=3.1e-47 Score=275.57 Aligned_cols=449 Identities=7% Similarity=-0.032 Sum_probs=279.8 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHH-----HHHHHHHHHHHHHhccccccc-------cccc Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHM-----DYQRPRLKVLSDYYEGKTKNL-------VELT 78 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~-----~~~~~r~~~~~~yy~G~~~~~-------~~~~ 78 (512) |++... +.++|.... ..+..++..+.++|.+..... .... T Consensus 1 ~~~~~~----------------------------~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~ 52 (518) T protein:vir:78 1 MGVWSV----------------------------MTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWA 52 (518) T ss_pred Ccchhh----------------------------HHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcc Confidence 433222 222322222 011122223223332221110 0112 Q ss_pred ccccccccceeeecchHHHHHHHHHhhhhccCceecC------CchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 79 RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD------DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 79 ~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~------~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~ 152 (512) ..+.+....+++++|+++.||+.+|+++++++++++. +++.+++.|++++++|+|...+.+++..+++.|..++ T Consensus 53 ~~~~~~~~~~~~~~~l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~ 132 (518) T protein:vir:78 53 QGYVPTVHDKLMNSGTGNEIVVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAV 132 (518) T ss_pred cCCCCccccccccCChHHHHHHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEE Confidence 2233455567889999999999999999999998864 4566789999999999999999999999999999999 Q ss_pred EEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEE------------cCCcEEE--E Q lcl|NC_010808. 153 LMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLF------------TSHGVYR--Y 218 (512) Q Consensus 153 ~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~y------------t~~~~~~--~ 218 (512) .+|++ +|++++.+++|.+++|+|++.. ...++.+..... . ......+.++.+ .+..|.+ | T Consensus 133 k~~~d-~~~~~i~~v~ad~~~P~~~~g~---~~~~~f~~~~~~-~-~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly 206 (518) T protein:vir:78 133 KINIL-NGRPSISVHSSSQFWIDFKNNE---PFRFNFFEEIPT-S-NKADIYYLVESREIKQWDKEGKKLSGGFVTYSVI 206 (518) T ss_pred EEEEE-CCeeEEEEEcCCeeEEEeecCc---EEEEEEEEEeec-C-CcceeEEEEEeeccccccceeecccceeEEEEEe Confidence 99986 5889999999999999998743 333222211111 1 111112223332 2222222 1 Q ss_pred EecCCcccccc---------------ccccccccccccccceEee-c----C-----CCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 219 LTSRTNGLKLT---------------PRENGFESHSFERMPITEF-S----N-----NERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 219 ~~~~~~~~~~~---------------~~~~~~~~~~~~~vPvv~~-~----n-----~~~g~s~~~~v~~liDa~~~~~s 273 (512) ..+........ ........+.....|++.| + | .+.|.|+|++++++||++|.++| T Consensus 207 ~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s 286 (518) T protein:vir:78 207 KIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFT 286 (518) T ss_pred eecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHH Confidence 12111111000 0000001111233455554 2 2 23499999999999999999999 Q ss_pred HHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc----ceeEEeecCCHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV----DGGYIYKQYDVQGTEAYKDR 349 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~~~~~~~~~~~~~ 349 (512) ++++.++....++.|...+......... ... ....+.............+.++ .++.+++.+..+.+.+.++. T Consensus 287 ~~~~e~~~g~~~i~v~~~~l~~~~~~~~-~~~--~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~ 363 (518) T protein:vir:78 287 VYMREGEKTKTKIAASERMFRKKVNKST-DKE--EWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEY 363 (518) T ss_pred HHHHHHHhCCceeeechhHhccCCCCCC-Ccc--ccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHH Confidence 9999999877777776554332221110 000 1111111111222211222222 26777888888888888888 Q ss_pred HHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---ccccceee Q lcl|NC_010808. 350 LNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA---NKDFNTVR 426 (512) Q Consensus 350 l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~---~~d~~~i~ 426 (512) +.+.|...++.+...++..++..||.++++..+.+.++++.++..++.+|+++++.++.++........ ..+...++ T Consensus 364 ~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~ 443 (518) T protein:vir:78 364 FAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVI 443 (518) T ss_pred HHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEE Confidence 888887777666555544456789999999999999999999999999999999999998776543221 22334689 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC-CCCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCC Q lcl|NC_010808. 427 YVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF-SFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQD 501 (512) Q Consensus 427 i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~-~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (512) |.|++.+++|..+.++..+++ +|++|++++++++ +.++| +++|++||++|+...... .+.+-...+ T Consensus 444 i~f~D~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~---------~p~~~~g~~ 514 (518) T protein:vir:78 444 IEFPDPMSVNLNELSSTLNNMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVP---------DPEAIGGME 514 (518) T ss_pred EEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCC---------CCccccCCC Confidence 999999999999999998875 7999999999874 56665 457899999997543211 111111111 Q ss_pred CCCc Q lcl|NC_010808. 502 DDTK 505 (512) Q Consensus 502 ~~~~ 505 (512) .+.+ T Consensus 515 ~~~g 518 (518) T protein:vir:78 515 TKGG 518 (518) T ss_pred CCCC Confidence 1111 No 72 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=3.9e-46 Score=269.55 Aligned_cols=472 Identities=10% Similarity=0.017 Sum_probs=306.3 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |++..-.+.+|++-...... .+....+ ...++ .--..+..|+.++.+||+|+++.++... .....+.+.++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~-~~~~~~~-----~~~~i--~~~~~~~~~I~~w~~~Y~g~~~~~~~~~-~~~~~~~~~~~ 71 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSG-QTLKSIN-----DHEKI--NIDPNELARIERNLRQYEGDYPQVEYIN-SQGKIQERDYM 71 (517) T ss_pred CchHHHHHHHHHHHHHHhcc-cchhHhh-----cCCce--ecCHHHHHHHHHHHHHhcCCCccccccc-cccccccccee Confidence 77777777777664443321 1111100 00000 0113456788899999999998664322 22334455678 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCch-----------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDK-----------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD 159 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~-----------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~ 159 (512) ++|+++.|+..+|++++++++++++++. ..++.|+++++.|+|...+.+++..+++.|.+++.+|+| . T Consensus 72 sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~ 150 (517) T protein:vir:98 72 TLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD-N 150 (517) T ss_pred ecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe-C Confidence 8999999999999999999999987653 357889999999999999999999999999999999998 4 Q ss_pred CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcE------EE-----EEecCCc--cc Q lcl|NC_010808. 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGV------YR-----YLTSRTN--GL 226 (512) Q Consensus 160 g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~------~~-----~~~~~~~--~~ 226 (512) |.++|.+++|.+++|+-.++ .+...+++.+....... ......+.++.|+.+.+ ++ |...... +. T Consensus 151 ~~~~I~~v~ad~~~Pl~~~~-~~v~~~ai~~~~~~~~~-~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~ 228 (517) T protein:vir:98 151 GEIEFSWALANAFYPLRSNS-NGISEGVMKSVTTKVIG-NKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGK 228 (517) T ss_pred CeeEEEEEcCCeeEEEEecC-CCeEEEEEEEEEEEeec-CCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccc Confidence 77899999999999965443 33444444333232222 22333445666665432 11 2211111 00 Q ss_pred cc------cccccccccccccccceEeecC---------CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Q lcl|NC_010808. 227 KL------TPRENGFESHSFERMPITEFSN---------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) Q Consensus 227 ~~------~~~~~~~~~~~~~~vPvv~~~n---------~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g 291 (512) .+ ........-.++.+.++++|++ .+.|+|+|+++++++|++|..+|++++.++....++.|... T Consensus 229 ~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~ 308 (517) T protein:vir:98 229 RIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDV 308 (517) T ss_pred cccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChh Confidence 00 0001111223445544556543 25699999999999999999999999999998888887666 Q ss_pred CCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccc-cc Q lcl|NC_010808. 292 NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SG 370 (512) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~ 370 (512) +...+.+... ......+..+ ...+.....+ .+++.++..++++..+.+.+.++.+.+.|...++.+...++.. .+ T Consensus 309 ~l~~~~~~~g-~~~~~~~d~~--~~~y~~~~~~-~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~ 384 (517) T protein:vir:98 309 MLRTVPDESG-MPPPQVFDPD--VNVYKSIRMG-TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRS 384 (517) T ss_pred hhccccCCCC-cccCCCCCcc--cceeeeccCC-CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccc Confidence 5433332211 1111111111 1112222222 2233466667777888888888888888877777765554432 34 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc-cccceeeEEeCCCCCcCHHHHHHHHHHH-- Q lcl|NC_010808. 371 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN-KDFNTVRYVYNRNLPKSLIEELKAYIDS-- 447 (512) Q Consensus 371 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~-~d~~~i~i~f~~~~p~d~~~~~~~~~kl-- 447 (512) ..+|.+++++.+.+.++++++++.++.+|++++++|+.+.......... .....++|.|.+.+++|..+.++..+++ T Consensus 385 ~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~ 464 (517) T protein:vir:98 385 MKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKT 464 (517) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHh Confidence 5689999999999999999999999999999999998776543322111 1234689999999999999999999886 Q ss_pred hccCChHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 448 GGKISQTTLMSLFSFFQDP--ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 448 ~g~~s~et~~~~~~~v~d~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +|++|.+++++++.++++. ++|++||++|+....+ .+. ..+..+....+|| T Consensus 465 aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~----------~~~----~~~~~~~~~gd~e 517 (517) T protein:vir:98 465 FGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIELDP----------VTI----SQRAQKRMFGDEE 517 (517) T ss_pred cCCCCHHHHHHHhCCCChHHHHHHHHHHHHhccccCC----------CCc----cccccCCCCCCCC Confidence 7999999998888666653 5678888887643211 100 1111222222222 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=9.9e-42 Score=245.40 Aligned_cols=472 Identities=11% Similarity=0.073 Sum_probs=284.9 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |++.-.=+... +..|+..+. .++..+..++..+|+.|.+||.|++..+.... T Consensus 1 m~~~~~q~~p~------~~~fp~~~a--------------------~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il-- 52 (563) T protein:vir:74 1 MPYNHKQYDPA------KPFLRGGDD--------------------NIVDENDKNRVRAYDLYENIYLNSAETLKLVL-- 52 (563) T ss_pred CCccccccCCC------ccccccccc--------------------ccCCHHHHHHHHHHHHHHHhhcCchhhhhhhc-- Confidence 54433322211 222333111 12344566788999999999999987643211 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCc--------hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD--------KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~ 152 (512) ... ...-+..+.++++|++.+ +++|+++.|+++. +.++..|.+|.+.+++..++.+..++|++.|.+.. T Consensus 53 ~G~--dr~~~~~ps~r~~V~~~~-~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf 129 (563) T protein:vir:74 53 RGD--DSVPILMPSGRKIVEAVH-RFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHF 129 (563) T ss_pred CCC--ceeeeccchHHHHHHHHH-HhcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeE Confidence 111 112244567889999966 5559999996542 23567789999999999999999999999998665 Q ss_pred EEEECC----CCceEEEEEccceeEEEEeCCCCcee------------------EEEEEEeeeeeeccC--CcceEEEEE Q lcl|NC_010808. 153 LMIRNQ----DDETRLYKSDAMSTFVIYDNTIERNS------------------IAGVRYLRTKPIDKT--DEDEVFTVD 208 (512) Q Consensus 153 ~v~~d~----~g~~~i~~~~p~~~~~i~d~~~~~~~------------------~~~v~~~~~~~~~~~--~~~~~~~~~ 208 (512) ++-+|. .+++++..++|.+.||+-|++..... ++.+|.|.....+.. .....+.++ T Consensus 130 ~l~wDp~K~~g~R~rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae 209 (563) T protein:vir:74 130 YIHADPNKKAGERISVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELT 209 (563) T ss_pred EEeeccccccCCCceEeecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccc Confidence 555553 24899999999999996554332111 122221111111100 000111112 Q ss_pred EEcCCcEEE-------EEecCCcc-ccccccccccccccccccceEeecC-----CCCCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 209 LFTSHGVYR-------YLTSRTNG-LKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDT 275 (512) Q Consensus 209 ~yt~~~~~~-------~~~~~~~~-~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~ 275 (512) .|+.+.+-. +.....+. ..-...++...|++++.||||.|+| ..+|+|.+++++++++++|+++|+. T Consensus 210 ~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~ 289 (563) T protein:vir:74 210 HWTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDE 289 (563) T ss_pred hhccccccccCccchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHH Confidence 222111000 00000000 0001223455689999999999865 3479999999999999999999999 Q ss_pred HHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCC-CCcceeEEeecCCHHHHHHHHHHHHH-H Q lcl|NC_010808. 276 ANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETE-GSVDGGYIYKQYDVQGTEAYKDRLNS-D 353 (512) Q Consensus 276 ~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~l~~-~ 353 (512) +.++.+...|+.++.|....+....+.- .+.+.+ +.....+.. .++-+..+.--.+.+.++.|++.|.. . T Consensus 290 s~i~~~tG~pi~vl~~~~p~d~~~g~~~----~w~vgp----G~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~era 361 (563) T protein:vir:74 290 DATIVFQGLGMYVTNASAPVDPNTGELT----DWNIGP----MQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKG 361 (563) T ss_pred HHHHHhcCCCeEEecccccccccccccc----ccccCC----ceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHH Confidence 9999999999999987653332111100 011111 111111111 22334444433456888888888777 7 Q ss_pred HHHHhcccccccc--cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhc---cCC-----Ccc Q lcl|NC_010808. 354 IHMFTNTPNMKDD--NFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR----RAKLLETILKNT---RSI-----DAN 419 (512) Q Consensus 354 i~~~s~~p~~~~~--~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~----~~~li~~~l~~~---~~~-----~~~ 419 (512) |+.++++|...++ ..+..+||.||+..+.+|.+++.++++.+..++++ .+++++...... +.. ..+ T Consensus 362 l~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~ 441 (563) T protein:vir:74 362 IAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASAD 441 (563) T ss_pred HHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccc Confidence 8999999999998 45567899999999999999999999988888887 344444332221 110 011 Q ss_pred cc-cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCC-CCHHHHHHHHHHHHHHHHHHHHhhcccCC Q lcl|NC_010808. 420 KD-FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFF-QDPELEVKKIEEDEKESIKKAQKGIYKDP 492 (512) Q Consensus 420 ~d-~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v-~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 492 (512) .. ...++|+|.+.+|.|..+.++.++.+ +|++|+|||+++| +|. +|++.|+++|+.++=..+..++... .++ T Consensus 442 ~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~a-d~~ 520 (563) T protein:vir:74 442 LLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEA-DAS 520 (563) T ss_pred cCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhc-cCc Confidence 11 23478999999999999999998876 7999999998887 654 4778888887776544322221111 111 Q ss_pred CCCC-CCCCCCCCcCcccCCC Q lcl|NC_010808. 493 RDIN-DDEQDDDTKDTVDKKE 512 (512) Q Consensus 493 ~~~~-~~~~~~~~~~~~~~~e 512 (512) .+.. ..++.-++++.-|+|- T Consensus 521 ~~~~a~~~~g~~~~~~dd~g~ 541 (563) T protein:vir:74 521 LGLSAMDNGGAGEQQFDDQGN 541 (563) T ss_pred ccceecccCCCCcccccccCC Confidence 1100 0111111112222233 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=2.6e-30 Score=182.76 Aligned_cols=449 Identities=13% Similarity=0.073 Sum_probs=275.3 Q ss_pred hhhc-HHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccccccc----cccce-e-eecchHHHHHHHHHhh Q lcl|NC_010808. 38 LLQN-INEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-----VELTRRKEE----YMADN-R-VAHDYASYISDFINGY 105 (512) Q Consensus 38 ~~~~-~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~-----~~~~~~~~~----~~~~~-r-i~~n~~~~iv~~~a~~ 105 (512) +.+. ++. ++..........++++.+.+-|.|....- +.+. .+.+ ++... | +-.|+++.+++.++++ T Consensus 1 m~~~~~~~-v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk-~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~ 78 (513) T protein:vir:97 1 MADKDPKS-PATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPR-HQEETDKGYQERLASAVLLNMVEQTLDTLSGK 78 (513) T ss_pred CCCCCCCC-CCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCC-CCCCCHHHHHHHHhcccCCChHHHHHHHHhhh Confidence 1110 111 11112223466778888888888863321 1111 1112 21111 2 2369999999999999 Q ss_pred hhccCceecCCc-hhHHH-HHHHH-HhccChhHHHHHHHHHHHhCCeEEEEEEECCCC------------------ceEE Q lcl|NC_010808. 106 FLGNPIQCQDDD-KDVLE-AIEAF-NDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD------------------ETRL 164 (512) Q Consensus 106 l~g~~~~~~~~d-~~~~~-~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g------------------~~~i 164 (512) +|-++|+++.+. ....+ .+.++ .+.++++.+.+.+++.++.+|+|+++|-....+ +|.+ T Consensus 79 vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~ 158 (513) T protein:vir:97 79 PFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYW 158 (513) T ss_pred hhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceE Confidence 999999886432 22222 23444 345799999999999999999999999664321 4889 Q ss_pred EEEccceeEEEEeCCCCc-eeEEEEEEee-eeeeccCCcceEEEEEEEcCCcEEEEEecCCccc-ccccccccccccccc Q lcl|NC_010808. 165 YKSDAMSTFVIYDNTIER-NSIAGVRYLR-TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL-KLTPRENGFESHSFE 241 (512) Q Consensus 165 ~~~~p~~~~~i~d~~~~~-~~~~~v~~~~-~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 241 (512) ..+.|.+++-.-...+.. ..+.-+++-. ....|+...+.+..+.+++++.+..|+....+.. ...+.......|+++ T Consensus 159 ~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~ 238 (513) T protein:vir:97 159 VMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLN 238 (513) T ss_pred EEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCC Confidence 999999987664333332 2233333322 2234455566666777889887766655433322 222344455678999 Q ss_pred ccceEeecCCC----CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhh Q lcl|NC_010808. 242 RMPITEFSNNE----RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYE 317 (512) Q Consensus 242 ~vPvv~~~n~~----~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (512) .||||.+.... .+.+.|.++..+.-++.+..|++..++...++|++++.|....+.+. +. . T Consensus 239 ~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~~---------i~------i 303 (513) T protein:vir:97 239 YVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSDP---------VV------V 303 (513) T ss_pred ceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCCc---------eE------e Confidence 99999986432 25677899999999999999999999999999999999864332211 11 1 Q ss_pred hcccccCCC-CCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 318 NRDTGIETE-GSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLF 395 (512) Q Consensus 318 ~~~~~~~~~-~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~ 395 (512) +....+..+ .+++++|++++.+ .+.....++.+.+.|...+..+- ...+++.||++.+.......+........+ T Consensus 304 G~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~ll---~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~l 380 (513) T protein:vir:97 304 GPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEFL---KRKTGGQTATARALDSAEATSDLSAMTGLF 380 (513) T ss_pred eccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHhh---ccCCccccHHHHHHHHHHHHHHHHHHHHHH Confidence 111112233 4788999998754 46678889999999988775442 223567999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCC---CHH Q lcl|NC_010808. 396 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFFQ---DPE 467 (512) Q Consensus 396 ~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v~---d~~ 467 (512) +.++++++++++.+++.... .. ...+.-.|..... ..+.++++.++ +|.+|++|.++.+ +.+. |++ T Consensus 381 e~al~~~l~~~a~wlg~~~~-~~---~v~in~dF~~~~~--~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~ 454 (513) T protein:vir:97 381 EDALAQALDITADWLRLGPN-GG---TVELVKDYDLEEM--DAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDED 454 (513) T ss_pred HHHHHHHHHHHHHHhCCCCC-cc---EEEeccccCcccC--CHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHH Confidence 99999999999999864211 00 1122223422211 23456666665 7899999987766 3332 345 Q ss_pred HHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 468 LEVKKIEEDEKESIKKAQKG--IYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 468 ~E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++.++++++-++..-..... ...+..+.......+..++..+++| T Consensus 455 ~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) T protein:vir:97 455 EDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGE 501 (513) T ss_pred HHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCC Confidence 55666655543332111000 0111111111222222333334433 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.96 E-value=1.1e-29 Score=179.34 Aligned_cols=424 Identities=10% Similarity=-0.003 Sum_probs=260.2 Q ss_pred hcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc----cccccccccc--ee----eecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 40 QNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL----TRRKEEYMAD--NR----VAHDYASYISDFINGYFLGN 109 (512) Q Consensus 40 ~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~----~~~~~~~~~~--~r----i~~n~~~~iv~~~a~~l~g~ 109 (512) -+ +...........++++...+-|.|....-... .+.+.+.... .| +-.|+++.+++.+++++|-+ T Consensus 1 m~----V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k 76 (452) T protein:vir:94 1 MP----IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQ 76 (452) T ss_pred CC----CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcC Confidence 00 11112223566778888888888864421111 1111222221 12 23699999999999999999 Q ss_pred CceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC-ceEEEEEccceeEEEEeCCCCceeEEEE Q lcl|NC_010808. 110 PIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD-ETRLYKSDAMSTFVIYDNTIERNSIAGV 188 (512) Q Consensus 110 ~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g-~~~i~~~~p~~~~~i~d~~~~~~~~~~v 188 (512) +++++..+. . ..+..=.+.++++.+...+.+.++.+|+|+++|..+..| +|.+..++|.+++-.--+.+.+-....+ T Consensus 77 ~p~~~~p~~-l-~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g~l~~v~l 154 (452) T protein:vir:94 77 PPVITHPDA-M-SKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDGRLLMVVL 154 (452) T ss_pred CceecccHH-H-HHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccccCCeeEEEE Confidence 999876532 2 223222557899999999999999999999999877655 7999999999987654333333222233 Q ss_pred EEeeeee--eccCCcceEEEEEEEc--CCcEE--EEEecCCcccc-ccccccccccccccccceEeecCCC----CCCcc Q lcl|NC_010808. 189 RYLRTKP--IDKTDEDEVFTVDLFT--SHGVY--RYLTSRTNGLK-LTPRENGFESHSFERMPITEFSNNE----RRKGD 257 (512) Q Consensus 189 ~~~~~~~--~~~~~~~~~~~~~~yt--~~~~~--~~~~~~~~~~~-~~~~~~~~~~~~~~~vPvv~~~n~~----~g~s~ 257 (512) |...... .+....+....+.+++ ++.+. +|.....+.+. ..........|+++.||+|.+.... .+.+. T Consensus 155 re~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pP 234 (452) T protein:vir:94 155 REFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPP 234 (452) T ss_pred EEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCCCCCCCCccc Confidence 4332211 1122333444444444 44332 33322222221 2233445567899999999886433 25677 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCC-CCcceeEEee Q lcl|NC_010808. 258 YEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETE-GSVDGGYIYK 336 (512) Q Consensus 258 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~ 336 (512) |.++..+.-++.+..|++.+.+...++|++|+.|....+. +. .+....+..+ .+++++|+++ T Consensus 235 Ll~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~~-----------i~------iG~~~~~~lpe~~~~~~yie~ 297 (452) T protein:vir:94 235 MIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQST-----------MH------IGSTKAWVIPEVAAKVGFLEF 297 (452) T ss_pred hHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCCc-----------eE------ecccccccCCCCCCcceEEcc Confidence 8999999999999999999999999999999998642221 11 1111222334 4778999998 Q ss_pred cCC-HHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_010808. 337 QYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 415 (512) Q Consensus 337 ~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~ 415 (512) +.+ .+..+..++.+.+.+...+.- +......++.|++|.........+........++.++++++++++.+++... T Consensus 298 ~g~~i~~~~~~l~~le~~m~~~Ga~--ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~- 374 (452) T protein:vir:94 298 TGQGLQSLEKALSEKQAQLASLSAR--LIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGG- 374 (452) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHH--hhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC- Confidence 754 467788888998888776642 2222334677888877666666666677777788889999999999876321 Q ss_pred CCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC--CCCCCHHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_010808. 416 IDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF--SFFQDPELEVKKIEEDEKESIKKAQKGIYKD 491 (512) Q Consensus 416 ~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~--~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 491 (512) + ..+++.-.-....-..+.++++.++ +|.+|++|++..+ ..+-|+++|.+++..|.+.. . T Consensus 375 -----~-~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~----------~ 438 (452) T protein:vir:94 375 -----T-LNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAP----------E 438 (452) T ss_pred -----c-eEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhcc----------C Confidence 1 1222221112222234566666664 7899999998877 34567778888888774431 1 Q ss_pred CCCCCCCCCCCCCcCcc Q lcl|NC_010808. 492 PRDINDDEQDDDTKDTV 508 (512) Q Consensus 492 ~~~~~~~~~~~~~~~~~ 508 (512) +.+.+... +.+++. T Consensus 439 ~~~~~~~~---~~~~~~ 452 (452) T protein:vir:94 439 PSPSNTPP---NPSSKA 452 (452) T ss_pred cccCCCCC---CCccCC Confidence 11111110 111111 No 76 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.94 E-value=3e-26 Score=160.48 Aligned_cols=438 Identities=12% Similarity=0.058 Sum_probs=260.0 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccccc----cccc----cccc Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN-----LVELTR----RKEE----YMAD 87 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~-----~~~~~~----~~~~----~~~~ 87 (512) ++ |+ +..........++++...+-+.|.... .+.+.. .+.+ ++.. T Consensus 1 m~---~V-------------------~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~r 58 (501) T protein:vir:95 1 MP---NV-------------------SFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAY 58 (501) T ss_pred CC---CC-------------------CCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHH Confidence 11 01 111122356677888888888886432 111111 1111 1111 Q ss_pred e-e-eecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHH-HhccChhHHHHHHHHHHHhCCeEEEEEEECCCC---- Q lcl|NC_010808. 88 N-R-VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAF-NDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD---- 160 (512) Q Consensus 88 ~-r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g---- 160 (512) . | +-.|+++.+++.+++++|-++|+++.+ +.....+.++ .+.++++.+...+++.++.+|+|+++|-....+ T Consensus 59 l~rA~~~n~~~~t~~~l~G~vf~k~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~ 137 (501) T protein:vir:95 59 LKRAVFYNVARRTLFGLVGQVFMRDPVVKVP-ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGG 137 (501) T ss_pred hhccccCchHHHHHHHHhhhhhcCCcceeCc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCccc Confidence 1 2 236999999999999999999998643 3334444444 345799999999999999999999999764321 Q ss_pred -----------ceEEEEEccceeEEEEeCCCCc-eeEEEEEEeeee--eeccCCcceEEEEEEEcCC--cEE---EEEec Q lcl|NC_010808. 161 -----------ETRLYKSDAMSTFVIYDNTIER-NSIAGVRYLRTK--PIDKTDEDEVFTVDLFTSH--GVY---RYLTS 221 (512) Q Consensus 161 -----------~~~i~~~~p~~~~~i~d~~~~~-~~~~~v~~~~~~--~~~~~~~~~~~~~~~yt~~--~~~---~~~~~ 221 (512) +|.+..++|.+++-.-...+.+ +.+.-+++-... ..++...+.+..+.+.+.+ ..+ .|+.. T Consensus 138 ~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~ 217 (501) T protein:vir:95 138 ASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREP 217 (501) T ss_pred ccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEec Confidence 3889999999986665333332 222223322221 1121233333333344332 222 23322 Q ss_pred CCcc------------ccccccccccccccccccceEeecCCC----CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_010808. 222 RTNG------------LKLTPRENGFESHSFERMPITEFSNNE----RRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (512) Q Consensus 222 ~~~~------------~~~~~~~~~~~~~~~~~vPvv~~~n~~----~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~ 285 (512) .... ............|+++.||+|.+.... .+.+.|.++..+.-++-+..|++.+.+...++| T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P 297 (501) T protein:vir:95 218 QPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQP 297 (501) T ss_pred CCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccc Confidence 1111 001112223345899999999874332 235678888888888888899999999999999 Q ss_pred eeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 286 MLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD 365 (512) Q Consensus 286 ~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 365 (512) ++|++|......+.... +. +..+.......+.+++++|+..+.+ .-.+..++.+.+.|...+..+ . T Consensus 298 ~l~i~G~~~~~~~~~~~----~~------i~~G~~~~~~lP~~~~~~~ie~~~~-~i~~~~l~~l~~~m~~~Ga~l---l 363 (501) T protein:vir:95 298 TPVLIGLTEEWVTNVLK----GS------VNFGSRGGIPLPVGADAKLLQASEN-TMLKEAMDTKERQMVALGAKL---V 363 (501) T ss_pred eeeeeCCcccccccCCC----Cc------eeecccccccCCCCCceeEEecChh-hHHHHHHHHHHHHHHHHHHhh---c Confidence 99999865433222111 11 1222233445667889999997643 334667888888887775432 1 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCC-CcC-HHHHHHH Q lcl|NC_010808. 366 DNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL-PKS-LIEELKA 443 (512) Q Consensus 366 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~-p~d-~~~~~~~ 443 (512) ...+++.||++.+.......+........++.++.+++++++.+++.... .++|..++.. ... ..+.+++ T Consensus 364 ~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~--------~~~v~i~~df~~~~~~~~~~~a 435 (501) T protein:vir:95 364 EQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQADS--------GVKFELNTDFDIARMTPDERRS 435 (501) T ss_pred cCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC--------ceEEEEecccccccCCHHHHHH Confidence 23346789999998888888888889999999999999999999764211 1222222222 212 3445666 Q ss_pred HHHH--hccCChHHHHHhC---CCCC-CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 444 YIDS--GGKISQTTLMSLF---SFFQ-DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 444 ~~kl--~g~~s~et~~~~~---~~v~-d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.++ +|.+|.+|+++.+ +.++ +.+.|.++|+.|..+... .+.... ...+..+++++.+.| T Consensus 436 l~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~--------~~~~~~-~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 436 LVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMA--------LATPAN-VPGDGSGGDNVGNSE 501 (501) T ss_pred HHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCccc--------ccccCC-CCCCCcccccccCCC Confidence 6665 7899999996665 4443 345566777666332111 011111 122233444455555 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.94 E-value=3.3e-25 Score=154.78 Aligned_cols=472 Identities=9% Similarity=-0.007 Sum_probs=264.4 Q ss_pred CCccee-eccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----- Q lcl|NC_010808. 1 MLKANE-FETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL----- 74 (512) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~----- 74 (512) ||+..+ --++-.-...+-.-.++-.....-.+. +...........++++...+-+.|....- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~d------------V~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~ 68 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPN------------VGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREE 68 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCC------------CCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccc Confidence 776543 222211111111112222333222211 11112233566778888888888863321 Q ss_pred cccccc----ccccccce-----e-eecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHH-HhccChhHHHHHHHHH Q lcl|NC_010808. 75 VELTRR----KEEYMADN-----R-VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAF-NDLNDVESHNRSLGLD 143 (512) Q Consensus 75 ~~~~~~----~~~~~~~~-----r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~ 143 (512) +.+... .++.+..+ | +-.|+++.+++.+++++|-+++.+... +.....+.++ .+.++++.+...+++. T Consensus 69 YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~ 147 (535) T protein:vir:80 69 YLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQLP-PALEAIVEDIDGEGVSLDQQAKKALGY 147 (535) T ss_pred cCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceecc-HHHHHHHhccCCCCCCHHHHHHHHHHH Confidence 111110 01111111 2 336999999999999999999988643 3344444444 3457999999999999 Q ss_pred HHhCCeEEEEEEECCCC-------------ceEEEEEccceeEEEEeCCCCc-eeEEEEEEee--eeeeccCCcceEEEE Q lcl|NC_010808. 144 LSIYGKAYELMIRNQDD-------------ETRLYKSDAMSTFVIYDNTIER-NSIAGVRYLR--TKPIDKTDEDEVFTV 207 (512) Q Consensus 144 ~~~~G~a~~~v~~d~~g-------------~~~i~~~~p~~~~~i~d~~~~~-~~~~~v~~~~--~~~~~~~~~~~~~~~ 207 (512) ++.+|+|+++|-....+ +|.+..++|.+++-.-.+.+.. ..+.-+++-. ....++...+.+..+ T Consensus 148 ~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~ 227 (535) T protein:vir:80 148 TMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQW 227 (535) T ss_pred HHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEE Confidence 99999999999665543 3889999999987665443332 2222233322 222233344455455 Q ss_pred EEEcCC-----cEEEEEecCCcccccc---ccccccccccccccceEeecCC--C--CCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 208 DLFTSH-----GVYRYLTSRTNGLKLT---PRENGFESHSFERMPITEFSNN--E--RRKGDYEKVITLIDLYDNAESDT 275 (512) Q Consensus 208 ~~yt~~-----~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~vPvv~~~n~--~--~g~s~~~~v~~liDa~~~~~s~~ 275 (512) .+++++ .+.+|+....+..... ........|+++.||||.+... . .+.+.|.++..+.-++.+..|++ T Consensus 228 RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~ 307 (535) T protein:vir:80 228 RVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADY 307 (535) T ss_pred EEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHH Confidence 555553 2333433332211111 1223446689999999988532 2 24567889999999999999999 Q ss_pred HHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_010808. 276 ANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIH 355 (512) Q Consensus 276 ~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~ 355 (512) .+.+...++|++|+.|......++... ...+. .+....+..+.+++++|+..+.+.-+. ..++.+.+.+. T Consensus 308 ~~il~~~~~P~l~i~G~~~~~~~~~~~---~~~i~------iG~~~~~~lP~~~~~~~~e~~~~~~a~-~~l~~~e~qM~ 377 (535) T protein:vir:80 308 EEMAFVAGQPTAFFTGLTKDWVEDVFK---DFKVH------LGSRAIIPLPQGATAGILQITPNSVPF-EAMTHKESQMI 377 (535) T ss_pred HHHHHHhcCceeeeecCchhhhhcCCC---CcceE------ecCcccccCCCCCCcceeeeccchhHH-HHHHHHHHHHH Confidence 999999999999999975433222111 11111 122233345667888888876544443 45777777776 Q ss_pred HHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCC-CC Q lcl|NC_010808. 356 MFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN-LP 434 (512) Q Consensus 356 ~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~-~p 434 (512) ..+...- ....++.++.+.+.......+........++.++++++++++.+++.... -..+.+..++. .. T Consensus 378 ~lGa~ll---~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~------~~~~~i~~n~dF~~ 448 (535) T protein:vir:80 378 AMGANLL---VKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIVN------DETVEYNLNTDFPA 448 (535) T ss_pred HHHHHhh---ccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccC------CCceEEEecccccc Confidence 6654332 22234565555555555666667777788889999999999998764211 11122222111 11 Q ss_pred cC-HHHHHHHHHHH--hccCChHHHHHhC---CCCC---CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 435 KS-LIEELKAYIDS--GGKISQTTLMSLF---SFFQ---DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 435 ~d-~~~~~~~~~kl--~g~~s~et~~~~~---~~v~---d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) .+ ..+.++++.++ +|.+|.+|++..| +.++ +.++|+.||+.|-.+... .++...+.....+.+ T Consensus 449 ~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~--------~~g~~~d~~~~g~~~ 520 (535) T protein:vir:80 449 ARLTPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTA--------AAGKVGDAASGGTNK 520 (535) T ss_pred ccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccc--------cCCCCCCCCCCCCCc Confidence 11 23456666665 7899999998776 3332 235667777776543211 111111222222222 Q ss_pred CcccCCC Q lcl|NC_010808. 506 DTVDKKE 512 (512) Q Consensus 506 ~~~~~~e 512 (512) .-.++++ T Consensus 521 ~~~~~~~ 527 (535) T protein:vir:80 521 AKLNNGN 527 (535) T ss_pred CcccCCc Confidence 2222222 No 78 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.93 E-value=8e-25 Score=152.70 Aligned_cols=445 Identities=9% Similarity=0.026 Sum_probs=258.2 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------c Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN------L 74 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~------~ 74 (512) |+..| .+..|+. ..........++++..++-|.|..-. + T Consensus 1 ~~~~~----------------~~~~~V~-------------------~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl 45 (489) T protein:vir:78 1 MLTEN----------------GQGSGVK-------------------TKHREWLHYAPKWQKVRHALAGELVSYLRNVGL 45 (489) T ss_pred CccCC----------------CccCCCC-------------------ccCHHHHHHHHHHHHHHHHhcCcccccccCCCC Confidence 32222 1222221 11112245667888888989885311 0 Q ss_pred ccccccccc--cccc-ee-eecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHH-HhccChhHHHHHHHHHHHhCCe Q lcl|NC_010808. 75 VELTRRKEE--YMAD-NR-VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAF-NDLNDVESHNRSLGLDLSIYGK 149 (512) Q Consensus 75 ~~~~~~~~~--~~~~-~r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~ 149 (512) .++.+.+.+ ++.. .| +-.|+++.+++.+++++|-++|.++.. +.....+.++ .+.++++.+...+++.++.+|+ T Consensus 46 ~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~ 124 (489) T protein:vir:78 46 NEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEINIP-KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGR 124 (489) T ss_pred CCCCCCCChHHHHHHHhccccCChHHHHHHHHhchhhcCCcceecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCe Confidence 111111111 1111 12 236999999999999999999998654 3344444444 3457999999999999999999 Q ss_pred EEEEEEECCCC------------ceEEEEEccceeEEEEeCCCCc-eeEEEEEEeeee--e--eccCCcceEEEEEEEcC Q lcl|NC_010808. 150 AYELMIRNQDD------------ETRLYKSDAMSTFVIYDNTIER-NSIAGVRYLRTK--P--IDKTDEDEVFTVDLFTS 212 (512) Q Consensus 150 a~~~v~~d~~g------------~~~i~~~~p~~~~~i~d~~~~~-~~~~~v~~~~~~--~--~~~~~~~~~~~~~~yt~ 212 (512) |+++|-.+..+ +|.+..++|.+++-.-...+.+ ..+.-|++-... . .++...+.+..+.+++. T Consensus 125 ~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~ 204 (489) T protein:vir:78 125 GGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDI 204 (489) T ss_pred EEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEec Confidence 99999876554 5889999999987664333222 222223322221 1 12334455666667766 Q ss_pred Cc-----EEEEEecCCcc--ccccccccccccccccccceEeecCCC----CCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 213 HG-----VYRYLTSRTNG--LKLTPRENGFESHSFERMPITEFSNNE----RRKGDYEKVITLIDLYDNAESDTANYMSD 281 (512) Q Consensus 213 ~~-----~~~~~~~~~~~--~~~~~~~~~~~~~~~~~vPvv~~~n~~----~g~s~~~~v~~liDa~~~~~s~~~~~~~~ 281 (512) +. +..|+....+. ............|+++.||+|.+.... .+.+.|.++..+.-++-+..|++..++.. T Consensus 205 ~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~ 284 (489) T protein:vir:78 205 DSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFV 284 (489) T ss_pred CCCcceEEEEEEeecCCcccceeeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHH Confidence 42 22233222221 111112223456889999999885432 24567889999988999999999999999 Q ss_pred hcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh-cc Q lcl|NC_010808. 282 LNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFT-NT 360 (512) Q Consensus 282 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s-~~ 360 (512) .+.|+++++|....+.+........+ ...+.......+.+++++|+..+.... ....++.+.+.+...+ .+ T Consensus 285 ~~~P~l~i~G~d~~~~~~~~~~~~~~-------i~~g~~~~~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa~l 356 (489) T protein:vir:78 285 VGQPTLFIYPGENLTPQAFKEANPNG-------IKFGSRRGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQL 356 (489) T ss_pred cccceeeeecCccCCcccccccCccc-------eeeCCcccccCCCCCCcceeccCcchH-HHHHHHHHHHHHHHHhhhh Confidence 99999999986544333322221111 112222334556788889998875433 4566777777666653 33 Q ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHH Q lcl|NC_010808. 361 PNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEE 440 (512) Q Consensus 361 p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~ 440 (512) . .. +++.||++.+.......+........++.++.+++++++.+++...... ..+ .+...|... +. ..+. T Consensus 357 ~----~~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~--~~i-~~n~dF~~~-~~-d~~~ 426 (489) T protein:vir:78 357 I----TP-TQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGKPEDTE--VEF-RLNMDFFLE-PM-TAQD 426 (489) T ss_pred c----cC-CcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc--eEE-EeecccCcc-cC-CHHH Confidence 3 22 3578999988888888888888899999999999999999976432111 111 123334211 11 2345 Q ss_pred HHHHHHH--hccCChHHHHHhC--CCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 441 LKAYIDS--GGKISQTTLMSLF--SFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 441 ~~~~~kl--~g~~s~et~~~~~--~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) ++++.++ +|.+|.+|.+..| ..+-|+. .+.++.|-+++ . .+...+++++-+++....++ T Consensus 427 ~~al~~~~~~G~is~~t~~~~L~~~gv~d~~--~e~~~~ei~~~------~---~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 427 RAAWMADINAGLLPATAYYAALRKAGVTDWT--DADIKDAVADQ------P---LPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc--HHHHHHHHhhc------C---CCcccCCcccCCCCcccccC Confidence 6666665 7899999988766 2333322 12222222210 0 01111122222222211112 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.92 E-value=1.1e-24 Score=152.01 Aligned_cols=445 Identities=10% Similarity=0.015 Sum_probs=258.3 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----ccc- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTK----NLV- 75 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~----~~~- 75 (512) |+..| .+..|+. ..........++++..++-|.|..- ..+ T Consensus 1 ~~~~~----------------~~~~~V~-------------------~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl 45 (491) T protein:vir:95 1 MLTAN----------------GQGSGVK-------------------TKHREWLHYAPKWQKVRHALAGDLVGYLRNVGL 45 (491) T ss_pred CcccC----------------CccCCCC-------------------ccCHHHHHHHHHHHHHHHHhcCcchhhcccCCC Confidence 33322 1222221 1111224566788888888888421 011 Q ss_pred -cccccccc--cccc-ee-eecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHH-HhccChhHHHHHHHHHHHhCCe Q lcl|NC_010808. 76 -ELTRRKEE--YMAD-NR-VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAF-NDLNDVESHNRSLGLDLSIYGK 149 (512) Q Consensus 76 -~~~~~~~~--~~~~-~r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~ 149 (512) ++.+.+.+ ++.. .| +-.|+++.+++.+++++|-++|+++.. +.....+.++ .+.++++.+...+.+.++.+|+ T Consensus 46 ~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~ 124 (491) T protein:vir:95 46 NEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEINIP-KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGR 124 (491) T ss_pred cCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhchhhcCCceeecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCe Confidence 11111111 1111 12 236999999999999999999998654 3344444444 3457999999999999999999 Q ss_pred EEEEEEECCCC------------ceEEEEEccceeEEEEeCCCCc-eeEEEEEEeeeee----eccCCcceEEEEEEEcC Q lcl|NC_010808. 150 AYELMIRNQDD------------ETRLYKSDAMSTFVIYDNTIER-NSIAGVRYLRTKP----IDKTDEDEVFTVDLFTS 212 (512) Q Consensus 150 a~~~v~~d~~g------------~~~i~~~~p~~~~~i~d~~~~~-~~~~~v~~~~~~~----~~~~~~~~~~~~~~yt~ 212 (512) |+++|-.+..+ +|.+..++|.+++-.-...+.+ ..+.-+++-.... .++...+.+..+.+++. T Consensus 125 ~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l 204 (491) T protein:vir:95 125 GGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDI 204 (491) T ss_pred EEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEee Confidence 99999776543 5889999999987664332222 2233333322211 12333444555555543 Q ss_pred ---C--cEEEEEecCCccc--cccccccccccccccccceEeecCC--CC--CCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 213 ---H--GVYRYLTSRTNGL--KLTPRENGFESHSFERMPITEFSNN--ER--RKGDYEKVITLIDLYDNAESDTANYMSD 281 (512) Q Consensus 213 ---~--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~vPvv~~~n~--~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~ 281 (512) + .+..|+....++. ...........|+++.||+|.+... .+ +.+.|.++..+.-++-+..|++.+.+.. T Consensus 205 ~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~ 284 (491) T protein:vir:95 205 DTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFV 284 (491) T ss_pred cCCCceEEEEEEEcCCCcceeeeeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHH Confidence 2 2233333222221 1112222345678999999988532 22 4567888999988999999999999999 Q ss_pred hcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH-hcc Q lcl|NC_010808. 282 LNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMF-TNT 360 (512) Q Consensus 282 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~-s~~ 360 (512) .+.|+++++|......+........ .+..+.......+.+++++|+..+.+.- .+..++.+...+... +.+ T Consensus 285 ~~~P~l~~~G~d~~~~~~~~~~~~~-------~i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga~l 356 (491) T protein:vir:95 285 VGQPTLFIYPGDNLTPQSFKEANPN-------GIKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQL 356 (491) T ss_pred cccceeeeecCcccCcchhhccCcc-------eeEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHHHh Confidence 9999999998654333322211111 1122222334556788999999875443 455566666666554 333 Q ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHH Q lcl|NC_010808. 361 PNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEE 440 (512) Q Consensus 361 p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~ 440 (512) . . .+++.||++.+.......+........++.++.+++++++.+++...... .. -.+...|... +. ..+. T Consensus 357 ~----~-~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~--v~-i~~n~dF~~~-~~-~~~~ 426 (491) T protein:vir:95 357 I----T-PSQQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMMLGKPEDSE--VE-FQLNMDFFLQ-PM-TAQD 426 (491) T ss_pred c----c-CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCc--eE-EEeecccccc-cC-CHHH Confidence 2 1 23578999998888888888888999999999999999999976432111 11 1123334211 12 2345 Q ss_pred HHHHHHH--hccCChHHHHHhCC--CCCC--HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCc Q lcl|NC_010808. 441 LKAYIDS--GGKISQTTLMSLFS--FFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDT 507 (512) Q Consensus 441 ~~~~~kl--~g~~s~et~~~~~~--~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (512) ++++.++ +|.+|++|.+..|- .+.| .++++++|++|.- ........+.+-.+..+++++ T Consensus 427 ~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~~--------~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 427 RAAWMADINAGLLPATAYYAALRKAGVTDWTDEDILNAIEDAPL--------PSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcCC--------CCCccccccccchhhhhhccC Confidence 6666665 78999999887652 3333 3444555544421 111111111111222222222 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.88 E-value=5.8e-22 Score=137.03 Aligned_cols=433 Identities=9% Similarity=-0.047 Sum_probs=239.4 Q ss_pred cchhhccccc---cCCC-cCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc------- Q lcl|NC_010808. 11 TDLRENRNYL---FNDE-ANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR------- 79 (512) Q Consensus 11 ~~~~~~~~~~---f~~~-~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~------- 79 (512) |---.++||+ ++.. .+..|. .. ...| +++..--. ..++. .|....++.... T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~------a~--~~~W--~~~~d~g~---~~~k~-----~g~~YLPk~~~~~~~~~~d 62 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYL------VN--APQW--LRNLDCVM---DNIKR-----KKQTYLPNLGAIPPEAKTD 62 (488) T ss_pred CceeEEEeecceeecccccCHHHH------HH--hhhh--hHhhhhhh---HHHHH-----hhhhcCCCCCCccccccCc Confidence 3222344442 2210 000000 00 0111 01110000 01111 121111110000 Q ss_pred -ccccccc-------ce---ee-ecchHHHHHHHHHhhhhccCceecCCc-hhHHHHHHHH-HhccChhHHHHHHHHHHH Q lcl|NC_010808. 80 -RKEEYMA-------DN---RV-AHDYASYISDFINGYFLGNPIQCQDDD-KDVLEAIEAF-NDLNDVESHNRSLGLDLS 145 (512) Q Consensus 80 -~~~~~~~-------~~---ri-~~n~~~~iv~~~a~~l~g~~~~~~~~d-~~~~~~l~~~-~~~n~~~~~~~~~~~~~~ 145 (512) ....++. ++ |. -.|+.+.+++.+++++|-++|+++.++ ......+.++ .+.++++.+...+.+.++ T Consensus 63 ~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l 142 (488) T protein:vir:96 63 PKVTALAAKIEKDWEDLTWRLANYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQ 142 (488) T ss_pred chhhhhhccchhhhHhhhhhccccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHH Confidence 0000111 11 22 259999999999999999999998764 3444555555 345899999999999999 Q ss_pred hCCeEEEEEEECCCC-----------ceEEEEEccceeEEEEeCCCCce-eEEEEEEee-eeeeccC--CcceEEEEEEE Q lcl|NC_010808. 146 IYGKAYELMIRNQDD-----------ETRLYKSDAMSTFVIYDNTIERN-SIAGVRYLR-TKPIDKT--DEDEVFTVDLF 210 (512) Q Consensus 146 ~~G~a~~~v~~d~~g-----------~~~i~~~~p~~~~~i~d~~~~~~-~~~~v~~~~-~~~~~~~--~~~~~~~~~~y 210 (512) .+|+++++|-..+++ +|.+..++|.+++-.--+.+.++ .+.-+++-. +...|+. .......+-.+ T Consensus 143 ~~G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l 222 (488) T protein:vir:96 143 WGSRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRL 222 (488) T ss_pred hcCeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEE Confidence 999999999876543 48899999999877644333332 222233322 2222332 22334444446 Q ss_pred cCCcEEEEEecCCccccccccccccccccccccceEeecCCC----CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010808. 211 TSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE----RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) Q Consensus 211 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~----~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~ 286 (512) ++..+..++.....+. ..........|+++.||||.+.... .+.+.|.++..+.-++-+..|++...+...+.|+ T Consensus 223 ~~g~~~v~~~~~~~~~-~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~ 301 (488) T protein:vir:96 223 VDGLCEFQEVTDDEYS-DEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAK 301 (488) T ss_pred ECcEEEEEEEecCCcc-cceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCce Confidence 6654333333222211 1112223356789999999885432 2466788999999999999999999998888998 Q ss_pred eeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhc-cccccc Q lcl|NC_010808. 287 LLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTN-TPNMKD 365 (512) Q Consensus 287 lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~-~p~~~~ 365 (512) +++ |..+.+.+........+ ...+..... ....++++|+..+.+.- .+..++.+.+.+...+. ++. T Consensus 302 lv~-~~~~~~~~~~~~~~~~g-------~~~~~~~~~-~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l~~--- 368 (488) T protein:vir:96 302 WMV-DMGDMNKTMASEMNPLG-------FTLAGRMPY-YVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASLFT--- 368 (488) T ss_pred eee-ccCCCCcccccccccce-------eeecccccc-cccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhhcc--- Confidence 876 43333332221111111 111111111 12346788877654322 36668888877766543 231 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCC-CCcC-HHHHHHH Q lcl|NC_010808. 366 DNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN-LPKS-LIEELKA 443 (512) Q Consensus 366 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~-~p~d-~~~~~~~ 443 (512) .+++.||++.+.......+........++.++++++++++.+++.......+ ..+++..++. .... ..+.+++ T Consensus 369 --~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~---~~~~~~in~dF~~~~ld~~~~~a 443 (488) T protein:vir:96 369 --QQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNP---DELVFKLNRDYFDVEVNPQMLQV 443 (488) T ss_pred --CCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCc---cceEEEeccCCCCccCCHHHHHH Confidence 2346789998888888888888889999999999999999998765432111 1122322221 1111 3446677 Q ss_pred HHHH--hccCChHHHHHhC--CCCC----CHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010808. 444 YIDS--GGKISQTTLMSLF--SFFQ----DPELEVKKIEEDEKESIKKAQKGIYK 490 (512) Q Consensus 444 ~~kl--~g~~s~et~~~~~--~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~ 490 (512) +.++ +|.+|++|.+..+ ..+- +.++|.++|+++ .++- T Consensus 444 l~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~----------g~~~ 488 (488) T protein:vir:96 444 AYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAEL----------GFGM 488 (488) T ss_pred HHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhc----------CCCC Confidence 7766 7899999987766 2332 234445555432 1111 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.83 E-value=2e-20 Score=128.64 Aligned_cols=492 Identities=11% Similarity=0.041 Sum_probs=229.4 Q ss_pred CCcceeeccc--cchhhccccccCCCcCeeecc---cchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhccccc Q lcl|NC_010808. 1 MLKANEFETD--TDLRENRNYLFNDEANVVYTY---DGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTK 72 (512) Q Consensus 1 ~~~~~~~~~~--~~~~~~~~~~f~~~~~~~~~~---~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~ 72 (512) |-..|-=+.+ ..-......+.++....---- ..+.....+....|+..+.... ..-+....+-.+||.|.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw 80 (776) T protein:vir:93 1 MFDLNDKDSTQLVPARTDEGELSPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQW 80 (776) T ss_pred CCCccccccccccccccccccCCCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC Confidence 3322211100 000000011112211111001 1111222222233333222221 1222234466799999986 Q ss_pred ccccccccccccccceeeecchHHHHHHHHHhhhhccCceec--CC---chh----HHHHHHHHHhccChhHHHHHHHHH Q lcl|NC_010808. 73 NLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ--DD---DKD----VLEAIEAFNDLNDVESHNRSLGLD 143 (512) Q Consensus 73 ~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~--~~---d~~----~~~~l~~~~~~n~~~~~~~~~~~~ 143 (512) ...........++ -.+.+|..+.+|+...++...+.+.+. .. |.+ ....++.+++.|+++.....+..+ T Consensus 81 ~~~~~~~l~~~g~--p~~~~N~i~~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d 158 (776) T protein:vir:93 81 SQDEIDELKERGQ--APTVYNVISQSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEE 158 (776) T ss_pred CHHHHHHHHhcCC--ceEEecchHHHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHH Confidence 4333222222222 347899999999999999888766543 22 222 244567778889999999999999 Q ss_pred HHhCCeEEEEEEECCC--C-ceEEEEEccceeEEEEeCCCCc----eeEEEEE-Eee----------------------- Q lcl|NC_010808. 144 LSIYGKAYELMIRNQD--D-ETRLYKSDAMSTFVIYDNTIER----NSIAGVR-YLR----------------------- 192 (512) Q Consensus 144 ~~~~G~a~~~v~~d~~--g-~~~i~~~~p~~~~~i~d~~~~~----~~~~~v~-~~~----------------------- 192 (512) ++++|.+|+.|+++.+ + .+++.+++|.+++ ||..... ...+.++ .|. T Consensus 159 ~~~~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~ 236 (776) T protein:vir:93 159 TTKAGIGWLESQVQDENDGEPIYAGAESWRNIL--WDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDN 236 (776) T ss_pred hhhcCcceEEEEeeccCCCCceEeeccChhhee--eccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhc Confidence 9999999999987653 3 3455677888764 3331110 0001100 000 Q ss_pred -----ee----------------------eeccCCcceEEEEEEEcCCcEEEEEecC--C-------------------- Q lcl|NC_010808. 193 -----TK----------------------PIDKTDEDEVFTVDLFTSHGVYRYLTSR--T-------------------- 223 (512) Q Consensus 193 -----~~----------------------~~~~~~~~~~~~~~~yt~~~~~~~~~~~--~-------------------- 223 (512) +. ...+...+.+..+++|....+....... + T Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 316 (776) T protein:vir:93 237 FETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVES 316 (776) T ss_pred ccccchhcccccccccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhc Confidence 00 0000112344456666543221111000 0 Q ss_pred ccc--------------ccccc--ccccccccccccceEeecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 224 NGL--------------KLTPR--ENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (512) Q Consensus 224 ~~~--------------~~~~~--~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~ 282 (512) +.. ..+.. .....+.+.+.+|+|+++. ...|.|.+..+++.++.+|...|.+.+.+- T Consensus 317 g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~-- 394 (776) T protein:vir:93 317 GRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS-- 394 (776) T ss_pred CceeehheeeeeeEEEEEecchhhhccCCCCCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc-- Confidence 000 00000 1122344557888887654 234789999999999999999999988763 Q ss_pred cCceeeeecCCcCChhhhhh--hhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010808. 283 NDAMLLIKGNLSLDPDEVKK--QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNT 360 (512) Q Consensus 283 ~~~~lv~~g~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 360 (512) +.++.+..|..... +++.. .+.+.++.+.+ +..+.+.+.....-..++..++..+...|..+|++ T Consensus 395 ~~~~~~~~gav~~~-d~~~~~~~rp~~vi~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi 461 (776) T protein:vir:93 395 TNKVLMEEGAVDDI-DEFRREAARPDAVMTVKN------------GKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGV 461 (776) T ss_pred CCceeeccccccch-HHHHHhcccCCceeeeCC------------ccccccccccCcCccHHHHHHHHHHHHHHHHhhCc Confidence 56666666654332 22221 12222222211 11112333322223466777888899999999999 Q ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc---------ccc--------- Q lcl|NC_010808. 361 PNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN---------KDF--------- 422 (512) Q Consensus 361 p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~---------~d~--------- 422 (512) .+.+.|..+++.||+|+...............+.|..+++++.++++.++......... ..+ T Consensus 462 ~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~ 541 (776) T protein:vir:93 462 TDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPE 541 (776) T ss_pred ChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchh Confidence 98888887788999999988777777777777777888888777777665443211100 001 Q ss_pred -------ceeeEEeCCCCCcCHHHHHHHHHHHhccCChH-------HHHHhCCCCCCHHHHHHHHHHHHH---------- Q lcl|NC_010808. 423 -------NTVRYVYNRNLPKSLIEELKAYIDSGGKISQT-------TLMSLFSFFQDPELEVKKIEEDEK---------- 478 (512) Q Consensus 423 -------~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~e-------t~~~~~~~v~d~~~E~~ri~~E~~---------- 478 (512) .+|.+.=.+..+.-..+..+.++.+.+.+..+ .+++..++ .+.++-.+++++... T Consensus 542 nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~d~-p~~~e~~~~l~~~~~~~~p~q~~~~ 620 (776) T protein:vir:93 542 NDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENMDI-PNRDELVKRIRAVNGQKDPDQDEPT 620 (776) T ss_pred hhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhcCc-cchHHHHHHHHHhhcccccchhhcc Confidence 02222222222221233333344433322221 11222211 111111112211100 Q ss_pred -----------HHHHHHHhhc--ccCCCCCC-------C---CCCCCC--CcCcccCCC Q lcl|NC_010808. 479 -----------ESIKKAQKGI--YKDPRDIN-------D---DEQDDD--TKDTVDKKE 512 (512) Q Consensus 479 -----------~~~~~~~~~~--~~~~~~~~-------~---~~~~~~--~~~~~~~~e 512 (512) ......+... ........ - ..+... ..-.....+ T Consensus 621 ~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a~~~~~~ 679 (776) T protein:vir:93 621 PEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMAIREGVG 679 (776) T ss_pred hhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcchhhhhh Confidence 0000000000 00000000 0 000000 000000000 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.83 E-value=4.9e-19 Score=120.96 Aligned_cols=484 Identities=13% Similarity=0.075 Sum_probs=238.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVEL 77 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~~~~~~ 77 (512) ||+.- ....+=..-+-..-.....+.+.......+.+.+.... ...+....+-.+||.|.+...... T Consensus 1 ~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~ 70 (711) T protein:vir:10 1 MAKKQ----------KKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR 70 (711) T ss_pred CCccc----------ccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHH Confidence 33221 10000000000000011111111112233333322222 222333456679999987644332 Q ss_pred cccccccccceeeecchHHHHHHHHHhhhhccCceecC---------------------------CchhH----HHHHHH Q lcl|NC_010808. 78 TRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD---------------------------DDKDV----LEAIEA 126 (512) Q Consensus 78 ~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~---------------------------~d~~~----~~~l~~ 126 (512) ......++ -.+.+|..+.+|+..+++-..+.+.+.. +|.+. ...+.. T Consensus 71 ~~l~~~g~--p~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~ 148 (711) T protein:vir:10 71 TERELEQR--PCLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKN 148 (711) T ss_pred HHHHhcCC--CcEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHH Confidence 22222222 2578999999999999999887776522 12222 334555 Q ss_pred HHhccChhHHHHHHHHHHHhCCeEEEEEEECC------CCceEEEEE-ccceeEEEEeCCC------CceeEEEEEEeee Q lcl|NC_010808. 127 FNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ------DDETRLYKS-DAMSTFVIYDNTI------ERNSIAGVRYLRT 193 (512) Q Consensus 127 ~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~------~g~~~i~~~-~p~~~~~i~d~~~------~~~~~~~v~~~~~ 193 (512) +.+.|+.+.....+..+++++|.+|+-|+.|. +|++++..+ +|.++ +||+.. +.+-+ +.+.|.. T Consensus 149 ~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v--~~Dp~a~~~D~sDar~~-~~~~~~~ 225 (711) T protein:vir:10 149 IEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSV--TIDPDAKKRDRSDMNWC-LIDDTMS 225 (711) T ss_pred HHHhcChhHHHHHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhhe--eeCccccccChhhhcce-eeeecCC Confidence 67789999999999999999999998886542 478888777 68885 555421 11111 1111110 Q ss_pred e----------e---e---------ccCCcceEEEEEEEcCCcEEEEEe--cCCccc----------------------- Q lcl|NC_010808. 194 K----------P---I---------DKTDEDEVFTVDLFTSHGVYRYLT--SRTNGL----------------------- 226 (512) Q Consensus 194 ~----------~---~---------~~~~~~~~~~~~~yt~~~~~~~~~--~~~~~~----------------------- 226 (512) . . . .+...+.+..+++|....+.+... ..+.+. T Consensus 226 ~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 305 (711) T protein:vir:10 226 KEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTR 305 (711) T ss_pred HHHHHHhCCchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhh Confidence 0 0 0 000122334455554432211110 000000 Q ss_pred -----------cccc-cccccccccccccceEeecC-------CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_010808. 227 -----------KLTP-RENGFESHSFERMPITEFSN-------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (512) Q Consensus 227 -----------~~~~-~~~~~~~~~~~~vPvv~~~n-------~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~l 287 (512) ..+. ....+.|.+.+.+|+|+|.- ...+.|.+..+++.++.+|...|.+...+...+.+.+ T Consensus 306 ~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~ 385 (711) T protein:vir:10 306 KVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPF 385 (711) T ss_pred hhceeeEEEEEEecceeecCCCCCCCCcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCce Confidence 0000 01123344556788877531 2234678899999999999999999999988777555 Q ss_pred ee-ecCCcCChhhhhh--hhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_010808. 288 LI-KGNLSLDPDEVKK--QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMK 364 (512) Q Consensus 288 v~-~g~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 364 (512) ++ .|......+.... .+.++++.+.++ ....+.++++..+.-..++...+......|-..|++.+.+ T Consensus 386 ~~~~gai~~~~~~~~e~~~~~~~vi~~~~~----------~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~ 455 (711) T protein:vir:10 386 IGSEGNVEGREDEWEQANTKNFSLLTYIPQ----------YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDAS 455 (711) T ss_pred eecCcccCChHHHHHhccccCCCeeEeccc----------ccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHH Confidence 44 5554332222221 122233333221 1122345555555556778888999999999999998888 Q ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---------ccccc------------- Q lcl|NC_010808. 365 DDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID---------ANKDF------------- 422 (512) Q Consensus 365 ~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~---------~~~d~------------- 422 (512) .|..+++.||+|+......-..........+..+.+++.++++.++....... ...++ T Consensus 456 ~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G 535 (711) T protein:vir:10 456 LGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESG 535 (711) T ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccc Confidence 88888889999999988777777777777777777777777666554322110 00011 Q ss_pred ------------ceeeEEeCCCCCcCHHHHHHHHHHHhccCCh------HHHHHhCCCCCCHHHHHHHHH---------- Q lcl|NC_010808. 423 ------------NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ------TTLMSLFSFFQDPELEVKKIE---------- 474 (512) Q Consensus 423 ------------~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~------et~~~~~~~v~d~~~E~~ri~---------- 474 (512) .+|.+.=.+..+.-..+.+..++.+.+.+|. ..+++.+++ .+.++-.++++ T Consensus 536 ~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~~~~~~il~~~d~-p~~~el~e~lr~~~~~~~~~~ 614 (711) T protein:vir:10 536 EWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSK 614 (711) T ss_pred cceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhhHHHHHHHHhcCC-CCHHHHHHHHHhhcCcccCcc Confidence 0223333344444444555555555554443 122333322 22221122221 Q ss_pred ----------HHHHHHHHH-HHhhcccCCCCCCCCCCCCCCcCccc----CCC Q lcl|NC_010808. 475 ----------EDEKESIKK-AQKGIYKDPRDINDDEQDDDTKDTVD----KKE 512 (512) Q Consensus 475 ----------~E~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~~e 512 (512) .|++..... ............. .+.+..+...+ ..| T Consensus 615 ~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~q--a~ae~~~Aqae~~qa~~e 665 (711) T protein:vir:10 615 DEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQ--AEADTAQAQADMLKAQLE 665 (711) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH Confidence 111100000 0000000000000 00000000000 000 No 83 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.81 E-value=4.1e-18 Score=115.90 Aligned_cols=465 Identities=11% Similarity=0.049 Sum_probs=232.8 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASY 97 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~ 97 (512) +.++.++. ........+.+.-..++..+.... ..-+....+..+||.|.+............++ ..+.+|..+. T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~--p~~~~N~i~~ 77 (714) T protein:vir:32 1 MKNETNTM-ATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ--PMTIHNLIAP 77 (714) T ss_pred CCcccccc-cCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcEEeccHHH Confidence 44444433 111111111111122222211111 12233445777999998875433333333332 3578999999 Q ss_pred HHHHHHhhhhccCceecC-----Cch--hH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC---CceE Q lcl|NC_010808. 98 ISDFINGYFLGNPIQCQD-----DDK--DV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---DETR 163 (512) Q Consensus 98 iv~~~a~~l~g~~~~~~~-----~d~--~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~---g~~~ 163 (512) +|+..+++...+.+.+.. ++. +. ...+..+++.|+.+.....+..+++++|.+|+-++.+.| +.++ T Consensus 78 ~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~ 157 (714) T protein:vir:32 78 TVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFK 157 (714) T ss_pred HHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeE Confidence 999999999888776532 112 12 344666778899999999999999999999999988753 5688 Q ss_pred EEEEccceeEEEEeCCCC----ceeEE-EEEEeeeee----------------------------e-------------- Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIE----RNSIA-GVRYLRTKP----------------------------I-------------- 196 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~----~~~~~-~v~~~~~~~----------------------------~-------------- 196 (512) +..++|.++ +||+... ....+ +.+.|...+ . T Consensus 158 i~~v~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:32 158 VSTVSRNEV--FWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEecchhhe--eeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 999999996 4443211 01111 111111100 0 Q ss_pred ---c-------cCCcceEEEEEEEcCCcEEEEEecCCcccc----------------------------------cccc- Q lcl|NC_010808. 197 ---D-------KTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------------------------------LTPR- 231 (512) Q Consensus 197 ---~-------~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----------------------------------~~~~- 231 (512) + ......+..+++|..............+.. .+.. T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:32 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 0 000122334455543222111111100000 0000 Q ss_pred -ccccccccccccceEeecC---CC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh--h Q lcl|NC_010808. 232 -ENGFESHSFERMPITEFSN---NE--RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK--Q 303 (512) Q Consensus 232 -~~~~~~~~~~~vPvv~~~n---~~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~ 303 (512) ..++.|-+.+.+|+|++.- .. ...|.+..+++.++.+|...|.+...+. ++..++..|........... . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:32 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccccHHHHHHhcc Confidence 0122233334566665432 11 1247788899999999999999988763 55555555554333222211 1 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 383 (512) +.++++.+.+.. ..+...+..++......-..++-..+......|..+|++-+.+.|..+++.||+|+...... T Consensus 394 rp~~vi~~~p~~------~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:32 394 RPDGIIKLNPVR------KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred CCCCceeecccc------cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 122233322211 01111122233333233456666778888889999999888888887888999999887776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---------cc-------------------cc----ceeeEEeCC Q lcl|NC_010808. 384 LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA---------NK-------------------DF----NTVRYVYNR 431 (512) Q Consensus 384 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~---------~~-------------------d~----~~i~i~f~~ 431 (512) -..........+..+.+++.++++.+......... .. |. .+|.+.=.+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p 547 (714) T protein:vir:32 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeecc Confidence 66666666666677777766666554432111000 00 00 123333344 Q ss_pred CCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDSGGKISQ-------TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl~g~~s~-------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ..|.-..+.+..++.+.+.++. ..+++.+.+ .+.++-.++|++... ..++ .+.. ..++. T Consensus 548 ~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~----------~~~~--~~~~-~~e~q 613 (714) T protein:vir:32 548 QTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALG----------TPKS--PDEM-TPEEQ 613 (714) T ss_pred CchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcC----------CCCC--cccc-chhhH Confidence 4444455666666666544443 344555544 444444555544210 0000 0000 00000 Q ss_pred cC---------------------cccCCC Q lcl|NC_010808. 505 KD---------------------TVDKKE 512 (512) Q Consensus 505 ~~---------------------~~~~~e 512 (512) .. ...+.+ T Consensus 614 ~~~~~~q~~~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:32 614 EVAAQQQALQQQQAELQMREMAGRVAKLE 642 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 No 84 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.81 E-value=4.1e-18 Score=115.90 Aligned_cols=465 Identities=11% Similarity=0.049 Sum_probs=232.8 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASY 97 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~ 97 (512) +.++.++. ........+.+.-..++..+.... ..-+....+..+||.|.+............++ ..+.+|..+. T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~--p~~~~N~i~~ 77 (714) T protein:vir:27 1 MKNETNTM-ATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ--PMTIHNLIAP 77 (714) T ss_pred CCcccccc-cCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcEEeccHHH Confidence 44444433 111111111111122222211111 12233445777999998875433333333332 3578999999 Q ss_pred HHHHHHhhhhccCceecC-----Cch--hH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC---CceE Q lcl|NC_010808. 98 ISDFINGYFLGNPIQCQD-----DDK--DV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---DETR 163 (512) Q Consensus 98 iv~~~a~~l~g~~~~~~~-----~d~--~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~---g~~~ 163 (512) +|+..+++...+.+.+.. ++. +. ...+..+++.|+.+.....+..+++++|.+|+-++.+.| +.++ T Consensus 78 ~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~ 157 (714) T protein:vir:27 78 TVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFK 157 (714) T ss_pred HHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeE Confidence 999999999888776532 112 12 344666778899999999999999999999999988753 5688 Q ss_pred EEEEccceeEEEEeCCCC----ceeEE-EEEEeeeee----------------------------e-------------- Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIE----RNSIA-GVRYLRTKP----------------------------I-------------- 196 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~----~~~~~-~v~~~~~~~----------------------------~-------------- 196 (512) +..++|.++ +||+... ....+ +.+.|...+ . T Consensus 158 i~~v~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:27 158 VSTVSRNEV--FWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEecchhhe--eeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 999999996 4443211 01111 111111100 0 Q ss_pred ---c-------cCCcceEEEEEEEcCCcEEEEEecCCcccc----------------------------------cccc- Q lcl|NC_010808. 197 ---D-------KTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------------------------------LTPR- 231 (512) Q Consensus 197 ---~-------~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----------------------------------~~~~- 231 (512) + ......+..+++|..............+.. .+.. T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:27 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 0 000122334455543222111111100000 0000 Q ss_pred -ccccccccccccceEeecC---CC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh--h Q lcl|NC_010808. 232 -ENGFESHSFERMPITEFSN---NE--RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK--Q 303 (512) Q Consensus 232 -~~~~~~~~~~~vPvv~~~n---~~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~ 303 (512) ..++.|-+.+.+|+|++.- .. ...|.+..+++.++.+|...|.+...+. ++..++..|........... . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:27 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccccHHHHHHhcc Confidence 0122233334566665432 11 1247788899999999999999988763 55555555554333222211 1 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 383 (512) +.++++.+.+.. ..+...+..++......-..++-..+......|..+|++-+.+.|..+++.||+|+...... T Consensus 394 rp~~vi~~~p~~------~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:27 394 RPDGIIKLNPVR------KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred CCCCceeecccc------cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 122233322211 01111122233333233456666778888889999999888888887888999999887776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---------cc-------------------cc----ceeeEEeCC Q lcl|NC_010808. 384 LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA---------NK-------------------DF----NTVRYVYNR 431 (512) Q Consensus 384 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~---------~~-------------------d~----~~i~i~f~~ 431 (512) -..........+..+.+++.++++.+......... .. |. .+|.+.=.+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p 547 (714) T protein:vir:27 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeecc Confidence 66666666666677777766666554432111000 00 00 123333344 Q ss_pred CCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDSGGKISQ-------TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl~g~~s~-------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ..|.-..+.+..++.+.+.++. ..+++.+.+ .+.++-.++|++... ..++ .+.. ..++. T Consensus 548 ~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~----------~~~~--~~~~-~~e~q 613 (714) T protein:vir:27 548 QTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALG----------TPKS--PDEM-TPEEQ 613 (714) T ss_pred CchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcC----------CCCC--cccc-chhhH Confidence 4444455666666666544443 344555544 444444555544210 0000 0000 00000 Q ss_pred cC---------------------cccCCC Q lcl|NC_010808. 505 KD---------------------TVDKKE 512 (512) Q Consensus 505 ~~---------------------~~~~~e 512 (512) .. ...+.+ T Consensus 614 ~~~~~~q~~~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:27 614 EVAAQQQALQQQQAELQMREMAGRVAKLE 642 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 No 85 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.81 E-value=4.1e-18 Score=115.90 Aligned_cols=465 Identities=11% Similarity=0.049 Sum_probs=232.8 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASY 97 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~ 97 (512) +.++.++. ........+.+.-..++..+.... ..-+....+..+||.|.+............++ ..+.+|..+. T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~--p~~~~N~i~~ 77 (714) T protein:vir:99 1 MKNETNTM-ATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ--PMTIHNLIAP 77 (714) T ss_pred CCcccccc-cCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcEEeccHHH Confidence 44444433 111111111111122222211111 12233445777999998875433333333332 3578999999 Q ss_pred HHHHHHhhhhccCceecC-----Cch--hH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC---CceE Q lcl|NC_010808. 98 ISDFINGYFLGNPIQCQD-----DDK--DV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---DETR 163 (512) Q Consensus 98 iv~~~a~~l~g~~~~~~~-----~d~--~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~---g~~~ 163 (512) +|+..+++...+.+.+.. ++. +. ...+..+++.|+.+.....+..+++++|.+|+-++.+.| +.++ T Consensus 78 ~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~ 157 (714) T protein:vir:99 78 TVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFK 157 (714) T ss_pred HHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeE Confidence 999999999888776532 112 12 344666778899999999999999999999999988753 5688 Q ss_pred EEEEccceeEEEEeCCCC----ceeEE-EEEEeeeee----------------------------e-------------- Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIE----RNSIA-GVRYLRTKP----------------------------I-------------- 196 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~----~~~~~-~v~~~~~~~----------------------------~-------------- 196 (512) +..++|.++ +||+... ....+ +.+.|...+ . T Consensus 158 i~~v~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:99 158 VSTVSRNEV--FWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEecchhhe--eeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 999999996 4443211 01111 111111100 0 Q ss_pred ---c-------cCCcceEEEEEEEcCCcEEEEEecCCcccc----------------------------------cccc- Q lcl|NC_010808. 197 ---D-------KTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------------------------------LTPR- 231 (512) Q Consensus 197 ---~-------~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----------------------------------~~~~- 231 (512) + ......+..+++|..............+.. .+.. T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:99 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 0 000122334455543222111111100000 0000 Q ss_pred -ccccccccccccceEeecC---CC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh--h Q lcl|NC_010808. 232 -ENGFESHSFERMPITEFSN---NE--RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK--Q 303 (512) Q Consensus 232 -~~~~~~~~~~~vPvv~~~n---~~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~ 303 (512) ..++.|-+.+.+|+|++.- .. ...|.+..+++.++.+|...|.+...+. ++..++..|........... . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:99 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccccHHHHHHhcc Confidence 0122233334566665432 11 1247788899999999999999988763 55555555554333222211 1 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 383 (512) +.++++.+.+.. ..+...+..++......-..++-..+......|..+|++-+.+.|..+++.||+|+...... T Consensus 394 rp~~vi~~~p~~------~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:99 394 RPDGIIKLNPVR------KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred CCCCceeecccc------cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 122233322211 01111122233333233456666778888889999999888888887888999999887776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---------cc-------------------cc----ceeeEEeCC Q lcl|NC_010808. 384 LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA---------NK-------------------DF----NTVRYVYNR 431 (512) Q Consensus 384 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~---------~~-------------------d~----~~i~i~f~~ 431 (512) -..........+..+.+++.++++.+......... .. |. .+|.+.=.+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p 547 (714) T protein:vir:99 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeecc Confidence 66666666666677777766666554432111000 00 00 123333344 Q ss_pred CCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDSGGKISQ-------TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl~g~~s~-------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ..|.-..+.+..++.+.+.++. ..+++.+.+ .+.++-.++|++... ..++ .+.. ..++. T Consensus 548 ~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~----------~~~~--~~~~-~~e~q 613 (714) T protein:vir:99 548 QTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALG----------TPKS--PDEM-TPEEQ 613 (714) T ss_pred CchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcC----------CCCC--cccc-chhhH Confidence 4444455666666666544443 344555544 444444555544210 0000 0000 00000 Q ss_pred cC---------------------cccCCC Q lcl|NC_010808. 505 KD---------------------TVDKKE 512 (512) Q Consensus 505 ~~---------------------~~~~~e 512 (512) .. ...+.+ T Consensus 614 ~~~~~~q~~~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:99 614 EVAAQQQALQQQQAELQMREMAGRVAKLE 642 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 No 86 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.81 E-value=4.1e-18 Score=115.90 Aligned_cols=465 Identities=11% Similarity=0.049 Sum_probs=232.8 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASY 97 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~ 97 (512) +.++.++. ........+.+.-..++..+.... ..-+....+..+||.|.+............++ ..+.+|..+. T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~--p~~~~N~i~~ 77 (714) T protein:vir:10 1 MKNETNTM-ATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ--PMTIHNLIAP 77 (714) T ss_pred CCcccccc-cCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcEEeccHHH Confidence 44444433 111111111111122222211111 12233445777999998875433333333332 3578999999 Q ss_pred HHHHHHhhhhccCceecC-----Cch--hH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC---CceE Q lcl|NC_010808. 98 ISDFINGYFLGNPIQCQD-----DDK--DV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---DETR 163 (512) Q Consensus 98 iv~~~a~~l~g~~~~~~~-----~d~--~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~---g~~~ 163 (512) +|+..+++...+.+.+.. ++. +. ...+..+++.|+.+.....+..+++++|.+|+-++.+.| +.++ T Consensus 78 ~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~ 157 (714) T protein:vir:10 78 TVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFK 157 (714) T ss_pred HHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeE Confidence 999999999888776532 112 12 344666778899999999999999999999999988753 5688 Q ss_pred EEEEccceeEEEEeCCCC----ceeEE-EEEEeeeee----------------------------e-------------- Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIE----RNSIA-GVRYLRTKP----------------------------I-------------- 196 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~----~~~~~-~v~~~~~~~----------------------------~-------------- 196 (512) +..++|.++ +||+... ....+ +.+.|...+ . T Consensus 158 i~~v~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:10 158 VSTVSRNEV--FWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEecchhhe--eeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 999999996 4443211 01111 111111100 0 Q ss_pred ---c-------cCCcceEEEEEEEcCCcEEEEEecCCcccc----------------------------------cccc- Q lcl|NC_010808. 197 ---D-------KTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------------------------------LTPR- 231 (512) Q Consensus 197 ---~-------~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----------------------------------~~~~- 231 (512) + ......+..+++|..............+.. .+.. T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:10 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 0 000122334455543222111111100000 0000 Q ss_pred -ccccccccccccceEeecC---CC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh--h Q lcl|NC_010808. 232 -ENGFESHSFERMPITEFSN---NE--RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK--Q 303 (512) Q Consensus 232 -~~~~~~~~~~~vPvv~~~n---~~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~ 303 (512) ..++.|-+.+.+|+|++.- .. ...|.+..+++.++.+|...|.+...+. ++..++..|........... . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:10 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccccHHHHHHhcc Confidence 0122233334566665432 11 1247788899999999999999988763 55555555554333222211 1 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 383 (512) +.++++.+.+.. ..+...+..++......-..++-..+......|..+|++-+.+.|..+++.||+|+...... T Consensus 394 rp~~vi~~~p~~------~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:10 394 RPDGIIKLNPVR------KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred CCCCceeecccc------cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 122233322211 01111122233333233456666778888889999999888888887888999999887776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---------cc-------------------cc----ceeeEEeCC Q lcl|NC_010808. 384 LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA---------NK-------------------DF----NTVRYVYNR 431 (512) Q Consensus 384 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~---------~~-------------------d~----~~i~i~f~~ 431 (512) -..........+..+.+++.++++.+......... .. |. .+|.+.=.+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p 547 (714) T protein:vir:10 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeecc Confidence 66666666666677777766666554432111000 00 00 123333344 Q ss_pred CCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDSGGKISQ-------TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl~g~~s~-------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ..|.-..+.+..++.+.+.++. ..+++.+.+ .+.++-.++|++... ..++ .+.. ..++. T Consensus 548 ~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~----------~~~~--~~~~-~~e~q 613 (714) T protein:vir:10 548 QTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALG----------TPKS--PDEM-TPEEQ 613 (714) T ss_pred CchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcC----------CCCC--cccc-chhhH Confidence 4444455666666666544443 344555544 444444555544210 0000 0000 00000 Q ss_pred cC---------------------cccCCC Q lcl|NC_010808. 505 KD---------------------TVDKKE 512 (512) Q Consensus 505 ~~---------------------~~~~~e 512 (512) .. ...+.+ T Consensus 614 ~~~~~~q~~~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:10 614 EVAAQQQALQQQQAELQMREMAGRVAKLE 642 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 No 87 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.81 E-value=4.1e-18 Score=115.90 Aligned_cols=465 Identities=11% Similarity=0.049 Sum_probs=232.8 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASY 97 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~ 97 (512) +.++.++. ........+.+.-..++..+.... ..-+....+..+||.|.+............++ ..+.+|..+. T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~--p~~~~N~i~~ 77 (714) T protein:vir:81 1 MKNETNTM-ATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQ--PMTIHNLIAP 77 (714) T ss_pred CCcccccc-cCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcEEeccHHH Confidence 44444433 111111111111122222211111 12233445777999998875433333333332 3578999999 Q ss_pred HHHHHHhhhhccCceecC-----Cch--hH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC---CceE Q lcl|NC_010808. 98 ISDFINGYFLGNPIQCQD-----DDK--DV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---DETR 163 (512) Q Consensus 98 iv~~~a~~l~g~~~~~~~-----~d~--~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~---g~~~ 163 (512) +|+..+++...+.+.+.. ++. +. ...+..+++.|+.+.....+..+++++|.+|+-++.+.| +.++ T Consensus 78 ~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~ 157 (714) T protein:vir:81 78 TVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFK 157 (714) T ss_pred HHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeE Confidence 999999999888776532 112 12 344666778899999999999999999999999988753 5688 Q ss_pred EEEEccceeEEEEeCCCC----ceeEE-EEEEeeeee----------------------------e-------------- Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIE----RNSIA-GVRYLRTKP----------------------------I-------------- 196 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~----~~~~~-~v~~~~~~~----------------------------~-------------- 196 (512) +..++|.++ +||+... ....+ +.+.|...+ . T Consensus 158 i~~v~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 235 (714) T protein:vir:81 158 VSTVSRNEV--FWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEY 235 (714) T ss_pred EEecchhhe--eeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhh Confidence 999999996 4443211 01111 111111100 0 Q ss_pred ---c-------cCCcceEEEEEEEcCCcEEEEEecCCcccc----------------------------------cccc- Q lcl|NC_010808. 197 ---D-------KTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------------------------------LTPR- 231 (512) Q Consensus 197 ---~-------~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----------------------------------~~~~- 231 (512) + ......+..+++|..............+.. .+.. T Consensus 236 ~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~ 315 (714) T protein:vir:81 236 QSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHF 315 (714) T ss_pred ccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcc Confidence 0 000122334455543222111111100000 0000 Q ss_pred -ccccccccccccceEeecC---CC--CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh--h Q lcl|NC_010808. 232 -ENGFESHSFERMPITEFSN---NE--RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK--Q 303 (512) Q Consensus 232 -~~~~~~~~~~~vPvv~~~n---~~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~ 303 (512) ..++.|-+.+.+|+|++.- .. ...|.+..+++.++.+|...|.+...+. ++..++..|........... . T Consensus 316 L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~~a~~~~d~~~~e~~a 393 (714) T protein:vir:81 316 IVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLSDNDLMEQIE 393 (714) T ss_pred cccCCCCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhhc--CCceeeecCcccccHHHHHHhcc Confidence 0122233334566665432 11 1247788899999999999999988763 55555555554333222211 1 Q ss_pred hhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHH Q lcl|NC_010808. 304 KEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 383 (512) +.++++.+.+.. ..+...+..++......-..++-..+......|..+|++-+.+.|..+++.||+|+...... T Consensus 394 rp~~vi~~~p~~------~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~q 467 (714) T protein:vir:81 394 RPDGIIKLNPVR------KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQ 467 (714) T ss_pred CCCCceeecccc------cccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHH Confidence 122233322211 01111122233333233456666778888889999999888888887888999999887776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc---------cc-------------------cc----ceeeEEeCC Q lcl|NC_010808. 384 LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA---------NK-------------------DF----NTVRYVYNR 431 (512) Q Consensus 384 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~---------~~-------------------d~----~~i~i~f~~ 431 (512) -..........+..+.+++.++++.+......... .. |. .+|.+.=.+ T Consensus 468 g~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p 547 (714) T protein:vir:81 468 GATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQ 547 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeecc Confidence 66666666666677777766666554432111000 00 00 123333344 Q ss_pred CCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDSGGKISQ-------TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl~g~~s~-------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ..|.-..+.+..++.+.+.++. ..+++.+.+ .+.++-.++|++... ..++ .+.. ..++. T Consensus 548 ~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~----------~~~~--~~~~-~~e~q 613 (714) T protein:vir:81 548 QTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALG----------TPKS--PDEM-TPEEQ 613 (714) T ss_pred CchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcC----------CCCC--cccc-chhhH Confidence 4444455666666666544443 344555544 444444555544210 0000 0000 00000 Q ss_pred cC---------------------cccCCC Q lcl|NC_010808. 505 KD---------------------TVDKKE 512 (512) Q Consensus 505 ~~---------------------~~~~~e 512 (512) .. ...+.+ T Consensus 614 ~~~~~~q~~~~~q~~lq~~~~~a~~~k~e 642 (714) T protein:vir:81 614 EVAAQQQALQQQQAELQMREMAGRVAKLE 642 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000 No 88 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.79 E-value=1.4e-17 Score=112.95 Aligned_cols=466 Identities=11% Similarity=0.047 Sum_probs=232.5 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhh-cHHHHHHHHHHHH--HHHHHHHHHHHHHhcccccccccccccccccccc Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQ-NINEVSKYIEHHM--DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMAD 87 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~-~~~~l~~~i~~~~--~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~ 87 (512) |..=.....+=++.. ...+ ....+..+..... ..-+....+-.+||.|.+............++ T Consensus 1 ~~~~~~~~~~~~~~~-----------~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~-- 67 (714) T protein:vir:10 1 MKNEINTTAMKNDHG-----------STPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQ-- 67 (714) T ss_pred CCcCcCcccCCCcch-----------hhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCC-- Confidence 221111111211111 1111 1222333333221 12223455777999998865433333333333 Q ss_pred eeeecchHHHHHHHHHhhhhccCceecC-----Cch--h----HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE Q lcl|NC_010808. 88 NRVAHDYASYISDFINGYFLGNPIQCQD-----DDK--D----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR 156 (512) Q Consensus 88 ~ri~~n~~~~iv~~~a~~l~g~~~~~~~-----~d~--~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~ 156 (512) ..+.+|..+.+|+..+++...+.+.+.. +++ + ....+..+++.|+.+.....+..+++++|.+|+-++. T Consensus 68 p~~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~ 147 (714) T protein:vir:10 68 PMTIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRR 147 (714) T ss_pred CcEEeccHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeee Confidence 3578999999999999999888776532 111 1 2334566678899999999999999999999999988 Q ss_pred CCC---CceEEEEEccceeEEEEeCCCCc----eeEEEE-EEeee----------------------------------- Q lcl|NC_010808. 157 NQD---DETRLYKSDAMSTFVIYDNTIER----NSIAGV-RYLRT----------------------------------- 193 (512) Q Consensus 157 d~~---g~~~i~~~~p~~~~~i~d~~~~~----~~~~~v-~~~~~----------------------------------- 193 (512) +.| +.+++..++|.+++ ||+.... ...+.+ +.|.. T Consensus 148 d~d~~~~~i~i~~v~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~ 225 (714) T protein:vir:10 148 NSEPFGPEFKVSTVSRNEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQP 225 (714) T ss_pred ccCCCCCCeEEEecChhhee--eccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhc Confidence 754 67999999998864 4431100 000111 00000 Q ss_pred ---------------eee--ccCCcceEEEEEEEcCCcEEEEEecCCcccc----------------------------- Q lcl|NC_010808. 194 ---------------KPI--DKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------------------------- 227 (512) Q Consensus 194 ---------------~~~--~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~----------------------------- 227 (512) ... .......+..+++|..............+.. T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv 305 (714) T protein:vir:10 226 SPLMSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRI 305 (714) T ss_pred ccccccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeE Confidence 000 0011223455666654332222211110000 Q ss_pred -----ccc--cccccccccccccceEeecCC---C--CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC Q lcl|NC_010808. 228 -----LTP--RENGFESHSFERMPITEFSNN---E--RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL 295 (512) Q Consensus 228 -----~~~--~~~~~~~~~~~~vPvv~~~n~---~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~ 295 (512) .+. ...++.|-+.+.+|+|++.-. . ...|.+..+++.++.+|...|.+...+. ++..++..|.... T Consensus 306 ~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~~ 383 (714) T protein:vir:10 306 REAWFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL 383 (714) T ss_pred EEEEEecchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHHh--CCceeeccccccc Confidence 000 011223444455666665321 1 2357888999999999999999988763 4455555555433 Q ss_pred Chhhhhh--hhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch Q lcl|NC_010808. 296 DPDEVKK--QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 373 (512) Q Consensus 296 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 373 (512) ....... .+.++++.+.+.. ..+...+..++......-..++...+......|..+|++-+.+.|..+++.| T Consensus 384 ~d~~~~e~~~rp~~vi~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~S 457 (714) T protein:vir:10 384 SDNDLMEQLERPDGIIKLNPVR------KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATS 457 (714) T ss_pred cHHHHHHhccCCCCeEEecccc------cccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhH Confidence 3322221 1112233322211 0011111223332223334667778888889999999988888888788899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc------------cc------------------- Q lcl|NC_010808. 374 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK------------DF------------------- 422 (512) Q Consensus 374 g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~------------d~------------------- 422 (512) |+||......-..........+..+.+++.++++.+..........+ .+ T Consensus 458 GvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~ 537 (714) T protein:vir:10 458 GVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRL 537 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceee Confidence 99999887776666666667777777777776666553321110000 00 Q ss_pred -ceeeEEeCCCCCcCHHHHHHHHHHHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCC Q lcl|NC_010808. 423 -NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ-------TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD 494 (512) Q Consensus 423 -~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~-------et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 494 (512) .+|.+.=.+..+.-..+.++.++.+.+.++. ..+++.+.+ .+.++-+++|.+.... +.+ T Consensus 538 ~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~-p~~~ei~~~ir~~~~~------------~~~ 604 (714) T protein:vir:10 538 NTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGT------------PKS 604 (714) T ss_pred eEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCC-cCHHHHHHHHHHHcCC------------CCC Confidence 0111111223333344455555555433332 334444433 3444445555433100 000 Q ss_pred CCCCCCCCCC----cCcccC-------CC Q lcl|NC_010808. 495 INDDEQDDDT----KDTVDK-------KE 512 (512) Q Consensus 495 ~~~~~~~~~~----~~~~~~-------~e 512 (512) .+.....+.. ...... +| T Consensus 605 ~~~~~~e~q~~q~~~~~~~~~q~~l~~~e 633 (714) T protein:vir:10 605 PDEMTPEEQEVAAQQQALQQQQAELQMRE 633 (714) T ss_pred ccccCcchhHHHHHHHHHHHHHHHHHHHH Confidence 0000000000 000000 00 No 89 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.79 E-value=4.3e-19 Score=121.25 Aligned_cols=481 Identities=13% Similarity=0.058 Sum_probs=228.7 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |++|.+..-......... ....+.+....+..-+..+.+. +..-.+..+||.|.+............++ -.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~q~~~-r~~a~~d~~fy~G~QW~~~~~~~l~~~g~--p~~ 72 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAG-----DTPLTVDEYADINYEIEDQPAW-RAVADKEMDYADGNQLDTELLRRQQALGI--PPA 72 (772) T ss_pred CCcchhhHHhhccCCccc-----ccccCHHHHHHHHHHHhccHHH-HHHHHHHHHhhcCCCCCHHHHHHHHhcCC--CcE Confidence 777766655443221111 0111111122222223333322 33445677899999875433333333332 347 Q ss_pred ecchHHHHHHHHHhhhhccCceecC--C----chhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC- Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQD--D----DKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD- 159 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~--~----d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~- 159 (512) .+|..+.+|+..+++...+.+.+.. . +.+. ...+..+++.|+++.....+..+++++|.+|+-++.+.+ T Consensus 73 ~~N~i~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~ 152 (772) T protein:vir:10 73 VEDLIGPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDP 152 (772) T ss_pred EEcchHHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCC Confidence 8999999999999999888776532 1 1222 334566678899999999999999999999999988754 Q ss_pred --CceEEEEEccceeEEEEeCCCCceeE---EEE-EE----------eee------------------------ee---- Q lcl|NC_010808. 160 --DETRLYKSDAMSTFVIYDNTIERNSI---AGV-RY----------LRT------------------------KP---- 195 (512) Q Consensus 160 --g~~~i~~~~p~~~~~i~d~~~~~~~~---~~v-~~----------~~~------------------------~~---- 195 (512) +.+++..++|.++ +||+..+.... +.+ +. |.. .. T Consensus 153 ~~~~i~i~~v~p~~v--~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (772) T protein:vir:10 153 FKFPYRCRPIRRDEI--HWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTST 230 (772) T ss_pred CCCCeEEEeeCcccc--eecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccccccc Confidence 3678999999885 55553321100 010 00 000 00 Q ss_pred ------------------eccCCcceEEEEEEEcCCcEEEEEecCCcc--c----------------------------- Q lcl|NC_010808. 196 ------------------IDKTDEDEVFTVDLFTSHGVYRYLTSRTNG--L----------------------------- 226 (512) Q Consensus 196 ------------------~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~--~----------------------------- 226 (512) ..+...+.+..+++|....+.........+ . T Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~ 310 (772) T protein:vir:10 231 GLHNAWNEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVR 310 (772) T ss_pred ccccccchhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEE Confidence 000112334455556443221111111000 0 Q ss_pred ---cccc--cccccccccccccceEeecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCC Q lcl|NC_010808. 227 ---KLTP--RENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLD 296 (512) Q Consensus 227 ---~~~~--~~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~ 296 (512) ..+. ...++.|.+.+.+|+|++.- .....|.+..+++.++.+|...|.+...+... ++..-.|..... T Consensus 311 ~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~--~~~~~~gav~~~ 388 (772) T protein:vir:10 311 RSYWLGPHCLHDGPTPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVA--RVERTKGAVAMT 388 (772) T ss_pred EEEEecceeeccCCCCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhcc--cccccCCCccch Confidence 0001 11123344445677776532 11234788899999999999999998887543 333334444332 Q ss_pred hhhhhh--hhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchH Q lcl|NC_010808. 297 PDEVKK--QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG 374 (512) Q Consensus 297 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg 374 (512) ...+.. .+.+.++.+.++. ....+..++....+.-..++..++......|..++++-+.+.|..++..|| T Consensus 389 d~~~~e~~arp~~vi~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SG 460 (772) T protein:vir:10 389 DAQFRRQIARPDADIVLDENH--------MAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSG 460 (772) T ss_pred hHHHHHhccCCCCeEEeCCcc--------ccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhH Confidence 222211 1112222222110 111223343333333356777788888888999998888888887788899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc--------------------------------cc Q lcl|NC_010808. 375 EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK--------------------------------DF 422 (512) Q Consensus 375 ~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~--------------------------------d~ 422 (512) +||......-.........-+..+.+++.++++.+..........+ |. T Consensus 461 vAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi 540 (772) T protein:vir:10 461 IQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDL 540 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccc Confidence 9998877666666666666677777777666666553322110000 00 Q ss_pred c----eeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHH-------HhCCCCCCHHHHHHHHHHH--------------- Q lcl|NC_010808. 423 N----TVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLM-------SLFSFFQDPELEVKKIEED--------------- 476 (512) Q Consensus 423 ~----~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~-------~~~~~v~d~~~E~~ri~~E--------------- 476 (512) . +|.+.=.+..+.=..+.++.++.+.+.++.+... +.+. ....++-.++|++- T Consensus 541 ~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D-~p~~~ei~~~ir~~~~~~~peq~~~~~~q 619 (772) T protein:vir:10 541 LRTRIKVALEDVPSTNSYRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMD-VPFKRDVVEAIRAVDQQQTPEQIQQQIDQ 619 (772) T ss_pred eeeeEEEEeeccccchHHHHHHHHHHHHHHhccChhHHHHHHHHHHhhcC-CCChHHHHHHHHHHhccCChHHHHHHHHH Confidence 0 1111111111111233444444444333333211 1111 11111111222211 Q ss_pred -HHHHHHHHHhhcccCCCCCCCCCCCCCC-----cCcccCCC Q lcl|NC_010808. 477 -EKESIKKAQKGIYKDPRDINDDEQDDDT-----KDTVDKKE 512 (512) Q Consensus 477 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~e 512 (512) .+......+...............+.+. +.....-+ T Consensus 620 ~~qq~~~~~~~el~~~q~~a~~~~~~A~a~~~~aqa~~~~~~ 661 (772) T protein:vir:10 620 AVQDALAKAGNDIKLRELEIKERKADSEISGLNAKAVQIGVQ 661 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1100000000000000000000000000 00000000 No 90 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.71 E-value=2.3e-17 Score=111.82 Aligned_cols=423 Identities=12% Similarity=0.090 Sum_probs=201.6 Q ss_pred eecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccce------eeecchHHHHHHH Q lcl|NC_010808. 28 VYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN------RVAHDYASYISDF 101 (512) Q Consensus 28 ~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~------ri~~n~~~~iv~~ 101 (512) .+..+.+. ...+..+-. .+......+=.|......-....-.....+. --.+.+++.||+. T Consensus 1 ~~~~~~a~----------~~~~~~~a~---~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~ 67 (461) T protein:vir:80 1 MYSIDKAK----------QAKIDSKIV---NRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDI 67 (461) T ss_pred Cccchhhh----------hhhhhhhhh---hhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhcc Confidence 11110000 011111000 0010010111111110000000000000010 1245788999999 Q ss_pred HHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010808. 102 INGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 102 ~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~ 181 (512) .+..++-+++.+++++++..+.+..+|+.-++...+.++.+.+..||.+++++-..+.+. ..|....|+...... T Consensus 68 ~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~-----~~~~~~~pl~~~~~~ 142 (461) T protein:vir:80 68 ISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR-----EQADLSTAIDPKTIK 142 (461) T ss_pred chHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc-----cccCccCCccccccc Confidence 999999999999999988889999999988899999999999999999999886532211 011111111111000 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc---ccccccccccccccceEeecC-----CCC Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT---PRENGFESHSFERMPITEFSN-----NER 253 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~vPvv~~~n-----~~~ 253 (512) .+....-+|............+..-..+.|. .|++........... .......-|+ -++++|.+ .-. T Consensus 143 -~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P~-~y~i~~~~~~~~~~~~~~~~~~~~~iH~---SRii~~~~~~~~~~~~ 217 (461) T protein:vir:80 143 -SIPYINTFNTQKVTQLYLNQDMFSEHFGEVE-FFEVNRVSQLGEEILSGTTASTSEQIHR---SRIIHEQGLRFEGETK 217 (461) T ss_pred -ceeEEEeccccccchhhhcccCcCcccccce-EEEEeccccccccccccccCccceEEcc---ccEEEecCCCCCcccc Confidence 0000000010000000000000000111111 111111110000000 0001112233 24555543 335 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeE Q lcl|NC_010808. 254 RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGY 333 (512) Q Consensus 254 g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (512) |+|.++.+.+.+.+++++.-..+..+..+..+.+...+......+..... .. .+.. ...+..... .+.+.+++ T Consensus 218 G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~-~~-~~~~---~~~~~g~~~-~d~~e~~e- 290 (461) T protein:vir:80 218 GRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANL-TA-MLDF---MFRTEALAI-IKGDEQLT- 290 (461) T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHH-HH-HHHH---hcCCceEEE-EcCCcceE- Confidence 89999999999999999998888888777777766554221111111000 00 0000 000111111 22333444 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhcccccccc--cccccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 334 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD--NFSGTQSGEAMKYKLFGLEQRTKTKE-GLFTKGLRRRAKLLETIL 410 (512) Q Consensus 334 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~n~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~l 410 (512) ..+.+.......++.+.+.|...+.+|-.-+. ..+++.||..=. .....+++.++ ..++..|++++++++... T Consensus 291 -~~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~---~~yyd~i~~~qe~~l~p~le~l~~~i~~s~ 366 (461) T protein:vir:80 291 -KESTNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDV---MNYYARVSSIQENRLRPQLEYLTRLLMWAS 366 (461) T ss_pred -EEecCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 44456778899999999999999999975432 235566776532 23445555555 467889999998887533 Q ss_pred HhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhC----CCCCC-----HHHHHHH Q lcl|NC_010808. 411 KNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS---------GGKISQTTLMSLF----SFFQD-----PELEVKK 472 (512) Q Consensus 411 ~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl---------~g~~s~et~~~~~----~~v~d-----~~~E~~r 472 (512) ...... ...+..++++.|++-.+.+..+.|+...+. +|++|.+++.+.+ +..++ ...|++. T Consensus 367 ~~~~~~-~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~ 445 (461) T protein:vir:80 367 DDCGPS-IDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDK 445 (461) T ss_pred cccccc-cCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhh Confidence 222221 122345788999999999999998875543 4566665554422 10000 0001111 Q ss_pred HHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 473 IEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 473 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.++ ..+...+++ T Consensus 446 ~~~~---------------------------~~~~~~~e~ 458 (461) T protein:vir:80 446 LAKL---------------------------VYDAYAKKN 458 (461) T ss_pred hhhh---------------------------ccccccccC Confidence 1110 000111111 No 91 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.68 E-value=4e-14 Score=94.04 Aligned_cols=435 Identities=13% Similarity=0.088 Sum_probs=239.2 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc-----------e--e---eecchHHHHHHHH Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMAD-----------N--R---VAHDYASYISDFI 102 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~-----------~--r---i~~n~~~~iv~~~ 102 (512) .+.++.+..+++.....++.+.+...+-|.|-...... .-.+....++ . | ...+|++-+|+.. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~-~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~ 79 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTH-KARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKL 79 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCccccc-CCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 34456667777666656555555555667765321110 0001111010 0 1 2368999999999 Q ss_pred Hhhhhcc-CceecC----C----chhHHHHHHHHHh----------ccChhHHHHHHHHHHHhCCeEEEEEEECCCCc-- Q lcl|NC_010808. 103 NGYFLGN-PIQCQD----D----DKDVLEAIEAFND----------LNDVESHNRSLGLDLSIYGKAYELMIRNQDDE-- 161 (512) Q Consensus 103 a~~l~g~-~~~~~~----~----d~~~~~~l~~~~~----------~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~-- 161 (512) ++.++|. ++++.. . +++..+.|...|+ ..+|......+.+..+..|.+|+.+..++.+. T Consensus 80 ~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~ 159 (502) T protein:vir:79 80 EERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLT 159 (502) T ss_pred HHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccC Confidence 9999996 555432 2 2344555555553 23688888889999999999999987765432 Q ss_pred ------eEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccc Q lcl|NC_010808. 162 ------TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGF 235 (512) Q Consensus 162 ------~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~ 235 (512) .++..++|..+-.-+++ ...+..+|.+ +..+.. ..+.++... .... T Consensus 160 ~g~~~~l~lq~iepd~l~~~~~~--~~~i~~GVe~------d~~Gr~--~aY~i~~~h--------Pgd~---------- 211 (502) T protein:vir:79 160 PSAGVHFWLEALEPDFIPMTSDE--SNRLNQGVFV------DDWGRP--EKYLVYKSR--------PVSG---------- 211 (502) T ss_pred CCcccceEEEEecchhcCCCCCC--CCeeEeeeEE------CCCCce--EEEEEeecC--------CCCC---------- Confidence 57899999886322222 3345555532 212221 112222110 0000 Q ss_pred ccccccccc---eEeecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhh--hhhh Q lcl|NC_010808. 236 ESHSFERMP---ITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVK--KQKE 305 (512) Q Consensus 236 ~~~~~~~vP---vv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~--~~~~ 305 (512) ....+.+|| |+++.. ...|.|.|.+++..+..++....-........+.-..+++...+....... .... T Consensus 212 ~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 291 (502) T protein:vir:79 212 RQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKEN 291 (502) T ss_pred cccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCc Confidence 011133455 555433 346899999999988888776655555544444444445432211111100 0001 Q ss_pred ccccccchhhhhhccccc-CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccc-cccccccchHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGI-ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMK-DDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~~~n~Sg~Ai~~~~~~ 383 (512) .....+.+ +..+ ....|-++++.+++.+...+..+...+...|....++|-.. .+.+++ |-.+++..+.. T Consensus 292 ~~~~~l~p------G~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~--nySs~R~~~~e 363 (502) T protein:vir:79 292 ERELTIQP------GIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNG--TYSAQRQELVE 363 (502) T ss_pred cccccccC------CccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc--hHHHHHHHHHH Confidence 11111111 1112 24567789999988888899999999999998888888322 233432 56667777777 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCCcc---cccceeeEEeCCCC--CcCHHHHHHHHHHH--hccCChHH Q lcl|NC_010808. 384 LEQRTKTKEGLFTK-GLRRRAKLLETILKNTRSIDAN---KDFNTVRYVYNRNL--PKSLIEELKAYIDS--GGKISQTT 455 (512) Q Consensus 384 l~~k~~~~~~~~~~-~l~~~~~li~~~l~~~~~~~~~---~d~~~i~i~f~~~~--p~d~~~~~~~~~kl--~g~~s~et 455 (512) ....+...+..|.. .++.+++..+...-..+.++.+ .......+.|..+. ..|....+++...+ +|+.|.+. T Consensus 364 ~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~ 443 (502) T protein:vir:79 364 STDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESD 443 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHH Confidence 77777777766654 3333555443333333333221 11223456774443 36777777776665 79999999 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcc---cCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 456 LMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIY---KDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 456 ~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+...| .|+++.++++.+|.+...+.-..... ..+.......+.++..+..++.| T Consensus 444 ~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e 501 (502) T protein:vir:79 444 WVRAGG--RNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSE 501 (502) T ss_pred HHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 999987 48999999999998776553221111 11111111111122222222222 No 92 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.66 E-value=5.9e-15 Score=98.59 Aligned_cols=475 Identities=11% Similarity=0.036 Sum_probs=210.1 Q ss_pred cccchhHHhhhcHHHHHHHHHH---HHHHHHHHHHHHHHHh--ccccccccccccccccccc--ceeeecchHHHHHHHH Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEH---HMDYQRPRLKVLSDYY--EGKTKNLVELTRRKEEYMA--DNRVAHDYASYISDFI 102 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~---~~~~~~~r~~~~~~yy--~G~~~~~~~~~~~~~~~~~--~~ri~~n~~~~iv~~~ 102 (512) ........ ...++..+.. +....+.....-.+|| .|.+............... .-.+.+|..+.+|+.. T Consensus 1 m~~~~~~~----~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v 76 (708) T protein:vir:10 1 MAETLEKK----HERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) T ss_pred CchhHHHH----HHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHH Confidence 11111111 2222332222 1122223333344555 5776543221111111111 1247889999999999 Q ss_pred HhhhhccCceecC--C----chhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEEC---CC------CceE Q lcl|NC_010808. 103 NGYFLGNPIQCQD--D----DKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD------DETR 163 (512) Q Consensus 103 a~~l~g~~~~~~~--~----d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---~~------g~~~ 163 (512) +++-..+.+.+.. . +.+. ...+..+++.|+.+.....+..+++++|.+|+.+..| +. .++. T Consensus 77 ~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~ 156 (708) T protein:vir:10 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) T ss_pred HHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccc Confidence 9999888776532 1 2222 3345666778999999999999999999999877553 11 1222 Q ss_pred E-EEEccceeEEEEeCCCCc-ee----EEEEEE----------eeee-------------eeccCCcceEEEEEEEcCCc Q lcl|NC_010808. 164 L-YKSDAMSTFVIYDNTIER-NS----IAGVRY----------LRTK-------------PIDKTDEDEVFTVDLFTSHG 214 (512) Q Consensus 164 i-~~~~p~~~~~i~d~~~~~-~~----~~~v~~----------~~~~-------------~~~~~~~~~~~~~~~yt~~~ 214 (512) + .+.+|... ++||+.... .. .++.+. |... ..++...+.+..+++|.... T Consensus 157 i~~~~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~ 235 (708) T protein:vir:10 157 IEPIYDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRK 235 (708) T ss_pred eEEeecchhh-cccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEE Confidence 2 23344321 233321100 00 001111 1000 00000112233333332221 Q ss_pred E---------------EEEEecCC----------cc----------------ccccc-cccccccccccccceEeecC-- Q lcl|NC_010808. 215 V---------------YRYLTSRT----------NG----------------LKLTP-RENGFESHSFERMPITEFSN-- 250 (512) Q Consensus 215 ~---------------~~~~~~~~----------~~----------------~~~~~-~~~~~~~~~~~~vPvv~~~n-- 250 (512) + ..|..... +. ...+. ......+-|++.+|+|++.- T Consensus 236 ~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r 315 (708) T protein:vir:10 236 ESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKR 315 (708) T ss_pred EEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeee Confidence 1 11110000 00 00000 01233555667788887632 Q ss_pred -----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-CCcCChhhhhhhhhccccccchhhhhhcccccC Q lcl|NC_010808. 251 -----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG-NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIE 324 (512) Q Consensus 251 -----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 (512) .+.+.|.+.++++.++.+|...|.+.+.+-.......++.. ...............+...+.........+... T Consensus 316 ~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~ 395 (708) T protein:vir:10 316 WFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNII 395 (708) T ss_pred eccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccc Confidence 12235788999999999999999999887655444333211 100000000000000000000000000000000 Q ss_pred CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 325 TEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK 404 (512) Q Consensus 325 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 404 (512) ......+.+..+.-..++..++......|..+|++-+.+.|. .+|.||+||......-..........+..+.+++.+ T Consensus 396 -~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 473 (708) T protein:vir:10 396 -AGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGE 473 (708) T ss_pred -cccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111112222333345667777888888899998887777775 567899999988777777777777778888887777 Q ss_pred HHHHHHHhccCCC------c---c---------------------ccc----ceeeEEeCCCCCcCHHHHHHHHHHHhcc Q lcl|NC_010808. 405 LLETILKNTRSID------A---N---------------------KDF----NTVRYVYNRNLPKSLIEELKAYIDSGGK 450 (512) Q Consensus 405 li~~~l~~~~~~~------~---~---------------------~d~----~~i~i~f~~~~p~d~~~~~~~~~kl~g~ 450 (512) +++.+........ . . .|. .+|.+.=.|..+.-..+.++.++.+.+. T Consensus 474 ~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~ 553 (708) T protein:vir:10 474 VWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSS 553 (708) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHh Confidence 7666554321100 0 0 000 0333333445555555666777666443 Q ss_pred CCh---HH------HHHhCCCCCCHHHHHHHHHHHHHH-------------HHHHHHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 451 ISQ---TT------LMSLFSFFQDPELEVKKIEEDEKE-------------SIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 451 ~s~---et------~~~~~~~v~d~~~E~~ri~~E~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) ++. .+ +++.+. +...++-.++|++.... .....+..............+....+-.. T Consensus 554 ~~p~~~~~~~~~~~~l~~~D-~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qA 632 (708) T protein:vir:10 554 MLPTDPMRPAIQGIILDNID-GEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) T ss_pred cCCCchhhHHHHHHHHHhcC-CcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 332 11 222222 22333334455432110 00000000000000000000000000000 Q ss_pred cCCC Q lcl|NC_010808. 509 DKKE 512 (512) Q Consensus 509 ~~~e 512 (512) +-.+ T Consensus 633 e~~k 636 (708) T protein:vir:10 633 EAQK 636 (708) T ss_pred HHHH Confidence 0000 No 93 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.64 E-value=2.2e-14 Score=95.50 Aligned_cols=458 Identities=12% Similarity=0.076 Sum_probs=209.0 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHH---HHH----------HHHHHhccccccccccccc Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRP---RLK----------VLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~---r~~----------~~~~yy~G~~~~~~~~~~~ 80 (512) |.+......++-+.-+.. -.....|.+....+.+.+.+ ++. ++.+||.|.... ... T Consensus 1 ~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~----~~~ 69 (651) T protein:vir:80 1 MKLATTTTDKNRQTYDET-------HDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLR----SVG 69 (651) T ss_pred Ccccccccchhhhhhhhh-------HHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhcccccc----ccC Confidence 111122222221111111 11123344555555544432 232 334566554321 111 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhcc-----C-ceec-CCchh----HHHHHHHHH----hccChhHHHHHHHHHHH Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGN-----P-IQCQ-DDDKD----VLEAIEAFN----DLNDVESHNRSLGLDLS 145 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~-----~-~~~~-~~d~~----~~~~l~~~~----~~n~~~~~~~~~~~~~~ 145 (512) .++..-..+++.+.....|+.+...|+.. . +.+. .++++ ..+.+..++ ..++|......+..+++ T Consensus 70 ~~~~~~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l 149 (651) T protein:vir:80 70 DVNADWRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLL 149 (651) T ss_pred CCCCCCCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhc Confidence 12212234688899999999888877542 1 1221 11222 334466554 36789988999999999 Q ss_pred hCCeEEEEEEECCC-------------------------------CceEEEEEccceeEEEEeCCCCc--eeEEEEEEee Q lcl|NC_010808. 146 IYGKAYELMIRNQD-------------------------------DETRLYKSDAMSTFVIYDNTIER--NSIAGVRYLR 192 (512) Q Consensus 146 ~~G~a~~~v~~d~~-------------------------------g~~~i~~~~p~~~~~i~d~~~~~--~~~~~v~~~~ 192 (512) ++|.|++.||++.. |.|++..++|.++++ |++... ...+.+|.+. T Consensus 150 ~~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~ 227 (651) T protein:vir:80 150 ITGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTK 227 (651) T ss_pred ccCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeee Confidence 99999999887521 457889999998764 443221 1112223321 Q ss_pred eeee-------------------cc---------C------------C---cceEEEEEEEcCCcEEEEEecCCcccc-- Q lcl|NC_010808. 193 TKPI-------------------DK---------T------------D---EDEVFTVDLFTSHGVYRYLTSRTNGLK-- 227 (512) Q Consensus 193 ~~~~-------------------~~---------~------------~---~~~~~~~~~yt~~~~~~~~~~~~~~~~-- 227 (512) +... +. . + ...+..+++|.+ +...+.+... T Consensus 228 t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~-----~d~e~~~~~~~~ 302 (651) T protein:vir:80 228 TKADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGD-----IHLENKTYHDVV 302 (651) T ss_pred eHHHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEE-----eeccCCceEEEE Confidence 1100 00 0 0 001122333321 1111111100 Q ss_pred c---ccccccccccc-ccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh Q lcl|NC_010808. 228 L---TPRENGFESHS-FERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD 298 (512) Q Consensus 228 ~---~~~~~~~~~~~-~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~ 298 (512) . +........++ +..+|++.+ +...+|+|..+.+.+.+..+|.+...+...+...+.|.+.+........+ T Consensus 303 v~~~g~~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~ 382 (651) T protein:vir:80 303 VTIMGNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPE 382 (651) T ss_pred EEEcCcEEecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHH Confidence 0 00011111222 223466543 33457999999999999999999999999999999999776432223333 Q ss_pred hhhhhhhccccccchhhhhhcccccCCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhccccccccc---ccccchH Q lcl|NC_010808. 299 EVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDN---FSGTQSG 374 (512) Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~~~n~Sg 374 (512) ++.. ..++++. ....+++..+... .+.......+..+...+...++++....+. ..++.+| T Consensus 383 ~l~~-~pg~vi~--------------~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TA 447 (651) T protein:vir:80 383 DVYT-EPGKVFL--------------VSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTA 447 (651) T ss_pred Hhhc-CCCceEE--------------ecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccH Confidence 3221 1122211 1223445555543 244556678889999999999988766543 2245577 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCCcc---------------cccceeeEEeCCCCCcC-- Q lcl|NC_010808. 375 EAMKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLETILKNTRSIDAN---------------KDFNTVRYVYNRNLPKS-- 436 (512) Q Consensus 375 ~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~l~~~~~~~~~---------------~d~~~i~i~f~~~~p~d-- 436 (512) .++......+.......-+.|.. +++.+++.++.++......+.. ....+++..+.- .+.. T Consensus 448 teI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~~ 526 (651) T protein:vir:80 448 AEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGSD 526 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeee-eeccHH Confidence 77777766666666666666655 5666665555555432211100 001233333311 1112 Q ss_pred -HHHHHHHHHHH------hccCC---h-----HH---HHHhCCCCCCHHHHH--------H-----HHHHHHHHHHHHHH Q lcl|NC_010808. 437 -LIEELKAYIDS------GGKIS---Q-----TT---LMSLFSFFQDPELEV--------K-----KIEEDEKESIKKAQ 485 (512) Q Consensus 437 -~~~~~~~~~kl------~g~~s---~-----et---~~~~~~~v~d~~~E~--------~-----ri~~E~~~~~~~~~ 485 (512) ..+....+.++ .+..| . .. +++.+| +.++..=+ . ++.+.+....+... T Consensus 527 ~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g-~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~ 605 (651) T protein:vir:80 527 HVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWG-FEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMS 605 (651) T ss_pred HHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcC-CCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHH Confidence 12222222222 12222 1 11 233333 33322100 0 00000000000000 Q ss_pred hhcccCCCCCCCCCCCCCCcCcccCC--------C Q lcl|NC_010808. 486 KGIYKDPRDINDDEQDDDTKDTVDKK--------E 512 (512) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------e 512 (512) .... .......+.+-..+.+ | T Consensus 606 ~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~ 634 (651) T protein:vir:80 606 NMLQ------NQLQADGGTQMMSEMYGTPNADQMQ 634 (651) T ss_pred HHHH------HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000000111111111 1 No 94 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.63 E-value=1.4e-14 Score=96.58 Aligned_cols=469 Identities=9% Similarity=0.005 Sum_probs=216.3 Q ss_pred ccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccC Q lcl|NC_010808. 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (512) Q Consensus 31 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~ 110 (512) +......+.+.-.++...+...... +....+-.+||.|.+............. |..+|..+.+|+..+++-.-+. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~-R~~a~~d~~fy~G~QW~~~~~~~l~~q~----rp~~N~i~~~v~~v~g~e~~nr 75 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEA-RREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNP 75 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhhcCCCCCHHHHHHHHhcC----CCcccchHHHHHHHHhhHHhCC Confidence 2223333433334444444444333 3355677799999987543333333333 3457999999999999987776 Q ss_pred ceecC-----CchhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEE---CCC---CceEEEEE----ccce Q lcl|NC_010808. 111 IQCQD-----DDKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR---NQD---DETRLYKS----DAMS 171 (512) Q Consensus 111 ~~~~~-----~d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~---d~~---g~~~i~~~----~p~~ 171 (512) +.+.+ ++.+. ...+..+.+.++.+.....+..+++++|.+|+-|.. +++ +++.|... +|.+ T Consensus 76 ~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~ 155 (725) T protein:vir:10 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) T ss_pred cceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhH Confidence 65532 22333 334555567799999999999999999999988743 333 33444332 3444 Q ss_pred eEEEEeCCCC------ceeEEEEEEeeee--------------------------eeccCCcceEEEEEEEcCCcEE--E Q lcl|NC_010808. 172 TFVIYDNTIE------RNSIAGVRYLRTK--------------------------PIDKTDEDEVFTVDLFTSHGVY--R 217 (512) Q Consensus 172 ~~~i~d~~~~------~~~~~~v~~~~~~--------------------------~~~~~~~~~~~~~~~yt~~~~~--~ 217 (512) ++ ||+... .+-+ +++.|... ..++...+.+..+++|....+. . T Consensus 156 v~--~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~ 232 (725) T protein:vir:10 156 VI--WDSNSKLMDKSDARHC-TVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETA 232 (725) T ss_pred cc--cCchhhccChhhhhhh-hhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEeeEE Confidence 43 443210 1100 11111100 0000112233444444332111 1 Q ss_pred EEe-c-CCccc-------------------------------------ccccc-ccccccccccccceEee---cCC--- Q lcl|NC_010808. 218 YLT-S-RTNGL-------------------------------------KLTPR-ENGFESHSFERMPITEF---SNN--- 251 (512) Q Consensus 218 ~~~-~-~~~~~-------------------------------------~~~~~-~~~~~~~~~~~vPvv~~---~n~--- 251 (512) +.. . ..+.. ..+.. ...+.+.+-+.+|+|+| ... T Consensus 233 ~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g 312 (725) T protein:vir:10 233 FIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVED 312 (725) T ss_pred EEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCC Confidence 100 0 00000 00000 01222333344566553 221 Q ss_pred -CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCcee-eeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc Q lcl|NC_010808. 252 -ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML-LIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV 329 (512) Q Consensus 252 -~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~l-v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (512) +.+.|.+.++++.++.+|...|.+...+-....-.. +-.+........ - ....+...+..+...... +.-... T Consensus 313 ~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~-~-~~~~~~~~~~~~~~~~~~---g~~~~~ 387 (725) T protein:vir:10 313 KEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHM-Y-DGNDDYPYYLLNRTDENN---GEMPTQ 387 (725) T ss_pred cceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHH-H-hccCCceeeecccccccC---cccccc Confidence 223488999999999999999999988764433222 222211110000 0 001111111000000000 000111 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETI 409 (512) Q Consensus 330 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 409 (512) .+++...+.-..++...+......|..++++-+.+.|..+++.||+|+.................+..+.+++.++++.+ T Consensus 388 ~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:10 388 PLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred cCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23333333334566778889999999999988888888888899999998877777776767777777777776666665 Q ss_pred HHhccCCC---------ccccc------------------------ceeeEEeCCCCCcCHHHHHHHHHHHhccCCh--- Q lcl|NC_010808. 410 LKNTRSID---------ANKDF------------------------NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ--- 453 (512) Q Consensus 410 l~~~~~~~---------~~~d~------------------------~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~--- 453 (512) ........ ...++ .++.|.-.|..+.=..+.+..++.+...++. T Consensus 468 I~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~ 547 (725) T protein:vir:10 468 VNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTP 547 (725) T ss_pred HHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccch Confidence 43211100 00000 1233333333333234455555554333331 Q ss_pred ---HHHHHhCCC--CCCHHHHHHHHHHHHHHHHH-------------HHHhhcccC--CCCCCC-----CCCCCCCcCcc Q lcl|NC_010808. 454 ---TTLMSLFSF--FQDPELEVKKIEEDEKESIK-------------KAQKGIYKD--PRDIND-----DEQDDDTKDTV 508 (512) Q Consensus 454 ---et~~~~~~~--v~d~~~E~~ri~~E~~~~~~-------------~~~~~~~~~--~~~~~~-----~~~~~~~~~~~ 508 (512) .+++..++. +...++-.++|.++...... ..+...... ....+- ..+.+-.+-.. T Consensus 548 ~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~a 627 (725) T protein:vir:10 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQN 627 (725) T ss_pred hHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 233333332 22223334555543211100 000000000 000000 00000000000 Q ss_pred cCCC Q lcl|NC_010808. 509 DKKE 512 (512) Q Consensus 509 ~~~e 512 (512) +..+ T Consensus 628 E~~k 631 (725) T protein:vir:10 628 QTLS 631 (725) T ss_pred HHHH Confidence 0000 No 95 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.63 E-value=2.3e-14 Score=95.40 Aligned_cols=470 Identities=10% Similarity=0.014 Sum_probs=215.3 Q ss_pred ccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccC Q lcl|NC_010808. 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (512) Q Consensus 31 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~ 110 (512) +......+.+.-.++...+...... +....+-.+||.|.+............. |..+|..+.+|+...++-.-+. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~-r~~a~~d~~fy~G~Qw~~~~~~~l~~q~----rp~~N~i~~~i~~v~g~~~~nr 75 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEA-RREAKNDLFFSRVSQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNP 75 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhhCCCCCCHHHHHHHHhcC----CCccccHHHHHHHHHhhHHhCC Confidence 2223333433334444444444333 3345566799999987543333333333 3467999999999999887776 Q ss_pred ceecC-----CchhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEEC---CC---CceEEEEE----ccce Q lcl|NC_010808. 111 IQCQD-----DDKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD---DETRLYKS----DAMS 171 (512) Q Consensus 111 ~~~~~-----~d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---~~---g~~~i~~~----~p~~ 171 (512) +.+.+ ++.+. ...+..+.+.++.+.....+..+++++|.+|+-|..| ++ ++++|... +|.+ T Consensus 76 ~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~ 155 (725) T protein:vir:77 76 IDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) T ss_pred cceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhh Confidence 65532 22233 3345556677999999999999999999999887543 22 33444332 3444 Q ss_pred eEEEEeCCCCc-ee----EEEEEEeeee--------------------------eeccCCcceEEEEEEEcCCcEEEEE- Q lcl|NC_010808. 172 TFVIYDNTIER-NS----IAGVRYLRTK--------------------------PIDKTDEDEVFTVDLFTSHGVYRYL- 219 (512) Q Consensus 172 ~~~i~d~~~~~-~~----~~~v~~~~~~--------------------------~~~~~~~~~~~~~~~yt~~~~~~~~- 219 (512) + +||+.... .. .++++.|... ..+....+.+..+++|....+.... T Consensus 156 v--~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~~~~~ 233 (725) T protein:vir:77 156 V--IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAF 233 (725) T ss_pred c--eeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEeeEEE Confidence 3 33331110 00 0001111000 0001112334445555433221111 Q ss_pred -ecC--Cc--------------------cc-----------------ccccc-ccccccccccccceEee---cCC---- Q lcl|NC_010808. 220 -TSR--TN--------------------GL-----------------KLTPR-ENGFESHSFERMPITEF---SNN---- 251 (512) Q Consensus 220 -~~~--~~--------------------~~-----------------~~~~~-~~~~~~~~~~~vPvv~~---~n~---- 251 (512) ... ++ +. ..+.. ..++.+.+-+.+|+|++ ... T Consensus 234 ~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~ 313 (725) T protein:vir:77 234 IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDK 313 (725) T ss_pred EecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCc Confidence 000 00 00 00000 01222333344566653 221 Q ss_pred CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCce-eeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcc Q lcl|NC_010808. 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAM-LLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVD 330 (512) Q Consensus 252 ~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~-lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (512) +.+.|.+.++++.++.+|...|.+...+-....-. .+..|.......... ..++...+.........+ .-..+. T Consensus 314 ~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~~~~~g---~~~~~~ 388 (725) T protein:vir:77 314 EVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYD--GNDDYPYYLLNRTDENSG---DLPTQP 388 (725) T ss_pred ccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHH--hccCCceecccccccCCC---cccccC Confidence 22348888999999999999999988876543322 222222111111100 001111110000000000 001112 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 331 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 410 (512) Q Consensus 331 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 410 (512) +.....+.-..++...+......|...+++-+.+.|..++++||+|+..........+......+..+.+++.++++.+. T Consensus 389 i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI 468 (725) T protein:vir:77 389 LAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) T ss_pred ccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22333333335566688888889999998888888887788999999998877777777777778888888777766654 Q ss_pred HhccC---------CCccccc------------------------ceeeEEeCCCCCcCHHHHHHHHHHHhccCCh---- Q lcl|NC_010808. 411 KNTRS---------IDANKDF------------------------NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ---- 453 (512) Q Consensus 411 ~~~~~---------~~~~~d~------------------------~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~---- 453 (512) ..... .+...++ .+|.+.=.+..+.=..+.+..++.+...++. T Consensus 469 ~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~ 548 (725) T protein:vir:77 469 NDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPE 548 (725) T ss_pred HHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchh Confidence 33211 0000000 1222322333322233444444444332322 Q ss_pred --HHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHH-------------Hhhccc--CCCCCC-----CCCCCCCCcCccc Q lcl|NC_010808. 454 --TTLMSLFSFFQ--DPELEVKKIEEDEKESIKKA-------------QKGIYK--DPRDIN-----DDEQDDDTKDTVD 509 (512) Q Consensus 454 --et~~~~~~~v~--d~~~E~~ri~~E~~~~~~~~-------------~~~~~~--~~~~~~-----~~~~~~~~~~~~~ 509 (512) .++...+...+ ..++..+++.++........ +..... .+.... -..+.+-.+-..+ T Consensus 549 ~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e 628 (725) T protein:vir:77 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQ 628 (725) T ss_pred HHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222222222 12333455544322110000 000000 000000 0000000000000 Q ss_pred CCC Q lcl|NC_010808. 510 KKE 512 (512) Q Consensus 510 ~~e 512 (512) ..+ T Consensus 629 ~~k 631 (725) T protein:vir:77 629 TLS 631 (725) T ss_pred HHH Confidence 000 No 96 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.61 E-value=3.4e-14 Score=94.39 Aligned_cols=459 Identities=11% Similarity=0.076 Sum_probs=200.0 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTR 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~-~~~~r~~~~~~yy~G~~~~~~~~~~ 79 (512) ||+--. .+.. ++..+........+.-..+.+ .......+..+||.|+...... T Consensus 1 ~~k~~~-------------~~~~----------~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~~--- 54 (705) T protein:vir:88 1 MAKRRK-------------IKPM----------DDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNER--- 54 (705) T ss_pred CCcccc-------------cccC----------CHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCccc--- Confidence 333211 1111 111111112221111111112 1222345566899997543211 Q ss_pred cccccccceeeecchHHHHHHHHHhhhh----ccC--ceecC---CchhH----HHHHHHH-HhccChhHHHHHHHHHHH Q lcl|NC_010808. 80 RKEEYMADNRVAHDYASYISDFINGYFL----GNP--IQCQD---DDKDV----LEAIEAF-NDLNDVESHNRSLGLDLS 145 (512) Q Consensus 80 ~~~~~~~~~ri~~n~~~~iv~~~a~~l~----g~~--~~~~~---~d~~~----~~~l~~~-~~~n~~~~~~~~~~~~~~ 145 (512) ++ ..+++.+.....|+.+...|. +.+ +++.. +|.+. ...++.+ .+.|+....+..++++++ T Consensus 55 ---~~--~s~~~~~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal 129 (705) T protein:vir:88 55 ---PG--KSGIVSRDVQETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTL 129 (705) T ss_pred ---CC--CCccccHHHHHHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHh Confidence 12 235667777777787777664 322 23322 33332 3344443 456777778889999999 Q ss_pred hCCeEEEEEEECCC------------------------------------------------CceEEEEEccceeEEEEe Q lcl|NC_010808. 146 IYGKAYELMIRNQD------------------------------------------------DETRLYKSDAMSTFVIYD 177 (512) Q Consensus 146 ~~G~a~~~v~~d~~------------------------------------------------g~~~i~~~~p~~~~~i~d 177 (512) ++|.+++.||++.. |++++..++|.++++--+ T Consensus 130 ~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~ 209 (705) T protein:vir:88 130 MMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRL 209 (705) T ss_pred hcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCC Confidence 99999999988432 567888889988754322 Q ss_pred CC-CCceeEEEEEEeeee-ee---------------c-----------------cC----------CcceEEEEEEEcCC Q lcl|NC_010808. 178 NT-IERNSIAGVRYLRTK-PI---------------D-----------------KT----------DEDEVFTVDLFTSH 213 (512) Q Consensus 178 ~~-~~~~~~~~v~~~~~~-~~---------------~-----------------~~----------~~~~~~~~~~yt~~ 213 (512) .. ...-...+.+++.+. +. + +. .......|++|. T Consensus 210 a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E-- 287 (705) T protein:vir:88 210 ATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASE-- 287 (705) T ss_pred CCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEE-- Confidence 11 111111122221110 00 0 00 000001122210 Q ss_pred cEEEEEecCCcccc------ccccccccccccccccceEe-----ecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 214 GVYRYLTSRTNGLK------LTPRENGFESHSFERMPITE-----FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (512) Q Consensus 214 ~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~vPvv~-----~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~ 282 (512) .+.++...+.+... .+.... ..-+++.+|++. .+...+|.|.++.+.++++.+|...+.+.+.+... T Consensus 288 ~y~~~d~~~d~~~~~~~~~~~g~~il--~~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~ 365 (705) T protein:vir:88 288 CYTLLDVDGDGISELRRILYVGDYII--SNEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRT 365 (705) T ss_pred eeeEecccCCcceeeEEEEEeCcccc--ccccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhc Confidence 00001111111000 000000 112455666664 44556789999999999999999999999999888 Q ss_pred cCceeee-ecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 283 NDAMLLI-KGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP 361 (512) Q Consensus 283 ~~~~lv~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 361 (512) .+|.+.+ .|.. +..+....+.++++ ....++.+.++..+.........+..+...+...|+++ T Consensus 366 ~~~~~~~~~g~v--~~~d~~~~~pg~vv--------------~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~ 429 (705) T protein:vir:88 366 NQGRSVVLDGQV--NLEDLLTNEAAGIV--------------RVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGIT 429 (705) T ss_pred cCCceecccccc--CcccccccCCCeeE--------------EecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCc Confidence 8876554 3322 11111111111111 11223446666555556667778899999999999999 Q ss_pred cccccc----ccccchHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhccCCCcccc--------------- Q lcl|NC_010808. 362 NMKDDN----FSGTQSGEAMKYKLFGLEQRTKTKEGLFT-KGLRRRAKLLETILKNTRSIDANKD--------------- 421 (512) Q Consensus 362 ~~~~~~----~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~-~~l~~~~~li~~~l~~~~~~~~~~d--------------- 421 (512) +.+.|. ..++.|+.|+...............+.|. .++++++++++.++........-+. T Consensus 430 ~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~ 509 (705) T protein:vir:88 430 DRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRE 509 (705) T ss_pred hHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhcc Confidence 888763 23456778887777777777777777775 4566666666665543322110000 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHHh----c---------cCChHHH-------HHhCCCCCCHHH------HHHHHHH Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIEELKAYIDSG----G---------KISQTTL-------MSLFSFFQDPEL------EVKKIEE 475 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~~~~~~~kl~----g---------~~s~et~-------~~~~~~v~d~~~------E~~ri~~ 475 (512) -.++.+.-... ..+..+....+..+. . +++.... .+.++ +.++++ .++..+. T Consensus 510 ~~~v~v~v~~~-~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~-~k~~~~~~~~~~~~e~~~~ 587 (705) T protein:vir:88 510 RSDLTVTVGIG-NMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAG-YKDPDRFWTNPNSPEALQA 587 (705) T ss_pred CCceEEeeccc-cchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHHHHHHHHhhh-hhhHHHHhhhhhhHHHHHH Confidence 01122221111 111222211111110 0 1111110 11111 011000 0000000 Q ss_pred HH----HHHHHHHHhhcccC-CCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 476 DE----KESIKKAQKGIYKD-PRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 476 E~----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~e 512 (512) +. .+............ ......+.+-...+-...+.| T Consensus 588 ~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E 629 (705) T protein:vir:88 588 KAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVE 629 (705) T ss_pred HHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 00000000000000 000000000000000000000 No 97 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.60 E-value=3.4e-14 Score=94.40 Aligned_cols=471 Identities=9% Similarity=0.019 Sum_probs=202.4 Q ss_pred cccchhHHhhhcHHHHHHHHHHH----HHHHHHHHHHH-HHHhccccccccccccccccccc--ceeeecchHHHHHHHH Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHH----MDYQRPRLKVL-SDYYEGKTKNLVELTRRKEEYMA--DNRVAHDYASYISDFI 102 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~----~~~~~~r~~~~-~~yy~G~~~~~~~~~~~~~~~~~--~~ri~~n~~~~iv~~~ 102 (512) +....... ...+...+... ...+....... ..||.|.+............... .-.+.+|..+.+|+.. T Consensus 1 ma~~~~~~----~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v 76 (708) T protein:vir:17 1 MAETLEKK----HERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRI 76 (708) T ss_pred CchhHHHH----HHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHH Confidence 11111111 22223322221 11111111111 36899987643322222211111 1247889999999999 Q ss_pred HhhhhccCceecC--C----chhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEEC---CC------CceE Q lcl|NC_010808. 103 NGYFLGNPIQCQD--D----DKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD------DETR 163 (512) Q Consensus 103 a~~l~g~~~~~~~--~----d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---~~------g~~~ 163 (512) +++---+.+.+.. . +.+. ...+..+.+.|+.+.....+..+++++|.+|+.+..| ++ .++. T Consensus 77 ~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~ 156 (708) T protein:vir:17 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) T ss_pred HhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccc Confidence 9998777665532 2 2222 3345556678999999999999999999999877432 22 2333 Q ss_pred EEEE--ccceeEEEEeCCCCc-e---eE-EEEEE----------ee-------------eeeeccCCcceEEEEEEEcCC Q lcl|NC_010808. 164 LYKS--DAMSTFVIYDNTIER-N---SI-AGVRY----------LR-------------TKPIDKTDEDEVFTVDLFTSH 213 (512) Q Consensus 164 i~~~--~p~~~~~i~d~~~~~-~---~~-~~v~~----------~~-------------~~~~~~~~~~~~~~~~~yt~~ 213 (512) +..+ ++.+++ ||+.... . .. ++++. |. ....++...+.+..+++|... T Consensus 157 i~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~ 234 (708) T protein:vir:17 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVR 234 (708) T ss_pred eEeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEe Confidence 3332 234442 4432100 0 00 00000 00 000111112333334444211 Q ss_pred ---------------cEEEEEecCC----------cc---------------c-cccc-cccccccccccccceEeecCC Q lcl|NC_010808. 214 ---------------GVYRYLTSRT----------NG---------------L-KLTP-RENGFESHSFERMPITEFSNN 251 (512) Q Consensus 214 ---------------~~~~~~~~~~----------~~---------------~-~~~~-~~~~~~~~~~~~vPvv~~~n~ 251 (512) .++.|..... +. . ..+. ....+.+-|++.+|+|+|.-. T Consensus 235 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~ 314 (708) T protein:vir:17 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) T ss_pred eeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecc Confidence 1111111000 00 0 0000 112344555667777775321 Q ss_pred ---CC----CCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcCChhh-hhhhhhccccccchhhhhhcccc Q lcl|NC_010808. 252 ---ER----RKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLSLDPDE-VKKQKEANVLFLEPTVYENRDTG 322 (512) Q Consensus 252 ---~~----g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~-~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 322 (512) .. ..|.+.++++.++.+|...|.+...+-.......++ .+....-... .....+....... ......... T Consensus 315 r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~-~~~~~~~g~ 393 (708) T protein:vir:17 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPL-REVRDKYGN 393 (708) T ss_pred cccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhh-hccCCcccc Confidence 11 247788999999999999999998876554443332 1111110000 0000011100000 000000010 Q ss_pred cCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 323 IETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 323 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) +...... ......+.-..++...+......|..+|++-+.+.|. .+|+||+|+.................+..+.+++ T Consensus 394 v~~~a~~-~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~-~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~ 471 (708) T protein:vir:17 394 IIAGATP-AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRA 471 (708) T ss_pred cccccCC-cccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 1112222334667777888888999999888877775 5679999999887777777777777777777776 Q ss_pred HHHHHHHHHhccCCC---------ccc---------------------cc----ceeeEEeCCCCCcCHHHHHHHHHHHh Q lcl|NC_010808. 403 AKLLETILKNTRSID---------ANK---------------------DF----NTVRYVYNRNLPKSLIEELKAYIDSG 448 (512) Q Consensus 403 ~~li~~~l~~~~~~~---------~~~---------------------d~----~~i~i~f~~~~p~d~~~~~~~~~kl~ 448 (512) .++++.+........ ... |. .+|.+.=.+..+.-..+..+.++++. T Consensus 472 g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll 551 (708) T protein:vir:17 472 GEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVL 551 (708) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHH Confidence 666666543321100 000 00 01222222233333334455555543 Q ss_pred ccCChH---H------HHHhCCCCCCHHHHHHHHHHHHHHH--------------------HHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 449 GKISQT---T------LMSLFSFFQDPELEVKKIEEDEKES--------------------IKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 449 g~~s~e---t------~~~~~~~v~d~~~E~~ri~~E~~~~--------------------~~~~~~~~~~~~~~~~~~~ 499 (512) +.++.. + +++.+.+ ...++-.++|++..... ............ ...... T Consensus 552 ~~~~~~~~~~~~~~~l~l~~~D~-p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~ea-qa~~~~ 629 (708) T protein:vir:17 552 SSMLPADPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLA-QAQMVA 629 (708) T ss_pred HhcCCccchhHHHHHHHHHhcCC-CChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 322211 1 2222221 22222234443321100 000000000000 000000 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) ...+- .....+. T Consensus 630 ~qAe~-~ka~aea 641 (708) T protein:vir:17 630 AQAEA-QKATNET 641 (708) T ss_pred HHHHH-HHHHHHH Confidence 00000 0000000 No 98 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.60 E-value=3.8e-14 Score=94.17 Aligned_cols=471 Identities=10% Similarity=0.024 Sum_probs=210.1 Q ss_pred ccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccC Q lcl|NC_010808. 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (512) Q Consensus 31 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~ 110 (512) +......+.+.-.++...+...... +....+-.+||.|.+............. |..+|..+.+|+...++-.-+. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~-r~~a~~d~~fy~G~Qw~~~~~~~l~~q~----rp~~N~i~~~i~~v~g~e~~nr 75 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEA-RREAKNDLFFSRISQWDDWLSQYTTLQY----RGQFDVVRPVVRKLVSEMRQNP 75 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHH-HHHHHHHHHhhcCCCCCHHHHHHHHhcC----CCcccchHHHHHHHHhhHHhCC Confidence 2222333333334444444444333 3345577799999987543333333333 3467999999999999877776 Q ss_pred ceecC-----CchhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEEC---CC---CceEEEEE---cccee Q lcl|NC_010808. 111 IQCQD-----DDKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRN---QD---DETRLYKS---DAMST 172 (512) Q Consensus 111 ~~~~~-----~d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d---~~---g~~~i~~~---~p~~~ 172 (512) +.+.+ ++.+. ...+..+.+.++.+.....+..+++++|.+|+-|..| ++ ++++|... +|... T Consensus 76 ~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~ 155 (725) T protein:vir:92 76 IDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH 155 (725) T ss_pred cceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhh Confidence 65432 22333 3345556677999999999999999999999877543 22 34444332 23321 Q ss_pred EEEEeCCCCc-ee----EEEEEEeeee--------------------------eeccCCcceEEEEEEEcCCcEE--EEE Q lcl|NC_010808. 173 FVIYDNTIER-NS----IAGVRYLRTK--------------------------PIDKTDEDEVFTVDLFTSHGVY--RYL 219 (512) Q Consensus 173 ~~i~d~~~~~-~~----~~~v~~~~~~--------------------------~~~~~~~~~~~~~~~yt~~~~~--~~~ 219 (512) ++||+.... .. .++++.|... ..+....+.+..+++|....+. .+. T Consensus 156 -V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~ 234 (725) T protein:vir:92 156 -VIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFI 234 (725) T ss_pred -cccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEeeeEEe Confidence 123321100 00 0000000000 0000112233344444322111 110 Q ss_pred e-c-CCcc------------------------------------c-ccccc-ccccccccccccceEee---cC----CC Q lcl|NC_010808. 220 T-S-RTNG------------------------------------L-KLTPR-ENGFESHSFERMPITEF---SN----NE 252 (512) Q Consensus 220 ~-~-~~~~------------------------------------~-~~~~~-~~~~~~~~~~~vPvv~~---~n----~~ 252 (512) . . .++. . ..+.. ...+.+.+-+.+|+|+| +. .+ T Consensus 235 ~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~ 314 (725) T protein:vir:92 235 YQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKE 314 (725) T ss_pred ecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCcc Confidence 0 0 0000 0 00000 01222333344566554 22 12 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCcee-eeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcce Q lcl|NC_010808. 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML-LIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDG 331 (512) Q Consensus 253 ~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~l-v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (512) .+.|.+.++++.++.+|...|.+...+-..+.-.. +-.|.......... .......+......... +.-....+ T Consensus 315 ~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~~~~~---g~~~~~~i 389 (725) T protein:vir:92 315 VYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYD--GNDDYPYYLLNRTDENN---GEMPTQPL 389 (725) T ss_pred cccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHh--ccCccceeecccccccc---ccccccCC Confidence 23488999999999999999999888764433222 22222211111000 00111111000000000 00011123 Q ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 332 GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILK 411 (512) Q Consensus 332 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 411 (512) ++...+.-..++...+......|..++++-+-+.|..+++.||+|+..+..............+..+.+++.++++.+.. T Consensus 390 ~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~ 469 (725) T protein:vir:92 390 AYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVN 469 (725) T ss_pred cccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33333334566777889999999999998888888878889999999887776666666666677777776666665543 Q ss_pred hccCCCc---------cccc------------------------ceeeEEeCCCCCcCHHHHHHHHHHHhccCCh----- Q lcl|NC_010808. 412 NTRSIDA---------NKDF------------------------NTVRYVYNRNLPKSLIEELKAYIDSGGKISQ----- 453 (512) Q Consensus 412 ~~~~~~~---------~~d~------------------------~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~----- 453 (512) ....... ..++ .++.|.=.|..+.-..+....++.+...++. T Consensus 470 ~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~ 549 (725) T protein:vir:92 470 DIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEY 549 (725) T ss_pred HhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHH Confidence 2211000 0000 1222222233222233444444444333332 Q ss_pred -HHHHHhCCCC--CCHHHHHHHHHHHHHHHHH-------------HHHhhccc--CCCCCCC-----CCCCCCCcCcccC Q lcl|NC_010808. 454 -TTLMSLFSFF--QDPELEVKKIEEDEKESIK-------------KAQKGIYK--DPRDIND-----DEQDDDTKDTVDK 510 (512) Q Consensus 454 -et~~~~~~~v--~d~~~E~~ri~~E~~~~~~-------------~~~~~~~~--~~~~~~~-----~~~~~~~~~~~~~ 510 (512) -++...+... ....+..+++.++...... ..+..... ....... ..+.+-.+-..+. T Consensus 550 ~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~ 629 (725) T protein:vir:92 550 QLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQT 629 (725) T ss_pred HHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222211 1122334445433211100 00000000 0000000 0000000000000 Q ss_pred CC Q lcl|NC_010808. 511 KE 512 (512) Q Consensus 511 ~e 512 (512) ++ T Consensus 630 ~k 631 (725) T protein:vir:92 630 LS 631 (725) T ss_pred HH Confidence 00 No 99 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.55 E-value=1.7e-12 Score=85.10 Aligned_cols=444 Identities=14% Similarity=0.096 Sum_probs=220.4 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHH---HHHHHHHHhcccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVEL 77 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~---r~~~~~~yy~G~~~~~~~~ 77 (512) |+. .+....+-+-++ + ..-.|.+......+.+.+ ++.++++||.+...... T Consensus 1 ~~~--------~~~~~~~~~~~~---------~-------~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~~-- 54 (584) T protein:vir:95 1 MSV--------KVAELNSLLVRD---------S-------SAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTTT-- 54 (584) T ss_pred CCc--------chhhhhhhcccc---------c-------hHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhhh-- Confidence 100 000000000000 0 012222322333333333 35688899988543211 Q ss_pred cccccccccceeeecchHHHHHHHHHhhhhccCc------ee---cCCchh--HHHHHHHHH----hccChhHHHHHHHH Q lcl|NC_010808. 78 TRRKEEYMADNRVAHDYASYISDFINGYFLGNPI------QC---QDDDKD--VLEAIEAFN----DLNDVESHNRSLGL 142 (512) Q Consensus 78 ~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~------~~---~~~d~~--~~~~l~~~~----~~n~~~~~~~~~~~ 142 (512) ...+....+++.+|.+.-+++.+..+++.--+ .+ ..++.+ ..+.++.+. ...++...+..++. T Consensus 55 --~~~~~~~r~~~~~~k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~ 132 (584) T protein:vir:95 55 --SNQGLPWKNSTTLPKLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIY 132 (584) T ss_pred --hhcccccccccchhHHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHH Confidence 11111123477888888888888888754211 11 112222 255566664 55689999999999 Q ss_pred HHHhCCeEEEEEEECCC-------------CceEEEEEccceeEEEEeCCCCcee--EEEEEEee--------------- Q lcl|NC_010808. 143 DLSIYGKAYELMIRNQD-------------DETRLYKSDAMSTFVIYDNTIERNS--IAGVRYLR--------------- 192 (512) Q Consensus 143 ~~~~~G~a~~~v~~d~~-------------g~~~i~~~~p~~~~~i~d~~~~~~~--~~~v~~~~--------------- 192 (512) ++.++|.|++.+++... .++++.-++|.++| +|++..... -+.+|.+. T Consensus 133 d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~ 210 (584) T protein:vir:95 133 DYIDYGNAFATVSFEAKYKEMTDGTLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQ 210 (584) T ss_pred hhccCCceEEEEeEeecceeeeccccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCcc Confidence 99999999999877432 26889999999886 565432111 11122111 Q ss_pred ---------------------eeeeccC---Ccce-EEEEEEEcCCcEEEEEe-------cCCccc----ccc---ccc- Q lcl|NC_010808. 193 ---------------------TKPIDKT---DEDE-VFTVDLFTSHGVYRYLT-------SRTNGL----KLT---PRE- 232 (512) Q Consensus 193 ---------------------~~~~~~~---~~~~-~~~~~~yt~~~~~~~~~-------~~~~~~----~~~---~~~- 232 (512) ..+.+.. ..+. ....+.|.+..+.-+.. ...... ..+ ... T Consensus 211 ~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iI 290 (584) T protein:vir:95 211 SYWLEALKRREEICRHLGGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEV 290 (584) T ss_pred ccchHHHHHHHHhccCCCCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEE Confidence 1101100 0000 00112222222211110 000000 000 001 Q ss_pred -cccccccccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhc Q lcl|NC_010808. 233 -NGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEA 306 (512) Q Consensus 233 -~~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~ 306 (512) ....+.+.+.+|++..+ ..-+|.|+.+.+.++++.+|.+.-.+.+.+..+.+|.+...+.. . +. T Consensus 291 R~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~--~--~~------ 360 (584) T protein:vir:95 291 RNESIPTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEV--E--EF------ 360 (584) T ss_pred EeeecCCCCCCCCEEEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeecccc--c--hh------ Confidence 12345666888887653 34579999999999999999999999999999999966554432 1 10 Q ss_pred cccccchhhhhhcccccCCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhcccccccccc-cccchHHHHHHHHHHH Q lcl|NC_010808. 307 NVLFLEPTVYENRDTGIETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLFGL 384 (512) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l 384 (512) .+.....+..+..+.++++.++. +.....+.+..+...+-..+++|..+.|.- .++.++..+.....++ T Consensus 361 ---------~~~pg~~~~~~~~~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa 431 (584) T protein:vir:95 361 ---------VWGPGAEIHLDQGGDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAA 431 (584) T ss_pred ---------cccCCceeecCCCCCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHH Confidence 01111222334555678888764 334445567778888889999998877643 3456777788888888 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCCCccc---------------ccceeeEEe--CCCCCcCHHHHHHHHHH Q lcl|NC_010808. 385 EQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDANK---------------DFNTVRYVY--NRNLPKSLIEELKAYID 446 (512) Q Consensus 385 ~~k~~~~~~~~~~~l-~~~~~li~~~l~~~~~~~~~~---------------d~~~i~i~f--~~~~p~d~~~~~~~~~k 446 (512) ......+.+.|..++ ++++.++.++........... ...+++-.| ..-...-..+.++..+. T Consensus 432 ~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~ 511 (584) T protein:vir:95 432 GRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQN 511 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHH Confidence 888888888888877 777888877643321111000 001222221 11111111222232222 Q ss_pred H------------hccCChHHHHH------hCCC---CC-----CHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|NC_010808. 447 S------------GGKISQTTLMS------LFSF---FQ-----DPELEVKKIEEDEKESIKKAQKGIYKDPR 493 (512) Q Consensus 447 l------------~g~~s~et~~~------~~~~---v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 493 (512) + .+.++...+.. .+|. .+ .++.|.+....+.++.....+.......- T Consensus 512 l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 512 LVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred HHHHHHhhhhhhccccchHHHHHHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 2 11233322222 1221 11 12223333322322222222222111111 No 100 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.53 E-value=6e-13 Score=87.58 Aligned_cols=473 Identities=12% Similarity=0.053 Sum_probs=212.0 Q ss_pred cccchhHHhhhcHHHHHHHHHHHH---HHHHHHHHHHHHHh--cccccccccccccccccc--cceeeecchHHHHHHHH Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHM---DYQRPRLKVLSDYY--EGKTKNLVELTRRKEEYM--ADNRVAHDYASYISDFI 102 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~---~~~~~r~~~~~~yy--~G~~~~~~~~~~~~~~~~--~~~ri~~n~~~~iv~~~ 102 (512) +....... ...+...+.... ...+.+...-.+|| .|.+.............. -.-.+.+|..+.+|+.. T Consensus 1 m~e~~~~~----~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v 76 (706) T protein:vir:10 1 MAESRQKQ----HERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRI 76 (706) T ss_pred CCcchHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHH Confidence 12211112 223333332222 23333444555676 566654332221111111 11257899999999999 Q ss_pred HhhhhccCceecC---C---chhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC---------CCceE Q lcl|NC_010808. 103 NGYFLGNPIQCQD---D---DKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ---------DDETR 163 (512) Q Consensus 103 a~~l~g~~~~~~~---~---d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~---------~g~~~ 163 (512) +++..-+.+.+.. . +.+. ...+..+.+.|+.+.....+..+++++|.+|+-+..|- ++++. T Consensus 77 ~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~ 156 (706) T protein:vir:10 77 ISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIA 156 (706) T ss_pred hhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccce Confidence 9998887776531 1 2222 33455566789999999999999999999998885431 12344 Q ss_pred EEE-EccceeEEEEeCCCC---ce-eEEE-EEEeeeee----------------------eccCCcceEEEEEEEcCCcE Q lcl|NC_010808. 164 LYK-SDAMSTFVIYDNTIE---RN-SIAG-VRYLRTKP----------------------IDKTDEDEVFTVDLFTSHGV 215 (512) Q Consensus 164 i~~-~~p~~~~~i~d~~~~---~~-~~~~-v~~~~~~~----------------------~~~~~~~~~~~~~~yt~~~~ 215 (512) +.. .+|.+. ++||+... .. ..++ .+.|...+ .++...+.+...+.|+.... T Consensus 157 i~~v~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~ 235 (706) T protein:vir:10 157 VEPIYDPARS-VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRKE 235 (706) T ss_pred eeeeccchhc-eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccce Confidence 433 356542 34454210 00 1111 11110000 00111222333343433221 Q ss_pred ----EEEEecCCc--------------------cc-c---------------ccc--cccccccccccccceEeecCC-- Q lcl|NC_010808. 216 ----YRYLTSRTN--------------------GL-K---------------LTP--RENGFESHSFERMPITEFSNN-- 251 (512) Q Consensus 216 ----~~~~~~~~~--------------------~~-~---------------~~~--~~~~~~~~~~~~vPvv~~~n~-- 251 (512) .+|.....+ +. . ... ....+.+-+.+.+|+|++.-. T Consensus 236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~ 315 (706) T protein:vir:10 236 SVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRW 315 (706) T ss_pred eEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeeccc Confidence 111110000 00 0 000 001222333467788775321 Q ss_pred -----CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhh---ccccc Q lcl|NC_010808. 252 -----ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYEN---RDTGI 323 (512) Q Consensus 252 -----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 323 (512) ....|.+.++++.++.+|..+|.+.+.+-.... ....|... +-+.+...-..........+... ...+. T Consensus 316 ~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~--~~~~~~~~-~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~ 392 (706) T protein:vir:10 316 FIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPG--QTPIVDME-QIRGLEQHWEGRNRKRPAFLPLRTVTDKTGN 392 (706) T ss_pred cccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCC--cccccchh-HHHHHHHHhhhcccccccchhcccccCCCCc Confidence 223578889999999999999999988754333 22222211 11111100000000000000000 00000 Q ss_pred CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 324 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 403 (512) Q Consensus 324 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 403 (512) .........++..+.-..++...+......|..++++-+.+.|. .+|.||+||................-+..+.+++. T Consensus 393 i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~-~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g 471 (706) T protein:vir:10 393 VVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQM-PSNVARETVNSLLNRSDMASFIYLDNMAKSLKRAG 471 (706) T ss_pred ccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00111222333343444566777888888899999888777775 45689999999888777777888888888888887 Q ss_pred HHHHHHHHhccCCC---------ccc---------------------cc----ceeeEEeCCCCCcCHHHHHHHHHHHhc Q lcl|NC_010808. 404 KLLETILKNTRSID---------ANK---------------------DF----NTVRYVYNRNLPKSLIEELKAYIDSGG 449 (512) Q Consensus 404 ~li~~~l~~~~~~~---------~~~---------------------d~----~~i~i~f~~~~p~d~~~~~~~~~kl~g 449 (512) ++++.+........ ... |. .+|.+.=.+..+.-..+..+.++.+.+ T Consensus 472 ~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~ 551 (706) T protein:vir:10 472 EIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQ 551 (706) T ss_pred HHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHH Confidence 77776654221100 000 11 122333334444445566666666533 Q ss_pred c-CCh--HH------HHHhCCCCCCHHHHHHHHHHHHH-------------HHHHHHHhhcccCC--CCCCCCCCCCCCc Q lcl|NC_010808. 450 K-ISQ--TT------LMSLFSFFQDPELEVKKIEEDEK-------------ESIKKAQKGIYKDP--RDINDDEQDDDTK 505 (512) Q Consensus 450 ~-~s~--et------~~~~~~~v~d~~~E~~ri~~E~~-------------~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 505 (512) . .|. .+ +++.+.+ ...++-.++|++... .....++....... ....-..+..... T Consensus 552 ~~~p~~~~~~~l~~~~~~~~d~-p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~q 630 (706) T protein:vir:10 552 GMLPQDPMRPALMGIIIDNMEG-EGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQ 630 (706) T ss_pred hcCCcchhhHHHHHHHHhhcCc-cchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 221 12 2332221 222222344432111 00000000000000 0000000000000 Q ss_pred CcccCCC Q lcl|NC_010808. 506 DTVDKKE 512 (512) Q Consensus 506 ~~~~~~e 512 (512) -+..+-+ T Consensus 631 A~~~k~~ 637 (706) T protein:vir:10 631 AEAQKSQ 637 (706) T ss_pred HHHHHHH Confidence 0000001 No 101 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.52 E-value=3.1e-12 Score=83.67 Aligned_cols=445 Identities=12% Similarity=0.025 Sum_probs=232.6 Q ss_pred ccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--ccccccccccc------ Q lcl|NC_010808. 16 NRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE--LTRRKEEYMAD------ 87 (512) Q Consensus 16 ~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~--~~~~~~~~~~~------ 87 (512) +....+.. +.+-+.+....-.+....+...+.|+|-...... ....+....++ T Consensus 1 ~~r~~~~~-------------------~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~ 61 (505) T protein:vir:96 1 MKRAEKKP-------------------SLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYAD 61 (505) T ss_pred CCCCcccc-------------------chhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHH Confidence 11111111 1111112111111222233334566653321100 00011111000 Q ss_pred -----e--e---eecchHHHHHHHHHhhhhc-cCceecCC--------chhHHHHHHHHHhc------------cChhHH Q lcl|NC_010808. 88 -----N--R---VAHDYASYISDFINGYFLG-NPIQCQDD--------DKDVLEAIEAFNDL------------NDVESH 136 (512) Q Consensus 88 -----~--r---i~~n~~~~iv~~~a~~l~g-~~~~~~~~--------d~~~~~~l~~~~~~------------n~~~~~ 136 (512) . | ...+|++-+|+..+..++| .+++.... +++..+.|+..|+. .+|... T Consensus 62 ~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~l 141 (505) T protein:vir:96 62 LASLVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTL 141 (505) T ss_pred HHHHHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHH Confidence 0 1 2368999999999999999 68877642 45555555555432 136777 Q ss_pred HHHHHHHHHhCCeEEEEEEECCCC--ceEEEEEccceeEEEEeC--CCCceeEEEEEEeeeeeeccCCcceEEEEEEEc- Q lcl|NC_010808. 137 NRSLGLDLSIYGKAYELMIRNQDD--ETRLYKSDAMSTFVIYDN--TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT- 211 (512) Q Consensus 137 ~~~~~~~~~~~G~a~~~v~~d~~g--~~~i~~~~p~~~~~i~d~--~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt- 211 (512) ...+.+..+..|.||+.......+ ..++..++|..+-.-++. .....+..+|.+ +..+.. ..+.++. T Consensus 142 q~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~------d~~Gr~--~aY~i~~~ 213 (505) T protein:vir:96 142 LHLWMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIEL------DAWERP--VAYHLLVN 213 (505) T ss_pred HHHHHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEE------CCCCce--EEEEEeec Confidence 888999999999999888665444 257889999886333321 112334556533 111211 1222221 Q ss_pred -CCcEEEEEecCCccccccccccccccccccccc---eEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 212 -SHGVYRYLTSRTNGLKLTPRENGFESHSFERMP---ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (512) Q Consensus 212 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~ 282 (512) |...+.. . ......+.+|| |+++- ....|.|.|.+++..+..++............. T Consensus 214 hPgd~~~~-~-------------~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~ 279 (505) T protein:vir:96 214 HPGDNSYC-Y-------------HYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELG 279 (505) T ss_pred CCCccccc-c-------------ccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHh Confidence 1111000 0 00111234454 34432 234689999999998888777666555555554 Q ss_pred cCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_010808. 283 NDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPN 362 (512) Q Consensus 283 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 362 (512) +.-..++++..+................+.+ +.......|-++++++++.+...+..+...+...|....++|- T Consensus 280 A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~p------G~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~y 353 (505) T protein:vir:96 280 AKKVGFYEQDPEAYDQPPEDDQGEIVEEVEA------GTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAY 353 (505) T ss_pred hhheeeeecCCccCCCccccccCccccccCC------ceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 4444455543222111111111111111111 1223346677899999988889999999999888888888874 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCCccc-c-cceeeEEeCCCC--CcCH Q lcl|NC_010808. 363 MKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTK-GLRRRAKLLETILKNTRSIDANK-D-FNTVRYVYNRNL--PKSL 437 (512) Q Consensus 363 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~l~~~~~~~~~~-d-~~~i~i~f~~~~--p~d~ 437 (512) ..+..--++.|-.+.+..+......+...+..|.. .++.+++..++..-..+..+.+. + ..-..+.|..+. ..|. T Consensus 354 e~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP 433 (505) T protein:vir:96 354 NRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDP 433 (505) T ss_pred HHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccCh Confidence 33322113345556777777777777777766665 44445555444433334332211 1 112356675443 3577 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 438 IEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 438 ~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) ...+++...+ +|+.|.+..+...| .|+++.++++.+|++...+.-.. ....+........++++....|+ T Consensus 434 ~Ke~~a~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~-~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 434 AKDSKAHSESIKNRTRSRSSIIRAAG--DDPEDVFDEIAWEEQLMRDKGVN-PTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred HHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCC-CCCCCCCCCCCCCCCCCCCCCCC Confidence 7777777665 79999999999986 48999999999998776543221 11111111111222222222222 No 102 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.48 E-value=3.9e-12 Score=83.13 Aligned_cols=447 Identities=9% Similarity=-0.053 Sum_probs=221.4 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-cccccccccccc-------------cee---eecchHHHHHHH Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-VELTRRKEEYMA-------------DNR---VAHDYASYISDF 101 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~-~~~~~~~~~~~~-------------~~r---i~~n~~~~iv~~ 101 (512) .+. -.+... + .++-.+....||.|-...- ....-.+....+ .-| ...+|++-+|+. T Consensus 1 ~~~-~~~~~~-----~-~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~ 73 (530) T protein:vir:38 1 MKI-PSLVGP-----D-GKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQL 73 (530) T ss_pred Ccc-ceeecC-----c-cccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 010 001110 1 1122344556665532100 000000000000 001 236899999999 Q ss_pred HHhhhhccCceecCC------------chhHHHHHHHHHhc--------------cChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 102 INGYFLGNPIQCQDD------------DKDVLEAIEAFNDL--------------NDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 102 ~a~~l~g~~~~~~~~------------d~~~~~~l~~~~~~--------------n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) .+..++|.+++.... +++..+.+...|+. .+|......+.+..++.|.+|+.+. T Consensus 74 ~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 153 (530) T protein:vir:38 74 HQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQAT 153 (530) T ss_pred HHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEee Confidence 999999999987542 23334555555531 2577888889999999999999886 Q ss_pred ECCCC----ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccc Q lcl|NC_010808. 156 RNQDD----ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPR 231 (512) Q Consensus 156 ~d~~g----~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~ 231 (512) .++.+ .+++..++|..+-.-++......+..+|.+ +..+...-+ .++... ..+.....+. T Consensus 154 ~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~------d~~Gr~~aY--~i~~~~--------~~~~~~~~~~ 217 (530) T protein:vir:38 154 WDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKI------NDSGAALGY--YVSDDG--------YPGWMAQNWT 217 (530) T ss_pred eccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEE------CCCCceEEE--EEeecc--------CCCccccccc Confidence 65543 367899999876433333334456666643 222222222 222110 0000000000 Q ss_pred ccccccccccccceEeecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhh------- Q lcl|NC_010808. 232 ENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDE------- 299 (512) Q Consensus 232 ~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~------- 299 (512) ... ....++.--|+++.. ...|.|+|.+++..+..++.............+.-..+|+...+..... T Consensus 218 ~~~-~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~ 296 (530) T protein:vir:38 218 YIP-RELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGAD 296 (530) T ss_pred eee-eeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCC Confidence 000 001111112555433 3468999999888887777665544444444333333444322111000 Q ss_pred hhhhhhccccc--------cchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 300 VKKQKEANVLF--------LEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT 371 (512) Q Consensus 300 ~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n 371 (512) .........-. .........+.......|.++++.+++.+...+..+.+.+...|....++|-..+..--++ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~ 376 (530) T protein:vir:38 297 NKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQ 376 (530) T ss_pred cccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc Confidence 00000000000 0000001111222345677899999888888999999999888888888774433221133 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCCCcc----ccc-----ceeeEEeCCC--CCcCHHH Q lcl|NC_010808. 372 QSGEAMKYKLFGLEQRTKTKEGLFTKG-LRRRAKLLETILKNTRSIDAN----KDF-----NTVRYVYNRN--LPKSLIE 439 (512) Q Consensus 372 ~Sg~Ai~~~~~~l~~k~~~~~~~~~~~-l~~~~~li~~~l~~~~~~~~~----~d~-----~~i~i~f~~~--~p~d~~~ 439 (512) .|-.+.+..+......+...+..|... ++.+++..+...-..+..+.+ .++ ....+.|..+ ...|... T Consensus 377 ~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~K 456 (530) T protein:vir:38 377 MSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLK 456 (530) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHH Confidence 455566777777777777777666553 344444333322222222211 111 1234566433 3467777 Q ss_pred HHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 440 ELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 440 ~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+++...+ +|+.|.+.++...| .|+++.++++.+|.+...+.-..... .+...........++++.|..- T Consensus 457 e~~a~~~~i~~G~~s~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~-~~~~~~~~~~~~~~~~~~d~~~ 528 (530) T protein:vir:38 457 EVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRESMERRAAGLNPPA-WAAAAFEAGVKKSNEEEQDGAR 528 (530) T ss_pred HHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCC-CcccccCCCCCCCCCCCCCCCC Confidence 77776654 79999999999886 48999999999998776543221111 1110100111111111111111 No 103 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.48 E-value=7.2e-12 Score=81.66 Aligned_cols=445 Identities=8% Similarity=-0.022 Sum_probs=225.4 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---cccc--cccc-cc-ccccc-----ee-----eecchHHHHHH Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTK---NLVE--LTRR-KE-EYMAD-----NR-----VAHDYASYISD 100 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~---~~~~--~~~~-~~-~~~~~-----~r-----i~~n~~~~iv~ 100 (512) +.+....+...+....-.++.+ .-...|.|-.. .... +... .+ ..... .| ...+|++-+|+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~--~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSAS--LGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred Ccchhhhhhcccccccchhhhh--hhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 2223333333333232222221 11234444211 1100 0000 00 00000 01 23689999999 Q ss_pred HHHhhhhccCceecCC-------------chhHHHHHHHHHh---c-----------cChhHHHHHHHHHHHhCCeEEEE Q lcl|NC_010808. 101 FINGYFLGNPIQCQDD-------------DKDVLEAIEAFND---L-----------NDVESHNRSLGLDLSIYGKAYEL 153 (512) Q Consensus 101 ~~a~~l~g~~~~~~~~-------------d~~~~~~l~~~~~---~-----------n~~~~~~~~~~~~~~~~G~a~~~ 153 (512) .++..++|.+++..+. +++.++.+...|+ . .+|......+.+..+..|.+|+. T Consensus 79 ~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 158 (553) T protein:vir:63 79 YQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLAT 158 (553) T ss_pred HHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEE Confidence 9999999999987542 1223333333332 1 24777888899999999999988 Q ss_pred EEECCCC----ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc--CCcEEEEEecCCcccc Q lcl|NC_010808. 154 MIRNQDD----ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT--SHGVYRYLTSRTNGLK 227 (512) Q Consensus 154 v~~d~~g----~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt--~~~~~~~~~~~~~~~~ 227 (512) ..+.+.+ .+++..++|..+-.-++......+..+|.+ +..+... .+.++. |...+........+.. T Consensus 159 ~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~------d~~Gr~v--aY~i~~~hPgd~~~~~~~~~~~~r 230 (553) T protein:vir:63 159 AEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQY------DKRGRPQ--GYWIQVAHPGDLYQMAPDMYKWKF 230 (553) T ss_pred eeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEE------CCCCceE--EEEeeccCCCccccccccccceee Confidence 7655432 357889999877555544444556667643 2222222 222322 2222211111111000 Q ss_pred ccccccccccccccccc---eEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhh Q lcl|NC_010808. 228 LTPRENGFESHSFERMP---ITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDE 299 (512) Q Consensus 228 ~~~~~~~~~~~~~~~vP---vv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~ 299 (512) + . .+..+| |+++ +....|.|.|.+++..+..++.............+.-..+++...+... . T Consensus 231 ~-~--------~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~-~ 300 (553) T protein:vir:63 231 V-Q--------QSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEF-I 300 (553) T ss_pred e-c--------cccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhh-h Confidence 0 0 011111 3332 2345689999999888877777665554444443333334442211100 0 Q ss_pred hhhhh------------------------hccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHH Q lcl|NC_010808. 300 VKKQK------------------------EANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIH 355 (512) Q Consensus 300 ~~~~~------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~ 355 (512) ..... ....+.+. .+.......|-+++++++..+...+..+.+.+...|. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~------pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~ia 374 (553) T protein:vir:63 301 HSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQID------GAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLA 374 (553) T ss_pred hhhcccccccccccccccccccccccccccccceeec------CceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHH Confidence 00000 00011111 1122234567789999988888889999999888888 Q ss_pred HHhccccccc-ccccccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCCCccc------------c Q lcl|NC_010808. 356 MFTNTPNMKD-DNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKG-LRRRAKLLETILKNTRSIDANK------------D 421 (512) Q Consensus 356 ~~s~~p~~~~-~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~-l~~~~~li~~~l~~~~~~~~~~------------d 421 (512) ...++|-..+ +.+ ++.|-.+.+..+..........+..|... .+.+++..++..-..+.++.+. . T Consensus 375 aglGi~Ye~lt~D~-s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~ 453 (553) T protein:vir:63 375 SAFGMSYEEFTRDF-SKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMK 453 (553) T ss_pred hhcCCCHHHHhhhc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhh Confidence 8887773332 222 23444566666666666666666656543 3334554443322323222111 0 Q ss_pred cceeeEEeCCCC--CcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCC----- Q lcl|NC_010808. 422 FNTVRYVYNRNL--PKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP----- 492 (512) Q Consensus 422 ~~~i~i~f~~~~--p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~----- 492 (512) ...+.+.|..+. ..|....+++...+ +|+.|.+.++...| .|+++-++++++|.+...+.-... ..++ T Consensus 454 ~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~-~~~~~~~~~ 530 (553) T protein:vir:63 454 EALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLG--GDFRKSFAQRAREDALLKKYGLTF-NLSAKRSLG 530 (553) T ss_pred hhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCC-CCCCccccC Confidence 112346675543 35777777776665 79999999999986 488988999999877655431110 0011 Q ss_pred ---CCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 493 ---RDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 493 ---~~~~~~~~~~~~~~~~~~~e 512 (512) ......+++...++..+++| T Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 531 DGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred CCcccCCCCCCCCCCCCcccccC Confidence 01111222233344444444 No 104 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.47 E-value=7.5e-12 Score=81.57 Aligned_cols=447 Identities=9% Similarity=-0.033 Sum_probs=221.3 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---cccc--cccc-ccc-ccc-----ce--e---eecchHHHHHHH Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTK---NLVE--LTRR-KEE-YMA-----DN--R---VAHDYASYISDF 101 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~---~~~~--~~~~-~~~-~~~-----~~--r---i~~n~~~~iv~~ 101 (512) ++.+...... . .... ........||.|-.. .... +... .+. ... .. | ...+|++-+|+. T Consensus 1 ~~~p~~~~~~--~-~~~~-~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~ 76 (533) T protein:vir:34 1 MKTPTIPTLL--G-PDGM-TSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQL 76 (533) T ss_pred CCCchhhhhh--c-cccc-chHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 2222111100 0 0111 122344567765321 1110 0000 000 000 00 1 236899999999 Q ss_pred HHhhhhccCceecCC------------chhHHHHHHHHHh----c----------cChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 102 INGYFLGNPIQCQDD------------DKDVLEAIEAFND----L----------NDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 102 ~a~~l~g~~~~~~~~------------d~~~~~~l~~~~~----~----------n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) .+++++|.+++..+. +++..+.++..|+ . .+|......+.+..++.|.+|+... T Consensus 77 ~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~ 156 (533) T protein:vir:34 77 HQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQAT 156 (533) T ss_pred HHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEee Confidence 999999999987652 2233344444432 1 2577778889999999999999887 Q ss_pred ECCCC----ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccc Q lcl|NC_010808. 156 RNQDD----ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPR 231 (512) Q Consensus 156 ~d~~g----~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~ 231 (512) +++.+ ..++..++|..+-.-++......+..+|.+ +..+.. .-+.++... ..+....... T Consensus 157 ~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~------d~~Gr~--~aY~i~~~~--------~~~~~~~~~~ 220 (533) T protein:vir:34 157 WDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQI------NDSGAA--LGYYVSEDG--------YPGWMPQKWT 220 (533) T ss_pred eccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEE------CCCCCe--EEEEEeecC--------CCCccccccc Confidence 65543 367899999876544443333445666643 112222 222222110 0000000000 Q ss_pred ccccccccccccc---eEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh-hh-- Q lcl|NC_010808. 232 ENGFESHSFERMP---ITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD-EV-- 300 (512) Q Consensus 232 ~~~~~~~~~~~vP---vv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~-~~-- 300 (512) ... .+..+| |+++. ....|.|.|.+++..+..++.............+.-..+++...+.... +. T Consensus 221 ~~~----~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~ 296 (533) T protein:vir:34 221 WIP----RELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFIL 296 (533) T ss_pred eee----eeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCccccccccc Confidence 000 011122 44432 2356899999988888777766554444444333333344422111000 00 Q ss_pred ----hhhhhcccc--------ccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_010808. 301 ----KKQKEANVL--------FLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF 368 (512) Q Consensus 301 ----~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~ 368 (512) ....+...- ..........+.......|.++++++++.+...+..+...+...|....++|-..+..- T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D 376 (533) T protein:vir:34 297 GANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRN 376 (533) T ss_pred CCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhh Confidence 000000000 00000011112222346677899999888888999999999888888887774333221 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCCCcc----ccc-----ceeeEEeCCC--CCcC Q lcl|NC_010808. 369 SGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL-RRRAKLLETILKNTRSIDAN----KDF-----NTVRYVYNRN--LPKS 436 (512) Q Consensus 369 ~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~l~~~~~~~~~----~d~-----~~i~i~f~~~--~p~d 436 (512) -++.|-.+++..+......+...+..|...+ +.+++..+...-..+..+.+ .++ ....+.|..+ ...| T Consensus 377 ~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iD 456 (533) T protein:vir:34 377 YAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAID 456 (533) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccC Confidence 1334555666666666666666666555433 33444333322222322211 011 1234567444 3467 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 437 LIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 437 ~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ....+++...+ +|+.|.+.++...| .|+++.++++.+|++...+.-... ..++......+...++.+..++.. T Consensus 457 P~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~ev~~q~a~e~~~~~~~gl~~-~~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 457 GLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRETMERRAAGLKP-PAWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHhcCCCC-CCCCCcCccCCCCCCCCCCcccCC Confidence 77777777665 79999999999987 489999999999987755432111 111111111111111222222222 No 105 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.47 E-value=4.2e-12 Score=82.96 Aligned_cols=470 Identities=13% Similarity=0.030 Sum_probs=202.9 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHH---HHHHHHHHHHHHhc--cccccccccc----ccccccccceeeecchHHHHHH Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMD---YQRPRLKVLSDYYE--GKTKNLVELT----RRKEEYMADNRVAHDYASYISD 100 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~---~~~~r~~~~~~yy~--G~~~~~~~~~----~~~~~~~~~~ri~~n~~~~iv~ 100 (512) +.+.....+ ..+...+....+ .-+.....-.+||. |.+....... .....++| .+.+|..+.+|+ T Consensus 1 ma~~~~~~l----~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P--~~~~N~i~~~v~ 74 (720) T protein:vir:35 1 MAETLQKRH----EQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYP--KFEINKISTELN 74 (720) T ss_pred CchHHHHHH----HHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCC--eEEEccHHHHHH Confidence 121212222 222222222221 12222334556775 6654321111 01111222 377899999999 Q ss_pred HHHhhhhccCceecC--C----chhH----HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC----C-----Cc Q lcl|NC_010808. 101 FINGYFLGNPIQCQD--D----DKDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ----D-----DE 161 (512) Q Consensus 101 ~~a~~l~g~~~~~~~--~----d~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~----~-----g~ 161 (512) ..+++-.-+.+.+.. . +.+. ...+..+.+.|+.+.....++.+++++|.+|+-|..|- + +. T Consensus 75 ~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~ 154 (720) T protein:vir:35 75 RIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQR 154 (720) T ss_pred HHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccce Confidence 999998777765532 1 2222 23455566789999999999999999999999886641 1 12 Q ss_pred eEEEEE-cc-ceeEEEEeCCCCc-e---eE-EEEEEe----------e------------eeeeccCCcceEEEEEEEcC Q lcl|NC_010808. 162 TRLYKS-DA-MSTFVIYDNTIER-N---SI-AGVRYL----------R------------TKPIDKTDEDEVFTVDLFTS 212 (512) Q Consensus 162 ~~i~~~-~p-~~~~~i~d~~~~~-~---~~-~~v~~~----------~------------~~~~~~~~~~~~~~~~~yt~ 212 (512) +++..+ +| .++ +||+.... . .. ++++.| . ....++.+...+..+++|.. T Consensus 155 i~i~~v~~~~~~v--~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~ 232 (720) T protein:vir:35 155 ICLEPIYDPARSV--WFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEV 232 (720) T ss_pred eeEecccCchhhe--eecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEE Confidence 333321 22 222 23321100 0 00 001000 0 00000111122333333322 Q ss_pred CcE---------------EEEEecCC-------------------------ccccccc--cccccccccccccceEeecC Q lcl|NC_010808. 213 HGV---------------YRYLTSRT-------------------------NGLKLTP--RENGFESHSFERMPITEFSN 250 (512) Q Consensus 213 ~~~---------------~~~~~~~~-------------------------~~~~~~~--~~~~~~~~~~~~vPvv~~~n 250 (512) ..+ ..|..... .+..+.. ....+.+-|++.+|+|+|.- T Consensus 233 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g 312 (720) T protein:vir:35 233 KKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYG 312 (720) T ss_pred EEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEe Confidence 211 11100000 0000000 01123444566677777532 Q ss_pred -------CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccc---cccchhhhhhcc Q lcl|NC_010808. 251 -------NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANV---LFLEPTVYENRD 320 (512) Q Consensus 251 -------~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 320 (512) .+...|.+.++++.++.+|...|.+.+.+. ..+...-.|...............+. ..+....... . T Consensus 313 ~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~--~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~-~ 389 (720) T protein:vir:35 313 KRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSAT--QDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVD-K 389 (720) T ss_pred eeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHH--cCCccccccCcchHHHHHHHhhccccccccccccccccc-c Confidence 112357888999999999999999999985 34444434432221111111000000 0000000000 0 Q ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 321 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .+........+.....+.-..+...++..-...|..+|++-+-+.|.. +|+||+||...-..-.........-+..+.+ T Consensus 390 ~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~-sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~ 468 (720) T protein:vir:35 390 QGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMP-SNIAKETVNHLMHRSDMSSFIYLDNMAKSLK 468 (720) T ss_pred CcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000112334444444455666777777888889988887777764 4689999998766666666666666777777 Q ss_pred HHHHHHHHHHHhccCCC---------ccc---------------------cc----ceeeEEeCCCCCcCHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSID---------ANK---------------------DF----NTVRYVYNRNLPKSLIEELKAYID 446 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~---------~~~---------------------d~----~~i~i~f~~~~p~d~~~~~~~~~k 446 (512) ++.++++.+........ ... |. .+|.+.=.+..+.-..+..+.++. T Consensus 469 ~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~q 548 (720) T protein:vir:35 469 RAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTN 548 (720) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHH Confidence 76666665543221100 000 11 122333333444445555666666 Q ss_pred HhccCChHH---------HHHhCCCCCCHHHHHHHHHHHHHHHHH--------------HHHhhcccCCCCCCCCC---- Q lcl|NC_010808. 447 SGGKISQTT---------LMSLFSFFQDPELEVKKIEEDEKESIK--------------KAQKGIYKDPRDINDDE---- 499 (512) Q Consensus 447 l~g~~s~et---------~~~~~~~v~d~~~E~~ri~~E~~~~~~--------------~~~~~~~~~~~~~~~~~---- 499 (512) +.+.++... +++.+.+ ...++-.+++.+....... ..+.............. T Consensus 549 ll~~~~p~~~~~~~~~~~ile~~d~-p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~q 627 (720) T protein:vir:35 549 LLAGMLPQDPMRQVLQGIILDNMEG-EGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQ 627 (720) T ss_pred HHHhcCCCchhHHHHHHHHHHhcCc-hhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHH Confidence 544343221 2222221 2222333444332211000 00000000000000000 Q ss_pred -CCCCCcCcccCCC Q lcl|NC_010808. 500 -QDDDTKDTVDKKE 512 (512) Q Consensus 500 -~~~~~~~~~~~~e 512 (512) +.+..+-..+... T Consensus 628 aqae~~kaqa~~~~ 641 (720) T protein:vir:35 628 GQAEVQKAKNEELA 641 (720) T ss_pred HHHHHHHHHHHHHH Confidence 0000000000000 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.41 E-value=7.6e-13 Score=87.01 Aligned_cols=401 Identities=10% Similarity=0.036 Sum_probs=195.7 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCch Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK 118 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~ 118 (512) ....+-+.++...--.. |-+....|+.+...... .... .-..+.+++.+|+..+.-++-+++.+++++. T Consensus 1 ~~~~D~~~~~~~~~g~~---~~~~~~~~~~~~~~~~~-------~l~a-~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~ 69 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSK---QEQTYYSPSLSLTDDLV-------QLEA-LWRDNWIANKVCIKRPEDMVRNWREIYSNDL 69 (437) T ss_pred CchhhhhHhHHhcCCCc---cccceeecCccccccHH-------HHHH-HHHhCchhhHHhhcchHHhhcCCceEecCCC Confidence 01111111111000000 00000001111000000 0000 0124689999999999999999999988643 Q ss_pred --hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC---------Cce-EEEEEccceeEEEEeC-CCCceeE Q lcl|NC_010808. 119 --DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD---------DET-RLYKSDAMSTFVIYDN-TIERNSI 185 (512) Q Consensus 119 --~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~---------g~~-~i~~~~p~~~~~i~d~-~~~~~~~ 185 (512) +..+.++..|+.-++...+.++.+.+-.||.+++++-.+.. |.+ .+.+++|.++.|..-. .....+- T Consensus 70 ~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~ 149 (437) T protein:vir:52 70 NSKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPN 149 (437) T ss_pred CHHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccc Confidence 34457888888888999999999999999999999877542 222 2556666665543211 1111111 Q ss_pred EE-EEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHH Q lcl|NC_010808. 186 AG-VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITL 264 (512) Q Consensus 186 ~~-v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~l 264 (512) ++ ..+|.+.. .... ..|.+.++.+|.... +| ...++-.|.|.++.+.+- T Consensus 150 fg~p~~y~v~~----~~~~----~~iH~SRii~~~~~~--------------------~~--~~~~~~~G~s~le~~~~~ 199 (437) T protein:vir:52 150 FGRYSEYSILG----GSQS----ITVHHSRLIILNAND--------------------AP--LSDNDIWGVSDLEKIIDV 199 (437) T ss_pred cCcceEEEEec----CCcc----eeEccceeEEecCcc--------------------CC--CccccccCCchHHHHHHH Confidence 11 11121110 0000 012223333332210 12 122345689999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCc---CC-hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLS---LD-PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDV 340 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 340 (512) +.+++++.-.....+..+..+.+.+.|... .. .+....... . . ............+.+.+++. .+.+. T Consensus 200 i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~--~--~--~~~~~~~~~~~~d~~~~~e~--~~~~~ 271 (437) T protein:vir:52 200 LKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVIS--A--V--QEIKSATNSLLLDAENEYDR--KELTF 271 (437) T ss_pred HHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHH--H--H--HHhcCCCceEEEcCCcceEE--EecCc Confidence 999999888877777777777666655211 11 111111000 0 0 00011111111223334444 44566 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccc--cccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_010808. 341 QGTEAYKDRLNSDIHMFTNTPNMKDDNF--SGTQSGEAMKYKLFGLEQRTKTKE-GLFTKGLRRRAKLLETILKNTRSID 417 (512) Q Consensus 341 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~n~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~l~~~~~~~ 417 (512) ......++...+.|...+++|-.-+... +|=.||..-. ...+..++..+ ..+...+++++.+++.- ..+. T Consensus 272 sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~---~~yyd~i~~~Qe~~l~p~le~l~~~i~~~--~~g~-- 344 (437) T protein:vir:52 272 TGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDI---QNYHEAIRRLQETRLRPIFEIIDPLICNE--LFGG-- 344 (437) T ss_pred CCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hcCC-- Confidence 7788899999999999999997554322 1223455333 33344455554 45788888888876542 1121 Q ss_pred cccccceeeEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 418 ANKDFNTVRYVYNRNLPKSLIEELKAYIDS---------GGKISQTTLMSLF------SFFQDPELEVKKIEEDEKESIK 482 (512) Q Consensus 418 ~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl---------~g~~s~et~~~~~------~~v~d~~~E~~ri~~E~~~~~~ 482 (512) .+ .++++.|++-...+..+.|+...+. +|+++.+.+.+.| +.+++ ++++ ..+ T Consensus 345 ~~---~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~--~~~~--------~~~ 411 (437) T protein:vir:52 345 LP---ADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISA--EHIE--------ELK 411 (437) T ss_pred CC---CcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCc--cccc--------ccc Confidence 11 2578999999888988887764432 4667766655543 11221 0000 000 Q ss_pred HHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 483 KAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) ... ...++.. ..+...+.+..+..++ T Consensus 412 ~~~-~~~~~~~-~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 412 NAD-EFAGNFE-EPEKMEGAQVQNSEDQ 437 (437) T ss_pred CCC-CCCCccC-CCCCCCCCCCCCCCCC Confidence 000 0000000 0011111112222222 No 107 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.40 E-value=2.9e-11 Score=78.33 Aligned_cols=437 Identities=13% Similarity=0.051 Sum_probs=228.3 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccc-----------ee-----eecchHHHHHHHH Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMAD-----------NR-----VAHDYASYISDFI 102 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~-----------~r-----i~~n~~~~iv~~~ 102 (512) .+.++.+...++.....++.+-+...+-|+|-..--..... .....++ .| ...+|++-+|+.+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~-~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~ 79 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAK-RQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRL 79 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCcccccccc-CCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 34455566666555544444444445556664321100000 0111010 01 1368999999999 Q ss_pred Hhhhhcc-CceecC----Cch----hHHHHHHHHH----hc------cChhHHHHHHHHHHHhCCeEEEEEEECCCC--- Q lcl|NC_010808. 103 NGYFLGN-PIQCQD----DDK----DVLEAIEAFN----DL------NDVESHNRSLGLDLSIYGKAYELMIRNQDD--- 160 (512) Q Consensus 103 a~~l~g~-~~~~~~----~d~----~~~~~l~~~~----~~------n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g--- 160 (512) ++.++|. ++.++. .+. +..+.++..| +. .+|......+.+..+..|.+|+...++..+ T Consensus 80 ~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~ 159 (548) T protein:vir:95 80 EERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYT 159 (548) T ss_pred HHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeeccccccc Confidence 9999984 444432 222 2233333333 22 347888888999999999999988765433 Q ss_pred -----ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc--CCcEEEEEecCCcccccccccc Q lcl|NC_010808. 161 -----ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT--SHGVYRYLTSRTNGLKLTPREN 233 (512) Q Consensus 161 -----~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt--~~~~~~~~~~~~~~~~~~~~~~ 233 (512) ..++..++|..+-.-++.. ...+..+|.+ +..+...-| .++. |....... .. T Consensus 160 ~g~~~~~~lqliepd~l~~~~~~~-~~~i~~GIE~------D~~Grp~aY--~i~~~hPgd~~~~~--~~---------- 218 (548) T protein:vir:95 160 FATSVPFALELLEPDYLPFSYNNL-SKGIVQGIER------DTWRRKRAY--HLLKDHPGNLQTLG--GS---------- 218 (548) T ss_pred CCcccceEEEEechhhcCCCCCCC-CCceeeeeEE------CCCCceEEE--EEeecCCCcccccc--cc---------- Confidence 1478899998863323222 2345555532 222222222 2222 11111100 00 Q ss_pred ccccccccccc---eEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCCh-hhhhhhh Q lcl|NC_010808. 234 GFESHSFERMP---ITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDP-DEVKKQK 304 (512) Q Consensus 234 ~~~~~~~~~vP---vv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~-~~~~~~~ 304 (512) ..+.+|| |+++ +....|.|.|.+++..+..++.............+.-..+++...+... ....... T Consensus 219 ----~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~~ 294 (548) T protein:vir:95 219 ----LAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGKDR 294 (548) T ss_pred ----cceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCccc Confidence 0112222 3332 2334689999999888877777665555555544444444443211111 1111111 Q ss_pred hccccccchhhhhhccccc-CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccc-cccccccchHHHHHHHHH Q lcl|NC_010808. 305 EANVLFLEPTVYENRDTGI-ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMK-DDNFSGTQSGEAMKYKLF 382 (512) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~~~n~Sg~Ai~~~~~ 382 (512) ....+.+.+ +..+ ....|-++++++++.+...++.+...+...|....++|-.. .+.++ .|-.+.+..+. T Consensus 295 ~~~~~~~~p------G~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s--~nYSS~R~~l~ 366 (548) T protein:vir:95 295 KNRTIPIAP------GMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD--GTYSAQRQELV 366 (548) T ss_pred ccccccccC------CccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc--hhHHHHHHHHH Confidence 111111211 1112 24566789999988888899999999989898888888332 23332 35667777777 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccc---ccceeeEEeCCCC--CcCHHHHHHHHHHH--hccCChH Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANK---DFNTVRYVYNRNL--PKSLIEELKAYIDS--GGKISQT 454 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~-~~~li~~~l~~~~~~~~~~---d~~~i~i~f~~~~--p~d~~~~~~~~~kl--~g~~s~e 454 (512) .........+..|...+-+ +++..++..-..+.++.+. ....+.+.|..+. ..|....+++...+ +|+.|.+ T Consensus 367 e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~ 446 (548) T protein:vir:95 367 EGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEA 446 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHH Confidence 7766777666666544333 5554444333333332211 1123567785443 36888888877665 7999999 Q ss_pred HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCC----CCCCCCCCCCcCcc--------cC--CC Q lcl|NC_010808. 455 TLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD----INDDEQDDDTKDTV--------DK--KE 512 (512) Q Consensus 455 t~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~--------~~--~e 512 (512) .++...| .|+++.++++.+|.+...+.-... ..++.. ...+..+++.++.. |+ || T Consensus 447 ~~~a~~G--~D~~ev~~q~a~E~~~~~~~GL~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (548) T protein:vir:95 447 EVARARG--RDPRELKKSRETEIKANRAAGLVF-SSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEAREL 515 (548) T ss_pred HHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCC-CCcccccccccccCCCCchhhhccccccccccchhHHh Confidence 9999986 489988999999987654432111 111100 00000011111111 11 11 No 108 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.36 E-value=6.1e-11 Score=76.60 Aligned_cols=444 Identities=8% Similarity=-0.027 Sum_probs=216.4 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---ccccccc-ccc--- Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE---LTRRKEE-YMA--- 86 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~---~~~~~~~-~~~--- 86 (512) |+.+.. .++.... ..+-....+-|+|-...-.. +...++. ... T Consensus 1 m~~~~~---------------------------~~~a~~~---~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~ 50 (495) T protein:vir:10 1 MNMTPS---------------------------GYQSLAS---GLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQ 50 (495) T ss_pred CCcccc---------------------------cccccch---hhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHH Confidence 111111 0000000 00111112334443221000 0000000 000 Q ss_pred ----cee---eecchHHHHHHHHHhhhhccCceecC--CchhHHHHHHHHHh---c-------cChhHHHHHHHHHHHhC Q lcl|NC_010808. 87 ----DNR---VAHDYASYISDFINGYFLGNPIQCQD--DDKDVLEAIEAFND---L-------NDVESHNRSLGLDLSIY 147 (512) Q Consensus 87 ----~~r---i~~n~~~~iv~~~a~~l~g~~~~~~~--~d~~~~~~l~~~~~---~-------n~~~~~~~~~~~~~~~~ 147 (512) .-| ...+|++-+|+..++.++|.+++... ++++..+.|+..|+ . .+|......+++..... T Consensus 51 ~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~d 130 (495) T protein:vir:10 51 TLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINS 130 (495) T ss_pred HHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhC Confidence 001 23689999999999999999998754 45555555555442 1 36777888899999999 Q ss_pred CeEEEEEEECC--CC---ceEEEEEccceeEEEEeC---CCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEE Q lcl|NC_010808. 148 GKAYELMIRNQ--DD---ETRLYKSDAMSTFVIYDN---TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (512) Q Consensus 148 G~a~~~v~~d~--~g---~~~i~~~~p~~~~~i~d~---~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~ 219 (512) |.+|+.+.+.+ +| .+++..++|..+---++. .....+..+|.+ +..+...-|++.--.|.... + T Consensus 131 GE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~------d~~Gr~vaY~i~~~hpgd~~-~- 202 (495) T protein:vir:10 131 GEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRF------SNGGKRKAYCFYRNHPAESS-L- 202 (495) T ss_pred CceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEE------CCCCceEEEEEeecCCCccc-c- Confidence 99998775543 33 368999999987422221 222345666643 11222222221111111100 0 Q ss_pred ecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh- Q lcl|NC_010808. 220 TSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD- 298 (512) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~- 298 (512) ......+ ....-....|.| | ..+....|.|.+..++.| ..++.............+.-..+++...+.... T Consensus 203 ~~~~~~~--~rvpA~~vlH~f---~--~r~gQ~RGis~la~i~~l-~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~ 274 (495) T protein:vir:10 203 IGDPVDT--VWIKAEHVLHVT---V--LTVRSDAGAPWFQLLLRL-NELDQYEDAELVRKKTAALFAAFIQEATADSTGG 274 (495) T ss_pred cccccce--eeechhheEecc---c--cCCCcccCcchhHHHHHH-HHhhHHHHHHHHHHHHhhhheeeeecCCCccccc Confidence 0000000 000001122332 1 112345688888877664 333333333222233333333344432111110 Q ss_pred hhh---hhhhc--cccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch Q lcl|NC_010808. 299 EVK---KQKEA--NVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 373 (512) Q Consensus 299 ~~~---~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 373 (512) ... ..... ....+. .+.......|.++++.++..+...+..+...+...|....++|-.....--++.| T Consensus 275 ~~~~~~~~~~~~~~~~~l~------pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~n 348 (495) T protein:vir:10 275 PTIGQPKRSKGGKRITGLN------PGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVN 348 (495) T ss_pred cccCccccccCcccceecC------CceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc Confidence 000 00000 011111 1122234667789999988888889999999888888888777333221112344 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHhccCCCccccc----ceeeEEeCCCC--CcCHHHHHHHHH Q lcl|NC_010808. 374 GEAMKYKLFGLEQRTKTKEG-LFT-KGLRRRAKLLETILKNTRSIDANKDF----NTVRYVYNRNL--PKSLIEELKAYI 445 (512) Q Consensus 374 g~Ai~~~~~~l~~k~~~~~~-~~~-~~l~~~~~li~~~l~~~~~~~~~~d~----~~i~i~f~~~~--p~d~~~~~~~~~ 445 (512) -.+++..+......+...+. .+. ..++.+++..++..-..+.+..+..+ .-+.+.|..+. ..|....+++.. T Consensus 349 YSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~ 428 (495) T protein:vir:10 349 YSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADL 428 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHH Confidence 55677776666666666543 343 34455555544433333433322111 11345674443 367777777776 Q ss_pred HH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC-CCCCCCCcCcccCCC Q lcl|NC_010808. 446 DS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND-DEQDDDTKDTVDKKE 512 (512) Q Consensus 446 kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~e 512 (512) .+ +|+.|.+..+...| .|+++.++++++|++...+.-. .+..++..... ....+.+++..++.| T Consensus 429 ~~i~~G~~s~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl-~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 429 GDVRAGFAPISDKQAERG--YDMEELFDMISDANQLIDEYDL-RLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCC-CCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 65 79999999999987 4888889999988876544321 11122222222 222233444444444 No 109 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.35 E-value=1.5e-12 Score=85.38 Aligned_cols=399 Identities=9% Similarity=0.021 Sum_probs=185.2 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc---ccc- Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEE---YMA- 86 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~---~~~- 86 (512) |+. .+.+..-.+-..+ -+...+.+.-... ........ +.. T Consensus 1 ~~~------~m~~~~~~~~~~D-----------------------------~~~~~~~~~~g~~-~~~~~~~~~~~~~~l 44 (435) T protein:vir:79 1 MGV------FMSDKVKAITKED-----------------------------GYNEIFGSKDGTF-RPNAFYMQRAAFKAL 44 (435) T ss_pred CCc------ccccccccchhhc-----------------------------chhhhhccccccc-ccCcccCCcCCHHHH Confidence 321 1111110000000 0111111111100 00000000 000 Q ss_pred -ceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC------ Q lcl|NC_010808. 87 -DNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD------ 159 (512) Q Consensus 87 -~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~------ 159 (512) ..-..+.+++.||+..+.-++.+++.+++++++ +.+...|+.-++...+.++.+.+..||.|++++-..+. T Consensus 45 ~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~P 122 (435) T protein:vir:79 45 SQFYEEDGMARRIVDVIPEEMVTPGFKVDGVKNE--KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSP 122 (435) T ss_pred HHHHhcCchhhhhhccchHHhhcCCceecCCChH--HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccc Confidence 001245889999999999999999999876432 45777777778889999999999999999988875322 Q ss_pred ----Cce-EEEEEccceeEEEEeCCCCceeEEE-EEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccc Q lcl|NC_010808. 160 ----DET-RLYKSDAMSTFVIYDNTIERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPREN 233 (512) Q Consensus 160 ----g~~-~i~~~~p~~~~~i~d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~ 233 (512) |.+ .+.+++|.++.|-.-+.....+-++ ..+|.+....+.. -.. +.+.++.++... T Consensus 123 l~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~-----~~~-iH~SRli~~~g~------------ 184 (435) T protein:vir:79 123 VKPGAQLEDIRVYDRYQITIHERETNARSVRYGEPKLYKISPGGDIP-----EFF-VHYSRICIIDGE------------ 184 (435) T ss_pred cccCCceeeEEeechhhccchhhccCCcccccCcceEEEEecCCCCC-----ceE-EcceeEEEecCC------------ Confidence 112 2344455444332111111111110 0112111000000 000 111122222110 Q ss_pred ccccccccccceE-eecCCCCCCcch-HHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC---Chhhhhhhhhccc Q lcl|NC_010808. 234 GFESHSFERMPIT-EFSNNERRKGDY-EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL---DPDEVKKQKEANV 308 (512) Q Consensus 234 ~~~~~~~~~vPvv-~~~n~~~g~s~~-~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~---~~~~~~~~~~~~~ 308 (512) .+|-. ...++-+|.|.+ +.+.+.+.+++++....+..+..++.+.+.+.|.... ......... +. T Consensus 185 --------~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~--r~ 254 (435) T protein:vir:79 185 --------RVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARL--RL 254 (435) T ss_pred --------cchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHH--HH Confidence 01111 112345678876 6888888888888888888777777776666542111 111100000 00 Q ss_pred cccchhhhhhcccccCC-CCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccc-ccc-c-ccchHHHHHHHHHHH Q lcl|NC_010808. 309 LFLEPTVYENRDTGIET-EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNF-S-GTQSGEAMKYKLFGL 384 (512) Q Consensus 309 ~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-~-~n~Sg~Ai~~~~~~l 384 (512) .... ...+....... .++.+++. .+.+..+....++...+.|...+++|-.-+ +.. + -|.||..-...+... T Consensus 255 ~~~~--~~~~~~~~~~i~~~~e~~e~--~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~ 330 (435) T protein:vir:79 255 AQVD--DESGVGKAIGIDATDEEYEV--LNSDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKL 330 (435) T ss_pred HHHH--HhcCCCCceeEecCCcceEE--EecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHH Confidence 0000 00111111111 22223444 345677889999999999999999997543 222 2 245666544444433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH---------hccCChHH Q lcl|NC_010808. 385 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS---------GGKISQTT 455 (512) Q Consensus 385 ~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl---------~g~~s~et 455 (512) +... .+..++..|++++.+++. ..++++.|+|-...+..+.|+...+. .|+++.+. T Consensus 331 i~~~--Qe~~l~p~l~~l~~li~~-------------s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e 395 (435) T protein:vir:79 331 IDRK--RVEDYKPILEFLLPFMIS-------------ETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKE 395 (435) T ss_pred HHHH--HHHHHHHHHHHHHHHhhc-------------CCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHH Confidence 3322 235577788887777542 02568999999999999888866554 34455444 Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 456 LMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK-DPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 456 ~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 511 (512) +.+.+ +. ..+........ .+-+...+.+.++..+..+++ T Consensus 396 ~r~~L-------------~~----~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 396 TRDTL-------------RS----ICPDLKIMDNDNIELPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred HHHHH-------------HH----hccccCCCCcccccCCccccCCCCCCCCCCCCC Confidence 43322 10 00000000000 000011111112222222222 No 110 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.35 E-value=5.3e-12 Score=82.42 Aligned_cols=475 Identities=12% Similarity=0.040 Sum_probs=188.2 Q ss_pred eccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccc Q lcl|NC_010808. 7 FETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA 86 (512) Q Consensus 7 ~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~ 86 (512) .+.+.+.+. .+=+.-...-....|-+..+...-+++-.+...|- ..+.+...+.+||.|..... +. ..+++ T Consensus 1 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~--~~~gr- 71 (763) T protein:vir:95 1 MEQNTDSMV---PLPDPSQATKLTSWKNELSLQALKADLDAAKPSHT-AMMIKVKEWNDLMRIEGKAK--PP--KVKGR- 71 (763) T ss_pred CCcCccCcC---CCccccchhcCCCCCChHHHHHHHHHHHhhhcchh-HHHHHHHHHHHhhhccccCc--cc--ccCCC- Confidence 222222221 11111111111344444444444444444433333 33444445566654443221 11 11222 Q ss_pred ceeeecchHHHHHHHHHhhh----hccCc--eec---CCchh----HHHHHHH-HHhccChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 87 DNRVAHDYASYISDFINGYF----LGNPI--QCQ---DDDKD----VLEAIEA-FNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 87 ~~ri~~n~~~~iv~~~a~~l----~g~~~--~~~---~~d~~----~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~ 152 (512) .+++.+-....|+.....| ++.+- .+. .+|.+ ....++. ++..|+-...+..++++++.+|.+++ T Consensus 72 -s~vv~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~ 150 (763) T protein:vir:95 72 -SQVQPKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIV 150 (763) T ss_pred -ccccCHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceE Confidence 2456666666666555544 33222 332 22222 2335555 35567666778899999999999999 Q ss_pred EEEECC-------------------------------------------------------------------------- Q lcl|NC_010808. 153 LMIRNQ-------------------------------------------------------------------------- 158 (512) Q Consensus 153 ~v~~d~-------------------------------------------------------------------------- 158 (512) .||++. T Consensus 151 k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (763) T protein:vir:95 151 RVGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEV 230 (763) T ss_pred EEeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEE Confidence 988741 Q ss_pred ----CCceEEEEEccceeEEEEeCCCC-ceeEEEE-EEeeeee-e-------------c----c---------------- Q lcl|NC_010808. 159 ----DDETRLYKSDAMSTFVIYDNTIE-RNSIAGV-RYLRTKP-I-------------D----K---------------- 198 (512) Q Consensus 159 ----~g~~~i~~~~p~~~~~i~d~~~~-~~~~~~v-~~~~~~~-~-------------~----~---------------- 198 (512) .+.|+|..++|.++++-.+-..+ ....+.+ +++.+.. + + . T Consensus 231 ~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (763) T protein:vir:95 231 EVPLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQ 310 (763) T ss_pred EEEecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhcc Confidence 02346667888887643321111 1111211 2111100 0 0 0 Q ss_pred -CC--cceEEEEEEEcC-----CcEEEEEe-cCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHH Q lcl|NC_010808. 199 -TD--EDEVFTVDLFTS-----HGVYRYLT-SRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITL 264 (512) Q Consensus 199 -~~--~~~~~~~~~yt~-----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~l 264 (512) .+ .+.+...++|.. +.+.++.. ...+... ......|.+.+.+|++.+ +...+|.|.++.++++ T Consensus 311 ~~d~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~i---L~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~ 387 (763) T protein:vir:95 311 ISDPMRKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTL---IRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDN 387 (763) T ss_pred CCCcccceEEEEEeeeeeccCCcceeEEEEEEEEcCee---eecccccccCCCcCEEEecceeecCcccCCchHHHhhHH Confidence 00 011222233321 11111100 0001100 111223334456666643 3456789999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCcee-eeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAML-LIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGT 343 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~l-v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 343 (512) ++.+|...+.+.+.+...++|.+ +..|.. +..+....+.+.++.+.++... ...+.....+...... T Consensus 388 Qr~~N~~~~~~~d~l~~~~~~~~~v~~gav--~~~d~~~~~pg~v~~v~~g~~~----------~~~~~~~~~p~~~~~~ 455 (763) T protein:vir:95 388 QAVLGAVMRGMIDLLGRSANGQRGMPKGML--DALNSRRYREGEDYEYNPTQNP----------AQMIIEHKFPELPQSA 455 (763) T ss_pred HHHHHHHHHHHHHHHHhhcCCcEEeecccc--cchhhhcccCCceEEeeCCCCh----------hhhcccccCCCCcchH Confidence 99999999999999988877655 334433 2222222222223322221111 0111222222223445 Q ss_pred HHHHHHHHHHHHHHhcccccccccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---- Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFSG--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID---- 417 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~~--n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~---- 417 (512) ...+..+...+-..++++..+.+..+. +.++.++...............+.|.++++.+++.++.++....... T Consensus 456 ~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviR 535 (763) T protein:vir:95 456 LTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVR 535 (763) T ss_pred HHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEE Confidence 555666666666777777766543221 12222333333334444445556666677777777666654322111 Q ss_pred --c----cc---c---cceeeEEeCCCCCcCHH-HHHHHHHHH----hccCChH---HHH----HhCC---CC------- Q lcl|NC_010808. 418 --A----NK---D---FNTVRYVYNRNLPKSLI-EELKAYIDS----GGKISQT---TLM----SLFS---FF------- 463 (512) Q Consensus 418 --~----~~---d---~~~i~i~f~~~~p~d~~-~~~~~~~kl----~g~~s~e---t~~----~~~~---~v------- 463 (512) . +. + -.+|.+.-. +.+.. +.+..+..+ ...++.. -++ +... .+ T Consensus 536 I~g~e~v~v~~~~~~~~~DV~V~~~---~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q 612 (763) T protein:vir:95 536 ITNEEFVTIKREDLKGNFDLEVDIS---TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQ 612 (763) T ss_pred EeCCccccccHHHhcCCcceEEecc---cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcC Confidence 0 00 0 012222222 12221 222222221 1112211 111 1110 00 Q ss_pred --CCH------HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 464 --QDP------ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 464 --~d~------~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .++ +.+.++++.+.+......+... ........+.+...-+....+ T Consensus 613 ~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~q---aqa~~~~aq~e~~~~d~~~~e 666 (763) T protein:vir:95 613 PQPDPVQEQLKQLAVEKAQLENEELRSKIRLND---AQAQKAMAERDNKNLDYLEQE 666 (763) T ss_pred CCccchhhhHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 011 0111111111111000000000 000000000000000000000 No 111 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.27 E-value=1.5e-11 Score=80.00 Aligned_cols=394 Identities=13% Similarity=0.073 Sum_probs=183.1 Q ss_pred HHHHHHHHHHHHHHHhccccccccccccc-cccccc-ceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHh Q lcl|NC_010808. 52 HMDYQRPRLKVLSDYYEGKTKNLVELTRR-KEEYMA-DNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFND 129 (512) Q Consensus 52 ~~~~~~~r~~~~~~yy~G~~~~~~~~~~~-~~~~~~-~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~ 129 (512) .+......|. ...-|.++-...+... ...+.. ..-..+.+++.||+..+.-++.+++.+++++++ +.+...|+ T Consensus 1 ~~~~~~d~~~---~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~--~~~~~~~~ 75 (427) T protein:vir:10 1 MKIVKHDGYN---DIFNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKDE--KEFKSLWD 75 (427) T ss_pred CCccccchHH---HHhhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccHH--HHHHHHHH Confidence 0001111111 1112211111000000 000000 001246889999999999999999999886533 45777777 Q ss_pred ccChhHHHHHHHHHHHhCCeEEEEEEECCCC----------c-eEEEEEccceeEEEEeCCCCceeEEE-EEEeeeeeec Q lcl|NC_010808. 130 LNDVESHNRSLGLDLSIYGKAYELMIRNQDD----------E-TRLYKSDAMSTFVIYDNTIERNSIAG-VRYLRTKPID 197 (512) Q Consensus 130 ~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g----------~-~~i~~~~p~~~~~i~d~~~~~~~~~~-v~~~~~~~~~ 197 (512) .-++...+.++.+.+..||.+++++-.+... . ..+.+++|.++.|-..+.....+-++ ..+|.+.. T Consensus 76 ~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~-- 153 (427) T protein:vir:10 76 SYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPEIYKVSP-- 153 (427) T ss_pred HhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccccccCcceEEEEec-- Confidence 7788899999999999999999988664322 1 12344444443332111111111110 01111110 Q ss_pred cCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceE-eecCCCCCCcchH-HHHHHHHHHHHHHHHH Q lcl|NC_010808. 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPIT-EFSNNERRKGDYE-KVITLIDLYDNAESDT 275 (512) Q Consensus 198 ~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv-~~~n~~~g~s~~~-~v~~liDa~~~~~s~~ 275 (512) ... ...+ .+.+.++.++... .+|-. ...++-+|.|.+. .+.+.+..++++.-.. T Consensus 154 -~~~--~~~~-~iH~SRli~~~g~--------------------~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~ 209 (427) T protein:vir:10 154 -GDN--MQPY-LIHHSRVFIADGE--------------------RVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLA 209 (427) T ss_pred -CCC--Ccce-EEccccEEEecCC--------------------CchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHH Confidence 000 0000 0111122222110 01111 1123456888876 4677788888888777 Q ss_pred HHHHHHhcCceeeeecCCcC---ChhhhhhhhhccccccchhhhhhcccccCC-CCCcceeEEeecCCHHHHHHHHHHHH Q lcl|NC_010808. 276 ANYMSDLNDAMLLIKGNLSL---DPDEVKKQKEANVLFLEPTVYENRDTGIET-EGSVDGGYIYKQYDVQGTEAYKDRLN 351 (512) Q Consensus 276 ~~~~~~~~~~~lv~~g~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~l~ 351 (512) ...+..++..++.+.|.... ...... ... +.... ............ ..+.++ -..+.+..+....++... T Consensus 210 ~~l~~k~~~~v~k~~~l~~~~~~~~~~~~-~~~-r~~~~--~~~~~~~~~~~l~~~~e~~--e~~~~~lsgl~~~~~~~~ 283 (427) T protein:vir:10 210 TQILRRKQQAVWKVKGLAEMCDDDDAQYA-ARL-RLAQV--DDNSGVGRAIGIDAETEEY--DVLNSDISGVPEFLSSKM 283 (427) T ss_pred HHHHHHhccccccchhHHHHhcCccchHH-HHH-HHHHH--HHhcCcccceeeecCCCce--eEEecccCChHHHHHHHH Confidence 77777777776666553211 111100 000 00000 000111111111 222333 334566778889999999 Q ss_pred HHHHHHhcccccccccc---cccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeE Q lcl|NC_010808. 352 SDIHMFTNTPNMKDDNF---SGTQSGEAMKYKLFGLEQRTKTK-EGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRY 427 (512) Q Consensus 352 ~~i~~~s~~p~~~~~~~---~~n~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i 427 (512) +.|...+++|-.-+... +-|.||..=...+... ++.+ +..+...+++++.+++. ..++++ T Consensus 284 ~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~---i~~~Qe~~l~p~l~~l~~~i~~-------------s~~~~~ 347 (427) T protein:vir:10 284 DRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKL---VDRKREEDYRPLLEFLLPFIVD-------------EEEWSI 347 (427) T ss_pred HHHHhhhCCCeeeeccCCccccccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhc-------------CCCcEE Confidence 99999999996644222 1245666543333333 3333 35578888888777642 025689 Q ss_pred EeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 428 VYNRNLPKSLIEELKAYIDS---------GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 428 ~f~~~~p~d~~~~~~~~~kl---------~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) .|+|-...+..+.|+...+. +|+++.+.+.+.+ ...-.. ...............+.. T Consensus 348 ~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L-------------~~~~~~-~~~~~~~~~~~e~~~~~~ 413 (427) T protein:vir:10 348 EFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPE-FKLKDGNNINIREPEETT 413 (427) T ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHH-------------Hhhhcc-ccCCCCccccccccchhc Confidence 99999999999988765443 3556665554433 110000 000000000000001111 Q ss_pred CCCCCCcCcccCCC Q lcl|NC_010808. 499 EQDDDTKDTVDKKE 512 (512) Q Consensus 499 ~~~~~~~~~~~~~e 512 (512) +.++..+|+.+++- T Consensus 414 e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 414 EPEPGLGEKLEDEN 427 (427) T ss_pred CCCCCCCCCCCCCC Confidence 11111111111111 No 112 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.25 E-value=6.8e-11 Score=76.32 Aligned_cols=443 Identities=8% Similarity=0.011 Sum_probs=203.1 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |...++|...-.....+-..-+--.+.. ..++... ....+-..- ...+-..+..||....-+-+.. T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~d~~~~-------~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~l--- 106 (537) T protein:vir:10 41 QLVHQTMMAIRDHAIAMMPKVDGSHPDM-AMDGLDV-------EGGTFSAYA---NPNLSEGLVLWYAQQAFIGHQM--- 106 (537) T ss_pred HhhhhccCCCCCccCcccccccccccch-hcccccc-------chhhhhhhc---cccccchhhhhccccCCccHHH--- Confidence 2222221111100000000000000000 0000000 000000000 0000111112222211110000 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCch-----hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK-----DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) .+-+ ..+.+++.||+..+.-++-+++.++++++ +..+.|...|+.-++...+.++.+.+..||.+++++. T Consensus 107 ----~a~Y-~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~ 181 (537) T protein:vir:10 107 ----CALI-ATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFK 181 (537) T ss_pred ----HHHH-HhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEe Confidence 0001 13478999999999999999999988643 3446678888888899999999999999999998876 Q ss_pred ECC-CCc----------------eEEEEEccceeEEEEeCCC---CceeEEE-EEEeeeeeeccCCcceEEEEEEEcCCc Q lcl|NC_010808. 156 RNQ-DDE----------------TRLYKSDAMSTFVIYDNTI---ERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (512) Q Consensus 156 ~d~-~g~----------------~~i~~~~p~~~~~i~d~~~---~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~yt~~~ 214 (512) .+. |+. ..+.+++|..+.|...+.. ...+-++ ..+|.+. . ..|.+.+ T Consensus 182 v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~------g------~~iH~SR 249 (537) T protein:vir:10 182 VDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN------G------KKYHRSH 249 (537) T ss_pred ecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec------C------eEeccee Confidence 542 221 1245566666555321110 0111111 0111110 0 0122333 Q ss_pred EEEEEecCCccccccccccccccccccccceEee-cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Q lcl|NC_010808. 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (512) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~ 293 (512) +.++.... +|-..- .++-.|+|.++.+.+.+.+++++.-..+..+..++.+++.+.+.. T Consensus 250 li~f~g~~--------------------~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~ 309 (537) T protein:vir:10 250 LAIYINDE--------------------VVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQ 309 (537) T ss_pred EEEecCCC--------------------CchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHH Confidence 33332111 111111 123359999999999999999998888888888888877665532 Q ss_pred cC-ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccc-ccc--c Q lcl|NC_010808. 294 SL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNF--S 369 (512) Q Consensus 294 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~--~ 369 (512) .. +.+.+.. +.-.+. ...........+.+ .-.|.....+.......++...+.|...+++|-.-+ +.. + T Consensus 310 ~l~~~~~~~~----r~~~~~--~~r~n~g~~~id~e-~e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~G 382 (537) T protein:vir:10 310 VLANKQQFDE----TMSWWT--ATRDNYQVRVVDKD-NEDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTG 382 (537) T ss_pred hhcCHHHHHH----HHHHHH--hhcCCcceeEecCC-CceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccc Confidence 11 1111111 000000 00111111111221 122334456677788899999999999999997643 321 2 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHH---- Q lcl|NC_010808. 370 GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYI---- 445 (512) Q Consensus 370 ~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~---- 445 (512) -|.||..=...+... ++.+|..++..+++++.+|+... . + .+ .++++.|++-...+..+.|+... T Consensus 383 lnatGe~D~~~yyd~---I~~~Qe~l~p~l~~l~~ll~~~~-~---~-~~---~~~~i~f~pL~~~s~kEkAei~~~~a~ 451 (537) T protein:vir:10 383 FNSTGDYEEASYHEE---CESTQDDMRPLIDRHHQLVCRSH-L---R-KR---IRVKVEFPPMDAPKESERADTFLKKMQ 451 (537) T ss_pred cccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhc-C---C-CC---cceEEEeCCCCCCCHHHHHHHHHHHHH Confidence 345677554444443 44444457889998888876532 1 1 11 25789999998899998877533 Q ss_pred ---HH--hccCChHHHHHhCCCCCCH-HHHH-HHHHHHHHHHHHH--HHhhcccCCCCCC--CCCCCCCCcCcccCCC Q lcl|NC_010808. 446 ---DS--GGKISQTTLMSLFSFFQDP-ELEV-KKIEEDEKESIKK--AQKGIYKDPRDIN--DDEQDDDTKDTVDKKE 512 (512) Q Consensus 446 ---kl--~g~~s~et~~~~~~~v~d~-~~E~-~ri~~E~~~~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~e 512 (512) ++ .|+++.+.++..|..-.+. ...+ ..+..|..+.... ........+.+.+ +..+....+++..+.. T Consensus 452 a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (537) T protein:vir:10 452 AAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPR 529 (537) T ss_pred HHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCc Confidence 33 5789998888776321100 0000 0001111110000 0000000000000 0000111111111111 No 113 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.23 E-value=3.6e-10 Score=72.33 Aligned_cols=459 Identities=11% Similarity=0.064 Sum_probs=187.2 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHH----HHhccccccccccccccccccc Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLS----DYYEGKTKNLVELTRRKEEYMA 86 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~----~yy~G~~~~~~~~~~~~~~~~~ 86 (512) |+|..-...-|+.-...+=...++...-....+.-...+++........+..-. ++..|... ++...+..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~~~~~~l~ 77 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKT---KPSIRNNQDLH 77 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeeccccccc---CCccCChhHHH Confidence 655544444444322222222221111111111111122222211111121111 11222111 11111111111 Q ss_pred c-ee--eecchHHHHHHHHHhhhhc-----------cCceecC---------CchhHHHHHHHHHhcc---------Chh Q lcl|NC_010808. 87 D-NR--VAHDYASYISDFINGYFLG-----------NPIQCQD---------DDKDVLEAIEAFNDLN---------DVE 134 (512) Q Consensus 87 ~-~r--i~~n~~~~iv~~~a~~l~g-----------~~~~~~~---------~d~~~~~~l~~~~~~n---------~~~ 134 (512) . .+ ...++...+|++.+..+.+ -++.+.. .+......|.+++..- .+. T Consensus 78 ~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~ 157 (547) T protein:vir:63 78 GVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFS 157 (547) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHH Confidence 0 01 1234556666655544331 1112211 1122223455554331 244 Q ss_pred HHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCC Q lcl|NC_010808. 135 SHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH 213 (512) Q Consensus 135 ~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~ 213 (512) .+...+..+.+.+|.+|+.+-++.+|++. +..++|..+.++.+... ......++|+.+. + .. ....+.++ T Consensus 158 ~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~--~---~~---~~~~~~~~ 228 (547) T protein:vir:63 158 SFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADG-KIPDNGNRFVQVI--D---QK---IVATFNAR 228 (547) T ss_pred HHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCcc-ccccCceEEEEEc--C---Cc---EEEEeccc Confidence 56777888999999999999999999764 67889998888765432 1111112222211 0 00 01123444 Q ss_pred cEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceee--eec Q lcl|NC_010808. 214 GVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL--IKG 291 (512) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv--~~g 291 (512) .+++++... .........|.|.++.+...+.....+..-....+.-.+.|--+ +.| T Consensus 229 eiih~r~n~----------------------~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~ 286 (547) T protein:vir:63 229 EMAFAVRNP----------------------RSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKA 286 (547) T ss_pred cEEEecccC----------------------CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecC Confidence 444443211 00000123477777777776666655554444445555555533 345 Q ss_pred CCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 292 NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT 371 (512) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n 371 (512) ....+++....++..-.-... +..-.....+..+++.+..-++.......+.+..+...+.|+..-++|....+-...+ T Consensus 287 ~~~ls~e~~~~lk~~~~~~~~-G~~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~ 365 (547) T protein:vir:63 287 AQQQSQHALEIFKREWKNSLS-GINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNG 365 (547) T ss_pred CCCCCHHHHHHHHHHHHHHhc-CcccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCccccc Confidence 433454444443322110000 0000000112223444555555444455566677778888988889997766532211 Q ss_pred c----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHH- Q lcl|NC_010808. 372 Q----SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYID- 446 (512) Q Consensus 372 ~----Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~k- 446 (512) . ++..+-. +.. .......+...|.-+++.|...++..-... .. ..+.+.|......+..+.+..... T Consensus 366 ~~~~~~~~s~t~--sn~---e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~--~~-~~~~~~f~~~~~~~~~~~~~~~~~~ 437 (547) T protein:vir:63 366 GATGSKGGSLNE--GNS---AEKNQASKNKGLQPLLGFIEDFINKHIVAE--FG-DKYTFQFVGGDIKSELESVKILAEK 437 (547) T ss_pred ccccccccccch--hhH---HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cC-CceEEEeeccccccHHHHHHHHHHH Confidence 1 0111100 000 111223345566666666655554432111 11 246778887777777776654433 Q ss_pred HhccCChHHHHHhCCCCCC-HH-HH------H----HHHHHH-----HHH-HHHHHHhhcccCCCCC--CCCCCCCCCcC Q lcl|NC_010808. 447 SGGKISQTTLMSLFSFFQD-PE-LE------V----KKIEED-----EKE-SIKKAQKGIYKDPRDI--NDDEQDDDTKD 506 (512) Q Consensus 447 l~g~~s~et~~~~~~~v~d-~~-~E------~----~ri~~E-----~~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 506 (512) ..|+++.-.++++++.-.. +- .+ + +...++ .+. ..............+. ......+..++ T Consensus 438 ~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (547) T protein:vir:63 438 AKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGD 517 (547) T ss_pred hCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCC Confidence 2688999888888754221 00 00 0 001111 000 0000000000000000 00000000010 Q ss_pred ----cc-------cCCC Q lcl|NC_010808. 507 ----TV-------DKKE 512 (512) Q Consensus 507 ----~~-------~~~e 512 (512) +. +.+| T Consensus 518 ~~~d~~~~~~~~~~~~~ 534 (547) T protein:vir:63 518 IGKDGQRKDKDNANAGK 534 (547) T ss_pred cCccccccCccccchhh Confidence 00 0010 No 114 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.22 E-value=1e-10 Score=75.34 Aligned_cols=389 Identities=13% Similarity=0.067 Sum_probs=183.5 Q ss_pred HHHHHHHHHHhcccccc-ccccccc-ccccc-cceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccCh Q lcl|NC_010808. 57 RPRLKVLSDYYEGKTKN-LVELTRR-KEEYM-ADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDV 133 (512) Q Consensus 57 ~~r~~~~~~yy~G~~~~-~~~~~~~-~~~~~-~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~ 133 (512) ..+..-+...+-|-+.- ....... ..... ...-..+.+++.||+..+.-++-+++.+++++++ ..+..-|+.-++ T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~--~~~~~~~~~l~~ 78 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEM 78 (422) T ss_pred CccchhhHHHHcCCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHH--HHHHHHHHHhhH Confidence 11111111122231210 0000000 00000 0011246889999999999999999999887643 345666666688 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCC----------Cce-EEEEEccceeEEEEeCCCCceeEEE-EEEeeeeeeccCCc Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQD----------DET-RLYKSDAMSTFVIYDNTIERNSIAG-VRYLRTKPIDKTDE 201 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~----------g~~-~i~~~~p~~~~~i~d~~~~~~~~~~-v~~~~~~~~~~~~~ 201 (512) ...+.++.+.+..||.|++++-.+.. |.+ .+.+++|.++.|..-+.....+-++ ..+|.+...... T Consensus 79 ~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~-- 156 (422) T protein:vir:10 79 TQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITTNESD-- 156 (422) T ss_pred HHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEecCCCC-- Confidence 89999999999999999998876321 222 2444555554432111111111111 011111100000 Q ss_pred ceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccce-EeecCCCCCCcchHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 202 DEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI-TEFSNNERRKGDYEK-VITLIDLYDNAESDTANYM 279 (512) Q Consensus 202 ~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv-v~~~n~~~g~s~~~~-v~~liDa~~~~~s~~~~~~ 279 (512) . -.. +.+.++.++... .+|- ....++-+|.|.+.. +.+.+.+++++.-.....+ T Consensus 157 -~--~~~-iH~SRli~~~g~--------------------~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~ 212 (422) T protein:vir:10 157 -M--FYD-VHYSRIHIIDGE--------------------RIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (422) T ss_pred -c--cee-eccceeEEeCCC--------------------CchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000 111222222110 0121 123345568898886 6788888888888877777 Q ss_pred HHhcCceeeeecCCcC--ChhhhhhhhhccccccchhhhhhcccccCC-CCCcceeEEeecCCHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 280 SDLNDAMLLIKGNLSL--DPDEVKKQKEANVLFLEPTVYENRDTGIET-EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHM 356 (512) Q Consensus 280 ~~~~~~~lv~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 356 (512) ..++...+.+.|.... ..+....... +... .....+....... +++.+++. .+.+.++....++...+.|.. T Consensus 213 ~~~~~~v~~~~~l~~~~~~~~~~~~~~~-r~~~--~~~~~~~~~~~~l~~~~e~~e~--~~~~lsgl~~~~~~~~~~iaa 287 (422) T protein:vir:10 213 KRKQQAVWKAKGLAELCDDSEGFGAARL-RLAQ--VDNNSGVGQAIGIDAESEEYSV--LNSDIGGIDAFLDKKFDRIVA 287 (422) T ss_pred HHhccccccchhHHHhcCCccchHHHHH-HHHH--HHHhcCCccceeEecCCcceEE--EecccCChHHHHHHHHHHHHh Confidence 7777776666542111 0100000000 0000 0000111111111 22233433 456677889999999999999 Q ss_pred Hhcccccccccc-cc--cchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCC Q lcl|NC_010808. 357 FTNTPNMKDDNF-SG--TQSGEAMKYKLFGLEQRTKTKE-GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN 432 (512) Q Consensus 357 ~s~~p~~~~~~~-~~--n~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~ 432 (512) .+++|-.-+... .+ |.||..-...+.. .++..| ..++..+++++.+|+. ..++++.|+|- T Consensus 288 a~~IP~t~L~G~s~~Glnatgd~d~~~yyd---~i~~~Qe~~l~p~l~~l~~~i~~-------------s~~~~~~f~pL 351 (422) T protein:vir:10 288 LSGIHEIILKNKNVGGVSSSQNTALETFHK---LVDRKRNAELLPILEFLIPFIVN-------------AEEWSVEFNPL 351 (422) T ss_pred hhCCCeeeeccCCcccccccchHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhcc-------------cCCcEEEeCCC Confidence 999996644222 12 3455554433333 334343 4578888888877652 12567899999 Q ss_pred CCcCHHHHHHHHHHH---------hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 433 LPKSLIEELKAYIDS---------GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 433 ~p~d~~~~~~~~~kl---------~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (512) ...+..+.|+...+. .|+++.+.+.+.|-. .- ............+....+. +++. T Consensus 352 ~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~-------------~~-~~~~~~~~~~~~~~~~~~~--~~~~ 415 (422) T protein:vir:10 352 AQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRT-------------IA-PEVKINDGSVETEVTISET--SNDP 415 (422) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhh-------------hc-ccccCCCCCCccccchhhc--CCCC Confidence 999999888875543 355555554444311 00 0000000000000000000 0000 Q ss_pred CcCcccC Q lcl|NC_010808. 504 TKDTVDK 510 (512) Q Consensus 504 ~~~~~~~ 510 (512) ..+..++ T Consensus 416 ~~~~~~d 422 (422) T protein:vir:10 416 LEVPTDD 422 (422) T ss_pred CCCCCCC Confidence 0000000 No 115 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.20 E-value=4.3e-10 Score=71.92 Aligned_cols=430 Identities=10% Similarity=0.017 Sum_probs=196.3 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) ||+...+.... + .-++-+.-.|..-...+... .. . .+. ...||.....+-+.. T Consensus 35 ~~~~~~~~~~~--~--~~~~~~~~~~~~a~~~g~~~-----~~----------~---~~~--~~~~~~~~~~~~~~l--- 87 (532) T protein:vir:94 35 LATAHEIDPTA--Y--SPYERNAAQNAMAMDYGLQT-----GR----------N---GRN--ALSFVEATSWPGFPT--- 87 (532) T ss_pred hhhhhhhcccc--c--ccccccccccccccccccCc-----cc----------c---ccc--ccccccccccchHHH--- Confidence 55443222211 0 01111111111100000000 00 0 000 001211111100000 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCc-----hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD-----KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMI 155 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d-----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~ 155 (512) . +-+ -.+.+++.+|+..+.=++-+++.+++++ .+....|...|+.-++...+.++.+.+..||.+++++- T Consensus 88 ~----a~Y-~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~ 162 (532) T protein:vir:94 88 L----ALL-AQLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPH 162 (532) T ss_pred H----HHH-HcCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEE Confidence 0 000 1246778999999999999999997743 23445667777766788899999999999999998876 Q ss_pred ECCCCc--------------------eEEEEEccceeEEEE-eCCCCceeEEE-EEEeeeeeeccCCcceEEEEEEEcCC Q lcl|NC_010808. 156 RNQDDE--------------------TRLYKSDAMSTFVIY-DNTIERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSH 213 (512) Q Consensus 156 ~d~~g~--------------------~~i~~~~p~~~~~i~-d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~yt~~ 213 (512) .+.+|. ..+.+++|.++.|-. +......+-++ ..+|.+. . ..-|.+. T Consensus 163 v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~-----~------g~~iH~S 231 (532) T protein:vir:94 163 LKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT-----S------GKKIHSS 231 (532) T ss_pred eccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEc-----c------Ceeeccc Confidence 653331 124455555554431 11111111111 0011100 0 0012233 Q ss_pred cEEEEEecCCccccccccccccccccccccceEee-cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC Q lcl|NC_010808. 214 GVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGN 292 (512) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~ 292 (512) ++.+|.... +|-... .++-+|.|.++.+.+.+..++++.-..+..+..++...+.+ +. T Consensus 232 Rli~f~g~~--------------------~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~ 290 (532) T protein:vir:94 232 RIHTVVGRP--------------------VGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DM 290 (532) T ss_pred eEEEecCCC--------------------chhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee-ch Confidence 333332110 111111 12235899999999999999998888877777777666554 32 Q ss_pred CcC-Chhhhhhhhhccccccchhhhhhc-ccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccc-- Q lcl|NC_010808. 293 LSL-DPDEVKKQKEANVLFLEPTVYENR-DTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-- 368 (512) Q Consensus 293 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-- 368 (512) ... ..+......+. .-.. ...... .......++.+++.+ ..+.+.+...++...+.|...+++|-.-+... T Consensus 291 a~~ls~~~~~~~~~r-~~~~--~~~~~n~g~~~id~~~e~~e~~--~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp 365 (532) T protein:vir:94 291 AQLLAPGGAQSLDAR-LQLF--NLYRDNRNIGALDKGTEEIQQT--NTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITP 365 (532) T ss_pred HHhhcchhHHHHHHH-HHHH--HhhcCCccceEEcCCCceeEEE--ecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCc Confidence 111 11111111110 0000 000000 111111122334443 45667788899999999999999997643221 Q ss_pred -cccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHH Q lcl|NC_010808. 369 -SGTQSGEAMKYKLFGLEQRTKTKE-GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYID 446 (512) Q Consensus 369 -~~n~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~k 446 (512) +-|.||+.-...+ +..++.++ ..+...+++++.+++... .+ ..+ .++++.|++-...+..+.|+...+ T Consensus 366 ~GlnstGe~D~~~y---yd~I~s~Qe~~l~p~le~l~~~l~~s~--~g--~~~---~d~~~~f~pL~~~s~kEkAei~~~ 435 (532) T protein:vir:94 366 NGLNASSDGEIRVW---YDFIAGYQATNLTPLMEWIIDLIQLSE--YG--QID---PGLAWEWSPLMELDDKELAEVRQL 435 (532) T ss_pred ccccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh--cC--CCC---CCceEEeCCCCCCCHHHHHHHHHH Confidence 2245566433333 33444444 446788888887775421 11 111 257899999888888887775432 Q ss_pred -------H--hccCChHHHHHhCCCCCC--------HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCccc Q lcl|NC_010808. 447 -------S--GGKISQTTLMSLFSFFQD--------PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 509 (512) Q Consensus 447 -------l--~g~~s~et~~~~~~~v~d--------~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (512) + .|++|.+.+.+.+..-.+ ..++++....+..+...........++ .+.....+++++.. T Consensus 436 ~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~d~~ 512 (532) T protein:vir:94 436 NASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAP---QTPNPQPDSEDDQT 512 (532) T ss_pred HHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCC---CCCCCCCCCCCCCC Confidence 2 578999888887642111 111121111111111111111111100 00011111111111 Q ss_pred CCC Q lcl|NC_010808. 510 KKE 512 (512) Q Consensus 510 ~~e 512 (512) +.+ T Consensus 513 ~~~ 515 (532) T protein:vir:94 513 DNQ 515 (532) T ss_pred CCc Confidence 111 No 116 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.01 E-value=1.9e-09 Score=68.44 Aligned_cols=447 Identities=9% Similarity=-0.007 Sum_probs=199.2 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccch-hHHhhhcH-HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGT-ESDLLQNI-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT 78 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~-~~~~~~~~-~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~ 78 (512) |.....+...-.+ ...+..+... +..... -....+.+ +.+..+-..+.......-..+..||.....+-+ T Consensus 66 ~~~~~~~~~~~~~----~~~~a~~~a~-~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gy--- 137 (862) T protein:vir:99 66 VEISDSVNAKSVS----GKNFAMDSAV-RSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGH--- 137 (862) T ss_pred ccccccccchhhh----hhhhcchhhc-chhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccH--- Confidence 4444444432211 0011111110 000000 00000000 001111000000000000111222211100000 Q ss_pred ccccccccceeeecchHHHHHHHHHhhhhccCceecCCc------hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 79 RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD------KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 79 ~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~ 152 (512) ...+. -..+.+++.||+..+.-++-+++.+.+.+ .+..+.|...|+.-++...+.++.+.+-.||++++ T Consensus 138 ----ql~al-Y~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~i 212 (862) T protein:vir:99 138 ----QACAL-IAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVA 212 (862) T ss_pred ----HHHHH-HHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEE Confidence 00011 12357899999999999999999998742 34456788888888888999999999999999877 Q ss_pred EEEECC-CCc----------------eEEEEEccceeEEEEe---CCCCceeEEE-EEEeeeeeeccCCcceEEEEEEEc Q lcl|NC_010808. 153 LMIRNQ-DDE----------------TRLYKSDAMSTFVIYD---NTIERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFT 211 (512) Q Consensus 153 ~v~~d~-~g~----------------~~i~~~~p~~~~~i~d---~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~yt 211 (512) ++-.+. |+. ..+.+++|..+.|+-. ......+-++ ..+|.+. . ..+. T Consensus 213 lilv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~------g------~~IH 280 (862) T protein:vir:99 213 IFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIIS------G------QKYH 280 (862) T ss_pred EEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeec------C------eeec Confidence 765432 221 1245556555544210 0000111100 0011100 0 0011 Q ss_pred CCcEEEEEecCCccccccccccccccccccccceEe-ecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee Q lcl|NC_010808. 212 SHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITE-FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK 290 (512) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~-~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~ 290 (512) +.++.++.... +|-.. -.++-+|.|.++.+.+.+.+++++.......+..++..++.+. T Consensus 281 ~SRliif~g~~--------------------vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd 340 (862) T protein:vir:99 281 RSHLIIARGPQ--------------------PADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTD 340 (862) T ss_pred cceeEEecCCC--------------------chhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeech Confidence 22222221100 11110 0123469999999999999999998888887777777776665 Q ss_pred cCCcCCh-hhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccc-cc- Q lcl|NC_010808. 291 GNLSLDP-DEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DN- 367 (512) Q Consensus 291 g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~- 367 (512) +...... +.+.. ....+.. ..........+.+.+++ ..+.+.+.+...++...+.|...+.+|-.-+ +. T Consensus 341 ~l~~l~~ed~l~~--r~~~~~~----~rdN~Gi~liD~eEe~e--~ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqs 412 (862) T protein:vir:99 341 TAKAIANEDKFIQ--RLMFWVR----YRDNHAVKVLGTDETME--QFDTSLADFDAVIMGQYQLVASIAKTPATKLLGTA 412 (862) T ss_pred hHhhhccHHHHHH--HHHHHHh----ccCcceeEEecCCCcee--EEecccCChHHHHHHHHHHHHhhhCCCceeecccC Confidence 5322111 11111 0000110 00100011112233343 3456677888999999999999999997643 32 Q ss_pred -ccccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHH Q lcl|NC_010808. 368 -FSGTQSGEAMKYKLFGLEQRTKTK-EGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYI 445 (512) Q Consensus 368 -~~~n~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~ 445 (512) .|-|.||..=...+... +..+ +..+...|++++.++..-+ .. ..++++.|++-...+..+.|+... T Consensus 413 paGlnATGE~D~~nYyD~---I~s~QE~~L~P~LerL~~li~~~l----g~-----~~d~~ieFnpL~~~sekEkAEi~k 480 (862) T protein:vir:99 413 PKGFNSTGEFETISYHEE---LESIQEHVYMPFLQRHYLISRLSL----GI-----QHEIDVVMEPVASMTAQQQADLNK 480 (862) T ss_pred cccccCchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhc----CC-----CCcceEEeCCCCCCCHHHHHHHHH Confidence 23346777433333333 3333 3557778887766553221 11 135789999999999998887643 Q ss_pred -------HH--hccCChHHHHHhC--------CCCCCHHHHHH-HHHHHHHHHHHHHHhhcccCCCCCCC-------CCC Q lcl|NC_010808. 446 -------DS--GGKISQTTLMSLF--------SFFQDPELEVK-KIEEDEKESIKKAQKGIYKDPRDIND-------DEQ 500 (512) Q Consensus 446 -------kl--~g~~s~et~~~~~--------~~v~d~~~E~~-ri~~E~~~~~~~~~~~~~~~~~~~~~-------~~~ 500 (512) ++ +|+++.+.++..| ..+++.+.|-. -+..+...............+.+... .+. T Consensus 481 k~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~ 560 (862) T protein:vir:99 481 TKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEG 560 (862) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcccccccccccccccCCccccC Confidence 33 5889888887753 22232221100 01111111100000000000000000 000 Q ss_pred CCCCcCcccCCC Q lcl|NC_010808. 501 DDDTKDTVDKKE 512 (512) Q Consensus 501 ~~~~~~~~~~~e 512 (512) +.+....+..+. T Consensus 561 d~~~~p~~~~~~ 572 (862) T protein:vir:99 561 DQPNVQMVPSMK 572 (862) T ss_pred CcccccccCCCC Confidence 000000000000 No 117 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=98.98 E-value=7.7e-09 Score=65.05 Aligned_cols=453 Identities=12% Similarity=0.100 Sum_probs=219.5 Q ss_pred cccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHH---HHHHHHHHHhcccccccccccccccccc Q lcl|NC_010808. 9 TDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQR---PRLKVLSDYYEGKTKNLVELTRRKEEYM 85 (512) Q Consensus 9 ~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~---~r~~~~~~yy~G~~~~~~~~~~~~~~~~ 85 (512) -+..++-..+.+=.. ...-.+ +.+|...-....+.+. ..++++++|-... + .+.....+ T Consensus 1 m~~~~~~~~~~~~~~--------~~~~~~----~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~-~-----tr~t~~~~ 62 (599) T protein:vir:31 1 MSTDIKTLQKMLEGR--------DDDRAF----IDELVVLFTNMENARAQKDREDKELMDYIDAT-D-----TRKTSNSK 62 (599) T ss_pred CccchHHHHHHhhcc--------CchHHH----HHHHHHHHHhhhhhhhhhhcccHHHHHHHhhh-c-----ccccccCC Confidence 111222222222111 000111 1222222222222222 3456667774321 1 11122222 Q ss_pred c--ceeeecchHHHHHHHHHhhhhccCc------eec---CC--chhHHHHHHHH----HhccChhHHHHHHHHHHHhCC Q lcl|NC_010808. 86 A--DNRVAHDYASYISDFINGYFLGNPI------QCQ---DD--DKDVLEAIEAF----NDLNDVESHNRSLGLDLSIYG 148 (512) Q Consensus 86 ~--~~ri~~n~~~~iv~~~a~~l~g~~~------~~~---~~--d~~~~~~l~~~----~~~n~~~~~~~~~~~~~~~~G 148 (512) . .+++.+|-...+++.+..++++--+ .+. .+ ..+....++.+ +...+|...+..++.+...+| T Consensus 63 ~~w~~s~t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G 142 (599) T protein:vir:31 63 LPFKNSTTINKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRG 142 (599) T ss_pred CCcccccchHHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccC Confidence 2 3467778888899999888876422 111 11 12233444444 556789999999999999999 Q ss_pred eEEEEEEEC------CCC-------ceEEEEEccceeEEEEeCCCC--ceeEEEEEEeeeee------------------ Q lcl|NC_010808. 149 KAYELMIRN------QDD-------ETRLYKSDAMSTFVIYDNTIE--RNSIAGVRYLRTKP------------------ 195 (512) Q Consensus 149 ~a~~~v~~d------~~g-------~~~i~~~~p~~~~~i~d~~~~--~~~~~~v~~~~~~~------------------ 195 (512) -|+..+..- +|| .|++..++|.++|+ |++.. ....+.+|-+.++. T Consensus 143 ~~vat~~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~--Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~ 220 (599) T protein:vir:31 143 FCVAHTRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFW--DVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMED 220 (599) T ss_pred ceeEeeeEEEcceeecccccccccccceEEeecccceee--CCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHH Confidence 998876421 233 47888999988754 44322 22233344332110 Q ss_pred e---------------ccCC---------cceEEE-EEEEcCCcE--E-----EEEecCCccc-----cccc--c--ccc Q lcl|NC_010808. 196 I---------------DKTD---------EDEVFT-VDLFTSHGV--Y-----RYLTSRTNGL-----KLTP--R--ENG 234 (512) Q Consensus 196 ~---------------~~~~---------~~~~~~-~~~yt~~~~--~-----~~~~~~~~~~-----~~~~--~--~~~ 234 (512) . ++.. .+.... .+.|.+.-+ . .|...+.+.. .+.. . ..+ T Consensus 221 ~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e 300 (599) T protein:vir:31 221 FQKLREERRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQ 300 (599) T ss_pred HHHHHhhccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecc Confidence 0 0000 000000 001111100 0 0011111110 0000 0 112 Q ss_pred cccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcccc Q lcl|NC_010808. 235 FESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVL 309 (512) Q Consensus 235 ~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~ 309 (512) ..|.+.|..|++.. +.+.+|.|.+..+..+++.+|.+.-.+.+.+..+..|+++..|... ..+.. T Consensus 301 ~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~--~eD~~-------- 370 (599) T protein:vir:31 301 SKDTWDGSQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVR--EKGMR-------- 370 (599) T ss_pred cCCCCCCCCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccccccccccc--ccCcc-------- Confidence 23455666777653 3456789999999999999999999999989999999888777521 11111 Q ss_pred ccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccc-cccchHHHHHHHHHHHHHHH Q lcl|NC_010808. 310 FLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLFGLEQRT 388 (512) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~ 388 (512) +.+....-....++++++.++.+.......+..+...+-..+++|..+.|.- .+..++..++....+..... T Consensus 371 -------~~P~~v~~~~d~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~ 443 (599) T protein:vir:31 371 -------GGPNHVFEVEETGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVF 443 (599) T ss_pred -------CCCCcceeecCCCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhH Confidence 1111222345677888888888877788888888888889999998877643 35678888888888888887 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhccCCC-------c---ccccce-----ee--EEeCCCCCcCHHHHHHHHHHHhc- Q lcl|NC_010808. 389 KTKEGLFTKGLRR-RAKLLETILKNTRSID-------A---NKDFNT-----VR--YVYNRNLPKSLIEELKAYIDSGG- 449 (512) Q Consensus 389 ~~~~~~~~~~l~~-~~~li~~~l~~~~~~~-------~---~~d~~~-----i~--i~f~~~~p~d~~~~~~~~~kl~g- 449 (512) ..+.+.|.+++-+ +++-+.++....-... . ...+.+ +. ..+.+--..-..+..+.++++.. T Consensus 444 ~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~i 523 (599) T protein:vir:31 444 RRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAI 523 (599) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHH Confidence 8888888776544 5554444433111000 0 000111 11 11211122223455555555422 Q ss_pred --------cC---ChHHH---HHhC------CCCC-C-----HHHHHHHHHHHHHHHHHHHHhhc-ccCCCCCCCCCC Q lcl|NC_010808. 450 --------KI---SQTTL---MSLF------SFFQ-D-----PELEVKKIEEDEKESIKKAQKGI-YKDPRDINDDEQ 500 (512) Q Consensus 450 --------~~---s~et~---~~~~------~~v~-d-----~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~~~~~~ 500 (512) +. +++.. +..+ +.-. . .+.+...++++-++..+.+.... .+.|+. +..+ T Consensus 524 l~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~--~~~~ 599 (599) T protein:vir:31 524 LGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTT--DTGQ 599 (599) T ss_pred hcccCCCccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCc--ccCC Confidence 22 33222 2221 1111 1 11122222222222222222111 111111 1111 No 118 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.98 E-value=8.2e-09 Score=64.93 Aligned_cols=459 Identities=11% Similarity=0.051 Sum_probs=179.9 Q ss_pred eccccchhhccccccCC-CcCeeec-ccchhHHhhhc----HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 7 FETDTDLRENRNYLFND-EANVVYT-YDGTESDLLQN----INEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 7 ~~~~~~~~~~~~~~f~~-~~~~~~~-~~~~~~~~~~~----~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) ....|+|.. .++|.. +.+..|. ...+...-++. .+.|.+......... .+-......+.+-+. .++... T Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~-~~~~~~~~~~~~~~~--~r~~~~ 75 (551) T protein:vir:80 1 MKNKLGLFE--SIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAY-SQPVIGSMSANPGFK--TKPSIR 75 (551) T ss_pred CchhhhhHH--HhhhccCChhhcccccccccceeeecccccHHHHHHhhccCccee-ecccccceecCcccc--cCcccc Confidence 112222211 111111 1111110 01100000000 011111111100000 000000000000000 000111 Q ss_pred ccccccce--ee-ecchHHHHHHHHHhhhhc-----------cCceecCC---------chhHHHHHHHHHhcc------ Q lcl|NC_010808. 81 KEEYMADN--RV-AHDYASYISDFINGYFLG-----------NPIQCQDD---------DKDVLEAIEAFNDLN------ 131 (512) Q Consensus 81 ~~~~~~~~--ri-~~n~~~~iv~~~a~~l~g-----------~~~~~~~~---------d~~~~~~l~~~~~~n------ 131 (512) +....+.. .+ ..+..+.+|+..+..+.. -++.+... +....+.+.+++..- T Consensus 76 ~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p 155 (551) T protein:vir:80 76 NNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDI 155 (551) T ss_pred ChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCC Confidence 11000000 11 135555666666554431 22222211 122223455555432 Q ss_pred ---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEE Q lcl|NC_010808. 132 ---DVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTV 207 (512) Q Consensus 132 ---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 207 (512) .+..+...+..+.+.+|.+|+.+.++.+|++. +..++|..+.++.++.. ......++|+... .+ . .. T Consensus 156 ~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g-~~~~~~~~y~~~~--~g-----~-~~ 226 (551) T protein:vir:80 156 NRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADG-KIPDNGNRFVQVI--DQ-----K-IV 226 (551) T ss_pred ccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCcc-ccccCceEEEEEe--CC-----c-EE Confidence 23456777888899999999999899999864 77889999888765532 1111112222211 00 0 01 Q ss_pred EEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_010808. 208 DLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (512) Q Consensus 208 ~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~l 287 (512) ..|.++.+++++... .........|.|.++.+...+.....+..-....+.-.+.|-. T Consensus 227 ~~~~~~eiiH~~~n~----------------------~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~g 284 (551) T protein:vir:80 227 ATFNAREMAFAVRNP----------------------RSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRG 284 (551) T ss_pred EEEcccceEEecccC----------------------CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcce Confidence 123444444443211 0000011247777777777776666555555555555556654 Q ss_pred ee--ecCCcCChhhhhhhhhccccccchhhhhhccc--ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_010808. 288 LI--KGNLSLDPDEVKKQKEANVLFLEPTVYENRDT--GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 363 (512) Q Consensus 288 v~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 363 (512) ++ .|....+++.....+..-.-.. ....... .+-.+++.+++-++.......+.+..+...+.|...-++|.. T Consensus 285 iL~~~~~~~lt~e~~~~lk~~~~~~~---~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~ 361 (551) T protein:vir:80 285 ILQIKAAQQQSQHALEIFKREWKNSL---SGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPA 361 (551) T ss_pred EEEEcCCCCCCHHHHHHHHHHHHHHh---cCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHH Confidence 44 4443344444443332210000 0000111 112234445555554444555666778888889998899977 Q ss_pred ccccccccc----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHH Q lcl|NC_010808. 364 KDDNFSGTQ----SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIE 439 (512) Q Consensus 364 ~~~~~~~n~----Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~ 439 (512) ..+-...+. .+..+-.. .. .......+...|.-+++.|...++..-.. .. -..+.+.|......+..+ T Consensus 362 ~lG~~~~~~~~~~~~~s~t~s--n~---e~~~~~f~~~tL~P~~~~ie~~ln~~L~~--~~-~~~~~f~f~~~~~~~~~~ 433 (551) T protein:vir:80 362 EINIPNNGGATGSKGGSLNEG--NS---AEKNQASKNKGLQPLLGFIEDFINKHIVA--EF-GDKYTFQFVGGDIKSELE 433 (551) T ss_pred HcCcccccccccccccccchh--hH---HHHHHHHHHHHHHHHHHHHHHHHHhhhcc--cc-CCceEEEeeccChhhHHH Confidence 665322111 01111000 00 01122344555555555555544432111 11 134677888777777766 Q ss_pred HHHHHHHH-hccCChHHHHHhCCCCCC-H--HH---------HHHHHHHHHHHHHHHH------HhhcccCCCC--CCCC Q lcl|NC_010808. 440 ELKAYIDS-GGKISQTTLMSLFSFFQD-P--EL---------EVKKIEEDEKESIKKA------QKGIYKDPRD--INDD 498 (512) Q Consensus 440 ~~~~~~kl-~g~~s~et~~~~~~~v~d-~--~~---------E~~ri~~E~~~~~~~~------~~~~~~~~~~--~~~~ 498 (512) .+...... .|+++.-.++++++.-.. + +. ..+...+++.+..... .........+ .... T Consensus 434 ~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 513 (551) T protein:vir:80 434 SVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIP 513 (551) T ss_pred HHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCC Confidence 66544322 688999888888864221 0 00 0000111100000000 0000000000 0000 Q ss_pred CCCCCCcCcc-------cCCC Q lcl|NC_010808. 499 EQDDDTKDTV-------DKKE 512 (512) Q Consensus 499 ~~~~~~~~~~-------~~~e 512 (512) .+.+..++.. ++++ T Consensus 514 ~~~~~~~~~~~~~~~~~~~~~ 534 (551) T protein:vir:80 514 DGKDTTGDIGKDGQRKDKDNA 534 (551) T ss_pred CccccCCCccccccccCcccc Confidence 0000000000 1110 No 119 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=98.96 E-value=1.4e-09 Score=69.13 Aligned_cols=440 Identities=9% Similarity=0.024 Sum_probs=202.1 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccc--ccccccccccc--c------- Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGK--TKNLVELTRRK--E------- 82 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~--~~~~~~~~~~~--~------- 82 (512) |+-..-+|...-+..--..++ +..+.+...+.--..++|.-+...+-.-. .++.......+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds 73 (765) T protein:vir:96 1 MFKLSWIFGRKKDNAACSESA-------PEKVARIPQHDPLDPMIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDS 73 (765) T ss_pred CceeeeecccccccccccccC-------chhhhhcCCCCCcccchhHHHHhhcccccccCCCCCCCCcccCcccceeccc Confidence 555556665432111111111 11111111111111122333222221100 01000000000 0 Q ss_pred ---c-----------cccce-----------------------eeecchHHHHHHHHHhhhhccCceecCCch----hHH Q lcl|NC_010808. 83 ---E-----------YMADN-----------------------RVAHDYASYISDFINGYFLGNPIQCQDDDK----DVL 121 (512) Q Consensus 83 ---~-----------~~~~~-----------------------ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~----~~~ 121 (512) . ..... -..+.+++.||+..+.-++-+++.++++++ +.. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~ 153 (765) T protein:vir:96 74 AYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQS 153 (765) T ss_pred cccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhcCCceeecCccccCHHHH Confidence 0 00000 113578899999999999999999988643 334 Q ss_pred HHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC-CCc---------------e-EEEEEccceeEEEEe---CCCC Q lcl|NC_010808. 122 EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDE---------------T-RLYKSDAMSTFVIYD---NTIE 181 (512) Q Consensus 122 ~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~---------------~-~i~~~~p~~~~~i~d---~~~~ 181 (512) +.|+..|+.-++...+.++.+.+-.||.+|+++-.+. ++. + .+.+++|..+.|.-. .... T Consensus 154 ~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp 233 (765) T protein:vir:96 154 ALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADP 233 (765) T ss_pred HHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcccccchhccccc Confidence 5677778777889999999999999999998876542 221 1 134444444443210 0000 Q ss_pred ceeEEE-EEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEe-ecCCCCCCcchH Q lcl|NC_010808. 182 RNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITE-FSNNERRKGDYE 259 (512) Q Consensus 182 ~~~~~~-v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~-~~n~~~g~s~~~ 259 (512) ..+-++ ..+|.+. . .-+.+.+++++... .+|-.. -.++-.|.|.++ T Consensus 234 ~sp~fg~P~~y~i~------g------~~IH~SRli~~~g~--------------------~lpd~lk~~~~~~G~Svlq 281 (765) T protein:vir:96 234 SAEHFYEPDFWIIS------G------KKYHRSHLVVVRGP--------------------QPPDILKPTYIFGGIPLTQ 281 (765) T ss_pred cccccCcceeeeec------C------ceeccceEEEecCC--------------------CchhhhccccCccCccHHH Confidence 000000 0001000 0 00111222221110 011111 112335999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC-ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC Q lcl|NC_010808. 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY 338 (512) Q Consensus 260 ~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 338 (512) .+...+.+++++.-.....+..++..++.+.+.... +.+.+... .-... ...........+.+.+++ ..+. T Consensus 282 ~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r----~~~~~--~~r~n~g~~~id~ee~~e--~~s~ 353 (765) T protein:vir:96 282 RIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNAR----LAFWI--ANRDNHGVKVIGIDETME--QFDT 353 (765) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHH----HHHHH--HhcCCceeEEecCCccee--EEec Confidence 999999999998888887777777777665543211 11111110 00000 011111111122333443 4456 Q ss_pred CHHHHHHHHHHHHHHHHHHhccccccccc---ccccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010808. 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDN---FSGTQSGEAMKYKLFGLEQRTKTKE-GLFTKGLRRRAKLLETILKNTR 414 (512) Q Consensus 339 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~~~n~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~l~~~~ 414 (512) +...+...++...+.|...+.+|-.-+.. .|-|.||..=...+.. .++.+| ..+...|++++.+++.. + T Consensus 354 ~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD---~I~s~Qe~~l~p~le~L~~li~~s----~ 426 (765) T protein:vir:96 354 NLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHE---ELESIQEHIFDPLLERHYLLLAKS----E 426 (765) T ss_pred ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHh----c Confidence 77889999999999999999999644332 2335677743333333 333333 55788888888887643 2 Q ss_pred CCCcccccceeeEEeCCCCCcCHHHHHHHHHH-------H--hccCChHHHHHhCC------C--CCCHHHHH-HHHHHH Q lcl|NC_010808. 415 SIDANKDFNTVRYVYNRNLPKSLIEELKAYID-------S--GGKISQTTLMSLFS------F--FQDPELEV-KKIEED 476 (512) Q Consensus 415 ~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~k-------l--~g~~s~et~~~~~~------~--v~d~~~E~-~ri~~E 476 (512) .. + .++++.|++-...+..+.|+...+ + .|+++...+++.+. + ++|.+.|. .-+..+ T Consensus 427 ~i----~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe 501 (765) T protein:vir:96 427 SI----D-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPE 501 (765) T ss_pred CC----C-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCcc Confidence 22 1 257899999998999888776433 2 58899888887762 1 12211110 001100 Q ss_pred HHHHHHHHHhhcc---c-------CCCCCCCCCCCC-----C-----------CcCcccCCC Q lcl|NC_010808. 477 EKESIKKAQKGIY---K-------DPRDINDDEQDD-----D-----------TKDTVDKKE 512 (512) Q Consensus 477 ~~~~~~~~~~~~~---~-------~~~~~~~~~~~~-----~-----------~~~~~~~~e 512 (512) .....+....... . .+...++.++.. . .++...... T Consensus 502 ~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~ 563 (765) T protein:vir:96 502 NLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPS 563 (765) T ss_pred ccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCccc Confidence 0000000000000 0 000000000000 0 000000000 No 120 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.76 E-value=5.9e-08 Score=60.22 Aligned_cols=391 Identities=9% Similarity=0.000 Sum_probs=173.3 Q ss_pred HHHHHHHHHHHHHH---HHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHH Q lcl|NC_010808. 46 SKYIEHHMDYQRPR---LKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLE 122 (512) Q Consensus 46 ~~~i~~~~~~~~~r---~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~ 122 (512) ..|........... -..+..++.+...-. ... . ..-+.+.-....|+.+++-+.+-|++. .+..... T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~v~-~---~~al~~~~V~~~v~~ia~~ia~~p~~~--~~~~~~~ 70 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTGGEAQK----YVS-A---DTALKNSDIFSLIMQLSGDLAMVRYTS--ESDRSQS 70 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcCCcCCc----eec-h---HHhhccHHHHHHHHHHHHHHhhCcccc--cccHHHH Confidence 11110000000000 001111111110000 000 0 000112223334555555555555543 2322222 Q ss_pred HHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCc Q lcl|NC_010808. 123 AIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDE 201 (512) Q Consensus 123 ~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~ 201 (512) .+.+-...-....+...+..+.+.+|.||+.+-++.+|++ .+..++|..+.+..+... ..+. |.+........ T Consensus 71 l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~-~~~~-----y~~~~~~~~~~ 144 (397) T protein:vir:38 71 IISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDG-SGLI-----YNINFDEPAIG 144 (397) T ss_pred HHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceEE-----EEEEecccccc Confidence 2222222224456778888899999999999988988876 577889998877765432 2111 11111000000 Q ss_pred ceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 202 DEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSD 281 (512) Q Consensus 202 ~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~ 281 (512) ....+.++.+++++... ......|.|.+..+...++....+..-..+.+.- T Consensus 145 ----~~~~~~~~eiih~~~~~-------------------------~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~n 195 (397) T protein:vir:38 145 ----YMENVPAADVIHIRLLS-------------------------KNGGKTGISPLSALINEQQIKDASNELTLKALKQ 195 (397) T ss_pred ----ceeEecCccEEEecCCC-------------------------CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 01123444444442211 0011257788887777777666665555555666 Q ss_pred hcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 282 LNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP 361 (512) Q Consensus 282 ~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 361 (512) .+.|-.+++-.....++..+..+...... ....+.......+.+.++.-++.......+.+..+.....|+..-++| T Consensus 196 g~~~~~il~~~~~~~~e~~~~~~~~~~~~---~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp 272 (397) T protein:vir:38 196 SVTASAVLTIQKGGLLDAETRIARSKEIS---KQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVP 272 (397) T ss_pred cCCccEEEEeCCCCCHHHHHHHHHHHHHH---hcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 67777777654444444433332211100 001111112223455566666555455666777888889999988998 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHH Q lcl|NC_010808. 362 NMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEEL 441 (512) Q Consensus 362 ~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~ 441 (512) ....+...+..|..+ .....+...|+.++..|...++.+-... +++.+...+-.|..+.+ T Consensus 273 ~~~lg~~~~~~~~~e-------------~~~~~~~~~l~P~~~~ie~~ln~~l~~~-------~~~~~~~~~~~d~~~~~ 332 (397) T protein:vir:38 273 DSYLNGQGDQQSSIT-------------QISGQYAKSLNRYVQAIVGELNDKLHAN-------ISANIRFAIDAMGDQYA 332 (397) T ss_pred HHHhCCCCCcccHHH-------------HHHHHHHHHHHHHHHHHHHHHHHhccCh-------hcccccccccCCHHHHH Confidence 776654322222211 0112334455555555555444332211 11222223445778888 Q ss_pred HHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 442 KAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 442 ~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.+.++ .|+++...+++.++.-.-+..++-..... .............+..+.++++...+-| T Consensus 333 ~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~--------~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 333 STISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKE--------PQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccc--------ccccccccccccCCCCCCCCCCCCCCCC Confidence 888776 68999999988875421000000000000 0000000000111111111111111111 No 121 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.73 E-value=7.7e-08 Score=59.58 Aligned_cols=457 Identities=9% Similarity=-0.002 Sum_probs=207.5 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +....+ .+....-.+.+..++..-..+++.+.+|..=.-.... ........+...++.-+-+...++.+++.|++- T Consensus 1 M~~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~-~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 76 (555) T protein:vir:98 1 MAEQTE---RKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFF-VQDRNRGEKRHNNILDNTGTRALRVLAAGMMAG 76 (555) T ss_pred CCCccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhCccccccc-CCCCCcchhcccccccccHHHHHHHHHHHHHHh Confidence 111111 0111122223333333444566666666521100000 001111122234566788888888888887653 Q ss_pred --Cc-----eecCCch-------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 110 --PI-----QCQDDDK-------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 110 --~~-----~~~~~d~-------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) |+ ++...+. .....+...+..++|.....++.++..++|.+.+++-.|..+.+++..++. T Consensus 77 ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl 156 (555) T protein:vir:98 77 MTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTA 156 (555) T ss_pred hcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeec Confidence 21 2222221 123345566777899999999999999999999988777777788888887 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeec--------cC--------Cc-ceEEEEEEEc---C-Cc-------------- Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPID--------KT--------DE-DEVFTVDLFT---S-HG-------------- 214 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~--------~~--------~~-~~~~~~~~yt---~-~~-------------- 214 (512) .+.+.-.|. .+++...+|.++..... .. .. .....+++++ + .. T Consensus 157 ~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~ 234 (555) T protein:vir:98 157 GEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWK 234 (555) T ss_pred ceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceE Confidence 776665544 35666666654332110 00 00 0011233221 1 10 Q ss_pred EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Q lcl|NC_010808. 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI 289 (512) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~ 289 (512) .+++.... .+..+ ...-+|..+|++.+ ..+.+|+|..+...+-+..++.+.-.....++...+|.+.+ T Consensus 235 s~~~~~~~-d~~~v------l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v 307 (555) T protein:vir:98 235 SVYFEPGA-DETRT------LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQL 307 (555) T ss_pred EEEEEecc-CCccc------cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 01111111 10000 11123445666543 34568999999999999999987777777788777776654 Q ss_pred ecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc--cccccc Q lcl|NC_010808. 290 KGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP--NMKDDN 367 (512) Q Consensus 290 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~ 367 (512) ........ +.+.++.... ...+..+++-.-.+.+..+.......++.++..|...-... ...... T Consensus 308 ~~~~~~~~-----------~~~~pgg~~~--v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~ 374 (555) T protein:vir:98 308 PVSAKNQD-----------ISTVPGGLSY--VDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANG 374 (555) T ss_pred cccccccc-----------ceeccccccc--cccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCC Confidence 32111110 1111111000 00111111112222334466777777888888775543222 111112 Q ss_pred ccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCc---ccccceeeEEeCCCCCcCHHH---- Q lcl|NC_010808. 368 FSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDA---NKDFNTVRYVYNRNLPKSLIE---- 439 (512) Q Consensus 368 ~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~---~~d~~~i~i~f~~~~p~d~~~---- 439 (512) .+...||..+........+...- ..++-.+.+.-+++-++.++...+..+. ......|+|++..++.+.... T Consensus 375 ~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~ 454 (555) T protein:vir:98 375 TNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATN 454 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHH Confidence 23456777776643333333222 2222233343444444455554443322 223345777776665432111 Q ss_pred ----HHHHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCC Q lcl|NC_010808. 440 ----ELKAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKG--IYKDPRDINDDE 499 (512) Q Consensus 440 ----~~~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~ 499 (512) .++.+..++++-| ...++..+ -+++ -.++|+++|++++++....++.. ..+.......-+ T Consensus 455 ~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~ 534 (555) T protein:vir:98 455 SVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLG 534 (555) T ss_pred HHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 1122222233222 12222222 1222 23567777777655443322222 111111111222 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) +.+.++.++=..= T Consensus 535 ~~~~~~~~~~~~~ 547 (555) T protein:vir:98 535 SVDTSKQNALTDV 547 (555) T ss_pred ccccCcchhHHHH Confidence 2222111000000 No 122 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.73 E-value=7.7e-08 Score=59.58 Aligned_cols=457 Identities=9% Similarity=-0.002 Sum_probs=207.5 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +....+ .+....-.+.+..++..-..+++.+.+|..=.-.... ........+...++.-+-+...++.+++.|++- T Consensus 1 M~~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~-~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 76 (555) T protein:vir:10 1 MAEQTE---RKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFF-VQDRNRGEKRHNNILDNTGTRALRVLAAGMMAG 76 (555) T ss_pred CCCccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhCccccccc-CCCCCcchhcccccccccHHHHHHHHHHHHHHh Confidence 111111 0111122223333333444566666666521100000 001111122234566788888888888887653 Q ss_pred --Cc-----eecCCch-------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 110 --PI-----QCQDDDK-------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 110 --~~-----~~~~~d~-------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) |+ ++...+. .....+...+..++|.....++.++..++|.+.+++-.|..+.+++..++. T Consensus 77 ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl 156 (555) T protein:vir:10 77 MTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTA 156 (555) T ss_pred hcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeec Confidence 21 2222221 123345566777899999999999999999999988777777788888887 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeec--------cC--------Cc-ceEEEEEEEc---C-Cc-------------- Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPID--------KT--------DE-DEVFTVDLFT---S-HG-------------- 214 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~--------~~--------~~-~~~~~~~~yt---~-~~-------------- 214 (512) .+.+.-.|. .+++...+|.++..... .. .. .....+++++ + .. T Consensus 157 ~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~ 234 (555) T protein:vir:10 157 GEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWK 234 (555) T ss_pred ceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceE Confidence 776665544 35666666654332110 00 00 0011233221 1 10 Q ss_pred EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Q lcl|NC_010808. 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI 289 (512) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~ 289 (512) .+++.... .+..+ ...-+|..+|++.+ ..+.+|+|..+...+-+..++.+.-.....++...+|.+.+ T Consensus 235 s~~~~~~~-d~~~v------l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v 307 (555) T protein:vir:10 235 SVYFEPGA-DETRT------LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQL 307 (555) T ss_pred EEEEEecc-CCccc------cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 01111111 10000 11123445666543 34568999999999999999987777777788777776654 Q ss_pred ecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc--cccccc Q lcl|NC_010808. 290 KGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP--NMKDDN 367 (512) Q Consensus 290 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~ 367 (512) ........ +.+.++.... ...+..+++-.-.+.+..+.......++.++..|...-... ...... T Consensus 308 ~~~~~~~~-----------~~~~pgg~~~--v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~ 374 (555) T protein:vir:10 308 PVSAKNQD-----------ISTVPGGLSY--VDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANG 374 (555) T ss_pred cccccccc-----------ceeccccccc--cccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCC Confidence 32111110 1111111000 00111111112222334466777777888888775543222 111112 Q ss_pred ccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCc---ccccceeeEEeCCCCCcCHHH---- Q lcl|NC_010808. 368 FSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDA---NKDFNTVRYVYNRNLPKSLIE---- 439 (512) Q Consensus 368 ~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~---~~d~~~i~i~f~~~~p~d~~~---- 439 (512) .+...||..+........+...- ..++-.+.+.-+++-++.++...+..+. ......|+|++..++.+.... T Consensus 375 ~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~ 454 (555) T protein:vir:10 375 TNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATN 454 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHH Confidence 23456777776643333333222 2222233343444444455554443322 223345777776665432111 Q ss_pred ----HHHHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCC Q lcl|NC_010808. 440 ----ELKAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKG--IYKDPRDINDDE 499 (512) Q Consensus 440 ----~~~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~ 499 (512) .++.+..++++-| ...++..+ -+++ -.++|+++|++++++....++.. ..+.......-+ T Consensus 455 ~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~ 534 (555) T protein:vir:10 455 SVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLG 534 (555) T ss_pred HHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 1122222233222 12222222 1222 23567777777655443322222 111111111222 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) +.+.++.++=..= T Consensus 535 ~~~~~~~~~~~~~ 547 (555) T protein:vir:10 535 SVDTSKQNALTDV 547 (555) T ss_pred ccccCcchhHHHH Confidence 2222111000000 No 123 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.73 E-value=7.7e-08 Score=59.58 Aligned_cols=457 Identities=9% Similarity=-0.002 Sum_probs=207.5 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +....+ .+....-.+.+..++..-..+++.+.+|..=.-.... ........+...++.-+-+...++.+++.|++- T Consensus 1 M~~~~~---~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~-~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 76 (555) T protein:vir:10 1 MAEQTE---RKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFF-VQDRNRGEKRHNNILDNTGTRALRVLAAGMMAG 76 (555) T ss_pred CCCccc---HHHHHHHHHHHHHHhhHHHHHHHHHHHHhCccccccc-CCCCCcchhcccccccccHHHHHHHHHHHHHHh Confidence 111111 0111122223333333444566666666521100000 001111122234566788888888888887653 Q ss_pred --Cc-----eecCCch-------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 110 --PI-----QCQDDDK-------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 110 --~~-----~~~~~d~-------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) |+ ++...+. .....+...+..++|.....++.++..++|.+.+++-.|..+.+++..++. T Consensus 77 ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl 156 (555) T protein:vir:10 77 MTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTA 156 (555) T ss_pred hcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeec Confidence 21 2222221 123345566777899999999999999999999988777777788888887 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeec--------cC--------Cc-ceEEEEEEEc---C-Cc-------------- Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPID--------KT--------DE-DEVFTVDLFT---S-HG-------------- 214 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~--------~~--------~~-~~~~~~~~yt---~-~~-------------- 214 (512) .+.+.-.|. .+++...+|.++..... .. .. .....+++++ + .. T Consensus 157 ~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~ 234 (555) T protein:vir:10 157 GEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWK 234 (555) T ss_pred ceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceE Confidence 776665544 35666666654332110 00 00 0011233221 1 10 Q ss_pred EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Q lcl|NC_010808. 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI 289 (512) Q Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~ 289 (512) .+++.... .+..+ ...-+|..+|++.+ ..+.+|+|..+...+-+..++.+.-.....++...+|.+.+ T Consensus 235 s~~~~~~~-d~~~v------l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v 307 (555) T protein:vir:10 235 SVYFEPGA-DETRT------LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQL 307 (555) T ss_pred EEEEEecc-CCccc------cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 01111111 10000 11123445666543 34568999999999999999987777777788777776654 Q ss_pred ecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccc--cccccc Q lcl|NC_010808. 290 KGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTP--NMKDDN 367 (512) Q Consensus 290 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p--~~~~~~ 367 (512) ........ +.+.++.... ...+..+++-.-.+.+..+.......++.++..|...-... ...... T Consensus 308 ~~~~~~~~-----------~~~~pgg~~~--v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~ 374 (555) T protein:vir:10 308 PVSAKNQD-----------ISTVPGGLSY--VDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANG 374 (555) T ss_pred cccccccc-----------ceeccccccc--cccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCC Confidence 32111110 1111111000 00111111112222334466777777888888775543222 111112 Q ss_pred ccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCc---ccccceeeEEeCCCCCcCHHH---- Q lcl|NC_010808. 368 FSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDA---NKDFNTVRYVYNRNLPKSLIE---- 439 (512) Q Consensus 368 ~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~---~~d~~~i~i~f~~~~p~d~~~---- 439 (512) .+...||..+........+...- ..++-.+.+.-+++-++.++...+..+. ......|+|++..++.+.... T Consensus 375 ~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~ 454 (555) T protein:vir:10 375 TNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATN 454 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHH Confidence 23456777776643333333222 2222233343444444455554443322 223345777776665432111 Q ss_pred ----HHHHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhh--cccCCCCCCCCC Q lcl|NC_010808. 440 ----ELKAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKG--IYKDPRDINDDE 499 (512) Q Consensus 440 ----~~~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~ 499 (512) .++.+..++++-| ...++..+ -+++ -.++|+++|++++++....++.. ..+.......-+ T Consensus 455 ~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~ 534 (555) T protein:vir:10 455 SVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLG 534 (555) T ss_pred HHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 1122222233222 12222222 1222 23567777777655443322222 111111111222 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) +.+.++.++=..= T Consensus 535 ~~~~~~~~~~~~~ 547 (555) T protein:vir:10 535 SVDTSKQNALTDV 547 (555) T ss_pred ccccCcchhHHHH Confidence 2222111000000 No 124 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.73 E-value=7.8e-08 Score=59.55 Aligned_cols=451 Identities=10% Similarity=0.035 Sum_probs=207.4 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhc---ccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~---G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) +.... .+......+.+..++..-..+++.+.+|.. +... ........+.+.|+.-+-+...++++++.| T Consensus 1 m~~~~----~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~----~~~~~~~~~~~~~~~dst~~~a~~~Las~l 72 (559) T protein:vir:95 1 MAETT----KERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFL----TSEVNRNDRRNTRIIDSTGTMAARTLASGM 72 (559) T ss_pred CChhh----HHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcC----CCCCCcccccccccccchHHHHHHHHHHHH Confidence 11111 112233334444455555566777777742 2110 000111112234566678888888888887 Q ss_pred hcc--Cc-----eecCCch------h-------HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEE Q lcl|NC_010808. 107 LGN--PI-----QCQDDDK------D-------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYK 166 (512) Q Consensus 107 ~g~--~~-----~~~~~d~------~-------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~ 166 (512) ++- |+ ++...++ + ..+.+...+..++|.....++.++..++|.+.+++-.+..+.+++.. T Consensus 73 ~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~ 152 (559) T protein:vir:95 73 MSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMP 152 (559) T ss_pred HHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEE Confidence 653 21 2222221 2 23335556777889999999999999999999888777667778888 Q ss_pred EccceeEEEEeCCCCceeEEEEEEeeeeeec-------cC---------Ccce-EEEEEEEc---C-C-----cE----- Q lcl|NC_010808. 167 SDAMSTFVIYDNTIERNSIAGVRYLRTKPID-------KT---------DEDE-VFTVDLFT---S-H-----GV----- 215 (512) Q Consensus 167 ~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~-------~~---------~~~~-~~~~~~yt---~-~-----~~----- 215 (512) ++..+.+.--|. .+++...+|.++..... .. ..+. -..++++. + . .. T Consensus 153 ~~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 230 (559) T protein:vir:95 153 FPIGSYYLANSP--RGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) T ss_pred eecCeEEEeeCC--CCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccc Confidence 888887666554 45666666654332210 00 0000 11222221 1 0 00 Q ss_pred ----EEEEecCCccccccccccccccccccccceEee-----cCCCCCCc-chHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_010808. 216 ----YRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKG-DYEKVITLIDLYDNAESDTANYMSDLNDA 285 (512) Q Consensus 216 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s-~~~~v~~liDa~~~~~s~~~~~~~~~~~~ 285 (512) +++.....+ ..+ ....+|..+|++.+ ....+|+| ......+-+..++.+.-......+...+| T Consensus 231 pf~s~~~e~~~~~-~~~------l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~p 303 (559) T protein:vir:95 231 PFKSVYYEVGGDN-DKL------LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNP 303 (559) T ss_pred eEEEEEEEecCCC-cee------eecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111110 000 11223444565543 24567999 58899999999999988888888988888 Q ss_pred eeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCC-CcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhcccc- Q lcl|NC_010808. 286 MLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEG-SVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPN- 362 (512) Q Consensus 286 ~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~- 362 (512) .+.+.+...... ..+.++... .+.... ...++.+. .+.+...+...++.++..|...-..-. T Consensus 304 p~~v~~~~~~~~-----------~~l~pgg~~----~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~ 368 (559) T protein:vir:95 304 PMVAPTSLKNQR-----------ASLLPGDIT----YIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLF 368 (559) T ss_pred ceeccccccccc-----------eeeecccee----eeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhH Confidence 776543211111 111111111 011111 11222221 233455556666777776644432211 Q ss_pred -cccccccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCC---cccccceeeEEeCCCCCcCH Q lcl|NC_010808. 363 -MKDDNFSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSID---ANKDFNTVRYVYNRNLPKSL 437 (512) Q Consensus 363 -~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~---~~~d~~~i~i~f~~~~p~d~ 437 (512) ......+...||..+......+.+...- ..++-.+.|.-++.-++.++...+..+ .......++|++..++.+-. T Consensus 369 ~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aq 448 (559) T protein:vir:95 369 MMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQ 448 (559) T ss_pred HHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHH Confidence 1112233456777776654433333222 223333344444444445555444322 22334567777765554321 Q ss_pred -HHH-------HHHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhhccc--CCC Q lcl|NC_010808. 438 -IEE-------LKAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGIYK--DPR 493 (512) Q Consensus 438 -~~~-------~~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~--~~~ 493 (512) .+. ++.+..++++-| ...++..+ -+++ -.++|++++++++++....++..... ... T Consensus 449 k~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~ 528 (559) T protein:vir:95 449 KSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQ 528 (559) T ss_pred HHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 112222233222 23333322 1222 23566776666655443322211111 011 Q ss_pred CCCCCCCCCCCcCcccCC----------C Q lcl|NC_010808. 494 DINDDEQDDDTKDTVDKK----------E 512 (512) Q Consensus 494 ~~~~~~~~~~~~~~~~~~----------e 512 (512) ....-++...++.++=++ + T Consensus 529 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 557 (559) T protein:vir:95 529 GVKTLSEAKTSDPSVLSAMANAVSGQGGQ 557 (559) T ss_pred hhhccccccCCChhHHHHHHHhhcCcccc Confidence 010100111000000000 0 No 125 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.72 E-value=8.2e-08 Score=59.44 Aligned_cols=471 Identities=10% Similarity=0.031 Sum_probs=188.7 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhh--cHHHHHHHHHHHHHHH---HHHHHHHHHHhcccccc----cccccccc Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQ--NINEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKN----LVELTRRK 81 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~--~~~~l~~~i~~~~~~~---~~r~~~~~~yy~G~~~~----~~~~~~~~ 81 (512) |-+--....+=+++.. +....+ ..-.|.+.+...++.+ .++++.+.+||...... .....+.. T Consensus 1 ~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~ 72 (641) T protein:vir:94 1 MTIEMPTPIIEDKESA--------KRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTT 72 (641) T ss_pred CccCCCcccccCCcch--------hhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhccccccccc Confidence 2111111112122211 111111 1222334333333333 24566777777542211 11111111 Q ss_pred c--ccccceeeecchHHHHHHHHHhhhhcc----C--ceec---CCchhHHH----HHHHHHhccChhHHHHHHHHHHHh Q lcl|NC_010808. 82 E--EYMADNRVAHDYASYISDFINGYFLGN----P--IQCQ---DDDKDVLE----AIEAFNDLNDVESHNRSLGLDLSI 146 (512) Q Consensus 82 ~--~~~~~~ri~~n~~~~iv~~~a~~l~g~----~--~~~~---~~d~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~ 146 (512) . ....+.|+..+.+...++.+++.|++- + +++. .++.+..+ .+...+..+++......+.++++. T Consensus 73 ~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~ 152 (641) T protein:vir:94 73 GADDADWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVL 152 (641) T ss_pred ccchhcccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhh Confidence 1 111134688888888888888877652 2 1221 23333333 344445567888888899999999 Q ss_pred CCeEEEEEEECC------------C----------------CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeee--- Q lcl|NC_010808. 147 YGKAYELMIRNQ------------D----------------DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKP--- 195 (512) Q Consensus 147 ~G~a~~~v~~d~------------~----------------g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~--- 195 (512) +|.+++.++++. . ..+++..++|.+++ +|++....-..++++..+.. T Consensus 153 ~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~ 230 (641) T protein:vir:94 153 YGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELH 230 (641) T ss_pred cCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHH Confidence 999999887541 1 12344555555543 34332221111222111100 Q ss_pred --------------------eccCCcce-----------EEEEEEE----cCCc-EEEEEecCCccccccccccccccc- Q lcl|NC_010808. 196 --------------------IDKTDEDE-----------VFTVDLF----TSHG-VYRYLTSRTNGLKLTPRENGFESH- 238 (512) Q Consensus 196 --------------------~~~~~~~~-----------~~~~~~y----t~~~-~~~~~~~~~~~~~~~~~~~~~~~~- 238 (512) +....... ...+++| .++. ...+.....+.. ......+ T Consensus 231 ~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~-----il~~~~~~ 305 (641) T protein:vir:94 231 ELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQ-----LIRLSDSK 305 (641) T ss_pred HHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCE-----Eeeccccc Confidence 00000000 0011111 0110 000111111111 1111122 Q ss_pred cccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch Q lcl|NC_010808. 239 SFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP 313 (512) Q Consensus 239 ~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~ 313 (512) .|..+|++.++ ...+|+|....+.+.+..+|.+.-...+.+....+|.+.+..........+.-. .++++ T Consensus 306 ~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~~~-PG~ii---- 380 (641) T protein:vir:94 306 YWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVKAK-PGAVF---- 380 (641) T ss_pred ccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceeecc-CCcce---- Confidence 35566876543 345799999999999999999999999999998888876543221222211100 11111 Q ss_pred hhhhhcccccCCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhccccccc---ccccccchHHHHHHHHHHHHHHHH Q lcl|NC_010808. 314 TVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKD---DNFSGTQSGEAMKYKLFGLEQRTK 389 (512) Q Consensus 314 ~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~---~~~~~n~Sg~Ai~~~~~~l~~k~~ 389 (512) ..+..++++++... .+.......++.+...|-...++..+.. ...+.+.+|..+.........+.. T Consensus 381 ----------~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~ 450 (641) T protein:vir:94 381 ----------KVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLS 450 (641) T ss_pred ----------eeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHH Confidence 11223345555432 2222333445555555544443332221 111224577777766666666666 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHhccCCC---------------cccccceeeEEeCCCCCcCHH---HHHHHHHHHh-- Q lcl|NC_010808. 390 TKEGLFT-KGLRRRAKLLETILKNTRSID---------------ANKDFNTVRYVYNRNLPKSLI---EELKAYIDSG-- 448 (512) Q Consensus 390 ~~~~~~~-~~l~~~~~li~~~l~~~~~~~---------------~~~d~~~i~i~f~~~~p~d~~---~~~~~~~kl~-- 448 (512) ...+.|. ++++.+++-+++++......+ .+....++...|.- +|.... +.++.+..+. T Consensus 451 ~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~ 529 (641) T protein:vir:94 451 SVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQL 529 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHH Confidence 6666665 466666666666554321110 01111233333321 233322 2222332221 Q ss_pred ----ccCCh-----------HHHHHhCCC-C-------CCHHHHHHHH-HHHHHHHHHHHHhhcccCCC--------CCC Q lcl|NC_010808. 449 ----GKISQ-----------TTLMSLFSF-F-------QDPELEVKKI-EEDEKESIKKAQKGIYKDPR--------DIN 496 (512) Q Consensus 449 ----g~~s~-----------et~~~~~~~-v-------~d~~~E~~ri-~~E~~~~~~~~~~~~~~~~~--------~~~ 496 (512) |..|. +.+++..+. + .+.+.+-..+ .+|.++.+...+....+... +.+ T Consensus 530 ~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~ 609 (641) T protein:vir:94 530 LDISGRVPQIGQSLDYALILEDLLRQMRFTDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPED 609 (641) T ss_pred HHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHH Confidence 22221 222333331 1 1111111111 11111111111111000000 000 Q ss_pred CCCCCCC---CcCcccCCC Q lcl|NC_010808. 497 DDEQDDD---TKDTVDKKE 512 (512) Q Consensus 497 ~~~~~~~---~~~~~~~~e 512 (512) -.+..+. ...+.=..| T Consensus 610 ~~~~~~~~~~~~~~~~~~~ 628 (641) T protein:vir:94 610 VSDLASRIGIDTSDVAPEA 628 (641) T ss_pred HHHHHHhhcCCchhhhHHH Confidence 0000000 000000000 No 126 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.72 E-value=8.5e-08 Score=59.33 Aligned_cols=409 Identities=12% Similarity=0.039 Sum_probs=175.5 Q ss_pred HHHHHHHhc-ccccccc----------cccc--cccccccceee------ecchHHHHHHHHHhhhhccCceecCCc--- Q lcl|NC_010808. 60 LKVLSDYYE-GKTKNLV----------ELTR--RKEEYMADNRV------AHDYASYISDFINGYFLGNPIQCQDDD--- 117 (512) Q Consensus 60 ~~~~~~yy~-G~~~~~~----------~~~~--~~~~~~~~~ri------~~n~~~~iv~~~a~~l~g~~~~~~~~d--- 117 (512) +-.+.+++. +..+... .+.. ......+..++ .+.=..-.|+.+++-+.+-|+.+--.+ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 111111110 0000000 0000 00000000111 111122345666666666676642111 Q ss_pred -hhH-HHHHHHHHh-cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEE Q lcl|NC_010808. 118 -KDV-LEAIEAFND-LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRY 190 (512) Q Consensus 118 -~~~-~~~l~~~~~-~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~ 190 (512) ... ...+..++. .| ....+...+..+.+.+|.||+.+-.+ .|++ .+..++|..+.+..+...... ....+. T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~-~~~~~~ 158 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLR-RKVFEA 158 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCcc-ceeEEE Confidence 111 122333332 23 24557777888899999999988554 5554 466788888766554322111 111111 Q ss_pred eeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHH Q lcl|NC_010808. 191 LRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDN 270 (512) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~ 270 (512) |... .. ........|.++.+++++.... . ....|.|.++.+...+..... T Consensus 159 y~~~---~~--g~~~~~~~~~~~eiih~r~~~~----------------~---------~~~~G~sp~~~~~~~i~~~~~ 208 (457) T protein:vir:62 159 YDID---AD--GNEVLLGWFTPRDVLHIPGMML----------------P---------GDFVGCSPISYARESIGLALA 208 (457) T ss_pred EEEc---cC--CceeEEEeeCccceEEecCCCC----------------C---------CceecccHHHHHHHHHHHHHH Confidence 2111 01 1112223355566655532110 0 012477777777776666666 Q ss_pred HHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHH Q lcl|NC_010808. 271 AESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRL 350 (512) Q Consensus 271 ~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 350 (512) +..-..+.+.-.+.|-.+++-....+++..+..++.-.-... + ..+.......+.+.+++.++.......+.+..+.. T Consensus 209 ~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~-G-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 286 (457) T protein:vir:62 209 AQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAANS-G-VDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQ 286 (457) T ss_pred HHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHhc-C-ccccCcceecCCCceEEEccCChhHHHHHHHHHHH Confidence 555555556666677766665444455444443321110000 0 00111112234555666665444444556667778 Q ss_pred HHHHHHHhcccccccccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEe Q lcl|NC_010808. 351 NSDIHMFTNTPNMKDDNFSGTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVY 429 (512) Q Consensus 351 ~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f 429 (512) ...|+..-++|....+...++. ++..++.... ..+...|.-+++.|...++..-..........+++.+ T Consensus 287 ~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~----------~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~ 356 (457) T protein:vir:62 287 VPEIARIFGVPPHLISDATNSTSWGSGLAEQNI----------AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNL 356 (457) T ss_pred HHHHHHHhCCCHHHcCCCCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeec Confidence 8889999999987665443322 2322222111 2223344444444444444322111111122344444 Q ss_pred CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCH--HHH-----HHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 430 NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSF--FQDP--ELE-----VKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 430 ~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~--v~d~--~~E-----~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) ..-+-.|..+.++++.++ +|+++.-.++++++. +++. +.- +..+..+.+..................++ T Consensus 357 ~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (457) T protein:vir:62 357 DEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADD 436 (457) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCC Confidence 455567888999998887 689999999988754 2222 110 11111110000000000000000001111 Q ss_pred CCCCCCcCcccCCC Q lcl|NC_010808. 499 EQDDDTKDTVDKKE 512 (512) Q Consensus 499 ~~~~~~~~~~~~~e 512 (512) .+.+.+..+.|++| T Consensus 437 ~~~~~~~~~~d~~~ 450 (457) T protein:vir:62 437 EEPDNAEGDPDEGE 450 (457) T ss_pred CCCCCCCCCCcccc Confidence 11111112222222 No 127 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.66 E-value=1.3e-07 Score=58.32 Aligned_cols=452 Identities=10% Similarity=0.029 Sum_probs=206.7 Q ss_pred ccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhc---ccccccccccccccccccceeeecchHHHHHHHHHhhhh Q lcl|NC_010808. 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (512) Q Consensus 31 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~---G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~ 107 (512) +..+..-+.+..+...+.+..++..-..+++.+.+|.. |......... .....+.+.++.-+-+...++.+++.|+ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~-~~~~~~~~~~~~dstg~~a~~~LAs~l~ 79 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPD-SEKGRERSQKMFDSTAPLALRNFVAAMD 79 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCC-CCcccccccccccchHHHHHHHHHHHHH Confidence 22233333333344444444455555556666666632 2111100011 1111112345666778888888888876 Q ss_pred cc--Cc-----eecCCchh------HHHH-------HHHHH--hccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEE Q lcl|NC_010808. 108 GN--PI-----QCQDDDKD------VLEA-------IEAFN--DLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLY 165 (512) Q Consensus 108 g~--~~-----~~~~~d~~------~~~~-------l~~~~--~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~ 165 (512) +- |+ ++...++. +... +...+ ...+|.....++.++...+|.+.+++-.+..+.+++. T Consensus 80 ~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~ 159 (549) T protein:vir:10 80 SMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYR 159 (549) T ss_pred hhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEE Confidence 53 22 23333321 1122 22322 2467888899999999999999998877766777777 Q ss_pred EEccceeEEEEeCCCCceeEEEEEEeeeeeec--------c--------CCcceEEEEEEEcC---Cc------------ Q lcl|NC_010808. 166 KSDAMSTFVIYDNTIERNSIAGVRYLRTKPID--------K--------TDEDEVFTVDLFTS---HG------------ 214 (512) Q Consensus 166 ~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~--------~--------~~~~~~~~~~~yt~---~~------------ 214 (512) .++-.+.+.-.|. .+++...+|.++..... . ...+....+++|+. .. T Consensus 160 ~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~ 237 (549) T protein:vir:10 160 NVPMQRLWFAENN--SGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNM 237 (549) T ss_pred EEEcCeEEEeeCC--CCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccC Confidence 7777776555554 35666666654322110 0 00111223343321 00 Q ss_pred ---EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010808. 215 ---VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) Q Consensus 215 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~ 286 (512) .+++.. .. . . -....+|..+|++.. .+..+|+|..+...+-+..++.+.-......+...+|. T Consensus 238 pf~sv~~e~-~~-~-~------il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~ 308 (549) T protein:vir:10 238 QFASYWLDE-GR-D-R------IVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPP 308 (549) T ss_pred ceEEEEEEe-cC-C-E------eeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 111111 10 0 0 011223445666543 34568999999999999999999888888888888888 Q ss_pred eeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_010808. 287 LLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD 366 (512) Q Consensus 287 lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 366 (512) +.+.-........ ...++.. ....+..+...+..+....+.......++.++..|...-....+... T Consensus 309 ~~v~~~g~~~~~~---l~pgg~~----------~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~ 375 (549) T protein:vir:10 309 LLANEDGVLDGFD---LRSGALN----------WGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQIL 375 (549) T ss_pred eeeccccccccce---eccCCcc----------ccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 7753211111111 1111110 01111222334555555556667777777777776554332211111 Q ss_pred cccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcc-c----ccceeeEEeCCCCCcCHH-H Q lcl|NC_010808. 367 NFSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDAN-K----DFNTVRYVYNRNLPKSLI-E 439 (512) Q Consensus 367 ~~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~-~----d~~~i~i~f~~~~p~d~~-~ 439 (512) ..+...||..+......+.+...- ..++-.+.+.-+++-++.++...+..+.. . ....+.+++..++.+... + T Consensus 376 ~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~ 455 (549) T protein:vir:10 376 VDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAG 455 (549) T ss_pred cCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHH Confidence 223456777766654333333222 12222233333333334444444433221 1 233566776554433211 1 Q ss_pred H-------HHHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHH-----hhcccCCC Q lcl|NC_010808. 440 E-------LKAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQ-----KGIYKDPR 493 (512) Q Consensus 440 ~-------~~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~-----~~~~~~~~ 493 (512) . ++.+..++++-| ...++..+ -+++ -.++|++++.+++++....++ ....+... T Consensus 456 ~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~ 535 (549) T protein:vir:10 456 EGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIK 535 (549) T ss_pred HHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 112222222212 22333222 1222 134566665544333222111 11111111 Q ss_pred CCCCCCCCCCCcCc Q lcl|NC_010808. 494 DINDDEQDDDTKDT 507 (512) Q Consensus 494 ~~~~~~~~~~~~~~ 507 (512) +.-+......+.+. T Consensus 536 ~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 536 DLSDAQTAAQTARV 549 (549) T ss_pred hhhhhcCCCcccCC Confidence 11111112222222 No 128 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.59 E-value=2.3e-07 Score=57.00 Aligned_cols=379 Identities=9% Similarity=0.008 Sum_probs=169.4 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+++ ++++..-+... ........ .+.-.+-+. ........ ...-+ T Consensus 1 M~~f---~~~~~~~~~~~-----------~~~~~~~~--------------~~~~~~~~~----~~~~~~v~---~~~al 45 (386) T protein:vir:49 1 MPIF---NITNLATESPP-----------INQESFFD--------------IADSDFLAS----LNSSEWVS---AENAL 45 (386) T ss_pred Cchh---hhhccCCCCcc-----------cchhhhhh--------------hhhcccccc----ccCCceec---hhhhh Confidence 4443 33333221110 00000000 000000000 00000000 00111 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEcc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDA 169 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p 169 (512) ...-....|+.+++-+.+-|+.+.- ......+.+-........+...+..+.+.+|.||+.+-++.+|++ .+..++| T Consensus 46 ~~~~v~~~i~~ia~~ia~~p~~~~~--~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~ 123 (386) T protein:vir:49 46 KNSDLFSIISQLSNDLATAKITTSR--KQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRP 123 (386) T ss_pred ccHHHHHHHHHHHHHhhhCceeecc--chhhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecC Confidence 2223334556666666666766532 222222222222234456777888899999999999988888876 5677888 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeec Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS 249 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 249 (512) ..+.+..++.. ..+ +|.....+..... . ..+.+..+++++... +. T Consensus 124 ~~v~v~~~~~~-~~~-----~y~~~~~~~~~~~---~-~~~~~~evih~~~~~-------------------------~~ 168 (386) T protein:vir:49 124 SQVSFNRLDNQ-NGL-----YYNITFDDPHIAP---K-QHVPQNDILHFRLLS-------------------------VD 168 (386) T ss_pred ceeEEEEcCCC-ceE-----EEEEEEcCccccc---e-eEEccccEEEecCCC-------------------------CC Confidence 88877765432 111 1111111111110 1 123344444442210 00 Q ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc Q lcl|NC_010808. 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV 329 (512) Q Consensus 250 n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (512) ....|.|.+..+...++....+..-..+.+.-.+.|-.+++-.....++.......... ....+.......+.+. T Consensus 169 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~-----~~~~n~g~~~vl~~g~ 243 (386) T protein:vir:49 169 GGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQ-----AMKQMQGGPLVLDDLE 243 (386) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHH-----HhccCCCCceecCCCc Confidence 11247788877777777666555555555566667777665433333333333322211 1111111222234555 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 330 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) +++.++.......+.+..+.....|+..-++|....+...+ ..++..++.. +...++.+++.+.. T Consensus 244 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--------------~~~~i~~~l~~i~~ 309 (386) T protein:vir:49 244 DFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNI--------------YFKSVSRYLRPFVS 309 (386) T ss_pred eEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHH--------------HHHHHHHHHHHHHH Confidence 66666554455566677888888999999999777653222 2233333222 22233333333333 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCC---CCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFS---FFQDPELEVKKIEEDEKESIKK 483 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~---~v~d~~~E~~ri~~E~~~~~~~ 483 (512) .++..-. ..+.+.....+-.|..+.+..+.++ +|+++.-.++++++ +..++ +.+ T Consensus 310 ~~~~~l~-------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~---~~~----------- 368 (386) T protein:vir:49 310 EMSKKLS-------CEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKE---LPD----------- 368 (386) T ss_pred HHHHHhc-------chhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCc---Ccc----------- Confidence 3322111 1122233334445666777777776 67888877777652 22221 100 Q ss_pred HHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 484 AQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) ..++.. ...+++|+.++. T Consensus 369 -----~~~~~~-----~~~~gGd~~~~~ 386 (386) T protein:vir:49 369 -----GKNPNR-----TSLKGGEINEQD 386 (386) T ss_pred -----hhccCC-----CCCCCCCCCCCC Confidence 000000 001111111111 No 129 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.59 E-value=2.3e-07 Score=56.96 Aligned_cols=418 Identities=10% Similarity=-0.000 Sum_probs=176.9 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc--cccccccceee------ecchHHHHHHHHHhhhhccC Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR--RKEEYMADNRV------AHDYASYISDFINGYFLGNP 110 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~--~~~~~~~~~ri------~~n~~~~iv~~~a~~l~g~~ 110 (512) ......|.+.... . .... ..++..-+..... ......+...+ .+.=..-.|+.+++-+.+-| T Consensus 1 Mg~~~~l~~r~~~---~---~~~~----~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp 70 (457) T protein:vir:13 1 MGFWSALFGRGHS---P---ALDG----IEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLP 70 (457) T ss_pred Cchhhhhhccccc---c---cccc----cccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCc Confidence 1111111110000 0 0000 0000000000000 00000000011 11112234666666666667 Q ss_pred ceecC---Cc--hhHHHHHHHHHhc--c--ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_010808. 111 IQCQD---DD--KDVLEAIEAFNDL--N--DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTI 180 (512) Q Consensus 111 ~~~~~---~d--~~~~~~l~~~~~~--n--~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~ 180 (512) +++-- +. +.....+..++.. | ....+...+..+.+.+|.+|+.+-.+ .|++ .+..++|..+.++.+... T Consensus 71 ~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:13 71 LSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVD 149 (457) T ss_pred eEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCC Confidence 66421 11 1112234444432 2 23356777888899999999988554 4554 567788888777654322 Q ss_pred CceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHH Q lcl|NC_010808. 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEK 260 (512) Q Consensus 181 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~ 260 (512) .... ...+.|.... . ........|.++.++++.... ..+...|.|.+.. T Consensus 150 ~~~~-~~~~~y~~~~---~--~~~~~~~~~~~~diih~~~~~-------------------------~~~~~~G~s~i~~ 198 (457) T protein:vir:13 150 GLRR-KVFEAYDIDA---D--GNEVLLGWFTPRDVLHIPGMM-------------------------LPGDFVGCSPISY 198 (457) T ss_pred Cccc-eeEEEEEEec---C--CceeeEEeeCccceEEecCCC-------------------------CCCccccccHHHH Confidence 1111 1111121110 1 111122334555555543210 0011247787877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCH Q lcl|NC_010808. 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDV 340 (512) Q Consensus 261 v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 340 (512) +...|.....+..-..+.+.-.+.|-.+++.....+++..+..++.-.-.... ..+.......+++.+++.++..... T Consensus 199 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g--~~nag~~~vl~~g~~~~~l~~~~~d 276 (457) T protein:vir:13 199 ARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSG--VDNAHRVALLTEGAKFSKVAMSPDE 276 (457) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcC--ccccCcceecCCCceEEEccCChhH Confidence 77777666655555555556666777777654444554444433221111000 0111111223455666666554444 Q ss_pred HHHHHHHHHHHHHHHHHhcccccccccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc Q lcl|NC_010808. 341 QGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN 419 (512) Q Consensus 341 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~ 419 (512) ..+.+..+.....|+..-++|....+...++. ++..++-.. ...+...|..+++.|...+..+-..... T Consensus 277 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~----------~~f~~~tl~P~~~~ie~~ln~~L~~~~~ 346 (457) T protein:vir:13 277 AQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN----------IAFTMFSLRPWLERIEAGFNRLLFAETA 346 (457) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCccc Confidence 44556667778889888899987665443322 222222211 1223444444444444444432221111 Q ss_pred cccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCH--HHHH-----HHHHHHHHHHHHHHHhhc Q lcl|NC_010808. 420 KDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDP--ELEV-----KKIEEDEKESIKKAQKGI 488 (512) Q Consensus 420 ~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~--~~E~-----~ri~~E~~~~~~~~~~~~ 488 (512) .....+++.++.-+-.|..+.++++.++ +|+++.-.++++++.- ++. +.-+ ..+.+.-+. +...... T Consensus 347 ~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~--~~~~~~~ 424 (457) T protein:vir:13 347 DRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEP--EPAPAPP 424 (457) T ss_pred cCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccccccccc--cccCCCC Confidence 1222345555566677888999998887 7899998888887542 222 1100 001110000 0000000 Q ss_pred ccCCCCCC-------CCCCCCCCcCcccCCC Q lcl|NC_010808. 489 YKDPRDIN-------DDEQDDDTKDTVDKKE 512 (512) Q Consensus 489 ~~~~~~~~-------~~~~~~~~~~~~~~~e 512 (512) ...+...+ .+..+++++-+.++++ T Consensus 425 ~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~ 455 (457) T protein:vir:13 425 AIEPPAEEPDEEPEPEGKPDDEGATEEDDED 455 (457) T ss_pred CCCCCccccCCCCCCCCCCccccCCCCcccc Confidence 00000000 1111111111112222 No 130 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.58 E-value=2.5e-07 Score=56.81 Aligned_cols=453 Identities=10% Similarity=0.052 Sum_probs=205.8 Q ss_pred cccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +..++...+ ....+.+..++..-..+++.+.+|..-.-... .........+...++.-+-+...++++++.|++- T Consensus 1 m~~~~~~~l----~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~-~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ 75 (556) T protein:vir:73 1 MAETEKERL----LKQLAQLKNERTSFESHWLDLSDFINPRGSRF-LTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSG 75 (556) T ss_pred CChhhHHHH----HHHHHHHHHHhhHHHHHHHHHHHHhccccCCc-CCCCCCcchhhcCccccchHHHHHHHHHHHHHHh Confidence 122222212 12223333344444556777777742110000 0001111112233566678888888888887653 Q ss_pred --Cc-----eecCCch-------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 110 --PI-----QCQDDDK-------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 110 --~~-----~~~~~d~-------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) |+ ++...+. .+...+.+.+..++|.....++.++..++|.+.+++-.+..+.+++..++. T Consensus 76 ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l 155 (556) T protein:vir:73 76 ITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPI 155 (556) T ss_pred hcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeec Confidence 21 2222221 133445566777889999999999999999999988777777788888888 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeee--------ccC--------Ccce-EEEEEE----EcCCc-----E-------- Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPI--------DKT--------DEDE-VFTVDL----FTSHG-----V-------- 215 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~--------~~~--------~~~~-~~~~~~----yt~~~-----~-------- 215 (512) .+.+.--|. .+++...+|.++.... +.. ..+. ...+++ |.... . T Consensus 156 ~~~~~~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~ 233 (556) T protein:vir:73 156 GSYYLANSP--RGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYR 233 (556) T ss_pred ceeEEeeCC--CCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEE Confidence 777665554 4566666665543310 000 0000 112222 21110 0 Q ss_pred -EEEEecCCccccccccccccccccccccceEee-----cCCCCCCc-chHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_010808. 216 -YRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKG-DYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (512) Q Consensus 216 -~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s-~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv 288 (512) +++.. ......+ ...-+|..+|++.+ .++.+|+| ..+...+-+..++.+.-......+...+|.+. T Consensus 234 s~~~~~-~~~~~~v------l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (556) T protein:vir:73 234 SVYFES-GGDSDKL------LRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMV 306 (556) T ss_pred EEEEEe-cCCCcee------cccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 11111 1010000 01123445666543 34568999 59999999999999988888888888888776 Q ss_pred eecCCcCChhhhhhhhhccccccchhhhhhcccccCC-CCCcceeEEe-ecCCHHHHHHHHHHHHHHHHHHhcccc--cc Q lcl|NC_010808. 289 IKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET-EGSVDGGYIY-KQYDVQGTEAYKDRLNSDIHMFTNTPN--MK 364 (512) Q Consensus 289 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~ 364 (512) +....... .+.+.++.. ..... .+...++.+. .+.+.....+.++.++..|...-.... .. T Consensus 307 v~~~~~~~-----------~~~~~pgg~----~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l 371 (556) T protein:vir:73 307 APTSLKNQ-----------RVSLLPGDV----TYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMML 371 (556) T ss_pred cccccccc-----------ceeeccCcc----ccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 54321111 111111110 00111 1122334332 223456666667777777754332221 11 Q ss_pred cccccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCc---ccccceeeEEeCCCCCcCHHH- Q lcl|NC_010808. 365 DDNFSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDA---NKDFNTVRYVYNRNLPKSLIE- 439 (512) Q Consensus 365 ~~~~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~---~~d~~~i~i~f~~~~p~d~~~- 439 (512) ....+.+.||..+......+.+...- ..++-.+.|.-++.-++.++...+..+. ......|+|++..++...... T Consensus 372 ~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~ 451 (556) T protein:vir:73 372 QNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSI 451 (556) T ss_pred ccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHH Confidence 11223456777776654433333222 2233333444444444455554443322 223456777776655432111 Q ss_pred HH-------HHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhhccc--CCCCCC Q lcl|NC_010808. 440 EL-------KAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGIYK--DPRDIN 496 (512) Q Consensus 440 ~~-------~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~--~~~~~~ 496 (512) .+ +.+..++++-| ...++..+ -+++ -.++|++.+++++++....++..... ..+... T Consensus 452 ~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~ 531 (556) T protein:vir:73 452 GLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAK 531 (556) T ss_pred HHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12222223222 23333322 1222 23456666655544332222211000 000000 Q ss_pred CCCCCCCCcCc----------ccCC Q lcl|NC_010808. 497 DDEQDDDTKDT----------VDKK 511 (512) Q Consensus 497 ~~~~~~~~~~~----------~~~~ 511 (512) .-++....+.+ .-.. T Consensus 532 ~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 532 TLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HhhhccCCCHHHHHHHHHhhcCCCC Confidence 00000000000 0000 No 131 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.55 E-value=3.1e-07 Score=56.27 Aligned_cols=385 Identities=12% Similarity=0.057 Sum_probs=161.8 Q ss_pred ceee--ecchHHHHHHHHHhhhhccCceecCC--------chhHHHHHHHHHhc---c-----------ChhHHHHHHHH Q lcl|NC_010808. 87 DNRV--AHDYASYISDFINGYFLGNPIQCQDD--------DKDVLEAIEAFNDL---N-----------DVESHNRSLGL 142 (512) Q Consensus 87 ~~ri--~~n~~~~iv~~~a~~l~g~~~~~~~~--------d~~~~~~l~~~~~~---n-----------~~~~~~~~~~~ 142 (512) ...+ ..++....|+.+++.+.+-|+.+... .....+.+.+++.. | .+..+...+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 1111 24788888999999998888876311 11122333333321 2 23456677888 Q ss_pred HHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCC-------c Q lcl|NC_010808. 143 DLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSH-------G 214 (512) Q Consensus 143 ~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~-------~ 214 (512) +...+|.||+.+.++..|++ .+..++|..+.+.-|... .+..+. ... .++.+|... . T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~------~~~~~~--------~~~-~~~~~~~~~~~~~~~~~ 145 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG------FVQLLE--------EKE-KYFGVAGDRYQTNGNGD 145 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce------eEeecC--------Cce-eeEEeccccceeecccc Confidence 99999999999999988875 467788888766654321 000000 000 000011100 0 Q ss_pred EEE-EEecCCccccccccccccccccccccceEeecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_010808. 215 VYR-YLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (512) Q Consensus 215 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv 288 (512) ... +....... .......+.. -|++++. ...|.|.+......++....+..-....+.-.+.|-.+ T Consensus 146 ~~~~~~~~~~~~------~~~~~~~~~~--diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 217 (467) T protein:vir:31 146 LDPVFVDADDGS------TGTSVSNPAN--ELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIA 217 (467) T ss_pred eeeeeeeecccc------ccceeEeccc--cEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 000 00000000 0000000111 1455432 22577777666555554443333333333334445444 Q ss_pred e--ecCCcCChhhhhhhhhcccc---------ccchhhhhhcccccCCCCCcc-----eeE--Eeec-CCHHHHHHHHHH Q lcl|NC_010808. 289 I--KGNLSLDPDEVKKQKEANVL---------FLEPTVYENRDTGIETEGSVD-----GGY--IYKQ-YDVQGTEAYKDR 349 (512) Q Consensus 289 ~--~g~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-----~~~--l~~~-~~~~~~~~~~~~ 349 (512) + .|. ..+++..+..+..-.- ........+.........+.+ +++ ++.. .....+.+..+. T Consensus 218 l~~~~~-~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~ 296 (467) T protein:vir:31 218 IIVKGA-ELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGR 296 (467) T ss_pred EEecCc-CCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHH Confidence 4 332 1233333332221000 000000000000011112222 222 1111 123445667777 Q ss_pred HHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc-ccccceeeEE Q lcl|NC_010808. 350 LNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVRYV 428 (512) Q Consensus 350 l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~-~~d~~~i~i~ 428 (512) ..+.|...-++|....+...++..+..++.. ....+...|.-+++.+...++..-.... ......+++. T Consensus 297 ~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~----------~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~ 366 (467) T protein:vir:31 297 NEHDILKVHDVPPVIAGVVESGAFSTDAEEQ----------RKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFE 366 (467) T ss_pred HHHHHHHHhCCCHHHcccCCCCCcccCHHHH----------HHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEe Confidence 7888888888886655432211111111111 1122233344444444444432211000 0011235566 Q ss_pred eCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCCCCCCCc Q lcl|NC_010808. 429 YNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK-DPRDINDDEQDDDTK 505 (512) Q Consensus 429 f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 505 (512) +......|..+.++++.++ .|+++.-.+++++++-.-++.++.- ... .......+ .+....+++..+..+ T Consensus 367 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~------~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 439 (467) T protein:vir:31 367 LAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYG------GET-LVAEVTGGSGPGGGIGDQIEQLVE 439 (467) T ss_pred cchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccC------Ccc-cccccccccCCCCcccCcCCCCCC Confidence 6677778999999988876 6899999999998652211111100 000 00000001 011111111111111 Q ss_pred CcccCC----C Q lcl|NC_010808. 506 DTVDKK----E 512 (512) Q Consensus 506 ~~~~~~----e 512 (512) +..++. + T Consensus 440 ~~~~~~~~~~~ 450 (467) T protein:vir:31 440 DRADEIIDSYQ 450 (467) T ss_pred CcccchHhhhh Confidence 111111 0 No 132 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.48 E-value=5e-07 Score=55.15 Aligned_cols=410 Identities=10% Similarity=0.056 Sum_probs=176.1 Q ss_pred hhcHHHHHHHHHHHHHHHHH---HHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceec- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ- 114 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~---r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~- 114 (512) ....+.+..+.+........ ....+..+.-+...-. ..... .-+.+.-....|+.+++-+..-|+.+- T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~-----~v~~~---~al~~~~v~~~i~~ia~~ia~l~~~~~~ 72 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTI-----SVKGK---NALKVATVFACIKILSESVSKLPLKIYQ 72 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcc-----eechh---hhhccHHHHHHHHHHHHhhccCceEEEE Confidence 00011000100000000000 0011111111000000 00000 001223344456666666666676641 Q ss_pred -CCc---hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCcee Q lcl|NC_010808. 115 -DDD---KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNS 184 (512) Q Consensus 115 -~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~ 184 (512) .++ ......+..++.. | ....+...+..+.+.+|.+|+++-.+..|++ .+..++|..+.+..++...... T Consensus 73 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~ 152 (429) T protein:vir:10 73 EDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNS 152 (429) T ss_pred ecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccc Confidence 111 1122345555532 2 3446777888899999999999999998986 5778888888777664321110 Q ss_pred EEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHH Q lcl|NC_010808. 185 IAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITL 264 (512) Q Consensus 185 ~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~l 264 (512) . ...+|.. .. .+ . . ..|.++.++++.... ..+...|.|.+..+... T Consensus 153 ~-~~~~~~~-~~--~g-~--~--~~~~~~evih~~~~~-------------------------~~~~~~G~s~i~~~~~~ 198 (429) T protein:vir:10 153 K-TKMWYVV-NT--GG-Q--Q--RVLKPEEILHFKNGI-------------------------TLDGLVGVPTMEYLKST 198 (429) T ss_pred c-ceEEEEE-cc--CC-e--E--EEEccccEEEecCCC-------------------------CCCCcccccHHHHHHHH Confidence 0 0111111 00 00 0 0 123333333332110 01123477778777777 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) ++....+..-....++-.+.|-.+++.....+++..+..++.-.-... + ..+.....-.+++.+++.++.......+. T Consensus 199 i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~ 276 (429) T protein:vir:10 199 LENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSS-G-LQNSHRIALMPVGYQFQPISLNMSDAQFL 276 (429) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhc-c-ccccCceeecCCCceEEEccCChhHHHHH Confidence 766665555555555555667666664333444433333222110000 0 01111111234455666555443344455 Q ss_pred HHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 423 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~ 423 (512) +..+...+.|+..-++|....+... ++-|. ++ ......+...|..+++.|...++.+--..... .. T Consensus 277 e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e----------~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~-~~ 343 (429) T protein:vir:10 277 ENTELTIRQIATAFGIKMHQLNDLSKATLNN--IE----------QQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL-DK 343 (429) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc-CC Confidence 6667778889999999977665332 22222 11 11222345555555555555554321111111 11 Q ss_pred eeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 424 TVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 424 ~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) ...+.| ..-+..|..+.++++.++ .|+++...++++++.-.-+. .++..--.. ..+........ ...++. T Consensus 344 g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD~~~~~~n-~~~~d~~~~~~-~k~g~~-- 417 (429) T protein:vir:10 344 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAG--GDRLLVNGN-MLPIDMAGQAY-LKGGDT-- 417 (429) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeeeeccc-ccchhhccccc-cCCCCC-- Confidence 223444 455567889999999887 68999998888886522110 000000000 00000000000 011111 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) +.+...+.+++- T Consensus 418 -~~~~~~~~~e~~ 429 (429) T protein:vir:10 418 -NGEVSKEGNEGN 429 (429) T ss_pred -CCCCCCCCCCCC Confidence 111111111111 No 133 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.46 E-value=5.7e-07 Score=54.82 Aligned_cols=411 Identities=12% Similarity=0.073 Sum_probs=181.9 Q ss_pred hhcHHHHHHHHHHHHHHHHHH------HHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCce Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPR------LKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQ 112 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r------~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~ 112 (512) ...++.+.++..-+.....+. ...+..|. |..+. ....... .-+.+.-....|+.+++-+..-|+. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-g~~~~----~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~ 72 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWL-GISPS----TISVKGK---NALKVATVFACIKILSESVSKLPLK 72 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHh-CCCcC----ccccchh---hhhccHHHHHHHHHHHHhhccCceE Confidence 222333333321110000000 00111111 10000 0000000 0122233344566666666666776 Q ss_pred ec--CCc---hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCC Q lcl|NC_010808. 113 CQ--DDD---KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 113 ~~--~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~ 181 (512) +- .++ ......+..++.. | ....+...+..+.+.+|.+|+++.++..|++ .+..++|..+.++.++... T Consensus 73 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~ 152 (432) T protein:vir:10 73 IYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGL 152 (432) T ss_pred EEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccc Confidence 41 111 1122335555532 2 3456777888899999999999999988986 5678889888777664311 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHH Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV 261 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v 261 (512) .... ...+|... . . .. . ..+.+..++++.... | .+...|.|.+..+ T Consensus 153 ~~~~-~~~~y~~~-~--~-g~---~-~~~~~~eiih~r~~~---------------------~----~~~~~G~s~~~~~ 198 (432) T protein:vir:10 153 LNSK-TKMWYVVN-T--G-GQ---Q-RVLKPEEILHFKNGI---------------------T----LDGLVGVPTMEYL 198 (432) T ss_pred cccc-ceEEEEEe-c--C-Ce---E-EEEccccEEEecCCC---------------------C----CCCcccccHHHHH Confidence 1000 11111110 0 0 00 0 123334444332110 0 1122477888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHH Q lcl|NC_010808. 262 ITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQ 341 (512) Q Consensus 262 ~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 341 (512) ...++....+..-....+.-.+.|-.+++.....+++..+..++.-.-... + ..+.......+.+.+++.++.+.... T Consensus 199 ~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~~~~~vl~~g~~~~~l~~~~~d~ 276 (432) T protein:vir:10 199 KSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSS-G-LQNSHRIALMPVGYQFQPISLNMSDA 276 (432) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhc-c-cccCCcceecCCCceEEEccCChhHH Confidence 777776666555555555666677777765444444443333322110000 0 01111112234455666665443444 Q ss_pred HHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc Q lcl|NC_010808. 342 GTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 420 (512) Q Consensus 342 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~ 420 (512) .+.+..+...+.|+..-++|....+... ++-|. ++ ......+...|+.+++.|...++.+--..... T Consensus 277 q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e----------~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~ 344 (432) T protein:vir:10 277 QFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IE----------QQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL 344 (432) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc Confidence 5556677778889999999977765332 22221 11 11222344555555555555554321111111 Q ss_pred c-cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC Q lcl|NC_010808. 421 D-FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND 497 (512) Q Consensus 421 d-~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 497 (512) . ...+++.+..-+..|..+.++++.++ .|+++.-.+++.+++-.-+....-.+..-. ...+...+ . ....++ T Consensus 345 ~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~--~-~~k~~~- 419 (432) T protein:vir:10 345 DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQ--A-YLKGGD- 419 (432) T ss_pred CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccc-cchhhccc--c-ccCCCC- Confidence 1 11234444455677899999998887 689999888888865321100000000000 00000000 0 000011 Q ss_pred CCCCCCCcCcccCCC Q lcl|NC_010808. 498 DEQDDDTKDTVDKKE 512 (512) Q Consensus 498 ~~~~~~~~~~~~~~e 512 (512) +..+...+.+++- T Consensus 420 --~~~~~~~~~~~~~ 432 (432) T protein:vir:10 420 --TNGEVSKEGNEGN 432 (432) T ss_pred --CCCCCCCCCCCCC Confidence 1111111111111 No 134 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.46 E-value=5.7e-07 Score=54.82 Aligned_cols=411 Identities=12% Similarity=0.073 Sum_probs=181.9 Q ss_pred hhcHHHHHHHHHHHHHHHHHH------HHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCce Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPR------LKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQ 112 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r------~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~ 112 (512) ...++.+.++..-+.....+. ...+..|. |..+. ....... .-+.+.-....|+.+++-+..-|+. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-g~~~~----~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~ 72 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWL-GISPS----TISVKGK---NALKVATVFACIKILSESVSKLPLK 72 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHh-CCCcC----ccccchh---hhhccHHHHHHHHHHHHhhccCceE Confidence 222333333321110000000 00111111 10000 0000000 0122233344566666666666776 Q ss_pred ec--CCc---hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCC Q lcl|NC_010808. 113 CQ--DDD---KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 113 ~~--~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~ 181 (512) +- .++ ......+..++.. | ....+...+..+.+.+|.+|+++.++..|++ .+..++|..+.++.++... T Consensus 73 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~ 152 (432) T protein:vir:10 73 IYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGL 152 (432) T ss_pred EEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccc Confidence 41 111 1122335555532 2 3456777888899999999999999988986 5678889888777664311 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHH Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV 261 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v 261 (512) .... ...+|... . . .. . ..+.+..++++.... | .+...|.|.+..+ T Consensus 153 ~~~~-~~~~y~~~-~--~-g~---~-~~~~~~eiih~r~~~---------------------~----~~~~~G~s~~~~~ 198 (432) T protein:vir:10 153 LNSK-TKMWYVVN-T--G-GQ---Q-RVLKPEEILHFKNGI---------------------T----LDGLVGVPTMEYL 198 (432) T ss_pred cccc-ceEEEEEe-c--C-Ce---E-EEEccccEEEecCCC---------------------C----CCCcccccHHHHH Confidence 1000 11111110 0 0 00 0 123334444332110 0 1122477888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHH Q lcl|NC_010808. 262 ITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQ 341 (512) Q Consensus 262 ~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 341 (512) ...++....+..-....+.-.+.|-.+++.....+++..+..++.-.-... + ..+.......+.+.+++.++.+.... T Consensus 199 ~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~~~~~vl~~g~~~~~l~~~~~d~ 276 (432) T protein:vir:10 199 KSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSS-G-LQNSHRIALMPVGYQFQPISLNMSDA 276 (432) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhc-c-cccCCcceecCCCceEEEccCChhHH Confidence 777776666555555555666677777765444444443333322110000 0 01111112234455666665443444 Q ss_pred HHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc Q lcl|NC_010808. 342 GTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 420 (512) Q Consensus 342 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~ 420 (512) .+.+..+...+.|+..-++|....+... ++-|. ++ ......+...|+.+++.|...++.+--..... T Consensus 277 q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e----------~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~ 344 (432) T protein:vir:10 277 QFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IE----------QQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL 344 (432) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc Confidence 5556677778889999999977765332 22221 11 11222344555555555555554321111111 Q ss_pred c-cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC Q lcl|NC_010808. 421 D-FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND 497 (512) Q Consensus 421 d-~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 497 (512) . ...+++.+..-+..|..+.++++.++ .|+++.-.+++.+++-.-+....-.+..-. ...+...+ . ....++ T Consensus 345 ~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~--~-~~k~~~- 419 (432) T protein:vir:10 345 DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQ--A-YLKGGD- 419 (432) T ss_pred CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccc-cchhhccc--c-ccCCCC- Confidence 1 11234444455677899999998887 689999888888865321100000000000 00000000 0 000011 Q ss_pred CCCCCCCcCcccCCC Q lcl|NC_010808. 498 DEQDDDTKDTVDKKE 512 (512) Q Consensus 498 ~~~~~~~~~~~~~~e 512 (512) +..+...+.+++- T Consensus 420 --~~~~~~~~~~~~~ 432 (432) T protein:vir:10 420 --TNGEVSKEGNEGN 432 (432) T ss_pred --CCCCCCCCCCCCC Confidence 1111111111111 No 135 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.46 E-value=5.7e-07 Score=54.82 Aligned_cols=411 Identities=12% Similarity=0.073 Sum_probs=181.9 Q ss_pred hhcHHHHHHHHHHHHHHHHHH------HHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCce Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPR------LKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQ 112 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r------~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~ 112 (512) ...++.+.++..-+.....+. ...+..|. |..+. ....... .-+.+.-....|+.+++-+..-|+. T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~-g~~~~----~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~ 72 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWL-GISPS----TISVKGK---NALKVATVFACIKILSESVSKLPLK 72 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHh-CCCcC----ccccchh---hhhccHHHHHHHHHHHHhhccCceE Confidence 222333333321110000000 00111111 10000 0000000 0122233344566666666666776 Q ss_pred ec--CCc---hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCC Q lcl|NC_010808. 113 CQ--DDD---KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 113 ~~--~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~ 181 (512) +- .++ ......+..++.. | ....+...+..+.+.+|.+|+++.++..|++ .+..++|..+.++.++... T Consensus 73 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~ 152 (432) T protein:vir:10 73 IYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGL 152 (432) T ss_pred EEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccc Confidence 41 111 1122335555532 2 3456777888899999999999999988986 5678889888777664311 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHH Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV 261 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v 261 (512) .... ...+|... . . .. . ..+.+..++++.... | .+...|.|.+..+ T Consensus 153 ~~~~-~~~~y~~~-~--~-g~---~-~~~~~~eiih~r~~~---------------------~----~~~~~G~s~~~~~ 198 (432) T protein:vir:10 153 LNSK-TKMWYVVN-T--G-GQ---Q-RVLKPEEILHFKNGI---------------------T----LDGLVGVPTMEYL 198 (432) T ss_pred cccc-ceEEEEEe-c--C-Ce---E-EEEccccEEEecCCC---------------------C----CCCcccccHHHHH Confidence 1000 11111110 0 0 00 0 123334444332110 0 1122477888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHH Q lcl|NC_010808. 262 ITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQ 341 (512) Q Consensus 262 ~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 341 (512) ...++....+..-....+.-.+.|-.+++.....+++..+..++.-.-... + ..+.......+.+.+++.++.+.... T Consensus 199 ~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~~~~~vl~~g~~~~~l~~~~~d~ 276 (432) T protein:vir:10 199 KSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSS-G-LQNSHRIALMPVGYQFQPISLNMSDA 276 (432) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhc-c-cccCCcceecCCCceEEEccCChhHH Confidence 777776666555555555666677777765444444443333322110000 0 01111112234455666665443444 Q ss_pred HHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc Q lcl|NC_010808. 342 GTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 420 (512) Q Consensus 342 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~ 420 (512) .+.+..+...+.|+..-++|....+... ++-|. ++ ......+...|+.+++.|...++.+--..... T Consensus 277 q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e----------~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~ 344 (432) T protein:vir:10 277 QFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IE----------QQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL 344 (432) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc Confidence 5556677778889999999977765332 22221 11 11222344555555555555554321111111 Q ss_pred c-cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC Q lcl|NC_010808. 421 D-FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND 497 (512) Q Consensus 421 d-~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 497 (512) . ...+++.+..-+..|..+.++++.++ .|+++.-.+++.+++-.-+....-.+..-. ...+...+ . ....++ T Consensus 345 ~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~-~~~~~~~~--~-~~k~~~- 419 (432) T protein:vir:10 345 DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNM-LPIDMAGQ--A-YLKGGD- 419 (432) T ss_pred CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccc-cchhhccc--c-ccCCCC- Confidence 1 11234444455677899999998887 689999888888865321100000000000 00000000 0 000011 Q ss_pred CCCCCCCcCcccCCC Q lcl|NC_010808. 498 DEQDDDTKDTVDKKE 512 (512) Q Consensus 498 ~~~~~~~~~~~~~~e 512 (512) +..+...+.+++- T Consensus 420 --~~~~~~~~~~~~~ 432 (432) T protein:vir:10 420 --TNGEVSKEGNEGN 432 (432) T ss_pred --CCCCCCCCCCCCC Confidence 1111111111111 No 136 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.43 E-value=6.9e-07 Score=54.38 Aligned_cols=407 Identities=11% Similarity=0.028 Sum_probs=179.3 Q ss_pred HHHHHHHHHHHHHHH-HHHhcccccccccc------c---cccccc-ccceeeecchHHHHHHHHHhhhhccCceec-CC Q lcl|NC_010808. 49 IEHHMDYQRPRLKVL-SDYYEGKTKNLVEL------T---RRKEEY-MADNRVAHDYASYISDFINGYFLGNPIQCQ-DD 116 (512) Q Consensus 49 i~~~~~~~~~r~~~~-~~yy~G~~~~~~~~------~---~~~~~~-~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~-~~ 116 (512) +++-+.+...+.+.- ..| .|........ . ...... .++.-+.+.-....|+.+++-+..-|+.+- .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~ 79 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKW-LGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQTK 79 (437) T ss_pred CCcchhhhhhhhHHhhhhh-cCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEEc Confidence 111112222222221 122 2221100000 0 000000 000111223344456666666666666541 11 Q ss_pred -c----hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 117 -D----KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 117 -d----~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) + ......+..+|.. | ....+...+..+++.+|.+|+++-.+. |++ .+..++|..+.+..+.. +.+ T Consensus 80 ~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~~~~--g~~- 155 (437) T protein:vir:10 80 PDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKRLTS--GAL- 155 (437) T ss_pred CCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEECCC--CeE- Confidence 1 1112234444432 2 445667778888999999999998874 765 46778888887776542 111 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) . |.+.... + ....+.+..|++++... .+...|.|.+..+...+ T Consensus 156 ---~-y~~~~~~--g-----~~~~~~~~dIih~r~~~--------------------------~d~~~G~spi~~~~~~i 198 (437) T protein:vir:10 156 ---Q-YTYRNVD--G-----TVSTLAEDDVFHVRGFS--------------------------LDGLMGLTPIQYAREVL 198 (437) T ss_pred ---E-EEEEecC--c-----eEEEEccccEEEecCcC--------------------------CCCcccccHHHHHHHHH Confidence 1 1111111 0 01133444444432110 01224777777777666 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEA 345 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 345 (512) +....+..-..+.+.-.+.|-.+++.....+++..+..+..-.-.... ..+.....-.+++.+++.++.......+.+ T Consensus 199 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g--~~nag~~~vl~~g~~~~~l~~~~~d~q~~e 276 (437) T protein:vir:10 199 GNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGG--AMQAGKTMVLEAGMKYQAITMNPGDVQLLE 276 (437) T ss_pred HHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcC--ccccCcceeccCCceEEeccCChhhHHHHH Confidence 665555555555556666677777654444555444433221100000 011111122344556655554444445566 Q ss_pred HHHHHHHHHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 346 YKDRLNSDIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 346 ~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) ..+.....|+..-++|....+...++ ..+..++. .....+...|.-.+..|...+..+-.......-.. T Consensus 277 ~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~----------~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~ 346 (437) T protein:vir:10 277 TRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQ----------QTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFY 346 (437) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHH----------HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceE Confidence 66777888999999997766543322 21222221 12233445555555555554443221111111112 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHHHh-hcccCC--CCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ-DPELEVKKIEEDEKESIKKAQK-GIYKDP--RDINDD 498 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~-d~~~E~~ri~~E~~~~~~~~~~-~~~~~~--~~~~~~ 498 (512) +++.+..-+..|..+.++++.++ +|+++.-.+++.++.-. +...+.-.+...-. ..+.... ...... ...+.+ T Consensus 347 ~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 425 (437) T protein:vir:10 347 AEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALL-PIDKLGEHTTATAAQDALKAWL 425 (437) T ss_pred EEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCccc-chhhccCcCCCcchhccccccC Confidence 44444555677888999988887 68999988888875422 00011100000000 0000000 000000 001111 Q ss_pred CCCCCCcCcccC Q lcl|NC_010808. 499 EQDDDTKDTVDK 510 (512) Q Consensus 499 ~~~~~~~~~~~~ 510 (512) .+.++++.++++ T Consensus 426 ~~~~~~~~~~e~ 437 (437) T protein:vir:10 426 YQEEKTRATQER 437 (437) T ss_pred CCCCCCCccccC Confidence 111111111111 No 137 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.40 E-value=8.3e-07 Score=53.91 Aligned_cols=438 Identities=9% Similarity=0.047 Sum_probs=207.0 Q ss_pred hcHHHHHHHHHHHHHHH---HHHHHHHHHHhcccccccc--ccccccc---ccccceeeecchHHHHHHHHHhhhhcc-- Q lcl|NC_010808. 40 QNINEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLV--ELTRRKE---EYMADNRVAHDYASYISDFINGYFLGN-- 109 (512) Q Consensus 40 ~~~~~l~~~i~~~~~~~---~~r~~~~~~yy~G~~~~~~--~~~~~~~---~~~~~~ri~~n~~~~iv~~~a~~l~g~-- 109 (512) -+...|.+....-+..+ ..+++.+.+|.. +... ....... ..+.+.|+--+-+...++++++.|++- T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~l---P~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~lt 77 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIM---PMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLT 77 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc---ccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhc Confidence 12334444444433444 344555555532 1100 0000011 112345667788889999998888753 Q ss_pred Cc-----eecCCch-------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECC--CCceEEEEEcc Q lcl|NC_010808. 110 PI-----QCQDDDK-------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ--DDETRLYKSDA 169 (512) Q Consensus 110 ~~-----~~~~~d~-------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~--~g~~~i~~~~p 169 (512) |+ +++..|. +....+...+..++|.....++.++..++|.+.+++-.|+ .+.+++..++. T Consensus 78 Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl 157 (547) T protein:vir:10 78 SPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPI 157 (547) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeec Confidence 21 2222221 2334455667778899999999999999999988876554 35678888888 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeec-------------------cCCcceEEEEEEEcC---Cc------------- Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPID-------------------KTDEDEVFTVDLFTS---HG------------- 214 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~-------------------~~~~~~~~~~~~yt~---~~------------- 214 (512) .+.+.--|. .+++...+|.++..... .........+++|+. .. T Consensus 158 ~~~~v~~d~--~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~ 235 (547) T protein:vir:10 158 QDSYFEEDS--RGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLA 235 (547) T ss_pred ceEEEeeCC--CcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceee Confidence 777666554 35566666654332100 000010112222210 00 Q ss_pred -------EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 215 -------VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (512) Q Consensus 215 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~ 282 (512) .+++... +... -....+|..+|++.+ ..+.+|+|..+...+-+..++.+.-......+.. T Consensus 236 ~~~~p~~s~~~e~~--~~~~------~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~ 307 (547) T protein:vir:10 236 PTERPFGKKWILKE--GAVQ------LGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKV 307 (547) T ss_pred ccccceeEEEEEec--Ccee------eeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0011110 0000 011223555676654 3456899999999999999999988888888888 Q ss_pred cCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_010808. 283 NDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPN 362 (512) Q Consensus 283 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 362 (512) ..|.+.+.-...... +.+.++ .. +..++..+++.+....+.......++.++..|...-.... T Consensus 308 ~~pp~~v~~~g~~~~-----------~~~~pg----g~--~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~ 370 (547) T protein:vir:10 308 IDPAIMVTERGLISD-----------IDLGAS----GL--TVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQ 370 (547) T ss_pred hcCceeccccccccc-----------ceecCC----ee--eecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhh Confidence 888875421100110 111111 10 1112334555566666777777778888777755432221 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCCcc------cccceeeEEeCCCCCc Q lcl|NC_010808. 363 MKDDNFSGTQSGEAMKYKLFGLEQRTKTK-EGLFTKGLRRRAKLLETILKNTRSIDAN------KDFNTVRYVYNRNLPK 435 (512) Q Consensus 363 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~l~~~~~~~~~------~d~~~i~i~f~~~~p~ 435 (512) +.. ..+...+|..+......+.+...-. .++-.+.+.-++.-++.++...+..+.. .....++|++..++.+ T Consensus 371 ~~~-~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Lar 449 (547) T protein:vir:10 371 LQM-KDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSR 449 (547) T ss_pred hhc-CCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHH Confidence 111 1234567777766533333332221 1222233333444344455444433221 1234567777666544 Q ss_pred CHHH--------HHHHHHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhhcc--cC Q lcl|NC_010808. 436 SLIE--------ELKAYIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGIY--KD 491 (512) Q Consensus 436 d~~~--------~~~~~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~--~~ 491 (512) .... .++.+..++++-| ...++..+ -+++ -.++|++.+.+++++....++.... .. T Consensus 450 aq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~ 529 (547) T protein:vir:10 450 AQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAE 529 (547) T ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3211 1122222333222 22333322 1232 2356777777665543332221110 00 Q ss_pred CCCCCCCCCCCCC-cCcc Q lcl|NC_010808. 492 PRDINDDEQDDDT-KDTV 508 (512) Q Consensus 492 ~~~~~~~~~~~~~-~~~~ 508 (512) ......-+..... ++.. T Consensus 530 g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 530 GNAMEAQGKGQAALKENQ 547 (547) T ss_pred HHHHHhhcCcccchhccC Confidence 0000000000000 1111 No 138 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.34 E-value=1.2e-06 Score=53.05 Aligned_cols=402 Identities=9% Similarity=0.022 Sum_probs=178.4 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHH---------HHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKV---------LSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~---------~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) ....+.|.+ ........+... ...++.+. .. ....... ...-+..+-....|+.+++-+..- T Consensus 1 MG~f~~lf~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~--~~~~~v~---~~~al~~~~v~~ci~~ia~~iA~l 71 (422) T protein:vir:13 1 MGFLRGLFN---KKNNNDEKRSNYDEDIGIDISDSNFWEKF-GI--KLNFSVR---GKRALKENTVYVCTKIRAESIGKL 71 (422) T ss_pred Cchhhhhhh---ccCCccchhhhhhhccccccCcchhhhhc-cc--cCCcccc---hhhhhccHHHHHHHHHHHHhhhhC Confidence 111111110 000000000000 00011000 00 0000000 000122333445566777777777 Q ss_pred CceecCCc-hhHHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCc Q lcl|NC_010808. 110 PIQCQDDD-KDVLEAIEAFND--LND---VESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIER 182 (512) Q Consensus 110 ~~~~~~~d-~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~ 182 (512) |+.+--.. +.....+..++. -|. ...+...+..+.+.+|.||+++.++..|++ .+..++|..+.+++++.... T Consensus 72 p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~ 151 (422) T protein:vir:13 72 SLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFL 151 (422) T ss_pred ceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcce Confidence 77652221 111223444443 232 346778888999999999999999988875 57788999998888654211 Q ss_pred eeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHH Q lcl|NC_010808. 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVI 262 (512) Q Consensus 183 ~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~ 262 (512) ....-+ +|...... + . ...+.++.+.++.... ..+...|.|.+..+. T Consensus 152 ~~~~~~-~y~~~~~~--g--~---~~~~~~~eiih~~~~~-------------------------~~~~~~G~s~~~~~~ 198 (422) T protein:vir:13 152 SSLSKV-WYVVTDKN--G--K---EHKLLPDEMLHFIGDI-------------------------TLDGLIGIKPLDYLR 198 (422) T ss_pred eccceE-EEEEEeCC--C--e---EEEEcccceEEEcCCC-------------------------CCCCcccccHHHHHH Confidence 111111 11111100 0 0 0112333333332110 011224778888777 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHH Q lcl|NC_010808. 263 TLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQG 342 (512) Q Consensus 263 ~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 342 (512) ..++....+..-....+.-.+.|-.+++-....+++..+..++.-.-... + ..+.......+.+.+++.++....... T Consensus 199 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~~~~~vl~~g~~~~~l~~~~~d~q 276 (422) T protein:vir:13 199 CTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSN-G-LENAHSISLLPFGYQFQPISLSMADAQ 276 (422) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhc-C-ccccCCceecCCCceeeeccCChhHHH Confidence 77776666655555556666667777765434444444333322111100 0 001111122244556655654444455 Q ss_pred HHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc Q lcl|NC_010808. 343 TEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 422 (512) Q Consensus 343 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~ 422 (512) +.+..+.....|+..-++|....+...+ .+...++. .....+...|..+++.|...+..+-...... . T Consensus 277 ~le~~~~~~~~Ia~~fgVpp~~lg~~~~-~~~sn~e~----------~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~-~ 344 (422) T protein:vir:13 277 FLENSKLTKRELAATFGMKSYHLNDLER-ATFNNLTE----------QQKDFYVTTLQSSLTVYEQEIQDKLFSQYET-L 344 (422) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHH----------HHHHHHHHHHHHHHHHHHHHHHHhhCChhhh-c Confidence 5666777788899999999776653321 11111111 1122334445555555544444321111110 1 Q ss_pred ceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 423 NTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 423 ~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) .+..+.| ..-+-.|..+.++++.++ .|+++.-.++++++.-.-+. -+++.. .......+. .+ T Consensus 345 ~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD~~~~--------~~n~~~l~~----~~ 410 (422) T protein:vir:13 345 QDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEG--GDRLLV--------NGNMIPIEM----AG 410 (422) T ss_pred CCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeeee--------ccCccchhh----cc Confidence 2334455 344556888889998887 68999999998886532110 000000 000000000 00 Q ss_pred CCCCCCcCcccCCC Q lcl|NC_010808. 499 EQDDDTKDTVDKKE 512 (512) Q Consensus 499 ~~~~~~~~~~~~~e 512 (512) + ...+...+++| T Consensus 411 ~--~~~~~g~~~g~ 422 (422) T protein:vir:13 411 E--QYKKGGEKGGK 422 (422) T ss_pred c--ccccCCCcCCC Confidence 0 01111112222 No 139 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.33 E-value=1.3e-06 Score=52.83 Aligned_cols=428 Identities=11% Similarity=0.067 Sum_probs=166.5 Q ss_pred ccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHH Q lcl|NC_010808. 20 LFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (512) Q Consensus 20 ~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 99 (512) +|+.+..|. .+ ...+.+.+ +....+.-......+||. +.... ....+..-..++..-+| T Consensus 1 ~~~~~~~i~--------s~-~~~~~i~~---~~~~s~~~~~~~~~~~~~--------pp~~~-~~la~l~~~n~~v~scI 59 (542) T protein:vir:41 1 MFNYHLSIR--------SL-EKYKAIKR---EEVESQALGETRFEEYVE--------PKVNP-LVLLSLLQVNPYHASAC 59 (542) T ss_pred Ccccccccc--------cc-ccchhhhh---ccccccccccccCCcccc--------CCCCH-HHHHHHHhhcHHHHHHH Confidence 566555554 11 11111111 000000000000111111 00000 00001111235667888 Q ss_pred HHHHhhhhccCceecCCchhHHHHHHHHHhc--cChhHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEE Q lcl|NC_010808. 100 DFINGYFLGNPIQCQDDDKDVLEAIEAFNDL--NDVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIY 176 (512) Q Consensus 100 ~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~--n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~ 176 (512) +.++..+.+-|+++...+.. .+..++-+ -....+...+..+.+.+|.||+.+..+..|++. +..++|..+.+.. T Consensus 60 ~~ia~~IA~l~~~~~~~~~~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~ 136 (542) T protein:vir:41 60 SIKANDIIRTGYILEGDDEG---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHK 136 (542) T ss_pred HHHHHHHhhCceeeecccch---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEE Confidence 99999998889887655433 23444322 235566778888999999999999999888764 6778888776655 Q ss_pred eCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCC----- Q lcl|NC_010808. 177 DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNN----- 251 (512) Q Consensus 177 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~----- 251 (512) |... .++++. .... .+...|.....+.... +. ....+..=-|+++++. T Consensus 137 d~~~------~~~~~~-------~~~~-~~~~~y~~~~~~~~~~--g~-----------~~~~~~~~eIiHir~~~~~~~ 189 (542) T protein:vir:41 137 DGSR------YRQTWD-------GVNI-THFKDYRYEGEINPET--GE-----------DQDSVGANELVFIHIPSPVCS 189 (542) T ss_pred cCCe------eEeeec-------CCcc-eeEEeecccccccccc--cc-----------cccccCcccEEEecCCCCCCC Confidence 4321 111110 0011 1111111111000000 00 0000000014454422 Q ss_pred CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee--ecCCcC--------Chhhhhhhhhccccccch-hhhhhcc Q lcl|NC_010808. 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI--KGNLSL--------DPDEVKKQKEANVLFLEP-TVYENRD 320 (512) Q Consensus 252 ~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~--~g~~~~--------~~~~~~~~~~~~~~~~~~-~~~~~~~ 320 (512) ..|.|.+..+...+.....+..-..+.+.-.+.|-.++ .|.... ..+..+..+..-.-.... ....+.. T Consensus 190 ~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~ 269 (542) T protein:vir:41 190 YYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTP 269 (542) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCce Confidence 35777777666555444333333333333334454443 333211 111111111100000000 0000000 Q ss_pred cc--cC--CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 321 TG--IE--TEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLF 395 (512) Q Consensus 321 ~~--~~--~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S-g~Ai~~~~~~l~~k~~~~~~~~ 395 (512) .. .. .+++.++.-++.......+.+..+...+.|+..-++|....+...++.+ +.-++. .....+ T Consensus 270 ~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq----------~~~~f~ 339 (542) T protein:vir:41 270 LVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEV----------TRRTYY 339 (542) T ss_pred eEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHH----------HHHHHH Confidence 00 11 1223334344433344555666777788899988999776654332221 111111 112233 Q ss_pred HHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCC--CCcCHHHHHHHHHHHhccCChHHHHHhCCCC---CCHH--- Q lcl|NC_010808. 396 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN--LPKSLIEELKAYIDSGGKISQTTLMSLFSFF---QDPE--- 467 (512) Q Consensus 396 ~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~--~p~d~~~~~~~~~kl~g~~s~et~~~~~~~v---~d~~--- 467 (512) ...|.-+++.|...++..-... .. .++.+.|+.. +..|..+.++.+ -.+|+++...+++.++.+ +|+- T Consensus 340 ~~tL~P~~~~ie~~ln~~L~~~--~~-~~~~~~f~~~~ll~~d~~~~~~~~-v~~GilT~NE~Re~L~g~~pgdd~~l~p 415 (542) T protein:vir:41 340 ESVVRPQQNIISSILTDFFQVK--FN-PKTRFKFNDETLLESDSVRNCALL-VQSGVLTPAEARERLFGLDGGPDIFMVP 415 (542) T ss_pred HHHHHHHHHHHHHHHHhhcccc--cC-CceEEEecchhhcchHHHHHHHHH-HhCCCCCHHHHHHhhCCCCCCCcccccc Confidence 4444444444444444321111 11 2345666533 333333322221 127899998888766433 2221 Q ss_pred -----HHHHHHHHHHH-HHHHHHHhh-cccCCCC-----CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 468 -----LEVKKIEEDEK-ESIKKAQKG-IYKDPRD-----INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 468 -----~E~~ri~~E~~-~~~~~~~~~-~~~~~~~-----~~~~~~~~~~~~~~~~~e 512 (512) +.+...+++.+ ...+..... ....+.. ......+..++-+..+.| T Consensus 416 ~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (542) T protein:vir:41 416 SKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAE 472 (542) T ss_pred ccccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccchhhh Confidence 00000000000 000000000 0000000 001111111222222222 No 140 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.32 E-value=1.3e-06 Score=52.77 Aligned_cols=395 Identities=8% Similarity=0.017 Sum_probs=178.5 Q ss_pred hhcHHHHHHHHHHHHHHHHH-HHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceec--C Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRP-RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ--D 115 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~-r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~--~ 115 (512) ......+.++..... .... .-..+..+.-|.. ..+..-+.+.-....|+.+++-+..-|+.+- . T Consensus 1 MG~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~~~------------~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~ 67 (411) T protein:vir:81 1 MGWWSRLTRFFRPRN-ETVDMTNPLLLQWLGVDP------------DTPRNQLSEATYFACLKILSESLGKLPLKMYQKT 67 (411) T ss_pred CchHHHHHhhccCcc-cccccchHHHHHHhcCcc------------cChhhhhccHHHHHHHHHHHHhHhhCceeEEEec Confidence 111122222111100 0000 0001111111110 0011112223344556777776666677651 1 Q ss_pred Cc---hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEE Q lcl|NC_010808. 116 DD---KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIA 186 (512) Q Consensus 116 ~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~ 186 (512) ++ +.....+..++.. | ....+...+..+.+.+|.||+++..+. |++ .+..++|..+.++.++........ T Consensus 68 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 146 (411) T protein:vir:81 68 ERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQLQALWILPSQYVTIVVDDRGLLGEKN 146 (411) T ss_pred CCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CceEEEEEECCceEEEEEcCcccccccc Confidence 11 1112234444432 3 345677778888999999999988874 554 467789999888876532111111 Q ss_pred EEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHH Q lcl|NC_010808. 187 GVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLID 266 (512) Q Consensus 187 ~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liD 266 (512) .+ +|......+ . ..+ .+.++.+++++... | .+...|.|.+..+...++ T Consensus 147 ~~-~~~~~~~~~----g-~~~-~~~~~eiih~k~~~---------------------~----~~~~~G~s~~~~~~~~i~ 194 (411) T protein:vir:81 147 AI-WYRYNDPYD----G-KMY-VFRNDEILHFKTSV---------------------T----FDGITGLSVRDVLKHTVD 194 (411) T ss_pred eE-EEEEEecCC----c-eEE-EEccccEEEEcCCC---------------------C----CCCcccccHHHHHHHHHH Confidence 11 111111000 0 001 13344444432110 0 012246777777777776 Q ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHH Q lcl|NC_010808. 267 LYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAY 346 (512) Q Consensus 267 a~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 346 (512) ....+..-..+.+.-.+.|-.+++.....+++..+..+..-.-... + ..+.......+++.+++.++.......+.+. T Consensus 195 ~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~ 272 (411) T protein:vir:81 195 GALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFAN-G-SKNAGKIIPVPLGMKLVPLDIKLTDSQFFEL 272 (411) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhc-C-ccccCCceecCCCceEEEccCCHHHHHHHHH Confidence 6666655555555666678877766444444443333321110000 0 0011111223455566555543334455566 Q ss_pred HHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc-ccce Q lcl|NC_010808. 347 KDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK-DFNT 424 (512) Q Consensus 347 ~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~-d~~~ 424 (512) .+...+.|+..-++|....+... ++-|.. + ......+...|..++..|...+..+-...... .-.. T Consensus 273 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~--e----------~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~ 340 (411) T protein:vir:81 273 KKYTALQIAAAFGIKPNQINDYEKSSYASA--E----------AQNLAFYVDTLLYVLKQYEEEITYKILSNDLISQGHY 340 (411) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCCCchhH--H----------HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcE Confidence 77788899999999977665432 221111 0 11223445566666666655554322111110 0112 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDD 502 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (512) +++.+..-+-.|..+.++++.++ +|+++.-.++++++.-..+.. +..... ...... .. ..++.. T Consensus 341 ~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~gg--D~~~~~------~n~~pl---~~---~~~~~~ 406 (411) T protein:vir:81 341 FKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYG--NNLMAN------GNYIPL---SM---LGANYG 406 (411) T ss_pred EEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--Ceeeec------cCccch---hh---hhhhhc Confidence 34444555677888999998887 689999888888875322110 000000 000000 00 000001 Q ss_pred CCcCc Q lcl|NC_010808. 503 DTKDT 507 (512) Q Consensus 503 ~~~~~ 507 (512) .++|. T Consensus 407 kgGd~ 411 (411) T protein:vir:81 407 KGGDS 411 (411) T ss_pred cCCCC Confidence 11111 No 141 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.24 E-value=2.2e-06 Score=51.61 Aligned_cols=452 Identities=10% Similarity=0.013 Sum_probs=204.4 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHH-hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESD-LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~ 79 (512) ||.+ ..+- -.+..+...+.+..++..-..+++.+.+|..-.. . ... T Consensus 1 m~~~-----------------------------~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~--~--~~~ 47 (535) T protein:vir:33 1 MADS-----------------------------KRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL--F--PKE 47 (535) T ss_pred CChh-----------------------------hhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--c--CCC Confidence 1111 0000 1112333444555555555667777777754321 1 011 Q ss_pred cccccccceeeecchHHHHHHHHHhhhhcc--Cc----eecCCch--------------------hHHHHHHHHHhccCh Q lcl|NC_010808. 80 RKEEYMADNRVAHDYASYISDFINGYFLGN--PI----QCQDDDK--------------------DVLEAIEAFNDLNDV 133 (512) Q Consensus 80 ~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~----~~~~~d~--------------------~~~~~l~~~~~~n~~ 133 (512) .........++--+-+...++.+++.|++- |. ++...+. .+...+...+..++| T Consensus 48 ~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf 127 (535) T protein:vir:33 48 SDNESTDYTTPWQAVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSY 127 (535) T ss_pred CCcccccccccccccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCc Confidence 111111223444566777888888777652 21 1222211 122334455777889 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec------------cCCc Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDE 201 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~------------~~~~ 201 (512) .....++.++..++|.+.+++-.+..+.++++.++-.+ |.+-.+. .+++...+|.++..... .... T Consensus 128 ~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~-~~v~~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k 205 (535) T protein:vir:33 128 RVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSS-YVVQRDA-YGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEK 205 (535) T ss_pred HHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCe-eEEeeCC-CCCeeEEEeeEeecHHHHHHHhhhhhccccccc Confidence 99999999999999999888877766667777775444 4444333 45566666655433100 0000 Q ss_pred ceEEEEEEEc-----CCc-EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHH Q lcl|NC_010808. 202 DEVFTVDLFT-----SHG-VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDN 270 (512) Q Consensus 202 ~~~~~~~~yt-----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~ 270 (512) +....+++|+ .+. -+.+.....+. .. .......+|..+|++.. ..+.+|+|..++..+-+..++. T Consensus 206 ~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~-~~---~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~ 281 (535) T protein:vir:33 206 KMDEMVDVYTHVYLDEESGDYLKYEEVEDV-EI---DGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLEN 281 (535) T ss_pred ccccCCeEEEEEEeeCCCCcEEEEEEEeCc-cc---cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHH Confidence 0011112221 111 01111100000 00 01112224666776654 2456899999999999999999 Q ss_pred HHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHH Q lcl|NC_010808. 271 AESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKD 348 (512) Q Consensus 271 ~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~ 348 (512) +.-......+...+|.+.+.-........+. .++ ...+..+...+++.+. ...+.......++ T Consensus 282 l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~---~~~------------~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~ 346 (535) T protein:vir:33 282 LQEAIVKMSMISAKVIGLVNPAGITQPRRLT---KAQ------------TGDFVPGRREDIDFLQLEKQADFTVAKAVSD 346 (535) T ss_pred HHHHHHHHHHHHhcCceeeccccccchhhcc---cCC------------ceeeecCCcccceeeecccccchhHHHHHHH Confidence 9888888888888888664311111111110 011 1111112223333332 3345667777777 Q ss_pred HHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeE Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRY 427 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~-~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i 427 (512) .++..|...-. .+......+...+|..+........+... ...++-.+.|.-+++.++.++...+.... ..-..+++ T Consensus 347 ~~~~~I~~af~-~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~-~p~~~v~~ 424 (535) T protein:vir:33 347 QIEARLSYAFM-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPE-LPKEAVEP 424 (535) T ss_pred HHHHHHHHHHh-hhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-CCccceeE Confidence 77777754322 22121122344677766654333322222 12222233344445555555554444332 22345778 Q ss_pred EeCCCCCcCH-HHHHHHH----HHHhccCC--------hHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 428 VYNRNLPKSL-IEELKAY----IDSGGKIS--------QTTLMSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQK 486 (512) Q Consensus 428 ~f~~~~p~d~-~~~~~~~----~kl~g~~s--------~et~~~~~---~~v~-----d~~~E~~ri~~E~~~~~~~~~~ 486 (512) +|..++..-- .+.++.+ ..++++-| ...++..+ -+++ ..++|++++.+++++.....+. T Consensus 425 ~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~ 504 (535) T protein:vir:33 425 TISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENA 504 (535) T ss_pred EEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHH Confidence 8866554321 1112221 12222211 12222222 1222 2356666666665443333222 Q ss_pred hcccCCCCCCCCCCCCC---------CcCcc Q lcl|NC_010808. 487 GIYKDPRDINDDEQDDD---------TKDTV 508 (512) Q Consensus 487 ~~~~~~~~~~~~~~~~~---------~~~~~ 508 (512) ........+.......+ +=+.. T Consensus 505 ~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 505 AAAGGAGVGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred HHhhhhhhcchhhcCChhHHHHHHhccCCCC Confidence 22111111000000000 00000 No 142 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.22 E-value=2.4e-06 Score=51.38 Aligned_cols=450 Identities=10% Similarity=0.030 Sum_probs=199.9 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |+. .+-...+..+...+.++.++..-..+++.+.+|..-.-. . .. . T Consensus 1 ~~~------------------------------~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~-~--~~-~ 46 (522) T protein:vir:94 1 MAE------------------------------REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLF-P--KE-S 46 (522) T ss_pred Ccc------------------------------cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccc-C--CC-C Confidence 111 011111112222333333333335566677776543211 0 00 1 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhcc--C--c--eecCCc---------h-----------hHHHHHHHHHhccChh Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGN--P--I--QCQDDD---------K-----------DVLEAIEAFNDLNDVE 134 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~--~--~~~~~d---------~-----------~~~~~l~~~~~~n~~~ 134 (512) ........++.-+-+...++.+++.|++- | + ++...+ . ++...+...+..++|. T Consensus 47 ~~~~~~~~~~~dst~~~a~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~ 126 (522) T protein:vir:94 47 DNSSTEYTTPWQAVGARCLNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFR 126 (522) T ss_pred CcccccccccccccHHHHHHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcH Confidence 11112223455677778888888877652 1 1 122111 1 1223344456668899 Q ss_pred HHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec----------cCCcce Q lcl|NC_010808. 135 SHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID----------KTDEDE 203 (512) Q Consensus 135 ~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~----------~~~~~~ 203 (512) ....++.++..++|.+.+++-.+.+|.+ .++.++-.+ |.+--+. .+++...+|.++..... ....+. T Consensus 127 ~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl~~-y~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p 204 (522) T protein:vir:94 127 VPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRLVS-YVVQRDA-FGNILQIVTIDKVAFSALPEDVKSQLNADDYEP 204 (522) T ss_pred HHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEcce-EEEeeCC-CcCeEEEeeeeeccHHhcchHHHHHHhcccCCc Confidence 9999999999999999988766665543 355555444 4444332 45666666655432110 011111 Q ss_pred EEEEEEEcC-----CcEEEEEecCCcccccccccccc-ccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHH Q lcl|NC_010808. 204 VFTVDLFTS-----HGVYRYLTSRTNGLKLTPRENGF-ESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAE 272 (512) Q Consensus 204 ~~~~~~yt~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~ 272 (512) ...+++|+. ++...|.... + ...... ...+|..+|++.+ ..+.+|+|..+...+-+..++.+. T Consensus 205 ~~~v~v~~~v~~~~~~~~~~~~~~-g-----~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~ 278 (522) T protein:vir:94 205 DTELEVYTHIYRQDDEYLRYEEVE-G-----IEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETIT 278 (522) T ss_pred cceEEEEEEEEeeCCceeEEeecc-C-----ceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHH Confidence 233444431 1111111110 0 011111 1234666786654 245689999999999999999999 Q ss_pred HHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEE--eecCCHHHHHHHHHHH Q lcl|NC_010808. 273 SDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYI--YKQYDVQGTEAYKDRL 350 (512) Q Consensus 273 s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l 350 (512) -......+...+|.+.+.-........+. .++ ...+..+...+++.+ ....+.......++.+ T Consensus 279 ~~~l~~~~~~~~p~~~v~~~g~~~~~~~~---~~~------------~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~ 343 (522) T protein:vir:94 279 EAITKMAKVASKVVGLVNPNGITQPRRLN---KAA------------TGEFVAGRVEDINFLQLTKGQDFTIAKSVADAI 343 (522) T ss_pred HHHHHHHHHHhCCceeecccccccchhee---ccC------------CceeecCCcccceeeecccccchhHHHHHHHHH Confidence 88888888888888765311111111111 000 001111222233333 2333566677777777 Q ss_pred HHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEe Q lcl|NC_010808. 351 NSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVY 429 (512) Q Consensus 351 ~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~-~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f 429 (512) +..|...-..-. .....+...||..+......+.+... ...++-.+.|.-+++..+.++...+..+.. .-..+++.+ T Consensus 344 ~~rI~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~-p~~~v~v~~ 421 (522) T protein:vir:94 344 EQRLGWAFLLNS-AVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDL-PKEAVEPTV 421 (522) T ss_pred HHHHHHHHhhhh-hccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-CcccEEeeE Confidence 777755432221 11122345677766654333332222 222223334444444444555444433221 123467777 Q ss_pred CCCCCcC-HHHHHHHHHH----Hhcc--------CChHHHH----HhCCC-CC---CHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_010808. 430 NRNLPKS-LIEELKAYID----SGGK--------ISQTTLM----SLFSF-FQ---DPELEVKKIEEDEKESIKKAQKGI 488 (512) Q Consensus 430 ~~~~p~d-~~~~~~~~~k----l~g~--------~s~et~~----~~~~~-v~---d~~~E~~ri~~E~~~~~~~~~~~~ 488 (512) ..++.+- ..+.++.+.. ++.+ +....++ ..+|. .. -.++|++.+.++++.....++... T Consensus 422 ~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~ 501 (522) T protein:vir:94 422 STGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGAS 501 (522) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHH Confidence 5544321 1111111111 1221 1112222 22232 11 125666666665443332222221 Q ss_pred ccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 489 YKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) ..... ..........++-... T Consensus 502 ~~~~~--~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 502 AAGAN--MGAAVGQGAGEDMAQA 522 (522) T ss_pred HHHHH--hhhhhhcccchhhhcC Confidence 11111 1111111222221112 No 143 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.21 E-value=2.6e-06 Score=51.25 Aligned_cols=452 Identities=10% Similarity=0.004 Sum_probs=205.5 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhH-HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTES-DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~-~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~ 79 (512) ||.+. .+ .-.+..+...+.+..++..-..+++.+.+|..-.- . ... T Consensus 1 m~~~~-----------------------------~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~--~--~~~ 47 (535) T protein:vir:15 1 MADSK-----------------------------RTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL--F--PKE 47 (535) T ss_pred CCccc-----------------------------hhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--c--CCC Confidence 11111 00 01112333444455555555667777777754321 1 011 Q ss_pred cccccccceeeecchHHHHHHHHHhhhhcc--Cc----eecCCch--------------------hHHHHHHHHHhccCh Q lcl|NC_010808. 80 RKEEYMADNRVAHDYASYISDFINGYFLGN--PI----QCQDDDK--------------------DVLEAIEAFNDLNDV 133 (512) Q Consensus 80 ~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~----~~~~~d~--------------------~~~~~l~~~~~~n~~ 133 (512) .........++--+-+...++.+++.|++- |. ++...+. .+...+...+..++| T Consensus 48 ~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf 127 (535) T protein:vir:15 48 SDNESTDYTTPWQAVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSY 127 (535) T ss_pred CCcccccccccccccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCc Confidence 111112223455566777888888877652 21 1222111 122334455777899 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec------------cCCc Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDE 201 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~------------~~~~ 201 (512) .....++.++..++|.+.+++-.+..+.++++.++-.+.+...|. .+++...+|.++..... .... T Consensus 128 ~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~ 205 (535) T protein:vir:15 128 RVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVVQRDA--YGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEK 205 (535) T ss_pred HHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEEeeCC--CCCeeEEEEeEeecHHHHHHHHhHhhhcccccc Confidence 999999999999999998887666666677777765554444343 45666666665433100 0011 Q ss_pred ceEEEEEEEcC-----Cc--EEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHH Q lcl|NC_010808. 202 DEVFTVDLFTS-----HG--VYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYD 269 (512) Q Consensus 202 ~~~~~~~~yt~-----~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~ 269 (512) +....+++|+. +. +..|.. ..+. .. .......+|..+|++.. .++.+|+|..++..+-+..++ T Consensus 206 ~~~~~v~v~~~v~~~~~~~~~~~~~e-~~g~-~~---~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~ 280 (535) T protein:vir:15 206 KMDEMVDVYTHVYLDEESGDYLKYEE-VEDV-EI---DGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLE 280 (535) T ss_pred CCCCceeEEEEEEEecCCCcEEEEEE-eeCc-cc---cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHH Confidence 11112333331 11 111111 0000 00 00112234566776654 345689999999999999999 Q ss_pred HHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHH Q lcl|NC_010808. 270 NAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYK 347 (512) Q Consensus 270 ~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~ 347 (512) .+.-......+...+|.+.+.-........+. .+ ....+..+...+++.+. ...+.......+ T Consensus 281 ~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~---~~------------~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i 345 (535) T protein:vir:15 281 NLQEAIVKMSMISAKVIGLVNPAGITQPRRLT---KA------------QTGDFVPGRREDIDFLQLEKQADFTVAKAVS 345 (535) T ss_pred HHHHHHHHHHHHHhcCceeecccccccchhcc---cC------------CceeeecCCcccceeeecccccchhHHHHHH Confidence 99888888888888888664311111111110 01 11111112223333332 334567777777 Q ss_pred HHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceee Q lcl|NC_010808. 348 DRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVR 426 (512) Q Consensus 348 ~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~-~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~ 426 (512) +.++..|...-. .+......+...+|..+........+... ...++-.+.|.-+++.++.++...+.... ..-..++ T Consensus 346 ~~~~~~I~~af~-~~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~-~p~~~v~ 423 (535) T protein:vir:15 346 DQIEARLSYAFM-LNSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPE-LPKEAVE 423 (535) T ss_pred HHHHHHHHHHHh-hhhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-CCcccee Confidence 777777754322 22121122344677766654333332222 12222233344445455555554444332 2234577 Q ss_pred EEeCCCCCcCH-HHHHHHH----HHHhccCC--------hHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 427 YVYNRNLPKSL-IEELKAY----IDSGGKIS--------QTTLMSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQ 485 (512) Q Consensus 427 i~f~~~~p~d~-~~~~~~~----~kl~g~~s--------~et~~~~~---~~v~-----d~~~E~~ri~~E~~~~~~~~~ 485 (512) ++|..++..-- .+.++.+ ..++++-| ...++..+ -+++ ..++|++++.+++++.....+ T Consensus 424 ~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~ 503 (535) T protein:vir:15 424 PTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIEN 503 (535) T ss_pred EEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHH Confidence 77765554321 1112221 22222212 12222222 1222 235666666655444333222 Q ss_pred hhcccCCCCCCCCCCCCC--------CcCccc Q lcl|NC_010808. 486 KGIYKDPRDINDDEQDDD--------TKDTVD 509 (512) Q Consensus 486 ~~~~~~~~~~~~~~~~~~--------~~~~~~ 509 (512) .........+.......+ -+-+.. T Consensus 504 ~a~~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 504 AAATGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred HHHHHHhhccchhccChHHHHHHHhccCCCCC Confidence 221111110000000000 000000 No 144 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.21 E-value=2.6e-06 Score=51.20 Aligned_cols=384 Identities=11% Similarity=0.063 Sum_probs=174.0 Q ss_pred HHHHHHHhcccccc---------c---ccccccccccccceeeecchHHHHHHHHHhhhhccCceecCC--ch-hHHHHH Q lcl|NC_010808. 60 LKVLSDYYEGKTKN---------L---VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD--DK-DVLEAI 124 (512) Q Consensus 60 ~~~~~~yy~G~~~~---------~---~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~--d~-~~~~~l 124 (512) +..+.+.+.+.... + ............+.-+.+......|+.+++-+..-|+.+--. +. .....+ T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~l 80 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIPVSPA 80 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccchH Confidence 11122222221100 0 000000000000011223345566777777777777754211 11 111223 Q ss_pred HHHHh-----ccChhHHHHHHHHHHHhCCeEEEEE-EECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec Q lcl|NC_010808. 125 EAFND-----LNDVESHNRSLGLDLSIYGKAYELM-IRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID 197 (512) Q Consensus 125 ~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~v-~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~ 197 (512) .+++. .-....+...+..+.+.+|.+|+++ +.+..|++ .+..++|..+.+.......... ++..... T Consensus 81 ~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~-----~~~~~~~- 154 (409) T protein:vir:84 81 PKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDW-----IEPVYRI- 154 (409) T ss_pred HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceE-----EEEEecC- Confidence 44442 1244567777888999999999876 45667765 5777888887665432211110 1111000 Q ss_pred cCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (512) Q Consensus 198 ~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~ 277 (512) .. ..|.++.++++.... ......|.|.++.+...++....+..-..+ T Consensus 155 --~g------~~~~~~dvih~~~~~-------------------------~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 201 (409) T protein:vir:84 155 --DG------KVVPNHRIMHIKRYP-------------------------VAGCALGMSPIEKAASAIGLGLAAERYGLR 201 (409) T ss_pred --Cc------eEEchhhEEEecCCC-------------------------CCcccccccHHHHHHHHHHHHHHHHHHHHH Confidence 00 123344444432210 001124778787777777666655555555 Q ss_pred HHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 278 YMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMF 357 (512) Q Consensus 278 ~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 357 (512) .+.-...|-.+++.....+++..+..++...- . ..+.....-.+++.+++-++.......+.+..+...+.|+.. T Consensus 202 ~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~---~--~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~ 276 (409) T protein:vir:84 202 WFRDSANPSGILSSDADLTPDQVKQTQKQWIQ---S--HHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMW 276 (409) T ss_pred HHhcCCCccEEEecCCCCCHHHHHHHHHHHHH---H--hccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 55656677777765444555555544432211 1 111111122344555555544333444556667778889998 Q ss_pred hcccccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcC Q lcl|NC_010808. 358 TNTPNMKDDNFSG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 436 (512) Q Consensus 358 s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d 436 (512) -++|....+...+ +.++..++.... ..+...|..+++.|...++..-. .-..|++.+..-+-.| T Consensus 277 fgVPp~~lg~~~~~~~~~sn~e~~~~----------~f~~~~l~P~~~~ie~~l~~~L~-----~g~~i~fd~~~l~~~d 341 (409) T protein:vir:84 277 FRIPPHMIGDVEKSTSWGTGIEEQGI----------NFVRHTLLPWLRCIEQALDTFLP-----RGQFVKFNVDGLMRGD 341 (409) T ss_pred hCCCHHHhCCCCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHHhcc-----CCCeEEEechhhhccC Confidence 8999776654332 222222222211 11233333334334333332210 0123455555666778 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 437 LIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 437 ~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..+.++++.++ +|+++.-.+++.++.-.-+. -+... ......... ...+.+.. ++.+...+.+..| T Consensus 342 ~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~g--gD~~~------~~~n~~~~~-~~~~~~~~-~~~~~~~~~~gn~ 409 (409) T protein:vir:84 342 VTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPE--GDIHL------QPMNFVPLG-YVPPEEPA-QEPQPNSATEGNK 409 (409) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cceee------ecccccccc-cCCccccC-cCCCCCCccCCCC Confidence 89999988887 68999988888886532111 00000 000000111 11111111 1111111112222 No 145 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.19 E-value=2.9e-06 Score=50.96 Aligned_cols=451 Identities=10% Similarity=0.026 Sum_probs=191.3 Q ss_pred ccchhH-HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 31 YDGTES-DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 31 ~~~~~~-~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +.-+.+ ...+..+...+.++.++..-..+++.+.+|..-.-.. . ..........|+.-+-+...++.+++.|++- T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~---~-~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 76 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP---K-DSDNASTDYQTPWQAVGARGLNNLASKLMLA 76 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccC---C-CCCcccccccccccccHHHHHHHHHHHHHhh Confidence 111111 1111223333333333333355666777775432110 0 1111112223556677788888888877652 Q ss_pred --Cc----eecCCch--------------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceE Q lcl|NC_010808. 110 --PI----QCQDDDK--------------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETR 163 (512) Q Consensus 110 --~~----~~~~~d~--------------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~ 163 (512) |. ++...+. .....+...+..++|.....++.++..++|.+.+++-.+.++.++ T Consensus 77 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~ 156 (536) T protein:vir:10 77 LFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) T ss_pred hcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCcee Confidence 21 1222211 122345556677889999999999999999988877554444333 Q ss_pred -EEEEccceeEEEEeCCCCceeEEEEEEeeeeee------------ccCCcceEEEEEEEcC------C-cEEEEEecCC Q lcl|NC_010808. 164 -LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPI------------DKTDEDEVFTVDLFTS------H-GVYRYLTSRT 223 (512) Q Consensus 164 -i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~------------~~~~~~~~~~~~~yt~------~-~~~~~~~~~~ 223 (512) ++.++-.+ |.+-.+. .+++...+|.++.... .....+....+++|+. + ....|.. .. T Consensus 157 ~~~~~pl~~-~~v~~d~-~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e-~~ 233 (536) T protein:vir:10 157 PMKLYRLSS-YVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEE-VE 233 (536) T ss_pred eEEEEEcCe-EEEeeCC-CCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEe-ec Confidence 44544444 4443332 4566666665443310 0011111122333321 1 1111111 00 Q ss_pred ccccccccccccccccccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcCCh Q lcl|NC_010808. 224 NGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLSLDP 297 (512) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~-~g~~~~~~ 297 (512) + ..+ .......+|..+|++.++ .+.+|+|..+...+-+..++.+.-...........|.+.+ .+- .... T Consensus 234 g-~~v---~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g-~~~~ 308 (536) T protein:vir:10 234 G-MEV---QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-ITQP 308 (536) T ss_pred C-ccc---cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccc-ccch Confidence 0 000 011122356677776543 4568999999999999999987666666555555544332 111 0111 Q ss_pred hhhhhhhhccccccchhhhhhcccccCCCCCccee--EEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHH Q lcl|NC_010808. 298 DEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG--YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 375 (512) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~ 375 (512) ..+. .++ .+.+..+...+++ .+....+.......++.++..|...-..-. .....+...||. T Consensus 309 ~~~~---~~~------------~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-l~~~~~~r~TAt 372 (536) T protein:vir:10 309 RRLT---KAQ------------TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAE 372 (536) T ss_pred hhhc---cCC------------CcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-cccCCCCCccHH Confidence 1110 010 0001111112222 233344556666777777776644332211 111223446777 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHH-------HHH Q lcl|NC_010808. 376 AMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAY-------IDS 447 (512) Q Consensus 376 Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~-------~kl 447 (512) .+......+.+...- ..++-.+.|.-+++.++.++...+..... .-..+++.+..++. .....+.+ ..+ T Consensus 373 EV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~-p~~~v~~~~vs~l~--~l~r~~~~~~l~~~~~~l 449 (536) T protein:vir:10 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPEL-PKEAVEPTISTGLE--AIGRGQDLDKLERCVTAW 449 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC-ChhhccceEEecHH--HHHHHHHHHHHHHHHHHH Confidence 766653333332221 22222333343444445555444433211 11234555544432 22222222 112 Q ss_pred hccC--------ChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCCCCCC---CCCcCc Q lcl|NC_010808. 448 GGKI--------SQTTLMS----LFSFFQ----DPELEVKKIEEDEKESIKKAQKGIY-KDPRDINDDEQD---DDTKDT 507 (512) Q Consensus 448 ~g~~--------s~et~~~----~~~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~ 507 (512) +++- ....++. .+|..+ -.++|++++.++++......+.... ............ ..--+. T Consensus 450 a~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 529 (536) T protein:vir:10 450 AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS 529 (536) T ss_pred HhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhc Confidence 2221 2222332 233211 2457777777665443332221111 000000000000 001111 Q ss_pred ccCCC Q lcl|NC_010808. 508 VDKKE 512 (512) Q Consensus 508 ~~~~e 512 (512) ..-++ T Consensus 530 ~g~~~ 534 (536) T protein:vir:10 530 VGLQP 534 (536) T ss_pred cccCC Confidence 11111 No 146 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.18 E-value=3e-06 Score=50.89 Aligned_cols=398 Identities=11% Similarity=0.025 Sum_probs=188.4 Q ss_pred hccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecch Q lcl|NC_010808. 15 ENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDY 94 (512) Q Consensus 15 ~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~ 94 (512) -..+++|.+...-. +... .-...+..+|-|.... ........ .-+.... T Consensus 1 m~~~~~f~~~~~~~-----------~~~~--------------~~~~~~~~~~~~~~~~---~~~~v~~~---~al~~~~ 49 (416) T protein:vir:12 1 MLLERMFEKRSGSS-----------DHED--------------GFNNILLNMFGGRKTA---SGERVSES---NSLVQPD 49 (416) T ss_pred CccchhcccccCcc-----------ccCc--------------cchhHHHHhhcCcccc---cCceechh---hhhccHH Confidence 23344443332110 0000 0011122344332211 00100000 1122344 Q ss_pred HHHHHHHHHhhhhccCcee-cCCch---h-HHHHHHHHH-h-cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-E Q lcl|NC_010808. 95 ASYISDFINGYFLGNPIQC-QDDDK---D-VLEAIEAFN-D-LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-R 163 (512) Q Consensus 95 ~~~iv~~~a~~l~g~~~~~-~~~d~---~-~~~~l~~~~-~-~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~ 163 (512) ....|+.+++-+..-|+.+ ...++ . ....+..++ . -| ....+...+..+.+.+|.||+++.++..|.+ . T Consensus 50 v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~ 129 (416) T protein:vir:12 50 IFACVNVLSDDIAKLPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEA 129 (416) T ss_pred HHHHHHHHHHhhhhCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEE Confidence 5566777777776777654 11111 1 111233333 2 13 3446677888899999999999999888876 4 Q ss_pred EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccccc Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 243 (512) +..++|..+.++.++.. .. .+|.+. .. + . .+ .+.+..+++++... T Consensus 130 L~~l~~~~v~v~~~~~~-~~-----~~~~~~-~~-g---~--~~-~~~~~eiih~~~~~--------------------- 174 (416) T protein:vir:12 130 LFPLRPDYTNAYVHPTT-GM-----LWYQTV-LN-G---K--AI-ELYDYEVLHFKGLS--------------------- 174 (416) T ss_pred EEEECCcceEEEEeCCC-cE-----EEEEEe-cC-C---e--EE-EecCccEEEecCcC--------------------- Confidence 77889988887765432 11 122211 01 0 0 11 23344444442110 Q ss_pred ceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhccccc Q lcl|NC_010808. 244 PITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGI 323 (512) Q Consensus 244 Pvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (512) .+...|.|.+..+...++....+..-..+.++-.+.|-.+++-....+++..+..+..-. ......... T Consensus 175 -----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~------~~~~~~~~~ 243 (416) T protein:vir:12 175 -----TDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWK------RVNKVENIA 243 (416) T ss_pred -----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHH------HHhcCCCee Confidence 012247788887777777766666666666666677777776443444544444433210 001111122 Q ss_pred CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 324 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 324 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) ..+++.+++.++.......+.+..+.....|+..-++|....+... ++-|... ......+...|..+ T Consensus 244 vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e------------~~~~~f~~~~l~P~ 311 (416) T protein:vir:12 244 IIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIE------------HQSIEYVRNTLQPW 311 (416) T ss_pred ecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHH------------HHHHHHHHHHHHHH Confidence 2345556666654444445556677778889888899877665432 2222111 11122345556666 Q ss_pred HHHHHHHHHhccCCCccc-ccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHH Q lcl|NC_010808. 403 AKLLETILKNTRSIDANK-DFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIEEDE 477 (512) Q Consensus 403 ~~li~~~l~~~~~~~~~~-d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~ 477 (512) ++.|...++..-...... ....+++.+..-+..|..+.++++.++ .|+++.-.++++++.-. +-+.-+.. T Consensus 312 ~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~----- 386 (416) T protein:vir:12 312 IVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISS----- 386 (416) T ss_pred HHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeec----- Confidence 655555554322111111 011244444555778999999998887 68999999888886422 11100000 Q ss_pred HHHHHHHHhhcccCCCC-CCCCCCCCCCcCcccCC Q lcl|NC_010808. 478 KESIKKAQKGIYKDPRD-INDDEQDDDTKDTVDKK 511 (512) Q Consensus 478 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 511 (512) ............ ....+.+..++|..++| T Consensus 387 -----~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 387 -----LNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred -----cccccccccchhhccccccccCCCCCcCCC Confidence 000000000000 00011111222333333 No 147 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.18 E-value=3e-06 Score=50.88 Aligned_cols=397 Identities=11% Similarity=0.036 Sum_probs=177.8 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD- 117 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d- 117 (512) ....+.| ..++..........+..++.+..... ........ .-+...-....|+.+++-+.+-|+.+-..+ T Consensus 1 Mg~f~~l---f~r~~~~~~~~~~~~~~~~~~~~~~~--~g~~v~~~---~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~ 72 (414) T protein:vir:44 1 MVFFSGL---FQRKSDAPVTTPAELADAIGLSYDTY--TGKQISSQ---RAMRLTAVFSCVRVLAESVGMLPCNLYHLNG 72 (414) T ss_pred Cchhhhh---hccCccCcccchhhHhHhhccCcccc--CCceechh---hhhccHHHHHHHHHHHHHhccCceEEEEecC Confidence 1111111 00000000000111112211111000 00000000 011223345556667776666676542111 Q ss_pred ----hhHHHHHHHHHh-----ccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEE Q lcl|NC_010808. 118 ----KDVLEAIEAFND-----LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAG 187 (512) Q Consensus 118 ----~~~~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~ 187 (512) ......+..++. ......+...+....+.+|.||+++..+ .|++ .+..++|..+.+.++++ +.+. T Consensus 73 ~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~--~~~~-- 147 (414) T protein:vir:44 73 SLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSS--WEPV-- 147 (414) T ss_pred CceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCC--CcEE-- Confidence 111223344432 2245567777888899999999998765 5766 57788899988887653 2221 Q ss_pred EEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHH Q lcl|NC_010808. 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDL 267 (512) Q Consensus 188 v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa 267 (512) |.+....+ ....+.++.+++++... + +...|.|.+..+...++. T Consensus 148 ---y~~~~~~g-------~~~~~~~~evih~~~~~-----------------~---------d~~~G~s~i~~~~~~i~~ 191 (414) T protein:vir:44 148 ---YQVTFPDG-------STDVLSQEDIWHVRTLT-----------------L---------DGLVGLNPIAYAREAISL 191 (414) T ss_pred ---EEEEecCc-------eEEEEccccEEEecCCC-----------------C---------CCcccccHHHHHHHHHHH Confidence 11111110 01124444554432110 0 112477777777777766 Q ss_pred HHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHH Q lcl|NC_010808. 268 YDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYK 347 (512) Q Consensus 268 ~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 347 (512) ...+..-..+.+.-.+.|-.++......+++..+..+..-.-.... . .+.......+++.+++.++.+.....+.+.. T Consensus 192 ~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g-~-~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~ 269 (414) T protein:vir:44 192 AAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTG-L-GNAHRPMILEMGLDWKSMALNAEDSQFLETR 269 (414) T ss_pred HHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcC-c-cccCcceecCCCceEEEccCChHHHHHHHHH Confidence 6665555555566666677766654444555444433321111100 0 0111111224455555555433344455667 Q ss_pred HHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceee Q lcl|NC_010808. 348 DRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVR 426 (512) Q Consensus 348 ~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~ 426 (512) +.....|+..-++|....+... ++-|. ++ ......+...|+.+++.|...++.+-..........++ T Consensus 270 ~~~~~~Ia~~fgVpp~~l~~~~~~t~~n--~e----------~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~ 337 (414) T protein:vir:44 270 KFQLEEICRLFRVPLHMVQNTDRATFNN--IE----------ELGLGFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAK 337 (414) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHhhcCCccccCceEEE Confidence 7777888888889876654332 12121 11 11123345556666665555554332111111111233 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 427 YVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 427 i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) +.+..-+..|..+.++++.++ +|+++.-.++++++.-.-+. -+.. ........ ...+..+..+. T Consensus 338 fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g--gD~~------~~~~n~~~------~~~~~~~~~~~ 403 (414) T protein:vir:44 338 FNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPG--GDVY------LTPMNMTT------KPSDGSKAGKQ 403 (414) T ss_pred EechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--ccee------cccccccc------cCCccccCCCC Confidence 334455567888999998887 68999999998886532110 0000 00000000 00011111112 Q ss_pred cCcccCCC Q lcl|NC_010808. 505 KDTVDKKE 512 (512) Q Consensus 505 ~~~~~~~e 512 (512) +++.+.+| T Consensus 404 ~~~~~~d~ 411 (414) T protein:vir:44 404 KDNANADE 411 (414) T ss_pred CCCCCCCC Confidence 22222222 No 148 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.16 E-value=3.3e-06 Score=50.61 Aligned_cols=430 Identities=13% Similarity=0.102 Sum_probs=161.2 Q ss_pred CCccee-eccccchh--hccc------------cccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 1 MLKANE-FETDTDLR--ENRN------------YLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSD 65 (512) Q Consensus 1 ~~~~~~-~~~~~~~~--~~~~------------~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~ 65 (512) |+.+-+ |-..+.-. -..+ +++....+..|...+......++...++ .. T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l-----------------~~ 89 (576) T protein:vir:96 27 IDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVL-----------------KQ 89 (576) T ss_pred cccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHH-----------------HH Confidence 433111 00000000 0000 0000000000000000000000000000 00 Q ss_pred HhcccccccccccccccccccceeeecchHHHHHHHHHhhhh-------------ccCceecC-----Cchh--HHHHHH Q lcl|NC_010808. 66 YYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL-------------GNPIQCQD-----DDKD--VLEAIE 125 (512) Q Consensus 66 yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~-------------g~~~~~~~-----~d~~--~~~~l~ 125 (512) |- ..++...+|+..+.-+. +=++.... .+.+ ....+. T Consensus 90 ~~-----------------------~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~ 146 (576) T protein:vir:96 90 FG-----------------------NNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIE 146 (576) T ss_pred hh-----------------------cCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHH Confidence 00 01122222222222111 11111110 0111 111122 Q ss_pred HHH----hc-c----ChhHHHHHHHHHHHhCCeEEEEEEECCC--Cce-EEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 126 AFN----DL-N----DVESHNRSLGLDLSIYGKAYELMIRNQD--DET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 126 ~~~----~~-n----~~~~~~~~~~~~~~~~G~a~~~v~~d~~--g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) .++ .. | .+..+...+..+.+.+|.+|+++..+.+ |++ .+..++|..+.++.+... .......+++.. T Consensus 147 ~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg-~~~~~~~~~~~~ 225 (576) T protein:vir:96 147 NFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNG-KIIKGGKRFVQV 225 (576) T ss_pred hhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCC-ceeeeeeEEEEe Confidence 222 11 1 3456778888899999999998876555 443 577789999888876532 111111122111 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) . .. .....+.++.++++..... + .......|.|.++.+...+.....+.. T Consensus 226 ~-----~~---~~~~~~~~~dii~~~~~~~--------------------~--d~~~~~~G~Spi~~a~~~i~~~~~~~~ 275 (576) T protein:vir:96 226 I-----NK---KVVASFTSREMAMGIRNPR--------------------T--ELSSSGYGLSEVEIAMKQFIAYNNTET 275 (576) T ss_pred c-----CC---ceEEEecccceEEEeecCC--------------------C--CcccCcccccHHHHHHHHHHHHHHHHH Confidence 0 00 0111223333333222100 0 000122477777777766666665555 Q ss_pred HHHHHHHHhcCceeeee--cCCcCChhhhhhhhhccccccchhhhhhccc--ccCCCCCcceeEEeecCCHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIK--GNLSLDPDEVKKQKEANVLFLEPTVYENRDT--GIETEGSVDGGYIYKQYDVQGTEAYKDR 349 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 349 (512) -..+.+.-.+.|-.++. |....+++..+..+..-.-... ...... ....+++.++.-++.......+.+..+. T Consensus 276 ~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~---G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~ 352 (576) T protein:vir:96 276 FNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFS---GINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTY 352 (576) T ss_pred HHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc---cccccccceeecCCCceEEeccCChhhHHHHHHHHH Confidence 55555555556654443 4333344444443322110000 000111 1223455566666555556666777888 Q ss_pred HHHHHHHHhccccccccccccc-chH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 350 LNSDIHMFTNTPNMKDDNFSGT-QSG----EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 350 l~~~i~~~s~~p~~~~~~~~~n-~Sg----~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) ..+.|+..-++|....+...+. .+| .++.+. .. -......+...|..+++.|...++..-.. .+ ... T Consensus 353 ~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~s--n~---e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~--~~-~~~ 424 (576) T protein:vir:96 353 LINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEA--DP---GKKQQQSQNKGLQPLLRFIEDLINTHIIS--EY-SDK 424 (576) T ss_pred hHHHHHHHhCCCHHHccccccccccccccccccccc--cH---HHHHHHHHHHHHHHHHHHHHHHHHhhhch--hc-cCc Confidence 8889999999997666532211 111 111100 00 11122334445555555555444432111 11 124 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCC--CCHHHH-----HHHHH----HH------HHHHHHHHHh Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSFF--QDPELE-----VKKIE----ED------EKESIKKAQK 486 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl-~g~~s~et~~~~~~~v--~d~~~E-----~~ri~----~E------~~~~~~~~~~ 486 (512) +.+.|.+.-+.+..+..+..... .|+++.-.++++++.- ++-+.- +..+. ++ +++....... T Consensus 425 ~~~~f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~ 504 (576) T protein:vir:96 425 YVFQFVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQ 504 (576) T ss_pred eEEEeccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCcccccccccccc Confidence 56678776655555555443332 5899998888887542 110000 00000 00 0000000000 Q ss_pred h-cccCC--CCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 487 G-IYKDP--RDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 487 ~-~~~~~--~~~~~~~~~~~~~~~~~~~e 512 (512) . ....+ +.....+...++++..|.++ T Consensus 505 ~~~~~~~~~~~~~s~~~~~~g~~~~~~~~ 533 (576) T protein:vir:96 505 FLNSPDDEEPQQESTEDKVDGRESNDPTK 533 (576) T ss_pred ccCCCCCCCCCCCCCCCcccccccccCCC Confidence 0 00000 00000011111111111111 No 149 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.14 E-value=3.8e-06 Score=50.32 Aligned_cols=452 Identities=10% Similarity=0.069 Sum_probs=176.2 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhc------HHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQN------INEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL 74 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~------~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~ 74 (512) |-+ -.++.-.+ ..+.-+++.+- .+...+.+.. .+...-..+.+.-+++.... T Consensus 1 ~~~----------------~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 58 (574) T protein:vir:80 1 MPK----------------WLDKALGI---EKSSIEETRNMENYKMHLREIDTNVVN---NEPYSMESIEKGMNGKTTAY 58 (574) T ss_pred Ccc----------------hhhhhhcc---chhhHHHHHhhhhhccccchhhhhhhh---ccCCCHHHHHHhHhhhcccc Confidence 100 00000000 00000000000 0000000000 00000111222222222111 Q ss_pred cc---------------cccccccccc-cee-e-ecchHHHHHHHHHhhhh-----------ccCceecCC--------- Q lcl|NC_010808. 75 VE---------------LTRRKEEYMA-DNR-V-AHDYASYISDFINGYFL-----------GNPIQCQDD--------- 116 (512) Q Consensus 75 ~~---------------~~~~~~~~~~-~~r-i-~~n~~~~iv~~~a~~l~-----------g~~~~~~~~--------- 116 (512) .. +...+....+ ..+ + .......++++.++.++ +-|+.+-.. T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~ 138 (574) T protein:vir:80 59 MQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSH 138 (574) T ss_pred cchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccch Confidence 00 0000000000 000 1 12334444454443322 233333111 Q ss_pred chhHHHHHHHHHhcc---------ChhHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEEeCCCCceeEE Q lcl|NC_010808. 117 DKDVLEAIEAFNDLN---------DVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIA 186 (512) Q Consensus 117 d~~~~~~l~~~~~~n---------~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~ 186 (512) .......|.+++... .+..+...+..+.+.+|.+|+.+-.+.+|+|. +..++|..+.+..+... ..... T Consensus 139 ~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~-~~~~~ 217 (574) T protein:vir:80 139 DIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEG-KLIKN 217 (574) T ss_pred hhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcc-ccccC Confidence 111223455555321 23457777888899999999998888888864 67789999888765432 11111 Q ss_pred EEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHH Q lcl|NC_010808. 187 GVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLID 266 (512) Q Consensus 187 ~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liD 266 (512) ..+||.... + . ....+.++.+++++.... ........|.|.++.+...|+ T Consensus 218 ~~~y~~~~~--g---~---~~~~~~~~eiih~~~~~~----------------------~~~~~~~~G~spi~~a~~~i~ 267 (574) T protein:vir:80 218 GERFVQVID--N---R---IVAKFNERELAFAVRNPR----------------------ADIEVGQYGYPELEIALKQFI 267 (574) T ss_pred ceEEEEEeC--C---c---eEEEEccccEEEEeccCC----------------------CCcccccccccHHHHHHHHHH Confidence 223332211 0 0 112234444444432110 000012257788877777777 Q ss_pred HHHHHHHHHHHHHHHhcCceeeee--cCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 267 LYDNAESDTANYMSDLNDAMLLIK--GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 267 a~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) ....+..-..+.+.-.+.|-.++. +....+++....++..-.-... +..-.....+..+++.++.-++.......+. T Consensus 268 ~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~-G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfl 346 (574) T protein:vir:80 268 AHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLA-GINGSWQIPVVSAEDVKFVNMTPSANDMQFE 346 (574) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc-cccccccceeecCCCceEEEccCChhHHHHH Confidence 666655555555555566665443 3333344444433322110000 0000000112223455655555444455566 Q ss_pred HHHHHHHHHHHHHhcccccccccccccc-hHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQ-SGEAMKYK-LFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 422 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~-~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~ 422 (512) ...+.....|+..-++|....+...... .|...... +... -......+...|.-+++.|...++..-.. ... T Consensus 347 e~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~---E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~--~~~- 420 (574) T protein:vir:80 347 KWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNS---KEKMQASQNKGLQPLLRFIEDTVNTYIVA--EFG- 420 (574) T ss_pred HHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhh--hcC- Confidence 6777788889999899877665322111 11100000 0000 01112333445555555555444432111 111 Q ss_pred ceeeEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCC--CCCHH--------HHHHHHHHHHH----HHHHHHHhh Q lcl|NC_010808. 423 NTVRYVYNRNLPKSLIEELKAYIDS-GGKISQTTLMSLFSF--FQDPE--------LEVKKIEEDEK----ESIKKAQKG 487 (512) Q Consensus 423 ~~i~i~f~~~~p~d~~~~~~~~~kl-~g~~s~et~~~~~~~--v~d~~--------~E~~ri~~E~~----~~~~~~~~~ 487 (512) ..+.+.|.+.-..+.++...+.... +|+++.-.++++++. ++.-+ ..+........ ......... T Consensus 421 ~~~~~~f~~~d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (574) T protein:vir:80 421 EKYQFQFRGGDLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRL 500 (574) T ss_pred CceEEEecccchhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhcccccc Confidence 2456778776666555555443322 689999888888744 22100 00000000000 000000000 Q ss_pred cccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 488 IYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ........+.+..++..+.+.|+.| T Consensus 501 ~~~~~~~~~~~~~~~p~~~~~d~~~ 525 (574) T protein:vir:80 501 LELSGGDVEQPEPEEPKDSQNDTDV 525 (574) T ss_pred ccccCCCCCCCCCCCCCCccccccc Confidence 0000111111111121222222222 No 150 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.13 E-value=4e-06 Score=50.17 Aligned_cols=410 Identities=8% Similarity=-0.014 Sum_probs=174.2 Q ss_pred HHHHHHHHHHHHHHH--H--HHHHHHhccccccc---ccccccccccccceeeecchHHHHHHHHHhhhhccCceecC-- Q lcl|NC_010808. 45 VSKYIEHHMDYQRPR--L--KVLSDYYEGKTKNL---VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD-- 115 (512) Q Consensus 45 l~~~i~~~~~~~~~r--~--~~~~~yy~G~~~~~---~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~-- 115 (512) +-+++...+...+.. . .-+...+..-.... ........... -+.+.=....|+.+++-+..-|+.+-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~---al~~~~V~~~v~~Ia~~iA~lp~~~~~~~ 77 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEA---VLSFHAVFACISLISQDIAKMRLRLMQTD 77 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHH---hhccHHHHHHHHHHHHhhccCceEEEEec Confidence 122221111111000 0 00001110000000 00000000000 011112233466666666666776411 Q ss_pred Cc---hh-HHHHHHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEE Q lcl|NC_010808. 116 DD---KD-VLEAIEAFNDL-N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIA 186 (512) Q Consensus 116 ~d---~~-~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~ 186 (512) .+ .. ....+..++.. | ....+...+..+++.+|.+|+++-.+.+|++ .+..++|..+-++.++. +++. T Consensus 78 ~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~- 154 (454) T protein:vir:93 78 AQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADD--GEVF- 154 (454) T ss_pred cCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCC--CcEE- Confidence 11 11 12223444433 3 2346777788899999999999999888886 57888999988887653 2221 Q ss_pred EEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHH Q lcl|NC_010808. 187 GVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLID 266 (512) Q Consensus 187 ~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liD 266 (512) |......... . .....+.++.++++.... ..+...|.|.+......+. T Consensus 155 ----y~~~~~~~~~--~-~~~~~~~~~eViH~k~~~-------------------------~~~~~~G~sp~~~~~~~i~ 202 (454) T protein:vir:93 155 ----YRITPDRNCG--I-TEAVTVPAREVIHDRFNC-------------------------FFHPLIGLPPVYAAGLAAT 202 (454) T ss_pred ----EEEEeccccc--c-ceeEEecCcceEEeccCC-------------------------CCCCceeccHHHHHHHHHH Confidence 1111110000 0 001124445555443210 0112247777776666666 Q ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHH Q lcl|NC_010808. 267 LYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAY 346 (512) Q Consensus 267 a~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 346 (512) ....+..-....+.-.+.|-.+++-....+++..+.+++.-.-... + .+.....-.+.+.+++.++.......+.+. T Consensus 203 ~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g--~n~g~~~vl~~g~~~~~l~~~~~d~q~le~ 279 (454) T protein:vir:93 203 QGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYT-G--ENAGKTAILSNGAKYNPTTFSPVDSQTVEQ 279 (454) T ss_pred HHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhc-c--cccCCceeccCCceEEEcccChhHHHHHHH Confidence 5555544444445555566566553333344444443322110000 0 111111122455566666554444555566 Q ss_pred HHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceee Q lcl|NC_010808. 347 KDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVR 426 (512) Q Consensus 347 ~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~ 426 (512) .+.....|+..-++|....+...+.. ...++ ......+...|.-++..|...++..-.... -..++ T Consensus 280 ~~~~~~~Ia~~fgVPp~~lg~~~~~t-~sn~e----------~~~~~f~~~~l~P~~~~ie~~ln~~L~~~~---~~~~~ 345 (454) T protein:vir:93 280 LKMTAEIVCSVFRVPAYKIGVGQPPS-SDNVE----------ALEQQYYSQCLQTLIESIELLLDEALETGE---NESTE 345 (454) T ss_pred HHHHHHHHHHHhCCCHHHcCCCCCCc-chhHH----------HHHHHHHHHHHHHHHHHHHHHHHHhhcCCC---CcEEE Confidence 77777889998899987665432221 11111 111223344444444444444433211111 12345 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHH--------HHHHHHHHHHHHHHHHHhhcccCCCC Q lcl|NC_010808. 427 YVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPEL--------EVKKIEEDEKESIKKAQKGIYKDPRD 494 (512) Q Consensus 427 i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~--------E~~ri~~E~~~~~~~~~~~~~~~~~~ 494 (512) +.+..-+..|..+.++.+.++ .|+++.-.++++++.-. +-++ -+..+.+.... .......+.+.. T Consensus 346 f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 422 (454) T protein:vir:93 346 FDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAR---EDPFASSGKTAS 422 (454) T ss_pred eechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcc---cCCCCCCccCCC Confidence 555666678889999988887 68999988888875422 1000 01111111000 000000000000 Q ss_pred CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 495 INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 495 ~~~~~~~~~~~~~~~~~e 512 (512) ......+.+.+++..+.| T Consensus 423 ~~~~~~~~d~~~~~~e~~ 440 (454) T protein:vir:93 423 VPQAVAASDGNKAITETE 440 (454) T ss_pred CCCCCCCCCCCCCccCCc Confidence 000000000111111111 No 151 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=98.01 E-value=7.1e-06 Score=48.81 Aligned_cols=381 Identities=10% Similarity=0.008 Sum_probs=164.8 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+++ +..+..-.... ..... .+ .+.. + .+..+..+... .. ...-+ T Consensus 1 M~~f---~~~~~~~~~~~--~~~~~-~~--------~~~~-------~---~~~~~~~~~~~--------v~---~~~~~ 45 (386) T protein:vir:48 1 MPIF---NITNLATESPP--ISQGG-FF--------DITD-------P---DFLSTLNGSEW--------VS---AESAL 45 (386) T ss_pred Cccc---ccccccccccc--ccccc-cc--------cccc-------c---hhcccccCCce--------ec---hhhhh Confidence 4333 23332211100 00000 00 0000 0 00000000000 00 00001 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEcc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDA 169 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p 169 (512) ...-....|+.+++-+.+-|+.+. +......+.+-...-....+...+..+.+.+|.+|+++-++..|++ .+..++| T Consensus 46 ~~~~v~~~i~~ia~~ia~~p~~~~--~~~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~ 123 (386) T protein:vir:48 46 RNSDLFSIINQLSNDLATVKLTAS--RKQLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRP 123 (386) T ss_pred cchHHHHHHHHHHHhhccCceeec--cchhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecC Confidence 122333455555555555566543 2333333333333334556677788899999999999999988875 5677888 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeec Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS 249 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 249 (512) ..+.+..+... .. + +|.+...... ......+.++.+++++... .. T Consensus 124 ~~v~v~~~~~~-~~-~----~y~~~~~~~~----~~~~~~~~~~evih~~~~~-------------------------~~ 168 (386) T protein:vir:48 124 SQVSFNRLDNK-DG-I----YYNITFDDPR----IPPKQHVPQGDVLHFKLLS-------------------------VD 168 (386) T ss_pred ceeEEEEcCCC-ce-E----EEEEEecCcc----ccceeEecCccEEEecCCC-------------------------CC Confidence 88877665421 11 1 1211110000 0011123444444432110 00 Q ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc Q lcl|NC_010808. 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV 329 (512) Q Consensus 250 n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (512) ....|.|.+..+...+.....+..-....+.-.+.|-.+++-......+.....++... ....+.......+++. T Consensus 169 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~-----~~~~n~g~~~vl~~g~ 243 (386) T protein:vir:48 169 GGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQ-----AMKQMQGGPLVLDDLE 243 (386) T ss_pred CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHH-----HhhcCCCCceecCCCc Confidence 01247777777666666655555555555555666777776544444444333332211 0111111122234455 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETI 409 (512) Q Consensus 330 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 409 (512) +++.++.......+.+..+...+.|+..-++|....+..+.+.+.... ....+...|..+++.|... T Consensus 244 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~-------------~~~~~~~~l~P~~~~ie~~ 310 (386) T protein:vir:48 244 EFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM-------------SLDLYNKAVSRYLRPFLSE 310 (386) T ss_pred eEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHH Confidence 665555443444456777888888999989997766532222222111 1113344444444444444 Q ss_pred HHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 410 LKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQ 485 (512) Q Consensus 410 l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~--v~d~~~E~~ri~~E~~~~~~~~~ 485 (512) ++..-.. .+.+.+...+-.+....+..+.++ +|+++.-.+++.++. +.. .|+... . T Consensus 311 l~~~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~~~~~~-----------~ 370 (386) T protein:vir:48 311 LSQKLSC-------DVDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILP--KELPEG-----------E 370 (386) T ss_pred HHHhhcc-------hhhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCC--ccchhh-----------c Confidence 4332111 111222223334555666666665 689999888877642 221 111110 0 Q ss_pred hhcccCCCCCCCCCCCCCCcC Q lcl|NC_010808. 486 KGIYKDPRDINDDEQDDDTKD 506 (512) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~ 506 (512) .........++++ .+| T Consensus 371 ~~~~~~~~gGd~~-----~~~ 386 (386) T protein:vir:48 371 NPNKTTLKGGEIN-----GED 386 (386) T ss_pred CCCCCccCCCCCC-----CCC Confidence 0000000011111 111 No 152 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=98.01 E-value=7.3e-06 Score=48.74 Aligned_cols=395 Identities=12% Similarity=0.031 Sum_probs=182.1 Q ss_pred hccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecch Q lcl|NC_010808. 15 ENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDY 94 (512) Q Consensus 15 ~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~ 94 (512) ...+.+|++........ ..++.. .+-+.... ..... ... ..-+.+.. T Consensus 1 ~~f~~~f~r~~~~~~~~----------~~~~~~------------------~~~~~~~~-~~g~~-v~~---~~~l~~~~ 47 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTT----------PAELAE------------------AIGLSYDT-YTGKR-ISS---QRAMRLTA 47 (413) T ss_pred CccchhhccCccCCccc----------hHHHHH------------------hhhcCccc-ccCce-ech---hhhhccHH Confidence 23345565533222100 111111 11110000 00000 000 01122334 Q ss_pred HHHHHHHHHhhhhccCceecCCc-----hhHHHHHHHHHh-----ccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-E Q lcl|NC_010808. 95 ASYISDFINGYFLGNPIQCQDDD-----KDVLEAIEAFND-----LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-R 163 (512) Q Consensus 95 ~~~iv~~~a~~l~g~~~~~~~~d-----~~~~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~ 163 (512) ..-.|+.+++-+.+-|+.+-..+ ......+..++. .-....+...+..+.+.+|.||+++..+ .|++ . T Consensus 48 v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~ 126 (413) T protein:vir:48 48 VYSCVRVLAESVGMLPCSLYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVE 126 (413) T ss_pred HHHHHHHHHHhhhhCceEEEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEE Confidence 45567777777766676542111 111223445543 1244567777888999999999998775 5664 4 Q ss_pred EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccccc Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 243 (512) +..++|..+.+..+.. ..+. |......+ ....|.++.+++++... T Consensus 127 L~~l~~~~v~~~~~~~--~~~~-----y~~~~~~g-------~~~~~~~~evih~~~~~--------------------- 171 (413) T protein:vir:48 127 LLPIDPGCVEPKLNSQ--WQPV-----YQVTFPDG-------SVDVLTQDEIWHVRTLT--------------------- 171 (413) T ss_pred EEEEcCceEEEEEcCC--ceEE-----EEEEecCc-------eEEEEccccEEEecCcC--------------------- Confidence 6778888888777643 2222 11111110 01234455555543210 Q ss_pred ceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhccccc Q lcl|NC_010808. 244 PITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGI 323 (512) Q Consensus 244 Pvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (512) .+...|.|.+..+...++....+..-..+.+.-.+.|-.+++.....+++..+..++.-.-.... ..+..... T Consensus 172 -----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g--~~n~g~~~ 244 (413) T protein:vir:48 172 -----LDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTG--LGNAHRPM 244 (413) T ss_pred -----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcC--ccccCcce Confidence 01234778777777777766665555555555566677776654444444444433321111100 01111112 Q ss_pred CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 324 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRR 401 (512) Q Consensus 324 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 401 (512) ..+.+.+++-++.......+.+..+.....|+..-++|....+..+ ++- +.+... ...+...|.- T Consensus 245 vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~-------------~~f~~~~i~P 311 (413) T protein:vir:48 245 ILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG-------------LGFINYSLVP 311 (413) T ss_pred ecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH-------------HHHHHHHHHH Confidence 2344556665554444445566777788889998899876655332 111 111111 2233334555 Q ss_pred HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHH Q lcl|NC_010808. 402 RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKE 479 (512) Q Consensus 402 ~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~ 479 (512) +++.|...+...-..........+++.+..-+-.|..+.++++.++ +|+++.-.++++++.-.-+. -+... T Consensus 312 ~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~g--gD~~~----- 384 (413) T protein:vir:48 312 YLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPG--GDVYL----- 384 (413) T ss_pred HHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cceee----- Confidence 5555544444321111111111234444455567888899998887 68999988888886522110 00000 Q ss_pred HHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 480 SIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .... . .......++....+++.+..| T Consensus 385 -~~~n----~--~~~~~~~~~~~~~~~~~~~~~ 410 (413) T protein:vir:48 385 -TPMN----M--TTSPSAGDDNGKKKESGDADK 410 (413) T ss_pred -cccc----c--cccccccccCCCCCCCCCccc Confidence 0000 0 000011111122222233333 No 153 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=97.99 E-value=8e-06 Score=48.52 Aligned_cols=399 Identities=10% Similarity=0.008 Sum_probs=173.4 Q ss_pred HHHHHHHHHHHH--H--HHHHHHHHhccccccccccccccccccccee-eecchHHHHHHHHHhhhhccCceecCCch-h Q lcl|NC_010808. 46 SKYIEHHMDYQR--P--RLKVLSDYYEGKTKNLVELTRRKEEYMADNR-VAHDYASYISDFINGYFLGNPIQCQDDDK-D 119 (512) Q Consensus 46 ~~~i~~~~~~~~--~--r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~-~ 119 (512) .-+..+...+.. + .....-....|.... ......... +.+.-.--.|+.+++-+.+-|+++..+.. . T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~ 73 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT-------KLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQIN 73 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhcccccc-------CccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcccc Confidence 111100000000 0 000111111111100 000000000 11111222466677767777776643221 1 Q ss_pred HHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 120 VLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 120 ~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) ....+..++. -| ....+...+....+.+|.||+++.++..|++ .+..++|..+.++.+.. ..+. |+ . T Consensus 74 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~----~~-~ 146 (416) T protein:vir:45 74 YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDAR--GRLY----YF-H 146 (416) T ss_pred ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCC--ccEE----EE-E Confidence 2233444443 23 2345667788888999999999999998986 47788999988887653 2221 11 1 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) ...+.... .....+.+..+.+++.. |. +...|.|.++.+...++....... T Consensus 147 ~~~~~~~~---~~~~~~~~~evihir~~----------------------~~----d~~~G~s~i~~~~~~i~~~~~~~~ 197 (416) T protein:vir:45 147 QRIDSNGN---NIERNVKFEDMLDIKFY----------------------SL----DGINGLSLLDTLSRTIESDNNGKD 197 (416) T ss_pred EEecCCCc---eeEEEEccccEEEeccC----------------------CC----CCccccCHHHHHHHHHHHHHHHHH Confidence 11111111 11123444444443210 10 112477777777777766555555 Q ss_pred HHHHHHHHhcCceeeeecCCcC-ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNS 352 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 352 (512) -..+.+.-...|-.+++-.... +++..+..+..-.-... + ..+.......+++.+.+.++.......+.+..+.... T Consensus 198 ~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~-g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 275 (416) T protein:vir:45 198 FLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFS-G-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTR 275 (416) T ss_pred HHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhc-C-ccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHH Confidence 5555555566676666522112 22222222211000000 0 0011111223445566555544444455566677778 Q ss_pred HHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCC Q lcl|NC_010808. 353 DIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN 432 (512) Q Consensus 353 ~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~ 432 (512) .|+..-++|....+...++.|.+... ..|...|..++..|...++.+-... .....+++.+..- T Consensus 276 ~Ia~~fgVPp~~lg~~~~~~~~~~~~--------------~~~~~~l~P~~~~ie~~ln~~l~~~--~~~~~~~f~~~~l 339 (416) T protein:vir:45 276 EIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEI 339 (416) T ss_pred HHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHHHHHHHHHHHhhhcccc--ccCceEEEechhh Confidence 89998899876554322222222111 1122345555555554444332111 1112344444444 Q ss_pred CCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 433 LPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 433 ~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) +-.|..+.++++.++ .|+++.-.++++++.- ++.+...-.+...-.. . +. .+..+.+.....+.+...+ T Consensus 340 ~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~---~--~~--~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:45 340 RVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVN---I--EL--VDEYQMNKSRATDKKLKGG 412 (416) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccc---c--cc--ccccCcccccccccccCCC Confidence 567888888888887 6899999998888542 2322211111000000 0 00 0000111111111111122 Q ss_pred cCCC Q lcl|NC_010808. 509 DKKE 512 (512) Q Consensus 509 ~~~e 512 (512) |+.| T Consensus 413 e~n~ 416 (416) T protein:vir:45 413 EENE 416 (416) T ss_pred CCCC Confidence 2222 No 154 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=97.99 E-value=8e-06 Score=48.52 Aligned_cols=399 Identities=10% Similarity=0.008 Sum_probs=173.4 Q ss_pred HHHHHHHHHHHH--H--HHHHHHHHhccccccccccccccccccccee-eecchHHHHHHHHHhhhhccCceecCCch-h Q lcl|NC_010808. 46 SKYIEHHMDYQR--P--RLKVLSDYYEGKTKNLVELTRRKEEYMADNR-VAHDYASYISDFINGYFLGNPIQCQDDDK-D 119 (512) Q Consensus 46 ~~~i~~~~~~~~--~--r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~r-i~~n~~~~iv~~~a~~l~g~~~~~~~~d~-~ 119 (512) .-+..+...+.. + .....-....|.... ......... +.+.-.--.|+.+++-+.+-|+++..+.. . T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~ 73 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT-------KLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQIN 73 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhcccccc-------CccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcccc Confidence 111100000000 0 000111111111100 000000000 11111222466677767777776643221 1 Q ss_pred HHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 120 VLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 120 ~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) ....+..++. -| ....+...+....+.+|.||+++.++..|++ .+..++|..+.++.+.. ..+. |+ . T Consensus 74 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~----~~-~ 146 (416) T protein:vir:81 74 YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDAR--GRLY----YF-H 146 (416) T ss_pred ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCC--ccEE----EE-E Confidence 2233444443 23 2345667788888999999999999998986 47788999988887653 2221 11 1 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) ...+.... .....+.+..+.+++.. |. +...|.|.++.+...++....... T Consensus 147 ~~~~~~~~---~~~~~~~~~evihir~~----------------------~~----d~~~G~s~i~~~~~~i~~~~~~~~ 197 (416) T protein:vir:81 147 QRIDSNGN---NIERNVKFEDMLDIKFY----------------------SL----DGINGLSLLDTLSRTIESDNNGKD 197 (416) T ss_pred EEecCCCc---eeEEEEccccEEEeccC----------------------CC----CCccccCHHHHHHHHHHHHHHHHH Confidence 11111111 11123444444443210 10 112477777777777766555555 Q ss_pred HHHHHHHHhcCceeeeecCCcC-ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNS 352 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 352 (512) -..+.+.-...|-.+++-.... +++..+..+..-.-... + ..+.......+++.+.+.++.......+.+..+.... T Consensus 198 ~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~-g-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 275 (416) T protein:vir:81 198 FLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFS-G-TKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTR 275 (416) T ss_pred HHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhc-C-ccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHH Confidence 5555555566676666522112 22222222211000000 0 0011111223445566555544444455566677778 Q ss_pred HHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCC Q lcl|NC_010808. 353 DIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN 432 (512) Q Consensus 353 ~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~ 432 (512) .|+..-++|....+...++.|.+... ..|...|..++..|...++.+-... .....+++.+..- T Consensus 276 ~Ia~~fgVPp~~lg~~~~~~~~~~~~--------------~~~~~~l~P~~~~ie~~ln~~l~~~--~~~~~~~f~~~~l 339 (416) T protein:vir:81 276 EIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKPYITCVCAELNFKFNDE--YVNREFKFDTTEI 339 (416) T ss_pred HHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHHHHHHHHHHHhhhcccc--ccCceEEEechhh Confidence 89998899876554322222222111 1122345555555554444332111 1112344444444 Q ss_pred CCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 433 LPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 433 ~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) +-.|..+.++++.++ .|+++.-.++++++.- ++.+...-.+...-.. . +. .+..+.+.....+.+...+ T Consensus 340 ~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~---~--~~--~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:81 340 RVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVN---I--EL--VDEYQMNKSRATDKKLKGG 412 (416) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccc---c--cc--ccccCcccccccccccCCC Confidence 567888888888887 6899999998888542 2322211111000000 0 00 0000111111111111122 Q ss_pred cCCC Q lcl|NC_010808. 509 DKKE 512 (512) Q Consensus 509 ~~~e 512 (512) |+.| T Consensus 413 e~n~ 416 (416) T protein:vir:81 413 EENE 416 (416) T ss_pred CCCC Confidence 2222 No 155 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.95 E-value=9.3e-06 Score=48.16 Aligned_cols=335 Identities=11% Similarity=0.085 Sum_probs=151.3 Q ss_pred hhccCceecCCchhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010808. 106 FLGNPIQCQDDDKDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNT 179 (512) Q Consensus 106 l~g~~~~~~~~d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 179 (512) +..-|+.+.-.++.....+.+++. -| ....+...+....+.+|.||+++-++..|++ .+..++|..+.++.++. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 233344432222233334455543 13 3445567778888999999999999999986 46778888877776543 Q ss_pred CCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchH Q lcl|NC_010808. 180 IERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYE 259 (512) Q Consensus 180 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~ 259 (512) . +. ++ |.+.... . ..+ .|.+..+++++... | .+.-.|.|.++ T Consensus 81 ~-~~-~~----y~~~~~~----g--~~~-~~~~~eiih~r~~~---------------------~----~~~~~G~s~~~ 122 (348) T protein:vir:93 81 S-RE-LY----YSIHAAT----G--NKL-IVHNMDMLHFKHIV---------------------A----SNMVQGISPID 122 (348) T ss_pred C-cE-EE----EEEEcCC----C--eEE-EEccccEEEecCCC---------------------C----CCceeeccHHH Confidence 2 11 11 1111100 0 011 23444444432210 0 01123666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCc-eeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC Q lcl|NC_010808. 260 KVITLIDLYDNAESDTANYMSDLNDA-MLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY 338 (512) Q Consensus 260 ~v~~liDa~~~~~s~~~~~~~~~~~~-~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 338 (512) .+...++..+.+... .+..+..+ ..++.-....+++..+..++.-.- ...+.......+.+.+++.++.+. T Consensus 123 ~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~-----~~~n~~~~~vl~~g~~~~~l~~~~ 194 (348) T protein:vir:93 123 VLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQ-----YYEENGGILFQEPGVEIEPLPKKY 194 (348) T ss_pred HHHHHHHHHHHHHHH---HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHH-----HhhcCCCeeecCCCceEEEcCCCh Confidence 666555544333211 23333333 333332223344443333322110 111111122334555666665544 Q ss_pred CHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_010808. 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 418 (512) Q Consensus 339 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~ 418 (512) ....+.+..+.....|+..-++|....+... ..+...++.. ....+...|.-+++.|...+...-.... T Consensus 195 ~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~-~~~~~~~e~~----------~~~~~~~~l~P~~~~ie~~l~~~l~~~~ 263 (348) T protein:vir:93 195 VSEDIVASENLTRERVANVFQLPSIFLNARS-NTNFAKNEEL----------NRFYLQHTLLPIVKQYEEEFNRKLLTKT 263 (348) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CCCcccHHHH----------HHHHHHHHHHHHHHHHHHHHHHhhCCcc Confidence 4445566677788889999899877665322 2222222111 1122333444444444444443211111 Q ss_pred ccc-cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|NC_010808. 419 NKD-FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIEEDEKESIKKAQKGIYKDPR 493 (512) Q Consensus 419 ~~d-~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 493 (512) ... ...+++.+..-+-.|..+.++++.++ +|+++.-.+++.++.-. +-+.=+ + ....... T Consensus 264 ~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~--~--------~~n~~~~----- 328 (348) T protein:vir:93 264 DREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL--I--------SGDLYPI----- 328 (348) T ss_pred cccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeEe--e--------ccccccc----- Confidence 100 11233334455567888889998887 68999999998886521 100000 0 0000000 Q ss_pred CCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 494 DINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 494 ~~~~~~~~~~~~~~~~~~e 512 (512) +...+.+....+.+.+.+| T Consensus 329 ~~~~~~~~~~~gg~~n~~~ 347 (348) T protein:vir:93 329 DTPLELRKSLKGGDKNVNE 347 (348) T ss_pred ccchhhcccccCCCCCcCC Confidence 0000000000111111111 No 156 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=97.93 E-value=1.1e-05 Score=47.86 Aligned_cols=458 Identities=9% Similarity=0.009 Sum_probs=201.0 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+.|. ... +-.+..+...+.+..++..-..+++.+.+|..-... . ...........++ T Consensus 1 ~~~~~--------------~~~----~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~-~---~~~~~~~~~~~~~ 58 (543) T protein:vir:88 1 MAETK--------------REG----LAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLF-P---KDSDNSSTDYTTP 58 (543) T ss_pred Ccccc--------------cCc----chHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccC-C---CCCCccccccccc Confidence 11110 000 011112333344444444545566677777643211 0 0011111122345 Q ss_pred ecchHHHHHHHHHhhhhcc--Cc----eecCCch-------------h-------HHHHHHHHHhccChhHHHHHHHHHH Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGN--PI----QCQDDDK-------------D-------VLEAIEAFNDLNDVESHNRSLGLDL 144 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~--~~----~~~~~d~-------------~-------~~~~l~~~~~~n~~~~~~~~~~~~~ 144 (512) .-+-+...++.+++.|++- |. ++...+. + +.+.+...+..++|.....++.++. T Consensus 59 ~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L 138 (543) T protein:vir:88 59 WQAVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQL 138 (543) T ss_pred ccchHHHHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHH Confidence 5677778888888877652 22 1122211 1 1223444556688999999999999 Q ss_pred HhCCeEEEEEEECCCCceE---EEEEccceeEEEEeCCCCceeEEEEEEeeeeeec-----------cCCcceEEEEEEE Q lcl|NC_010808. 145 SIYGKAYELMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-----------KTDEDEVFTVDLF 210 (512) Q Consensus 145 ~~~G~a~~~v~~d~~g~~~---i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~-----------~~~~~~~~~~~~y 210 (512) .++|.+.+++-.+....++ ++.+ |..-|.+..+. .+++...+|.++..... ....+....+++| T Consensus 139 ~~~G~a~ly~~~~~~~~~~~~~~~~~-pl~~y~v~~d~-~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~ 216 (543) T protein:vir:88 139 ALAGTALIYLPPPDASSNSYNPMKLY-TLHNHVVQRDA-FGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVY 216 (543) T ss_pred HhhCceeeeeccCccccceecceEEe-EcceEEEeeCC-CCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEE Confidence 9999998776554432222 2222 44445554333 45666666655432111 0011111234444 Q ss_pred cCCcEEEEEecCCc---cccccccc--cccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 211 TSHGVYRYLTSRTN---GLKLTPRE--NGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMS 280 (512) Q Consensus 211 t~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~ 280 (512) +. ++ .+.+... ........ ......++..+|++.+ ..+.+|+|..+...+-+..+|.+.-......+ T Consensus 217 ~~--V~-pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 293 (543) T protein:vir:88 217 TH--IY-IDDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAM 293 (543) T ss_pred EE--EE-eecCCCcccccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 11 1111110 00000011 1112233556776654 24568999999999999999999888888888 Q ss_pred HhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 281 DLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFT 358 (512) Q Consensus 281 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s 358 (512) ...+|.+.+.-........+. .++ ...+..+..++++.+ ....+.......++.++..|...- T Consensus 294 ~~~~pp~~v~~~g~~~~~~~~---~~~------------~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af 358 (543) T protein:vir:88 294 ISSKVVGLVNPNGITQVRRLV---KAQ------------TGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVF 358 (543) T ss_pred HHhcCceeeccccccchhhcc---cCC------------CceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHH Confidence 888888765321111111111 111 111111222333333 233466777777887777775433 Q ss_pred cccccccccccccchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCC-CcC Q lcl|NC_010808. 359 NTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL-PKS 436 (512) Q Consensus 359 ~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~-p~d 436 (512) ..-. .....+...+|..+........+...- ..++-.+.|.-+++.++.++...+..... .-..+++.+...+ +-. T Consensus 359 ~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~-p~~~v~~~~vs~l~~l~ 436 (543) T protein:vir:88 359 MLNS-AVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNL-PQEAVEPTVTTGAEALG 436 (543) T ss_pred hhhh-hccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-chhceeeeEEecHHHHH Confidence 2211 111223446777766543333322221 12222333333444444555444433221 1224566664332 111 Q ss_pred HHHHHHHHHHH---hccCCh---------HHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCC Q lcl|NC_010808. 437 LIEELKAYIDS---GGKISQ---------TTLMSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKDPRDIN 496 (512) Q Consensus 437 ~~~~~~~~~kl---~g~~s~---------et~~~~~---~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 496 (512) ....++.+... .|.+++ ..++..+ -+++ -.++|+++++++++......+.......+... T Consensus 437 r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~ 516 (543) T protein:vir:88 437 RGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAA 516 (543) T ss_pred HHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhh Confidence 22222222221 122222 2333322 1231 23567777777665444333332222222111 Q ss_pred CCCCCCCCcCcccC---CC Q lcl|NC_010808. 497 DDEQDDDTKDTVDK---KE 512 (512) Q Consensus 497 ~~~~~~~~~~~~~~---~e 512 (512) +.....+..+..-+ .+ T Consensus 517 ~~~~~~~~~~~~~~~~~~~ 535 (543) T protein:vir:88 517 QATASPEAMESAMDTAGVQ 535 (543) T ss_pred hhccChHHHHHHhhhcCCC Confidence 11111111011100 11 No 157 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=97.92 E-value=1.1e-05 Score=47.77 Aligned_cols=396 Identities=11% Similarity=0.014 Sum_probs=170.6 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc--cccc--------c--ccccce Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL--TRRK--------E--EYMADN 88 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~--~~~~--------~--~~~~~~ 88 (512) +++. +......+....+....+.-... ...+ . ...+.. T Consensus 1 ~~~~------------------------------~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 50 (432) T protein:vir:81 1 MPDE------------------------------KKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGA 50 (432) T ss_pred CCch------------------------------hhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCc Confidence 1111 01111222222222211100000 0000 0 000000 Q ss_pred e------eecchHHHHHHHHHhhhhccCcee--cCCc---hhHHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 89 R------VAHDYASYISDFINGYFLGNPIQC--QDDD---KDVLEAIEAFND--LND---VESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 89 r------i~~n~~~~iv~~~a~~l~g~~~~~--~~~d---~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~ 152 (512) . +.+.-....|+.+++-+..-|+.+ ...+ +.....+..++. -|. -..+...+..+++.+|.||+ T Consensus 51 ~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv 130 (432) T protein:vir:81 51 AVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYV 130 (432) T ss_pred ccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEE Confidence 1 111223345666666666667654 1111 112233445543 232 33566778888999999999 Q ss_pred EEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccc Q lcl|NC_010808. 153 LMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPR 231 (512) Q Consensus 153 ~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~ 231 (512) ++..+ +|++ .+..++|..+.+..++. .++. |.....++ ..+ .+.++.+++++.. T Consensus 131 ~i~~~-~g~~~~L~~l~~~~v~v~~~~~--g~~~-----y~~~~~~g------~~~-~~~~~~iih~r~~---------- 185 (432) T protein:vir:81 131 RKVVT-DGRIESLQYLANDRLTITTDPK--GNTA-----YRYRRTDG------QMI-DIPKQQIWKIMGY---------- 185 (432) T ss_pred EEEec-CCcEEEEEEEcCCceEEEECCC--CcEE-----EEEEecCc------eEE-EEccccEEEecCC---------- Confidence 98775 4664 46678888888777643 2211 11111110 011 2233444433211 Q ss_pred ccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcccccc Q lcl|NC_010808. 232 ENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFL 311 (512) Q Consensus 232 ~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 311 (512) | .+...|.|.+..+...|+.......-..+.+.-...|-.++.-....+++..+..++.- . T Consensus 186 ------------~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~--~- 246 (432) T protein:vir:81 186 ------------S----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKV--S- 246 (432) T ss_pred ------------C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHH--h- Confidence 0 01123667776666666555544444444444445665555543334444433333211 0 Q ss_pred chhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccc--hHHHHHHHHHHHHHHHH Q lcl|NC_010808. 312 EPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTK 389 (512) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~--Sg~Ai~~~~~~l~~k~~ 389 (512) ...+.......+++.+++.++.......+.+..+.....|+..-++|....+....+. .+..++-. T Consensus 247 ---~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~--------- 314 (432) T protein:vir:81 247 ---GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQ--------- 314 (432) T ss_pred ---hhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHH--------- Confidence 0111111222345556666655444455556677788889998899987765433222 22323221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD 465 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d 465 (512) ....+...|..++..|...++.+-.... +.....+.| ..-+..|..+.++++.++ +|+++.-.++++++.-.= T Consensus 315 -~~~f~~~tl~P~~~~ie~~l~~kLl~~~--~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~ 391 (432) T protein:vir:81 315 -QLGFLTMTLSPWLRRIEQSIALNLLSPA--ERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL 391 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHHhhccCcc--ccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Confidence 1122334455555555444443221111 112334455 444667888999998887 689999999988865220 Q ss_pred H-HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 466 P-ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 466 ~-~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) + ...+-.+.. ..... ........+....+... .+++.+.+ T Consensus 392 ~g~~~~~~~~~---~~~pl--~~~~~~~~~~~~~~~~n-~~~~~~~~ 432 (432) T protein:vir:81 392 GGNAAVLTVQS---AMVPL--DSIGLQASPEPASGLGN-QQQDKVSK 432 (432) T ss_pred CCCcceEeecC---cccch--hhhccCCCCCCCCCCCC-cccccccC Confidence 0 001000000 00000 00001111110011111 11111111 No 158 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.90 E-value=1.2e-05 Score=47.56 Aligned_cols=411 Identities=10% Similarity=0.024 Sum_probs=170.9 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHH--HHHHHhcccccccccc---------------ccccccccccee Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLK--VLSDYYEGKTKNLVEL---------------TRRKEEYMADNR 89 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~--~~~~yy~G~~~~~~~~---------------~~~~~~~~~~~r 89 (512) +.. ++.+- ..+.+ +.++.+|-. ..--+...+......+ ...........- T Consensus 1 ~~~-~~~~~--------~~~~~----~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 67 (441) T protein:vir:79 1 MHW-YNTDC--------YFVDF----KSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEA 67 (441) T ss_pred Ccc-ccCcc--------ccccc----cccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhh Confidence 000 00000 00000 001111000 0000000000000000 000000000000 Q ss_pred eecchHHHHHHHHHhhhhccCceecCCc-hhHHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEEEEEEECCCCce- Q lcl|NC_010808. 90 VAHDYASYISDFINGYFLGNPIQCQDDD-KDVLEAIEAFND--LND---VESHNRSLGLDLSIYGKAYELMIRNQDDET- 162 (512) Q Consensus 90 i~~n~~~~iv~~~a~~l~g~~~~~~~~d-~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~- 162 (512) +...-.--.|+.+++-+..-|+.+.-+. ......+..++. -|. ...+...+..+.+.+|.||+++.++..|++ T Consensus 68 l~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:79 68 IRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred hccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 0111111235555555555666653221 112223444442 232 245667788889999999999999988986 Q ss_pred EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccc Q lcl|NC_010808. 163 RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFER 242 (512) Q Consensus 163 ~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (512) .+..++|..+.+..++. ..+.+. ....+.... .....|.+..+.+++.. ++ T Consensus 148 ~L~~i~~~~v~v~~d~~--g~~~~~-----~~~~~~~~~---~~~~~~~~~dvih~k~~-----------------~~-- 198 (441) T protein:vir:79 148 NLTFRKTSEIELKSDAR--GRLYYF-----HQRIDSNGN---NIERNVKFEDMLDIKFY-----------------SL-- 198 (441) T ss_pred EEEEEcCceeEEEECCC--ccEEEE-----EEEeccCCc---eeEEEEccccEEEeccC-----------------CC-- Confidence 57889999998887653 222111 111111111 11123444444443210 00 Q ss_pred cceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC-Chhhhhhhhhccccccchhhhhhccc Q lcl|NC_010808. 243 MPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 243 vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (512) +...|.|.++.+...++....+..-..+.+.-.+.|-.++.-.... +++.....+..-.-... + ..+... T Consensus 199 -------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~-G-~~nag~ 269 (441) T protein:vir:79 199 -------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFS-G-TKQAGK 269 (441) T ss_pred -------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhc-C-ccccCc Confidence 1124777777767666655554444444555566677666522222 22222222211000000 0 001111 Q ss_pred ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 401 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 401 (512) ....+++.+++.++.......+.+..+.....|+..-++|....+...++.|..... ..|...|.. T Consensus 270 ~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~--------------~~~~~tl~P 335 (441) T protein:vir:79 270 VVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKP 335 (441) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHH Confidence 122345556666654444445566677778889888899877665322222222111 112234444 Q ss_pred HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHH Q lcl|NC_010808. 402 RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDE 477 (512) Q Consensus 402 ~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~ 477 (512) +++.|..-++.+-.. ......+++.+..-+-.|..+.++.+.++ +|+++...++++++.- ++.+..+-.+... T Consensus 336 ~~~~ie~eln~kl~~--~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n- 412 (441) T protein:vir:79 336 YITCVCAELNFKFND--EYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN- 412 (441) T ss_pred HHHHHHHHHhhhccc--cccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc- Confidence 444444444332111 11111233333444567888888888886 7899999988887652 2222111100000 Q ss_pred HHHHHHHHhhcc-cCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 478 KESIKKAQKGIY-KDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 478 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..... .+..+.+..+..+......|+.| T Consensus 413 -------~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 413 -------HVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred -------ccccccccccccccccccccccCCCCCCC Confidence 00000 00011111111222223333333 No 159 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.90 E-value=1.2e-05 Score=47.56 Aligned_cols=411 Identities=10% Similarity=0.024 Sum_probs=170.9 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHH--HHHHHhcccccccccc---------------ccccccccccee Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLK--VLSDYYEGKTKNLVEL---------------TRRKEEYMADNR 89 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~--~~~~yy~G~~~~~~~~---------------~~~~~~~~~~~r 89 (512) +.. ++.+- ..+.+ +.++.+|-. ..--+...+......+ ...........- T Consensus 1 ~~~-~~~~~--------~~~~~----~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 67 (441) T protein:vir:94 1 MHW-YNTDC--------YFVDF----KSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEA 67 (441) T ss_pred Ccc-ccCcc--------ccccc----cccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhh Confidence 000 00000 00000 001111000 0000000000000000 000000000000 Q ss_pred eecchHHHHHHHHHhhhhccCceecCCc-hhHHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEEEEEEECCCCce- Q lcl|NC_010808. 90 VAHDYASYISDFINGYFLGNPIQCQDDD-KDVLEAIEAFND--LND---VESHNRSLGLDLSIYGKAYELMIRNQDDET- 162 (512) Q Consensus 90 i~~n~~~~iv~~~a~~l~g~~~~~~~~d-~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~- 162 (512) +...-.--.|+.+++-+..-|+.+.-+. ......+..++. -|. ...+...+..+.+.+|.||+++.++..|++ T Consensus 68 l~~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:94 68 IRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred hccHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 0111111235555555555666653221 112223444442 232 245667788889999999999999988986 Q ss_pred EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccc Q lcl|NC_010808. 163 RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFER 242 (512) Q Consensus 163 ~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (512) .+..++|..+.+..++. ..+.+. ....+.... .....|.+..+.+++.. ++ T Consensus 148 ~L~~i~~~~v~v~~d~~--g~~~~~-----~~~~~~~~~---~~~~~~~~~dvih~k~~-----------------~~-- 198 (441) T protein:vir:94 148 NLTFRKTSEIELKSDAR--GRLYYF-----HQRIDSNGN---NIERNVKFEDMLDIKFY-----------------SL-- 198 (441) T ss_pred EEEEEcCceeEEEECCC--ccEEEE-----EEEeccCCc---eeEEEEccccEEEeccC-----------------CC-- Confidence 57889999998887653 222111 111111111 11123444444443210 00 Q ss_pred cceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC-Chhhhhhhhhccccccchhhhhhccc Q lcl|NC_010808. 243 MPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 243 vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (512) +...|.|.++.+...++....+..-..+.+.-.+.|-.++.-.... +++.....+..-.-... + ..+... T Consensus 199 -------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~-G-~~nag~ 269 (441) T protein:vir:94 199 -------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFS-G-TKQAGK 269 (441) T ss_pred -------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhc-C-ccccCc Confidence 1124777777767666655554444444555566677666522222 22222222211000000 0 001111 Q ss_pred ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 401 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 401 (512) ....+++.+++.++.......+.+..+.....|+..-++|....+...++.|..... ..|...|.. T Consensus 270 ~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~--------------~~~~~tl~P 335 (441) T protein:vir:94 270 VVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKP 335 (441) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHH Confidence 122345556666654444445566677778889888899877665322222222111 112234444 Q ss_pred HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHH Q lcl|NC_010808. 402 RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDE 477 (512) Q Consensus 402 ~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~ 477 (512) +++.|..-++.+-.. ......+++.+..-+-.|..+.++.+.++ +|+++...++++++.- ++.+..+-.+... T Consensus 336 ~~~~ie~eln~kl~~--~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n- 412 (441) T protein:vir:94 336 YITCVCAELNFKFND--EYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN- 412 (441) T ss_pred HHHHHHHHHhhhccc--cccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc- Confidence 444444444332111 11111233333444567888888888886 7899999988887652 2222111100000 Q ss_pred HHHHHHHHhhcc-cCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 478 KESIKKAQKGIY-KDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 478 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..... .+..+.+..+..+......|+.| T Consensus 413 -------~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 413 -------HVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred -------ccccccccccccccccccccccCCCCCCC Confidence 00000 00011111111222223333333 No 160 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.86 E-value=1.4e-05 Score=47.18 Aligned_cols=451 Identities=11% Similarity=0.025 Sum_probs=191.4 Q ss_pred ccchhH-HhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc Q lcl|NC_010808. 31 YDGTES-DLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN 109 (512) Q Consensus 31 ~~~~~~-~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~ 109 (512) +.-+.+ ...+..+...+.++.++..-..+++.+.+|..-.-.. . ..........|+.-+-+...++.+++.|++- T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~---~-~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 76 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFP---K-DSDNASTDYQTPWQAVGARGLNNLASKLMLA 76 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccC---C-CCCcccccccccccccHHHHHHHHHHHHHHh Confidence 111111 1111222333333333333455666777775432110 0 1111112223566677778888888877652 Q ss_pred --Cc----eecCCch--------------------hHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceE Q lcl|NC_010808. 110 --PI----QCQDDDK--------------------DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETR 163 (512) Q Consensus 110 --~~----~~~~~d~--------------------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~ 163 (512) |. ++...+. .....+...+..++|.....++.++..++|.+.+++-.+.++.++ T Consensus 77 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~ 156 (536) T protein:vir:21 77 LFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYN 156 (536) T ss_pred hcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCcee Confidence 21 1222211 122345556777889999999999999999988877555444443 Q ss_pred -EEEEccceeEEEEeCCCCceeEEEEEEeeeeeec------------cCCcceEEEEEEEc-----CCc--EEEEEecCC Q lcl|NC_010808. 164 -LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFT-----SHG--VYRYLTSRT 223 (512) Q Consensus 164 -i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~------------~~~~~~~~~~~~yt-----~~~--~~~~~~~~~ 223 (512) ++.++-.+ |.+-.+. .+++...+|.++..... ....+....+++|+ ++. ...|. ... T Consensus 157 ~f~~~pl~~-~~v~~d~-~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~-e~~ 233 (536) T protein:vir:21 157 PMKLYRLSS-YVVQRDA-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYE-EVE 233 (536) T ss_pred eEEEEEcCe-EEEeeCC-CCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEe-ccC Confidence 44544444 4444332 45666666654432110 00111112233332 111 11111 110 Q ss_pred ccccccccccccccccccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee-ecCCcCCh Q lcl|NC_010808. 224 NGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI-KGNLSLDP 297 (512) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~-~g~~~~~~ 297 (512) + ..+ .......+|..+|++.++ .+.+|+|..+...+-+..++.+.-...........+.+.+ .+- .... T Consensus 234 g-~~v---~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g-~~~~ 308 (536) T protein:vir:21 234 G-MEV---QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAG-ITQP 308 (536) T ss_pred C-eee---ccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccc-ccch Confidence 0 000 111223356677877643 4568999999999999999987666666555555544332 111 0111 Q ss_pred hhhhhhhhccccccchhhhhhcccccCCCCCccee--EEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHH Q lcl|NC_010808. 298 DEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG--YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGE 375 (512) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~ 375 (512) ..+. .++ .+.+..+...+++ .+....+.......++.++..|...-..-. .....+...||. T Consensus 309 ~~~~---~~~------------~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-l~~~~~~r~TAt 372 (536) T protein:vir:21 309 RRLT---KAQ------------TGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-AVQRTGERVTAE 372 (536) T ss_pred hhhc---cCC------------CcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-cccCCCCCccHH Confidence 1110 010 0001111112222 233344556666777777776644332211 111223446777 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHH-------HH Q lcl|NC_010808. 376 AMKYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYI-------DS 447 (512) Q Consensus 376 Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~-------kl 447 (512) .+......+.+...- ..++-.+.|.-+++.++.++...+..... .-..+++.+..++. .....+.+. .+ T Consensus 373 EV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~-p~~~v~~~~vs~l~--~l~r~~~~~~l~~~~~~l 449 (536) T protein:vir:21 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPEL-PKEAVEPTISTGLE--AIGRGQDLDKLERCVTAW 449 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC-ChhhccceEEecHH--HHHHHHHHHHHHHHHHHH Confidence 766653333332221 22222333343444445555444433211 11234555544432 222222221 12 Q ss_pred hccCC--------hHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHHhhcc-cCCCCCCC---CCCCCCCcCc Q lcl|NC_010808. 448 GGKIS--------QTTLMS----LFSFFQ----DPELEVKKIEEDEKESIKKAQKGIY-KDPRDIND---DEQDDDTKDT 507 (512) Q Consensus 448 ~g~~s--------~et~~~----~~~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~~---~~~~~~~~~~ 507 (512) +++-| ...++. .+|... -.++|++++.++++......+.... ........ .+.-..--+. T Consensus 450 a~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 529 (536) T protein:vir:21 450 AALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS 529 (536) T ss_pred HhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhc Confidence 22212 222332 233211 2457777777665443332221111 00000000 0000001111 Q ss_pred ccCCC Q lcl|NC_010808. 508 VDKKE 512 (512) Q Consensus 508 ~~~~e 512 (512) ..-++ T Consensus 530 ~g~~~ 534 (536) T protein:vir:21 530 VGLQP 534 (536) T ss_pred cccCC Confidence 11111 No 161 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=97.85 E-value=1.5e-05 Score=47.11 Aligned_cols=448 Identities=9% Similarity=0.017 Sum_probs=169.4 Q ss_pred cchhhccccccC---CCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----cccccccc Q lcl|NC_010808. 11 TDLRENRNYLFN---DEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-----VELTRRKE 82 (512) Q Consensus 11 ~~~~~~~~~~f~---~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~-----~~~~~~~~ 82 (512) |.+..-...-|. +..+. ..+..++=.+.|.+....-+.. ..+.+.|-.-.. ++...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~g~~~~~~~~ 67 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTS----------YIELGDYDKDIVNKAIRPGRAS---ARDTVDGIDIADGNVAGQYSVASIS 67 (535) T ss_pred ChhhHHHHHHHHhhhhhhhh----------hHHHhhhhHHHHHhhhhhhhhh---hhccccccccccCCcccccccCccc Confidence 322211111111 10000 0000000001111111111111 122222311000 00000000 Q ss_pred cccc--c-eee--ecchHHHHHHHHHhhhh-------------ccCceec-----CCch--hHHHHHHHHHhc--cCh-- Q lcl|NC_010808. 83 EYMA--D-NRV--AHDYASYISDFINGYFL-------------GNPIQCQ-----DDDK--DVLEAIEAFNDL--NDV-- 133 (512) Q Consensus 83 ~~~~--~-~ri--~~n~~~~iv~~~a~~l~-------------g~~~~~~-----~~d~--~~~~~l~~~~~~--n~~-- 133 (512) .... . .+. .....+.++++.+..+. +=++.+. .... .....|..++.. |.+ T Consensus 68 ~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~ 147 (535) T protein:vir:10 68 DVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYE 147 (535) T ss_pred cccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCC Confidence 0000 0 011 12233444444443321 2233321 1111 122335555532 332 Q ss_pred -----hHHHHHHHHHHHhCC-eEEEEEEECCCCceE-EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEE Q lcl|NC_010808. 134 -----ESHNRSLGLDLSIYG-KAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFT 206 (512) Q Consensus 134 -----~~~~~~~~~~~~~~G-~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 206 (512) ..+...+..+++.+| .+|+++..+..|++. +..++|..+.+..+........ ++|.... ... T Consensus 148 ~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~---~~~~~~~-----~~~--- 216 (535) T protein:vir:10 148 WRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPR---KFEQFVS-----ETK--- 216 (535) T ss_pred hhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCce---EEEEEec-----Cce--- Confidence 234556677777776 589999898889875 7889999988877654322111 1121110 000 Q ss_pred EEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010808. 207 VDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) Q Consensus 207 ~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~ 286 (512) ...+.++.+.+++..... .......|.|.++.+...|.....+..-..+.+.-.+.|- T Consensus 217 ~~~~~~~eiih~~~~~~~----------------------~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~ 274 (535) T protein:vir:10 217 SVKFSERNLTFINYWNLS----------------------DTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTR 274 (535) T ss_pred eEEECcccEEEEeccCCC----------------------CcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcc Confidence 112344444444321100 0001124777777777777666665555555555556665 Q ss_pred eeee--cCC--cCChhhhhhhhhccccccchhhhhhcccc--cCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010808. 287 LLIK--GNL--SLDPDEVKKQKEANVLFLEPTVYENRDTG--IETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNT 360 (512) Q Consensus 287 lv~~--g~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 360 (512) .++. +.. ....+..+.++..- ............ +..+++.++.-++.......+.+..+...+.|...-++ T Consensus 275 giL~~~~~~~~~ls~e~~e~lk~~~---~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgV 351 (535) T protein:vir:10 275 GILVIDQDGDAQANQMMLAGIRRQW---TSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQM 351 (535) T ss_pred EEEEecCCCCcccCHHHHHHHHHHH---HHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCC Confidence 5544 211 12223333222210 000000001111 11223444444444334455566677788888888899 Q ss_pred cccccccccc-cchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCH Q lcl|NC_010808. 361 PNMKDDNFSG-TQSGEA--MKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL 437 (512) Q Consensus 361 p~~~~~~~~~-n~Sg~A--i~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~ 437 (512) |....+-... +-|... -...+.... -......+...|..+++.|...++..-... .+ ..+.+.|+.....|. T Consensus 352 Pp~~lG~~~~at~sn~~~~~~~~~~s~~--E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~--~~-~~~~f~f~~l~~~d~ 426 (535) T protein:vir:10 352 QPEEINFPNNGGSTGKSGTKSVNEGSTA--KAKLESSKDKGLTPLLSFIEQVINDKIMRY--VD-TDYRFSFTLGDAQDK 426 (535) T ss_pred CHHHhccccCcccccchhhhhhhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhhcccc--cC-CeEEEEeccccccCH Confidence 9776654321 111111 111111110 012222334556666666655554332111 11 246778877777777 Q ss_pred HHHHHHHHHH-hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHH--HHHHhhccc-CC---CCCCCCCC-------- Q lcl|NC_010808. 438 IEELKAYIDS-GGKISQTTLMSLFSFFQ--DPELEVKKIEEDEKESI--KKAQKGIYK-DP---RDINDDEQ-------- 500 (512) Q Consensus 438 ~~~~~~~~kl-~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~~~~~--~~~~~~~~~-~~---~~~~~~~~-------- 500 (512) .+.+++.... .|+++.-.++++++.-. .-+.-.-.+..+.--.. ......... ++ ...+.+.+ T Consensus 427 ~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~ 506 (535) T protein:vir:10 427 LQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKD 506 (535) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccc Confidence 7766655432 67789988888875422 10100000000000000 000000000 00 00000000 Q ss_pred --------------CCCCcCcccCCC Q lcl|NC_010808. 501 --------------DDDTKDTVDKKE 512 (512) Q Consensus 501 --------------~~~~~~~~~~~e 512 (512) +.+.++...+++ T Consensus 507 ~~~g~~~~~~~~~~~~~~~~~~~~~~ 532 (535) T protein:vir:10 507 YEKGKDDPKSPLPKPSESDDVSNNED 532 (535) T ss_pred cccCCCCCCCCCCcCCCCCccccccc Confidence 000111111111 No 162 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.85 E-value=1.5e-05 Score=47.10 Aligned_cols=395 Identities=11% Similarity=0.078 Sum_probs=169.5 Q ss_pred HHHHHHHHHHHHHHHH--HHHHHhccccccccccccccc-ccc---cceeeecchHHHHHHHHHhhhhccCceecCCchh Q lcl|NC_010808. 46 SKYIEHHMDYQRPRLK--VLSDYYEGKTKNLVELTRRKE-EYM---ADNRVAHDYASYISDFINGYFLGNPIQCQDDDKD 119 (512) Q Consensus 46 ~~~i~~~~~~~~~r~~--~~~~yy~G~~~~~~~~~~~~~-~~~---~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~ 119 (512) .+|+.+.... .|.+ ...++.......+........ ... ...-+..+.....|+.+++-+..-|+.+--..+. T Consensus 1 m~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 78 (412) T protein:vir:26 1 MNVIAKENIV--TRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 78 (412) T ss_pred Cccchhhhhh--hhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccc Confidence 1222110000 0111 111111100000000000000 000 0111223444555666666666667765322223 Q ss_pred HHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 120 VLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 120 ~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) ....+..++. -| ....+...+..+++.+|.+|+++.++..|++ .+..++|..+.+..++.. +. ++ |.. T Consensus 79 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~-~~----y~~ 152 (412) T protein:vir:26 79 VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RE-LY----YSI 152 (412) T ss_pred ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cE-EE----EEE Confidence 3333444443 23 2344567788899999999999999999986 577788998888876532 21 11 111 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) .... . . . ..+.++.+.+++... ..+.-.|.|.++-+...++..+.+.. T Consensus 153 ~~~~---g-~--~-~~~~~~evih~~~~~-------------------------~~~~~~G~s~i~~~~~~i~~~~a~~~ 200 (412) T protein:vir:26 153 HAAT---G-N--K-LIVHNMDMLHFKHIV-------------------------ASNMVQGISPIDVLKNTTDFDNAVRT 200 (412) T ss_pred EcCC---c-e--E-EEEccccEEEeCCCC-------------------------CCCCcccccHHHHHHHHHHHHHHHHH Confidence 1110 0 0 1 124455555442210 00112466766655555554333321 Q ss_pred HHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 353 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 353 (512) . +.......+-.++......+++..+..++.-.- ...+.......+++.++..++.......+.+..+..... T Consensus 201 ~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~-----~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 273 (412) T protein:vir:26 201 F--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQ-----YYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRER 273 (412) T ss_pred H--HHHhcCCCCceEEecCCCCCHHHHHHHHHHHHH-----HhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHH Confidence 1 122222233334433334444444433332110 011111122334555666555433344555666667788 Q ss_pred HHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEe--CC Q lcl|NC_010808. 354 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVY--NR 431 (512) Q Consensus 354 i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f--~~ 431 (512) |+..-++|....+...+ .+...++ ......+...|.-++..|...++..-...... .....+.| .. T Consensus 274 Ia~afgVPp~~lg~~~~-~~~sn~e----------~~~~~f~~~~l~P~~~~ie~~ln~kLl~~~~~-~~~~~~~fd~~~ 341 (412) T protein:vir:26 274 VANVFQLPSVFLNARSN-TNFAKNE----------ELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDR-EKNRYFKFNVKS 341 (412) T ss_pred HHHHhCCCHHHhCCCCC-CCcccHH----------HHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc-cCcceEEeechh Confidence 88888898766653221 1111111 11122334445555555555454322111111 11233444 45 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCccc Q lcl|NC_010808. 432 NLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD 509 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (512) -+..|..+.++++.++ +|+++.-.+++.++.-.-+. -+++. + .... .+-+...+.+....+.+.+ T Consensus 342 l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g--gD~~~------~--~~n~---~~~~~~~~~~~~~~gG~~n 408 (412) T protein:vir:26 342 YLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG--GDKPL------I--SGDL---YPIDTPLELRKSLKGGDKN 408 (412) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee------e--cccc---cccccchhhcccccCCCCC Confidence 5667899999998887 68999999988886532110 00000 0 0000 0000000000001111111 Q ss_pred CCC Q lcl|NC_010808. 510 KKE 512 (512) Q Consensus 510 ~~e 512 (512) ..| T Consensus 409 ~~e 411 (412) T protein:vir:26 409 VNE 411 (412) T ss_pred cCC Confidence 111 No 163 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=97.85 E-value=1.5e-05 Score=47.09 Aligned_cols=440 Identities=10% Similarity=0.048 Sum_probs=189.8 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc--Cc----- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN--PI----- 111 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~----- 111 (512) .+ .+...+.+..++..-..+++.+.+|..-.-.. ... ......+...++.-+-+...++.+++-|.+- |+ T Consensus 1 m~-~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~-~~~-~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 77 (522) T protein:vir:10 1 MK-ARERYNQLTTARQMFLDKAVECSELTLPYLID-DDI-SSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFF 77 (522) T ss_pred Cc-hHHHHHHHHHHhhHHHHHHHHHHHHhhhcccC-CCC-CCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 12 23334444444444456677777775321110 000 1111112223566677788888888887652 22 Q ss_pred eecCCchh-------------------HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcccee Q lcl|NC_010808. 112 QCQDDDKD-------------------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST 172 (512) Q Consensus 112 ~~~~~d~~-------------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~ 172 (512) ++...+.+ ....+...+..++|.....++.++..++|.+.++ .++++ +++++-.+ T Consensus 78 ~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly--~~~~~---~~~~pl~~- 151 (522) T protein:vir:10 78 KLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIF--MGKDG---LKTFPLTR- 151 (522) T ss_pred cccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEE--EcCCC---ceEEEcce- Confidence 22222211 1223344466788999999999999999998865 45554 33443333 Q ss_pred EEEEeCCCCceeEEEEEEeeeeee--------c------cCCcceEEEEEEEcC-----Cc-EEEEEecCCccccccccc Q lcl|NC_010808. 173 FVIYDNTIERNSIAGVRYLRTKPI--------D------KTDEDEVFTVDLFTS-----HG-VYRYLTSRTNGLKLTPRE 232 (512) Q Consensus 173 ~~i~d~~~~~~~~~~v~~~~~~~~--------~------~~~~~~~~~~~~yt~-----~~-~~~~~~~~~~~~~~~~~~ 232 (512) |.+--+. .+++...+|.++.... + ....+....+++|+. +. -+.+.....+.. + . T Consensus 152 y~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~-~---~ 226 (522) T protein:vir:10 152 YVINRDG-DGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKI-I---P 226 (522) T ss_pred EEEeeCC-CCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCcc-c---c Confidence 4444333 4566666665443210 0 000111122333321 10 011111111000 0 0 Q ss_pred cccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcc Q lcl|NC_010808. 233 NGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEAN 307 (512) Q Consensus 233 ~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~ 307 (512) .....-+|..+|++.+ ..+.+|+|..+...+-+..++.+.-......+....|.+.+.-........+. .++ T Consensus 227 ~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~---~~~ 303 (522) T protein:vir:10 227 DSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIA---KAG 303 (522) T ss_pred ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccccc---CCC Confidence 0112235666777654 34568999999999999999998888888888888888765321111111111 011 Q ss_pred ccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHH Q lcl|NC_010808. 308 VLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLE 385 (512) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~ 385 (512) . ..+..+...++..+. ...+.......++.++..|...-.. .....+...+|..+......+. T Consensus 304 ~------------~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~---~~~~d~~rvTAtEV~~r~~E~~ 368 (522) T protein:vir:10 304 N------------GAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLV---MNVRNAERVTAEEVRLTQLELE 368 (522) T ss_pred C------------cceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhh---ccCCCCCCCCHHHHHHHHHHHH Confidence 0 111112222333332 3345666677777777777654321 1122234567777766533333 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCc-cccc-ceeeEEeCCCCCcCHHHHHHHH----HHHhccCChHHH-- Q lcl|NC_010808. 386 QRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDA-NKDF-NTVRYVYNRNLPKSLIEELKAY----IDSGGKISQTTL-- 456 (512) Q Consensus 386 ~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~-~~d~-~~i~i~f~~~~p~d~~~~~~~~----~kl~g~~s~et~-- 456 (512) +...- ..++-.+.+.-+++-++.++...+.... +.+. ....+++..++-+ .+.++.+ ..++.++..+.+ T Consensus 369 ~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~~~i~~~~~p~~~~~ 446 (522) T protein:vir:10 369 QQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGR--GQDRESLTAFVGTIAQTLGPEALMQ 446 (522) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHH--HHHHHHHHHHHHHHHHhhCchhhhh Confidence 32221 1111122233333333444444433221 1111 1222334333322 1112221 112222222222 Q ss_pred -------HHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHhhcccC---CCCCCCCCCC--CCCcCcccCCC Q lcl|NC_010808. 457 -------MSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQKGIYKD---PRDINDDEQD--DDTKDTVDKKE 512 (512) Q Consensus 457 -------~~~~---~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~---~~~~~~~~~~--~~~~~~~~~~e 512 (512) +..+ -+|+ ..++|++.+++++++....++...... .....+.... --..-...++| T Consensus 447 ~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 447 YLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred cCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHHHHhCCCCCC Confidence 2222 1222 234566655555544433222111110 0001010000 01111222222 No 164 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=97.85 E-value=1.5e-05 Score=47.05 Aligned_cols=428 Identities=13% Similarity=0.090 Sum_probs=163.6 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchh--HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTE--SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT 78 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~ 78 (512) +|.+..|-..|+.-. .+..+...+.+.++... ..+.+ ..++..+..-.......+.-...+... T Consensus 55 ~a~~~~~~~~~~~~~---~~~~~~~~~~~~~~l~~~l~~~~~--n~i~~~~I~t~~~~vA~~~~~~~~~~~--------- 120 (563) T protein:vir:99 55 QAYAEPFIEMMDTNP---EFRDKRSYMKNEHNLHDVLKKFGN--NPILNAIILTRSNQVAMYCQPARYSEK--------- 120 (563) T ss_pred CcchhhhHhhhcccc---cccccccCCCCcccHHHHHHHhhc--chHHHHHHHHHHHHHHHHhhhhhhhcc--------- Confidence 555555554443221 11111111111111000 00000 011111101111111111111011000 Q ss_pred ccccccccceeeecchHHHHHHHHHhhhhccCcee-----cCCch--hHHHHHHHHHh-----c----cChhHHHHHHHH Q lcl|NC_010808. 79 RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC-----QDDDK--DVLEAIEAFND-----L----NDVESHNRSLGL 142 (512) Q Consensus 79 ~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~-----~~~d~--~~~~~l~~~~~-----~----n~~~~~~~~~~~ 142 (512) . .|=++.+ +..+. .....|..++. . ..+..+...+.. T Consensus 121 -----------------------~----~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~ 173 (563) T protein:vir:99 121 -----------------------G----LGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVR 173 (563) T ss_pred -----------------------c----ccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHH Confidence 0 0000000 00000 01111222211 0 124466777888 Q ss_pred HHHhCCeEEEEEE--ECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEE Q lcl|NC_010808. 143 DLSIYGKAYELMI--RNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (512) Q Consensus 143 ~~~~~G~a~~~v~--~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~ 219 (512) +.+.+|.+|+++. .+..|++ .+..++|..+.+..+.... ......+|+.... + . ....+.+..++++. T Consensus 174 ~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~--g---~---~~~~~~~~evI~~~ 244 (563) T protein:vir:99 174 DTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD--K---R---VVASFTSRELAMGI 244 (563) T ss_pred HHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC--C---c---eeEEecCcceEEEe Confidence 9999999988765 4556665 4778899998888765321 1111122221110 0 0 11122333332221 Q ss_pred ecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCCh Q lcl|NC_010808. 220 TSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDP 297 (512) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~ 297 (512) ... -........|.|.+..+...+.....+..-..+.+.-.+.|-.+++ |....++ T Consensus 245 ~~~----------------------~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~ 302 (563) T protein:vir:99 245 RNP----------------------RTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQ 302 (563) T ss_pred ccC----------------------CCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCH Confidence 110 0000012357777777776666555555555555565666665543 4333344 Q ss_pred hhhhhhhhccccccchhhhhhccc--ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc----- Q lcl|NC_010808. 298 DEVKKQKEANVLFLEPTVYENRDT--GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----- 370 (512) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----- 370 (512) +..+.++..-.-... ...... ..-.+++.++.-++.+.....+.+..+.....|+..-++|....+-... T Consensus 303 e~~~~~~~~~~~~~~---G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~ 379 (563) T protein:vir:99 303 HALENFKREWKSSLS---GINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATG 379 (563) T ss_pred HHHHHHHHHHHHHhc---cccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccc Confidence 444433322110000 000111 1123455566666554455566777888888899999999766643211 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--h Q lcl|NC_010808. 371 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--G 448 (512) Q Consensus 371 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~ 448 (512) ...|..+... .+ .......+...|..+++.|...++..-... . ...+.+.|.+.-+.+..+..+. .++ + T Consensus 380 ~~~~ss~~~s--n~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~--~-~~~~~~~f~r~D~~~~~e~~~~-~~~~~~ 450 (563) T protein:vir:99 380 SKGGSTLNEA--DP---GKKQQQSQNKGLQPLLRFIEDLVNRHIISE--Y-GDKYTFQFVGGDTKSATDKLNI-LKLETQ 450 (563) T ss_pred cccccchhhc--cH---HHHHHHHHHHHHHHHHHHHHHHHHhhhchh--c-ccccEEEeccCCHHHHHHHHHH-HHHhcC Confidence 1111111100 00 111223344455555555554444321111 1 1245667876655555444432 333 5 Q ss_pred ccCChHHHHHhCCCCC--CHHH--------HHHHH----HHHHHHHHHHHHhhcccCCCCCCCCCCCCC---CcCcccCC Q lcl|NC_010808. 449 GKISQTTLMSLFSFFQ--DPEL--------EVKKI----EEDEKESIKKAQKGIYKDPRDINDDEQDDD---TKDTVDKK 511 (512) Q Consensus 449 g~~s~et~~~~~~~v~--d~~~--------E~~ri----~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 511 (512) |+++.-.++++++.-. +-+. -+... ..+.+...............+....+.+.+ ..++.+.+ T Consensus 451 G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (563) T protein:vir:99 451 IFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIG 530 (563) T ss_pred CccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccc Confidence 8999988888775422 1000 00000 000000000000000111111111111111 11111111 Q ss_pred C Q lcl|NC_010808. 512 E 512 (512) Q Consensus 512 e 512 (512) + T Consensus 531 ~ 531 (563) T protein:vir:99 531 T 531 (563) T ss_pred c Confidence 1 No 165 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=97.85 E-value=1.5e-05 Score=47.05 Aligned_cols=428 Identities=13% Similarity=0.090 Sum_probs=163.6 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchh--HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTE--SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELT 78 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~ 78 (512) +|.+..|-..|+.-. .+..+...+.+.++... ..+.+ ..++..+..-.......+.-...+... T Consensus 55 ~a~~~~~~~~~~~~~---~~~~~~~~~~~~~~l~~~l~~~~~--n~i~~~~I~t~~~~vA~~~~~~~~~~~--------- 120 (563) T protein:vir:95 55 QAYAEPFIEMMDTNP---EFRDKRSYMKNEHNLHDVLKKFGN--NPILNAIILTRSNQVAMYCQPARYSEK--------- 120 (563) T ss_pred CcchhhhHhhhcccc---cccccccCCCCcccHHHHHHHhhc--chHHHHHHHHHHHHHHHHhhhhhhhcc--------- Confidence 555555554443221 11111111111111000 00000 011111101111111111111011000 Q ss_pred ccccccccceeeecchHHHHHHHHHhhhhccCcee-----cCCch--hHHHHHHHHHh-----c----cChhHHHHHHHH Q lcl|NC_010808. 79 RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC-----QDDDK--DVLEAIEAFND-----L----NDVESHNRSLGL 142 (512) Q Consensus 79 ~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~-----~~~d~--~~~~~l~~~~~-----~----n~~~~~~~~~~~ 142 (512) . .|=++.+ +..+. .....|..++. . ..+..+...+.. T Consensus 121 -----------------------~----~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~ 173 (563) T protein:vir:95 121 -----------------------G----LGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVR 173 (563) T ss_pred -----------------------c----ccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHH Confidence 0 0000000 00000 01111222211 0 124466777888 Q ss_pred HHHhCCeEEEEEE--ECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEE Q lcl|NC_010808. 143 DLSIYGKAYELMI--RNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (512) Q Consensus 143 ~~~~~G~a~~~v~--~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~ 219 (512) +.+.+|.+|+++. .+..|++ .+..++|..+.+..+.... ......+|+.... + . ....+.+..++++. T Consensus 174 ~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~--g---~---~~~~~~~~evI~~~ 244 (563) T protein:vir:95 174 DTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD--K---R---VVASFTSRELAMGI 244 (563) T ss_pred HHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC--C---c---eeEEecCcceEEEe Confidence 9999999988765 4556665 4778899998888765321 1111122221110 0 0 11122333332221 Q ss_pred ecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCCh Q lcl|NC_010808. 220 TSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDP 297 (512) Q Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~ 297 (512) ... -........|.|.+..+...+.....+..-..+.+.-.+.|-.+++ |....++ T Consensus 245 ~~~----------------------~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~ 302 (563) T protein:vir:95 245 RNP----------------------RTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQ 302 (563) T ss_pred ccC----------------------CCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCH Confidence 110 0000012357777777776666555555555555565666665543 4333344 Q ss_pred hhhhhhhhccccccchhhhhhccc--ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc----- Q lcl|NC_010808. 298 DEVKKQKEANVLFLEPTVYENRDT--GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----- 370 (512) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----- 370 (512) +..+.++..-.-... ...... ..-.+++.++.-++.+.....+.+..+.....|+..-++|....+-... T Consensus 303 e~~~~~~~~~~~~~~---G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~ 379 (563) T protein:vir:95 303 HALENFKREWKSSLS---GINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATG 379 (563) T ss_pred HHHHHHHHHHHHHhc---cccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccc Confidence 444433322110000 000111 1123455566666554455566777888888899999999766643211 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--h Q lcl|NC_010808. 371 TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--G 448 (512) Q Consensus 371 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~ 448 (512) ...|..+... .+ .......+...|..+++.|...++..-... . ...+.+.|.+.-+.+..+..+. .++ + T Consensus 380 ~~~~ss~~~s--n~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~--~-~~~~~~~f~r~D~~~~~e~~~~-~~~~~~ 450 (563) T protein:vir:95 380 SKGGSTLNEA--DP---GKKQQQSQNKGLQPLLRFIEDLVNRHIISE--Y-GDKYTFQFVGGDTKSATDKLNI-LKLETQ 450 (563) T ss_pred cccccchhhc--cH---HHHHHHHHHHHHHHHHHHHHHHHHhhhchh--c-ccccEEEeccCCHHHHHHHHHH-HHHhcC Confidence 1111111100 00 111223344455555555554444321111 1 1245667876655555444432 333 5 Q ss_pred ccCChHHHHHhCCCCC--CHHH--------HHHHH----HHHHHHHHHHHHhhcccCCCCCCCCCCCCC---CcCcccCC Q lcl|NC_010808. 449 GKISQTTLMSLFSFFQ--DPEL--------EVKKI----EEDEKESIKKAQKGIYKDPRDINDDEQDDD---TKDTVDKK 511 (512) Q Consensus 449 g~~s~et~~~~~~~v~--d~~~--------E~~ri----~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 511 (512) |+++.-.++++++.-. +-+. -+... ..+.+...............+....+.+.+ ..++.+.+ T Consensus 451 G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (563) T protein:vir:95 451 IFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIG 530 (563) T ss_pred CccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccc Confidence 8999988888775422 1000 00000 000000000000000111111111111111 11111111 Q ss_pred C Q lcl|NC_010808. 512 E 512 (512) Q Consensus 512 e 512 (512) + T Consensus 531 ~ 531 (563) T protein:vir:95 531 T 531 (563) T ss_pred c Confidence 1 No 166 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=97.85 E-value=1.5e-05 Score=47.01 Aligned_cols=434 Identities=9% Similarity=0.012 Sum_probs=189.7 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc--Cc---- Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN--PI---- 111 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~---- 111 (512) +....+...+.+..++..-..+++.+.+|..-.-. .. ..........|+--+-+...++.+++.|++- |+ T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~-~~---~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~W 76 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLL-TE---DGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTSF 76 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC-CC---CCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcc Confidence 23333444444444544445667777777532110 00 0011111223455677788888888887653 22 Q ss_pred -eecCCc----------hh-----------HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcc Q lcl|NC_010808. 112 -QCQDDD----------KD-----------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDA 169 (512) Q Consensus 112 -~~~~~d----------~~-----------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p 169 (512) ++...+ ++ +...+...+..++|.....++.++..++|.+.++ .++++ +++++- T Consensus 77 F~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~~~~~---~~~~pl 151 (542) T protein:vir:78 77 FKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVF--AGKKT---LKVYPL 151 (542) T ss_pred ccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEE--ecCCC---ceEEec Confidence 222221 11 1223445566788999999999999999998665 45543 333433 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeec--------c-----------CCcceEEEEE-EEcCCc--EEEEEecCCccc- Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPID--------K-----------TDEDEVFTVD-LFTSHG--VYRYLTSRTNGL- 226 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~--------~-----------~~~~~~~~~~-~yt~~~--~~~~~~~~~~~~- 226 (512) .+ |.+--+. .+++...+|.++..... . ........++ ++.... ++.+.......+ T Consensus 152 ~~-y~v~~d~-~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~~s 229 (542) T protein:vir:78 152 DR-YVIERDG-DGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQHR 229 (542) T ss_pred ce-eEEeeCC-CCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCeEE Confidence 33 4444333 45566666655433110 0 0000111111 111111 111110000000 Q ss_pred -c--cccccc-c-cccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCC Q lcl|NC_010808. 227 -K--LTPREN-G-FESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLD 296 (512) Q Consensus 227 -~--~~~~~~-~-~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~ 296 (512) . ...... . ....+|..+|++.. ..+.+|+|..+...+-+..++.+.-......+...+|.+.+.-..... T Consensus 230 ~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~~~ 309 (542) T protein:vir:78 230 WHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSATTK 309 (542) T ss_pred EEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc Confidence 0 000100 0 12235666776654 245689999999999999999998888888888888876652111111 Q ss_pred hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchH Q lcl|NC_010808. 297 PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG 374 (512) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg 374 (512) ...+.. + ....+..+...+++.+ ....+.......++.++..|...-..-+ ...+...+| T Consensus 310 ~~~~~~---~------------~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~---~~d~~rvTA 371 (542) T protein:vir:78 310 PQSLAR---A------------GTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILN---VRQSERTTA 371 (542) T ss_pred hhhccc---C------------CCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcccc---cCCcccccH Confidence 111110 0 0000111222333333 3344666677778887777754332111 112334577 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcC-HHHHHHHH- Q lcl|NC_010808. 375 EAMKYKLFGLEQRTKTKEGLFTKGLRR--------RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS-LIEELKAY- 444 (512) Q Consensus 375 ~Ai~~~~~~l~~k~~~~~~~~~~~l~~--------~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d-~~~~~~~~- 444 (512) ..+.... .++...++..+.+ +++-++.++...+..+...+ ..+++.+..++..- ..+.++.+ T Consensus 372 tEV~~r~-------~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~-~lv~~~~~s~La~~~r~~~~~~l~ 443 (542) T protein:vir:78 372 TEVREVQ-------MELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPK-GLVMPTVVAGLGGVGRGEDRAALI 443 (542) T ss_pred HHHHHHH-------HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCch-hceeeeeechHHHHHHHHHHHHHH Confidence 6665543 3333334433333 33333444444443332211 23567676555321 11111111 Q ss_pred ---HHHhccCChHH---------HHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHH-----hhcccCC----CC- Q lcl|NC_010808. 445 ---IDSGGKISQTT---------LMSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQ-----KGIYKDP----RD- 494 (512) Q Consensus 445 ---~kl~g~~s~et---------~~~~~---~~v~-----d~~~E~~ri~~E~~~~~~~~~-----~~~~~~~----~~- 494 (512) ..++.++..+. ++..+ -+++ ..++|+++.+++.+.....+. ......+ .. T Consensus 444 ~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~~~~ 523 (542) T protein:vir:78 444 EFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKMMQ 523 (542) T ss_pred HHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhh Confidence 11111121222 22222 1222 224555555544333222111 1111100 00 Q ss_pred -----CCCCCCCCCCcCcc Q lcl|NC_010808. 495 -----INDDEQDDDTKDTV 508 (512) Q Consensus 495 -----~~~~~~~~~~~~~~ 508 (512) ++.....+.++++. T Consensus 524 ~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 524 QINAPGQEAPAGPQTGEDL 542 (542) T ss_pred hcCCCCcCCCCCCcccccC Confidence 11111112233333 No 167 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.83 E-value=1.6e-05 Score=46.87 Aligned_cols=411 Identities=10% Similarity=0.031 Sum_probs=168.2 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHH--HHHhcccccccc---cc------------ccccccccccee Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVL--SDYYEGKTKNLV---EL------------TRRKEEYMADNR 89 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~--~~yy~G~~~~~~---~~------------~~~~~~~~~~~r 89 (512) +.. ++.+ -..+++.. ++.+|-... --+...+..... .. ...........- T Consensus 1 ~~~-~~~~--------~~~~~~~~----~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 67 (441) T protein:vir:98 1 MHW-YNTD--------CYFVDFKS----RKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEA 67 (441) T ss_pred Cce-ecCc--------cceecccc----ccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhh Confidence 100 0000 00011100 000000000 000000000000 00 000000000000 Q ss_pred eecchHHHHHHHHHhhhhccCceecCCc-hhHHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEEEEEEECCCCce- Q lcl|NC_010808. 90 VAHDYASYISDFINGYFLGNPIQCQDDD-KDVLEAIEAFND--LND---VESHNRSLGLDLSIYGKAYELMIRNQDDET- 162 (512) Q Consensus 90 i~~n~~~~iv~~~a~~l~g~~~~~~~~d-~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~- 162 (512) +.+.=.-..|+.+++-+..-|+.+.-+. ......+..++. -|. ...+...+..+++.+|.||+++.++.+|++ T Consensus 68 l~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~ 147 (441) T protein:vir:98 68 IRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM 147 (441) T ss_pred hccHHHHHHHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE Confidence 0011111235555555555666653221 112223444442 232 335667788889999999999999988875 Q ss_pred EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccc Q lcl|NC_010808. 163 RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFER 242 (512) Q Consensus 163 ~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (512) .+..++|..+.+..++. .++...+ ...+.... .....+.+..+.+++.. ++ T Consensus 148 ~L~~i~~~~v~v~~~~~--g~~~~~~-----~~~~~~~~---~~~~~~~~~dviHir~~-----------------~~-- 198 (441) T protein:vir:98 148 NLTFRKTSEIELKLDAR--GRLYYFH-----QRIDSNGN---NIERNVKFEDMLDIKFY-----------------SL-- 198 (441) T ss_pred EEEEEcCceeEEEECCC--CcEEEEE-----EEeccCcc---eeeEEEccccEEEeccC-----------------CC-- Confidence 47788999988887653 2222211 11111110 11123444444443210 00 Q ss_pred cceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC-Chhhhhhhhhccccccchhhhhhccc Q lcl|NC_010808. 243 MPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 243 vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (512) +.-.|.|.+..+...++....+..-....+.-...|-.+++-.... +++..+..+..-.-... + ..+... T Consensus 199 -------dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~-G-~~nag~ 269 (441) T protein:vir:98 199 -------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFS-G-TKQAGK 269 (441) T ss_pred -------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhc-C-ccccCc Confidence 1123677777766666655555444445555556666666422121 22222222211100000 0 001111 Q ss_pred ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 401 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~ 401 (512) ....+++.+++.++.......+.+..+.....|+..-++|....+...++.|.+.... .|...|.. T Consensus 270 ~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~--------------~y~~tl~P 335 (441) T protein:vir:98 270 VVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYLSTLKP 335 (441) T ss_pred ceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHHHHHHH Confidence 1223455566666554444455566677778888888999777653333333222111 12223444 Q ss_pred HHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHH Q lcl|NC_010808. 402 RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDE 477 (512) Q Consensus 402 ~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~ 477 (512) ++..|..-++.+-... ..-..+++....-+-.|..+.++++.++ .|+++.-.++++++.- ++.+..+-.+... T Consensus 336 ~~~~ie~~ln~~L~~~--~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n- 412 (441) T protein:vir:98 336 YITCVCAELNFKFNDE--YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN- 412 (441) T ss_pred HHHHHHHHHHhhcccc--ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc- Confidence 4443333333321110 1111233333444667888888888887 6899999998887542 2222111000000 Q ss_pred HHHHHHHHhhcc-cCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 478 KESIKKAQKGIY-KDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 478 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..... .+.-+.+..+..+.....+|+.| T Consensus 413 -------~~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 413 -------HVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred -------ccccccccccccccccccccccCCCCCCC Confidence 00000 00000011111111112222222 No 168 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.79 E-value=1.9e-05 Score=46.51 Aligned_cols=376 Identities=9% Similarity=-0.001 Sum_probs=166.8 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |++. +++|..-..-. ...... ....+-+... ...... ...-+ T Consensus 1 Mg~f---~~~~~~~~~~~--------------~~~~~~--------------~~~~~~~~~~----~~~~v~---~~~~l 42 (382) T protein:vir:48 1 MPIF---NLATESPPDNQ--------------GGFFDV--------------VDSDFLASLK----GNEWVS---AETAL 42 (382) T ss_pred Cccc---cccccCCcccc--------------cccccc--------------hhhhcccccc----CCcccc---hHhhh Confidence 4333 33332211000 000000 0000000000 000000 00001 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEcc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDA 169 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p 169 (512) ...-....|+.+++-+..-|+++.... ....+.+=...-....+...+..+.+.+|.||+++-.|.+|++ .+..++| T Consensus 43 ~~~~v~~~i~~ia~~ia~~~~~~~~~~--~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~ 120 (382) T protein:vir:48 43 RNSDLFSIINQLSNDLATVKLITSRKK--LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRP 120 (382) T ss_pred ccHHHHHHHHHHHHhhccCceeeecch--hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcC Confidence 223344556666666666676653322 2222222222224466777788899999999999999988875 6777889 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeec Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS 249 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 249 (512) ..+.++.++.. ..+ +|.+........ ....+.++.+++++... .. T Consensus 121 ~~v~v~~~~~~-~~~-----~y~~~~~~~~~~----~~~~~~~~evih~~~~~-------------------------~~ 165 (382) T protein:vir:48 121 SQVSFNRLDNK-DGI-----YYNITFDDPRIP----PKQHVPQNDVLHFRLLS-------------------------VD 165 (382) T ss_pred ceeEEEEcCCC-CeE-----EEEEEecCcccc----ceeEEcCccEEEecCCC-------------------------CC Confidence 88877765432 111 122111110000 01123344444432110 00 Q ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc Q lcl|NC_010808. 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV 329 (512) Q Consensus 250 n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (512) ....|.|.+..+...++....+..-..+.+.-.+.|-.+++-....+.+..........- ...+.......+++. T Consensus 166 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~-----~~~n~g~~~vl~~g~ 240 (382) T protein:vir:48 166 GGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQA-----MKQMQGGPLVLDDLE 240 (382) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHh-----hccCCCCeeEcCCCc Confidence 113577878877777776666655566666666777776654333344433332221110 111111122234555 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETI 409 (512) Q Consensus 330 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 409 (512) ++..++.......+.+..+...+.|+..-++|....+..+.+.+.. ......+...|..+++.|... T Consensus 241 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~-------------~~~~~~~~~~l~p~~~~i~~~ 307 (382) T protein:vir:48 241 DFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSL-------------EMSSDLYSKAVSRYLRPFLSE 307 (382) T ss_pred eEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-------------HHHHHHHHHHHHHHHHHHHHH Confidence 6666654444555567778888889998899977765433322211 111223344444444444444 Q ss_pred HHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 410 LKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKKA 484 (512) Q Consensus 410 l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~~ 484 (512) ++..-...... ++...+ -.+.......+.++ +|++++-.+++.+ ++.+++..+.+ T Consensus 308 l~~~l~~~~~~---~~~~~~----~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~------------- 367 (382) T protein:vir:48 308 LSQKLSCDVDA---DIFPAV----DPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGE------------- 367 (382) T ss_pred HHHHhcChhhh---hhhhhh----ccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhh------------- Confidence 43321111111 111111 12233344455555 6788887777655 44443211100 Q ss_pred HhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 485 QKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) ... +. -+++|+.+++ T Consensus 368 --~~~--~~--------~~GGd~~~~~ 382 (382) T protein:vir:48 368 --NPN--ST--------LKGGEEDGQD 382 (382) T ss_pred --cCC--CC--------CCCCCCCCCC Confidence 000 00 0111111111 No 169 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=97.76 E-value=2.1e-05 Score=46.19 Aligned_cols=440 Identities=10% Similarity=0.056 Sum_probs=185.7 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc--Cc---- Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN--PI---- 111 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~---- 111 (512) +.+..+...+.+..++..-..+++.+.+|..-.-. . ...........++.-+-+...++.+++.|++- |+ T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~-~---~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~W 76 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYIL-T---DEGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSF 76 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccc-C---CCCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCcc Confidence 22223333333444444445566667666533110 0 00111112223455677888888888887653 22 Q ss_pred -eecCCch---------h-----------HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 112 -QCQDDDK---------D-----------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 112 -~~~~~d~---------~-----------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++...+. + ....+...+..++|.....++.++..++|.+.++ .++++ + +++ |. T Consensus 77 F~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly--~~~~~-~--~~~-pl 150 (555) T protein:vir:17 77 FKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLY--QGKKN-L--KLY-PL 150 (555) T ss_pred cccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--ecCCc-e--eEE-Ec Confidence 2222221 1 1223444456688999999999999999998754 45553 3 333 33 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeec-----cC----------------------------CcceEEEEEEEcC----C Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPID-----KT----------------------------DEDEVFTVDLFTS----H 213 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~-----~~----------------------------~~~~~~~~~~yt~----~ 213 (512) .-|.+--+. .+++...+|.++..... +. .......+++|+. . T Consensus 151 ~~y~v~~d~-~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~~~ 229 (555) T protein:vir:17 151 DRFVVSRDG-EGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCRKD 229 (555) T ss_pred CeEEEeeCC-CcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccccC Confidence 334444333 35566666654422110 00 0001111223321 1 Q ss_pred cEEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceee Q lcl|NC_010808. 214 GVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (512) Q Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv 288 (512) .-+.|.....+.... .....-+|..+|++.+ .++.+|+|..++..+-+..++.+.-......+...+|.+. T Consensus 230 ~~~~~~~e~~~~~v~----~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~l 305 (555) T protein:vir:17 230 GQVKWHQECDGKVIP----GSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFM 305 (555) T ss_pred CeeEEEEecCceecc----ccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Confidence 111111111110000 0012345666777654 3456899999999999999999988888888888888866 Q ss_pred eecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_010808. 289 IKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD 366 (512) Q Consensus 289 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 366 (512) +.-.......++. .++ ...+..+...+++.+. ...+.......++.++..|...-..- +. T Consensus 306 v~~~g~~~~~~l~---~~~------------~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~--~~- 367 (555) T protein:vir:17 306 VSPSATTKPQNLA---LAA------------NGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML--QV- 367 (555) T ss_pred eccccccCcceee---cCC------------CceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc--CC- Confidence 5211111111111 011 0011112222333333 23345666667777766664432211 11 Q ss_pred cccccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCC-cccccceeeEEeCCCCCc-CHHHHHH- Q lcl|NC_010808. 367 NFSGTQSGEAMKYKLFGLEQRTKTK-EGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPK-SLIEELK- 442 (512) Q Consensus 367 ~~~~n~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~l~~~~~~~-~~~d~~~i~i~f~~~~p~-d~~~~~~- 442 (512) ..+...+|..+........+...-. .++-.+.|.-+++-++.++...+..+ .+.+.. .+.+.-.+.. ...+.++ T Consensus 368 ~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v--~~~i~~~l~~l~r~~~~~~ 445 (555) T protein:vir:17 368 RQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLV--QPTVVAGLWGVGRGQDKQQ 445 (555) T ss_pred CCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhh--ccceeehHHHHHHHHHHHH Confidence 2234567776665433333322221 11112333333333444444444332 222222 2222212111 0111111 Q ss_pred ---HHHHHhcc---------CChHHH----HHhCCC----CCCHHHHHHHHHHHHHHHHHHHHhhcc-----cCCC---- Q lcl|NC_010808. 443 ---AYIDSGGK---------ISQTTL----MSLFSF----FQDPELEVKKIEEDEKESIKKAQKGIY-----KDPR---- 493 (512) Q Consensus 443 ---~~~kl~g~---------~s~et~----~~~~~~----v~d~~~E~~ri~~E~~~~~~~~~~~~~-----~~~~---- 493 (512) .+..++.+ +....+ ...+|. +-..++|+++++++++.....++.... +.+. T Consensus 446 l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~ 525 (555) T protein:vir:17 446 LMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQA 525 (555) T ss_pred HHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhH Confidence 12222222 112222 233332 112456666666554433322221111 0000 Q ss_pred --CCCCCCCCC-------CCcCcccCCC Q lcl|NC_010808. 494 --DINDDEQDD-------DTKDTVDKKE 512 (512) Q Consensus 494 --~~~~~~~~~-------~~~~~~~~~e 512 (512) ....+.... ..+.+.+.-+ T Consensus 526 ~~~~~~~~~~a~~~~~a~~~~~~~~~~~ 553 (555) T protein:vir:17 526 MQLIQQQQEGAQDAGAAESETSSAEAQA 553 (555) T ss_pred HhccccchhhhhHHHHHHhhcCCccccc Confidence 000000000 0111111111 No 170 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=97.76 E-value=2.1e-05 Score=46.18 Aligned_cols=425 Identities=14% Similarity=0.106 Sum_probs=159.8 Q ss_pred eecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhh Q lcl|NC_010808. 28 VYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (512) Q Consensus 28 ~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~ 107 (512) .|.+.- .+..+.+.- ..... ......+.+.......+.. .....++.--...+..-.|+..+..+. T Consensus 1 ~~~~~~-------~~~~~~~~~-----~~~~~-~~~~~~~~~~~~~~~~pp~-~~~~La~~~~~n~~v~scI~~ia~~ia 66 (540) T protein:vir:41 1 MFNYHL-------SIKSLEKYR-----AIKGD-TDSQALKEDRFEEYVEPKV-HPLVLLSLLQVNPYHASACSIKANDIL 66 (540) T ss_pred CCCccc-------Chhhccchh-----hhhcc-ccccccccCCCCccccCCC-CHHHHHHHHHhcHHHHHHHHHHHHHHh Confidence 111110 011111110 00000 0011111111111110000 000000000123566778899999999 Q ss_pred ccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEE Q lcl|NC_010808. 108 GNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIA 186 (512) Q Consensus 108 g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~ 186 (512) +-|+.+...+......+-.. .-....+...+..+.+.+|.||+.+..+..|++ .+..++|..+-+..+.. T Consensus 67 ~~~~~i~~~~~~~~~~lpN~--~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~------- 137 (540) T protein:vir:41 67 RTGYLIDGDDGGVEELLRAC--RPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGS------- 137 (540) T ss_pred cCCceEecCccchhhhccCC--CCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCc------- Confidence 99998877766554433211 113566777888899999999999999888875 46778888876654432 Q ss_pred EEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC-----CCCCCcchHHH Q lcl|NC_010808. 187 GVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN-----NERRKGDYEKV 261 (512) Q Consensus 187 ~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v 261 (512) +++... ......++..|.......... .. ....+..=-|+++++ ...|.|.+... T Consensus 138 --~~~~~~-----d~~~~~~~~~~~~~~~~~~~~--g~-----------~~~~~~~~eViHir~~~~~~~~~G~Spi~~~ 197 (540) T protein:vir:41 138 --RYMQTW-----DGIHVTYFKDYRYEGEVNPDN--GE-----------DQDGVGANEIIFIHLPSPICSYYGVPRYLSA 197 (540) T ss_pred --eeEeee-----cCceeeeeecccccceeeccc--cc-----------cceeecccceEEecCCCCCCCcccccHHHHH Confidence 111111 011111222221111111000 00 000011112455532 22577777665 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCChh----hhhhhhhccccccch---hh--hhhccc--ccC--CC Q lcl|NC_010808. 262 ITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDPD----EVKKQKEANVLFLEP---TV--YENRDT--GIE--TE 326 (512) Q Consensus 262 ~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~~----~~~~~~~~~~~~~~~---~~--~~~~~~--~~~--~~ 326 (512) ...+.....+..-..+.+.-.+.|-.+++ |....... .....++...-.... +. ...... ... .+ T Consensus 198 ~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~ 277 (540) T protein:vir:41 198 APSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDT 277 (540) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcc Confidence 55555444443333333444455655543 32211110 000000000000000 00 000000 011 12 Q ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc---cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 327 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG---TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 327 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~---n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) ++.++.-++.......+.+..+...+.|+..-++|....+...+ +- +.+.... ..+...|.-+ T Consensus 278 ~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~-------------~f~~~tL~P~ 344 (540) T protein:vir:41 278 VEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARR-------------TYYESVVRPQ 344 (540) T ss_pred cceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHH-------------HHHHHHHHHH Confidence 23344444443344556677778888899999999776643221 11 1121111 1122223333 Q ss_pred HHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC---CCHH--------HH Q lcl|NC_010808. 403 AKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF---QDPE--------LE 469 (512) Q Consensus 403 ~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v---~d~~--------~E 469 (512) ++.|...++..-... . ..++.+.|+..-.... +.+..+.++ +|+++.-.+++.++.+ +|+- .+ T Consensus 345 ~~~ie~~ln~~L~~~--~-~~~~~i~f~~~~ll~~-D~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~ 420 (540) T protein:vir:41 345 QEIVSSVLTDFIQLK--L-DPGARFVFNEEILMES-EFVHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSA 420 (540) T ss_pred HHHHHHHHHHhhhhc--c-CCceEEEecchhhcch-HHHHHHHHHHhCCCCCHHHHHHHhCcCcCCCccccccccccccc Confidence 333333332211111 1 1234566754433222 223333333 6899988887755322 2321 11 Q ss_pred HHHHHHHHHHHHHHHHhhccc--CCCCC---CCCCCCCCCcCcccC------------CC Q lcl|NC_010808. 470 VKKIEEDEKESIKKAQKGIYK--DPRDI---NDDEQDDDTKDTVDK------------KE 512 (512) Q Consensus 470 ~~ri~~E~~~~~~~~~~~~~~--~~~~~---~~~~~~~~~~~~~~~------------~e 512 (512) +..-.+..+............ ++... +.+...++++.+-++ +| T Consensus 421 ~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (540) T protein:vir:41 421 MKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAEAYENGK 480 (540) T ss_pred ccccccccCCCCccccccccchhcccccCccccccccccccccccccccccCCccccchh Confidence 110000000000000000000 00000 000000111111111 11 No 171 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.76 E-value=2.2e-05 Score=46.13 Aligned_cols=381 Identities=9% Similarity=0.053 Sum_probs=165.7 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--cccccccccccccc-cceeeecchHHHHHHHHHhhhhccCceec Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKT--KNLVELTRRKEEYM-ADNRVAHDYASYISDFINGYFLGNPIQCQ 114 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~--~~~~~~~~~~~~~~-~~~ri~~n~~~~iv~~~a~~l~g~~~~~~ 114 (512) +. .-+.+++++.... +.-.....+...-. .+...........- ...-+...-....|+.+++-+.+-|+.+. T Consensus 1 m~---m~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~ 75 (392) T protein:vir:74 1 MI---LPILNFINQTNDP--PEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAE 75 (392) T ss_pred Cc---chhhhhhhcccCc--ccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeec Confidence 00 0011111110000 00000000000000 00000000000000 00001223344456667776666676653 Q ss_pred CCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 115 DDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 115 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) -.. ....+.+=........+...+..+++.+|.||+++-++.+|++ .+..++|..+.+..+... ..+ +|.+ T Consensus 76 ~~~--~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~-~~~-----~y~~ 147 (392) T protein:vir:74 76 KKK--NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE-NGM-----YYNI 147 (392) T ss_pred cch--hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceE-----EEEE Confidence 222 2222332222223456677788899999999999999988886 577888888877765432 211 1221 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) ...... ......+.++.++++.... ......|.|.+..+...|+....+.. T Consensus 148 ~~~~~~----~~~~~~~~~~evih~~~~~-------------------------~~~~~~G~s~i~~~~~~i~~~~~~~~ 198 (392) T protein:vir:74 148 TFDDPK----IEPILQAPQSDLIHMKLLS-------------------------IDGGKTGISPLYSLRRESKIQRASDR 198 (392) T ss_pred EecCCc----cceeEEEcCccEEEecCCC-------------------------CCCccccccHHHHHHHHHHHHHHHHH Confidence 111100 0011224444444442210 00112477878777777766666555 Q ss_pred HHHHHHHHhcCceeeeecCCc-CChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLS-LDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNS 352 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 352 (512) -....+.-...|-.+++-... ...++.+... ...+. +. .+.......+++.+++-++.......+.+..+.... T Consensus 199 ~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~-~~~~~---~~-~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 273 (392) T protein:vir:74 199 LTISSLNSSLNVPGVLTVKGGGLLSDKDKASR-SRSFM---KR-SRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSK 273 (392) T ss_pred HHHHHHhccCCCceEEEeCCCCCchHHHHHHH-HHHHh---cc-ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHH Confidence 555555666666655542111 1111111110 00000 00 111111223455566666544445556677777888 Q ss_pred HHHHHhcccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCC Q lcl|NC_010808. 353 DIHMFTNTPNMKDDNFSGTQSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNR 431 (512) Q Consensus 353 ~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~ 431 (512) .|+..-++|....+..+.+.|. .+. +..+...|..+++.|...++.+-.. .+++.+.. T Consensus 274 ~Ia~~fgVPp~~lg~~~~~~~~~e~~--------------~~~~~~~l~p~~~~ie~~l~~~l~~-------~~~~~~~~ 332 (392) T protein:vir:74 274 QYAKVYGLPDSYIGGQGDQQSSIQQI--------------SGMYASALNRYLRPAISELEYKLSD-------HISVNMRP 332 (392) T ss_pred HHHHHhCCCHHHhCCCCCcccHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhccc-------hhcccchh Confidence 8988889987766543333322 222 1233444555555554444432211 12222233 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (512) -+-.|..+.++.+.++ +|+++...+.+++ |+..| |+.+ .........+ +..++-+ T Consensus 333 ~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pn---e~r~-----------~enl~~~~~G---d~~~p~p 392 (392) T protein:vir:74 333 AIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPA-----------PENTNKKTTG---QSNEPVP 392 (392) T ss_pred hhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcc---ccch-----------hcCCCCCCCC---CCCCCCC Confidence 3345667777777776 6899998877654 44332 2211 0001100111 1111111 No 172 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=97.75 E-value=2.3e-05 Score=46.04 Aligned_cols=395 Identities=12% Similarity=0.041 Sum_probs=173.0 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---------ccccc------ccccccc Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN---------LVELT------RRKEEYM 85 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~---------~~~~~------~~~~~~~ 85 (512) ++++--+. .+.+++..+....+. +.... .....+. T Consensus 1 ~~~~~~~~------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~ 50 (432) T protein:vir:10 1 MPDEKKLG------------------------------LLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGA 50 (432) T ss_pred CCCCcccc------------------------------hhhhhHhhcCCccccccccccccccCcchhhhhcccccccCc Confidence 23322222 122222222221110 00000 0000000 Q ss_pred ---cceeeecchHHHHHHHHHhhhhccCcee--cCCc---hhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 86 ---ADNRVAHDYASYISDFINGYFLGNPIQC--QDDD---KDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 86 ---~~~ri~~n~~~~iv~~~a~~l~g~~~~~--~~~d---~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~ 152 (512) .+.-+.+.-....|+.+++-+.+-|+.+ ...+ ......+..++. -| ....+...+..+++.+|.||+ T Consensus 51 ~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~ 130 (432) T protein:vir:10 51 AVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYV 130 (432) T ss_pred ccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEE Confidence 0001122344455666666666667754 1111 112233445542 23 334566778889999999999 Q ss_pred EEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccc Q lcl|NC_010808. 153 LMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPR 231 (512) Q Consensus 153 ~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~ 231 (512) ++..+ +|++ .+..++|..+.++.+.. .++. |.....++ ..+ .+.++.+++++.. T Consensus 131 ~~~~~-~g~~~~L~~l~~~~v~v~~~~~--g~~~-----y~~~~~~g------~~~-~~~~~~iih~~~~---------- 185 (432) T protein:vir:10 131 RKVVT-DGRIESLQYLANDRLTITTDTK--GNTA-----YRYRRTDG------QMI-DIPKQQIWKIMGY---------- 185 (432) T ss_pred EEEec-CCcEEEEEEEcCCceEEEEcCC--CcEE-----EEEEecCc------eEE-EEcCccEEEecCC---------- Confidence 88775 4664 46778888888887643 2211 11111110 011 2334444433210 Q ss_pred ccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcccccc Q lcl|NC_010808. 232 ENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFL 311 (512) Q Consensus 232 ~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 311 (512) ++ +...|.|.+..+...++.......-..+.+.-.+.|-.+++.....+++..+..++.- . T Consensus 186 ------------~~----dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~--~- 246 (432) T protein:vir:10 186 ------------SL----DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKV--S- 246 (432) T ss_pred ------------CC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHH--h- Confidence 00 1123667676666555554444333344445555677777654444444443333211 0 Q ss_pred chhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccc--hHHHHHHHHHHHHHHHH Q lcl|NC_010808. 312 EPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTK 389 (512) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~--Sg~Ai~~~~~~l~~k~~ 389 (512) ...+.......+++.+++.++.......+.+..+.....|+..-++|....+....+. .+..++.. T Consensus 247 ---~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~--------- 314 (432) T protein:vir:10 247 ---GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQ--------- 314 (432) T ss_pred ---hhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHH--------- Confidence 0111111222345556666654444445556677788889999999987765433222 22323221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeC--CCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC-- Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYN--RNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF-- 463 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~--~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v-- 463 (512) ....+...|...++.|...++.+-.... +.....+.|+ .-+..|..+.++.+.++ +|+++.-.++++++.- T Consensus 315 -~~~f~~~tl~P~~~~ie~~ln~kL~~~~--~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi 391 (432) T protein:vir:10 315 -QLGFLSMTLSPWLRRIEQSIALNLLSPA--ERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL 391 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHHhhhcCcc--ccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Confidence 1122333444444444444433211111 1123345553 44567888899988887 6899999999888642 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 464 QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 464 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) ++.. .+-.+..- .... ......+.+ ++....+..++..+.+ T Consensus 392 ~g~~-~~~~~~~~---~~pl--~~~~~~~~~-~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 392 GGNA-AVLTVQSA---MVPL--DSIGLQASP-EPASGLGNQQQDKVSK 432 (432) T ss_pred CCCc-ceEeecCc---ccch--hhhcccCCC-CCCCCCCCcccccccC Confidence 2110 10000000 0000 000011100 0011111111111111 No 173 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=97.72 E-value=2.5e-05 Score=45.79 Aligned_cols=460 Identities=10% Similarity=0.025 Sum_probs=178.3 Q ss_pred CCccee---eccccchhhccccccCCCcCeee----cccchhH-------HhhhcHHHHHH---HHHHHHHHHHHHHHHH Q lcl|NC_010808. 1 MLKANE---FETDTDLRENRNYLFNDEANVVY----TYDGTES-------DLLQNINEVSK---YIEHHMDYQRPRLKVL 63 (512) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~f~~~~~~~~----~~~~~~~-------~~~~~~~~l~~---~i~~~~~~~~~r~~~~ 63 (512) ...+|. =.++..+.-|-+.+.+-...-.| +|..... .++=..+.+++ ++.....+.-+....+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~ 91 (945) T protein:vir:10 12 IVNANEQKRPSFSSNIKANVDSLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNL 91 (945) T ss_pred eeccccccCccccccchhchhhhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhh Confidence 222222 11122233333333322211111 1111000 00001122111 1111111111111112 Q ss_pred HHHhcccccccccccccccc------cccceeeecchHHHHHHHHHhhhhccCceec--CCch---------hHHHHHHH Q lcl|NC_010808. 64 SDYYEGKTKNLVELTRRKEE------YMADNRVAHDYASYISDFINGYFLGNPIQCQ--DDDK---------DVLEAIEA 126 (512) Q Consensus 64 ~~yy~G~~~~~~~~~~~~~~------~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~--~~d~---------~~~~~l~~ 126 (512) .. +.|..... .+....+. ...+..+...-....|+.+++-+.+-|+++- ..+. .....+.. T Consensus 92 f~-~s~es~s~-vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~ 169 (945) T protein:vir:10 92 FE-YSPESLMY-LPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILE 169 (945) T ss_pred hh-ccCcccee-cccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHH Confidence 11 22222100 00000000 0001112233455567777777777787641 1111 11234555 Q ss_pred HHhc-cCh-------hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec Q lcl|NC_010808. 127 FNDL-NDV-------ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID 197 (512) Q Consensus 127 ~~~~-n~~-------~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~ 197 (512) ++.. |.. ..+...+..+.+.+|.+|+.+.++.+|++ .+..++|..+.+..++... .. .+ |.. ..+ T Consensus 170 LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~--~~--y~-Yv~-~id 243 (945) T protein:vir:10 170 FLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTG--IV--VG-YVQ-EVD 243 (945) T ss_pred HHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCc--EE--EE-EEE-ecC Confidence 6543 322 12445677899999999999999999986 4788999998887765321 11 11 111 000 Q ss_pred cCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (512) Q Consensus 198 ~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~ 277 (512) . . ....|.+..++++.... ++-|. ....|.|.++.+...++....+.....+ T Consensus 244 ---G--~-~~~~v~a~DvIlhirn~---------------s~DG~-------~~GyGlSPIeaa~~aI~~alAaek~aar 295 (945) T protein:vir:10 244 ---G--A-IVAHFDKRDVVLFRQNL---------------TPDVY-------MYGYSLPPIEILYKVILSDIFIDKGNLD 295 (945) T ss_pred ---C--c-eEEEecCCceEEEeccC---------------CCCcc-------cccCCchHHHHHHHHHHHHHHHHHHHHH Confidence 0 0 11122333322221110 00000 0113555555544444433333222222 Q ss_pred HHHH-hcCceeee--ecCC--------cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHH Q lcl|NC_010808. 278 YMSD-LNDAMLLI--KGNL--------SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAY 346 (512) Q Consensus 278 ~~~~-~~~~~lv~--~g~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 346 (512) .+.. .+.|-.++ .+.. ..+.+..+..+..-. ...........+..+++.+++.++.......+.+. T Consensus 296 ~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~we---e~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEs 372 (945) T protein:vir:10 296 YYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQ---AIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKEL 372 (945) T ss_pred HHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHH---HHhCCcccccceecCCCceEEEccCChhHHHHHHH Confidence 2221 23453333 3221 112222222221110 00000111111223555666666655555666677 Q ss_pred HHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceee Q lcl|NC_010808. 347 KDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVR 426 (512) Q Consensus 347 ~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~ 426 (512) .+.....|+..-++|....+...+ .++..++.. ....+...|+.++..+...++..-. .......+. T Consensus 373 rkfs~eeIArAFGVPP~lLG~~e~-st~SNiEqq----------~~~Fv~~tL~Pil~~IEqeLNrkLl--~~~eg~~i~ 439 (945) T protein:vir:10 373 AEFVARKICAVYQVSPQDVGILEG-SNKATAEVM----------ASLTKAKGLEPLMATISKGFDEVVS--EFRNEKDIK 439 (945) T ss_pred HHHHHHHHHHHhCCCHHHcccCCC-CCcchHHHH----------HHHHHHHHHHHHHHHHHHHHHHhcc--ccccCceeE Confidence 888888899999999776653322 111111111 1122334444444444444432211 011223567 Q ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHH---HHHH--HHHHHHHHHH-hhcccC--CCC Q lcl|NC_010808. 427 YVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVK---KIEE--DEKESIKKAQ-KGIYKD--PRD 494 (512) Q Consensus 427 i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~---ri~~--E~~~~~~~~~-~~~~~~--~~~ 494 (512) +.|+.....+..+.++++.++ .|+++.-.++++++.-. +-+.-+- .+.- +.....+... ....+. +.+ T Consensus 440 fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp 519 (945) T protein:vir:10 440 LWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQP 519 (945) T ss_pred EEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCC Confidence 888777777888888888876 68999999988885422 1010000 0000 0000000000 000000 000 Q ss_pred CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 495 INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 495 ~~~~~~~~~~~~~~~~~e 512 (512) ....++.++..+..++.. T Consensus 520 ~~kGGe~dEns~~psE~k 537 (945) T protein:vir:10 520 SQQGGGVDENSSVPSEQK 537 (945) T ss_pred CCCCCCCCCCCCCCCccc Confidence 000000011111111111 No 174 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.70 E-value=2.7e-05 Score=45.60 Aligned_cols=398 Identities=8% Similarity=-0.024 Sum_probs=171.4 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecc Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHD 93 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n 93 (512) |+....++...-.+. + ... +..+...+.+... ......... .-+.+. T Consensus 1 m~~~~~~~~~~~~~s----~--------~~~---------------w~~~~~~~~~~~~---~~g~~vt~~---~al~~~ 47 (421) T protein:vir:10 1 MFIPQMFEGKKRSVS----G--------GGF---------------WEAMLGGVRSSHS---KAGVMITPE---TALALS 47 (421) T ss_pred CCCcchhcccccccC----c--------chh---------------hHHHhhhhccCcc---cCCceechH---HhhccH Confidence 443333333322210 0 000 0011111111100 000000000 001222 Q ss_pred hHHHHHHHHHhhhhccCcee-cC-Cch----hHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce Q lcl|NC_010808. 94 YASYISDFINGYFLGNPIQC-QD-DDK----DVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET 162 (512) Q Consensus 94 ~~~~iv~~~a~~l~g~~~~~-~~-~d~----~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~ 162 (512) -....|+.+++-+..-|+.+ .. .+. .....+..++. -| ....+...+..+.+.+|.||+++-++.+|++ T Consensus 48 ~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~ 127 (421) T protein:vir:10 48 AVRACVTLLAESVAQLPVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYP 127 (421) T ss_pred HHHHHHHHHHHhhccCceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcE Confidence 34445666666666667654 11 111 11223444442 23 3445667778899999999999999988876 Q ss_pred -EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccc Q lcl|NC_010808. 163 -RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFE 241 (512) Q Consensus 163 -~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (512) .+..++|..+.++.++. + ..+|.+.. .+ ..+..+.+++++.. T Consensus 128 ~~L~~l~~~~v~v~~~~~--g-----~~~y~~~~---~g-------~~~~~~eiih~~~~-------------------- 170 (421) T protein:vir:10 128 KELIPINPKKVIVLKGPD--G-----MPYYEIPE---IG-------ETLPMRMMHHVKVF-------------------- 170 (421) T ss_pred EEEEEecCceEEEEECCC--c-----eEEEEEcC---CC-------cEEchhhEEEecCc-------------------- Confidence 46677888877665432 1 12222110 00 01222333322110 Q ss_pred ccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC----Chhhhhhhhhccccccchhhhh Q lcl|NC_010808. 242 RMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL----DPDEVKKQKEANVLFLEPTVYE 317 (512) Q Consensus 242 ~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~----~~~~~~~~~~~~~~~~~~~~~~ 317 (512) + .+...|.|.+..+...++....+..-..+.+.-.+.|-.+++..... ..+.....+..-.-... + .. T Consensus 171 --~----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~-g-~~ 242 (421) T protein:vir:10 171 --S----LDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYS-G-IN 242 (421) T ss_pred --C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhc-C-cc Confidence 0 01224677777666666655554444444455556676666531111 22222222211000000 0 00 Q ss_pred hcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 318 NRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFGLEQRTKTKEGLFT 396 (512) Q Consensus 318 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~ 396 (512) +.......+.+.+++.++.......+.+..+...+.|+..-++|....+.... +-|.. + ......+. T Consensus 243 n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~--e----------~~~~~f~~ 310 (421) T protein:vir:10 243 NMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNI--E----------HQGLQFVM 310 (421) T ss_pred ccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccH--H----------HHHHHHHH Confidence 01111223455566666544444455666677788898888998766543321 11211 1 11123344 Q ss_pred HHHHHHHHHHHHHHHhccCCCcccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHH Q lcl|NC_010808. 397 KGLRRRAKLLETILKNTRSIDANKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKK 472 (512) Q Consensus 397 ~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~r 472 (512) ..|..++..|...++.+-.... ......+.| ..-+..|..+.++.+.++ +|+++.-.+++.++.-.-+. -+. T Consensus 311 ~tl~P~~~~ie~~ln~kL~~~~--~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~g--gD~ 386 (421) T protein:vir:10 311 YTLLAWLKRHEGALQRDLLLPS--ERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAG--GDK 386 (421) T ss_pred HHHHHHHHHHHHHHhhhccCcc--ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cce Confidence 4555555555555544321111 122334445 444567888899988887 78999999998886522100 000 Q ss_pred HHHHHHHHHHHHHhhccc-CCCC-CCCCCCCCCCcCcccCC Q lcl|NC_010808. 473 IEEDEKESIKKAQKGIYK-DPRD-INDDEQDDDTKDTVDKK 511 (512) Q Consensus 473 i~~E~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~ 511 (512) .. .......... .+.+ .....+..+.++...+. T Consensus 387 ~~------~~~n~~~~~~~~~~~~~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 387 YL------TPLNMVDSAQIIPGDKKPTAQQMAEIDTILSRT 421 (421) T ss_pred ee------eccccccccccccCCCCcccccCcccccccccC Confidence 00 0000000000 0000 00011111111111111 No 175 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.58 E-value=4.2e-05 Score=44.55 Aligned_cols=395 Identities=12% Similarity=0.017 Sum_probs=173.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCcee--cCCc--hh Q lcl|NC_010808. 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC--QDDD--KD 119 (512) Q Consensus 44 ~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~--~~~d--~~ 119 (512) .+-+.+.+.+...........-.+-|..+. ...... ..-+.+.-....|+.+++-+.+-|+.+ ..++ .. T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~---~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~ 73 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGINPS----ETYVNG---KSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRV 73 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCCcC----cceech---hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeec Confidence 000000000000000000000001111000 000000 000222334455666666666667654 1111 11 Q ss_pred HHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 120 VLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 120 ~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) ....+..++. -| ....+...+..+.+.+|.||+++.++..|++ .+..++|..+-++.++........-+.|. . T Consensus 74 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~-~ 152 (409) T protein:vir:10 74 PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYL-Y 152 (409) T ss_pred cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEE-E Confidence 1223444443 23 3345677788899999999999999998875 46778888887777653211111111111 0 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) ... . .. ...+.++.+++++... .+...|.|.++.+...++....+.. T Consensus 153 ~~~--~-g~----~~~~~~~evih~r~~~--------------------------~d~~~G~s~i~~~~~~i~~~~~~~~ 199 (409) T protein:vir:10 153 TDD--L-GQ----RHKFMSDEILHFKGLT--------------------------ADGLAGLSVIELLNHLIENGKSSET 199 (409) T ss_pred EeC--C-ce----eEEeccccEEEecCcC--------------------------CCCcccccHHHHHHHHHHHHHHHHH Confidence 000 0 00 0123334444332110 0112477777777777766555555 Q ss_pred HHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 353 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 353 (512) .....+.-.+.|-.+++.....+++..+..+..-.-... + ..+.....-.+.+.+++-+........+.+..+...+. T Consensus 200 ~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 277 (409) T protein:vir:10 200 YLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSS-G-LKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQ 277 (409) T ss_pred HHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhc-c-ccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHH Confidence 555555656667666664333344433333221110000 0 00111112234455666555444445556667778888 Q ss_pred HHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce--eeEEeCC Q lcl|NC_010808. 354 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT--VRYVYNR 431 (512) Q Consensus 354 i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~--i~i~f~~ 431 (512) |+..-++|....+.... .++..++. .....+...|+.+++.|...++.+-..... ...+ +++.+.. T Consensus 278 Ia~~fgVPp~~lg~~~~-~~~~~~e~----------~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~-~~~~~~~~fd~~~ 345 (409) T protein:vir:10 278 IASVFGVKMHQLNDLDR-ATHSNITE----------QNREFYIDTLQSILNMYELEINYKLFLISE-IKNGFYSKFNVDT 345 (409) T ss_pred HHHHhCCCHHHcCCCCC-CccccHHH----------HHHHHHHHHHHHHHHHHHHHHHHhhcCchh-ccCCcEEEEechh Confidence 99999999776653221 11111111 112334455555555555555432111111 1122 3333444 Q ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCc Q lcl|NC_010808. 432 NLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDT 507 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (512) -+-.|..+.++++.++ +|+++.-.++++++.-.-+- -+.+ . ......+ ....+++...+++. T Consensus 346 ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~g--gD~~----------~-~~~n~~~-~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 346 ILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEG--GDVL----------L-INGNMIP-VKMAGEQYSKGGEK 409 (409) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee----------e-eccCccc-hhhccccccccCCC Confidence 4567888899988887 68999988888886522100 0000 0 0000000 00001111111111 No 176 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=97.56 E-value=4.6e-05 Score=44.39 Aligned_cols=390 Identities=10% Similarity=-0.003 Sum_probs=179.8 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc--cccccccee-----eecchHHHHHHHHHhhhhccCc Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR--KEEYMADNR-----VAHDYASYISDFINGYFLGNPI 111 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~--~~~~~~~~r-----i~~n~~~~iv~~~a~~l~g~~~ 111 (512) ++.. .| ..+........+++.+...-+..+... .....-+.+ .......-.+.+....+.+.++ T Consensus 1 v~~~-~l--------~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w 71 (488) T protein:vir:99 1 MEKP-AL--------GREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREW 71 (488) T ss_pred CCcc-ch--------hHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCc Confidence 0000 00 111111111122222111100000000 000000011 1235666777788888889998 Q ss_pred eecCCc-----hhHHHHHHHHHhccChhHHHHHHHHHHHhCCeE-EEEEEECCCCceEEEE---EccceeEEEEeCCCCc Q lcl|NC_010808. 112 QCQDDD-----KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA-YELMIRNQDDETRLYK---SDAMSTFVIYDNTIER 182 (512) Q Consensus 112 ~~~~~d-----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~d~~g~~~i~~---~~p~~~~~i~d~~~~~ 182 (512) .+...+ .+..+.+.++++.-+|+..+..+. ++.-||.+ ++.+|...+|...+.. .+|... .|+... T Consensus 72 ~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f--~~d~~~-- 146 (488) T protein:vir:99 72 KVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRF--RYDQDG-- 146 (488) T ss_pred eEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccce--eecCCC-- Confidence 886433 233467888887777887777765 68889975 5667765556544332 222211 111110 Q ss_pred eeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEee--cCCCCCCcchHH Q lcl|NC_010808. 183 NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF--SNNERRKGDYEK 260 (512) Q Consensus 183 ~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~--~n~~~g~s~~~~ 260 (512) . .++....... .....+.+++.+-.++. ..++.|.|.+.. T Consensus 147 ~--------------------------------l~~~~~~~~~------~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~ 188 (488) T protein:vir:99 147 G--------------------------------LRLLTPNNMF------EGEPCPAPYFWHFSTGADNDDEPYGLGLAHW 188 (488) T ss_pred c--------------------------------eEEeccCCCC------CccccccCceEEEEeecCCCCCcccchHHHH Confidence 0 0011000000 00111122222211111 235678899988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC-cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeec-C Q lcl|NC_010808. 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNL-SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ-Y 338 (512) Q Consensus 261 v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~ 338 (512) +....=.-+..+.+++..++.|+.|+++.+-.. +.++++...+... ............+.+.++++++.. . T Consensus 189 ~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~a-------v~~~~~~~~~viP~~~~ie~~ea~~~ 261 (488) T protein:vir:99 189 LYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAA-------LHAIQTDSAIIMPAGMQAELLEAGRS 261 (488) T ss_pred HHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHH-------HHHHhcCcEEEecCCceeEEeecCCC Confidence 777666667778889999999999999876321 2233332222111 011112222233556778888853 4 Q ss_pred CHHHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCC Q lcl|NC_010808. 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSI 416 (512) Q Consensus 339 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~ 416 (512) ....++.+++.+.+.|.+.--.-.++.+..+ +...|. .. ..-....+..-.+.+...+. +++..++.+ + .. T Consensus 262 ~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~-vh--~~v~~d~~~aDa~~i~~tln~~li~~l~~~-N---~~ 334 (488) T protein:vir:99 262 GTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDD-LQ--ADVRLDLVKADADLICESFNLGPARWLTEW-N---FP 334 (488) T ss_pred ChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-C---cC Confidence 5566899999999988775422222222222 222222 11 11223334444555666664 355555543 1 11 Q ss_pred CcccccceeeEEeCCCCCcCHHHHHHHHHHH---hcc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCC Q lcl|NC_010808. 417 DANKDFNTVRYVYNRNLPKSLIEELKAYIDS---GGK-ISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 492 (512) Q Consensus 417 ~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl---~g~-~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 492 (512) . . ....+.|...-+.|..+.++.+.++ .|+ ++.+.+.+.++.-. ++.+ +.. . ...... T Consensus 335 ~--~--~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~-~~~~--------~~~---~--~~~~~~ 396 (488) T protein:vir:99 335 G--A--QPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEV-ESTQ--------AEA---T--APTPST 396 (488) T ss_pred C--c--CCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCC-cccc--------ccc---c--cCCCcc Confidence 1 1 1235678778889999999998887 365 88888888886532 1110 000 0 000000 Q ss_pred CCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 493 RDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~e 512 (512) .........+......++-+ T Consensus 397 ~~~~~~~~~~~~~~~~~~~~ 416 (488) T protein:vir:99 397 EFAEGDQPSDPAAAMAPQLA 416 (488) T ss_pred cCCCCCCCCCchHHHHHHHH Confidence 00110000100000101100 No 177 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.55 E-value=4.7e-05 Score=44.33 Aligned_cols=384 Identities=8% Similarity=0.005 Sum_probs=165.0 Q ss_pred chhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeee Q lcl|NC_010808. 12 DLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVA 91 (512) Q Consensus 12 ~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~ 91 (512) =.|-..+++|+...-.. . .+........ ........+.|..... .. ...-+. T Consensus 1 m~m~~f~~~~~~~~~~~-~-~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~------v~---~~~al~ 52 (392) T protein:vir:39 1 MILPILNFINQTNDPPE-V-GSVQSYFPDG-----------------NDAQIMESLLGDNNEW------VS---ARAALR 52 (392) T ss_pred Ccchhhhhhhccccccc-c-cccccccccC-----------------chhhhhhhhcCCCCce------ec---hHHhhc Confidence 12222223332211100 0 0000000000 0000000111110000 00 000012 Q ss_pred cchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccc Q lcl|NC_010808. 92 HDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAM 170 (512) Q Consensus 92 ~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~ 170 (512) .+-....|+.+++-+..-|+++.-. .....+.+=...-....+...+..+.+.+|.+|+++.++.+|++ .+..++|. T Consensus 53 ~~~v~~~i~~ia~~ia~lp~~~~~~--~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~ 130 (392) T protein:vir:39 53 NSDLFSIILQLSSDLAIVKINAEKK--KNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPS 130 (392) T ss_pred cHHHHHHHHHHHHhhccCceeeccc--hhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCc Confidence 2334456666666666666665322 22222222112223356667788899999999999999998986 57778888 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 250 (512) .+.+..+... ..+ +|.+....... .....+.++.++++.... ... T Consensus 131 ~v~~~~~~~~-~~~-----~y~~~~~~~~~----~~~~~~~~~eiih~~~~~-------------------------~~~ 175 (392) T protein:vir:39 131 QVNTYYFEYE-NGM-----YYNITFDDPKI----EPILQAPQSDLIHMKLLS-------------------------IDG 175 (392) T ss_pred eeEEEEcCCC-ceE-----EEEEEecCccc----ceeEEEccccEEEecCCC-------------------------CCC Confidence 8877765422 111 12111111000 011223444444432210 001 Q ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCChhhhhhhhhccccccchhhhhhcccccCCCCC Q lcl|NC_010808. 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGS 328 (512) Q Consensus 251 ~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (512) ...|.|.+..+...++....+..-....+.-...|-.+++ +....++.......+. +. + ..+.....-.+++ T Consensus 176 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~--~~---~-~~~~g~~~vl~~g 249 (392) T protein:vir:39 176 GKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRS--FM---K-RSRSGGPVVLDDL 249 (392) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHH--Hh---c-cccCCCeeecCCC Confidence 1247777777777666555554444444555556655543 2211111111111110 00 0 0111111223455 Q ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 329 VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 329 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) .+++.++.......+.+..+...+.|+..-++|....+..+.+.|... ..+..+...|..+++.|.. T Consensus 250 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~-------------~~~~f~~~~l~P~~~~ie~ 316 (392) T protein:vir:39 250 EEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ-------------QISGMYASALNRYLRPAIS 316 (392) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH-------------HHHHHHHHHHHHHHHHHHH Confidence 566666544445556677788888899988898776654333232211 1122344455555555544 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKK 483 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~ 483 (512) .++.+-.. .+++....-.-.|..+.+..+.++ +|+++...+.+.+ |+..| |+.+. T Consensus 317 ~l~~~L~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~~---------- 376 (392) T protein:vir:39 317 ELEYKLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPAP---------- 376 (392) T ss_pred HHHHhccc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccchh---------- Confidence 44432211 112222222334566777777776 6889987776654 65543 22210 Q ss_pred HHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 484 AQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~ 503 (512) .... +.++.+..++.+ T Consensus 377 -e~l~---~~~~Gd~~~p~p 392 (392) T protein:vir:39 377 -ENTN---KKTTGQSNEPVP 392 (392) T ss_pred -cCCC---CCCCCCCCCCCC Confidence 0011 101111111111 No 178 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.55 E-value=4.7e-05 Score=44.33 Aligned_cols=384 Identities=8% Similarity=0.005 Sum_probs=165.0 Q ss_pred chhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeee Q lcl|NC_010808. 12 DLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVA 91 (512) Q Consensus 12 ~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~ 91 (512) =.|-..+++|+...-.. . .+........ ........+.|..... .. ...-+. T Consensus 1 m~m~~f~~~~~~~~~~~-~-~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~------v~---~~~al~ 52 (392) T protein:vir:10 1 MILPILNFINQTNDPPE-V-GSVQSYFPDG-----------------NDAQIMESLLGDNNEW------VS---ARAALR 52 (392) T ss_pred Ccchhhhhhhccccccc-c-cccccccccC-----------------chhhhhhhhcCCCCce------ec---hHHhhc Confidence 12222223332211100 0 0000000000 0000000111110000 00 000012 Q ss_pred cchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccc Q lcl|NC_010808. 92 HDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAM 170 (512) Q Consensus 92 ~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~ 170 (512) .+-....|+.+++-+..-|+++.-. .....+.+=...-....+...+..+.+.+|.+|+++.++.+|++ .+..++|. T Consensus 53 ~~~v~~~i~~ia~~ia~lp~~~~~~--~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~ 130 (392) T protein:vir:10 53 NSDLFSIILQLSSDLAIVKINAEKK--KNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPS 130 (392) T ss_pred cHHHHHHHHHHHHhhccCceeeccc--hhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCc Confidence 2334456666666666666665322 22222222112223356667788899999999999999998986 57778888 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 250 (512) .+.+..+... ..+ +|.+....... .....+.++.++++.... ... T Consensus 131 ~v~~~~~~~~-~~~-----~y~~~~~~~~~----~~~~~~~~~eiih~~~~~-------------------------~~~ 175 (392) T protein:vir:10 131 QVNTYYFEYE-NGM-----YYNITFDDPKI----EPILQAPQSDLIHMKLLS-------------------------IDG 175 (392) T ss_pred eeEEEEcCCC-ceE-----EEEEEecCccc----ceeEEEccccEEEecCCC-------------------------CCC Confidence 8877765422 111 12111111000 011223444444432210 001 Q ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCChhhhhhhhhccccccchhhhhhcccccCCCCC Q lcl|NC_010808. 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGS 328 (512) Q Consensus 251 ~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (512) ...|.|.+..+...++....+..-....+.-...|-.+++ +....++.......+. +. + ..+.....-.+++ T Consensus 176 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~--~~---~-~~~~g~~~vl~~g 249 (392) T protein:vir:10 176 GKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRS--FM---K-RSRSGGPVVLDDL 249 (392) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHH--Hh---c-cccCCCeeecCCC Confidence 1247777777777666555554444444555556655543 2211111111111110 00 0 0111111223455 Q ss_pred cceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 329 VDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 329 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) .+++.++.......+.+..+...+.|+..-++|....+..+.+.|... ..+..+...|..+++.|.. T Consensus 250 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~-------------~~~~f~~~~l~P~~~~ie~ 316 (392) T protein:vir:10 250 EEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQ-------------QISGMYASALNRYLRPAIS 316 (392) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH-------------HHHHHHHHHHHHHHHHHHH Confidence 566666544445556677788888899988898776654333232211 1122344455555555544 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKK 483 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~ 483 (512) .++.+-.. .+++....-.-.|..+.+..+.++ +|+++...+.+.+ |+..| |+.+. T Consensus 317 ~l~~~L~~-------~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~~---------- 376 (392) T protein:vir:10 317 ELEYKLSD-------HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPAP---------- 376 (392) T ss_pred HHHHhccc-------cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccchh---------- Confidence 44432211 112222222334566777777776 6889987776654 65543 22210 Q ss_pred HHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 484 AQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~ 503 (512) .... +.++.+..++.+ T Consensus 377 -e~l~---~~~~Gd~~~p~p 392 (392) T protein:vir:10 377 -ENTN---KKTTGQSNEPVP 392 (392) T ss_pred -cCCC---CCCCCCCCCCCC Confidence 0011 101111111111 No 179 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=97.54 E-value=4.8e-05 Score=44.24 Aligned_cols=438 Identities=12% Similarity=0.058 Sum_probs=169.2 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecc----------c--------chhHHhhhcHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTY----------D--------GTESDLLQNINEVSKYIEHHMDYQRPRL-- 60 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~----------~--------~~~~~~~~~~~~l~~~i~~~~~~~~~r~-- 60 (512) ||+.-- .-++-.+....|++..+....- . +.......+.+ .....|. T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~----------~~~~~r~g~ 67 (648) T protein:vir:79 1 MARKVW---GRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPK----------MSLVKRIGL 67 (648) T ss_pred Cccchh---cchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccch----------hHHHHHhHH Confidence 554211 1122233344455443332000 0 00000111111 0011111 Q ss_pred HHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHH--HHH-HHhcc---Chh Q lcl|NC_010808. 61 KVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEA--IEA-FNDLN---DVE 134 (512) Q Consensus 61 ~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~--l~~-~~~~n---~~~ 134 (512) .-...-+ |.++. ..+..-.... ....-..++....|+..+.-+.+-++.+...++...+. ... +..-| ... T Consensus 68 ~~~~~~~-g~~~~-~epp~d~~~l-~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~ 144 (648) T protein:vir:79 68 AIMDGGG-GGRDF-EEPEFDFNEI-TSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTN 144 (648) T ss_pred HHHhhcC-Ccccc-ccCCcCHHHH-HHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHH Confidence 0011111 21221 1111100000 00111346677788888888888887776544321111 111 12222 445 Q ss_pred HHHHHHHHHHHhCCeEEEEEEECCCCceE----------------EEEEccceeEEEEeCCCCceeEEEEEEeeeeeecc Q lcl|NC_010808. 135 SHNRSLGLDLSIYGKAYELMIRNQDDETR----------------LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (512) Q Consensus 135 ~~~~~~~~~~~~~G~a~~~v~~d~~g~~~----------------i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~ 198 (512) .+...+..+.+.+|.+|+.+-.+.+|.+- +..++|..+.+..++.. .+ .+|..... T Consensus 145 ~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g--~~----~~Y~y~~~-- 216 (648) T protein:vir:79 145 QLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFG--MI----KGWQQEQE-- 216 (648) T ss_pred HHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCC--ce----eeeEEEec-- Confidence 67788899999999999999888887431 11123333322222110 00 00100000 Q ss_pred CCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 199 ~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s 273 (512) +. ...+ .|.++. |++++ +...|.|.+..+...|+....+.. T Consensus 217 -g~--~~~~-~~~~~d------------------------------IIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~ 262 (648) T protein:vir:79 217 -GQ--DKPQ-KFKPED------------------------------IVHIYYKREKGRAFGTPWLLPALDDIRALRQVEE 262 (648) T ss_pred -CC--ceeE-EecCcc------------------------------EEEEccCCCCCCceeccHHHHHHHHHHHHHHHHH Confidence 00 0000 011222 33432 223588888877777766666555 Q ss_pred HHHHHHHHhcCceeeeecCCcCC-hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeec--CC--HHHHHHHHH Q lcl|NC_010808. 274 DTANYMSDLNDAMLLIKGNLSLD-PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ--YD--VQGTEAYKD 348 (512) Q Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~~--~~~~~~~~~ 348 (512) .....+.-.+.|-.+++--.+.. .+..+...+. +... ... .. ..+++.+.+.+... .. ...+.+..+ T Consensus 263 ~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~----~~~~--~~~-~~-i~gg~v~~~~~~i~~~~s~~dlqfle~rk 334 (648) T protein:vir:79 263 NVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDL----VRGE--VEN-MD-VEGGMVTTERVNISSIASNQIIDAKEYLK 334 (648) T ss_pred HHHHHHhccCCccEEEEeCCCccchHHHHHHHHH----HHHh--ccc-cc-ccccccccceeeccccCCHHHHHHHHHHH Confidence 55555666677766664211111 1111111110 0000 000 00 01111122222111 11 223556667 Q ss_pred HHHHHHHHHhcccccccccccc-c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--cccccce Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSG-T-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID--ANKDFNT 424 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~-n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~--~~~d~~~ 424 (512) ...+.|...-++|....+...+ + .++.+....+.. .+.-.+..+...+...+. -.++ .....+ ...+ .. T Consensus 335 ~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~---~i~~l~~~i~~~le~~~~--~~ll-~e~~l~~~l~~d-~~ 407 (648) T protein:vir:79 335 HFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKD---RIKALQKVMATFINEFMV--KEIL-MEGGFDPVLNPD-DK 407 (648) T ss_pred HHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH--HHHh-hhhhcccccccc-ce Confidence 7788899999999876653221 1 223333222211 122222222222222110 0000 000000 0111 23 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHH-HHHhhcccCCCC-CCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIK-KAQKGIYKDPRD-INDD 498 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~-~~~~ 498 (512) +++.|++-...|....++.+.++ +|++|...++++++.- ++.... ..+....-.... .........+.. ...+ T Consensus 408 ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (648) T protein:vir:79 408 VEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGR-AKMHLQMVTIAQATALAALAPTPAGGSSAS 486 (648) T ss_pred EEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc-cccccccccchhccccccCCCCCCCCCCCC Confidence 66777777777888888888776 7999999999988652 221110 011110000000 000000000000 0000 Q ss_pred CC--------CCCCcCcccCC-C Q lcl|NC_010808. 499 EQ--------DDDTKDTVDKK-E 512 (512) Q Consensus 499 ~~--------~~~~~~~~~~~-e 512 (512) +. +..++.+...+ + T Consensus 487 a~~eg~~~e~~~~~~~~~~~g~~ 509 (648) T protein:vir:79 487 ASGDKKKKATDNKTKPTNQHGTK 509 (648) T ss_pred ccccccccccCCCCCCCCCCCcC Confidence 00 11111111111 1 No 180 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.53 E-value=5e-05 Score=44.18 Aligned_cols=418 Identities=10% Similarity=0.052 Sum_probs=175.8 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCcee-- Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC-- 113 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~-- 113 (512) .++.+ .+. ..-. -.....+.+. ..|.+....-........ .........+.....|+.+++-+..-|+.+ T Consensus 1 ~~~~~-~~~-~~~p--~~~e~~~~~~---~~~~~~~~~~~~~~~~~~-~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~ 72 (518) T protein:vir:10 1 MLLAN-GQT-LSAP--AMAELSPQMQ---DSYYYAPAVGMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred CcccC-cee-ecCc--hhhhhhhhhh---cccccccccceecccccc-hhhHHHhhhHHHHHHHHHHHHhhccCceEEEE Confidence 11111 110 0000 0001111111 111111100000000000 000001122344566777777666666654 Q ss_pred -cCCc--hhHHHHHHHHHhc-cC---hhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 114 -QDDD--KDVLEAIEAFNDL-ND---VESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 114 -~~~d--~~~~~~l~~~~~~-n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) +.+. +.....+..++.. |. ...+...+..+.+.+|.+|+++.++.+|++ .+..++|..+.+..+... ... T Consensus 73 ~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~-~~~- 150 (518) T protein:vir:10 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-GRY- 150 (518) T ss_pred EcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCC-CEE- Confidence 1111 1122334445433 32 335666788889999999999999999986 478889998888776432 111 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCC-CCCCcchHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNN-ERRKGDYEKVITL 264 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~-~~g~s~~~~v~~l 264 (512) +|......... ... ..+.++.+++++... .+. ..|.|.+..+... T Consensus 151 ----~y~~~~~~~~~---~~~-~~~~~~eViHir~~s--------------------------~dg~~~G~spi~~a~~~ 196 (518) T protein:vir:10 151 ----EYYFQAGAGVG---TQL-VSFADDEVVPIRFFN--------------------------PDGLERGLSLMESLKST 196 (518) T ss_pred ----EEEEEecCCcc---ceE-EEecCCcEEEecCCC--------------------------CCcccccccHHHHHHHH Confidence 11111111111 011 123444555442210 011 2467777666666 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) +.....+.....+.+.-.+.|-.+++.....+++..+..+..-.-.... ..+.......+.+.++..++.......+. T Consensus 197 i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G--~~nag~v~vL~~G~~~~~l~~s~~D~q~l 274 (518) T protein:vir:10 197 IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSG--SSNTGKTMVVEEGMEPIPLQLTAVEMQFI 274 (518) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcC--ccccCcceEcCCCceEEEccCChhHHHHH Confidence 6555555555555555566676666654334444433333211100000 01111112234455665555433344455 Q ss_pred HHHHHHHHHHHHHhcccccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 423 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~ 423 (512) +..+.....|...-++|....+...+ +-|. ++. .....+...|.-+++.|...++..-..... ... T Consensus 275 e~r~~~~~eIa~afgVPp~~lg~~~~~t~sn--~eq----------~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~-~~~ 341 (518) T protein:vir:10 275 EARQLNREEVCGVYDIAPPIVHILDRATFSN--ISA----------QMRAFYRDTMAIPIARIQSAMDKYVGQYWV-RKN 341 (518) T ss_pred HHHHHHHHHHHHHhCCCHHHhccCCCCCchh--HHH----------HHHHHHHHHHHHHHHHHHHHHHHhhccccc-CCc Confidence 66677778888888998766653322 1221 111 112233444444444444444332111100 111 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-H------HHHHHHHHHHHHHHHHhhcccCC Q lcl|NC_010808. 424 TVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSF--FQDPEL-E------VKKIEEDEKESIKKAQKGIYKDP 492 (512) Q Consensus 424 ~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~--v~d~~~-E------~~ri~~E~~~~~~~~~~~~~~~~ 492 (512) .+++....-+..|..+.++++.++ +|+++.-.++++++. ++++.. + +..+..-.....+.......+++ T Consensus 342 ~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~ 421 (518) T protein:vir:10 342 RMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRP 421 (518) T ss_pred eEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCC Confidence 244444455678889999988886 689999888888864 232211 1 01111000000000000000000 Q ss_pred CC----CCCCC--C------CCCCcCcccCCC Q lcl|NC_010808. 493 RD----INDDE--Q------DDDTKDTVDKKE 512 (512) Q Consensus 493 ~~----~~~~~--~------~~~~~~~~~~~e 512 (512) .. ..++. + ...+++..+.++ T Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) T protein:vir:10 422 ASTPVASLDQSPPTSVPGLSPTNSDRSTDSGK 453 (518) T ss_pred CccccccccccccccCCCCCcccccccccccc Confidence 00 00000 0 000111111111 No 181 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=420 Identities=10% Similarity=0.050 Sum_probs=176.1 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecC Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD 115 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~ 115 (512) .++.+ . .++.-.. .....+ ...+.|-|....-........ .........+.....|+.+++-+.+-|+.+-- T Consensus 1 ~~~~~-~-~~~~~p~--~~~~~~---~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~ 72 (518) T protein:vir:78 1 MLLAN-G-QTLSAPA--MAELSP---QMQDSYYYAPAVGMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred CcccC-c-eeeccch--hhhhhh---hhhhcccccceeceecccccc-hhhHHhhhhHHHHHHHHHHHHhhccCceEEEE Confidence 21211 1 1111100 001111 111211111100000000000 00000112234456677777777666766411 Q ss_pred --Cc---hhHHHHHHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 116 --DD---KDVLEAIEAFNDL-N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 116 --~d---~~~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) ++ +.....+..++.. | ....+...+..+.+.+|.+|+++-++..|++ .+..++|..+.+..+... ... T Consensus 73 ~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~-~~~- 150 (518) T protein:vir:78 73 TSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-GRY- 150 (518) T ss_pred EcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCC-CEE- Confidence 11 1112234444433 3 2335667788888899999999999998886 477888988888776432 111 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) +|......... ...+ .+.++.|++++... | .....|.|.+..+...+ T Consensus 151 ----~y~~~~~~~~~---~~~~-~~~~~eIiHir~~~---------------------~----dg~~~G~Spi~~~~~~i 197 (518) T protein:vir:78 151 ----EYYFQAGAGVG---TQLV-SFADDEVVPIRFFN---------------------P----DGLERGLSLMESLKSTI 197 (518) T ss_pred ----EEEEEecCCcc---ceeE-EecCCcEEEecCCC---------------------C----CcccccccHHHHHHHHH Confidence 11111111110 0111 23444444432210 0 00113667776666666 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEA 345 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 345 (512) .....+.....+.+.-.+.|-.+++.....+++..+..+..-.-.... ..+.......+.+.++..++.......+.+ T Consensus 198 ~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G--~~nag~~~vL~~G~~~~~l~~~~~d~q~le 275 (518) T protein:vir:78 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAG--SSNTGKTMVVEEGMEPIPLQLTAVEMQFIE 275 (518) T ss_pred HHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcC--cccCCceeEcCCCceEEeccCChhHHHHHH Confidence 655555555555556666777777654444444443333211100000 001111122344556655554333444556 Q ss_pred HHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccee Q lcl|NC_010808. 346 YKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTV 425 (512) Q Consensus 346 ~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i 425 (512) ..+.....|+..-++|....+...+. +...++. .....+...|.-++..|...++..-..... ....+ T Consensus 276 ~r~~~~~eIa~afgVPp~~lg~~~~s-t~sn~e~----------~~~~f~~~tL~P~~~~ie~eln~~L~~~~~-~~~~~ 343 (518) T protein:vir:78 276 ARQLNREEVCGVYDIAPPIVHILDRA-TFSNISA----------QMRAFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRM 343 (518) T ss_pred HHHHHHHHHHHHhCCCHHHhccCCCC-CchhHHH----------HHHHHHHHHHHHHHHHHHHHHHHhhccccc-CcceE Confidence 66677788888888987665433221 1111111 112233334444444444434322111000 11123 Q ss_pred eEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-H------HHHHHHHHHHHHHHHHhhcccCCCC Q lcl|NC_010808. 426 RYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSF--FQDPEL-E------VKKIEEDEKESIKKAQKGIYKDPRD 494 (512) Q Consensus 426 ~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~--v~d~~~-E------~~ri~~E~~~~~~~~~~~~~~~~~~ 494 (512) ++..+.-+..|..+.++++.++ +|+++.-.++++++. ++++.. + +..+..-.....+........++.. T Consensus 344 ~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:78 344 KFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred EeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCc Confidence 4444455678889999998887 689999888888764 332211 1 0111000000000000000000000 Q ss_pred ------------CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 495 ------------INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 495 ------------~~~~~~~~~~~~~~~~~e 512 (512) .........+++..+.++ T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) T protein:vir:78 424 TPVASLDQSPPASVPGLSPTNSDRSTDSGK 453 (518) T ss_pred ccccccccCccccCCCCCcccccccccccc Confidence 000000001111111111 No 182 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.51 E-value=5.4e-05 Score=43.97 Aligned_cols=377 Identities=10% Similarity=-0.001 Sum_probs=155.2 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+++ ++++....... .. ........... ...+..+...+ ... .-+ T Consensus 1 Mglf---~~~~~~~~~~~----~~-------~~~~~~~~~~~----------~~~~~~~~~~v--------~~~---~al 45 (384) T protein:vir:49 1 MPIF---NITNLATESPP----SN-------QDSFFDITDPE----------FLDALNGSEWV--------SAE---TAL 45 (384) T ss_pred Cccc---cccccCccccc----cc-------chhhccccchh----------hcccccCCcee--------chh---hhh Confidence 5443 33332211110 00 00000000000 00011110000 000 001 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEcc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDA 169 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p 169 (512) ...-....|+.+++-+.+-|+++.- ......+.+=........+...+..+++.+|.+|+++-++..|++ .+..++| T Consensus 46 ~~~~V~~~i~~Ia~~ia~l~~~~~~--~~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~ 123 (384) T protein:vir:49 46 KNSDLFSIISQLSNDLATAKITTSR--KQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRP 123 (384) T ss_pred ccHHHHHHHHHHHHHHhhCceeeec--chhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcC Confidence 2223345567777766666776532 222222222222234566777888899999999999999998875 5777888 Q ss_pred ceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeec Q lcl|NC_010808. 170 MSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFS 249 (512) Q Consensus 170 ~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 249 (512) ..+.+..++.. ..+ +|.+...+.... .. ..+.++.|++++... .. T Consensus 124 ~~v~v~~~~~~-~~~-----~y~~~~~~~~~~---~~-~~~~~~eVih~~~~~-------------------------~~ 168 (384) T protein:vir:49 124 SQVSFNRLDNQ-NGL-----YYNITFDDPRIP---PK-QHVPQGDILHFRLLS-------------------------VD 168 (384) T ss_pred ceeEEEEcCCC-ceE-----EEEEEecCcccc---ce-eEecCccEEEecCCC-------------------------CC Confidence 88877765422 111 121111110000 00 123444444442210 00 Q ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCc Q lcl|NC_010808. 250 NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV 329 (512) Q Consensus 250 n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (512) ....|.|.+..+...++....+.....+.+...+.|-.+++--.....++......... ....+.......+++. T Consensus 169 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~-----~~~~n~~~~~vl~~g~ 243 (384) T protein:vir:49 169 GGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQ-----AMKQMQGGPLVLDDLE 243 (384) T ss_pred CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHH-----hcccCCccceecCCCc Confidence 11247777777777776666555555555666667766665322222222111111110 0111112222334555 Q ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 408 (512) Q Consensus 330 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 408 (512) ++..++.......+.+..+.+.+.|+..-++|....+..+++ .++..++..+...+ ...++.+...+.. T Consensus 244 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i----------~~~l~pi~~~i~~ 313 (384) T protein:vir:49 244 DFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAV----------SRFLRPFVSELSK 313 (384) T ss_pred eEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHH----------HHHHHHHHHHHHH Confidence 655555444445556677788889999999997766543322 33443333222211 1111111111111 Q ss_pred HHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 409 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF---SFFQDPELEVKKIEEDEKESIKK 483 (512) Q Consensus 409 ~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~---~~v~d~~~E~~ri~~E~~~~~~~ 483 (512) .+... +.....+..-.+.......+..+ +|+.++-.+++.+ |+.++ |+.++. T Consensus 314 ~l~~~-----------l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~n---e~r~~~--------- 370 (384) T protein:vir:49 314 KLSCE-----------VDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPK---DLPEGE--------- 370 (384) T ss_pred Hhchh-----------hhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCCh---hHHHHc--------- Confidence 11100 00000011111111222222222 5677776666554 44432 222110 Q ss_pred HHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 484 AQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) ...+ .++. ++.|.- T Consensus 371 -----~~~p--~~gG----d~~~~~ 384 (384) T protein:vir:49 371 -----TDST--LKGG----ETNEQY 384 (384) T ss_pred -----CCCC--CCCC----CCCCCC Confidence 0000 0000 011111 No 183 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.50 E-value=5.5e-05 Score=43.95 Aligned_cols=396 Identities=9% Similarity=0.044 Sum_probs=176.7 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc---------ccee--eecchHHHHHHHHHhhhh Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYM---------ADNR--VAHDYASYISDFINGYFL 107 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~---------~~~r--i~~n~~~~iv~~~a~~l~ 107 (512) .+.++.-.++ ..++..+.+++.++.|................ ...+ +.+.-....|+.+++-+. T Consensus 1 ~~~~~~~~~~-----~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA 75 (424) T protein:vir:18 1 MEEPKYTIDL-----RTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) T ss_pred CCCCcccccc-----CCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhc Confidence 1112211111 12233455566666664321110000000000 0011 112223345666777666 Q ss_pred ccCcee-cC-Cch---h--HHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEE Q lcl|NC_010808. 108 GNPIQC-QD-DDK---D--VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFV 174 (512) Q Consensus 108 g~~~~~-~~-~d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~ 174 (512) +-|+.+ .. .+. . ....+..++.. | .-..+...+..+.+.+|.+|+++-++..|++ .+..++|..+.+ T Consensus 76 ~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v 155 (424) T protein:vir:18 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) T ss_pred cCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEE Confidence 667654 11 111 1 22334555432 3 3345667788899999999999988888875 466778888766 Q ss_pred EEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCC Q lcl|NC_010808. 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (512) Q Consensus 175 i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g 254 (512) ..+.. ... |.+.. + + ..+ .|.++.+++++... .+...| T Consensus 156 ~~~~~---~~~-----y~~~~-~-g-----~~~-~~~~~eVihir~~~--------------------------~dg~~G 193 (424) T protein:vir:18 156 KLVGK---KVV-----YRYQR-D-S-----EYA-DFSQKEIFHLKGFG--------------------------FTGLVG 193 (424) T ss_pred EEcCC---eEE-----EEEEe-C-C-----eEE-EeccccEEEecCcC--------------------------CCCccc Confidence 54421 111 11110 0 0 111 24444554442210 011236 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc-CChhhhhhhhhccccccchhhhhhcccccCCCCCcceeE Q lcl|NC_010808. 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLS-LDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGY 333 (512) Q Consensus 255 ~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (512) .|.+..+...++....+.....+.+.-.+.|-.+++.... .+++.....+..-. ..... .+.......+++.+++. T Consensus 194 ~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~-~~~~~--~nag~~~vl~~g~~~~~ 270 (424) T protein:vir:18 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK-EIAGG--PVKKRLWILEAGFSTSA 270 (424) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHH-HHhCC--cccCCceeccCCceEEe Confidence 6666655555554444444444445555566556553222 23333222222110 00000 11111122344556665 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 334 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 412 (512) Q Consensus 334 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~ 412 (512) ++.......+.+..+.....|+..-++|....+...++ .+|..++.... ..+...|..+++.|..-++. T Consensus 271 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~ln~ 340 (424) T protein:vir:18 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQR 340 (424) T ss_pred cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHh Confidence 55444445556667777888988889997776544332 22333332221 23344555555555554443 Q ss_pred ccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010808. 413 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) Q Consensus 413 ~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 490 (512) +-.........-+++.+..-+..|..+.++.+.++ .|+++.-.++++++.-.-+....-.+...- .+. ..... T Consensus 341 ~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~n~---~~l--~~~~~ 415 (424) T protein:vir:18 341 WLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDVAMRQAQY---VPI--TDLGT 415 (424) T ss_pred hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCc---cch--hhhhc Confidence 22111111112244444555678889999998887 689999888888865321100000000000 000 00000 Q ss_pred CCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 491 DPRDINDDEQDDDTKDTV 508 (512) Q Consensus 491 ~~~~~~~~~~~~~~~~~~ 508 (512) ...+.+.. . T Consensus 416 ~~~~~~n~---------a 424 (424) T protein:vir:18 416 NKEPRNNG---------A 424 (424) T ss_pred cCCccccC---------C Confidence 00001000 0 No 184 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=97.50 E-value=5.5e-05 Score=43.93 Aligned_cols=395 Identities=8% Similarity=-0.007 Sum_probs=170.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCcee-c-CCch-- Q lcl|NC_010808. 43 NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC-Q-DDDK-- 118 (512) Q Consensus 43 ~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~-~-~~d~-- 118 (512) --+.++........ +. .......|-.......... ..+..-+...-....|+.++.-+.+-|+.+ . ..+. T Consensus 1 m~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~g~~---v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~ 74 (419) T protein:vir:57 1 MFIPQFWKGRPSEN--RV-NWQVVPGGMRSSSSQAGVI---ITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGR 74 (419) T ss_pred CcchhhhccCCccc--cc-cccccccccccccccCCce---echHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 00111111000000 00 0000000000000000000 000011222334566777777666667664 1 1111 Q ss_pred h--HHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEE Q lcl|NC_010808. 119 D--VLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRY 190 (512) Q Consensus 119 ~--~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~ 190 (512) + ....|.+++. -| ....+...+..+.+.+|.+|+++.++.+|++ .+..++|..+.+..+.. . ..+ T Consensus 75 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~--g-----~~~ 147 (419) T protein:vir:57 75 EIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPD--G-----MPY 147 (419) T ss_pred eccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCC--c-----eEE Confidence 1 1223555543 22 3455667788899999999999999998985 56778888887665432 1 122 Q ss_pred eeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHH Q lcl|NC_010808. 191 LRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDN 270 (512) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~ 270 (512) |.... . . .++..+.+++++.. | .+...|.|.+..+...++.... T Consensus 148 y~~~~---~--~-----~~~~~~~vih~r~~----------------------~----~d~~~G~s~i~~~~~~i~~~~~ 191 (419) T protein:vir:57 148 YDIPS---I--G-----EILPMRMVHHIKSF----------------------S----LDGYIGTSPIQTNPDVLGLGIA 191 (419) T ss_pred EEEcC---C--c-----eEEchhhEEEecCc----------------------C----CCCcccccHHHHHHHHHHHHHH Confidence 22210 0 0 12333333333210 0 0122477877777777766555 Q ss_pred HHHHHHHHHHHhcCceeeeecCC----cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHH Q lcl|NC_010808. 271 AESDTANYMSDLNDAMLLIKGNL----SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAY 346 (512) Q Consensus 271 ~~s~~~~~~~~~~~~~lv~~g~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 346 (512) +..-....+.-.+.|-.+++.-. ..+++.....+..-.-.... ..+.......+.+.+++-++.......+.+. T Consensus 192 ~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g--~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~ 269 (419) T protein:vir:57 192 VEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGG--VRNAFSVGMLQEGMTYKQLSQDNEKAQLLQS 269 (419) T ss_pred HHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhcc--ccccccceecCCCceEEEcCCChhhHHHHHH Confidence 54444444555556665554211 11222222222110000000 0001111123445566555544445556667 Q ss_pred HHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccee Q lcl|NC_010808. 347 KDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTV 425 (512) Q Consensus 347 ~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i 425 (512) .+...+.|+..-++|....+... ++-|+ ++ ......+...|..++..|...+..+-.... ...+. T Consensus 270 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e----------~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~--~~~~~ 335 (419) T protein:vir:57 270 RQYTVNEVCRLYKVPPHMIQDLQKSTNNN--IE----------HQGLQYVIYTMLAILKRHESAMMRDLLLPS--ERRDF 335 (419) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCCcccc--HH----------HHHHHHHHHHHHHHHHHHHHHHHhhccCcc--ccCCe Confidence 77788889999999976654322 11121 11 111233355566655555554443221111 11233 Q ss_pred eEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCC Q lcl|NC_010808. 426 RYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQD 501 (512) Q Consensus 426 ~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (512) .+.| ..-+..|..+.++++.++ .|+++.-.++++++.-.-+.. +.+. ............. .+.+.+ T Consensus 336 ~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gg--D~~~------~~~n~~~~~~~~~--~~~~~~ 405 (419) T protein:vir:57 336 YIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGG--DKYL------TPLNMVDSKALTG--IGKATP 405 (419) T ss_pred EEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Ceee------ecccccccccccc--ccCCCc Confidence 4445 444567888899988886 689999999998865321110 0000 0000000000000 000001 Q ss_pred CCCcCcccCCC Q lcl|NC_010808. 502 DDTKDTVDKKE 512 (512) Q Consensus 502 ~~~~~~~~~~e 512 (512) +.+++..-... T Consensus 406 ~~~~~~~~~~~ 416 (419) T protein:vir:57 406 QQLKDIEAILC 416 (419) T ss_pred ccCcchhhhhh Confidence 11110000000 No 185 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.46 E-value=6.3e-05 Score=43.62 Aligned_cols=418 Identities=10% Similarity=-0.056 Sum_probs=183.8 Q ss_pred cccccCCCcCeeecccchhHHhhh----------------cHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 17 RNYLFNDEANVVYTYDGTESDLLQ----------------NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 17 ~~~~f~~~~~~~~~~~~~~~~~~~----------------~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) -..|.+...+-+-.....+..+.. .+..+...++........++..+.+...- T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e----------- 69 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEE----------- 69 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHh----------- Confidence 022222222211000000000000 12222222222111111111111111000 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCC------chhHHHHHHHHHhcc-ChhHHHHHHHHHHHhCCeE-EE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD------DKDVLEAIEAFNDLN-DVESHNRSLGLDLSIYGKA-YE 152 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~------d~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~ 152 (512) ......-.+.+...-+.+.++.+... +....+.+++++..- +|......+. ++.-||.+ ++ T Consensus 70 ----------~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~E 138 (526) T protein:vir:99 70 ----------RDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIE 138 (526) T ss_pred ----------hChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEE Confidence 01344445555566677777777532 223456788888653 5777666655 68889974 56 Q ss_pred EEEECCCCceEEE---EEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc Q lcl|NC_010808. 153 LMIRNQDDETRLY---KSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT 229 (512) Q Consensus 153 ~v~~d~~g~~~i~---~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~ 229 (512) .+|.-.+|..... ..+|..+ . |+..... ..++...... T Consensus 139 ivw~~~~g~~~~~~l~~r~~~~f-~-~~~~~~~--------------------------------~l~~~~~~~~----- 179 (526) T protein:vir:99 139 LEWALQGREWMPLAFHHRPQSWF-Q-LNPEDQN--------------------------------ELRLRDNSPA----- 179 (526) T ss_pred EEEeecCCceeEEEeeeecccce-e-eccCCCc--------------------------------EEEecCCCCC----- Confidence 7776555544332 2222211 1 1111100 0111111000 Q ss_pred ccccccccccccccceEee--cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcc Q lcl|NC_010808. 230 PRENGFESHSFERMPITEF--SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEAN 307 (512) Q Consensus 230 ~~~~~~~~~~~~~vPvv~~--~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~ 307 (512) +....+++.|-.++. ..++.|.|.+..+.-..=.-+..+.+++..++.|+.|+++.+--.+.++++...+... T Consensus 180 ----g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~a- 254 (526) T protein:vir:99 180 ----GEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRA- 254 (526) T ss_pred ----ceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHH- Confidence 001112232222221 1356788888877666666666888999999999999998763223333333222211 Q ss_pred ccccchhhhhhcccccCCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHH-HHHHHHHH Q lcl|NC_010808. 308 VLFLEPTVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM-KYKLFGLE 385 (512) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai-~~~~~~l~ 385 (512) ............+.+.++++++.. .....++.+++.+.+.|.+.--.-.++.+...|+.+.-|+ +....-.. T Consensus 255 ------v~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~ 328 (526) T protein:vir:99 255 ------VTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRH 328 (526) T ss_pred ------HHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHH Confidence 111112222333556778888853 4567789999999999877643333333222222222222 11112223 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hcc-CChHHHHHhCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-ISQTTLMSLFS 461 (512) Q Consensus 386 ~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~-~s~et~~~~~~ 461 (512) ..+..-.+.+...+. ++++.++.+ + .....+ ......+.|...-+.|..+.++.+.++ .|+ +|.+.+.+.++ T Consensus 329 di~~aDa~~i~~tln~~Li~~l~~~-N--~~~~~~-~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~G 404 (526) T protein:vir:99 329 DLLASDARQLAATLSRDLLWPLLVL-N--RPGSPD-VRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLG 404 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-C--CCCcCC-ccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhC Confidence 334444555666664 466666553 2 111111 122346788888999999999999988 465 89999999987 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccC-------CC Q lcl|NC_010808. 462 FFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK-------KE 512 (512) Q Consensus 462 ~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~e 512 (512) . ..+...-.-+....... ......................+.+..++ ++ T Consensus 405 i-p~~~~~e~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~~~~~ 460 (526) T protein:vir:99 405 I-PQPAKNEPVLRSAAQPA-ILSRQHGQRVAALATIVGPRYGDQQALDKALADLPAKD 460 (526) T ss_pred C-CCCCCcccccCCCCCCc-ccccccccccccccccccccCcchhhHHHHHHHHHHHH Confidence 5 22221000000000000 00000000000000000000000000000 00 No 186 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=97.45 E-value=6.4e-05 Score=43.57 Aligned_cols=447 Identities=9% Similarity=0.004 Sum_probs=186.2 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) ||- +. ... +-.+..+...+.+..++..-..+++.+.+|..-.-- . ... T Consensus 1 m~~----------~~--------------~~~----~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~-~---~~~ 48 (532) T protein:vir:99 1 MAE----------VE--------------KTG----FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVF-P---SAT 48 (532) T ss_pred Ccc----------hh--------------hcc----ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhccc-C---CCC Confidence 111 00 000 001112233333434444445566677777543211 0 111 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhcc--Cc-----eecCCchh-------------H-------HHHHHHHHhccCh Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGN--PI-----QCQDDDKD-------------V-------LEAIEAFNDLNDV 133 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~-----~~~~~d~~-------------~-------~~~l~~~~~~n~~ 133 (512) ....+...|+.-+-+...++++++.|++- |+ ++...+.+ + ...+...+..++| T Consensus 49 ~~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf 128 (532) T protein:vir:99 49 ADGSTSYTTPWQSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSF 128 (532) T ss_pred CcchhhccccccchHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 11222334566677888888888887653 22 22222211 1 2233445666899 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCC---CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec-----------c- Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQD---DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-----------K- 198 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~---g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~-----------~- 198 (512) .....++.++..++|.+.+++..++. ...+++.++-.+ |.+--+. .+++...+|.++..... . T Consensus 129 ~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~~~pl~~-y~v~~d~-~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~ 206 (532) T protein:vir:99 129 RPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKLHN-FVVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEDAQ 206 (532) T ss_pred HHHHHHHHHHHHhHCcEeEEecccccccCcccceEEEEcCe-EEEeeCC-CCCeeeEeeeeeecHHhcChHHHHHhhccc Confidence 99999999999999999998865432 334555555444 4443332 45566666543322100 0 Q ss_pred CCcceEEEEEEEcC-----CcE-EEEEecCCccccccccccccccccccccceEeec-----CCCCCCcchHHHHHHHHH Q lcl|NC_010808. 199 TDEDEVFTVDLFTS-----HGV-YRYLTSRTNGLKLTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDL 267 (512) Q Consensus 199 ~~~~~~~~~~~yt~-----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa 267 (512) ...+....+++|+. +.. +.+.....+.. . .......+|..+|++.++ ...+|+|......+-+.. T Consensus 207 ~~~~p~~~v~v~~~v~~~~~~~~~~~~~~~~g~~-~---~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~ 282 (532) T protein:vir:99 207 GDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEI-V---AGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKS 282 (532) T ss_pred cccCCCcceEEEEEEEecCCCCeeEEEEeecCce-e---cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHH Confidence 00112223444431 110 11111100100 0 001112235556766542 456899999999999999 Q ss_pred HHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHH Q lcl|NC_010808. 268 YDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEA 345 (512) Q Consensus 268 ~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~ 345 (512) ++.+.-...........|.+.+.-........+. .++ ...+..+...+++.+. ...+...... T Consensus 283 L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~---~~~------------~g~~v~g~~~~i~~~~~~~~~~~~~~~~ 347 (532) T protein:vir:99 283 LENLYEAIVKMSMISSKVLFFVNPNGVTQIRRVA---KAN------------TGDFVAGRKQDVEVFQLEKYNDFQVAKA 347 (532) T ss_pred HHHHHHHHHHHHHHHcCCCceeccccccchhhhc---cCC------------CcceecCCcccceeeecccccchhHHHH Confidence 9988766666666666666544211111111111 000 0001111122233332 3335666677 Q ss_pred HHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCc-ccccc Q lcl|NC_010808. 346 YKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFN 423 (512) Q Consensus 346 ~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~-~~~~~~~~~l~~~~~li~~~l~~~~~~~~-~~d~~ 423 (512) .++.++..|...-..- ......+...+|..+......+.+... ...++-.+.|.-++...+.++...+.... +.+.. T Consensus 348 ~i~~~~~rI~~af~~~-~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~ 426 (532) T protein:vir:99 348 TADDIEKRLSYAFMLN-SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAV 426 (532) T ss_pred HHHHHHHHHHHHHhhh-hcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhc Confidence 7777777664432211 111122344677766654333322222 11222233333344444455544443321 11222 Q ss_pred eee-EEeCCCCCcCHHHHHHHH-------HHHhccCC-------hHHHHH----hCCC----CCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 424 TVR-YVYNRNLPKSLIEELKAY-------IDSGGKIS-------QTTLMS----LFSF----FQDPELEVKKIEEDEKES 480 (512) Q Consensus 424 ~i~-i~f~~~~p~d~~~~~~~~-------~kl~g~~s-------~et~~~----~~~~----v~d~~~E~~ri~~E~~~~ 480 (512) .+. +++ .+..+.++.+ ..++.+.| ...++. .+|. +-..++|++.++++++.. T Consensus 427 ~~~iv~~-----is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~ 501 (532) T protein:vir:99 427 EPAIATG-----LEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTA 501 (532) T ss_pred ccceeec-----chHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHH Confidence 222 222 2222222222 22222222 222332 2232 112345555555443322 Q ss_pred HHHHH-----hhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 481 IKKAQ-----KGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 481 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ....+ ....+.........+ ..-+.| T Consensus 502 ~~~~~a~~~~~~~~~~~~~~~~~~~------~~~~~~ 532 (532) T protein:vir:99 502 AGMVTAGQQMGAAGGQAAAAMMQQQ------AGMPTQ 532 (532) T ss_pred HHHHHHHHHHHHHHHHhcchhHHhh------cCCCCC Confidence 22111 111111110000001 111111 No 187 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.42 E-value=7.1e-05 Score=43.32 Aligned_cols=385 Identities=11% Similarity=0.079 Sum_probs=159.6 Q ss_pred HHHhcccccccc--------ccccccc---ccccceeeecchHHHHHHHHHhhhhccCceec-CC-chhH-HHHHHHHHh Q lcl|NC_010808. 64 SDYYEGKTKNLV--------ELTRRKE---EYMADNRVAHDYASYISDFINGYFLGNPIQCQ-DD-DKDV-LEAIEAFND 129 (512) Q Consensus 64 ~~yy~G~~~~~~--------~~~~~~~---~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~-~~-d~~~-~~~l~~~~~ 129 (512) .++|++...... .....+. .+....-+.+.=.--.|+.+++-+..-|+.+- .. +... ...+..++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~lL~ 80 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYLMN 80 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHHHh Confidence 222222221100 0000000 00000001111112346677776666677642 11 1111 123444443 Q ss_pred c--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCc-eE-EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcc Q lcl|NC_010808. 130 L--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDE-TR-LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (512) Q Consensus 130 ~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~-~~-i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~ 202 (512) . | ....+...+..+++.+|.||+++.++..|. +. +..+.|..+.+..++. .++. |.+....+ . T Consensus 81 ~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~--~~~~-----y~~~~~~~--~- 150 (417) T protein:vir:38 81 TKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDP--DNII-----YRFTPYNS--S- 150 (417) T ss_pred cccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCC--CeEE-----EEEEEcCC--c- Confidence 2 3 234566778888999999999999887653 33 4567788876655432 1111 11111110 0 Q ss_pred eEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDL 282 (512) Q Consensus 203 ~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~ 282 (512) ...++.++.+.+++.. + .+.-.|.|.+..+...+.....+..-....+.-. T Consensus 151 ---~~~~~~~~dviH~r~~----------------------~----~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng 201 (417) T protein:vir:38 151 ---MQKVCGFEDVIHWKFF----------------------S----YDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSG 201 (417) T ss_pred ---EEEEecCcceEEecCC----------------------C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 1112333444433210 0 0112367777666666655454444444445555 Q ss_pred cCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_010808. 283 NDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPN 362 (512) Q Consensus 283 ~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 362 (512) +.|-.++......+++..+..++.-.-.. .+ .+.....-.+.+.+++.++.......+.+..+.....|+..-++|. T Consensus 202 ~~p~~il~~~~~l~~e~~~~~~~~~~~~~-~g--~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp 278 (417) T protein:vir:38 202 LKGSIIKAKESRLSAEARQKIREDFERAQ-AG--ADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPA 278 (417) T ss_pred CCCcEEEEeCCCCCHHHHHHHHHHHHHHh-cc--cccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH Confidence 66766665444445444444433211111 00 0111111123455555554333333444556666778888888887 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHH Q lcl|NC_010808. 363 MKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK 442 (512) Q Consensus 363 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~ 442 (512) ...+..+...|.+. .....+...|+.+++.|..-++.+-... .......+.|+... .+....++ T Consensus 279 ~~lg~~~~~s~~e~-------------~~~~~~~~tl~P~~~~ie~~l~~~Ll~~--~~~~~~~~~fd~~~-l~~~~~~~ 342 (417) T protein:vir:38 279 YRLAQNSPNQSVKQ-------------LADDYIRNDLPFYFEPITSEFELKLLDD--AQRHQYCIGFDTKS-VNGLPIAD 342 (417) T ss_pred HHhCCCCcchhHHH-------------HHHHHHHHHHHHHHHHHHHHHHhhhcCh--hhcccceEEechhh-hhHHHHHH Confidence 76653222222211 1122344556666655555544322111 11224456674221 12222233 Q ss_pred HHHHH--hccCChHHHHHhCCC--CCCHHH-HH------HHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 443 AYIDS--GGKISQTTLMSLFSF--FQDPEL-EV------KKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 443 ~~~kl--~g~~s~et~~~~~~~--v~d~~~-E~------~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) +.++ .|+++.-.++++++. +++... ++ -.+....+..... ...... ++...+.....++.++. T Consensus 343 -~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~---~~~~kg--g~~~~~~~~~~~~~~~~ 416 (417) T protein:vir:38 343 -VNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEH---AAELKG--GDTNAKGNQNGSGTNAN 416 (417) T ss_pred -HHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccccccc---ccccCC--CCCCCCCCCcCCCCcCC Confidence 2333 689999999888855 333211 00 0111110000000 111111 11111111111111111 Q ss_pred C Q lcl|NC_010808. 512 E 512 (512) Q Consensus 512 e 512 (512) - T Consensus 417 ~ 417 (417) T protein:vir:38 417 S 417 (417) T ss_pred C Confidence 1 No 188 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=97.36 E-value=8.4e-05 Score=42.93 Aligned_cols=450 Identities=9% Similarity=0.000 Sum_probs=187.1 Q ss_pred ccchhHHhhhcH-HHHHH---HHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 31 YDGTESDLLQNI-NEVSK---YIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 31 ~~~~~~~~~~~~-~~l~~---~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) +.+ .....+.. ..+.+ .+..++..-..+++.+.+|..-.-. . ...........++.-+-+...++.+++.| T Consensus 1 ~~~-~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~-~---~~~~~~~~~~~~~~dst~~~a~~~Laa~l 75 (535) T protein:vir:94 1 MAS-SQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLF-P---KDSDNASTDYTTPWQAVGARGLNNLASKL 75 (535) T ss_pred CCc-hhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccC-C---CCCCccccccCCcccccHHHHHHHHHHHH Confidence 111 11111111 11333 3333333334566666666543110 0 00111112223455677777888888777 Q ss_pred hcc--Cc----eecCCch-------------hHHH-------HHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 107 LGN--PI----QCQDDDK-------------DVLE-------AIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 107 ~g~--~~----~~~~~d~-------------~~~~-------~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) ++- |. ++...+. ++.+ .+...+..++|.....++.++..++|.+.+++-.+.+. T Consensus 76 ~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 155 (535) T protein:vir:94 76 MLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGT 155 (535) T ss_pred HhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCc Confidence 642 21 1222211 1222 23334566899999999999999999998877655444 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec-----------cCCcceEEEEEEEcCCcEEEEEecCCcccc-- Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-----------KTDEDEVFTVDLFTSHGVYRYLTSRTNGLK-- 227 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~-----------~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~-- 227 (512) ..+++.++-.+ |.+-.+. .+++...+|.++..... ....+....+++|+.- +....+..+.. T Consensus 156 ~~~f~~~pl~~-y~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v---~~~~~~~~~~~~~ 230 (535) T protein:vir:94 156 YNPMKLYRLSS-YVVQRDA-FGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHI---YLDEESGEYLKYE 230 (535) T ss_pred ccceEEEEcCe-EEEeeCC-CCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEE---EeeCCCCcEEEEE Confidence 44566554434 4444333 45666666654432100 0011112234444310 00101000000 Q ss_pred -cccccc--ccccccccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhh Q lcl|NC_010808. 228 -LTPREN--GFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDE 299 (512) Q Consensus 228 -~~~~~~--~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~ 299 (512) ...... .....+|..+|++.++ .+.+|+|..++..+-+-.++.+.-...........|.+.+.-........ T Consensus 231 e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~ 310 (535) T protein:vir:94 231 EIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRR 310 (535) T ss_pred EecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhh Confidence 000001 1123356677876543 45689999999999998888876666665555555554432100111111 Q ss_pred hhhhhhccccccchhhhhhcccccCCCCCcceeEE--eecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHH Q lcl|NC_010808. 300 VKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYI--YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM 377 (512) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai 377 (512) +. ..+ ...+..+...+++.+ ....+.......++.++..|...-..-. .....+...+|..+ T Consensus 311 ~~---~~~------------~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~d~~rvTAtEV 374 (535) T protein:vir:94 311 LT---KAQ------------TGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNS-AVQRTGERVTAEEI 374 (535) T ss_pred cc---cCC------------CceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhh-hccCCCCCccHHHH Confidence 11 000 000111112233333 2334556666777777776654332211 11222344677776 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHH-------Hhc Q lcl|NC_010808. 378 KYKLFGLEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYID-------SGG 449 (512) Q Consensus 378 ~~~~~~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~k-------l~g 449 (512) +.....+.+...- ..++-.+.|.-+++..+.++...+..... .-..+++.+..++. .....+.+.+ +++ T Consensus 375 ~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~-p~~~v~~~~vs~la--~l~r~~~~~~l~~~~~~laq 451 (535) T protein:vir:94 375 RYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPEL-PKEAVEPTISTGME--ALGRGQDLDKLERCIAAWSA 451 (535) T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC-ChhhccceEeehHH--HHHHHHHHHHHHHHHHHHHh Confidence 6643333322221 12222333333444444555444433211 11224555543332 2222222222 222 Q ss_pred cCC--------hHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHHHHhhccc-CCCCC----CCCC---CCCCCc Q lcl|NC_010808. 450 KIS--------QTTLMSLF---SFFQ-----DPELEVKKIEEDEKESIKKAQKGIYK-DPRDI----NDDE---QDDDTK 505 (512) Q Consensus 450 ~~s--------~et~~~~~---~~v~-----d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~----~~~~---~~~~~~ 505 (512) +-| ...++..+ .+++ ..++|++++.+++++....+...... ..... +... ..+.-+ T Consensus 452 ~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g 531 (535) T protein:vir:94 452 LAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAG 531 (535) T ss_pred hChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhc Confidence 222 22222222 1222 23566666665554433322111110 00000 0000 000000 Q ss_pred Cccc Q lcl|NC_010808. 506 DTVD 509 (512) Q Consensus 506 ~~~~ 509 (512) =.-+ T Consensus 532 ~~~~ 535 (535) T protein:vir:94 532 MAPN 535 (535) T ss_pred cCCC Confidence 0000 No 189 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.32 E-value=9.7e-05 Score=42.59 Aligned_cols=396 Identities=12% Similarity=0.029 Sum_probs=168.7 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------cccc------ccccccccc Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTK---------NLVE------LTRRKEEYM 85 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~---------~~~~------~~~~~~~~~ 85 (512) ++++--+. .+.+++..+....+ .+.. .......+. T Consensus 1 ~~~~~~~g------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 50 (432) T protein:vir:97 1 MPDEKKLG------------------------------LLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGA 50 (432) T ss_pred CCCcccCc------------------------------hhhhhHhhcCCccccccccccccccCchhhhhhcccccccCc Confidence 22222221 11122222211100 0000 000000000 Q ss_pred c---ceeeecchHHHHHHHHHhhhhccCceec--CCc---hhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEE Q lcl|NC_010808. 86 A---DNRVAHDYASYISDFINGYFLGNPIQCQ--DDD---KDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYE 152 (512) Q Consensus 86 ~---~~ri~~n~~~~iv~~~a~~l~g~~~~~~--~~d---~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~ 152 (512) . ..-+.+.-....|+.+++-+..-|+.+- ..+ ......+..++. -| ....+...+..+.+.+|.||+ T Consensus 51 ~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~ 130 (432) T protein:vir:97 51 AVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYV 130 (432) T ss_pred ccchHhhhcchHHHHHHHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEE Confidence 0 0001122223345556665555566531 111 112233445542 23 334566778889999999999 Q ss_pred EEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccc Q lcl|NC_010808. 153 LMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPR 231 (512) Q Consensus 153 ~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~ 231 (512) ++..+ +|++ .+..++|..+.++.+.. .++ +|.+...++ .. ..+.++.+.+++.. T Consensus 131 ~~~~~-~g~~~~L~~l~p~~v~v~~~~~--g~~-----~y~~~~~~g------~~-~~~~~~~iih~r~~---------- 185 (432) T protein:vir:97 131 RKVVT-DGRIESLQYLANDRLTITTDTK--GNT-----AYRYRRTDG------QM-IDIPRQQIWKIMGY---------- 185 (432) T ss_pred EEEec-CCcEEEEEEEcCcceEEEEcCC--CcE-----EEEEEecCc------eE-EEEccccEEEecCc---------- Confidence 98876 4664 45678898888887643 221 121111110 01 12334444443210 Q ss_pred ccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcccccc Q lcl|NC_010808. 232 ENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFL 311 (512) Q Consensus 232 ~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~ 311 (512) ++ +...|.|.++.+...++....+..-..+.+.-.+.|-.+++.....+++..+..++.- . T Consensus 186 ------------~~----dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~--~- 246 (432) T protein:vir:97 186 ------------SL----DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFSKKV--S- 246 (432) T ss_pred ------------CC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHHHHHH--h- Confidence 00 1123667776666555554444444444445556666666543333444333322211 0 Q ss_pred chhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch--HHHHHHHHHHHHHHHH Q lcl|NC_010808. 312 EPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS--GEAMKYKLFGLEQRTK 389 (512) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S--g~Ai~~~~~~l~~k~~ 389 (512) ...+.......+++.+++.++.......+.+..+.....|+..-++|....+....+.+ +..++.. T Consensus 247 ---~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~--------- 314 (432) T protein:vir:97 247 ---GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQ--------- 314 (432) T ss_pred ---hhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHH--------- Confidence 00111112223455566666544444555566777888899988998776654322221 2222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_010808. 390 TKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD 465 (512) Q Consensus 390 ~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d 465 (512) ....+...|..+++.|...++.+-.... +.....+.| ..-+-.|..+.++++.++ .|+++.-.++++++.-.- T Consensus 315 -~~~f~~~tl~P~~~~ie~~ln~kLl~~~--e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~ 391 (432) T protein:vir:97 315 -QLGFLTMTLSPWLRRIEQSIALNLLTPA--ERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKL 391 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHhhhccCcc--ccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Confidence 1122334455555544444443221111 112233444 445567888999998887 689999888888764210 Q ss_pred H-HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 466 P-ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 466 ~-~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) + ...+-.+..- .... ........+ +..+..+..+++...+ T Consensus 392 ~g~~~~~~~~~~---~~pl--~~~~~~~~~-~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 392 GGNAAVLTVQSA---MVPL--DSIGLQASP-EPASGLGNQQQDKVSK 432 (432) T ss_pred CCCcceEeeccc---ccch--hhhcccCCC-CCCCCCCCcccccccC Confidence 0 0000000000 0000 000000000 0011111111111111 No 190 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.31 E-value=9.7e-05 Score=42.58 Aligned_cols=390 Identities=9% Similarity=-0.015 Sum_probs=169.5 Q ss_pred hhcHHHHHHHHHHHHHHHHHHH-HHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceec--C Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRL-KVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ--D 115 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~-~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~--~ 115 (512) ....+.+.++ ..+...+. .....++-+..... ....... .-+........|+.+++-+..-|+.+- . T Consensus 1 Mg~f~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~---~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 70 (406) T protein:vir:95 1 MGLFDRWRRT----KRKSKIRADTGYVGLFMSGEDVS---FLVPGYV---RLSDNPEVRMAVHKIADLISSMTIYLMQNT 70 (406) T ss_pred Ccchhhhccc----cccccccccchhhhhhccCcccC---ccccCHH---HHhhcHHHHHHHHHHHHhhccCceEEEEec Confidence 1111111110 00000000 00011111110000 0000000 012345667778888888877777651 1 Q ss_pred Cc--hhHHHHHHHHH-h-cc---ChhHHHHHHHHHHHhCCeE--EEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 116 DD--KDVLEAIEAFN-D-LN---DVESHNRSLGLDLSIYGKA--YELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 116 ~d--~~~~~~l~~~~-~-~n---~~~~~~~~~~~~~~~~G~a--~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) ++ ......+...+ . -| ....+...+..+.+.+|.+ |+.+-.+..|++ .+..++|..+.++.+... T Consensus 71 ~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~----- 145 (406) T protein:vir:95 71 EDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDG----- 145 (406) T ss_pred CCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCe----- Confidence 11 11112222222 2 12 3456677788888888765 545556666765 466788888777655421 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) .++. . . ...|.+..+++++.... |. +.-.|.|.+..+...+ T Consensus 146 --~~~~-~---~---------~~~~~~~evih~~~~~~--------------------~~----~~~~G~s~i~~~~~~i 186 (406) T protein:vir:95 146 --YQVL-Y---G---------GQTFNYDEVLHFIYNPD--------------------PE----RPYIGRGYRVVLKDIA 186 (406) T ss_pred --EEEE-e---c---------cEEEchhHEEEeeccCC--------------------CC----CCccccCHHHHHHHHH Confidence 1110 0 0 01233444444322100 00 0113777777777777 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEee-cCCHHHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYK-QYDVQGTE 344 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~ 344 (512) +....+..-....+.-.+.|-.++.-....+++..+..+..-.-........+ ...+-..++.+..-++. ......+. T Consensus 187 ~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~-~~~v~~~~~~~~~~~~~~~~~d~q~~ 265 (406) T protein:vir:95 187 DNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAG-QPWIIPAELLEVEQVKPLSLKDIAIN 265 (406) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccC-CceeecCCCccccccccCChhHHHHH Confidence 76666655555556666677666654444444443333322111111000000 01111122222222221 22234445 Q ss_pred HHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) +..+.....|+..-++|....+... +.+.. ....+...|..+++.|...+...-..+. + .. T Consensus 266 e~~~~~~~~Ia~~fgVp~~~lg~~~-~~~~~---------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~--~-~~ 326 (406) T protein:vir:95 266 EAVELDKRTVAGMFGVPAFLLGIGE-FNRDE---------------YNNFINSTILPIAKGIEQELTRKLLISP--D-LY 326 (406) T ss_pred HHHHHHHHHHHHHhCCCHHHcCCCC-chHHH---------------HHHHHHHHHHHHHHHHHHHHHHhcCCCC--C-cE Confidence 6677778888888889876554221 11111 1124556666666666665554322111 1 13 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCC Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDD 502 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (512) +++.++.-+..|..+.++.+.++ .|+++...++++++.-.-+. .+++.-- ...... .........++.+ T Consensus 327 ~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~--gd~~~~~------~n~~~~-~~~~~~~~~k~g~ 397 (406) T protein:vir:95 327 FKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEG--LSELVIL------ENYIPL-DKIGDQSKLKGGD 397 (406) T ss_pred EEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cceeeec------cCccch-hhcccccccCCCC Confidence 45555566677888889888887 68999999999986632111 1111000 000000 0000000001111 Q ss_pred CCcCcccCCC Q lcl|NC_010808. 503 DTKDTVDKKE 512 (512) Q Consensus 503 ~~~~~~~~~e 512 (512) .+++ .+++| T Consensus 398 ~~~~-~~~~~ 406 (406) T protein:vir:95 398 NSGA-DGQTD 406 (406) T ss_pred CCCC-CCCCC Confidence 1111 11111 No 191 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.31 E-value=0.0001 Score=42.52 Aligned_cols=409 Identities=13% Similarity=0.023 Sum_probs=148.3 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhc----c--c-cccccc----cccccc-ccccceeeecchHHHHHHHHHhh Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYE----G--K-TKNLVE----LTRRKE-EYMADNRVAHDYASYISDFINGY 105 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~----G--~-~~~~~~----~~~~~~-~~~~~~ri~~n~~~~iv~~~a~~ 105 (512) .. +. +.+...|... ..|+.+..+.+. | . .+.... +..... .....+| ...+++.||+..++- T Consensus 1 ~~---~~-~~~~~~~~~~-~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr-~~~ia~~iVd~~~d~ 74 (449) T protein:vir:10 1 MT---DK-LTLAVNHALN-DARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYR-RGGIAHGAVEKLVGK 74 (449) T ss_pred Cc---hh-hHHHHhhhcc-hhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHh-cCchhHHHHHhhhhh Confidence 00 01 1121122111 112222222111 1 0 000000 000000 0000011 235678999999887 Q ss_pred hhccCceec-CCchh---HHHHHHHHHh---ccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeC Q lcl|NC_010808. 106 FLGNPIQCQ-DDDKD---VLEAIEAFND---LNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (512) Q Consensus 106 l~g~~~~~~-~~d~~---~~~~l~~~~~---~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~ 178 (512) ..-+.+.+. +.+.+ ....+...++ .+++...+.++.+.+..+|.+++++-.+ +|+.- -.|.. . T Consensus 75 ~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l---~~Pl~------~ 144 (449) T protein:vir:10 75 CWQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIR-DEKDW---NLPAT------K 144 (449) T ss_pred hhhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEec-CCCCC---Ccccc------c Confidence 654544332 22211 1112222222 2356667888888888999998887664 33321 11211 0 Q ss_pred CCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcch Q lcl|NC_010808. 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDY 258 (512) Q Consensus 179 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~ 258 (512) ...+....-+|........-...+..-..+-|. .|++.....+.. .....-|+--.+.++..+ ..|.|.+ T Consensus 145 --~~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~-~y~v~~~~~g~~-----~~~~~iH~SRl~~~~~~~--~~g~~~L 214 (449) T protein:vir:10 145 --GRGLQKVSVSWAGSLKVAEWDTGINSKTYGQPK-LWKYTERLPNGS-----SRRVDIHPDRVFILGDYS--EDAIGFL 214 (449) T ss_pred --CcceeeEEeeccccCChhhhhcCCCCCCCCCce-EEEEeeeccCCC-----ccceeeccceeEeecCCC--CCChhHH Confidence 011111111111000000000000111112222 222222111110 011123433323332221 1255555 Q ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHhcCc---eeeeecC---CcCChhhhh-hhhhccccccchhhhhhcccccCCC Q lcl|NC_010808. 259 EKVITLIDLYDNAESD-----TANYMSDLNDA---MLLIKGN---LSLDPDEVK-KQKEANVLFLEPTVYENRDTGIETE 326 (512) Q Consensus 259 ~~v~~liDa~~~~~s~-----~~~~~~~~~~~---~lv~~g~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 326 (512) +++-.-+-.++.+.-. +.+..+..... ..-+.|. .....++.. .+. ..+-. +..+.. ..... T Consensus 215 ~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~-~~~~~----~~~~~~-~~~i~ 288 (449) T protein:vir:10 215 EPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFN-EVAGE----INRGND-VLMTT 288 (449) T ss_pred HHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHH-HHHHH----Hhccch-heeec Confidence 5543322222222111 11111111000 0001111 111111111 110 00000 000110 11112 Q ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccc--cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 327 GSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF--SG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRA 403 (512) Q Consensus 327 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 403 (512) .+.+ |...+.+..+....++...+.+...+++|-.-+... +| |.+++ ++ --+..+..++..++..|++++ T Consensus 289 ~~~d--~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~----nyyd~i~~~Q~~l~p~le~l~ 361 (449) T protein:vir:10 289 QGAT--VTPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QK----YFNARCQSRRVDLSFEIEDFC 361 (449) T ss_pred CCcc--eEEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchh-HH----HHHHHHHHHHHhhhHHHHHHH Confidence 3334 334455667788888888888999999996544322 22 22332 32 233444445556788999988 Q ss_pred HHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhC--CCCCCHHHHHHHHHHHHHHHH Q lcl|NC_010808. 404 KLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLF--SFFQDPELEVKKIEEDEKESI 481 (512) Q Consensus 404 ~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~~~~~--~~v~d~~~E~~ri~~E~~~~~ 481 (512) .+++.. . .++.+ .+++|.|+|-...+..+.|+...+.+... .+++... +-+ ++ .|+.... T Consensus 362 ~~l~~s-~---~g~~~---~d~~i~f~pL~~~t~kEkAei~k~~A~a~--~~~~~ag~~~~~-~~-~EiR~~~------- 423 (449) T protein:vir:10 362 DKLIEL-K---IIDAV---AKKAVIWDDLNEQTGTEKLTNAKTMGEIN--QTMLGSGDNPAF-SR-EEIRTAA------- 423 (449) T ss_pred HHHHHh-h---cCCCC---CceeEEeCCCCCCCHHHHHHHHHHHHHHH--HHHHHccccCCc-CH-HHHHHHh------- Confidence 876543 1 12211 35899999999999999988876654211 1111111 111 11 1221100 Q ss_pred HHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 482 KKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..++... ...+++..++.+.+. T Consensus 424 -------~~~~~~~--~~~~~e~~de~~~~~ 445 (449) T protein:vir:10 424 -------GYDNDDE--EPLGEEDGDEEDKAT 445 (449) T ss_pred -------cccCCCC--CCCCCCCCccccccC Confidence 0011000 011111111111111 No 192 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.28 E-value=0.00011 Score=42.37 Aligned_cols=379 Identities=10% Similarity=0.042 Sum_probs=170.8 Q ss_pred ccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc----------c--ccccccc----cc--cee-ee Q lcl|NC_010808. 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVE----------L--TRRKEEY----MA--DNR-VA 91 (512) Q Consensus 31 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~----------~--~~~~~~~----~~--~~r-i~ 91 (512) +++ ..++++ .+...+|..+...... + ....+.. .. ..+ .. T Consensus 1 ~~~--------~~~~~~-------------~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 59 (413) T protein:vir:96 1 MPG--------VSEIRK-------------DKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSD 59 (413) T ss_pred CCc--------cchhhh-------------hhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhh Confidence 111 111111 0111222222111000 0 0000000 00 011 11 Q ss_pred cchHHHHHHHHHhhhhccCceecCC----chhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCc- Q lcl|NC_010808. 92 HDYASYISDFINGYFLGNPIQCQDD----DKDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDE- 161 (512) Q Consensus 92 ~n~~~~iv~~~a~~l~g~~~~~~~~----d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~- 161 (512) .......|+.+++-+..-|+.+--. .......+..++. -| ....+...+..+.+.+|.||+++..+..|. T Consensus 60 ~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~ 139 (413) T protein:vir:96 60 SPEVRMAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDK 139 (413) T ss_pred chHHHHHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCc Confidence 3555666777777777777765111 1122233444442 23 335677888899999999999999988874 Q ss_pred e-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccc Q lcl|NC_010808. 162 T-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSF 240 (512) Q Consensus 162 ~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (512) + .+..++|..+.+.+++. . ++ |.... . . ..+.++.+++++... T Consensus 140 ~~~L~~l~~~~v~~~~~~~---~----~~-y~~~~-~---~------~~~~~~evih~k~~~------------------ 183 (413) T protein:vir:96 140 IIGLTPISPYKVTFNVSDD---D----LD-YSITF-D---N------KEYDPSTLLHFVLNP------------------ 183 (413) T ss_pred eEEEEEecCceeEEEEcCC---e----EE-EEEee-c---C------cEEchhhEEEEeccC------------------ Confidence 3 57788888887776532 1 11 11110 0 0 012233333322100 Q ss_pred cccceEeecCC-CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhc Q lcl|NC_010808. 241 ERMPITEFSNN-ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENR 319 (512) Q Consensus 241 ~~vPvv~~~n~-~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (512) ..++ -.|.|.+..+...+.....+.......+.-.+.|-.+++.....+++..+..++.-.-... +..-.. T Consensus 184 -------~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g~~n~g 255 (413) T protein:vir:96 184 -------SIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYL-KRKEAG 255 (413) T ss_pred -------CCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhc-CccccC Confidence 0001 1367777766666665555555555556666677777765443444443333322111000 000000 Q ss_pred ccccCCCCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 320 DTGIETEGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) Q Consensus 320 ~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 398 (512) ...+...++.+..-+.. ......+.+..+.....|+..-++|....+... +.++.+ ...+... T Consensus 256 ~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~-~~~~~~---------------~~~~~~~ 319 (413) T protein:vir:96 256 KPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGT-YNKDEF---------------NNFINTK 319 (413) T ss_pred ceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCc-chHHHH---------------HHHHHHH Confidence 00111122222222111 222344455666777888888889876654221 111111 1234555 Q ss_pred HHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH Q lcl|NC_010808. 399 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEED 476 (512) Q Consensus 399 l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E 476 (512) |..+++.|...++..-. + +-..+++.++.-+..|..+.++++.++ +|+++.-.++++++.-..+. -+.+. T Consensus 320 l~P~~~~ie~~ln~~ll-~---~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~--gd~~~-- 391 (413) T protein:vir:96 320 IMSIAQVIQQTYNKLIV-E---EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAE--MDDLL-- 391 (413) T ss_pred HHHHHHHHHHHHHHhhC-C---CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cceee-- Confidence 66666666665554321 1 112345555566678889999998887 78999999999987643211 00000 Q ss_pred HHHHHHHHHhhcccCCCCCCCCCCCCCCcCc Q lcl|NC_010808. 477 EKESIKKAQKGIYKDPRDINDDEQDDDTKDT 507 (512) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (512) . .....+-+.....+....+|. T Consensus 392 --------~-~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 392 --------V-LENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred --------e-cccccchhhcccccCCCCCCC Confidence 0 000000000000001111111 No 193 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.26 E-value=0.00011 Score=42.23 Aligned_cols=414 Identities=10% Similarity=-0.059 Sum_probs=181.2 Q ss_pred cccccCCCcCeeecccchhHHhhh----------------cHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 17 RNYLFNDEANVVYTYDGTESDLLQ----------------NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 17 ~~~~f~~~~~~~~~~~~~~~~~~~----------------~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) -..+.+...+-+-...-....... .+..+...++........++..+.+...- T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e----------- 69 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEE----------- 69 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHh----------- Confidence 012222211111000000000000 12222222222111111111111111100 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCC------chhHHHHHHHHHhcc-ChhHHHHHHHHHHHhCCeE-EE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD------DKDVLEAIEAFNDLN-DVESHNRSLGLDLSIYGKA-YE 152 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~------d~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~ 152 (512) ......-.+.+....+.+.++.+... +....+.+.+++..- +|+..+.. ..++.-||.+ ++ T Consensus 70 ----------~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~E 138 (528) T protein:vir:10 70 ----------RDAHLFAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIE 138 (528) T ss_pred ----------hChHHHHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEE Confidence 12345556666667778888887542 123345677777552 46654444 4568889975 56 Q ss_pred EEEECCCCceEEEEE---ccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc Q lcl|NC_010808. 153 LMIRNQDDETRLYKS---DAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT 229 (512) Q Consensus 153 ~v~~d~~g~~~i~~~---~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~ 229 (512) .+|.-.+|...+..+ +|.. |- |+.... .. .+....... T Consensus 139 i~w~~~~g~~~~~~~~~r~~~~-f~-~~~~~~--~~------------------------------l~~~~~~~~----- 179 (528) T protein:vir:10 139 LDWSLQGREWLPQAFDHRPQSW-FQ-LNPDDQ--DE------------------------------LRLRDNSIA----- 179 (528) T ss_pred EEEeecCCceeEEEeeeecccc-ee-eccCCC--cE------------------------------EeccCCCCC----- Confidence 667655555433222 2211 11 111000 00 000000000 Q ss_pred ccccccccccccccceEee--cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcc Q lcl|NC_010808. 230 PRENGFESHSFERMPITEF--SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEAN 307 (512) Q Consensus 230 ~~~~~~~~~~~~~vPvv~~--~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~ 307 (512) +..-.+++.+=.++. ..++.|.|.+..+....=--+..+.+++..++.|+.|+++.+-..+.++++...+... T Consensus 180 ----g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~a- 254 (528) T protein:vir:10 180 ----GEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEKVTLLRA- 254 (528) T ss_pred ----ceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHH- Confidence 000012222111111 2355688888887777777777889999999999999998763333344433322211 Q ss_pred ccccchhhhhhcccccCCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHH-HHHHHHHH Q lcl|NC_010808. 308 VLFLEPTVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM-KYKLFGLE 385 (512) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai-~~~~~~l~ 385 (512) ............+.+.++++++.. .....++.+++.+.+.|.+.--.-.++.+...+..+.-|+ +....-.. T Consensus 255 ------l~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~ 328 (528) T protein:vir:10 255 ------VTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRH 328 (528) T ss_pred ------HHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHH Confidence 011111222233556778888853 4567789999999998887653333322221111111111 11111222 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hcc-CChHHHHHhCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-ISQTTLMSLFS 461 (512) Q Consensus 386 ~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~-~s~et~~~~~~ 461 (512) ..+..-.+.+...+. ++++.++.+ + ..... ....-..+.|...-+.|..+.++.+.++ .|+ +|.+.+.+.++ T Consensus 329 di~~aDa~~i~~tln~~li~~l~~~-N--~~~~~-~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~g 404 (528) T protein:vir:10 329 DLLAADARQLAATLSRDLLWPLLVL-N--RSGNL-DARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLG 404 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-C--CCCCC-CccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhC Confidence 334444455666664 356555553 2 11111 1122346788889999999999999888 466 89999999987 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc-CcccCCC Q lcl|NC_010808. 462 FFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK-DTVDKKE 512 (512) Q Consensus 462 ~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e 512 (512) . ..++.. +.+...+... ........+... .....+..+ ...++.+ T Consensus 405 i-p~p~~~-e~~~~~~~~~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 450 (528) T protein:vir:10 405 I-PLPANG-EAVLGDQAGA---GIAQLSRRPGPR-IAALAQVIGPRYRDQEA 450 (528) T ss_pred C-CCCCCC-cccccCCCcc---cccccCcccccc-cccccccccccccccch Confidence 5 222110 0000000000 000000000000 000000000 0111111 No 194 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.19 E-value=0.00014 Score=41.79 Aligned_cols=396 Identities=9% Similarity=0.037 Sum_probs=179.1 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccc--------cc-ccee--eecchHHHHHHHHHhhhh Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEE--------YM-ADNR--VAHDYASYISDFINGYFL 107 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~--------~~-~~~r--i~~n~~~~iv~~~a~~l~ 107 (512) .+.++.-.++ ..++..+.+++.++.|.............+ .. ...+ +.+.-....|+.+++-+. T Consensus 1 ~~~~~~~~~~-----~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA 75 (424) T protein:vir:18 1 MEEPKYTIDL-----RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) T ss_pred CCCCcceEee-----cCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhc Confidence 1111111111 223344556666665543211100000000 00 0011 122223345667777666 Q ss_pred ccCcee-cCC-ch-----hHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEE Q lcl|NC_010808. 108 GNPIQC-QDD-DK-----DVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFV 174 (512) Q Consensus 108 g~~~~~-~~~-d~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~ 174 (512) +-|+.+ ..+ +. .....+..++.. | ....+...+..+.+.+|.+|+++-++.+|++ .+..++|..+.+ T Consensus 76 ~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v 155 (424) T protein:vir:18 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) T ss_pred cCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEE Confidence 667654 111 11 112234555432 3 2345667788899999999999989988875 467788888776 Q ss_pred EEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCC Q lcl|NC_010808. 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (512) Q Consensus 175 i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g 254 (512) ..++. ... |.... + . ..+ .|.++.|++++... .+...| T Consensus 156 ~~~~~---~~~-----y~~~~-~---g---~~~-~~~~~eIih~r~~~--------------------------~dg~~G 193 (424) T protein:vir:18 156 KLVGK---KVV-----YRYQR-D---S---EYA-DFSQKEIFHLKGFG--------------------------FTGLVG 193 (424) T ss_pred EEcCC---eEE-----EEEEe-C---C---eEE-EeccccEEEecCcC--------------------------CCCccc Confidence 55431 111 11110 0 0 011 24455555442110 011246 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc-CChhhhhhhhhccccccchhhhhhcccccCCCCCcceeE Q lcl|NC_010808. 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLS-LDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGY 333 (512) Q Consensus 255 ~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (512) .|.++.+...++....+..-..+.+.-.+.|-.++..... .+++.....+..-. ..... .+.....-.+++.+++. T Consensus 194 ~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~-~~~~g--~nag~~~vl~~g~~~~~ 270 (424) T protein:vir:18 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK-EIAGG--PVKKRLWILEAGFSTSA 270 (424) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHH-HHhCC--cccCCceeccCCceEEe Confidence 7777666666655444444444445556667666653222 23333222221110 00000 11111122345556666 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 334 IYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 412 (512) Q Consensus 334 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~ 412 (512) ++.......+.+..+...+.|+..-++|....+...++. .+..++... ...+...|..+++.|...++. T Consensus 271 l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~----------~~f~~~tl~P~~~~ie~~l~~ 340 (424) T protein:vir:18 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQR 340 (424) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHH----------HHHHHHHHHHHHHHHHHHHHh Confidence 654444455566677778889999999977765443332 222222211 123344555555555555543 Q ss_pred ccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010808. 413 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) Q Consensus 413 ~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 490 (512) +-.......-.-+++.+..-+..|..+.++++.++ +|+++.-.++++++.-.-+. -+... ........ T Consensus 341 ~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~g--GD~~~------~~~n~~~l-- 410 (424) T protein:vir:18 341 WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM------RQSQYVPI-- 410 (424) T ss_pred hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee------eccCccch-- Confidence 32111111111244444555678889999998887 68999988888875432100 00000 00000000 Q ss_pred CCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 491 DPRDINDDEQDDDTKDTV 508 (512) Q Consensus 491 ~~~~~~~~~~~~~~~~~~ 508 (512) .+..++.+...+.. T Consensus 411 ----~~~~~~~~p~~~ga 424 (424) T protein:vir:18 411 ----TDLGTNKEPRNNGA 424 (424) T ss_pred ----HhhhccCCCccCCC Confidence 00000000000001 No 195 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=97.17 E-value=0.00014 Score=41.67 Aligned_cols=380 Identities=10% Similarity=0.037 Sum_probs=159.8 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+++.-.. |.+............. ...+.-|.-... . .... .-+ T Consensus 1 Mg~~~~~~--~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~~----~-v~~~---~al 44 (385) T protein:vir:10 1 MGLLTPRN--FNKRKAKNMVYPSNPA--------------------------FFTTTVGGMQLS----Y-VSAL---SAL 44 (385) T ss_pred Cccccchh--cccccccccccccchh--------------------------hhhhhccccCcc----c-cCHH---Hhh Confidence 77664322 2222111111111000 001111100000 0 0000 001 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ...-....|+.+++-+..-|+++. +......|.+=...-....+...+..+.+.+|.||+++..+. ..+..++|. T Consensus 45 ~~~~v~~~i~~ia~~ia~~p~~v~--~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~ 119 (385) T protein:vir:10 45 QNTNVYSVINRIASDVASAHFKTE--NTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDV 119 (385) T ss_pred ccHHHHHHHHHHHHHHhhCceeee--ccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCc Confidence 122344556666666666677653 222222222211112345566667778889999999986542 223333444 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 250 (512) .+.+..+.. . +.|+ +....+ . ....+.+..+++++.... |. .+ T Consensus 120 ~v~~~~~~~---~----~~~~-~~~~~~--~----~~~~~~~~eiihik~~~~--------------------~~---~~ 162 (385) T protein:vir:10 120 QINYLPGNM---G----IVYT-VLESND--R----PQMVLRQDQMLHFRLMPD--------------------PQ---YR 162 (385) T ss_pred eEEEEEcCC---c----eEEE-EEEcCC--c----eEEEEccccEEEeccCCC--------------------Cc---cc Confidence 443332211 0 1111 100000 0 001233444443322110 00 01 Q ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC-ChhhhhhhhhccccccchhhhhhcccccCCCCCc Q lcl|NC_010808. 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSV 329 (512) Q Consensus 251 ~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (512) ...|.|.+..+...++....+..-..+.+.....|-.+++--... +++.....+..-.-.. .+ .+.......+++. T Consensus 163 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~-~~--~n~~~~~vl~~g~ 239 (385) T protein:vir:10 163 YLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKAN-TG--DNSGRLMVLPDGF 239 (385) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHh-Cc--cccCCccccCCCc Confidence 224778888777777766666555555566666676666532122 2333332221111000 00 1111112234455 Q ss_pred ceeEEeecCCHHH-HHHHHHHHHHHHHHHhcccccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 330 DGGYIYKQYDVQG-TEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLE 407 (512) Q Consensus 330 ~~~~l~~~~~~~~-~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 407 (512) +++.++....... +.+..+.....|+..-++|....+.. .++.++..++.. ...|...|...++.|. T Consensus 240 ~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~-----------~~~~~~~l~P~~~~ie 308 (385) T protein:vir:10 240 DYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQI-----------KATYLANLNSYVNPIV 308 (385) T ss_pred eEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHH-----------HHHHHHHHHHHHHHHH Confidence 5555554322222 34566777788888889987665432 222222222211 1112223444444444 Q ss_pred HHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 408 TILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQ 485 (512) Q Consensus 408 ~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~ 485 (512) ..+...-.. ..+++.+..-+..|..+.++++.++ .|+++.-.+++.++.-.=+...+.... T Consensus 309 ~~l~~~l~~------~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~----------- 371 (385) T protein:vir:10 309 DELRLKMNA------PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFK----------- 371 (385) T ss_pred HHHHHhhCC------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCcccc----------- Confidence 433332111 1355556666778999999999887 689998888877643210000000000 Q ss_pred hhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 486 KGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) .+.+ .-+.+++.|+ T Consensus 372 -------~~~~----~~~~g~~~dn 385 (385) T protein:vir:10 372 -------PLTT----QVKGGDEGDN 385 (385) T ss_pred -------Cccc----ccCCCCCCCC Confidence 0000 0111222222 No 196 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=97.12 E-value=0.00016 Score=41.34 Aligned_cols=412 Identities=8% Similarity=0.019 Sum_probs=164.5 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCC--ch Q lcl|NC_010808. 41 NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD--DK 118 (512) Q Consensus 41 ~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~--d~ 118 (512) -.+.+.+++++.....-.....+.+|. |...-. ....-...-...-+..+.....|+.+++-+.+-|+.+--. +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~--~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g 77 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIKYI-GQTFTK--YDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTK 77 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHHhh-ccccCC--CccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCc Confidence 112222222211111111111222222 211000 0000000000011223455555677777666666654111 10 Q ss_pred h-------------------------------HHHHHHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEEEEECCC---- Q lcl|NC_010808. 119 D-------------------------------VLEAIEAFNDL-N---DVESHNRSLGLDLSIYGKAYELMIRNQD---- 159 (512) Q Consensus 119 ~-------------------------------~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~---- 159 (512) . ....+..++.. | ....+...+..+.+.+|.||+++.++.. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~ 157 (460) T protein:vir:10 78 AYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINA 157 (460) T ss_pred cchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccC Confidence 0 00111223222 2 3445566778899999999999987644 Q ss_pred CceE-EEEEccceeEEEEeCCCCceeE-EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccc Q lcl|NC_010808. 160 DETR-LYKSDAMSTFVIYDNTIERNSI-AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFES 237 (512) Q Consensus 160 g~~~-i~~~~p~~~~~i~d~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (512) |.+. +..++|..+.+..++....... ..++.|... .+ . ....+.++.+.+++..... T Consensus 158 G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~--~~--g----~~~~~~~~evih~r~~~~~------------- 216 (460) T protein:vir:10 158 GVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLI--QG--D----QFIEFNEDEVIHTKYANPN------------- 216 (460) T ss_pred ceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEe--cC--c----eeEEecccceEEEecCCCC------------- Confidence 4443 6678888887766543211100 011111110 00 0 0112344444444321110 Q ss_pred ccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhh Q lcl|NC_010808. 238 HSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYE 317 (512) Q Consensus 238 ~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (512) .-.......|.|.+..+...+........-..+.+.-...|-.+++.....+++..+..+..-.-.... .. T Consensus 217 -------~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g--~~ 287 (460) T protein:vir:10 217 -------FDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKS--PD 287 (460) T ss_pred -------cccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcC--cc Confidence 000001124677777766666665555544454555556666665544444555544443221110000 00 Q ss_pred hcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccccc-chHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 318 NRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT-QSGEAMKYKLFGLEQRTKTKEGLFT 396 (512) Q Consensus 318 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~ 396 (512) +.......+++.+++.++.......+.+..+...+.|+..-++|....+...+. .+...++. .....+. T Consensus 288 n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~----------~~~~f~~ 357 (460) T protein:vir:10 288 RLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEE----------ERKRVVT 357 (460) T ss_pred ccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHH----------HHHHHHH Confidence 111112334555665555444445556667778888988888987765533221 12221211 1122334 Q ss_pred HHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHH Q lcl|NC_010808. 397 KGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKK 472 (512) Q Consensus 397 ~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~r 472 (512) ..|..++..|...++.+-..+.. ......+.|+........+...+...+ .|+++.-.+++.++.- +++-. ++ T Consensus 358 ~~l~P~~~~ie~~ln~kl~~~~~-~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~g--D~ 434 (460) T protein:vir:10 358 DNIQPDLVILKQAFDKKFIKRFK-GYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQDGM--DI 434 (460) T ss_pred HHHHHHHHHHHHHHHHhhcCccc-ccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCC--Ce Confidence 44555555554444432111111 111233445322211112222222222 6899998888887542 22100 00 Q ss_pred HHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 473 IEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 473 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) +. .. ....+- ....++..++++...+ T Consensus 435 ~~------~~-----~n~~~~-~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 435 VF------MP-----SNKVRI-DDVSNNLIDSAFNQNQ 460 (460) T ss_pred ee------ec-----ccccch-hhcccccCCCcccCCC Confidence 00 00 000000 0000011111111111 No 197 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.11 E-value=0.00017 Score=41.27 Aligned_cols=271 Identities=12% Similarity=0.071 Sum_probs=133.2 Q ss_pred hhccCceecCCchhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010808. 106 FLGNPIQCQDDDKDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNT 179 (512) Q Consensus 106 l~g~~~~~~~~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 179 (512) +..-|+.+...++.....+..++.. | ....+...+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 3333444322222223334444321 2 3456778888899999999999999988875 56778888887776543 Q ss_pred CCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchH Q lcl|NC_010808. 180 IERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYE 259 (512) Q Consensus 180 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~ 259 (512) . . .+ +|..... . . . . ..+.+..+.+++... +. +...|.|.+. T Consensus 81 ~-~-~~----~y~~~~~--~-g-~--~-~~~~~~evih~~~~~----------------~~---------~~~~G~s~~~ 122 (278) T protein:vir:78 81 S-R-EL----YYSIHAA--T-G-N--K-LIVHNMDMLHFKHIV----------------AS---------NMVQGISPID 122 (278) T ss_pred C-c-eE----EEEEEcC--C-c-e--E-EEEccccEEEECCCC----------------CC---------CCeeeccHHH Confidence 2 1 11 1111110 0 0 0 1 123344444432110 00 1124777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCC Q lcl|NC_010808. 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYD 339 (512) Q Consensus 260 ~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 339 (512) .+...++....+... +.......|-.++......+++..+..++.-. ... .+.......+++.+++.++.... T Consensus 123 ~~~~~i~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~----~~~-~~~g~~~vl~~g~~~~~l~~~~~ 195 (278) T protein:vir:78 123 VLKNTTDFDNAVRTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFK----QYY-EENGGILFQEPGVEIEPLPKKYV 195 (278) T ss_pred HHHHHHHHHHHHHHH--HHHHhcCCCcEEEEeCCCCCHHHHHHHHHHHH----HHh-ccCCCceecCCCceEEEccCChh Confidence 777666654443322 22222234444444333444444444433211 111 11112223345566666665545 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_010808. 340 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 418 (512) Q Consensus 340 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~ 418 (512) ...+.+..+...+.|+..-++|....+... ++-|.. + ......+...|+.+++.|...++.+-.... T Consensus 196 d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~--~----------~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~ 263 (278) T protein:vir:78 196 SEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKN--E----------ELNRFYLQHTLLPIVKQYEEEFNRKLLTKT 263 (278) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--H----------HHHHHHHHHHHHHHHHHHHHHHHhhcCChh Confidence 556667777888889888899876655432 221211 0 111234444566666666665554322111 Q ss_pred ccccceeeEEeCCCCC Q lcl|NC_010808. 419 NKDFNTVRYVYNRNLP 434 (512) Q Consensus 419 ~~d~~~i~i~f~~~~p 434 (512) .. .....+.|+-+.- T Consensus 264 e~-~~g~~~~f~~~~l 278 (278) T protein:vir:78 264 DR-EKIGILNLTLNLI 278 (278) T ss_pred Hh-cCCceEEEecccC Confidence 11 1234566754333 No 198 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.08 E-value=0.00018 Score=41.13 Aligned_cols=397 Identities=10% Similarity=0.028 Sum_probs=169.4 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHH------------HHHHHHhc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRL------------KVLSDYYE 68 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~------------~~~~~yy~ 68 (512) |++ .+ -+.+.......+..+ ......+- T Consensus 1 ~~~---------------~l-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 40 (434) T protein:vir:43 1 MSK---------------SL-------------------------GKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFL 40 (434) T ss_pred Ccc---------------ch-------------------------hhhhhhcccccchhhhcccccccccCchHHHHHHh Confidence 111 11 111111111110000 00111122 Q ss_pred ccccccccccccccccccceeeecchHHHHHHHHHhhhhccCcee-cCC-c----hhHHHHHHHHHh--ccC---hhHHH Q lcl|NC_010808. 69 GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC-QDD-D----KDVLEAIEAFND--LND---VESHN 137 (512) Q Consensus 69 G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~-~~~-d----~~~~~~l~~~~~--~n~---~~~~~ 137 (512) |.... ........ .-+.+.=....|+.+++-+..-|+.+ ..+ + ......+..++. -|. -..+. T Consensus 41 g~~~~---~g~~v~~~---~al~~~~V~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~ 114 (434) T protein:vir:43 41 GRESS---SGKKVTVD---KAMKLSAVWACVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFW 114 (434) T ss_pred cCCcc---CCceechh---hhhccHHHHHHHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHH Confidence 21000 00000000 00112222344666666666666654 111 1 112234555553 243 34667 Q ss_pred HHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEE Q lcl|NC_010808. 138 RSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVY 216 (512) Q Consensus 138 ~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~ 216 (512) ..+..+.+.+|.+|+++..+ .|++ .+..++|..+.+..+.. .. ++|+ ....+ . . ...+.++.+. T Consensus 115 ~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~--g~----~~y~-~~~~~---g-~---~~~~~~~eVi 179 (434) T protein:vir:43 115 QAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDEN--GR----LKYF-YTTKK---G-A---RREIERTNML 179 (434) T ss_pred HHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCC--Ce----EEEE-EEecC---c-e---EEEEccccEE Confidence 77888999999999988665 5765 46778898888777543 21 1111 11110 0 0 1123444444 Q ss_pred EEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCC Q lcl|NC_010808. 217 RYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLD 296 (512) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~ 296 (512) ++... | .+...|.|.+..+...+........-....+.-.+.|-.+++.....+ T Consensus 180 h~~~~----------------------~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~ 233 (434) T protein:vir:43 180 HIPAF----------------------T----LDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQ 233 (434) T ss_pred EecCc----------------------C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCC Confidence 43211 0 012246666666555555444443333334444556666665443344 Q ss_pred hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccc-hHH Q lcl|NC_010808. 297 PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ-SGE 375 (512) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~ 375 (512) ++..+..+... -......+.....-.+++.+++-++.......+.+..+.....|+..-++|....+...+.. ++. T Consensus 234 ~e~~~~~r~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s 310 (434) T protein:vir:43 234 PAQREEFREYV---KSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGT 310 (434) T ss_pred HHHHHHHHHHH---HHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccc Confidence 44333332211 00000011111112244555555554444455566677788889999999976654432222 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCCh Q lcl|NC_010808. 376 AMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQ 453 (512) Q Consensus 376 Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~ 453 (512) .++.. ....+...|..++..|...++.+-..........+++.+..-+..|..+.++.+.++ +|+++. T Consensus 311 ~~e~~----------~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~ 380 (434) T protein:vir:43 311 GLEQQ----------MLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTR 380 (434) T ss_pred hHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 22221 122334455555555555554332111111111234444455667888999998887 689999 Q ss_pred HHHHHhCCCCCCHHH----------HHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 454 TTLMSLFSFFQDPEL----------EVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 454 et~~~~~~~v~d~~~----------E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) -.++++++.-.-+.. -++.+.+.+.... . ...... .....+-+| T Consensus 381 NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~--------~-~~~~~~------~~~~~~~~~ 434 (434) T protein:vir:43 381 NEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNKSQA--------V-RAALMN------WFSQPEPQE 434 (434) T ss_pred HHHHHHhCCCCCCCCCeEeeccCccchhhhhccCCCcc--------h-hhhhhc------cCCCCCCCC Confidence 888888754321110 0111111000000 0 000000 000001111 No 199 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.07 E-value=0.00018 Score=41.07 Aligned_cols=394 Identities=10% Similarity=0.055 Sum_probs=166.3 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhc----ccccccccccccccccccceeeecchHHHHHHHHHhhhhccCc Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYE----GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPI 111 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~----G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~ 111 (512) ....+...++.+.+ +.++.. +-.....-........-.+.-+.+.-....|+.+++-+..-|+ T Consensus 1 ~~~~~~~~~~k~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~ 67 (409) T protein:vir:94 1 MAKENIVTRIKKKL-------------IDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred CcccccchhhhhHH-------------hhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCce Confidence 00000111122111 111110 1000000000000000000012223444556666666666666 Q ss_pred eecCCchhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 112 QCQDDDKDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 112 ~~~~~d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) ++--..+.....+..++. -| .-..+...+..+++.+|.+|+++.++..|++ .+..++|..+.++.++.. .. + T Consensus 68 ~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~-~ 145 (409) T protein:vir:94 68 KMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RE-L 145 (409) T ss_pred eEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cE-E Confidence 652222222233444442 23 3445566778889999999999999988875 577788988887776532 11 1 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) +|...... . . .+ .+.++.+.+++... | .+.-.|.|.+..+...+ T Consensus 146 ----~y~~~~~~---g-~--~~-~~~~~dvih~r~~~---------------------~----~~~~~G~s~l~~~~~~i 189 (409) T protein:vir:94 146 ----YYSIHAAT---G-N--KL-IVHNMDMLHFKHIV---------------------A----SNMVQGISPIDVLKNTT 189 (409) T ss_pred ----EEEEEcCC---c-e--EE-EEccccEEEecCCC---------------------C----CCccccccHHHHHHHHH Confidence 12111110 0 0 11 23344444442100 0 01124677776666666 Q ss_pred HHHHHHHHHHHHHHHHhc-CceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLN-DAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~-~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) +....+... + +..+. .+-.++......+++..+..++.-.- ...+.......+++.++..++.......+. T Consensus 190 ~~~~~~~~~--~-~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~-----~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~ 261 (409) T protein:vir:94 190 DFDNAVRTF--N-LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQ-----YYEENGGILFQEPGVEIEPLPKKYVSEDIV 261 (409) T ss_pred HHHHHHHHH--H-HHhcCCCCeeEEecCCCCCHHHHHHHHHHHHH-----HhhcCCCeeecCCCceEEEcCCChhHHHHH Confidence 544433221 1 22333 33334433334444444443332111 111111122334555666665443344555 Q ss_pred HHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) +..+.....|+..-++|....+... +.+...++. .....+...|..+++.|...++..-......+ .. T Consensus 262 e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~sn~e~----------~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~-~~ 329 (409) T protein:vir:94 262 ASENLTRERVANVFQLPSVFLNARS-NTNFAKNEE----------LNRFYLQHTLLPIVKQYEEEFNRKLLTKTDRE-KN 329 (409) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCC-CCCcccHHH----------HHHHHHHHHHHHHHHHHHHHHHHhhCCccccc-Cc Confidence 5666677888888899876665322 222111111 11123344455555555444443221111111 12 Q ss_pred eeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 425 VRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 425 i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) ..+.| ..-+-.|..+.++++.++ +|+++.-.+++.++.-.-+-. +.+.- .................++ T Consensus 330 ~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gg--D~~~~------~~n~~~~~~~~~~~~~~kG 401 (409) T protein:vir:94 330 RYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGG--DKPLI------SGDLYPIDTPLELRKSLKG 401 (409) T ss_pred ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--CeEee------cccccccccchhhcccccC Confidence 33445 344567888889998887 789999888888764221100 00000 0000000000000000111 Q ss_pred CCCCcCcc Q lcl|NC_010808. 501 DDDTKDTV 508 (512) Q Consensus 501 ~~~~~~~~ 508 (512) .++.+++. T Consensus 402 G~~n~~e~ 409 (409) T protein:vir:94 402 GDKNVNES 409 (409) T ss_pred CCCCcCCC Confidence 11111111 No 200 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.05 E-value=0.00019 Score=40.97 Aligned_cols=397 Identities=11% Similarity=0.049 Sum_probs=165.6 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHH-hcccccccccccccccccccceeeecchHHHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDY-YEGKTKNLVELTRRKEEYMADNRVAHDYASYIS 99 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~y-y~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 99 (512) +. ..+....+..-+ .. .+.+. ..+-.....-........-.+.-+.+....-.| T Consensus 1 ~~---------------~~~~~~~~~~~~---~~-------~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci 55 (409) T protein:vir:93 1 MA---------------KENIVTRIKKKL---ID-------NWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAI 55 (409) T ss_pred CC---------------ccchhhhhhhhh---hh-------hhhccccccccccccccCccccccchhhhhccHHHHHHH Confidence 00 000011111110 00 00000 000000000000000000000011233344556 Q ss_pred HHHHhhhhccCceecCCchhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeE Q lcl|NC_010808. 100 DFINGYFLGNPIQCQDDDKDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTF 173 (512) Q Consensus 100 ~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~ 173 (512) +.+++-+..-|+.+--..+.....+..++. -| ....+...+..+++.+|.||+++.++..|++ .+..++|..+. T Consensus 56 ~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:93 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceeEeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeE Confidence 666666666677653222333333444443 23 3455567788889999999999999988875 56778888887 Q ss_pred EEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCC Q lcl|NC_010808. 174 VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNER 253 (512) Q Consensus 174 ~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~ 253 (512) +..++.. +.+. |.+.... . ..+ .+.++.+.+++... + .+.-. T Consensus 136 ~~~~~~~-~~~~-----y~~~~~~---g---~~~-~~~~~eVih~r~~~----------------~---------~~~~~ 177 (409) T protein:vir:93 136 MLIENQS-RELY-----YSIHAAT---G---NKL-IVHNMDMLHFKHIV----------------A---------SNMVQ 177 (409) T ss_pred EEEeCCC-cEEE-----EEEEcCC---c---eEE-EEccccEEEeCCCC----------------C---------CCccc Confidence 7765432 1111 2111110 0 011 23444444442210 0 01124 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHhcC-ceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCccee Q lcl|NC_010808. 254 RKGDYEKVITLIDLYDNAESDTANYMSDLND-AMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGG 332 (512) Q Consensus 254 g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~-~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (512) |.|.+..+...++....+... + +..+.. +-.++......+++..+..++.-.- ...+.......+++.++. T Consensus 178 G~s~i~~~~~~i~~~~~~~~~--~-~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~-----~~~~~g~~~vl~~g~~~~ 249 (409) T protein:vir:93 178 GISPIDVLKNTTDFDNAVRTF--N-LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQ-----YYEENGGILFQEPGVEIE 249 (409) T ss_pred cccHHHHHHHHHHHHHHHHHH--H-HHhcCCCCceEEecCCCCCHHHHHHHHHHHHH-----HhhcCCCeeecCCCceEE Confidence 667776665555544333211 1 233333 3333333333444444433322110 011111122334555666 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 333 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 412 (512) Q Consensus 333 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~ 412 (512) .++.......+.+..+.....|+..-++|....+...+ .+...++. .....+...|..++..|...++. T Consensus 250 ~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~-~~~sn~e~----------~~~~f~~~~l~P~~~~ie~~l~~ 318 (409) T protein:vir:93 250 PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSN-TNFAKNEE----------LNRFYLQHTLLPIVKQYEEEFNR 318 (409) T ss_pred EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHH----------HHHHHHHHHHHHHHHHHHHHHHh Confidence 55543334455556666778898888998776654322 11111111 11123344455555555544443 Q ss_pred ccCCCcccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_010808. 413 TRSIDANKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGI 488 (512) Q Consensus 413 ~~~~~~~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~ 488 (512) .-...... .....+.| ..-+-.|..+.++++.++ +|+++.-.+++.++.-.-+- -++.. ........ T Consensus 319 ~Ll~~~~~-~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~g--gD~~~------~~~n~~~~ 389 (409) T protein:vir:93 319 KLLTKTDR-EKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG--GDKPL------ISGDLYPI 389 (409) T ss_pred hcCCcccc-cCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee------eccccccc Confidence 22111111 11233445 344556888889988887 68999999888886532110 00000 00000000 Q ss_pred ccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 489 YKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) .. .....+....++++.+++ T Consensus 390 ~~---~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 390 DT---PLELRKSLKGGDKNVNES 409 (409) T ss_pred cc---chhhcccccCCCCCcCCC Confidence 00 000000000111111111 No 201 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=96.96 E-value=0.00024 Score=40.45 Aligned_cols=376 Identities=11% Similarity=0.039 Sum_probs=143.4 Q ss_pred HHHHHHHhccccccccccc-ccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhc--c---Ch Q lcl|NC_010808. 60 LKVLSDYYEGKTKNLVELT-RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDL--N---DV 133 (512) Q Consensus 60 ~~~~~~yy~G~~~~~~~~~-~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~--n---~~ 133 (512) +..+.+.+..+........ ..........-+........|+.+++-+..-|+.+-.........+..++.. | .. T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~ 80 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSS 80 (395) T ss_pred CchhhhhhccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcCCCH Confidence 1111112221111000000 0000000011123345566677777766666776433333333334444432 3 23 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcccee--EEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST--FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT 211 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~--~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt 211 (512) ..+...+..+.+..|.+|+++..+ +.. ..+++... ..++++. ...+... . . .....+. T Consensus 81 ~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~-----~-~-----~~~~~~~ 140 (395) T protein:vir:10 81 DSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTVK-----D-Y-----TYQRTFT 140 (395) T ss_pred HHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEEc-----C-c-----eeeeeec Confidence 445556667777777777665432 221 12222221 1122110 0001000 0 0 0011233 Q ss_pred CCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Q lcl|NC_010808. 212 SHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g 291 (512) ++.+++++... ......|.|.++.....++.... .....+.+--++.. T Consensus 141 ~~evih~~~~~-------------------------~~~~~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~ 188 (395) T protein:vir:10 141 MQEVIYLKYNN-------------------------NKVTHFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKS 188 (395) T ss_pred cccEEEEccCC-------------------------CCcccccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEe Confidence 33333332110 00112355655555554443332 22233333333322 Q ss_pred -CCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC-----CHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 292 -NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY-----DVQGTEAYKDRLNSDIHMFTNTPNMKD 365 (512) Q Consensus 292 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-----~~~~~~~~~~~l~~~i~~~s~~p~~~~ 365 (512) ....+++..+..+.. .-................+++.+.+.++... ....+.+..+...+.|+..-++|.... T Consensus 189 ~~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l 267 (395) T protein:vir:10 189 ASSAYDEKNIEKLQAF-TNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI 267 (395) T ss_pred CCCCCCHHHHHHHHHH-HHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 112233332222211 0000000000111111234444555444221 122455666777788888888987665 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHH Q lcl|NC_010808. 366 DNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYI 445 (512) Q Consensus 366 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~ 445 (512) +...++.+ .....++...|..++..|...+..+-... ......+++.++.-+-.|..+.++++. T Consensus 268 ~~~~sn~e---------------~~~~~~~~~~l~P~~~~ie~~l~~kL~~~-~~~~~~~~f~~~~l~~~D~~~~~~~~~ 331 (395) T protein:vir:10 268 YGETADLE---------------KNTLVFEKFCLTPLLKKIQNELNAKLITQ-SMYLKDTRIEIVGVNKKDPLQYAEAID 331 (395) T ss_pred cCcccCHH---------------HHHHHHHHHHHHHHHHHHHHHHHHhhcCh-hhhcccceecchhhhccCHHHHHHHHH Confidence 42211111 11222334455555555555444321111 111123445555666778888999988 Q ss_pred HH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 446 DS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYK-DPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 446 kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++ +|+++.-.++++++.- +++.. ++.. .......... ........+....++++.++|. T Consensus 332 ~~~~~G~lt~NE~R~~~g~~p~~~g~~--d~~~------~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 332 KLVSSGSFTRNEVRIMLGEEPSDNPEL--DEYL------ITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee------eccccccccccccccCcccccccCCCCCCCCCC Confidence 76 6899998888888653 22210 0000 0000000000 0000001111112233333333 No 202 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=96.96 E-value=0.00024 Score=40.45 Aligned_cols=376 Identities=11% Similarity=0.039 Sum_probs=143.4 Q ss_pred HHHHHHHhccccccccccc-ccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhc--c---Ch Q lcl|NC_010808. 60 LKVLSDYYEGKTKNLVELT-RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDL--N---DV 133 (512) Q Consensus 60 ~~~~~~yy~G~~~~~~~~~-~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~--n---~~ 133 (512) +..+.+.+..+........ ..........-+........|+.+++-+..-|+.+-.........+..++.. | .. T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~ 80 (395) T protein:vir:95 1 MSILEKIFKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSS 80 (395) T ss_pred CchhhhhhccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcCCCH Confidence 1111112221111000000 0000000011123345566677777766666776433333333334444432 3 23 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcccee--EEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST--FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT 211 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~--~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt 211 (512) ..+...+..+.+..|.+|+++..+ +.. ..+++... ..++++. ...+... . . .....+. T Consensus 81 ~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~-----~-~-----~~~~~~~ 140 (395) T protein:vir:95 81 DSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTVK-----D-Y-----TYQRTFT 140 (395) T ss_pred HHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEEc-----C-c-----eeeeeec Confidence 445556667777777777665432 221 12222221 1122110 0001000 0 0 0011233 Q ss_pred CCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Q lcl|NC_010808. 212 SHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g 291 (512) ++.+++++... ......|.|.++.....++.... .....+.+--++.. T Consensus 141 ~~evih~~~~~-------------------------~~~~~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~ 188 (395) T protein:vir:95 141 MQEVIYLKYNN-------------------------NKVTHFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKS 188 (395) T ss_pred cccEEEEccCC-------------------------CCcccccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEe Confidence 33333332110 00112355655555554443332 22233333333322 Q ss_pred -CCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC-----CHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 292 -NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY-----DVQGTEAYKDRLNSDIHMFTNTPNMKD 365 (512) Q Consensus 292 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-----~~~~~~~~~~~l~~~i~~~s~~p~~~~ 365 (512) ....+++..+..+.. .-................+++.+.+.++... ....+.+..+...+.|+..-++|.... T Consensus 189 ~~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l 267 (395) T protein:vir:95 189 ASSAYDEKNIEKLQAF-TNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI 267 (395) T ss_pred CCCCCCHHHHHHHHHH-HHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 112233332222211 0000000000111111234444555444221 122455666777788888888987665 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHH Q lcl|NC_010808. 366 DNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYI 445 (512) Q Consensus 366 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~ 445 (512) +...++.+ .....++...|..++..|...+..+-... ......+++.++.-+-.|..+.++++. T Consensus 268 ~~~~sn~e---------------~~~~~~~~~~l~P~~~~ie~~l~~kL~~~-~~~~~~~~f~~~~l~~~D~~~~~~~~~ 331 (395) T protein:vir:95 268 YGETADLE---------------KNTLVFEKFCLTPLLKKIQNELNAKLITQ-SMYLKDTRIEIVGVNKKDPLQYAEAID 331 (395) T ss_pred cCcccCHH---------------HHHHHHHHHHHHHHHHHHHHHHHHhhcCh-hhhcccceecchhhhccCHHHHHHHHH Confidence 42211111 11222334455555555555444321111 111123445555666778888999988 Q ss_pred HH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 446 DS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYK-DPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 446 kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++ +|+++.-.++++++.- +++.. ++.. .......... ........+....++++.++|. T Consensus 332 ~~~~~G~lt~NE~R~~~g~~p~~~g~~--d~~~------~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 332 KLVSSGSFTRNEVRIMLGEEPSDNPEL--DEYL------ITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee------eccccccccccccccCcccccccCCCCCCCCCC Confidence 76 6899998888888653 22210 0000 0000000000 0000001111112233333333 No 203 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=96.96 E-value=0.00024 Score=40.45 Aligned_cols=376 Identities=11% Similarity=0.039 Sum_probs=143.4 Q ss_pred HHHHHHHhccccccccccc-ccccccccceeeecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhc--c---Ch Q lcl|NC_010808. 60 LKVLSDYYEGKTKNLVELT-RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDL--N---DV 133 (512) Q Consensus 60 ~~~~~~yy~G~~~~~~~~~-~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~--n---~~ 133 (512) +..+.+.+..+........ ..........-+........|+.+++-+..-|+.+-.........+..++.. | .. T Consensus 1 Mg~f~~lf~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~ 80 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNTDLSS 80 (395) T ss_pred CchhhhhhccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCcCCCH Confidence 1111112221111000000 0000000011123345566677777766666776433333333334444432 3 23 Q ss_pred hHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEcccee--EEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc Q lcl|NC_010808. 134 ESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST--FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT 211 (512) Q Consensus 134 ~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~--~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt 211 (512) ..+...+..+.+..|.+|+++..+ +.. ..+++... ..++++. ...+... . . .....+. T Consensus 81 ~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~-----~-~-----~~~~~~~ 140 (395) T protein:vir:10 81 DSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTVK-----D-Y-----TYQRTFT 140 (395) T ss_pred HHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEEc-----C-c-----eeeeeec Confidence 445556667777777777665432 221 12222221 1122110 0001000 0 0 0011233 Q ss_pred CCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Q lcl|NC_010808. 212 SHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g 291 (512) ++.+++++... ......|.|.++.....++.... .....+.+--++.. T Consensus 141 ~~evih~~~~~-------------------------~~~~~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~ 188 (395) T protein:vir:10 141 MQEVIYLKYNN-------------------------NKVTHFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKS 188 (395) T ss_pred cccEEEEccCC-------------------------CCcccccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEe Confidence 33333332110 00112355655555554443332 22233333333322 Q ss_pred -CCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC-----CHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 292 -NLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY-----DVQGTEAYKDRLNSDIHMFTNTPNMKD 365 (512) Q Consensus 292 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-----~~~~~~~~~~~l~~~i~~~s~~p~~~~ 365 (512) ....+++..+..+.. .-................+++.+.+.++... ....+.+..+...+.|+..-++|.... T Consensus 189 ~~~~~~~e~~~~~~~~-~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l 267 (395) T protein:vir:10 189 ASSAYDEKNIEKLQAF-TNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLI 267 (395) T ss_pred CCCCCCHHHHHHHHHH-HHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 112233332222211 0000000000111111234444555444221 122455666777788888888987665 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHH Q lcl|NC_010808. 366 DNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYI 445 (512) Q Consensus 366 ~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~ 445 (512) +...++.+ .....++...|..++..|...+..+-... ......+++.++.-+-.|..+.++++. T Consensus 268 ~~~~sn~e---------------~~~~~~~~~~l~P~~~~ie~~l~~kL~~~-~~~~~~~~f~~~~l~~~D~~~~~~~~~ 331 (395) T protein:vir:10 268 YGETADLE---------------KNTLVFEKFCLTPLLKKIQNELNAKLITQ-SMYLKDTRIEIVGVNKKDPLQYAEAID 331 (395) T ss_pred cCcccCHH---------------HHHHHHHHHHHHHHHHHHHHHHHHhhcCh-hhhcccceecchhhhccCHHHHHHHHH Confidence 42211111 11222334455555555555444321111 111123445555666778888999988 Q ss_pred HH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhccc-CCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 446 DS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYK-DPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 446 kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~e 512 (512) ++ +|+++.-.++++++.- +++.. ++.. .......... ........+....++++.++|. T Consensus 332 ~~~~~G~lt~NE~R~~~g~~p~~~g~~--d~~~------~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 332 KLVSSGSFTRNEVRIMLGEEPSDNPEL--DEYL------ITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee------eccccccccccccccCcccccccCCCCCCCCCC Confidence 76 6899998888888653 22210 0000 0000000000 0000001111112233333333 No 204 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=96.91 E-value=0.00026 Score=40.21 Aligned_cols=391 Identities=10% Similarity=0.057 Sum_probs=153.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCch--hHHHH Q lcl|NC_010808. 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK--DVLEA 123 (512) Q Consensus 46 ~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~--~~~~~ 123 (512) ..|.++........-.-+..+.-|.... ..-...-+.+.-.-..|+.+++-+..-|+.+...+. ..... T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~ 71 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQ---------KYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIHDED 71 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCc---------ccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCccccccch Confidence 1111000000000000011111111110 000000011111122355555555444665432222 12233 Q ss_pred HHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECC-CCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeee Q lcl|NC_010808. 124 IEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQ-DDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPI 196 (512) Q Consensus 124 l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~ 196 (512) +..++. -| ....+...+..+.+.+|.||+++.++. .|.+ .+..++|..+.+..++. .++ . |.+... T Consensus 72 ~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~--~~~----~-y~~~~~ 144 (406) T protein:vir:97 72 INYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDN--HEI----V-YTFTDM 144 (406) T ss_pred HHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCC--ceE----E-EEEEec Confidence 555553 23 334667778888999999999998875 4554 56778888887765542 111 1 111111 Q ss_pred ccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 197 DKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTA 276 (512) Q Consensus 197 ~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~ 276 (512) . ... .+ .+.+..+++++.. | .+.-.|.|.+..+...++....+..-.. T Consensus 145 ~--~~~---~~-~~~~~evih~r~~----------------------~----~dg~~G~spi~~~~~~i~~~~a~~~~~~ 192 (406) T protein:vir:97 145 L--TAK---QV-KCFAHDVIHWKFF----------------------S----HDTILGRSPLLSLGDEIDLQTGGINTLI 192 (406) T ss_pred C--Cce---EE-EEccccEEEecCC----------------------C----CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 1 000 11 2334444443210 0 0111367777666665554444433333 Q ss_pred HHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 277 NYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHM 356 (512) Q Consensus 277 ~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 356 (512) ..++....|-.++......+++..+..++.-.-.. .+ .+.......+.+.+...++.......+.+..+.....|.. T Consensus 193 ~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~-~g--~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~ 269 (406) T protein:vir:97 193 KFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMR-EG--SVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAK 269 (406) T ss_pred HHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHh-cc--cccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHH Confidence 34444445544444333334444444332211000 00 0111111224455555554333333344455556777888 Q ss_pred HhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcC Q lcl|NC_010808. 357 FTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 436 (512) Q Consensus 357 ~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d 436 (512) .-++|....+..+.+ |..+ ......+...|..+++.|...+..+-... .+.....+.|. +..+ T Consensus 270 afgVPp~~lg~~~~~-~~~e------------~~~~~f~~~~l~P~~~~ie~~l~~kll~~--~~~~~~~i~fd--~~~~ 332 (406) T protein:vir:97 270 ALRVPSYKLGVNSPN-QSVA------------QLMEDYVTNDLPFYFDAITSELGLKTLND--KDRRLYHIEFD--TRSV 332 (406) T ss_pred HhCCCHHHcCCCCCc-chHH------------HHHHHHHHHHHHHHHHHHHHHHhhhhcCh--hhccceeEEEe--cCcc Confidence 888887776532221 2111 11112334445555555544444322111 11223345553 1223 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 437 LIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 437 ~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ....++++.++ .|+++...+++.++.-. ++.. ++..- ...........+..+.......+++...++. T Consensus 333 ~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~g--D~~~~------~~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 404 (406) T protein:vir:97 333 TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNM--DRYQS------SLNYVFLDKKEEYQDKVGIKGKGGEVNAEED 404 (406) T ss_pred chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--CeEee------ccCccchhcccccccccccccCCCCCCCCCC Confidence 44445555565 68999999888886432 2110 00000 0000000000000000000000111000000 No 205 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=96.89 E-value=0.00028 Score=40.08 Aligned_cols=436 Identities=10% Similarity=-0.009 Sum_probs=175.8 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc--Cc---- Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN--PI---- 111 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~---- 111 (512) +......+.+.++ ++.-..+++.+.+|..-.- . ............|+.-+-+...++.+++.|.+- |+ T Consensus 1 mk~~~~~~~~~lk--R~~~e~~w~e~a~~tlP~~---~-~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) T protein:vir:63 1 MKTTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---M-VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhcccc---C-CCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCcc Confidence 2221222222221 1122234455555543210 0 000111112223455567777888888877652 22 Q ss_pred -eecCCch-------------hHH-------HHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 112 -QCQDDDK-------------DVL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 112 -~~~~~d~-------------~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++...++ ++. ..+...+..++|.....++.++...+|.+.++ .++++. +++.++ . T Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~--~~~~~~-~~~~~p-l 150 (510) T protein:vir:63 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RDSDAA-TVVAWS-L 150 (510) T ss_pred cccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEE--EcCCCc-EEEEEE-c Confidence 2222221 122 23555667789999999999999999997555 566654 455553 3 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeec------------cCCcceEEEEEEEcCCcEEEEEecCCc-c-----ccccccc Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTSHGVYRYLTSRTN-G-----LKLTPRE 232 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~------------~~~~~~~~~~~~yt~~~~~~~~~~~~~-~-----~~~~~~~ 232 (512) .-|.+.-|. .+++...+|.++..... ....+....+++|+.- + .....+ . ....... T Consensus 151 ~~y~v~~d~-~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V--~--~~~~~~~~~~sv~~e~dg~~ 225 (510) T protein:vir:63 151 RSYAVRRDA-TGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHV--Q--RKKGTAMEYAELYHEIDGVR 225 (510) T ss_pred ceeEEeeCC-CcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEE--E--eecCCCceEEEEEEEecCce Confidence 335554443 34555555544332100 0011112223343311 0 111100 0 0000011 Q ss_pred -cccccccccccceEeec-----CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhc Q lcl|NC_010808. 233 -NGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEA 306 (512) Q Consensus 233 -~~~~~~~~~~vPvv~~~-----n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~ 306 (512) ......+|..+|++.++ ...+|+|..+...+-+..++.+.-...........|.+.+.-........+. .+ T Consensus 226 ~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~~~---~~ 302 (510) T protein:vir:63 226 VGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ---DA 302 (510) T ss_pred eccccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhhc---cC Confidence 11122335556766542 4568999999999999999987666666655555554433210011111111 11 Q ss_pred cccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHH Q lcl|NC_010808. 307 NVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGL 384 (512) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l 384 (512) +. ..+..+...+++.+. +..+.......++.++..|...-.. +.. ...+...||..+......+ T Consensus 303 ~~------------g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~l~-~~~~~rvTAtEV~~r~~E~ 368 (510) T protein:vir:63 303 EM------------GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEA 368 (510) T ss_pred CC------------ceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHh-hcc-cCCCCCcCHHHHHHHHHHH Confidence 10 001111122233332 3345566666777776666554221 111 1223446777766653333 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccc-cceeeEEeCCCCCcCHH-HHHHH----HHHHhc---c---C Q lcl|NC_010808. 385 EQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDANKD-FNTVRYVYNRNLPKSLI-EELKA----YIDSGG---K---I 451 (512) Q Consensus 385 ~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d-~~~i~i~f~~~~p~d~~-~~~~~----~~kl~g---~---~ 451 (512) .+...- ..++-.+.+.-+++..+.++...+....+.+ .....+++..++-+... +.+.. +..+.+ + + T Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~aq~~~~i 448 (510) T protein:vir:63 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPRI 448 (510) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceecchhHHHHHHHHHHHHHHHHHHHHhcCchhhhccC Confidence 322221 2333333444455555555544332222222 12222333222222111 00111 111111 1 1 Q ss_pred ChHHHHH----hCCCCC----CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 452 SQTTLMS----LFSFFQ----DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 452 s~et~~~----~~~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) -...++. .+|.-+ -.++|++++.+++......++........+...-....-+- T Consensus 449 d~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 449 SLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred CHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 1122322 333211 23556666655533222221111100000000000000000 No 206 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=96.87 E-value=0.00029 Score=39.99 Aligned_cols=377 Identities=9% Similarity=0.014 Sum_probs=158.7 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHH-HHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhh Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMD-YQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGY 105 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~-~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~ 105 (512) +.. .+.. .+-+.... ...+.........-|..... . .. ...-+.+.-....|+.+++- T Consensus 1 Mg~------------~~~~-~~~k~~~~~~~~~~~~~~~~~~~~~~~~~----~-v~---~~~~l~~~~v~~~i~~ia~~ 59 (383) T protein:vir:10 1 MGL------------LTPK-NFSKRNAKNMVYPSNPAFFTTTVGGMQLS----Y-VS---ALSALQNTNVYSVINRIASD 59 (383) T ss_pred CCc------------cccc-ccccccccccccccchhhhhhhccCcccc----c-cc---hhHhhcchHHHHHHHHHHHh Confidence 110 0000 00000000 00000000001111100000 0 00 00112223334456666666 Q ss_pred hhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 106 FLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 106 l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) +..-|+++. +......|..-+..-....+...+..+++.+|.||+++..+. ..+..++|..+.+..+.. ... T Consensus 60 ia~~~~~~~--~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---~~~~p~~~~~v~~~~~~~---~~~ 131 (383) T protein:vir:10 60 VSSAHFKTE--NTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM---GIV 131 (383) T ss_pred hccCceeec--ccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCcceEEEEEcCC---ceE Confidence 666666553 222222333222222456667778888999999999886542 223333444433332211 100 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) |.+.... . . ....|.++.+++++.... +. .+...|.|.++.+...+ T Consensus 132 -----~~~~~~~-~-~----~~~~~~~~evih~r~~~~--------------------~~---~~~~~G~s~l~~~~~~i 177 (383) T protein:vir:10 132 -----YTVLESN-D-R----PKMVLRQDQMLHFRLMPD--------------------PQ---YRYLIGRSPLESLQNAL 177 (383) T ss_pred -----EEEEEcC-C-c----eEEEEcccceEEeccCCC--------------------Cc---ccccccccHHHHHHHHH Confidence 1110000 0 0 011133344443321100 00 01124788888888888 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcC-ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHH-H Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQG-T 343 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~-~ 343 (512) +....+..-....+.-...|-.++.-.... +++..+..+..-. ....+ .+.......+.+.+++.++.+..... + T Consensus 178 ~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~-~~~~~--~n~~~~~vl~~g~~~~~l~~~~~d~~~l 254 (383) T protein:vir:10 178 NLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFE-KANTG--DNSGRLMVLPDGFDYTQLEMKTDVFKAL 254 (383) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHH-HHhCc--cccCCccccCCCceEEecCCChhHHHHH Confidence 777776666666666666666555432111 2332222221110 00000 01111122244555555554333322 3 Q ss_pred HHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 422 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~ 422 (512) .+..+...+.|+..-++|....+... ++.++..++ .....|...|..+++.|...+...-. . T Consensus 255 ~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~e-----------q~~~~~~~~l~P~~~~ie~~l~~~l~------~ 317 (383) T protein:vir:10 255 ADNSAYSADQISKAFGVPSDILGGGTSTESQHSNID-----------QIKATYLANLNSYVNPIVDELRLKMN------A 317 (383) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHH-----------HHHHHHHHHHHHHHHHHHHHHHHhhC------C Confidence 45667778889998899876654321 222111111 11122333455555555554443211 1 Q ss_pred ceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 423 NTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 423 ~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) ..+++.+..-+..|..+.++++.++ .|+++...+++.++.-.-+..++ ........ T Consensus 318 ~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~-----------------~~~~~~~~----- 375 (383) T protein:vir:10 318 PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNL-----------------PEFKPLTN----- 375 (383) T ss_pred ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcc-----------------cccCCCcc----- Confidence 2356667777788999999999887 68999988888875422100000 00000000 Q ss_pred CCCCcCcccCC Q lcl|NC_010808. 501 DDDTKDTVDKK 511 (512) Q Consensus 501 ~~~~~~~~~~~ 511 (512) +- +..|+| T Consensus 376 ~~---~gGd~e 383 (383) T protein:vir:10 376 ET---KGGDDK 383 (383) T ss_pred cC---CCCCCC Confidence 00 111111 No 207 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=96.57 E-value=0.00051 Score=38.65 Aligned_cols=380 Identities=11% Similarity=0.000 Sum_probs=160.5 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+++......+...... ..++ ...+-+..... ..... ...-+ T Consensus 1 MGl~~~~~~~~~~~~~~--------------~~~~------------------~~~~~~~~~~~---~~~vt---~~~al 42 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEK--------------RGYL------------------DNVLGKSIRYS---GVYVT---DSNIL 42 (394) T ss_pred CchhhhhhhhccCCCCc--------------hhhh------------------hhhhhcccccC---ccccC---hhhhh Confidence 66554332221111100 0000 01111111100 00000 00012 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCch--hHHHHHHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEE Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDK--DVLEAIEAFNDL-N---DVESHNRSLGLDLSIYGKAYELMIRNQDDETRL 164 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~--~~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i 164 (512) .+.-....|+.+++-+..-|+.+...+. .....+..++.. | ....+...+..+.+.+|.+|+++-.+..+ T Consensus 43 ~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~---- 118 (394) T protein:vir:62 43 QSSDVYELLQDISNQMVLADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH---- 118 (394) T ss_pred ccHHHHHHHHHHHHhhcccceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee---- Confidence 2344566677777777777776532221 112233444433 3 23456667888899999999987432211 Q ss_pred EEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccc Q lcl|NC_010808. 165 YKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP 244 (512) Q Consensus 165 ~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 244 (512) . |..+.|..++.. ..+|.. .. ..|.++.+.+++.. + T Consensus 119 -~--~~~~~~~~~~~~-------~~~~~~--------~~----~~~~~~eiih~r~~----------------------~ 154 (394) T protein:vir:62 119 -L--ASNVFTELDDNL-------VEHFNI--------GG----HEIPPCMIRHVKNI----------------------G 154 (394) T ss_pred -c--cccceEEECCce-------EEEEee--------CC----EEechhheEEecCc----------------------C Confidence 1 223334433210 111110 00 01222333222110 0 Q ss_pred eEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCChhhhhhhhhccccccchhhhhhcccc Q lcl|NC_010808. 245 ITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDPDEVKKQKEANVLFLEPTVYENRDTG 322 (512) Q Consensus 245 vv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (512) .+.-.|.|.+..+...++....+..-....+.-.+.|-.+++ +....+.+..+..++.-.-........+.. T Consensus 155 ----~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~-- 228 (394) T protein:vir:62 155 ----ADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSV-- 228 (394) T ss_pred ----CCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCce-- Confidence 011246777776666666555555545555555566765554 222222222222221110000000000000 Q ss_pred cCCCCCcceeEEeec--CCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 323 IETEGSVDGGYIYKQ--YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 400 (512) Q Consensus 323 ~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~ 400 (512) .-.+.+.+.++.... .....+.+..+.....|+..-++|....+... ..+.+ ......+...|. T Consensus 229 ~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~sn~e-------------~~~~~~~~~~l~ 294 (394) T protein:vir:62 229 KMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI-KEDIE-------------KAMMYIHNKAVR 294 (394) T ss_pred eEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CcCHH-------------HHHHHHHHHHHH Confidence 011234444443322 23344555667777888888889876664322 11111 112333455566 Q ss_pred HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHH Q lcl|NC_010808. 401 RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEED 476 (512) Q Consensus 401 ~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E 476 (512) .+++.|...+..+-... .+...+.+.|+.....+....++++.++ +|+++.-.++++++.- +++.. ..+.- T Consensus 295 P~~~~ie~~l~~kll~~--~~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~g--d~~~~- 369 (394) T protein:vir:62 295 PIMKNFEDHLSLLFYAQ--NSGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKES--QAIYI- 369 (394) T ss_pred HHHHHHHHHHhhhhcCc--cccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--Ceeec- Confidence 66666655555322111 1223567888777767777788887776 6899999999888653 22211 10000 Q ss_pred HHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 477 EKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) .......+..+..+...+++++.++ T Consensus 370 ---------~~n~~~~~~~~~~~~~~kgge~~en 394 (394) T protein:vir:62 370 ---------SNDVTEIGKKEATDGSLGGGEENEN 394 (394) T ss_pred ---------ccccccccccccccccCCCCCCCCC Confidence 0000000001111111122222222 No 208 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=96.55 E-value=0.00052 Score=38.57 Aligned_cols=393 Identities=12% Similarity=0.028 Sum_probs=163.8 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceec- Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ- 114 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~- 114 (512) ..+ .+..........+.-.-+.....|-... ........ ..-+.+.-....|+.+++-+.+-|+.+- T Consensus 1 m~~-------~~~~~~~~~~~~~~~~~~~~~~~g~~~s--~~~~~v~~---~~al~~~~v~~cv~~ia~~ia~lp~~~~~ 68 (419) T protein:vir:80 1 MFF-------SRQLLSNLGQTQPGSGGWVSALLGSARS--EAGQVVTP---ASALSLTVLQNCVTLLAESIAQLPVELYE 68 (419) T ss_pred CCc-------ccccccccCcCCCCcchhhHHhhccccc--ccCcccCh---HHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 000 0000000000000000000001110000 00000000 0112233444567777777777777641 Q ss_pred --CCch--hHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEEeCCCCcee Q lcl|NC_010808. 115 --DDDK--DVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERNS 184 (512) Q Consensus 115 --~~d~--~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~~ 184 (512) .+.. .....+..++. -| ....+...+..+.+.+|.||+++.++..|++. +..++|..+.+..+... .+ T Consensus 69 ~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~--~~ 146 (419) T protein:vir:80 69 RSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDL--KP 146 (419) T ss_pred ecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc--eE Confidence 1111 11223455543 23 34456677888899999999999999989864 67788888776654321 11 Q ss_pred EEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHH Q lcl|NC_010808. 185 IAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITL 264 (512) Q Consensus 185 ~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~l 264 (512) +|... +. ..+..+.++++.. .| .+...|.|.+..+... T Consensus 147 -----~y~~~-----~~------~~~~~~~i~h~~~----------------------~~----~d~~~G~s~i~~~~~~ 184 (419) T protein:vir:80 147 -----MYRVA-----GA------DPLPQRLVHHVRW----------------------MS----INGYTGLSPVLLHANA 184 (419) T ss_pred -----EEEEc-----Cc------cccchhheEEecC----------------------CC----CCCcccccHHHHHHHH Confidence 11110 00 0011111111110 00 0123477777766666 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecC--C--cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCH Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKGN--L--SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDV 340 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 340 (512) ++....+..-..+.+.-.+.|-.+++-. . ..+.+..+..+..-.-.... ..+.......+++.+++-++..... T Consensus 185 i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~n~g~~~vl~~g~~~~~l~~s~~d 262 (419) T protein:vir:80 185 IGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGG--SGNAKKVALLQEGMKFKPLSMTNVD 262 (419) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcC--ccccCCceecCCCceEEeccCChhh Confidence 6655554444444455556676666521 1 11222222222110000000 0000111223445556555544344 Q ss_pred HHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc Q lcl|NC_010808. 341 QGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDAN 419 (512) Q Consensus 341 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~ 419 (512) ..+.+..+...+.|+..-++|....+... ++-|+. + ......+...|.-+++.|...++.+-.... T Consensus 263 ~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~--e----------~~~~~f~~~~l~P~~~~ie~~l~~kll~~~- 329 (419) T protein:vir:80 263 AALIDALRLSALDIARIYKIPAHMVNELERATFSNI--E----------HQSLQFVIYTLLPWVKRHEQAKTRDLLLPS- 329 (419) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccH--H----------HHHHHHHHHHHHHHHHHHHHHHhhhccCcc- Confidence 45556667778889888899876654332 221221 1 111223344455555555444443221111 Q ss_pred cccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhcccCCC Q lcl|NC_010808. 420 KDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIEEDEKESIKKAQKGIYKDPR 493 (512) Q Consensus 420 ~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 493 (512) +.....+.| ..-+..|..+.++.+.++ .|+++.-.++++++.-. +-+ ++ -.+..... ...+. T Consensus 330 -~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD-~~---------~~~~n~~~-~~~~~ 397 (419) T protein:vir:80 330 -ERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD-IY---------LSPMNMVD-ASKPQ 397 (419) T ss_pred -ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc-ee---------eecccccc-ccccc Confidence 112233444 444567888888888887 78999999888876421 100 00 00000000 00000 Q ss_pred CCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 494 DINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 494 ~~~~~~~~~~~~~~~~~~e 512 (512) +. ..++.++++...++-+ T Consensus 398 ~~-~~~~~~~~~~~~~~~~ 415 (419) T protein:vir:80 398 PI-PMGKTEPTKAALDEIG 415 (419) T ss_pred cc-cCCCCCchhhhHHHHH Confidence 00 0111111111111111 No 209 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=96.49 E-value=0.00057 Score=38.35 Aligned_cols=406 Identities=11% Similarity=-0.010 Sum_probs=180.3 Q ss_pred ccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHH-hccccccccccccccccccccee----- Q lcl|NC_010808. 16 NRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDY-YEGKTKNLVELTRRKEEYMADNR----- 89 (512) Q Consensus 16 ~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~y-y~G~~~~~~~~~~~~~~~~~~~r----- 89 (512) +...|.+...+-+-...... .+..-|. ..+ +.+..+ +.|-.+.. ........+ +.+ T Consensus 1 ~~~~i~~~~g~~~~~~~~~~--------~~~~~ia----~~~---~~~~~~~~~~~~p~~--~~il~~~~~-~~~~y~~m 62 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDK--------SLSSQIA----TRA---RSIDFFALGMYLPNP--DPVLKALGK-DIRVYREL 62 (491) T ss_pred CCCeeeCCCCCcccccccch--------hHHHHHh----hhc---cccccccccccCcch--hHHHhhccC-CHHHHHHH Confidence 23344444443331111100 0111110 000 000010 11111100 000000000 000 Q ss_pred eecchHHHHHHHHHhhhhccCceecCC--chhHHHHHHHHHhccChhHHHHHHHHHHHhCCeE-EEEEEECCCCceE--- Q lcl|NC_010808. 90 VAHDYASYISDFINGYFLGNPIQCQDD--DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA-YELMIRNQDDETR--- 163 (512) Q Consensus 90 i~~n~~~~iv~~~a~~l~g~~~~~~~~--d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~d~~g~~~--- 163 (512) .......-.+.+...-+.+.++.+... ++...+.+.++++.-+|+.....+ .++.-||.+ ++.+|...+|... T Consensus 63 ~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~ 141 (491) T protein:vir:79 63 RADAHVGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPID 141 (491) T ss_pred hhChHHHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEe Confidence 123566666777777788888888643 344557888888877787777666 468889975 4566765555543 Q ss_pred EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccccc Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 243 (512) +...+|..+ .|+.. .-.++....... .+....+++.| T Consensus 142 l~~r~~~~f--~~d~~----------------------------------~~l~l~~~~~~~-------~g~~lp~~k~i 178 (491) T protein:vir:79 142 VVGKPADWF--VYDPE----------------------------------NQLRFRSKEHWV-------QGEELPARKFL 178 (491) T ss_pred eeeecccce--eeccC----------------------------------CceEEeecCCCC-------CceeecCCCeE Confidence 333333222 12211 001111110000 00111122222 Q ss_pred ceEee--cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhccc Q lcl|NC_010808. 244 PITEF--SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 244 Pvv~~--~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (512) -.++. ..++.|.|.+..+....=--+..+.+++..++.|+.|+++.+-..+.++++.+.+... ........ T Consensus 179 ~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~a-------l~~~~~~a 251 (491) T protein:vir:79 179 VPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDR-------LEDMVQDA 251 (491) T ss_pred EEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHH-------HHHHhcCe Confidence 22211 1356788999887777767777788999999999999998763333344333332211 01111222 Q ss_pred ccCCCCCcceeEEeec---CCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQ---YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~---~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 398 (512) ....+.+.++++++.. .+...++.+++.+.+.|...--.-.++.++.++...|.. .. .-....+..-.+.+... T Consensus 252 ~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~v-h~--~v~~~i~~~D~~~i~~t 328 (491) T protein:vir:79 252 VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQA-GL--EVTDDIRDGDKAIVVEA 328 (491) T ss_pred EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhhHHH-HH--HHHHHHHHHHHHHHHHH Confidence 2334566788888653 234568888888888887744222223333233322221 11 12233344445566677 Q ss_pred HHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCH-HHHHHHHHHH--hcc-CChHHHHHhCCCCCCHHHHHHHHH Q lcl|NC_010808. 399 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSL-IEELKAYIDS--GGK-ISQTTLMSLFSFFQDPELEVKKIE 474 (512) Q Consensus 399 l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~-~~~~~~~~kl--~g~-~s~et~~~~~~~v~d~~~E~~ri~ 474 (512) +.++++.++.+ + .. . .....+.|.. +.+. .+.++.+.++ .|+ +|.+.+.+.++. ..++.+.+ . T Consensus 329 ln~li~~l~~~-N--~~-~----~~~p~f~~~e--~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gi-p~~~~~e~-~- 395 (491) T protein:vir:79 329 MNMLIRWICDL-N--FD-G----AARPVFDMWE--QEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNL-QDGDLDER-P- 395 (491) T ss_pred HHHHHHHHHHh-c--CC-C----CCcceEeecC--cCchhHHHHHHHHHHHhCCCccCHHHHHHHhCC-CCCCCCcc-c- Confidence 77766665553 1 11 1 1122344433 3333 4567777777 465 888999898874 32221100 0 Q ss_pred HHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC-C Q lcl|NC_010808. 475 EDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK-E 512 (512) Q Consensus 475 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-e 512 (512) .+..................+.+..+..-+. + T Consensus 396 ------~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 428 (491) T protein:vir:79 396 ------LPVSAVDAVGAASFAEFEAPDQDALDAALNALS 428 (491) T ss_pred ------cCcCcccccccccccccCCCCCcchHHHHHHHH Confidence 0000000000000000000000000000000 0 No 210 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=96.48 E-value=0.00058 Score=38.32 Aligned_cols=416 Identities=11% Similarity=0.103 Sum_probs=172.7 Q ss_pred CCc----------ceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 1 MLK----------ANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGK 70 (512) Q Consensus 1 ~~~----------~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~ 70 (512) .|. +-+|+-+.+.- -.++.++...-+ +...--+.-+ .||.|. T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~---------------~~~~~~~~~l-~~~~~~ 107 (694) T protein:vir:10 56 AAPVAEPSPSLRLARQFEVDVSNY------------TPRERRAASYAL---------------DFNGTSMDAL-SFVTSS 107 (694) T ss_pred ccccCCCCcchhhhhhccccccCC------------Cccccchhhhhh---------------ccCcccccch-hhhhcc Confidence 111 11232221110 000000000000 0000000000 122221 Q ss_pred ccccccccccccccccceeeecchHHHHHHHHHhhhhccC---------------ce----ecC-CchhHHHHHHHHHhc Q lcl|NC_010808. 71 TKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP---------------IQ----CQD-DDKDVLEAIEAFNDL 130 (512) Q Consensus 71 ~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~---------------~~----~~~-~d~~~~~~l~~~~~~ 130 (512) ..+-+.. -+.- ..++..+.++.+++..+.-+- ++ ... .+.+..+.|..-++. T Consensus 108 ~F~Gy~~-la~l-------aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~er 179 (694) T protein:vir:10 108 GFPGFPT-LVLL-------AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER 179 (694) T ss_pred CcchHHH-HHHH-------hhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHH Confidence 1110000 0000 000111111111111111111 11 111 123555678888888 Q ss_pred cChhHHHHHHHHHHHhCCeEEEEEEECCCCc-----------------e-EEEEEccceeEEEEeCCCCceeEEEEEEee Q lcl|NC_010808. 131 NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-----------------T-RLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (512) Q Consensus 131 n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~-----------------~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~ 192 (512) -++...+.++.+.+-.||.+..++-.+.++. + .+.+++|..+.|-.-+. ..+.. T Consensus 180 l~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~--~dP~s------ 251 (694) T protein:vir:10 180 LRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPVA------ 251 (694) T ss_pred HHHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhh--ccchh------ Confidence 8889999999999999999987776544331 0 14555666655532110 01110 Q ss_pred eeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEee---cCCCCCCcchHHHHHHHHHHH Q lcl|NC_010808. 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF---SNNERRKGDYEKVITLIDLYD 269 (512) Q Consensus 193 ~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~---~n~~~g~s~~~~v~~liDa~~ 269 (512) -.+|-|+.++ +.. .. .. ...-+.|...|+-.. ..+-.|.|....+.+.+++.+ T Consensus 252 --------------pdfgkP~~y~--V~G-~~-IH------~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~ 307 (694) T protein:vir:10 252 --------------DDFYKPSTWW--MIG-TE-VH------ATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWL 307 (694) T ss_pred --------------hccCCCceEE--Eec-eE-Ee------eeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHH Confidence 0111111100 000 00 00 000000111111110 113368888888999999988 Q ss_pred HHHHHHHHHHHHhcCceeeeecCC--cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHH Q lcl|NC_010808. 270 NAESDTANYMSDLNDAMLLIKGNL--SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYK 347 (512) Q Consensus 270 ~~~s~~~~~~~~~~~~~lv~~g~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 347 (512) ++.-.....+..++...+.. ++. ........ .. .+..... ....+.... -.+ +.+=+|.+.+.+..++...+ T Consensus 308 rT~~~v~~Li~~~~v~~lk~-dla~~L~~g~~~~-l~-~R~eli~-~~Rsn~G~~-llD-k~~Eefeq~stslSGLddVi 381 (694) T protein:vir:10 308 RTRQSVSDIVKQFSVSGILM-DLAQALMPGANVD-LS-MRAELIN-RYRDNRNIL-FLD-KATEEFFQFNTPLSGLDALQ 381 (694) T ss_pred HHHhHHHHHHHhhhhHHHHH-HHHHhhcChhHHH-HH-HHHHHHH-HhcCccceE-EEe-cCCcceEEEecccCCHHHHH Confidence 88777776665444433211 100 00000000 00 0000000 000111111 112 11223556677889999999 Q ss_pred HHHHHHHHHHhccccccccccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 348 DRLNSDIHMFTNTPNMKDDNFS---GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 348 ~~l~~~i~~~s~~p~~~~~~~~---~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) ....+.|...+++|-.-+...+ =|.||++=...+...+.- ..+..+...|++++.+|.. +..+.. +. + T Consensus 382 ~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s--~Qe~~L~p~L~rl~~ii~r--S~~G~i----dp-~ 452 (694) T protein:vir:10 382 AQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRA--YQRNALQQLMNDVIVMIQL--SLFGAV----DP-S 452 (694) T ss_pred HHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH--HhcCCC----CC-c Confidence 9999999999999976654432 367888765555555433 3366788899888877743 222222 22 5 Q ss_pred eeEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_010808. 425 VRYVYNRNLPKSLIEELKAYIDS---------GGKISQTTLMSLF------SFFQ--DPELEVKKIEEDEKESIKKAQKG 487 (512) Q Consensus 425 i~i~f~~~~p~d~~~~~~~~~kl---------~g~~s~et~~~~~------~~v~--d~~~E~~ri~~E~~~~~~~~~~~ 487 (512) |.+.|+|--..+..+.|+.-.|- .|+|+...+..++ ++.. |...+=-...++...- .. T Consensus 453 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~---~~-- 527 (694) T protein:vir:10 453 IKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDG---VL-- 527 (694) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhh---hH-- Confidence 78899998888899888874431 4666665555553 1210 1000000000000000 00 Q ss_pred cccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 488 IYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ... +...+.++..+.++ T Consensus 528 -----~~~---~~~~~~~~~~~~~~ 544 (694) T protein:vir:10 528 -----TYV---QRLAEGGDTGAPGG 544 (694) T ss_pred -----hhh---cCcccccccCCCCc Confidence 000 00111111111111 No 211 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=96.44 E-value=0.00063 Score=38.15 Aligned_cols=401 Identities=10% Similarity=-0.011 Sum_probs=184.8 Q ss_pred ccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHh-ccccccccccccc-cccccc--ce-e- Q lcl|NC_010808. 16 NRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYY-EGKTKNLVELTRR-KEEYMA--DN-R- 89 (512) Q Consensus 16 ~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy-~G~~~~~~~~~~~-~~~~~~--~~-r- 89 (512) +...|.+...+-+-..... +.+...|... ....++. .|-. +...... ...++. .+ . T Consensus 1 m~~~i~~~~g~p~~~~~~~--------~~~~~~ia~~--------~~~~~~~~~~~~--~~~~~~iLr~~~~~~~~y~~m 62 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPD--------KSLSSQIATR--------ARSIDFFALGMY--LPNPDPVLKALGKDIRVYREL 62 (491) T ss_pred CCCceeCCCCCccCcccCC--------hHHHHHHHhh--------hcccccccccCC--ccchHHHHHhcCCCHHHHHHH Confidence 3334555544443211111 1111111100 0000110 0100 0000000 000000 00 0 Q ss_pred eecchHHHHHHHHHhhhhccCceecC--CchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeE-EEEEEECCCCceEEE- Q lcl|NC_010808. 90 VAHDYASYISDFINGYFLGNPIQCQD--DDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA-YELMIRNQDDETRLY- 165 (512) Q Consensus 90 i~~n~~~~iv~~~a~~l~g~~~~~~~--~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~d~~g~~~i~- 165 (512) .......-.+.+...-+.+.++.+.. +++...+.+.++++.-.|+.....+. ++.-||.+ ++.+|...+|...+. T Consensus 63 ~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~ 141 (491) T protein:vir:10 63 RADAHVGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPID 141 (491) T ss_pred hhChHHHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEE Confidence 12356666777777778889988864 34455678889888878888887775 78889975 566676555554332 Q ss_pred --EEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccccc Q lcl|NC_010808. 166 --KSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (512) Q Consensus 166 --~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 243 (512) ..+|..+ .|+.. .-.++....... .+..-.+++.| T Consensus 142 l~~r~~~~f--~~d~~----------------------------------~~l~~~~~~~~~-------~g~~l~~~k~i 178 (491) T protein:vir:10 142 VVGKPADWF--VYDPE----------------------------------NQLRFRSKDHWM-------QGEELPARKFL 178 (491) T ss_pred eeeecccce--eeccC----------------------------------CceEEecCCCCC-------CcceecCCCEE Confidence 2333221 12211 111111111000 00011122222 Q ss_pred ceEee--cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhccc Q lcl|NC_010808. 244 PITEF--SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDT 321 (512) Q Consensus 244 Pvv~~--~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (512) -.++- ..++.|.|.+..+....-.-+..+.+++..++.|+.|+++.+--.+.++++...+... ........ T Consensus 179 ~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~a-------l~~~~~~a 251 (491) T protein:vir:10 179 VPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDC-------LEDMVQDA 251 (491) T ss_pred EEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHH-------HHHHhcCc Confidence 11111 1356788999988887777888889999999999999998774333344433332221 11112222 Q ss_pred ccCCCCCcceeEEeecC---CHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 322 GIETEGSVDGGYIYKQY---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) Q Consensus 322 ~~~~~~~~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 398 (512) ....+.+.++++++... +...++.+++.+.+.|...--.=.++.++.++...|.. ... -....++.-.+.+... T Consensus 252 ~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~v-h~~--v~~di~~~D~~~i~~t 328 (491) T protein:vir:10 252 VAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQA-GLE--VTDDIRDGDKAVVSEA 328 (491) T ss_pred EEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhHHHH-HHH--HHHHHHHHHHHHHHHH Confidence 33345667888887542 34568888888888887754322223333222222221 111 1222333344566677 Q ss_pred HHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hcc-CChHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_010808. 399 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-ISQTTLMSLFSFFQDPELEVKKIEE 475 (512) Q Consensus 399 l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~-~s~et~~~~~~~v~d~~~E~~ri~~ 475 (512) +.++++-++.+ ... . .+ ...+.|... ..+..+.++.+.++ .|+ ++.+.+.+.++. ..++.+.. T Consensus 329 ln~li~~l~~~---N~~-~--~~--~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gi-p~~~~~~~---- 394 (491) T protein:vir:10 329 MNMLIRWICDL---NFD-G--AD--RPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPAYFKRAYNL-QDGDLDER---- 394 (491) T ss_pred HHHHHHHHHHh---cCC-C--CC--cceEEecCc-CchhHHHHHHHHHHHhCCCcCCHHHHHHHhCC-CCCCcCcc---- Confidence 77666655543 111 1 11 234556433 23335678888887 465 888998888874 32221100 Q ss_pred HHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 476 DEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 476 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) ..+ ...+.........+......+..+ T Consensus 395 ----~~~------~~~~~~~~~~~~~~~~~~~~~~~d 421 (491) T protein:vir:10 395 ----PLP------VSAVDTVGAASFAEFEAPDQDALD 421 (491) T ss_pred ----ccc------cCCCCCcccccccccCCCCCCchH Confidence 000 000000000000000110000001 No 212 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=96.37 E-value=0.0007 Score=37.89 Aligned_cols=385 Identities=10% Similarity=0.058 Sum_probs=154.5 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |++.. +|....... ...... .++. ...+.. .. .. ...++ T Consensus 1 Mg~~~----~f~~k~~~~---------~~~~~~---~~~~-------------~~~~~~----~~-----~~---~~~~~ 39 (403) T protein:vir:80 1 MGLFN----FFRRKTRSE---------PTNAIS---WFLT-------------QEAYDT----LA-----IP---GYTRL 39 (403) T ss_pred Ccccc----ccccccccc---------ccchhh---hhcc-------------cccccc----cc-----cc---hhhhh Confidence 66653 233322110 000000 0000 000000 00 00 00011 Q ss_pred e-cchHHHHHHHHHhhhhccCcee-cC-Cc--hhHHHHHHHHHh--ccCh---hHHHHHHHHHHHh--CCeEEEEEEECC Q lcl|NC_010808. 91 A-HDYASYISDFINGYFLGNPIQC-QD-DD--KDVLEAIEAFND--LNDV---ESHNRSLGLDLSI--YGKAYELMIRNQ 158 (512) Q Consensus 91 ~-~n~~~~iv~~~a~~l~g~~~~~-~~-~d--~~~~~~l~~~~~--~n~~---~~~~~~~~~~~~~--~G~a~~~v~~d~ 158 (512) . .+.....|+.+++-+..-|+.+ .. ++ ......+..++. -|.. ..+...+..+++. .|.||+++..+. T Consensus 40 ~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~ 119 (403) T protein:vir:80 40 SDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTT 119 (403) T ss_pred hhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcC Confidence 1 1233445666666666666654 11 11 112223444443 2322 2444455556666 467888888888 Q ss_pred CCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccc Q lcl|NC_010808. 159 DDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFES 237 (512) Q Consensus 159 ~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (512) .|++ .+..++|..+.++.+++. . .+. |. ...|.++.+.++.... T Consensus 120 ~g~~~~L~~l~p~~v~~~~~~~g-~----~~~-y~--------------~~~~~~~eiih~~~~~--------------- 164 (403) T protein:vir:80 120 SGLIDELIPLAPSKVSFVDTDTG-Y----QIW-YQ--------------GKAYNYDEVLHFIVNP--------------- 164 (403) T ss_pred CCcEEEEEEEcCCeeEEEEcCCc-e----EEE-Ee--------------ecccchhhEEEEeccC--------------- Confidence 8876 466788887766655431 0 011 10 0112333333332100 Q ss_pred ccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhh Q lcl|NC_010808. 238 HSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYE 317 (512) Q Consensus 238 ~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (512) .|.. .-.|.|.+..+...+........-....+.-...|-.++.-.....++..+..++.. ...-..... T Consensus 165 -----~~~~----~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 234 (403) T protein:vir:80 165 -----DPEK----PYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAV-FKKYLEASE 234 (403) T ss_pred -----CCcC----ccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHH-HHHHhhhhh Confidence 0110 013666666555555555444333334444455666666443323332222222111 100000000 Q ss_pred hcccccCCCCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 318 NRDTGIETEGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFT 396 (512) Q Consensus 318 ~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~ 396 (512) .....+-..+..+..-+++ ......+.+..+.....|+..-++|....+. +.+.+.... ..+. T Consensus 235 ~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~~~~---------------~f~~ 298 (403) T protein:vir:80 235 AGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGV-GKYDKDEYN---------------NFIN 298 (403) T ss_pred cCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCC-CCccHHHHH---------------HHHH Confidence 0011111111222222221 1222344456667777888888888755542 212222111 1334 Q ss_pred HHHHHHHHHHHHHHHhccCCCcccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHH Q lcl|NC_010808. 397 KGLRRRAKLLETILKNTRSIDANKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKK 472 (512) Q Consensus 397 ~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~r 472 (512) ..|..+++.|...+..+-..+. +..+.| ..-+..|..+.++++.++ +|+++.-.+++.++.-..+... + T Consensus 299 ~~l~P~~~~ie~~l~~kll~~~-----~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd--~ 371 (403) T protein:vir:80 299 STILPIAKGIEQELTRKLLISP-----DLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLS--E 371 (403) T ss_pred HHHHHHHHHHHHHHHHhccCCC-----CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--e Confidence 4566666555555543221111 123445 455677888899988886 6899999998888653321100 0 Q ss_pred HHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 473 IEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 473 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.. .......... ...+..+..+. +.+.++.| T Consensus 372 ~~~------~~n~~pl~~~-~~~~~~k~ge~-~~~~~~~~ 403 (403) T protein:vir:80 372 LVI------LENYIPLDKI-GDQNKLKGGEK-GGADGQTD 403 (403) T ss_pred Eee------cccccchhhc-cchhhccCCCC-CCCCCCCC Confidence 000 0000000000 00000011111 11111111 No 213 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=96.36 E-value=0.0007 Score=37.88 Aligned_cols=424 Identities=9% Similarity=-0.014 Sum_probs=165.3 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHH--------HHHHHHHHHHHHHhcccccccccccccccccc Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHM--------DYQRPRLKVLSDYYEGKTKNLVELTRRKEEYM 85 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~--------~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~ 85 (512) |...++++.....-.-. ............. ..-.+ .+..+..|..... .+..+. T Consensus 1 M~~~~~l~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~g~~~~~-----~~~~g~ 62 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRM----------SIDDYAQMLNEFAFNGIGYGFGGGVP---RIQQTLAGPSTEL-----APDTFV 62 (466) T ss_pred CchhHHHhhccCccccc----------chhhhhhhhhhhhccccccccccccH---HHHHhhccccccc-----cCcccc Confidence 33334433332211000 0000000000000 00000 1111112211100 000011 Q ss_pred c---ceeeecchHHHHHHHHHhhhhccCceecCCch----h-HHHHHHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEE Q lcl|NC_010808. 86 A---DNRVAHDYASYISDFINGYFLGNPIQCQDDDK----D-VLEAIEAFNDL-N---DVESHNRSLGLDLSIYGKAYEL 153 (512) Q Consensus 86 ~---~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~----~-~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~ 153 (512) . ..-+.+......|+.++.-+..-|+.+.-.++ + ....+..++.. | ....+...+..+++.+|.||++ T Consensus 63 ~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~ 142 (466) T protein:vir:81 63 GLATQAYQANGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWT 142 (466) T ss_pred ccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEE Confidence 0 01133456667788888877777876532111 1 11233444432 2 3445667788899999999999 Q ss_pred EEECCCCc---------eEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCc Q lcl|NC_010808. 154 MIRNQDDE---------TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTN 224 (512) Q Consensus 154 v~~d~~g~---------~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~ 224 (512) +..++.|. ..+..++|..+.+..+....... ..+ |... ..........+.++.+++++... T Consensus 143 i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~--~y~-~~~~-----~~~~~~~~~~~~~~dviHir~~~-- 212 (466) T protein:vir:81 143 IVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQLGWRKV--GYL-YTEG-----GRQSGNESVGFLAEDVVHFAPIP-- 212 (466) T ss_pred EEecCccccccccCcceeEEEEecCcceEEEEcCCCceEE--EEE-EEec-----CcccccceeeeccccEEEEcCCC-- Confidence 98877654 23555666666665543221111 111 1000 00000000112333333321100 Q ss_pred cccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhh Q lcl|NC_010808. 225 GLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQK 304 (512) Q Consensus 225 ~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~ 304 (512) +++ +...|.|.+..+...|+....+..-....+.-...|-.+++-....+++.....+ T Consensus 213 -------------~~~---------d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~ 270 (466) T protein:vir:81 213 -------------DPL---------ASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADPAAVKKWA 270 (466) T ss_pred -------------Ccc---------cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHH Confidence 000 0114677777666666655555444454555566677666643334444444333 Q ss_pred hccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccccccc--ccchHHHHHHHHH Q lcl|NC_010808. 305 EANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS--GTQSGEAMKYKLF 382 (512) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~n~Sg~Ai~~~~~ 382 (512) +.-.-.... ..+.......+++.+++.++.......+.+..+...+.|+..-++|....+... +..++..++.. T Consensus 271 ~~~~~~~~g--~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~-- 346 (466) T protein:vir:81 271 DEVNSKHAG--VDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQA-- 346 (466) T ss_pred HHHHHHhcC--ccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHH-- Confidence 221111000 011111123355666666665444555566777788889999999977665321 11222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeC--CCCCcCHHHHHHH-------HHHH--hccC Q lcl|NC_010808. 383 GLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYN--RNLPKSLIEELKA-------YIDS--GGKI 451 (512) Q Consensus 383 ~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~--~~~p~d~~~~~~~-------~~kl--~g~~ 451 (512) ....+...|..+++.|...++.+-.. .. +...+.+.|+ .-+-.|..+.+++ +..+ +| + T Consensus 347 --------~~~f~~~tl~P~~~~ie~~l~~~L~~-~~-~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g-~ 415 (466) T protein:vir:81 347 --------RRRLADGTAHPLWQNLSGCIGHVMPD-MG-PDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAG-Y 415 (466) T ss_pred --------HHHHHHHHHHHHHHHHHHHHHhhcCC-cc-cCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcC-C Confidence 11223444444444444444332111 11 1112345553 3334455554443 2222 34 3 Q ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 452 SQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 452 s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) ....++..+..-+.+ .+...... .............+.++....++++.++ T Consensus 416 t~nE~r~~~~~gd~~-----~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 416 EPESVVAAVNSGDLR-----LLKHTGLT---SVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred ChhhccccccCCccc-----cccCCCcc---hhhhcccccccccCCCCcccCCCCcCCC Confidence 444444433221110 00000000 0000111111111111111122222222 No 214 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.29 E-value=0.00078 Score=37.62 Aligned_cols=313 Identities=11% Similarity=-0.022 Sum_probs=123.1 Q ss_pred EEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc Q lcl|NC_010808. 150 AYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT 229 (512) Q Consensus 150 a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~ 229 (512) .++.+|.-.+|...+..+. |-+ .+. +.+| .+...++...+......+. T Consensus 1 v~Eivw~~~~g~~~~~~l~-------~r~---~~~---~~~f----------------~~~~~~~l~~~~~~~~~g~--- 48 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLA-------WRP---PRT---ISRF----------------DVAPDGGLVAIEQWGVFGK--- 48 (355) T ss_pred CeEEEEEeeCCeEEEeeee-------ecC---ccc---eeee----------------eeccCCceeEEEecCCCCC--- Confidence 5555554333322111110 100 000 0001 0011111111111110000 Q ss_pred ccccccccccccccceEee--cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCCh---hh----- Q lcl|NC_010808. 230 PRENGFESHSFERMPITEF--SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDP---DE----- 299 (512) Q Consensus 230 ~~~~~~~~~~~~~vPvv~~--~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~---~~----- 299 (512) ......+.+.|-.++- ..++.|.|.+..+.-..---...+.+++..++.|..|+.+.+|-.+... +. T Consensus 49 ---~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~ 125 (355) T protein:vir:78 49 ---ATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQ 125 (355) T ss_pred ---CcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHH Confidence 0001111121111111 2356788888876666655677788888899999888888777433221 10 Q ss_pred -hhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccc---cccchHH Q lcl|NC_010808. 300 -VKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF---SGTQSGE 375 (512) Q Consensus 300 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---~~n~Sg~ 375 (512) ....++. ...+...+..+...+...+.+.++++++.......+..+++.+.+.|.+.-....++.+.. ++...|. T Consensus 126 ~~~~~~~~-l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~ 204 (355) T protein:vir:78 126 WLNDQKEE-GLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGD 204 (355) T ss_pred HHHHHHHH-HHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHH Confidence 0111110 0011011111111222345567888988776666677889999888877654443333221 1111122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hcc-C Q lcl|NC_010808. 376 AMKYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-I 451 (512) Q Consensus 376 Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~-~ 451 (512) . ...-....++.-.+.+...+. ++++.++.+ +. +. ...-..+.|.. .+.+..+.++.+.++ .|+ + T Consensus 205 v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~--~~----~~~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~~ 273 (355) T protein:vir:78 205 T---FASFFTGSLNAVMKHIADVTQQHVVEDLVDQ-NW--GP----EEPAPRLVPAQ-LGKEQPVTAEAIRALVECGAFT 273 (355) T ss_pred H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cC--CC----CCCCCEEEecC-cChhHHHHHHHHHHHHhCCCcc Confidence 1 111222333333455555563 355555542 21 11 11123456743 455666778888877 454 5 Q ss_pred ChH----HHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 452 SQT----TLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 452 s~e----t~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.+ .+.+.++. ..+.+.-+-+... +........... .+......+.........+..+ T Consensus 274 ~~~~~~~~~~e~~gi-p~p~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~a~~~~a~~~~~ 335 (355) T protein:vir:78 274 ADPELEKDLRARYGL-PAPAERDDGADAA-AAKAAGRRRAKR-LPGQRQGAALPSRSPRADPPRR 335 (355) T ss_pred ccHHHHHHHHHHhCC-CCCCCCCcccCCc-cccccccccccc-cCCccccccccccCCCCCChhh Confidence 543 45666654 3221100000000 000000000000 0000000011111111112222 No 215 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=96.28 E-value=0.00079 Score=37.60 Aligned_cols=415 Identities=10% Similarity=-0.044 Sum_probs=181.0 Q ss_pred cccccCCCcCeeecccchhHHhhh----------------cHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 17 RNYLFNDEANVVYTYDGTESDLLQ----------------NINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 17 ~~~~f~~~~~~~~~~~~~~~~~~~----------------~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) -..|.+...+-+-.....+..+.. .+..+...++..... .+..+.+.|+ ++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~g---d~~~~~~L~e---dm~------ 68 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQG---NLQAQAELFM---DME------ 68 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCC---CHHHHHHHHH---HHH------ Confidence 022222222211000000000000 012222222211111 1111111111 000 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCC------chhHHHHHHHHHhcc-ChhHHHHHHHHHHHhCCeE-EE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD------DKDVLEAIEAFNDLN-DVESHNRSLGLDLSIYGKA-YE 152 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~------d~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~ 152 (512) . ......-.+.+....+.+.++.+... +....+.+.+++..- +|......+. ++.-||.+ ++ T Consensus 69 -e--------~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-dA~~~G~s~~E 138 (526) T protein:vir:79 69 -E--------RDAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIE 138 (526) T ss_pred -h--------hChHHHHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHH-hhhhhcceeEE Confidence 0 12344555666666677888777532 223455688888653 5776665554 58889974 56 Q ss_pred EEEECCCCceEEE---EEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc Q lcl|NC_010808. 153 LMIRNQDDETRLY---KSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT 229 (512) Q Consensus 153 ~v~~d~~g~~~i~---~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~ 229 (512) .+|.-.+|...+. ..+|.. |- |+..... ..++...... T Consensus 139 i~w~~~~g~~~~~~l~~r~~~~-F~-~~~~~~~--------------------------------~l~~~~~~~~----- 179 (526) T protein:vir:79 139 LEWALQGREWMPLAFHHRPQSW-FQ-LNPEDQN--------------------------------ELRLRDNSPA----- 179 (526) T ss_pred EEEeecCCceeEEEeeeecccc-eE-eccCCCc--------------------------------EEEecCCCCC----- Confidence 6676555544332 222221 11 1111100 0011110000 Q ss_pred ccccccccccccccceEee--cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcc Q lcl|NC_010808. 230 PRENGFESHSFERMPITEF--SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEAN 307 (512) Q Consensus 230 ~~~~~~~~~~~~~vPvv~~--~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~ 307 (512) +....+++.|-.++- ..++.|.|.+..+.-..=--+..+.+++..++.|+.|+++.+--.+.++++...+... T Consensus 180 ----g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek~~L~~a- 254 (526) T protein:vir:79 180 ----GEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRA- 254 (526) T ss_pred ----ceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHH- Confidence 001112222222221 1356788888877666655566888999999999999998763333333333222211 Q ss_pred ccccchhhhhhcccccCCCCCcceeEEee-cCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHH-HHHHHHH Q lcl|NC_010808. 308 VLFLEPTVYENRDTGIETEGSVDGGYIYK-QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMK-YKLFGLE 385 (512) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~-~~~~~l~ 385 (512) ............+.+.++++++. ......++.+++.+.+.|.+.--.-.++.+...++.+.-|+. ....-.. T Consensus 255 ------v~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~ 328 (526) T protein:vir:79 255 ------VTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRH 328 (526) T ss_pred ------HHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHH Confidence 11111222333455677888885 345677899999999999876433333322211111111211 1111122 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-CChHHHHHhCC Q lcl|NC_010808. 386 QRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-ISQTTLMSLFS 461 (512) Q Consensus 386 ~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--g~-~s~et~~~~~~ 461 (512) ..+..-.+.+...+. ++++.++.+ + .....+. ..-..+.|...-+.|..+.++.+.++. |+ +|.+.+.+.++ T Consensus 329 di~~aDa~~i~~tln~~Li~~l~~~-N--~~~~~~~-~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~g 404 (526) T protein:vir:79 329 DILASDARQLAATLSRDLLWPLLVL-N--RPGSPDV-RRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLG 404 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-C--CCCcCCc-cccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhC Confidence 233444455566664 466655553 1 1111111 123467888889999999999999884 65 89999999987 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 462 FFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 462 ~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) . ..+++ -+.+........... . ..................+.+.-+ T Consensus 405 i-p~~~~-~e~~l~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~d 450 (526) T protein:vir:79 405 I-PQPAK-NEPVLRPAAQPAILS-R--QHGQRVAALATIVGPRYGDQQALD 450 (526) T ss_pred C-CCCCC-chhhccccCCccccc-c--ccccccccccccccccCchhhHHH Confidence 5 32221 111110000000000 0 000000000000000000000000 No 216 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=96.26 E-value=0.00081 Score=37.53 Aligned_cols=394 Identities=10% Similarity=0.071 Sum_probs=163.7 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---ccccccccccccc-ceeeecchHHHHHHHHHhhhhccCc Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN---LVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPI 111 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~---~~~~~~~~~~~~~-~~ri~~n~~~~iv~~~a~~l~g~~~ 111 (512) ....+....+.+.+ +.++-...... +..........-. +.-+...-....|+.+++-+..-|+ T Consensus 1 ~~~~~~~~~~k~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~ 67 (409) T protein:vir:96 1 MAKENIVTRIKKKL-------------IDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPL 67 (409) T ss_pred CccccchhhhhhHH-------------hhhhhccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCce Confidence 00000011111110 11111100000 0000000000000 0011223334445556665555566 Q ss_pred eecCCchhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 112 QCQDDDKDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 112 ~~~~~d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) .+--..+.....+.+++. -| .-..+...+..+++.+|.||+++-++..|++ .+..++|..+.++.++.. .. + T Consensus 68 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~-~ 145 (409) T protein:vir:96 68 KMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RE-L 145 (409) T ss_pred EEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cE-E Confidence 542222222233444443 23 2345567788889999999999999988875 566788888877765432 11 1 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) +|...... . . . ..+.++.+.+++... | .+.-.|.|.+..+...+ T Consensus 146 ----~y~~~~~~---g-~--~-~~~~~~evih~r~~~---------------------~----~~~~~G~s~l~~~~~~i 189 (409) T protein:vir:96 146 ----YYSIHAAT---G-N--K-LIVHNMDMLHFKHIV---------------------A----SNMVQGISPIDVLKNTT 189 (409) T ss_pred ----EEEEEcCC---c-e--E-EEEccccEEEeCCCC---------------------C----CCccccccHHHHHHHHH Confidence 11111100 0 0 1 123344444432100 0 01124677776666655 Q ss_pred HHHHHHHHHHHHHHHHhcC-ceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLND-AMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~-~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 344 (512) +....+... .+..++. +-.++......+++..+..++.-.- ...+.......+++.++..++.......+. T Consensus 190 ~~~~~~~~~---~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~-----~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~ 261 (409) T protein:vir:96 190 DFDNAVRTF---NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQ-----YYEENGGILFQEPGVEIEPLPKKYVSEDIV 261 (409) T ss_pred HHHHHHHHH---HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHH-----HhhcCCCeeecCCCceEEEcCCChhHHHHH Confidence 544333211 1222333 2233332223344444443332111 011111222334556666665444444555 Q ss_pred HHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccce Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 424 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 424 (512) +..+.....|+..-++|....+...+ .+...++ ......+...|..++..|...++..-...... ... T Consensus 262 e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~~s~~e----------~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~-~~g 329 (409) T protein:vir:96 262 ASENLTRERVANVFQLPSIFLNARSN-TNFAKNE----------ELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDR-EKN 329 (409) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHH----------HHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc-cCc Confidence 66667778898888998776653221 1111111 11123334445555555544444321111111 112 Q ss_pred eeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCC Q lcl|NC_010808. 425 VRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) Q Consensus 425 i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) ..+.| ..-+-.|..+.++++.++ +|+++.-.+++.++.-.-+- -+.+. +.................++ T Consensus 330 ~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~g--gD~~~------~~~n~~~~~~~~~~~~~~~g 401 (409) T protein:vir:96 330 RYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG--GDKPL------ISGDLYPIDTPLELRKSLKG 401 (409) T ss_pred ceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--cceee------ecccccccccchhhcccccC Confidence 33445 344556888889998887 68999988888886422110 00000 00000000000000000111 Q ss_pred CCCCcCcccCC Q lcl|NC_010808. 501 DDDTKDTVDKK 511 (512) Q Consensus 501 ~~~~~~~~~~~ 511 (512) .++. .+++ T Consensus 402 G~~n---~~e~ 409 (409) T protein:vir:96 402 GDKN---VNES 409 (409) T ss_pred CCCC---cCCC Confidence 1111 1111 No 217 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=96.20 E-value=0.00089 Score=37.32 Aligned_cols=413 Identities=11% Similarity=0.097 Sum_probs=174.8 Q ss_pred CCcceeeccccchhhccc---cccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRN---YLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL 77 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~ 77 (512) .--+-+|+-+-+.-.... .-+.-+.|-+. +.-+ .||.|...+-+.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------------------~~~l-~~~~~~~F~Gy~~ 115 (695) T protein:vir:36 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTS------------------------------MDAL-SFVTSSGFPGFPT 115 (695) T ss_pred cccceeceecccccCccccchhhhhhcccccc------------------------------cccc-hhhhccCcchHHH Confidence 111223443321111000 00111111110 1111 1222221110000 Q ss_pred cccccccccceeeecchHHHHHHHHHhhhhccC---------------cee----cC-CchhHHHHHHHHHhccChhHHH Q lcl|NC_010808. 78 TRRKEEYMADNRVAHDYASYISDFINGYFLGNP---------------IQC----QD-DDKDVLEAIEAFNDLNDVESHN 137 (512) Q Consensus 78 ~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~---------------~~~----~~-~d~~~~~~l~~~~~~n~~~~~~ 137 (512) -+.- ..++-.+.++.+++..+.-+- ++. .. .+.+..+.|..-++.-++...+ T Consensus 116 -la~l-------aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l 187 (695) T protein:vir:36 116 -LVLL-------AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAV 187 (695) T ss_pred -HHHH-------hhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHH Confidence 0000 000111111111111111111 111 11 1235566788888888889999 Q ss_pred HHHHHHHHhCCeEEEEEEECCCCc-----------------e-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccC Q lcl|NC_010808. 138 RSLGLDLSIYGKAYELMIRNQDDE-----------------T-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKT 199 (512) Q Consensus 138 ~~~~~~~~~~G~a~~~v~~d~~g~-----------------~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~ 199 (512) .++.+.+-.||.+..++-.+.++. + .+.+++|..+.|-.-+. ..+.. T Consensus 188 ~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~--~dP~s------------- 252 (695) T protein:vir:36 188 RTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPVA------------- 252 (695) T ss_pred HHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhh--ccchh------------- Confidence 999999999999987776644331 0 14555666655532110 01110 Q ss_pred CcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEee---cCCCCCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 200 DEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF---SNNERRKGDYEKVITLIDLYDNAESDTA 276 (512) Q Consensus 200 ~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~---~n~~~g~s~~~~v~~liDa~~~~~s~~~ 276 (512) -.+|-|+.++ +.. .. .. ...-+.|...|+-.. ..+-.|.|....+.+.+++.+++.-... T Consensus 253 -------pdfgkP~~y~--V~G-~k-IH------~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~ 315 (695) T protein:vir:36 253 -------DDFYKPSTWW--MIG-TE-VH------ATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVS 315 (695) T ss_pred -------hccCCCceEE--Eec-eE-Ee------eeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHH Confidence 0111111100 000 00 00 000000111111110 1133588888889999999888877776 Q ss_pred HHHHHhcCceeeeecCC--cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 277 NYMSDLNDAMLLIKGNL--SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 354 (512) Q Consensus 277 ~~~~~~~~~~lv~~g~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i 354 (512) ..+..++...+.. ++. ........ .. .+..... ....+.... -.+ +.+=+|.+.+.+..++...+....+.| T Consensus 316 ~Li~~~~v~~lk~-dla~aL~~g~~~~-l~-~R~eli~-~~Rsn~G~~-llD-k~~Eefeq~stslSGLddVi~qf~q~V 389 (695) T protein:vir:36 316 DIVKQFSVSGILM-DLAQALMPGANVD-LS-MRAELIN-RYRDNRNIL-FLD-KATEEFFQFNTPLSGLDALQAQAQEQM 389 (695) T ss_pred HHHHhhhHHHHHH-HHHHhhcChhHHH-HH-HHHHHHH-HhcCccceE-EEe-cCCcceEEEecccCCHHHHHHHHHHHH Confidence 6665444333211 100 00000000 00 0000000 000111111 112 112235566778899999999999999 Q ss_pred HHHhccccccccccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCC Q lcl|NC_010808. 355 HMFTNTPNMKDDNFS---GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNR 431 (512) Q Consensus 355 ~~~s~~p~~~~~~~~---~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~ 431 (512) ...+++|-.-+...+ =|.||++=...+...+.- ..+..+...|++++.+|.. +..+.. +. +|.+.|+| T Consensus 390 Agaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s--~Qe~~L~p~L~rl~~ii~r--S~~G~i----dp-di~~~fnP 460 (695) T protein:vir:36 390 SAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRA--YQRNALQQLMNDVIVMIQL--SLFGAV----DP-SIKWQWNA 460 (695) T ss_pred HhhhcCchhhhhccCcccccccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH--HhcCCC----CC-cceEEeCC Confidence 999999976654432 367888765555555433 3366788899888877743 222222 22 57889999 Q ss_pred CCCcCHHHHHHHHHHH---------hccCChHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHHHHHhhcccCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDS---------GGKISQTTLMSLF------SFFQ--DPELEVKKIEEDEKESIKKAQKGIYKDPRD 494 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl---------~g~~s~et~~~~~------~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 494 (512) --..+..+.|+.-.|- .|+|+...+..++ ++.. |...+=-...++...- . . .. T Consensus 461 L~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~---~--~-----~~ 530 (695) T protein:vir:36 461 LRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDG---V--L-----TY 530 (695) T ss_pred CCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhh---h--H-----hh Confidence 8888999888874431 4666665555553 1110 1000000000000000 0 0 00 Q ss_pred CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 495 INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 495 ~~~~~~~~~~~~~~~~~e 512 (512) . +...+.++..+.++ T Consensus 531 ~---~~~~~~~~~~~~~~ 545 (695) T protein:vir:36 531 V---QRLAEGGDTGAPGG 545 (695) T ss_pred h---cCcccccccCCCCc Confidence 0 00111111111111 No 218 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=395 Identities=10% Similarity=-0.005 Sum_probs=164.9 Q ss_pred HHhhhcHHHHHHHHHHHHHHHHH-HHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceec Q lcl|NC_010808. 36 SDLLQNINEVSKYIEHHMDYQRP-RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ 114 (512) Q Consensus 36 ~~~~~~~~~l~~~i~~~~~~~~~-r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~ 114 (512) .+. . +........... ....+... -|-... ........ ..-+.+.-....|+.+++-+..-|+.+- T Consensus 1 ~~~----~---r~~~~~~~~~~~~~~~~~~~~-~g~~~s--~~~~~vt~---~~al~~~~v~~~v~~ia~~iA~lp~~~~ 67 (419) T protein:vir:14 1 MFF----S---RQLLSNLGQTQMSAGGWVSAL-LGSSRS--DSGQVVTP---ASALALTVLQNCVTLLAESIAQLPIELY 67 (419) T ss_pred Ccc----c---ccccccccccccCcchhhHHh-hcCCCc--cCCcccch---HHhhccHHHHHHHHHHHHhhccCceEEE Confidence 000 0 000000000000 00000000 000000 00000000 0012223345567777776666676542 Q ss_pred ---CCc--hhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEEeCCCCce Q lcl|NC_010808. 115 ---DDD--KDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERN 183 (512) Q Consensus 115 ---~~d--~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~~~ 183 (512) .++ ......|..++. -| ....+...+..+.+.+|.+|+++.++.+|++. +..++|..+.+..+.. .. T Consensus 68 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~--~~ 145 (419) T protein:vir:14 68 ERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSD--LK 145 (419) T ss_pred EecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCC--ce Confidence 111 111223555543 23 34456666788899999999999999888864 7778888887665432 11 Q ss_pred eEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHH Q lcl|NC_010808. 184 SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (512) Q Consensus 184 ~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~ 263 (512) + +|.+... . .+..+.++++.. .+ .+.-.|.|.+..+.. T Consensus 146 ~-----~y~~~~~-----~------~~~~~~i~h~~~----------------------~~----~dg~~G~s~i~~~~~ 183 (419) T protein:vir:14 146 P-----VYRVRGS-----D------PMPQRLVHHVRW----------------------MS----INGYTGLSPVLLHAN 183 (419) T ss_pred E-----EEEEccC-----c------ccchhheeEecC----------------------cC----CCCcccccHHHHHHH Confidence 1 1111000 0 001111111110 00 012347777777776 Q ss_pred HHHHHHHHHHHHHHHHHHhcCceeeeecCCcC----ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCC Q lcl|NC_010808. 264 LIDLYDNAESDTANYMSDLNDAMLLIKGNLSL----DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYD 339 (512) Q Consensus 264 liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 339 (512) .++....+..-..+.+.-.+.|-.+++..... +++..+..+..-.-.... ..+.......+.+.++..++.... T Consensus 184 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~nag~~~vl~~g~~~~~l~~~~~ 261 (419) T protein:vir:14 184 AIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGG--SGNAKKVALLQEGMTFRPLSMTNV 261 (419) T ss_pred HHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcC--ccccCCceecCCCceEEEccCChh Confidence 66665555544444555556676666532111 222222222110000000 000011112244555555554333 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_010808. 340 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 418 (512) Q Consensus 340 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~ 418 (512) ...+.+..+.....|+..-++|....+... ++-|+ ++. .....+...|.-.++.|...+..+-.... T Consensus 262 d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~--~E~----------~~~~f~~~~L~P~~~~ie~~l~~kll~~~ 329 (419) T protein:vir:14 262 DAALIDALRLSALDIARIYKIPAHMVNELERATFSN--IEH----------QSLQFVIYTLLPWVKRHEQAKTRDLLLPS 329 (419) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHH----------HHHHHHHHHHHHHHHHHHHHHhhhccCcc Confidence 334455566677888888889876665332 22222 111 11233444555555555444443221111 Q ss_pred ccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCC Q lcl|NC_010808. 419 NKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRD 494 (512) Q Consensus 419 ~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 494 (512) +.....+.| ..-+-.|..+.++++.++ .|+++.-.++++++.-.-+.- +.. -..... .....+.. T Consensus 330 --~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gG--D~~------~~~~n~-~~~~~~~~ 398 (419) T protein:vir:14 330 --ERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGG--DIY------LSPMNM-VDASKPQQ 398 (419) T ss_pred --ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee------eecccc-cccccccc Confidence 112233444 444567888899998887 789999988888765221100 000 000000 00001111 Q ss_pred CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 495 INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 495 ~~~~~~~~~~~~~~~~~e 512 (512) ....+.++......+.+. T Consensus 399 ~~~~~~~~~~~~~~e~~~ 416 (419) T protein:vir:14 399 LPVGKSEPTKAAIDEIGR 416 (419) T ss_pred ccCCCCCCccccccchhc Confidence 111111111111111111 No 219 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=96.11 E-value=0.00099 Score=37.06 Aligned_cols=384 Identities=10% Similarity=0.027 Sum_probs=156.2 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecC--- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD--- 115 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~--- 115 (512) .....||.+.+..-. +..+.+ .+........+.... ..-....-....|+.+++-+..-|+++.. T Consensus 1 mg~~~~~~~~~~~~~--------~~~~~~---~~~~~~~~~~~~~t~-~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~ 68 (403) T protein:vir:10 1 MGFKSWITEKLNPGQ--------RIIRDM---EPVSHRTNRKPFTTG-QAYSKIEILNRTANMVIDSAAECSYTVGDKYN 68 (403) T ss_pred Ccchhhhhhccchhh--------hhhhcc---cccccccCCcccccH-HHHHHHHHHHHHHHHHHHHHhhCceeEeeccc Confidence 222333332221100 000101 111000000000000 00011112223344555555555554421 Q ss_pred ----CchhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEE Q lcl|NC_010808. 116 ----DDKDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIA 186 (512) Q Consensus 116 ----~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~ 186 (512) .+.-....+..++.. | ....+...+..+++.+|.||++.- +. .+..++|..+.+..+.. .. T Consensus 69 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~~-~l~~l~~~~~~v~~~~~---~~-- 138 (403) T protein:vir:10 69 IVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----GT-SLYHVPAALMQVEADAN---KF-- 138 (403) T ss_pred ccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----Cc-eeEeecCcceEEEEcCC---ce-- Confidence 111112234455532 3 234566677888889999997652 11 23445554443332211 11 Q ss_pred EEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEee-cCCCCCCcchHHHHHHH Q lcl|NC_010808. 187 GVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-SNNERRKGDYEKVITLI 265 (512) Q Consensus 187 ~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-~n~~~g~s~~~~v~~li 265 (512) +++|... .. + .|..+.+.++.. ..+++. .+...|.|.+..+...+ T Consensus 139 -~~~~~~~-------~~---~-~~~~~eiih~~~----------------------~~~~~~~~~~~~G~s~i~~~~~~i 184 (403) T protein:vir:10 139 -IKKFIFN-------NQ---I-NYRVDEIIFIKD----------------------NSYVCGTNSQISGQSRVATVIDSL 184 (403) T ss_pred -EEEEEec-------Cc---e-eecccceEEecc----------------------cccccCCCCCcccccHHHHHHHHH Confidence 1111000 00 0 011222222210 111111 23345777877777777 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCC--HHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYD--VQGT 343 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~ 343 (512) +....+..-..+.+.-...|-.+++.....+++.....++.-.-... + ..+.......+++.+++.++...+ ...+ T Consensus 185 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~-g-~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~ 262 (403) T protein:vir:10 185 EKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYN-P-STGQSSVLILDGGMKAKPYSQISSFKDLDF 262 (403) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhC-C-cccCcceeecCCCceeEEecccCCHHHHHH Confidence 76666665555556656667777765444555544443322110000 0 011111112344545555553222 2334 Q ss_pred HHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 423 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~ 423 (512) .+..+.....|+..-++|....+.. ...+.. ......+...|...++.|...++..-.. T Consensus 263 ~e~~~~~~~~Ia~~fgVPp~~lg~~-~~sn~e-------------~~~~~f~~~tl~P~~~~ie~~l~~~L~~------- 321 (403) T protein:vir:10 263 KEDIEGFNKSICLAFGVPQVLLDGG-NNANIR-------------PNIELFYYMTIIPMLNKLTSSLTFFFGY------- 321 (403) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCC-CCcCHH-------------HHHHHHHHHHHHHHHHHHHHHHHHhcCc------- Confidence 5556667788888888887766432 111111 1122333444555555555544433211 Q ss_pred eeeEEeCC--CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 424 TVRYVYNR--NLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 424 ~i~i~f~~--~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) .+.+.+.. .+-.|..+.++++.++ .|+++.-.+++.++.-.=+++...+.. .+ .+..........++. T Consensus 322 ~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~------~p--~n~~~~~~~~~~~e~ 393 (403) T protein:vir:10 322 KITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIR------IP--ANVAGSATGVSGQEG 393 (403) T ss_pred eeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccccc------cc--cccccccccCCCCcC Confidence 12233332 2445777888888776 689999999998865321111111100 00 011111111111111 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) +++ ....++| T Consensus 394 ~~~---~~~~~g~ 403 (403) T protein:vir:10 394 GRP---KGSTEGD 403 (403) T ss_pred CCC---CCCcCCC Confidence 122 2222333 No 220 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.02 E-value=0.0011 Score=36.78 Aligned_cols=413 Identities=11% Similarity=0.099 Sum_probs=175.6 Q ss_pred CCcceeeccccchhhcccc---ccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNY---LFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL 77 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~---~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~ 77 (512) .--+-+|+-+-+.-..... -+.-+.|.+. +.-+ .||.|...+-+.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------------------~~~l-~~~~~~~F~Gy~~ 115 (695) T protein:vir:78 67 LRLARQFEVDVSNYTPRERRAASYALDFNGTS------------------------------MDAL-SFVTSSGFPGFPT 115 (695) T ss_pred cccceeceeccccCCccccchhhhhhcccccc------------------------------cccc-hhhhccCcchHHH Confidence 1112234433311110000 0111111110 1111 1222221110000 Q ss_pred cccccccccceeeecchHHHHHHHHHhhhhccC---------------ce----ecC-CchhHHHHHHHHHhccChhHHH Q lcl|NC_010808. 78 TRRKEEYMADNRVAHDYASYISDFINGYFLGNP---------------IQ----CQD-DDKDVLEAIEAFNDLNDVESHN 137 (512) Q Consensus 78 ~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~---------------~~----~~~-~d~~~~~~l~~~~~~n~~~~~~ 137 (512) -+.- ..++..+.++.+++..+.-+- ++ ... .+.+..+.|..-+++-++...+ T Consensus 116 -la~l-------aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l 187 (695) T protein:vir:78 116 -LVLL-------AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAV 187 (695) T ss_pred -HHHH-------hhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 0000 000111111222222111111 11 111 1234556788888888889999 Q ss_pred HHHHHHHHhCCeEEEEEEECCCCc-----------------e-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccC Q lcl|NC_010808. 138 RSLGLDLSIYGKAYELMIRNQDDE-----------------T-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKT 199 (512) Q Consensus 138 ~~~~~~~~~~G~a~~~v~~d~~g~-----------------~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~ 199 (512) .++.+.+-.||.+..++-.+.++. + .+.+++|..+.|-.-+. ..+.. T Consensus 188 ~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~--~dP~s------------- 252 (695) T protein:vir:78 188 RTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPVA------------- 252 (695) T ss_pred HHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhh--ccchh------------- Confidence 999999999999987776644331 0 14555666655532110 01110 Q ss_pred CcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEee---cCCCCCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 200 DEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF---SNNERRKGDYEKVITLIDLYDNAESDTA 276 (512) Q Consensus 200 ~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~---~n~~~g~s~~~~v~~liDa~~~~~s~~~ 276 (512) -.+|-|+.++ +.. .. .. ...-+.|...|+-.. ..+-.|.|....+.+.+++.+++.-... T Consensus 253 -------pdfgkP~~y~--V~G-~k-IH------~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~ 315 (695) T protein:vir:78 253 -------DDFYKPSTWW--MIG-TE-VH------ATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVS 315 (695) T ss_pred -------hccCCCceEE--Eec-eE-Ee------eeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHH Confidence 0111111100 000 00 00 000000111111110 1133688888899999999988877777 Q ss_pred HHHHHhcCceeeeecCC--cCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHH Q lcl|NC_010808. 277 NYMSDLNDAMLLIKGNL--SLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 354 (512) Q Consensus 277 ~~~~~~~~~~lv~~g~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i 354 (512) ..+..++...+.. ++. ........ .. .+..... ....+.... -.+ +.+=+|.+.+.+..++...+....+.| T Consensus 316 ~Li~~~~v~~lk~-dla~~L~~g~~~~-l~-~R~eli~-~~Rsn~G~~-llD-k~~Eefeq~stslSGLddVi~qf~q~V 389 (695) T protein:vir:78 316 DIVKQFSVSGILM-DLAQALMPGANVD-LS-MRAELIN-RYRDNRNIL-FLD-KATEEFFQFNTPLSGLDALQAQAQEQM 389 (695) T ss_pred HHHHhhhhHHHHH-HHHHhhcChhHHH-HH-HHHHHHH-HhcCccceE-EEe-cCCcceEEEecccCCHHHHHHHHHHHH Confidence 6665544433211 100 00000000 00 0000000 000111111 112 112235566778899999999999999 Q ss_pred HHHhccccccccccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCC Q lcl|NC_010808. 355 HMFTNTPNMKDDNFS---GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNR 431 (512) Q Consensus 355 ~~~s~~p~~~~~~~~---~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~ 431 (512) ...+++|-.-+...+ =|.||++=...+...+.- ..+..+...|++++.+|.. +..+.. +. +|.+.|+| T Consensus 390 Agaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s--~Qe~~L~p~L~rl~~ii~r--S~~G~i----dp-di~~~fnP 460 (695) T protein:vir:78 390 SAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRA--YQRNALQQLMNDVIVMIQL--SLFGAV----DP-SIKWQWNA 460 (695) T ss_pred HhhhcCchhhhhccCCccccccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH--HhcCCC----CC-cceEEeCC Confidence 999999976654432 367888765555555433 3366788899888877743 222222 22 57889999 Q ss_pred CCCcCHHHHHHHHHHH---------hccCChHHHHHhC------CCCC--CHHHHHHHHHHHHHHHHHHHHhhcccCCCC Q lcl|NC_010808. 432 NLPKSLIEELKAYIDS---------GGKISQTTLMSLF------SFFQ--DPELEVKKIEEDEKESIKKAQKGIYKDPRD 494 (512) Q Consensus 432 ~~p~d~~~~~~~~~kl---------~g~~s~et~~~~~------~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 494 (512) --..+..+.|+.-.|- .|+|+...+..++ ++.. |...+=-...++...- .... T Consensus 461 L~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~---~~~~------- 530 (695) T protein:vir:78 461 LRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDG---VLTY------- 530 (695) T ss_pred CCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhh---hHhh------- Confidence 8888899888874431 4666665555553 1210 1000000000000000 0000 Q ss_pred CCCCCCCCCCcCcccCCC Q lcl|NC_010808. 495 INDDEQDDDTKDTVDKKE 512 (512) Q Consensus 495 ~~~~~~~~~~~~~~~~~e 512 (512) . +...+.++..+.++ T Consensus 531 ~---~~~~~~~~~~~~~~ 545 (695) T protein:vir:78 531 V---QRLAEGGDTGAPGG 545 (695) T ss_pred h---cCcccccccCCCCC Confidence 0 00111111111111 No 221 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=95.94 E-value=0.0012 Score=36.53 Aligned_cols=396 Identities=9% Similarity=-0.028 Sum_probs=170.1 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-cc--ccc---ccccccccee------eecch Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-VE--LTR---RKEEYMADNR------VAHDY 94 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~-~~--~~~---~~~~~~~~~r------i~~n~ 94 (512) +.|--.. -+.- .+-+ +-.+..++..+.... .. ... ......+... +.+.= T Consensus 1 ~~~~~~~--~~~~--~~~~--------------~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~ 62 (424) T protein:vir:45 1 MLYCWWA--HWLW--PEGG--------------RVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAA 62 (424) T ss_pred CeeEeee--ceec--Ccch--------------hHHHHhhccccCCCCCccccchhhhhhhccccCCceechHHhhccHH Confidence 2221111 0000 0000 111112222211000 00 000 0000001111 11122 Q ss_pred HHHHHHHHHhhhhccCceec-CCc---hh-HHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEEEEEEECCCCce-E Q lcl|NC_010808. 95 ASYISDFINGYFLGNPIQCQ-DDD---KD-VLEAIEAFND--LND---VESHNRSLGLDLSIYGKAYELMIRNQDDET-R 163 (512) Q Consensus 95 ~~~iv~~~a~~l~g~~~~~~-~~d---~~-~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~ 163 (512) ....|+.+++-+.+-|+++- ..+ +. ....+.+++. -|. ...+...+..+++.+|.+|+++-++..|++ . T Consensus 63 v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~ 142 (424) T protein:vir:45 63 VYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVIS 142 (424) T ss_pred HHHHHHHHHHHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEE Confidence 33456667776666677641 111 11 1223455542 232 334566788889999999999999888886 4 Q ss_pred EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccccc Q lcl|NC_010808. 164 LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (512) Q Consensus 164 i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 243 (512) +..++|..+.+..++ .+.. |.+... . . ...+.++.+.+++... T Consensus 143 L~~l~~~~v~i~~~~---~~~~-----y~~~~~--~---~---~~~~~~~eVih~r~~~--------------------- 185 (424) T protein:vir:45 143 LDCCMPWETTLMNTG---GRYT-----YGLYNE--Y---G---AFAISPDDMIHIRALG--------------------- 185 (424) T ss_pred EEEecCceEEEEEcC---CeEE-----EEEEec--C---c---eEEECcccEEEecCcC--------------------- Confidence 677788776554322 1111 111100 0 0 0123444444432110 Q ss_pred ceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhccccc Q lcl|NC_010808. 244 PITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGI 323 (512) Q Consensus 244 Pvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (512) .+...|.|.+......|+....+..-..+.+.-.+.|-.+++-....+++..+..+..-.-... +...+..... T Consensus 186 -----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~-g~~~n~g~~~ 259 (424) T protein:vir:45 186 -----NNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQ-ALRRQENKTM 259 (424) T ss_pred -----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhc-cccccCCcee Confidence 0122467777766666655544444444445555667777764434444443333321110000 0001111112 Q ss_pred CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 324 ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRR 402 (512) Q Consensus 324 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 402 (512) ..+.+.+++-++.......+.+..+.....|...-++|....+.... +-|+. + ......+...|... T Consensus 260 vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~--e----------q~~~~f~~~tL~P~ 327 (424) T protein:vir:45 260 LLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNI--S----------AQAIQFVRYTMMPW 327 (424) T ss_pred EcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH--H----------HHHHHHHHHHHHHH Confidence 23455566555543333445566677778888888999776654322 21221 1 11223344555555 Q ss_pred HHHHHHHHHhccCCCcccccceeeEEe--CCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_010808. 403 AKLLETILKNTRSIDANKDFNTVRYVY--NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEK 478 (512) Q Consensus 403 ~~li~~~l~~~~~~~~~~d~~~i~i~f--~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~ 478 (512) ++.|...++.+-..... ...+..+.| ..-+-.|..+.++++.++ +|+++.-.++++++.-.-+. - T Consensus 328 ~~~ie~~ln~kLl~~~e-~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~g--g-------- 396 (424) T protein:vir:45 328 VTNWEQELNRRLFTRAE-LAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEG--L-------- 396 (424) T ss_pred HHHHHHHHHHhcCChhh-hcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--c-------- Confidence 55555544432211111 011223444 444567888889998887 58999988888886532100 0 Q ss_pred HHHHHHHhhcccCCCCCCCCCCCCCCcCcccC Q lcl|NC_010808. 479 ESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 510 (512) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (512) + ......... + ..++......++...++ T Consensus 397 D--~~~~~~n~~-~-~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 397 D--EMLVSVNAA-N-PAGDFKPPKNDEGKTNE 424 (424) T ss_pred c--eeeeccccc-c-cccccCCCCCCCCCCCC Confidence 0 000000000 0 01111111111111111 No 222 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=95.89 E-value=0.00075 Score=37.71 Aligned_cols=189 Identities=12% Similarity=0.045 Sum_probs=81.3 Q ss_pred eeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 286 MLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD 365 (512) Q Consensus 286 ~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 365 (512) ++.++|....-.......+. +.-.... ..+........+ .+=+|-..+.+.+++...+....+.|...+++|-.-+ T Consensus 1 V~k~~~l~~~~~~~~~~~~~-r~~~~~~--~~~~~~~~~ld~-~~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~L 76 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARL-RLAQVDN--NSGVGQAIGIDA-DSEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIIL 76 (201) T ss_pred CccchHHHHHhcCChHHHHH-HHHHHHH--hhhhhhhheeec-CCcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhh Confidence 11111110000000000000 0000000 000000000111 1112555677888999999999999999999996544 Q ss_pred cccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHH Q lcl|NC_010808. 366 DNFS---GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK 442 (512) Q Consensus 366 ~~~~---~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~ 442 (512) ...+ =|.||..-..-|...+.- ..+..+...|++++.+++. ..++++.|+|-...+..+.|+ T Consensus 77 fG~sp~Glnatge~d~~nyyd~i~~--~Qe~~l~p~le~l~~~~~~-------------~~~~~~~f~pL~~~s~kekAe 141 (201) T protein:vir:10 77 KGKNVGGVSASQNTALETFYGYVDR--KRKAELLPLLEFLLPFIVT-------------EQEWSVEFNPLSQVSDKDKSE 141 (201) T ss_pred cCCCCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHhhcC-------------CCCceEeeCCCCCCCHHHHHH Confidence 3321 245676544333333222 2235677788877765421 125789999999999999888 Q ss_pred HHHHH---------hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 443 AYIDS---------GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 443 ~~~kl---------~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) ...+. +|++|.+.+...| .+.-.. ...............++++..+..+++ T Consensus 142 i~~~~a~a~~~~~~~g~i~~~e~r~~L-------------~~~~~~-----~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 142 ILEKNVNSVAALIAAGIIDADEARDTL-------------RAISTE-----VKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred HHHHHHHHHHHHHHcCCCCHHHHHHHH-------------HhcCCc-----CCCCCCCCCccccccccCCCCCCCCCC Confidence 76553 3556555544433 111000 000000000000000111111111111 No 223 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=95.75 E-value=0.0015 Score=36.03 Aligned_cols=420 Identities=10% Similarity=-0.006 Sum_probs=184.0 Q ss_pred CCc-----ceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_010808. 1 MLK-----ANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (512) Q Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~ 75 (512) |.. ++.+.+..-.....-.+-. -...+......-. .+..|...++........++..| ||+- . T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~--~~~~~~~~~~~gl---tp~~l~~iL~~a~~gd~~~~~~L--~~dm----~- 68 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAM--VMKRTQEHPSSGV---TPNRAAQMLRDAERGDLTAQADL--AFDM----E- 68 (512) T ss_pred CcceeCCCCCccccccccccccchhcc--cchhhccccccCC---CHHHHHHHHHHhhCCCHHHHHHH--HHHH----H- Confidence 221 1112111100000000000 0000000000001 13333444333322222222222 1110 0 Q ss_pred cccccccccccceeeecchHHHHHHHHHhhhhccCceecCC------chhHHHHHHHHHhcc-ChhHHHHHHHHHHHhCC Q lcl|NC_010808. 76 ELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD------DKDVLEAIEAFNDLN-DVESHNRSLGLDLSIYG 148 (512) Q Consensus 76 ~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~------d~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G 148 (512) .......-.+.+....+.+.++.+... +.+..+.+.+++..- +|+..+..+. ++.-|| T Consensus 69 --------------~~D~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G 133 (512) T protein:vir:19 69 --------------EKDTHLFSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKG 133 (512) T ss_pred --------------hhChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhc Confidence 012344555666666677888877532 123445677877543 5777666654 688899 Q ss_pred eE-EEEEEECCCCceE---EEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCc Q lcl|NC_010808. 149 KA-YELMIRNQDDETR---LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTN 224 (512) Q Consensus 149 ~a-~~~v~~d~~g~~~---i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~ 224 (512) .+ ++.+|.-.+|... +...+|... .|+...... .++...... T Consensus 134 ~s~~Ei~w~~~~g~~~~~~~~~r~~~~f--~~~~~~~~~--------------------------------lr~~~~~~~ 179 (512) T protein:vir:19 134 YSMQEIEWGWLGKMRVPVALHHRDPALF--CANPDNLNE--------------------------------LRLRDASYH 179 (512) T ss_pred ceeeeeEeeeeCCceeeeeeeeeccccc--eeccCCCcE--------------------------------EEecCCCCC Confidence 75 5666754444433 223333221 111111000 001010000 Q ss_pred cccccccccccccccccccceEe--ecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhh Q lcl|NC_010808. 225 GLKLTPRENGFESHSFERMPITE--FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKK 302 (512) Q Consensus 225 ~~~~~~~~~~~~~~~~~~vPvv~--~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~ 302 (512) +....+++.|-.++ ...++.|.|.+..+....=.-+..+.+++..++.|+.|+++.+-..+...++... T Consensus 180 ---------G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~ 250 (512) T protein:vir:19 180 ---------GLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKAT 250 (512) T ss_pred ---------ceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHHH Confidence 00111222222222 1235678899988777777777888999999999999999876333333333322 Q ss_pred hhhccccccchhhhhhcccccCCCCCcceeEEeec-CCHHHHHHHHHHHHHHHHHHhccccccccc--ccccchHHHHHH Q lcl|NC_010808. 303 QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNTPNMKDDN--FSGTQSGEAMKY 379 (512) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~~~n~Sg~Ai~~ 379 (512) +... ............+.+.++++++.. .....++.+++.+.+.|.+.--.-.++.+. .+++..|. + T Consensus 251 L~~a-------l~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~-v-- 320 (512) T protein:vir:19 251 LMQA-------VMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGE-V-- 320 (512) T ss_pred HHHH-------HHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH-H-- Confidence 2211 111112222333556778888853 455668999999999888754332233332 11121111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH-hcc-CChHHH Q lcl|NC_010808. 380 KLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS-GGK-ISQTTL 456 (512) Q Consensus 380 ~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl-~g~-~s~et~ 456 (512) ...-....+..-.+.+...+. ++++.++.+ + .+...+ ...-..+.|...-+.|..+.++.+.++ .|+ +|.+.+ T Consensus 321 h~ev~~di~~aDa~~i~~tln~~li~~l~~~-N--~~~~~~-~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i 396 (512) T protein:vir:19 321 HDEVRREIRNADVGQLARSINRDLIYPLLAL-N--SDSTID-INRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWI 396 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-C--CCCCCC-ccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHH Confidence 111222333444455666663 456555543 2 111111 112346778888899999999988876 465 899999 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 457 MSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 457 ~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+.++. ..++.+-.-+..... .. . ...........+.....+..|... T Consensus 397 ~e~~Gi-p~~~~~e~~~~~~~~-----~~-~-~~~~~~~~~~~~~~~~~~~~d~~~ 444 (512) T protein:vir:19 397 QEKLHI-PQPVGDEAVFTIQPV-----VP-D-NGSQKEAALSAEDIPQEDDIDRMG 444 (512) T ss_pred HHHhCC-CCCCCccccccCCCc-----cc-c-ccccccccccccCCCchhhHhHHh Confidence 999874 322211000000000 00 0 000000000000011111111110 No 224 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=95.71 E-value=0.0016 Score=35.93 Aligned_cols=431 Identities=10% Similarity=0.024 Sum_probs=185.6 Q ss_pred cccchhHHhhhcHHHHHHH---HHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 30 TYDGTESDLLQNINEVSKY---IEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 30 ~~~~~~~~~~~~~~~l~~~---i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) +-++.++.+.-.-..+.+. +...+..-..+++.+.+|..-. .... ....+...|+--+-+...++.+++-| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~-----~~~~-~~~~~~~~~~~dstg~~a~~~LAa~l 74 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPY-----LMND-KGDNETSQNGWQGVGAQATNHLANKL 74 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc-----ccCC-CCCcccccccccchHHHHHHHHHHHH Confidence 2233333332112233333 3333333345566666665431 1111 11112223555677778888888877 Q ss_pred hcc--Cc-----eecCCch-------------hHH-------HHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC Q lcl|NC_010808. 107 LGN--PI-----QCQDDDK-------------DVL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD 159 (512) Q Consensus 107 ~g~--~~-----~~~~~d~-------------~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~ 159 (512) .+- |+ ++...+. ++. ..+...+..++|.....++.++...+|.+. +|.|++ T Consensus 75 ~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~d~~ 152 (516) T protein:vir:10 75 AQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM--LYKPSK 152 (516) T ss_pred HhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe--EEecCC Confidence 652 22 2222221 122 234445667899999999999999999986 455777 Q ss_pred CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec------c--------CCcceEEEEEEEc-----CCcEEEEEe Q lcl|NC_010808. 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID------K--------TDEDEVFTVDLFT-----SHGVYRYLT 220 (512) Q Consensus 160 g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~------~--------~~~~~~~~~~~yt-----~~~~~~~~~ 220 (512) +.++ .+ |..-|.+..+. .+++...+|..+..... . ...+....+++|+ ++..+.+.. T Consensus 153 ~~~~--~~-pl~~y~v~~d~-~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~~~ 228 (516) T protein:vir:10 153 GAIS--AI-PMHHYVVNRDT-NGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWELKQ 228 (516) T ss_pred CCeE--EE-EcCeEEEeeCC-CCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEEEE Confidence 6654 33 33345554443 34555555443211100 0 0001112233332 222121111 Q ss_pred cCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC Q lcl|NC_010808. 221 SRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL 295 (512) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~ 295 (512) ...+ ... ......+|..+|++.+ ....+|+|..+...+-+..++.+.-...........|.+.+.-.... T Consensus 229 ~~d~-~~~----~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~ 303 (516) T protein:vir:10 229 SADD-IPV----GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQT 303 (516) T ss_pred eeCc-eee----ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCccccc Confidence 1111 111 1112223555676554 24568999999999988888887777776666666666544211111 Q ss_pred ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch Q lcl|NC_010808. 296 DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 373 (512) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 373 (512) ....+. .++. ..+..+...+++.+. +..+.......++.++..|...-..-.+.. ..+...+ T Consensus 304 ~~~~l~---~~~~------------g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~-rd~~rvT 367 (516) T protein:vir:10 304 DVDHFV---NSGT------------GEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTR-RDAERVT 367 (516) T ss_pred chhhhc---cCCC------------ceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhc-cCCcccc Confidence 111111 1110 011112222333332 334566667777777777754332211111 1123457 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH----HHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHH-- Q lcl|NC_010808. 374 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRA-KLLETI----LKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYID-- 446 (512) Q Consensus 374 g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~-~li~~~----l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~k-- 446 (512) |..+... ..+++..++..+.++- +++.-+ +.... ...+.....++ .. .+.+....++-+.. T Consensus 368 AtEV~~r-------~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~-p~~P~~lv~~~--~v--~~i~~L~raq~~~~i~ 435 (516) T protein:vir:10 368 AVEIQRD-------ALEIEQNMGGVYSLFATTMQSPVAMWGLLEAG-DSFTSDLVDPV--II--TGIEALGRMAELDKLA 435 (516) T ss_pred HHHHHHH-------HHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhC-CCCChhhcCcc--ee--hhHHHHHHHHHHHHHH Confidence 7766543 4555556666555432 111111 11111 11111121222 11 12222222222111 Q ss_pred --------HhccCC-----------hHHHHHhCCC---CCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 447 --------SGGKIS-----------QTTLMSLFSF---FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 447 --------l~g~~s-----------~et~~~~~~~---v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) ++++-+ .+.+...++. +-..++|++.+.+++++.....+..... ....+..-.++- T Consensus 436 ~~~q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~~~--~~~~~~~~~~~~ 513 (516) T protein:vir:10 436 NFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEEGV--AKAVPGVIQQEL 513 (516) T ss_pred HHHHHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHh--hhcccchhhhhh Confidence 122211 1222333321 1134567777777765544433322211 112222223333 Q ss_pred cCc Q lcl|NC_010808. 505 KDT 507 (512) Q Consensus 505 ~~~ 507 (512) ++. T Consensus 514 ~~~ 516 (516) T protein:vir:10 514 KEA 516 (516) T ss_pred hcC Confidence 333 No 225 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=95.69 E-value=0.0016 Score=35.89 Aligned_cols=430 Identities=11% Similarity=0.024 Sum_probs=185.6 Q ss_pred ccchhHHhhhcHHHHHHHHHHHHHHH---HHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhh Q lcl|NC_010808. 31 YDGTESDLLQNINEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFL 107 (512) Q Consensus 31 ~~~~~~~~~~~~~~l~~~i~~~~~~~---~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~ 107 (512) .+++.....-.-..|.+....-+..+ ..+++.+.+|..-. .... ....+...|+--+-+...++.+++.|. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~-----~~~~-~~~~~~~~~~~dstg~~a~~~LAa~l~ 74 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY-----LMNN-KGDNETSQNGWQGVGAQATNHLANKLA 74 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc-----ccCC-CCCcccccccccchHHHHHHHHHHHHH Confidence 22222222112233333333333333 34566666665431 1111 111222234555677778888888776 Q ss_pred cc--Cc-----eecCCch-------------h-------HHHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 108 GN--PI-----QCQDDDK-------------D-------VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 108 g~--~~-----~~~~~d~-------------~-------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) +- |+ ++...+. + ....+...+..++|.....++.++...+|.+.+++ |+++ T Consensus 75 ~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--d~~~ 152 (515) T protein:vir:70 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKG 152 (515) T ss_pred HhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE--eCCC Confidence 52 22 2222111 1 11234444667899999999999999999986554 6666 Q ss_pred ceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec--------------cCCcceEEEEEEE-----cCCcEEEEEec Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID--------------KTDEDEVFTVDLF-----TSHGVYRYLTS 221 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~--------------~~~~~~~~~~~~y-----t~~~~~~~~~~ 221 (512) .++ +++ ..-|.+.-+. .+++...+|.++..... ....+....+++| .++..+.+... T Consensus 153 ~~~--~~p-l~~y~v~~d~-~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e 228 (515) T protein:vir:70 153 AMS--AVP-MHHYVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS 228 (515) T ss_pred CeE--EEE-cCeEEEeeCC-CcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEe Confidence 544 333 3334554433 35555555543322100 0000111122333 22322211111 Q ss_pred CCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCC Q lcl|NC_010808. 222 RTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLD 296 (512) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~ 296 (512) ..+. .+ ......+|..+|++.+ ..+.+|+|..+...+-+..+|.+.-...........|.+.+.-..... T Consensus 229 ~d~~-~~----~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~ 303 (515) T protein:vir:70 229 ADDI-PV----GKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTD 303 (515) T ss_pred cCce-ee----ccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccc Confidence 1111 11 1112223455666554 245689999999999999999888777777777777776553111111 Q ss_pred hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchH Q lcl|NC_010808. 297 PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG 374 (512) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg 374 (512) ...+. .++ ...+..+...+++.+. +..+.......++.++..|...-..-... ...+...+| T Consensus 304 ~~~l~---~~~------------~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~-~rd~~rvTA 367 (515) T protein:vir:70 304 VDHFV---NSG------------TGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMT-RRDAERVTA 367 (515) T ss_pred hhhcc---ccC------------CceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhh-ccCCccccH Confidence 11111 011 0111112222333332 33456667777777777775533221111 111234577 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHH---HHHH Q lcl|NC_010808. 375 EAMKYKLFGLEQRTKTKEGLFTKGLRRRAK-L----LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK---AYID 446 (512) Q Consensus 375 ~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-l----i~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~---~~~k 446 (512) ..+... ..+++..++..+.++-. + +...+.... ...+.+ .+.+.+.. +.+.+..++ .+.. T Consensus 368 tEV~~r-------~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~-p~~P~~--~v~~~~vs--~l~~L~r~q~~~~i~~ 435 (515) T protein:vir:70 368 VEIQRD-------ALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAG-DSFTSE--LVDPVIVT--GIEALGRMAELDKLAN 435 (515) T ss_pred HHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhC-CCCChh--hcccceeh--hHHHHHHHHHHHHHHH Confidence 666543 45555566665555321 1 111111111 111111 23333322 222222222 1111 Q ss_pred H-------hcc-------CCh----HHHHHhCCC---CCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 447 S-------GGK-------ISQ----TTLMSLFSF---FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 447 l-------~g~-------~s~----et~~~~~~~---v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) . +++ +-. +.+...++. +-..++|++.+.++++.....++.. ........+.-.++.+ T Consensus 436 ~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~--~~~~~a~~~~~~~~~~ 513 (515) T protein:vir:70 436 FAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLN--EGVAKAVPGVIQQEMK 513 (515) T ss_pred HHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHH--Hhhhhhcccchhhhhc Confidence 1 111 111 122222221 1124567777776655543322211 1111122222222222 Q ss_pred Cc Q lcl|NC_010808. 506 DT 507 (512) Q Consensus 506 ~~ 507 (512) +. T Consensus 514 ~~ 515 (515) T protein:vir:70 514 EG 515 (515) T ss_pred cC Confidence 22 No 226 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=95.51 E-value=0.0019 Score=35.46 Aligned_cols=395 Identities=12% Similarity=0.050 Sum_probs=166.8 Q ss_pred hhcHHHHHHHHHHHHH---HHHHHHHHHH-----------HHhccccc-cccc----ccccccccccceeeecchHHHHH Q lcl|NC_010808. 39 LQNINEVSKYIEHHMD---YQRPRLKVLS-----------DYYEGKTK-NLVE----LTRRKEEYMADNRVAHDYASYIS 99 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~---~~~~r~~~~~-----------~yy~G~~~-~~~~----~~~~~~~~~~~~ri~~n~~~~iv 99 (512) .. |.++++.... ...++.+--. +.+.|-.+ .+.. ....-.......-+.+.-..-.| T Consensus 1 Mg----l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci 76 (431) T protein:vir:10 1 MG----LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCV 76 (431) T ss_pred Cc----chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHH Confidence 11 1111111100 0011110000 00111000 0000 00000000000001122334456 Q ss_pred HHHHhhhhccCcee-cCCc---hhHHHHHHHHHhc--cC---hhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 100 DFINGYFLGNPIQC-QDDD---KDVLEAIEAFNDL--ND---VESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 100 ~~~a~~l~g~~~~~-~~~d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) +.+++-+..-|+.+ ..++ ......+..++.. |. -..+...+..+++.+|.+|+++.++....+.+..++|. T Consensus 77 ~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~L~pl~~~ 156 (431) T protein:vir:10 77 TLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRPIRLIPMDRG 156 (431) T ss_pred HHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCceEEEEEEcCc Confidence 66666666667754 2111 1122345555432 32 33456678889999999999998875333456778888 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecC Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 250 (512) .+.+..++. ..+. |......+ . . ..+.+..|.+++... .+ T Consensus 157 ~v~~~~~~~--~~~~-----y~~~~~~g---~---~-~~~~~~dViHir~~~--------------------------~d 196 (431) T protein:vir:10 157 SAKGRLTST--WQIV-----YDYTTPTG---D---K-IELPAREVFHLRDLS--------------------------ID 196 (431) T ss_pred eeEEEEcCC--CeEE-----EEEEeCCc---e---E-EEEchhhEEEecCcC--------------------------CC Confidence 887766542 2211 11111110 0 0 112334444332110 01 Q ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcc Q lcl|NC_010808. 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVD 330 (512) Q Consensus 251 ~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (512) ...|.|.+.-+...+........-..+.+.-.+.|-.+++.....+++..+..++.-.-.... ..+.......+++.+ T Consensus 197 g~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g--~~n~g~~~vl~~g~~ 274 (431) T protein:vir:10 197 GVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTG--SENAGSWMLLEEGAT 274 (431) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcC--ccccCCceecCCCce Confidence 224667776666555544444444444445556676666544444444444433221111100 011111122345556 Q ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 331 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 410 (512) Q Consensus 331 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l 410 (512) ++.++.......+.+..+.....|+..-++|....+...+ .++..++.. ....+...|..+++.|...+ T Consensus 275 ~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~-~t~sn~eq~----------~~~f~~~tL~P~~~~ie~~l 343 (431) T protein:vir:10 275 AKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDT-SWGSGIEQL----------AIFFIQYGLSHWFVSWEQAA 343 (431) T ss_pred EEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCC-CccccHHHH----------HHHHHHHHHHHHHHHHHHHH Confidence 6666554444455556666778899988999776654321 122222211 11223334444444444444 Q ss_pred HhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--h----ccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 411 KNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--G----GKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIK 482 (512) Q Consensus 411 ~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~----g~~s~et~~~~~~~--v~d~~~E~~ri~~E~~~~~~ 482 (512) +.+-..........+++.+..-+-.|..+.++.+.++ . |+++.-.++++++. ++++... ++ T Consensus 344 n~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD--~~--------- 412 (431) T protein:vir:10 344 ARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVAD--QL--------- 412 (431) T ss_pred HhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCcccc--ce--------- Confidence 3321111111111234444444667888888888776 2 35888888888754 4332221 00 Q ss_pred HHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 483 KAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~ 504 (512) . ......+.+. .++.+..+ T Consensus 413 -~-~p~n~~~~~~-~~~~p~~~ 431 (431) T protein:vir:10 413 -R-NPMTQKQKGS-GDEPPATT 431 (431) T ss_pred -e-cccccccCCC-CCCCCCCC Confidence 0 0111111111 11111112 No 227 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=428 Identities=10% Similarity=0.044 Sum_probs=174.9 Q ss_pred cCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHH---HHHHHHHHHHhcccccccccccccccccccceeeecchHHH Q lcl|NC_010808. 21 FNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQ---RPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASY 97 (512) Q Consensus 21 f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~---~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~ 97 (512) ++. .....-..+.+....-+.++ ..+++.+.+|..- .... .........|+.-+-+.. T Consensus 1 ~~~-------------~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP-----~~~~-~~~~~~~~~~~~dstg~~ 61 (517) T protein:vir:10 1 MDM-------------RFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLP-----YLMA-DVNDDLSSQNAWQDDGAS 61 (517) T ss_pred Ccc-------------cccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcc-----cccc-CCCCCccccccccchHHH Confidence 111 11111223333333333333 4466666666533 1111 111222334566677788 Q ss_pred HHHHHHhhhhcc--Cc-----eecCCch-------------hH-------HHHHHHHHhccChhHHHHHHHHHHHhCCeE Q lcl|NC_010808. 98 ISDFINGYFLGN--PI-----QCQDDDK-------------DV-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKA 150 (512) Q Consensus 98 iv~~~a~~l~g~--~~-----~~~~~d~-------------~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a 150 (512) .++++++.|.+- |+ ++...+. ++ ...+...+..++|.....++.++...+|.+ T Consensus 62 a~~~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a 141 (517) T protein:vir:10 62 ATNFLSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNV 141 (517) T ss_pred HHHHHHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeE Confidence 888888887653 22 2222221 11 223444566789999999999999999998 Q ss_pred EEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec-----c---------CCcceEEEEEEEc----- Q lcl|NC_010808. 151 YELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID-----K---------TDEDEVFTVDLFT----- 211 (512) Q Consensus 151 ~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~-----~---------~~~~~~~~~~~yt----- 211 (512) .++ .++ +...++.++- .-|.+..+. .+++...+|..+..... + ...+....+++|+ T Consensus 142 ~ly--~~~-~~~~~~~~pl-~~y~v~~d~-~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~ 216 (517) T protein:vir:10 142 MMY--HPD-KTSPIQAVPL-HHYCVRRDN-NGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRT 216 (517) T ss_pred EEE--EeC-CCCcEEEEEc-CeEEEeeCC-CcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEe Confidence 654 443 3334455433 334444333 34455555443221100 0 0001112233333 Q ss_pred CCcEEEEEecCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010808. 212 SHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAM 286 (512) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~ 286 (512) ++..+.+.....+ .. .......++..+|++.+ ....+|+|..+...+-+..++.+.-...........|. T Consensus 217 ~~~~~~~~~~~d~-~~----~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~ 291 (517) T protein:vir:10 217 KDGKYLIRQSADD-VP----VGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVK 291 (517) T ss_pred CCCceEEEEEeCc-ee----eccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCC Confidence 2221111111100 00 11112223456776654 24568999999999999888887666666555555555 Q ss_pred eeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccc Q lcl|NC_010808. 287 LLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMK 364 (512) Q Consensus 287 lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 364 (512) +.+.-........+. .++. ..+..+...++..+. ...+.......++.++..|...-..-.+. T Consensus 292 ~lv~~~~~~~~~~l~---~~~~------------g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~ 356 (517) T protein:vir:10 292 YLVKPGSYTDINQFV---EGGS------------GAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMT 356 (517) T ss_pred cccCcccccchhhcc---CCCc------------cccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhh Confidence 443211111111111 1110 001111122333332 33355666677777776665543322211 Q ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHhccCCCcccccceeeEEeCCCCCc- Q lcl|NC_010808. 365 DDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR--------RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK- 435 (512) Q Consensus 365 ~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~--------~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~- 435 (512) ...+...||..+... ..++...++..+.+ ++..++..+..... ...+.+....++.. T Consensus 357 -~~~~~rvTAtEV~~r-------~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~------~~~v~~~~~s~la~l 422 (517) T protein:vir:10 357 -RRDAERVTAYEIQRD-------AMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILT------SKNVSPTILTGIEAL 422 (517) T ss_pred -ccCCccccHHHHHHH-------HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcC------CCCccceeeccHHHH Confidence 111234577666543 44445555555444 22222222221111 11233433222211 Q ss_pred CHHHHHHHHHHH-------hcc-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhh--ccc-C Q lcl|NC_010808. 436 SLIEELKAYIDS-------GGK-------ISQTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKG--IYK-D 491 (512) Q Consensus 436 d~~~~~~~~~kl-------~g~-------~s~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~--~~~-~ 491 (512) .....++.+... +.+ +-...++..+ -+++ -.++|+++.+++++......+.. ... . T Consensus 423 ~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~ 502 (517) T protein:vir:10 423 GRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAI 502 (517) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111112222211 111 1112222222 1122 12456655555544322221111 000 0 Q ss_pred CCCCCCCCCCCCCcC Q lcl|NC_010808. 492 PRDINDDEQDDDTKD 506 (512) Q Consensus 492 ~~~~~~~~~~~~~~~ 506 (512) +..........+++. T Consensus 503 ~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 503 PDMVKNGQINPQGGQ 517 (517) T ss_pred HHHHhCCCCCCCCCC Confidence 011111111122222 No 228 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=95.35 E-value=0.0022 Score=35.10 Aligned_cols=452 Identities=10% Similarity=-0.002 Sum_probs=167.6 Q ss_pred cchhhccccccCCCcCeeecccc-hhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDG-TESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR 89 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~-~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~r 89 (512) |.-+.. ...+-++....+ .+.++.+ .....+.+ -.++|.++.-+ .+...+..- .... T Consensus 1 ~~~~~~-----~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~----~~~~~~~~~~~--~p~~~~~~L-~~~~ 58 (651) T protein:vir:99 1 MTDTTG-----ETQETKVHVEGLGGEADLAK----------SPNSTQIP----DHRIQSHNVGV--NPPYNPDRL-AAFL 58 (651) T ss_pred CCCccc-----eeeeeEEEeecccccccccc----------cccccccc----hhhhcccCCCC--CCCCCHHHH-HHHH Confidence 110000 000111111100 0000000 00011111 11344443322 122211111 1111 Q ss_pred eecchHHHHHHHHHhhhhccCceecC------C--chhHHHHHHHHHhc---------------cChhHHHHHHHHHHHh Q lcl|NC_010808. 90 VAHDYASYISDFINGYFLGNPIQCQD------D--DKDVLEAIEAFNDL---------------NDVESHNRSLGLDLSI 146 (512) Q Consensus 90 i~~n~~~~iv~~~a~~l~g~~~~~~~------~--d~~~~~~l~~~~~~---------------n~~~~~~~~~~~~~~~ 146 (512) -..++.+..|+..+..+.|-++.+.. + +.+-.+.++.+|.. ..+......+..+... T Consensus 59 e~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~ 138 (651) T protein:vir:99 59 ELNETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHG 138 (651) T ss_pred hcChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHH Confidence 23689999999999999998876532 1 12223345555433 1344566677778888 Q ss_pred CCeEEEEEEECCCCce-EEEEEccceeEEEEeCCC--------------CceeEE--EEEE---------eeeeeeccCC Q lcl|NC_010808. 147 YGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTI--------------ERNSIA--GVRY---------LRTKPIDKTD 200 (512) Q Consensus 147 ~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~--------------~~~~~~--~v~~---------~~~~~~~~~~ 200 (512) +|-+|+-+..+..|.+ .+..++|..+ -+..... ...+.. +.++ |-....+... T Consensus 139 tGna~ieiIrn~~g~pv~L~~lp~~~~-Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~ 217 (651) T protein:vir:99 139 VGWLALEMLTDIEGRPVGLAYVPARTV-RVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYR 217 (651) T ss_pred HhhHhhhhhhcCccchhhhhhcChhhe-eeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeecccc Confidence 9988887777665543 1222222211 1100000 000000 0000 0000000000 Q ss_pred c--------ceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccc---eEeecCC-----CCCCcchHHHHHH Q lcl|NC_010808. 201 E--------DEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMP---ITEFSNN-----ERRKGDYEKVITL 264 (512) Q Consensus 201 ~--------~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~~~n~-----~~g~s~~~~v~~l 264 (512) . ...... +|..+.............. .............+| |++++.. ..|.|.+..+... T Consensus 218 ~~~~~~~~~~~~v~~-~~~~d~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~ 294 (651) T protein:vir:99 218 GQEVVIDESGDEPTI-RYREDEESEREPIFVDRET--GDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRT 294 (651) T ss_pred ceeeeeccCCcceeE-EeccCcceeeeeeccccee--eeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHH Confidence 0 000000 0011100000000000000 000000000111122 5565432 2577878777766 Q ss_pred HHHHHHHHHHHHHHHHHhcCceeeeec-CCcCChhhhhhhhhccccccchhhhhh-----cccccCCCCCcceeEEeecC Q lcl|NC_010808. 265 IDLYDNAESDTANYMSDLNDAMLLIKG-NLSLDPDEVKKQKEANVLFLEPTVYEN-----RDTGIETEGSVDGGYIYKQY 338 (512) Q Consensus 265 iDa~~~~~s~~~~~~~~~~~~~lv~~g-~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~ 338 (512) +.....+..-..+.+.-.+.|-.++.- ....+++....++..-.-.. .+.... .........+..++|..-.. T Consensus 295 i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~-~nagk~~vL~~~~~~~~~~~~~g~~~~pls~ 373 (651) T protein:vir:99 295 ISADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLR-EESHRAVVLEVEKFQSQLDEDVEIELEPMGQ 373 (651) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHh-ccCCceEEeecccccccccccCCceEEEcCc Confidence 665555554445555555666666641 11234444333332110000 000000 00000011122333433221 Q ss_pred ---CHHHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010808. 339 ---DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTR 414 (512) Q Consensus 339 ---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 414 (512) ....+.+..+.....|...-++|....+... ++-|. ++. .....+...|+.+++.|...++..- T Consensus 374 ~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn--~E~----------~~~~f~~~tL~P~~~~ie~eln~kL 441 (651) T protein:vir:99 374 GISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSN--SDQ----------QDKDFALEVIQPEQHTFAEWLYQII 441 (651) T ss_pred CchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCccc--HHH----------HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2345566777788889999999976654322 12111 111 1112234445555555555444321 Q ss_pred CCCcc-cccceeeEEeCC--CCCcCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_010808. 415 SIDAN-KDFNTVRYVYNR--NLPKSLIEELKAYIDS--GGKISQTTLMSLFSF--FQDPELE--VKKIEEDEKESIKKAQ 485 (512) Q Consensus 415 ~~~~~-~d~~~i~i~f~~--~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~--v~d~~~E--~~ri~~E~~~~~~~~~ 485 (512) ..... ..-..+.+.|+. -+-.|..+.++.+.++ +|+++.-.++++++. ++++... +..+ .. T Consensus 442 l~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~----------~~ 511 (651) T protein:vir:99 442 HQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEF----------EA 511 (651) T ss_pred cCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcccccccccc----------cc Confidence 11100 001134455543 3456788888888776 689999999988864 3321110 1000 00 Q ss_pred hhcccCCCCC--CCCCCCCCCcCcccCCC Q lcl|NC_010808. 486 KGIYKDPRDI--NDDEQDDDTKDTVDKKE 512 (512) Q Consensus 486 ~~~~~~~~~~--~~~~~~~~~~~~~~~~e 512 (512) ........++ +..++.+++.... ++| T Consensus 512 ~~~g~~~~gge~~~~~~~~~~~~~~-~~e 539 (651) T protein:vir:99 512 EVAGDVAGGGETEAVHEPPEENKIG-ERE 539 (651) T ss_pred ccccccccCCCCcccccCccccccc-cch Confidence 0000000001 0111111111111 111 No 229 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=95.23 E-value=0.0025 Score=34.86 Aligned_cols=421 Identities=10% Similarity=0.065 Sum_probs=173.4 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) .--+-+|+-+-+.- ..|+.++...-+ +...--+.-+ .||.|...+-+.. -+ T Consensus 67 ~~~~~~~~~~~~~~------------~~~~~~~~~~~~---------------~~~~~~~~~l-~~~~~~~F~Gy~~-la 117 (698) T protein:vir:10 67 LRLARQFEVDVSNY------------TPRERRAASYAL---------------DFNGTSMDAL-SFVTSSGFPGFPT-LV 117 (698) T ss_pred ccccccceeccccC------------Cccccchhhhhh---------------cccccccccc-hhhhccCcchHHH-HH Confidence 11122344333111 111111110000 0000001111 1222221110000 00 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccC---------------ce----ecC-CchhHHHHHHHHHhccChhHHHHHH Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNP---------------IQ----CQD-DDKDVLEAIEAFNDLNDVESHNRSL 140 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~---------------~~----~~~-~d~~~~~~l~~~~~~n~~~~~~~~~ 140 (512) .- ..++..+.++.+++..+.-+- ++ ... .+.+..+.|..-++.-++...+.++ T Consensus 118 ~l-------aQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~ea 190 (698) T protein:vir:10 118 LL-------AQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTT 190 (698) T ss_pred HH-------hhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000111111222221111111 11 111 1234556788888877888999999 Q ss_pred HHHHHhCCeEEEEEEECCCCc----e--------------EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcc Q lcl|NC_010808. 141 GLDLSIYGKAYELMIRNQDDE----T--------------RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDED 202 (512) Q Consensus 141 ~~~~~~~G~a~~~v~~d~~g~----~--------------~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~ 202 (512) .+.+-.||.+..++-.+.++. | .+.+++|..+.|-.-+. ..++ T Consensus 191 i~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~--~dP~----------------- 251 (698) T protein:vir:10 191 VIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS--INPV----------------- 251 (698) T ss_pred HHhcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhh--ccch----------------- Confidence 999999999887765544331 0 14455555554421110 0010 Q ss_pred eEEEEEEEcCCcEEEEEecCCccccccccccccccccccc--cceEe-ecCCCCCCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 203 EVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFER--MPITE-FSNNERRKGDYEKVITLIDLYDNAESDTANYM 279 (512) Q Consensus 203 ~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--vPvv~-~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~ 279 (512) .-.+|-|..+ ++.. .. .. ...-|.|.. +|-.. -..+-.|.|....+.+.+++++++.-.....+ T Consensus 252 ---spdfgkP~~y--~V~G-~~-IH------~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li 318 (698) T protein:vir:10 252 ---ADDFYKPSTW--WMIG-SE-VH------ATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV 318 (698) T ss_pred ---hhccCCCceE--EEec-ce-ec------ceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHH Confidence 0011222110 0000 00 00 000000100 12111 01233588888889999999888877776666 Q ss_pred HHhcCceeeeecCCc--CChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 280 SDLNDAMLLIKGNLS--LDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMF 357 (512) Q Consensus 280 ~~~~~~~lv~~g~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 357 (512) ..++...+. +++.. .+.... .... +. .+-.....+.... -.+ +.+=+|.+.+.+..++...+.+..+.|... T Consensus 319 ~~~~~~~l~-~dla~aL~~g~~~-~l~~-R~-eli~~~Rsn~G~~-llD-k~~Eefeq~st~lSGLddVi~qf~q~VAga 392 (698) T protein:vir:10 319 KQFSVSGIL-MDLAQALTPGANV-DLSM-RA-ELINRYRDNRNIL-FLD-KATEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (698) T ss_pred HHhhHHHHH-HHHHHhcCChhhH-HHHH-HH-HHHHHhcCccceE-EEe-cCCcceEEEecCcCCHHHHHHHHHHHHHhh Confidence 544433321 11100 000000 0000 00 0000000111111 112 122235566778899999999999999999 Q ss_pred hccccccccccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCC Q lcl|NC_010808. 358 TNTPNMKDDNFS---GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLP 434 (512) Q Consensus 358 s~~p~~~~~~~~---~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p 434 (512) +++|-.-+...+ =|.||++=...+...+.- ..+..+...|++++.+|+. +..+.. +. +|.+.|+|--. T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s--~Qe~~L~p~L~rl~~ii~r--S~~G~i----dp-~i~~~fnPL~q 463 (698) T protein:vir:10 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRA--YQRNALQQLMNDVIVMIQL--SLFGAV----DP-SIKWQWNALRE 463 (698) T ss_pred hcCchhhhhccCCcccCccchhhHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH--HhcCCC----CC-cceEEeCCCCC Confidence 999976654432 367888755555555433 3366788899888877643 222322 22 57889999988 Q ss_pred cCHHHHHHHHHHH---------hccCChHHHHHhC------CCC--CCHHHHH-----HHHHHHHHHHHHHHHhhcccCC Q lcl|NC_010808. 435 KSLIEELKAYIDS---------GGKISQTTLMSLF------SFF--QDPELEV-----KKIEEDEKESIKKAQKGIYKDP 492 (512) Q Consensus 435 ~d~~~~~~~~~kl---------~g~~s~et~~~~~------~~v--~d~~~E~-----~ri~~E~~~~~~~~~~~~~~~~ 492 (512) .+..+.|+.-.|- .|+|+...+..++ ++. .|++++- ..++.+... ....+ T Consensus 464 mtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~--------~~~~~ 535 (698) T protein:vir:10 464 LDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTY--------VQRMA 535 (698) T ss_pred cCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhh--------hcCCc Confidence 8999988874432 4666655555444 121 1111100 000000000 00000 Q ss_pred CCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 493 RDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~e 512 (512) ..++..+..+..+--...+- T Consensus 536 ~~~~~~~~~~~~~~~~~~~~ 555 (698) T protein:vir:10 536 EGGDTGAPTAPGGARAGATA 555 (698) T ss_pred CCCCcccccccccccCCCCC Confidence 00000000000000000000 No 230 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=95.17 E-value=0.0026 Score=34.75 Aligned_cols=431 Identities=10% Similarity=-0.003 Sum_probs=183.4 Q ss_pred cccchhHHhh---hcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh Q lcl|NC_010808. 30 TYDGTESDLL---QNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF 106 (512) Q Consensus 30 ~~~~~~~~~~---~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l 106 (512) +-++.+..+- +..+...+.+...+..-..+++.+.+|..-. .... ....+...|+--+-+...++.+++-| T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~-----~~~~-~~~~~~~~~~~dstg~~a~~~LAa~l 74 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPY-----LMND-KGDNETSQNGWQGVGAQATNHLANKL 74 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhccc-----ccCC-CCCccccCCcccchHHHHHHHHHHHH Confidence 2222222221 1122233333333333345666666665431 1111 11122223555677778888888877 Q ss_pred hcc--Cc-----eecCCch-------------hH-------HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCC Q lcl|NC_010808. 107 LGN--PI-----QCQDDDK-------------DV-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD 159 (512) Q Consensus 107 ~g~--~~-----~~~~~d~-------------~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~ 159 (512) .+- |+ +++..++ ++ ...+...+..++|.....++.++...+|.+.+ |.+++ T Consensus 75 ~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~d~~ 152 (516) T protein:vir:96 75 AQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML--YKPSK 152 (516) T ss_pred HhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE--EecCC Confidence 652 22 2222221 12 22344556778899999999999999999865 45776 Q ss_pred CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeee------ec--------cCCcceEEEEEEE-----cCCcEEEEEe Q lcl|NC_010808. 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKP------ID--------KTDEDEVFTVDLF-----TSHGVYRYLT 220 (512) Q Consensus 160 g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~------~~--------~~~~~~~~~~~~y-----t~~~~~~~~~ 220 (512) +.++ .+ |..-|.+.-+. .+++...+|...... .. ....+....+++| .++..+.+.. T Consensus 153 ~~~~--~~-pl~~y~v~~d~-~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 228 (516) T protein:vir:96 153 GAIS--AI-PMHHYVVNRDT-NGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQ 228 (516) T ss_pred CCEE--EE-EcCeEEEeeCC-CCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEE Confidence 6544 33 33335554443 234444443221100 00 0000111123333 3332221111 Q ss_pred cCCccccccccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcC Q lcl|NC_010808. 221 SRTNGLKLTPRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSL 295 (512) Q Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~ 295 (512) ...+. . .......+|..+|++.+ ....+|+|..+...+-+..++.+.-...........|.+.+.-.... T Consensus 229 ~~d~~-~----~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~ 303 (516) T protein:vir:96 229 SADDI-P----VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQT 303 (516) T ss_pred EeCce-e----eccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCccccc Confidence 11111 1 11112233455676654 24568999999999999888887777777777666666544211111 Q ss_pred ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch Q lcl|NC_010808. 296 DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS 373 (512) Q Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S 373 (512) ....+. .++. ..+..+...+++.+. +..+.......++.++..|...-..-.+.. ..+...+ T Consensus 304 ~~~~l~---~~~~------------g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~-r~~~rvT 367 (516) T protein:vir:96 304 DVDHFV---NSGT------------GEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTR-RDAERVT 367 (516) T ss_pred chhhhc---cCCC------------ceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhcc-CCCcccc Confidence 111111 1110 011112222333332 334566667777777777755332211111 1233457 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHH----- Q lcl|NC_010808. 374 GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAK-L----LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKA----- 443 (512) Q Consensus 374 g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-l----i~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~----- 443 (512) |..+... ..+++..++..+.++-. + |...+...+. ..+ -..+.+.+...+ +.+..++. T Consensus 368 AtEV~~r-------~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p-~lp--~~~v~~~~vs~l--~~l~r~~~~~~i~ 435 (516) T protein:vir:96 368 AVEIQRD-------ALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGE-SFT--SDLVDPVIITGI--EALGRMAELDKLA 435 (516) T ss_pred HHHHHHH-------HHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCC-CCc--cccccceeechH--HHHHHHHHHHHHH Confidence 7766553 45555556655555321 1 1111111111 111 112333332222 22222211 Q ss_pred -----HHHHhccCC-------hHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCC Q lcl|NC_010808. 444 -----YIDSGGKIS-------QTTLMSLF---SFFQ----DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDT 504 (512) Q Consensus 444 -----~~kl~g~~s-------~et~~~~~---~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (512) +..++++-| ...++..+ -+++ -.++|++++.+++++.....+...... +.-+..-..+- T Consensus 436 ~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~--~~~~~~~~~~~ 513 (516) T protein:vir:96 436 NFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVA--KAVPGVIQQEL 513 (516) T ss_pred HHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhh--hhhhHHhhccc Confidence 111122212 12233322 1121 235667666666555443332221111 11111122222 Q ss_pred cCc Q lcl|NC_010808. 505 KDT 507 (512) Q Consensus 505 ~~~ 507 (512) +|. T Consensus 514 ~~~ 516 (516) T protein:vir:96 514 KEA 516 (516) T ss_pred ccC Confidence 222 No 231 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=94.76 E-value=0.0036 Score=34.00 Aligned_cols=411 Identities=10% Similarity=0.011 Sum_probs=167.9 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccc-ccceee-- Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEY-MADNRV-- 90 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~-~~~~ri-- 90 (512) |++ -+.+.+.+... .+ +..- ..+ +....-|. +--+++.+........ ..-.++ T Consensus 1 ~~~-~~~~~p~~~~~----------~~----~~~~-~~~-------~~~~~g~~-~~D~~lr~~gg~~~~~~~l~~~m~e 56 (446) T protein:vir:98 1 MNM-EVRNAPTPAIR----------RR----TIYA-MEH-------LGLATSYL-SEDGGYKRAGKPTYQQLSAWDEAAQ 56 (446) T ss_pred Ccc-cccCCCchhhh----------hh----hhhc-ccc-------chhhcccC-CcchHhhhcCCChHHHHHHHHHHHh Confidence 111 12221111111 00 0000 000 11111111 1111111000000000 000011 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHhccChhHHHHHHHHHHHhCCeE-EEEEEECCCCceE-EEEE- Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA-YELMIRNQDDETR-LYKS- 167 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~d~~g~~~-i~~~- 167 (512) ......-...+....+.+-+.++...+++..+++.+++..-.+...... ..++..||.+ .+.+|.-..|.-. .+++ T Consensus 57 ~D~~v~s~l~~Rk~av~~~~w~V~p~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~~d 135 (446) T protein:vir:98 57 TEPIIAQGLDSIALSVLNKVGPYQHGDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATVLD 135 (446) T ss_pred cchHHHHHHHHHHHHhhcCCceecCccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecccccccchhhc Confidence 1345555666666667788888888888888999999987666555544 5788999975 5677764444211 1111 Q ss_pred -----ccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEE-EEEcCCcEEEEEecCCcccccccccccccccccc Q lcl|NC_010808. 168 -----DAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTV-DLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFE 241 (512) Q Consensus 168 -----~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~-~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (512) .|...--.++.. .....+. ........ -.|.+-...++....... ....+..+-|.. T Consensus 136 ~~~~~~~~~~r~~~~~~--~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~g~~~~iP~~ 198 (446) T protein:vir:98 136 DIVNYHPLQVMLIANDN--GRIVDGD-----------TVTASQYKSGYWVPLPPYRIGDPPKKV----DVVGSHVRLPSH 198 (446) T ss_pred cccccccccceeeeccC--Ccccccc-----------ccchhhcccccccCcccchhhhhhhhc----ccCccccccccc Confidence 221111011111 0000000 00000000 000000000000000000 000000111111 Q ss_pred ccceEee---cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc---CChhhhhhhhhcccc--ccch Q lcl|NC_010808. 242 RMPITEF---SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLS---LDPDEVKKQKEANVL--FLEP 313 (512) Q Consensus 242 ~vPvv~~---~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~---~~~~~~~~~~~~~~~--~~~~ 313 (512) ++=+..+ ..++.|.|.+..+--.-=--+..+-+++..++.|..|+++.+--.+ .+.+........+.. .+.. T Consensus 199 kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~ 278 (446) T protein:vir:98 199 KRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAED 278 (446) T ss_pred ceEEEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHH Confidence 1111111 2356788888776555555566778888889999999998763222 111111000000000 0000 Q ss_pred ---hhhhhccc---ccCCCCCcceeEEeecCC-HHHHHHHHHHHHHHHHHHhcccccccccccc-cchHHHH-HHHHHHH Q lcl|NC_010808. 314 ---TVYENRDT---GIETEGSVDGGYIYKQYD-VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQSGEAM-KYKLFGL 384 (512) Q Consensus 314 ---~~~~~~~~---~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Ai-~~~~~~l 384 (512) ........ ....+.+..+++++.... ...++.+++.+.+.|...-....++.+...+ .-|. |+ +....-. T Consensus 279 av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~-ala~vh~~V~ 357 (446) T protein:vir:98 279 ALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG-RASEIQLELF 357 (446) T ss_pred HHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchh-hhHHHHHHHH Confidence 00011111 112256778888887644 3458899999999998865444333222111 1111 11 1111111 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hcc-CC--hHHHHH Q lcl|NC_010808. 385 EQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-IS--QTTLMS 458 (512) Q Consensus 385 ~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~-~s--~et~~~ 458 (512) ...++.-.+.+...+. ++++-++.+ +.- ..........-.+.|...-+.|..+.++++.++ .|+ ++ .+.+.+ T Consensus 358 ~d~~~aDa~~i~~tln~~Li~~l~~l-Nf~-~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire 435 (446) T protein:vir:98 358 DGKINSIFDTVIHAFTEQVIGNLIRL-NFD-PALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRS 435 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh-CCC-ccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHH Confidence 1222333344555553 455555542 211 111111111113456556788889999999888 454 44 455666 Q ss_pred hCCCCCCHHHHH Q lcl|NC_010808. 459 LFSFFQDPELEV 470 (512) Q Consensus 459 ~~~~v~d~~~E~ 470 (512) .++. .+++.-- T Consensus 436 ~~gi-P~~~~~~ 446 (446) T protein:vir:98 436 ITGL-PDAISST 446 (446) T ss_pred HhCc-CCCCCCC Confidence 6654 2211100 No 232 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=94.31 E-value=0.0048 Score=33.32 Aligned_cols=394 Identities=9% Similarity=-0.028 Sum_probs=154.8 Q ss_pred ccccccccccc----cccccc-ce--eeecchHHHHHHHHHhhhhccCceecCCch--hHHHHHHHHHhc--c---ChhH Q lcl|NC_010808. 70 KTKNLVELTRR----KEEYMA-DN--RVAHDYASYISDFINGYFLGNPIQCQDDDK--DVLEAIEAFNDL--N---DVES 135 (512) Q Consensus 70 ~~~~~~~~~~~----~~~~~~-~~--ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~--~~~~~l~~~~~~--n---~~~~ 135 (512) -..++...... ...... .. -+........|+.+++-+.+-|+.+--.+. .....+..++.. | .... T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t~~~ 80 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMPAQV 80 (723) T ss_pred CcccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCCHHH Confidence 11111100000 000000 00 012233444566666666666776532221 122234555532 3 2344 Q ss_pred HHHHHHHHHHhCCeEEEEEEECC---CCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEc Q lcl|NC_010808. 136 HNRSLGLDLSIYGKAYELMIRNQ---DDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFT 211 (512) Q Consensus 136 ~~~~~~~~~~~~G~a~~~v~~d~---~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt 211 (512) +...+..+.+.+|.+|+++..+. .|.| .+..++|..+.++..+............|.....++ .. ..+. T Consensus 81 f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G------~~-~~~~ 153 (723) T protein:vir:94 81 LKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDG------VR-VPVL 153 (723) T ss_pred HHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCc------ee-EEec Confidence 56667778889999999987643 3443 345555555444433221110000111111110000 00 0112 Q ss_pred CCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Q lcl|NC_010808. 212 SHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKG 291 (512) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g 291 (512) ++.+++++.. + | .+.-.|.|.+......|.....+..-....+.-.+.|-.+++. T Consensus 154 ~~dIiHir~~----------------~-----~----~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~ 208 (723) T protein:vir:94 154 ADEMLWLRFS----------------D-----P----YDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNL 208 (723) T ss_pred ccceEEecCC----------------C-----C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEc Confidence 2222222110 0 0 1122477777766665555444444334444445556666652 Q ss_pred CCcCChhhhhhhhhccccccchhhhhhccccc--------CCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhccccc Q lcl|NC_010808. 292 NLSLDPDEVKKQKEANVLFLEPTVYENRDTGI--------ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNM 363 (512) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 363 (512) ...+++..+..+..-.-........+..... ..+.|.+++.++.......+.+..+.....|...-++|.. T Consensus 209 -~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~ 287 (723) T protein:vir:94 209 -GDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKD 287 (723) T ss_pred -CCCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChh Confidence 3344444333332111000000000000000 0112334444443323344555666677788888899865 Q ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCC--CCCcCHHHHH Q lcl|NC_010808. 364 KDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNR--NLPKSLIEEL 441 (512) Q Consensus 364 ~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~--~~p~d~~~~~ 441 (512) ..+..+...+.... ....+...|...++.|...++..-... . -..+.+.|+. -+-.|..+.+ T Consensus 288 ~i~~~st~sN~e~~-------------~~~f~~~tL~P~~~~ie~~ln~~Ll~~--~-g~~~~~~f~~~~lLr~D~~~r~ 351 (723) T protein:vir:94 288 ALLGGSTYENQAEA-------------KAAVWTETLIPQMEVMASITDLQLLPD--I-GWTVEWDFNSVPALQEDLEAQA 351 (723) T ss_pred HcCCCCCcccHHHH-------------HHHHHHHHHHHHHHHHHHHHhHhhccc--c-cCceEEeecchhhhhcCHHHHH Confidence 44322211111110 111233444444444444444321111 1 1235677754 3457888888 Q ss_pred HHHHHH--hccCChHHHHHhCCC--CCCHHHHH--H-------------HHHHHHHHHHHHHHhhcccCCCCCC-CCCC- Q lcl|NC_010808. 442 KAYIDS--GGKISQTTLMSLFSF--FQDPELEV--K-------------KIEEDEKESIKKAQKGIYKDPRDIN-DDEQ- 500 (512) Q Consensus 442 ~~~~kl--~g~~s~et~~~~~~~--v~d~~~E~--~-------------ri~~E~~~~~~~~~~~~~~~~~~~~-~~~~- 500 (512) +.+.++ +|+++.-.+++.++. +.+-...+ . --.+|....+.........+....+ .... T Consensus 352 ~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~ 431 (723) T protein:vir:94 352 GRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRAT 431 (723) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCC Confidence 888876 689999888887744 22111110 0 0011111111111111111110000 0000 Q ss_pred ---CCCCcCcccCCC Q lcl|NC_010808. 501 ---DDDTKDTVDKKE 512 (512) Q Consensus 501 ---~~~~~~~~~~~e 512 (512) ..+.+.+.++++ T Consensus 432 ~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 432 TVLHHDPGPDPQQTL 446 (723) T ss_pred CCCCCCcccCCchhH Confidence 011112222222 No 233 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=380 Identities=12% Similarity=0.035 Sum_probs=143.1 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCch Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK 118 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~ 118 (512) ...++++..+..+. .... ....+-. ...... .....-+...-....|+.+++-+..-|+.+--.++ T Consensus 1 Mg~~~~~~~~~~~~----~~~~-~~~~~~~--------~~~~~~-~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~ 66 (395) T protein:vir:40 1 MGFKSWVSGFFNEE----QRTL-NLTDTVW--------CSIPSE-KLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGE 66 (395) T ss_pred CchHHHHHhhhccc----cccc-ccccchh--------hccccc-cchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCc Confidence 22223333322111 0000 0000000 000000 00011122233445566666666666776543444 Q ss_pred hHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeee Q lcl|NC_010808. 119 DVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) Q Consensus 119 ~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 193 (512) .....+..++.. | ....+...+..+++.+|.||+++..+.. . -|..+... .....-.+++.+ T Consensus 67 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~---~----~~~~~~~~------~~~~~~~~~~~v 133 (395) T protein:vir:40 67 EVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI---Y----VADSFTKN------DKSLYENTYTEV 133 (395) T ss_pred cccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce---e----ecCCcccc------ccccccceeeee Confidence 444445555432 3 2345556678888999999987754321 1 11111000 000000111100 Q ss_pred eeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHH Q lcl|NC_010808. 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) Q Consensus 194 ~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s 273 (512) . .+ ....-..|.+..+++++. ++..+.+... .+...+....+ T Consensus 134 ~-~~-----~~~~~~~~~~~evih~r~-----------------------------~~~~~~~~~~---~l~~~~~~~~~ 175 (395) T protein:vir:40 134 T-LK-----DLTLKKEFKESEVLHLTL-----------------------------NNESIKSIID---GFYLLYGDLLT 175 (395) T ss_pred e-ec-----CceeeeeeccccEEEeec-----------------------------CCCCccccch---hHHHHHHHHHH Confidence 0 00 000001123333333321 1111112111 12222222222 Q ss_pred HHHHHHHHh--cCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHH---HHH Q lcl|NC_010808. 274 DTANYMSDL--NDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEA---YKD 348 (512) Q Consensus 274 ~~~~~~~~~--~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~---~~~ 348 (512) ...+..... ..+.+++......+++..+..+..-.-.... ...+.......+++.+++.++.......+.+ +.+ T Consensus 176 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 254 (395) T protein:vir:40 176 AAVNKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKK-FLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMID 254 (395) T ss_pred HHHHHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHH-hhccCCceeecCCCceEEeccCChhhhhHHHHHHHHH Confidence 222222222 3455555444334443333322211111110 0011111223455666666554333222222 223 Q ss_pred HHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc-ccceeeE Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK-DFNTVRY 427 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~-d~~~i~i 427 (512) .+...|+..-++|....+...+|.+. .....+...|..+++.|..-+..+--..... ....+++ T Consensus 255 ~~~~~Ia~~fgVPp~~l~~~~sn~e~---------------~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~f 319 (395) T protein:vir:40 255 DVFEMVANSFNIPLGLAKGDTVGLSE---------------QVNSFLMFSINPIAEMFTDEGNRKFYGRDSVLERTYMKL 319 (395) T ss_pred HHHHHHHHHhCCCHHHhcCCCcCHHH---------------HHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCceEEE Confidence 34566777788887655322222211 1223445566666666655555332111111 1123455 Q ss_pred EeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCC Q lcl|NC_010808. 428 VYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDD 503 (512) Q Consensus 428 ~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (512) .+..-+-.|..+.++++.++ .|+++.-.+++.++.- +++... +.. ........... .+..++.++ T Consensus 320 d~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD--~~~------~~~n~~~~~~~---~~~~kgge~ 388 (395) T protein:vir:40 320 DTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQ--ERF------VTKNYAPLGEN---EEDLKGGDI 388 (395) T ss_pred echhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCc--eee------ecccccccccc---ccccCCCCC Confidence 55566778889999998887 6899999988888652 222210 000 00000000000 000111111 Q ss_pred CcCcccC Q lcl|NC_010808. 504 TKDTVDK 510 (512) Q Consensus 504 ~~~~~~~ 510 (512) .+++.+. T Consensus 389 ~~~~~~~ 395 (395) T protein:vir:40 389 NENKGDS 395 (395) T ss_pred CCCcCCC Confidence 1111111 No 234 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=94.12 E-value=0.0053 Score=33.04 Aligned_cols=366 Identities=10% Similarity=0.034 Sum_probs=145.9 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCch Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK 118 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d~ 118 (512) ....+ ++.. |-......+.+.. .. ......-+.......+|+.+++-+..-|+.+--.+. T Consensus 1 Mg~f~---~~f~--------~~~~~~~~~~~~~--~~-------~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~ 60 (385) T protein:vir:95 1 MGLFD---SVFK--------RHSELSWMYDLEF--LQ-------DKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNT 60 (385) T ss_pred Cchhh---hhhc--------cCcccccccchhh--hh-------ccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCc Confidence 00000 0000 0000000111100 00 000011122345566677777777777776532333 Q ss_pred hHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEE--EEEccceeEEEEeCCCCceeEEEEEEe Q lcl|NC_010808. 119 DVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDETRL--YKSDAMSTFVIYDNTIERNSIAGVRYL 191 (512) Q Consensus 119 ~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i--~~~~p~~~~~i~d~~~~~~~~~~v~~~ 191 (512) .....+..++.. | ....+...+..+.+.+|.||++.. .++...+ .+..|.. ..++.. .++ T Consensus 61 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~--~~~~~~~~~~~~~~~~-~~~~~~----------~~~ 127 (385) T protein:vir:95 61 KEKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKN--DEGHFFVADDFEKEDE-LGLYSH----------RFT 127 (385) T ss_pred cccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEe--cCCCeeeccccccccc-cccccc----------cce Confidence 333345555532 3 335566778888999999997653 3333211 1111111 111100 001 Q ss_pred eeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHH Q lcl|NC_010808. 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNA 271 (512) Q Consensus 192 ~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~ 271 (512) .... . .......+.++.+.+++... ......|.|.+..+...+.. T Consensus 128 ~~~~-~-----~~~~~~~~~~~eiih~~~~~-------------------------~~~~~~G~s~~~~~~~~i~~---- 172 (385) T protein:vir:95 128 NVLV-N-----DFEFKRVFTMDDVIYLKYNN-------------------------QKLDAFSLGLFEDYGEIFGR---- 172 (385) T ss_pred eeee-c-----ccceeeeeccccEEEecCCC-------------------------CCcccccchHHHHHHHHHHH---- Confidence 0000 0 00001112222222221100 00011245544444333322 Q ss_pred HHHHHHHHHHhcCc--eeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC------CHHHH Q lcl|NC_010808. 272 ESDTANYMSDLNDA--MLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY------DVQGT 343 (512) Q Consensus 272 ~s~~~~~~~~~~~~--~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~------~~~~~ 343 (512) ......+.+.| ++++.+....+++..+..+..-.-.. .+...........+++.+++.++... ....+ T Consensus 173 ---~~~~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~-~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~ 248 (385) T protein:vir:95 173 ---MIDLQMLNNQIRGILKVDATKFYNKEKQKELQAYIDTLF-DAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSEL 248 (385) T ss_pred ---HHHHHHhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHh-hhhhhcCCceEEcCCCceeEeecccccccCCHHHHHH Confidence 22222223333 22332332333333222221110000 00000111112245566666554321 23456 Q ss_pred HHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 423 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~ 423 (512) .+..+.....|+..-++|....+...+|. + ......+...|..+++.|...++.+-......... T Consensus 249 ~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~--e-------------~~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~ 313 (385) T protein:vir:95 249 NELKKTVLTDVARMIGVPPSLVLGEMADL--E-------------KTIESYLQFCINPLLRKIEAELNSKFFYQDEYLND 313 (385) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCCcCH--H-------------HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcccc Confidence 66777788889888899876553211111 1 12334455566666666666665432211111111 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 424 TVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 424 ~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) .+++.+..-+..|..+.++++.++ .|+++.-.+++.++.-. ++.. ++.. ..... ...+ T Consensus 314 ~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~g--d~~~--------~~~n~-----~~~~--- 375 (385) T protein:vir:95 314 DMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPEL--DKFI--------ITKNL-----QSAD--- 375 (385) T ss_pred eEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--ceee--------ecccc-----eecc--- Confidence 345555566778888999998887 68999999888886532 1110 0000 00000 0000 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) ..+...+++| T Consensus 376 ---~~kgge~~~e 385 (385) T protein:vir:95 376 ---AFKGGESNEE 385 (385) T ss_pred ---cccCCCCCCC Confidence 0111111111 No 235 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=93.86 E-value=0.0061 Score=32.71 Aligned_cols=435 Identities=10% Similarity=-0.007 Sum_probs=176.2 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc--Cc---- Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN--PI---- 111 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~---- 111 (512) +.+..+.+.+.++ ++.-..+++.+.+|..-.- . ............|..-+-+...++.+++.|.+- |+ T Consensus 1 mk~~~~~~~~~lk--r~~~e~~w~e~a~~tlP~~---~-~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 74 (510) T protein:vir:78 1 MKSTAAMLWEKLR--DGSVEQRAIEFAKTTLPYL---M-VDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPF 74 (510) T ss_pred ChhHHHHHHHHHh--ccchHHHHHHHHHhhcccc---c-cCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcc Confidence 2222233332221 1222334555555543211 0 000111111112344566777788888777652 22 Q ss_pred -eecCCch-------------hHH-------HHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 112 -QCQDDDK-------------DVL-------EAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 112 -~~~~~d~-------------~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++...++ ++. ..+...+..++|.....++.++...+|.+.+++ ++++. +++.++ . T Consensus 75 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~~~~~-~~~~~p-l 150 (510) T protein:vir:78 75 FRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWS-L 150 (510) T ss_pred cccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE--eCCCC-eEEEEE-c Confidence 2222221 122 234445667899999999999999999986554 45543 344443 3 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeee-----------c-cCCcceEEEEEEEcC-----Cc---EEEEEecCCccccccc Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPI-----------D-KTDEDEVFTVDLFTS-----HG---VYRYLTSRTNGLKLTP 230 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~-----------~-~~~~~~~~~~~~yt~-----~~---~~~~~~~~~~~~~~~~ 230 (512) .-|.+.-|. .+++...+|.++.... . ....+....+++|+. +. .+.+..+..+ . T Consensus 151 ~~y~v~~d~-~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg-~---- 224 (510) T protein:vir:78 151 RSYAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDG-V---- 224 (510) T ss_pred ceeEEeeCC-CcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecC-e---- Confidence 435554443 3455555554433210 0 001112223334331 00 0000000000 0 Q ss_pred cccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhh Q lcl|NC_010808. 231 RENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKE 305 (512) Q Consensus 231 ~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~ 305 (512) ........++..+|++.+ ....+|+|..+...+-+..++.+.-...........|.+.+.-........+. . T Consensus 225 ~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~l~---~ 301 (510) T protein:vir:78 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDDYQ---D 301 (510) T ss_pred eeccccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhhhc---c Confidence 011112223455676654 24568999999999999999887666655555445544332210011111111 1 Q ss_pred ccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHH Q lcl|NC_010808. 306 ANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFG 383 (512) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~ 383 (512) ++. ..+..+...+++.+. +..+.......++.++..|...-.. +.. ...+...||..+...... T Consensus 302 ~~~------------g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~l~-~~~~~rvTAtEV~~r~~E 367 (510) T protein:vir:78 302 AEM------------GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEE 367 (510) T ss_pred CCC------------ceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhh-ccc-cCCCCCcCHHHHHHHHHH Confidence 110 000111122233332 2334565666677776666543221 111 122344677776665333 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCccc-ccceeeEEeCCCCCcCHH-HHH----HHHHHHhc---c--- Q lcl|NC_010808. 384 LEQRTKT-KEGLFTKGLRRRAKLLETILKNTRSIDANK-DFNTVRYVYNRNLPKSLI-EEL----KAYIDSGG---K--- 450 (512) Q Consensus 384 l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~~~~~~~~-d~~~i~i~f~~~~p~d~~-~~~----~~~~kl~g---~--- 450 (512) +.+...- ..++-.+.+.-+++..+.++...+....+. ......+++..++-+... +.+ +.+..+.+ + T Consensus 368 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~~~~~~~q~~~~ 447 (510) T protein:vir:78 368 AENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIAGLAPIAQLDPR 447 (510) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHHHHHHHHHHHHHHHHHhcChhhhhhc Confidence 3332221 233333344445555555554433222221 222223344333322111 001 11111111 1 Q ss_pred CChHHHH----HhCCCCC----CHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCc Q lcl|NC_010808. 451 ISQTTLM----SLFSFFQ----DPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTK 505 (512) Q Consensus 451 ~s~et~~----~~~~~v~----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (512) +....++ ..+|... -.++|++.+.++++.....++........+..+......+- T Consensus 448 id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred CCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 1122222 2333201 23567777766654333222211111011111111111110 No 236 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=93.82 E-value=0.0063 Score=32.66 Aligned_cols=347 Identities=10% Similarity=0.045 Sum_probs=142.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccee--eecchHHHHHHHHHhhhhccCceecCCchhHHHH Q lcl|NC_010808. 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEA 123 (512) Q Consensus 46 ~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~r--i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~ 123 (512) ..++.....+.......+.....+... ..... ..+.+ +.+.=..-.|+.+++-+.+-|+. ++.. T Consensus 1 M~~~~~f~~r~~~~~~~~~~~~~~~~~------~~~~~-~v~~~~al~~~av~~cv~~ia~~ia~~p~~---~~~~---- 66 (359) T protein:vir:10 1 MSILNPFERRSSITPNNYYPFMVQNGS------IVPNS-LVDATEALKNSDLYAVTSLISSDIAGTRFI---GNQV---- 66 (359) T ss_pred CcccchhhccccCCCCcchhhhhcccc------ccCCc-ccCHHHhhcchHHHHHHHHHHHhhhcCccc---cchH---- Confidence 111110000000000000000000000 00000 00111 11111223455566655555653 2222 Q ss_pred HHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeecc Q lcl|NC_010808. 124 IEAFNDL-N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (512) Q Consensus 124 l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~ 198 (512) +..++.. | .-..+...+..+.+.+|.+|+++-++..|.+ .+..++|..+.+..++. . ++|. +....+ T Consensus 67 ~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~---~----~~y~-~~~~~~ 138 (359) T protein:vir:10 67 FTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD---T----LTYE-VNQFDD 138 (359) T ss_pred HHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC---e----EEEE-EEecCC Confidence 2333332 3 3344556677788889999999999988875 46677777776655432 1 2111 111110 Q ss_pred CCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (512) Q Consensus 199 ~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~ 278 (512) .....+.++.+.+++...... + |. +.-.|.|.++.+...+.....+..-..+. T Consensus 139 ------~~~~~~~~~evih~~~~~~~~------------~-----~~----dg~~G~spi~~~~~~i~~~~~~~~~~~~~ 191 (359) T protein:vir:10 139 ------YPSAKYNASEMIHVKIMAYGV------------D-----TL----HNLVGHSPLESLTSEIGQQKEANRLSLST 191 (359) T ss_pred ------ceEEEEcccceEEeccCCCCC------------C-----cc----CccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 011234455555543321100 0 00 11247777777777776666655555555 Q ss_pred HHHhcCceeeeecC-CcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 279 MSDLNDAMLLIKGN-LSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMF 357 (512) Q Consensus 279 ~~~~~~~~lv~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 357 (512) ++-.+.|-.+++-- ...+.+....++..-. ..... .+.....-.+++.+.+.++.......+.+..+.....|+.. T Consensus 192 f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~-~~~~~--~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~ 268 (359) T protein:vir:10 192 LKGALNPTSVVKVPQGTLSSEAKDSIRKEFE-KANGG--NNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKA 268 (359) T ss_pred HhccCCcceEEEeCCCCCCHHHHHHHHHHHH-HHhCc--cccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHH Confidence 56566676666531 1233333332222110 00000 11111122344555555543333334456667777888888 Q ss_pred hccccccccccc-ccchHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCc Q lcl|NC_010808. 358 TNTPNMKDDNFS-GTQSGEAMKYKLFGLE-QRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPK 435 (512) Q Consensus 358 s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~-~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~ 435 (512) -++|....+..+ .+.+...++..+.... ..+.- +...|++. +... . ..+... -+.| T Consensus 269 fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p----~~~~l~~~-------l~~~--~--~~~~~~-~~~~------ 326 (359) T protein:vir:10 269 FGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEP----LISELRIK-------CDSS--I--GVDMSP-ITDY------ 326 (359) T ss_pred hCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHH----HHHHHHHH-------hhhh--h--cccchh-hhhc------ Confidence 899987665433 2234333333222111 11111 11111110 0000 0 011110 1112 Q ss_pred CHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHH Q lcl|NC_010808. 436 SLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEV 470 (512) Q Consensus 436 d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~ 470 (512) |.......+.++ +|+++.-.++++++.-. - | T Consensus 327 d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~p--v--~ 359 (359) T protein:vir:10 327 SNSVFKADILNWVKEGIIEPTEAKTLLESKG--I--I 359 (359) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC--C--C Confidence 223333334444 68999988888773211 0 0 No 237 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=92.88 E-value=0.0096 Score=31.64 Aligned_cols=395 Identities=10% Similarity=0.047 Sum_probs=154.1 Q ss_pred hhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHH-Hhccccccccccccccccccccee--e Q lcl|NC_010808. 14 RENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSD-YYEGKTKNLVELTRRKEEYMADNR--V 90 (512) Q Consensus 14 ~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~-yy~G~~~~~~~~~~~~~~~~~~~r--i 90 (512) |-+.++++..... ...... +..|.. . .............. . T Consensus 1 Mg~~~~~~~~~~~---------------------------------~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~ 44 (423) T protein:vir:81 1 MGFLQKLGLAPSV---------------------------------VATPEPIELVGPI--F-ESLKLSTKNMTVEQIWE 44 (423) T ss_pred CchhHhhcccccc---------------------------------ccCcccccccccc--c-cccccccchhhHHHHHH Confidence 1111111110000 000000 000000 0 00000000000000 1 Q ss_pred ecchHHHHHHHHHhhhhccCcee---cCCc--hh-HHHHHHHHHhc-c---ChhHHHHHHHHHHHhCCeEEEEEEECCCC Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQC---QDDD--KD-VLEAIEAFNDL-N---DVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~---~~~d--~~-~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g 160 (512) .++.....|+.+++-+..-|+.+ +.+. +. ....+..++.. | ....+...+..+.+.+|.+|+++..+..+ T Consensus 45 ~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~ 124 (423) T protein:vir:81 45 DQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGV 124 (423) T ss_pred hhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc Confidence 13455567788888777777754 1111 11 12234455532 3 34556666778899999999998876544 Q ss_pred ceEEEEEccceeEEEE---eCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccccccccccc Q lcl|NC_010808. 161 ETRLYKSDAMSTFVIY---DNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFES 237 (512) Q Consensus 161 ~~~i~~~~p~~~~~i~---d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (512) ...+..+.|..+..+. ...... .++|........ ....++ +.+..+.+++... T Consensus 125 ~~~~~~l~p~~~~~v~~~~~~~~~~----~~~Y~~~~~~~~----~g~~~~-~~~~evih~r~~~--------------- 180 (423) T protein:vir:81 125 DTPTLDIRPIPVSWVQRRAYKDGWG----SLDYIIIESGDN----DGRSVK-VPGERVIHRHGYN--------------- 180 (423) T ss_pred CcceEEEeecccceeeeeeccCCCc----ceEEEEEEecCC----CceEEE-EcccceEEecCCC--------------- Confidence 3333334333222221 000000 011110000000 000011 1222222221100 Q ss_pred ccccccceEeecCC-CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC-----cCChhhhhhhhhcccccc Q lcl|NC_010808. 238 HSFERMPITEFSNN-ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL-----SLDPDEVKKQKEANVLFL 311 (512) Q Consensus 238 ~~~~~vPvv~~~n~-~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~~~~~~~~~ 311 (512) .+. ..|.|.+..+...++....+..-....+.-.+.|-.+++... ..+++..+..+..-.-.. T Consensus 181 -----------~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~ 249 (423) T protein:vir:81 181 -----------PKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASF 249 (423) T ss_pred -----------CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHh Confidence 011 247777776666666555544444444455556666664211 122222222221100000 Q ss_pred chhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccc-hHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 312 EPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ-SGEAMKYKLFGLEQRTKT 390 (512) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~ 390 (512) .. ...+.....-.+++.++..++.......+.+..+.....|+..-++|....+...+.. |. ++. . T Consensus 250 ~~-~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn--~e~----------~ 316 (423) T protein:vir:81 250 SP-KSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSN--VRE----------F 316 (423) T ss_pred cc-ccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCccc--HHH----------H Confidence 00 0000011112344556655554333334445556677788888899977665432221 22 111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeC--CCCCcCHHHHHHHHHHH---hccCChHHHHHhCCCCCC Q lcl|NC_010808. 391 KEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYN--RNLPKSLIEELKAYIDS---GGKISQTTLMSLFSFFQD 465 (512) Q Consensus 391 ~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~--~~~p~d~~~~~~~~~kl---~g~~s~et~~~~~~~v~d 465 (512) ....+...|.-.+..|...++..-......+.....+.|+ .-+..|..+.++++.++ .|+++.-.+++.++.-.. T Consensus 317 ~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~ 396 (423) T protein:vir:81 317 RKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSI 396 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCC Confidence 1223333455555445444443221111112223345553 44567888888887764 488998888888765321 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCc Q lcl|NC_010808. 466 PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDT 507 (512) Q Consensus 466 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (512) +. -+.+ .......+.+.++ ...++.+. T Consensus 397 ~g--GD~~-----------~~p~n~~~~~~~~--~~~~~~~t 423 (423) T protein:vir:81 397 DG--GDDL-----------ARPLNTEFGDSED--APGEEVET 423 (423) T ss_pred CC--ccee-----------ecccccccCccCC--CCCCCCCC Confidence 11 0000 0000111111100 00111111 No 238 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=89.39 E-value=0.027 Score=29.22 Aligned_cols=413 Identities=11% Similarity=-0.011 Sum_probs=172.7 Q ss_pred HHHHHHHHHHHHHHHHHHhcccc---cccccccccccccc--c---ce-ee--ecchHHHHHHHHHhhhhccCceecCC- Q lcl|NC_010808. 49 IEHHMDYQRPRLKVLSDYYEGKT---KNLVELTRRKEEYM--A---DN-RV--AHDYASYISDFINGYFLGNPIQCQDD- 116 (512) Q Consensus 49 i~~~~~~~~~r~~~~~~yy~G~~---~~~~~~~~~~~~~~--~---~~-ri--~~n~~~~iv~~~a~~l~g~~~~~~~~- 116 (512) +..+.....||.+..+---.|-. .++..... .+.-+ . .. .+ ......-.+.+....+.+-++++... T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~-~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~ 79 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQ-TPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANG 79 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhcccccc-ccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCC Confidence 11111222222222111011111 11110000 00000 0 00 11 13555566666777788888877533 Q ss_pred -chhHHHHHHHHHhc-----------------cChhHHHHHHHHHHHhCCeEE-EEEEECC----CCceEEEEEccc--e Q lcl|NC_010808. 117 -DKDVLEAIEAFNDL-----------------NDVESHNRSLGLDLSIYGKAY-ELMIRNQ----DDETRLYKSDAM--S 171 (512) Q Consensus 117 -d~~~~~~l~~~~~~-----------------n~~~~~~~~~~~~~~~~G~a~-~~v~~d~----~g~~~i~~~~p~--~ 171 (512) +++..+++.+.+.. ..+...+.++..++.-||.++ +.+|... +|...+..+.+. . T Consensus 80 ~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~rp~~ 159 (469) T protein:vir:10 80 ASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAPRPQW 159 (469) T ss_pred CCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeeecCcc Confidence 33333444443321 135566666777788899754 5777532 355444333222 1 Q ss_pred eEE--EEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEee- Q lcl|NC_010808. 172 TFV--IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF- 248 (512) Q Consensus 172 ~~~--i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~- 248 (512) .+- .|++. ...+ .++ ...+..-........ ........+...|-.++- T Consensus 160 ~i~~~~~~~~--~~l~-~~~-------------------~~~~~~~~~~~~~~~-------~~~~~~lp~~k~i~~~~~~ 210 (469) T protein:vir:10 160 TISKFNVAPD--GGLE-SIE-------------------QIAPPARTRGSLYVA-------NIAPPEIPVNRLVVYTRNK 210 (469) T ss_pred cceeeeeccC--Ccee-eee-------------------ecCcccccccccccC-------CCCccccccCcEEEEEecC Confidence 110 11111 0000 000 000000000000000 000001111121111211 Q ss_pred -cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCC Q lcl|NC_010808. 249 -SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEG 327 (512) Q Consensus 249 -~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (512) ..++.|.|.+..+-...=--+..+.+++..++.|+.|+++.+-..+.+.++...+... ...+..+...+...+. T Consensus 211 ~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a-----~~~~~~g~~a~~iip~ 285 (469) T protein:vir:10 211 RPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAAL-----ARSVRGGINAGVGLAQ 285 (469) T ss_pred CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHH-----HHHHhcCCceEEEccC Confidence 1356788888887666666666888899999999999998775444444433222111 0011111122233456 Q ss_pred CcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_010808. 328 SVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQS-GEAMKYKLFGLEQRTKTKEGLFTKGLR-RRAKL 405 (512) Q Consensus 328 ~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~l 405 (512) +.++++++...+...++.+++.+.+.|.+.-....++.+..+|.-+ |.. . ..-....++.-.+.+...+. ++++- T Consensus 286 ~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~v-h--~ev~~d~~~sDa~~i~~tln~~li~~ 362 (469) T protein:vir:10 286 GQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASV-L--EDPFTQAVHAYATSICRIANQHIIED 362 (469) T ss_pred CceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHH-H--HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7789999988888889999999999998765544444432222211 211 1 11122233333445556663 35555 Q ss_pred HHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hcc-----CChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_010808. 406 LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-----ISQTTLMSLFSFFQDPELEVKKIEEDEK 478 (512) Q Consensus 406 i~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~-----~s~et~~~~~~~v~d~~~E~~ri~~E~~ 478 (512) ++.+ +. + .+..-..+.|... ..+....++.+.++ .|+ ++.+.+.+.++. ..++.+ +.+....+ T Consensus 363 l~~l-N~--g----~~~~~P~~~~~~~-e~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gi-p~~~~~-~~~~~~~~ 432 (469) T protein:vir:10 363 LVDI-NF--G----VDTPAPVLTFDPI-GSRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNL-PSELND-TPSAEPEE 432 (469) T ss_pred HHHh-cC--C----CCCCccEEEecCC-CCcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCC-CCCCCC-cccccchh Confidence 5442 21 1 1112235667543 34556678888776 465 455677777764 322221 11111111 Q ss_pred HHHHHHHhhcccCCCCCCCCCCCCCCcCccc--CCC Q lcl|NC_010808. 479 ESIKKAQKGIYKDPRDINDDEQDDDTKDTVD--KKE 512 (512) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~e 512 (512) ..... .....+......+..+....+.. ..+ T Consensus 433 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (469) T protein:vir:10 433 PAAVP---NQSAAPARTRSSGNADARARAPKADQGV 465 (469) T ss_pred cccCC---CCCccccccCCCCCcccccccCCChHHh Confidence 00000 00000000000001110111111 111 No 239 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=88.97 E-value=0.029 Score=29.01 Aligned_cols=375 Identities=10% Similarity=0.010 Sum_probs=134.3 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD- 117 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d- 117 (512) .... +++..... ......+.|. . ....-...-+........|+.+++-+..-|+.+-..+ T Consensus 1 MGlf----~~~~~~~~------~~~~~~~~~~--------~-~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~ 61 (395) T protein:vir:98 1 MGIL----DFFSFKKS------GTLSDDDSGS--------T-TSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEK 61 (395) T ss_pred Ccch----hhhcCCCc------ccccccccch--------h-hhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCC Confidence 0001 11000000 0000000000 0 0000000001223444556777776666677542221 Q ss_pred -hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEe Q lcl|NC_010808. 118 -KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (512) Q Consensus 118 -~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~ 191 (512) ......+..++.. | ....+...+....+.+|.||+++-.+..+. + |......+. ....+++ T Consensus 62 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~-----~-~~~~~~~~~-------~~~~~~~ 128 (395) T protein:vir:98 62 LTENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY-----V-ADSFTQDKK-------ISGSQFK 128 (395) T ss_pred cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee-----c-CCccccccc-------ccCcccc Confidence 1222234555532 3 234556777888999999998876553211 1 111111100 0000011 Q ss_pred eeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHH-HHHH Q lcl|NC_010808. 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLID-LYDN 270 (512) Q Consensus 192 ~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liD-a~~~ 270 (512) .+.. + ....-..|.++.+++++..... . ...+.|.+.....++. +++. T Consensus 129 ~~~~-~-----~~~~~~~~~~~evih~k~~~~~--------------------~-----~~~~~~~~~~~~~~~~~~~~~ 177 (395) T protein:vir:98 129 VSRV-Q-----GQTYEKTFTFDQVIYLKNDNSD--------------------L-----MSKVESLWEEYGELLGHVINN 177 (395) T ss_pred eeee-c-----CceeeeEecCccEEEecCCCCC--------------------c-----cccccchhhhHHHHHHHHHHH Confidence 0000 0 0000012233333333211000 0 0111222222222211 1111 Q ss_pred H-HHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeec------CCHHHH Q lcl|NC_010808. 271 A-ESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQ------YDVQGT 343 (512) Q Consensus 271 ~-~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~------~~~~~~ 343 (512) . ..........+..+...+.+......+...+..+...-........+.......+.|.+..-++.. .....+ T Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~ 257 (395) T protein:vir:98 178 QKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDI 257 (395) T ss_pred HHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHH Confidence 1 111111122233333333322222222111111111111001111111111223444454444321 122345 Q ss_pred HHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 423 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~ 423 (512) .+..+.....|+..-++|....+...++.+... ...+...|...++.|...++.+--... .-.. T Consensus 258 ~e~~~~~~~~Ia~~fgVP~~~l~~~~sn~e~~~---------------~~f~~~tl~P~~~~ie~~l~~kll~~~-~~~~ 321 (395) T protein:vir:98 258 KKLKDQYMAEFAEMLGIPISLLHGDIADNQKNY---------------ELLLEGPIESLITNIVDGLEYAIFDKS-ETLQ 321 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCcccHHHHH---------------HHHHHHHHHHHHHHHHHHHHHhcCChh-hhcC Confidence 555566677788888888766542212221111 123334444444444443332211110 0122 Q ss_pred eeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 424 TVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 424 ~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) .+.+.|..-+..|..+.++++.++ .|+++.-.+++.++.- +++.. +++ .. .....+ T Consensus 322 g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~g--D~~----------~~-~~n~~~------- 381 (395) T protein:vir:98 322 GSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLG--KVL----------YM-TKNYES------- 381 (395) T ss_pred cceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--cee----------ee-ccccee------- Confidence 345677777888999999999887 6899999998888542 22110 000 00 000000 Q ss_pred CCCCCcCcccCCC Q lcl|NC_010808. 500 QDDDTKDTVDKKE 512 (512) Q Consensus 500 ~~~~~~~~~~~~e 512 (512) -++.+++..++.| T Consensus 382 ~~~~gge~~~~~~ 394 (395) T protein:vir:98 382 VLERGGEVDEEVE 394 (395) T ss_pred cccccCCCCCCCC Confidence 0011111111111 No 240 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=88.03 E-value=0.035 Score=28.57 Aligned_cols=428 Identities=9% Similarity=-0.019 Sum_probs=163.8 Q ss_pred hhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhcc--Cc---- Q lcl|NC_010808. 38 LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGN--PI---- 111 (512) Q Consensus 38 ~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~--~~---- 111 (512) +......+.. +..+..-..+++.+.+|..-.- ...+...........+..-+-+...++.+++-|++- |+ T Consensus 1 m~~~~~~l~~--k~~R~~~e~~w~e~a~~~lP~~--~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~W 76 (514) T protein:vir:80 1 MRQQASAMWA--EYRDSTAIRKAEDFAKFTIASL--MVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPPGRPS 76 (514) T ss_pred CccchHHHHH--HhhcchHHHHHHHHHHHhcccc--cCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCCCCcc Confidence 1111222211 1111222345666666653210 000001011111112233455666777777776642 22 Q ss_pred -eecCCch-------------hH-------HHHHHHHHhccChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccc Q lcl|NC_010808. 112 -QCQDDDK-------------DV-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) Q Consensus 112 -~~~~~d~-------------~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~ 170 (512) ++..+|+ ++ ...+...+..++|.....++.++...+|.+.+++- ++.. .++.++ . T Consensus 77 F~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~--~~~~-~~~~~p-l 152 (514) T protein:vir:80 77 FQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYRE--PGTG-KMLVWT-M 152 (514) T ss_pred cccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe--cCCC-cEEEEE-c Confidence 2222221 12 22344456678999999999999999999876653 3222 344443 3 Q ss_pred eeEEEEeCCCCceeEEEEEEeeeeeec------------cCCcceEEEEEEEcC-----Cc----EEEEEecCCcccccc Q lcl|NC_010808. 171 STFVIYDNTIERNSIAGVRYLRTKPID------------KTDEDEVFTVDLFTS-----HG----VYRYLTSRTNGLKLT 229 (512) Q Consensus 171 ~~~~i~d~~~~~~~~~~v~~~~~~~~~------------~~~~~~~~~~~~yt~-----~~----~~~~~~~~~~~~~~~ 229 (512) .-|.+..+. .+++...+|..+..... ....+....+++|+. ++ ...|.. . .+..+ T Consensus 153 ~~y~v~~d~-~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e-~-~g~~i- 228 (514) T protein:vir:80 153 QSYTVRRTS-HGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHE-L-EGKRV- 228 (514) T ss_pred CeEEEeeCC-CcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEe-c-cceee- Confidence 334554443 34555555443322100 001111122333321 11 011111 0 11111 Q ss_pred ccccccccccccccceEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhh Q lcl|NC_010808. 230 PRENGFESHSFERMPITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQK 304 (512) Q Consensus 230 ~~~~~~~~~~~~~vPvv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~ 304 (512) ......++..+|++.+ ....+|+|..+...+-+..++.+.-...........|.+.+.-........+.. T Consensus 229 ---~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l~~-- 303 (514) T protein:vir:80 229 ---GPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDYRD-- 303 (514) T ss_pred ---cccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhhcc-- Confidence 1111122334666543 245689999999999999998876666666665555555432111111111111 Q ss_pred hccccccchhhhhhcccccCCCCCcceeEEe--ecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHH Q lcl|NC_010808. 305 EANVLFLEPTVYENRDTGIETEGSVDGGYIY--KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLF 382 (512) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~ 382 (512) ++ ...+..+...+++.+. ...+.......++.++..|...-..-. .. ..+.+.+|..+..... T Consensus 304 -~~------------~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~-~~-rd~~rvTAtEV~~r~~ 368 (514) T protein:vir:80 304 -AE------------TGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTG-QV-RDAERVTVEEIRTVAE 368 (514) T ss_pred -cC------------CceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhc-cC-CCCCCCCHHHHHHHHH Confidence 00 0001111223333433 234566667777777777754322111 11 1234467777765433 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhc--cCCC-cccccceeeEEeCCCC-CcCHHHHHH-------HHHHHhcc Q lcl|NC_010808. 383 GLEQRTKT-KEGLFTKGLRRRAKLLETILKNT--RSID-ANKDFNTVRYVYNRNL-PKSLIEELK-------AYIDSGGK 450 (512) Q Consensus 383 ~l~~k~~~-~~~~~~~~l~~~~~li~~~l~~~--~~~~-~~~d~~~i~i~f~~~~-p~d~~~~~~-------~~~kl~g~ 450 (512) .+.+...- ..++-.+.+.-+++..+.++... +... .+.+. +.+.+.-++ +......++ .+..+++. T Consensus 369 E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l--~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~ 446 (514) T protein:vir:80 369 EAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGV--YRPSIITGIPALTRNIETANILRATQEASAIVPA 446 (514) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchh--hcceeeecHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 33222111 11111222222233333333221 1111 12222 334443222 111111122 12222232 Q ss_pred CC-------hHHHHHhC---CCC------CCHHHHH---HHHHHHHHHHHHHHHhhcccCCCCCCCCC Q lcl|NC_010808. 451 IS-------QTTLMSLF---SFF------QDPELEV---KKIEEDEKESIKKAQKGIYKDPRDINDDE 499 (512) Q Consensus 451 ~s-------~et~~~~~---~~v------~d~~~E~---~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 499 (512) .| ...++..+ -++ .+++... +|.+++++......+.......+.+--.+ T Consensus 447 ~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 447 LVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred chhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 22 23333332 112 2222111 11111111111111111111111111111 No 241 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=85.73 E-value=0.051 Score=27.69 Aligned_cols=416 Identities=8% Similarity=-0.061 Sum_probs=165.8 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----cccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKT-----KNLV 75 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~-----~~~~ 75 (512) ||+..+=-..+ -+.-+.+ .+.+.-+. ..+..-+..|- |.|-. +++. T Consensus 1 m~kk~~k~~~~---------~~~~~~~----~~~~~~~~---~~~~~~~~~~~-------------~~g~~~~~~~~iLr 51 (448) T protein:vir:77 1 MAKRGRKPKEL---------VPGPGSI----DPSDVPKL---EGASVPVMSTS-------------YDVVVDREFDELLQ 51 (448) T ss_pred CCCCCCCCccc---------CCccccc----chhhhhhh---ccchhhhcccc-------------cccccccchhHhhc Confidence 65554321111 0000000 00000000 00000000000 01100 0000 Q ss_pred cccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc-----hhHHHHHHHHHhcc-------ChhHHHHHHHHH Q lcl|NC_010808. 76 ELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD-----KDVLEAIEAFNDLN-------DVESHNRSLGLD 143 (512) Q Consensus 76 ~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d-----~~~~~~l~~~~~~n-------~~~~~~~~~~~~ 143 (512) .. ....-+ ... .......-.+.+....+.+.++.+...+ ....+++.+++... .|...+..+ .+ T Consensus 52 ~~-~~~~ly-~~m-~~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-ld 127 (448) T protein:vir:77 52 GK-DGLLVY-HKM-LSDGTVKNALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-EN 127 (448) T ss_pred cc-cchHHH-HHH-hhChHHHHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HH Confidence 00 000000 000 0124455556666667778888875321 22344566665432 466666655 58 Q ss_pred HHhCCeE-EEEEEE-CCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEec Q lcl|NC_010808. 144 LSIYGKA-YELMIR-NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTS 221 (512) Q Consensus 144 ~~~~G~a-~~~v~~-d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~ 221 (512) +..||.+ ++.+|. ..+|...+..+.+... +. ++++. |+++.-.++... T Consensus 128 a~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~----------~~---~~~f~-----------------~~~~~~l~~~~~ 177 (448) T protein:vir:77 128 AYIYGMAAGEIVLTLGADGKLILDKIVPIHP----------FN---IDEVL-----------------YDEEGGPKALKL 177 (448) T ss_pred hhhhcceeEEEEEeecCCCceeeccccccCC----------Cc---cceee-----------------eecCCceEEEec Confidence 9999975 457774 4567654332221110 00 00110 011110111110 Q ss_pred CCccccccccccccccccccccceEeec--CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCCh-- Q lcl|NC_010808. 222 RTNGLKLTPRENGFESHSFERMPITEFS--NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDP-- 297 (512) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~vPvv~~~--n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~-- 297 (512) .... .......+..+.|++.+=+.... -++.|.|.+..+.-..=--+..+.+++..++.|+.|+++.+-..+.+. T Consensus 178 ~~~~-~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~ 256 (448) T protein:vir:77 178 SGEV-KGGSQFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGT 256 (448) T ss_pred CCcc-cccccCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCH Confidence 0000 00000011122234432111111 245678888876666555667778899999999999998774333222 Q ss_pred hhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHH Q lcl|NC_010808. 298 DEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAM 377 (512) Q Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai 377 (512) .+...... ....+..+...+...+.+.++++++.......+..+++.+.+.|...-..--++.+..+ ..++.+. T Consensus 257 ~~~~~l~~-----av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~-g~~~~~~ 330 (448) T protein:vir:77 257 KQWEAAKE-----IVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNM-GVQAVNI 330 (448) T ss_pred HHHHHHHH-----HHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhcccccccccc-chhhhhh Confidence 12111110 00011111222233456678889987766566777888888888776544333333322 2222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHH Q lcl|NC_010808. 378 KYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTL 456 (512) Q Consensus 378 ~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s~et~ 456 (512) .....-....+..-.+.+...+. ++++-++. ++ .+. +..-..+.|...-+.|..+.++.+.++.+.+ T Consensus 331 ~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~-lN--fg~----~~~~P~~~f~~~e~eDl~~~a~~~~~l~~~~----- 398 (448) T protein:vir:77 331 GEFVSLTQQTIISLQREFASAVNLYLIPKLVL-PN--WPG----ATRFPRLTFEMEERNDFSAAANLMGMLINAV----- 398 (448) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hc--CCC----CCCCCEEEecCCChhhHHHHHHHhHHHHHHH----- Confidence 21111111112223333444443 34444443 22 111 1112357788888889888898888876532 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 457 MSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 457 ~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) .+.++ +..+..+ ..+ .......+... ..+++.+...+..+.. T Consensus 399 ~~~~~-ip~~~~~----------~~~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 440 (448) T protein:vir:77 399 KDSED-IPTELKA----------LID--ALPSKMRRALG-VVDEVREAVRQPADSR 440 (448) T ss_pred HHHhc-CCccCCc----------CCC--CCchhcccccC-CCCCCCchhhcchhhH Confidence 12221 1110000 000 00000000011 1111111111111111 No 242 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=84.43 E-value=0.061 Score=27.26 Aligned_cols=362 Identities=12% Similarity=0.030 Sum_probs=146.5 Q ss_pred eeecccchhHHhhhcHHHHHHHHHHHHHHHHHHH---HHHHHHhcccccccccccccccc-------------------- Q lcl|NC_010808. 27 VVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRL---KVLSDYYEGKTKNLVELTRRKEE-------------------- 83 (512) Q Consensus 27 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~---~~~~~yy~G~~~~~~~~~~~~~~-------------------- 83 (512) +.| .+ -|..+ ...+++ .-..+|..|+.+........... T Consensus 1 ~~~--~~----------~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 62 (409) T protein:vir:83 1 MGF--WS----------NLFGI------PSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESW 62 (409) T ss_pred Cch--hh----------hhccc------ccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccc Confidence 110 00 00000 000000 01113333332222111110000 Q ss_pred ----ccc-cee--eecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHh--ccC---hhHHHHHHHHHHHhCCeEE Q lcl|NC_010808. 84 ----YMA-DNR--VAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFND--LND---VESHNRSLGLDLSIYGKAY 151 (512) Q Consensus 84 ----~~~-~~r--i~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~ 151 (512) +.. ..+ +.+......|+.+++-+..-|+.+--..... +.+..++. -|. ...+...+..+ +..|.+| T Consensus 63 ~~~~~~~~t~~~~~~~~~v~acV~~Ia~~iA~lpl~~~~~~~~~-~~~~~ll~~~PN~~~t~~~f~~~l~~~-lllGnay 140 (409) T protein:vir:83 63 ATPSWGSAQDKLRTLIDVAWACIDLNASVLSSMPIYRMRNGRII-DSVAWMSNPDPEVYTSWQEFAKQLFWD-FQLGEAF 140 (409) T ss_pred cccCccccchhhHhhhHHHHHHHHHHHHhhccCceEEeeCCccc-cchhhhcccCCCCCCCHHHHHHHHHHH-HhhCCcE Confidence 000 000 1122334456666666666666542111111 11222222 122 22333334444 4458898 Q ss_pred EE-EEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCcccccc Q lcl|NC_010808. 152 EL-MIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLT 229 (512) Q Consensus 152 ~~-v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~ 229 (512) ++ +..+.+|.+ .+..++|..+.+.++... . .+|.+. ..+.++ T Consensus 141 ~~~i~r~~~G~~~~L~pl~p~~v~v~~~~~g--~-----~~y~~~-------------~~~~~~---------------- 184 (409) T protein:vir:83 141 VLPMAHGSDGYPIRFRVVPPWLVNVELKKGA--R-----REYRIG-------------GLNVTD---------------- 184 (409) T ss_pred EEEEEECCCCcEEEEEEECCcceEEEEcCCc--e-----EEEEEc-------------cccCcc---------------- Confidence 76 457888875 467788887766554321 1 111110 001111 Q ss_pred ccccccccccccccceEeecC-----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhh Q lcl|NC_010808. 230 PRENGFESHSFERMPITEFSN-----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQK 304 (512) Q Consensus 230 ~~~~~~~~~~~~~vPvv~~~n-----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~ 304 (512) +|++++. ...|.|.++.+...++..+....-..+.+.-.+.|-.+++-....+++.....+ T Consensus 185 --------------eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~ 250 (409) T protein:vir:83 185 --------------EILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERRLSETEAVDLM 250 (409) T ss_pred --------------ceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHH Confidence 2333321 224778787777777766655444444445556677776544444444444433 Q ss_pred hccccccchhhhhhcccccCCCCCcce-eEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchH--HHHHHHH Q lcl|NC_010808. 305 EANVLFLEPTVYENRDTGIETEGSVDG-GYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG--EAMKYKL 381 (512) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg--~Ai~~~~ 381 (512) +.-.-... .+.....-..++.+. +.++-......+.+..+.....|...-++|.+..+....+.|+ ..++... T Consensus 251 ~~~~~~~~----~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~ 326 (409) T protein:vir:83 251 DRWIESRS----KYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLF 326 (409) T ss_pred HHHHHhhC----CccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHH Confidence 32110000 011111112233332 2222221222344445566778888889997766532211111 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHh Q lcl|NC_010808. 382 FGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSL 459 (512) Q Consensus 382 ~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~ 459 (512) ...+...|..++..|...++..-... ...+++.+..-+-.|..+.++++.++ +|+++.-.++++ T Consensus 327 ----------~~f~~~tL~P~~~~ie~~l~~~Ll~~----~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~ 392 (409) T protein:vir:83 327 ----------SFHDRSSLRPKATAVMAALDRWALPS----PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAM 392 (409) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHhhCCC----CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 11122233333333333333211111 11244444455567788888888776 688888777776 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 460 FSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 460 ~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) .+.-.. .+.+.-...+. T Consensus 393 ~glpp~----------------------~ggd~l~~~gv 409 (409) T protein:vir:83 393 ERLHSE----------------------AAAVRLSGGGV 409 (409) T ss_pred hCCCCC----------------------CCCcccCCCCC Confidence 543110 00000011111 No 243 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=83.43 E-value=0.069 Score=26.97 Aligned_cols=360 Identities=11% Similarity=-0.004 Sum_probs=142.0 Q ss_pred cchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee Q lcl|NC_010808. 11 TDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) Q Consensus 11 ~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri 90 (512) |+ +.+++|+++.-..+..... .+. . . . ...-+ T Consensus 1 Mg---~f~~l~~~~~~~~~~~~~~-------------------------------~~~--~-~-~----------~~~~l 32 (376) T protein:vir:78 1 MG---FFSELFKRNKEIEWMWDLD-------------------------------FLE--D-K-T----------TKVYL 32 (376) T ss_pred Cc---hhhhhhccCCccccccchh-------------------------------hcc--c-c-c----------hhhhh Confidence 44 3456776643322111110 000 0 0 0 00001 Q ss_pred ecchHHHHHHHHHhhhhccCceecCCchhHHHHHHHHHh--cc---ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEE Q lcl|NC_010808. 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFND--LN---DVESHNRSLGLDLSIYGKAYELMIRNQDDETRLY 165 (512) Q Consensus 91 ~~n~~~~iv~~~a~~l~g~~~~~~~~d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~ 165 (512) ........|+.+++-+..-|+.+-..+......+..++. -| ....+...+..+.+.+|.||+++..+..|.+.- T Consensus 33 ~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~- 111 (376) T protein:vir:78 33 KKMALNTCVKHIARTIAKSDFRLKNGETSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIAD- 111 (376) T ss_pred hhHHHHHHHHHHHHhhcccceeeccccccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeecc- Confidence 112334556666666666666553333333334444443 13 345566778888889999999988776654311 Q ss_pred EEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccce Q lcl|NC_010808. 166 KSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (512) Q Consensus 166 ~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 245 (512) .+|+.... ....+++.+...+ ......+.++.+++++. +..| T Consensus 112 ------~~~~~~~~-----~~~~~~~~~~~~~------~~~~~~~~~~evih~~~--------------------~~~~- 153 (376) T protein:vir:78 112 ------SYVRKEFA-----FFPDVFEGVTVKD------YRYNRNFSMDDVIFLEY--------------------GNER- 153 (376) T ss_pred ------ceeecccc-----eeeeeeeeeeeec------ceeeeeeccccEEEecc--------------------CCCC- Confidence 11221100 0001111110000 00011223333333321 0011 Q ss_pred EeecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHH--hcCceeeeecCCcCChhhhhhhhhccccccchhhhhhccccc Q lcl|NC_010808. 246 TEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSD--LNDAMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGI 323 (512) Q Consensus 246 v~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~--~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (512) +.+-. .+++..+..+.......... ...+.+++......+++..+..+..-.-... .......... T Consensus 154 --------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~-g~~~~~~~v~ 221 (376) T protein:vir:78 154 --------LSAFT---DGMFEDYGELFGKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYA-SFNNNEIAIV 221 (376) T ss_pred --------chhhh---hHHHHHHHHHHHHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhc-cccccCcceE Confidence 11111 12223333333333222222 2234444432222333333322221110100 0000111112 Q ss_pred CCCCCcceeEEeecC-----CHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 324 ETEGSVDGGYIYKQY-----DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) Q Consensus 324 ~~~~~~~~~~l~~~~-----~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 398 (512) ..+++.+...++... ....+.+..+.....|+..-++|....+...++.+... ...+... T Consensus 222 ~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~---------------~~f~~~~ 286 (376) T protein:vir:78 222 PQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNM---------------KAYMEYC 286 (376) T ss_pred EcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHH---------------HHHHHHH Confidence 234555555544322 12345666677778888888898766542222222111 2233444 Q ss_pred HHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHH Q lcl|NC_010808. 399 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIE 474 (512) Q Consensus 399 l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~ 474 (512) |..++..|...++.+--... + ..+...+..-+-.|..+.++++.++ .|+++.-.+++.++.-. ++.. ++ T Consensus 287 l~P~~~~ie~~l~~kll~~~--~-~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~--d~-- 359 (376) T protein:vir:78 287 IDPLTKKLEDELNAKLFTFS--E-FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPEL--DK-- 359 (376) T ss_pred HHHHHHHHHHHHHhhhCCcc--c-ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--ce-- Confidence 55555444444443211111 1 1122233334456888889998887 68899988888875422 1110 00 Q ss_pred HHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 475 EDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 475 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) .... .+...-+ +..++| T Consensus 360 --------~~~~-----~n~~~~~-------~~~e~g 376 (376) T protein:vir:78 360 --------YLIT-----KNYQSAD-------EGGEDG 376 (376) T ss_pred --------eeec-----cCceehh-------ccccCC Confidence 0000 0000000 001111 No 244 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=83.11 E-value=0.071 Score=26.88 Aligned_cols=348 Identities=12% Similarity=0.054 Sum_probs=133.7 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeee--cchHHHHHHHHHhhhhccCcee-c- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVA--HDYASYISDFINGYFLGNPIQC-Q- 114 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~--~n~~~~iv~~~a~~l~g~~~~~-~- 114 (512) ... ......+..+... ....... ......+. ......+|+.+|+-+..-|+.+ . T Consensus 1 Mg~------------------f~~~~~~~~~~~~--~~~~~~~--~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~ 58 (378) T protein:vir:94 1 MNL------------------FGKVVSFSRGKLN--NDTQRVT--AWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred CCc------------------cccchhccccccc--CCcceee--eeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEE Confidence 000 0001011111000 0000000 00001111 1244556677777666677653 1 Q ss_pred -CCc-------hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEE-ECCCCceEEEEEccceeEEEEeCCC Q lcl|NC_010808. 115 -DDD-------KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMI-RNQDDETRLYKSDAMSTFVIYDNTI 180 (512) Q Consensus 115 -~~d-------~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~-~d~~g~~~i~~~~p~~~~~i~d~~~ 180 (512) ..+ ......+.+++.. | ....+...+..+++.+|.+|+++. .+..|++.. +-|. .. T Consensus 59 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~--l~p~--------~~ 128 (378) T protein:vir:94 59 KKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLD--LLFA--------DD 128 (378) T ss_pred cccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEE--EEec--------CC Confidence 111 0112345666542 3 234666778888999999998753 333333211 1010 00 Q ss_pred CceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHH Q lcl|NC_010808. 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEK 260 (512) Q Consensus 181 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~ 260 (512) .+ -|.++.+++++. ++ +-..|.|. T Consensus 129 ----------------------~~----~~~~~diiH~~~------------------~~---------~~~~g~s~--- 152 (378) T protein:vir:94 129 ----------------------KK----EYKPEELVRLTS------------------PF---------YINEDTSI--- 152 (378) T ss_pred ----------------------ee----EeeeeeeEEecC------------------cC---------CccchhHH--- Confidence 00 011222333210 00 00112232 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch-hhhhhcccccCCCCCcceeEEeecCC Q lcl|NC_010808. 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP-TVYENRDTGIETEGSVDGGYIYKQYD 339 (512) Q Consensus 261 v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~ 339 (512) +..+..++....+. +.+-.+++-....+++..+..++.-.-.+.. ....+.......+++.+++.++.... T Consensus 153 l~~~~~~i~~~~~~--------~~~~gil~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~ 224 (378) T protein:vir:94 153 LDNALASIQTKLEQ--------GKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYS 224 (378) T ss_pred HHHHHHHHHHHHhc--------ccccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChh Confidence 33333333332221 2222222211112222222222111101000 00001111223345556655554333 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-- Q lcl|NC_010808. 340 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-- 417 (512) Q Consensus 340 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~-- 417 (512) ...+ ...+.+.+.|+..-++|..... +..|.. .....+...|..+++.|...+..+-..+ T Consensus 225 ~~~~-~~~~~~~~~Ia~~fgVP~~~l~---~~~se~--------------~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~e 286 (378) T protein:vir:94 225 VLNK-DEIDLIKSELLTGYFMNENILL---GTASQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNR 286 (378) T ss_pred hhhH-HHHHHHHHHHHHHhCCCHHHhc---CChHHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhH Confidence 3333 3456677788888888865442 211111 1123455566666665555554322111 Q ss_pred -----cccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_010808. 418 -----ANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPELEVKKIEEDEKESIKKAQKGI 488 (512) Q Consensus 418 -----~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~--d~~~E~~ri~~E~~~~~~~~~~~~ 488 (512) .......+.+.+..-.-.|..+.++++.++ .|+++.-.++++++.-. +.+.=+ + ..... . T Consensus 287 r~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~--~--------~~n~~-~ 355 (378) T protein:vir:94 287 RRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYI--A--------NLNAV-A 355 (378) T ss_pred hhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeee--e--------ccccc-c Confidence 011122344555566677888999998887 68999988888876432 211000 0 00000 0 Q ss_pred ccCCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 489 YKDPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) .......+..+.+...+++.+++ T Consensus 356 ~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 356 VKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred cccchhhcCCcCCCCCCCCCCCC Confidence 00011111111122222222222 No 245 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=82.81 E-value=0.074 Score=26.79 Aligned_cols=350 Identities=12% Similarity=0.039 Sum_probs=135.0 Q ss_pred hhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceee--ecchHHHHHHHHHhhhhccCceec-- Q lcl|NC_010808. 39 LQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV--AHDYASYISDFINGYFLGNPIQCQ-- 114 (512) Q Consensus 39 ~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri--~~n~~~~iv~~~a~~l~g~~~~~~-- 114 (512) ......+. .+-.+.-. ....... ......+ ........|+.+++-+..-|+++- T Consensus 1 Mg~f~~~~------------------~f~~~~~~--~~~~~~~--~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~ 58 (378) T protein:vir:93 1 MNLFGKVV------------------SFSRGKLN--NDTQRVT--AWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKY 58 (378) T ss_pred Cccchhhh------------------hhhccccC--CCcceee--ecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEE Confidence 00011100 10000000 0000000 0000001 112344456677777767777541 Q ss_pred -CCc---h----hHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECC-CCceEEEEEccceeEEEEeCCC Q lcl|NC_010808. 115 -DDD---K----DVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQ-DDETRLYKSDAMSTFVIYDNTI 180 (512) Q Consensus 115 -~~d---~----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~~~i~~~~p~~~~~i~d~~~ 180 (512) ..+ + .....|..++.. | ....+...+..+.+.+|.+|+++..+. .|++.. +|... T Consensus 59 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~----------l~~~~- 127 (378) T protein:vir:93 59 KKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLD----------LLFAD- 127 (378) T ss_pred cccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEE----------EEecC- Confidence 110 0 122345666642 3 234666678889999999998754432 222211 11000 Q ss_pred CceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHH Q lcl|NC_010808. 181 ERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEK 260 (512) Q Consensus 181 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~ 260 (512) . . + .|.++.++++ ++.-.+...... T Consensus 128 ------------------~---~---~-~~~~~diih~------------------------------r~~~~~~~~~s~ 152 (378) T protein:vir:93 128 ------------------D---K---K-EYKTEELVRL------------------------------TSPFYINEDTSI 152 (378) T ss_pred ------------------C---e---e-EeccceeEEe------------------------------cCccccchhhHH Confidence 0 0 0 1122233332 211111111222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch-hhhhhcccccCCCCCcceeEEeecCC Q lcl|NC_010808. 261 VITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP-TVYENRDTGIETEGSVDGGYIYKQYD 339 (512) Q Consensus 261 v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~ 339 (512) +..+..++....+ .+.+--+++.....+++.....++.-.-.+.. ............+.+.+++.++.... T Consensus 153 l~~~~~~i~~~~~--------~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~ 224 (378) T protein:vir:93 153 LDNALASIQTKLE--------QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYS 224 (378) T ss_pred HHHHHHHHHHHHh--------cCcccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChh Confidence 3333333332221 12232333221122222222222111111000 00001111223345556655554433 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc- Q lcl|NC_010808. 340 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA- 418 (512) Q Consensus 340 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~- 418 (512) ...+ ...+.+.+.|+..-++|..... +..|.. .....+...|..+++.|...+..+-..+. T Consensus 225 ~~~~-~~~~~~~~~Ia~~fgVPp~~l~---g~~~e~--------------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e 286 (378) T protein:vir:93 225 VLNK-DEIDLIKSELLTGYFMNENILL---GTATQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNR 286 (378) T ss_pred hhhH-HHHHHHHHHHHHHhCCCHHHhc---CCcHHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhH Confidence 3333 4456677888888888864432 211111 11234455666666666665543221110 Q ss_pred ------ccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_010808. 419 ------NKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) Q Consensus 419 ------~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 490 (512) ......+.+.+..-.-.|..+.++++.++ +|+++.-.++++++.-.-+. -+.+. ...... +.. T Consensus 287 r~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~g--gD~~~------~~~n~~-~~~ 357 (378) T protein:vir:93 287 RRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG--GDVYI------ANLNAV-AVK 357 (378) T ss_pred hhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeee------eccccc-ccc Confidence 01112344555666778888999998887 68999998888886532110 00000 000000 000 Q ss_pred CCCCCCCCCCCCCCcCcccCC Q lcl|NC_010808. 491 DPRDINDDEQDDDTKDTVDKK 511 (512) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~ 511 (512) ........+..+..+++.+++ T Consensus 358 ~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 358 NLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred chhhhcCccCCCCCCCCCCCC Confidence 011111111222222233333 No 246 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=80.59 E-value=0.093 Score=26.23 Aligned_cols=350 Identities=12% Similarity=0.042 Sum_probs=135.4 Q ss_pred HHHHHHHHHHhcccccccccccccccccccceee--ecchHHHHHHHHHhhhhccCcee-c--CCc-------hhHHHHH Q lcl|NC_010808. 57 RPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV--AHDYASYISDFINGYFLGNPIQC-Q--DDD-------KDVLEAI 124 (512) Q Consensus 57 ~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri--~~n~~~~iv~~~a~~l~g~~~~~-~--~~d-------~~~~~~l 124 (512) ..-..+...+-.+.-.- ....... .....+ ........|+.+++-+..-|+.+ . ..+ ......| T Consensus 1 Mg~f~~~~~~~~~~~~~--~~~~~~~--~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l 76 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNN--DTQRVTA--WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDL 76 (378) T ss_pred CccchhhhhhhcccccC--Ccceeee--cccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccchH Confidence 00011111111110000 0000000 000001 12334455666666666667653 1 110 1122345 Q ss_pred HHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEEEECC-CCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeecc Q lcl|NC_010808. 125 EAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQ-DDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (512) Q Consensus 125 ~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~ 198 (512) .+++.. | ....+...+..+.+.+|.+|++..++. .|++. .+-|.. . T Consensus 77 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l~~~~--------~------------------ 128 (378) T protein:vir:16 77 DEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLLFAD--------D------------------ 128 (378) T ss_pred HHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEEecC--------C------------------ Confidence 666542 2 345666778888999999998754432 12221 110000 0 Q ss_pred CCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANY 278 (512) Q Consensus 199 ~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~ 278 (512) . ..|.++.++++ ++.-.+......+..+.++++..++ T Consensus 129 ----~----~~~~~~diih~------------------------------r~~~~~~~~~s~l~~~~~~i~~~~~----- 165 (378) T protein:vir:16 129 ----K----KEYKPEELVRL------------------------------TSPFYINEDTSILDNALASIQTKLE----- 165 (378) T ss_pred ----e----eEecccceEEe------------------------------cCccCccchhHHHHHHHHHHHHHHh----- Confidence 0 01122333333 2111111122233334444433221 Q ss_pred HHHhcCceeeeecCCcCChhhhhhhhhccccccch-hhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 279 MSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP-TVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMF 357 (512) Q Consensus 279 ~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 357 (512) .+.+-.+++.....+++..+..++.-.-.+.. ....+.......+++.+++.++.......+ ...+.+.+.|+.. T Consensus 166 ---~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~ 241 (378) T protein:vir:16 166 ---QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTG 241 (378) T ss_pred ---cCccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHH Confidence 12222233222222222222222111111000 000011112233455566655544333333 3456677888888 Q ss_pred hcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-------cccccceeeEEeC Q lcl|NC_010808. 358 TNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-------ANKDFNTVRYVYN 430 (512) Q Consensus 358 s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~-------~~~d~~~i~i~f~ 430 (512) -++|..... +..+.. .....+...|..+++.|...+..+-..+ .......+.+.+. T Consensus 242 fgVPp~~l~---g~~~e~--------------~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~ 304 (378) T protein:vir:16 242 YFMNENILL---GTASQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQ 304 (378) T ss_pred hCCCHHHhc---CCchHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccc Confidence 888865432 211111 1113445556666655555554321110 0111223455556 Q ss_pred CCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcc Q lcl|NC_010808. 431 RNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) Q Consensus 431 ~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) .-.-.|..+.++++.++ .|+++.-.++++++.-.-+. -+++.- ....... .........+.+...+++. T Consensus 305 ~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~g--gD~~~~------~~n~~~~-~~~~~~~~~~~~~~~~~e~ 375 (378) T protein:vir:16 305 LFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG--GDVYIA------NLNAVAV-KNLSDLQGSRKDVTSTDET 375 (378) T ss_pred hhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--CCeEee------ccccccc-cchhhhcCccCCCCCCCCC Confidence 66778888999988887 68999999888886422100 000000 0000000 0001111111222222222 Q ss_pred cCC Q lcl|NC_010808. 509 DKK 511 (512) Q Consensus 509 ~~~ 511 (512) ++| T Consensus 376 ~ne 378 (378) T protein:vir:16 376 NNQ 378 (378) T ss_pred CCC Confidence 222 No 247 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=79.64 E-value=0.1 Score=26.01 Aligned_cols=462 Identities=12% Similarity=0.069 Sum_probs=170.6 Q ss_pred CCcceeecccc--chhhccccccCCCc---CeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Q lcl|NC_010808. 1 MLKANEFETDT--DLRENRNYLFNDEA---NVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLV 75 (512) Q Consensus 1 ~~~~~~~~~~~--~~~~~~~~~f~~~~---~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~ 75 (512) |+.---|...- ..-....++-+++. ...+...+-..-... +-.-+ +.......+|+.+..+++=+.. T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~----~~~~~-~~~~eLI~~YR~ma~~pEvd~A--- 72 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVD----IEGAY-RSEYDLIRRYREMALHPEADGA--- 72 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeec----ccchh-hhHHHHHHHHHHHhhccchhhH--- Confidence 66544443311 11111112212211 111111111100000 00000 0011222344444433332221 Q ss_pred cccccccccccceeeecchHHHHHHHHH-hhhhccCceecCCc--------hhHHHHHHHHHhccChhHHHHHHHHHHHh Q lcl|NC_010808. 76 ELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQCQDDD--------KDVLEAIEAFNDLNDVESHNRSLGLDLSI 146 (512) Q Consensus 76 ~~~~~~~~~~~~~ri~~n~~~~iv~~~a-~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~ 146 (512) ...||+..+ .-...+|+.+..++ +...+.++.+++--+|+....+..+.+.+ T Consensus 73 -------------------v~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYV 133 (558) T protein:vir:10 73 -------------------IEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYV 133 (558) T ss_pred -------------------HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhee Confidence 111222111 11223344443322 23344556666666899999999999999 Q ss_pred CCeEEEEEEECCC----CceEEEEEccceeEEEEeCCCC---ceeEEEEEEeeeeeeccC-CcceEEEEEEEcCCcEEEE Q lcl|NC_010808. 147 YGKAYELMIRNQD----DETRLYKSDAMSTFVIYDNTIE---RNSIAGVRYLRTKPIDKT-DEDEVFTVDLFTSHGVYRY 218 (512) Q Consensus 147 ~G~a~~~v~~d~~----g~~~i~~~~p~~~~~i~d~~~~---~~~~~~v~~~~~~~~~~~-~~~~~~~~~~yt~~~~~~~ 218 (512) -|+.|.+...|.+ |-..+..++|..+-.|..-... .+....++. .-+. ........-+|.+...+.. T Consensus 134 DgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~-----~~~~~~~~~~~eyy~Y~~~~~~~~ 208 (558) T protein:vir:10 134 DGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRS-----EQDVVPNPEFEEFYIYTPKVQHPT 208 (558) T ss_pred eeEEEEEEEEeCCCccccceeeeeeCcccceeeeeeccccccccceeeeec-----ccceeeccceeEeeeecCCccccc Confidence 9999999988743 6677889999998776542111 111111110 0000 0001111123444322221 Q ss_pred EecCCccccccccccccccccccccc--eEeec-------CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Q lcl|NC_010808. 219 LTSRTNGLKLTPRENGFESHSFERMP--ITEFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI 289 (512) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~vP--vv~~~-------n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~ 289 (512) ...+ . . + ..+++ +|| .|.|. |...-.|-+...+.-.+.+ +++-|.+...+..+.|-+=+ T Consensus 209 ~~~~--~--~-----~-~~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQL-kmlEDAlVIYRitRAPERRv 276 (558) T protein:vir:10 209 GMVG--Q--M-----G-GKNSI-KIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQL-RMIEDSLVIYRLSRAPERRI 276 (558) T ss_pred ccce--e--e-----c-CCCce-eechhheeeecccceecCCCeeeecchHhhHhHHhh-HHHHhhHHHHhhhccccceE Confidence 1100 0 0 0 00111 122 11121 1111123333322222211 12333333334444443311 Q ss_pred ----ecCCcCC----------------------hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHH Q lcl|NC_010808. 290 ----KGNLSLD----------------------PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGT 343 (512) Q Consensus 290 ----~g~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 343 (512) .|..+.. ..+++..+....+ ++.-...-+. .+.+..+..|-...+...+ T Consensus 277 FYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msM-lEDyWLpRRe----GgrgTEItTLpGgqnLgem 351 (558) T protein:vir:10 277 FYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSM-MEDFWLPRRE----GGRGTEITTLPGGQNLGEL 351 (558) T ss_pred EEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhh-HhhhcccccC----CCCccceeeccccCCcchH Confidence 1221111 1111111111111 1111111110 1122234344333332222 Q ss_pred HHHHHHHHHHHHHHhcccccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKD 421 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d 421 (512) .-+.-+.+.++..-++|-.-.+.-++ +. -+..|---+.....-+.+.+..|..-+.++++.=+-+-+.+..-+++.- T Consensus 352 -~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i 430 (558) T protein:vir:10 352 -SDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTM 430 (558) T ss_pred -HHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHH Confidence 22344445566666666433222111 11 1223444444455566777777777777766543222222222222222 Q ss_pred cceeeEEeCCCCCcCHHHHHH-------HHHHHh---c-cCChHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHH---HH Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIEELK-------AYIDSG---G-KISQTTLMSLFSFFQDP--ELEVKKIEEDEKESIKK---AQ 485 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~~~~-------~~~kl~---g-~~s~et~~~~~~~v~d~--~~E~~ri~~E~~~~~~~---~~ 485 (512) ...|.+.|...-.-.+...++ ++..+. | .+|.+++++.+-..+|. .++-++|++|..+..-. +. T Consensus 431 ~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~ 510 (558) T protein:vir:10 431 EDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQI 510 (558) T ss_pred hhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCcccc Confidence 246777785544433443333 333343 3 47999999987555543 34556676665432110 11 Q ss_pred hhcccCCCCCCCCCCCCC---------------CcCcccCCC Q lcl|NC_010808. 486 KGIYKDPRDINDDEQDDD---------------TKDTVDKKE 512 (512) Q Consensus 486 ~~~~~~~~~~~~~~~~~~---------------~~~~~~~~e 512 (512) +...+++.+.+.+..-+. ..+..-+.| T Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (558) T protein:vir:10 511 DPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKD 552 (558) T ss_pred ChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhhh Confidence 111111111110100000 111111111 No 248 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=77.83 E-value=0.12 Score=25.62 Aligned_cols=430 Identities=11% Similarity=0.017 Sum_probs=153.6 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) ||.++.=.... .+..|.+..+.-... ..-.+|+-.-+++. .... T Consensus 1 ~~~~~~~~~gl-----------------------------~p~rl~~i~~~~~~~------~~~~~~~~~~~~Lr-~~~~ 44 (488) T protein:vir:95 1 MADITETQESL-----------------------------PPFRMGEVGSLGLKV------KNGRIYEEPRQALR-FPES 44 (488) T ss_pred CCCccccCCCC-----------------------------CHHHHHHHHHHhhcc------ccchhhccchhhhc-ccch Confidence 33222211111 122233332211000 00012211001110 0000 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecCCc---h-----hHHHHHHHHHhcc--ChhHHHHHHHHHHHhCCeE Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD---K-----DVLEAIEAFNDLN--DVESHNRSLGLDLSIYGKA 150 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d---~-----~~~~~l~~~~~~n--~~~~~~~~~~~~~~~~G~a 150 (512) -.-+. ..+ ......-.+.+....+.+.++.+...+ + +..+.+++++..- .|...+..+ .++.-+|.+ T Consensus 45 ~~ly~-~m~-~D~hi~s~l~~Rk~av~~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s 121 (488) T protein:vir:95 45 IKTFQ-LMM-RDPAVAASVNIIKMFVRKVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFC 121 (488) T ss_pred HHHHH-HHh-hChHHHHHHHHHHHHHhcCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccce Confidence 00000 000 135566667777777888888775321 1 1235677777543 355555555 478889974 Q ss_pred -EEEEEECCCCceEEEE-------EccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecC Q lcl|NC_010808. 151 -YELMIRNQDDETRLYK-------SDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSR 222 (512) Q Consensus 151 -~~~v~~d~~g~~~i~~-------~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~ 222 (512) ++.+|....+...... +.|....+. +-.-+++|... .++. .. +.+........... T Consensus 122 ~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~R--------pq~~~~~f~~d-~d~~---l~----~~~~~~~~~~~~~~ 185 (488) T protein:vir:95 122 VNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIR--------NQSTLDKWYFD-EDFR---RV----TGVRQNLRNVSHIA 185 (488) T ss_pred eeeeeeeccccccccccccccCCeeeeeeeeec--------Ccccccceeec-cCCC---ce----eecccccccccccc Confidence 5677754322111110 111111110 00000111100 0000 00 00000000000000 Q ss_pred Cccccccccccccccccccccc---eEee-----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Q lcl|NC_010808. 223 TNGLKLTPRENGFESHSFERMP---ITEF-----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLS 294 (512) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~vP---vv~~-----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~ 294 (512) ... ........ ..+| ++.+ ..++.|.|.+..+--.-=--+..+..++..++.|..|+.+..|-.. T Consensus 186 --~~~----~~~~~~~~-~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~ 258 (488) T protein:vir:95 186 --GAI----NLGERPLT-RKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPD 258 (488) T ss_pred --ccc----cccccccc-ccccccceEEEeecCCCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccC Confidence 000 00000000 0123 1222 1356678888776554444456667778888888888888776322 Q ss_pred ---C-ChhhhhhhhhccccccchhhhhhcccccCCCCCcc---------eeEEeec-CCHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010808. 295 ---L-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVD---------GGYIYKQ-YDVQGTEAYKDRLNSDIHMFTNT 360 (512) Q Consensus 295 ---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~ 360 (512) . ..++........ .............+...+.+.+ +..+... .....+..+++.+.+.|.+.--. T Consensus 259 ~~~~~~~~e~~~l~~a~-~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLG 337 (488) T protein:vir:95 259 YLDENAEPEKKAFVQYC-KTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMS 337 (488) T ss_pred CCCCcccHHHHHHHHHH-HHHHHHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhc Confidence 1 111111111110 0000000000000001111111 1122221 23345677788888888765433 Q ss_pred ccccc--ccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcC Q lcl|NC_010808. 361 PNMKD--DNFSGTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLR-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKS 436 (512) Q Consensus 361 p~~~~--~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d 436 (512) -.++. +..++.. +.+..+. ....+..-.+.+...+. +++.-++.+ + .+. ...-..+.|...-+.| T Consensus 338 qtLT~~~~~~Gs~Al~~vh~ev----~~~i~~aDa~~i~~tln~~li~~l~~~-N--fg~----~~~~P~~~~~~~e~~D 406 (488) T protein:vir:95 338 DVLAMGQSKYGSFSLADSKTSL----LAMSVDILLKQIKNVINRDLVAQTYAL-N--MWD----DEEHVQITYDDIETPD 406 (488) T ss_pred cccccccCcchhhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHh-c--CCC----CCCccEEEecCcChhh Confidence 32222 2212211 1112222 22223333344445553 344444432 1 111 1122467788888899 Q ss_pred HHHHHHHHHHH--hcc-CCh----HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcccCC------CCCCCCCCCCC Q lcl|NC_010808. 437 LIEELKAYIDS--GGK-ISQ----TTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP------RDINDDEQDDD 503 (512) Q Consensus 437 ~~~~~~~~~kl--~g~-~s~----et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~------~~~~~~~~~~~ 503 (512) ..+.++++.+| .|+ ++. +.+.+.++. ..++.. + ..............+ +.........+ T Consensus 407 l~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gi-p~~~~~-e------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (488) T protein:vir:95 407 LEAIGSYIQKTVAVGALEVDKELSNKLREHIGL-PPADES-Q------PVSEKLSPNSQSRSGDGYKTAGEGTAKTPSAK 478 (488) T ss_pred HHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC-CCCCCC-c------cccccCCCCCCCCCCcccCCCcccCCcccccc Confidence 88999999887 465 663 456666654 221110 0 000000000000000 00000000111 Q ss_pred CcCcccCCC Q lcl|NC_010808. 504 TKDTVDKKE 512 (512) Q Consensus 504 ~~~~~~~~e 512 (512) ......+.+ T Consensus 479 ~~~~a~~~~ 487 (488) T protein:vir:95 479 DPSTANKAN 487 (488) T ss_pred cchhhhhcc Confidence 111111111 No 249 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=76.92 E-value=0.13 Score=25.44 Aligned_cols=451 Identities=11% Similarity=0.071 Sum_probs=167.3 Q ss_pred CCcceeeccccchhhc----cccccCCCcCeeeccc-----chhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_010808. 1 MLKANEFETDTDLREN----RNYLFNDEANVVYTYD-----GTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKT 71 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~----~~~~f~~~~~~~~~~~-----~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~ 71 (512) ||. .-|-++..-... .++.=+++.+..+... +...++...+... .....+|+.+..+++=+. T Consensus 1 ~~~-~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~--------~eLI~~YR~ma~~pEvd~ 71 (537) T protein:vir:10 1 MAQ-QLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRND--------HELITRYREMVLNPECDS 71 (537) T ss_pred Ccc-ccccceeecccccccCCcccCCCcccccceeecccccccccccccccchH--------HHHHHHHHHHhhccchhh Confidence 442 113322211110 0111111111111000 0000000001111 112234444444433222 Q ss_pred cccccccccccccccceeeecchHHHHHHHHH-hhhhccCceecCCc--------hhHHHHHHHHHhccChhHHHHHHHH Q lcl|NC_010808. 72 KNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQCQDDD--------KDVLEAIEAFNDLNDVESHNRSLGL 142 (512) Q Consensus 72 ~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a-~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~ 142 (512) . ...||+..+ .-...+|+.+..++ +...+.++.+++--+|+....+..+ T Consensus 72 A----------------------v~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR 129 (537) T protein:vir:10 72 A----------------------VDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFR 129 (537) T ss_pred H----------------------HHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHh Confidence 1 111222111 11223344443332 2234455666666689999999999 Q ss_pred HHHhCCeEEEEEEECCC----CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEE Q lcl|NC_010808. 143 DLSIYGKAYELMIRNQD----DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRY 218 (512) Q Consensus 143 ~~~~~G~a~~~v~~d~~----g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~ 218 (512) .+.+-|+.|.+...|.+ |-..+..++|..+-.+.--.. +....++...... .-.......-+|.+...+. T Consensus 130 ~WYVDgRi~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~--~~~~~~~~~~~~~---~v~~~~~eyf~ynp~g~~~- 203 (537) T protein:vir:10 130 RWYVDGRLFFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEA--KRPEALRTQDLNQ---QLTQQSASYFLYNPKGLKN- 203 (537) T ss_pred hheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeEeecc--cCCccceEEecce---eeeecccceeeeccccccc- Confidence 99999999999888743 667788999999876653110 0011111110000 0000001112344433210 Q ss_pred EecCCccccccccccccccccccccc--eEee-------cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Q lcl|NC_010808. 219 LTSRTNGLKLTPRENGFESHSFERMP--ITEF-------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI 289 (512) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~~~~~~vP--vv~~-------~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~ 289 (512) ..... -+|| .|.| .|.....|-+...+.-.+.+ +++-|.+...+..+.|-+=+ T Consensus 204 --~~~~~---------------vkI~~dAI~y~hSGl~d~n~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRv 265 (537) T protein:vir:10 204 --STNQG---------------MKIAPDSIAYCHSGIQDLNKNMVLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRI 265 (537) T ss_pred --cCCCc---------------eeccHhheeeecccceeCCCCeeeeeehhhhHHHHhh-HHHHhhHHHHhhhccccceE Confidence 00000 0111 0111 12222334444332222222 12233333333333332211 Q ss_pred ----ecCCcC----------------------ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHH Q lcl|NC_010808. 290 ----KGNLSL----------------------DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGT 343 (512) Q Consensus 290 ----~g~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 343 (512) .|..+. ...+++..+....+ ++.-...-+. .+.+..+..|-...+...+ T Consensus 266 FYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msM-lEDyWLPRRe----GgrgTEItTLpGgqnlgem 340 (537) T protein:vir:10 266 FYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSM-LEDFWLPRRE----GGRGTEISTLPGGQNLGEL 340 (537) T ss_pred EEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhh-hhhhcccccC----CCcccceeeccccCCcChH Confidence 121111 11111111111111 1111111110 1122234344333332222 Q ss_pred HHHHHHHHHHHHHHhcccccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Q lcl|NC_010808. 344 EAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKD 421 (512) Q Consensus 344 ~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d 421 (512) .-+.-+.+.++..-++|-.-.+.-++ +. -|..|---+.....-+.+.+..|..-+.++++.=+-+-+.+...+++.- T Consensus 341 -~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i 419 (537) T protein:vir:10 341 -EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEM 419 (537) T ss_pred -HHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHH Confidence 22344444556666666433222111 11 1223444444555566777777777777776543222222222222222 Q ss_pred cceeeEEeCCCCCcCHHHHHH-------HHHHHh---c-cCChHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIEELK-------AYIDSG---G-KISQTTLMSLFSFFQDP--ELEVKKIEEDEKESIKKAQKGI 488 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~~~~-------~~~kl~---g-~~s~et~~~~~~~v~d~--~~E~~ri~~E~~~~~~~~~~~~ 488 (512) ...|.+.|...-.-.....++ ++..+. | .+|.+++++.+-..+|. +++-++|++|..+..-...... T Consensus 420 ~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~ 499 (537) T protein:vir:10 420 KEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAM 499 (537) T ss_pred hhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccc Confidence 246777785544433443333 333332 3 47999999987555542 3445666666543211100000 Q ss_pred ---c---cCCCCCCCCCCCC-------CCcCcccCCC Q lcl|NC_010808. 489 ---Y---KDPRDINDDEQDD-------DTKDTVDKKE 512 (512) Q Consensus 489 ---~---~~~~~~~~~~~~~-------~~~~~~~~~e 512 (512) . ++........+++ +.-.....|| T Consensus 500 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 500 QAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred cccccCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 0 0000000000000 0111222233 No 250 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=76.60 E-value=0.13 Score=25.37 Aligned_cols=369 Identities=9% Similarity=0.009 Sum_probs=132.9 Q ss_pred HHHHHHHhccccccccccc--ccccccccceeeecchHHHHHHHHHhhhhccCceecCCc--hhHHHHHHHHHhc--c-- Q lcl|NC_010808. 60 LKVLSDYYEGKTKNLVELT--RRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD--KDVLEAIEAFNDL--N-- 131 (512) Q Consensus 60 ~~~~~~yy~G~~~~~~~~~--~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d--~~~~~~l~~~~~~--n-- 131 (512) +-.+.....++........ ......-...-+........|+.+++-+..-|+.+...+ ......+..++.. | T Consensus 1 Mgl~d~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~~PN~~ 80 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLYWINTKANPN 80 (395) T ss_pred CcchhhhcCCCCccccccccccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHHHHhhcCCCC Confidence 1111111111110000000 000000000111223444556777777666677653222 1122335555532 3 Q ss_pred -ChhHHHHHHHHHHHhCCeEEEEEEECCCCceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEE Q lcl|NC_010808. 132 -DVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLF 210 (512) Q Consensus 132 -~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~y 210 (512) ....+...+..+.+.+|.+|+++..+.... +. ..++.... +...+++.+.. . ....-..+ T Consensus 81 ~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~-----~~--~~~~~~~~------~~~~~~~~v~~-~-----~~~~~~~~ 141 (395) T protein:vir:96 81 QSASQFWVEVVQKLLVDGETLIFVIPGKGIY-----VA--DAFTQDKK------LSGNKFKVSRV-Q-----GQTYEKIF 141 (395) T ss_pred CCHHHHHHHHHHHHhhcCceEEEEEcCCcee-----cC--Cccccccc------cccceeeeeee-c-----cceeeeEe Confidence 334556678888899999999887654211 11 11111000 00001111100 0 00001123 Q ss_pred cCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcc---hHHHHHHHHHHHHHHHH---HHHHHHHhcC Q lcl|NC_010808. 211 TSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGD---YEKVITLIDLYDNAESD---TANYMSDLND 284 (512) Q Consensus 211 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~---~~~v~~liDa~~~~~s~---~~~~~~~~~~ 284 (512) .+..+.+++.... +.. ..+.|. ...++.+.-+.....+. ..+....... T Consensus 142 ~~~dvih~k~~~~--------------------~~~-----~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 196 (395) T protein:vir:96 142 TFDQVIYLKNDNS--------------------DLM-----LKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVR 196 (395) T ss_pred ccCceEEecccCC--------------------ccc-----cccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 3444444322110 000 111222 22333333222211111 1111112222 Q ss_pred ceeeeecCCcCChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecC-CHH-----HHHHHHHHHHHHHHHHh Q lcl|NC_010808. 285 AMLLIKGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY-DVQ-----GTEAYKDRLNSDIHMFT 358 (512) Q Consensus 285 ~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~-----~~~~~~~~l~~~i~~~s 358 (512) +..++.-......+..+.... ........+.......+.+.+..-++... ..+ .+........+.|...- T Consensus 197 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~f 272 (395) T protein:vir:96 197 ERAQENSDGGRQPKSDKDFFK----RTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEML 272 (395) T ss_pred cceeeccCchhhHHHHHHHHH----HHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHh Confidence 222221111111111111100 00000111111111233444444443321 111 22222333456677777 Q ss_pred cccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHH Q lcl|NC_010808. 359 NTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLI 438 (512) Q Consensus 359 ~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~ 438 (512) ++|....+...++.+. .....+...|...+..|...+..+-..... -...+.+.|+.-+..|.. T Consensus 273 gVPp~~l~~~~sn~e~---------------~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e-~~~~~~f~~~~l~~~d~~ 336 (395) T protein:vir:96 273 GIPISLLHGDIADNQK---------------NYELLLEGPIESLITNIVDGLEYAIFDKSE-TLEGSFIKVTGLKNYDLF 336 (395) T ss_pred CCCHHHhcCCCccHHH---------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhh-hcCceeEeecchhccCHH Confidence 8887665421111111 112344445555555555444432111111 122345677777889999 Q ss_pred HHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 439 EELKAYIDS--GGKISQTTLMSLFSF--FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 439 ~~~~~~~kl--~g~~s~et~~~~~~~--v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) +.++++.++ .|+++.-.+++.++. ++++.. +++ .. +.+...-++.++|..++.| T Consensus 337 ~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~g--D~~----------~~--------~~N~~~~~~~gge~~~~~~ 394 (395) T protein:vir:96 337 SISSQADKLISSGFVFIDEVREEIGLPELPDGLG--KVL----------YM--------TKNYESVLERGGEVDEEVE 394 (395) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--cee----------ee--------cccceechhccCCCCCCCC Confidence 999999887 688999888888754 223211 000 00 0000000011122111122 No 251 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=75.71 E-value=0.14 Score=25.20 Aligned_cols=396 Identities=10% Similarity=-0.021 Sum_probs=155.3 Q ss_pred HHHHHHHHH--HHH-----------HHHHHHHhccccccccccccccccccc------ceee-----ecchHHHHHHHHH Q lcl|NC_010808. 48 YIEHHMDYQ--RPR-----------LKVLSDYYEGKTKNLVELTRRKEEYMA------DNRV-----AHDYASYISDFIN 103 (512) Q Consensus 48 ~i~~~~~~~--~~r-----------~~~~~~yy~G~~~~~~~~~~~~~~~~~------~~ri-----~~n~~~~iv~~~a 103 (512) +..++.... .+. +.....-+.++... ...+.+..+ ..++ ......-.+.+.. T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYD----VVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccc----cccccchhHhhccccchHHHHHHhhChHHHHHHHHHH Confidence 111110000 000 00000000000000 000000000 0000 1234444566666 Q ss_pred hhhhccCceecCC--ch---hHHHHHHHHHhcc-------ChhHHHHHHHHHHHhCCeEE-EEEEE-CCCCceEEEEE-- Q lcl|NC_010808. 104 GYFLGNPIQCQDD--DK---DVLEAIEAFNDLN-------DVESHNRSLGLDLSIYGKAY-ELMIR-NQDDETRLYKS-- 167 (512) Q Consensus 104 ~~l~g~~~~~~~~--d~---~~~~~l~~~~~~n-------~~~~~~~~~~~~~~~~G~a~-~~v~~-d~~g~~~i~~~-- 167 (512) ..+.+.++.+... +. ...+.+.+++... .|..++ .-..++.-||.++ +.+|. ..+|...+..+ T Consensus 77 ~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~-~~~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~ 155 (448) T protein:vir:79 77 GRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLF-AIYENAYIYGMAAGEIVLTLGADGKLILDKIVP 155 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHH-HHHHHhhhhcceeEEEEeeecCCCceecccccc Confidence 6778888887532 21 2334455555421 344433 3345688899755 56674 45666433222 Q ss_pred -cccee-EEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccce Q lcl|NC_010808. 168 -DAMST-FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (512) Q Consensus 168 -~p~~~-~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 245 (512) +|... .-.|+. +.-..+........ ......+..+.|++. + T Consensus 156 r~~~~~~~f~~~~----------------------------------d~~l~~~~~~~~~~-~~~~~~~~~~lP~~~--~ 198 (448) T protein:vir:79 156 IHPFNIDEVLYDE----------------------------------EGGPKALKLSGEVK-GGSQFVSGLEIPIWK--T 198 (448) T ss_pred cCCccccceeeec----------------------------------CCceEEeecCCccc-ccccCCCccccccce--E Confidence 11110 001111 10001110000000 000000111223333 2 Q ss_pred EeecC----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChh--hhhhhhhccccccchhhhhhc Q lcl|NC_010808. 246 TEFSN----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPD--EVKKQKEANVLFLEPTVYENR 319 (512) Q Consensus 246 v~~~n----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 319 (512) +.+.+ ++.|.|.+..+.-..=--+..+.+++..++.|+.|+++.+-..+.+.+ +...... ....+..+. T Consensus 199 i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~-----av~~i~~g~ 273 (448) T protein:vir:79 199 VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKE-----IVKNFVQKP 273 (448) T ss_pred EEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHH-----HHHHHhcCC Confidence 33322 456788888877766666778889999999999999987743333321 1111110 000111112 Q ss_pred ccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 320 DTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 399 (512) Q Consensus 320 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 399 (512) ..+...+.+.++++++.......+..+++.+.+.|...--.-.++.+..+| .+..+......-....+..-.+.+...+ T Consensus 274 ~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g-~~~~~~~~~~~v~~~~~~aDa~~i~~tl 352 (448) T protein:vir:79 274 RHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMG-VQAINIGEFVSLTQQTIISLQREFASAV 352 (448) T ss_pred ceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhccccccc-hhhhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 222334566788898877655556678888888887754333333333222 2222222111111122223334445555 Q ss_pred H-HHHHHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHHHHHHHHHHhccCC--hHHHHHhCCCCCCHHHHHHHHHHH Q lcl|NC_010808. 400 R-RRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKIS--QTTLMSLFSFFQDPELEVKKIEED 476 (512) Q Consensus 400 ~-~~~~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~g~~s--~et~~~~~~~v~d~~~E~~ri~~E 476 (512) . +++.-++.+ + .+. +..-..+.|...-+.|..+.++.+.++.+... .....+.+ .+.++... T Consensus 353 n~~li~~l~~l-N--fg~----~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~-~~p~~~~~------- 417 (448) T protein:vir:79 353 NLYLIPKLVLP-N--WPS----ATRFPRLTFEMEERNDFSAAANLMGMLINAVKDSEDIPTELK-ALIDALPS------- 417 (448) T ss_pred HHHHHHHHHHh-c--CCC----cCCCcEEEecCCChHHHHHHHHHhhhhhccchhhHHHHHHhh-cCCCCCCC------- Confidence 4 345444442 2 111 11123677877788888888888887765321 11111111 11110000 Q ss_pred HHHHHHHHHhhcccCCCCCCCCCCCCCCcCcccCCC Q lcl|NC_010808. 477 EKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 512 (512) . ........+...+...+..++.-.-+..- T Consensus 418 ---~---~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (448) T protein:vir:79 418 ---K---MRRALGVVDEVREAVRQPADSRYLYTRRR 447 (448) T ss_pred ---c---cccccCCCCcccccccCCccccchhhccc Confidence 0 00000000000000000000000000000 No 252 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=70.37 E-value=0.21 Score=24.30 Aligned_cols=457 Identities=12% Similarity=0.046 Sum_probs=189.6 Q ss_pred CCcceeeccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~ 80 (512) |...+ -..|...+.. +.+-.-...+.-.+|.+.-..|.+...+-|.+...... T Consensus 1 m~~~~--------------------~~~~~~tpe~--la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~----- 53 (663) T protein:vir:34 1 MNESQ--------------------PTDFADTPQG--WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAH----- 53 (663) T ss_pred CCccc--------------------cccchhcchh--HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCC----- Confidence 22211 1112222211 11111112223334555555666777777766432211 Q ss_pred ccccccceeeecchHHHHHHHHHhhhhccCceecC------Cchh----HHHHHHHHH------hccChhHHHHHHHHHH Q lcl|NC_010808. 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQD------DDKD----VLEAIEAFN------DLNDVESHNRSLGLDL 144 (512) Q Consensus 81 ~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~------~d~~----~~~~l~~~~------~~n~~~~~~~~~~~~~ 144 (512) . ...|. |+.=--|.++.--+++.+|..+. -++. +.+.+.+.+ +.++|+.......+++ T Consensus 54 ~----~~~r~--nl~~sni~~i~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ 127 (663) T protein:vir:34 54 D----AETRW--NLFSTNIQTQMASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDR 127 (663) T ss_pred c----ccccc--chhhhhHHHHhhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh Confidence 1 11122 33322333344445666665432 1222 223344433 4456888889999999 Q ss_pred HhCCeEEEEEEEC--------------CCC-----------------ceEEEEEccceeEEEEeCCCCceeE--EEEEEe Q lcl|NC_010808. 145 SIYGKAYELMIRN--------------QDD-----------------ETRLYKSDAMSTFVIYDNTIERNSI--AGVRYL 191 (512) Q Consensus 145 ~~~G~a~~~v~~d--------------~~g-----------------~~~i~~~~p~~~~~i~d~~~~~~~~--~~v~~~ 191 (512) +.+|++-+.|-+- +.+ .++|-.+.-.++ ++++...+.-. .+-|.| T Consensus 128 ll~~rG~~~v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~ 205 (663) T protein:vir:34 128 LLPGFGLCRIRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNL 205 (663) T ss_pred hccccceEEEEeecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeecc Confidence 9999876666331 100 122222221211 22222111111 111111 Q ss_pred e--------------------eee--ecc----CC----cceEEEEEEEcCC--cEEEEEecCCcccccccccccccccc Q lcl|NC_010808. 192 R--------------------TKP--IDK----TD----EDEVFTVDLFTSH--GVYRYLTSRTNGLKLTPRENGFESHS 239 (512) Q Consensus 192 ~--------------------~~~--~~~----~~----~~~~~~~~~yt~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (512) - ... ... ++ .+....-|||.+. +||++.....-+.. +.+.+.+ T Consensus 206 mtk~e~~~rf~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~-----~~~p~lg 280 (663) T protein:vir:34 206 LDMREFNARFDADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLD-----TQPDPLG 280 (663) T ss_pred CCHHHHHHhhcCChhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcceecc-----cCCCCCC Confidence 0 000 000 00 1233345778664 44444333221111 1222222 Q ss_pred cc---ccceEeecC----CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcCChhh-hhhhhhcccccc Q lcl|NC_010808. 240 FE---RMPITEFSN----NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLSLDPDE-VKKQKEANVLFL 311 (512) Q Consensus 240 ~~---~vPvv~~~n----~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~-~~~~~~~~~~~~ 311 (512) +. -||...+++ +-...++|.--..+++.+|.+-..+ |.+.+.-.+-.+..+-.+.+... +.+...+ .+ T Consensus 281 l~~ffPcPrpl~~~~~~ds~ipvpd~~~y~~~~~E~n~~t~Ri-n~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n-~l-- 356 (663) T protein:vir:34 281 LESFFPCPKPLLANWTTDKVVPRPDFVLAQDLYKEIDLVSTRI-TLLERAIRVVGVYDKSSGLTIGRLLSEAAQN-DL-- 356 (663) T ss_pred CCCCCCCcccccceecCCCeecCCcHHHHHHHHHHHHHHHHHH-HHHHhhhhhceeeccccchhHHHHHHHhhCC-Cc-- Confidence 21 245544443 2345688888888888888764443 33333222222221111111111 1111111 11 Q ss_pred chhhhhhcccccCCCCC--cceeEEeecCC---HHHHHHHHHHHHHHHHHHhcccccccccccccchHHHHHHHHHHHHH Q lcl|NC_010808. 312 EPTVYENRDTGIETEGS--VDGGYIYKQYD---VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQ 386 (512) Q Consensus 312 ~~~~~~~~~~~~~~~~~--~~~~~l~~~~~---~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~ 386 (512) -+...+... ...++ +.+.|+--+-- ....-..-..++.+++++|++-+..=+....+-++.|-+.+-+.+.. T Consensus 357 vpV~~~~~~---~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~ 433 (663) T protein:vir:34 357 IPVENWLTF---ADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSI 433 (663) T ss_pred eecchhhhh---hhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhH Confidence 111111111 11111 12333321111 23333555678889999999887776666666788888888889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccC---------CCcc----------------cccceeeEEeCCCCCcCHHHHH Q lcl|NC_010808. 387 RTKTKEGLFTKGLRRRAKLLETILKNTRS---------IDAN----------------KDFNTVRYVYNRNLPKSLIEEL 441 (512) Q Consensus 387 k~~~~~~~~~~~l~~~~~li~~~l~~~~~---------~~~~----------------~d~~~i~i~f~~~~p~d~~~~~ 441 (512) ++.+++..+.+..+.++++..+++...-. ...+ ...-.+.|+=....-.|.++.- T Consensus 434 RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK 513 (663) T protein:vir:34 434 RLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALR 513 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHH Confidence 99999999999999999998888753211 1111 1222345555555666665544 Q ss_pred HHHHHH-hcc--CCh-------------HHHHHhC-----CCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCC Q lcl|NC_010808. 442 KAYIDS-GGK--ISQ-------------TTLMSLF-----SFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 498 (512) Q Consensus 442 ~~~~kl-~g~--~s~-------------et~~~~~-----~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 498 (512) +.++++ .++ +++ ..+.+++ ++- .+.+.-++++..- ++..+. .+.+.... T Consensus 514 ~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~----~e~aa~----~~~~~~pa 585 (663) T protein:vir:34 514 NEKMEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAA----AEEAQK----QAAQQSPA 585 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhh----hHHHhh----ccCCCCcc Confidence 444332 111 111 1122221 110 1111112222221 111111 11111111 Q ss_pred CCCCCCcCcccCCC Q lcl|NC_010808. 499 EQDDDTKDTVDKKE 512 (512) Q Consensus 499 ~~~~~~~~~~~~~e 512 (512) ....+.+-..+... T Consensus 586 ~~~~~~k~~~~q~k 599 (663) T protein:vir:34 586 PQQPDPKVVAQAMK 599 (663) T ss_pred cchhhHHHHHHHHH Confidence 11111111111111 No 253 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=66.72 E-value=0.26 Score=23.76 Aligned_cols=430 Identities=11% Similarity=0.033 Sum_probs=147.3 Q ss_pred CCccee---------eccccchhhccccccCCCcCeeeccc-chhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010808. 1 MLKANE---------FETDTDLRENRNYLFNDEANVVYTYD-GTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGK 70 (512) Q Consensus 1 ~~~~~~---------~~~~~~~~~~~~~~f~~~~~~~~~~~-~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~yy~G~ 70 (512) |-+-.. |+.=|+.+... -.++.+.--...+ +........ -.-..+..........-+++++..+.- T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~--~~p~~~dG~s~i~~~~~~~~~~~-~~~~~~~gg~~~n~~eLI~~YR~ma~~- 76 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGM--GAPHGAGGSSMIPINMYHPFATA-GYASRFYGGIEFNRFFLYDMYDRMDYT- 76 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcc--cCccCCCCCccccCCCCcchhhh-hhhhhhhccccccHHHHHHHHHHhhcc- Confidence 322111 11111111111 0111111100000 000000000 000000000000000001111110000 Q ss_pred ccccccccccccccccceeeecchHHHHHHHH-HhhhhccCceecCCchhHHHHHHH-HHhccChhHHHHHHHHHHHhCC Q lcl|NC_010808. 71 TKNLVELTRRKEEYMADNRVAHDYASYISDFI-NGYFLGNPIQCQDDDKDVLEAIEA-FNDLNDVESHNRSLGLDLSIYG 148 (512) Q Consensus 71 ~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~-a~~l~g~~~~~~~~d~~~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G 148 (512) |+. +.+-...||+.. +..-..+|+.+..++.+..+.+.+ +..-.+|+....+.++.+.+.| T Consensus 77 ~pE-----------------Vd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDG 139 (533) T protein:vir:58 77 DPL-----------------ISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYG 139 (533) T ss_pred Ccc-----------------hhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcc Confidence 000 001112222222 122344566665554443333322 2333579999999999999999 Q ss_pred eEEEEEEEC-CC-CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccc Q lcl|NC_010808. 149 KAYELMIRN-QD-DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGL 226 (512) Q Consensus 149 ~a~~~v~~d-~~-g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~ 226 (512) +.|.+.-.+ ++ |-..+..+||..+-.+|+.-.. ..| -+|++..... .++. T Consensus 140 riy~Hkiik~~k~GI~elr~lDPr~i~~vr~~~t~------~ey-----------------yvy~~~~~~~----~s~~- 191 (533) T protein:vir:58 140 DMFLHILEKGSDGTIEKFQVVSPYIFSKRYNPETD------TWY-----------------YVITDVYRNV----VSGY- 191 (533) T ss_pred eeEEEeccCCcccchhhheecCCeeeEEEEeeccc------eEE-----------------Eeeccccccc----ccCc- Confidence 999888542 33 3347889999998777754221 111 1333321110 0000 Q ss_pred cccccccccccccccccc---eEeec------CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCcee---ee-ecCC Q lcl|NC_010808. 227 KLTPRENGFESHSFERMP---ITEFS------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML---LI-KGNL 293 (512) Q Consensus 227 ~~~~~~~~~~~~~~~~vP---vv~~~------n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~l---v~-~g~~ 293 (512) ... .|| |+++. +.+.+.|-+...+.-.+.+- ++-+.+...+..+.|-+ .+ .|.. T Consensus 192 ------~~~------kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLk-miEDAlVIYRisRAPeRRvFYIDVGNl 258 (533) T protein:vir:58 192 ------FNE------DIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLR-LMEDALMLYRVVRSVDRRVFYVDVGNV 258 (533) T ss_pred ------ccc------ccchhheeeeeeccccCCCCceehhhhHHHHHHHHHH-HHHHHHHHHhhcCChhheEEEEeecCC Confidence 000 111 22221 22333444544333322221 12333333333333222 11 2221 Q ss_pred cCCh----------------------hhhhhhhhcc--ccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHHH Q lcl|NC_010808. 294 SLDP----------------------DEVKKQKEAN--VLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDR 349 (512) Q Consensus 294 ~~~~----------------------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 349 (512) +... .++.+.+... .-.++.-...-+. .+.+..+..|-. .+. +...-+.- T Consensus 259 pk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRRe----GgrgTEI~TLpG-g~l-gemeDV~Y 332 (533) T protein:vir:58 259 PPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRG----DRRAVEIDILQG-SKV-DLAEDVEY 332 (533) T ss_pred CccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccC----CCccceeeecCC-CCC-CcHHHHHH Confidence 1111 1111111110 0001100001000 111223333322 222 22344555 Q ss_pred HHHHHHHHhccccccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeE Q lcl|NC_010808. 350 LNSDIHMFTNTPNMKDDNFS--GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRY 427 (512) Q Consensus 350 l~~~i~~~s~~p~~~~~~~~--~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i 427 (512) +.+.++..-.+|-.-.+.-+ |..| .|---......-+.+.+..|..-|++-+ + .++.+. ..+| .+ T Consensus 333 F~kkLy~ALnVP~sRl~~e~~fgr~~--eItRDEiKF~KFI~rLR~rF~~ll~~qL--i-----lk~iit-~eew---~~ 399 (533) T protein:vir:58 333 MLNRLISALKVPKAFIGYEGDVNAKN--TLATQDIKFNNTIKRIQGFFVEELERMV--R-----MNKEFA-DQDF---RL 399 (533) T ss_pred HHHHHHHHhCCCeeecCCCCCCccch--hhhHHHHHHHHHHHHHHHHHHHHHhccc--c-----cccCcc-hhhe---ee Confidence 66667777778754333221 2222 2222222233344444555655554422 1 122221 2233 56 Q ss_pred EeCCCCCcCHHHH-------HHHHHHHhccCChHHHHHhCCCCC-CHHHHHHHHHHHHHHHHHHH------HhhcccCCC Q lcl|NC_010808. 428 VYNRNLPKSLIEE-------LKAYIDSGGKISQTTLMSLFSFFQ-DPELEVKKIEEDEKESIKKA------QKGIYKDPR 493 (512) Q Consensus 428 ~f~~~~p~d~~~~-------~~~~~kl~g~~s~et~~~~~~~v~-d~~~E~~ri~~E~~~~~~~~------~~~~~~~~~ 493 (512) .|...-.-..... ++++..+.+.+++.++++.+-..+ |...+.+.|++|.....-.. ......++. T Consensus 400 ~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~ 479 (533) T protein:vir:58 400 VMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGE 479 (533) T ss_pred eeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCcc Confidence 6754433333333 334445567899999988864444 43444455665543211000 000000000 Q ss_pred CCCCCCCCCCCcCcccCC-----------------------------C Q lcl|NC_010808. 494 DINDDEQDDDTKDTVDKK-----------------------------E 512 (512) Q Consensus 494 ~~~~~~~~~~~~~~~~~~-----------------------------e 512 (512) .. +..+.+...+..+.+ | T Consensus 480 ~~-~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~ 526 (533) T protein:vir:58 480 RG-SPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEE 526 (533) T ss_pred cc-CcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccC Confidence 00 000000000000000 0 No 254 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=64.55 E-value=0.3 Score=23.47 Aligned_cols=317 Identities=10% Similarity=0.028 Sum_probs=111.5 Q ss_pred HHHHHhcccccccccccccccccccceee-ecch--HHH------HHHHHHhhhhcc----CceecCC------chhHHH Q lcl|NC_010808. 62 VLSDYYEGKTKNLVELTRRKEEYMADNRV-AHDY--ASY------ISDFINGYFLGN----PIQCQDD------DKDVLE 122 (512) Q Consensus 62 ~~~~yy~G~~~~~~~~~~~~~~~~~~~ri-~~n~--~~~------iv~~~a~~l~g~----~~~~~~~------d~~~~~ 122 (512) +-++-+...+.-...... ....+...++ +..| +.. +.+-.--+..|+ |+.+.+= +..... T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNP-SAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCch-hhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhh Confidence 000001110000000000 0000000111 1111 100 111111111121 1111100 000000 Q ss_pred HH-------HHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEee Q lcl|NC_010808. 123 AI-------EAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (512) Q Consensus 123 ~l-------~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~ 192 (512) .| ...+.-|.. ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+.+..+.. +||. T Consensus 80 ~l~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~---------~~~~ 150 (351) T protein:vir:78 80 ALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS---------GFVY 150 (351) T ss_pred hhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC---------eEEE Confidence 11 111122221 23356788899999999999999888864 45566666654443221 1111 Q ss_pred eeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHH Q lcl|NC_010808. 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAE 272 (512) Q Consensus 193 ~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~ 272 (512) +.. . . . ...|.++.|.++..-. | .+...|.|++...+.-+..-+.+. T Consensus 151 ~~~---~-~-~---~~~~~~~eVihir~~~---------------------~----~~~~yGl~~~~~a~~si~l~~~a~ 197 (351) T protein:vir:78 151 VNG---W-Q-E---RHEFAPDSVFQLVRPD---------------------I----NQEVYGLPEYLSSLHSAWLNESST 197 (351) T ss_pred Eec---C-C-e---EEEEccccEEEEcCCC---------------------C----CCCcccccHHHHHHHHHHHHHHHH Confidence 110 0 0 0 0123333333332100 0 012246666665555444333322 Q ss_pred HHHHHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhcc-cccCC--CCCcceeEEeecCCHHHHHHHHH Q lcl|NC_010808. 273 SDTANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRD-TGIET--EGSVDGGYIYKQYDVQGTEAYKD 348 (512) Q Consensus 273 s~~~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~~l~~~~~~~~~~~~~~ 348 (512) .-..+.+.-.+.|-.++. .....++++...+++.-. ........... +..+. +++.++..++.......+.+..+ T Consensus 198 ~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~-~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~ 276 (351) T protein:vir:78 198 LFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALK-NAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKN 276 (351) T ss_pred HHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHH-HhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHH Confidence 222233333445544443 112234444444333211 00000000000 00111 22334444443334455667777 Q ss_pred HHHHHHHHHhcccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeE Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRY 427 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i 427 (512) ...+.|...-++|....+...++.++ .-++ ...+..+...|..+++.+.++....+ .+ -+ T Consensus 277 ~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e----------~~~~~f~~~~l~P~~~~iee~n~~l~-----~~----~~ 337 (351) T protein:vir:78 277 VTRDDLLAAHRVPPQLLGIVPSNSGGFGTPD----------TAARVFGRNEIRPLQARFAELNDWLG-----DE----VV 337 (351) T ss_pred HhHHHHHHHhCCCHHHhcccCCCCCCcccHH----------HHHHHHHHHHHHHHHHHHHHHHhhcC-----cc----ce Confidence 78888999999987665543322211 1111 01112223333333333333221111 01 14 Q ss_pred EeCCCCCcCHHHHH Q lcl|NC_010808. 428 VYNRNLPKSLIEEL 441 (512) Q Consensus 428 ~f~~~~p~d~~~~~ 441 (512) .|++..-...++.+ T Consensus 338 ~F~~~~Llr~d~ka 351 (351) T protein:vir:78 338 RFDDYEIPPAPVAA 351 (351) T ss_pred ecChhhhccccccC Confidence 56443322222222 No 255 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=60.37 E-value=0.37 Score=22.93 Aligned_cols=242 Identities=9% Similarity=-0.028 Sum_probs=102.0 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc-hhHHHH Q lcl|NC_010808. 46 SKYIEHHMDYQR-PRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD-KDVLEA 123 (512) Q Consensus 46 ~~~i~~~~~~~~-~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d-~~~~~~ 123 (512) ..+......+.. +.-.....+.. ..+..............-+...-..-.|+.+++-+..-|+.+.-.. ...... T Consensus 1 MglF~~~~~r~~~~~~~~~~~~~~---~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~~~~ 77 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDLQMMVQ---TLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDR 77 (251) T ss_pred CCccccccccccCCCccchhhhhh---hhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCccccccch Confidence 000000000000 00000000000 0000000000000001112223344567777777777777653322 112223 Q ss_pred HHHHHh-c-c---ChhHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEeeeeeec Q lcl|NC_010808. 124 IEAFND-L-N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPID 197 (512) Q Consensus 124 l~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~ 197 (512) +..++. . | ....+...+..+.+.+|.||+++.++.+|++ .+..++|..+.+..++. ..+.+ +|...... T Consensus 78 ~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~~---~~~~~~~~ 152 (251) T protein:vir:46 78 IVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDAR--GRLYY---FHQRIDSN 152 (251) T ss_pred HHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCC--CcEEE---EEEEeccC Confidence 344442 1 3 3445677788899999999999999999875 58889999998887653 22221 11111111 Q ss_pred cCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 198 KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTAN 277 (512) Q Consensus 198 ~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~s~~~~ 277 (512) .. .....|.++.+.+++.. |. +.-.|.|.++.+...+.....+..-..+ T Consensus 153 --~~---g~~~~~~~~diiH~r~~----------------------~~----dg~~G~spi~~~~~~i~~~~~~~~~~~~ 201 (251) T protein:vir:46 153 --GN---NIERNVKFEDMLDIKFY----------------------SL----DGINGLSLLDTLSRTIESDNNGKDFLNN 201 (251) T ss_pred --Cc---ceeEEECCccEEEecCc----------------------CC----CCeeecCHHHHHHHHHHHHHHHHHHHHH Confidence 00 01123455555554321 00 1124777777777766666655555555 Q ss_pred HHHHhcCceeeeecCCcC-ChhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHH Q lcl|NC_010808. 278 YMSDLNDAMLLIKGNLSL-DPDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQ 341 (512) Q Consensus 278 ~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 341 (512) .+.-.+.|-.+++-.... +++.....+..-. ... . +....+. +... ..+ T Consensus 202 ~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~------~~~---~--g~~n~g~---~~~g-m~~ 251 (251) T protein:vir:46 202 FLRNGTHAGGILKMKGVLDNKKARDRAREEFP------KVL---V--ELNKLGK---LSYS-MNQ 251 (251) T ss_pred HHHccCCCcEEEEeCCCCCCHHHHHHHHHHHH------HHh---c--Ccccccc---cccc-cCC Confidence 566666676666543222 2222222221100 000 0 0000111 0000 001 No 256 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=56.95 E-value=0.44 Score=22.51 Aligned_cols=350 Identities=12% Similarity=0.054 Sum_probs=130.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceec---CCc----- Q lcl|NC_010808. 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQ---DDD----- 117 (512) Q Consensus 46 ~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~---~~d----- 117 (512) ..+ +.++..++.++...-..+.... .....-+......-.|+.+++-+..-|+.+- .++ T Consensus 1 M~~-----------f~k~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:85 1 MNL-----------FGKVVSFSRGKLNNDTQRVTAW--QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred Cch-----------hhhhhhhhhcccccCCcceeee--eccchhhhhHHHHHHHHHHHHhHhhCceeEEEEecccccccc Confidence 111 1111111211111000000000 0000111223344566777776666676431 110 Q ss_pred --hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEEE-EECCCCceEEEEEccceeEEEEeCCCCceeEEEEE Q lcl|NC_010808. 118 --KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELM-IRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVR 189 (512) Q Consensus 118 --~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v-~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~ 189 (512) +.....|.+++.. | ....+...+..+.+.+|.||++. +.+..|++...+ |.+ T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~---------~~~----------- 127 (378) T protein:vir:85 68 LISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLL---------FAN----------- 127 (378) T ss_pred ccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEE---------ecC----------- Confidence 1123345666542 3 23445666778888999999763 344444322111 000 Q ss_pred EeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHH Q lcl|NC_010808. 190 YLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYD 269 (512) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~ 269 (512) . . ..|.++.+.+++. | -...+....+....++++ T Consensus 128 ----------~--~----~~~~~~dvih~~~-----------------------~-------~~~~~~~~~~~~a~~~~~ 161 (378) T protein:vir:85 128 ----------D--K----KEYKPEELVRLVS-----------------------P-------FYINEDTSILDNALASIQ 161 (378) T ss_pred ----------C--C----EEEcccceEEEec-----------------------C-------cCccchhhHHHHHHHHHH Confidence 0 0 0122233333221 0 000011111222222222 Q ss_pred HHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhcccccc-chhhhhhcccccCCCCCcceeEEeecCCHHHHHHHHH Q lcl|NC_010808. 270 NAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFL-EPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKD 348 (512) Q Consensus 270 ~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 348 (512) .. +. .+.+--+++-....+++..+..++.-.-.. ..............+++.+++.++.......+ ...+ T Consensus 162 ~~-------~~-~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~ 232 (378) T protein:vir:85 162 TK-------LE-QGKLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIE 232 (378) T ss_pred HH-------Hh-cCCcceEEEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHH Confidence 11 11 122222222111122222222221110000 00000011112233455566555543322333 3456 Q ss_pred HHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-------cccc Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-------ANKD 421 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~-------~~~d 421 (512) .+.+.|+..-++|..... +..+.. .....+...|..++..|...+..+-..+ .... T Consensus 233 ~~~~~Ia~~fgVPp~~l~---~s~~e~--------------~~~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~ 295 (378) T protein:vir:85 233 LIKSELLTGYFMNENILL---GTATQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLY 295 (378) T ss_pred HHHHHHHHHhCCCHHHhc---CCchHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccc Confidence 667788888888865442 111110 0112445556665555555544321110 0001 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND 497 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 497 (512) ..++.+.+..-+-.|..+.++.+.++ .|+++.-.++++++.- ++-+ ++ .+ ....... ......+. T Consensus 296 ~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD-~~-~~--------~~N~~~~-~~~~~~~~ 364 (378) T protein:vir:85 296 YERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-IY-IA--------NLNAVAV-KNLSDLQG 364 (378) T ss_pred cceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eE-ee--------ccccccc-ccchhhcC Confidence 12234444555677888999988887 6899998888888542 2111 00 00 0000000 00111111 Q ss_pred CCCCCCCcCcccCC Q lcl|NC_010808. 498 DEQDDDTKDTVDKK 511 (512) Q Consensus 498 ~~~~~~~~~~~~~~ 511 (512) .+.....+++.+++ T Consensus 365 ~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 365 SRKDVASTDETNNQ 378 (378) T ss_pred ccCCCCCCCCCCCC Confidence 11112222222222 No 257 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=46.39 E-value=0.74 Score=21.31 Aligned_cols=350 Identities=11% Similarity=0.060 Sum_probs=133.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCcee-c--CCc----- Q lcl|NC_010808. 46 SKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQC-Q--DDD----- 117 (512) Q Consensus 46 ~~~i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~-~--~~d----- 117 (512) ..+..+..... + ...+.|..+..... . ...-+........|+.+++-+..-|+.+ . ..+ T Consensus 1 M~if~~~~~~~--~----~~~~~~~~~~~~~~----~---~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFS--R----GKLNNDTQRVTAWQ----N---EAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhHHhHhhh--h----cccccCcceeeeee----c---chhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccc Confidence 11111111100 0 01111111111000 0 0000112345566777777777767643 1 111 Q ss_pred --hhHHHHHHHHHhc--c---ChhHHHHHHHHHHHhCCeEEEE-EEECCCCceEEEEEccceeEEEEeCCCCceeEEEEE Q lcl|NC_010808. 118 --KDVLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYEL-MIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVR 189 (512) Q Consensus 118 --~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~-v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~ 189 (512) ......|..+|.. | ....+...+..+.+..|.||++ ++.+..|++... T Consensus 68 ~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~------------------------ 123 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDL------------------------ 123 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEE------------------------ Confidence 1122345555542 3 3345666688889999999976 344444433211 Q ss_pred EeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHH Q lcl|NC_010808. 190 YLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYD 269 (512) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~ 269 (512) ++.. +. ..|.+..+.+++. |. .. ..+.+.+. .+.++++ T Consensus 124 ~~~~------~~------~~~~~~dvih~~~-----------------------~~---~~-~~~~~~~~---~~~~~~~ 161 (378) T protein:vir:94 124 LFAN------DK------KEYKPEELVRLTS-----------------------PF---YI-NEDTSILD---NALASIQ 161 (378) T ss_pred EEec------Cc------EEechhceeeecC-----------------------cC---Cc-ccchhHHH---HHHHHHH Confidence 0000 00 0122222322210 00 00 01112222 2222222 Q ss_pred HHHHHHHHHHHHhcCceeeeecCCcCChhhhhhhhhccccccch-hhhhhcccccCCCCCcceeEEeecCCHHHHHHHHH Q lcl|NC_010808. 270 NAESDTANYMSDLNDAMLLIKGNLSLDPDEVKKQKEANVLFLEP-TVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKD 348 (512) Q Consensus 270 ~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 348 (512) ... .. +.+-.+++-....+++..+..++.-.-.+.. ....+.......+++.+++.++.......+ ..++ T Consensus 162 ~~~-------~~-~~~~g~l~~~~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~ 232 (378) T protein:vir:94 162 TKL-------EQ-GKLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEID 232 (378) T ss_pred HHH-------hh-CCcccceeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHH Confidence 211 11 1222222222222222222222111100000 000111112334555666656543332333 4456 Q ss_pred HHHHHHHHHhcccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-------cccc Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-------ANKD 421 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~-------~~~d 421 (512) .+.+.|+..-++|..... +..+.. .....+...|..++..|...++..-... .... T Consensus 233 ~~~~~Ia~~fgvPp~~l~---g~~~e~--------------~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~ 295 (378) T protein:vir:94 233 LIKSELLTGYFMNENILL---GTATQE--------------QQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLY 295 (378) T ss_pred HHHHHHHHHhCCCHHHhc---CCchHH--------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcc Confidence 677888888888864432 211110 0112334455555555554444321100 0111 Q ss_pred cceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHHHHhhcccCCCCCCC Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND 497 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~~~~~~~kl--~g~~s~et~~~~~~~v--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 497 (512) ...+.+.++.-+-.|..+.++++.++ .|+++.-.++++++.- ++-+ ++- + ...... .......+. T Consensus 296 ~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd-~~~-~--------~~n~~~-~~~~~~~~~ 364 (378) T protein:vir:94 296 YERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD-VYI-A--------NLNAVA-VKNLSDLQG 364 (378) T ss_pred cceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-eee-e--------cccccc-hhcchhccc Confidence 23455556677778899999999887 6899998888887542 2111 000 0 000000 000111111 Q ss_pred CCCCCCCcCcccCCC Q lcl|NC_010808. 498 DEQDDDTKDTVDKKE 512 (512) Q Consensus 498 ~~~~~~~~~~~~~~e 512 (512) .+.+...+++.+ .| T Consensus 365 ~~~~~~~~~e~~-n~ 378 (378) T protein:vir:94 365 NRKDVTSTDETN-NQ 378 (378) T ss_pred ccCCCCCCCCCC-CC Confidence 111112222222 22 No 258 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=43.50 E-value=0.84 Score=20.99 Aligned_cols=311 Identities=13% Similarity=0.057 Sum_probs=113.2 Q ss_pred HHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhh------hc---cCc-eecC------Cc------ Q lcl|NC_010808. 60 LKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYF------LG---NPI-QCQD------DD------ 117 (512) Q Consensus 60 ~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l------~g---~~~-~~~~------~d------ 117 (512) +++... ......... .+..-.+..|.-.+-....+|+ .| +|+ .+.+ .+ T Consensus 1 ~~~~~~-------~~~~~~~~~---~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~ 70 (345) T protein:vir:37 1 MKTNVK-------TDNKKGIVI---APINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGI 70 (345) T ss_pred CCCCcc-------ccchhhccc---CcceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccc Confidence 000000 000000000 0001111222221111222222 01 111 1000 00 Q ss_pred --hhHHHHHHHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEee Q lcl|NC_010808. 118 --KDVLEAIEAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (512) Q Consensus 118 --~~~~~~l~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~ 192 (512) -+. ..|...+.-|.. ...+.+++.+.+.+|.||+.+.++..|++ .+..++|..+.+..+.. ....++++. T Consensus 71 i~~k~-n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~----~~~~~~~~~ 145 (345) T protein:vir:37 71 LHSRA-NMVSSLYEGGKALSRMDMRALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGG----YSYLMKKSL 145 (345) T ss_pred eeeec-hHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCC----eeEEEEEeE Confidence 000 011122222321 12356788899999999999999888875 45667777665443321 111122111 Q ss_pred eeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHH Q lcl|NC_010808. 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAE 272 (512) Q Consensus 193 ~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~ 272 (512) .. .. .. ...|.++.+++++... | .+...|.|.+...+..+..-. .. T Consensus 146 ~~----~~-g~---~~~~~~~dVihir~~~---------------------~----~~~~~Gls~~~~a~~si~l~~-~a 191 (345) T protein:vir:37 146 YD----TA-QE---IYRYDAKDIIFIKLYD---------------------P----MQQVYGSPDYVGGIQSALLNS-DA 191 (345) T ss_pred ec----CC-ce---EEEEccccEEEecCCC---------------------C----CCCcccccHHHHHHHHHHHHH-HH Confidence 10 00 00 0012333333221100 0 011246666665444433222 22 Q ss_pred HHH-HHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhcccccC--CCCCcceeEEeecCCHHHHHHHHH Q lcl|NC_010808. 273 SDT-ANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIE--TEGSVDGGYIYKQYDVQGTEAYKD 348 (512) Q Consensus 273 s~~-~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~~~~~~~~ 348 (512) +.+ ....+-.+.|-.++. .....++++...+++.-.-....+.........+ .+++.++..++.......+.+..+ T Consensus 192 ~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~ 271 (345) T protein:vir:37 192 TVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKN 271 (345) T ss_pred HHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHH Confidence 222 222233345555543 1122334444333322100000000000001111 123333444433333455666777 Q ss_pred HHHHHHHHHhcccccccccccccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceee Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSG--EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVR 426 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg--~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~ 426 (512) ...+.|...-++|....+....+.++ .+-+. ....+...|.-+++.+...++..... ..... T Consensus 272 ~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~-----------~~~f~~~~l~P~~~~ie~~ln~~~~~-----~~~~~ 335 (345) T protein:vir:37 272 ISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKY-----------REVYHYDEVMPLQEIIAETINQDPEI-----KNLLK 335 (345) T ss_pred HhHHHHHHHhCCCHHHhCccCCCCCCcccHHHH-----------HHHHHHHHHHHHHHHHHHHhhhhccC-----CCcce Confidence 88888999999997765533222211 11111 11223334444444444444322111 11235 Q ss_pred EEeCC-CCCc Q lcl|NC_010808. 427 YVYNR-NLPK 435 (512) Q Consensus 427 i~f~~-~~p~ 435 (512) +.|++ .+.. T Consensus 336 i~F~~~~L~~ 345 (345) T protein:vir:37 336 IKFREQNFAK 345 (345) T ss_pred EEecchhhcC Confidence 66753 3333 No 259 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=41.92 E-value=0.91 Score=20.82 Aligned_cols=311 Identities=11% Similarity=0.022 Sum_probs=103.4 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccccccccccce-eeecchHH------HHHHHHH----hhhhccCceecCCc Q lcl|NC_010808. 49 IEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADN-RVAHDYAS------YISDFIN----GYFLGNPIQCQDDD 117 (512) Q Consensus 49 i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~-ri~~n~~~------~iv~~~a----~~l~g~~~~~~~~d 117 (512) +++++ ............. ..... -+..+=|. .+.+-.- +..+.-|+.+.+=. T Consensus 1 m~~~~--------------~~~~~~~~~~~~~---~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la 63 (344) T protein:vir:60 1 MSKKK--------------GKTLQPAAKKMTA---SAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLA 63 (344) T ss_pred CCccc--------------CCCCCchHHhhcC---CcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHH Confidence 00000 0000000000000 00000 00011110 0111110 11111122221100 Q ss_pred h----------hH---HHHHHHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEEccceeEEEEeCCCC Q lcl|NC_010808. 118 K----------DV---LEAIEAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 118 ~----------~~---~~~l~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~~p~~~~~i~d~~~~ 181 (512) . .. ...|...+.-|.. ...+..++.+.+.+|.||+.+-.+..|++. +..++|..+-...+.. T Consensus 64 ~~~~a~~~h~~~i~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~-- 141 (344) T protein:vir:60 64 KSLRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED-- 141 (344) T ss_pred HHHHhhhhhccchhhhhhHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCC-- Confidence 0 00 0012222233321 123567888999999999999888888753 4555555443322211 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHH Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV 261 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v 261 (512) +||.+.. . .. ...|.++.|.++.... | .+.-.|.|.+... T Consensus 142 -------~~~~v~~---~--~~---~~~~~~~eIiHir~~~---------------------~----~~~~yGlsp~~~a 181 (344) T protein:vir:60 142 -------VYWWVPS---F--NE---PTAFAPGSVFHLLEPD---------------------I----NQELYGLPEYLSA 181 (344) T ss_pred -------eEEEEcc---C--Ce---EEEEcCccEEEEcCCC---------------------C----CCCcccccHHHHH Confidence 1111110 0 00 0012222222221100 0 0112466666544 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcCChhhhhhhhhccccccchhhhhhcccccCC--CCCcceeEEeec Q lcl|NC_010808. 262 ITLIDLYDNAESDTANYMSDLNDAMLLIK--GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET--EGSVDGGYIYKQ 337 (512) Q Consensus 262 ~~liDa~~~~~s~~~~~~~~~~~~~lv~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~ 337 (512) +.-++.-..+..-......-.+.|-.++. |. ..+++..+.+++.-.-....+.........+. .++.++..++.. T Consensus 182 ~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~-~ls~e~~~~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~ 260 (344) T protein:vir:60 182 LNSAWLNESATLFRRKYYENGAHAGYIMYVTDA-VQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV 260 (344) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEecCc-CCCHHHHHHHHHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCC Confidence 43333222211111222232344444443 32 23444433333221000001110111111111 223334444333 Q ss_pred CCHHHHHHHHHHHHHHHHHHhcccccccccccccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_010808. 338 YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG--EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRS 415 (512) Q Consensus 338 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg--~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~ 415 (512) .....+.+..+...+.|...-++|....+...++.++ .+-+. .+......|.-+++.+.++-...+. T Consensus 261 ~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~-----------~~~f~~~~L~Pl~~~~e~ln~~lg~ 329 (344) T protein:vir:60 261 ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKV-----------AKVFVRNELIPLQDRIREINGWLGQ 329 (344) T ss_pred hhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHH-----------HHHHHHHHHHHHHHHHHHHHHhcCC Confidence 3445567778888889999999997766543333221 11111 1111222223222222222111111 Q ss_pred CCcccccceeeEEeCCCCCcCHHH Q lcl|NC_010808. 416 IDANKDFNTVRYVYNRNLPKSLIE 439 (512) Q Consensus 416 ~~~~~d~~~i~i~f~~~~p~d~~~ 439 (512) . .++|.+..-....+ T Consensus 330 -------~--~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 330 -------E--VIRFKNYSLDTDNG 344 (344) T ss_pred -------c--ccccCccccCCCCC Confidence 0 13344332222222 No 260 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=36.83 E-value=1.2 Score=20.25 Aligned_cols=427 Identities=13% Similarity=0.075 Sum_probs=159.3 Q ss_pred CCcceeeccccchhhccccccCCCcCe---eecc-----------cchhHHhhhcHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 1 MLKANEFETDTDLRENRNYLFNDEANV---VYTY-----------DGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDY 66 (512) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~f~~~~~~---~~~~-----------~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~~~~y 66 (512) -++.++=+..-......-++-+++.+. .... .+...+....+ +. .....+|+.+..+ T Consensus 4 w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~-------~~--~eLI~~YR~ma~~ 74 (511) T protein:vir:56 4 WTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTI-------PV--KELIKSYRALAEY 74 (511) T ss_pred ccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCcc-------ch--HHHHHHHHHHhhc Confidence 111111000000011111111111111 0000 00000000000 00 1222345555444 Q ss_pred hcccccccccccccccccccceeeecchHHHHHHHHH-hhhhccCceecCCc--------hhHHHHHHHHHhccChhHHH Q lcl|NC_010808. 67 YEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQCQDDD--------KDVLEAIEAFNDLNDVESHN 137 (512) Q Consensus 67 y~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a-~~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~ 137 (512) ++=+..+ ..||+..+ .--..+|+.+..++ +...+..+.+++--+|+... T Consensus 75 pEvd~Av----------------------~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~ 132 (511) T protein:vir:56 75 HEVDDAI----------------------QEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHG 132 (511) T ss_pred cchhhHH----------------------HHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhh Confidence 4333221 11111111 11122333333322 22344556666666899999 Q ss_pred HHHHHHHHhCCeEEEEEEECCC-CceEEEEEccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEE Q lcl|NC_010808. 138 RSLGLDLSIYGKAYELMIRNQD-DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVY 216 (512) Q Consensus 138 ~~~~~~~~~~G~a~~~v~~d~~-g~~~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~ 216 (512) .+..+.+.+-|+.|.+.-.|++ |-..+..+||..+-.|..-- .+.+..+... ......-+|.+.... T Consensus 133 ~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i~--~~~~~~~~v~----------~~~~ey~~Y~~~~~~ 200 (511) T protein:vir:56 133 YKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMELVREIQ--KETIDGVEVV----------KGTLEYYVYKQSDYK 200 (511) T ss_pred hHHHhhhhhcceEEEEEEeccccceeehhhcCcccchhhhhhh--cccccccccc----------cceeeeeEecCCCcc Confidence 9999999999999988776654 54567889998876664311 1111111100 111122234332211 Q ss_pred EEEecCCccccccccccccccccccccc---eEee--------cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_010808. 217 RYLTSRTNGLKLTPRENGFESHSFERMP---ITEF--------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDA 285 (512) Q Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~~vP---vv~~--------~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~ 285 (512) ....... .+ .+..--+|| |++. .|+....|-+...+.-.+.+ +++-|.+...+..+.| T Consensus 201 ~~~~~~~----~~------~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQL-km~EDAlVIYRitRAP 269 (511) T protein:vir:56 201 MPSWMSA----TN------RAQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQL-KMLEDALVIYRLARAP 269 (511) T ss_pred cCccccc----cc------ccccceeechhheeeecccceeccCCCCeeeccchhhhHHHHhh-HHHHhhHHHHhhhccc Confidence 0000000 00 000000122 1110 12222344444332222222 1223333333333333 Q ss_pred eeee----ecCCcCC----------------------hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCC Q lcl|NC_010808. 286 MLLI----KGNLSLD----------------------PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYD 339 (512) Q Consensus 286 ~lv~----~g~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 339 (512) -+=+ .|..+.. ..++...+....+ ++.-..+-+. .+.+..+..|-...+ T Consensus 270 eRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msM-lEDyWLpRRe----GgrgTEItTLpGgqn 344 (511) T protein:vir:56 270 ERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSM-LEDYYLPRRE----GSKGTEVSTLPGGQS 344 (511) T ss_pred cceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhh-HhhhcccccC----CCCccceeeccccCC Confidence 3211 1221111 1111111111111 0100111110 112223434433333 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccc------cc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010808. 340 VQGTEAYKDRLNSDIHMFTNTPNMKDDN------FS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 412 (512) Q Consensus 340 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~------~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~ 412 (512) ...+ .-+.-+.+.++..-++|-.-.+. |+ | -|..|---+.....-+.+.+..|..-+.++++.=+-+-+. T Consensus 345 lgem-~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~G--r~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgi 421 (511) T protein:vir:56 345 LGDI-EDVLYFNRKLYKAMRIPTSRAASEDQTGGINFG--QGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNI 421 (511) T ss_pred cChH-HHHHHHHHHHHHHhCCCcccccCCCCccccccc--cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 2222 23444555566666777433321 21 1 1234444444555566777777777777776543222222 Q ss_pred ccCCCcccccceeeEEeCCCCCcCHHHHHH-------HHHHHh---c-cCChHHHHHhCCCCCCH--HHHHHHHHHHHHH Q lcl|NC_010808. 413 TRSIDANKDFNTVRYVYNRNLPKSLIEELK-------AYIDSG---G-KISQTTLMSLFSFFQDP--ELEVKKIEEDEKE 479 (512) Q Consensus 413 ~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~-------~~~kl~---g-~~s~et~~~~~~~v~d~--~~E~~ri~~E~~~ 479 (512) +...+++.-...|.+.|...-.-.+...++ ++..+. | .+|.+++++.+-..+|. .++-++|++|..+ T Consensus 422 it~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~ 501 (511) T protein:vir:56 422 ITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETN 501 (511) T ss_pred CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhhcC Confidence 222222222246777785544433443333 333343 3 36999999987655542 2333444444332 Q ss_pred HHHHHHhhcccCCCCCCC Q lcl|NC_010808. 480 SIKKAQKGIYKDPRDIND 497 (512) Q Consensus 480 ~~~~~~~~~~~~~~~~~~ 497 (512) ...+.+. .+. T Consensus 502 -------~~~~~~e-~~f 511 (511) T protein:vir:56 502 -------PRFQQDD-QGF 511 (511) T ss_pred -------CCCCCcc-cCC Confidence 2222111 111 No 261 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=34.27 E-value=1.3 Score=19.95 Aligned_cols=322 Identities=12% Similarity=0.003 Sum_probs=105.5 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccccccccccc-cc----ceeeecchHHHHHHHHHhhhhccCceecCC------c Q lcl|NC_010808. 49 IEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEY-MA----DNRVAHDYASYISDFINGYFLGNPIQCQDD------D 117 (512) Q Consensus 49 i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~-~~----~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~------d 117 (512) +.+ .........-........-+ .+ ..+-...+...+-+.. +..+--|+.+.+= + T Consensus 1 ~~~-------------~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~-~~~~epp~~~~~La~l~~~n 66 (348) T protein:vir:26 1 MTE-------------QLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDF-DDYWEPPISLKGLAEIANAN 66 (348) T ss_pred CCc-------------cccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCC-CccccCCCCHHHHHHHHhhh Confidence 000 00000000000000000000 00 0000011111111000 0011111111000 0 Q ss_pred hh----HHH---HHHHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEE Q lcl|NC_010808. 118 KD----VLE---AIEAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAG 187 (512) Q Consensus 118 ~~----~~~---~l~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~ 187 (512) .- ... .|...+.-|.. ...+.+++.+.+.+|.||+.+-++..|++ .+..++|..+-+.-| . T Consensus 67 ~~h~~~i~~k~N~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d----~----- 137 (348) T protein:vir:26 67 GYHGSLLKARANYVAGRFMNGGGLPMYKMNSACWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKN----G----- 137 (348) T ss_pred hhhhhhHhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeec----C----- Confidence 00 000 00011112221 24456778899999999999999888875 355555554422211 0 Q ss_pred EEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHH Q lcl|NC_010808. 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDL 267 (512) Q Consensus 188 v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa 267 (512) ++|.+.. . .. ...|.++.+.++..-. | .....|.|.+...+.-+.. T Consensus 138 -~~~~~~~-~---g~----~~~f~~~dIiHir~~~---------------------~----~~~~~Gls~~~~a~~si~l 183 (348) T protein:vir:26 138 -DFVQLLR-N---NE----QKVFKAKDVIFIPQYD---------------------P----QQQIYGLPDYLGSIQSSLL 183 (348) T ss_pred -cEEEEEe-c---Ce----EEEEcCccEEEEcCCC---------------------C----CCCcccccHHHHHHHHHHH Confidence 0111100 0 00 0113333333321100 0 0112466666554443332 Q ss_pred HHHHHHHHHHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhcccccCC--CCCcceeEEeecCCHHHHH Q lcl|NC_010808. 268 YDNAESDTANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET--EGSVDGGYIYKQYDVQGTE 344 (512) Q Consensus 268 ~~~~~s~~~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~ 344 (512) -+.+..-.....+-.+.|-.++. .....++++...+++.-.-....+......+..+. +++.++..++.......+. T Consensus 184 ~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~ 263 (348) T protein:vir:26 184 NRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFE 263 (348) T ss_pred HHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHH Confidence 22222122222333445555553 21223333333333211100000000000111111 2233344443333344566 Q ss_pred HHHHHHHHHHHHHhcccccccccccccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccc Q lcl|NC_010808. 345 AYKDRLNSDIHMFTNTPNMKDDNFSGTQS--GEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 422 (512) Q Consensus 345 ~~~~~l~~~i~~~s~~p~~~~~~~~~n~S--g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~ 422 (512) +..+.-...|...-++|....+....+.+ +.+-+. ....+...|.-+++.+...++..-... .. T Consensus 264 e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~-----------~~~f~~~~l~P~~~~ie~~ln~~l~~~---~~ 329 (348) T protein:vir:26 264 RIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKV-----------SQVYDFYEVIPVCKRFMDAVNNDPEIP---DN 329 (348) T ss_pred HHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHH-----------HHHHHHHHHHHHHHHHHHHHhhhhCCC---Cc Confidence 67777778899998999766543222111 111111 112223334444444433333221111 11 Q ss_pred ceeeEEeCCCCCcCHHHHH Q lcl|NC_010808. 423 NTVRYVYNRNLPKSLIEEL 441 (512) Q Consensus 423 ~~i~i~f~~~~p~d~~~~~ 441 (512) ..+++.|++..-++..+.+ T Consensus 330 ~~~~fdl~~~~e~~~~~a~ 348 (348) T protein:vir:26 330 LKLKFNLNPGVESANGSAV 348 (348) T ss_pred cEEEEecCcccccchhhcC Confidence 1233333333222222221 No 262 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=29.24 E-value=1.7 Score=19.35 Aligned_cols=319 Identities=10% Similarity=0.051 Sum_probs=108.9 Q ss_pred HHHHHHHHHHHHHHHHHHhccccccccccc--ccccccccceeeecc--hHHHHHHHH--HhhhhccCceecC------- Q lcl|NC_010808. 49 IEHHMDYQRPRLKVLSDYYEGKTKNLVELT--RRKEEYMADNRVAHD--YASYISDFI--NGYFLGNPIQCQD------- 115 (512) Q Consensus 49 i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~--~~~~~~~~~~ri~~n--~~~~iv~~~--a~~l~g~~~~~~~------- 115 (512) ++++.. | ....+....... ....-+.+. .+++ ..-.-+... .+..+--|+...+ T Consensus 1 m~~~~~----~-------~~~~~~~~~~~~~~~~~~~~~p~--~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~ 67 (346) T protein:vir:10 1 MKKQLR----K-------NLTQNDRLQPQAQTEIFSFGDPI--PVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRS 67 (346) T ss_pred CCcccC----C-------CCCcccccccccCeEEEecCCcc--eecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHh Confidence 000000 0 000000000000 000000000 0000 000001100 0000001111100 Q ss_pred ----C-c-hhHHHHHHHHHhc-cCh--hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeE Q lcl|NC_010808. 116 ----D-D-KDVLEAIEAFNDL-NDV--ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSI 185 (512) Q Consensus 116 ----~-d-~~~~~~l~~~~~~-n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~ 185 (512) . - ..-...|..+++. |.. ...+.+++.+.+.+|.||+.+.++..|++ .+..++|..+.+..++. .. T Consensus 68 ~~~h~~~i~~k~n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~---~~- 143 (346) T protein:vir:10 68 STHHESAIITKANILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAG---QF- 143 (346) T ss_pred hhhcchhhhhhhhhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCC---eE- Confidence 0 0 0001123334432 211 23456678889999999999999888875 45666776665433221 00 Q ss_pred EEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHH Q lcl|NC_010808. 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (512) Q Consensus 186 ~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~li 265 (512) +|.....+ .. . ..|.++.|++++... ......|.|.+...+..+ T Consensus 144 ----~~~~~~~~----g~--~-~~~~~~dIih~r~~~-------------------------~~~~~~G~~~~~~a~~si 187 (346) T protein:vir:10 144 ----YYVPQRFD----HQ--E-HEFAKGSIYHLLEPD-------------------------INQDIYGLPQYLSALQSA 187 (346) T ss_pred ----EEEEEccC----Ce--E-EEEecccEEEecCCC-------------------------CCCCeeeccHHHHHHHHH Confidence 01110000 00 0 112333333321110 001124666665544444 Q ss_pred HHHHHHHHHHHHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhccccc--CCCCCcceeEEeecCCHHH Q lcl|NC_010808. 266 DLYDNAESDTANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGI--ETEGSVDGGYIYKQYDVQG 342 (512) Q Consensus 266 Da~~~~~s~~~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~ 342 (512) .....+..-..+...-.+.|-.++. .....++++.+.+++.-.-....+......... +..++.++..++....... T Consensus 188 ~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~q 267 (346) T protein:vir:10 188 WLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDE 267 (346) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHH Confidence 4322222222222333344555442 112234444444433211000000000000000 1112223333333233455 Q ss_pred HHHHHHHHHHHHHHHhcccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccc Q lcl|NC_010808. 343 TEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKD 421 (512) Q Consensus 343 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d 421 (512) +.+..+...++|...-++|....+-..++.++ ..++ ......+...|..+++.|.++....+. + T Consensus 268 f~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e----------~~~~~f~~~~l~P~~~~iee~n~~L~~-----e 332 (346) T protein:vir:10 268 FFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVA----------DAAEVFFITEIEPLQERLKEFNQWLGQ-----E 332 (346) T ss_pred HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHH----------HHHHHHHHHHHHHHHHHHHHHHhhccc-----c Confidence 66677778888999889987765533222211 1111 011122233333333333332221110 0 Q ss_pred cceeeEEeCCCCCcCHHH Q lcl|NC_010808. 422 FNTVRYVYNRNLPKSLIE 439 (512) Q Consensus 422 ~~~i~i~f~~~~p~d~~~ 439 (512) .+.|++...-..++ T Consensus 333 ----~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 333 ----VIKFKPSKLLQRTQ 346 (346) T ss_pred ----eeeechhhhcccCC Confidence 24565443333333 No 263 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=27.95 E-value=1.8 Score=19.19 Aligned_cols=317 Identities=11% Similarity=0.051 Sum_probs=107.2 Q ss_pred HHHHHhcccccccccccccccccccceee-ecch--HH------HHHHHHHhhhhcc----CceecCC------chhHHH Q lcl|NC_010808. 62 VLSDYYEGKTKNLVELTRRKEEYMADNRV-AHDY--AS------YISDFINGYFLGN----PIQCQDD------DKDVLE 122 (512) Q Consensus 62 ~~~~yy~G~~~~~~~~~~~~~~~~~~~ri-~~n~--~~------~iv~~~a~~l~g~----~~~~~~~------d~~~~~ 122 (512) +-++-+...+.-...... ....+...++ +..| +. -+.+-.--+..|+ |+.+.+= +..... T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNP-SAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCch-hhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhh Confidence 000001110000000000 0000000111 1110 10 1111111111122 1111100 000000 Q ss_pred HH-------HHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCCceeEEEEEEee Q lcl|NC_010808. 123 AI-------EAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLR 192 (512) Q Consensus 123 ~l-------~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~v~~~~ 192 (512) .| ...+.-|.. ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+-+..+.. +||. T Consensus 80 ~l~~k~n~l~~~~~Pnp~~t~~~f~~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~---------~~~~ 150 (351) T protein:vir:79 80 ALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS---------GFVY 150 (351) T ss_pred hhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCC---------eEEE Confidence 11 111112211 22356788899999999999999888874 45666666654432221 1111 Q ss_pred eeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHHHHHHHHHHHHH Q lcl|NC_010808. 193 TKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAE 272 (512) Q Consensus 193 ~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~ 272 (512) +.. . .. ...|.++.|.+++... | ...-.|.|.+.....-+..-..+. T Consensus 151 ~~~---~--g~---~~~~~~~eIihir~~~---------------------~----~~~~yGl~~~~~a~~si~l~~~a~ 197 (351) T protein:vir:79 151 VNG---W--QE---RHEFEPDSVFQLVRPD---------------------I----NQEVYGLPEYLSSLHSAWLNESST 197 (351) T ss_pred Eec---C--ce---EEEEcCccEEEeCCCC---------------------C----CCCcccccHHHHHHHHHHHHHHHH Confidence 110 0 00 0123333333332110 0 011236666654443333222211 Q ss_pred HHHHHHHHHhcCceeee--ecCCcCChhhhhhhhhccccccchhhhhhcccccCC--CCCcceeEEeecCCHHHHHHHHH Q lcl|NC_010808. 273 SDTANYMSDLNDAMLLI--KGNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET--EGSVDGGYIYKQYDVQGTEAYKD 348 (512) Q Consensus 273 s~~~~~~~~~~~~~lv~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~~~ 348 (512) .-....+.-.+.|-.++ ++ ...++++.+.+++.-.-...........+..+. +++.++..++.......+.+..+ T Consensus 198 ~~~~~~f~NGa~pg~il~~~~-~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~ 276 (351) T protein:vir:79 198 LFRRKYYENGSHAGFILYMTD-AAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKN 276 (351) T ss_pred HHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHH Confidence 11122223334444444 33 223444443333221100000000000011111 22333444443334455667777 Q ss_pred HHHHHHHHHhcccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccceeeE Q lcl|NC_010808. 349 RLNSDIHMFTNTPNMKDDNFSGTQSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRY 427 (512) Q Consensus 349 ~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~i 427 (512) ...+.|...-++|....+-..++.++ .-++- ..+..+...|.-+++.+.++-...+ .+ -+ T Consensus 277 ~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~----------~~~~f~~~~l~Pl~~~ie~ln~~lg-----~~----~~ 337 (351) T protein:vir:79 277 VTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDT----------AARVFGRNEIRPLQARFAELNDWLG-----DE----VV 337 (351) T ss_pred HhHHHHHHHhCCCHHHhcccCCCCCCcccHHH----------HHHHHHHHHHHHHHHHHHHHHhhcC-----cc----ee Confidence 78888999989997665543222221 11110 1112223333333333333211111 01 14 Q ss_pred EeCCCCCcCHHHHHHHHHHH Q lcl|NC_010808. 428 VYNRNLPKSLIEELKAYIDS 447 (512) Q Consensus 428 ~f~~~~p~d~~~~~~~~~kl 447 (512) .|++..- .....++ T Consensus 338 ~F~~~~l------lr~d~~a 351 (351) T protein:vir:79 338 TFDDYEI------PPAPVAA 351 (351) T ss_pred eeChhhh------ccccccC Confidence 5654321 1111111 No 264 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=24.38 E-value=2.2 Score=18.72 Aligned_cols=330 Identities=12% Similarity=0.020 Sum_probs=112.9 Q ss_pred HHHHHHHHHHHHHHhccccccccccccccccccc--cee---eecchHH------HHHHH----HHhhhhccCceecCC- Q lcl|NC_010808. 53 MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA--DNR---VAHDYAS------YISDF----INGYFLGNPIQCQDD- 116 (512) Q Consensus 53 ~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~--~~r---i~~n~~~------~iv~~----~a~~l~g~~~~~~~~- 116 (512) ..+++.|..+ +-..+..... ......+.+. ..+ +..+=+. .+.+. ..+..+..|+.+.+= T Consensus 1 m~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la 76 (368) T protein:vir:79 1 MSRNKTRRAA--RAASAHVRTA--NTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLA 76 (368) T ss_pred CCccccccch--hccCcccccc--cccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHH Confidence 0111100000 0001100000 0000000000 000 0000000 01110 111112223322110 Q ss_pred ---------ch--hHH-HHHHHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCce-EEEEEccceeEEEEeCCCC Q lcl|NC_010808. 117 ---------DK--DVL-EAIEAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIE 181 (512) Q Consensus 117 ---------d~--~~~-~~l~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~ 181 (512) .. ... ..+.-+..-|.. ...+.+++.+.+.+|.||+.+..+..|++ .+..++|..+-..-+. T Consensus 77 ~~~~~~~~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~--- 153 (368) T protein:vir:79 77 RSFRAAAHHSSAVYVKRNILVSTFIPHPLLSRATFERLVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDL--- 153 (368) T ss_pred HHHhhccccchhhhhhcchhhhhcCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccC--- Confidence 00 000 011112223321 13356788899999999999999888875 3555666654322211 Q ss_pred ceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcchHHH Q lcl|NC_010808. 182 RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKV 261 (512) Q Consensus 182 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~~~v 261 (512) . ++|.+.. .. . . ..|.++.|.+++.-. | .+.-.|.|.+... T Consensus 154 ~------~~~~~~~---~~--~--~-~~~~~~dIihir~~~---------------------~----~~~~yGlsp~~~a 194 (368) T protein:vir:79 154 N------TYFFVQN---WQ--Q--P-YTFAAGSVFHLQEPD---------------------I----NQEVYGLPEYLSA 194 (368) T ss_pred C------EEEEEec---CC--e--E-EEEccccEEEecCCC---------------------C----CCCcccccHHHHH Confidence 0 1111100 00 0 0 012223222221100 0 0012467777665 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhcccc-c--CCCCCcceeEEeec Q lcl|NC_010808. 262 ITLIDLYDNAESDTANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRDTG-I--ETEGSVDGGYIYKQ 337 (512) Q Consensus 262 ~~liDa~~~~~s~~~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~l~~~ 337 (512) +.-++.-+.+..-....++-.+.|-.++. .....+++....+++.-.- ............ . +.+++.++..++.. T Consensus 195 ~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~-~~G~~N~g~~~vl~~~g~~~g~~~~pls~~ 273 (368) T protein:vir:79 195 LNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKS-AKGPGNFRNLFMYAPNGKKDGIQLLPVSEV 273 (368) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHH-hcCCcccCceeEecCCCCccceeEEEcCCC Confidence 55554433322222333333444554442 1122344444333322110 000000000000 1 11233344444433 Q ss_pred CCHHHHHHHHHHHHHHHHHHhcccccccccccccchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_010808. 338 YDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSG-EAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSI 416 (512) Q Consensus 338 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~ 416 (512) .....+.+..+...+.|...-++|....+...++.++ .-++- .....+...|.-+++.+.++....+. T Consensus 274 ~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~----------~~~~f~~~~l~Pl~~~ie~ln~~l~~- 342 (368) T protein:vir:79 274 AAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEK----------AAMVFARNEVKPLQDRLLAINDWIGD- 342 (368) T ss_pred HHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHH----------HHHHHHHHHHHHHHHHHHHHHhccCc- Confidence 3445566677788888999999997766543333321 11111 11122333344444443332211110 Q ss_pred CcccccceeeEEeCCCC--CcCHHHHHHHHHHHh Q lcl|NC_010808. 417 DANKDFNTVRYVYNRNL--PKSLIEELKAYIDSG 448 (512) Q Consensus 417 ~~~~d~~~i~i~f~~~~--p~d~~~~~~~~~kl~ 448 (512) + .++|++.. -.|..+.++...+.+ T Consensus 343 ----e----~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 343 ----E----VVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred ----c----eeeechhHhhcccccccCCcccccC Confidence 0 13454321 122222222222222 No 265 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=22.96 E-value=2.4 Score=18.53 Aligned_cols=467 Identities=8% Similarity=-0.010 Sum_probs=164.2 Q ss_pred CCccee--eccccchhhccccccCCCcCeeecccchhHHhhhcHHHHHHHH----HHHHHHHHHHHHHHHHHhccccccc Q lcl|NC_010808. 1 MLKANE--FETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYI----EHHMDYQRPRLKVLSDYYEGKTKNL 74 (512) Q Consensus 1 ~~~~~~--~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~l~~~i----~~~~~~~~~r~~~~~~yy~G~~~~~ 74 (512) |+.--- |+..++.. ..++-+++.+.. .....-......+.+- .+.......+|+.+..+++=+.. T Consensus 1 m~~lfgf~i~~~~~~~--~~S~vpp~~~~~-----~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~A-- 71 (564) T protein:vir:10 1 MSQLFGFLINEKEGQK--GQSPVPPNDEAS-----VSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSA-- 71 (564) T ss_pred CcchhcceeeeeccCC--CCCcccCCcCCC-----hhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhH-- Confidence 543222 22222211 111112221111 0000000000000000 00011222234443333222211 Q ss_pred ccccccccccccceeeecchHHHHHHHHHh-hhhccCceecCCc--------hhHHHHHHHHHhccChhHHHHHHHHHHH Q lcl|NC_010808. 75 VELTRRKEEYMADNRVAHDYASYISDFING-YFLGNPIQCQDDD--------KDVLEAIEAFNDLNDVESHNRSLGLDLS 145 (512) Q Consensus 75 ~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~-~l~g~~~~~~~~d--------~~~~~~l~~~~~~n~~~~~~~~~~~~~~ 145 (512) ...||+..+- --..+|+.+..++ +...+.++.+++--+|+....+..+.+. T Consensus 72 --------------------v~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WY 131 (564) T protein:vir:10 72 --------------------IDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWY 131 (564) T ss_pred --------------------HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhh Confidence 1122222111 1123344443332 2244556666666689999999999999 Q ss_pred hCCeEEEEEEECC----CCceEEEEEccceeEEEEeCCCCc--eeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEE Q lcl|NC_010808. 146 IYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIER--NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYL 219 (512) Q Consensus 146 ~~G~a~~~v~~d~----~g~~~i~~~~p~~~~~i~d~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~ 219 (512) +.|+.|.+.-.|. +|-..+..+||..+-.++..-.+. .....++-+.... . .......-+|.+.. |. T Consensus 132 VDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~-~---y~~~~Eyy~Ynp~~---~~ 204 (564) T protein:vir:10 132 VDGRSHYHKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQY-D---YGDFIEYYIYNPKG---FA 204 (564) T ss_pred hcceEEEEEEeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeec-c---ccccccceeecccc---cc Confidence 9999998877663 354567889999887777322111 1111111111000 0 00000111222211 00 Q ss_pred ecCCcccccccc---ccccccccccccceEee----cCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee--- Q lcl|NC_010808. 220 TSRTNGLKLTPR---ENGFESHSFERMPITEF----SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLI--- 289 (512) Q Consensus 220 ~~~~~~~~~~~~---~~~~~~~~~~~vPvv~~----~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~--- 289 (512) .... ...+.. ......-+-.-|+.++. +++..-.|-+...+.-.+.+ +++-|.+...+..+.|-+=+ T Consensus 205 g~~~--~~~~~~~~~~~~~ikI~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQL-kmlEDAlVIYRitRAPeRRvFYI 281 (564) T protein:vir:10 205 GNIP--MVTGSMDWSNQEGIKIASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQL-RMIEDSLVIYRLSRAPERRIFYI 281 (564) T ss_pred Cccc--ccccccccccccceeechhhcceecccceeCCCCceeccchhhhHhHHhh-HHHHhhHHHHhhhccccceEEEE Confidence 0000 000000 00000000111111111 01111122333222221111 12333333334444443311 Q ss_pred -ecCCcCC----------------------hhhhhhhhhccccccchhhhhhcccccCCCCCcceeEEeecCCHHHHHHH Q lcl|NC_010808. 290 -KGNLSLD----------------------PDEVKKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAY 346 (512) Q Consensus 290 -~g~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 346 (512) .|..+.. ..++...+....+ ++.-...-+. .+.+..+..|-...+...++ - T Consensus 282 DVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msM-lEDyWLPRRe----GgrgTEItTLpGgqnLgem~-D 355 (564) T protein:vir:10 282 DVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSM-LEDFWLPRRE----GGRGTEITTLPGGQNLGELK-D 355 (564) T ss_pred ecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhh-HhhhcccccC----CCcccceeeccccCCcchHH-H Confidence 1221111 1111111111111 1111111110 11222343443333322222 2 Q ss_pred HHHHHHHHHHHhcccccccccc--cccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccccc Q lcl|NC_010808. 347 KDRLNSDIHMFTNTPNMKDDNF--SGTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 423 (512) Q Consensus 347 ~~~l~~~i~~~s~~p~~~~~~~--~~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~d~~ 423 (512) +.-+.+.++..-++|-.-.+.- +-+. -+..|---+.....-+.+.+..|..-+.++++.=+-+-+.+...+++.-.. T Consensus 356 V~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~ 435 (564) T protein:vir:10 356 VEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEE 435 (564) T ss_pred HHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhh Confidence 3444455666666764322211 1111 122343334445555667777777777776654322222222222222224 Q ss_pred eeeEEeCCCCCcCHHHHHHH-------HHHH---hc-cCChHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHH-hhcc Q lcl|NC_010808. 424 TVRYVYNRNLPKSLIEELKA-------YIDS---GG-KISQTTLMSLFSFFQDP--ELEVKKIEEDEKESIKKAQ-KGIY 489 (512) Q Consensus 424 ~i~i~f~~~~p~d~~~~~~~-------~~kl---~g-~~s~et~~~~~~~v~d~--~~E~~ri~~E~~~~~~~~~-~~~~ 489 (512) .|.+.|...-.-.+...++. +..+ .| .+|.+++++.+-..+|. .++-++|++|..+..-... +... T Consensus 436 ~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~ 515 (564) T protein:vir:10 436 HIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNM 515 (564) T ss_pred cceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhc Confidence 67777855444434433333 3333 23 47999999987555542 3455667776653211000 0000 Q ss_pred cCCCCCCCCCC-------------CC---------CCcCcccCCC Q lcl|NC_010808. 490 KDPRDINDDEQ-------------DD---------DTKDTVDKKE 512 (512) Q Consensus 490 ~~~~~~~~~~~-------------~~---------~~~~~~~~~e 512 (512) .++.+.+...- +. +.....+.+| T Consensus 516 ~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 560 (564) T protein:vir:10 516 LDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKS 560 (564) T ss_pred CCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcC Confidence 00000000000 00 0001111111 No 266 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=22.12 E-value=2.5 Score=18.41 Aligned_cols=299 Identities=11% Similarity=0.026 Sum_probs=103.5 Q ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccccccccccceeeecchHHHHHHHHHhhhhccCceecCCc----------- Q lcl|NC_010808. 49 IEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDD----------- 117 (512) Q Consensus 49 i~~~~~~~~~r~~~~~~yy~G~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~a~~l~g~~~~~~~~d----------- 117 (512) +++++.+ .+.+..... ....- ....|.||+|.-+-..- T Consensus 1 ~~~~~~~---------------~~~~~~~~~----~~~~~------------~~~~~~~~~p~~v~~~~~~~~~~~~~~~ 49 (344) T protein:vir:56 1 MSKKKGK---------------TPQPAAKTM----TASAP------------KMEAFTFGEPVPVLDRRDILDYVECISN 49 (344) T ss_pred CCCCCCC---------------CCchhhHHh----hcCCC------------ceEEEEcCCceeecCcchhhhHHHhhhc Confidence 1110000 000000000 00000 01112222221110000 Q ss_pred ----------hhH-----------------HHHHHHHHhccCh--hHHHHHHHHHHHhCCeEEEEEEECCCCceE-EEEE Q lcl|NC_010808. 118 ----------KDV-----------------LEAIEAFNDLNDV--ESHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKS 167 (512) Q Consensus 118 ----------~~~-----------------~~~l~~~~~~n~~--~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~-i~~~ 167 (512) ..+ ...|...+.-|.. ...+..++.+.+.+|.||+.+-.+..|++. +..+ T Consensus 50 ~~~~~pp~~~~~la~~~~a~~~h~s~i~~k~n~l~~~~~Pnp~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl 129 (344) T protein:vir:56 50 GRWYEPPVSFTGLAKSLRAAVHHSSPIYVKRNILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETS 129 (344) T ss_pred CccccCCCCHHHHHHHHhhhhhhCccceehhhhHHhhcCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEe Confidence 000 0011122222321 134567788999999999999888888753 4445 Q ss_pred ccceeEEEEeCCCCceeEEEEEEeeeeeeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEe Q lcl|NC_010808. 168 DAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITE 247 (512) Q Consensus 168 ~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~ 247 (512) +|..+-...+.. +||.+.. . . . . ..|.++.|.++.... | T Consensus 130 ~~~~v~~~~~~~---------~~~~~~~---~-g-~--~-~~~~~~dIiHir~~~---------------------~--- 168 (344) T protein:vir:56 130 PAKYTRRGVEED---------VYWWVPS---F-N-E--P-TAFAPGSVFHLLEPD---------------------I--- 168 (344) T ss_pred CCceeEEeecCC---------EEEEEec---C-C-e--E-EEEcCccEEEECCCC---------------------C--- Confidence 555443221110 1111100 0 0 0 0 012333333221100 0 Q ss_pred ecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhcccccCC- Q lcl|NC_010808. 248 FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIET- 325 (512) Q Consensus 248 ~~n~~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 325 (512) .+.-.|.|.+...+.-++.-..+..-......-.+.|-.++. .....++++.+.+++.-.-....+.........+. T Consensus 169 -~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~~~~~~g~~~~r~l~l~~p~g 247 (344) T protein:vir:56 169 -NQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQG 247 (344) T ss_pred -CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCCCCccceEEecCCC Confidence 011246666654433333222211111222222345555543 12223444444333221100000100001111111 Q ss_pred -CCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhcccccccccccccchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010808. 326 -EGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEA-MKYKLFGLEQRTKTKEGLFTKGLRRRA 403 (512) Q Consensus 326 -~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A-i~~~~~~l~~k~~~~~~~~~~~l~~~~ 403 (512) .++.++..++-......+.+..+...+.|...-++|....+....+.++-+ ++.. ....+...|.-++ T Consensus 248 ~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~----------~~~f~~~tL~Pl~ 317 (344) T protein:vir:56 248 KADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKV----------AKVFVRNELIPLQ 317 (344) T ss_pred CccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHH----------HHHHHHHHHHHHH Confidence 233344444433344556777888888899999999877654333332111 1110 0111222222222 Q ss_pred HHHHHHHHhccCCCcccccceeeEEeCCCCCcCHHH Q lcl|NC_010808. 404 KLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIE 439 (512) Q Consensus 404 ~li~~~l~~~~~~~~~~d~~~i~i~f~~~~p~d~~~ 439 (512) +.+.++....+. + .+.|.+..-.+..+ T Consensus 318 ~~ie~~n~~l~~-----~----~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 318 DRIREINGWIGQ-----E----VIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHhhhcc-----c----cccCCCccccccCC Confidence 222222111110 0 13344333332222 No 267 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=20.42 E-value=2.8 Score=18.15 Aligned_cols=214 Identities=11% Similarity=0.017 Sum_probs=77.8 Q ss_pred CCceeEEEEEEeeee-eeccCCcceEEEEEEEcCCcEEEEEecCCccccccccccccccccccccceEeecCCCCCCcch Q lcl|NC_010808. 180 IERNSIAGVRYLRTK-PIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDY 258 (512) Q Consensus 180 ~~~~~~~~v~~~~~~-~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s~~ 258 (512) .+...-.-++|.... ..+..+ ....|.++.+.+++.- + |. ..-.|.|.+ T Consensus 1 ~r~~~dg~~~y~~~~~~~~~~g-----~~~~~~~~eilH~r~~----------------~-----~~----~~~~Glspi 50 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKSLYDTKS-----EIYEYNKNDVIFIKLY----------------D-----PM----QQVYGSPDY 50 (219) T ss_pred CceeecCeEEEEEecceecCCc-----eeEEeccccEEEecCC----------------C-----CC----CCcceecHH Confidence 000000000000000 000000 0011222332222110 0 00 112467766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCCcCChhhhhhhhhccccccchhhhhhcccccC--CCCCcceeEEe Q lcl|NC_010808. 259 EKVITLIDLYDNAESDTANYMSDLNDAMLLIK-GNLSLDPDEVKKQKEANVLFLEPTVYENRDTGIE--TEGSVDGGYIY 335 (512) Q Consensus 259 ~~v~~liDa~~~~~s~~~~~~~~~~~~~lv~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~ 335 (512) ......+.....+..-....+.-.+.|-.++. .....+++....++..-.-...........+..+ ..++.+.+.++ T Consensus 51 ~~a~~~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~ 130 (219) T protein:vir:98 51 VGGITSALLNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIG 130 (219) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEcc Confidence 65544444322222222223344556665553 2112333333333321100000000000000001 12334444444 Q ss_pred ecCCHHHHHHHHHHHHHHHHHHhccccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010808. 336 KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTR 414 (512) Q Consensus 336 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~ 414 (512) .......+.+..+.....|...-++|....+-.. +..++..++- .....+...|...+..|...++..- T Consensus 131 ~~~~d~qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq----------~~~~f~~~tL~P~~~~ie~~ln~~~ 200 (219) T protein:vir:98 131 DTGQKDEFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLK----------IREAYQADEVLPLQEIIAESINSDY 200 (219) T ss_pred CCHHHHHHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHH----------HHHHHHHHHHHHHHHHHHHHhhhhh Confidence 3333445555666667788888899987765322 1111111111 1112334444444444444443221 Q ss_pred CCCcccccceeeEEeCCCCCcCHH Q lcl|NC_010808. 415 SIDANKDFNTVRYVYNRNLPKSLI 438 (512) Q Consensus 415 ~~~~~~d~~~i~i~f~~~~p~d~~ 438 (512) .. ...+.+.|....+.|.. T Consensus 201 ~~-----~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 201 EI-----KSALKVNFKQPEKRDKN 219 (219) T ss_pred cC-----CCccEEeecCcccccCC Confidence 11 12457788888887777 Done!